Andre Niyongabo Rubungo's repositories
africanlp-public-datasets
A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.
KINNEWS-and-KIRNEWS-Corpus
Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for Kinyarwanda and Kirundi" by Rubungo Andre Niyongabo, Hong Qu, Julia Kreutzer, and Li Huang.
nlp-datasets
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
BangBei-APP
BangBei is an android app which was designed to be used inside the campus of UESTC to let students help each other and make money at the same time. It has won 2017 UESTC programing competition.
UESTC_2016_Freshman_web
This is a web developed in UESTC-IUSTU workshop which was designed for new members to learn about web development, mobile app development (Android&ios), etc.
annotated_latex_equations
Examples of how to create colorful, annotated equations in Latex using Tikz.
bitextor
Bitextor generates translation memories from multilingual websites
cgcnn
Crystal graph convolutional neural networks for predicting material properties.
Data-Science-Articles
A collection of my data science articles published in Towards Data Science and Towards AI.
lit-gpt
Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Llama-2-notebooks
All the projects related to Llama
LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
masakhane-community
All our community docs! Start here! Lets put Africa on the NLP Map
masakhane-preprocessing
Building an effective preprocessing tool for African languages
ML-Papers-Explained
Explanation to key concepts in ML
Neo4j-ParticleFiltering
A user-defined procedure based on Markov-chains to approximate the Personalized PageRank algorithm in Neo4j
pytorch-sentiment-analysis
Tutorials on getting started with PyTorch and TorchText for sentiment analysis.
pytorch-seq2seq
Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.
speechbrain
A PyTorch-based Speech Toolkit
TAADpapers
Must-read Papers on Textual Adversarial Attack and Defense
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.