Andre Niyongabo Rubungo's repositories

africanlp-public-datasets

A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.

KINNEWS-and-KIRNEWS-Corpus

Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for Kinyarwanda and Kirundi" by Rubungo Andre Niyongabo, Hong Qu, Julia Kreutzer, and Li Huang.

Language:PythonLicense:MITStargazers:11Issues:3Issues:0

nlp-datasets

Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)

BangBei-APP

BangBei is an android app which was designed to be used inside the campus of UESTC to let students help each other and make money at the same time. It has won 2017 UESTC programing competition.

Language:JavaLicense:Apache-2.0Stargazers:1Issues:2Issues:0

UESTC_2016_Freshman_web

This is a web developed in UESTC-IUSTU workshop which was designed for new members to learn about web development, mobile app development (Android&ios), etc.

Language:HTMLStargazers:1Issues:1Issues:0

afromt

Code for the EMNLP 2021 Paper "AfroMT: Pretraining Strategies and Reproducible Benchmarks for Translation of 8 African Languages" by Machel Reid, Junjie Hu, Graham Neubig, Yutaka Matsuo

Language:PythonStargazers:0Issues:1Issues:0
Language:HTMLStargazers:0Issues:2Issues:0

annotated_latex_equations

Examples of how to create colorful, annotated equations in Latex using Tikz.

Language:TeXLicense:MITStargazers:0Issues:1Issues:0

bert

TensorFlow code and pre-trained models for BERT

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

bitextor

Bitextor generates translation memories from multilingual websites

License:GPL-3.0Stargazers:0Issues:0Issues:0

cgcnn

Crystal graph convolutional neural networks for predicting material properties.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Data-Science-Articles

A collection of my data science articles published in Towards Data Science and Towards AI.

License:MITStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:1Issues:0

lafand-mt

LAFAND-MT: Lacuna Anglo & Franco Africa News Dataset for low-resourced MT

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

lit-gpt

Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Llama-2-notebooks

All the projects related to Llama

Stargazers:0Issues:0Issues:0

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

masakhane-community

All our community docs! Start here! Lets put Africa on the NLP Map

License:MITStargazers:0Issues:1Issues:0

masakhane-preprocessing

Building an effective preprocessing tool for African languages

Language:PythonStargazers:0Issues:0Issues:0

ML-Papers-Explained

Explanation to key concepts in ML

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Neo4j-ParticleFiltering

A user-defined procedure based on Markov-chains to approximate the Personalized PageRank algorithm in Neo4j

Language:JavaLicense:GPL-3.0Stargazers:0Issues:1Issues:0

PLMpapers

Must-read Papers on pre-trained language models.

License:MITStargazers:0Issues:1Issues:0

pytorch-sentiment-analysis

Tutorials on getting started with PyTorch and TorchText for sentiment analysis.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

pytorch-seq2seq

Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

synthesis

Data synthesis by contextualizing glossary translations

Language:PythonStargazers:0Issues:1Issues:0

TAADpapers

Must-read Papers on Textual Adversarial Attack and Defense

Stargazers:0Issues:0Issues:0

TextBlob

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

License:NOASSERTIONStargazers:0Issues:0Issues:0