Idris Abdulmumin's repositories
acl-anthology
Data and software for building the ACL Anthology.
afrisent-semeval-dataset
Dataset for AfriSenti-Semeval
afrolid
AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.
ALMA
This is repository for ALMA translation models.
ArewaDS-Test
My 30 days of python exercise at Arewa DS
Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
fsdp_qlora
Training LLMs with QLoRA + FSDP
label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
lafand-mt
MAFAND-MT
mlcontests.github.io
A list of public machine learning/data science/AI contests.
ndjuka.github.io
PyLator is a free translating service created by CyberPy, also known as LeShuriken1, using Python and HTML, hosted on Replit, and translated by the translate module which uses MyMemory API.
Panlex-Lexicon-Extractor
A library to extract bilingual lexicons from Panlex Database
PLM4MT
Code for our work "MSP: Multi-Stage Prompting for Making Pre-trained Language Models Better Translators" in ACL 2022
Rabiu_Msc
My Msc Project work
sentiNaija
This is a Lacuna Funded Project to develop sentiment and emotion corpus for three Nigerian languages: Igbo, Hausa, and Yoruba.
timelms
TimeLMs: Diachronic Language Models from Twitter
unlimiformer
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"
YOSM
YOSM: A NEW YORUBA SENTIMENT CORPUS FOR MOVIE REVIEWS