David Adelani's repositories
africanlp-resources
List of all the resources I developed in collaboration with LSV and Masakhane during my doctoral studies and beyond
nepali-ner
Nepali NER dataset and code
How-to-distill-your-BERT
Code for the paper: How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives (ACL 2023)
NLP_DL_Intro
Deep learning for NLP
ANEC-An-Amharic-Named-Entity-Corpus-
A Dataset for Amharic Named Entity Recognition
dadelani.github.io
David Adelani website
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
DeBERTa
The implementation of DeBERTa
finetune-hf-vits
Finetune VITS and MMS using HuggingFace's tools
flores
Facebook Low Resource (FLoRes) MT Benchmark
HornMT
Machine translation (MT) benchmark dataset for languages in the Horn of Africa.
kb_bart
Pretraining scripts for BART transformer model
langrank
A program to choose transfer languages for cross-lingual learning
MultilingualSIFT
MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning
open-bible-scripts
scipts for working with open.bible data
portuguese-bert
Portuguese pre-trained BERT models
setfit
Efficient few-shot learning with Sentence Transformers
ViDeBERTa
ViDeBERTa: A powerful pre-trained language model for Vietnamese
wtpsplit
Code for Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation
zindi_masakhane_pos
Code for Lacuna Masakhane Parts of Speech Classification Challenge