David Adelani's repositories

sib-200

SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects

Language:PythonLicense:Apache-2.0Stargazers:15Issues:1Issues:0

africanlp-resources

List of all the resources I developed in collaboration with LSV and Masakhane during my doctoral studies and beyond

License:Apache-2.0Stargazers:7Issues:1Issues:0

nepali-ner

Nepali NER dataset and code

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6Issues:2Issues:0

How-to-distill-your-BERT

Code for the paper: How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives (ACL 2023)

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

hstm

Code and data for "Heterogeneous Supervised Topic Models"

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

NLP_DL_Intro

Deep learning for NLP

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0
Language:TeXLicense:MITStargazers:0Issues:0Issues:0

ANEC-An-Amharic-Named-Entity-Corpus-

A Dataset for Amharic Named Entity Recognition

License:GPL-3.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

dadelani.github.io

David Adelani website

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:1Issues:0

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DeBERTa

The implementation of DeBERTa

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

finetune-hf-vits

Finetune VITS and MMS using HuggingFace's tools

License:MITStargazers:0Issues:0Issues:0

flores

Facebook Low Resource (FLoRes) MT Benchmark

Language:PythonLicense:CC-BY-SA-4.0Stargazers:0Issues:0Issues:0

ftac

FTAC text

Stargazers:0Issues:1Issues:0

HornMT

Machine translation (MT) benchmark dataset for languages in the Horn of Africa.

Stargazers:0Issues:0Issues:0

kb_bart

Pretraining scripts for BART transformer model

Language:PythonStargazers:0Issues:0Issues:0

langrank

A program to choose transfer languages for cross-lingual learning

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0
Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

muliwai

Text pre-processing for NLP datasets

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

MultilingualSIFT

MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PerlStargazers:0Issues:0Issues:0

open-bible-scripts

scipts for working with open.bible data

Language:ShellLicense:Apache-2.0Stargazers:0Issues:1Issues:0

portuguese-bert

Portuguese pre-trained BERT models

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

setfit

Efficient few-shot learning with Sentence Transformers

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ViDeBERTa

ViDeBERTa: A powerful pre-trained language model for Vietnamese

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

wtpsplit

Code for Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

zindi_masakhane_pos

Code for Lacuna Masakhane Parts of Speech Classification Challenge

Stargazers:0Issues:0Issues:0