Genta Indra Winata's repositories
end2end-asr-pytorch
End-to-End Automatic Speech Recognition on PyTorch
code-switching-papers
A curated list of research papers and resources on code-switching
lstm-attention
Attention-based bidirectional LSTM for Classification Task (ICASSP)
few-shot-lm
The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)
indonesian-nlp
A curated list of research papers and resources on Indonesian languages
gentaiscool.github.io
My website
matrix_fact
Matrix Factorization Library
acl-anthology
Data and software for building the ACL Anthology.
lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
mesh-transformer-jax
Model parallel transformers in JAX and Haiku
mt-metrics-eval
Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.
mteb
MTEB: Massive Text Embedding Benchmark
NER-datasets
Datasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)
NL-Augmenter
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
PromptPapers
Must-read papers on prompt-based tuning for pre-trained language models.
promptsource
Toolkit for creating, sharing and using natural language prompts.