Genta Indra Winata's repositories
end2end-asr-pytorch
End-to-End Automatic Speech Recognition on PyTorch
code-switching-papers
A curated list of research papers and resources on code-switching
few-shot-lm
The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)
indonesian-nlp
A curated list of research papers and resources on Indonesian languages
gentaiscool.github.io
My website
rnn-transducer
A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition
matrix_fact
Matrix Factorization Library
nlp-id-progress
The latest progress on the NLP research for the Indonesian language
speech-recognition-papers
Towards hot directions in industrial speech recognition
acl-anthology
Data and software for building the ACL Anthology.
al-folio
A beautiful, simple, clean, and responsive Jekyll theme for academics
BIG-bench
Beyond the Imitation Game collaborative benchmark for enormous language models
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
GitHubGraduation-2021
Join the GitHub Graduation Yearbook and "walk the stage" on June 5.
indonlu
The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained models, and a starter code! (AACL-IJCNLP 2020)
lowresource-nlp-bootcamp-2020
The website for the CMU Language Technologies Institute low resource NLP bootcamp 2020
mesh-transformer-jax
Model parallel transformers in JAX and Haiku
NER-datasets
Datasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)
NL-Augmenter
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
PromptPapers
Must-read papers on prompt-based tuning for pre-trained language models.
promptsource
Toolkit for creating, sharing and using natural language prompts.
pytorch-OpCounter
Count the MACs / FLOPs of your PyTorch model.
seqeval
A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
xtreme
XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 typologically diverse languages and includes nine tasks.