Dongji Gao's repositories
CLAP
Contrastive Language-Audio Pretraining
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
espnet
End-to-End Speech Processing Toolkit
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
gtn_applications
Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"
k2
FSA/FST algorithms, intended to (eventually) be interoperable with PyTorch and similar
kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
dongjigao.github.io
My personal website
llama3
The official Meta Llama 3 GitHub site
ml-visuals
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
NeMo
NeMo: a toolkit for conversational AI
queue-utils
Utility scripts to submit multi-GPU jobs on CLSP, COE, and MARCC
text_search
Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
whisper
Robust Speech Recognition via Large-Scale Weak Supervision