gaopeng's starred repositories
Megatron-LM
Ongoing research training transformer models at scale
flash-attention
Fast and memory-efficient exact attention
GigaSpeech
Large, modern dataset for speech recognition
speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
speechbrain
A PyTorch-based Speech Toolkit
common-voice
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
warp-transducer
A fast parallel implementation of RNN Transducer.
kaldi-io-for-python
Python functions for reading kaldi data formats. Useful for rapid prototyping with python.