Mars's repositories
android-vad
This VAD library can process audio in real-time utilizing GMM which helps identify presence of human speech in an audio sample that contains a mixture of speech and noise.
kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
kaldi_x-vector_aishell
Using Kaldi x-vector method to train speaker recognition model on aishell database.
NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
SpeechT5
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
wetts
Production First and Production Ready End-to-End Text-to-Speech Toolkit