yangliu1992's repositories
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
bert
TensorFlow code and pre-trained models for BERT
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
PaddleSpeech
Easy-to-use Speech Toolkit including SOTA/Streaming ASR witch punctuation, influential TTS with text frontend, Speaker Verification System and End-to-End Speech Simultaneous Translation.
speech-synthesis-paper
List of speech synthesis papers.
supervoice
VoiceBox neural network implementation
tensorflow
An Open Source Machine Learning Framework for Everyone
tensorflow_end2end_speech_recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
textlesslib
Library for Textless Spoken Language Processing
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
world-class
A C++ library of "World" - A high-quality speech analysis, manipulation and synthesis system -