yangliu1992's repositories

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

bert

TensorFlow code and pre-trained models for BERT

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

CNTN

ChiNese Text Normalization (CNTN) tool for Text-to-speech system

Language:PythonStargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

PaddleSpeech

Easy-to-use Speech Toolkit including SOTA/Streaming ASR witch punctuation, influential TTS with text frontend, Speaker Verification System and End-to-End Speech Simultaneous Translation.

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

speech-synthesis-paper

List of speech synthesis papers.

License:MITStargazers:0Issues:0Issues:0

supervoice

VoiceBox neural network implementation

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

tensorflow

An Open Source Machine Learning Framework for Everyone

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

tensorflow_end2end_speech_recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

textlesslib

Library for Textless Spoken Language Processing

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:0Issues:0Issues:0

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

world-class

A C++ library of "World" - A high-quality speech analysis, manipulation and synthesis system -

Language:C++License:BSD-3-ClauseStargazers:0Issues:1Issues:0