户建坤's repositories
allosaurus
Pretrained Model for ICASSP 2020 "Universal Phone Recognition with a Multilingual Allophone System"
audioset_models
📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).
NOASSERTION000
deep-speaker
Deep Speaker: an End-to-End Neural Speaker Embedding System.
dpss-exp3-VC-PPG
Voice Conversion Experiments for THUHCSI Course : <Digital Processing of Speech Signals>
Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
PPG-CL-TTS
PPG-CL-TTS
speech-recognition-papers
Towards hot directions in industrial speech recognition