Fu Guanyu's starred repositories
Demo-of-Text-to-Speech-based-on-Deep-Learning
text to speech for mandarin,
mfa-models
Collection of pretrained models for the Montreal Forced Aligner
StructuredLM_RTDT
A library for building hierarchical text representation and corresponding downstream applications.
AEC-Challenge
AEC Challenge
tensorflow
An Open Source Machine Learning Framework for Everyone
book-text-to-speech
A book about Text-to-Speech (TTS) in Chinese.
multimodal-speech-emotion
TensorFlow implementation of "Multimodal Speech Emotion Recognition using Audio and Text," IEEE SLT-18
Expressive-FastSpeech2
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
TTS-papers
🐸 collection of TTS papers
awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
hangzhou-house-guide
杭州购房指南,根据个人购房经历,总结而成的一篇买房攻略,涉及新房摇号和二手房选购,包含大量杭州城市规划资料。
mandarin-tts
Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets
End-to-End-Speech-Recognition-Learning
ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别
kaldi-dnn-ali-gop
Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.
pinyin-tapt-wav2vec2
(Re)-Pre-training Wav2Vec2 on Converting Pinyin to Chinese Characters
wavenet_SR
WaveNet Speech Recognition to ARRPA phonemes
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
efficient_tts
Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"
tacotronv2_wavernn_chinese
tacotronV2 + wavernn 实现中文语音合成(Tensorflow + pytorch)