Wendy510's starred repositories
zju-icicles
浙江大学课程攻略共享计划
100-Days-Of-ML-Code
100-Days-Of-ML-Code中文版
EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Speech-Emotion-Recognition
Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别
AvStackDocs
音视频基础知识整理和相关协议文档说明
python_ebook
收集了一些Python相关资料
PortaSpeech
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
LSTM_PIT_Speech_Separation
Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.
pytorch_xvectors
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
Expressive-FastSpeech2
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
speaker-recognition-py3
Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)
FastSpeech2
PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
x-vector-pytorch
Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch
Audio_Classification_using_LSTM
Classification of Urban Sound Audio Dataset using LSTM-based model.
Build-SE-Dataset
Build speech enhancement dataset.
kaldi-script
初学者笔记不多
youtube-lid-data
Scripts for collecting audio data from Youtube for building spoken language identification models.