Yun Zhao's repositories
chinese_text_normalization
Chinese text normalization for speech processing
hybrid-multi-spk-vc
a hybrid multi-speaker voice conversion system
asr_dataset
The dataset of Speech Recognition
awesome-keyword-spotting
This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
BaiduSpider
BaiduSpider,一个爬取百度搜索结果的爬虫,目前支持百度网页搜索,百度图片搜索,百度知道搜索,百度视频搜索,百度资讯搜索,百度文库搜索,百度经验搜索和百度百科搜索。
DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
Emotional-Speech-Data
This is the GitHub page for publicly available emotional speech data.
espnet
End-to-End Speech Processing Toolkit
FastVocoder
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
ICASSP2021_paper_list-VC
ICASSP 2021 accepted papers in term of voice conversion (VC)
kaldi-cmake
create CMakeLists.txt for kaldi
kaldi-onnx
Kaldi model converter to ONNX
MetaDialog
Platform for few-shot natural language processing: Text Classification, Sequene Labeling.
multi-speaker-tacotron
VCTK multi-speaker tacotron for ICASSP 2020
nnet_pytorch
Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.
pointer_summarizer
pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
StarGANv2-VC
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
x-vector-pytorch
Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch