CrazyCharles6's repositories
DNN-HMM-Course
DNN-HMM related Experiments for THUHCSI Course : <Digital Processing of Speech Signals>
AudioSignalProcessingForML
Code and slides of my YouTube series called "Audio Signal Proessing for Machine Learning"
auorange
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
BVAE-TTS
Official implementation of BVAE-TTS
coder2gwy
互联网首份程序员考公指南,由3位已经进入体制内的前大厂程序员联合献上。
Emotional-Speech-Data
This is the GitHub page for publicly available emotional speech data.
FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
fft-conv-pytorch
Implementation of 1D, 2D, and 3D FFT convolutions in PyTorch. Much faster than direct convolutions for large kernel sizes.
hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
leeml-notes
李宏毅《机器学习》笔记,在线阅读地址:https://datawhalechina.github.io/leeml-notes
multiband-melgan
Multiband MelGAN implementation with Pytorch
Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
nonparaSeq2seqVC_code
Implementation code of non-parallel sequence-to-sequence VC
Phonetisaurus
Phonetisaurus G2P
Prune-Tune
Official code repository for AAAI2021 paper Finding Sparse Structures for Domain Specific Neural Machine Translation
Python-Wrapper-for-World-Vocoder
A Python wrapper for the high-quality vocoder "World"
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
qualtreats
Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
rnnoise
Recurrent neural network for audio noise reduction
Tacotron
A Tacotron implementation with location relative attention
tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese, German and Easy to adapt for other languages)
txtfilemerge
TXT文本语料数据清洗(Text corpus data cleaning):1> 合并TXT文件;2> 过滤干扰字符串;3> 对人名、地名、组织机构进行遮码处理;4> 将其他编码格式统一转换为UTF-8
versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
visqol
Perceptual Quality Estimator for speech and audio
WG-WaveNet
Real-Time High-Fidelity Speech Synthesis without GPU
zhrtvc
Chinese real time voice cloning (VC) and Chinese text to speech (TTS). 好用的中文语音克隆兼中文语音合成系统,包含语音编码器、语音合成器、声码器和可视化模块。