jcl-gx's starred repositories
fake-voice-detection
Using temporal convolution to detect Audio Deepfakes
deepfake-whisper-features
Implementation of the paper "Improved DeepFake Detection Using Whisper Features"
so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
contentvec
speech self-supervised representations
vits_chinese
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!
TranSpeech
PyTorch Implementation of TranSpeech (ICLR'23): Textless NAR Speech-to-Speech Translation with Bilateral Perturbation
WenetSpeech
A 10000+ hours dataset for Chinese speech recognition
GenerSpeech
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
SpeechSplit2
Official implementation of SpeechSplit2
denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
SpeechSplit
Unsupervised Speech Decomposition Via Triple Information Bottleneck
DeepLearning
深度学习入门教程, 优秀文章, Deep Learning Tutorial
audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
PythonTrain
Python程序设计基础_嵩天编