jcl-gx

followers

following

stars

jcl-gx's starred repositories

TDL-ADD

This is the pytorch implementation of our work titled "An Efficient Temporary Deepfake Location Approach Based Embeddings for Partially Spoofed Audio Detection”.

Language:Python700

fake-voice-detection

Using temporal convolution to detect Audio Deepfakes

Language:PythonApache-2.034000

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonMIT6581500

deepfake-whisper-features

Implementation of the paper "Improved DeepFake Detection Using Whisper Features"

Language:PythonMIT7900

so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

Language:PythonNOASSERTION860400

awesome-speech-recognition-speech-synthesis-papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

MIT292600

Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Language:PythonMIT2200700

NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Language:PythonMIT63700

contentvec

speech self-supervised representations

Language:PythonMIT44100

CLAP

Contrastive Language-Audio Pretraining

Language:PythonCC0-1.0128500

CLAP

Learning audio concepts from natural language supervision

Language:PythonMIT44100

AdaSpeech

An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"

Language:Python9500

vits_chinese

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!

Language:PythonMIT113400

knn-vc

Voice Conversion With Just Nearest Neighbors

Language:PythonNOASSERTION43500

knn-vc-1

Voice Conversion With Just Nearest Neighbors

Language:PythonNOASSERTION100

knn-vc

Voice conversion with just k-nearest neighbors

MIT400

FreeVC

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

Language:PythonMIT56800

TranSpeech

PyTorch Implementation of TranSpeech (ICLR'23): Textless NAR Speech-to-Speech Translation with Bilateral Perturbation

Language:PythonMIT16400

WenetSpeech

A 10000+ hours dataset for Chinese speech recognition

Language:ShellApache-2.048400

GenerSpeech

PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.

Language:PythonMIT31000

SpeechSplit2

Official implementation of SpeechSplit2

Language:Python12400

NS2VC

Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech

Language:Python22400

denoising-diffusion-pytorch

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Language:PythonMIT771600

naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Language:PythonMIT125000

SpeechSplit

Unsupervised Speech Decomposition Via Triple Information Bottleneck

Language:PythonMIT63200

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Language:PythonApache-2.0196500

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Language:PythonMIT335000

DeepLearning

深度学习入门教程, 优秀文章, Deep Learning Tutorial

Language:Jupyter NotebookApache-2.01374400

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonMIT234300

PythonTrain

Python程序设计基础_嵩天编

Language:HTML600