jcl-gx's starred repositories

TDL-ADD

This is the pytorch implementation of our work titled "An Efficient Temporary Deepfake Location Approach Based Embeddings for Partially Spoofed Audio Detection”.

Language:PythonStargazers:7Issues:0Issues:0

fake-voice-detection

Using temporal convolution to detect Audio Deepfakes

Language:PythonLicense:Apache-2.0Stargazers:340Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:65815Issues:0Issues:0

deepfake-whisper-features

Implementation of the paper "Improved DeepFake Detection Using Whisper Features"

Language:PythonLicense:MITStargazers:79Issues:0Issues:0

so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

Language:PythonLicense:NOASSERTIONStargazers:8604Issues:0Issues:0

awesome-speech-recognition-speech-synthesis-papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

License:MITStargazers:2926Issues:0Issues:0

Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Language:PythonLicense:MITStargazers:22007Issues:0Issues:0

NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Language:PythonLicense:MITStargazers:637Issues:0Issues:0

contentvec

speech self-supervised representations

Language:PythonLicense:MITStargazers:441Issues:0Issues:0

CLAP

Contrastive Language-Audio Pretraining

Language:PythonLicense:CC0-1.0Stargazers:1285Issues:0Issues:0

CLAP

Learning audio concepts from natural language supervision

Language:PythonLicense:MITStargazers:441Issues:0Issues:0

AdaSpeech

An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"

Language:PythonStargazers:95Issues:0Issues:0

vits_chinese

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!

Language:PythonLicense:MITStargazers:1134Issues:0Issues:0

knn-vc

Voice Conversion With Just Nearest Neighbors

Language:PythonLicense:NOASSERTIONStargazers:435Issues:0Issues:0

knn-vc-1

Voice Conversion With Just Nearest Neighbors

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0

knn-vc

Voice conversion with just k-nearest neighbors

License:MITStargazers:4Issues:0Issues:0

FreeVC

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

Language:PythonLicense:MITStargazers:568Issues:0Issues:0

TranSpeech

PyTorch Implementation of TranSpeech (ICLR'23): Textless NAR Speech-to-Speech Translation with Bilateral Perturbation

Language:PythonLicense:MITStargazers:164Issues:0Issues:0

WenetSpeech

A 10000+ hours dataset for Chinese speech recognition

Language:ShellLicense:Apache-2.0Stargazers:484Issues:0Issues:0

GenerSpeech

PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.

Language:PythonLicense:MITStargazers:310Issues:0Issues:0

SpeechSplit2

Official implementation of SpeechSplit2

Language:PythonStargazers:124Issues:0Issues:0

NS2VC

Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech

Language:PythonStargazers:224Issues:0Issues:0

denoising-diffusion-pytorch

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Language:PythonLicense:MITStargazers:7716Issues:0Issues:0

naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Language:PythonLicense:MITStargazers:1250Issues:0Issues:0

SpeechSplit

Unsupervised Speech Decomposition Via Triple Information Bottleneck

Language:PythonLicense:MITStargazers:632Issues:0Issues:0

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Language:PythonLicense:Apache-2.0Stargazers:1965Issues:0Issues:0

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Language:PythonLicense:MITStargazers:3350Issues:0Issues:0

DeepLearning

深度学习入门教程, 优秀文章, Deep Learning Tutorial

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:13744Issues:0Issues:0

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonLicense:MITStargazers:2343Issues:0Issues:0

PythonTrain

Python程序设计基础_嵩天编

Language:HTMLStargazers:6Issues:0Issues:0