dragnDriver's starred repositories

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:32359Issues:273Issues:1071

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20329Issues:198Issues:368

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:12565Issues:170Issues:505

AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Language:PythonLicense:NOASSERTIONStargazers:9911Issues:131Issues:48

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonLicense:MITStargazers:7469Issues:82Issues:151

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Language:PythonLicense:MITStargazers:6536Issues:56Issues:201

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Language:PythonLicense:MITStargazers:3332Issues:58Issues:70

riffusion

Stable diffusion for real-time music generation

Language:PythonLicense:MITStargazers:3311Issues:38Issues:93

Bark-Voice-Cloning

Bark Voice Cloning and Voice Cloning for Chinese Speech

Language:Jupyter NotebookLicense:MITStargazers:2627Issues:33Issues:96

AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Language:PythonLicense:NOASSERTIONStargazers:2353Issues:42Issues:102

Make-An-Audio

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

Language:PythonLicense:MITStargazers:726Issues:71Issues:13

glow-tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Language:PythonLicense:MITStargazers:652Issues:20Issues:73

lora-svc

singing voice change based on whisper, and lora for singing voice clone

Language:PythonLicense:MITStargazers:610Issues:24Issues:69

fish-diffusion

An easy to understand TTS / SVS / SVC framework

Language:PythonLicense:MITStargazers:609Issues:22Issues:60

KAN-TTS

KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech

Language:PythonLicense:MITStargazers:472Issues:14Issues:68

knn-vc

Voice Conversion With Just Nearest Neighbors

Language:PythonLicense:NOASSERTIONStargazers:433Issues:14Issues:35

MB-iSTFT-VITS

Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform

Language:PythonLicense:Apache-2.0Stargazers:404Issues:17Issues:25

StyleTTS

Official Implementation of StyleTTS

Language:PythonLicense:MITStargazers:381Issues:32Issues:71

univnet

Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)

Language:PythonLicense:BSD-3-ClauseStargazers:258Issues:12Issues:9

iSTFTNet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

Language:PythonLicense:Apache-2.0Stargazers:215Issues:10Issues:15

bigvsan

Pytorch implementation of BigVSAN

Language:PythonLicense:MITStargazers:191Issues:29Issues:6
Language:PythonLicense:Apache-2.0Stargazers:190Issues:4Issues:6

cargan

Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"

Language:PythonLicense:MITStargazers:184Issues:22Issues:14

StyleTTS-VC

Official Implementation of StyleTTS-VC

Language:PythonLicense:MITStargazers:153Issues:18Issues:8

avocodo

Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)

Language:PythonLicense:NOASSERTIONStargazers:150Issues:4Issues:5

BigVGAN

Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training

Language:PythonLicense:MITStargazers:130Issues:8Issues:12
Language:PythonLicense:MITStargazers:57Issues:6Issues:2

snake

Inspired by "Neural Networks Fail to Learn Periodic Functions and How to Fix It"

Language:Jupyter NotebookLicense:MITStargazers:50Issues:7Issues:1

NSF-HiFiGAN

Vocoder NSF-HiFiGAN (Moved into deepaudio)

Language:PythonLicense:MITStargazers:44Issues:6Issues:0

chinese-poetry

最全古诗词数据库 The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。

Language:JavaScriptLicense:MITStargazers:1Issues:0Issues:0