Beast code in Giters

dragnDriver's starred repositories

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.032359 273 1071

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT20329 198 368

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookApache-2.012565 170 505

AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Language:PythonNOASSERTION9911 131 48

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonMIT7469 82 151

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Language:PythonMIT6536 56 201

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Language:PythonMIT3332 58 70

riffusion

Stable diffusion for real-time music generation

Language:PythonMIT3311 38 93

Bark-Voice-Cloning

Bark Voice Cloning and Voice Cloning for Chinese Speech

Language:Jupyter NotebookMIT2627 33 96

AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Language:PythonNOASSERTION2353 42 102

Make-An-Audio

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

Language:PythonMIT726 71 13

glow-tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Language:PythonMIT652 20 73

lora-svc

singing voice change based on whisper, and lora for singing voice clone

Language:PythonMIT610 24 69

fish-diffusion

An easy to understand TTS / SVS / SVC framework

Language:PythonMIT609 22 60

KAN-TTS

KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech

Language:PythonMIT472 14 68

knn-vc

Voice Conversion With Just Nearest Neighbors

Language:PythonNOASSERTION433 14 35

MB-iSTFT-VITS

Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform

Language:PythonApache-2.0404 17 25

StyleTTS

Official Implementation of StyleTTS

Language:PythonMIT381 32 71

univnet

Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)

Language:PythonBSD-3-Clause258 12 9

iSTFTNet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

Language:PythonApache-2.0215 10 15

bigvsan

Pytorch implementation of BigVSAN

Language:PythonMIT191 29 6

RMVPE

Language:PythonApache-2.0190 4 6

cargan

Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"

Language:PythonMIT184 22 14

StyleTTS-VC

Official Implementation of StyleTTS-VC

Language:PythonMIT153 18 8

avocodo

Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)

Language:PythonNOASSERTION150 4 5

BigVGAN

Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training

Language:PythonMIT130 8 12

diffiner

Language:PythonMIT57 6 2

snake

Inspired by "Neural Networks Fail to Learn Periodic Functions and How to Fix It"

Language:Jupyter NotebookMIT50 7 1

NSF-HiFiGAN

Vocoder NSF-HiFiGAN (Moved into deepaudio)

Language:PythonMIT44 60

chinese-poetry

最全古诗词数据库 The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人，21050首词。

Language:JavaScriptMIT100