Beast code in Giters

Suwon Yang's repositories

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonApache-2.0000

LivePortrait

Bring portraits to life!

NOASSERTION000

diff2lip

Language:PythonNOASSERTION000

fish-speech

Brand new TTS solution

Language:PythonNOASSERTION000

EchoMimic

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Language:PythonApache-2.0000

SenseVoice

Multilingual Voice Understanding Model

NOASSERTION000

awesome-audio-plaza

Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation

MIT000

suno-api

Use API to call the music generation AI of suno.ai, and easily integrate it into agents like GPTs.

LGPL-3.0000

BigVGAN

Official PyTorch implementation of BigVGAN (ICLR 2023)

Language:PythonMIT000

IMAGDressing

👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing

Language:PythonApache-2.0000

so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

Language:PythonNOASSERTION000

AudioLCM

PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.

Language:Python000

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonApache-2.0000

MusiConGen

MIT000

Retrieval-based-Voice-Conversion-WebUI

Voice data <= 10 mins can also be used to train a good VC model!

Language:PythonMIT000

ChatTTS

A generative speech model for daily dialogue.

AGPL-3.0000

EmoSphere-TTS

The official implementation of EmoSphere-TTS

000

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonApache-2.0000

MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

Language:Jupyter NotebookAGPL-3.0000

NeMo

NeMo: a toolkit for conversational AI

Language:PythonApache-2.0000

Stable-Hair

Stable-Hair: Real-World Hair Transfer via Diffusion Model

Apache-2.0000

OpenVoice

Instant voice cloning by MyShell

Language:PythonMIT000

DEX-TTS

DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability

Language:PythonMIT000

instruct-MusicGen

The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning".

Language:PythonApache-2.0000

descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

MIT000

TTS-papers

🐸 collection of TTS papers

MPL-2.0000

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookNOASSERTION000

honeybee

Official implementation of Honeybee

Language:PythonNOASSERTION000

wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Language:PythonApache-2.0000

FoleyCrafter

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师，给你的无声视频添加生动而且同步的音效 😝

Apache-2.0000