Suwon Yang's repositories

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

LivePortrait

Bring portraits to life!

License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

fish-speech

Brand new TTS solution

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

EchoMimic

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

SenseVoice

Multilingual Voice Understanding Model

License:NOASSERTIONStargazers:0Issues:0Issues:0

awesome-audio-plaza

Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation

License:MITStargazers:0Issues:0Issues:0

suno-api

Use API to call the music generation AI of suno.ai, and easily integrate it into agents like GPTs.

License:LGPL-3.0Stargazers:0Issues:0Issues:0

BigVGAN

Official PyTorch implementation of BigVGAN (ICLR 2023)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

IMAGDressing

👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

AudioLCM

PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.

Language:PythonStargazers:0Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

Retrieval-based-Voice-Conversion-WebUI

Voice data <= 10 mins can also be used to train a good VC model!

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ChatTTS

A generative speech model for daily dialogue.

License:AGPL-3.0Stargazers:0Issues:0Issues:0

EmoSphere-TTS

The official implementation of EmoSphere-TTS

Stargazers:0Issues:0Issues:0

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

NeMo

NeMo: a toolkit for conversational AI

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Stable-Hair

Stable-Hair: Real-World Hair Transfer via Diffusion Model

License:Apache-2.0Stargazers:0Issues:0Issues:0

OpenVoice

Instant voice cloning by MyShell

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DEX-TTS

DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

instruct-MusicGen

The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning".

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

License:MITStargazers:0Issues:0Issues:0

TTS-papers

🐸 collection of TTS papers

License:MPL-2.0Stargazers:0Issues:0Issues:0

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

honeybee

Official implementation of Honeybee

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

FoleyCrafter

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝

License:Apache-2.0Stargazers:0Issues:0Issues:0