jwang1993's starred repositories
fish-speech
Brand new TTS solution
whisper-vits-svc
Core Engine of Singing Voice Conversion & Singing Voice Clone
parler-tts
Inference and training library for high-quality TTS models.
VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
contentvec
speech self-supervised representations
TransferTTS
TransferTTS (Zero-Shot learning of VITS)
VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
DeepLearing-Interview-Awesome-2024
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
StableStudio
Community interface for generative AI
stable-audio-tools
Generative models for conditional audio generation
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
ColossalAI
Making large AI models cheaper, faster and more accessible
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Awesome-Text-to-Image
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.