HanShin, Park's starred repositories
Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
fastapi-with-tailwindcss
How to setup FastAPI with TailwindCSS
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
HierSpeechpp
The official implementation of HierSpeech++
one-click-installers-tts
Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocos
Genshin_Datasets
Genshin Datasets For SVC/SVS/TTS
StarRail_Datasets
StarRail Datasets For SVC/SVS/TTS
ai-audio-datasets
AI Audio Datasets 🎵. A list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.
ChatWaifu-API
A ChatWaifu Version with official ChatGPT API
professional-programming
A collection of learning resources for curious software engineers
descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
StableCascade
Official Code for Stable Cascade
WutheringWaves
Wuthering Waves ps (0.9.0)
metavoice-src
Foundational model for human-like, expressive TTS