vick-wuwei's starred repositories
friendly-stable-audio-tools
Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.
lp-music-caps
LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]
stable-audio-tools
Generative models for conditional audio generation
riffusion-manipulation
tools to manipulate audio with riffusion
kohya-trainer
Adapted from https://note.com/kohya_ss/n/nbf7ce8d80f29 for easier cloning
Bert-VITS2
vits2 backbone with multilingual-bert
StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
polyffusion
Polyffusion: A Diffusion Model for Polyphonic Score Generation with Internal and External Controls
AccoMontage-3
Code and demo for paper: Zhao et al., AccoMontage-3: Full-Band Accompaniment Arrangement via Sequential Style Transfer and Multi-Track Function Prior.
AudioLDM-training-finetuning
AudioLDM training, finetuning, evaluation and inference.
dashboard-icons
🚀 The best source for dashboard icons.
awesome-selfhosted
A list of Free Software network services and web applications which can be hosted on your own servers
douyin-downloader
抖音批量下载工具,去水印,支持视频、图集、合集、音乐(原声)。免费!免费!免费!
duangcloud
duangcloud官网最新地址
DiffSinger
An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。