vick-wuwei

vick-wuwei

Geek Repo

0

followers

0

following

Github PK Tool:Github PK Tool

vick-wuwei's starred repositories

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Language:PythonLicense:Apache-2.0Stargazers:1956Issues:0Issues:0
Language:C++License:NOASSERTIONStargazers:3733Issues:0Issues:0

friendly-stable-audio-tools

Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.

Language:PythonLicense:MITStargazers:96Issues:0Issues:0

lp-music-caps

LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]

Language:PythonStargazers:259Issues:0Issues:0

stable-audio-tools

Generative models for conditional audio generation

Language:PythonLicense:MITStargazers:2388Issues:0Issues:0

riffusion-manipulation

tools to manipulate audio with riffusion

Language:PythonStargazers:85Issues:0Issues:0

kohya-trainer

Adapted from https://note.com/kohya_ss/n/nbf7ce8d80f29 for easier cloning

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1830Issues:0Issues:0

stable-ts

Transcription, forced alignment, and audio indexing with OpenAI's Whisper

Language:PythonLicense:MITStargazers:1433Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:32386Issues:0Issues:0

Bert-VITS2

vits2 backbone with multilingual-bert

Language:PythonLicense:AGPL-3.0Stargazers:7604Issues:0Issues:0

StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Language:PythonLicense:Apache-2.0Stargazers:9356Issues:0Issues:0

polyffusion

Polyffusion: A Diffusion Model for Polyphonic Score Generation with Internal and External Controls

Language:PythonLicense:MITStargazers:68Issues:0Issues:0

AccoMontage-3

Code and demo for paper: Zhao et al., AccoMontage-3: Full-Band Accompaniment Arrangement via Sequential Style Transfer and Multi-Track Function Prior.

Language:PythonLicense:MITStargazers:20Issues:0Issues:0

CLAP

Contrastive Language-Audio Pretraining

Language:PythonLicense:CC0-1.0Stargazers:1275Issues:0Issues:0

AudioLDM-training-finetuning

AudioLDM training, finetuning, evaluation and inference.

Language:PythonLicense:MITStargazers:178Issues:0Issues:0

AudioLDM2

Text-to-Audio/Music Generation

Language:PythonLicense:NOASSERTIONStargazers:2169Issues:0Issues:0

muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

Language:PythonLicense:MITStargazers:4395Issues:0Issues:0

cryptpad

Collaborative office suite, end-to-end encrypted and open-source.

Language:JavaScriptLicense:AGPL-3.0Stargazers:5439Issues:0Issues:0

dashboard-icons

🚀 The best source for dashboard icons.

Language:PythonLicense:NOASSERTIONStargazers:4488Issues:0Issues:0

homepage

A highly customizable homepage (or startpage / application dashboard) with Docker and service API integrations.

Language:JavaScriptLicense:GPL-3.0Stargazers:17533Issues:0Issues:0

awesome-selfhosted

A list of Free Software network services and web applications which can be hosted on your own servers

License:NOASSERTIONStargazers:189255Issues:0Issues:0

douyin-downloader

抖音批量下载工具,去水印,支持视频、图集、合集、音乐(原声)。免费!免费!免费!

Language:PythonStargazers:1020Issues:0Issues:0

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:34039Issues:0Issues:0

wechaty

Conversational RPA SDK for Chatbot Makers. Join our Discord: https://discord.gg/7q8NBZbQzt

Language:TypeScriptLicense:Apache-2.0Stargazers:19734Issues:0Issues:0

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonLicense:NOASSERTIONStargazers:11350Issues:0Issues:0

navidrome

🎧☁️ Modern Music Server and Streamer compatible with Subsonic/Airsonic

Language:GoLicense:GPL-3.0Stargazers:10900Issues:0Issues:0

duangcloud

duangcloud官网最新地址

Stargazers:24Issues:0Issues:0

DiffSinger

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

Language:PythonLicense:Apache-2.0Stargazers:2640Issues:0Issues:0
Language:PythonLicense:MITStargazers:1954Issues:0Issues:0

ChatGPT-Next-Web

A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。

Language:TypeScriptLicense:MITStargazers:73517Issues:0Issues:0