warma16's repositories
diffsinger-sovits
a diffsinger enhance for sovits
audio-preprocess
Preprocess Audio for training
audio_dataset_vpr
A voiceprint recognition classifier for audio dataset
Autoformer
About Code release for "Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting" (NeurIPS 2021), https://arxiv.org/abs/2106.13008
basic-pitch
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
ChatRWKV
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
Colaboratory-Notebook-for-Ultimate-Vocal-Remover
Colaboratory Notebook for Ultimate Vocal Remover
CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
edgetunnel
Running V2ray inside edge/serverless runtime
english-labeling-guide
guidelines for correct and repeatable english labeling (Diffsinger, NNSVS, etc)
fish-diffusion
An easy to understand TTS / SVS / SVC framework
GPT-SoVITS-ocero
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
gpt4all
gpt4all: a chatbot trained on a massive collection of clean assistant data including code, stories and dialogue
handle
A Chinese Hanzi variation of Wordle - 汉字 Wordle
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
OneForAll
OneForAll是一款功能强大的子域收集工具
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
SenseVoice
Multilingual Voice Understanding Model
so-vits-svc
SoftVC VITS Singing Voice Conversion
StudyWithMiku
STUDY WITH MIKU web version cover
xmind-sdk-js
This is a lightweight official software development kit to help people who wants to build the mapping file without the UI client and It's also supported to run in Browser or Node.js.