splinter21's repositories
bert-vits
vits with bert
Bili.Copilot
BiliBili Copilot
Chat-Haruhi-Suzumiya
Chat凉宫春日, 由李鲁鲁, 冷子昂等同学开发的模仿二次元对话的聊天机器人。
ChatGLM-Efficient-Tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
GhostReview
A Framework Code of Reviewing Stable Diffusion Checkpoint
github-trending
Tracking the most popular Github repos, update daily(Python version)
JK-VITS
Bilingual-TTS (Japanese and Korean)
libf0
A Python Library for Fundamental Frequency Estimation in Music Recordings
LLaMA-Efficient-Tuning
Fine-tuning LLaMA with PEFT (PT+SFT+RLHF with QLoRA)
Mangio-RVC-Fork
Mangio-RVC-Fork
MoeMusicTranscription
An automatic music transcription application
natsume
A Japanese text frontend processing toolkit
NSF-BigVGAN
BigVGAN with Neural Source-Filter
so-vits-svc
SoftVC VITS Singing Voice Conversion
SoundStorm
The reproduced code for Google's SoundStorm
StableDiffusionWebUIScan
Simple Stable Diffusion WebUI Scanner (for seeta cloud)
storm
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
sukasuka-vocal-dataset-builder
すかすかアニメボカロデータセット。1st anime vocal dataset. Extract audio (vocal) files from video based on .ass subtitle files; manually label vocal files to characters. Will be used for PITS/VITS/Diffusion text-to-speech/SVC. 根据字幕,从视频里抽取全部语音,然后手动按角色标注。
vits2
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
VocalForge
Your one-stop solution for voice dataset creation
vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
WavThruVec_pytorch
An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"
XPhoneBERT
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)