WhiteFu's repositories
audio-dataset
Audio Dataset for training CLAP and other models
audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
audio-preprocessing-scripts
数据集制作-从录播到伴奏分离到切片脚本
chinese-dialect-lexicons
Grapheme-to-Phoneme lexicons for Chinese dialects
control-vc
This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
espnet_onnx
Onnx wrapper for espnet infrernce model
fluenttts
FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS
FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
g2pE_mobile
g2p for english tts
GenerSpeech
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
hello-algo
《Hello 算法》一本动画图解、能运行、可提问的数据结构与算法入门书
IMS-Toucan
IMS-Toucan is a toolkit to train state-of-the-art Speech Synthesis models. Everything is pure Python and PyTorch based to keep it as simple and beginner-friendly, yet powerful as possible.
larynx2
A fast, local neural text to speech system
musika
Fast Infinite Waveform Music Generation
NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
SiFiGAN
Official implementation of the source-filter HiFiGAN vocoder
so-vits-svc-toolkit
A toolkit and documentation version of so-vits-svc.
StyleTTS
Official Implementation of StyleTTS
T2A
Project page for "T2A: Robust Text-to-Animation" for ICASSP2023
torch-nansypp
NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
vall-e
An unofficial PyTorch implementation of the audio LM VALL-E, WIP
vlabeler
Open source voice labeling application
zac2022-lyric-alignment
Solution for Zalo AI Challenge 2022 - Lyrics Alignment