powei-C's starred repositories
Languagecodec
Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models
SpeechTokenizer
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
singaligner
a compact audio-to-phoneme aligner for singing voice
so-vits-svc-4.0-v2
SoftVC VITS Singing Voice Conversion
so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
Towards-Training-Explainable-Singing-Quality-Assessment-Network-with-Augmented-Data
Codes for paper -- Towards Training Explainable Singing Quality Assessment Network with Augmented Data
SingingVoice-Auto-Alignment-Revised
revised version of the workflow of auto annotation
phonemizer
Simple text to phones converter for multiple languages
imitation-learning
Imitation learning algorithms
RL-pytorch
Implemention of reinforcment learning by pytorch
lets-do-irl
Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)
diffwave-sashimi
Implementation of DiffWave and SaShiMi audio generation models
DiffWave-Vocoder
Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.
DiffSinger
An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
iSTFTNet-pytorch
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform