powei-C's starred repositories

Languagecodec

Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models

Language:PythonLicense:MITStargazers:186Issues:0Issues:0
Language:PythonLicense:MITStargazers:242Issues:0Issues:0

FunCodec

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

Language:PythonLicense:MITStargazers:323Issues:0Issues:0

SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Language:PythonLicense:Apache-2.0Stargazers:385Issues:0Issues:0

vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Language:PythonLicense:MITStargazers:2260Issues:0Issues:0

descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Language:PythonLicense:MITStargazers:1057Issues:0Issues:0

BigVGAN

BigVGAN with Neural Source-Filter

Language:PythonLicense:MITStargazers:50Issues:0Issues:0

singaligner

a compact audio-to-phoneme aligner for singing voice

Language:PythonStargazers:9Issues:0Issues:0

golf

A DDSP-based neural voice synthesiser.

Language:Jupyter NotebookLicense:MITStargazers:90Issues:0Issues:0

BigVGAN

Official PyTorch implementation of BigVGAN (ICLR 2023)

Language:PythonLicense:MITStargazers:810Issues:0Issues:0
Language:PythonStargazers:43Issues:0Issues:0

so-vits-svc-4.0-v2

SoftVC VITS Singing Voice Conversion

Language:PythonLicense:MITStargazers:548Issues:0Issues:0

so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

Language:PythonLicense:NOASSERTIONStargazers:8593Issues:0Issues:0

Towards-Training-Explainable-Singing-Quality-Assessment-Network-with-Augmented-Data

Codes for paper -- Towards Training Explainable Singing Quality Assessment Network with Augmented Data

Language:PythonStargazers:13Issues:0Issues:0

SingingVoice-Auto-Alignment-Revised

revised version of the workflow of auto annotation

Language:Jupyter NotebookStargazers:4Issues:0Issues:0

NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Language:PythonLicense:MITStargazers:633Issues:0Issues:0

PHONEix

PHONEix: Acoustic Feature Processing Strategy for Enhanced Singing Pronunciation with Phoneme Distribution Predictor

Stargazers:5Issues:0Issues:0

phonemizer

Simple text to phones converter for multiple languages

Language:PythonLicense:GPL-3.0Stargazers:1165Issues:0Issues:0

USVG

A unified model for zero-shot singing voice conversion and synthesis

Language:PythonStargazers:21Issues:0Issues:0

GMVAE

Implementation of Gaussian Mixture Variational Autoencoder (GMVAE) for Unsupervised Clustering

Language:PythonLicense:MITStargazers:298Issues:0Issues:0

imitation-learning

Imitation learning algorithms

Language:PythonLicense:MITStargazers:409Issues:0Issues:0

tacotron2

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:5006Issues:0Issues:0

RL-pytorch

Implemention of reinforcment learning by pytorch

Language:PythonLicense:MITStargazers:9Issues:0Issues:0

DI-engine

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Language:PythonLicense:Apache-2.0Stargazers:2853Issues:0Issues:0

lets-do-irl

Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)

Language:PythonLicense:MITStargazers:677Issues:0Issues:0

diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Language:PythonLicense:Apache-2.0Stargazers:742Issues:0Issues:0

diffwave-sashimi

Implementation of DiffWave and SaShiMi audio generation models

Language:PythonLicense:MITStargazers:112Issues:0Issues:0

DiffWave-Vocoder

Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.

Language:PythonLicense:MITStargazers:85Issues:0Issues:0

DiffSinger

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

Language:PythonLicense:Apache-2.0Stargazers:2640Issues:0Issues:0

iSTFTNet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

Language:PythonLicense:Apache-2.0Stargazers:214Issues:0Issues:0