XinLiu's repositories
cdr-dereverb
Coherence-based Dereverberation for Speech Enhancement
Causal-U-Net
unofficial PyTorch implementation of 《A Causal U-net based Neural Beamforming Network for Real-Time Multi-Channel Speech Enhancement》
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
AQUA-Tk
AQUA-Tk = Audio QUality Assessment-Toolkit. (In development)
AudioLDM2
Text-to-Audio/Music Generation
Child-ASR-Paper
A list of papers for child ASR
DeepFilterNet
Noise supression using deep filtering
DSP-Digital-Audio-Processors-in-MATLAB
Digital audio processors such as a compressor/limiter, expander/gate, phase vocoder, multi-tap delay, flanger, reverb, dereverb, and others.
FilterBanks_FastPythonImplementation
Filter Banks, Fast Python Implementation
FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
groove2groove
Code for "Groove2Groove: One-Shot Music Style Transfer with Supervision from Synthetic Data"
HierSpeechpp
The official implementation of HierSpeech++
MNN
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba
MNNKit
MNNKit is a collection of AI solutions for mobile developers, powered by MNN engine.
odas
ODAS: Open embeddeD Audition System
rVAD
Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
so-vits-svc
SoftVC VITS Singing Voice Conversion
so-vits-svc-4.0-v2
SoftVC VITS Singing Voice Conversion
TinyNeuralNetwork
TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.
traditional-speech-enhancement
语音增强传统方法
Using-spectral-analysis-and-mapping-to-enhance-the-harmonicity-of-a-sound
Research MATLAB Project which analyses Inharmonic sounds, tries to find its most likely fundamental frequency and harmonic template, and performs spectral mapping to make it sound more harmonic while retaining most of its sound quality.
versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
visqol
Perceptual Quality Estimator for speech and audio
voicefixer
General Speech Restoration
voicefixer2
The second generation of VoiceFixer, a toolkit for general speech restoration. *Not affiliated with the original VoiceFixer repo*