xiaozhuo's repositories
3dti_AudioToolkit
3D Tune-In Toolkit is a custom open-source C++ library developed within the EU-funded project 3D Tune-In. The Toolkit provides a high level of realism and immersiveness within binaural 3D audio simulations, while allowing for the emulation of hearing aid devices and of different typologies of hearing loss.
all-in-one
All-In-One Music Structure Analyzer
AntiFake
https://dl.acm.org/doi/10.1145/3576915.3623209
Applio
VITS-based Voice Conversion focused on simplicity, quality and performance.
AudioLDM2
Text-to-Audio/Music Generation
audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Bert-VITS2
vits2 backbone with bert
CoMoSVC
CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone
DDSP-SVC
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
DelayCat
DelayCat Feature Based Delay Line Audio Plugin
dry_sing_multi_eval
Five-Dimensional Acapella Singing Evaluation System based on funASR, include pronunciation, pitch accuracy, rhythm, fluency, and emotion.
FxNorm-automix
FxNorm-Automix - Implementation of automatic music mixing systems. We show how we can use wet music data and repurpose it to train a fully automatic mixing system
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
grok-1
Grok open release
HAAQI-Net
HAAQI-Net is a novel DNN-based non-intrusive method for assessing music audio quality in hearing aid users.
mustango
Mustango: Toward Controllable Text-to-Music Generation
Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
OpenVoice
Instant voice cloning by MyShell.
POPDG
Data and PopDanceSet are coming soon.
Retrieval-based-Voice-Conversion-WebUI
Voice data <= 10 mins can also be used to train a good VC model!
rvc-eval
Simplified implementation of the RVC (Retrieval-based Voice Conversion) evaluation for easy integration into other projects. Removes unnecessary features and provides a sample CLI for real-time conversion.
stream-vc
An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)
StreamVC
An unofficial pytorch implementation of "STREAMVC: REAL-TIME LOW-LATENCY VOICE CONVERSION".
tinyvc
a lightweight voice conversion
XNNPACK
High-efficiency floating-point neural network inference operators for mobile, server, and Web