xiaozhuo's repositories

3dti_AudioToolkit

3D Tune-In Toolkit is a custom open-source C++ library developed within the EU-funded project 3D Tune-In. The Toolkit provides a high level of realism and immersiveness within binaural 3D audio simulations, while allowing for the emulation of hearing aid devices and of different typologies of hearing loss.

Language:C++License:GPL-3.0Stargazers:0Issues:0Issues:0

all-in-one

All-In-One Music Structure Analyzer

License:MITStargazers:0Issues:0Issues:0

AntiFake

https://dl.acm.org/doi/10.1145/3576915.3623209

License:MITStargazers:0Issues:0Issues:0

Applio

VITS-based Voice Conversion focused on simplicity, quality and performance.

License:NOASSERTIONStargazers:0Issues:0Issues:0

AudioLDM2

Text-to-Audio/Music Generation

License:NOASSERTIONStargazers:0Issues:0Issues:0

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

License:MITStargazers:0Issues:0Issues:0
License:AGPL-3.0Stargazers:0Issues:0Issues:0

Bert-VITS2

vits2 backbone with bert

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

CoMoSVC

CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone

License:MITStargazers:0Issues:0Issues:0

DDSP-SVC

Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)

License:MITStargazers:0Issues:0Issues:0

DelayCat

DelayCat Feature Based Delay Line Audio Plugin

Stargazers:0Issues:0Issues:0

dry_sing_multi_eval

Five-Dimensional Acapella Singing Evaluation System based on funASR, include pronunciation, pitch accuracy, rhythm, fluency, and emotion.

License:GPL-3.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

FxNorm-automix

FxNorm-Automix - Implementation of automatic music mixing systems. We show how we can use wet music data and repurpose it to train a fully automatic mixing system

License:MITStargazers:0Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

License:MITStargazers:0Issues:0Issues:0

grok-1

Grok open release

License:Apache-2.0Stargazers:0Issues:0Issues:0

HAAQI-Net

HAAQI-Net is a novel DNN-based non-intrusive method for assessing music audio quality in hearing aid users.

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

mustango

Mustango: Toward Controllable Text-to-Music Generation

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

License:Apache-2.0Stargazers:0Issues:0Issues:0

OpenVoice

Instant voice cloning by MyShell.

License:NOASSERTIONStargazers:0Issues:0Issues:0

POPDG

Data and PopDanceSet are coming soon.

Stargazers:0Issues:0Issues:0

Retrieval-based-Voice-Conversion-WebUI

Voice data <= 10 mins can also be used to train a good VC model!

License:MITStargazers:0Issues:0Issues:0

rvc-eval

Simplified implementation of the RVC (Retrieval-based Voice Conversion) evaluation for easy integration into other projects. Removes unnecessary features and provides a sample CLI for real-time conversion.

Stargazers:0Issues:0Issues:0

stream-vc

An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)

Stargazers:0Issues:0Issues:0

StreamVC

An unofficial pytorch implementation of "STREAMVC: REAL-TIME LOW-LATENCY VOICE CONVERSION".

License:MITStargazers:0Issues:0Issues:0

tinyvc

a lightweight voice conversion

License:Apache-2.0Stargazers:0Issues:0Issues:0

XNNPACK

High-efficiency floating-point neural network inference operators for mobile, server, and Web

License:NOASSERTIONStargazers:0Issues:0Issues:0