Beast code in Giters

xiaozhuo's repositories

3dti_AudioToolkit

3D Tune-In Toolkit is a custom open-source C++ library developed within the EU-funded project 3D Tune-In. The Toolkit provides a high level of realism and immersiveness within binaural 3D audio simulations, while allowing for the emulation of hearing aid devices and of different typologies of hearing loss.

Language:C++GPL-3.0000

all-in-one

All-In-One Music Structure Analyzer

MIT000

AntiFake

https://dl.acm.org/doi/10.1145/3576915.3623209

MIT000

Applio

VITS-based Voice Conversion focused on simplicity, quality and performance.

NOASSERTION000

AudioLDM2

Text-to-Audio/Music Generation

NOASSERTION000

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

MIT000

BABE2

AGPL-3.0000

Bert-VITS2

vits2 backbone with bert

Language:PythonAGPL-3.0000

CoMoSVC

CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone

MIT000

DDSP-SVC

Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)

MIT000

DelayCat

DelayCat Feature Based Delay Line Audio Plugin

000

dry_sing_multi_eval

Five-Dimensional Acapella Singing Evaluation System based on funASR, include pronunciation, pitch accuracy, rhythm, fluency, and emotion.

GPL-3.0000

FCPE

MIT000

FSPEN

000

FxNorm-automix

FxNorm-Automix - Implementation of automatic music mixing systems. We show how we can use wet music data and repurpose it to train a fully automatic mixing system

MIT000

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

MIT000

grok-1

Grok open release

Apache-2.0000

HAAQI-Net

HAAQI-Net is a novel DNN-based non-intrusive method for assessing music audio quality in hearing aid users.

000

hilcodec

000

mustango

Mustango: Toward Controllable Text-to-Music Generation

MIT000

NeuCoSVC

000

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Apache-2.0000

OpenVoice

Instant voice cloning by MyShell.

NOASSERTION000

POPDG

Data and PopDanceSet are coming soon.

000

Retrieval-based-Voice-Conversion-WebUI

Voice data <= 10 mins can also be used to train a good VC model!

MIT000

rvc-eval

Simplified implementation of the RVC (Retrieval-based Voice Conversion) evaluation for easy integration into other projects. Removes unnecessary features and provides a sample CLI for real-time conversion.

000

stream-vc

An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)

000

StreamVC

An unofficial pytorch implementation of "STREAMVC: REAL-TIME LOW-LATENCY VOICE CONVERSION".

MIT000

tinyvc

a lightweight voice conversion

Apache-2.0000

XNNPACK

High-efficiency floating-point neural network inference operators for mobile, server, and Web

NOASSERTION000