zql's repositories
speech-music-detection
tensorflow for speech-music-detection task,acc 96%+
audio_tagging_onnx
Easy to use Audio Tagging in Onnx
ailia-models
The collection of pre-trained, state-of-the-art AI models for ailia SDK
all-in-one
All-In-One Music Structure Analyzer
bark
🔊 Text-Prompted Generative Audio Model
beat_tracker
Beat tracker assignment for Music Informatics
Bert-VITS2
vits2 backbone with bert
EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
MetaGPT
🌟 The Multi-Agent Framework: Given one line Requirement, return PRD, Design, Tasks, Repo
OpenVoice
Instant voice cloning by MyShell
RepCodec
Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization
Sentiment-classification
LSTM Sentiment-classification
soundstorm-speechtokenizer
Implementation of SoundStorm built upon SpeechTokenizer.
SpeechTokenizer
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
BrushNet
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
CED_audiotagging
Source code for Consistent ensemble distillation for audio tagging
chorus-detection
A machine learning project for automated chorus detection in songs, featuring a command-line interface (CLI) tool that allows users to input a YouTube link and utilize a pre-trained CRNN model to detect chorus sections from a song on YouTube
FoleyCrafter
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
FunCodec
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
grok-1
Grok open release
gtcrn
The official implementation of GTCRN, an ultra-lite speech enhancement model.
Kolors-TensorRT-libtorch
Kolors with TensorRT and libtorch
MoneyPrinter
Automate Creation of YouTube Shorts using MoviePy.
odeval
Benchmarking the accelerated generation quality of OneDiff.
PaddleVideo
Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video tagging and sport action detection.