Beast code in Giters

C00reNUT's repositories

ai-audio-datasets

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.

MIT000

AI-Song-Cover-RVC

All in One Version : Youtube WAV Download, Separating Vocal, Splitting Audio, Training, and Inference Using Google Colab

Language:Jupyter Notebook000

AICoverGen

A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.

Language:PythonMIT000

aria-amt

Efficient and robust implementation of seq-to-seq automatic piano transcription.

Apache-2.0000

auto_dataset_tts

A simple script to prepare dataset for training with TTS Tortoise model via https://git.ecker.tech/mrq/ai-voice-cloning

Language:Python000

clapper

Clapper.app, the video editor designed for the age of AI cinema

Language:TypeScriptGPL-3.0000

ComfyUI-SaveAsScript

A powerful tool that translates ComfyUI workflows into executable Python code - now as a UI button.

Language:PythonMIT000

courses

Anthropic's educational courses

NOASSERTION000

ctc-forced-aligner

Text to speech alignment using CTC forced alignment

000

e2_tts

000

finetune-musicgen

a notebook containing scripts, documentation, and examples for finetuning musicgen

000

gpt-author

Language:Jupyter NotebookMIT000

InfiniteMusicGen

Create seamless infinite music generation leveraging MusicGen model

000

joy-caption-jupyter

000

LLaVA-OneVision-jupyter

Language:Jupyter Notebook000

Mistral-7B-south-park-fanatic

training + data generation scripts necessary to train South Park fanatic AI

Language:PythonMIT000

narrator

David Attenborough narrates your life

Language:Python000

Pandrator

Pandrator aspires to be a user-friendly app with a graphical interface and a one-click installer that creates high-quality speech from text in multiple languages (audiobooks, speech synchronised with subtitles and more) using local models (XTTS, Silero or VoiceCraft), plus voice cloning, LLM pre-processing, RVC enhancement, and automatic evaluation

AGPL-3.0000

resemble-enhance

AI powered speech denoising and enhancement

MIT000

speech-dataset-generator

🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.

MIT000

SpeechMOS

Easy-to-Use Speech MOS predictors

MIT000

stable-audio-controlnet

Fine-tune Stable Audio Open with DiT ControlNet.

NOASSERTION000

StreamingT2V

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Language:Python000

StyleTTS2FineTune

Language:Python000

Tile-Upscaler

Image Upscaler with Tile Controlnet Fully Integrated in Huggingface Diffusers

Language:PythonApache-2.0000

Train_Hifigan_XTTS

This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.

000

whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Language:PythonAGPL-3.0000

xtts-finetune-tests

In this repository I will be running various experiments on finetune different parts for xtts

Language:PythonMIT000

youtube-transcript-api

This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headless browser, like other selenium based solutions do!

MIT000

ziplora-pytorch

Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"

MIT000