0417itsuki's repositories

VALL-E-X-Trainer-by-CustomData

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonLicense:MITStargazers:58Issues:5Issues:0

JEN-1-pytorch

Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.04729)

Language:PythonStargazers:45Issues:0Issues:0

PromptTTS2

[WIP] Unofficial Implementation of Microsoft's PromptTTS2

Language:PythonStargazers:45Issues:5Issues:0

JEN-1-COMPOSER-pytorch

Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.19180)

UTAUTAI

UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)

Language:PythonStargazers:8Issues:2Issues:0

SpeechTokenizer_trainer

Trainer of Speech Tokenizer(https://arxiv.org/abs/2308.16692)

Language:PythonStargazers:5Issues:0Issues:0

music_dataset_generator

This repo is a necessary style prompt for generating music with accompaniment, and for transcribing lyrics.

Stargazers:2Issues:0Issues:0

Control-JBDiff

[WIP] ControlNet for Jukebox-diffusion

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

MAGNET

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

all-in-one

All-In-One Music Structure Analyzer

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

llark

Code for the paper "LLark: A Multimodal Foundation Model for Music" by Josh Gardner, Simon Durand, Daniel Stoller, and Rachel Bittner.

License:NOASSERTIONStargazers:0Issues:0Issues:0

Music_DiT

Diffusion Transformer(DiT) for Music / Audio using pretrained audicraft model

Stargazers:0Issues:1Issues:0

naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

License:MITStargazers:0Issues:0Issues:0

soundstorm-speechtokenizer

Implementation of SoundStorm built upon SpeechTokenizer.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

tacotron2

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

vampnet

music generation with masked transformers!

Language:PythonLicense:MITStargazers:0Issues:0Issues:0