0417keito

0417itsuki's repositories

VALL-E-X-Trainer-by-CustomData

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonMIT58 50

JEN-1-pytorch

Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.04729)

Language:Python4500

PromptTTS2

[WIP] Unofficial Implementation of Microsoft's PromptTTS2

Language:Python45 50

JEN-1-COMPOSER-pytorch

Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.19180)

Language:Python25 3 1

UTAUTAI

UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)

Language:Python8 20

SpeechTokenizer_trainer

Trainer of Speech Tokenizer(https://arxiv.org/abs/2308.16692)

Language:Python500

music_dataset_generator

This repo is a necessary style prompt for generating music with accompaniment, and for transcribing lyrics.

200

Control-JBDiff

[WIP] ControlNet for Jukebox-diffusion

Language:PythonMIT100

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT100