annn's repositories
AmphionPublic
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
bark
🔊 Text-Prompted Generative Audio Model
espnet
End-to-End Speech Processing Toolkit
muse-maskgit-pytorch
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
musicgen-dreamboothing
Fine-tune your own MusicGen with LoRA
voicefixer_main
General Speech Restoration