moiseshorta

Moisés Horta Valenzuela's repositories

RAVE-Latent-Diffusion

Generate new latent codes for RAVE with Denoising Diffusion models.

Language:PythonMIT157 15 3

MelSpecVAE

Variational Autoencoder in the mel-spectrogram domain for one-shot audio synthesis

Language:Jupyter NotebookMIT125 4 5

ADD-audio-dataset-downloader

Simple Python CLI script for downloading N-hours of audio from Youtube, based on a list of music genres.

Language:PythonGPL-3.027 10

MelGAN-VC

MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms

Language:Jupyter NotebookMIT900

ravejs

Language:JavaScript400

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Language:PythonMIT300

RAVE

Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

Language:PythonMIT300

aphantasia

CLIP + FFT/DWT/RGB = text to image/video

Language:Jupyter NotebookMIT100

audio-diffusion

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

Language:Jupyter NotebookGPL-3.0100

jukebox

Code for the paper "Jukebox: A Generative Model for Music"

Language:PythonNOASSERTION100

PaddleGAN

PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, wav2lip, picture repair, image editing, photo2cartoon, image style transfer, and so on.

Language:PythonApache-2.0100

PyTorch-VAE

A Collection of Variational Autoencoders (VAE) in PyTorch.

Language:PythonApache-2.0100

rave_vst

Language:C++NOASSERTION100

unagan

Code for Unconditional Audio Generation with GAN and Cycle Regularization

Language:PythonMIT100

alis

Aligning Latent and Image Spaces to Connect the Unconnectable

Language:Jupyter Notebook000

ddsp_pytorch

Implementation of Differentiable Digital Signal Processing (DDSP) in Pytorch

Language:C000

diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Language:PythonApache-2.0000

jukebox-saveopt

NOASSERTION000

prism-samplernn

Neural sound synthesis with TensorFlow 2

Language:PythonMIT000

rembg-comfyui-node

Rembg Background Removal node for ComfyUI

Language:Python000

sample-generator

Tools to train a generative model on arbitrary audio samples

Language:Jupyter NotebookMIT000

spectrogram-inversion

spectrogram inversion tools in PyTorch. Documentation: https://spectrogram-inversion.readthedocs.io

Language:Python000

stable-audio-tools

Generative models for conditional audio generation

MIT000

StyleGAN-nada

MIT000

stylegan2

StyleGAN2 - Official TensorFlow Implementation (Tweaks for latent space manipulation)

Language:PythonNOASSERTION000

stylegan2-ada-pytorch

StyleGAN2-ADA - Official PyTorch implementation

NOASSERTION000

stylegan2_eps

StyleGAN2 for practice

NOASSERTION000

Tacotron2AutoTrim

Auto trim and auto transcription of audio for using in Tacotron 2

000

textual_inversion

Language:Jupyter NotebookMIT000

trajectories

easy generation of latent trajectories

000