João Felipe Santos's repositories
jfsantos.github.io
My research blog
alias-free-torch
Simple torch.nn.module implementation of Alias-Free-GAN style filter and resample
altium-projects
Altium PCBs for guitar effects pedals
audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
anticipation
Anticipatory Autoregressive Models
cargan
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
DaisyExamples
Examples for the Daisy Platform
ddim
Denoising Diffusion Implicit Models
DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
diffsptk
A differential version of SPTK
HiFiplusplus-pytorch
HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement
im2wav
Official implementation of the pipeline presented in I hear your true colors: Image Guided Audio Generation
ltspice-guitar-pedals
A collection of LTSpice simulation files for popular guitar effects. :guitar: :electron: :musical_note: :chart_with_upwards_trend: Pull requests welcome :smiley:
lyrebird-wav2clip
Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP
NeMo
Neural Modules: a toolkit for conversational AI
NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
nisqalib
This is a Python package for NISQA.
open_flamingo
An open-source framework for training large multimodal models
ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
phaseaug
Submitted to ICASSP 2023
sample-generator
Tools to train a generative model on arbitrary audio samples
state-spaces
Sequence Modeling with Structured State Spaces
univnet
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
uxnds
NDS port of the uxn virtual machine
visqol
Perceptual Quality Estimator for speech and audio