jfsantos

followers

following

stars

@NVIDIA

Vancouver, BC, Canada

http://www.seaandsailor.com

Organizations

JuliaDSP

MuSAELab

João Felipe Santos's repositories

uxnatr

Port of the uxn virtual machine to Atari computers (800/1200XL)

Language:CMIT7 10

ataritools

Tools to convert text files from ASCII to ATASCII

Language:CApache-2.0500

jfsantos.github.io

My research blog

Language:HTML3 20

alias-free-torch

Simple torch.nn.module implementation of Alias-Free-GAN style filter and resample

Language:PythonApache-2.0100

altium-projects

Altium PCBs for guitar effects pedals

MIT100

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Language:PythonMIT100

CleanUNet

Official PyTorch Implementation of CleanUNet (ICASSP 2022)

Language:PythonMIT100

cargan

Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"

Language:PythonMIT000

DaisyExamples

Examples for the Daisy Platform

Language:CMIT000

DaisySP

A Powerful, Open Source DSP Library in C++

MIT000

ddim

Denoising Diffusion Implicit Models

Language:PythonMIT000

DeepAFx

Third-party audio effects plugins as differentiable layers within deep neural networks.

Language:Jupyter NotebookNOASSERTION000

denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

Language:PythonNOASSERTION000

DiffSinger

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Language:PythonMIT000

diffsptk

A differential version of SPTK

Language:PythonApache-2.0000

HiFiplusplus-pytorch

HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

MIT000

ltspice-guitar-pedals

A collection of LTSpice simulation files for popular guitar effects. :guitar: :electron: :musical_note: :chart_with_upwards_trend: Pull requests welcome :smiley:

Language:AGS Script000

lyrebird-wav2clip

Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP

Language:PythonMIT000

minimal_diffusion_models

000

NeMo

Neural Modules: a toolkit for conversational AI

Language:Jupyter NotebookApache-2.0000

NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Language:PythonMIT000

open_flamingo

An open-source framework for training large multimodal models

Language:Python000

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

MIT000

phaseaug

Submitted to ICASSP 2023

Language:PythonBSD-3-Clause000

sample-generator

Tools to train a generative model on arbitrary audio samples

Language:Jupyter NotebookMIT000

state-spaces

Sequence Modeling with Structured State Spaces

Language:PythonApache-2.0000

terrarium-stand

A template repository for creating effects with the terrarium from PedalPCB.

Language:C++MIT000

univnet

Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)

Language:PythonBSD-3-Clause000

uxnds

NDS port of the uxn virtual machine

Language:CMIT000

visqol

Perceptual Quality Estimator for speech and audio

Apache-2.0000