emmanuelinfante

Emmanuel Infante's starred repositories

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookMIT13961 108 305

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonMIT4336 58 141

RePaint

Official PyTorch Code and Models of "RePaint: Inpainting using Denoising Diffusion Probabilistic Models", CVPR 2022

Language:Python1889 42 55

Palette-Image-to-Image-Diffusion-Models

Unofficial implementation of Palette: Image-to-Image Diffusion Models by Pytorch

Language:PythonMIT1459 17 96

DDNM

[ICLR 2023 Oral] Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model

Language:PythonMIT1096 27 72

segan

Speech Enhancement Generative Adversarial Network in TensorFlow

Language:PythonMIT804 45 79

image-restoration-sde

Image Restoration with Mean-Reverting Stochastic Differential Equations, ICML 2023. Winning solution of the NTIRE 2023 Image Shadow Removal Challenge.

Language:PythonMIT532 5 93

Awesome-diffusion-model-for-image-processing

one summary of diffusion-based image processing, including restoration, enhancement, coding, quality assessment

Apache-2.0529 15 3

NeuralSVB

Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code

Language:PythonGPL-3.0416 13 19

segan_pytorch

Speech Enhancement Generative Adversarial Network in PyTorch

Language:PythonMIT375 12 35

DeepAFx-ST

DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/

Language:PythonNOASSERTION352 12 9

DiffPIR

"Denoising Diffusion Models for Plug-and-Play Image Restoration", Yuanzhi Zhu, Kai Zhang, Jingyun Liang, Jiezhang Cao, Bihan Wen, Radu Timofte, Luc Van Gool.

Language:PythonMIT350 11 39

classifier-free-guidance-pytorch

Implementation of Classifier Free Guidance in Pytorch, with emphasis on text conditioning, and flexibility to include multiple text embedding models

Language:PythonMIT302 8 4

WhisperS2T

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

Language:Jupyter NotebookMIT256 11 53

GenerativeDiffusionPrior

Generative Diffusion Prior for Unified Image Restoration and Enhancement (CVPR2023)

Language:ShellApache-2.0249 5 40

CDiffuSE

Conditional Diffusion Probabilistic Model for Speech Enhancement

Language:PythonApache-2.0198 9 13

griffin_lim

Implementation of the Griffin and Lim algorithm to recover an audio signal from a magnitude-only spectrogram.

Language:PythonBSD-3-Clause168 5 1

IRM-based-Speech-Enhancement-using-LSTM

Ideal Ratio Mask (IRM) Estimation based Speech Enhancement using LSTM

Language:PythonMIT111 3 5

diffwave-sr

Language:Jupyter NotebookMIT78 5 7

fast-ode

Official PyTorch implementation for the paper Minimizing Trajectory Curvature of ODE-based Generative Models, ICML 2023

Language:Python74 2 2

audio-inpainting-diffusion

Language:Jupyter NotebookMIT61 7 2

Awesome-Bandwidth-Extension

This is a curated list of awesome Speech Bandwidth Extension tutorials, papers, libraries, datasets, tools, scripts and results. The purpose of this repo is to organize the world’s resources for speech bandwidth extension, and make them universally accessible and useful.

MIT59 6 2

emmanuelinfante