Emmanuel Infante's starred repositories

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:13961Issues:108Issues:305

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:4336Issues:58Issues:141

RePaint

Official PyTorch Code and Models of "RePaint: Inpainting using Denoising Diffusion Probabilistic Models", CVPR 2022

Palette-Image-to-Image-Diffusion-Models

Unofficial implementation of Palette: Image-to-Image Diffusion Models by Pytorch

Language:PythonLicense:MITStargazers:1459Issues:17Issues:96

DDNM

[ICLR 2023 Oral] Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model

Language:PythonLicense:MITStargazers:1096Issues:27Issues:72

segan

Speech Enhancement Generative Adversarial Network in TensorFlow

Language:PythonLicense:MITStargazers:804Issues:45Issues:79

image-restoration-sde

Image Restoration with Mean-Reverting Stochastic Differential Equations, ICML 2023. Winning solution of the NTIRE 2023 Image Shadow Removal Challenge.

Language:PythonLicense:MITStargazers:532Issues:5Issues:93

Awesome-diffusion-model-for-image-processing

one summary of diffusion-based image processing, including restoration, enhancement, coding, quality assessment

NeuralSVB

Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code

Language:PythonLicense:GPL-3.0Stargazers:416Issues:13Issues:19

segan_pytorch

Speech Enhancement Generative Adversarial Network in PyTorch

Language:PythonLicense:MITStargazers:375Issues:12Issues:35

DeepAFx-ST

DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/

Language:PythonLicense:NOASSERTIONStargazers:352Issues:12Issues:9

DiffPIR

"Denoising Diffusion Models for Plug-and-Play Image Restoration", Yuanzhi Zhu, Kai Zhang, Jingyun Liang, Jiezhang Cao, Bihan Wen, Radu Timofte, Luc Van Gool.

Language:PythonLicense:MITStargazers:350Issues:11Issues:39

classifier-free-guidance-pytorch

Implementation of Classifier Free Guidance in Pytorch, with emphasis on text conditioning, and flexibility to include multiple text embedding models

Language:PythonLicense:MITStargazers:302Issues:8Issues:4

WhisperS2T

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

Language:Jupyter NotebookLicense:MITStargazers:256Issues:11Issues:53

GenerativeDiffusionPrior

Generative Diffusion Prior for Unified Image Restoration and Enhancement (CVPR2023)

Language:ShellLicense:Apache-2.0Stargazers:249Issues:5Issues:40

CDiffuSE

Conditional Diffusion Probabilistic Model for Speech Enhancement

Language:PythonLicense:Apache-2.0Stargazers:198Issues:9Issues:13

griffin_lim

Implementation of the Griffin and Lim algorithm to recover an audio signal from a magnitude-only spectrogram.

Language:PythonLicense:BSD-3-ClauseStargazers:168Issues:5Issues:1

IRM-based-Speech-Enhancement-using-LSTM

Ideal Ratio Mask (IRM) Estimation based Speech Enhancement using LSTM

Language:PythonLicense:MITStargazers:111Issues:3Issues:5
Language:Jupyter NotebookLicense:MITStargazers:78Issues:5Issues:7

fast-ode

Official PyTorch implementation for the paper Minimizing Trajectory Curvature of ODE-based Generative Models, ICML 2023

Awesome-Bandwidth-Extension

This is a curated list of awesome Speech Bandwidth Extension tutorials, papers, libraries, datasets, tools, scripts and results. The purpose of this repo is to organize the world’s resources for speech bandwidth extension, and make them universally accessible and useful.

fakeflac

A command-line tool to detect "fake" FLAC files

Language:PythonLicense:MITStargazers:35Issues:5Issues:7

FLAD

Fake Lossless Audio Detector

Language:PythonLicense:Apache-2.0Stargazers:34Issues:2Issues:3
Language:PythonLicense:MITStargazers:23Issues:2Issues:0

true-bitrate

Little command-line tool to find out true audio file bit rate

Language:PythonStargazers:14Issues:2Issues:0

LA-2A

Feed-forward compressor experiments source code for "Differentiable All-pole Filters for Time-varying Audio Systems".

Language:PythonLicense:MPL-2.0Stargazers:13Issues:1Issues:0

ICASSP-2024-BEAFX-using-DDSP

Github repository for the paper accepted in ICASSP 2024 : Blind estimation of audio effects using an auto-encoder approach and differentiable signal processing

Language:Jupyter NotebookStargazers:10Issues:1Issues:0

FakeFLac-Lossless-audio-checker

Python GUI Lossless audio checker

Language:PythonLicense:GPL-3.0Stargazers:9Issues:2Issues:0

ChatPaper

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0