cohendrake's starred repositories

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:52021Issues:937Issues:1078

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:30106Issues:428Issues:4181

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:24986Issues:193Issues:3987

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonLicense:Apache-2.0Stargazers:4057Issues:90Issues:1024

s4

Structured state space sequence models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2343Issues:52Issues:133

DeepFilterNet

Noise supression using deep filtering

Language:PythonLicense:NOASSERTIONStargazers:2329Issues:32Issues:274

nnom

A higher-level Neural Network library for microcontrollers.

Language:CLicense:Apache-2.0Stargazers:892Issues:45Issues:130

sox

SoX, Swiss Army knife of sound processing

Language:CLicense:NOASSERTIONStargazers:689Issues:28Issues:0

setk

Tools for Speech Enhancement integrated with Kaldi

Language:PythonLicense:Apache-2.0Stargazers:394Issues:22Issues:10
Language:C++License:Apache-2.0Stargazers:385Issues:36Issues:13

diffq

DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight or group of weights, in order to achieve a given trade-off between model size and accuracy.

Language:PythonLicense:NOASSERTIONStargazers:230Issues:10Issues:8

MetaAF

Control adaptive filters with neural networks.

nn-gev

Neural network supported GEV beamformer

Language:PythonLicense:NOASSERTIONStargazers:193Issues:14Issues:13

MTFAA-Net

Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement

Language:PythonLicense:MITStargazers:186Issues:7Issues:12

Noise2Noise-audio_denoising_without_clean_training_data

Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoising networks using only noisy speech samples.

Language:Jupyter NotebookLicense:MITStargazers:172Issues:7Issues:6

FAST-RIR

This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.

Language:PythonLicense:AGPL-3.0Stargazers:150Issues:6Issues:3

pygsound

Impulse response generation based on state-of-the-art geometric sound propagation engine.

Language:C++License:NOASSERTIONStargazers:140Issues:1Issues:9

AFRCNN-For-Speech-Separation

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network

Language:PythonLicense:MITStargazers:134Issues:6Issues:3

Neural-Speech-Dereverberation

Machine and Deep Learning models for speech dereverberation

Language:PythonLicense:GPL-3.0Stargazers:103Issues:2Issues:4

Bayesian-Pitch-Tracking-Using-Harmonic-model

Pitch detection and pitch tracking, voicing unvoicing detection (VAD),基音检测

Language:MATLABLicense:GPL-2.0Stargazers:86Issues:2Issues:1

PhoneFortifiedPerceptualLoss

Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement

Language:PythonLicense:MITStargazers:68Issues:4Issues:5

Awesome-Bandwidth-Extension

This is a curated list of awesome Speech Bandwidth Extension tutorials, papers, libraries, datasets, tools, scripts and results. The purpose of this repo is to organize the world’s resources for speech bandwidth extension, and make them universally accessible and useful.

VAD-LTSD

Efficient voice activity detection algorithm using long-term speech information

Emergency-Vehicle-Detection

Python implementation of papers on emergency vehicle detection using audio signals

Language:Jupyter NotebookStargazers:44Issues:6Issues:3

RIR-Generator

为音频加混响的代码

HarmonicLowering

Implementation of Harmonic Convolution by Harmonic Lowering

Language:PythonLicense:NOASSERTIONStargazers:17Issues:1Issues:3

dcase23_task5_scl

System that ranked 2nd in DCASE 2023 Challenge Task 5: Few-shot Bioacoustic Event Detection

Language:PythonStargazers:4Issues:0Issues:0