breizhn

followers

following

stars

Oldenburg

https://uol.de/en/kommunikationsakustik

Nils L. Westhausen's starred repositories

pyenv

Simple Python version management

Language:RoffMIT37959 386 1752

HandBrake

HandBrake's main development repository

Language:CNOASSERTION16745 287 4587

sonnet

TensorFlow-based neural network library

Language:PythonApache-2.09734 422 193

NoiseTorch

Real-time microphone noise suppression on Linux.

Language:GoNOASSERTION9119 71 312

Soundflower

MacOS system extension that allows applications to pass audio to other applications. Soundflower works on macOS Catalina.

Language:Objective-CMIT8835 4190

einops

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Language:PythonMIT8191 69 172

ddsp

DDSP: Differentiable Digital Signal Processing

Language:PythonApache-2.02833 67 174

audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Language:PythonMIT1766 20 180

denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

Language:PythonNOASSERTION1617 38 149

svoice

We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.

Language:PythonNOASSERTION1185 25 88

nnfusion

A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.

Language:C++MIT937 44 205

torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Language:PythonMIT908 11 105

NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Language:PythonMIT626 24 46

qkeras

QKeras: a quantization deep learning library for Tensorflow Keras

Language:PythonApache-2.0527 30 92

gpuRIR

Python library for Room Impulse Response (RIR) simulation with GPU acceleration

Language:CudaAGPL-3.0466 10 51

scaper

A library for soundscape synthesis and augmentation

Language:PythonBSD-3-Clause369 9 105

AEC-Challenge

AEC Challenge

pyminiaudio

python interface to the miniaudio audio playback, recording, decoding and conversion library

Language:CNOASSERTION159 8 63

RealRIRs

Python loaders for many Real Room Impulse Response databases

Language:Python82 60

PLC-Challenge

This repo contains required files for the INTERSPEECH 2022 Audio Deep Packet Loss Concealment (PLC) Challenge.

Language:PythonMIT70 8 6

python_kaldi_features

python codes to extract MFCC and FBANK speech features for Kaldi

Language:PythonMIT62 7 4

reseval

Reproducible Subjective Evaluation

Language:PythonMIT55 1 24

se_relativisticgan

Keras framework for speech enhancement using relativistic GANs

Language:PythonMIT53 4 15

GC3

Language:PythonMIT48 3 1

storir

Language:PythonMIT41 10 4

pymushra

pyMUSHRA is a python web application which hosts webMUSHRA experiments and collects the data with python.

Language:PythonMIT34 3 5

flopco-keras

FLOPs and other statistics COunter for tf.keras neural networks

Language:PythonMIT29 20

MAPS-Scripts

A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.

Language:MATLABGPL-3.021 40

tPLCnet

This repository contains the trained models and some audio samples for the tPLCnet.

Language:PythonMIT19 3 1

SpotifyDataAnalyzer

Analyzer of User Data saved by Spotify

Language:PythonMIT4 20