Beast code in Giters

zhaoforever's repositories

bitsandbytes

Library for 8-bit optimizers and quantization routines.

Language:PythonMIT010

bssaec2020

A New Perspective of Auxiliary-Function-Based Independent Component Analysis in Acoustic Echo Cancellation

000

DTLN-aec

This Repostory contains the pretrained DTLN-aec model for real-time acoustic echo cancellation.

Language:PythonMIT010

EasyComDataset

The Easy Communications (EasyCom) dataset is a world-first dataset designed to help mitigate the *cocktail party effect* from an augmented-reality (AR) -motivated multi-sensor egocentric world view.

NOASSERTION010

flops-counter.pytorch

Flops counter for convolutional networks in pytorch framework

Language:PythonMIT010

HRTF-construction

code for HRTF database construction

Language:MATLAB010

model compression based on pytorch (1、quantization: 16/8/4/2 bits(dorefa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、ternary/binary value(twn/bnn/xnor-net)；2、 pruning: normal、regular and group convolutional channel pruning；3、 group convolution structure；4、batch-normalization folding for quantization)

Language:Python010

music_source_separation

Language:PythonNOASSERTION010

nnom

A higher-level Neural Network library for microcontrollers.

Language:CApache-2.0010

openMHA

The open Master Hearing Aid (openMHA)

Language:C++AGPL-3.0010

paderwasn

Paderwasn is a collection of methods for acoustic signal processing in wireless acoustic sensor networks (WASNs).

Language:PythonMIT010

pedalboard

A Python library for adding effects to audio.

Language:C++GPL-3.0010

PercepNet

(Work In Progress) Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech

Language:C++BSD-3-Clause010

PseudoBinaural_CVPR2021

Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)

Language:PythonCC-BY-4.0010

python-pesq-1

PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)

Language:CMIT010

RAdam

On the Variance of the Adaptive Learning Rate and Beyond

Language:PythonApache-2.0010

RIR-Generator

Generating room impulse responses

Language:C++MIT010

room-impulse-responses

A list of publicly available room impulse response datasets and scripts to download them.

Language:Shell010

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Language:PythonMIT010

SepStereo_ECCV2020

Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)

Language:PythonCC-BY-4.0010

sofamyroom

Room acoustic simulator with a SOFA file loader.

Language:CEUPL-1.2010

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonApache-2.0010

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Language:Python010

Subband-Music-Separation

Pytorch: Channel-wise subband input for better voice and accompaniment separation

000

svoice

We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.

Language:PythonNOASSERTION010

zhaoforever

zhaoforever's repositories

athena-signal

bitsandbytes

bssaec2020

DTLN-aec

EasyComDataset

flops-counter.pytorch

HRTF-construction

leaf-audio

model-compression

music_source_separation

nnom

openMHA

paderwasn

pedalboard

PercepNet

PseudoBinaural_CVPR2021

python-pesq-1

RAdam

RIR-Generator

room-impulse-responses

s3prl

SepStereo_ECCV2020

sofamyroom

speechbrain

speechmetrics

Subband-Music-Separation

svoice

Target-sound-event-detection

unified2021

voicefixer