zhaoforever's repositories

Language:CLicense:Apache-2.0Stargazers:0Issues:1Issues:0

bitsandbytes

Library for 8-bit optimizers and quantization routines.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

bssaec2020

A New Perspective of Auxiliary-Function-Based Independent Component Analysis in Acoustic Echo Cancellation

Stargazers:0Issues:0Issues:0

DTLN-aec

This Repostory contains the pretrained DTLN-aec model for real-time acoustic echo cancellation.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

EasyComDataset

The Easy Communications (EasyCom) dataset is a world-first dataset designed to help mitigate the *cocktail party effect* from an augmented-reality (AR) -motivated multi-sensor egocentric world view.

License:NOASSERTIONStargazers:0Issues:1Issues:0

flops-counter.pytorch

Flops counter for convolutional networks in pytorch framework

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

HRTF-construction

code for HRTF database construction

Language:MATLABStargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

model-compression

model compression based on pytorch (1、quantization: 16/8/4/2 bits(dorefa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、ternary/binary value(twn/bnn/xnor-net);2、 pruning: normal、regular and group convolutional channel pruning;3、 group convolution structure;4、batch-normalization folding for quantization)

Language:PythonStargazers:0Issues:1Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

nnom

A higher-level Neural Network library for microcontrollers.

Language:CLicense:Apache-2.0Stargazers:0Issues:1Issues:0

openMHA

The open Master Hearing Aid (openMHA)

Language:C++License:AGPL-3.0Stargazers:0Issues:1Issues:0

paderwasn

Paderwasn is a collection of methods for acoustic signal processing in wireless acoustic sensor networks (WASNs).

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

pedalboard

A Python library for adding effects to audio.

Language:C++License:GPL-3.0Stargazers:0Issues:1Issues:0

PercepNet

(Work In Progress) Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech

Language:C++License:BSD-3-ClauseStargazers:0Issues:1Issues:0

PseudoBinaural_CVPR2021

Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)

Language:PythonLicense:CC-BY-4.0Stargazers:0Issues:1Issues:0

python-pesq-1

PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)

Language:CLicense:MITStargazers:0Issues:1Issues:0

RAdam

On the Variance of the Adaptive Learning Rate and Beyond

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

RIR-Generator

Generating room impulse responses

Language:C++License:MITStargazers:0Issues:1Issues:0

room-impulse-responses

A list of publicly available room impulse response datasets and scripts to download them.

Language:ShellStargazers:0Issues:1Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

SepStereo_ECCV2020

Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)

Language:PythonLicense:CC-BY-4.0Stargazers:0Issues:1Issues:0

sofamyroom

Room acoustic simulator with a SOFA file loader.

Language:CLicense:EUPL-1.2Stargazers:0Issues:1Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Language:PythonStargazers:0Issues:1Issues:0

Subband-Music-Separation

Pytorch: Channel-wise subband input for better voice and accompaniment separation

Stargazers:0Issues:0Issues:0

svoice

We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

unified2021

A UNIFIED SPEECH ENHANCEMENT FRONT-END FOR ONLINE DEREVERBERATION, ACOUSTIC ECHO CANCELLATION, AND SOURCE SEPARATION

Language:MATLABStargazers:0Issues:1Issues:0

voicefixer

General Speech Restoration

Language:PythonLicense:MITStargazers:0Issues:1Issues:0