Rongzhi Gu's repositories

TasNet-tensorflow

A tensorflow implementation of TasNet (ICASSP 2018)

Language:CSSLicense:NOASSERTIONStargazers:1Issues:1Issues:0

TASNET

Time-domain Audio Separation Network

Language:PythonStargazers:1Issues:1Issues:0
Language:CSSLicense:MITStargazers:0Issues:1Issues:0

ASAM

This is the code&dataset for our paper [Modeling Attention and Memory for Auditory Selection in a Cocktail Party Environment. AAAI 2018]

Language:PythonStargazers:0Issues:1Issues:0

asru2021.github.io

3D spatial features

Language:CSSLicense:MITStargazers:0Issues:2Issues:0

asteroid

The PyTorch-based audio source separation toolkit for researchers || Pretrained models available

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

CountNet

Deep Neural Network for Speaker Count Estimation

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

cplxmodule

A lightweight extension for pytorch that implements complex-valued layers and bayesian sparsification for them.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

DANet

Dual Attention Network for Scene Segmentation

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

DaNet-Tensorflow

Tensorflow implementation of "Speaker-independent Speech Separation with Deep Attractor Network"

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

dc_integration

Tight integration of spatial and spectral features for BSS with Deep Clustering embeddings

Language:PythonStargazers:0Issues:0Issues:0

gcc-nmf

Real-time GCC-NMF Blind Speech Separation and Enhancement

License:MITStargazers:0Issues:0Issues:0
Language:MatlabStargazers:0Issues:1Issues:0

huxpro.github.io

My Blog / Jekyll Themes / PWA

Language:CSSLicense:Apache-2.0Stargazers:0Issues:1Issues:0

InnerSelf

Experiments & Papers & Research tips

Stargazers:0Issues:1Issues:0

kaldi

This is now the official location of the Kaldi project.

Language:ShellLicense:NOASSERTIONStargazers:0Issues:1Issues:0

ladder

Ladder network is a deep learning algorithm that combines supervised and unsupervised learning

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Machine-Learning-Ex

My first machine learning exercise.

Language:MatlabStargazers:0Issues:1Issues:0

models

Models and examples built with TensorFlow

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

multisensory

Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Nabu-MSSS

Code for Multi Speaker Source Separation with neural networks, build with TensorFlow

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

nn-gev

Neural network supported GEV beamformer

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

SE_DCUNet

Deep Complex UNet for speech enhancement, init from "https://github.com/chanil1218/DCUnet.pytorch"

Stargazers:0Issues:0Issues:0

SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

Language:PythonStargazers:0Issues:1Issues:0

SpectralNet

Deep network that performs spectral clustering

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

tensorflow-vrnn

A variational recurrent neural network implementation in tensorflow

Language:PythonStargazers:0Issues:1Issues:0

VAD

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

Language:MatlabStargazers:0Issues:1Issues:0

voicefilter

Unofficial PyTorch implementation of Google AI's VoiceFilter system

Language:PythonStargazers:0Issues:1Issues:0