Moplast

Rongzhi Gu's repositories

TasNet-tensorflow

A tensorflow implementation of TasNet (ICASSP 2018)

Language:Python15 1 3

moplast.github.io

Language:CSSNOASSERTION1 10

TASNET

Time-domain Audio Separation Network

Language:Python1 10

3d.github.io

Language:CSSMIT010

ASAM

This is the code&dataset for our paper [Modeling Attention and Memory for Auditory Selection in a Cocktail Party Environment. AAAI 2018]

Language:Python010

asru2021.github.io

3D spatial features

Language:CSSMIT020

asteroid

The PyTorch-based audio source separation toolkit for researchers || Pretrained models available

Language:PythonMIT010

CountNet

Deep Neural Network for Speaker Count Estimation

Language:PythonMIT010

cplxmodule

A lightweight extension for pytorch that implements complex-valued layers and bayesian sparsification for them.

Language:PythonMIT010

DANet

Dual Attention Network for Scene Segmentation

Language:PythonNOASSERTION010

DaNet-Tensorflow

Tensorflow implementation of "Speaker-independent Speech Separation with Deep Attractor Network"

Language:PythonMIT010

dc_integration

Tight integration of spatial and spectral features for BSS with Deep Clustering embeddings

Language:Python000

gcc-nmf

Real-time GCC-NMF Blind Speech Separation and Enhancement

MIT000

hopcs

Language:Matlab010

huxpro.github.io

My Blog / Jekyll Themes / PWA

Language:CSSApache-2.0010

InnerSelf

Experiments & Papers & Research tips

010

kaldi

This is now the official location of the Kaldi project.

Language:ShellNOASSERTION010

ladder

Ladder network is a deep learning algorithm that combines supervised and unsupervised learning

Language:PythonMIT010

Machine-Learning-Ex

My first machine learning exercise.

Language:Matlab010

models

Models and examples built with TensorFlow

Language:PythonApache-2.0000

multisensory

Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features

Language:PythonApache-2.0000

Nabu-MSSS

Code for Multi Speaker Source Separation with neural networks, build with TensorFlow

Language:PythonMIT010

nn-gev

Neural network supported GEV beamformer

Language:PythonNOASSERTION010

SE_DCUNet

Deep Complex UNet for speech enhancement, init from "https://github.com/chanil1218/DCUnet.pytorch"

000

SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

Language:Python010

SpectralNet

Deep network that performs spectral clustering

Language:PythonMIT010

tensorflow-vrnn

A variational recurrent neural network implementation in tensorflow

Language:Python010

VAD

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

Language:Matlab010

voicefilter

Unofficial PyTorch implementation of Google AI's VoiceFilter system

Language:Python010