Rpersie

followers

following

stars

Rpersie's repositories

espnet

End-to-End Speech Processing Toolkit

Language:ShellApache-2.0000

cwavegan

Conditional WaveGAN: Generating audio samples conditioned on class labels

Language:PythonMIT000

Automatic-Music-Transcription

Automatic music transcription performed on jazz solos in the presence of noise using TDNN acoustic model, HMM language model

Language:Python000

Tensorflow-MultiGPU-VAE-GAN

A single jupyter notebook multi gpu VAE-GAN example with latent space algebra and receptive field visualizations.

Language:Jupyter NotebookMIT000

char-aware

Language:PythonMIT100

tensorflow-generative-model-collections

Collection of generative models in Tensorflow

Language:PythonApache-2.0000

pykaldi

A Python wrapper for Kaldi

Language:PythonApache-2.0000

uPIT-for-speech-separation

Speech separation with utterance-level PIT experiments

Language:Python000

xdecoder

Fast, portable, enhanced ASR decoder

Language:C++000

pix2pix

Tensorflow implementation of pix2pix(cGAN) for audio source separation

Language:Python000

rsrgan

Robust Speech Recognition Using Generative Adversarial Networks (GAN)

Language:Shell000

pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Language:Perl000

Joint-GAN

Language:Python000

Singing_Voice_Separation_RNN

Singing-Voice Separation From Monaural Recordings Using Deep Recurrent Neural Networks

Language:PythonMIT000

generative_model_speech

Phone generation model/VAE/GAN/VAE+GAN

Language:Python000

Multi-channel-speech-extraction-using-DNN

A tensorflow implementation of my paper Combining beamforming and deep neural networks for multi-channel speech extraction

Language:PythonMIT000

Listen-Attend-and-Spell-Pytorch

Listen Attend and Spell (LAS) implement in pytorch

Language:Jupyter NotebookMIT000

Cross-Domain-CWS

Code for IJCAI 2018 paper "Neural Networks Incorporating Unlabeled and Partially-labeled Data for Cross-domain Chinese Word Segmentation"

Language:PythonMIT000

aes-lac-2018

Pytorch code of "A new automatic speech recognizer for Brazilian Portuguese based on deep neural networks and transfer learning" submitted to AES-LAC 2018

Language:PythonMIT000

cublasHgemm-P100

Code for testing the native float16 matrix multiplication performance on Tesla P100 and V100 GPU based on cublasHgemm

Language:CudaMIT000

music-source-separation

Separating singing voice from music based on deep neural networks in Tensorflow

Language:Python000

PyTorch-GAN

PyTorch implementations of Generative Adversarial Networks.

Language:PythonMIT000

speech-denoising-wavenet

A neural network for end-to-end speech denoising

Language:PythonMIT000

CycleGAN

Software that can generate photos from paintings, turn horses into zebras, perform style transfer, and more.

Language:LuaNOASSERTION000

SeGAN

SeGAN: Segmenting and Generating the Invisible (https://arxiv.org/pdf/1703.10239.pdf)

Language:LuaNOASSERTION000

pit-speech-separation

Language:Python000

CGMM-MVDR

Implement of CGMM-MVDR beamforming

Language:Python000

nn_mask

multichannel linear filters based on mask estimation neural networks for CHiME4

Language:PythonNOASSERTION000

Speech_Enhancement_MMSE-STSA

A statistical model-based Speech Enhancement Using MMSE-STSA

Language:Python000

Weikun-Zhengshuang

over-the-air_speech_recogniztion_attack

Language:Jupyter NotebookBSD-2-Clause000