Rpersie's repositories

espnet

End-to-End Speech Processing Toolkit

Language:ShellLicense:Apache-2.0Stargazers:0Issues:0Issues:0

cwavegan

Conditional WaveGAN: Generating audio samples conditioned on class labels

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Automatic-Music-Transcription

Automatic music transcription performed on jazz solos in the presence of noise using TDNN acoustic model, HMM language model

Language:PythonStargazers:0Issues:0Issues:0

Tensorflow-MultiGPU-VAE-GAN

A single jupyter notebook multi gpu VAE-GAN example with latent space algebra and receptive field visualizations.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:1Issues:0Issues:0

tensorflow-generative-model-collections

Collection of generative models in Tensorflow

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pykaldi

A Python wrapper for Kaldi

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

uPIT-for-speech-separation

Speech separation with utterance-level PIT experiments

Language:PythonStargazers:0Issues:0Issues:0

xdecoder

Fast, portable, enhanced ASR decoder

Language:C++Stargazers:0Issues:0Issues:0

pix2pix

Tensorflow implementation of pix2pix(cGAN) for audio source separation

Language:PythonStargazers:0Issues:0Issues:0

rsrgan

Robust Speech Recognition Using Generative Adversarial Networks (GAN)

Language:ShellStargazers:0Issues:0Issues:0

pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Language:PerlStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

Singing_Voice_Separation_RNN

Singing-Voice Separation From Monaural Recordings Using Deep Recurrent Neural Networks

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

generative_model_speech

Phone generation model/VAE/GAN/VAE+GAN

Language:PythonStargazers:0Issues:0Issues:0

Multi-channel-speech-extraction-using-DNN

A tensorflow implementation of my paper Combining beamforming and deep neural networks for multi-channel speech extraction

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Listen-Attend-and-Spell-Pytorch

Listen Attend and Spell (LAS) implement in pytorch

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

Cross-Domain-CWS

Code for IJCAI 2018 paper "Neural Networks Incorporating Unlabeled and Partially-labeled Data for Cross-domain Chinese Word Segmentation"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

aes-lac-2018

Pytorch code of "A new automatic speech recognizer for Brazilian Portuguese based on deep neural networks and transfer learning" submitted to AES-LAC 2018

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

cublasHgemm-P100

Code for testing the native float16 matrix multiplication performance on Tesla P100 and V100 GPU based on cublasHgemm

Language:CudaLicense:MITStargazers:0Issues:0Issues:0

music-source-separation

Separating singing voice from music based on deep neural networks in Tensorflow

Language:PythonStargazers:0Issues:0Issues:0

PyTorch-GAN

PyTorch implementations of Generative Adversarial Networks.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

speech-denoising-wavenet

A neural network for end-to-end speech denoising

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CycleGAN

Software that can generate photos from paintings, turn horses into zebras, perform style transfer, and more.

Language:LuaLicense:NOASSERTIONStargazers:0Issues:0Issues:0

SeGAN

SeGAN: Segmenting and Generating the Invisible (https://arxiv.org/pdf/1703.10239.pdf)

Language:LuaLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

CGMM-MVDR

Implement of CGMM-MVDR beamforming

Language:PythonStargazers:0Issues:0Issues:0

nn_mask

multichannel linear filters based on mask estimation neural networks for CHiME4

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Speech_Enhancement_MMSE-STSA

A statistical model-based Speech Enhancement Using MMSE-STSA

Language:PythonStargazers:0Issues:0Issues:0

Weikun-Zhengshuang

over-the-air_speech_recogniztion_attack

Language:Jupyter NotebookLicense:BSD-2-ClauseStargazers:0Issues:0Issues:0