Beast code in Giters

5iding's repositories

singing_transcription_ICASSP2021

The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"

000

auditory-slow-fast

Implementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch

NOASSERTION000

deepspeech.pytorch

Speech Recognition using DeepSpeech2.

MIT000

KoSpeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition.

Apache-2.0000

awesome-speech-recognition-speech-synthesis-papers

Speech synthesis, voice conversion, self-supervised learning, music generation,Automatic Speech Recognition, Speaker Verification, Speech Synthesis, Language Modeling

MIT000

end2end-asr-pytorch

End-to-End Automatic Speech Recognition on PyTorch

MIT000

deep_avsr

A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.

MIT000

Awesome-Speech-Enhancement

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

MIT000

DNS-Challenge

This repo contains the scripts, models, and required files for the ICASSP 2021 Deep Noise Suppression (DNS) Challenge.

CC-BY-4.0000

PhoneFortifiedPerceptualLoss

Improving Perceptual Quality by Phone-Fortified Perceptual Loss for Speech Enhancement

MIT000

EHNet

This in an implementation of EHNet in PyTorch and PyTorch Lightning. EHNet is a convolutional-recurrent neural network for single channel speech enhancement.

000

Self-Supervised-Speech-Pretraining-and-Representation-Learning

Official implementation of the S3PRL toolkit: self-supervised pre-training of Mockingjay, TERA, AALBERT, APC, and more to come. With easy-to-use standard downstream evaluation scripts including phone classification, speaker recognition, and ASR. (All in Pytorch)

MIT000

espnet

End-to-End Speech Processing Toolkit

Apache-2.0000

suggested-notation-for-machine-learning

This introduces a suggestion of mathematical notation protocol for machine learning.

000

LAS_Mandarin_PyTorch

Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)

MIT000

WavAugment

A library for speech data augmentation in time-domain

MIT000

Tensor-Train-Neural-Network

Jun and Huck's Tensor-Train Network Toolbox

000

libri-light

dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.

MIT000

pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

000

pase

Problem Agnostic Speech Encoder

MIT000

wav2letter.pytorch

A fully convolution-network for speech-to-text, built on pytorch.

MIT000

Data-Science-Notes

数据科学的笔记以及资料搜集

000

i-revnet-based-time-frequency-transform

MIT000

audio_visual_speech_enhancement

Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments

000

magenta

Magenta: Music and Art Generation with Machine Intelligence

Apache-2.0000

Wave-U-Net-for-Speech-Enhancement-1

Implement [Wave-U-Net](https://arxiv.org/abs/1806.03185) by PyTorch, and migrate it to the speech enhancement area.

MIT000

CPC_audio

An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.

MIT000

5iding

5iding's repositories

nlp_paper_study

singing_transcription_ICASSP2021

auditory-slow-fast

leaf-audio

deepspeech.pytorch

KoSpeech

awesome-speech-recognition-speech-synthesis-papers

end2end-asr-pytorch

deep_avsr

Awesome-Speech-Enhancement

DNS-Challenge

PhoneFortifiedPerceptualLoss

AEC-Challenge

EHNet

Self-Supervised-Speech-Pretraining-and-Representation-Learning

espnet

suggested-notation-for-machine-learning

LAS_Mandarin_PyTorch

WavAugment

Tensor-Train-Neural-Network

libri-light

pytorch-kaldi

pase

wav2letter.pytorch

Data-Science-Notes

i-revnet-based-time-frequency-transform

audio_visual_speech_enhancement

magenta

Wave-U-Net-for-Speech-Enhancement-1

CPC_audio