Sundy1219's repositories
kaldifeat
Kaldi-compatible feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
transformers
🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.
rasr
The RWTH ASR Toolkit.
python_speech_features
This library provides common speech features for ASR including MFCCs and filterbank energies.
GigaSpeech
Large, modern dataset for speech recognition
warp-rnnt
CUDA-Warp RNN-Transducer
TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
CAT
A CRF-based ASR Toolkit
speechbrain
A PyTorch-based Speech Toolkit
pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
athena
an open-source implementation of sequence-to-sequence based speech processing engine
tensorpack
A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
neural_sp
End-to-end ASR/LM implementation with PyTorch
Conformer-Athena
Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.
DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
pychain
PyTorch implementation of LF-MMI for End-to-end ASR
TENet-kws
Tensorflow implementation of "Small-Footprint Keyword Spotting with Multi-Scale Temporal Convolution"(INTERSPEECH 2020)
CTCWordBeamSearch
Connectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.
SkipConvNet
Speech Dereverberation using Fully Convolutional Networks
cat_tensorflow
Crf-based Asr Toolkit with TensorFlow implement
ESC-50
ESC-50: Dataset for Environmental Sound Classification
ctc_decoders
Baidu's CTC Decoders, including Greedy, Beam Search and Beam Search with KenLM Language Model
fast-ctc-decode
Blitzing Fast CTC Beam Search Decoder
KWS_Max-pooling_RHE
Mining effective negative training samples for keyword spotting (PyTorch)
tf-code-acoustics
it's a train acoustics model code lib
Speech-enhancement
Deep learning for audio denoising
minimize-chain-decoder
Minimize kaldi nnet3 chain decoder