iwaterxt's repositories
voiceprint
text-independent speaker identification
awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition, Speaker Verification, Speech Synthesis, Language Modeling
CAT
A CRF-based ASR Toolkit
cmake-demo
《CMake入门实战》源码
compound-loss-pytorch
Compound loss for PyTorch
DeepSpeech
A PaddlePaddle implementation of DeepSpeech2 architecture for ASR.
E2E-ASR
PyTorch Implementations for End-to-End Automatic Speech Recognition
espnet
End-to-End Speech Processing Toolkit
gdrive.sh
Download a file or a folder easily. curl gdrive.sh | bash -s $fileid
iwaterxt.github.io
Template for a blog hosted on GitHub Pages
kaldi
This is now the official location of the Kaldi project.
kaldi-dnn-ali-gop
Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.
kaldi-gop
Computes the Goodness of Pronunciation (GOP). Bases on Kaldi.
kaldi-gstreamer-server
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
kaldi-io-for-python
Python functions for reading kaldi data formats. Useful for rapid prototyping with python.
neural_sp
End-to-end ASR/LM implementation with pytorch.
nn-vad
simple dnn based vad
onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
OpenSeq2Seq
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
PortaSpeech
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
python_kaldi_features
python codes to extract MFCC and FBANK speech features for Kaldi
Socket-Programming-Python
Client Server running code described with comments here.
sparse_image_warp_pytorch
Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for Automatic Speech Recognition https://arxiv.org/abs/1904.08779
wekws
Production First and Production Ready End-to-End Keyword Spotting Toolkit
xdecoder
Fast, portable, enhanced ASR decoder