YIZHAO's repositories
audio_augment
A tool/script for batch speech data enhancement with speed/volume/RIRS/MUSAN
xvector-cnceleb
kaldi based x-vector trained on Cn-Celeb
CVTE_chain_model_finetune
finetune the chain model based on cvte open source model without traing any GMM for frame alignment
kaldi-asr-server
this is a kaldi based flask asr service with server and client,which supports Multi-threaded concurrent
Classical-Speech-Algorithms
Classical speech recognition and speaker recognition algorithms
OpenTransformer
A No-Recurrence Sequence-to-Sequence Model for Speech Recognition
Speech-Transformer
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
text_classifer
基于pytroch的文本分类相关算法实现
Factorized-TDNN
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
rnn-transducer
A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition
SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain