Shylock's repositories
ASR_Theory
语音识别理论、论文和PPT
DNN-HMM-Course
DNN-HMM related Experiments for THUHCSI Course : <Digital Processing of Speech Signals>
SparseSelfAttention
Sparse Attention Mechanism, accepted in KSC 2019
cudafst
Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva
speechbrain
A PyTorch-based Speech Toolkit
AIF-PyTorch
(NOT Official) Implementation Auto-regressive Integrate-and-Fire (AIF)
asr-decode-simple
从Kaldi中裁剪的轻量级语音识别解码推理框架,目前实现了MFCC+GMM+Viterbi,不依赖OpenFST、OpenBLAS等库
Bayesian_TDNN
This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for Speech Recognition"
chinese-xinhua-important
:orange_book: 中华新华字典数据库。包括歇后语,成语,词语,汉字。
CTC-OptimizedLoss
Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.
GigaS2S
S2ST Data
ksponspeech
Pre-processing KsponSpeech corpus (Korean Speech dataset) provided by AI Hub.
KWS_pytorch
Keyword spotting, Speech wake_up, by pytorch, DNN, CNN, TDNN, DFSMN, LSTM
neurst
Neural end-to-end Speech Translation Toolkit
sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
SimilarCharacter
对常用的6700个汉字进行音、形比较,输出音近字、形近字的列表。 # 相近字
speech-to-speech-translation
S2ST 伪标签
speechllm
We Speech Transcript based on LLM, in 300 lines of code.
whisper-finetune
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.