zw76859420

This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for Speech Recognition"

Language:C++010

chinese-xinhua-important

:orange_book: 中华新华字典数据库。包括歇后语，成语，词语，汉字。

Language:PythonMIT010

CTC-OptimizedLoss

Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.

Language:Python000

emoASR

End-to-end MOdeling of ASR (Automatic Speech Recognition)

Language:Python010

gfcc

gfcc features

Language:C++MIT010

GigaS2S

S2ST Data

CC-BY-4.0000

ksponspeech

Pre-processing KsponSpeech corpus (Korean Speech dataset) provided by AI Hub.

Language:PythonMIT010

KWS_pytorch

Keyword spotting, Speech wake_up, by pytorch, DNN, CNN, TDNN, DFSMN, LSTM

Language:Python010

moshi

Apache-2.0000

neurst

Neural end-to-end Speech Translation Toolkit

Language:PythonNOASSERTION000

pkwrap

A pytorch wrapper for LF-MMI training and parallel training in Kaldi

Language:PythonNOASSERTION010

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

Language:C++Apache-2.0010

SimilarCharacter

对常用的6700个汉字进行音、形比较，输出音近字、形近字的列表。 # 相近字

Language:PythonMIT010

snowfall

Language:PythonApache-2.0010

speech-to-speech-translation

S2ST 伪标签

Language:PythonMIT000

Speech2Unit

Language:Python000

speechllm

We Speech Transcript based on LLM, in 300 lines of code.

Language:PythonApache-2.0000

whisper

Language:Jupyter NotebookMIT010

whisper-finetune

Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.

Language:PythonMIT000