Agrover112 / Kaldi-notes

Resources helpful for Kaldi

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Kaldi Notes

This repo contains some of the stuff I love to refer back to related to Kaldi. I add some important links, lecture that helps in using Kaldi. PDFs are also present in this repository, which are my notes. Kaldi toolkit has lot of resources and information spread out on the internet, despite the presence of many such similar respositories, many links are often outdated as of 2022. This repository will serve as an list for some great links I found online which can be helpful for learning Kaldi and it's internal workings. This should help in demystifiying the working of Kaldi.

I won't accept Pull Requests for fixing Spelling Errors. I consider it the responsibility of other uses to raise meaningful Pull Requests to help with the cause of learning Kaldi

Kaldi Lectures

These links contain lectures given by Dan Povey, in the form of Kaldi lectures.

Text Preprocessing

Text preprocessing is an important aspect in ASR when preparing transcripts from raw-data or cleaning transcripts for preparation of lexicon files, doing preprocessing in Linux can be helpful and prevent further errors downstream in the pipeline.

Kaldi miscellaneous

Theory

Some links related to theory WFST

Maximum Likelihood Estimation

Signal Processing

Decoding

The decoding process is important to understand , as it is responsible for the final output. Kaldi creates such decoding graphs via compositions of lattices. I think of compositions as dot product of Tensors.

Common Kaldi Errors & Questions

A list of some great errors faced by Kaldi users, I bookmarked. Note: You might need to join the Google Group for viewing them.