There are 18 repositories under kaldi topic.
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Command line utility for forced alignment using Kaldi
the open-source virtual assistant for Ubuntu based Linux distributions
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Tools for handling speech data in machine learning projects.
Offline speech recognition for Android with Vosk library.
A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Open tools and data for cloudless automatic speech recognition
A speech recognition library running in the browser thanks to a WebAssembly build of Vosk
speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
End-to-End Neural Diarization
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Проект для распознавания речи на русском языке на основе pykaldi.
Dockerfile for kaldi-gstreamer-server.
A pure python module for reading and writing kaldi ark files
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
A personal toolkit for single/multi-channel speech recognition & enhancement & separation.
ASR with PyTorch