Beast code in Giters

Karel Vesely's repositories

kaldi-io-for-python

Python functions for reading kaldi data formats. Useful for rapid prototyping with python.

Language:PythonApache-2.0375 12 34

kaldi

Karel's development fork of official kaldi repo.

Language:ShellNOASSERTION200

atco2-corpus

A Corpus for Research on Robust Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications

Language:PythonMIT000

audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Language:PythonMIT000

CQT_toolbox_python

Constant-Q Transform Toolbox for Python/MATLAB

Language:MMIT000

cylimiter

A C++/Cython audio limiter for Python.

Language:C++Apache-2.0000

espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Language:PythonNOASSERTION000

fixwav

Quick utility to fix WAV files with incorrect lengths

000

GigaSpeech

Large, modern dataset for speech recognition

Apache-2.0000

gpt4all

gpt4all: open-source LLM chatbots that you can run anywhere

MIT000

grive2

Google Drive client with support for new Drive REST API and partial sync

Language:C++GPL-2.0010

icefall

NOASSERTION000

json

JSON for Modern C++

MIT000

k2

FSA/FST algorithms, intended to (eventually) be interoperable with PyTorch and similar

NOASSERTION000

kaldi-native-fbank

Kaldi-compatible online fbank extractor without external dependencies

Apache-2.0000

kaldi_native_io

python wrapper for kaldi's native I/O

Language:C++NOASSERTION000

kaldilm

Python wrapper for kaldi's arpa2fst

NOASSERTION000

lhotse

Tools for handling speech data in machine learning projects.

Language:PythonApache-2.0000

libriheavy

Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context

Language:PythonApache-2.0000

mistral-src

Reference implementation of Mistral AI 7B v0.1 model.

Apache-2.0000

mitlm

MIT Language Modeling Toolkit

BSD-3-Clause000

personalVAD

An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.

Language:PythonGPL-3.0000

Phonetisaurus

Phonetisaurus G2P

BSD-3-Clause000

pocolm

Small language toolkit for creation, interpolation and pruning of ARPA language models

NOASSERTION000

sherpa

Speech-to-text server framework with next-gen Kaldi

Language:PythonNOASSERTION000

soundslike_icefall

Icefall recipe for the SoundsLike project under JSALT 2023 (voxpopuli recipe)

Apache-2.0000

VBx

Variational Bayes HMM over x-vectors diarization on DIHARD II

Language:Python010

vocode-python

🤖 Build voice-based LLM agents. Modular + open source.

MIT000

w2v2-air-traffic

MIT000

wikiextractor

A tool for extracting plain text from Wikipedia dumps

AGPL-3.0000