Karel Vesely (KarelVesely84)

KarelVesely84

Geek Repo

Company:Brno University of Technology

Location:Brno, Czech Republic

Github PK Tool:Github PK Tool

Karel Vesely's repositories

kaldi-io-for-python

Python functions for reading kaldi data formats. Useful for rapid prototyping with python.

Language:PythonLicense:Apache-2.0Stargazers:375Issues:12Issues:34

kaldi

Karel's development fork of official kaldi repo.

Language:ShellLicense:NOASSERTIONStargazers:2Issues:0Issues:0

atco2-corpus

A Corpus for Research on Robust Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CQT_toolbox_python

Constant-Q Transform Toolbox for Python/MATLAB

Language:MLicense:MITStargazers:0Issues:0Issues:0

cylimiter

A C++/Cython audio limiter for Python.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

fixwav

Quick utility to fix WAV files with incorrect lengths

Stargazers:0Issues:0Issues:0

GigaSpeech

Large, modern dataset for speech recognition

License:Apache-2.0Stargazers:0Issues:0Issues:0

gpt4all

gpt4all: open-source LLM chatbots that you can run anywhere

License:MITStargazers:0Issues:0Issues:0

grive2

Google Drive client with support for new Drive REST API and partial sync

Language:C++License:GPL-2.0Stargazers:0Issues:1Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

json

JSON for Modern C++

License:MITStargazers:0Issues:0Issues:0

k2

FSA/FST algorithms, intended to (eventually) be interoperable with PyTorch and similar

License:NOASSERTIONStargazers:0Issues:0Issues:0

kaldi-native-fbank

Kaldi-compatible online fbank extractor without external dependencies

License:Apache-2.0Stargazers:0Issues:0Issues:0

kaldi_native_io

python wrapper for kaldi's native I/O

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

kaldilm

Python wrapper for kaldi's arpa2fst

License:NOASSERTIONStargazers:0Issues:0Issues:0

lhotse

Tools for handling speech data in machine learning projects.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

libriheavy

Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

mistral-src

Reference implementation of Mistral AI 7B v0.1 model.

License:Apache-2.0Stargazers:0Issues:0Issues:0

mitlm

MIT Language Modeling Toolkit

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

personalVAD

An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

Phonetisaurus

Phonetisaurus G2P

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

pocolm

Small language toolkit for creation, interpolation and pruning of ARPA language models

License:NOASSERTIONStargazers:0Issues:0Issues:0

sherpa

Speech-to-text server framework with next-gen Kaldi

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

soundslike_icefall

Icefall recipe for the SoundsLike project under JSALT 2023 (voxpopuli recipe)

License:Apache-2.0Stargazers:0Issues:0Issues:0

VBx

Variational Bayes HMM over x-vectors diarization on DIHARD II

Language:PythonStargazers:0Issues:1Issues:0

vocode-python

🤖 Build voice-based LLM agents. Modular + open source.

License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

wikiextractor

A tool for extracting plain text from Wikipedia dumps

License:AGPL-3.0Stargazers:0Issues:0Issues:0