Tanel Alumäe's repositories

kaldi-gstreamer-server

Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.

Language:PythonLicense:BSD-2-ClauseStargazers:1059Issues:68Issues:222

kaldi-offline-transcriber

Offline transcription system for Estonian using Kaldi

Language:PythonLicense:NOASSERTIONStargazers:228Issues:36Issues:28

gst-kaldi-nnet2-online

GStreamer plugin around Kaldi's online neural network decoder

Language:C++License:Apache-2.0Stargazers:185Issues:26Issues:75

online_speaker_change_detector

Online streaming speaker change detection model in Pytorch

Language:Jupyter NotebookLicense:MITStargazers:34Issues:3Issues:5

sv_score_calibration

Score calibration for speaker verification

Language:PythonLicense:Apache-2.0Stargazers:22Issues:5Issues:0
Language:PythonLicense:MITStargazers:17Issues:6Issues:0

et-g2p

Grapheme to phoneme converter for Estonian

voxlingua107_sb

VoxLingua107 recipe for SpeechBrain

Language:PythonStargazers:11Issues:3Issues:0

et-g2p-fst

FST-based rule-based grapheme-to-phoneme (and vice versa) converter for Estonian

voxceleb_weakly_supervised_segments

Speaker segmentations for the Voxceleb 1 and 2 datasets, generated using weakly supervised training

kaldi-estonian-recipe

Recipe to train a general-purpose Estonian ASR system with all the bells and whistles (eventually)

Language:ShellLicense:Apache-2.0Stargazers:4Issues:3Issues:0

vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Language:C++License:Apache-2.0Stargazers:2Issues:2Issues:0

pl-whisper-finetuner

Whisper finetuning with Pytorch Lightning

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

tts_preprocess_et

Preprocessing for Estonian text-to-speech

Language:PythonStargazers:1Issues:1Issues:0

acl-anthology

Data and software for building the ACL Anthology.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Language:PythonLicense:BSD-2-ClauseStargazers:0Issues:1Issues:0

audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

k6net6lke-benchmark

Eesti kõnetõlkeprojekti test-andmed ja skriptid

Language:MathematicaStargazers:0Issues:1Issues:0

libcaption

Free open-source CEA608 / CEA708 closed-caption encoder/decoder

Language:CLicense:MITStargazers:0Issues:1Issues:0

nodalida2023_gen_proc

Generator of NoDaLiDa 2023 proceedings

Language:TeXStargazers:0Issues:1Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

streamlit-asr-leaderboard

Streamlit-based leaderboard for my NLP course home asssignment

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:2Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0