Tanel Alumäe's repositories
kaldi-gstreamer-server
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
kaldi-offline-transcriber
Offline transcription system for Estonian using Kaldi
gst-kaldi-nnet2-online
GStreamer plugin around Kaldi's online neural network decoder
online_speaker_change_detector
Online streaming speaker change detection model in Pytorch
sv_score_calibration
Score calibration for speaker verification
voxlingua107_sb
VoxLingua107 recipe for SpeechBrain
et-g2p-fst
FST-based rule-based grapheme-to-phoneme (and vice versa) converter for Estonian
voxceleb_weakly_supervised_segments
Speaker segmentations for the Voxceleb 1 and 2 datasets, generated using weakly supervised training
kaldi-estonian-recipe
Recipe to train a general-purpose Estonian ASR system with all the bells and whistles (eventually)
pl-whisper-finetuner
Whisper finetuning with Pytorch Lightning
tts_preprocess_et
Preprocessing for Estonian text-to-speech
acl-anthology
Data and software for building the ACL Anthology.
audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
faster-whisper
Faster Whisper transcription with CTranslate2
k6net6lke-benchmark
Eesti kõnetõlkeprojekti test-andmed ja skriptid
libcaption
Free open-source CEA608 / CEA708 closed-caption encoder/decoder
nodalida2023_gen_proc
Generator of NoDaLiDa 2023 proceedings
speechbrain
A PyTorch-based Speech Toolkit
streamlit-asr-leaderboard
Streamlit-based leaderboard for my NLP course home asssignment
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.