Aditya Yadavalli's starred repositories
private-gpt
Interact with your documents using the power of GPT, 100% privately, no data leaks
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
speechbrain
A PyTorch-based Speech Toolkit
whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
torchdistill
A coding-free framework built on PyTorch for reproducible deep learning studies. 🏆25 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemented so far. 🎁 Trained models, training logs and configurations are available for ensuring the reproducibiliy and benchmark.
forced-alignment-tools
A collection of links and notes on forced alignment tools
azure-storage-azcopy
The new Azure Storage data transfer utility - AzCopy v10
huggingsound
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
language_tool_python
a free python grammar checker 📝✅
ctc-segmentation
Segment an audio file and obtain utterance alignments. (Python package)
kaldi-dnn-ali-gop
Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.
number-parser
Parse numbers written in natural language
flutter_pytorch_mobile
A flutter plugin for pytorch model inference. Supports image models as well as custom models.
awesome-asr-contextualization
A curated list of awesome papers on contextualizing E2E ASR outputs
indic-wx-converter
Python library for converting UTF to WX and vice-versa for Indian languages.
rttm-viewer
Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way
Kaldi-notes
Resources helpful for Kaldi
kaldi-helpers
Helper scripts to work with Kaldi
mucs_2021_dialpad
Dialpad team's submission to the MUCS 2021 workshop
SBCSAE-preprocess
Preprocessing and downloading scripts for the Santa Barbara Corpus of Spoken American English (SBCSAE).