AdityaYadavalli1

Aditya Yadavalli's starred repositories

private-gpt

Interact with your documents using the power of GPT, 100% privately, no data leaks

Language:PythonApache-2.052930 453 1126

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

NOASSERTION25779 280 37

localGPT

Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.

Language:PythonApache-2.019602 164 529

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonApache-2.08235 128 1044

vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Language:Jupyter NotebookApache-2.07406 119 1469

whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Language:PythonAGPL-3.01689 26 133

gentle

gentle forced aligner

Language:PythonMIT1409 45 234

A coding-free framework built on PyTorch for reproducible deep learning studies. 🏆25 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemented so far. 🎁 Trained models, training logs and configurations are available for ensuring the reproducibiliy and benchmark.

Language:PythonMIT1319 19 46

forced-alignment-tools

A collection of links and notes on forced alignment tools

Language:PythonNOASSERTION856 38 6

azure-storage-azcopy

The new Azure Storage data transfer utility - AzCopy v10

Language:GoMIT593 49 1481

huggingsound

HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools

Language:PythonMIT427 14 47

language_tool_python

a free python grammar checker 📝✅

Language:PythonGPL-3.0415 10 73

simalign

Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)

Language:PythonMIT345 10 33

ctc-segmentation

Segment an audio file and obtain utterance alignments. (Python package)

Language:PythonApache-2.0309 13 28

textgrid

A Python module for interacting with Praat TextGrid files. Also includes a class for reading HTK .mlf files into Praat

Language:PythonMIT274 16 16

kaldiio

A pure python module for reading and writing kaldi ark files

Language:PythonNOASSERTION247 12 16

kaldi-dnn-ali-gop

Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.

Language:C++NOASSERTION218 160

rVADfast

This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

Language:PythonMIT122 7 2