Aditya Yadavalli (AdityaYadavalli1)

AdityaYadavalli1

Geek Repo

Company:Karya Inc

Location:Remote

Home Page:https://adityayadavalli1.github.io

Twitter:@AdityaYadavall2

Github PK Tool:Github PK Tool

Aditya Yadavalli's starred repositories

private-gpt

Interact with your documents using the power of GPT, 100% privately, no data leaks

Language:PythonLicense:Apache-2.0Stargazers:52930Issues:453Issues:1126

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

localGPT

Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.

Language:PythonLicense:Apache-2.0Stargazers:19602Issues:164Issues:529

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8235Issues:128Issues:1044

vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7406Issues:119Issues:1469

whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Language:PythonLicense:AGPL-3.0Stargazers:1689Issues:26Issues:133

gentle

gentle forced aligner

Language:PythonLicense:MITStargazers:1409Issues:45Issues:234

torchdistill

A coding-free framework built on PyTorch for reproducible deep learning studies. 🏆25 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemented so far. 🎁 Trained models, training logs and configurations are available for ensuring the reproducibiliy and benchmark.

Language:PythonLicense:MITStargazers:1319Issues:19Issues:46

forced-alignment-tools

A collection of links and notes on forced alignment tools

Language:PythonLicense:NOASSERTIONStargazers:856Issues:38Issues:6

azure-storage-azcopy

The new Azure Storage data transfer utility - AzCopy v10

huggingsound

HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools

Language:PythonLicense:MITStargazers:427Issues:14Issues:47

language_tool_python

a free python grammar checker 📝✅

Language:PythonLicense:GPL-3.0Stargazers:415Issues:10Issues:73

simalign

Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)

Language:PythonLicense:MITStargazers:345Issues:10Issues:33

ctc-segmentation

Segment an audio file and obtain utterance alignments. (Python package)

Language:PythonLicense:Apache-2.0Stargazers:309Issues:13Issues:28

textgrid

A Python module for interacting with Praat TextGrid files. Also includes a class for reading HTK .mlf files into Praat

Language:PythonLicense:MITStargazers:274Issues:16Issues:16

kaldiio

A pure python module for reading and writing kaldi ark files

Language:PythonLicense:NOASSERTIONStargazers:247Issues:12Issues:16

kaldi-dnn-ali-gop

Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.

Language:C++License:NOASSERTIONStargazers:218Issues:16Issues:0

rVADfast

This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

Language:PythonLicense:MITStargazers:122Issues:7Issues:2

realbook

Easier audio-based machine learning with TensorFlow.

Language:PythonLicense:Apache-2.0Stargazers:112Issues:10Issues:1

minicons

Utility for behavioral and representational analyses of Language Models

Language:PythonLicense:MITStargazers:109Issues:6Issues:15

number-parser

Parse numbers written in natural language

Language:PythonLicense:BSD-3-ClauseStargazers:104Issues:9Issues:39

flutter_pytorch_mobile

A flutter plugin for pytorch model inference. Supports image models as well as custom models.

Language:JavaLicense:NOASSERTIONStargazers:95Issues:3Issues:25

awesome-asr-contextualization

A curated list of awesome papers on contextualizing E2E ASR outputs

spelling

This is a neural spell checker

indic-wx-converter

Python library for converting UTF to WX and vice-versa for Indian languages.

Language:PythonLicense:MITStargazers:48Issues:3Issues:6

rttm-viewer

Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way

Language:PythonLicense:MITStargazers:35Issues:3Issues:1

Kaldi-notes

Resources helpful for Kaldi

kaldi-helpers

Helper scripts to work with Kaldi

Language:PythonLicense:MITStargazers:6Issues:5Issues:3

mucs_2021_dialpad

Dialpad team's submission to the MUCS 2021 workshop

Language:PythonLicense:NOASSERTIONStargazers:5Issues:6Issues:0

SBCSAE-preprocess

Preprocessing and downloading scripts for the Santa Barbara Corpus of Spoken American English (SBCSAE).

Language:PythonLicense:MITStargazers:3Issues:1Issues:1