widdiot

followers

following

stars

gnani.ai

Bengaluru

Vishay Raina's repositories

Bag-of-Visual-Words

This has he BoVW model to classify the images of same object together among: airplanes, bikes, cars, faces.

Language:Jupyter Notebook100

Text-Binarization

Language:Python101

APLUS_track

000

arabic_pronounce

Pronounce Arabic words

Language:Python000

asr_labs

ASR labs

Language:Jupyter NotebookMIT000

Best-README-Template

An awesome README template to jumpstart your projects!

MIT000

camel_tools

A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.

Language:PythonMIT000

ctcdecode

PyTorch CTC Decoder bindings

Language:C++MIT000

da-lang-id

Domain Adaptation for Spoken Language ID

Language:Python000

demo

example code for remind myself, especial the api

Language:Python000

Digit-Recognition

A CNN LeNet model to classify images of digits as 0 - 9.

Language:Python000

E2E-ASR

PyTorch Implementations for End-to-End Automatic Speech Recognition

Language:Python000

EEND

End-to-End Neural Diarization

Language:PythonMIT000

kaldi

This is the official location of the Kaldi project.

NOASSERTION000

kaldi-postproc

Language:Python010

marytts-lexicon-de

German lexicon for MaryTTS

NOASSERTION000

neural_sp

End-to-end ASR/LM implementation with PyTorch

Apache-2.0000

pika

a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi

Apache-2.0000

pychain_example

Language:Python000

pytorch-streamloader

000

speech-training-recorder

Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesis.

Language:PythonAGPL-3.0000

spoteno

Spoken text normalization for asr

Language:PythonMIT000

TIPR-assignment-1

Language:Jupyter Notebook000

TIPR_ASSIGNMENT_2

Language:Jupyter Notebook000

triplet-entropy-loss

Project repository for the work done in Triplet Entropy Loss: Improving The Generalization of Short Speech Language Identification Systems

Language:PythonGPL-3.0000

Tuplemax-Loss

Unofficial implementation of pairwise tuplemax loss. TUPLEMAX LOSS FOR LANGUAGE IDENTIFICATION https://arxiv.org/pdf/1811.12290.pdf Eq. (2). works only for batch_size = 1

Language:Python000

UHV-OTS-Speech

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

Apache-2.0000

VGG-Speaker-Recognition

Utterance-level Aggregation For Speaker Recognition In The Wild

Language:Python000

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonApache-2.0000

youtube-dl

Command-line program to download videos from YouTube.com and other video sites

Unlicense000