Vishay Raina (widdiot)

widdiot

Geek Repo

Company:gnani.ai

Location:Bengaluru

Github PK Tool:Github PK Tool

Vishay Raina's repositories

Bag-of-Visual-Words

This has he BoVW model to classify the images of same object together among: airplanes, bikes, cars, faces.

Language:Jupyter NotebookStargazers:1Issues:0Issues:0
Stargazers:0Issues:0Issues:0

arabic_pronounce

Pronounce Arabic words

Language:PythonStargazers:0Issues:0Issues:0

asr_labs

ASR labs

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

Best-README-Template

An awesome README template to jumpstart your projects!

License:MITStargazers:0Issues:0Issues:0

camel_tools

A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ctcdecode

PyTorch CTC Decoder bindings

Language:C++License:MITStargazers:0Issues:0Issues:0

da-lang-id

Domain Adaptation for Spoken Language ID

Language:PythonStargazers:0Issues:0Issues:0

demo

example code for remind myself, especial the api

Language:PythonStargazers:0Issues:0Issues:0

Digit-Recognition

A CNN LeNet model to classify images of digits as 0 - 9.

Language:PythonStargazers:0Issues:0Issues:0

E2E-ASR

PyTorch Implementations for End-to-End Automatic Speech Recognition

Language:PythonStargazers:0Issues:0Issues:0

EEND

End-to-End Neural Diarization

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

kaldi

This is the official location of the Kaldi project.

License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

marytts-lexicon-de

German lexicon for MaryTTS

License:NOASSERTIONStargazers:0Issues:0Issues:0

neural_sp

End-to-end ASR/LM implementation with PyTorch

License:Apache-2.0Stargazers:0Issues:0Issues:0

pika

a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

speech-training-recorder

Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesis.

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

spoteno

Spoken text normalization for asr

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

triplet-entropy-loss

Project repository for the work done in Triplet Entropy Loss: Improving The Generalization of Short Speech Language Identification Systems

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

Tuplemax-Loss

Unofficial implementation of pairwise tuplemax loss. TUPLEMAX LOSS FOR LANGUAGE IDENTIFICATION https://arxiv.org/pdf/1811.12290.pdf Eq. (2). works only for batch_size = 1

Language:PythonStargazers:0Issues:0Issues:0

UHV-OTS-Speech

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

License:Apache-2.0Stargazers:0Issues:0Issues:0

VGG-Speaker-Recognition

Utterance-level Aggregation For Speaker Recognition In The Wild

Language:PythonStargazers:0Issues:0Issues:0

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

youtube-dl

Command-line program to download videos from YouTube.com and other video sites

License:UnlicenseStargazers:0Issues:0Issues:0