asr-model

There are 3 repositories under asr-model topic.

sovaai / sova-asr
SOVA ASR (Automatic Speech Recognition)
asr asr-model stt speech-recognition speech-to-text speech wav2letter automatic-speech-recognition
Language:Python 167
at16k / at16k
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
speech-recognition speech-to-text speech-api speech-recognizer speech-analysis voice-recognition voice-commands automatic-speech-recognition asr asr-model pretrained-models
Language:Python 130
ASR
vietai / ASR
End-to-End Vietnamese Speech Recognition using wav2vec 2.0
asr asr-model wav2vec2 ctc-loss pretrained-weights end-to-end-speech-recognition
85
IS2AI / TurkicASR
A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek.
asr-model deep-learning speech-recognition speech-synthesis speech-to-text turkic-languages
Language:Python 51
iamjanvijay / rnnt
An implementation of RNN-Transducer loss in TF-2.0.
transducer-loss rnnt ctc-loss asr-decoder asr-model
Language:Python 45
Hamtech-ai / wav2vec2-fa
fine-tune Wav2vec2. an ASR model released by Facebook
asr-model speech-to-text asr nlp huggingface transformer wav2vec2
Language:Jupyter Notebook 36
juan-csv / GPT3-text-summarization
Summarization, topic generation using GPT3
asr-model gpt-3 sentiment-analysis speech-to-text summarization topic-modeling
Language:Jupyter Notebook 31
oleges1 / quartznet-pytorch
Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]
quartznet quartznet-pytorch automatic-speech-recognition asr asr-model pytorch common-voice librispeech
Language:Jupyter Notebook 25
antouanbg / Bulgarian_Linguistic
Collection and resources for Bulgarian Corpus, Datasets and Models used in ASR, TTS or NLP tasks together with the links of corresponding tools/apps.
bulgarian-dataset asr asr-model tts tts-engines nlp machine-translation bulgarian-models lematization stemmer
Language:Java 24
robmsmt / SpeechLoop
Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
speech-recognition speech-to-text speech asr asr-model asr-benchmark speech-analysis speech-api python speechrecognition
Language:Python 18
ccoreilly / deepspeech-catala
Deepspeech ASR Model for the Catalan Language
deepspeech catalan asr catalan-language asr-model
Language:Python 17
tifaniwarnita / indonesian-asr
Automatic speech recognition (ASR) for Indonesian language built by using HTK and Julius. Web interface is built using Node.js.
speech-recognition hidden-markov-model asr-model htk
Language:Lex 17
WOLOF-ASR-Wav2Vec2
kingabzpro / WOLOF-ASR-Wav2Vec2
Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.
asr-model wav2vec2 wolof africa audio-processing audio facebook transcription
Language:Jupyter Notebook 15
Kirili4ik / QuartzNet-ASR-pytorch
Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.
asr asr-model beam-search ctc-loss pytorch pytorch-implementation quartznet quartznet-pytorch
Language:Jupyter Notebook 14
MegEngine / End-to-end-ASR-Transformer
An end to end ASR Transformer model training repo
asr-model automatic-speech-recognition megengine transfomer attention-mechanism
Language:Python 13
ccoreilly / catalan-speech-recognition-benchmark
A benchmark of speech recognition solutions for the Catalan language
speech-recognition asr asr-model catalan-language catala catalan speech-to-text vosk deepspeech
9
BudEcosystem / BarkingGPT
Audio to Audio (Whisper+ChatGPT+Bark)
bark chatgpt generative-model singing asr-model texttospeech
Language:JavaScript 8
KrishnaDN / LAS-Pytorch
Implementation of the paper "Listen, Attend and Spell" Paper in Pytorch
speech-re speech-to-text listen-attend-and-spell seq2seq-model timit asr-model asr
Language:Python 7
fquirin / kaldi-adapt-lm
Create and adapt n-gram and JSGF language models, e.g. for Kaldi-ASR nnet3 chain models from Zamia-Speech
speech-recognition asr-model language-model g2p jsgf-grammars kaldi-asr ngram-models kenlm zamia
Language:Python 6
isadrtdinov / quartznet
QuartzNet implementation for Automatic Speech Recognition task
deep-learning asr-model pytorch ljspeech
Language:Python 4
LaurentVeyssier / Automatic-Speech-Recognizer
Build end-to-end Deep Neural Network to translate speech to text (ASR model)
keras asr-model speech-to-text deep-neural-networks rnn temporal-convolutional-network gru ctc-loss
Language:Jupyter Notebook 3
Nexdata-AI / 800-Hours-Sichuan-Dialect-Conversational-Speech-Data-by-Mobile-Phone
The dataset of Sichuan dialect conversational speech
speech-recognition asr asr-model audio automatic-speech-recognition dataset deep-learning human-machine-interaction machine-learning speech speech-processing speech-to-text voice-interaction voice-recognition wav
3
Nexdata-AI / Conversational_Speech_Dataset
Mega Conversational Speech Datasets for Speech Recognition
asr asr-model audio automatic-speech-recognition conversational-ai dataset deep-learning speech speech-processing speech-recognition speech-synthesis speech-to-text tts voice-recognition
3
SzLeaves / asr-webapp
ASR Web APP 中文语音识别实验室APP，使用Django构建，包含中文语音转文字与中文语音聊天机器人模块
asr asr-model chatbot chatgpt-api django django-channels paddlepaddle tensorflow2 speech-recognition speech-to-text tensorflow
Language:Python 3
blademoon / Whisper_Train
Ноутбук для тонкой настройки Whisper на наборе данных Mozilla Сommon Voice.
asr asr-model huggingface-transformers whisper
Language:Jupyter Notebook 2
hammaad2002 / SimpleASRmodel
A simple CRDNN based ASR model for my own understanding of how ASR works and are trained. (Work in progress) If anyone finds any error or have any suggestion please do let me know.
asr asr-model librispeech pytorch pytorch-implementation pytorch-tutorial speech-recognition supervised-learning timit timit-dataset crdnn
Language:Jupyter Notebook 2
LuluW8071 / Automatic-Speech-Recognition-with-PyTorch
End-to-End Automatic Speech Recognition on PyTorch with CTC Decoder and Ken LM
asr-model cnn-lstm-models ctc-decode cuda-support kenlm pytorch pytorch-lightning deep-neural-networks python
Language:Python 2
Nexdata-AI / 1000-Hours-American-English-Conversational-Speech-Data-by-Mobile-Phone
American English Conversational Speech Dataset
asr asr-model audio dataset deep-learning machine-learning speech speech-recognition speech-to-text
2
Nexdata-AI / 200-People-Chinese-Wake-up-Words-Speech-Data-by-Mobile-Phone
Chinese Wake-up Words Speech Dataset
asr asr-model audio dataset deep-learning speech speech-recognition speech-to-text
2
Nexdata-AI / 240-Hours-Hindi-Speech-Data-by-Mobile-Phone_Reading
Hindi Speech Dataset
asr asr-model audio dataset deep-learning speech speech-recognition speech-synthesis speech-to-text tts
2
Nexdata-AI / 292-Hours-Thai-Speech-Data-by-Mobile-Phone_Reading
Thai Speech Dataset
asr asr-model audio dataset deep-learning speech speech-recognition speech-to-text
2
Nexdata-AI / 300-Hours-Mixed-Speech-with-Korean-and-English-Data-by-Mobile-Phone
Mixed Speech with Korean and English Dataset
asr asr-model audio code-switching dataset deep-learning speech speech-recognition speech-to-text
2
Nexdata-AI / 359-Hours-Indonesian-Speech-Data-by-Mobile-Phone_Reading
Indonesian Speech Dataset
asr asr-model audio dataset deep-learning speech speech-recognition speech-to-text
2
Nexdata-AI / 500-Hours-Henan-Dialect-Conversational-Speech-Data-by-Mobile-Phone
The dataset of Henan Dialect conversational speech
asr asr-model audio automatic-speech-recognition dataset deep-learning speech-processing speech-recognition speech-to-text tts wav
2
Shuyib / zindi_mcv_swahilli
How I used Seamless m4t large to get to the top 5 of the mozilla common voice competition hosted on Zindi
asr-model hackathon mozilla-common-voice seamlessm4t stt swahili voice-recognition zindi-hackathon
Language:Python 1
yandex-cloud-examples / yc-speechkit-async-recognizer
SpeechKit Asynchronous Batch Recognizer.
asr-model python3 speech-recognition speechkit yandex-cloud yandex-speechkit-api yandexcloud
Language:Python 1

asr-model

sovaai / sova-asr

at16k / at16k

vietai / ASR

IS2AI / TurkicASR

iamjanvijay / rnnt

Hamtech-ai / wav2vec2-fa

juan-csv / GPT3-text-summarization

oleges1 / quartznet-pytorch

antouanbg / Bulgarian_Linguistic

robmsmt / SpeechLoop

ccoreilly / deepspeech-catala

tifaniwarnita / indonesian-asr

kingabzpro / WOLOF-ASR-Wav2Vec2

Kirili4ik / QuartzNet-ASR-pytorch

MegEngine / End-to-end-ASR-Transformer

ccoreilly / catalan-speech-recognition-benchmark

BudEcosystem / BarkingGPT

KrishnaDN / LAS-Pytorch

fquirin / kaldi-adapt-lm

isadrtdinov / quartznet

LaurentVeyssier / Automatic-Speech-Recognizer

Nexdata-AI / 800-Hours-Sichuan-Dialect-Conversational-Speech-Data-by-Mobile-Phone

Nexdata-AI / Conversational_Speech_Dataset

SzLeaves / asr-webapp

blademoon / Whisper_Train

hammaad2002 / SimpleASRmodel

LuluW8071 / Automatic-Speech-Recognition-with-PyTorch

Nexdata-AI / 1000-Hours-American-English-Conversational-Speech-Data-by-Mobile-Phone

Nexdata-AI / 200-People-Chinese-Wake-up-Words-Speech-Data-by-Mobile-Phone

Nexdata-AI / 240-Hours-Hindi-Speech-Data-by-Mobile-Phone_Reading

Nexdata-AI / 292-Hours-Thai-Speech-Data-by-Mobile-Phone_Reading

Nexdata-AI / 300-Hours-Mixed-Speech-with-Korean-and-English-Data-by-Mobile-Phone

Nexdata-AI / 359-Hours-Indonesian-Speech-Data-by-Mobile-Phone_Reading

Nexdata-AI / 500-Hours-Henan-Dialect-Conversational-Speech-Data-by-Mobile-Phone

Shuyib / zindi_mcv_swahilli

yandex-cloud-examples / yc-speechkit-async-recognizer