speechrecognition

There are 2 repositories under speechrecognition topic.

speechbrain / speechbrain
A PyTorch-based Speech Toolkit
speech-recognition speech-toolkit speaker-recognition speech-to-text speech-enhancement speech-separation audio audio-processing speech-processing speechrecognition asr voice-recognition spoken-language-understanding speaker-diarization speaker-verification pytorch huggingface transformers language-model deep-learning
Language:Python 10430
revdotcom / reverb
Open source inference code for Rev's model
asr asr-model canary deeplearning diarization docker huggingface neural-network open-source opensource pyannote rev revai speaker-diarization speech-recognition speech-to-text speechrecognition wenet whisper
Language:Python 396
speechbrain / speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
beamforming deep-learning deeplearning librispeech neural-network neural-networks speaker-identification speaker-recognition speaker-verification speech speech-analysis speech-api speech-emotion-recognition speech-processing speech-recognition speech-recognizer speech-separation speech-to-text speechrecognition timit
Language:HTML 371
SamirPaulb / real-time-voice-translator
A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.
final-year-project machine-learning ml real-time-transcription translates-audio voice-translator deep-translator googletranslator gtts playsound speaker-recognition speech-to-speech speech-to-text speechrecognition text-to-speech tkinter translation gui python linguasync
Language:Tcl 261
robmsmt / KerasDeepSpeech
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
keras deepspeech asr ctc coreml speechrecognition speech-to-text deep-learning machine-learning neural-networks baidu speech deeplearning neural-network nn
Language:Python 242
Azure-Samples / SpeechToText-WebSockets-Javascript
SDK & Sample to do speech recognition using websockets in Javascript
microsoft speech speechtotext sdk javascript typescript ts js browser websocket cognitive-services speech-recognition websockets microsoft-speech-service recognition speechrecognition
Language:TypeScript 219
roshan9419 / PersonalAssistantChatbot
It is a personal assistant chatbot, capable to perform many tasks same as Google Assistant plus more extra features...
chatbot assistatant speechrecognition tkinter pyttsx3 opencv
Language:Python 132
by2101 / OpenASR
A pytorch based end2end speech recognition system.
speech speech-recognition speech-to-text speechrecognition speech-recognizer transformer las end2end asr
Language:Python 113
shangeth / wavencoder
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.
deeplearning pytorch speech-recognition audio-processing speech-processing speechrecognition representation-learning unsupervised-learning semi-supervised-learning voice-recognition speaker-recognition hacktoberfest
Language:Python 89
Open-Speech-EkStep / vakyansh-wav2vec2-experimentation
Repository containing experimentation platform on how to train, infer on wav2vec2 models.
asr indic-languages indic-scripts open-source pytorch speech speech-recognition speech-recognition-model speechrecognition speechrecognition-python
Language:Python 86
goxr3plus / java-google-speech-api
🙊 Speech Recognition , Text To Speech , Google Translate
google-translate speechrecognition text-to-speech
Language:Java 81
WeBAD
solyarisoftware / WeBAD
Web Browser Audio Detection/Speech Recording Events API
audio audio-processing speechrecognition push-to-talk browser javascript volume volume-control microphone speech recording recording-button voice voice-recognition webrtc audi-capture voice-interface dom
Language:JavaScript 74
botbahlul / autosrt
A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Google Speech Recognition API) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file
google-translate-api python speech-recognition srt-subtitle voice-recognition captions ffmpeg subtitle speechrecognition voicerecognition auto-caption auto-subtitle subriptext
Language:Python 60
jindongwang / EasyEspnet
Making Espnet easier to use
asr easy-to-use espnet speech speech-recognition speechrecognition toolkit
Language:Python 54
syntithenai / opensnips
Open source projects related to Snips https://snips.ai/.
speech speechrecognition rasa kaldi docker dialog snips snowboy hotwords hark porcupine nlu snips-skills asr audio-server
Language:JavaScript 54
IS2AI / ISSAI_SAIDA_Kazakh_ASR
the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTTS2 and supplements additional data from other sources. KSC2 contains around 1.2k hours of high-quality transcribed data comprising over 600k utterances.
speech-recognition speech-synthesis speech-to-text speechrecognition
Language:Shell 50
rollingstarky / Python-Voice-Assistant
A Python based Voice Assistant like Siri
python ai chatbot tts stt speechrecognition
Language:Python 42
AppleHolic / PytorchSR
Pytorch based phoneme recognition (TIMIT phoneme classification)
pytorch paper timit speechrecognition minimalgru cbhg
Language:Python 34
speech
ng-web-apis / speech
A library for using Web Speech API with Angular
speech-recognition speech-to-text speech speech-synthesis speech-api speechrecognition text-to-speech angular
Language:TypeScript 32
botbahlul / pyvosklivesubtitle
PySimpleGUI based DESKTOP APP that can RECOGNIZE any live streaming in 23 languages that supported by VOSK then TRANSLATE (using unofficial online Google Translate API) and display it as LIVE CAPTION / LIVE SUBTITLE
google-translate-api live-subtitle python speech-recognition voice-recognition vosk auto-caption live-caption caption pysimplegui speechrecognition subtitle voicerecognition ffmpeg
Language:Python 28
Kushal997-das / Pyautogui-module-using-audio
📌 This repo is all about how we implemented pyttsx3,speech_recognition,colored all three modules with pyautogui module.
pyttsx3 colored pyautogui speechrecognition python3 git github project
Language:Python 28
srinivr / kaldi-long-audio-alignment
Long audio alignment using Kaldi
kaldi longaudio-alignment audio-segments asr automatic-speech-recognition split-audio speech-recognition speech-to-text speechrecognition transcription speech-transcription
Language:Shell 24
botbahlul / whisper_autosrt
A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using faster_whisper module which is a reimplementation of OpenAI Whisper module) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file
auto-caption auto-subtitle caption faster-whisper ffmpeg google-translate-api openai openai-whisper python speech-recognition speechrecognition subtitle voice-recognition voicerecognition whisper
Language:Python 23
G10DRAS / RoboCop
Artificially Intelligent Machine with Computer Vision, Natural Language Processing, AI, Sense and Feelings.
artificial-intelligence natural-language-processing speechrecognition voice-control
Language:Python 23
LinkonBSMRSTU / Speech-To-Text-App-iOS
A simple iOS App that can convert speech/voice into text. Only English voice is supported for now. Used Swift 5, AVKit and Speech.
swift5 avkit speech-to-text speech-recognition xcode11 speech microphone speech-analysis voice-to-text speechrecognition ios xcode ios-app
Language:Swift 23
franchesoni / s2t
:speaking_head: :keyboard: Speech-to-text on key for Linux
linux onkey openai speech speech-recognition speech-to-text speechrecognition utilities whisper
Language:Shell 21
react-vocal
untemps / react-vocal
React component and hook to initiate a SpeechRecognition session
speech-to-text speechrecognition speech web-speech-api react reactjs component hook javascript
Language:JavaScript 20
robmsmt / SpeechLoop
Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
speech-recognition speech-to-text speech asr asr-model asr-benchmark speech-analysis speech-api python speechrecognition
Language:Python 19
ShawnPi233 / EatecPlayerMaster
食课——PyQt5多功能视频播放器（数据管理、笔记、识别字幕、视频关键词生成）
python pyqt5 video-player srt-subtitles abstract sql jdbc opengauss speechrecognition
Language:Python 18
botbahlul / android-autosrt-v2
ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any audio/video files using 2 ACTIVITIES
android caption ffmpeg google-translate-api googletranslate java python speech-recognition speechrecognition subtitle voice-recognition voicerecognition chaquopy speech-to-text voice-to-text
Language:Java 17
azu / transcript-audio
Transcript your audio files like Podcast using SpeechRecognition and Virtual Audio Device.
audio transcript chrome speechrecognition blackhole
Language:TypeScript 16
botbahlul / android-autosrt
ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any audio/video files
chaquopy google-translate-api speech-recognition voice-recognition srt-subtitle captions ffmpeg speechrecognition subtitle voicerecognition mobile-ffmpeg android java python speech-to-text voice-to-text
Language:Java 16
Abhishek-op / SR
💡Kivy-android speech recognition
python3 kivy-android speechrecognition wrap api speech-recognition android python
Language:Python 15
cantonese-selfish-project
scottykwok / cantonese-selfish-project
Cantonese Selfish Project 廣東話自肥企劃 at PYCON HK 2021
pycon pyconhk cantonese deepspeech wav2vec2 speechrecognition cantonese-speech-recognition
Language:Jupyter Notebook 15
Manasvi070902 / Meetify
Microsoft Engage program 2021
webrtc socket-io video-conferencing notes chat-application peerjs speechrecognition
Language:JavaScript 14
bluetooth-lamp
IlyaZaprutski / bluetooth-lamp
Demo project for bluetooth lamp
webbluetooth speechrecognition webaudioapi emotion-recognition
Language:JavaScript 13

speechrecognition

speechbrain / speechbrain

revdotcom / reverb

speechbrain / speechbrain.github.io

SamirPaulb / real-time-voice-translator

robmsmt / KerasDeepSpeech

Azure-Samples / SpeechToText-WebSockets-Javascript

roshan9419 / PersonalAssistantChatbot

by2101 / OpenASR

shangeth / wavencoder

Open-Speech-EkStep / vakyansh-wav2vec2-experimentation

goxr3plus / java-google-speech-api

solyarisoftware / WeBAD

botbahlul / autosrt

jindongwang / EasyEspnet

syntithenai / opensnips

IS2AI / ISSAI_SAIDA_Kazakh_ASR

rollingstarky / Python-Voice-Assistant

AppleHolic / PytorchSR

ng-web-apis / speech

botbahlul / pyvosklivesubtitle

Kushal997-das / Pyautogui-module-using-audio

srinivr / kaldi-long-audio-alignment

botbahlul / whisper_autosrt

G10DRAS / RoboCop

LinkonBSMRSTU / Speech-To-Text-App-iOS

franchesoni / s2t

untemps / react-vocal

robmsmt / SpeechLoop

ShawnPi233 / EatecPlayerMaster

botbahlul / android-autosrt-v2

azu / transcript-audio

botbahlul / android-autosrt

Abhishek-op / SR

scottykwok / cantonese-selfish-project

Manasvi070902 / Meetify

IlyaZaprutski / bluetooth-lamp