dictation

There are 4 repositories under dictation topic.

collabora / WhisperLive
A nearly-live implementation of OpenAI's Whisper.
dictation obs openai text-to-speech translation voice-recognition whisper tensorrt tensorrt-llm whisper-tensorrt
Language:Python 2220
savbell / whisper-writer
💬📝 A small dictation app using OpenAI's Whisper speech recognition model.
dictation faster-whisper openai openai-api openai-whisper speech-recognition speech-to-text typing-assistant whisper
Language:Python 385
daanzu / kaldi-active-grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
coding command-and-control dictation grammars kaldi kaldi-asr kaldi-grammar python speech-recognition speech-to-text voice voice-coding voice-commands voice-control
Language:Python 340
Nikorasu / LiveWhisper
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
ai assistant chatbot dictation numpy openai openai-whisper python sounddevice speech-recognition speech-to-text terminal text-to-speech transcription translation tts voice voice-assistant voice-recognition whisper
Language:Python 336
themanyone / whisper_dictation
Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.
assistant-chat-bots assistive-technology client client-server coding continuous dictation hands-free launcher server speech-recognition stable-diffusion stable-diffusion-webui voice-assistant voice-recognition whisper-api whisper-cpp star-trek voice-control ai
Language:Python 187
jinserk / pytorch-asr
ASR with PyTorch
speech pytorch ctc pyro kaldi speech-recognition lvcsr python resnet lattice decoder transcription dictation pytorch-binding kaldi-decoder asr deepspeech densenet capsule-network ss-vae
Language:Python 140
yohasebe / whisper-stream
A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection
command-line dictation openai transcription voice-to-text whisper
Language:Shell 94
QuantiusBenignus / blurt
Gnome shell extension for accurate speech to text input in Linux using whisper.cpp. Input text from speech anywhere.
gnome-shell-extension input-method speech-to-text whisper-cpp gnome machine-learning speech-recognition whisper ai bloat-free dictate dictation kiss input asr linux gnome-extension
Language:JavaScript 57
Mohamad-Hussein / speech-assistant
Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-text dictation.
dictation distil-whisper huggingface offline openai-whisper speech-to-text whisper whisper-ai desktop-app speech transcription translation
Language:Python 53
dhruvyad / uttertype
Short code for dictation using OpenAI Whisper for transcription.
dictation openai openai-whisper speech-recognition speech-to-text transcription
Language:Python 42
frazy
ApayRus / frazy
Educational player with phrasal playback and parallel multi-language subtitles. Online subtitles/captions editor.
dictation audition languages translations firebase material-ui constructor subtitles audio parallel-texts subtitles-editor
Language:JavaScript 19
eellak / gsoc2019-sphinx
Creation of an online Greek mail dictation system, using Sphinx and personalized acoustic/language model training
sphinx4-speech dictation gsoc-2019 angular8 flask-application speech-recognition adaptation mongodb
Language:Python 19
C-Loftus / sight-free-talon
Integrate Talon voice dictation commands with TTS, screen readers, braille, and more!
talonvoice accessibility nvda eyestrain screenreader voice-dictation hci human-computer-interaction blind dictation
Language:Python 17
saypi-userscript
Pedal-Intelligence / saypi-userscript
An independent voice interface for Inflection AI's conversational assistant, Pi
ai chat chatbot dictation heypi inflection-ai llm openai pi speech-recognition speech-to-text transcription voice voice-recognition whisper
Language:TypeScript 17
pluteski / speech-to-text
Python helper for Google and IBM Watson speech-to-text cloud APIs.
transcription ibm-watson-speech speech-to-text google-speech dictation python watson-speech-sdk watson-speech
Language:Python 15
mrob95 / mathfly
A complete system for dictating mathematics and LaTeX using Dragon
caster latex python dictation voice dragon dragonfly math mathematics
Language:Python 13
sloganking / desk-talk
A desktop transcription software
desktop dictation transcription whisper
Language:Rust 13
GoogleChromeLabs / dictation_support
This SDK allows web-based apps/pages to interact with dictation devices
dictation webhid webhid-api
Language:TypeScript 12
ntkme / Swift-Dictation
:speech_balloon: The missing frontend for dictation in OS X. Switch Dictation Language on the fly when Input Method Language changes. Just like iOS keyboard.
dictation osx
Language:Objective-C 12
redocrepus / Whisper-Paste
Chrome extension that allows dictating anywhere using OpenAI Whisper
chrome-extension dictation openai openai-api text-to-speech voice-recognition voice-typing whisper whisper-ai
Language:JavaScript 12
eladcn / nuance-nodejs
A simple Nuance HTTP Client for NodeJS
dictation nodejs nuance nuance-nodejs speech
Language:JavaScript 10
redocrepus / ahk-whisper-paste
Allows dictating anywhere in Windows using AutoHotKey and OpenAI's Whisper speech-to-text engine.
dictation openai openai-api text-to-speech voice-typing whisper whisper-ai windows
Language:Go 10
otacke / h5p-dictation
Let your students train their listening comprehension and spelling skills
h5p students teachers dictation authoring-tool
Language:JavaScript 8
PanosAntoniadis / personalized_asr
[MTAP] Official implementation: A mechanism for personalized Automatic Speech Recognition for less frequently spoken languages: the Greek case
automatic-speech-recognition clustering dictation personalization
Language:Python 8
theawless / Dict-O-nator
A dictation plugin for gedit (the GNOME text editor).
gedit-plugin gedit speech-recognition dictation
Language:Python 8
A-AhkUser / Dictation-Interface
dictation interface using UI automation via a chrome extension
autohotkey chrome-extension dictation speech-recognition uiautomation
Language:AutoHotkey 6
ognistik / alfred-superwhisper
Use Alfred to Control SuperWhisper - AI Powered Voice to Text
ai dictation llm productivity text-processing transcription
Language:JavaScript 6
DrDictaphone
olekli / DrDictaphone
Dictation app for the terminal and Neovim, using Whisper for transcription and ChatGPT for post-processing.
chatgpt dictate dictation terminal terminal-based transcription neovim neovim-plugin openai openai-chatgpt openai-whisper speech-to-text
Language:Python 6
jtara1 / dictation
self-hosted dictation for speech to text anywhere on linux as exec & declarative build
build dictation executable speech-to-text
Language:Nix 5
ognistik / km-ai-memos
Dictate with Just Press Record and transcribe with Whisper AI using Keyboard Maestro
ai creativity dictation memos note-taking productivity transcription
5
bobbymay / Dictation-for-macOS
Speech Recognition for macOS that allows you to define words, phrases, or sentences to perform keyboard and mouse operations.
apple dictation dictation-commands mac macos nsspeechrecognizer speech-recognition speech-to-text swift
Language:Swift 3
ricky0123 / vocoder
Source code dictation and voice control
developer-tools dictation speech-recognition voice-control
Language:Python 3
scottjoyner / Sophia
Natural Language Processing, Speech Dictation, API controllers
sophia dictation deepspeech python nlp machine-learning spotify-api
Language:Python 3
arunkv / dictation
Dictation game for kids to practice their spelling
dictation english english-learning google-cloud openai-api tts
Language:Python 2
voidism / LiTy
LiTy: Listen & Type - Efficient Dictation Training Tool for English Learners.
english-learning dictation toefl toefl-ibt
Language:Python 2
sbadulin / obsidian-dictation-plugin
Obsidian dictation plugin
dictation gpt-35-turbo obsidian obsidian-plugin openai speech-to-text whisper
Language:TypeScript 1

dictation

collabora / WhisperLive

savbell / whisper-writer

daanzu / kaldi-active-grammar

Nikorasu / LiveWhisper

themanyone / whisper_dictation

jinserk / pytorch-asr

yohasebe / whisper-stream

QuantiusBenignus / blurt

Mohamad-Hussein / speech-assistant

dhruvyad / uttertype

ApayRus / frazy

eellak / gsoc2019-sphinx

C-Loftus / sight-free-talon

Pedal-Intelligence / saypi-userscript

pluteski / speech-to-text

mrob95 / mathfly

sloganking / desk-talk

GoogleChromeLabs / dictation_support

ntkme / Swift-Dictation

redocrepus / Whisper-Paste

eladcn / nuance-nodejs

redocrepus / ahk-whisper-paste

otacke / h5p-dictation

PanosAntoniadis / personalized_asr

theawless / Dict-O-nator

A-AhkUser / Dictation-Interface

ognistik / alfred-superwhisper

olekli / DrDictaphone

jtara1 / dictation

ognistik / km-ai-memos

bobbymay / Dictation-for-macOS

ricky0123 / vocoder

scottjoyner / Sophia

arunkv / dictation

voidism / LiTy

sbadulin / obsidian-dictation-plugin