There are 4 repositories under dictation topic.
A nearly-live implementation of OpenAI's Whisper.
💬📝 A small dictation app using OpenAI's Whisper speech recognition model.
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.
ASR with PyTorch
A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection
Gnome shell extension for accurate speech to text input in Linux using whisper.cpp. Input text from speech anywhere.
Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-text dictation.
Creation of an online Greek mail dictation system, using Sphinx and personalized acoustic/language model training
Integrate Talon voice dictation commands with TTS, screen readers, braille, and more!
An independent voice interface for Inflection AI's conversational assistant, Pi
Python helper for Google and IBM Watson speech-to-text cloud APIs.
This SDK allows web-based apps/pages to interact with dictation devices
:speech_balloon: The missing frontend for dictation in OS X. Switch Dictation Language on the fly when Input Method Language changes. Just like iOS keyboard.
Chrome extension that allows dictating anywhere using OpenAI Whisper
A simple Nuance HTTP Client for NodeJS
Allows dictating anywhere in Windows using AutoHotKey and OpenAI's Whisper speech-to-text engine.
Let your students train their listening comprehension and spelling skills
[MTAP] Official implementation: A mechanism for personalized Automatic Speech Recognition for less frequently spoken languages: the Greek case
A dictation plugin for gedit (the GNOME text editor).
dictation interface using UI automation via a chrome extension
Use Alfred to Control SuperWhisper - AI Powered Voice to Text
Dictation app for the terminal and Neovim, using Whisper for transcription and ChatGPT for post-processing.
Dictate with Just Press Record and transcribe with Whisper AI using Keyboard Maestro
Speech Recognition for macOS that allows you to define words, phrases, or sentences to perform keyboard and mouse operations.
Natural Language Processing, Speech Dictation, API controllers
Obsidian dictation plugin