There are 5 repositories under dictation topic.
A nearly-live implementation of OpenAI's Whisper.
💬📝 A small dictation app using OpenAI's Whisper speech recognition model.
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.
ASR with PyTorch
A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection
Gnome shell extension for accurate OFFLINE speech to text input in Linux using whisper.cpp. Input text from speech anywhere.
Use Alfred to Control Superwhisper - AI Powered Voice to Text
Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-text dictation.
Educational player with phrasal playback and parallel multi-language subtitles. Online subtitles/captions editor.
Creation of an online Greek mail dictation system, using Sphinx and personalized acoustic/language model training
Integrate Talon voice dictation commands with TTS, screen readers, braille, and more!
An independent voice interface for Inflection AI's conversational assistant, Pi
Python helper for Google and IBM Watson speech-to-text cloud APIs.
This SDK allows web-based apps/pages to interact with dictation devices
Dictate with Just Press Record and transcribe with Whisper AI using Keyboard Maestro
Chrome extension that allows dictating anywhere using OpenAI Whisper
:speech_balloon: The missing frontend for dictation in OS X. Switch Dictation Language on the fly when Input Method Language changes. Just like iOS keyboard.
A simple Nuance HTTP Client for NodeJS
Allows dictating anywhere in Windows using AutoHotKey and OpenAI's Whisper speech-to-text engine.
A dictation plugin for gedit (the GNOME text editor).
Let your students train their listening comprehension and spelling skills
[MTAP] Official implementation: A mechanism for personalized Automatic Speech Recognition for less frequently spoken languages: the Greek case
self-hosted dictation for speech to text anywhere on linux as exec & declarative build
dictation interface using UI automation via a chrome extension
Dictation app for the terminal and Neovim, using Whisper for transcription and ChatGPT for post-processing.
Speech Recognition for macOS that allows you to define words, phrases, or sentences to perform keyboard and mouse operations.
Natural Language Processing, Speech Dictation, API controllers