There are 4 repositories under dictation topic.
A nearly-live implementation of OpenAI's Whisper.
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
💬📝 A small dictation app using OpenAI's Whisper speech recognition model.
ASR with PyTorch
Fast! Offline, privacy-focused, hands-free voice typing, 2-way AI voice chat, with images, voice control, in under 4 GiB of VRAM.
A bash script that uses the OpenAI Whisper API to transcribe continuous spoken audio into text
Gnome shell extension for accurate speech to text input in Linux using whisper.cpp. Input text from speech anywhere.
Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-text dictation.
Creation of an online Greek mail dictation system, using Sphinx and personalized acoustic/language model training
Python helper for Google and IBM Watson speech-to-text cloud APIs.
Integrate Talon voice dictation commands with TTS, screen readers, braille, and more!
:speech_balloon: The missing frontend for dictation in OS X. Switch Dictation Language on the fly when Input Method Language changes. Just like iOS keyboard.
This SDK allows web-based apps/pages to interact with dictation devices
A simple Nuance HTTP Client for NodeJS
[MTAP] Official implementation: A mechanism for personalized Automatic Speech Recognition for less frequently spoken languages: the Greek case
An independent voice interface for Inflection AI's conversational assistant, Pi
Allows dictating anywhere in Windows using AutoHotKey and OpenAI's Whisper speech-to-text engine.
A dictation plugin for gedit (the GNOME text editor).
dictation interface using UI automation via a chrome extension
Let your students train their listening comprehension and spelling skills
Speech Recognition for macOS that allows you to define words, phrases, or sentences to perform keyboard and mouse operations.
Dictation app for the terminal and Neovim, using Whisper for transcription and ChatGPT for post-processing.
Chrome extension that allows dictating anywhere using OpenAI Whisper
Natural Language Processing, Speech Dictation, API controllers
Uses Whisper AI to transcribe and process audio files, with the output being useful to psychotherapists.
This program will allow you to dictate into your linux computer and have text inputed into the window of focus!