neofob / whisper-realtime

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

A front-end, "real-time" implementation for Whisper

hi

Features:

  • Keyboard emulation to "type" your transcription into the currently active text box (can be disabled via settings).
  • Option to save audio (can be enabled via settings).
  • Local logging of prior transcriptions.

While running, hold F9 to record.

Release when you're done talking.

This is a work in progress

It might crash.

It might crash your audio device (though, I think I've fixed that).

How to run

Then either

  • git clone this repo

or

  • Click the green Code button on the top right, Download ZIP, and extract to wherever you want it.

Then

  • run.bat

todo

  • Wrangle pyinstaller to correctly include whisper for a pre-built version.
  • Allow custom hotkey for audio recording.
  • Test on other systems to make sure it's capturing the audio device correctly.
  • Clean up the jank.

About


Languages

Language:Python 94.9%Language:Batchfile 5.1%