appvoid / vosper

Real-Time Whisper Voice Recognition with vosk model feedback.

Home Page:https://opencollective.com/vosper

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

vosper

πŸ•‘ Real-Time Whisper Voice Recognition with vosk model feedback πŸŽ™

🌏 News

New logo: Hopefully everyone likes it.
vosper 2.0: The codebase was rewritten and more customization was added!
Pip is coming: A proper, easier installation and update alternative will be launched soon.
vosper 2.1: The codebase is being refactorized for highly-optimized implementation.

πŸ“‘ Features

  • Minimal approach
  • Easy installation
  • Easy modification
  • Fast text feedback thanks to vosk
  • Detects human voices (It records audio for Whisper only when needed)

⭐ Installation

git clone https://github.com/appvoid/vosper.git && 
cd vosper && 
chmod +x install.sh &&
./install.sh

β–Ά Usage

python3 main.py # It's pretty minimal...

β˜• Donations and Support

paypal donation or patreon support

πŸ”­ Full Roadmap

  • Vosk Real-Time inference and Whisper VAD support
  • Class-Based implementation
  • Easier way to choose a whisper model
  • Improved code quality, comments, readability, etc...
  • Verbosity switch
  • Customizable settings
  • Custom VAD model support
  • Python's pip installation method
  • Keyboard support
  • Documentation

πŸ”΄ Disclaimer

Real-Time usage scenarios (like a voice assistant for example) requires a GPU with at least 2-4~ gb of vram. The more the vram, the largest the model you can load, the better the transcription and the slower it gets.

Star History Chart

About

Real-Time Whisper Voice Recognition with vosk model feedback.

https://opencollective.com/vosper

License:MIT License


Languages

Language:Python 94.7%Language:Shell 5.3%