qhduan / streaming-sensevoice

Pseudo Streaming SenseVoice with Hotwords

Repository from Github https://github.comqhduan/streaming-sensevoiceRepository from Github https://github.comqhduan/streaming-sensevoice

streaming-sensevoice

Streaming SenseVoice processes inference in chunks of SenseVoice.

Usage

  • transcribe wav file
$ python main.py

  • transcribe from microphone
$ python realtime.py
  • transcribe from websocket

A basic WebSocket service built with Recorder and FastAPI; the frontend uses MP3 format to transmit audio information to reduce latency and increase stability.

pip install -r requirements-ws-demo.txt
python realtime_ws_server_demo.py

# check cli options
python realtime_ws_server_demo.py --help

About

Pseudo Streaming SenseVoice with Hotwords

License:Apache License 2.0


Languages

Language:Python 72.9%Language:HTML 27.1%