xenova / whisper-web

ML-powered speech recognition directly in your browser

Home Page:https://hf.co/spaces/Xenova/whisper-web

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Chrome Extension Empty Output

pythonlearner1025 opened this issue · comments

Versions:

  • @xenova/transformers": "^2.1.0
  • Manifest v3.0

Issue
The final result of the transcription is always an empty text.

Steps to Replicate

  • Clone this repository
  • Load it as chrome extension
  • Enable microphone by clicking "Record"
  • Activate extension popup (click on icon or press down "Ctrl+B" (windows) OR "Cmd+B" (mac)
  • Open DevTools inspector by right-clicking and selecting "inspect" option on the extension popup
  • Hold down "v" to record, release "v" stop recording and to begin transcribing it.
  • Observe console outputs
  • he completed output (return value of "transcriber") will be always empty
  • Line 105 of background.js

Notes:

  • I had to downgrade to @xenova/transformers@2.1.0 or below, otherwise I get the error:
• Uncaught (in promise) Error: Unsupported model type: whisper at Function.from_pretrained (background.bundle. is:2:640741
async background. bundle. is:2:689608
  • I am mostly copy-pasting code from this repo's worker.js into my repo's background.js (extension worker).

Screenshots:
Screenshot 2023-09-09 at 11 25 15 PM

Hi there. The "unsupported" issue should be fixed in v2.6.1 (it was an issue related to minification). Could you try again with that version?