fastapi huggingface speech-recognition speech-to-text stt vad voice-activity-detection wav2vec2

Hugging Face Wav2Vec2 Demonstration API

Install poetry - a python dependency manager
poetry install
poetry run python main.py
Pop open your browser to http://localhost:8001/ for the docs.

Resource Warning - this will use all your CPU cores! This is the price you pay for not needing a GPU.

Youtube Transcribe Example

Used the following video: https://www.youtube.com/watch?v=dZ7GiP4vPts See output transcript under readme_assets/example_youtube_output.json

URL Transcribe

If you have some wavs locally spin up a local file server with python -m http.server 8081 and supply a local url to the transcribe_url endpoint like so:

File Transcribe

Just upload the file to the transcribe_file endpoint!

Enhancements / "Hey, wanna do a pull request?"

About

Demonstration of Hugging Face's (https://huggingface.co/) newly released Wav2Vec2 model for easy, reasonably coherent, Speech to Text!

fastapi huggingface speech-recognition speech-to-text stt vad voice-activity-detection wav2vec2

BSD 3-Clause "New" or "Revised" License

Languages

Language:Python 100.0%