Heng-xiu / whisper-asr-webservice

OpenAI Whisper ASR Webservice API

Home Page:https://ahmetoner.github.io/whisper-asr-webservice

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Release Docker Pulls Build Licence

Whisper ASR Webservice

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitask model that can perform multilingual speech recognition as well as speech translation and language identification. For more details: github.com/openai/whisper

Features

Current release (v1.3.0) supports following whisper models:

Quick Usage

CPU

docker run -d -p 9000:9000 -e ASR_MODEL=base -e ASR_ENGINE=openai_whisper onerahmet/openai-whisper-asr-webservice:latest

GPU

docker run -d --gpus all -p 9000:9000 -e ASR_MODEL=base -e ASR_ENGINE=openai_whisper onerahmet/openai-whisper-asr-webservice:latest-gpu

for more information:

Documentation

Explore the documentation by clicking here.

Credits

  • This software uses libraries from the FFmpeg project under the LGPLv2.1

About

OpenAI Whisper ASR Webservice API

https://ahmetoner.github.io/whisper-asr-webservice

License:MIT License


Languages

Language:Python 86.8%Language:Dockerfile 13.2%