centre-for-humanities-computing / whisper-cpp-transcription-pipeline

Transcription pipeline for Whisper C++

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

whisper-cpp-transcription-pipeline

Containerized transcription pipeline for Whisper C++

Usage

If you have a folder with .mp3 and .m4a files, even in subdirectories, this pipeline will transcribe them into .txt files in the same directory, with a .transcription.txt extension.

In order to download the pipeline you should pull it from docker hub.

docker pull cicaharcos/whisper-transcriber

Then in order to transcribe the files you should mount the target directory to the Docker image. You can also log progress by running docker in interactive mode.

The container will stop running once the task is completed.

docker run -v ./dat:/dat -i cicaharcos/whisper-transcriber

whisperdeez

About

Transcription pipeline for Whisper C++

License:MIT License


Languages

Language:Shell 76.9%Language:Dockerfile 23.1%