mikeesto / subber

A small CLI tool for converting video & audio to a text transcription

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Subber

This is a Golang CLI tool for generating transcripts of video and audio files. It wraps FFmpeg and Whisper.

Usage

go run *.go [--format] <file>

Options

  • --format: Optional flag to trigger formatting the transcript into paragraphs using basic topic analysis with TF-IDF

Example

go run *.go --format video.mp4

This command will:

  1. Convert video.mp4 to a WAV
  2. Transcribe the audio using Whisper
  3. Format the transcript into paragraphs using topic analysis with TF-IDF
  4. Save the formatted transcript to formatted_transcript.txt

Some very rough benchmarks.

Installation

I'm still working on the best way to distribute this. For now, you can clone the repository, ensure you have the required dependencies installed, and update the WhisperPath and ModelPath constants in the code to match your system.

To build the CLI:

go build -o subber

Then move the binary to somewhere on your PATH (e.g. /usr/local/bin).

About

A small CLI tool for converting video & audio to a text transcription


Languages

Language:Go 100.0%