dnanhkhoa / whisper-ui

A GUI interface for Open AI Whisper based on Tauri and Sveltekit

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Whiskey

Whiskey is a GUI for Open AI's Whisper built using Tauri and Sveltekit. Using cpp binaries for Whisper from whisper.cpp

Screenshot of Whiskey

Features

  • Transcribe audio or video files into written text
  • Real-time text highlighting during playback
  • Export transcriptions as .txt or .vtt files

Planned features:

  • Export files
  • Rename files
  • Save already opened files
  • Upload more than wav files
  • Upload video
  • Drag and drop
  • Start audio playback from line
  • Record mic audio directly
  • Apple Silicon, Linux, and Windows binaries
  • Editable text
  • Event and errors show in UI
  • Prediction accuracy

About

A GUI interface for Open AI Whisper based on Tauri and Sveltekit


Languages

Language:Svelte 54.5%Language:TypeScript 33.1%Language:JavaScript 6.0%Language:CSS 2.8%Language:Rust 2.8%Language:HTML 0.8%