timelery / Whisperboard

iOS app to record and transcribe speech to text with the help of the OpenAI Whisper model

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Whisperboard

An iOS app for recording and transcribing audio on the go, based on OpenAI's Whisper model.



Twitter: @sa1k0s Commit Activity License Powered by Tuist

Features

  • Easy-to-use voice recording and playback
  • Transcription of recorded audio using Whisper from OpenAI
  • Import and export audio files
  • Select microphone for recording
  • Model selection screen with the ability to download any Whisper model

Future Plans

  • Optimize the transcription process by eliminating silent portions of audio, which can reduce the processing time and improve overall efficiency.
  • Implement resumable transcription so that users can continue transcribing after the app has been terminated during the transcription process.
  • Enable background transcription when the app is minimized, allowing users to perform other tasks while the transcription proceeds.
  • Improve visualization of the current state of the transcription process, providing users with a clearer understanding of its progress.
  • Provide an estimated time remaining for the transcription to complete, helping users plan accordingly.
  • Introduce a transcription queue that allows users to select multiple files for transcription, streamlining the process for bulk transcriptions.
  • Implement real-time transcription using smaller, more efficient models, offering users faster results.
  • Offer an additional remote transcription option to enhance user experience with larger models, potentially providing more accurate transcriptions or faster processing times.

Installation

  1. Clone this repository
  2. Run make
  3. Open the project in Xcode

License

This project is licensed under the GPL-3.0 license.

Links

Buy Me A Coffee

About

iOS app to record and transcribe speech to text with the help of the OpenAI Whisper model

License:GNU General Public License v3.0


Languages

Language:Swift 99.4%Language:Shell 0.3%Language:Makefile 0.3%Language:C 0.0%