rishikanthc / Scriberr

Self-hosted AI audio transcription

Home Page:https://scriberr.app

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Scriberr

ci

This is Scriberr, a self-hostable AI audio transcription app. Scriberr uses the open-source Whisper models from OpenAI, to transcribe audio files locally on your hardware. It uses the Whisper.cpp high-performance inference engine for OpenAI's Whisper. Scriberr also allows you to summarize transcripts using ollama or OpenAI's ChatGPT API, with your own custom prompts. From v0.2.0 Scriberr supports offline speaker diarization. Check out the documentation website for more details and instructions.

Features

  • Fast transcription with support for hardware acceleration across a wide variety of platforms
  • Customizable compute settings. Choose #threads, #cores and your model size
  • Transcription happens locally on device
  • Exposes API endpoints for automation pipelines and integrating with other tools
  • Optionally summarize transcripts with ChatGPT or Ollama
  • Use your own custom prompts for summarization
  • Mobile ready
  • Simple & Easy to use
  • Speaker Diarization (New)

and more to come. Checkout the planned features section.

Demo and Screenshots

Note

Demo was run locally on my Macbook Air M2 using docker. Performance depends on the size of the model used and also number of cores and threads you assign. Was running a lot of things in the background and this is in dev mode so it's really slow.

CleanShot.2024-10-04.at.14.55.46.mp4

CleanShot 2024-10-04 at 14 42 54@2x CleanShot 2024-10-04 at 14 48 31@2x CleanShot 2024-10-04 at 14 49 08@2x CleanShot 2024-10-04 at 15 11 27@2x

Installation

For installation and usage instruction refer the documentation website at scriberr.app

Note

This app is under development, so expect a few rough edges and minor bugs. Expect breaking changes in the first few minor releases. Will smooth out and try to avoid it as best as I can

If you like this project I would really appreciate it if you could star this repository.

About

Self-hosted AI audio transcription

https://scriberr.app

License:MIT License


Languages

Language:Svelte 53.8%Language:TypeScript 36.7%Language:CSS 4.0%Language:Python 2.4%Language:Dockerfile 1.4%Language:JavaScript 1.1%Language:Shell 0.4%Language:HTML 0.3%