transcription

There are 40 repositories under transcription topic.

omi
BasedHardware / omi
AI wearables. Put it on, speak, transcribe, automatically
ai app flutter friend mobile necklace omi python summary transcription wearable bci c nextjs personas smartglasses
Language:C 6298
voice-pro
abus-aikorea / voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
faster-whisper tts whisper gradio subtitles transcription translator webui speech-recognition speech-synthesis speech-to-text text-to-speech yt-dlp voice-cloning podcasts audiobook voice-conversion karaoke whisperx
Language:Python 4808
basic-pitch
spotify / basic-pitch
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
audio lightweight machine-learning midi music pitch-detection polyphonic python transcription typescript
Language:Python 4238
pluja / whishper
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
ai audio-to-text golang subtitles sveltekit transcription whisper ui webapp speech-recognition speech-to-text stt web web-whisper
Language:Svelte 2653
speaches
speaches-ai / speaches
docker docker-compose faster-whisper openai-api openai-whisper openai-whisper-translation transcription whisper whisper-ai
Language:Python 2377
floneum / floneum
Instant, controllable, local pre-trained AI models in Rust
ai candle constrained-generation dioxus floneum-v3 kalosm llama llamacpp llm mistral rust transcription whisper
Language:Rust 2015
awesome-whisper
sindresorhus / awesome-whisper
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI
ai artificial-intelligence awesome awesome-list gpt openai speech-to-text transcription
1855
audapolis
bugbakery / audapolis
an editor for spoken-word audio with automatic transcription
audio-editing speech-to-text transcription video-editing
Language:TypeScript 1732
diart
juanmc2005 / diart
A python package to build AI-powered real-time audio applications
speaker-diarization streaming-audio real-time speaker-embedding deep-learning transcription voice-activity-detection
Language:Python 1460
hardhackerlabs / book
「硬地骇客 - 两个月 $12000 ARR 实践之路」是由硬地骇客团队编著，本书是关于 Podwise 产品历程的忠实记录：内容包含灵感 - 构建 - 发布 - 增长 - 复盘五个章节。如果你觉得一个人读不够过瘾，欢迎加入「硬地骇客」官方知识星球与专家们一起讨论！Podwise 的故事才刚刚开始，我们也将在星球持续分享我们的认知，成功可能无法复制，但失败一定可以借鉴。现在就点击下方链接加入吧！
ai book indie-hackers mindmap mrr podcast summarizer transcription
Language:MDX 1266
azuwis / pianotrans
Simple GUI for ByteDance's Piano Transcription with Pedals
piano transcription ai
Language:Nix 1242
rishikanthc / Scriberr
Self-hosted AI audio transcription
ai audio transcript transcription
Language:Go 1241
YaoFANGUK / video-subtitle-generator
视频音频生成字幕，生成srt文件。无需申请第三方API，本地实现音频转文本。基于Transformer的视频字幕生成框架。A GUI tool for generating subtitle from videos and generating srt files.
audio2text generation srt subtitle transcription whisper
Language:Python 982
Saik0s / Whisperboard
The open-source iOS app that's making quality voice transcription more accessible on mobile devices.
openai ios speech-recognition speech-to-text swiftui transcription audio-to-text composable-architecture tca tuist whisper whisper-cpp
Language:Swift 912
transcriptionstream / transcriptionstream
turnkey self-hosted offline transcription and diarization service with llm summary
automation diarization llm speaker-diarization speech-recognition transcription whisper ollama mistral-7b whisperx
Language:Python 889
aschmelyun / subvert
Generate subtitles, summaries, and chapters from videos in seconds
chatgpt openai transcription translation video-editing whisper
Language:PHP 830
nyrahealth / CrisperWhisper
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
asr audio detection filler recognition speech speech-recognition timestamps transcription verbatim whisper speech-processing
Language:Python 814
mayeaux / generate-subtitles
Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration
expressjs libretranslate machine-learning nodejs transcription translation whisper gpu yt-dlp
Language:JavaScript 786
locaal-ai / obs-localvocal
OBS plugin for local speech recognition and captioning using AI
ai obs plugin speech-to-text whisper live-streaming livestream obs-studio obs-studio-plugin openai-whisper real-time-transcription realtime-transcribe realtime-translator speech-recognition transcription translation whisper-cpp
Language:C++ 748
freedmand / textra
A command-line application to convert images, PDFs, and audio files to text using Apple's APIs
command-line-tool macos ocr transcription
Language:Swift 741
kaixxx / noScribe
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
audio-transcription interview pyannote qualitative-research transcription whisper-cpp
Language:Python 735
exPHAT / SwiftWhisper
🎤 The easiest way to transcribe audio in Swift
ios swift whisper macos openai speech-recognition speech-to-text transcription whisper-cpp
Language:Swift 660
Picovoice / cheetah
On-device streaming speech-to-text engine powered by deep learning
speech-to-text asr automatic-speech-recognition online-speech-recognition speech-recognition stt transcription voice-recognition streaming-speech-to-text
Language:Python 637
bbc / react-transcript-editor
A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
bbc-news-labs news-labs transcript transcription transcript-editor stt kaldi react textav
Language:JavaScript 596
Dicklesworthstone / bulk_transcribe_youtube_videos_from_playlist
Easily take an entire YouTube playlist and turn it into high quality transcripts using Whisper.
playlists transcription transcripts whisper youtube
Language:Python 541
dsymbol / decipher
Effortlessly add AI-generated transcription subtitles to your videos
openai transcription translation whisper
Language:Python 541
vilassn / whisper_android
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
android asr automatic-speech-recognition embedded mobile offline openai speech-recognition tensorflow tensorflowlite text-to-speech texttospeech tflite transcribe transcription translation tts whisper
Language:C++ 530
sveinbjornt / hear
Command line interface for the built-in speech recognition and transcription capabilities in macOS.
speech-recognition macos command-line-tool command-line transcribe-audio-files transcription transcribe-audio command-line-interface macosx osx subtitles subtitles-generator transcription-tool
Language:Objective-C 521
baxtree / subaligner
Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/
subtitles captions alignment subrip ttml voice-activity-detection subtitle-synchronization webvtt substation-alpha microdvd mpl2 tmp sami ebu-stl advanced-substation-alpha subtitle-translation subtitle-conversion scc sbv transcription
Language:Python 484
Picovoice / leopard
On-device speech-to-text engine powered by deep learning
stt speech-to-text asr automatic-speech-recognition on-device speech-recognition transcription voice-recognition voice-to-text
Language:Python 458
OpenNewsLabs / autoEdit_2
Fast text based video editing, node Electron Os X desktop app, with Backbone front end.
video-editing dmg edl watson speech-to-text stt gentle gentle-stt osx electron backbone ibm-watson ibm-watson-speech mac speechmatics video-sequences autoedit transcription desktop backbonejs
Language:JavaScript 438
haydenbleasel / orate
The AI toolkit for speech.
ai change-voice isolation speech-recognition speech-to-text text-to-speech transcribe transcription
Language:TypeScript 432
bugbakery / transcribee
open source audio and video transcription software
transcription collaborative speech-to-text
Language:TypeScript 429
jim-schwoebel / voicebook
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
voice voice-assistant voice-recognition voice-recording transcription featurization data data-cleaning visualization generation voice-activity-detection voice-control server security encryption-decryption python3 machine-learning wake-word-detection voice-computing
Language:Python 386
Hugo-Dz / on-device-transcription
A ready-to-use, minimal app that converts any speech into text.
ai-transcription electronjs on-device-ai svelte sveltekit transcription
Language:JavaScript 375
Nikorasu / LiveWhisper
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
assistant dictation openai python sounddevice speech-recognition speech-to-text text-to-speech transcription whisper ai chatbot openai-whisper tts voice voice-assistant voice-recognition numpy translation terminal
Language:Python 357

transcription

BasedHardware / omi

abus-aikorea / voice-pro

spotify / basic-pitch

pluja / whishper

speaches-ai / speaches

floneum / floneum

sindresorhus / awesome-whisper

bugbakery / audapolis

juanmc2005 / diart

hardhackerlabs / book

azuwis / pianotrans

rishikanthc / Scriberr

YaoFANGUK / video-subtitle-generator

Saik0s / Whisperboard

transcriptionstream / transcriptionstream

aschmelyun / subvert

nyrahealth / CrisperWhisper

mayeaux / generate-subtitles

locaal-ai / obs-localvocal

freedmand / textra

kaixxx / noScribe

exPHAT / SwiftWhisper

Picovoice / cheetah

bbc / react-transcript-editor

Dicklesworthstone / bulk_transcribe_youtube_videos_from_playlist

dsymbol / decipher

vilassn / whisper_android

sveinbjornt / hear

baxtree / subaligner

Picovoice / leopard

OpenNewsLabs / autoEdit_2

haydenbleasel / orate

bugbakery / transcribee

jim-schwoebel / voicebook

Hugo-Dz / on-device-transcription

Nikorasu / LiveWhisper