There are 40 repositories under transcription topic.
AI wearables. Put it on, speak, transcribe, automatically
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI
A python package to build AI-powered real-time audio applications
「硬地骇客 - 两个月 $12000 ARR 实践之路」是由 硬地骇客 团队编著,本书是关于 Podwise 产品历程的忠实记录:内容包含 灵感 - 构建 - 发布 - 增长 - 复盘 五个章节。如果你觉得一个人读不够过瘾,欢迎加入「硬地骇客」官方知识星球与专家们一起讨论!Podwise 的故事才刚刚开始,我们也将在星球持续分享我们的认知,成功可能无法复制,但失败一定可以借鉴。现在就点击下方链接加入吧!
视频音频生成字幕,生成srt文件。无需申请第三方API,本地实现音频转文本。基于Transformer的视频字幕生成框架。A GUI tool for generating subtitle from videos and generating srt files.
The open-source iOS app that's making quality voice transcription more accessible on mobile devices.
turnkey self-hosted offline transcription and diarization service with llm summary
Generate subtitles, summaries, and chapters from videos in seconds
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration
OBS plugin for local speech recognition and captioning using AI
A command-line application to convert images, PDFs, and audio files to text using Apple's APIs
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
🎤 The easiest way to transcribe audio in Swift
A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
Easily take an entire YouTube playlist and turn it into high quality transcripts using Whisper.
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
Command line interface for the built-in speech recognition and transcription capabilities in macOS.
Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/
Fast text based video editing, node Electron Os X desktop app, with Backbone front end.
The AI toolkit for speech.
open source audio and video transcription software
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
A ready-to-use, minimal app that converts any speech into text.
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.