There are 32 repositories under transcription topic.
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI
「硬地骇客 - 两个月 $12000 ARR 实践之路」是由 硬地骇客 团队编著,本书是关于 Podwise 产品历程的忠实记录:内容包含 灵感 - 构建 - 发布 - 增长 - 复盘 五个章节。如果你觉得一个人读不够过瘾,欢迎加入「硬地骇客」官方知识星球与专家们一起讨论!Podwise 的故事才刚刚开始,我们也将在星球持续分享我们的认知,成功可能无法复制,但失败一定可以借鉴。现在就点击下方链接加入吧!
A python package to build AI-powered real-time audio applications
视频音频生成字幕,生成srt文件。无需申请第三方API,本地实现音频转文本。基于Transformer的视频字幕生成框架。A GUI tool for generating subtitle from videos and generating srt files.
Generate subtitles, summaries, and chapters from videos in seconds
Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration
turnkey self-hosted offline transcription and diarization service with llm summary
The open-source iOS app that's making quality voice transcription more accessible on mobile devices.
🎤 The easiest way to transcribe audio in Swift
A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
OBS plugin for local speech recognition and captioning using AI
Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/
Easily take an entire YouTube playlist and turn it into high quality transcripts using Whisper.
Fast text based video editing, node Electron Os X desktop app, with Backbone front end.
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
A ready-to-use, minimal app that converts any speech into text.
Command line speech recognition and transcription for macOS
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
open source audio and video transcription software
Transcribe and translate audio to text using Whisper and DeepL.
PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models
Tero Subtitler is an open source, cross-platform, and free subtitle editing software.
Talk to ChatGPT in real time using LiveKit
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection.