There are 8 repositories under audio-transcription topic.
Pybind11 bindings for Whisper.cpp
The main repo for Stage Whisper β a free, secure, and easy-to-use transcription app for journalists, powered by OpenAI's Whisper automatic speech recognition (ASR) machine learning models.
A static site demonstrating real-time audio transcription via Amazon Transcribe over a WebSocket.
Free speech to text
WhisperClip simplifies your life by automatically transcribing audio recordings and saving the text directly to your clipboard. With just a click of a button, you can effortlessly convert spoken words into written text, ready to be pasted wherever you need it. This application harnesses the power of OpenAIβs Whisper for free.
Streamlit Audio Transcription with OPENAI's Whisper Ai: An interactive Streamlit app demonstrating real-time audio transcription using OPENAI's Whisper Ai.
Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.
Speakscribe is a web application that allows users to transcribe audios using OpenAI and also interact with a chat bot. The web application is created in Python using NiceGUI.
Transcription and annotation interface for recorded audio or video files
Generate subtitles for long movies / podcasts with OpenAI Whisper API.
Generate text captions for audio files & youtube video using OpenAI Whisper on Google Colab. Multiple languages support.
A portal that offers a transcription chain for multi upload and processing of audio files using ASR, OCTRA, MAUS and EMU-webApp.
[Russian] This script will split audio file on silence, transcript it with google recognition and save it in LJSpeech-1.1 dataset manner.
cloud audio transcription with whisper or whisperX
Python package to scrape webpages and transcribe video content from a video sharing platform.
A cross-platform, fully functional, full-featured GUI implementation of the OpenAI API.
Dockerized Whisper C++ speech-to-text API for easy deployment and rapid integration. Offering the latest stable and nightly builds for efficient audio transcription.
Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files
ChatGPT API based video game audio translator application
The Real-Time Speech Recognition System is an innovative tool designed to revolutionize the way we interact with audiovisual content. Developed by Miguel Kallemback, this system uses cutting-edge speech recognition technology to transcribe audio in real time, making content accessible to a wider audience.
ClearSpeak is a real-time audio transcription application using Google's Speech-to-Text API. It features a Tkinter-based GUI, filtering background noise, and providing clear speech transcription.
This is a Speech to text project which uses openAI's Whisper model.
A web demo of Open AI's Automatic Speech Recognition system WHISPER using Gradio.
PyScribe is a command-line tool to transcribe audio files. It uses `ffmpeg` for audio conversion and `pywhisper` for transcription.
Landing Page of Media Podium. An ai-powered privacy-focused audio and video transcription app. Unlimited free transcription time. πππ
Deepgram Transcription Processor is a Python program designed to process transcription output obtained from Deepgram's transcription service. It extracts key information such as conversation, summary, and paragraphs from the transcription output JSON and writes them to separate text files for further analysis and reference.
Automate Audio Transcription with OpenAI: Fast, Accurate, and Easy!
WhisperAudioTranscriber is an asynchronous audio recording and transcription tool built using Python. It utilizes the Hugging Face API, specifically leveraging the powerful capabilities of OpenAI's Whisper model
πΌ A streamlit web interface designed to extract words from video/audio files into text β’ Python, FFmpeg, Whisper, YT-DLP