There are 2 repositories under whisperx topic.
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
faster_whisper GUI with PySide6
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
turnkey self-hosted offline transcription and diarization service with llm summary
A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and create subtitles.
A simple GUI to use Whisper.
Open source subtitling platform 💻 for transcribing and translating videos/audios in Indic languages.
WhisperX FastAPI integration
a cross-platform and customizable vlc video player that can generate subtitles using WhisperX model
Transcribe Like a Pro, Without Paying a Penny!
User friendly toolkit for generating immersion language learning tools including downloading media, generating subtitles and creating Anki decks
A sleek, web-based audio player featuring synchronized subtitle display, speaker diarization support, and keyboard controls in a modern, responsive interface
This repository contains a Jupyter notebook for qualitative researchers to transcribe, diarize speakers, and convert audio or video files into various text formats (csv, txt, json, & vtt).
Generate fully aligned subtitles for any Video or Audio file on your local system for free using the amazing capabilities of WhisperX.
AI 驱动的视频译配工具. An AI powered tool to execute end-to-end video dubbing.
Code for our INTERSPEECH 2024 paper: Comparing ASR Systems in the Context of Speech Disfluencies.
A streamlined CLI tool combining `yt-dlp` and `whisperx` (or `openai-whisper`) for quick and efficient audio transcription from various video platforms.
VideoWise is a video transcription and AI-powered analysis tool that helps users easily upload, transcribe, and interact with video content. Using WhisperX for high-quality transcriptions and Ollama for AI-driven insights, VideoWise makes it easy to search, analyze, and export video data.
A tool for automatically adding subtitles to short social media videos
Llama2 finetuning framework of Q&A from transcription of YouTube videos
WhisperX Slack bot for transcribing audio files
FastAPI capaz de gerar boletins de ocorrência a partir de um áudio.
Transcribrr is a python desktop application that uses transcribes audio/video files or youtube videos and summarizes the output using a variety of preset prompts using OpenAI's GPT models.
Python GUI to interact with Whisperx
Dockerized transcription pipeline using WhisperX.
Topic Modelling (LDA) and Dynamic Topic Modelling for Top Gear
A practical collection of ASR models and tools — including Whisper variants and Google STT — with implementations for real-time, batch transcription, and multi-platform integration.
Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote
WhisperX deployment on Replicate, forked to expose initial_prompt