There are 147 repositories under whisper topic.
Port of OpenAI's Whisper model in C/C++
Faster Whisper transcription with CTranslate2
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Open source real-time translation app for Android that runs locally
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local device (Mac OS and windows OS Support added. Working on adding linux support soon) https://meetily.ai/ is meetly ai
On-device Speech Recognition for Apple Silicon
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
Mac app for crushing tech interviews with AI
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Framework for serverless Decentralized Applications using Ethereum, IPFS and other platforms
A nearly-live implementation of OpenAI's Whisper.
ChatGPT Java SDK支持流式输出、Gpt插件、联网。支持OpenAI官方所有接口。ChatGPT的Java客户端。OpenAI GPT-3.5-Turb GPT-4 Api Client for Java
🤖 A Telegram bot that integrates with OpenAI's official ChatGPT APIs to provide answers, written in Python
OpenAI API + Ruby! 🤖❤️ GPT-5 & Realtime WebRTC compatible!
ML-powered speech recognition directly in your browser
Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
An app to interact privately with your documents using the power of GPT, 100% privately, no data leaks
这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
faster_whisper GUI with PySide6
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
A Web UI for easy subtitle using whisper model.
The media player for language learning, with dual subtitles, AI-generated subtitles, real-time translation, and more!
Automatically generate and overlay subtitles for any video.