cafew's starred repositories

WhisperHallu

Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts

Language:PythonStargazers:245Issues:0Issues:0

CTranslate2

Fast inference engine for Transformer models

Language:C++License:MITStargazers:3125Issues:0Issues:0

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonLicense:MITStargazers:10722Issues:0Issues:0

whisper-typer-tool

This is a python script using whisper to type with your voice

Language:PythonLicense:MITStargazers:48Issues:0Issues:0

diart

A python package to build AI-powered real-time audio applications

Language:PythonLicense:MITStargazers:938Issues:0Issues:0

whispercppGUI

GUI for whispercpp, a high performance C++ port of OpenAI's whisper

Language:PythonLicense:MITStargazers:56Issues:0Issues:0

whispercpp

Pybind11 bindings for Whisper.cpp

Language:C++License:Apache-2.0Stargazers:319Issues:0Issues:0

WhisperSubsWindows

Generate video subtitles using Whisper cpp on Windows

Language:BatchfileStargazers:4Issues:0Issues:0

whisper-playground

Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/

Language:PythonLicense:MITStargazers:775Issues:0Issues:0

yt-whisper

Using OpenAI's Whisper to automatically generate YouTube subtitles

Language:PythonLicense:MITStargazers:1340Issues:0Issues:0

buzz

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

Language:PythonLicense:MITStargazers:11510Issues:0Issues:0

kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1501Issues:0Issues:0

efficient_whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:Jupyter NotebookLicense:MITStargazers:19Issues:0Issues:0

Whisper

High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model

Language:C++License:MPL-2.0Stargazers:7894Issues:0Issues:0

tgisper

Telegram bot with ASR

Language:PythonLicense:MITStargazers:21Issues:0Issues:0

StoryToolkitAI

An editing tool that uses AI to transcribe, understand content and search for anything in your footage, integrated with ChatGPT and other AI models

Language:PythonLicense:GPL-3.0Stargazers:652Issues:0Issues:0

riva-asrlib-decoder

Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva

Language:PythonStargazers:78Issues:0Issues:0

podalize

Podalize: Podcast Transcription and Analysis

Language:PythonLicense:MITStargazers:143Issues:0Issues:0

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-2-ClauseStargazers:10361Issues:0Issues:0

TUM-Live-Voice-Service

Microservice that generates subtitles for TUM-Live

Language:PythonLicense:MITStargazers:16Issues:0Issues:0

whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Language:PythonLicense:AGPL-3.0Stargazers:1761Issues:0Issues:0

whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Language:Jupyter NotebookLicense:BSD-2-ClauseStargazers:2565Issues:0Issues:0

punctuation-restoration

Punctuation Restoration using Transformer Models for High-and Low-Resource Languages

Language:PythonLicense:MITStargazers:201Issues:0Issues:0

whispering

Streaming transcriber with whisper

Language:PythonLicense:MITStargazers:686Issues:0Issues:0

WhisperWithVAD

Whisper combined with Silero VAD, for improved long-form transcriptions

Language:Jupyter NotebookLicense:MITStargazers:38Issues:0Issues:0

VoiceAssistant

A VoiceAsistant with WhisperAI speech recognition

Language:PythonLicense:MITStargazers:28Issues:0Issues:0

transcriber_app

Real time speech to text transcription app.

Language:PythonStargazers:366Issues:0Issues:0

whisper-openvino

openvino version of openai/whisper

Language:Jupyter NotebookLicense:MITStargazers:151Issues:0Issues:0

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonLicense:MITStargazers:3571Issues:0Issues:0
Language:PythonLicense:MITStargazers:31Issues:0Issues:0