There are 3 repositories under speech-transcription topic.
A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Long audio alignment using Kaldi
Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python
🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.
CLI tool that continuously transcribes audio from the device's built-in microphone to a text file. Runs in the background, providing an ongoing log of ambient audio as text.
SPEAR-ASR and SPEAR-WakeUp Software Development Kit for Android
Speech transcription and speech diarization
SPEAR-ASR and SPEAR-WakeUp Software Development Kit in Python for Linux
Real time caption generator using Microsoft Azure speech services
Whisper Transcription Service
A yarp plugin to perform speech transcription using openai whisper