There are 2 repositories under speech-transcription topic.
A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Long audio alignment using Kaldi
Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python
Speech transcription and speech diarization
SPEAR-ASR and SPEAR-WakeUp Software Development Kit for Android
Real time caption generator using Microsoft Azure speech services
SPEAR-ASR and SPEAR-WakeUp Software Development Kit in Python for Linux
A yarp plugin to perform speech transcription using openai whisper