speech-transcription

There are 3 repositories under speech-transcription topic.

Dadangdut33 / Speech-Translate
A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.
python speech-transcription speech-translation tkinter-python translate whisper
Language:Python 456
Appen / UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
speech-processing speech-annotation speech-recognition speaker-diarization speech-seperation gender-classification speaker-identification synthetic-speech-detection speech-transcription topic-detection audio-segmentation accent-detection
Language:Forth 100
srinivr / kaldi-long-audio-alignment
Long audio alignment using Kaldi
kaldi longaudio-alignment audio-segments asr automatic-speech-recognition split-audio speech-recognition speech-to-text speechrecognition transcription speech-transcription
Language:Shell 25
jhauret / vibravox
Speech to Phoneme, Bandwidth Extension and Speaker Verification using the Vibravox dataset.
bandwidth-extension datasets hydra pytorch pytorch-lightning speaker-verification speech-transcription speech-enhancement
Language:Python 19
PranavPutsa1006 / Speaker-Diarization
Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python
deep-learning embeddings-extraction mfcc neural-networks speaker-diarization spectral-clustering speech-detection speech-segmentation speech-to-text speech-transcription voice-activity-detection
Language:Jupyter Notebook 16
KevKibe / African-Whisper
🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.
asr speech speech-recognition speech-to-text speech-transcription speech-translation whisper
Language:Python 15
capjamesg / awsnap.js
Navigate websites by clicking your fingers and saying the link you want to visit.
audio-classification tensorflow-js speech-transcription webaudio-api
Language:HTML 3
otonomee / mic2transcript
CLI tool that continuously transcribes audio from the device's built-in microphone to a text file. Runs in the background, providing an ongoing log of ambient audio as text.
audio cli cli-tool openai speech speech-transcription transcription whisper
Language:Python 2
Think-A-Move / SPEAR-SDK-Java-Android
SPEAR-ASR and SPEAR-WakeUp Software Development Kit for Android
android offline voice-commands speech voice-recognition speech-recognition speech-to-text stt asr command-and-control on-device speech-transcription voice-enable voice-transcription java voice-control natural-language-processing noise-robust speech-processing high-noise
Language:Java 2
adam-aalah / Speech-transcription
Speech transcription and speech diarization
diarization python speech-diarization speech-to-text speech-transcription speechbrain transcription whisper-ai
Language:Python 1
Think-A-Move / SPEAR-SDK-Python-Linux
SPEAR-ASR and SPEAR-WakeUp Software Development Kit in Python for Linux
python linux offline voice-commands speech voice-recognition speech-recognition speech-to-text voice-control stt asr command-and-control on-device speech-transcription voice-enable voice-transcription high-noise natural-language-processing noise-robust speech-processing
Language:Python 1
JadenChun / real-time-caption-generator
Real time caption generator using Microsoft Azure speech services
azure-speech-service speech-transcription speech-translation windows-application real-time-caption cpp gui-application qt-widgets
Language:C++ 0
ksquarekumar / whisper-stream
Whisper Transcription Service
automatic-speech-recognition deep-learning flax inference jax speech-to-text speech-transcription speech-translation transformer whisper openai
Language:Jupyter Notebook
robotology / yarp-device-speechTranscription-whisper
A yarp plugin to perform speech transcription using openai whisper
openai speech-to-text speech-transcription whisper yarp
Language:C++