Sefa Alper's repositories
Audio-and-text-based-emotion-recognition
A multimodal approach on emotion recognition using audio and text.
faster-whisper
Faster Whisper transcription with CTranslate2
NeMo
NeMo: a toolkit for conversational AI
py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
recognito
Java Speaker Recognition Framework
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
whisper
Robust Speech Recognition via Large-Scale Weak Supervision
whisper-jax
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
unimrcp
Open source cross-platform implementation of MRCP protocol
whisper-jni
A JNI wrapper for using whisper.cpp, allows to transcribe speech to text in Java.