sefaalper

Sefa Alper's repositories

A multimodal approach on emotion recognition using audio and text.

Language:Jupyter NotebookApache-2.0000

Faster Whisper transcription with CTranslate2

Language:PythonMIT000

Language:C000

Language:C++000

NeMo: a toolkit for conversational AI

Language:PythonApache-2.0000

Python interface to the WebRTC Voice Activity Detector

Language:CNOASSERTION000

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookMIT000

Java Speaker Recognition Framework

Language:JavaApache-2.0000

Foundational Models for State-of-the-Art Speech and Text Translation

Language:PythonNOASSERTION000

Robust Speech Recognition via Large-Scale Weak Supervision

Language:Jupyter NotebookMIT000

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Language:Jupyter NotebookApache-2.0000

Open source cross-platform implementation of MRCP protocol

Language:CApache-2.0000

A JNI wrapper for using whisper.cpp, allows to transcribe speech to text in Java.

Language:JavaApache-2.0000