Alpha Cephei's repositories
vosk-server
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
vosk-android-demo
Offline speech recognition for Android with Vosk library.
awesome-russian-speech
Russian speech technology links
vosk-asterisk
Speech Recognition in Asterisk with Vosk Server
vosk-unity-asr
Automatic Speech Recognition in Unity using Vosk library
whisper-prompts
OpenAI Whisper Prompt Examples
vosk-space
Website and documentation
awesome-speech
Resources that make every language unique
sherpa-onnx
Real-time speech recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go
awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
MB-iSTFT-VITS2
Application of MB-iSTFT-VITS components to vits2_pytorch
Irene-Voice-Assistant
Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
faster-whisper
Faster Whisper ASR transcription with CTranslate2
text_search
Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup