Alpha Cephei's repositories
vosk-server
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
vosk-android-demo
Offline speech recognition for Android with Vosk library.
awesome-russian-speech
Russian speech technology links
vosk-asterisk
Speech Recognition in Asterisk with Vosk Server
vosk-unity-asr
Automatic Speech Recognition in Unity using Vosk library
awesome-speech
Resources that make every language unique
vosk-space
Website and documentation
sherpa-onnx
Real-time speech recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go
awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
MB-iSTFT-VITS2
Application of MB-iSTFT-VITS components to vits2_pytorch
Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Irene-Voice-Assistant
Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.
faster-whisper
Faster Whisper ASR transcription with CTranslate2
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis