Ciaran O'Reilly's repositories
vosk-browser
A speech recognition library running in the browser thanks to a WebAssembly build of Vosk
NetflixEnCatala
Extensió pel Chrome que automàticament silencia l'àudio de Netflix i reprodueix el doblatge en català.
commonvoice-utils
Linguistic processing for Common Voice
coqui-ai-tensorflow
An Open Source Machine Learning Framework for Everyone
QuickVC-VoiceConversion
QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion
raspberry-pi-pwm-fan-control
raspberry pi pwm fan control
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
speechbrain
A PyTorch-based Speech Toolkit
streaming-source-separation
Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.
telegram-deepspeech-bot
A Telegram bot that infers text from voice notes using DeepSpeech
tevr-asr-tool
State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool.
VoiceActivityProjection
Voice Activity Projection Models: Self-supervised learning of Turn-taking Events
vosk-server
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
vscode-audio-preview
VS Code Extension to preview and play wav file.