Ciaran O'Reilly's starred repositories
whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
open-lid-dataset
Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)
webtorrent
⚡️ Streaming torrent client for the web
whisper-vits-svc
Core Engine of Singing Voice Conversion & Singing Voice Clone
whisper-turbo
Cross-Platform, GPU Accelerated Whisper 🏎️
rapidpages
Generate React and Tailwind components using AI
Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
whisper-edge
OpenAI Whisper for edge devices
SpeechTokenizer
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
VoiceActivityProjection
Voice Activity Projection Models: Self-supervised learning of Turn-taking Events
speculative-decoding
Explorations into some recent techniques surrounding speculative decoding
spear-tts-pytorch
Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch
WavJourney
WavJourney: Compositional Audio Creation with LLMs
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
getTV3Videos
Descarrega vídeos de TV3 (tv3.cat) // Download videos of TV3 channel (tv3.cat)
PolyLangVITS
Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)
stable-diffusion.cpp
Stable Diffusion in pure C/C++
langstream
Build robust LLM applications with true composability 🔗