Ciaran O'Reilly's starred repositories
vits_chinese
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!
dp-rewrite
DP-Rewrite: Towards Reproducibility and Transparency in Differentially Private Text Rewriting
vanmoof-encryption-key-exporter
Export all bike details (such as encryption key) of your VanMoof bikes.
tinydiarize
Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens
telegram_media_downloader
Download media files from a telegram conversation/chat/channel up to 2GiB per file
piping-ssh-web
SSH over HTTPS via Piping Server on Web browser
piping-server
Infinitely transfer between every device over pure HTTP with pipes or browsers
go-webrtc-piping
WebRTC P2P tunneling/duplex with Piping Server WebRTC signaling
willow-inference-server
Open source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and WS
actinia-core
Actinia Core is an open source REST API for scalable, distributed, high performance processing of geographical data that uses mainly GRASS GIS for computational tasks (DOI: https://doi.org/10.5281/zenodo.5879231) | Tutorial: https://actinia-org.github.io/actinia-core/ | Docker: https://hub.docker.com/r/mundialis/actinia-core
INTERSPEECH-2023-Papers
INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
basic-pitch
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
DeepFilterNet
Noise supression using deep filtering
noise-repellent
Lv2 suite of plugins for broadband noise reduction
open-archaeo
A list of open source archaeological software and resources
Auto_Tuning_Zeroshot_TTS_and_VC
Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis", Interspeech 2023
zm-text-tts
[IJCAI'23] Learning to Speak from Text for Low-Resource TTS