Luis Armendariz's starred repositories
faster-whisper
Faster Whisper transcription with CTranslate2
wavesurfer.js
Audio waveform player
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
mlx-examples
Examples in the MLX framework
podman-compose
a script to run docker-compose.yml using podman
big-AGI
Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud.
awesome-conformal-prediction
A professionally curated list of awesome Conformal Prediction videos, tutorials, books, papers, PhD and MSc theses, articles and open-source libraries.
bark-with-voice-clone
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
CTranslate2
Fast inference engine for Transformer models
dateparser
python parser for human readable dates
DeepFilterNet
Noise supression using deep filtering
jupyterlab-git
A Git extension for JupyterLab
musicinformationretrieval.com
Instructional notebooks on music information retrieval.
Thorsten-Voice
Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.
ctc-segmentation
Segment an audio file and obtain utterance alignments. (Python package)
fastfeedforward
A repository for log-time feedforward networks
pyannote-metrics
A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems
VocalForge
Your one-stop solution for voice dataset creation
ozen-toolkit
Audio datasets, easier.