Gió's repositories
autogen
A programming framework for agentic AI 🤖
awesome-italian-public-datasets
A selection of interesting Open dataset from the Italian Public Administration and Civic Data use cases
bergamot-translator
Cross platform C++ library focusing on optimized machine translation on the consumer-grade device.
glombardo-se
Config files for my GitHub profile.
ctc-forced-aligner
Text to speech alignment using CTC forced alignment
DeepFilterNet
Noise supression using deep filtering
ECCV2022-RIFE
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
gpt4v-video-voiceover
Video Voiceover with GPT4V
home-assistant-core
:house_with_garden: Open source home automation that puts local control and privacy first.
linguist
Translate web pages, highlighted text, Netflix subtitles, private messages, speak the translated text, and save important translations to your personal dictionary to learn words even offline
llama-coder
Replace Copilot local AI
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Practical-RIFE
More practical frame interpolation approach.
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
python-iso3166
Standalone ISO 3166-1 country definitions
qtlanguageserver
An implementation of the Language Server Protocol
rife-ncnn-vulkan
RIFE, Real-Time Intermediate Flow Estimation for Video Frame Interpolation implemented with ncnn library
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
tinydiarize
Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens
vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
vits2
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
vits2_pytorch
unofficial vits2-TTS implementation in pytorch
whisper
Robust Speech Recognition via Large-Scale Weak Supervision
whisper.cpp
Port of OpenAI's Whisper model in C/C++
whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
YourTTS
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
zlib.install
ZLIB installer for Windows.