Charleno Pires's starred repositories
stable-diffusion-webui-aesthetic-gradients
Aesthetic gradients extension for web ui
stable-diffusion-aesthetic-gradients
Personalization for Stable Diffusion via Aesthetic Gradients 🎨
stable-diffusion-webui-extensions
Extension index for stable-diffusion-webui
whisper-jax
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
coremltools
Core ML tools contain supporting tools for Core ML model conversion, editing, and validation.
webtransport-go
Lightweight but fully-capable WebTransport server for Go
whisper-rs
Rust bindings to https://github.com/ggerganov/whisper.cpp
pocketsphinx
A small speech recognizer
ExtractThinker
ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.
RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
tensorflow-speech-recognition
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音
speechbrain
A PyTorch-based Speech Toolkit
faster-whisper
Faster Whisper transcription with CTranslate2
speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.