rivermonster's starred repositories
distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
awesome-vlm-architectures
Famous Vision Language Models and Their Architectures
ComfyUI_VLM_nodes
Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation
LCM_Inpaint_Outpaint_Comfy
ComfyUI custom nodes for inpainting/outpainting using the new latent consistency model (LCM)
firefly-iii
Firefly III: a personal finances manager
FasterThanSight
Desktop speed reading software for Windows, Mac and Linux
keepyourmouthshut
Acid Reflux for your Ears!
Microsoft-Activation-Scripts
A Windows and Office activator using HWID / Ohook / KMS38 / Online KMS activation methods, with a focus on open-source code and fewer antivirus detections.
unstable_journey
A desktop Paint application powered by Stable Diffusion, automatic1111 webui and PieCasso!
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
stable_diffusion_sketch
Stable Diffusion Sketch, an Android client app that connect to your own automatic1111's Stable Diffusion Web UI
text-generation-webui
A Gradio web UI for Large Language Models.
canvas-zoom
zoom and pan functionality
alpaca.cpp
Locally run an Instruction-Tuned Chat-Style LLM
Kandinsky-2
Kandinsky 2 — multilingual text2image latent diffusion model
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality