rivermonster

0

followers

following

stars

rivermonster's starred repositories

distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Language:PythonMIT337500

awesome-vlm-architectures

Famous Vision Language Models and Their Architectures

Language:MarkdownCC0-1.020000

ComfyUI_VLM_nodes

Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation

Language:PythonApache-2.031100

bangs

Repository of bangs used by Kagi Search

Language:RubyMIT13300

LCM_Inpaint_Outpaint_Comfy

ComfyUI custom nodes for inpainting/outpainting using the new latent consistency model (LCM)

Language:Python22800

firefly-iii

Firefly III: a personal finances manager

Language:PHPAGPL-3.01498700

jetzt

Speed reader extension for chrome

Language:JavaScriptNOASSERTION48500

FasterThanSight

Desktop speed reading software for Windows, Mac and Linux

Language:C++MIT7100

reasy

Reasy Firefox Extension

Language:JavaScript900

crowdcast

Converts a subreddit into a podcast

Language:PythonMIT10600

keepyourmouthshut

Acid Reflux for your Ears!

Language:PythonGPL-3.06900

Microsoft-Activation-Scripts

A Windows and Office activator using HWID / Ohook / KMS38 / Online KMS activation methods, with a focus on open-source code and fewer antivirus detections.

Language:BatchfileGPL-3.08781800

vpinball

Visual Pinball

Language:C++NOASSERTION50900

czkawka

Multi functional app to find duplicates, empty folders, similar images etc.

Language:RustNOASSERTION1859600

unstable_journey

A desktop Paint application powered by Stable Diffusion, automatic1111 webui and PieCasso!

Language:PythonApache-2.07400

aiyabot

A neat Discord bot for AUTOMATIC1111's Web UI

Language:PythonGPL-2.030500

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonNOASSERTION5164700

stable_diffusion_sketch

Stable Diffusion Sketch, an Android client app that connect to your own automatic1111's Stable Diffusion Web UI

Language:JavaGPL-3.09100

SadTalker

[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonNOASSERTION1130200

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.03221100

leon

🧠 Leon is your open-source personal assistant.

Language:TypeScriptMIT1503000

automatic

SD.Next: Advanced Implementation of Stable Diffusion and other Diffusion-based generative image models

Language:PythonAGPL-3.0533600

text-generation-webui

A Gradio web UI for Large Language Models.

Language:PythonAGPL-3.03860900

canvas-zoom

zoom and pan functionality

Language:JavaScript34800

alpaca.cpp

Locally run an Instruction-Tuned Chat-Style LLM

Language:CMIT1026500

Kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

Language:Jupyter NotebookApache-2.0273000

progress_knight_2

Mod

Language:JavaScript3100

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookApache-2.01253300

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookApache-2.0683300

ultimate-upscale-for-automatic1111

Language:PythonGPL-3.0157200