whalemonster's starred repositories
faster-whisper
Faster Whisper transcription with CTranslate2
Speech-Translate
A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.
Piper-Read
Piper Read is a lightweight Piper TTS GUI written in C#.
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
anything-llm
The all-in-one Desktop & Docker AI application with full RAG and AI Agent capabilities.
MSEdgeRedirect
A Tool to Redirect News, Search, Widgets, Weather and More to Your Default Browser
WhisperLive
A nearly-live implementation of OpenAI's Whisper.
whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
Fuck_off_EA_App
Keep using Origin instead of EA App
open-webui
User-friendly WebUI for LLMs (Formerly Ollama WebUI)
local_llm_assistant
World's Easiest GPT-like Voice Assistant
llm_voice_chatbot_rpi
Local LLM-based voice chatbot running on Raspberry Pi
ComfyUI-TTS
A set of TTS nodes for ComfyUI
ComfyUI-AnimateAnyone-Evolved
Improved AnimateAnyone implementation that allows you to use the opse image sequence and reference image to generate stylized video
vid2densepose
Convert your videos to densepose and use it on MagicAnimate
Moore-AnimateAnyone
Character Animation (AnimateAnyone, Face Reenactment)
sd_api_pictures_tag_injection
Based on @Brawlence's extension
oobaboogas-webui-langchain_agent
Creates an Langchain Agent which uses the WebUI's API and Wikipedia to work
long_term_memory_with_qdrant
RAG implementation for Ooba characters. dynamically spins up new qdrant vector DB and manages retrieval and commits for conversations based entirely on character name. Provides characters with access to past chat conversations