Mithrandil's starred repositories
open-webui
User-friendly WebUI for LLMs (Formerly Ollama WebUI)
autogenstudio-skills
Repo of skills for autogenstudio
Qwen-Agent
Agent framework and applications built upon Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
comfyui-colab
comfyui colabs templates new nodes
faster-whisper
Faster Whisper transcription with CTranslate2
alltalk_tts
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.
StoryDiffusion
Create Magic Story!
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
XTTS-RVC-UI
A Gradio UI for XTTSv2 and RVC.
xtts-finetune-webui
Slightly improved official version for finetune xtts
xtts-webui
Webui for using XTTS and for finetuning it
rs-mainspring-winder
Free design of a 3D printed watch mainspring winder with a rising sun knurling pattern.
anthropic-cookbook
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
lossless-cut
The swiss army knife of lossless video/audio editing
SuperSlicer_to_Orca_scripts
Script(s) to convert SuperSlicer data for use in Orca Slicer
Local-LLM-Comparison-Colab-UI
Compare the performance of different LLM that can be deployed locally on consumer hardware. Run yourself with Colab WebUI.