renolynx's starred repositories
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
facefusion
Next generation face swapper and enhancer
StableCascade
Official Code for Stable Cascade
WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
k-diffusion
Karras et al. (2022) diffusion models for PyTorch
DynamiCrafter
[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
clip-interrogator-ext
Stable Diffusion WebUI extension for CLIP Interrogator
DragAnything
[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation
freecontrol
Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition"
sdweb-merge-block-weighted-gui
Merge models with separate rate for each 25 U-Net block (input, middle, output). Extension for Stable Diffusion UI by AUTOMATIC1111
ai-toolkit
Various AI scripts. Mostly Stable Diffusion stuff.
Comfy_Dungeon
At the moment this is mostly a tech demo to show how to build a web app on top of ComfyUI
ComfyUI-Qwen-VL-API
QWen-VL-Plus & QWen-VL-Max in ComfyUI
CartoonSegmentation
Instance segmentation for cartoon/anime characters and some visual techniques building around it.
img-txt_viewer
Display an image and text file side-by-side for easy manual caption editing.