Beast code in Giters

renolynx's starred repositories

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT29596 189 974

maybe

The OS for your personal finances

Language:RubyAGPL-3.028827 149 299

facefusion

Next generation face swapper and enhancer

Language:PythonNOASSERTION16755 161 365

StableCascade

Official Code for Stable Cascade

Language:Jupyter NotebookMIT6451 61 121

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonApache-2.05691 66 405

moondream

tiny vision language model

Language:Jupyter NotebookApache-2.04576 54 98

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型

Language:PythonMIT4320 43 357

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Language:PythonMIT4103 39 136

SUPIR

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

Language:PythonNOASSERTION3995 70 123

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

Language:Jupyter NotebookMIT3607 72 96

ComfyUI_IPAdapter_plus

Language:PythonGPL-3.03374 36 582

k-diffusion

Karras et al. (2022) diffusion models for PyTorch

Language:PythonMIT2198 41 63

DynamiCrafter

[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Language:PythonApache-2.02181 30 105

notesGPT

Record voice notes & transcribe, summarize, and get tasks

Language:TypeScriptMIT1603 21 20

EveryDream2trainer

Language:PythonNOASSERTION785 18 106

GPT4V-Image-Captioner

Language:PythonGPL-3.0706 10 43

taggui

Tag manager and captioner for image datasets

Language:PythonGPL-3.0521 9 145

clip-interrogator-ext

Stable Diffusion WebUI extension for CLIP Interrogator

Language:PythonMIT477 10 75

segmoe

Language:PythonApache-2.0390 6 23

DragAnything

[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation

Language:Python381 17 20

freecontrol

Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition"

Language:Python377 29 8

sdweb-merge-block-weighted-gui

Merge models with separate rate for each 25 U-Net block (input, middle, output). Extension for Stable Diffusion UI by AUTOMATIC1111

Language:Python314 8 16

LECO

Low-rank adaptation for Erasing COncepts from diffusion models.

Language:Jupyter NotebookApache-2.0302 7 27

ai-toolkit

Various AI scripts. Mostly Stable Diffusion stuff.

Language:PythonMIT224 17 26

Comfy_Dungeon

At the moment this is mostly a tech demo to show how to build a web app on top of ComfyUI

Language:JavaScriptApache-2.0194 10 2

ComfyUI-Qwen-VL-API

QWen-VL-Plus & QWen-VL-Max in ComfyUI

Language:PythonGPL-3.0185 4 5

CartoonSegmentation

Instance segmentation for cartoon/anime characters and some visual techniques building around it.

Language:Jupyter Notebook128 9 5

waifuset

Language:PythonMPL-2.07300

PuTT

Language:PythonMIT3700

img-txt_viewer

Display an image and text file side-by-side for easy manual caption editing.

Language:PythonCC0-1.034 1 9