jotoy's starred repositories

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:23640Issues:251Issues:289

Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonLicense:Apache-2.0Stargazers:10646Issues:123Issues:207
Language:PythonLicense:Apache-2.0Stargazers:8984Issues:83Issues:1844
Language:PythonLicense:Apache-2.0Stargazers:7041Issues:67Issues:69

StableCascade

Official Code for Stable Cascade

Language:Jupyter NotebookLicense:MITStargazers:6476Issues:61Issues:121

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5803Issues:47Issues:75

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonLicense:MITStargazers:4239Issues:62Issues:93

mmpretrain

OpenMMLab Pre-training Toolbox and Benchmark

Language:PythonLicense:Apache-2.0Stargazers:3321Issues:30Issues:767

pytorch-fid

Compute FID scores with PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:3270Issues:15Issues:85

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonLicense:AGPL-3.0Stargazers:2604Issues:46Issues:0

InternImage

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Language:PythonLicense:MITStargazers:2438Issues:34Issues:260

DynamiCrafter

[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Language:PythonLicense:Apache-2.0Stargazers:2222Issues:30Issues:110

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Language:PythonLicense:Apache-2.0Stargazers:1864Issues:23Issues:85

Cream

This is a collection of our NAS and Vision Transformer work.

Language:PythonLicense:MITStargazers:1628Issues:36Issues:156

server-bot-quick-start

Tutorial for Poe server bots

sd-dynamic-thresholding

Dynamic Thresholding (CFG Scale Fix) for Stable Diffusion (StableSwarmUI, ComfyUI, and Auto WebUI)

Language:PythonLicense:MITStargazers:1081Issues:6Issues:85

muse-maskgit-pytorch

Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch

Language:PythonLicense:MITStargazers:839Issues:34Issues:36

RectifiedFlow

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

DeepCache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

Language:PythonLicense:Apache-2.0Stargazers:719Issues:17Issues:41

res-adapter

Official implementation of "ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models".

Language:PythonLicense:Apache-2.0Stargazers:718Issues:17Issues:17

Wuerstchen

Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models

Language:Jupyter NotebookLicense:MITStargazers:517Issues:23Issues:22

piecewise-rectified-flow

PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:378Issues:17Issues:9

mvit

Code Release for MViTv2 on Image Recognition.

Language:PythonLicense:Apache-2.0Stargazers:374Issues:14Issues:19

T-GATE

T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!

Language:PythonLicense:MITStargazers:319Issues:12Issues:14

ComfyUI-AutomaticCFG

If your image was a pizza and the CFG the temperature of your oven: this is a thermostat that ensures it is always cooked like you want. Also adds a 30% speed increase. For ComfyUI / StableDiffusion

TCD

Official Repository of the paper "Trajectory Consistency Distillation"

Language:PythonLicense:MITStargazers:64Issues:1Issues:2

learning-to-cache

Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching