jotoy

followers

following

stars

jotoy's starred repositories

generative-models

Generative Models by Stability AI

Language:PythonMIT23640 251 289

Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonApache-2.010646 123 207

kohya_ss

Language:PythonApache-2.08984 83 1844

LWM

Language:PythonApache-2.07041 67 69

StableCascade

Official Code for Stable Cascade

Language:Jupyter NotebookMIT6476 61 121

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonNOASSERTION5803 47 75

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonMIT4239 62 93

mmpretrain

OpenMMLab Pre-training Toolbox and Benchmark

Language:PythonApache-2.03321 30 767

pytorch-fid

Compute FID scores with PyTorch.

Language:PythonApache-2.03270 15 85

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonAGPL-3.02604 460

InternImage

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Language:PythonMIT2438 34 260

DynamiCrafter

[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Language:PythonApache-2.02222 30 110

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Language:PythonApache-2.01864 23 85

Cream

This is a collection of our NAS and Vision Transformer work.

Language:PythonMIT1628 36 156

server-bot-quick-start

Tutorial for Poe server bots

Language:Python1296 13 18

sd-dynamic-thresholding

Dynamic Thresholding (CFG Scale Fix) for Stable Diffusion (StableSwarmUI, ComfyUI, and Auto WebUI)

Language:PythonMIT1081 6 85

muse-maskgit-pytorch

Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch

Language:PythonMIT839 34 36

RectifiedFlow

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

Language:Python730 11 22

DeepCache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

Language:PythonApache-2.0719 17 41

res-adapter

Official implementation of "ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models".

Language:PythonApache-2.0718 17 17

Wuerstchen

Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models

Language:Jupyter NotebookMIT517 23 22

piecewise-rectified-flow

PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator

Language:Jupyter NotebookBSD-3-Clause378 17 9

mvit

Code Release for MViTv2 on Image Recognition.

Language:PythonApache-2.0374 14 19

T-GATE

T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!

Language:PythonMIT319 12 14

ComfyUI-AutomaticCFG

If your image was a pizza and the CFG the temperature of your oven: this is a thermostat that ensures it is always cooked like you want. Also adds a 30% speed increase. For ComfyUI / StableDiffusion

Language:Python308 2 43

TCD

Official Repository of the paper "Trajectory Consistency Distillation"

Language:Python294 10 18

ctm

Language:Python207 18 7

comfy-todo

Language:PythonMIT64 1 2

learning-to-cache

Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching

Language:Python48 3 1