IronMan's starred repositories

VAR

[GPT beats diffusionšŸ”„] [scaling laws in visual generationšŸ“ˆ] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3505Issues:107Issues:50

fastcomposer

FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention

Language:PythonLicense:MITStargazers:599Issues:21Issues:30

OMG

OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models

sdxs

Official repo of our paper "SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions"

Language:PythonLicense:Apache-2.0Stargazers:503Issues:24Issues:15

PAIR-Diffusion

[CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor

Language:PythonLicense:MITStargazers:474Issues:19Issues:14

Magic-Me

Codes for ID-Specific Video Customized Diffusion

Language:PythonLicense:Apache-2.0Stargazers:430Issues:14Issues:13

MIGC

[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)

Language:PythonLicense:NOASSERTIONStargazers:380Issues:14Issues:5

Prompt-Diffusion

Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"

Language:PythonLicense:Apache-2.0Stargazers:351Issues:7Issues:12

T-GATE

T-GATE: Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models

Language:PythonLicense:MITStargazers:215Issues:6Issues:10

Perturbed-Attention-Guidance

Official implementation of "Perturbed-Attention Guidance"

Language:Jupyter NotebookLicense:MITStargazers:189Issues:1Issues:0

zigma

A PyTorch implementation of the paper "ZigMa: A DiT-Style Mamba-based Diffusion Model"

Language:PythonLicense:Apache-2.0Stargazers:156Issues:11Issues:9

LoRAMoE

LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment

moe_attention

Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"

Language:PythonLicense:MITStargazers:78Issues:7Issues:2

attention-interpolation-diffusion

Interpolation Between Text-to-Image Generation!

PlainMamba

PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition

Language:PythonLicense:Apache-2.0Stargazers:45Issues:3Issues:4

PD-Quant

[CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference Metric

Language:PythonLicense:Apache-2.0Stargazers:39Issues:4Issues:1

FLAP

[AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:24Issues:1Issues:4

DiffusionNAG

Official PyTorch implementation of "DiffusionNAG: Predictor-guided Neural Architecture Generation with Diffusion Models" (ICLR 2024)

SLEB

Official Implementation of SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks

Language:PythonLicense:MITStargazers:21Issues:4Issues:0

PAE

[CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation

Language:PythonStargazers:21Issues:0Issues:0

MoCLE

MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)

Language:Jupyter NotebookStargazers:19Issues:0Issues:0

vid-TLDR

Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".

Language:PythonLicense:MITStargazers:18Issues:8Issues:1

APQ-DM

This is the official pytorch implementation for the paper: Towards Accurate Post-training Quantization for Diffusion Models.(CVPR24 Poster Highlight)

Language:PythonLicense:Apache-2.0Stargazers:16Issues:0Issues:0

optin-transformer-pruning

[ICLR 2024] The Need for Speed: Pruning Transformers with One Recipe

Language:PythonLicense:MITStargazers:12Issues:0Issues:0

DCP-GAN

[CVPR 2024] Diversity-aware Channel Pruning for StyleGAN Compression

RepresentationSurgery

Representation Surgery for Multi-Task Model Merging. ICML, 2024.

rg-lcd

Reward Guided Latent Consistency Distillation

Stargazers:8Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0

SCott

The code of SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation

Stargazers:1Issues:0Issues:0

FLAP

[AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0