lilujunai

IronMan's starred repositories

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonMIT3505 107 50

fastcomposer

FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention

Language:PythonMIT599 21 30

OMG

OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models

Language:Python530 12 11

sdxs

Official repo of our paper "SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions"

Language:PythonApache-2.0503 24 15

PAIR-Diffusion

[CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor

Language:PythonMIT474 19 14

Magic-Me

Codes for ID-Specific Video Customized Diffusion

Language:PythonApache-2.0430 14 13

MIGC

[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)

Language:PythonNOASSERTION380 14 5

Prompt-Diffusion

Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"

Language:PythonApache-2.0351 7 12

T-GATE

T-GATE: Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models

Language:PythonMIT215 6 10

Perturbed-Attention-Guidance

Official implementation of "Perturbed-Attention Guidance"

Language:Jupyter NotebookMIT189 10

zigma

A PyTorch implementation of the paper "ZigMa: A DiT-Style Mamba-based Diffusion Model"

Language:PythonApache-2.0156 11 9

LoRAMoE

LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment

Language:Python110 2 7

moe_attention

Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"

Language:PythonMIT78 7 2

attention-interpolation-diffusion

Interpolation Between Text-to-Image Generation!

Language:Python74 3 1

PlainMamba

PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition

Language:PythonApache-2.045 3 4

PD-Quant

[CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference Metric

Language:PythonApache-2.039 4 1

FLAP

[AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models

Language:PythonApache-2.024 1 4

DiffusionNAG

Official PyTorch implementation of "DiffusionNAG: Predictor-guided Neural Architecture Generation with Diffusion Models" (ICLR 2024)

Language:Python23 3 5

SLEB

Official Implementation of SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks

Language:PythonMIT21 40

PAE

[CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation

Language:Python2100

MoCLE

MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)

Language:Jupyter Notebook1900

vid-TLDR

Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".

Language:PythonMIT18 8 1

APQ-DM

This is the official pytorch implementation for the paper: Towards Accurate Post-training Quantization for Diffusion Models.(CVPR24 Poster Highlight)

Language:PythonApache-2.01600

optin-transformer-pruning

[ICLR 2024] The Need for Speed: Pruning Transformers with One Recipe

Language:PythonMIT1200

DCP-GAN

[CVPR 2024] Diversity-aware Channel Pruning for StyleGAN Compression

Language:Python10 3 2

RepresentationSurgery

Representation Surgery for Multi-Task Model Merging. ICML, 2024.

Language:Python9 3 2

rg-lcd

Reward Guided Latent Consistency Distillation

800

BinaryDM

Language:Python200

SCott

The code of SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation

100

FLAP

[AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models

Language:PythonApache-2.0100