text-to-image-generation

There are 6 repositories under text-to-image-generation topic.

adobe-research / custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
computer-vision customization diffusion-models few-shot fine-tuning pytorch text-to-image-generation
Language:Python 1926
muzishen / IMAGDressing
[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high fidelity and garment consistency for virtual dressing.
datasets diffusion-models text-to-image-generation try-on
Language:Python 1188
FoundationVision / Infinity
Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
auto-regressive-model autoregressive-models generative-model gpt gpt-2 image-generation text-to-image text-to-image-generation transformers
Language:Python 1011
Lightricks / ComfyUI-LTXVideo
LTX-Video Support for ComfyUI
comfyui diffusion-models dit image-to-video image-to-video-generation text-to-image text-to-image-generation
Language:Python 904
songweige / rich-text-to-image
Rich-Text-to-Image Generation
computer-vision diffusion-models pytorch rich-text text-to-image-generation
Language:Python 776
donahowe / AutoStudio
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
image-generation multi-turn-dialogue text-to-image-generation
Language:Jupyter Notebook 431
Paranioar / Awesome_Matching_Pretraining_Transfering
The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.
cross-modal-retrieval tutorial awesome-list image-text-matching image-text-retrieval large-language-models large-vision-language-models large-vision-models memory-efficient-tuning multimodal-pretraining parameter-efficient-fine-tuning video-text-recognition video-text-retrieval vision-and-language visual-semantic-embedding multimodal-large-language-models large-language-model text-to-image-generation text-to-image-synthesis text-to-video-generation
422
OSU-NLP-Group / MagicBrush
[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
diffusion-models image-editing image-generation image-synthesis instruction-following text-to-image text-to-image-generation text-to-image-synthesis
Language:Python 337
woctezuma / stable-diffusion-colab
Colab notebook for Stable Diffusion Hyper-SDXL.
colab colab-notebook colaboratory stable-diffusion huggingface-diffusers diffusion diffusion-models text-to-image text-to-image-generation text-to-image-synthesis diffusers google-colab google-colab-notebook google-colaboratory image-generation text2image deep-learning stable-diffusion-xl hyper-sd hyper-sdxl
Language:Jupyter Notebook 320
ByteFlow-AI / TokenFlow
[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".
large-language-models multimodal-understanding text-to-image-generation vector-quantization
Language:Python 289
Liquid
FoundationVision / Liquid
Liquid: Language Models are Scalable and Unified Multi-modal Generators
autoregressive-models generative generative-ai image-gen large-language-models llms multimodal-large-language-models text-to-image text-to-image-generation
Language:Python 248
huggingface / diffusion-fast
Faster generation with text-to-image diffusion models.
diffusers diffusion-models pytorch sdxl text-to-image-generation
Language:Python 211
CFGpp-diffusion / CFGpp
Official repository for "CFG++: manifold-constrained classifier free guidance for diffusion models" (ICLR2025)
diffusion-model diffusionmodel image-editing machinelearning pytorch text-to-image text-to-image-generation
Language:Python 190
yunqing-me / AttackVLM
[NeurIPS-2023] Annual Conference on Neural Information Processing Systems
adversarial-attack deep-generative-model generative-ai image-to-text-generation text-to-image-generation foundation-models large-language-models vision-language-model trustworthy-ai
Language:Python 186
RockeyCoss / SPO
Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization
diffusion-models dpo sdxl text-to-image text-to-image-generation
Language:Python 185
tsunghan-wu / SLD
🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)
diffusion-models image-editing self-correction text-to-image-generation dalle-3 stable-diffusion
Language:Python 168
GuoLanqing / Awesome-High-Resolution-Diffusion
🔥🔥🔥A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.
diffusion-models high-resolution text-to-image-generation text-to-video-generation aigc
134
ExplainableML / ReNO
[NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization
reward-models stable-diffusion text-to-image text-to-image-generation
Language:Python 128
zituitui / BELM
[NeurIPS 2024] Official implementation of "BELM: Bidirectional Explicit Linear Multi-step Sampler for Exact Inversion in Diffusion Models".
diffusion-models image-editing neurips-2024 text-to-image-generation numerical-odes
Language:Python 120
somepago / DCR
Official Pytorch repo of CVPR'23 and NeurIPS'23 papers on understanding replication in diffusion models.
diffusion memorization text-to-image-diffusion text-to-image-generation
Language:Python 105
louisYen / Gen4Gen
🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"
data-generation llm personalization stable-diffusion text-to-image-generation
Language:Python 104
QY-H00 / attention-interpolation-diffusion
[NeurIPS 2024] Official Implementation of Attention Interpolation of Text-to-Image Diffusion
image-interpolation stable-diffusion text-to-image-generation deep-learning diffusion pytorch
Language:Jupyter Notebook 94
j-min / DSG
Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)
dsg llm text-to-image text-to-image-evaluation text-to-image-generation vqa
Language:Jupyter Notebook 85
Correr-Zhou / MagicTailor
[arXiv 2024] Offical implementation of the paper "MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models".
diffusion-models text-to-image-generation text-to-image-synthesis artificial-intelligence computer-vision
Language:Python 81
glami / glami-1m
The largest multilingual image-text classification dataset. It contains fashion products.
classification computer-vision dataset deep-learning fashion image-classification image-text image-text-classification image-to-text multi-modal-deep-learning multilingual multilingual-image-text-classification multimodal natural-language-processing text-classification text-to-image-generation
Language:Jupyter Notebook 71
mapo-t2i / mapo
Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).
alignment diffusers diffusion-models pytorch text-to-image-generation human-preference
Language:Python 71
humansensinglab / ITI-GEN
[ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generation
debiasing diffusion-models fairness inclusiveness prompt-tuning responsible-ai stable-diffusion text-to-image-generation
Language:Python 68
YonghaoXu / Txt2Img-MHN
[IEEE TIP 2023] Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern Hopfield Networks
hopfield-network image-synthesis remote-sensing text-to-image-generation vision-language-model
Language:Python 67
PangzeCheung / SingDiffusion
[CVPR 2024] Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models
cvpr2024 diffusion-model singularity text-to-image-generation diffusers
Language:Python 66
YangLing0818 / ContextDiff
[ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation
diffusion-models text-to-image-generation text-to-video multimodal-generation
Language:Python 65
haoosz / ConceptExpress
[ECCV 2024 Oral] ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction
concept-extraction diffusion-model text-to-image-generation
Language:Python 61
CSU-JPG / TextAtlas
A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation
image-generation image-generation-leaderboard leaderboard text-to-image text-to-image-generation
Language:Python 57
j-min / VPGen
Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)
pytorch step-by-step text-to-image text-to-image-evaluation text-to-image-generation
Language:Jupyter Notebook 56
HotpotDesign / api-examples
This repository illustrates how to use the Hotpot.ai API. Our API provides Stable Diffusion, image generator, text-to-image generator, background removal, image upscaler, photo restoration, and picture colorization.
background-removal restoration colorization photo-colorizer upscaling upscaler background-remover super-resolution stable-diffusion stable-diffusion-api image-generation image-generator text-to-image text-to-image-generation text-to-image-synthesis image-upscaler
55
Gen-Verse / Diffusion-Sharpening
Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening
diffusion-models inference-scaling rlhf text-to-image-generation
Language:Python 48
LayoutLLM-T2I / LayoutLLM-T2I
Code for ACM MM'23 paper: LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation
diffusion-models large-language-models text-to-image-diffusion text-to-image-generation text-to-image-synthesis
Language:Python 44

text-to-image-generation

adobe-research / custom-diffusion

muzishen / IMAGDressing

FoundationVision / Infinity

Lightricks / ComfyUI-LTXVideo

songweige / rich-text-to-image

donahowe / AutoStudio

Paranioar / Awesome_Matching_Pretraining_Transfering

OSU-NLP-Group / MagicBrush

woctezuma / stable-diffusion-colab

ByteFlow-AI / TokenFlow

FoundationVision / Liquid

huggingface / diffusion-fast

CFGpp-diffusion / CFGpp

yunqing-me / AttackVLM

RockeyCoss / SPO

tsunghan-wu / SLD

GuoLanqing / Awesome-High-Resolution-Diffusion

ExplainableML / ReNO

zituitui / BELM

somepago / DCR

louisYen / Gen4Gen

QY-H00 / attention-interpolation-diffusion

j-min / DSG

Correr-Zhou / MagicTailor

glami / glami-1m

mapo-t2i / mapo

humansensinglab / ITI-GEN

YonghaoXu / Txt2Img-MHN

PangzeCheung / SingDiffusion

YangLing0818 / ContextDiff

haoosz / ConceptExpress

CSU-JPG / TextAtlas

j-min / VPGen

HotpotDesign / api-examples

Gen-Verse / Diffusion-Sharpening

LayoutLLM-T2I / LayoutLLM-T2I