There are 5 repositories under text-to-image-generation topic.
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high fidelity and garment consistency for virtual dressing.
Rich-Text-to-Image Generation
LTX-Video Support for ComfyUI
Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.
Colab notebook for Stable Diffusion Hyper-SDXL.
[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".
Faster generation with text-to-image diffusion models.
Official repository for "CFG++: manifold-constrained classifier free guidance for diffusion models"
[NeurIPS-2023] Annual Conference on Neural Information Processing Systems
Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization
🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)
🔥🔥🔥A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.
[NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization
[NeurIPS 2024] Official Implementation of Attention Interpolation of Text-to-Image Diffusion
Offical implementation of the paper "MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models".
[ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generation
[CVPR 2024] Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models
[IEEE TIP 2023] Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern Hopfield Networks
[ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation
This repository illustrates how to use the Hotpot.ai API. Our API provides Stable Diffusion, image generator, text-to-image generator, background removal, image upscaler, photo restoration, and picture colorization.
[ECCV 2024 Oral] ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction
Liquid: Language Models are Scalable Multi-modal Generators
Code for ACM MM'23 paper: LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation
(ECCV 2024) Official implementation of Paper ''DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation''
[CVPR '23] Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models