There are 5 repositories under text-to-image-generation topic.
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
đź‘”IMAGDressingđź‘”: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high fidelity and garment consistency for virtual try-on.
Rich-Text-to-Image Generation
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
Colab notebook for Stable Diffusion Hyper-SDXL.
[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
Faster generation with text-to-image diffusion models.
Official repository for "CFG++: manifold-constrained classifier free guidance for diffusion models"
[NeurIPS-2023] Annual Conference on Neural Information Processing Systems
Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step
🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)
🔥🔥🔥A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.
[NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization
[NeurIPS 2024] Official Implementation of Attention Interpolation of Text-to-Image Diffusion
Offical implementation of the paper "MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models".
[ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generation
[CVPR 2024] Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models
[ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation
[IEEE TIP 2023] Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern Hopfield Networks
This repository illustrates how to use the Hotpot.ai API. Our API provides Stable Diffusion, image generator, text-to-image generator, background removal, image upscaler, photo restoration, and picture colorization.
[ECCV 2024 Oral] ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction
(ECCV 2024) Official implementation of Paper ''DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation''
[CVPR '23] Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models
The official code implementation of "LaCon: Late-Constraint Diffusion for Steerable Guided Image Synthesis".
Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).
Repo for our NeurIPS 2023 paper on: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback
[ICASSP 2024] The official repo for Harnessing the Power of Large Vision Language Models for Synthetic Image Detection
"Experience the magic of the 'Text to Image' project, where JavaScript transforms your text into captivating visuals using HTML5 and CSS3. Unlock the creative potential of digital storytelling and data visualization in a visually immersive experience."
Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]