zhuxiangru's starred repositories
LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
DiffusionDPO
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
LaVi-Bridge
[ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation
Structure-CLIP
[Paper][AAAI2024]Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations
style-aligned
Official code for "Style Aligned Image Generation via Shared Attention"
SyncDreamer
[ICLR 2024 Spotlight] SyncDreamer: Generating Multiview-consistent Images from a Single-view Image
sdxl_prompt_test
Testing prompts with SDXL
RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
SceneGraphGenZeroShotWithGSAM
Scene Graph Generate Zero Shot
torch-LLM4SGG
Official PyTorch implementation Source code for LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation, accepted at CVPR 2024
docker-prompt-generator
Using a Model to generate prompts for Model applications. / 使用模型来生成作图咒语的偷懒工具,支持 MidJourney、Stable Diffusion 等。
docker-stable-diffusion-xl-turbo
Stable Diffusion XL Turbo 实时文生图、图生图
llmblueprint
[ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"
T2I-Adapter
T2I-Adapter
GraphDreamer
[CVPR'24] GraphDreamer: a novel framework of generating compositional 3D scenes from scene graphs.