zhuxiangru's starred repositories

LayerDiffuse

Transparent Image Layer Diffusion using Latent Transparency

License:Apache-2.0Stargazers:1992Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1377Issues:0Issues:0

SimPO

SimPO: Simple Preference Optimization with a Reference-Free Reward

Language:PythonLicense:MITStargazers:650Issues:0Issues:0

DiffusionDPO

Code for "Diffusion Model Alignment Using Direct Preference Optimization"

Language:PythonLicense:Apache-2.0Stargazers:232Issues:0Issues:0

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonLicense:AGPL-3.0Stargazers:2700Issues:0Issues:0

LaVi-Bridge

[ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation

Language:PythonLicense:MITStargazers:307Issues:0Issues:0

EqBen

[ICCV'23 Oral] The introduction and toolkit for EqBen Benchmark

Language:PythonLicense:Apache-2.0Stargazers:125Issues:0Issues:0

Structure-CLIP

[Paper][AAAI2024]Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations

Language:PythonStargazers:106Issues:0Issues:0

ELLA

ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment

Language:PythonLicense:Apache-2.0Stargazers:1054Issues:0Issues:0

style-aligned

Official code for "Style Aligned Image Generation via Shared Attention"

Language:PythonLicense:Apache-2.0Stargazers:1197Issues:0Issues:0

SyncDreamer

[ICLR 2024 Spotlight] SyncDreamer: Generating Multiview-consistent Images from a Single-view Image

Language:PythonLicense:MITStargazers:880Issues:0Issues:0

sdxl_prompt_test

Testing prompts with SDXL

Language:Jupyter NotebookStargazers:13Issues:0Issues:0

RPG-DiffusionMaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

Language:Jupyter NotebookLicense:MITStargazers:1654Issues:0Issues:0

Caption-Anything

Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything

Language:PythonLicense:BSD-3-ClauseStargazers:1662Issues:0Issues:0

SceneGraphGenZeroShotWithGSAM

Scene Graph Generate Zero Shot

Language:Jupyter NotebookStargazers:17Issues:0Issues:0

torch-LLM4SGG

Official PyTorch implementation Source code for LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation, accepted at CVPR 2024

Language:PythonStargazers:77Issues:0Issues:0

docker-prompt-generator

Using a Model to generate prompts for Model applications. / 使用模型来生成作图咒语的偷懒工具,支持 MidJourney、Stable Diffusion 等。

Language:PythonLicense:MITStargazers:1160Issues:0Issues:0

docker-stable-diffusion-xl-turbo

Stable Diffusion XL Turbo 实时文生图、图生图

Language:PythonLicense:Apache-2.0Stargazers:12Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:6268Issues:0Issues:0

MVDream

Multi-view Diffusion for 3D Generation

Language:PythonLicense:MITStargazers:772Issues:0Issues:0

Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Language:PythonLicense:MITStargazers:1777Issues:0Issues:0

llmblueprint

[ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"

Language:Jupyter NotebookStargazers:65Issues:0Issues:0

T2I-Adapter

T2I-Adapter

Language:PythonStargazers:3409Issues:0Issues:0

GraphDreamer

[CVPR'24] GraphDreamer: a novel framework of generating compositional 3D scenes from scene graphs.

Language:PythonLicense:MITStargazers:160Issues:0Issues:0

PIA

[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画

Language:PythonLicense:Apache-2.0Stargazers:882Issues:0Issues:0

Mega

Code for ACM MM 2021 Paper "Multimodal Relation Extraction with Efficient Graph Alignment".

Language:PythonLicense:MITStargazers:88Issues:0Issues:0

VISTA

VISTA: VIsual-Textual Knowledge Graph Representation Learning (Findings of EMNLP 2023)

Language:PythonStargazers:19Issues:0Issues:0

MNRE

Resource and Code for ICME 2021 paper "MNRE: A Challenge Multimodal Dataset for Neural Relation Extraction with Visual Evidence in Social Media Posts"

Stargazers:48Issues:0Issues:0

DSG

Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)

Language:Jupyter NotebookStargazers:74Issues:0Issues:0
Stargazers:133Issues:0Issues:0