ToTheBeginning / PuLID

Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

question about fig2 in the paper.

dingangui opened this issue · comments

Hello, thanks for your incredible work!

In the 'Accurate ID Loss' section in the bottom right corner of Figure 2 of the paper, there are two generated images both denoted as 'predict x_0'. Are both of these images produced by the Lighting T2I? I guess they represent T2I w/ ID and T2I w/o ID, respectively. However, upon closer inspection, it appears that the IDs of both images are well-preserved, which contradicts my speculation. What are these two images' actual meanings and why do you connect them with a vertical line?
image

When calculating the ID loss, the two images involved in the calculation are both generated by the Lightning T2I training branch, and both images are generated under the T2I w/ ID setting. Additionally, only when calculating the Alignment loss, the contrastive pair is composed of T2I w/ ID and T2I w/o ID.