jpthu17 / DiffusionRet

[ICCV 2023] DiffusionRet: Generative Text-Video Retrieval with Diffusion Model

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Question about how to get the ground truth x0?

chuanshen-chen opened this issue · comments

Hello, I would like to ask how x0 (ground truth) in formula 8 and formula 9 in the article is obtained?

maybe the distribution x_start is the ground truth [1,0,0,0,0,0,0],which means only the video which matters the query is one,others are all zeros?

maybe the distribution x_start is the ground truth [1,0,0,0,0,0,0],which means only the video which matters the query is one,others are all zeros?

You're right. That's exactly what we do.