The dimension of localized conditionings
jmkim0309 opened this issue · comments
@huanglianghua In section 2.3,
For localized representations including
sketches, segmentation masks, depthmaps, intensity
images, and masked images, we project them into uniform dimensional
embeddings with the same spatial size as the
noisy latent xt using stacked convolutional layers.
What is the value of uniform dimension here?