IIGROUP / MM-CelebA-HQ-Dataset

[CVPR 2021] Multi-Modal-CelebA: A Large-Scale Text-Driven Face Generation and Understanding Dataset

Home Page:https://arxiv.org/abs/2012.03308

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

How to calculate the FID score for this dataset?

Corleone-Huang opened this issue · comments

The proposed dataset is split as 24k for training and 6k for testing. We use the testing split of our dataset for FID calculation. We randomly sampled one from ten captions to generate one image, meaning to produce 6k generated images. These generated images are used to calculate FID (and LPIPS) with the 6k real ones in the testing set. Since previous text-to-image generation methods can only produce images with 256*256 resolution, we use a 14-layer StyleGAN implementation for fair comparisons. I tend to use more images, eg 10 images for each caption, after recent discussions with other researchers. You can set a fair weight and keep it constant when generating batches of images for the validation.