pytorch / examples

I don't know why this code use drop_last=True on validation mode.
Also, this code only uses batch_size dividable datas for calculating average top1,5 errors.
And then re-generate auxiliary validation data&dataloader for printing remaining logs.

Can anyone tell me why this code uses this method?

Hi @DY112 , there are many examples in this repo. Could you share a code pointer of the example you are talking about?

Oh sorry.
My question was about imagenet training code.

examples/imagenet/main.py

Line 256 in f82f562

    
           val_sampler = torch.utils.data.distributed.DistributedSampler(val_dataset, shuffle=False, drop_last=True)

@DY112 Good question! Because DistributedSampler would pad the last uncompleted batch to become a full batch by default, which leads to wrong validation metrics. To get the correct metrics, we can either 1) use single GPU to run validation(it's slow though) or 2) use DistributedSampler for all batches until the last batch and use auxiliary dataset + regular Dataloader for the last batch.

question about drop_last=True on validation mode