Evaluate on PACS

Question

Evaluate on PACS

GA-17a opened this issue 3 years ago · comments

GA-17 commented 3 years ago

Hi @KaiyangZhou ,
Thanks for sharing the code. I have the following questions:

How many times did you repeat on PACS?
Would you mind sharing the standard deviation of your PACS performances?
How did you select the model to report your results?

Thanks!

Kaiyang · Answer 1 · Tue Jun 29 2021 12:08:53 GMT+0800 (China Standard Time)

How many times did you repeat on PACS?

The provided script runs the code 5 times

In the paper, I think I have run more than 5 times to have a smaller std

Would you mind sharing the standard deviation of your PACS performances?

Please see the paper

How did you select the model to report your results?

simply use the last-step checkpoint

another way is to use the median performance among the last, e.g., 10 steps, or use their average

GA-17 · Answer 2 · Tue Jun 29 2021 20:39:59 GMT+0800 (China Standard Time)

Thanks for your reply.
For the last question, I think the common way is to use the checkpoint which has the highest accuracy on validation dataset. And it usually has a relatively high variance. What do you think about it?

Kaiyang · Answer 3 · Tue Jun 29 2021 23:38:57 GMT+0800 (China Standard Time)

due to domain shift, a higher performance on the same-domain val set doesn't necessarily give a better result on unseen domains (e.g. overfitting), but it's not uncommon to follow the standard cross-val way for selecting checkpoints

as I mentioned, you could also simply use the last, converged checkpoint which might lead to a smaller variance

I'd assume there is no further issue regarding the code so I'm closing it now