novelty in QM9 dataset is so small, why?

Question

novelty in QM9 dataset is so small, why?

FairyFali opened this issue a year ago · comments

Fali Wang commented a year ago

This is the running results for QM9. I have two questions:

the running time is 9 hours, not 1 hour metioned in the paper, why?
why the novelty is so small?

Clement Vignac · Answer 1 · Wed Jun 21 2023 23:27:05 GMT+0800 (China Standard Time)

Hello, as explained in the table of the paper, " Training time is the time needed to reach 99% validity. "

Novelty is so small because QM9 is an exhaustive enumeration of molecules that satisfy some constraints. Cf section 5.4 of https://arxiv.org/pdf/2110.02096.pdf for a discussion of why high novelty is not a good thing for QM9