Hyperparameters to replicate reported performance

Question

Hyperparameters to replicate reported performance

Tiiiger opened this issue 5 years ago · comments

Thank you for the great work!

I would like to ask the correct hyperparameters for each of the datasets in order to replicate the paper reported result. Thank you!

Hoang NT · Answer 1 · Thu Feb 28 2019 17:02:33 GMT+0800 (China Standard Time)

Hi @Tiiiger, I would like to ask if you have been able to find the parameter settings for all the datasets? I ran MUTAG and found the default setting can (kind of) recreate the result. Given there is early stopping and whatnot, the mean accuracy can be higher and match the paper.

batch_size = 128
num_layers = 2
lr = 0.01
num_mlp_layers = 1
hidden_dim = 64
# --epochs can be way smaller than the default 350 (~90)

mean acc = 0.8775
std acc = 0.0422

Tianyi · Answer 2 · Fri Mar 01 2019 01:20:04 GMT+0800 (China Standard Time)

I try to tune some hyperparameters but possibly due to high variance, the result averaged over 10 folds is usually worse than the paper reported number. Also, using MLP and 1-layer do not seem to make a difference in my experiments.

Hoang NT · Answer 3 · Fri Mar 01 2019 12:30:10 GMT+0800 (China Standard Time)

I see. I am searching for COLLAB hyperparameters now. So far varying the number graph layers around 5 doesn't help. Mean acc = ~60% +- 2%. If I take only the highest test accuracy over all training process (assuming early stopping.....), mean acc = ~70%. There is 10% difference from the paper so please let me know if you found the setting that works for COLLAB.

Weihua Hu · Answer 4 · Fri Mar 01 2019 12:46:49 GMT+0800 (China Standard Time)

Hi, thanks for your interest. There are number of hyper-parameters to be tuned. Please refer to our paper for the detailed procedure. For COLLAB, did you set --degree_as_tag?

Weihua Hu · Answer 5 · Fri Mar 01 2019 12:50:13 GMT+0800 (China Standard Time)

Besides, the variance is indeed rather high due to the small data size. Still, all the results should be able to be reproduced by the exact procedure described in our paper.

Hoang NT · Answer 6 · Fri Mar 01 2019 14:57:50 GMT+0800 (China Standard Time)

@weihua916 Thanks! I indeed did not set --degree_as_tag for COLLAB since I switched from MUTAG and forgot to use the degree as feature vector. Btw, is the default 350 epochs necessary to achieve the results in the paper?

Weihua Hu · Answer 7 · Fri Mar 01 2019 15:13:33 GMT+0800 (China Standard Time)

Great! I do not think 350 epochs are necessary for most of the datasets. :)

Hoang NT · Answer 8 · Fri Mar 01 2019 15:16:08 GMT+0800 (China Standard Time)

Cool! Thanks for making the code available btw :).