Grid to pick hyperparameter for each model

Question

Grid to pick hyperparameter for each model

kudkudak opened this issue 7 years ago · comments

Stanislaw Jastrzebski commented 7 years ago

Proper fitting of models to old and new dev split. Right now hyperparameters are shared, which might be bias heavily results.

For each model tune learning rate and l2 regularization.

Stanislaw Jastrzebski · Answer 1 · Fri Jan 05 2018 21:17:30 GMT+0800 (China Standard Time)

For NAACL we will just fix hyperparams to the ones tuned for DNNs. Performance of factorized and prototypical is approximately stable wrt to l2_a and lr hyperparameter. I haven't checked others.

If we submit it somewhere else I would tune these hyperparams, but closing for now

Stanislaw Jastrzebski · Answer 2 · Sun Jan 07 2018 18:16:20 GMT+0800 (China Standard Time)

But it actually seems important for glove.. will try to run proper grid.