Exp14: NVP_4 vs NVP

Question

Exp14: NVP_4 vs NVP

idhumphrey opened this issue 4 years ago · comments

Bella Humphrey commented 4 years ago

Run NVP_4 for 90 epochs with a lsdim of 500.

Meekail Zain commented 4 years ago

Running

Bella Humphrey · Answer 1 · Tue Feb 25 2020 10:54:24 GMT+0800 (China Standard Time)

Script has been written and ready to run.

Meekail Zain · Answer 2 · Thu Feb 27 2020 01:35:20 GMT+0800 (China Standard Time)

Experiment terminated after ~60 epochs. Loss failed to reduce past 140 consistently while NVP succeeds at a significantly reduced run time. Considerations for improvements on NVP_4 are welcome. It's worth noting that we may want to revisit this experiment once learning-rate/batch-size scheduling is up and running. I'm inclined to believe we may get different results upon either increasing the learning rate, or decreasing the batch size at the start and scheduling them. Another approach is to implement #39.

Meekail Zain · Answer 3 · Thu Feb 27 2020 08:16:58 GMT+0800 (China Standard Time)

Running again with increased learning rate (1e-4 vs 1e-5) just for comparison's sake

Meekail Zain · Answer 4 · Sat Feb 29 2020 09:37:29 GMT+0800 (China Standard Time)

Significant improvement has been observed in NVP_4 when utilizing learning rate of 1e-4. For future considerations we ought to test 1e-3, as well as learning rate scheduling strategies.