Some confusion about the code.
yangorwell opened this issue · comments
When I use the Powell damping to hold the PSD property of the metric matrix, I find the Bs update are written as "Bs.copy_(g_Sk.mul(-t))". Is it right?
Hi, thanks for taking a look at the code!
Powell damping in the code actually occurs in the modification of y in this part of the code:
PyTorch-LBFGS/functions/LBFGS.py
Lines 384 to 389 in e79e7d9
Bs is explicitly computed by noting that
Please let me know if this helps clarify things!
Thanks for your prompt response! This solve my problem!