What's Conjugate gradients and line_search in TRPO?
Dreamlikec opened this issue · comments
Alan Feng commented
Could you please give me a sense/reference what these two func meaing for?
Jiachen Wang commented
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Dreamlikec opened this issue · comments
Could you please give me a sense/reference what these two func meaing for?