Khrylx / PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

What's Conjugate gradients and line_search in TRPO?

Dreamlikec opened this issue · comments

Could you please give me a sense/reference what these two func meaing for?