Critic baseline for Policy Gradients
fedebotu opened this issue · comments
At the moment, the critic baseline is still not implemented - will be working on this alongside solving the Rollout baseline problem
A PyTorch library for all things Reinforcement Learning (RL) for Combinatorial Optimization (CO)
fedebotu opened this issue · comments
At the moment, the critic baseline is still not implemented - will be working on this alongside solving the Rollout baseline problem