ai4co / rl4co

A PyTorch library for all things Reinforcement Learning (RL) for Combinatorial Optimization (CO)

Home Page:https://rl4.co

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Critic baseline for Policy Gradients

fedebotu opened this issue · comments

At the moment, the critic baseline is still not implemented - will be working on this alongside solving the Rollout baseline problem