OptMLGroup / VRP-RL

Reinforcement Learning for Solving the Vehicle Routing Problem

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

About Train

ychen-2000 opened this issue · comments

Hello!
I appreciate your excellent work very much, so I tried to run your code in order to find about your result precisely.But as I trained,in the size 20,50,100,after a few steps,actor loss became NAN and critic loss became 0. Why does that happened?