[Question] Control PPO training

Question

[Question] Control PPO training

mwalidcharrwi opened this issue 4 months ago · comments

Mohammad Walid Charrwi commented 4 months ago

❓ Question

Hi
I wanted to inquire is there a way to control the training of the MaskablePPO when it finishes the data set that is to be trained on (when ep_len_mean reaches 1 and ep_rew_mean reaches 100 in the logger) or at any specific point during the learning?

Checklist

I have checked that there is no similar issue in the repo
I have read the documentation
If code there is, it is minimal and working
If code there is, it is formatted using the markdown code blocks for both code and stack traces.