bad performance not like the paper

Question

bad performance not like the paper

nguyenviettuan96 opened this issue 2 years ago · comments

The paper I read represents the result is very good convergence, but when I train using your code (not change anything), the model not converage and so result's chart is up and down, chaotic. Could you explaine that, please?

Daochen Zha · Answer 1 · Sat Nov 05 2022 03:10:57 GMT+0800 (China Standard Time)

@nguyenviettuan96 Thanks for asking. The environment and RL implementation have been updated with multiple iterations. So the results are not comparable. But You should be able to see similar trends.

chanyukyu · Answer 2 · Mon Feb 20 2023 06:29:27 GMT+0800 (China Standard Time)

I've tried changing the parameters and let it run for more iterations , but it doesn't converge at all and I couldn't see a similar trend. what would be the possible issues?