Performance of DQN agent on "small-linear" scenario

Question

Performance of DQN agent on "small-linear" scenario

GeneveyC opened this issue a year ago · comments

Hi,

I have run multiple training with the DQN on the "small-linear" scenario with training_steps=5000000 and fully_obs=False. The network doesn't seem to compromise the two targets on this scenario (and doesn't seem to converge in terms of episodic return). Have you ever managed to get results that show that the DQN converges and can get positive episodic return on this scenario (with the above mentioned parameters) ?

Best regards,

Jonathon Schwartz · Answer 1 · Sat Apr 15 2023 02:03:19 GMT+0800 (China Standard Time)

Hey,

I haven't tried it with fully_obs=False, but it's not surprising that the DQN implementation that comes with the library isn't able to solve it, since it's very basic. To solve the non fully observable setting you need an agent that can handle partially observable environments, e.g. agents that use recurrent networks.

There are a lot of resources and existing implementations of such algorithms, the following are some examples I found after a brief search, I'm sure there are many more:

Best of luck

chgeneve · Answer 2 · Tue Apr 18 2023 15:08:05 GMT+0800 (China Standard Time)

Thanks for you answer! I will try with recurrent networks.