About Crawler for Continuous Control

Question

About Crawler for Continuous Control

ZeratuuLL opened this issue 5 years ago · comments

I am trying to solve the Crawler environment in Continuous control task. I have read the Unity webpage and realized that there were two environments. One with static target and one with dynamic target. Which one is provided through the links? Thank you!

Samuel Pun · Answer 1 · Sat Apr 13 2019 02:30:49 GMT+0800 (China Standard Time)

I too wanted to know the answer please.

Lifeng Wei · Answer 2 · Sat Apr 13 2019 02:42:43 GMT+0800 (China Standard Time)

After some test it should be the fixed goal....

Samuel Pun · Answer 3 · Sat Apr 13 2019 02:44:50 GMT+0800 (China Standard Time)

I think so too. I was training my humble agent and it’s gaiining score and the visual shows that its always facing the same direction. I guess it is a fixed goal. Thanks man! sam

…

On 13 Apr 2019, at 2:42 AM, Lifeng Wei ***@***.***> wrote: After some test it should be the fixed goal.... — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#16 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AMvt5Sz9ja7TCTA79SP0BcSpfUGoH_cIks5vgNOpgaJpZM4bJKm->.

Lifeng Wei · Answer 4 · Sat Apr 13 2019 05:26:50 GMT+0800 (China Standard Time)

Np! What kind of results are you getting? The dimension info is different from Unity page so I am not sure if the benchmarks are still reliable.... I can get an average around 1800 but cannot get further.... Lifeng Wei. From my phone and apology for any typos

…

On Apr 12, 2019 at 11:44 AM, <Samuel Pun ***@***.***)> wrote: I think so too. I was training my humble agent and it’s gaiining score and the visual shows that its always facing the same direction. I guess it is a fixed goal. Thanks man! sam > On 13 Apr 2019, at 2:42 AM, Lifeng Wei ***@***.***> wrote: > > After some test it should be the fixed goal.... > > — > You are receiving this because you commented. > Reply to this email directly, view it on GitHub <#16 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AMvt5Sz9ja7TCTA79SP0BcSpfUGoH_cIks5vgNOpgaJpZM4bJKm->. > — You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub (#16 (comment)), or mute the thread (https://github.com/notifications/unsubscribe-auth/AVj1AKuNAhHfjsJ0xpvm-oJ4soxzgyvQks5vgNQmgaJpZM4bJKm-).

Samuel Pun · Answer 5 · Sat Apr 13 2019 14:06:56 GMT+0800 (China Standard Time)

envy you! I have just started and I am trying to use PPO to solve the problem. I am only able to get to around 100 scores (average of rewards of pass 100 episodes across agents once any of the agent reached ‘done’). May I ask how many layers of network did you use?

…

On 13 Apr 2019, at 5:26 AM, Lifeng Wei ***@***.***> wrote: Np! What kind of results are you getting? The dimension info is different from Unity page so I am not sure if the benchmarks are still reliable.... I can get an average around 1800 but cannot get further.... Lifeng Wei. From my phone and apology for any typos > > On Apr 12, 2019 at 11:44 AM, <Samuel Pun ***@***.***)> wrote: > > > I think so too. I was training my humble agent and it’s gaiining score and the visual shows that its always facing the same direction. I guess it is a fixed goal. Thanks man! > > sam > > On 13 Apr 2019, at 2:42 AM, Lifeng Wei ***@***.***> wrote: > > > > After some test it should be the fixed goal.... > > > > — > > You are receiving this because you commented. > > Reply to this email directly, view it on GitHub <#16 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AMvt5Sz9ja7TCTA79SP0BcSpfUGoH_cIks5vgNOpgaJpZM4bJKm->. > > > > > > — > You are receiving this because you modified the open/close state. > Reply to this email directly, view it on GitHub (#16 (comment)), or mute the thread (https://github.com/notifications/unsubscribe-auth/AVj1AKuNAhHfjsJ0xpvm-oJ4soxzgyvQks5vgNQmgaJpZM4bJKm-). > > > — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#16 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AMvt5UiNFbU-q7GvI0PgoYfGZsrKodrwks5vgPoggaJpZM4bJKm->.

Lifeng Wei · Answer 6 · Sat Apr 13 2019 15:33:58 GMT+0800 (China Standard Time)

You can check my repo here. https://github.com/ZeratuuLL/Reinforcement-Learning/tree/master/Continuous%20Control/Crawler

It's not easy to say the structure directly....