instadeepai / Mava

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX

Home Page:https://id-mava.readthedocs.io

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

[BUG] Crash due to floating point instead of int

EdanToledo opened this issue · comments

Describe the bug

The timestep variable $t$ is sometimes a float and other times an int. It depends on the configs.

Super simple fix:

wherever you see :

t = steps_per_rollout * (eval_step + 1)

make it:

t = int(steps_per_rollout * (eval_step + 1))

Expected behavior

A crash not happening.