Error when running the Single Item Recommender System Notebook

Question

Error when running the Single Item Recommender System Notebook

davera-017 opened this issue 2 months ago · comments

Daniel Ávila Vera commented 2 months ago

Hi, I'm trying to run the notebook on how to use Pearl for recommender systems, but when I run the online_learning() function I keep getting the same error, which I copy below:

    [161](Pearl/pearl/pearl_agent.py:161) if isinstance(safe_action_space, DiscreteActionSpace):
--> [162](Pearl/pearl/pearl_agent.py:162)     self._latest_action = safe_action_space.actions_batch[int(action.item())]
    [163](Pearl/pearl/pearl_agent.py:163) else:
    [164](Pearl/pearl/pearl_agent.py:164)     self._latest_action = action

RuntimeError: a Tensor with 100 elements cannot be converted to Scalar

On the other hand, I'm having a hard time understanding how the environment is being built. Could someone please explain further how they are creating the RecEnv object?

rodrigodesalvobraz · Answer 1 · Wed Apr 03 2024 13:03:15 GMT+0800 (China Standard Time)

I am debugging this and will let you know as soon as possible.

rodrigodesalvobraz · Answer 2 · Thu Apr 04 2024 23:43:13 GMT+0800 (China Standard Time)

Update: we've identified the bug and are currently working on a solution. Should be out today or tomorrow.

rodrigodesalvobraz · Answer 3 · Mon Apr 08 2024 23:50:37 GMT+0800 (China Standard Time)

Update: started the fix but it had a wider range than initially expected. It might take a few days to get everything set correctly.

rodrigodesalvobraz · Answer 4 · Thu Apr 18 2024 08:35:01 GMT+0800 (China Standard Time)

Update: we've fixed the bug but we don't see the same learning behavior as previously observed, so now we are working on identifying the cause of that.

rodrigodesalvobraz · Answer 5 · Fri Apr 26 2024 02:26:26 GMT+0800 (China Standard Time)

Glad to let you know this has been finally fixed! It took a few iterations and the removal of a couple of issues.