hotpotqa / hotpot

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

MemoryError still presists with n_jobs=3

mominali12 opened this issue · comments

commented

memoryerrorBaseline

Try n_jobs=1?

commented

Still not working...
This time it gets stuck even earlier.
Capture

I am trying to replicate the baseline model and for that purpose I need to get the files generated after pre processing. Is there any way anyone could share a link to those files so that I could proceed with the remaining steps.
Thanks in advance

@mominali12 not sure I'm seeing it in the screenshot -- warnings can be safely ignored, and if you're missing the ujson package, try pip install ujson

commented

The system stops at Done 9 and won't progress any further. I believe that I should be getting a train_record.pkl file and this file is not created neither is there any progress in pre processing

@mominali12 That's strange. Could you try running the code with a machine with more RAM?

I am experiencing a similar issue. I am running the preprocessing step on my local machine which has 16GB of RAM, and nothing is printed after 9 out of 9.

My memory consumption keeps growing reaching 100% too. What would you advise?

image

Same problem for me on linux machine with 32GB of RAM. Stuck after job 9 and RAM usage is going up until crash.
image

Most of our work is done on server machines with 64G or more memory, so it's possible that even larger RAM will help resolve the issue. But also if people are stuck at example 9, it might also be worth investigating into that particular example and skip/truncate if necessary.