Make sure we don't write anything (of significant size) to /tmp

Question

dlwh opened this issue 2 years ago · comments

Per JohnT, hf datasets is apparently still dumping some stuff out to /tmp, which causes issues on codalab (which has a small /tmp).

David Hall · Answer 1 · Mon Apr 25 2022 09:06:01 GMT+0800 (China Standard Time)

my best guess is we're doing some transform on a dataset where we don't specify cache files, but I'm not sure where.

David Hall · Answer 2 · Fri Jun 17 2022 06:29:13 GMT+0800 (China Standard Time)

ok pretty sure this is done.