ShoufaChen / clone-anonymous4open

clone/download codes from https://anonymous.4open.science/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Crashes on missing files in anonymized repo

gwern opened this issue · comments

I can run successfully the provided example, but in trying the repo of interest to me, it crashes on

https://anonymous.4open.science/repository/5c1049ba-109f-4a12-9d74-9a4a5130ce97/IN256_imgs.npz

That file has no <code> element or otherwise, and is an error page:

The file IN256_imgs.npz is too big to be anonymized (beyond 1MB, Github limit)

Probably it should just skip such files?

diff clone.py clone-fixed.py                                                                                                                                                                    
61c61,64
<             f.write(source_code.get_text())
---
>             try:
>                 f.write(source_code.get_text())
>             except:
>                 print("Err: %s" % file_name)

Thanks very much for pointing out this problem.

Add a temporary workaround at 0ceb4ac