timhutton / twitter-archive-parser

Python code to parse a Twitter archive and output in various ways

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Feature request: check with Internet Archive for media that can't be downloaded

cooljeanius opened this issue · comments

Some media just isn't going to be on Twitter anymore, due to getting deleted or whatever, so trying to fetch it again with a longer sleep time isn't going to help at all. Instead, I think a better solution might be to check with the Internet Archive to see if they already have the missing media? All that ought to be necessary would be to prepend http://web.archive.org/ to the URL, and then if it's got a copy, it should find it...