timhutton / twitter-archive-parser

Python code to parse a Twitter archive and output in various ways

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Feature Request: Handling for Quote Retweets

beadsland opened this issue · comments

Presently QRTs are only provided as unshortened links back to Twitter, where RTs are reproduced (in 140-character part?) in the markdown output.

Here there are two issues.

For self-referential QRTs, presumably we'll be able to reconstruct those relationships by tweet ID found in the twitter URLs themselves at any time. That means any solution to the below can skip server queries on those references, as they're already identifiable in the archive somewhere.

For QRTs of others, however, that data can only be recovered while Twitter remains operational. So, yeah.

sigh

Thank you for all the work you're doing on this, BTW.

The PR #97 is merged into the branch downloadtweets now and provides that basic functionality (which is still evolving) to download quoted tweets. The downloaded tweet is not used in any way yet. See that PR for the overall idea on how to proceed.