Get issues data
mihneadb opened this issue · comments
In order to compute all the issue-based metrics it would be easier to process specific issue data[1][2] rather than live events.
We need a function like:
download_issues_data(path, owner, repo, open=True, since=None)
that saves all the json objects received in a file, one per line. They will also have to have a "type" property set to "issuedata".
[1] https://api.github.com/repos/gabrielfalcao/lettuce/issues
[2] http://developer.github.com/v3/issues/#list-issues-for-a-repository
A bit curious - why is it easier to process issue data in batches?
We could process them one at a time, no problem. But for that you need to call the issues endpoint[1] first to get the IDs. However, besides the IDs, that endpoint already gives you all the details. :)
BTW, this is only helpful if we want "all time" issue data. Otherwise we get IssuesEvent data in the default chunk of events.
I think this should be moved to https://github.com/uberVU/elasticsearch-river-github
Fixed in the river.