ContentMine / getpapers

Get metadata, fulltexts or fulltext URLs of papers matching a search query

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Failure when EuPMC doesn't correctly report the number of hits

tarrow opened this issue · comments

At the time of writing queries like:
http://www.ebi.ac.uk/europepmc/webservices/rest/search/query=FIRST_PDATE%3A2017-05-22%20OPEN_ACCESS%3Ay&resulttype=core&pageSize=1000&cursorMark=*

report m hits but actually return m-n.

This leaves getpapers tying to load the next page but this page is empty. This means we keep trying forever.

We shouldn't do this. We should also start writing tests to cover these odder and odder issues with the API so that as we fix one we don't break normal functionality or a fix from the past.