EleutherAI / the-pile

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Paperity

leogao2 opened this issue · comments

Multidisciplinary open-access research aggregator.

Size: 1.5 million papers (rough estimate: about 70GiB, give or take)

Overlap with other sets: There is probably some overlap with arXiv and PubMedCentral, but Paperity seems to have a lot of papers from subjects not currently in any of our sets.

Quote from website:

We can provide dumps of Paperity data, full and incremental, for use in external services and applications. For more information please contact us at: services (at) paperity.org.

TODO: contact paperity

https://paperity.org/