zbMATHOpen / mscHarvester

Demo Metadata Harvester to extract MSCs

Home Page:https://oai.zbmath.org

Repository from Github https://github.comzbMATHOpen/mscHarvesterRepository from Github https://github.comzbMATHOpen/mscHarvester

MSC harvester

DOI python workflow

The MSC harvester obtains Open Mathematics Subject Classification (MSC) data from the zbMATH Open OAI-PMH-API using the Sickle client. The data is written in a CSV file. See sample.csv for an example.

The meaning of the columns is the following:

de: Eight-digit internal zbMATH identifier (not to confuse with the public zbl identifier in the form Zbl 0910.34036)

msc: MSC of the article

keywors: keywords of the article

title: title of the article

refs: MSCs occurring in the references

By default, only columns are exported with all values present.

Notes on the de-indentifier

One can navigate to the corresponding article in the zbMATH open web interface by prefixing the string https://zbmath.org/?an=0. For example for de=1209934 navigate to https://zbmath.org/?an=01209934. In rare cases, there are de values below 1000000; if so, as many zeros have to be prefixed so that the final number in the URL has eight digits. To retrieve the de from the zbMATH Open website, a little trick is required: One can obtain the DE by clicking on the BibTeX button below the article. A BibTeX entry with the key zbMATH01209934 will be downloaded for this example. The DE number is the number after the word zbMATH, i.e., 1209934.

Links and references

About

Demo Metadata Harvester to extract MSCs

https://oai.zbmath.org

License:GNU General Public License v3.0


Languages

Language:Python 99.2%Language:Dockerfile 0.8%