hornc / plotMARC

Visually characterise a bibliographic collection in terms of publication dates and available bibliographic identifier coverage

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

plotMARC

A command line tool to visually characterise a bibliographic collection in terms of publication dates and available bibliographic identifier coverage.

Basic usage

From a directory containing one or more binary MARC21 format for bibliographic data files (with extension .mrc) representing a bibliographic collection, run:

./plotMARC.py

The script will process the MARC files using pymarc, and produce a single <directoryname>.png image containing a 3-way Venn diagram displaying the number of records with the following bibliographic identifiers:

and a histogram showing the publication dates in the bib records.

Example output

Example plotMARC output plot

Summary for Test Collection:
Record counts for bibliographic identifiers present in this collection:
Total:  248632	100.00%
ISBN:   175593	 70.62%
LCCN:   130780	 52.60%
OCN :    82801	 33.30%
No Id:   42263	 17.00%

Requirements

Install these using pip:

 pip install -r requirements.txt

License

Copyright © 2022 Charles Horn.

This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

About

Visually characterise a bibliographic collection in terms of publication dates and available bibliographic identifier coverage

License:GNU General Public License v3.0


Languages

Language:Python 100.0%