emilymae / imls-cdx

working files, data, notebooks for museum group at Archives Unleashed DC

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

These are working programs, data and notebooks from the Museums Group at [Archives Unleased] workshop in Washington DC, June 14-15.

Internet Archive did a crawl of museum websites documented by IMLS and the CDX files for the crawl. We then attempted to process the CDX files to learn something about the Museum websites.

For more checkout the Notebook that's here in the respository.

Binder

About

working files, data, notebooks for museum group at Archives Unleashed DC


Languages

Language:Jupyter Notebook 96.1%Language:Python 3.9%