jwilber / USDA_Pomological_Watercolors

Open data for the USDA Pomological Watercolor Collection

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

USDA_Pomological_Watercolors

This repo stores data from the USDA Pomological Watercolor Collection:

The USDA Pomological Watercolor Collection documents fruit and nut varieties developed by growers or introduced by USDA plant explorers around the turn of the 20th century. Technically accurate paintings were used to create lithographs illustrating USDA bulletins, yearbooks, and other series distributed to growers and gardeners across America

Collage of watercolor paintings

The following data is provided in this repo:

data/images/*

Image directory containing a .jpg for each individual painting. There are 7584 images in total.

data/usda_pomological_watercolors.csv

Csv file containing metadata for each painting.

Column Description Data Type
painting_number Painting number as enumerated in collection. number
fruit Pomological name of the primary fruit depicted in the painting. text
authors Author(s) of the given watercolor painting. text
subjects Broader classification of fruit(s) depicted in the painting. (e.g. 'apple') text
year Year painting was published. number
thumbnail_image Link to thumbnail jpg of watercolor painting. text
image Link to jpg image of watercolor painting. text

scripts/get_pomological_data.py

Python script used to scrape the paintings.

Example use:

# call without arguments
$ python get_pomological_data.py

# call with arguments
$ python get_pomological_data.py --start=20 --end=400 --csv_name fruits.csv --verbose 1

Calling without arguments scrapes all of the watercolor painting data and stores the information to a local csv titled usda_pomological_watercolors.csv.

About

Open data for the USDA Pomological Watercolor Collection

License:MIT License


Languages

Language:Python 100.0%