A jupyter notebook to scrape sail boat listings to a pandas dataframe for graphing and storing in CSV.
Scrapes the following sites:
- Get Jupyter, I suggest miniconda.
conda install
libraries, list of imports in the first notebook cell- Open das-boot.ipynb in Jupyter.
- Run the whole notebook
Try something like
summary(
listings(
'Swan',
max_year=1990,
min_loa=12
)
)
in a new cell.
Calls to listings
will generate .csv
files of the results, if you prefer continuing in a spreadsheet.
This is a web scraper in a notebook; not proper, maintained code. By the time anyone lands here, it's probably broken by changes the websites made.
Hopefully someone wanting the same data can get it faster by fixing this rather than starting from scratch.