I love the data on the website motorcyclespecs.co.za from my South African compatriots. It's my goto place to find motorcycle info, even if its a little clunky and sometimes wrong.
I also love spreadsheets. Google Sheets is the most useful cognitive tool I know of. I'd love to have a giant spreadsheet with specs of all the motorcycles I'm interested in.
This project scrapes data from motorcyclespecs.co.za into a nice spreadsheet format.
There's more than just motorcyclespecs.co.za. Here's a few useful websites. They don't always agree, which is just infuriating:
- https://www.motorcyclespecs.co.za/
- https://www.webbikeworld.com/
- https://www.ultimatespecs.com/motorcycles-specs
Python Code Layout and Structure:
- https://pycon.switowski.com/03-project-structure-2-parts/cookiecutter/
- https://github.com/cookiecutter/cookiecutter#a-pantry-full-of-cookiecutters
- https://github.com/audreyfeldroy/cookiecutter-pypackage
Need to use some sort of virtual environment. Dropping everything into global surely isnt the best idea.
- For each page, record the transformation applied from raw data to cleaned data, as a way to log all the errors I find to generate a report for the site authors.