acbraith / crossfit_scraper

Crossfit Leaderboard Scraper and Analysis

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Crossfit Web Scraper and Data Analysis

crossfit_api.py provides an API for scraping data from the 2017 Crossfit leaderboard. Make sure there's a /cache folder in the same directory as crossfit_scaper.py; big queries will be stored here (including the full leaderboards you get, manually delete the files if you want to refresh your data).

data_analysis.py produces some pretty box plots, and allows you to interact with them by left clicking to set 'your' benchmarks, or right clicking to remove them (one click is needed to fix the GUI, which is messed up before for some reason...). The predicted open placement will then be calculated using an XGB model trained on open leaderboard data. This code needs some pretty serious cleaning up though, I'll get around to that sometime...

Here are some nice box plots of the scraped data.

About

Crossfit Leaderboard Scraper and Analysis

License:MIT License


Languages

Language:Python 100.0%