An exploratory data analysis of crime in London from 2008 to present. I moved to London recently so I decided to do a project on data relating to my new home city.
I initially stumbled on this dataset on Kaggle. The London crime data is provided as a public dataset on Google BigQuery. Accessing the data on Big Query does require some SQL knowledge to write the query to get the data you want.
To access the data in R, I used the bigrquery package made by Hadley Wickham. Google also has great documentation for setting up your machine to get data from BigQuery that is long but extremely useful. Big Query also provides a UI for querying the dataset that was very useful for testing out my SQL code.
The easiest way to start working the data is to use a kernel on Kaggle. This will save time since you will not have to do the standard BigQuery setup and authentication process. However, this can only be done using Python. To get started with the dataset on in a kernel, you can fork this kernel.