gridl / data_wrangling_exercises

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Practical Python: Data Wrangling and Data Quality

This repo contains draft coding exercises for the early-release version of the book Practical Python: Data Wrangling and Data Quality to be published by O'Reilly Media in 2021.

Before You Begin

Below you will find an overview of this repo's contents, as well as important tips and information on how to use these files. In general, all exercises are accessible as standalone .py files, and as Jupyter Notebooks. The notebooks can either be downloaded to your device and run locally, or opened and run in Google Colab (https://colab.research.google.com/). The draft text of Chapter 1 includes basic instructions on how to get started with some of these tools; this text will be updated/completed before final publication.

Working with data files

Because data sets can often be quite large, the data sets for these exercises are available for download here.

If you are working locally

Data sets should be downloaded/copied in the same folder as the Python file or notebook, unless otherwise indicated.

About

License:GNU Affero General Public License v3.0


Languages

Language:Jupyter Notebook 75.1%Language:Python 24.9%