The aim of the workshop is to put into action the Pandas Python library to deal with real data. It is intended for beginners and will be based on a notebook comprising 2 datasets to perform in-depth analysis. Basic to advanced functionalities of Pandas and Matplotlib will be experimented during the session.
The workshop is based on 2 datasets:
- Names from US Social Security from 1880 to 2016 : https://www.ssa.gov/oact/babynames/limits.html (National data)
- Names from French Insee from 1900 to 2015 : https://www.insee.fr/fr/statistiques/2540004 (Fichier France)
When the notebook is saved in a folder, let's say "PyParis/", run in a shell: jupyter notebook --notebook-dir=PyParis/
The notebook has been developped using Python 3.5. It should work with Python 2.7 and Python 3.3+.
Python modules needed for executing the notebook:
- numpy: 1.11.3
- pandas: 0.18.1
- matplotlib: 1.5.3
- ipython: 5.1.0
- jupyter: 1.0.0
We recommend to use the Anaconda distribution of Python which includes all necessary modules.
© 2017 Yotta Conseil - Francis Wolinski