emonson / pandas-jupyterlab

Pandas and Jupyterlab workshop – Python for Data Science

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Python for Data Science: Pandas and JupyterLab

Workshop modules:

Pandas 101 – Spring 2022

  1. Data structures for spreadsheets in Python
  2. Accessing your tabular data · (inline exercises)
  3. Pandas built-in plotting
  4. Plotting exercise

Assignment Pandas 101 – Fall 2022

  1. Pandas 101 essential review
  2. Assignment Pandas 101

Pandas 102 – Fall 2022

  1. Intro to Tidy Data
  2. Intro to tidy data plotting with Seaborn · (inline exercises)
  3. Melt into tidy data
  4. Melt data exercise
  5. Merge (JOIN) datasets
  6. Merge (JOIN) exercise

Pandas 103 – Spring 2023

  1. Groupby Basics · (inline exercises)
  2. Exploring NC emissions with Groupby and Pivot Tables · (inline exercises)

Extras

Eric E Monson
Duke Libraries Center for Data and Visualization Sciences


Tidy Data Exercises are from the blog post by Jean-Nicholas Hould blog post and hist accompanying GitHub repository


Extra Resources

Minimally Sufficient Pandas · A really great post by Ted Petrou with an opinionated guide to which commands and styles of coding should be used with Pandas. There are so many ways of doing the same thing in Pandas and he helps clarify which to choose and why. Not a beginner article, but essential for intermediate to advanced users.

Jupyter Notebook Tutorial · From Datacamp, gives history, setup and basic usage, with great links to good notebooks and how to set up interactive dashboards

Github Markdown cheat sheet · Super-concise summary of (github-style) Markdown syntax

Pandas cheat sheet · Modeled after R cheat sheet. Very good summary of basic Pandas functions

Pandas Tutor · Visualizes simple Python Pandas code, showing transformation diagrams

JupyterLite · JupyterLab distribution that runs entirely in the web browser, backed by in-browser language kernels

Dramatic Tour of Python visualization tools · Entertaining and illuminating comparison of how various plots would be done with many major Python visualization packages (Matplotlib, Pandas, Seaborn, ggplot, Altair), including code repository

Pandas tutorials and documentation · Good documentation, examples and quick tutorials from the official Pandas site

Python for Data Science workshop · Python for Data Science workshop repository, given at IEEE VIS 2015

NBViewer – Jupyter viewer site · Place to both put your notebooks for sharing and find great content by others

About

Pandas and Jupyterlab workshop – Python for Data Science


Languages

Language:Jupyter Notebook 100.0%