click-here / Pandas-vs-Power-Query

Compare similar tasks in both Power Query and Python

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Pandas-vs-Power-Query

It has been my experience that people often work with data inefficiently in Excel. The purpose of this repo is to introduce users to some of the basic tools at their disposal using the Excel Add-in PowerQuery as well as to introduce them to similar methods with the Python data science library Pandas.

If you are new to Power Query I strongly recommend viewing Part 1 as subsequent Parts will assume basic knowledge explained in Part 1.

Part 1 - Reading and Grouping

Read multiple csv files and an Excel file. Group, filter, and aggregate data. Perform column level math and basic grouping/aggregation math.

Links

Full Data set https://collegescorecard.ed.gov/data/

More info regarding this data https://collegescorecard.ed.gov/data/documentation/

Part 2 - Finding the highest ranked x over n years.

Read a csv file from the web. Unpivot/data normalization, table buffering, and duplicate removal.

Part 3 - Time Card Nightmare! - Coming Soon

About

Compare similar tasks in both Power Query and Python


Languages

Language:Jupyter Notebook 62.8%Language:MATLAB 27.6%Language:Python 9.7%