7cb15 / Big-Data-Scripts

scripts for a mix of data processing tasks using various big data tools

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Big-Data-Scripts

scripts for a mix of data processing tasks using various big data tools

report_streaming.py - takes two system arguments, an input.csv and an output.csv. Script streams the input.csv and aggregates unique customer counts and revenue by product. Output.csv is aggregated by product code.

About

scripts for a mix of data processing tasks using various big data tools


Languages

Language:Jupyter Notebook 80.5%Language:Python 19.5%