samuelmaina / data-analysis-with-pyspark

Perfom data analysis using pyspark. Covers Spark functions(trigonometric functions, windows and lags), SQL views and queries and Parallelizing Spark operations.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Data-Analysis-with-pyspark

Perform data analysis using pyspark. Skills learnt incude:

  1. Reading and loading data into data frames, use of RDD,
  2. Use of SQL views in Spark.
  3. Parallellizing data querying and processing.

About

Perfom data analysis using pyspark. Covers Spark functions(trigonometric functions, windows and lags), SQL views and queries and Parallelizing Spark operations.


Languages

Language:Python 100.0%