Rindhujatreesa / Big_Data_Processing_Projects

This repository contains the course work for the Big Data as a part of Master's in Data Science program at UMBC.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This repository consists of several projects that dealt with Big Data using various Big Data Processing tools.

The technologies and tools include -

  • Apache Spark
    • Spark SQL
    • Spark ML Library
    • Spark Streaming
  • Python
    • Pandas
    • Matplotlib
  • Hadoop
    • HDFS

About

This repository contains the course work for the Big Data as a part of Master's in Data Science program at UMBC.


Languages

Language:Jupyter Notebook 82.0%Language:HTML 18.0%