This repository consists of several projects that dealt with Big Data using various Big Data Processing tools.
The technologies and tools include -
- Apache Spark
- Spark SQL
- Spark ML Library
- Spark Streaming
- Python
- Pandas
- Matplotlib
- Hadoop
- HDFS
This repository contains the course work for the Big Data as a part of Master's in Data Science program at UMBC.
This repository consists of several projects that dealt with Big Data using various Big Data Processing tools.
The technologies and tools include -