epap011 / Spark-EMR-HiBench-Performance-Testing

Analyzing Spark Cluster Performance in Amazon EMR

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Analyzing Spark Cluster Performance in Amazon EMR

CS543: Big Data Project | UOC

stzagkarak@csd.uoc.gr papageorgiou@csd.uoc.gr apostolou@csd.uoc.gr

This git repository contains all available resources used in our project in CS543: Big Data .

Overview

Under the plot directory you can find all generated plots used on our report.

Under the run directory you can find instructions to run similar experiments on an Amazon EMR cluster.

Spring 2024

About

Analyzing Spark Cluster Performance in Amazon EMR


Languages

Language:Python 84.5%Language:Shell 15.5%