ssavvides / Nitro-Spark

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Confidential Computing Data Analytics with Apache Spark in AWS NitroEnclaves

Leveraging AWS Nitro for secure distributed data processing

Security is one of the major concerns in the development of dependable distributed systems. In this project, we leverage AWS NitroEnclaves to deploy an Apache Spark cluster securely.

Tested on:

  • Apache Spark version: Spark 3.2.0 (Oct 13 2021)
  • AWS Nitro EC2 instance: [TBD]

Getting started

Prerequisites

For an overview of Apache Spark and AWS NitroEnclaves see here.

For a guide on how to run Apache Spark in docker see here.

For a guide on how to run Apache Spark in docker in AWS EC2 see here.

Running the TPC-H benchmark on Apache Spark - NitroEnclaves

[TBD]

Links and References

About


Languages

Language:Scala 100.0%