CPSSD / cerberus

MapReduce framework

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

cerberus

CA4019 Project

Build Status


Requirements:

Development

  • Rust Nightly
  • Protobuf Compiler

Deployment

  • Docker
  • Docker Compose

Building the project

Build everything by running:

$ cargo build --all

Running AWS benchmarks

The following dependancies are required to run the benchmarking script:

  • python3-tk
  • matplotlib

Ubuntu dependancy installation:

apt-get install python3-pip python3-tk
pip3 install numpy matplotlib

The AWS benchmarking script is located at aws/benchmarking.py Run cargo build --release before running the script.


System Requirements

The project currently only works on Linux. macOS and other platforms are planned for the future.

OpenSSL is required - See http://github.com/sfackler/rust-openssl#building for installation instructions.


Setting up deployment on AWS

Requirements:

pip install boto3

Deployment steps:

  1. Add AWS credentials to ~/.aws/credentials

    A sample file is located in aws/credentials. Simply replace ACCESS_KEY_ID and SECRET_ACCESS_KEY with their respective values.

  2. Update parameters in aws.py script

    You will need to create and download an EC2 key pair from AWS and update the python script to use your .pem file containing the created key.

  3. Ensure that you push the latest version of the master/worker containers to DockerHub

    This can be done by running ./production-deployment.sh in the cerberus root directory.

  4. Configure Launch Templates

    You need to create two Launch Templates for EC2.

    Each template MUST have the same name as described below and must have the given tag associated with it. Any other settings can be changed as you see fit.

    Template Name Tag
    Master Key: "type", Value: "master"
    Worker Key: "type", Value: "worker"
  5. Deploy Instances

    To create 1 master and N workers and deploy our containers to them we can run the following command: python aws.py --create N --deploy

Useful commands:

  • To restart currently running instances we can run python aws.py --terminate --deploy
  • To kill all of the instances we can use python aws.py --kill

About

MapReduce framework

License:MIT License


Languages

Language:Rust 88.9%Language:Shell 3.4%Language:Python 3.1%Language:JavaScript 2.5%Language:CSS 1.1%Language:HTML 0.7%Language:Makefile 0.2%