rshad / Automate-Spark-Hadoop-HDFS-Configuration-Ansible

Using Ansible playbooks to automate the installation and the configuration of a cluster of Spark,Hadoop and HDFS

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Automating SPARK-YARN-HADOOP-HDFS Cluster Deployment using Ansible

Summary

This repository contains a part of my Computer Science BSc Degree's Final Graduation Project. In this project I wanted to implement a Cloud Computing platform with simple already-implmemented services for data storage and processing. This part coincide with the automation of the deployment process of a cluster formed by " SPAR-YARN-HADOOP-HDFS " using Ansible.

Collaborators

This project was devloped by me " Rshad Zhran " under the supervision of my tutors, D. José Manuel Benitez Sanchez and D. Manuel J. Parra Royón.

Contact Details

Table of Contents

  1. Introduction
  2. Cluster Architecture and Components
  3. Components Distribution
  4. Ansible Scripting
  5. Furhter Studies
  6. References

1. Introduction

First of all, in order to start implementing a Cloud Computing Platfrom we need an Architecture Orchestrator, in this case I use OpenStack.

About

Using Ansible playbooks to automate the installation and the configuration of a cluster of Spark,Hadoop and HDFS


Languages

Language:Python 78.4%Language:Shell 21.6%