rshad/Automate-Spark-Hadoop-HDFS-Configuration-Ansible

ansible-playbook ansible spark hadoop hdfs automated-deployment yarn

Automating SPARK-YARN-HADOOP-HDFS Cluster Deployment using Ansible

Summary

This repository contains a part of my Computer Science BSc Degree's Final Graduation Project. In this project I wanted to implement a Cloud Computing platform with simple already-implmemented services for data storage and processing. This part coincide with the automation of the deployment process of a cluster formed by " SPAR-YARN-HADOOP-HDFS " using Ansible.

Collaborators

This project was devloped by me " Rshad Zhran " under the supervision of my tutors, D. José Manuel Benitez Sanchez and D. Manuel J. Parra Royón.

Contact Details

Rshad Zhran: rashzk95@gmail.com
José Manuel Benitez: j.m.benitez@decsai.ugr.es
Manuel J. Parra Royón: manuelparra@decsai.ugr.es

1. Introduction

First of all, in order to start implementing a Cloud Computing Platfrom we need an Architecture Orchestrator, in this case I use OpenStack.

About

Using Ansible playbooks to automate the installation and the configuration of a cluster of Spark,Hadoop and HDFS

ansible-playbook ansible spark hadoop hdfs automated-deployment yarn

Languages

Language:Python 78.4%Language:Shell 21.6%

rshad / Automate-Spark-Hadoop-HDFS-Configuration-Ansible