eapostol / hadoop-on-colab

An example demonstrating setting up a Hadoop / Spark environment on Google Colab

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Hadoop on Colab

This repository contains how to install Apache Hadoop 3.2.3, configure it and run it on Google Colaboratory The notebook has the following sections:

  1. Installing Java 8
  2. Installing Secure Shell Server (SSHD)
  3. Installing Hadoop 3.2.3
  4. Running Hadoop in Standalone mode
  5. Running Hadoop in Pseudo-distributed mode

About

An example demonstrating setting up a Hadoop / Spark environment on Google Colab

License:MIT License


Languages

Language:Jupyter Notebook 98.7%Language:Python 1.3%