hadoop-cluster

There are 9 repositories under hadoop-cluster topic.

big-data-europe / docker-hadoop
Apache Hadoop docker image
docker docker-hadoop hadoop hadoop-cluster hadoop-docker
Language:Shell 2305
big_data
groda / big_data
Big Data essentials: Hadoop, MapReduce, Spark. Explore tutorials and demos in Jupyter notebooks—most are self-contained and live, ready to run with a click.
big-data bigdata spark spark-sql docker mapreduce mapreduce-bash pyspark hadoop testdfsio jupyter-notebook apache-sedona hadoop-cluster hadoop-hdfs mrjob gutenberg-ebooks hadoop-mapreduce apache-spark bigtop
Language:Jupyter Notebook 84
Impetus / jumbune
Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,
hadoop-cluster yarn yarn-hadoop-cluster optimization-framework hadoop hadoop-monitor data-quality data-analysis devops-tools developer-tools monitoring-tool hadoop-monitoring cluster-monitoring apm aiops
Language:Java 72
Segence / docker-hadoop
A Docker container with a full Hadoop cluster setup with Spark and Zeppelin
hadoop hadoop-cluster docker spark zeppelin-notebook
Language:Shell 67
sergevs / ansible-cloudera-hadoop
ansible playbook to deploy cloudera hadoop components to the cluster
cloudera-hadoop hadoop-cluster impala oozie hbase kafka
Language:Shell 52
apache-spark-docker
Wittline / apache-spark-docker
Dockerizing an Apache Spark Standalone Cluster
pyspark apache-spark docker-compose hive-metastore hive hdfs hue dataengineering dataengineer hadoop-cluster hadoop-docker docker
Language:VBA 43
rainmaple / WIFI_BussinessBigDataAnalyseSystem
A System is designed to analyse BigData collect from Wifi probe
hadoop-cluster spark realtime hbase echarts
Language:JavaScript 36
hokstack / hok-helm
HokStack - Run Hadoop Stack on Kubernetes
hadoop hdp kubernetes operator bigdata hadoop-cluster hadoop-hdfs automation dataops devops-tools
Language:Shell 25
hadoop-sandbox / hadoop-sandbox
A fully-functional Hadoop Yarn cluster as docker-compose deployment.
docker docker-compose hadoop hadoop-yarn hadoop-hdfs hadoop-cluster
Language:Shell 23
mikeroyal / Apache-Ignite-Guide
Apache Ignite Guide
ignite data-science database streaming stream-processing nosql nosql-databases nosql-data-storage hadoop hadoop-cluster
13
waltherg / distributable_docker_sql_on_hadoop
Toy Hadoop cluster combining various SQL-on-Hadoop variants
hadoop hadoop-mapreduce hadoop-filesystem hadoop-cluster hadoop-docker hadoop-hdfs hadoop-framework hive hue spark sparksql hbase hbase-client yarn yarn-hadoop-cluster zookeeper zookeeper-deployment tez impala presto
Language:Shell 12
hyeonsangjeon / dataplatform
Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.
hadoop hadoop-cluster hadoop-docker hadoop-mapreduce hadoop-ecosystem hive pyspark-notebook zeppelin-notebook
Language:Shell 11
manuparra / MasterDegreeCC_Practice
Taller del Máster Profesional de Informática UGR. Curso de CloudComputing.
docker opennebula hdfs hadoop cloudcomputing practice docker-container cluster hadoop-cluster docker-cluster virtual-machine
10
lyingbo / hadoop-cluster-docker
Run Hadoop Cluster within Docker Containers
hadoop-cluster hadoop-docker hadoop-3-2-0
Language:Shell 8
pfisterer / apache-knox-docker
Dockerfile for running Apache Knox (http://knox.apache.org/) in Docker
apache-knox dockerfile hadoop-cluster hadoop hadoop-ecosystem rest-api gateway-server
Language:Dockerfile 8
hadoop-sandbox / hadoop-sandbox-images
Docker image builds for Hadoop sandbox.
docker hadoop hadoop-cluster hadoop-hdfs hadoop-yarn hdfs
Language:Dockerfile 6
MitaliBhiwande / Clustering-Algorithms
Colelction of various clustering algorithms including K means, HAC, DBscan. Also includes Hadoop, MapReduce, implementation of K mean algorithm
kmeans-clustering mapreduce hierarchical-clustering density-based-clustering hadoop-cluster
Language:Python 6
Shwetabhdixit / Hadoop-2.7.3-Installation-Guide-for_windows
A storage reference to a comprehensive guide on installing Hadoop on Windows
hadoop-mapreduce hadoop-cluster hadoop-framework
Language:Shell 6
raspberry-pi4-hadoop-spark-cluster
aimanamri / raspberry-pi4-hadoop-spark-cluster
This is a self-documentation of learning distributed data storage, parallel processing, and Linux OS using Apache Hadoop, Apache Spark and Raspbian OS. In this project, 3-node cluster will be setup using Raspberry Pi 4, install HDFS and run Spark processing jobs via YARN.
big-data hadoop-cluster hdfs raspberry-pi-4 spark-cluster yarn pyspark spark-shell distributed-storage parallel-processing
Language:Shell 5
HxnDev / Finding-Average-Temperature-of-Each-Year-using-Hadoop-HDFS
In this task, we had to calculate the average temperature for each year from the given dataset using Hadoop HDFS. We had to create a MapReduce function to perform this task.
hadoop hadoop-mapreduce hadoop-hdfs hadoop-filesystem hadoop-cluster mapreduce mapreduce-java average-calculator code java
Language:Java 5
jinho-yoo-jack / HadoopCluster
based Docker
docker-compose hadoop hadoop-cluster hadoop-docker
Language:Shell 5
MengmSun / hadoop-in-docker
Hadoop in docker cluster, created by docker-compose. Create Hadoop cluster in less than 5mins.
docker-compose hadoop-cluster hadoop-docker hdfs-cluster hdfs-docker hadoop hdfs docker
Language:Shell 5
AnalyticsApps / LogAnalyzer
Analyses the customer logs for bigdata components like HDFS, Hive, HBase, Yarn, MapReduce, Storm, Spark, Spark 2, Knox, Ambari Metrics, Nifi, Accumulo, Kafka, Flume, Oozie, Falcon, Atlas & Zookeeper.
docker loganalyzer ambari hadoop-cluster
Language:Shell 4
balajic06 / Big_Data
The project deals on how to perform Spatio-temporal hot-spot analysis using Apache Spark.
big-data spatio-temporal-data spatio-temporal-analysis scala java hadoop spark geospatial-data hadoop-cluster
Language:Java 4
chriskery / hadoop-operator
Kubernetes operator for managing the lifecycle of Apache Hadoop Yarn Tasks on Kubernetes.
hadoop kubernetes apache-hadoop kubernetes-operator hadoop-cluster k8s
Language:Go 4
malabz / HAlign-2
a multiple sequence alignment tool
multiple-sequence-alignment hadoop-cluster
Language:HTML 4
mitre / clusterconf
Manage Hadoop cluster configurations
r rstats hadoop hadoop-cluster r-package
Language:R 4
PacktPublishing / Big-Data-Processing-with-Hadoop---A-Complete-Reference-Guide
Design, build, and execute effective big data strategies with advanced Hadoop concepts
hadoop hadoop-cluster spark streaming-pipelines hadoop-concepts
Language:Java 4
roboxue / YarnVision
UI for Hadoop Resource Manager
resource-manager hadoop-cluster
Language:Vue 4
tugrulhkarabulut / hadoop-movie-rating-prediction
Movie rating prediction application
flask hadoop hadoop-cluster hadoop-mapreduce machine-learning mrjob natural-language-processing
Language:CSS 4
vitobellini / bigdata-cluster
BigData Cluster with Docker
docker docker-hadoop hadoop-cluster bigdata cluster hadoop spark
Language:Shell 4
AsmaZgo / distribution_and_scripts
A repository for some scripts that can help in creating a distributed Big data ecosystem using the platform Grid5000.
bigdata distributed-systems mongodb grid5000 neo4j hdfs hadoop-cluster distributed-database sharding big-data cloud distributed
Language:Shell 3
conema / spark-terraform
This project create an Hadoop and Spark cluster on Amazon AWS with Terraform
aws terraform spark cluster hadoop spark-clusters hadoop-cluster hcl
Language:Shell 3
huy-dataguy / HadoopSphere
Containerized Hadoop cluster with Spark, Hive, Pig, HBase, and Zookeeper for scalable Big Data processing using Docker.
big-data docker docker-compose hadoop hadoop-cluster hbase hive pig spark zookeeper
Language:Shell 3
mitre / webhdfs
Interface with WebHDFS Service in a Cluster-Neutral Way
r webhdfs r-package rstats hadoop-cluster
Language:R 3
peyaa / bigdata-platform-on-k8s
deploy bigdata platform on kubernetes
hadoop-cluster docker kubernetes spark hive
Language:Shell 3

hadoop-cluster

big-data-europe / docker-hadoop

groda / big_data

Impetus / jumbune

Segence / docker-hadoop

sergevs / ansible-cloudera-hadoop

Wittline / apache-spark-docker

rainmaple / WIFI_BussinessBigDataAnalyseSystem

hokstack / hok-helm

hadoop-sandbox / hadoop-sandbox

mikeroyal / Apache-Ignite-Guide

waltherg / distributable_docker_sql_on_hadoop

hyeonsangjeon / dataplatform

manuparra / MasterDegreeCC_Practice

lyingbo / hadoop-cluster-docker

pfisterer / apache-knox-docker

hadoop-sandbox / hadoop-sandbox-images

MitaliBhiwande / Clustering-Algorithms

Shwetabhdixit / Hadoop-2.7.3-Installation-Guide-for_windows

aimanamri / raspberry-pi4-hadoop-spark-cluster

HxnDev / Finding-Average-Temperature-of-Each-Year-using-Hadoop-HDFS

jinho-yoo-jack / HadoopCluster

MengmSun / hadoop-in-docker

AnalyticsApps / LogAnalyzer

balajic06 / Big_Data

chriskery / hadoop-operator

malabz / HAlign-2

mitre / clusterconf

PacktPublishing / Big-Data-Processing-with-Hadoop---A-Complete-Reference-Guide

roboxue / YarnVision

tugrulhkarabulut / hadoop-movie-rating-prediction

vitobellini / bigdata-cluster

AsmaZgo / distribution_and_scripts

conema / spark-terraform

huy-dataguy / HadoopSphere

mitre / webhdfs

peyaa / bigdata-platform-on-k8s