hadoop-ecosystem

There are 6 repositories under hadoop-ecosystem topic.

madd86 / awesome-system-design
A curated list of awesome System Design (A.K.A. Distributed Systems) resources.
distributed-systems hadoop-ecosystem interview message-broker microservices microservices-architecture nosql relational-database stream-processing
9937
dhkdn9192 / data_engineer_career
DE직무에 필요한 모든 것
data-engineer hadoop-ecosystem interview-questions
Language:Jupyter Notebook 192
ZuInnoTe / hadoopoffice
HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
office excel bigdata hadoop spark hadoopoffice analyze-office-documents poi hive flink hadoop-ecosystem
Language:Java 63
Cigna / ibis
IBIS is a workflow creation-engine that abstracts the Hadoop internals of ingesting RDBMS data.
hadoop hadoop-ecosystem hadoop-framework ibis oozie sqoop sqoop2 ingestion workflow-automation workflow-scheduler workflow cigna
Language:Python 50
Jayvardhan-Reddy / BigData-Ecosystem-Architecture
Life-cycle: Internal working of HDFS, SQOOP, HIVE, SPARK, HBASE, KAFKA with code.
spark hadoop-ecosystem hadoop yarn-hadoop-cluster hadooparchitecture architecture-components hive sqoop hdfs kafka spark-streaming architecture bigdata bigdata-module big-data big-data-essentials hbase hbase-cluster zookeeper
Language:Shell 15
hyeonsangjeon / dataplatform
Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.
hadoop hadoop-cluster hadoop-docker hadoop-mapreduce hadoop-ecosystem hive pyspark-notebook zeppelin-notebook
Language:Shell 11
jodth07 / hadoop-installation
Instructions on setting up Hadoop, HDFS, java, sbt, kafka, scala, spark and flume on Ubuntu 18.04
hadoop hadoop-hdfs hadoop-ecosystem kafka installation scala sbt spark flume spark-installation hadoop-installation kafka-installation sbt-installation scala-installation
Language:Shell 8
pfisterer / apache-knox-docker
Dockerfile for running Apache Knox (http://knox.apache.org/) in Docker
apache-knox dockerfile hadoop-cluster hadoop hadoop-ecosystem rest-api gateway-server
Language:Dockerfile 8
SarahAyaz / YouTube_Data_Analysis
Analysis of YouTube Data using Hadoop Mapreduce framework in Java.
hadoop hdfs mapreduce hadoop-mapreduce hadoop-filesystem hadoop-ecosystem mapreduce-java java youtube analysis partitioner hadoop-hdfs linux
Language:Java 3
satyajeetmaharana / floodprediction
The goal of this project is to identify the flood-prone areas with probabilities of flood in counties in a future date, using Spark MLLib.
spark hadoop spark-mllib sparksql hdfs machine-learning flood-predictions geospatial geojson-schema geojson-polygon geojson-data hadoop-framework hadoop-ecosystem big-data-analytics tableau
Language:Scala 3
alex-ber / docker-hive
EMR 5.25.0 cluster single node Hadoop docker image. With Amazon Linux, Hadoop 2.8.5 and Hive 2.3.5
hadoop-docker hive docker docker-compose dockerfile docker-image hadoop-hdfs hadoop-mapreduce hadoop-cluster hadoop-ecosystem hadoop-framework hadoop-filesystem yarn-hadoop-cluster yarn hiveserver2 dockerfiles docker-images hadoop emr emr-cluster
Language:Shell 2
meliodaseren / spark-sql-demo
SparkSQL Quick Start Tutorial
sparksql spark hadoop-ecosystem
Language:Scala 2
pfisterer / apache-knox-helm
Helm chart for Apache Knox
helm-charts knox apache-knox apache yaml-configuration hadoop-ecosystem hadoop
Language:Mustache 2
saitejavishalj / Hotspot-analysis-of-Geospatial-data
Built a Large Scale Distributed Data Processing system for Streaming Analytics using Hadoop Ecosystem (Apache Spark and HDFS), in Cloud for real-time spatial analytics.
hdfs sparksql apache-spark apache-hadoop hadoop-ecosystem data-analysis distributed-systems large-scale streaming-analytics
Language:Scala 2
AnkitaSinha98 / Customer360-Data-Analysis
Big Data is Stored and analyzed of various Customer using Hadoop and other tools like Hive, Zookeeper, Hbase and sqoop and all details of the customer is analyzed then result are given.This result is very useful for companies.
big-data-analytics dataset hadoop-ecosystem sqoop hbase zookeeper hive pig hadoop
1
ArwaEiad / TMDB-Project
This project focuses on analyzing movie data using Pyspark tailored for efficient data processing on Hadoop Distributed File System (HDFS)
hadoop-ecosystem hdfs pyspark
Language:Jupyter Notebook 1
f2e-awesome / HadoopEcosystem
Hadoop 生态体系(ecosystem)
hadoop hadoop-mapreduce hadoop-filesystem hadoop-ecosystem bigtable hdfs hbase hive flume mahout zookeeper ambari avro hcatalog sqoop pig
Language:JavaScript 1
mayankskb / Hadoop-Times
Practise programs in hadoop ecosystem for refrence
hadoop hive mapreduce hadoop-ecosystem
1
meliodaseren / avro-file-format
Avro File Format Quick Start Tutorial
hadoop-ecosystem
Language:Java 1
meliodaseren / spark-streaming-kafka-demo
Spark Streaming & Kafka Quick Start Tutorial
hadoop-ecosystem
Language:Scala 1
nirmalagra / MovieLensDataAnalysis
Mapreduce program developed in Java for analyzing movie dataset
hadoop-mapreduce hadoop-ecosystem hdfs big-data data-science
Language:Java 1
oykuyildirim / Flume-Service
Getting tweets using Flume service and analyzing tweets
flume big-data-analytics big-data tweets hadoop-hdfs hadoop-filesystem hadoop-ecosystem mapreduce tutorial
1
rakeshdey0018 / Weblog-Analysis-using-PIG
[BigData] one year weblog analysis using PIG
bigdata hadoop hadoop-ecosystem pig pig-latin tableau weblog-analysis piggybank
Language:PigLatin 1
simple-learning / Hadoop
Hadoop Projects
hadoop hadoop-mapreduce hadoop-streaming hadoop-ecosystem java-8 hadoop-testing hadoop-mrunit mrunit
Language:Java 1
PykaAlexandro / A-MapReduce-Vademecum-via-Hadoop
Some basic procedures for parallel computing in the Hadoop environment
mapreduce hadoop-ecosystem parallel-computing
Language:Python 0
rahulsakore7 / Unstructured-data-mart-sentimental-analysis
predictive-modeling datamart unstructured-data tableau visualization dataanalytics hadoop-ecosystem
Language:Jupyter Notebook 0
Rohit-Jain-2801 / HadoopInstallGuide
Apache Hadoop Components Installation Guide on Windows
apache hadoop hadoop-ecosystem installation installation-guide windows java hdfs hbase hive pig apache-hbase apache-hive apache-pig
0
tingjhenjiang / bigdata_docker_images
資料平行批次與串流處理以及搭建機器學習環境會用到的container
dockerfile hadoop-ecosystem jupyterhub spark
Language:Dockerfile 0
uncleislearning / learning-Hadoop
HDFS、MapReduce、Hive、Zookeeper原理以及实践操作
hadoop-filesystem hadoop-mapreduce hadoop hadoop-cluster hadoop-ecosystem
0
vineetdcunha / Hadoop_Ecosystem
Processing and transforming data via Hadoop Ecosystem
hadoop hadoop-mapreduce hadoop-hdfs hadoop-cluster hadoop-streaming hadoop-ecosystem hbase hbase-standalone hiveql hive pig pyspark mahout multinode python python-script
Language:Python 0
DiegoBulhoes / hadoop-ansible-single-node
Ambiente com o objetivo de praticar o uso das ferramentas Ansible e Hadoop usando uma única instância
hadoop hadoop-ecosystem ansible vagrant single-node
Language:Shell
m-r-tanha / Hadoop-Ecosystem
This repository is going to update based on my challenges in installing and using the Hadoop's tools Spark
hadoop-ecosystem
meliodaseren / hive-udf-demo
Hive
hive hadoop hadoop-ecosystem
Language:Java
meliodaseren / structure-streaming-demo
Structure Streaming Quick Start Tutorial
hadoop-ecosystem
Language:Scala
PrathameshNimkar / Big-Data-Analysis-using-the-Hadoop-Ecosystem
Learn and implement the Hadoop Ecosystem to drive Big Data Analytics.
hadoop-ecosystem tutorials big-data big-data-analytics cloudera cloudera-manager
reggert / cumulative
[Work in progress] Client library for simplified access to Apache Accumulo
scala accumulo bigdata hadoop-ecosystem spark
Language:Scala

hadoop-ecosystem

madd86 / awesome-system-design

dhkdn9192 / data_engineer_career

ZuInnoTe / hadoopoffice

Cigna / ibis

Jayvardhan-Reddy / BigData-Ecosystem-Architecture

hyeonsangjeon / dataplatform

jodth07 / hadoop-installation

pfisterer / apache-knox-docker

SarahAyaz / YouTube_Data_Analysis

satyajeetmaharana / floodprediction

alex-ber / docker-hive

meliodaseren / spark-sql-demo

pfisterer / apache-knox-helm

saitejavishalj / Hotspot-analysis-of-Geospatial-data

AnkitaSinha98 / Customer360-Data-Analysis

ArwaEiad / TMDB-Project

f2e-awesome / HadoopEcosystem

mayankskb / Hadoop-Times

meliodaseren / avro-file-format

meliodaseren / spark-streaming-kafka-demo

nirmalagra / MovieLensDataAnalysis

oykuyildirim / Flume-Service

rakeshdey0018 / Weblog-Analysis-using-PIG

simple-learning / Hadoop

PykaAlexandro / A-MapReduce-Vademecum-via-Hadoop

rahulsakore7 / Unstructured-data-mart-sentimental-analysis

Rohit-Jain-2801 / HadoopInstallGuide

tingjhenjiang / bigdata_docker_images

uncleislearning / learning-Hadoop

vineetdcunha / Hadoop_Ecosystem

DiegoBulhoes / hadoop-ansible-single-node

m-r-tanha / Hadoop-Ecosystem

meliodaseren / hive-udf-demo

meliodaseren / structure-streaming-demo

PrathameshNimkar / Big-Data-Analysis-using-the-Hadoop-Ecosystem

reggert / cumulative