hadoop-framework

There are 4 repositories under hadoop-framework topic.

linkedin / dynamometer
A tool for scale and performance testing of HDFS with a specific focus on the NameNode.
hadoop hadoop-filesystem hdfs hdfs-dfs testing testing-tools scale scale-up performance-testing performance-test performance-analysis performance-metrics hadoop-framework hadoop-hdfs
Language:Java 129
Cigna / ibis
IBIS is a workflow creation-engine that abstracts the Hadoop internals of ingesting RDBMS data.
hadoop hadoop-ecosystem hadoop-framework ibis oozie sqoop sqoop2 ingestion workflow-automation workflow-scheduler workflow cigna
Language:Python 51
spoddutur / cloud-based-sql-engine-using-spark
Cloud-based SQL engine using SPARK where data is accessible as JDBC/ODBC data source via Spark ThriftServer.
apache-spark thrift-server spark-thrift-server sql-engine sparksql jdbc beeline hadoop-framework
Language:Java 32
SAKET-SK / Semester6-SPPU-Data-Analysis-Lab
I installed Hadoop on Virtual Machine and all Assignments are performed on Ubuntu OS. Refer to this repo for completion of the Hadoop Assignments. It is recommended that you have a stable internet connection while doing these things.
hadoop hadoop-mapreduce hadoop-bigdata-assignments hadoop-framework hadoop-assignments r tableau data-visualization charts plot
Language:Rebol 14
James-QiuHaoran / distributed-computing-platform-mapreduce
This repository contains a simple Hadoop-like (MapReduce) distributed computing platform implemented in Java. It is extended from a course project at UIUC awarded the best Java version implementation and it's open-sourced for reference.
mapreduce hadoop hadoop-mapreduce distributed-computing distributed-file-system membership-management failure-detection distributed-systems cloud-computing cloud-computing-applications hadoop-framework
Language:Java 12
waltherg / distributable_docker_sql_on_hadoop
Toy Hadoop cluster combining various SQL-on-Hadoop variants
hadoop hadoop-mapreduce hadoop-filesystem hadoop-cluster hadoop-docker hadoop-hdfs hadoop-framework hive hue spark sparksql hbase hbase-client yarn yarn-hadoop-cluster zookeeper zookeeper-deployment tez impala presto
Language:Shell 12
giovannigarifo / bigdata
Code samples, summaries, cheatsheets and other study material for Hadoop MapReduce and Apache Spark
big-data bigdata hadoop hadoop-mapreduce hadoop-framework spark spark-streaming sparkjava spark-sql sparksql spark-mllib spark-ml polito mapreduce politecnico-di-torino
Language:Java 9
suselong / bigData-30-Days
零基础大数据学习笔记
bigdata hadoop hadoop-mapreduce hadoop-framework
Language:Java 8
Shwetabhdixit / Hadoop-2.7.3-Installation-Guide-for_windows
A storage reference to a comprehensive guide on installing Hadoop on Windows
hadoop-mapreduce hadoop-cluster hadoop-framework
Language:Shell 6
MohammedRayyanAli / Twitter-Data-Analysis-using-Hadoop-Framework
Twitter data analysis using hadoop (hdfs), flume, map-reduce and hive. Sentiment Analysis is also done using affin dictionary for tweets related to Indian election.
hadoop-mapreduce hadoop-hdfs hadoop-framework hive-table flume twitter-sentiment-analysis twitter4j twitter-streaming-api twitter-oauth hiveql
Language:Java 3
satyajeetmaharana / floodprediction
The goal of this project is to identify the flood-prone areas with probabilities of flood in counties in a future date, using Spark MLLib.
spark hadoop spark-mllib sparksql hdfs machine-learning flood-predictions geospatial geojson-schema geojson-polygon geojson-data hadoop-framework hadoop-ecosystem big-data-analytics tableau
Language:Scala 3
Akilankm / Hadoop-Installation
The repo contains the steps for setting up the single node cluster in Hadoop 3.2.1 in Ubuntu 20.04 LTS
hadoop hadoop-installation big-data big-data-hadoop hadoop3 data-science hadoop-framework hadoop-hdfs
2
alex-ber / docker-hive
EMR 5.25.0 cluster single node Hadoop docker image. With Amazon Linux, Hadoop 2.8.5 and Hive 2.3.5
hadoop-docker hive docker docker-compose dockerfile docker-image hadoop-hdfs hadoop-mapreduce hadoop-cluster hadoop-ecosystem hadoop-framework hadoop-filesystem yarn-hadoop-cluster yarn hiveserver2 dockerfiles docker-images hadoop emr emr-cluster
Language:Shell 2
Rohit9314 / my-hadoop
Setup hadoop cluster manually and automatically
hadoop-docker hadoop-cluster hadoop-mapreduce hadoop-filesystem hadoop-framework hadoop-distributions hdfs-docker docker-container dockerfiles docker-implemented-hadoop automated-hadoop-implementation complete-hadoop-setup hadoop-using-devops
Language:Python 2
shienlong / parallel
WQD7008 Parallel and Distributed Computing Project
raspberry-pi hadoop-framework hipi node-red hdfs mapreduce parallel
2
vinitS101 / knn
This Project focuses on creating a KNN MapReduce program for the Hadoop Framework
knn mapreduce scala java spark hadoop hadoop-framework
Language:Java 2
BigWheel92 / PageRank-Algorithm-using-MapReduce
PageRank algorithm written in Java MapReduce framework
java mapreduce hadoop hadoop-mapreduce hadoop-mapreduce-framework hadoop-framework
Language:Java 1
rahul-dhavalikar / Product-Recommendation-System
Product recommendation system on Amazon product dataset using Apache Spark framework
recommender-system amazon-data amazon spark hadoop-framework python jupyter-notebook apache-spark spark-framework spark-ml
Language:Jupyter Notebook 1
SakhriHoussem / MapReduce-Python
MapReduce Python Example
python python3 python-3 hadoop hadoop-mapreduce hadoop-streaming hadoop-hdfs hadoop-mini-clusters hadoop-cluster hadoop-framework hadoop-distributions hdfs hdfs-dfs exemple fichier anagrams anagram anagram-solver mapreduce mapreduce-server
Language:Python 1
akshaytambe / Big-Data-Scripts
Python Scripts for working with Big Data Files
big-data-analytics big-data hadoop-mapreduce hadoop-cluster hadoop-hdfs hadoop-framework
Language:Python 0
jayantakumar / Hadoop-In-Action-Introductory-Patent-Dataset-Analysis
A basic introductory example of hadoops mapreduce libraries to load and analyse large datasets in this case a US patent dataset sourced from https://www.nber.org/research/data/us-patents
hadoop hadoop-framework hadoop-mapreduce mapreduce-java
Language:Java 0
imdeepanshugpt / Hadoop
Hadoop-Cluster
hadoop hadoop-mapreduce hadoop-filesystem hadoop-cluster hadoop-docker hadoop-streaming hadoop-framework docker docker-compose docker-container docker-image
Language:Python
JayLohokare / distributed-GIS-framework
Distributed Hadoop and Spark based framework for in-memory GIS queries
inmemory-db spatial-analysis spark-framework hadoop-framework distributed-computing
Language:C++
jd268 / spark-examples
Basic spark examples to scratch some ground
java spark hadoop-framework
parasgulati8 / Hadoop-Cluster
MapReduce in Cluster.
mapreduce hadoop hadoop-mapreduce hadoop-cluster hadoop-hdfs hadoop-filesystem hadoop-framework wordcount master-slave pseudo-distributed-hadoop single-node-cluster standalone-mode-operation hadoop-clusters
Language:Java
shubhambhardwaj007 / Ansible-Hadoop-Hive-Role
An Ansible Role to Configure and setup Hive Data WareHouse on Client Node.
hadoop hive dataanalysis dataanalytics datawarehouse datawarehousing hadoop-cluster hadoop-framework ansible ansible-role ansible-galaxy ansible-playbooks
tahamohkawy / titanicDataAnalysis_Hadoop
Titanic data analysis with Hadoop
hadoop hadoop-mapreduce hadoop-framework hadoop-hdfs
Language:Java

hadoop-framework

linkedin / dynamometer

Cigna / ibis

spoddutur / cloud-based-sql-engine-using-spark

SAKET-SK / Semester6-SPPU-Data-Analysis-Lab

James-QiuHaoran / distributed-computing-platform-mapreduce

waltherg / distributable_docker_sql_on_hadoop

giovannigarifo / bigdata

suselong / bigData-30-Days

Shwetabhdixit / Hadoop-2.7.3-Installation-Guide-for_windows

MohammedRayyanAli / Twitter-Data-Analysis-using-Hadoop-Framework

satyajeetmaharana / floodprediction

Akilankm / Hadoop-Installation

alex-ber / docker-hive

Rohit9314 / my-hadoop

shienlong / parallel

vinitS101 / knn

BigWheel92 / PageRank-Algorithm-using-MapReduce

rahul-dhavalikar / Product-Recommendation-System

SakhriHoussem / MapReduce-Python

akshaytambe / Big-Data-Scripts

jayantakumar / Hadoop-In-Action-Introductory-Patent-Dataset-Analysis

imdeepanshugpt / Hadoop

JayLohokare / distributed-GIS-framework

jd268 / spark-examples

parasgulati8 / Hadoop-Cluster

shubhambhardwaj007 / Ansible-Hadoop-Hive-Role

tahamohkawy / titanicDataAnalysis_Hadoop