cloudera-hadoop

There are 4 repositories under cloudera-hadoop topic.

sergevs / ansible-cloudera-hadoop
ansible playbook to deploy cloudera hadoop components to the cluster
cloudera-hadoop hadoop-cluster impala oozie hbase kafka
Language:Shell 51
tilakpatidar / cdh5
Docker image for Cloudera Hadoop components (CDH5)
docker docker-compose cloudera-hadoop hive hdfs postgresql mysql zookeeper
Language:Shell 9
smartlin5228 / CCA175
cloudera-hadoop cloudera spark sparksql scala
Language:Java 7
Ranjandas / Dirty-CDH-Docker
A quick and dirty CDH cluster skeleton using Docker for Testing
cdh docker cloudera cloudera-hadoop
Language:Shell 6
dengshaochun / cdh-tools
cloudera hadoop auto install
ansible cloudera-hadoop auto-install
Language:Shell 4
achintya-kumar / BD2017
Otto-von-Guericke Universität Magdeburg - Big Data SoSe 2017
bigdata java cloudera-hadoop cluster-computing ovgu
Language:Java 2
haspdecrypted / OS-for-Big-Data-and-Hadoop
Getting Started with Hadoop and Big Data
bigdata hadoop spark cloudera-hadoop cloudera
2
kwartile / spark-benchmark
Spark Benchmark suite to evaluate cluster configuration and compare the performance with other big data frameworks.
spark apache-spark cloudera-hadoop cdh benchmark benchmarking-suite scala hadoop performance impala hive
Language:Scala 2
rapsoulhaonan / graphic-theoretic-problems
:guardsman: Hadoop/MapReduce Streaming
hadoop-mapreduce python virtualbox cloudera-hadoop
Language:Python 2
arunkthomasuncc / Query_Search_Using_TF-IDF
This repository contains the TF-IDF score calculation for the documents in the Canterbury dataset for a user given search query
hadoop hadoop-mapreduce java cloudera-hadoop tfidf
Language:Java 1
dorianbg / cloudera-quickstart-installation-guide
How to install Cloudera quickstart
big-data hadoop oozie hue cloudera-hadoop cloudera
1
Ishuan / Page-Rank-Implementation
The goal of this programming assignment is to compute the PageRanks of an input set of hyperlinked Wikipedia documents using Hadoop MapReduce. The PageRank score of a web page serves as an indicator of the importance of the page. Many web search engines (e.g., Google) use PageRank scores in some form to rank user-submitted queries. The goals of this assignment are to: 1. Understand the PageRank algorithm and how it works in MapReduce. 2. Implement PageRank and execute it on a large corpus of data. 3. Examine the output from running PageRank on Simple English Wikipedia to measure the relative importance of pages in the corpus. To run your program on the full Simple English Wikipedia archive, you will need to run it on the dsba-hadoop cluster to which you have access.
mapreduce-java cloud-computing java hadoop-mapreduce cloudera-hadoop
Language:Java 1
JohnnyFoulds / local-hadoop
This project creates a small local Hadoop cluster using Cloudera CDH and CentOS.
hadoop cloudera-hadoop cloudera vmware-vsphere vmware-esxi centos powercli
Language:Python 1
SakhriHoussem / Apache-Hive-Tutorial
Learn How Hive Work in Simple Example
hive cloudera cloudera-hadoop
1
syscrest / cloudera-manager-hipchat-chatbot
chatbot for hipchat (cloud or onpremise) that enables you to talk to your cloudera manager
chatbot hipchat cloudera-manager cdh communication chatops devops hadoop cloudera-hadoop
Language:Java 1
vodkolav / DataEngineerProject
This is my final project for Data Engineer Expert course at Naya College.
hadoop hdfs hive spark spark-structured-streaming python3 jupyter-notebook kafka cloudera-hadoop twitter
Language:Jupyter Notebook 1
aastha-ghub / Airlines-Analysis-project-HADOOP
This project involves analysing the airline datasets to solve the problem statements using HADOOP.
airline-booking airline-datasets airlines airlines-analysis-hadoop cloudera-hadoop country data data-analysis hadoop hadoop-hdfs hadoop-mapreduce hive hiveql involves-analysing problem-statement virtualbox
Language:Rich Text Format 0
akshay-madar / MovieTycoon-gcp-based-BI-tool
GCP hosted product for over 1 million movie investors on HSX.com, aiding online movie trading and box-office investments by leveraging Big Data technologies like Hive and Hadoop, and Tableau dashboards
gcp cloud cloudera-hadoop hive big-data tableau movies hsx rotten-tomatoes box-office investment trading product reviewsanalysis-nlp
Language:Jupyter Notebook 0
akshaydake123 / Sentiment-Analysis-on-Twitter-Data
This contains how to perform Sentiment Analysis on the tweets from Twitter using Hive.Collect the tweets from Twitter using Flume, As the tweets coming in from twitter are in Json format, we need to load the tweets into Hive using json input format. Use Cloudera Hive json serde for this purpose.
hive flume json cloudera-hadoop sentiment-analysis twitter
0
bishalpaudel / HadoopProductPurchaseProbability
Anticipatory customer order prediction after purchasal of item(s).
hadoop hadoop-mapreduce cloudera-hadoop java
Language:Java 0
jcrespoortega / Docker-Twitter-Sentiment-analysis
docker sentiment-analysis twitter cloudera-hadoop map-reduce mrjob mongodb
Language:Python 0
marycboardman / Assessment-Attempts
Data processing using docker containers, kafka, spark, and hadoop
kafka spark pyspark docker docker-image docker-compose docker-container zookeeper cloudera cloudera-hadoop cloudera-hadoop-framework hadoop hadoop-hdfs hadoop-docker spark-sql sparksql digitalocean digital-ocean
0
nandanosql / fundamental-hadoop
fundamental-hadoop is basically for introduction about Apache Hadoop and it's ecosystem.
hadoop cloudera-hadoop
0
Rifat392000 / BigDataAnalytics
big-data-analytics big-data-processing cloudera-hadoop clustering google-colab-notebook hadoop-mapreduce hue pyspark-notebook python3 rdbms sql visualization eclipse hadoop-filesystem java-mapreduce virtual-machine
Language:Jupyter Notebook 0
Rishi500067313 / Twitter-data-stream-into-MySQL-table-using-NiFI
nifi mysql cloudera-hadoop
0
VaishnavJois / CLOUDERA
Cloudera commands used for Big Data Analytics
cloudera cloudera-hadoop big hadoop-mapreduce pig hdfs
0
ankitssh / Flume-Twitter-Sentimental-Analysis
hadoop sentimental-analysis cloudera-hadoop
guptasaumya / navigator-data-service
Navigator is a data service that prepares the content for travel agencies, ready for exploration in EWNS (East-West-North-South) direction and hence allows them to render content to the end-user based on their desire to travel.
mapreduce-java cloudera-hadoop hive navigator travel eclipse-ide
Language:Java
Johnny1110 / Hadoop_Note
學習 Hadoop 筆記
hadoop java spark hbase hive cloudera-hadoop
Language:Shell
Mantej-Singh / Apache-Spark-Under-the-hood--WordCount
Running my first pyspark app in CDH5
pyspark wordcount apache-spark cdh5 cloudera-hadoop
Language:Jupyter Notebook
nikitaeverywhere / hadoop-network-of-keywords
Keywords network builder based on TF-IDF with the use of Hadoop platform
hadoop cloudera cloudera-hadoop tf-idf hadoop-platform keywords-builder term-frequency document-frequency mapreduce
Language:Python
SakhriHoussem / Apache-Spark-Tutorial
a Simple Apache Spark Tutorial
spark cloudera scala cloudera-hadoop cloudera-hadoop-framework
SakhriHoussem / HBase-Tutorial
a Simple HBase Tutorial
hbase hbase-shell cloudera cloudera-hadoop filter distance
SakhriHoussem / HBase-With-Hive
Learn How Hive Work With HBase in Simple Example
hive hbase hbase-shell hive-hbase cloudera cloudera-hadoop
SakhriHoussem / SparkSQL-Tutorial
a Simple SparkSQL Tutorial
spark sparksql spark-sql cloudera cloudera-hadoop cloudera-hadoop-framework
shubnimkar / Hadoop
This repository includes two versions of hadoop management tools
cloudera-hadoop hortonworks-hdp

cloudera-hadoop

sergevs / ansible-cloudera-hadoop

tilakpatidar / cdh5

smartlin5228 / CCA175

Ranjandas / Dirty-CDH-Docker

dengshaochun / cdh-tools

achintya-kumar / BD2017

haspdecrypted / OS-for-Big-Data-and-Hadoop

kwartile / spark-benchmark

rapsoulhaonan / graphic-theoretic-problems

arunkthomasuncc / Query_Search_Using_TF-IDF

dorianbg / cloudera-quickstart-installation-guide

Ishuan / Page-Rank-Implementation

JohnnyFoulds / local-hadoop

SakhriHoussem / Apache-Hive-Tutorial

syscrest / cloudera-manager-hipchat-chatbot

vodkolav / DataEngineerProject

aastha-ghub / Airlines-Analysis-project-HADOOP

akshay-madar / MovieTycoon-gcp-based-BI-tool

akshaydake123 / Sentiment-Analysis-on-Twitter-Data

bishalpaudel / HadoopProductPurchaseProbability

jcrespoortega / Docker-Twitter-Sentiment-analysis

marycboardman / Assessment-Attempts

nandanosql / fundamental-hadoop

Rifat392000 / BigDataAnalytics

Rishi500067313 / Twitter-data-stream-into-MySQL-table-using-NiFI

VaishnavJois / CLOUDERA

ankitssh / Flume-Twitter-Sentimental-Analysis

guptasaumya / navigator-data-service

Johnny1110 / Hadoop_Note

Mantej-Singh / Apache-Spark-Under-the-hood--WordCount

nikitaeverywhere / hadoop-network-of-keywords

SakhriHoussem / Apache-Spark-Tutorial

SakhriHoussem / HBase-Tutorial

SakhriHoussem / HBase-With-Hive

SakhriHoussem / SparkSQL-Tutorial

shubnimkar / Hadoop