There are 6 repositories under hadoop-streaming topic.
Installation and configuration of Hadoop on Google Colaboratory
Text Processing Using Hadoop
[MAP543] Hadoop Streaming (MapReduce) and Spark implementations of the Dijkstra shortest path algorithm
:elephant: :heavy_plus_sign: :snake: Learning Hadoop with Python
Bootcamp ministrado pela IGTI com o objetivo de abordar de forma intensiva conceitos e práticas da análise de dados, habilitando o aluno para atuar profissionalmente na área.
A case study on mining association rules between different factors related to deaths of people in the United States
A REST-based service that translates the SQL query into MapReduce and Spark jobs. It runs these jobs and provides the JSON object. SQL to MapReduce and Spark translator.
Market Basket Analysis using Hadoop MapReduce in Python
AWS Elastic Map Reduce Streaming Templates
Twitter Streaming Analytics Project (Big Data Analysis using Hadoop)
hadoop mapreduce algorithm with hadoop streaming (Python)
Learning Hadoop MapReduce Using Python
Step By Step guide for Hadoop installation on Ubuntu 16.04.3 with MapReduce example using Streaming
MapReduce Python Example
Hadoop Projects
Exercise files for Apache Hadoop Big Data Training
A small library example how to work with binary files with Hadoop Streaming.
K-Means, Hierarchical Agglomerative, Density based and Map Reduce K-Means Clustering implemented on 2 Gene Datasets in Python
Worked on Hadoop file streaming
First project for Big Data course held at Roma Tre University
Mutations
This repo contains implementations of Mapreduce program in a large text corpus with Apache Hadoop Environment | Nilufa Yeasmin | https://www.linkedin.com/in/nilufayeasmin/
Repository to the needs of Big Data course at university
Построение рекомендательной системы на основе алгоритма коллаборативной фильтрации и технологии Hadoop Streaming
Processing and transforming data via Hadoop Ecosystem
Implementation of Word2Vec for large datasets as a Map-Reduce Job using Hadoop Streaming.
PageRank algorithm using Hadoop Streaming
A Hadoop MapReduce application to find the maximum temperature in every day of the years 1901 and 1902 from the NCDC weather records.
Leveraging the mapreduce paradigm we propose a solution to parallelize the feedforward operation of neural networks in order to speed it up for sufficiently large NN architectures and for sufficiently large datasets. Tested Using the MNIST dataset results can be found in the results.html and results.ipynb files.