rohanm9 / hadoop_assignment

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Lab1-Hadoop

This Lab consists of Map Reduce programs on Hadoop.

Question-2: Input file(input.txt) consists of the NCDC weather data and the MapReduce program returns yearly the Maximum Temperature recorded in output.txt.

Question-3: Input file(web_access_log.txt) consists of web access log roduced by a web server and the MapReduce program counts the number of times GIF, JPG, and other image files that have been accessed by clients.

Question-4: We are still using the same input file as used in question3, the MapReduce program return the output (Total number of requests and Total download size on Monthly basis) in output.txt.

Question-5: We are still using the same input file as used in question3, the MapReduce program lists Timestamp, URL for which http response status was 404.

About


Languages

Language:Java 100.0%