tahtaciburak / bigdata-term-project

Yildiz Technical University Computer Engineering BLM4120 Course Term Project

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

BLM4120 Big Data Processing Term Project

This repo contains Yildiz Technical University BLM4120 Course's term project codes. In this project a dataset processed in Hadoop Ecosystem with MapReduce programming model. The dataset contains Crimes in Chicago and it is publicly available at https://www.kaggle.com/chicago/chicago-crime. 5 different MapReduce jobs written in java. To visualize calculated results, a web app developed with Python/Flask and Chart.js

Web Application

MapReduce Jobs

  • Crime Type Count: Count the types of crime
  • FBI Code Count: Count the types of FBI Code
  • Average Latitude: Calculate the average value of Latitude's
  • Average Longitude: Calculate the average value of Latitude's
  • Most Frequent Year: Calculate the most frequent year

About

Yildiz Technical University Computer Engineering BLM4120 Course Term Project


Languages

Language:Java 46.6%Language:HTML 33.2%Language:Python 14.9%Language:Shell 5.3%