hagarmar / crunch-movie-analysis

Analyzing movie data with Apache Crunch

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This is an exercise in data analysis using Apache Crunch.

The dataset

The dataset in use is the MovieLens 10M line dataset.

Types of analyses

  1. Find the most common tag per movie
  2. Find the most common movie genre per rater

About

Analyzing movie data with Apache Crunch


Languages

Language:Java 100.0%