j-marchadier / Series-R-Dashboarding

Dashboard of Series in R language

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

output
pdf_document html_document
default
default

LinkedIn Contributors


Series and Movies R Dashboarding

Table of Contents
  1. About The Project
  2. Getting Started
  3. Developer_Guide
  4. Roadmap
  5. Contributing
  6. License
  7. Contact
  8. Acknowledgments

About The Project

This is a project of our second year of engineering study. It took place in september and october. Our goal is to create an interactive dashboard including a map and an histogram. The Dataset is present on kaggle : www.kaggle.com/ashishgup/netflix-rotten-tomatoes-metacritic-imdb.

This dataset combines data sources from Netflix, Rotten Tomatoes, IMBD, posters, box office informations, trailers on YouTube, and more using a variety of APIs. Note that there is no official Netflix API.

"Hidden Gem Score" is calculated using low review count and high rating. Lower the review count and higher the user rating, higher the hidden gem score.

(back to top)

Getting Started

To get the projet, you need to clone it with the following command

  • Clone git repository
    git clone https://github.com/juju312000/Series_R_Dashboarding.git

https://github.com/juju312000/Series_R_Dashboarding.git

This repository is storage on github. To get a local copy, run follow this simple step.

User_Guide

You need to install packages and call them to run the project. The simpler way is to run in RStudio the few lines in the packages.R file. After that, you can see the dashboard on the url below : http://127.0.0.1:3711/

Built With

This project writed in R. Here are the different library used in the project :

(back to top)

Once the packages are well installed and imported in your RStudio, just run the app by writing 'runApp('Projet')' and you will be able to see the dashboard.

Developer_Guide

This project is divided in 2 parts :

  • Data cleaning
  • Dashboard

Data Cleaning

In "global.R" file we first download the dataset.

download.file(response[["url"]], "netflix-rotten-tomatoes-metacritic-imdb.zip", mode="wb")

When it's done, we read the .csv and select columns we want. We clean the first issues we have seen which are the most easy to fix. That's what we do in the global.R file after downloading and importing the datas.

In the original dataset in the column "country_availability", countries are present and split by comma. In the function clean_country we create as much columns as countries and we put True or False if the country is in or not.

We have so a dataframe with a lot of columns. We will pivot it with the function pivot_longer. That returns a dataframe with number of original rows * number of country.

We delete rows which are False in pivot column since it's an information we won't use, and rename rename the 'country' column into 'region' in order to make a geographical plot.

We make the exact same with genre is the function clean_genre.

We have now 3 different datasets. One which is specialised in the plots linked to the genre category, another specialised in the geographical plots and a classic one for all other types of plots.

However, our datas are not totally ready for the plots since there is few changements to do in the server in order to plot according to our filters.

Dashboard

In R Dashboard, there are 2 main parts which are :

  • UI
  • Server

UI

This part is located in the ui.R file, where we just create several Inputs in the SidebarMenu. For this project, we chose the type, which is a movie, a serie or both as a variable. We took the imdb_score and the run time as well.

Secondly, in the Dashboard Body, we just plot our the diffenrent graphs made in the server.R file

Server

Here comes the most difficult part. Located in the server.R file this is where we at first filter the datas according to our Inputs declared in the UI. Once this filtering is done, we now have to create reactive functions to produce our datas which goes into our different plot functions.

We have once again a particular Dataset for the countries and another for the genres.

Here is a visual of our R Dashboard

Analysis Report

We can show several things according to the user's desire thanks to our filters. For example :

  • Thanks to the graph representing the evolution of the world box office we can see that the curve follows an increasing trend for the movies which isn't the case for the series.

  • Whatever the filter applied, we can see that the 2 dominant genres are Drama and Comedy (for both series and movies).

  • We want to observe in which region are the best productions. In order to have our information we look at the map with the best 30 score filter. We can see that in general, North America and Europe concentrate the most high rated movies. The above examples show only a small part of what can be represented with this type of dashboard. It's up to you to draw the one you like.

Warning: Applying too many filters can cause errors due to lack of data.

Contact

Project Link: https://github.com/juju312000/Series_R_Dashboarding

(back to top)

About

Dashboard of Series in R language


Languages

Language:R 100.0%