nazareno / imdb-series

Series data from IMDB

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

IMDB Ratings for TV/Streaming Series

A dataset of ratings given in IMDB to episodes of popular TV and Streaming series. And the R code to generate your version of it.

Each line in the dtaset is an episode from a series from a handcrafted list of series. The variables are as follows:

* series_name <chr> Self explanatory
* series_ep   <int> Episode index in the series from 1 onwards.
* season      <int> From 1 onwards
* season_ep   <int> Episode index in the season
* url         <chr> IMDB url for the episode (eg "http://www.imdb.com/title/tt5174246/")
* Episode     <chr> Episode title
* UserRating  <dbl> IMDB User Rating calculated [as explained in their site](http://www.imdb.com/help/show_leaf?votestopfaq).
* UserVotes   <dbl> Num of votes for the rating
* r1          <dbl> Proportion of users who rated this episode with score 1
* r2          <dbl> Proportion of users who rated this episode with score 2
* ...
* r10         <dbl> Proportion of users who rated this episode with score 10

Analysing series

Go for data/series_from_imdb.csv and start analysing series/episode data.

Fetching the data / more data yourself

Dependencies

You'll need tidyverse and rvest.

Runnnig

Run get_series_data.R. It will fetch ratings for every episode of the series in series_urls.csv and save the result in data/series_from_imdb.csv.

About

Series data from IMDB

License:GNU General Public License v2.0


Languages

Language:R 100.0%