skent259 / schrute

The Entire Transcript from The Office in Tidy Format

Home Page:https://bradlindblad.github.io/schrute/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

schrute

Travis build status CRAN_Status_Badge metacran downloads lifecycle

Analyze and have fun with the text from the best series of all time

Installation

You can install the released version of schrute from CRAN with:

install.packages("schrute")

Usage

The schrute package has one and only one purpose: share the complete script transcription for The Office (US) television show. Users are encouraged to use the tidy text data for exploration, learning and fun.

Check out the data like so:

library(schrute)
library(tibble)

tibble::glimpse(schrute::theoffice)
#> Observations: 55,130
#> Variables: 12
#> $ index            <int> 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 1…
#> $ season           <chr> "01", "01", "01", "01", "01", "01", "01", "01", "01"…
#> $ episode          <chr> "01", "01", "01", "01", "01", "01", "01", "01", "01"…
#> $ episode_name     <chr> "Pilot", "Pilot", "Pilot", "Pilot", "Pilot", "Pilot"…
#> $ director         <chr> "Ken Kwapis", "Ken Kwapis", "Ken Kwapis", "Ken Kwapi…
#> $ writer           <chr> "Ricky Gervais;Stephen Merchant;Greg Daniels", "Rick…
#> $ character        <chr> "Michael", "Jim", "Michael", "Jim", "Michael", "Mich…
#> $ text             <chr> "All right Jim. Your quarterlies look very good. How…
#> $ text_w_direction <chr> "All right Jim. Your quarterlies look very good. How…
#> $ imdb_rating      <dbl> 7.6, 7.6, 7.6, 7.6, 7.6, 7.6, 7.6, 7.6, 7.6, 7.6, 7.…
#> $ total_votes      <int> 3706, 3706, 3706, 3706, 3706, 3706, 3706, 3706, 3706…
#> $ air_date         <fct> 2005-03-24, 2005-03-24, 2005-03-24, 2005-03-24, 2005…

Or view the short vignette with:

vignette("theoffice")

About

The Entire Transcript from The Office in Tidy Format

https://bradlindblad.github.io/schrute/

License:Other


Languages

Language:R 74.5%Language:Python 25.5%