jkiv / jeopardy_clue_dataset

A dataset containing 376,000 Jeopardy clues.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

jeopardy_clue_dataset

This dataset contains Jeopardy clues from Season 1 through Season 36 (June 2020). It does not contain every clue that has appeared on the show. The data source prefers not to be credited.

There are 376,266 clues in total. They can be found in combined_season1-36.tsv.

There are also individual files for each season.

  • Seasons 1-12 average 5,088 clues each.
  • Seasons 13-36 average 13,134 clues each.

There is a kids_teen.tsv file which contains only clues that appeared in Kids and Teen Tournament matches.

There is a separate goat_tournament_jan2020.tsv file which covers the Jennings-Holzhauer-Rutter event.

Note that combined_season1-36.tsv is zipped. When uncompressed it is approx. 56 MB.


Column Labels:

  • round

    • 1 for Single Jeopardy
    • 2 for Double Jeopardy,
    • 3 for Final Jeopardy.
  • value – The clue's value on the board. If the clue was a Daily Double, this column will be the wagered amount.

  • daily_doubleyes or no.

  • category

  • comments – Usually this contains the host's comments about the category. Sometimes other misc. information is found here.

  • answer

  • question

  • air_date – The calendar date on which the episode first aired.

  • notes – Indicates whether a clue appeared in a special tournament match.


All data is property of Jeopardy Productions, Inc. and protected under law. I am not affiliated with Jeopardy Productions, Inc. Please don't use the data to make a public-facing web site, app, or any other commercial product.

About

A dataset containing 376,000 Jeopardy clues.