iurshina / fiction_chapters

Small data set of fiction chapters with plot-revealing names

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Fiction chapters dataset

A small data set of fiction chapters with plot-revealing names.

The data set contains 1253 data points (chapters and the corresponding titles). All the books are in English. The following literary works create the dataset:

  • ”Around the World in Eighty Days” by Jules Verne (37 chapters)
  • ”Baboo Hurry Bungsho Jabberjee” by Thomas Anstey Guthrie (32 chapters)
  • ”Lonely O’Malley” by Arthur Stringer (14 chapters)
  • ”The Mill on the Floss” by George Eliot (59 chapters)
  • ”Nibelungenlied”, Anonymous (24 chapters, only chapters available on wikisource were used)
  • ”Omoo” by Herman Melville (82 chapters)
  • ”Redburn. His First Voyage” by Herman Melville (62 chapters)
  • ”Gargantua and Pantagruel” by Fran ̧cois Rabelais (258 chapters)
  • ”Don Quixote” by Miguel de Cervantes Saavedra (126 chapters)
  • ”Le Morte d’Arthur” by Sir Thomas Malory (around 525 chapters)
  • ”Candide” by Voltaire (30 chapters)
  • ”The Adventures of Pinocchio” by Carlo Collodi (36 chapters)

The first 8 books were downloaded from wikisource and the rest (”Gargantua and Pantagruel”, ”Don Quixote”, ”Le Morte d’Arthur”, ”Candide” and ”The Adventures of Pinocchio”) are from Project Gutenberg.

About

Small data set of fiction chapters with plot-revealing names