johndpope / book-dataset

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Book Dataset

This dataset contains 137,788 books from the Amazon.com, Inc. marketplace.

Challenges

Results and related papers

Task 1: Classification

A. Book Cover Image to Genre (BookCover30)

The purpose of this task is to classify the books by the cover image. The BookCover30 dataset contains 57,000 book cover images divided into 30 classes. The training set and test set is split into 90% - 10% respectively.

Technical details

Task 2: Data Mining (Book32)

This task is to explore the entire book database. There are 137,788 books in 32 classes. This dataset contains book cover images, title, author, and category for each respective book.

Technical details

Citation

Paper on arXiv

B. K. Iwana, S. T. Raza Rizvi, S. Ahmed, A. Dengel, and S. Uchida, "Judging a Book by its Cover," arXiv preprint arXiv:1610.09204 (2016).

@article{iwana2016judging,
  title={Judging a Book by its Cover},
  author={Iwana, Brian Kenji and Raza Rizvi, Syed Tahseen and Ahmed, Sheraz and Dengel, Andreas and Uchida, Seiichi},
  journal={arXiv preprint arXiv:1610.09204},
  year={2016}
}

Contact

brian@human.ait.kyushu-u.ac.jp

Disclaimer

All book cover images are hosted by and copyright Amazon.com, Inc. The the use of the book cover images is fair use for academic purposes.

About

License:Apache License 2.0