This is a repository for scripts to parse idlebrain.com reviews.
We explore the distribution of reviews across the years.
pip install -e . # picks up packages from requirements.txt and installs
Idlebrain Archive page lists movies.
Create reviews.db
file. Open it with SqliteBrowser and create movies
table with the following schema:
CREATE TABLE `movies` ( `id` INTEGER, `name` TEXT, `url` TEXT, `release_date` TEXT, `rating` TEXT )
We fetch the archive list, save the links in a sqlite
database named reviews.db
(movies
table). For this, run parse_reviews()
in parse.py
.
This updates movies
table with movie entries.
Create data
directory in the root directory.
For this, run fetch_data_from_IB()
in parse.py
. This creates movie_name.html
in data/
directory.