IhorPletnov / oscar-predictor

Home Page:http://oscarpredictor.github.io

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

oscar-predictor

This is an ipython notebook that walks through our process of using IMBD data to predict the success of any given movie.

Movie success predictor written by @dhanus, @dtomc, @rmazumdar, & @sbuschbach for a Harvard Data Science final project. We were advised by @lfcampos.

Getting Started

Install required python packages:

pip install -r requirements.txt

Road Map

We did two analyses for this project: Oscar Predictor and Box Office Sales.

alt text

####Oscar Predictor The data scraper for this analysis can be found in ipython notebook oscar_scraper.ipynb. This takes in the xls file "Academy_Awards_2006.xls" and outputs "AAdictfinal", a dataset in dictionary form. ("AAdict.p" can be used to skip a portion of this notebook, the output will still be "AAdictfinal") To run this data scraper run:

ipython notebook oscar_scraper.ipynb

The process notebook can be found in ipython notebook oscar_process_notebook.ipynb. This notebook uses "AAdictfinal". To run the analysis for the Oscar Predictor run:

ipython notebook oscar_process_notebook.ipynb

####Box Office Sales The data scraper for this analysis can be found in ipython notebook box_office_scraper.ipynb. This notebook outputs "BOdict", a dataset in dictionary form. To run this:

ipython notebook box_office_scraper.ipynb

The process notebook can be found in ipython notebook oscar_process_notebook.ipynb. This notebook uses "BOdict". Run ipython notebook:

ipython notebook box_office_process_notebook.ipynb

About

http://oscarpredictor.github.io

License:MIT License


Languages

Language:OpenEdge ABL 71.1%Language:Jupyter Notebook 28.9%