Web Scrapping & Movie Popularity Prediction Model

📗 Table of Contents

📖 About the Project
- 🛠 Built With
  - Key Features
💻 Getting Started
👥 Authors
🤝 Contributing
⭐️ Show your support
🙏 Acknowledgements

📖 Project Name

Web Scrapping & Movie Popularity Prediction Model

About this project

The purpose of this project is to build a machine learning model from a non existing database to predict the popularity of a movie based on several features. The database is collected using web scrapping technique. In this project the website named Sens Critique has been used for scrapping to collect necessary data.

N.B. The author does not support web scrapping for piracy or any sort of unauthorized use of data from any website. This project is for education purpose only.

🛠 Built With

Python3, Pandas, NumPy, and SQLLite3

Key Features

500 pages of the website containing 16 movies each is scrapped and about page of each movie is also scrapped to collect necessary data.
This project is a great example of scrapping dynamically loaded data on a website.
Several features are considered in order to build the machine learning model for the prediction of a movie popularity.
The project uses selenium package to simulate a browser experience in order to load every page smoothly.
Pandas is used to generate a dataframe.

(back to top)

💻 Getting Started

To clone the repository in local environment try following steps.

Prerequisites

A web browser like Google Chrome.
A code editor like Visual Studio Code with Git, Python3, Pandas, Selenium.

You can check if Git is installed by running the following command in the terminal.

$ git --version

Likewise for python and pip for package installation.

$ python --version

$ pip show pandas

$ pip show numpy

$ pip show selenium

N.B. If any required package is missing, they must be installed using pip install {package_name} to ensure the proper use of the notebook.

Setup

Clone the repository using this link.

Install

In the terminal, go to your file directory and run this command.

$ git clone https://github.com/PrangonGhose/movie-popularity-prediction-model.git

Get into development

In the terminal, run these commands to get into development.

$ cd movie-popularity-prediction-model

👥 Authors

👤 Prangon Ghose

GitHub: @PrangonGhose
LinkedIn: Prangon Ghose

(back to top)

🤝 Contributing

Contributions, issues, and feature requests are welcome! Add suggestions by opening new issues.

PrangonGhose / movie-popularity-prediction-model

Web Scrapping & Movie Popularity Prediction Model

📗 Table of Contents

📖 Project Name