kumarAnand05 / IMDb-Data-Scraper

Java Selenium based scraper to collect all media details from IMDb website.

Home Page:https://www.imdb.com

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

IMDb Data Scraper

By Anand Kumar

Features

  • Entire Database Collection : Extracts all the media data stored on IMDb website.
  • CSV Output : Extracts the data in CSV format.

Instructions

After you have downloaded the project files. Follow the instructions below to setup your machine to make code functional.

Downloading/Installing dependencies

Of course you need JDK and an IDE like VSCode, IntelliJ etc. installed on your machine.

Download Dependencies

Open the project in your IDE and connect to internet. Using the pom.xml file present in the project directory, download the dependencies by performing specific actions for the IDE that you are using.

Dos and Don'ts

Do's

  • You can use your machine during the process.
  • You can keep the browser and IDE in background.

Don'ts

  • Do not click on any element of the webpage as it can lead to termination of the code.
  • Do not use console during the process.
  • Do not turn off internet or close the automated browser session.

About

Java Selenium based scraper to collect all media details from IMDb website.

https://www.imdb.com


Languages

Language:Java 100.0%