The IMDB Scraper is a Python script that allows you to scrape movie data from the IMDb website. It retrieves information such as movie titles, ratings, cast members, and saves it in a structured format for further analysis or use.
- Scrapes movie data from IMDb
- Retrieves movie titles, ratings, directors, and cast members
- Saves data in a structured format JSON format
- Clone the repository:
git clone https://github.com/Leomv55/IMDB-movie-scrapper.git
- Navigate to the project folder:
cd IMDB-movie-scrapper
- Install the required dependencies:
pip3 install pipenv && pipenv install
- Run the script:
pipenv run python main.py "<query>"
, where<query>
is the movie title you want to search for. - The script will scrape the data and save it in
output
dir. - To run the tests:
pipenv run python tests.py
Sample output:
[
{
"title": "The Matrix",
"release_date": "1999",
"ratings": "8.7",
"directors": [
"Lana Wachowski",
"Lilly Wachowski"
],
"cast": [
"Keanu Reeves",
"Laurence Fishburne",
"Carrie-Anne Moss"
],
"plot_summary": "When a beautiful stranger leads computer hacker Neo to a forbidding underworld, he discovers the shocking truth--the life he knows is the elaborate deception of an evil cyber-intelligence."
},...
]
Contributions are welcome! If you have any suggestions, bug reports, or feature requests, please open an issue or submit a pull request.
This project is licensed under the MIT License.
For any questions or inquiries, please contact leomv3@gmail.com.