A Node.js web scraping project to extract book information from a specific website and save it to a MongoDB database.
This is a project that I used to practise different concepts with. It is a web scraper built with Node.js and various libraries like Axios for making HTTP requests and Cheerio for parsing HTML. It's designed to scrape book data from a specific website (http://books.toscrape.com) and store it in a MongoDB database. The scraped data includes book titles, prices, and star ratings.
- Web scraping of book data from multiple pages.
- Rate limiting to avoid overloading the target website.
- Data storage in a MongoDB database.
- Some error handling and logging for smooth operation.
- Clone the repository to your local machine
- Navigate to the project directory
- Install dependencies with:
npm install
- Configure your MongoDB connection settings in
config/database.js
. - Customize scraping parameters in
app.js
. - Run the scraper:
npm start
This project is licensed under the MIT License.