A simple web scraper for popular property aggregator websites, like idealista and supercasa. It can be extended to handle other property websites.
- Node 16+ installed [https://nodejs.org]
- Valid web sraping api key [https://www.webscrapingapi.com]
git clone git@github.com:dafcoe/property-web-scraper.gitcd property-web-scrapernpm installnpm run buildSCRAPER_API_KEY=PlaceHereYouApiKeynode dist/property-scraper.js [options]-u, --urls <urls> | comma-separated urls of the property aggregator websites to scrape (required)
-f, --full | perform subsequent scrapes based on pagination (default: false)
-o, --output <output> | the output format, either "list" or "count" (default: "list")
// Get a list of properties from a single url
node dist/property-scraper.js -u https://www.idealista.pt/comprar-casas/lisboa/
// Get a list of properties from multiple urls
node dist/property-scraper.js -u https://www.idealista.pt/comprar-casas/lisboa/,https://www.idealista.pt/comprar-casas/lisboa/pagina-5,https://supercasa.pt/comprar-casas/lisboa
// Get the size of a list of properties from a single url
node dist/property-scraper.js -u https://www.idealista.pt/comprar-casas/lisboa/ -o count
// Get a full list of properties from a single url
// Goes through all pages and perform a scrape
// Use with caution as it will take some time
node dist/property-scraper.js -u https://www.idealista.pt/comprar-casas/lisboa/ -f