hellozeyu / Web-Scraping-AMA

This is a curated collections of WORKING spiders written in Python.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Web-Scraping-AMA

This is a curated collections of WORKING spiders written in Python.


Scraped the following fields of each movie using the Scrapy package.

  • movie_title
  • genre
  • release_date
  • metascore
  • meta_positive
  • meta_mixed
  • meta_negative
  • userscore
  • user_positive
  • user_mixed
  • user_negative

Scraped the listings of four example regions in NYC area using the Selenium package. Each listing contains the following fields.

Note: this website uses Distil Networks or similar technology to ban automated browser like Selenium. To bypass that, follow the solution here to change the cdc_ variable in your chromedriver otherwise you will see the reCAPTCHA after a few pages.

  • property
  • bed
  • bath
  • sqft
  • price

About

This is a curated collections of WORKING spiders written in Python.

License:MIT License


Languages

Language:Python 100.0%