kkweon / cragislist-apt-crawler

scrapy and python

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Craigslist Apartment Crawler

crawl apartment and update the database. It also finds the closest Caltrain station and returns its distance.

targetsite: https://sfbay.craigslist.org/search/apa

Requirements

  1. scrapy
  2. python3

File Description

  • README.md (current file)
  • result.csv
    • result file
  • craigslit_apt/
    • scrapy project

How to use

scrapy crawl cl # run crawler
(source) mongo2csv # from mongo to result.csv

About

scrapy and python


Languages

Language:Python 99.4%Language:Shell 0.6%