seiyawati / docker-scrapy-selenium-chrome

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Docker Scrapy Selenium Chrome

Usage

list scrapy commands

docker run --rm jamesway/scrapy

create project

#the container WORKDIR is /code
docker run --rm -v $(pwd):/code jamesway/scrapy startproject [scrapy_project_name]

create a spider for a domain

cd [scrapy_project_name]
docker run --rm -v $(pwd):/code jamesway/scrapy genspider [spider_name] [domain.com]

#eg
docker run --rm -v $(pwd):/code jamesway/scrapy genspider example example.com

crawl

# -o specifies output type eg: json list (.jl)
docker run --rm -v $(pwd):/code jamesway/scrapy crawl [spider_name] -o [output_file.jl]

About


Languages

Language:Python 63.2%Language:Dockerfile 19.1%Language:Shell 17.7%