chanway0602 / PythonSpiderDemo

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

PythonSpiderDemo

Python spider lesson learn demo

python spider 基本步骤:

  1. 判断网页类型(静态网页?动态网页?)
  2. 静态网页 -> html -> dada 动态网页 -> 抓包 -> 分析 -> data
  3. 保存数据

技术:requests + (re, xpath); 复杂动态页面用selenium

ref:http://c.biancheng.net/python_spider/what-is-spider.html

About


Languages

Language:Python 88.0%Language:HTML 12.0%