http://www.bcactc.com/home/gcxx/index.aspx?gs&sg_gs
安装命令:
pip install -r requirements.txt
运行命令(目录下crawl_bectin\crawl\crawl):
python debug.py crawl bcactc -o re1.csv
结果:
re1.csv:抓取的基本信息
pdf_result:抓取的pdf
http://www.bcactc.com/home/gcxx/index.aspx?gs&sg_gs
安装命令:
pip install -r requirements.txt
运行命令(目录下crawl_bectin\crawl\crawl):
python debug.py crawl bcactc -o re1.csv
结果:
re1.csv:抓取的基本信息
pdf_result:抓取的pdf