luzihang123 / scrapyd

A service daemon to run Scrapy spiders

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Scrapyd

PyPI Version Build Status Coverage Status Python Version Pypi Downloads

Scrapyd is a service for running Scrapy spiders.

It allows you to deploy your Scrapy projects and control their spiders using an HTTP JSON API.

The documentation (including installation and usage) can be found at: http://scrapyd.readthedocs.org/

修改部分

Feature:

1、通过配置文件scrapyd.conf,log_suffix配置项,可以自定义日志格式,方便被公司elk收集

2、前端增加UTF_8视窗,解决乱码问题,方便查看中文日志

TODO:

1、日志大小控制 应对长任务,海量数据抓取的log大小问题

Install: -----1、安装 无需登录git

码云:pip install git+https://gitee.com/WoAiChiZhuSun/scrapyd.git

github:pip install git+https://github.com/luzihang123/scrapyd.git

2、添加 scrapyd/scripts/logs/UTF-8.html 文件,到scrapyd 配置文件指定的logs_dir下

About

A service daemon to run Scrapy spiders

License:BSD 3-Clause "New" or "Revised" License


Languages

Language:Python 96.4%Language:Shell 3.0%Language:HTML 0.6%