Jafeney / node-spider

Web crawler program based on the node

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

基于node的网络爬虫项目

@author Jafeney @dateTime 2016-08-05

项目说明

本项目采用express快速构建HTTP服务,用Request模块请求需要扒取的网页,然后用 Cheerio进行处理,然后结果以JSON格式存储。

    "body-parser": "~1.15.1",
    "cheerio": "^0.20.0",
    "cookie-parser": "~1.4.3",
    "debug": "~2.2.0",
    "ejs": "~2.4.1",
    "express": "~4.13.4",
    "morgan": "~1.7.0",
    "request": "^2.74.0",
    "serve-favicon": "~2.3.0"

欢迎fork

项目在线演示地址: http://jafeney.com:9999


欢迎关注我的 个人博客Jafeney or link me at 692270687@qq.com

About

Web crawler program based on the node


Languages

Language:JavaScript 98.2%Language:HTML 1.3%Language:CSS 0.5%