essiembre / collector-http-ci-tests

For testing GitHub actions. Not a "real" fork.

Home Page:https://opensource.norconex.com/crawlers/web

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Norconex HTTP Collector

Norconex HTTP Collector

Norconex HTTP Collector is a full-featured web crawler (or spider) that can manipulate and store collected data into a repositoriy of your choice (e.g. a search engine). It very flexible, powerful, easy to extend, and portable. Can be used command-line with file-based configuration on any OS, or can be embedded into Java applications using well documented APIs.

Visit the web site for binary downloads and documentation:

About

For testing GitHub actions. Not a "real" fork.

https://opensource.norconex.com/crawlers/web

License:Apache License 2.0


Languages

Language:Java 97.9%Language:HTML 1.6%Language:JavaScript 0.3%Language:Batchfile 0.1%Language:Shell 0.1%