kulak / spider

The fastest web crawler written in Rust

Home Page:https://docs.rs/spider/latest/spider

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Spider

Build Status Crates.io Documentation Rust

The fastest web crawler and indexer. Foundational building blocks for data curation workloads.

  • Concurrent
  • Streaming
  • Decentralization
  • Headless Chrome Rendering
  • HTTP Proxies
  • Cron Jobs
  • Subscriptions
  • Smart Mode
  • Blacklisting and Budgeting Depth
  • Changelog

Getting Started

The simplest way to get started is to use the Spider Cloud for a pain free hosted service. View the spider or spider_cli directory for local installations. You can also use the spider with node.js using the spider-nodejs project.

Benchmarks

See BENCHMARKS.

Examples

See EXAMPLES.

License

This project is licensed under the MIT license.

Contributing

See CONTRIBUTING.

About

The fastest web crawler written in Rust

https://docs.rs/spider/latest/spider

License:MIT License


Languages

Language:Rust 100.0%