houseme / spider

The fastest web crawler written in Rust. Maintained by @a11ywatch.

Home Page:https://docs.rs/spider/latest/spider/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Spider

Build Status Crates.io Documentation Rust

The fastest web crawler and indexer. Foundational building blocks for data curation workloads.

  • Concurrent
  • Streaming
  • Decentralization
  • Headless Chrome Rendering
  • HTTP Proxies
  • Cron Jobs
  • Subscriptions
  • Smart Mode
  • Blacklisting and Budgeting Depth
  • Dynamic AI Prompt Scripting Headless
  • Changelog

Getting Started

The simplest way to get started is to use the Spider Cloud hosted service. View the spider or spider_cli directory for local installations. You can also use spider with Node.js using spider-nodejs and Python using spider-py.

Benchmarks

See BENCHMARKS.

Examples

See EXAMPLES.

License

This project is licensed under the MIT license.

Contributing

See CONTRIBUTING.

About

The fastest web crawler written in Rust. Maintained by @a11ywatch.

https://docs.rs/spider/latest/spider/

License:MIT License


Languages

Language:Rust 100.0%