ilazaridis / http-status-check

CLI tool to crawl a website and check HTTP status codes

Home Page:https://murze.be/2015/11/building-a-crawler-in-php/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Check the statuscode of all links on a website

Latest Version on Packagist Software License SensioLabsInsight Quality Score StyleCI Total Downloads

This repository provides a tool to check the http statuscode of every link on a given website.

Spatie is a webdesign agency in Antwerp, Belgium. You'll find an overview of all our open source projects on our website.

Postcardware

You're free to use this package (it's MIT-licensed), but if it makes it to your production environment you are required to send us a postcard from your hometown, mentioning which of our package(s) you are using.

Our address is: Spatie, Samberstraat 69D, 2060 Antwerp, Belgium.

The best postcards will get published on the open source page on our website.

Install

Via Composer

composer global require spatie/http-status-check

Usage

This tool will scan all links on a given site.

http-status-check scan https://example.com

It outputs a line per link found.

screenshot

When the crawl is finished a summary will be shown.

By default it uses 10 concurrent connections to speed up the crawling process. You can change that number passing a different value to the concurrency-option.

http-status-check scan https://example.com --concurrency=20

You can also write all urls that gave a non-2xx or non-3xx response to a file:

http-status-check scan https://example.com --output=log.txt

Changelog

Please see CHANGELOG for more information what has changed recently.

Contributing

Please see CONTRIBUTING for details.

Security

If you discover any security related issues, please email freek@spatie.be instead of using the issue tracker.

Testing

To run the tests you'll have to start the included node based server first in a separate terminal window.

cd tests/server
./start_server.sh

With the server running, you can start testing.

vendor/bin/phpunit

Credits

About Spatie

Spatie is a webdesign agency in Antwerp, Belgium. You'll find an overview of all our open source projects on our website.

License

The MIT License (MIT). Please see License File for more information.

About

CLI tool to crawl a website and check HTTP status codes

https://murze.be/2015/11/building-a-crawler-in-php/

License:MIT License


Languages

Language:PHP 89.0%Language:JavaScript 8.4%Language:Shell 2.5%