There are 4 repositories under website-crawler topic.
It allows you to download a website from the Internet to a local directory, building recursively all directories, getting HTML, images, and other files from the server to your computer.
Python-based web crawling script with randomized intervals, user-agent rotation, and proxy server IP rotation to outsmart website bots and prevent blocking.
Scraper for https://marvelsnapzone.com to retrieve metadata of Marvel SNAP cards.
An almost generic web crawler built using Scrapy and Python 3.7 to recursively crawl entire websites.
A tutorial and code samples of web scraping with PHP
A Simple Script To Scrape DuckDuckGo Search Results Using Python And Selenium WebDriver.
:dizzy: Crawl urls from a webpage and provide a DomCrawler with Scraper Library
This a project to demonstrate the use of standard python libraries like os, urllib, HTMLParser to create a minimalist webpage crawler that crawls webpages on a website to gather hyperlinks (URLs)
Crawls a website to generate insights
A tutorial on using Oxylabs' E-Commerce Scraper
A quick-start guide on using Web Scraper API
The most advanced Lightshot (or prnt.sc) scraper ever!
Java website crawler - library for analyze and testing websites
Web Link Crawler: A Python script to crawl websites and collect links based on a regex pattern. Efficient and customizable.
Grabs images off webpages.
Recursive website crawler
Created a website-crawler in bash. Note, it's for a specific website and will not work unless you know the site.
a python script that crawls website sitemap in a very quick way with multi threading and extract, write SEO based data to CSV file
The most advanced Imgur scraper ever!
Parses data using json file as instruction and writes to SQL server database
Sitesweeper is a python package to help you automate your web scraping process, outputting pages to a file
Simple website crawler to get Meta tags and <H1> on Python