There are 96 repositories under scraping topic.
Scrapy, a fast high-level web crawling & scraping framework for Python.
Elegant Scraper and Crawler Framework for Golang
Pythonic HTML Parsing for Humans™
Tabula is a tool for liberating data tables trapped inside PDF files
Generate code from cURL commands
Distributed crawler powered by Headless Chrome
Declarative web scraping
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
Mechanize is a ruby library that makes automated web interaction easy.
Do you want to LEARN NEW STUFF for FREE? Don't worry, with the power of web-scraping and automation, this script will find the necessary Udemy coupons & enroll you for PAID UDEMY COURSES, ABSOLUTELY FREE!
Collection of useful data science topics along with code and articles
A browser testing and web crawling library for PHP and Symfony
A Python module to scrape several search engines (like Google, Yandex, Bing, Duckduckgo, ...). Including asynchronous networking support.
Getting started with Puppeteer and Chrome Headless for Web Scraping
Custom Selenium Chromedriver | Zero-Config | Passes ALL bot mitigation systems (like Distil / Imperva/ Datadadome / CloudFlare IUAM)
A curated list of awesome puppeteer resources.
Geziyor, blazing fast web crawling & scraping framework for Go. Supports JS rendering.
Snoop — инструмент разведки на основе открытых данных (OSINT world)
Scrape Facebook public pages without an API key
Creating Scrapy scrapers via the Django admin interface
This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.
Scrape the Instagram frontend. Inspired from twitter-scraper by @kennethreitz.
[Unmaintained] A simple and clean video/music/image downloader 👾
DataHen Till is a companion tool to your existing web scraper that instantly makes it scalable, maintainable, and more unblockable, with minimal code changes on your scraper. Integrates with any scraper in 5 minutes.
Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors
Tools for various online judges. Downloading sample cases, generating additional test cases, testing your code, and submitting it.
:scissors: High performance, multi-threaded image scraper
Simple but useful Python web scraping tutorial code.
🧹 Python package for text cleaning
Internet-in-a-Box - Build your own LIBRARY OF ALEXANDRIA with a Raspberry Pi !
Generate Free Edu Mail(s) within minutes