There are 151 repositories under webscraping topic.
🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data
An LLM agent that conducts deep research (local and web) on any given topic and generates a long report with citations.
Python scraper based on AI
⚡ Easiest no code web data extraction platform • Instantly turn any website into API or spreadsheet ⚡
A cli tool to browse and play anime
🕷️ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!
List of libraries, tools and APIs for web scraping and data processing.
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system 👻 and get around browser fingerprinting scripts 🕵️♂️ when scraping the web?
Scrapoxy is a super proxies manager that orchestrates all your proxies into one place, rather than spreading management across multiple scrapers. It manages IP rotation and fingerprinting, and smartly routes traffic to avoid bans.
Web Scraper in Go, similar to BeautifulSoup
A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama
Undetected version of the Playwright testing and automation library.
The web scraping open project repository aims to share knowledge and experiences about web scraping with Python
Persistent HTTP cache for python requests
👻 Experimental library for scraping websites using OpenAI's GPT API.
LinkedIn enumeration tool to extract valid employee names from an organization through search engine scraping
Creating Scrapy scrapers via the Django admin interface
Make your job hunt easy by automating your application process with this Auto Applier
Undetected Python version of the Playwright testing and automation library.
A blazing fast, async-first, undetectable webscraping/web automation framework based on ultrafunkamsterdam/nodriver. Now with Docker support!
Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
Scalable Python web scraping scripts for +40 popular domains
Uscrapper Vanta: Dive deeper into the web with this powerful open-source tool. Extract valuable insights with ease and efficiency, from both surface and deep web sources. Empower your data mining and analysis with Vanta's advanced capabilities. Fast, reliable, and user-friendly, Uscrapper Vanta is the ultimate choice for researchers and analysts.
A python bot to automatically apply all Linkedin,Glassdoor, etc Easy Apply jobs based on your preferences. Auto login, auto fill additional questions, apply automatically!
Powerful and flexible Instagram scraping library for Python, providing easy-to-use and expressive tools for accessing data programmatically