There are 10 repositories under website-scraper topic.
Download website to local directory (including all css, images, js, etc.)
Website Cloner - Utilizes powerful Go routines to clone websites to your computer within seconds.
🕵️♂️ LinkedIn profile scraper returning structured profile data in JSON.
Uscrapper Vanta: Dive deeper into the web with this powerful open-source tool. Extract valuable insights with ease and efficiency, from both surface and deep web sources. Empower your data mining and analysis with Vanta's advanced capabilities. Fast, reliable, and user-friendly, Uscrapper Vanta is the ultimate choice for researchers and analysts.
Plugin for website-scraper which returns html for dynamic websites using puppeteer
🕸 generates RSS feeds of any website a d servers to the web! Docker. Automatic scraping, use the built-in configs or create your own. Rolling release for speedy updates.
DPULSE - Tool for complex approach to domain OSINT
A server to collect & archive websites that also supports video downloads
ScrapeGPT is a RAG-based Telegram bot designed to scrape and analyze websites, then answer questions based on the scraped content. The bot utilizes Retrieval Augmented Generation and webscraping to return natural language answers to the user's queries.
Python-based web crawling script with randomized intervals, user-agent rotation, and proxy server IP rotation to outsmart website bots and prevent blocking.
Plugin for website-scraper which returns html for dynamic websites using PhantomJS.
Automatically curates and posts content to LinkedIn. It can optionally use web scraping to gather data, which is then fed to ChatGPT to craft engaging LinkedIn posts.
Scraper for https://marvelsnapzone.com to retrieve metadata of Marvel SNAP cards.
JSON collection of scraped file extensions, along with their description and type, from FileInfo.com
Now you can keep track of your followers from YouTube, Instagram and Twitter accounts - Followers scraper API on AWS serverless
Website Penetration Testing Tool With Dos Attack Feature
Plugin for website-scraper which allows to save resources to existing directory
Scraping websites made easy! A minimalistic yet powerful tool for collecting data from websites.
Download ALL the images (JPEG/GIF/PNG) from any Tumblr website! This project employs Python3 and BeautifulSoup4 to scrape a Tumblr site (with the url provided by the user) to download, page by page, all the images from the Tumblr site's posts. Ideal for archiving other peoples' Tumblrs <3
This is a website url scraper built using python.
Article Dataset Generator for Internet News Sites. Crawls news sites, analyses them with NLP (sentiment analysis), and pushes to a database.
Simple library which parses web pages into objects usin attributes
This script scrapes the verses and references from an openbible.info page into a JSON file - if needed, we use bible-api.com to translate to another bible version.
A python Script for automatically collect links from a web page.
Scrapes any website to retrieve all hyperlinks from it in a matter of seconds. Scraping made easy!
This script downloads manhua, manga or manhwa and save them in a same name directory.
Alexa Bulk Website Rank Checker PHP Script 2020 Latest! you can grab 200+ URL's website ranking at once!
There is a script for scraping yellowpages.com website for name, contact, address and link