There are 123 repositories under git-scraping topic.
Better GitHub statistics images for your profile, with stats from private repos too
World Factbook Country Profiles in JSON - Free Open Public Domain Data - No API Key Required ;-)
Daily snapshots of public Spotify playlists
An up-to-date export of cloud provider IP address ranges
this shows how to use github actions to do periodic data scraping
Common Release Data for various projects in a consumable format, automatically updated.
Git scraping of AT Protocol/Bluesky instances
Unified datasets for public cloud provider IP ranges. Providers include AWS, Azure, CloudFlare, DigitalOcean, Fastly, Google Cloud and Oracle Cloud.
Data extraction of Google's COVID-19 Mobility Reports
Git scraping of Bluesky labelers/label providers
This repository contains the full dataset of AWS IAM data (services, actions, resource types and conditions keys). It's updated on a daily basis at 4AM UTC.
The open-source web scrapers that feed the Los Angeles Times California coronavirus tracker.
Scrapers for disaster data - writes to https://github.com/simonw/disaster-data
Tracking the history of trees in San Francisco
International Securities Identification Numbers for various Indian Securities
Get information about Indian Mutual Funds from their ISIN numbers.
Pulling a history of the holdings for ark invest funds https://ark-funds.com/
Data scraped by https://github.com/simonw/disaster-scrapers
Scrape various open data directories to create an index of what's available out there
Archive of German legal acts (weekly archive of gesetze-im-internet.de)
A repository demonstrating the use of real-estate-scrape to store the estimated value of a property on Redfin and Zillow every night using Github Actions.
Historical Mutual Funds data
Spotify genre attributes from EveryNoise