There are 8 repositories under web-archive topic.
Free web archiving and sharing service based on Cloudflare. 跑在 Cloudflare 上的免费网页归档和分享工具。
Serverless replay of web archives directly in the browser
Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!
Hunt down the secrets from the WebArchives for Fun and Profit
Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewing, replay, mirroring, data scraping, and/or indexing. Your own personal private Wayback Machine that can also archive HTTP POST requests and responses, as well as most other HTTP-level data.
Summarize web archive capture index (CDX) files.
🗄 Save an archived copy of websites from Pocket/Pinboard/Bookmarks/RSS. Outputs HTML, PDFs, and more...
📜 The Archive Query Log.
A Tool to Summarize Web Archive Holdings
Build rich git projects history discovery apps with ease, used by Gitstory
A continuation of legacy XUL version of DownThemAll! ✔️preserves web.archive.org timestamps, ✔️advanced filters for remote directory tree mirroring, ✔️UI is tweaked for better UX
Read Web ARChive (WARC) files in Java.
Docker image for ReplayWeb.page
Crawls the web to generate a huge dataset for training
Miscellaneous utility scripts
Wubbzy archived sites
a cli toolkit for working with web archives
Easily scrape, download and preview websites.
PalaceRadio | A Next.js app Built from web Archive | Freelance Project @upwork
Redirect to a live website or an archived version if it's down.
Periodically crawl a set of websites and ensure that all of their pages are archived on the Wayback Machine. Mirror of https://codeberg.org/meadowingc/waybacker
Tool to archive websites and other content available on the Internet on the content-addressed S5 Network
Farsky | A Next.js app Built from web Archive | Freelance Project @upwork