ArchiveBox

ArchiveBox

Organization data from Github https://github.com/ArchiveBox

The self-hosted internet archiving solution maintained by @pirate. #webarchiving #internetarchiving #digipres

Location:Montréal, Quebec

Home Page:https://docs.archivebox.io

GitHub:@ArchiveBox

Twitter:@ArchiveBoxApp

ArchiveBox's repositories

ArchiveBox

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

Language:PythonLicense:MITStargazers:25509Issues:175Issues:1023

good-karma-kit

😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...

archivebox-browser-extension

Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.

Language:JavaScriptLicense:MITStargazers:374Issues:8Issues:34

electron-archivebox

Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)

Language:JavaScriptLicense:GPL-3.0Stargazers:178Issues:6Issues:6

abx-dl

⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). 🎭 Uses headless Chrome to get HTML, JS, CSS, images/video/audio/subtitles, PDFs, screenshots, article text, git repos, and more...

Language:JavaScriptLicense:MITStargazers:87Issues:5Issues:1

docker-archivebox

Home of the official docker image for ArchiveBox

readability-extractor

Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.

pocket-exporter

[FREE] A service to help export your pocket bookmarks, tags, saved article text, and more...

Language:TypeScriptStargazers:31Issues:0Issues:11

archivebox-proxy

Official ArchiveBox MITM proxy: saves URLs of all requests passing through to an ArchiveBox server for archival.

Language:PythonLicense:MITStargazers:29Issues:2Issues:0

homebrew-archivebox

Homebrew formula for the ArchiveBox self-hosted internet archiving solution.

Language:RubyLicense:GPL-3.0Stargazers:28Issues:1Issues:0

abx-pkg

📦 Modern strongly typed Python library for managing system dependencies with package managers like apt, brew, pip, npm, etc.

Language:PythonLicense:MITStargazers:20Issues:1Issues:0

abx-spec-behaviors

🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser environments, puppeteer, playwright, extensions, AI tools, and many other contexts with minimal adjustment.

Language:JavaScriptLicense:MITStargazers:19Issues:1Issues:3

DigestBox

DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by ArchiveBox.io under the hood.

debian-archivebox

Home of the official apt/deb package for Ubuntu/Debian-based systems.

Language:PythonLicense:GPL-3.0Stargazers:17Issues:2Issues:2

docs

Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.

internet-archiving-talk

🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.

Language:JavaScriptStargazers:15Issues:1Issues:0

pip-archivebox

Official Python package for ArchiveBox, the self-hosted internet archiving solution.

License:GPL-3.0Stargazers:13Issues:1Issues:0

community

A wiki of the broader Web Archiving Community: important organizations, alternative projects, blog posts, and more.

squasher-browser-extension

Extension to collect all open browser tabs for a given domain into a new window (with suspender support).

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0