ArchiveBox

ArchiveBox

Geek Repo

The self-hosted internet archiving solution maintained by @pirate. #webarchiving #internetarchiving #digipres

Location:Montréal, Quebec

Home Page:https://docs.archivebox.io

Twitter:@ArchiveBoxApp

Github PK Tool:Github PK Tool

ArchiveBox's repositories

ArchiveBox

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

Language:PythonLicense:MITStargazers:20399Issues:174Issues:870

good-karma-kit

😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...

archivebox-browser-extension

Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.

Language:TypeScriptLicense:MITStargazers:187Issues:10Issues:23

electron-archivebox

Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)

Language:JavaScriptLicense:GPL-3.0Stargazers:176Issues:9Issues:6

docker-archivebox

Home of the official docker image for ArchiveBox

Language:DockerfileLicense:GPL-3.0Stargazers:45Issues:3Issues:1

readability-extractor

Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.

homebrew-archivebox

Homebrew formula for the ArchiveBox self-hosted internet archiving solution.

Language:RubyLicense:GPL-3.0Stargazers:26Issues:3Issues:0

debian-archivebox

Home of the official apt/deb package for Ubuntu/Debian-based systems.

Language:PythonLicense:GPL-3.0Stargazers:18Issues:3Issues:2

DigestBox

DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by ArchiveBox.io under the hood.

Language:HTMLStargazers:14Issues:0Issues:1

pip-archivebox

Official Python package for ArchiveBox, the self-hosted internet archiving solution.

License:GPL-3.0Stargazers:14Issues:2Issues:0

docs

Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.

internet-archiving-talk

🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.

Language:JavaScriptStargazers:13Issues:2Issues:0

archivebox-proxy

Official ArchiveBox MITM proxy: saves URLs of all requests passing through to an ArchiveBox server for archival.

Language:PythonLicense:MITStargazers:10Issues:1Issues:0

pydantic-pkgr

A modern Python library for managing system dependencies with package managers like apt, brew, pip, npm, etc.

Language:PythonLicense:MITStargazers:7Issues:1Issues:0

community

A wiki of the broader Web Archiving Community: important organizations, alternative projects, blog posts, and more.

Stargazers:4Issues:0Issues:0

archivebox-spreadsheet-bot

This is a bot that provides ArchiveBox integration with Google Sheets for new URL ingestion, archived URL management, and automated QA (optionally AI-powered).

License:GPL-3.0Stargazers:2Issues:0Issues:0