John Berlin's repositories

Squidwarc

Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head

Language:JavaScriptLicense:Apache-2.0Stargazers:164Issues:10Issues:32

wail

:whale2: One-Click User Instigated Preservation

Language:JavaScriptLicense:GPL-3.0Stargazers:120Issues:13Issues:105

node-warc

Parse And Create Web ARChive (WARC) files with node.js

Language:JavaScriptLicense:MITStargazers:92Issues:9Issues:15

userAgentLists

Get your lists of User-Agent Strings here

Language:PythonLicense:MITStargazers:78Issues:8Issues:2

chrome-remote-interface-extra

Like fs-extra but for the chrome-remote-interface-extra by cyrus-and

Language:JavaScriptLicense:Apache-2.0Stargazers:18Issues:1Issues:3

simplechrome

Webrecorders DevTools Protocol Automation Library

chrome-remote-interface-py

Chrome Debugging Protocol interface for python asyncio

Language:PythonLicense:Apache-2.0Stargazers:11Issues:3Issues:3

controlChromeHeadless

This project is dead. Use Squidwarc (https://github.com/N0taN3rd/Squidwarc)

Language:JavaScriptLicense:GPL-3.0Stargazers:1Issues:4Issues:4

asyncio-promise

JS Promises For Asyncio

Language:PythonStargazers:0Issues:2Issues:0

base-browser

Base Containerized Browser Image

Language:PythonStargazers:0Issues:2Issues:0
Language:TypeScriptLicense:MITStargazers:0Issues:0Issues:0

browser-chrome

Chrome containerized browser for Webrecorder

Language:ShellStargazers:0Issues:2Issues:0

browsers

oldweb.today Remote/Containerized Browser System

Language:JavaScriptStargazers:0Issues:2Issues:0

dot-files

My setup :)

Language:ShellStargazers:0Issues:2Issues:0
Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

LegionY530Ubuntu

Guide for installing Ubuntu on the Legion Y530

License:GPL-3.0Stargazers:0Issues:0Issues:0

mantha

Webpack 4 Vue.js typescript-friendly starter kit with A LOT of automated processes

Language:TypeScriptLicense:MITStargazers:0Issues:1Issues:0

pyee2

Nodejs EventEmitter3 for python

Language:PythonStargazers:0Issues:2Issues:0

pywb

Python WayBack for web archive replay and url-rewriting HTTP/S web proxy

Language:PythonLicense:GPL-3.0Stargazers:0Issues:2Issues:0
Language:HTMLStargazers:0Issues:2Issues:0
Language:JavaScriptStargazers:0Issues:2Issues:0

urlcanon

url canonicalization library for python and java

Language:JavaStargazers:0Issues:2Issues:0
Language:JavaScriptStargazers:0Issues:2Issues:0

warcio

Streaming WARC/ARC library for fast web archive IO

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

warcit

Convert Directories, Files and ZIP Files to Web Archives (WARC)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

warcworker

A dockerized, queued high fidelity web archiver based on Squidwarc

Language:PythonLicense:GPL-3.0Stargazers:0Issues:2Issues:0

webrecorder

Web Archiving For All!

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

webrecorder-player

Webrecorder Player for Desktop (OSX/Windows/Linux). (Built with Electron + Webrecorder)

Language:JavaScriptLicense:Apache-2.0Stargazers:0Issues:0Issues:0

webrecorder-tests

QA tests for webrecorder player (WORK IN PROGRESS)

Language:ShellStargazers:0Issues:0Issues:0