Kieran O'Leary's starred repositories
ArchiveBox
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
duckstation
Fast PlayStation 1 emulator for x86-64/AArch32/AArch64/RV64
bulk_extractor
This is the development tree. Production downloads are at:
archiveweb.page
A High-Fidelity Web Archiving Extension for Chrome and Chromium based browsers!
instamancer
Scrape Instagram's API with Puppeteer
webrecorder-desktop
Webrecorder Desktop App!
pdf-issues
Industry-based resolutions for issues and errata reported against any PDF-related specification
IFIscripts
IFIscripts is an open-source digital preservation tool which facilitates collection management workflows within the IFI and further afield. It is freely available from the GitHub repository and subject to modification depending on the progressive needs of collections and based upon policies and preservation standards.
demystify
Engine for analysis of Siegfried export files and DROID CSV. The tool has three purposes, break the export into its components and store them within a SQLite database; create additional columns to augment the output where useful; and query the SQLite database, outputting results in a readable form useful for analysis by researchers and archivists within digital preservation departments in memory institutions. The tool will find duplicates, unidentified files, blacklisted objects, character encoding issues, and more.
Digital-Preservation-Headaches
Digital Preservation Headaches
sumfolder1
What is the checksum of a directory?
sight-and-sound
Linked Open Data experiment with data from the 2012 Sight and Sound Film Poll.
2021-4-26-sfc-dc
SFC-DC-2021 Workshop 3: R, Regular Expressions, SQL
untl-schemas
A collection of XML schemas used by the UNT Digital Libraries.