httpreserve suite's repositories
httpreserve
Digital Preservation of HTTP in documentary heritage.
tikalinkextract
Tika based link (URL) extractor for httpreserve
linkscanner
A helper package to tokenize textual content and retrieve hyperlinks
awesome-web-archiving
An Awesome List for getting started with web archiving
conventoarchiver
Repository for collecting scripts to help capture MyConvento newsroom press-releases from the MyConvento PR management suite. The README provides an analysis of the MyConvento URL architecture for users hoping to develop a solution for themselves.
phantomjsscreenshot
A wrapper for phantom.js commands for headless screenshots.
million-dollar-webpage
HTTPreserve Analysis of Million Dollar Web Page
eaccession-research
A repository to store data associated with HTTPreserve research on Archive NZ's born digital material.
gnomescreenshot
Wrapper for gnome-web-photo for httpreserve demos
simplerequest
Minimal HTTP requests for Golang
urlgetter
Script to disambiguate domain names from where they actually point to.