Shawn M. Jones (shawnmjones)

shawnmjones

Geek Repo

Company:Los Alamos National Laboratory, Old Dominion University

Location:Santa Fe, NM

Home Page:https://www.shawnmjones.org

Twitter:@shawnmjones

Github PK Tool:Github PK Tool


Organizations
lanl
oduwsdl

Shawn M. Jones's repositories

OffTopic-Detection

This system evaluates a series of mementos (archived web pages) to determine which are off topic. The series can be part of an Archive-It collection, a single TimeMap, or stored in a WARC file.

Language:PythonLicense:MITStargazers:1Issues:3Issues:0

archivenow

A Tool To Push Web Resources Into Web Archives

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

brozzler

brozzler - distributed browser-based web crawler

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

collection-stories

This repository exists to share stories generated from the Dark and Stormy Archives project.

Stargazers:0Issues:1Issues:0

cs595-f13

Shared repository for ODU CS 495 / 595 Fall 2013

Language:DOTLicense:MITStargazers:0Issues:0Issues:0

cs895-f20

ODU CS 795/795 Web Archiving Forensics, Fall 2020.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

dsa-rainpuddle

This project implements the visualization components fo the Dark and Stormy Archives project.

Language:HTMLLicense:MITStargazers:0Issues:1Issues:0

government-sites-archive-projects

This repository contains work done to determine how much of www.guideline.gov and qualitymeasures.ahrq.gov were archived.

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

hr-contracting

CS 825 Project Showing the Geography of federal contracting in Hampton Roads

Language:TeXLicense:Apache-2.0Stargazers:0Issues:2Issues:0

iipc-dsa-work

This repository contains work done on the IIPC Dark and Stormy Archives grant.

Language:HTMLStargazers:0Issues:1Issues:0

JCDL2023-website-source

The source of the JCDL 2023 website.

Language:HTMLStargazers:0Issues:2Issues:0
Language:HTMLStargazers:0Issues:1Issues:0

mediawiki

A Memento Plugin for MediaWiki

Language:PHPStargazers:0Issues:1Issues:0

py-memento-client

A Memento Client Library in Python

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

python-boilerpipe

Python interface to Boilerpipe, Boilerplate Removal and Fulltext Extraction from HTML pages

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

pywb

Core Python Web Archiving Toolkit for replay and recording of web archives

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

robustlinks

Links on the web break all the time, robustify them!

Language:JavaScriptStargazers:0Issues:0Issues:0

shawnmjones.github.io

Shawn's GitHub Web Site

Language:HTMLStargazers:0Issues:1Issues:0

shot-scraper-test

https://www.ap.org/en

Stargazers:0Issues:0Issues:0

sqlite3worker

A threadsafe sqlite worker for Python

License:MITStargazers:0Issues:0Issues:0

sumgram

sumgram is a tool that summarizes a collection of text documents by generating the most frequent sumgrams (multiple ngrams)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Timemap.py

Python class to parse an simplify access to Memento timemaps.

Language:PythonLicense:GPL-2.0Stargazers:0Issues:1Issues:0

unix-env

These are the generic login scripts I use.

Language:ShellStargazers:0Issues:1Issues:0

VisHash

Visual Hash for matching copies of visually similar images.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

wren

Experiments in testable, scaleable crawler architectures

Language:PHPLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

wsdlthesis

ODU WS-DL Thesis/Dissertation LaTeX Template

Language:TeXLicense:MITStargazers:0Issues:0Issues:0