Thomas Arrow's repositories
quickscrape
A scraping command line tool for the modern web
aaraa
ContentMine Dictionary Browser, Visualiser and Creator
BiblioWikidata
Various scripts for adding bibliographic information to Wikidata
canary
Canary is a UI to the contentmine tools getpapers, quickscrape, norma, and ami.
contentmine.org
The static site
elasticsearch-dump
Import and export tools for elasticsearch
elasticsearch-js
Official Elasticsearch client library for Node.js and the browser
es-stress
Stress an ES cluster with repeated connections
gerrit-cli
Gerrit in your command lines.
getpapers
Get metadata, fulltexts or fulltext URLs of papers matching a search query
hypothesisapi
A Python wrapper for the nascent hypothes.is web API
journal-scrapers
Journal scraper definitions for the ContentMine framework
lhasademos
Task Tracking and Code for the Lhasa Demos
mediawiki-docker-dev
Development environment for MediaWiki.
norma
Convert XML/SVG/PDF into normalised, sectioned, scholarly HTML
thresher
Headless scraperJSON scraping for Node.js
wd-presentation-2017-06
The HTML Presentation Framework
wdic
Wikidata Item Converter
WikidataIntegrator
A Wikidata Python module integrating the MediaWiki API and the Wikidata SPARQL endpoint
wikifactmine-ms
workspace for improving main subject statement for paper items