Jonathan Bowker's repositories
ahmia-crawler
Collection of crawlers used by the ahmia search engine
AzureML-BERT
End-to-end recipes for pre-training and fine-tuning BERT using Azure Machine Learning service
cortana-intelligence-personalized-offers
Generate real-time personalized offers on a retail website to engage more closely with customers.
csdl-samples
Sample CSDL filters showcasing the functionality of the DataSift platform
dbpedia-spotlight
DBpedia Spotlight is a tool for automatically annotating mentions of DBpedia resources in text.
ELK_twitter
This is a data pipeline for Twitter (ETL) using the elastic stack Elasticsearch, Logstash and Kibana (version 6.1)
fusion-examples
This repository contains various examples for running and working with Lucidworks Fusion. Lucidworks Fusion may be downloaded at http://lucidworks.com/fusion/download/
intelmq-feeds-documentation
Cyber Threat Intelligence Feeds
kibi
Kibi is a friendly - kept in sync - Kibana fork which add support for joins across indexes and external sources, tabbed navigation interface and more
major-scrapy-spiders
Scrapy spiders of major websites. Play Store, Facebook, Instagram, Ebay, Amazon
memex-explorer
Viewers for statistics and dashboarding of Domain Search Engine data
neo4j-dbpedia-importer
DBpedia.org RDF to CSV for import into Neo4j
noslegal
noslegal taxonomy facets and release notes
panama-papers-dataset-2016
Structured data about Panama papers collected from official ICIJ website
scrapy-elasticsearch
A scrapy pipeline which send items to Elastic Search server
Scrapy-Samples
Scrapy examples crawling Craigslist
spotlight-docker
Docker containers for DBpedia Spotlight