Jon Zajac's starred repositories
nginx-ultimate-bad-bot-blocker
Nginx Block Bad Bots, Spam Referrer Blocker, Vulnerability Scanners, User-Agents, Malware, Adware, Ransomware, Malicious Sites, with anti-DDOS, Wordpress Theme Detector Blocking and Fail2Ban Jail for Repeat Offenders
yacy_search_server
Distributed Peer-to-Peer Web Search Engine and Intranet Search Appliance
AmpliGraph
Python library for Representation Learning on Knowledge Graphs https://docs.ampligraph.org
awesome-semantic-web
A curated list of various semantic web and linked data resources.
sqllineage
SQL Lineage Analysis Tool powered by Python
spf-dkim-dmarc-simplified
Email security is a key part of internet communication. But what are SPF, DKIM, and DMARC, and how do they work? This guide will explain it all in simple terms to make these concepts clearer.
wtf_wikipedia
a pretty-committed wikipedia markup parser
mwparserfromhell
A Python parser for MediaWiki wikicode
awesome-public-real-time-datasets
A list of publicly available datasets with real-time data maintained by the team at bytewax.io
view_component-contrib
A collection of extension and developer tools for ViewComponent
security-policies
Security policies for Tailscale
dumpster-dive
roll a wikipedia dump into mongo
skills-api
Skills API
annotaterb
A Ruby Gem that adds annotations to your Rails models and route files.
wiki-tools
Code for my Wikimedia Labs Tools account
simple-wikidata-db
A set of Python scripts for preprocessing the Wikidata JSON dump and running simple queries in an efficient manner.
hooli-data-eng-pipelines
Example Dagster Cloud code for the Hooli Data Engineering organization.
awesome-dagster
All things awesome related to Dagster!
awesome-wikidata
Curated list of Wikidata Projects
dumpster-dip
parse a wikipedia dump into tiny files