Robert Kreuzer's repositories
dataset-popular
A dataset of popular pages (taken from <dir.yahoo.com>) with manually marked up semantic blocks.
dataset-random
A dataset of random pages with manually marked up semantic blocks.
python-boilerpipe
Python interface to Boilerpipe, Boilerplate Removal and Fulltext Extraction from HTML pages
googleads-python-lib
The Python client library for Google's Ads APIs
minijinja
MiniJinja is a powerful but minimal dependency template engine for Rust compatible with Jinja/Jinja2
Apache-2.0000
sandcastle
A simple and powerful sandbox for running untrusted JavaScript.
TiddlyWiki5
A reboot of TiddlyWiki for the next 25 years
validictory
general purpose python data validator