Dumky de Wilde's starred repositories
Crawling-Infrastructure
Distributed crawling infrastructure running on top of severless computation, cloud storage (such as S3) and sophisticated queues.
IP-Address-API
This repository contains instructions how to use the free IP Address API. The databases are: ASN database, Geolocation database, hosting ranges database.
Open-Cookie-Database
The Open Cookie Database is an effort to describe and categorise all major cookies. All cookie descriptions are saved in a downloadable CSV file. All contributions to the CSV file are welcomed.
ActivitySchema
Repository for the ActivitySchema spec and supporting materials
dbt-snowplow-web
A fully incremental model, that transforms raw web event data generated by the Snowplow JavaScript tracker into a series of derived tables of varying levels of aggregation.
cloud_cost_monitoring
Repo for holding the dbt project used to make sense of cloud cost data from the major cloud platforms
snowplow-mini
An easily-deployable, single-instance version of Snowplow
awesome-dbt
A curated list of awesome dbt resources
bigquery-utils
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
compromise
modest natural-language processing
awesome-analytics
A curated list of analytics frameworks, software and other tools.
gtm-doctags
Automatically generate a GTM documentation site using the Google Tag Manager notes and Docsify
analytics-testing-puppeteer
A simple example of using Puppeteer to test your analytics setup
gtm-export-tools
GTM Export Tools gives you the power to easily manipulate your GTM container without giving others access.
knitr-examples
A collection of knitr examples