Richard Haussmann's starred repositories
airflow_backfill_plugin
An Airflow plugin, providing an admin UI to conveniently start backfills. Usable with Airflow 1, 2 and Cloud Composer
tricking-data-science
A book of subtle code tricks and gem resources for all things data, machine learning and deep learning.
awesome-falsehood
😱 Falsehoods Programmers Believe in
Bash-Oneliner
A collection of handy Bash One-Liners and terminal tricks for data processing and Linux system maintenance.
basic-computer-games
An updated version of the classic "Basic Computer Games" book, with well-written examples in a variety of common MEMORY SAFE, SCRIPTING programming languages. See https://coding-horror.github.io/basic-computer-games/
macos-setup
Quickly setting up my new macOS machines with a Brewfile, dotfiles, and simple bash scripts..
SwiftDefaultApps
Replacement for RCDefaultApps, written in Swift.
net.jgp.books.spark.ch01
Spark in Action, 2nd edition - chapter 1 - Introduction
pyspark-example-project
Implementing best practices for PySpark ETL jobs and applications.
White-box-Cartoonization
Official tensorflow implementation for CVPR2020 paper “Learning to Cartoonize Using White-box Cartoon Representations”
sentry_airflow
Airflow integration with Sentry (https://sentry.io)
discreETLy
ETLy is an add-on dashboard service on top of Apache Airflow.
ge_tutorials
Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.
senator-filings
Scrape public filings of the buy + sell orders of U.S. senators and calculate their returns
avatarify-python
Avatars for Zoom, Skype and other video-conferencing apps.
org-journal
A simple org-mode based journaling mode
1on1-questions
Mega list of 1 on 1 meeting questions compiled from a variety to sources
powerful-questions
Powerful questions - catalyzing insight, innovation, action
ohmyzsh
🙃 A delightful community-driven (with 2,300+ contributors) framework for managing your zsh configuration. Includes 300+ optional plugins (rails, git, macOS, hub, docker, homebrew, node, php, python, etc), 140+ themes to spice up your morning, and an auto-update tool so that makes it easy to keep up with the latest updates from the community.
awesome-data-engineering
A curated list of data engineering tools for software developers
Spark-The-Definitive-Guide
Spark: The Definitive Guide's Code Repository
cce-python
Python tools for processing data from the Catalog of Copyright Entries