Bill's starred repositories
open-data-contract-standard
Home of the Open Data Contract Standard (ODCS).
awesome-duckdb
🦆 A curated list of awesome DuckDB resources
dbt-metabase
dbt + Metabase integration
spotify-stream-analytics
Generate synthetic Spotify music stream dataset to create dashboards. Spotify API generates fake event data emitted to Kafka. Spark consumes and processes Kafka data, saving it to the Datalake. Airflow orchestrates the pipeline. dbt moves data to Snowflake, transforms it, and creates dashboards.
fastapi-alembic-sqlmodel-async
This is a project template which uses FastAPI, Pydantic 2.0, Alembic and async SQLModel as ORM. It shows a complete async CRUD using authentication and role base access control.
system-design-101
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
yaml2pyclass
Code generator that produces a Python class from a YAML input file. Can be used to facilitate code completion for config objects.
github-issue-templates
🔣 A collection of GitHub issue, pull request and security templates
little-book-of-pipelines
This repository goes over how to handle massive variety in data engineering
audiophile-e2e-pipeline
Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard.
ActivitySchema
Repository for the ActivitySchema spec and supporting materials
awesome-public-datasets
A topic-centric list of HQ open datasets.