Bill's starred repositories

open-data-contract-standard

Home of the Open Data Contract Standard (ODCS).

Language:ShellLicense:Apache-2.0Stargazers:277Issues:0Issues:0

v4

Fourth iteration of my personal website built with Gatsby

Language:JavaScriptLicense:MITStargazers:7376Issues:0Issues:0

awesome-duckdb

🦆 A curated list of awesome DuckDB resources

License:CC0-1.0Stargazers:1127Issues:0Issues:0

dbt-metabase

dbt + Metabase integration

Language:PythonLicense:MITStargazers:448Issues:0Issues:0

sqlglot

Python SQL Parser and Transpiler

Language:PythonLicense:MITStargazers:6113Issues:0Issues:0

cockroach

CockroachDB - the open source, cloud-native distributed SQL database.

Language:GoLicense:NOASSERTIONStargazers:29596Issues:0Issues:0

spotify-stream-analytics

Generate synthetic Spotify music stream dataset to create dashboards. Spotify API generates fake event data emitted to Kafka. Spark consumes and processes Kafka data, saving it to the Datalake. Airflow orchestrates the pipeline. dbt moves data to Snowflake, transforms it, and creates dashboards.

Language:PythonStargazers:65Issues:0Issues:0

fastapi-alembic-sqlmodel-async

This is a project template which uses FastAPI, Pydantic 2.0, Alembic and async SQLModel as ORM. It shows a complete async CRUD using authentication and role base access control.

Language:PythonLicense:MITStargazers:903Issues:0Issues:0

system-design-101

Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.

License:NOASSERTIONStargazers:60929Issues:0Issues:0

arguably

The best Python CLI library, arguably.

Language:PythonLicense:NOASSERTIONStargazers:354Issues:0Issues:0

yaml2pyclass

Code generator that produces a Python class from a YAML input file. Can be used to facilitate code completion for config objects.

Language:PythonLicense:BSD-3-ClauseStargazers:24Issues:0Issues:0

github-issue-templates

🔣 A collection of GitHub issue, pull request and security templates

License:NOASSERTIONStargazers:4087Issues:0Issues:0

little-book-of-pipelines

This repository goes over how to handle massive variety in data engineering

Language:ScalaStargazers:72Issues:0Issues:0
License:NOASSERTIONStargazers:13Issues:0Issues:0

audiophile-e2e-pipeline

Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard.

Language:PythonStargazers:186Issues:0Issues:0

ActivitySchema

Repository for the ActivitySchema spec and supporting materials

License:Apache-2.0Stargazers:391Issues:0Issues:0

Rdatasets

A collection of datasets originally distributed in R packages

Language:HTMLLicense:NOASSERTIONStargazers:296Issues:0Issues:0

awesome-public-datasets

A topic-centric list of HQ open datasets.

License:MITStargazers:59488Issues:0Issues:0