JP (he/him)'s starred repositories
octocatalog
Nicely modeled data built on the Github Archive.
system-design-101
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
datamesh-architecture.com
Data Mesh Architecture
postgresml
The GPU-powered AI application database. Get your app to market faster using the simplicity of SQL and the latest NLP, ML + LLM models.
pvldb-announce
PVLDB Paper Announcement Script
ec2instances.info
Amazon EC2 instance comparison site
data-engineering-meetup-in-a-box
A collection of guides, resources, and support for DE meetup organizers.
data-engineering-salaries
A Streamlit app to explore data engineering salary data.
SparkLearning
A comprehensive Spark guide collated from multiple sources that can be referred to learn more about Spark or as an interview refresher.
data-engineering-practice
Data Engineering Practice Problems
obsidian-modular-css-layout
CSS Layout hack for Obsidian.md
dbt-codegen
Macros that generate dbt code
little-book-of-pipelines
This repository goes over how to handle massive variety in data engineering
data_engineering_project_template
A template repository to create a data project with IAC, CI/CD, Data migrations, & testing
metricflow
MetricFlow allows you to define, build, and maintain metrics in code.
private-gpt
Interact with your documents using the power of GPT, 100% privately, no data leaks
gpt-engineer
Specify what you want it to build, the AI asks for clarification, and then builds it.
countries-states-cities-database
🌍 Discover our global repository of countries, states, and cities! 🏙️ Get comprehensive data in JSON, SQL, PSQL, XML, YAML, and CSV formats. Access ISO2, ISO3 codes, country code, capital, native language, timezones (for countries), and more. #countries #states #cities
wiki-reddit-bot
Public repo of the bot