There are 4 repositories under analytics-engineering topic.
Preswald is a WASM packager for Python-based interactive data apps: bundle full complex data workflows, particularly visualizations, into single files, runnable completely in-browser, using Pyodide, DuckDB, Pandas, and Plotly, Matplotlib, etc. Build dashboards, reports, and notebooks that run offline, load fast, and share like a document.
A curated list of awesome dbt resources
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
Supplementary Materials for the The Complete dbt (Data Build Tool) Bootcamp Udemy course
dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Main repo including core data model, data marts, data quality tests, and terminology sets.
The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)
dbt adapter for SQL Server and Azure SQL
🥪🦘 An open source sandbox project exploring dbt workflows via a fictional sandwich shop's data.
Project demonstrating how to automate Prefect 2.0 deployments to AWS ECS Fargate
Example repository showing how to build a data platform with Prefect, dbt and Snowflake
A Python package that creates fine-grained dbt tasks on Apache Airflow
🥪🏭 A simple CLI for generating synthetic Jaffle Shop data.
A data and analytics engineering platform designed for real-time sports betting analytics.
Kotlin Multiplatform Analytics with a debug viewer
A dbt (data build tool) project you can use for testing purposes or experimentation
Best practices for data workflows, integrations with the Modern Data Stack (MDS), Infrastructure as Code (IaC), Cloud Provider Services
Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from Blizzard’s Hearthstone API. Focused on card statistics and attributes, this project reveals detailed insights into card mechanics, strengths, and trends to support BI and strategic analysis.
Never sift through endless dbt™ logs again. dbt Command Center is a free, open-source, local web application that provides a user-friendly interface to monitor and manage dbt runs.
A starter dbt project and synthetic claims dataset for trying out the Tuva Project.
dbt starter code for enterprise Snowflake usage data artifacts
✍️ dbt doc generator for advanced data teams
Automatically generate DBML files from Snowflake databases for quickly reverse engineer interactive ER diagrams and documentation from your Snowflake DB. Ideal for data engineers and analysts, it supports custom primary key configurations and relationship inference.
Readings for Analytics Engineers
This connector is a dbt project that maps Medicare CCLF claims data to the Tuva Input Layer.
The Tuva Project Docs i.e. where we write and share our knowledge about healthcare data and analytics.
Getting Started with Analytics Engineering
DataHut-DuckHouse is a modern, modular, and multi-tenant analytics platform that combines DuckDB, Apache Iceberg, Arrow Flight, dbt, and Trino to build a hybrid, lightweight, and scalable data stack ready for SaaS.
A universal metrics layer. Compatible with definitions in LookML, MetricFlow, Cube with DuckDB, Snowflake, Clickhouse, Bigquery & more!
Maps Medicare LDS claims data to the Tuva Input Layer so you can easily run the Tuva Project.
A dbt project that transforms messy public provider datasets into usable data for the Tuva Project.
Get started with Prefect by scheduling your Prefect flows with GitHub Actions
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
SQLContext is a tool for generating LLM context from database tables for consumption from IDEs