John Schroeder (johnlovesdata)

johnlovesdata

Geek Repo

Company:Craftjack

Location:Chicago, IL

Github PK Tool:Github PK Tool

John Schroeder's starred repositories

etl_spark_airflow_emr

Capstone project of the data engineer course at Udacity

Language:Jupyter NotebookLicense:MITStargazers:4Issues:0Issues:0

polars-cookbook

Recipes for using Python's polars library

Language:Jupyter NotebookStargazers:242Issues:0Issues:0

vscode-python

A Tutorial for Setting Python Development Environment with VScode and Docker

Language:ShellStargazers:837Issues:0Issues:0

awesome-data-leadership

A curated list of awesome posts, videos, and articles on leading a data team (small and large)

Stargazers:516Issues:0Issues:0

machine-learning-zoomcamp

Learn ML engineering for free in 4 months!

Language:Jupyter NotebookStargazers:8830Issues:0Issues:0

data-engineering-zoomcamp

Free Data Engineering course!

Language:Jupyter NotebookStargazers:24365Issues:0Issues:0

lightdash

Self-serve BI to 10x your data team ⚡️

Language:TypeScriptLicense:MITStargazers:3720Issues:0Issues:0

dbt-tips

Collection of dbt Tips and Tricks

License:GPL-3.0Stargazers:359Issues:0Issues:0

spectacles

A continuous integration tool for Looker and LookML.

Language:PythonLicense:MITStargazers:214Issues:0Issues:0

data_engineering_on_gcp_book

A book describing how to set up and maintain Data Engineering infrastructure using Google Cloud Platform.

Stargazers:120Issues:0Issues:0

prefect-demo-flow

This is the code accompanying the blog article on makeitnew.io. It defines a Prefect flow which can be visualized, run locally or registers in Prefect cloud.

Language:PythonStargazers:28Issues:0Issues:0

prefect

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

Language:PythonLicense:Apache-2.0Stargazers:15749Issues:0Issues:0

airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Language:PythonLicense:Apache-2.0Stargazers:36085Issues:0Issues:0

howtheydbt

A curated collection of publicly available resources on dbt best practices and how data-driven organizations around the world utilize dbt

License:CC0-1.0Stargazers:113Issues:0Issues:0

documentation-pipeline-generator

Documentation and Pipeline Generator.

Language:PythonLicense:MITStargazers:25Issues:0Issues:0

datasets-for-good

List of datasets to apply stats/machine learning/technology to the world of social good.

Stargazers:235Issues:0Issues:0

best-of-ml-python

🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

License:CC-BY-SA-4.0Stargazers:16209Issues:0Issues:0

airflow-repo-template

The easiest way to run Airflow locally, with linting & tests for valid DAGs and Plugins.

Language:PythonLicense:MITStargazers:238Issues:0Issues:0

data-engineering-book

Accumulated knowledge and experience in the field of Data Engineering

Stargazers:844Issues:0Issues:0

gazpacho

🥫 The simple, fast, and modern web scraping library

Language:PythonLicense:MITStargazers:742Issues:0Issues:0

amundsen

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

Language:PythonLicense:Apache-2.0Stargazers:4374Issues:0Issues:0

howtheytest

A collection of public resources about how software companies test their software

Language:HTMLLicense:CC0-1.0Stargazers:5937Issues:0Issues:0

data-engineer-roadmap

Roadmap to becoming a data engineer in 2021

Stargazers:12279Issues:0Issues:0

az-kung-fu

Repo for the Azure CLI Kung Fu series on Build5Nines.com

Language:ShellLicense:MITStargazers:79Issues:0Issues:0

goodreads_etl_pipeline

An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.

Language:PythonLicense:MITStargazers:1270Issues:0Issues:0

Awesome-CV

:page_facing_up: Awesome CV is LaTeX template for your outstanding job application

Language:TeXLicense:LPPL-1.3cStargazers:22678Issues:0Issues:0

dbt-init

A dbt-init script for consulting projects

Language:PythonLicense:Apache-2.0Stargazers:59Issues:0Issues:0

sql-style-guide

An opinionated guide for writing clean, maintainable SQL.

Stargazers:1013Issues:0Issues:0

azure-mol-samples

Supporting resources for "Learn Azure in a Month of Lunches" (Manning Publications)

Language:ShellLicense:MITStargazers:715Issues:0Issues:0

awesome-apache-airflow

Curated list of resources about Apache Airflow

Language:ShellStargazers:3646Issues:0Issues:0