Conrad's starred repositories
unitycatalog
Open, Multi-modal Catalog for Data & AI
awesome-apache-airflow
Curated list of resources about Apache Airflow
spark-monitoring
Monitoring Azure Databricks jobs
kafka-delta-ingest
A highly efficient daemon for streaming data from Kafka into Delta Lake
pyre-check
Performant type-checking for python.
dbt-expectations
Port(ish) of Great Expectations to dbt test macros
delta-sharing
An open protocol for secure data sharing
podman-compose
a script to run docker-compose.yml using podman
applied-ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
awesome-data-catalogs
📙 Awesome Data Catalogs and Observability Platforms.
azure.datafactory.tools
Tools for deploying Data Factory (v2) in Microsoft Azure
databricks-nutter-repos-demo
Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline
notebook-best-practices
An example showing how to apply software engineering best practices to Databricks notebooks.
azure-sdk-for-python
This repository is for active development of the Azure SDK for Python. For consumers of the SDK we recommend visiting our public developer docs at https://docs.microsoft.com/python/azure/ or our versioned developer docs at https://azure.github.io/azure-sdk-for-python.
azure-pipelines-terraform
Azure Pipelines tasks for installing Terraform and running Terraform commands in a build or release pipeline.
pre-commit
A framework for managing and maintaining multi-language pre-commit hooks.
azure-pipelines-tasks
Tasks for Azure Pipelines