databricks / notebook-best-practices

An example showing how to apply software engineering best practices to Databricks notebooks.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Software engineering best practices for Databricks notebooks

This repository is a companion for the example article "Software engineering best practices for Databricks notebooks" (AWS | Azure | GCP).

Going through the example, you will:

  • Add notebooks to Databricks Repos for version control.
  • Extracts portions of code from one of the notebooks into a shareable component.
  • Test the shared code.
  • Automatically run notebooks in git on a schedule using a Databricks job.
  • Optionally, apply CI/CD to the notebooks and the shared code.

The example is hands-on. We recommend working it step-by-step to learn how to apply these techniques to your own Databricks notebooks.

About

An example showing how to apply software engineering best practices to Databricks notebooks.

License:Apache License 2.0


Languages

Language:Python 100.0%