databand-ai / dbnd

DBND is an agile pipeline framework that helps data engineering teams track and orchestrate their data processes.

Home Page:https://databand.ai/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

pipeline status

PyPI - Downloads PyPI PyPI - Python Version PyPI - License Code style:

DBND

DBND an open source framework for building and tracking data pipelines. DBND is used for processes ranging from data ingestion, preparation, machine learning model training and production.

DBND includes a Python library, set of APIs, and CLI that enables you to collect metadata from your workflows, create a system of record for runs, and easily orchestrate complex processes.

DBND simplifies the process of building and running data pipelines from dbnd import task

from dbnd import task

@task
def say_hello(name: str = "databand.ai") -> str:
    value = "Hello %s!" % name
    return value

And makes it easy to track your critical pipeline metadata

from dbnd import log_metric, log_dataframe

log_dataframe("my_dataset", my_dataset)
log_metric("r2", r2)

Getting Started

See our documentation with examples and quickstart guides to get up and running with DBND.

The Latest and Greatest

For using DBND, we recommend that you work with a virtual environment like Virtualenv or Conda. Update to the latest and greatest:

pip install dbnd

If you would like access to our latest features, or have any questions, feedback, or contributions we would love to here from you! Get in touch through contact@databand.ai

About

DBND is an agile pipeline framework that helps data engineering teams track and orchestrate their data processes.

https://databand.ai/

License:Apache License 2.0


Languages

Language:Python 90.2%Language:Java 8.2%Language:C 0.8%Language:Scala 0.4%Language:Makefile 0.2%Language:Jupyter Notebook 0.1%Language:Shell 0.1%Language:HiveQL 0.0%Language:Dockerfile 0.0%