dah33 / dbt-dada

dada: data about data [for dbt]

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

dada: data about data [for dbt]

dada is an add-on package for dbt that helps you, the heroic data analyst or engineer, rapidly explore and profile the data in your data warehouse.

It can be used interactively for exploration, or on a schedule to track the evolution of data.

Supported databases

  • PostgreSQL

Other databases may work as much of the code is standard SQL. Please contribute by testing dada on your database, and raising an issue if it doesn't work as expected.

Inspiration

  • R's summary() and glimpse() functions
  • Python's pandas_profiling module
  • data-mie's dbt_profiler package
  • csvkit's csvstat command
  • Adam Aspin's article Data Profiling with T-SQL

About

dada: data about data [for dbt]

License:Apache License 2.0