duyet / soda-core

Data profiling, testing, and monitoring for SQL accessible data.

Home Page:https://docs.soda.io/soda-core/overview.html

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Soda logo

Soda Core

Data testing, monitoring and profiling for SQL-accessible data.

License: Apache 2.0 Slack

Soda Core is a free, open-source, command-line tool that enables you to use the Soda Checks Language to turn user-defined input into aggregated SQL queries.

When it runs a scan on a dataset, Soda Core executes the checks to find invalid, missing, or unexpected data. When your Soda Checks fail, they surface the data that you defined as “bad”.

Connect Soda Core to your data source, then define your Soda Checks for data quality in a checks.yml file. Use Soda Core to run scans of your data to execute the checks you defined.

Get started

Soda Core currently supports PosgreSQL, Amazon Redshift, GCP BigQuery, and Snowflake. To get started, use the install command, replacing soda-postgres with the package that matches your data source.

pip install soda-postgres

  • soda-core-postgres
  • soda-core-redshift
  • soda-core-bigquery
  • soda-core-snowflake

Documentation

About

Data profiling, testing, and monitoring for SQL accessible data.

https://docs.soda.io/soda-core/overview.html

License:Apache License 2.0


Languages

Language:Python 99.1%Language:ANTLR 0.5%Language:Shell 0.4%