Evanto / awesome-data-catalogs

Awesome Data Catalogs and Observability Solutions

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Awesome Data Discovery and Observability

Awesome

This repository contains a curated collection of awesome data data catalogs and observability platforms that will help you discover, observe and manage data in your organization.


Contents: Existing Data Discovery and Observability Solutions

OSS Proprietary Monocloud Observability
πŸ““ Amundsen πŸ““ Collibra πŸ““ Google DC πŸ” Monte Carlo
πŸ““ DataHub πŸ““ Informatica πŸ““ Azure DC πŸ” Databand
πŸ““ Marquez πŸ““ Alation πŸ” Datafold
πŸ““ Atlas πŸ““ Atlan πŸ” Ataccama
πŸ““ CKAN πŸ““Stemma
πŸ““ Magda

Open-Source Data Catalogs

Amundsen

Based on Open Standard Federation ML 1st Citizen Data Quality End-to-end Lineage Observability
❌ ❌ ❌ ❌ ❌ ❌
More features
  • Strategy: Push
  • UX personalization: No
  • AI autowiring: No
  • Network-based: Yes
  • Rich data profiling: No
  • Supported data sources:
  • Search-based: Yes
  • Recommendations: Yes
  • Schemas, Description: Yes
  • Complex schemas: No
  • Data preview: Yes
  • Column statistics: Yes
  • Data owner: Yes
  • Top data users: Yes
  • Change notifications:No
  • Change feed: No
  • Supported data sources: Hive, Redshift, Druid, RDBMS, Presto, Snowflake

DataHub

Based on Open Standard Federation ML 1st Citizen Data Quality End-to-end Lineage Observability
❌ ❌ ❌ ❌ ❌ ❌
More features
  • Strategy: Push, Pull
  • UX personalization: No
  • AI autowiring: No
  • Network-based: Yes
  • Rich data profiling: No
  • Supported data sources: Hive, Kafka, RDBMS

Marquez

Based on Open Standard Federation ML 1st Citizen Data Quality End-to-end Lineage Observability
OpenLineage ❌ ❌ ❌ ❌ ❌
More features
  • Strategy: Push
  • UX personalization: No
  • AI autowiring: No
  • Network-based: No
  • Rich data profiling: No
  • Supported data sources: S3, Kafka

Atlas

Based on Open Standard Federation ML 1st Citizen Data Quality End-to-end Lineage Observability
❌ ❌ ❌ ❌ ❌ ❌
More features
  • Strategy: Push
  • UX personalization: No
  • AI autowiring: No
  • Network-based: No
  • Rich data profiling: No
  • Supported data sources:HBase, Hive, Sqoop, Kafka, Storm

CKAN

Based on Open Standard Federation ML 1st Citizen Data Quality End-to-end Lineage Observability
❌ βœ”οΈ ❌ ❌ ❌ ❌
More features
  • Strategy: Push
  • UX personalization: No
  • AI autowiring: No
  • Network-based: No
  • Rich data profiling: No
  • Supported data sources:

Magda

Based on Open Standard Federation ML 1st Citizen Data Quality End-to-end Lineage Observability
❌ βœ”οΈ ❌ ❌ ❌ ❌
More features
  • Strategy: Push via UI
  • UX personalization: No
  • AI autowiring: No
  • Network-based: No
  • Rich data profiling: No
  • Supported data sources: Mostly GeoData

Proprietary Data Catalogs

Collibra

Based on Open Standard Federation ML 1st Citizen Data Quality End-to-end Lineage Observability
❌ ❌ ❌ ? ❌ ❌
More features
  • Strategy: Push
  • UX personalization: Yes
  • AI autowiring: ?
  • Network-based: No
  • Rich data profiling: ?
  • Supported data sources:

Informatica

Based on Open Standard Federation ML 1st Citizen Data Quality End-to-end Lineage Observability
❌ ❌ ❌ ? ❌ ❌
More features
  • Strategy: Push
  • UX personalization: ?
  • AI autowiring: ?
  • Network-based: Yes
  • Rich data profiling: Yes
  • Supported data sources:

Alation

Based on Open Standard Federation ML 1st Citizen Data Quality End-to-end Lineage Observability
❌ ❌ ❌ βœ”οΈ ❌ ❌
More features
  • Strategy: Push
  • UX personalization: Yes
  • AI autowiring: No
  • Network-based: No
  • Rich data profiling: No
  • Supported data sources:

Atlan

Based on Open Standard Federation ML 1st Citizen Data Quality End-to-end Lineage Observability
❌ ❌ ❌ βœ”οΈ ❌ ❌
More features
  • Strategy: Pull
  • UX personalization: ?
  • AI autowiring: ?
  • Network-based: No
  • Rich data profiling: ?
  • Supported data sources:Presto, Deequ, Atlas, Airflow, Hudi

Ataccama

Based on Open Standard Federation ML 1st Citizen Data Quality End-to-end Lineage Observability
❌ ❌ ❌ βœ”οΈ ❌ ❌
More features
  • Strategy: Pull
  • UX personalization: Yes
  • AI autowiring: No
  • Network-based: No
  • Rich data profiling: Yes
  • Supported data sources:

Stemma

Based on Open Standard Federation ML 1st Citizen Data Quality End-to-end Lineage Observability
❌ ❌ ❌ ❌ ❌ ❌
More features
  • Strategy: Push
  • UX personalization: No
  • AI autowiring: No
  • Network-based: No
  • Rich data profiling: No
  • Supported data sources:

Talend

Based on Open Standard Federation ML 1st Citizen Data Quality End-to-end Lineage Observability
❌ ❌ ❌ βœ”οΈ ❌ ❌
More features
  • Strategy: Push
  • UX personalization: Yes
  • AI autowiring: ?
  • Network-based: ?
  • Rich data profiling: Yes
  • Supported data sources:

Monocloud Data Catalogs

Google Cloud Data Catalog

Based on Open Standard Federation ML 1st Citizen Data Quality End-to-end Lineage Observability
❌ ❌ ❌ ? ❌ ❌
More features
  • Strategy: Pull
  • UX personalization: ?
  • AI autowiring: ?
  • Network-based: No
  • Rich data profiling: No
  • Supported data sources:

Azure Data Catalog

Based on Open Standard Federation ML 1st Citizen Data Quality End-to-end Lineage Observability
❌ ❌ ❌ ? ❌ ❌
More features
  • Strategy: Pull
  • UX personalization: ?
  • AI autowiring: ?
  • Network-based: ?
  • Rich data profiling: ?
  • Supported data sources:

Data Observability Platforms

Monte Carlo

Based on Open Standard Federation ML 1st Citizen Data Quality End-to-end Lineage Observability
❌ ❌ ❌ βœ”οΈ ❌ βœ”οΈ
More features
  • Strategy: Pull
  • UX personalization: ?
  • AI autowiring: ?
  • Network-based: ?
  • Rich data profiling: ?
  • Supported data sources: Snowflake, Hive, Kafka, Looker, Redshift, Tableau, Big Query, Airflow, Fivetran, Presto, Mode, Periscope, Databricks, Glue, dbt, Chartio, Spark, AWS, S3, data.world, Google Cloud Platform

Databand

Based on Open Standard Federation ML 1st Citizen Data Quality End-to-end Lineage Observability
❌ ❌ ❌ βœ”οΈ ❌ ❌
More features
  • Strategy: Push
  • UX personalization: ?
  • AI autowiring: ?
  • Network-based: ?
  • Rich data profiling: ?
  • Supported data sources:

Datafold

Based on Open Standard Federation ML 1st Citizen Data Quality End-to-end Lineage Observability
❌ ❌ ❌ βœ”οΈ ❌ ❌
More features
  • Strategy: Push
  • UX personalization: ?
  • AI autowiring: ?
  • Network-based: ?
  • Rich data profiling: ?
  • Supported data sources:

About

Awesome Data Catalogs and Observability Solutions