There are 12 repositories under data-validation topic.
A React component for building Web forms from JSON Schema.
Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
Evidently is ​​an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
A light-weight, flexible, and expressive statistical data testing library
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
Validation library with type-safe schemas and rules
Automatically find issues in image datasets and practice data-centric computer vision.
Data quality assessment and metadata reporting for data frames and database tables
Google ReCaptcha package for Laravel
Coercion and validation for data structures
The toolkit to test, validate, and evaluate your models and surface, curate, and prioritize the most valuable data for labeling.
A proxy that validates responses and requests against an OpenAPI document. https://www.npmjs.com/package/openapi-cop https://hub.docker.com/r/lxlu/openapi-cop
Data validation toolkit for assessing and monitoring data quality.
Data Cleaning Libraries with Python
The Open Data Editor (ODE) is a no-code application to explore, validate and publish data in a simple way. Forever free and open source project powered by the Frictionless Framework.
Powerful CSV & Excel Import experience for SaaS 🚀 Save months building data import experience from scratch 💰
âš“ Eurybia monitors model drift over time and securizes model deployment with data validation
A lightweight JSON decoding library for TypeScript
Validator for the Brain Imaging Data Structure
Typical: Fast, simple, & correct data-validation using Python 3 typing.
AtroPIM is a flexible, highly configurable, modular, open-source product information management (PIM) system that extends the AtroCore data management and system integration platform.
Open Source Data Quality Monitoring.
Accelerates migrations to Databricks by automating key migration activities
A tool to validate data, built around Apache Spark.
Create "immutable" objects with no setters, just getters.
A simple and easy to use Data Validation library for Python.