Tvkoushik / gq-great-expectations

Great Expectations Data Quality Checks is a specialized repository designed to harness the capabilities of the great_expectations Python library. With a focus on ensuring data quality, this project provides robust tools and methodologies to validate and check data across various sources.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Great Expectations Data Quality Checks

Great Expectations Data Quality Checks is a specialized repository designed to harness the capabilities of the great_expectations Python library. With a focus on ensuring data quality, this project provides robust tools and methodologies to validate and check data across various sources.

Key Features

  • Great Expectations Integration: Seamlessly integrates with the great_expectations library to deliver advanced data quality checks.
  • Data Quality Framework: Explore the framework.ipynb notebook for a comprehensive look at setting up and running data quality checks on CSV data.
  • API Data Fetching: The fetch_api_data.py script offers a utility to fetch real-time weather data from an API.

Prerequisites

  • Python: Ensure you have a suitable version of Python installed, preferably 3.6 or newer.
  • Great Expectations: The core library that powers the data quality checks in this repository.

Getting Started

  1. Setup Environment: Install the required packages using the provided requirements.txt.
  2. Data Quality Framework: Navigate to the framework.ipynb notebook to get acquainted with the data quality checks and validations.
  3. Fetch Data: Use the fetch_api_data.py script to fetch real-time weather data from an API.

Contribution

Contributions are always welcome! If you have any enhancements, bug fixes, or suggestions:

  1. Fork the repository.
  2. Make the necessary changes or additions.
  3. Submit a pull request.

Feedback, bug reports, and general discussions can also be initiated. We aim for a collaborative environment where everyone's input is valued.

License

For licensing details, please refer to the project's license documentation.

About

Great Expectations Data Quality Checks is a specialized repository designed to harness the capabilities of the great_expectations Python library. With a focus on ensuring data quality, this project provides robust tools and methodologies to validate and check data across various sources.


Languages

Language:Jupyter Notebook 83.8%Language:Python 14.1%Language:Dockerfile 2.1%