Nayanexx.py (nayanex)

nayanex

Geek Repo

Location:localhost

Github PK Tool:Github PK Tool

Nayanexx.py 's starred repositories

Language:Jupyter NotebookStargazers:83Issues:0Issues:0

databricks-sdk-py

Databricks SDK for Python (Beta)

Language:PythonLicense:Apache-2.0Stargazers:327Issues:0Issues:0

delta

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Language:ScalaLicense:Apache-2.0Stargazers:7319Issues:0Issues:0

bundle-examples

Examples of Databricks Asset Bundles

Language:PythonLicense:NOASSERTIONStargazers:56Issues:0Issues:0

DependencyCheck

OWASP dependency-check is a software composition analysis utility that detects publicly disclosed vulnerabilities in application dependencies.

Language:JavaLicense:Apache-2.0Stargazers:6212Issues:0Issues:0

ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

Language:PythonLicense:MITStargazers:12282Issues:0Issues:0

PyData-Global-2023-Improving-Open-Data-Quality-using-Python

This repo has the complete materials of the tutorial session Improving Open Data Quality using Python, presented at PyData Global 2023 conference

Language:Jupyter NotebookStargazers:7Issues:0Issues:0

community-demo-2024-04-16

Demos for the community meeting

Language:Jupyter NotebookStargazers:2Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:15Issues:0Issues:0

gx-databricks-bigquery-public

How to leverage the power of Databricks notebooks and GX data quality checks to create validated data workflows

Language:PythonLicense:MITStargazers:5Issues:0Issues:0

hands-on-great-expectations-with-spark

How to evaluate the Quality of your Data with Great Expectations and Spark.

Language:Jupyter NotebookLicense:MITStargazers:28Issues:0Issues:0

Azure-Databricks-NYC-Taxi-Workshop

An Azure Databricks workshop leveraging the New York Taxi and Limousine Commission Trip Records dataset

Language:ScalaLicense:MITStargazers:104Issues:0Issues:0

DataFactoryCICD

Complete Azure Data Factory CICD Process Via Azure Pipeline

Language:BicepStargazers:15Issues:0Issues:0

gq-great-expectations

Great Expectations Data Quality Checks is a specialized repository designed to harness the capabilities of the great_expectations Python library. With a focus on ensuring data quality, this project provides robust tools and methodologies to validate and check data across various sources.

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

great_expectations

Always know what to expect from your data.

Language:PythonLicense:Apache-2.0Stargazers:9711Issues:0Issues:0

lakeFS

lakeFS - Data version control for your data lake | Git for data

Language:GoLicense:Apache-2.0Stargazers:4258Issues:0Issues:0

data_quality

The repo contains a few notebooks for you to get started with Databricks and Great Expectations. Have fun!

Language:PythonStargazers:1Issues:0Issues:0

drunken-data-quality

Spark package for checking data quality

Language:ScalaLicense:Apache-2.0Stargazers:222Issues:0Issues:0

deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Language:ScalaLicense:Apache-2.0Stargazers:3195Issues:0Issues:0

dataframe-rules-engine

Extensible Rules Engine for custom Dataframe / Dataset validation

Language:ScalaLicense:NOASSERTIONStargazers:134Issues:0Issues:0

python-deequ

Python API for Deequ

Language:PythonLicense:Apache-2.0Stargazers:677Issues:0Issues:0

GitPython

GitPython is a python library used to interact with Git repositories.

Language:PythonLicense:BSD-3-ClauseStargazers:4512Issues:0Issues:0
Language:C#License:NOASSERTIONStargazers:477Issues:0Issues:0

azure.datafactory.tools

Tools for deploying Data Factory (v2) in Microsoft Azure

Language:PowerShellLicense:MITStargazers:208Issues:0Issues:0

azure-devops-extension-sample

Sample web extension for Azure DevOps

Language:TypeScriptLicense:MITStargazers:236Issues:0Issues:0

cli

Databricks CLI

Language:GoLicense:NOASSERTIONStargazers:117Issues:0Issues:0

terragrunt

Terragrunt is a flexible orchestration tool that allows Infrastructure as Code written in OpenTofu/Terraform to scale.

Language:GoLicense:MITStargazers:7833Issues:0Issues:0

azure-agent-self-hosted-toolkit

Toolkit to run azure agents under linux

Language:ShellStargazers:21Issues:0Issues:0

python-certifi

(Python Distribution) A carefully curated collection of Root Certificates for validating the trustworthiness of SSL certificates while verifying the identity of TLS hosts.

Language:PythonLicense:NOASSERTIONStargazers:799Issues:0Issues:0

azure-devops-cli-extension

Azure DevOps Extension for Azure CLI

Language:PythonLicense:MITStargazers:618Issues:0Issues:0