sathya-reddy-m's repositories

actionlint

:octocat: Static checker for GitHub Actions workflow files

Language:GoLicense:MITStargazers:0Issues:0Issues:0

architecture-center

Open Source documentation for the Azure Architecture Center on Microsoft Docs

License:CC-BY-4.0Stargazers:0Issues:0Issues:0

awesome-azure-architecture

AWESOME-Azure-Architecture - https://aka.ms/AwesomeAzureArchitecture

License:CC0-1.0Stargazers:0Issues:0Issues:0

awesome-database-design

:zap: A collection of resources and tutorials to design a better database schema.

Stargazers:0Issues:0Issues:0

aws-saas-factory-eks-reference-architecture

This repository provides a reference architecture for building an end to end SaaS solution using Amazon Elastic Kubernetes Service (EKS)

License:MIT-0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

AzureTRE

An accelerator to help organizations build Trusted Research Environments on Azure.

License:MITStargazers:0Issues:0Issues:0

best-of-ml-python

🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

License:CC-BY-SA-4.0Stargazers:0Issues:0Issues:0

brickflow

Pythonic Programming Framework to orchestrate jobs in Databricks Workflow

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ctakes

Apache cTAKES is a Natural Language Processing (NLP) platform for clinical text.

License:Apache-2.0Stargazers:0Issues:0Issues:0

data-diff

Compare tables within or across databases

License:MITStargazers:0Issues:0Issues:0

data-engineering-wiki

The best place to learn data engineering. Built and maintained by the data engineering community.

License:CC0-1.0Stargazers:0Issues:0Issues:0

DataAISummit2024

This repository contains code of Brickflow and SparkExpectations for DataAISummit2024

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

datacompy

Pandas and Spark DataFrame comparison for humans and more!

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ddedocs

Data Developer & Engineer Documents and Hands-On

License:MITStargazers:0Issues:0Issues:0

delta-examples

Delta Lake examples

License:Apache-2.0Stargazers:0Issues:0Issues:0

inlong

Apache InLong - a one-stop, full-scenario integration framework for massive data

License:Apache-2.0Stargazers:0Issues:0Issues:0

lakehouse-engine

The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

learn-databricks

Notebooks to learn Databricks Lakehouse Platform

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

metricflow

MetricFlow allows you to define, build, and maintain metrics in code.

License:NOASSERTIONStargazers:0Issues:0Issues:0

openhouse

Open Control Plane for Tables in Data Lakehouse

License:BSD-2-ClauseStargazers:0Issues:0Issues:0

OpenLineage

An Open Standard for lineage metadata collection

License:Apache-2.0Stargazers:0Issues:0Issues:0

OpenMetadata

Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.

Language:TypeScriptLicense:Apache-2.0Stargazers:0Issues:0Issues:0

quinn

pyspark methods to enhance developer productivity 📣 👯 🎉

Language:PythonStargazers:0Issues:0Issues:0

seatunnel

SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.

License:Apache-2.0Stargazers:0Issues:0Issues:0

spark

Apache Spark - A unified analytics engine for large-scale data processing

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

system-design-101

Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.

License:NOASSERTIONStargazers:0Issues:0Issues:0

terraform-databricks-lakehouse-blueprints

Set of Terraform automation templates and quickstart demos to jumpstart the design of a Lakehouse on Databricks. This project has incorporated best practices across the industries we work with to deliver composable modules to build a workspace to comply with the highest platform security and governance standards.

License:NOASSERTIONStargazers:0Issues:0Issues:0

terraform-databricks-sra

The Security Reference Architecture (SRA) implements typical security features as Terraform Templates that are deployed by most high-security organizations, and enforces controls for the largest risks that customers ask about most often.

License:NOASSERTIONStargazers:0Issues:0Issues:0