sathya-reddy-m's repositories
actionlint
:octocat: Static checker for GitHub Actions workflow files
amazon-quicksight-embedding-sdk
A SDK to help users embed QuickSight dashboards on other pages.
api-guidelines
adidas group API design guidelines
architecture-center
Open Source documentation for the Azure Architecture Center on Microsoft Docs
awesome-azure-architecture
AWESOME-Azure-Architecture - https://aka.ms/AwesomeAzureArchitecture
awesome-database-design
:zap: A collection of resources and tutorials to design a better database schema.
awesome-leading-and-managing
Awesome List of resources on leading people and being a manager. Geared toward tech, but potentially useful to anyone.
aws-saas-factory-eks-reference-architecture
This repository provides a reference architecture for building an end to end SaaS solution using Amazon Elastic Kubernetes Service (EKS)
AzureTRE
An accelerator to help organizations build Trusted Research Environments on Azure.
brickflow
Pythonic Programming Framework to orchestrate jobs in Databricks Workflow
ctakes
Apache cTAKES is a Natural Language Processing (NLP) platform for clinical text.
data-engineering-wiki
The best place to learn data engineering. Built and maintained by the data engineering community.
data-factory-testing-framework
A stand-alone test framework that allows to write unit tests for Data Factory pipelines on Microsoft Fabric, Azure Data Factory and Azure Synapse Analytics.
delta-examples
Delta Lake examples
dlt-meta
This is metadata driven DLT based framework for bronze/silver pipelines
dotcms-core
Headless/Hybrid Content Management System for Enterprises
fhir-omop-ig
A FHIR implementation guide that supports conversion of data from FHIR to OMOP and OMOP to FHIR
inlong
Apache InLong - a one-stop, full-scenario integration framework for massive data
lakehouse-engine
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
metricflow
MetricFlow allows you to define, build, and maintain metrics in code.
openhouse
Open Control Plane for Tables in Data Lakehouse
polaris
The interoperable, open source catalog for Apache Iceberg
pyspark-style-guide
This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurring topics across the PySpark repos we've encountered.
seatunnel
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
unitycatalog
Open, Multi-modal Catalog for Data & AI
WhiteRabbit
WhiteRabbit is a small application that can be used to analyse the structure and contents of a database as preparation for designing an ETL. It comes with RabbitInAHat, an application for interactive design of an ETL to the OMOP Common Data Model with the help of the the scan report generated by White Rabbit.
www-project-top-10-for-large-language-model-applications
OWASP Foundation Web Respository