sathya-reddy-m's repositories

License:CC0-1.0Stargazers:0Issues:0Issues:0

cobrix

A COBOL parser and Mainframe/EBCDIC data source for Apache Spark

License:Apache-2.0Stargazers:0Issues:0Issues:0

CommonDataModel

Definition and DDLs for the OMOP Common Data Model (CDM)

License:Apache-2.0Stargazers:0Issues:0Issues:0

pyspark-cheatsheet

PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster

License:CC0-1.0Stargazers:0Issues:0Issues:0

xbar

Put the output from any script or program into your macOS Menu Bar (the BitBar reboot)

License:MITStargazers:0Issues:0Issues:0

meltano

Your open source DataOps Platform Infrastructure to let you manage all the data tools in your stack in one place, and turn them into your ideal end-to-end data platform

License:MITStargazers:0Issues:0Issues:0

CDM

The Common Data Model (CDM) is a standard and extensible collection of schemas (entities, attributes, relationships) that represents business concepts and activities with well-defined semantics, to facilitate data interoperability. Examples of entities include: Account, Contact, Lead, Opportunity, Product, etc.

License:CC-BY-4.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

fastapi-lakehouse

Connect FastAPI to a Databricks Lakehouse

Stargazers:0Issues:0Issues:0

aws-cloud-mindmaps

Mindmaps about AWS based on public information

Stargazers:0Issues:0Issues:0

superset

Apache Superset is a Data Visualization and Data Exploration Platform

License:Apache-2.0Stargazers:0Issues:0Issues:0

spark-data-standardization

Excellent Validation Schema Validation and transformation for streaming

License:Apache-2.0Stargazers:1Issues:0Issues:0

aws-athena-query-federation

The Amazon Athena Query Federation SDK allows you to customize Amazon Athena with your own data sources and code.

License:Apache-2.0Stargazers:0Issues:0Issues:0

the-book-of-secret-knowledge

A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more.

License:MITStargazers:0Issues:0Issues:0

redpanda

Redpanda is a streaming data platform for developers. Kafka API compatible, 10x faster, ZooKeeper free, JVM free! See more at redpanda.com

Stargazers:0Issues:0Issues:0

dbt-databricks

A dbt adapter for Databricks.

License:Apache-2.0Stargazers:0Issues:0Issues:0

sql-style-guide

An opinionated guide for writing clean, maintainable SQL.

Stargazers:0Issues:0Issues:0

ra_data_warehouse

This dbt package contains a set of pre-built, pre-integrated Load and Transform dbt models for common SaaS applications.

License:Apache-2.0Stargazers:1Issues:0Issues:0

Miscellaneous

Scripts and code examples. Includes Spark notes, Jupyter notebook examples for Spark, Impala and Oracle.

License:Apache-2.0Stargazers:0Issues:0Issues:0

data_engineering_tools

data transformation functions and snippets

Stargazers:0Issues:0Issues:0

God-Of-BigData

专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

Stargazers:0Issues:0Issues:0

ckan

CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data. It powers catalog.data.gov, open.canada.ca/data, data.humdata.org among many other sites.

License:NOASSERTIONStargazers:0Issues:0Issues:0

iceberg

Apache Iceberg

License:Apache-2.0Stargazers:0Issues:0Issues:0

apicurio-registry

An API/Schema registry - stores APIs and Schemas.

License:Apache-2.0Stargazers:0Issues:0Issues:0

aim42

public repository for the "architecture improvement method reference"

License:Apache-2.0Stargazers:0Issues:0Issues:0

prefect

The easiest way to automate your data

License:Apache-2.0Stargazers:0Issues:0Issues:0

spark-metrics

Spark metrics related custom classes and sinks (e.g. Prometheus)

License:Apache-2.0Stargazers:0Issues:0Issues:0

synth

The Declarative Data Generator

License:Apache-2.0Stargazers:0Issues:0Issues:0

snapflow

Functional reactive data pipelines

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

spec

CloudEvents Specification

License:Apache-2.0Stargazers:0Issues:0Issues:0