stikkireddy

stikkireddy

Geek Repo

Company:Databricks

Location:Philadelphia, PA

Github PK Tool:Github PK Tool

stikkireddy's starred repositories

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonLicense:Apache-2.0Stargazers:32297Issues:476Issues:18093

great_expectations

Always know what to expect from your data.

Language:PythonLicense:Apache-2.0Stargazers:9708Issues:83Issues:1848

delta

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Language:ScalaLicense:Apache-2.0Stargazers:7317Issues:219Issues:1452

terraform-cdk

Define infrastructure resources using programming constructs and provision them using HashiCorp Terraform

Language:TypeScriptLicense:MPL-2.0Stargazers:4798Issues:62Issues:1557

scala-style-guide

Databricks Scala Coding Style Guide

delta-sharing

An open protocol for secure data sharing

Language:ScalaLicense:Apache-2.0Stargazers:735Issues:31Issues:133

terraform-provider-databricks

Databricks Terraform Provider

Language:GoLicense:NOASSERTIONStargazers:427Issues:35Issues:1617

overwatch

Capture deep metrics on one or all assets within a Databricks workspace

Language:ScalaLicense:NOASSERTIONStargazers:222Issues:31Issues:735

slrp

rotating open proxy multiplexer

Language:GoLicense:MITStargazers:160Issues:3Issues:40

dataframe-rules-engine

Extensible Rules Engine for custom Dataframe / Dataset validation

Language:ScalaLicense:NOASSERTIONStargazers:134Issues:13Issues:20

mlflow-export-import

Export and import MLflow experiments, runs or registered models

Language:HTMLLicense:Apache-2.0Stargazers:77Issues:6Issues:48

databricks-sync

An experimental tool to synchronize source Databricks deployment with a target Databricks deployment.

Language:PythonLicense:NOASSERTIONStargazers:46Issues:7Issues:63

dlt-with-debug

A lightweight helper utility which allows developers to do interactive pipeline development by having a unified source code for both DLT run and Non-DLT interactive notebook run.

Language:PythonLicense:MITStargazers:41Issues:1Issues:6

deltaray

Delta reader for the Ray open-source toolkit for building ML applications

Language:PythonLicense:Apache-2.0Stargazers:40Issues:5Issues:8

terraform-module-azure-datalake

Terraform module for an Azure Data Lake

Language:HCLLicense:MITStargazers:29Issues:11Issues:21

kdbspark

Spark Data Source (V2) for Kx Systems kdb+ Database

Language:ScalaLicense:Apache-2.0Stargazers:19Issues:3Issues:1

edw-best-practices

Git Repo for EDW Best Practice Assets on the Lakehouse

Language:PythonLicense:MITStargazers:15Issues:4Issues:0
Language:PythonLicense:Apache-2.0Stargazers:4Issues:0Issues:0

datafussion-delta-rust

Query Delta without SPARK

Language:RustStargazers:2Issues:0Issues:0
Language:RustLicense:Apache-2.0Stargazers:1Issues:0Issues:0