Conrad (conraddd)

conraddd

Geek Repo

Location:Hong Kong

Github PK Tool:Github PK Tool

Conrad's starred repositories

unitycatalog

Open, Multi-modal Catalog for Data & AI

Language:JavaLicense:Apache-2.0Stargazers:1490Issues:0Issues:0

openhouse

Open Control Plane for Tables in Data Lakehouse

Language:JavaLicense:BSD-2-ClauseStargazers:268Issues:0Issues:0

kedro

Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.

Language:PythonLicense:Apache-2.0Stargazers:9443Issues:0Issues:0

awesome-apache-airflow

Curated list of resources about Apache Airflow

Language:ShellStargazers:3599Issues:0Issues:0

spark-monitoring

Monitoring Azure Databricks jobs

Language:ScalaLicense:MITStargazers:206Issues:0Issues:0

kafka-delta-ingest

A highly efficient daemon for streaming data from Kafka into Delta Lake

Language:RustLicense:Apache-2.0Stargazers:334Issues:0Issues:0

pyre-check

Performant type-checking for python.

Language:OCamlLicense:MITStargazers:6727Issues:0Issues:0

dbt-expectations

Port(ish) of Great Expectations to dbt test macros

Language:ShellLicense:Apache-2.0Stargazers:977Issues:0Issues:0

delta-sharing

An open protocol for secure data sharing

Language:ScalaLicense:Apache-2.0Stargazers:708Issues:0Issues:0

podman-compose

a script to run docker-compose.yml using podman

Language:PythonLicense:GPL-2.0Stargazers:4819Issues:0Issues:0

podman

Podman: A tool for managing OCI containers and pods.

Language:GoLicense:Apache-2.0Stargazers:22224Issues:0Issues:0

bamboolib

bamboolib - a GUI for pandas DataFrames

Language:Jupyter NotebookStargazers:936Issues:0Issues:0

petals

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Language:PythonLicense:MITStargazers:8841Issues:0Issues:0

delta-rs

A native Rust library for Delta Lake, with bindings into Python

Language:RustLicense:Apache-2.0Stargazers:1935Issues:0Issues:0

dbx

🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.

Language:PythonLicense:NOASSERTIONStargazers:434Issues:0Issues:0

applied-ml

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

License:MITStargazers:26159Issues:0Issues:0

awesome-data-catalogs

📙 Awesome Data Catalogs and Observability Platforms.

License:MITStargazers:619Issues:0Issues:0

jina

☁️ Build multimodal AI applications with cloud-native stack

Language:PythonLicense:Apache-2.0Stargazers:20404Issues:0Issues:0

delta

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Language:ScalaLicense:Apache-2.0Stargazers:7172Issues:0Issues:0

procfwk

A cross tenant metadata driven processing framework for Azure Data Factory and Azure Synapse Analytics achieved by coupling orchestration pipelines with a SQL database and a set of Azure Functions.

Language:C#License:NOASSERTIONStargazers:177Issues:0Issues:0

azure.datafactory.tools

Tools for deploying Data Factory (v2) in Microsoft Azure

Language:PowerShellLicense:MITStargazers:205Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:12Issues:0Issues:0

databricks-nutter-repos-demo

Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline

Language:PythonLicense:MITStargazers:145Issues:0Issues:0

notebook-best-practices

An example showing how to apply software engineering best practices to Databricks notebooks.

Language:PythonLicense:Apache-2.0Stargazers:113Issues:0Issues:0

azure-sdk-for-python

This repository is for active development of the Azure SDK for Python. For consumers of the SDK we recommend visiting our public developer docs at https://docs.microsoft.com/python/azure/ or our versioned developer docs at https://azure.github.io/azure-sdk-for-python.

Language:PythonLicense:MITStargazers:4329Issues:0Issues:0

azure-pipelines-terraform

Azure Pipelines tasks for installing Terraform and running Terraform commands in a build or release pipeline.

Language:TypeScriptLicense:MITStargazers:92Issues:0Issues:0

pre-commit

A framework for managing and maintaining multi-language pre-commit hooks.

Language:PythonLicense:MITStargazers:12300Issues:0Issues:0
Language:C#License:MITStargazers:33Issues:0Issues:0

azure-pipelines-tasks

Tasks for Azure Pipelines

Language:TypeScriptLicense:MITStargazers:3417Issues:0Issues:0

mlops-v2

Azure MLOps (v2) solution accelerators. Enterprise ready templates to deploy your machine learning models on the Azure Platform.

Language:ShellLicense:MITStargazers:468Issues:0Issues:0