Ravi Hindocha (RaviHindocha)

RaviHindocha

Geek Repo

Company:CultData

Location:Rabbit Hole

Github PK Tool:Github PK Tool

Ravi Hindocha's repositories

dbt-core-rh

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

ingestd-tpcdi

Data Integration via Confluent Kafka

Language:PythonLicense:GPL-3.0Stargazers:1Issues:0Issues:0

lens

Lenses, Folds, and Traversals - Join us on freenode #haskell-lens

Language:HaskellLicense:NOASSERTIONStargazers:1Issues:0Issues:0

tpc-di_benchmark

Benchmark for Airflow with BigQuery as the Data Warehouse using TPC - DI

Language:PythonStargazers:1Issues:0Issues:0

amundsen

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

aqueduct

The control center for ML in the cloud

Language:GoLicense:Apache-2.0Stargazers:0Issues:0Issues:0

awesome

😎 Awesome lists about all kinds of interesting topics

License:CC0-1.0Stargazers:0Issues:0Issues:0

awesome-production-machine-learning

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

License:MITStargazers:0Issues:0Issues:0

build-your-own-x

Master programming by recreating your favorite technologies from scratch.

Stargazers:0Issues:0Issues:0

codon

A high-performance, zero-overhead, extensible Python compiler using LLVM

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

Computer-Science-Education-Resources

A place for programming language instructors to share educational materials

Stargazers:0Issues:0Issues:0

datacontract-specification

The Data Contract Specification Repository

License:MITStargazers:0Issues:0Issues:0

delta

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

License:Apache-2.0Stargazers:0Issues:0Issues:0

free-for-dev

A list of SaaS, PaaS and IaaS offerings that have free tiers of interest to devops and infradev

Language:HTMLStargazers:0Issues:0Issues:0

free-programming-books

:books: Freely available programming books

License:NOASSERTIONStargazers:0Issues:0Issues:0

h2o-llmstudio

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs

License:Apache-2.0Stargazers:0Issues:0Issues:0

hudi

Upserts, Deletes And Incremental Processing on Big Data.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

iceberg

Apache Iceberg

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

jitsu

Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days

Language:TypeScriptLicense:MITStargazers:0Issues:0Issues:0

mage-ai

🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

mlflow

Open source platform for the machine learning lifecycle

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

onnx

Open standard for machine learning interoperability

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

presto

The official home of the Presto distributed SQL query engine for big data

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

python-mastery

Advanced Python Mastery (course by @dabeaz)

Language:PythonLicense:CC-BY-SA-4.0Stargazers:0Issues:0Issues:0

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

spark

Apache Spark - A unified analytics engine for large-scale data processing

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

spec

The AsyncAPI specification allows you to create machine-readable definitions of your asynchronous APIs.

License:Apache-2.0Stargazers:0Issues:0Issues:0

trino

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

zenml

ZenML 🙏: Build portable, production-ready MLOps pipelines. https://zenml.io.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0