Amey Chaugule's repositories
DataflowJavaSDK
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
datafusion
Apache DataFusion SQL Query Engine
Deep-Learning-World
:satellite: Organized Resources for Deep Learning Researchers and Developers
delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs for Scala, Java, Rust, Ruby, and Python.
smart-contract-sanctuary
🐦🌴🌴🌴🦕 A home for ethereum smart contracts. 🏠
testing-distributed-systems
Curated list of resources on testing distributed systems
kafka-in-production
:books: Tech blogs & talks by companies that run Kafka in production
Leaflet.TimeDimension
Add time dimension capabilities on a Leaflet map.
mev_bundle_generator
A MEV bundle generator written in Rust
minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
nbexamples
distribute notebooks for users to copy via a web interface
parquet-format
Apache Parquet
pinot
Apache Pinot - A realtime distributed OLAP datastore
Probabilistic-Programming-and-Bayesian-Methods-for-Hackers
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
Prompt-Engineering-Guide
:octopus: Guides, papers, lecture, and resources for prompt engineering
react-ipython-notebook
React component for nbconvert.js
scala-style-guide
Databricks Scala Coding Style Guide
sdb
Source code for the book Building a Debugger
spark-pinot-connector
Spark-pinot connector to read and write data from/to Pinot.
talent-plan
open source training courses about distributed database and distributed systems