Amogh Jahagirdar's repositories
arrow-datafusion-comet
Apache Arrow DataFusion Comet Spark Accelerator
async-file-io
Async FileIO implementation
aws-sdk-java
The official AWS SDK for Java.
brotli
Brotli compression format
calcite
Apache Calcite
iceberg
Apache Iceberg
iceberg-docs
Apache Iceberg Documentation Site
iceberg-rust
Apache Iceberg
ClickHouse
ClickHouse® is a free analytics DBMS for big data
delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs for Scala, Java, Rust, Ruby, and Python.
druid
Apache Druid: a high performance real-time analytics database.
flink
Apache Flink
hyperspace
An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
iceberg-python
Apache PyIceberg
neon
Neon: Serverless Postgres. We separated storage and compute to offer autoscaling, branching, and bottomless storage.
parquet-mr
Apache Parquet
presto
The official home of the Presto distributed SQL query engine for big data
pyspark-ai
English SDK for Apache Spark
spark
Apache Spark - A unified analytics engine for large-scale data processing
spark-on-k8s-operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
yunikorn-core
Apache YuniKorn Core
yunikorn-k8shim
Apache YuniKorn K8shim