Venki Korukanti's repositories
spark-docker-compose
Spark + HDFS cluster using docker compose
connectors
This library allows Scala and Java-based projects (including Apache Flink, Apache Hive, Apache Beam, and PrestoDB) to read from and write to Delta Lake.
arrow
Mirror of Apache Arrow
calcite
Mirror of Apache Calcite
calcite-avatica
Mirror of Apache Calcite - Avatica
de.flapdoodle.embed.mongo
...will provide a platform neutral way for running mongodb in unittests.
delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
delta-sharing
An open protocol for secure data sharing
docker-presto-cluster
Multiple node presto cluster on docker container
dremio-oss
Dremio - the missing link in modern data
drill
Mirror of Apache Drill (Incubating)
drill-test-framework
Test Framework for Apache Drill
flink
Apache Flink
geometry-api-java
The Esri Geometry API for Java enables developers to write custom applications for analysis of spatial data. This API is used in the Esri GIS Tools for Hadoop and other 3rd-party data processing solutions.
hive
Mirror of Apache Hive
hive-testbench
Testbench for experimenting with Apache Hive at any data scale.
incubator-hudi
Upserts, Deletes And Incremental Processing on Big Data.
incubator-pinot
Apache Pinot (Incubating) - A realtime distributed OLAP datastore
jillow-core
Java library using Zillow's API
presto
Distributed SQL query engine for big data
presto-hive-apache
Shaded version of Apache Hive for Presto
spark
Apache Spark - A unified analytics engine for large-scale data processing
trino-hadoop-apache
Shaded version of Apache Hadoop for Trino
trino-hive-apache
Shaded version of Apache Hive for Trino
unitycatalog
Open, Multi-modal Catalog for Data & AI