Ashish's repositories
arrow
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
arrow-adbc
Apache arrow
arrow-cookbook
Apache Arrow Cookbook
aws-sa-pro
Course Files for AWS Certified Solutions Architect - Professional - Adrian Cantrill
beam
Apache Beam is a unified programming model for Batch and Streaming data processing.
calcite
Apache Calcite
cassandra
Mirror of Apache Cassandra
debezium
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
flink-kubernetes-operator
Apache Flink Kubernetes Operator
hollow
Hollow is a java library and toolset for disseminating in-memory datasets from a single producer to many consumers for high performance read-only access.
hudi
Upserts, Deletes And Incremental Processing on Big Data.
incubator-xtable
OneTable is an omni-directional converter for table formats that facilitates interoperability across data processing systems and query engines.
jvm-tools
Small set of tools for JVM troublshooting, monitoring and profiling.
kafka
Mirror of Apache Kafka
LeetCode-Java
LeetCode fun
openhouse
Open Control Plane for Tables in Data Lakehouse
orc
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
orc-format
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
ozone
Scalable, redundant, and distributed object store for Apache Hadoop
papers
Repo to keep track of Paper Reading
parquet-java
Apache Parquet Java
polaris
The interoperable, open source catalog for Apache Iceberg
pulsar
Apache Pulsar - distributed pub-sub messaging system
ratis
Open source Java implementation for Raft consensus protocol.
shenyu
Apache ShenYu is a Java native API Gateway for service proxy, protocol conversion and API governance.
sofa-jraft
A production-grade java implementation of RAFT consensus algorithm.
System-Performance-Book-Notes
Repository to Host Book Reading notes on System Performance
trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
unitycatalog
Open, Multi-modal Catalog for Data & AI