Rohit Rastogi's starred repositories
iceberg-rust
Apache Iceberg
iceberg-rust
Rust implementation of Apache Iceberg with integration for Datafusion
Scrapegraph-ai
Python scraper based on AI
prost-arrow
prost-arrow derives arrow array builders for protobuf messages generated by prost
RemoteShuffleService
Remote shuffle service for Apache Spark to store shuffle data on remote servers.
incubator-gluten
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
s3xz-caching-solution
Reference Architecture to automate the use of S3 Express One Zone as a caching layer for S3 Regional Buckets.
incubator-xtable
Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.
Hoptimator
Multi-hop declarative data pipelines
superlinked
A compute framework for turning complex data into vectors. Build multimodal vectors with ease and define weights at query time so you don't need a custom reranking algorithm to optimise results. Go straight from notebook to production with the same SDK.
perspective
A data visualization and analytics component, especially well-suited for large and/or streaming datasets.
datafusion-comet
Apache DataFusion Comet Spark Accelerator