elans2's repositories
lakehouse-sharing
A Table format agnostic data sharing framework
bastionlab
A simple framework for privacy-friendly data science collaboration
blaze
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
bumblebee
🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
dbt-trino
The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)
delta-rs
A native Rust library for Delta Lake, with bindings into Python
delta-sharing-rust-client
Delta Sharing client library for Rust
gridstudio
Grid studio is a web-based application for data science with full integration of open source data science frameworks and languages.
hyperspace
An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
mo-sql-parsing
Let's make a SQL parser so we can provide a familiar interface to non-sql datastores!
glaredb
GlareDB is a fast SQL database for querying and analyzing distributed data.
graphic-walker
An open source alternative to Tableau. Easily embedded as a component in web apps.
iceberg-rust
Rust implementation of Apache Iceberg with integration for Datafusion
llama_cpp-rs
High-level, optionally asynchronous Rust bindings to llama.cpp
mctx
Monte Carlo tree search in JAX
protobuf
Protocol Buffers - Google's data interchange format
Rath
Automated data exploratory analysis and visualization tools.
robusta
Easy interop between Rust and Java
rubix
Cache File System optimized for columnar formats and object stores
rust-llama.cpp
LLama.cpp rust bindings
rust-wasm-dynamic-module-study
A Study on how to load dynamic WASM modules in Rust
sharing
A Minimalistic Rust Implementation of Delta Sharing Server.
sql-ds-cache
Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.
sql-translator
SQL Translator is a tool for converting natural language queries into SQL code using artificial intelligence. This project is 100% free and open source.
tensorbase
TensorBase is a new big data warehousing with modern efforts.
transport
A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Apache Hive, and Presto.
velox
A new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.