stuartcarnie / arrow-datafusion

Apache Arrow DataFusion SQL Query Engine

Home Page:https://arrow.apache.org/datafusion

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

DataFusion

Coverage Status

logo

DataFusion is a very fast, extensible query engine for building high-quality data-centric systems in Rust, using the Apache Arrow in-memory format. Python Bindings are also available.

DataFusion offers SQL and Dataframe APIs, excellent performance, built-in support for CSV, Parquet, JSON, and Avro, extensive customization, and a great community.

https://arrow.apache.org/datafusion/ contains the project's documentation.

Using DataFusion

The example usage section in the user guide and the datafusion-examples code in the crate contain information on using DataFusion.

Contributing to DataFusion

The developer’s guide contains information on how to contribute.

About

Apache Arrow DataFusion SQL Query Engine

https://arrow.apache.org/datafusion

License:Apache License 2.0


Languages

Language:Rust 99.1%Language:Python 0.5%Language:Shell 0.5%Language:Dockerfile 0.0%