aplunk / delta-rs

A native Rust library for Delta Lake, with bindings into Python

Home Page:https://delta-io.github.io/delta-rs/python/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

delta-rs

Build Status Crate Docs Python binding Docs

Delta-rs logo

A native interface to Delta Lake.

About

This library provides low level access to Delta tables in Rust, which can be used with data processing frameworks like datafusion, ballista, polars, vega, etc. It also provides bindings to other higher level language Python.

Features

Supported backends:

  • Local file system

  • AWS S3

  • Azure Blob Storage / Azure Datalake Storage Gen2

  • Google Cloud Storage

Table 1. Support features
Operation/Feature Rust Python

Read table

✔️

✔️

Stream table update

✔️

✔️

Filter files with partitions

✔️

✔️

Vacuum (delete stale files)

✔️

✔️

History

✔️

✔️

Write transactions

✔️

Checkpoint creation

✔️

High-level file writer

✔️

Optimize

✔️

#622

Get Involved

Development Meeting

We have a standing development sync meeting for those that are interested. The meeting is held every two weeks at 9am PST on Tuesday mornings. The direct meeting URL is shared in the Slack channel above ☝️ before the meeting.

These meetings are also streamed live via YouTube if you just want to listen in.

Development

delta-rs requires the Rust compiler, which can be installed with the rustup command.

Running tests can be done with cargo test in the root directory, or one of the directories below:

Rust

The rust/ directory contains core Rust APIs for accessing Delta Lake from Rust, or for higher-level language bindings.

Python

The python/ directory contains the deltalake Python package built on top of delta-rs

About

A native Rust library for Delta Lake, with bindings into Python

https://delta-io.github.io/delta-rs/python/

License:Apache License 2.0


Languages

Language:Rust 84.8%Language:Python 11.3%Language:TLA 3.1%Language:Shell 0.5%Language:Makefile 0.3%Language:Batchfile 0.1%