chr1st1ank / dataframe-io

Read and write dataframes anywhere

Home Page:https://chr1st1ank.github.io/dataframe-io/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

dataframe-io

Release Status CI Status codecov

Read and write dataframes from and to any storage.

Features

Dataframes types supported:

  • pandas DataFrame
  • Python dictionary

Supported storage backends:

  • Parquet files
  • PostgreSQL database

More backends will come. Open an issue if you are interested in a particular backend.

Implementation status for reading data:

Storage Select columns Filter rows Max rows Sampling Drop duplicates
Parquet files ✔️ ✔️ ✔️ ✔️ ✔ ¹
PostgreSQL ✔️ ✔️ ✔️ ✔️ ✔️

¹ only for pandas DataFrames

Implementation status for writing data:

Storage write append write replace
Parquet files ✔️ ✔️
PostgreSQL ✔️ ✔️

Installation

pip install dframeio

# Including pyarrow to read/write parquet files:
pip install dframeio[parquet]

# Including PostgreSQL support:
pip install dframeio[postgres]

Show installed backends:

>>> import dframeio
>>> dframeio.backends
[<class 'dframeio.parquet.ParquetBackend'>]

About

Read and write dataframes anywhere

https://chr1st1ank.github.io/dataframe-io/

License:Apache License 2.0


Languages

Language:Python 100.0%