treebee / kartothek

A consistent table management library in python

Home Page:https://kartothek.readthedocs.io/en/latest

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

kartothek

Build Status Documentation Status codecov.io License: MIT Anaconda-Server Badge Anaconda-Server Badge

Datasets are a collection of files with the same schema that reside in a storage. kartothek offers a metadata definition to handle these datasets efficiently. In addition, the kartothek.io module provides building blocks to create and modify these datasets. Handling of I/O, tracking of dataset partitions and selecting subsets of data are handled transparently.

Installation

Installers for the latest released version are availabe at the Python package index and on conda.

# Install with pip
pip install kartothek
# Install with conda
conda install -c conda-forge kartothek

What is a (real) Kartothek?

A Kartothek (or more modern: Zettelkasten/Katalogkasten) is a tool to organize (high-level) information extracted from a source of information.

About

A consistent table management library in python

https://kartothek.readthedocs.io/en/latest

License:MIT License


Languages

Language:Python 99.9%Language:Shell 0.1%