stariy95 / dflib

In-memory Java DataFrame library

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Maven Central Build Status

DFLib

DFLib ("DataFrame Library") is a lightweight pure Java implementation of a common DataFrame data structure. With DFLib you get essentially the same data manipulation tools you may be used to in SQL (such as queries and joins), only you can use them in-memory and over dynamically defined data structures.

It is somewhat close to Python pandas, except it is intended to be used with Java and other JVM languages.

DFLib comes with connectors for a variety of data formats: CSV, Excel, RDBMS (JDBC), Avro, JSON, etc.

Project Links

Older Stuff

  • a 20 min presentation of an early version of DFLib. While the DFLib API has evolved substantially since then, the video still demonstrates the idea of the project...

DFLib at WODay Frankfurt

About

In-memory Java DataFrame library

License:Apache License 2.0


Languages

Language:Java 99.9%Language:PLpgSQL 0.1%