vibhatha / cylon

Cylon is a fast, scalable distributed memory data parallel library for processing structured data

Home Page:https://cylondata.org

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Cylon

Build Status License

Cylon is a fast, scalable distributed memory data parallel library for processing structured data. Cylon implements a set of relational operators to process data. While ”Core Cylon” is implemented using system level C/C++, multiple language interfaces (Python and Java (R in future)) are provided to seamlessly integrate with existing applications, enabling both data and AI/ML engineers to invoke data processing operators in a familiar programming language. By default it works with MPI for distributing the applications.

Internally Cylon uses Apache Arrow to represent the data in a column format.

The documentation can be found at https://cylondata.org

Email - cylondata@googlegroups.com

Mailing List - Join

About

Cylon is a fast, scalable distributed memory data parallel library for processing structured data

https://cylondata.org

License:Apache License 2.0


Languages

Language:C++ 67.1%Language:Python 28.1%Language:CMake 2.1%Language:Java 1.1%Language:Jupyter Notebook 1.0%Language:Shell 0.6%Language:Dockerfile 0.0%Language:Makefile 0.0%