There are 24 repositories under dataframe topic.
Modin: Scale your Pandas workflows by changing a single line of code
Apache DataFusion SQL Query Engine
Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.
Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.
Koalas: pandas API on Apache Spark
Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
C++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types and contiguous memory storage
AI code-writing assistant that understands data content
📺(tv) Tidy Viewer is a cross-platform CLI csv pretty printer that uses column styling to maximize viewer enjoyment.
Fastest library to load data from DB to DataFrames in Rust and Python
Distributed DataFrame for Python designed for the cloud, powered by Rust
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.
Apache Arrow Ballista Distributed Query Engine
Clean APIs for data cleaning. Python implementation of R package Janitor
A curated list of amazingly awesome Cybersecurity datasets
DataFrames for Go: For statistics, machine-learning, and data manipulation/exploration
A connector for Spark that allows reading and writing to/from Redis cluster
A nimble options backtesting library for Python