dataframe

There are 24 repositories under dataframe topic.

pola-rs / polars
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
dataframe-library dataframe dataframes rust arrow python out-of-core polars
Language:Rust 26622
pygwalker
Kanaries / pygwalker
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
data-analysis data-exploration dataframe matplotlib pandas plotly tableau tableau-alternative visualization
Language:Python 9970
modin-project / modin
Modin: Scale your Pandas workflows by changing a single line of code
analytics data-science dataframe datascience distributed modin pandas python sql
Language:Python 9499
vaexio / vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
dataframe python bigdata tabular-data visualization memory-mapped-file hdf5 machine-learning machinelearning data-science pyarrow
Language:Python 8180
rapidsai / cudf
cuDF - GPU DataFrame Library
arrow cpp cuda cudf dask data-analysis data-science dataframe gpu pandas pydata python rapids
Language:C++ 7396
haifengl / smile
Statistical Machine Intelligence & Learning Engine
machine-learning regression clustering manifold-learning nlp visualization classification nearest-neighbor-search interpolation wavelet graph linear-algebra computer-algebra-system multidimensional-scaling deep-learning statistics dataframe data-science genetic-algorithm
Language:Java 5932
apache / datafusion
Apache DataFusion SQL Query Engine
arrow big-data dataframe datafusion olap python query-engine rust sql
Language:Rust 5207
twopirllc / pandas-ta
Technical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 150+ Indicators
python3 pandas pandas-extension technical-analysis technical-analysis-indicators technical-analysis-library finance fundamental-analysis trading trading-algorithms technical-indicators pandas-dataframe-extension stock-market technical jupyter-notebook pandas-ta dataframe
Language:Python 4814
danfojs
javascriptdata / danfojs
Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.
data-analytics data-science pandas data-manipulation tensors dataframe javascript data-analysis danfojs stream-data stream-processing tensorflow table plotting-charts
Language:TypeScript 4668
lk-geimfari / mimesis
Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.
mimesis fake data generator fixtures dummy schema testing python json-generator mock synthetic-data datascience dataframe pandas syntetic polars pytest-plugin factory factory-boy
Language:Python 4313
jtablesaw / tablesaw
Java dataframe and visualization library
chart data-analysis data-frame data-science data-visualization dataframe high-performance java java-dataframe machine-learning plotly plotting statistical-analysis statistics visualization
Language:Java 3444
databricks / koalas
Koalas: pandas API on Apache Spark
spark pandas pydata dataframe mlflow big-data data-science
Language:Python 3321
adamerose / PandasGUI
A GUI for Pandas DataFrames
dataframe gui pandas viewer
Language:Python 3135
mars-project / mars
Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
python numpy tensor pandas machine-learning scikit-learn tensorflow pytorch xgboost lightgbm ray statsmodels joblib dataframe dask
Language:Python 2675
DataFrame
hosseinmoein / DataFrame
C++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types and contiguous memory storage
numerical-analysis dataframe data-analysis multidimensional-data cpp large-data heterogeneous-data statistical-analysis financial-data-analysis financial-engineering trading-strategies trading-algorithms statistical machine-learning data-science tensor tensorboard ai pandas polars
Language:C++ 2294
approximatelabs / sketch
AI code-writing assistant that understands data content
ai codex copilot data data-science dataframe datasketches df ds gpt3 pandas sketches tabular-data datasketch lambdaprompt python
Language:Python 2197
tv
alexhallam / tv
📺(tv) Tidy Viewer is a cross-platform CLI csv pretty printer that uses column styling to maximize viewer enjoyment.
cli terminal csv pretty-printer pretty-print command-line-tool data-science rust command-line tabular-data tibble dataframe datatable csv-viewer csv-visualization csv-pretty-print csv-cat column csv-column
Language:Rust 2030
sfu-db / connector-x
Fastest library to load data from DB to DataFrames in Rust and Python
rust python database sql dataframe
Language:Rust 1810
Daft
Eventual-Inc / Daft
Distributed DataFrame for Python designed for the cloud, powered by Rust
big-data data-engineering data-science dataframe distributed-computing machine-learning python rust
Language:Rust 1761
sngyai / Sequoia
A股自动选股程序，实现了海龟交易法则、缠中说禅牛市买点，以及其他若干种技术形态
ta-lib python tushare turtle-trade a-shares pandas dataframe akshare
Language:Python 1711
DAGWorks-Inc / hamilton
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.
dag data-analysis data-engineering data-science dataframe etl etl-framework etl-pipeline feature-engineering featurization hacktoberfest lineage llmops machine-learning mlops numpy orchestration pandas python software-engineering
Language:Jupyter Notebook 1391
apache / datafusion-ballista
Apache Arrow Ballista Distributed Query Engine
arrow big-data dataframe distributed olap python query-engine rust sql
Language:Rust 1311
pyjanitor-devs / pyjanitor
Clean APIs for data cleaning. Python implementation of R package Janitor
pandas dataframe data cleaning-data data-engineering pydata hacktoberfest
Language:Python 1291
shramos / Awesome-Cybersecurity-Datasets
A curated list of amazingly awesome Cybersecurity datasets
data datasets learning security cybersecurity attack malware dataframe traffic logs events machinelearning deep deeplearning ids ips
1205
arquero
uwdata / arquero
Query processing and transformation of array-backed data tables.
arrays data database dataframe query table transform
Language:JavaScript 1199
man-group / ArcticDB
ArcticDB is a high performance, serverless DataFrame database built for the Python Data Science ecosystem.
big-data data data-analysis data-science database pandas dataframe quantitative-analysis quantitative-finance quantitative-trading
Language:C++ 1127
rocketlaunchr / dataframe-go
DataFrames for Go: For statistics, machine-learning, and data manipulation/exploration
data-science dataframe dataframes go golang machine-learning pandas pandas-dataframe python statistics
Language:Go 1125
kangas
comet-ml / kangas
🦘 Explore multimedia datasets at scale
data-analysis data-exploration dataframe datagrid machine-learning
Language:Jupyter Notebook 1029
microsoft / Mobius
C# and F# language binding and extensions to Apache Spark
spark apache-spark rdd dataframe dstream dataset streaming csharp mobius kafka-streaming spark-streaming fsharp bigdata mapreduce eventhubs near-real-time
Language:C# 939
RedisLabs / spark-redis
A connector for Spark that allows reading and writing to/from Redis cluster
spark redis java dataframe
Language:Scala 934
michaelchu / optopsy
A nimble options backtesting library for Python
options-spreads options-strategies trade-options options trading options-trading options-framework option-pricing option-chain algorithmic algorithmic-trading algorithmic-trading-engine algorithmic-trading-library backtesting-trading-strategies backtest backtesting-frameworks backtesting option-strategies dataframe
Language:Python 923
stitchfix / hamilton
A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton
python pandas dag data-science data-engineering numpy software-engineering etl-framework etl-pipeline etl feature-engineering featurization dataframe stitch-fix data-platform hamilton hamiltonian machine-learning
Language:Python 872
MrPowers / spark-daria
Essential Spark extensions and helper methods ✨😲
spark dataframe
Language:Scala 742
Kotlin / dataframe
Structured data processing in Kotlin
data-analysis data-science dataframe kotlin
Language:Kotlin 723
pdpipe
pdpipe / pdpipe
Easy pipelines for pandas DataFrames.
pandas pandas-dataframe pipeline data data-science dataframe dataframes
Language:Jupyter Notebook 715
freqtrade / technical
Various indicators developed or collected for the Freqtrade
freqtrade dataframe technical-analysis trading
Language:Python 705