Matthew Powers's repositories
spark-fast-tests
Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)
spark-style-guide
Spark style guide
python-parquet-examples
Using the Parquet file format with Python
mrpowers-benchmarks
MrPowers benchmarks for Dask, Polars, DataFusion, and pandas
datafusion-book
DataFusion book
data-scrapbook
A collection of images and captions to explain core data concepts
mrpowers-book
Book on MrPowers OSS projects, blogs, and other assets
mrpowers.github.io
Documentation and stuff
pydata-examples
Examples of various PyData technologies like pandas, DataFusion, DuckDB, and Polars
pyspark-examples
PySpark example notebooks
arrow-datafusion
Apache Arrow DataFusion SQL Query Engine
arrow-datafusion-python
Apache Arrow DataFusion Python Bindings
db-benchmark
reproducible benchmark of database-like ops
lance-examples
Examples with Lance table format
Language:Jupyter Notebook000
spark-website
Apache Spark Website