sabman / aicasts.com

Weekly AI Tutorials

Home Page:https://aicasts.org

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

vaex 🐍

sabman opened this issue Β· comments

https://vaex.io/docs/index.html
https://towardsdatascience.com/how-to-speed-up-data-processing-in-pandas-a272d3485b24

Features:

  • Performance: works with huge tabular data, processes rows/second
  • Lazy / Virtual columns: compute on the fly, without wasting ram
  • Memory efficient no memory copies when doing filtering/selections/subsets.
  • Visualization: directly supported, a one-liner is often enough.
  • User friendly API: you will only need to deal with the DataFrame object, and tab completion + docstring will help you out: ds.mean, feels very similar to Pandas.