dtables is a Python based minimalistic framework for exploratory data analysis. This project is under development. A rough API map is available.
- Minimal - Quick to learn and get going. Useful for programmers who are just dabbling in data analysis. Not necessarily for pros whose day job is data analysis. Focus on "one way to solve problems" rather than "different ways to match different situations".
- Complete - Minimal does not mean incomplete. Do not leave important use cases unaddressed. Try to address them without adding concepts.
Performance is not an initial goal. The current objective is to make it work well for smallish datasets (10s of MBs).
- load_dict
- load_tuples
- load_csv
- load_json
- load_table
- column names
- head
- tail
- dtypes
- shape
- describe
- name based
- position based
- single column name
- array of column names
- column name range
- single position
- array of position
- range of position
- boolean array based
- single position
- array of position
- range of position
- boolean array based
- overwrite existing columns
- add new columns
- overwrite existing rows
- arithmetic
- comparison
- broadcasting
- row apply
- column apply
- pivot
- melt
- append
- join
- sort
- group apply