dannguyen / csvmedkit

csvkit's extended family of awkward tools

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

csvmedkit

The unofficial extended family of csvkit, i.e. even more tools for command-line data parsing and wrangling.

Status as of 2020-11-11: Alpha, but working toward release version with these utilities:

  • csvflatten: reformat data for easier browsing of "wide" data
  • csvheader: add/alter a data file's column names
  • csvnorm: normalize unprintable characters and whitespace
  • csvpivot: do pivot tables
  • csvsed: do sed substitution on a per-column basis
  • csvslice: return rows by index, including head/tail to get first/last n rows

Read the in-progress docs at: csvmedkit.readthedocs.io

Note: many of the data examples incorporate the original csvkit tools, including a couple of post 1.0.6 fixes. Until csvkit makes a new official release, here's what I used:

$ pip install git+https://github.com/wireservice/csvkit.git@73d5bdc4a2f5c07b91737ea007bb3510f970aad7

About

csvkit's extended family of awkward tools

License:MIT License


Languages

Language:Python 99.0%Language:Makefile 1.0%