jdanbrown / feather-util

Misc. utils for https://github.com/wesm/feather

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Install feather:

Convert a row-major tsv file to many tsv-split-cols files:

  • Straightforward to make this work with csv, but it's ~5x slower, so I didn't
cat ebd_US-CA_relFeb-2017.txt | pv -terbl -s34m | ./tsv-split-cols ebd_US-CA_relFeb-2017.txt.tsv-split-cols/

Convert the tsv-split-cols files into one column-major feather file:

./feather-from-tsv-split-cols ebd_US-CA_relFeb-2017.txt.tsv-split-cols/ ebd_US-CA_relFeb-2017.feather

Use feather:

About

Misc. utils for https://github.com/wesm/feather


Languages

Language:Python 100.0%