bens / hadoop-formats

Read/write file formats commonly used by Hadoop

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Hadoop Formats Hackage version Build Status

Read/write file formats commonly used by Hadoop.

Currently this package only supports reading snappy encoded sequence files.

Installation

You will need to have libsnappy installed to build this project. If you are using OSX and homebrew to install snappy then the following should get everything installed successfully.

$ brew install snappy
$ SNAPPY=$(brew --prefix snappy)
$ export C_INCLUDE_PATH=$SNAPPY/include
$ export LIBRARY_PATH=$SNAPPY/lib
$ cabal install hadoop-formats

About

Read/write file formats commonly used by Hadoop

License:Apache License 2.0


Languages

Language:Haskell 73.0%Language:C 21.3%Language:Java 3.8%Language:Shell 2.0%