dmvianna / framesy

Frames tutorial loosely followed

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

framesy

I'm learning how to parse big CSV files in Haskell. This is my second attempt. I'll be following (loosely) the frames tutorial. No, not the one formatted for web. I'm slowly realising one can learn Haskell by following code (it is readable, and authors are generally good at commenting it). Also, that one cannot do so without reading the source code of the libraries used. Tutorials and plain English documentation are relatively inexistent.

First task

Reading from a stream.

Second dask

Doing some basic data analysis, like counting records.

Second and a half task

Being able to inspect the stream using something like take or show with indexing. I assume I would be doing it in GHCi.

Third task

Extracting relevant info from unstructured text, such as addresses. That's a big part of what I do for work, and the main motivation for looking beyond Python. I want to move away from regular expressions and do it fast.

Fourth task

GROUP BY

At some point

  • Encoding results back into an output file.

Finally, I eagerly welcome help to move this forward. Get in touch!

About

Frames tutorial loosely followed

License:BSD 3-Clause "New" or "Revised" License


Languages

Language:Haskell 100.0%