adeb09 / data-lake-code

Code for the Data Lake Talk

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Creating a Local Data Lake

Companion code and slides to my talk about whether you need Hadoop. A lot of processing can be done in-memory on your laptop, if you have a reasonably modern laptop.

For example, you can run MapReduce with PyPy.

About

Code for the Data Lake Talk


Languages

Language:Python 95.1%Language:Shell 4.9%