lkuffo / lsde2021-assignment1

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Large Scale Data Engineering 2021 – Assignment 1

https://event.cwi.nl/lsde/2021/practical1.shtml

⚠️ Warning: the code pushed to the master branch is executed on the leaderboard machine. This might take a significant amount of time and thus increase queueing times. Therefore, please work on a development branch (e.g. dev) and only merge & push to master when you would like to benchmark your code on the leaderboard machine.

Assignments 1a and 1b

To build and run the cruncher binary on SF100, use:

make cruncher
./cruncher /opt/lsde/dataset-sf100-bidirectional/ queries-test.csv out.csv
# in a separate terminal
tail -f out.csv

In 1a, your task is to optimize the code in cruncher.c so that it finishes within the timeout on the leaderboard machine.

In 1b, your task is to implement the reorganizer (reorg.c) code, adjust the code in cruncher.c accordingly, and potentially add further optimizations.

Assignment 1c

See the spark/ directory.

About


Languages

Language:C 88.5%Language:Scala 10.7%Language:Makefile 0.8%