pdoup / ddp-task-docker

Distributed Data Processing - Spring '22

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ddp-task

Implement 3 join algorithms on 2 key-value store databases (Redis & Apache Ignite) in a containerized enviroment

  1. Double-Pipelined Hash Join
  2. Semi Join
  3. Two-Way Join Processing Using Bloom Filter Intersection

TODO

  • Implement pipelined hash-join
  • Add log level for verbose printing
  • Implement semi-joins
  • Implement Bloom filter joins
  • Incorporate my implementation of Bloom Filter
  • Add functionality to specify number of characters to store as key
  • Added comments

About

Distributed Data Processing - Spring '22


Languages

Language:Java 98.2%Language:Shell 1.1%Language:Dockerfile 0.7%