Extended NexMark Query 13 from Beam
gangliao opened this issue · comments
-- -------------------------------------------------------------------------------------------------
-- Query 13: Bounded Side Input Join (Not in original suite)
-- -------------------------------------------------------------------------------------------------
-- Joins a stream to a bounded side input, modeling basic stream enrichment.
-- -------------------------------------------------------------------------------------------------
I like this query. And I will implement this query to join a stream with a bounded side input in S3. It's an important feature that is also highlighted in Kinesis Data Streams.
For this query, I think we can use the recent S3 feature --- S3 Object Lambda.
https://aws.amazon.com/cn/blogs/aws/introducing-amazon-s3-object-lambda-use-your-code-to-process-data-as-it-is-being-retrieved-from-s3/
Q13 is actually equivalent to the YSB benchmark since they both need to read a side input (lookup table) from external storage.
- Replace YSB Redis with AWS S3
- Replace NEXMark local file with AWS S3
- Support side-input query in benchmark