Amazon Managed Service for Apache Flink was formerly known as Amazon Kinesis Data Analytics
Sample Apache Beam pipeline that can be deployed to Amazon Managed Service for Apache Flink. It reads taxi events from a Kinesis data stream, processes and aggregates them, and ingests the result to Amazon CloudWatch for visualization.
The Beam pipeline can also run in batch mode for backfilling, reading a finite dataset from S3.
Application configuration
- Group ID:
BeamApplicationProperties
InputStreamName
name of the input Kinesis Stream, ignored ifSource
=s3
OutputBoroughs
,true
|false
, counts trips per boroughSource
eitherkinesis
, for stream processing mode, ors3
, for batch modeInputS3Pattern
pattern to read backfilling input in batch mode,s3://<bucket-name>/<path>/*/*/*/*/*
, ignored ifSource
=kinesis
This sample code is made available under the MIT-0 license. See the LICENSE file.