Difficulties solved so far:
- input is tab-delimited, without headline -> apply column names to transform RDD to dataset
Spark(Scala) App to join tab delimited master data with events and do some aggregations
Difficulties solved so far:
Spark(Scala) App to join tab delimited master data with events and do some aggregations