jsousa-Resulto / teched2020-DAT263

DAT263 - Create and Manage End-to-End Data Pipelines with SAP Data Intelligence

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

DAT263 - Create and Manage End-to-End Data Pipelines with SAP Data Intelligence

REUSE status

Description

This repository contains the material for the SAP TechEd 2020 session called Session ID - DAT263.

Overview

This session introduces attendees to use the SAP Data Intelligence Modeler to create data pipelines. We try to touch as many aspects as possible within an interactive 2h workshop. We will follow a use case that is based on a customer request in the area of IoT and quality management. The background story is quite simple. There are two workstreams to choose from

  • 2h workshop
  • 3h workshop with 2 more exercises (File concatenation and Jupyter Notebook analysis)

Setup

The customer gets on a daily basis the configuration of several devices that means the nominal value the device should produce. During the day additional files are received that contain the actual values of each device. All files are stored in an object store in a separate folder.

Process

  1. Append all configuration files and all performance files into corresponding single files and store them to another object store location. (3h Workshop only)

  2. Merge the 2 resulting files into a HANA table by using projections, aggregation and joining.

  3. Do a simple data validation and create for the failed data a quality management service ticket.

  4. In order to improve the quality check a data scientist should be able to do an analysis of the IoT data to eventually developer an early alert schema (3h Workshop only).

  5. The central device configuration and performance table should be exposed via a webservice to retrieve the device status from outside.

Acquired Skills

After having done all the tasks you are familiar with the general concept of **SAP Data Intelligence Modeler **

Requirements

Exercises

2h Workshop

3h Workshop

How to obtain support

Support for the content in this repository is available during the actual time of the online session for which this content has been designed. Otherwise, you may request support via the Issues tab.

License

Copyright (c) 2020 SAP SE or an SAP affiliate company. All rights reserved. This file is licensed under the Apache Software License, version 2.0 except as noted otherwise in the LICENSE file.

About

DAT263 - Create and Manage End-to-End Data Pipelines with SAP Data Intelligence


Languages

Language:Jupyter Notebook 99.1%Language:Python 0.9%