fasl / Client-Retention-Through-Data-Analysis-On-zOS

Analyze customer churn data using Python and Jupyter Notebooks via IBM Open Data Analytics for z/OS

Home Page:https://developer.ibm.com/patterns/client-retention-through-data-analysis-on-zos/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

The client retention demo

This demo is an exploratory Jupyter Notebook. It analyzes customer data and customer credit card transactions data on the mainframe to discover potential factors for why they would leave the bank.

So let's set up the back story for this demo: You are a bank that is experiencing customer retention issues, or in data scientist terms, customer churn. Now, the data science team (you) will need to analyze the data to see what features are good indicators of potential churn.

This demo is written in Python and includes many of the top Python libraries like Pandas and MatPlotLib. It also does machine learning with Scikit-learn.

flow

Flow

  1. A data scientist interacts with a web UI.
  2. The web UI is the browser-based user interface for Jupyter Notebook.
  3. Jupyter Notebook is part of IzODA, running on z/OS.
  4. A VSAM dataset is accessed via a virtural table defined in Optimized Data Layer (ODL).
  5. DSDBC queries ODL for the VSAM date.

Included components

  • IzODA: An assembly of key open source and proprietary technologies that allow data scientists and application developers to analyze and visualize the large volumes of data hosted on IBM Z.
  • Conda: An open source package management system and environment management system.
  • Jupyter Notebook: An open source web application that allows you to create and share documents that contain live code, equations, visualizations, and explanatory text.
  • Pandas: An open source, BSD-licensed library that provides high-performance, easy-to-use data structures and data analysis tools for the Python programming language.
  • Scikit-learn: A simple and efficient set of tools for data mining and data analysis.
  • MatPlotLib: A Python 2D plotting library that produces publication quality figures in a variety of hard-copy formats and interactive environments across platforms.
  • Seaborn: A Python data visualization library that's based on MatPlotLib. It provides a high-level interface for drawing attractive and informative statistical graphics.
  • Numpy: The fundamental package for scientific computing with Python. It contains among other things:
  • DSDBC
  • ODL: The Optimized Data Layer is also known as the IBM Mainframe Data Service for Apache Spark (MDS, or MDSS).

Featured technologies

  • Artificial Intelligence: Can be applied to disparate solution spaces to deliver disruptive technologies.
  • Data Science: Systems and scientific methods for analyzing structured and unstructured data in order to extract knowledge and insights.
  • Python: A programming language that lets you work more quickly and integrate your systems more effectively.

Steps

Either run locally:

  1. Clone the repo
  2. Run Jupyter Notebook

Run locally

1. Clone the repo

Clone the Client-Retention-Through-Data-Analysis-On-zOS locally. In a terminal, run:

$ git clone git@github.com:IBM/Client-Retention-Through-Data-Analysis-On-zOS.git

2. Run Jupyter Notebook

The code included in this code pattern runs in a Jupyter Notebook.

  • Start your Jupyter Notebooks. Starting in your Client-Retention-Through-Data-Analysis-On-zOS cloned repo directory will help you find the notebook and the output as described below. Jupyter Notebooks will open in your browser.

    cd Client-Retention-Through-Data-Analysis-On-zOS
    jupyter notebook
    
  • Navigate to the notebooks directory and open the notebook file named client_retention_demo.ipynb by clicking on it.

Links

Learn more

  • IBM Open Data Analytics for z/OS: Read more here.
  • Artificial intelligence code patterns: Enjoyed this code pattern? Check out our other AI code patterns.
  • Data analytics code patterns: Enjoyed this code pattern? Check out our other data analytics code patterns.
  • AI and data code pattern playlist: Bookmark our playlist with all of our code pattern videos.

License

This code pattern is licensed under the Apache Software License, Version 2. Separate third party code objects invoked within this code pattern are licensed by their respective providers pursuant to their own separate licenses. Contributions are subject to the Developer Certificate of Origin, Version 1.1 (DCO) and the Apache Software License, Version 2.

Apache Software License (ASL) FAQ

About

Analyze customer churn data using Python and Jupyter Notebooks via IBM Open Data Analytics for z/OS

https://developer.ibm.com/patterns/client-retention-through-data-analysis-on-zos/

License:Apache License 2.0


Languages

Language:Jupyter Notebook 100.0%