SweetAdjPotato / Arvato_Customer_Segmentation

Customer segmentation report and customer conversion probability prediction

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Arvato_Customer_Segmentation

Table of Contents

  1. Installation
  2. Introduction
  3. File Descriptions
  4. Results
  5. Licensing, Authors, and Acknowledgements

Installation

  1. Python versions 3.7
  2. Library and packages: numpy, pandas, matplotlib, sklearn, seaborn, scipy, xgboost, bayes_opt, pickle, time

Introduction

In this project, I analyzed demographics data for customers of a mail-order sales company in Germany, comparing it against demographics information for the general population. I first used unsupervised learning techniques (PCA and K-Means) to perform customer segmentation, identifying the parts of the population that best describe the core customer base of the company. Then, I applied what I've learned on a third dataset with demographics information for targets of a marketing campaign for the company, and use a supervised learning model (XGBClassifier) to predict which individuals are most likely to convert into becoming customers for the company. The data used is provided by Udacity partners at Bertelsmann Arvato Analytics, and represents a real-life data science task. My main deliverable for this project will be a blog post on medium reporting important findings and conclusions.

File Descriptions

Data cannot be shared due to a confidentiality agreement. There is one notebook Arvato_Project_Workbook.ipynb, with all the functions needed are written in the notebook along with the code.

Results

The main findings of the code can be found at a Medium post available here. The supervised learning model is evaluated in a Kaggle competition.

Licensing, Authors, and Acknowledgements

License: MIT

Acknowledge to Udacity and Arvato Financial Services for the data.

Acknowledge to a few repos I studied on Github: [1], [2], [3].

About

Customer segmentation report and customer conversion probability prediction

License:MIT License


Languages

Language:Jupyter Notebook 100.0%