claudio1975 / Customer_Lifetime_Value

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Unsupervised Learning applied to the Customer Lifetime Value (CLV)

The core business of Insurance Companies is to enable individuals and firms to protect themselves against rarely events paying a small premium compared to the eventually damage incurred.

Customer Lifetime Value (CLV) evaluates the value of the customer for the Company, in other words, it’s the Net Present Value of the cash flows ascribed to the relationship with a customer.

In this work, from the collection of portfolio contracts by one insurance year, will be predicted the Customer Lifetime Value of the last three months of the year, also looking at the effects coming from the use of Unsupervised Learning.

Unsupervised Learning describes tasks that involves using a model to discover a good internal representation of input data useful for subsequent Supervised Learning.

In this job Unsupervised Leaning are used to provide a low-dimensional representation of inputs and clustering numerical variables to provide a better portfolio analysis of customers. In both situations, Unsupervised Learning can be used as feature engineering.

Go to the notebooks:

Analysis

-> Exploratory Data Analysis

-> Dimensionality Reduction Analysis

-> Clustering Analysis

Recap Experiments

-> CLV Modelling

-> CLV prediction with reduced features space

-> CLV prediction with augmented features space by dimensionality reduction

-> CLV prediction with augmented features space by clustering

-> Clustering as Portfolio Analysis

Notebooks for each tool:

-> Flat (baseline) model

-> Principal Component Analysis

-> Autoencoder

-> Isomap

-> K-Means

-> DBSCAN

-> Gaussian Mixture Models

-> Hierarchical Clustering

Comparison between ML models to evaluate CLV prediction

Comparison between Flat model (baseline) and other models built only with extracted features from dimensionality reduction

Comparison between Flat model (baseline) and other augmented models with feature engineering from dimensionality reduction

Comparison between several clustering methods applied to numerical features

About


Languages

Language:Jupyter Notebook 100.0%