JCabral88 / Lab-Customer-Analysis-Round-1

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

logo_ironhack_blue 7

Lab | Customer Analysis Round 1

Remember the process:

  1. Case Study
  2. Get data
  3. Cleaning/Wrangling/EDA
  4. Processing Data
  5. Modeling
  6. Validation
  7. Reporting

Abstract

The objective of this data is to understand customer demographics and buying behavior. Later during the week, we will use predictive analytics to analyze the most profitable customers and how they interact. After that, we will take targeted actions to increase profitable customer response, retention, and growth.

For this lab, we will gather the data from 3 csv files that are provided in the files_for_lab folder. Use that data and complete the data cleaning tasks as mentioned later in the instructions.

Instructions

  • Read the three files into python as dataframes

  • Show the DataFrame's shape.

  • Standardize header names.

  • Rearrange the columns in the dataframe as needed

  • Concatenate the three dataframes

  • Which columns are numerical?

  • Which columns are categorical?

  • Understand the meaning of all columns

  • Perform the data cleaning operations mentioned so far in class

    • Delete the column education and the number of open complaints from the dataframe.
    • Correct the values in the column customer lifetime value. They are given as a percent, so multiply them by 100 and change dtype to numerical type.
    • Check for duplicate rows in the data and remove if any.
    • Filter out the data for customers who have an income of 0 or less.

About


Languages

Language:Jupyter Notebook 100.0%