muleykeyy / EDA-ML

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

EDA-ML

Data Set : https://www.kaggle.com/datasets/kamilpytlak/personal-key-indicators-of-heart-disease?datasetId=1936563

Content:

  1. LOAD and FIRST LOOK to DATA

  2. VARIABLE DESCRIPTION

  3. MISSING VALUES

    • Find Missing Value
    • Fill Missing Value
  4. UNIVARIATE VARIABLE ANALYSIS

    • Categorical Variable
    • Numerical Variable
  5. BASIC DATA ANALYSIS

    • Converting to Ordinal Variable
    • HeartDisease-BMI
    • HeartDisease-SleepTime
    • Sex-MentalHealth
    • AgeCategory-MentalHealth
  6. OUTLIER DETECTION

    • Calculation of Interquartile Range
    • Finding Outlier Values
    • Solutions for Outlier Values
      • Trimming
      • Imputation
      • Winsorization
  7. VISUALIZATION

    • Correlation Between BMI-PhysicalHealth-MentalHealth-SleepTime
    • AGE
    • BMI
    • SMOKING
    • ALCOHOL
  8. MODELLING

    • PREPROCESSING
    • NORMALIZATION
    • TRAIN-TEST SPLIT
    • MODELS
    • ROC - AUC CURVE

About


Languages

Language:Jupyter Notebook 100.0%