Here vehicle_prices is used to perform EDA
, find features by feature engineering
that affects price
of vehicles and create a regression
model that can predict price of vechicles.
While doing this we'll go through
- Feature engineering on
categorical
andcontinuous
features - Dealing with
multi-collinearity
issues - Dealing with
outliers
The notebook is available on Kaggle to work in the same environment where this notebook was created i.e. use the same version packages used, etc...
Correlation matrix
Multi collinearity issue
Price distribution
Correlation matrix for vehicle prices
To see more go to the notebook.
Learning curve
RMS error and R2 square metrics
Actual Vs Predicted values