I try to simulate a multiple linear regression which I have downloaded the data from Kaggle (https://www.kaggle.com/elikplim/eergy-efficiency-dataset?select=ENB2012_data.csv). I try to find out the relationship between these variables:;
- X1 Relative Compactness
- X2 Surface Area
- X3 Wall Area
- X4 Roof Area
- X5 Overall Height
- X6 Orientation
- X7 Glazing Area
- X8 Glazing Area Distribution
- y1 Heating Load
- Import related libraries
- Extracting into data panda frames
- Define X and Y variables through this line of code
- Run this code to get the result of OLS. All variables are significant with high Rsquare stood at 91,6%
lm_multi = sn.OLS(y_multi, X_multi_cons).fit()
lm_multi.summary()
- Visualize the graph in 3D graph