As we can see the distribution of the classes is not balanced so we this give us indication of imbalancing problem should be resolved before modelling phase. Methods that could be used are:
- Customize the lose functions of models by giving a heigher weight to the lower class
- using Sampling techniques such as Upsampling or DownSampling
- using SMOTE which is not recommmanded in most cases due to the irrelavent data generated by it
Plot indicate the most affected people by the covid are the Women also they have the most recoverd percentage!
From the previous plots , we can notice that the data is not linearly seperable in the 2d dimension, may they are seperable in the heigher dimension, We can figure it out through out the analysis!



