Data Wrangling and small dataset insights.
“Data is not always clean like Kaggle Dataset” - someone on the internet
- Check data shape (num of Rows & Columns)
- Check each data type of columns and missing values
- Splitting values
- Change the data type
- Check the percentages of missing value
- Summary Statistics
- Check value counts for a specific column
- Check duplicate values and deal with it
- See the data distribution and data anomaly
- Check the correlation between variables in the data
- Based on above pointers, proceed to specific direction like
- Start to replace, transform and modify the data to make it ready for further analysis.