This is a machine learning classification model to predict whether a person have more than 50,000 dollars salary, given the person's background information like age, education, gender, marriage, sibling, work class, etc. It using Python to implement.
-
The dataset adult_train.txt and adult_test.txt includes 48842 samples and 14 features.
-
Utilized five different train models: Logistic Regression, Support Vector Machine, Decision Tree, Random Forest, Neural Network.
-
Evaluate the performance of each mode using F1 scores and draw Normalized Confusion Matrix.