Predicting if person's yearly income will be more than $50K or less.
Analyzing income of the most similar people in data set. Similarity is counted as sum of gradation (Euclidean distance), power gradation (Minkowski distance) and equality of every of 14 essential parameters such as education level, marital status, sex etc.
Data set: https://archive.ics.uci.edu/ml/datasets/adult
There are two running modes:
- -r [path] - learning mode, reads passed CSV file and adds it to known data set.
- -c [path] - checking mode, checks given people and predicts their income
Here are sample learning and checking data files.
P.S Take a break when reading sample learning data of 32K people :)