timothymiko / CarAuctionEvaluator

The goal of this project is to predict if a car purchased at an auction is a good or bad buy.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CarAuctionEvaluator

The goal of this project is to use machine learning to build a predictive model to help auto dealers avoid purchasing potentially bad vehicles. The model will be based on a set of 72,000 historical records that contains 32 attributes as well as a unique id for each purchase and a label indicating if the vehicle was a bad buy or not.

This project is for the Kaggle Don't Get Kicked! challenge.

Preliminary Analysis

  • Bad Buy Percentage: 12.30%
  • Years range from 2001 to 2010. The median year was 2005 and the mean year was 2005. The most popular year is 2006.
  • Age ranges from 0 to 9 years old. The median age was 4 and the mean age was 4.
  • The most popular make is CHEVROLET.
  • The most popular color is SILVER.
  • Odometer readings range from 4825 to 115717 miles. The median reading was 73363 and the mean reading was 71502.

Strategies

  • Random Forest
  • Naive Bayes
  • Neural Network
  • Decision Tree
  • Adaboost (base=Decision Tree)

I ultimately chose to use the Adaboost algorithm as decision trees were producing the best results. I focused on maximizing the accuracy of predicting bad buys without taking a huge hit to the accuracy of the good buys.

Dependencies

About

The goal of this project is to predict if a car purchased at an auction is a good or bad buy.

License:MIT License


Languages

Language:Python 100.0%