Rohit7594 / Predicting-the-Sale-Price-of-Bulldozers-Kaggle

Predict the Sale Price of Bulldozers (Regression Problem) The data and evaluation metric to be used is root mean square log error or RMSLE mentioned in Kaggle Bluebook for Bulldozers competition.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Predict the Sale Price of Bulldozers (Regression Problem)

The data and evaluation metric to be used is root mean square log error or RMSLE mentioned in Kaggle Bluebook for Bulldozers competition.

1. Problem Definition

The goal is to predict the future sale price of a bulldozer based on its characteristics. (Given previous sales price & specification of similar types of bulldozers.)

2. Data

dataset from Kaggle

In the data set, historical sales data of bulldozers. Include things like, model type, size, sale date and more.

There are 3 datasets:

  1. Train.csv - Historical bulldozer sales examples up to 2011 (close to 400,000 examples with 50+ different attributes, including SalePrice which is the target variable).
  2. Valid.csv - Historical bulldozer sales examples from January 1 2012 to April 30 2012 (close to 12,000 examples with the same attributes as Train.csv).
  3. Test.csv - Historical bulldozer sales examples from May 1 2012 to November 2012 (close to 12,000 examples but missing the SalePrice attribute, as this is what we'll be trying to predict).

3. Evaluation

For this problem, Kaggle has set the evaluation metric to being root mean squared log error (RMSLE). As with many regression evaluations, the goal will be to get this value as low as possible.

4. Features

For this dataset, Kaggle provide a data dictionary which contains information about what each attribute of the dataset means. You can download this file directly from the Kaggle competition page

About

Predict the Sale Price of Bulldozers (Regression Problem) The data and evaluation metric to be used is root mean square log error or RMSLE mentioned in Kaggle Bluebook for Bulldozers competition.


Languages

Language:Jupyter Notebook 100.0%