deepak525 / Boston-Housing-Prices

Predict the best selling price of a new home in Boston

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Boston-Housing-Prices

This project applies basic machine learning concepts on data collected for housing prices in the Boston, Massachusetts area to predict the selling price of a new home.

Software and Libraries

This project uses the following software and Python libraries:

  • Python 2.7
  • NumPy
  • pandas
  • matplotlib
  • scikit-learn

Introduction

The dataset for this project originates from the UCI Machine Learning Repository. The Boston housing data was collected in 1978 and each of the 506 entries represent aggregated data about 14 features for homes from various suburbs in Boston, Massachusetts. For the purposes of this project, the following preoprocessing steps have been made to the dataset:

  • 16 data points have an 'MDEV' value of 50.0. These data points likely contain missing or censored values and have been removed.
  • 1 data point has an 'RM' value of 8.78. This data point can be considered an outlier and has been removed.
  • The features 'RM', 'LSTAT', 'PTRATIO', and 'MDEV' are essential. The remaining non-relevant features have been excluded.
  • The feature 'MDEV' has been multiplicatively scaled to account for 35 years of market inflation.

About

Predict the best selling price of a new home in Boston


Languages

Language:HTML 66.5%Language:Jupyter Notebook 32.9%Language:Python 0.6%