nishchaychawla / EDA-White-Wine-Dataset-using-R

Uni, Bi and Multivariate analysis of white wine dataset using R.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

EDA-White-Wine-Dataset-using-R

Overview:

Exploratory data analysis (EDA) is an approach to analyse data sets for summarizing their main characteristics and general trends, with visual and summary statistics methods. This project is an Exploratory Data Analysis of a tidy data set having 4,898 white wines(observations) with 11 variables on quantifying the chemical properties of each wine. At least 3 wine experts rated the quality of each wine, providing a rating between 0 (very bad) and 10 (very excellent). The project includes Univariate, Bivariate and Multivariate analysis of the data mainly using ggplot package

This data set is provided by Udacity using the following research article: Cortez, P.; Cerdeira, A.; Almeida, F.; Matos, T.; Reis, J. Modeling wine preferences by data mining from physicochemical properties. Decision Support Systems. 2009, 47, 547-553.

Files:

EDA_Wh_Wine_Nishchay.rmd: R markdown file having all the code which generated graphs shown in EDA_pdf.

EDA_pdf.pdf: The pdf copy of the project knited from rmd file, having all the graphs, analysis and summary of the project.

wineQualityWhites.csv: Data set for the project.

About

Uni, Bi and Multivariate analysis of white wine dataset using R.