ali-unlu / EDA-Car-Dataset

In this analysis, I will demonstrate how to explore the cars dataset using Python. I will first load the dataset and then process the data. I will also visualize the dataset and then finally apply ttest to explore the relationship between variables.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

EDA Car Dataset

In this analysis, I will demonstrate how to explore the cars dataset using Python. I will first load the dataset and then process the data. I will also visualize the dataset and then finally apply ttest to explore the relationship between variables. When the final dataset is prepared, the same dataset can be used to develop various models.

Briefly:

  1. I converted some variables from chr format into numbers. To do so, I first removed dollar and punction from the data and then changed the data type.
  2. I replaced missing variable with the set median.
  3. I identified outliers and then remove some of them. I also prepared some codes for data normalization procedure.
  4. I created three graphs about the dataset.
  5. I created three hypothesis testing models and used ttest to compare groups' means.
  6. I also created boxplots to show the ttest results in context.

To see the file, please click this link.

About

In this analysis, I will demonstrate how to explore the cars dataset using Python. I will first load the dataset and then process the data. I will also visualize the dataset and then finally apply ttest to explore the relationship between variables.


Languages

Language:Jupyter Notebook 100.0%