radikz / us-insurance

Exploring real-world medical insurance costs dataset using Python for independent analysis and insights

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

U.S. Medical Insurance Costs

The goal with this project will be to analyze various attributes within insurance.csv to learn more about the user information in the file and gain insight into potential use cases for the dataset.

insurance.csv contains the following columns:

  • Patient Age
  • Patient Sex
  • Patient BMI
  • Patient Number of Children
  • Patient Smoking Status
  • Patient U.S Geopraphical Region
  • Patient Yearly Medical Insurance Cost

There are no signs of missing data. To store this information, seven empty lists will be created hold each individual column of data from insurance.csv.

The following operations will be implemented:

  • Find out the average age of the patients in the dataset.
  • Analyze where a majority of the individuals are from.
  • Return the number of smokers vs. non-smokers counted in the dataset
  • Analyze the smokers then grouping based on the age of users
  • creating a dictionary that contains all patient information

About

Exploring real-world medical insurance costs dataset using Python for independent analysis and insights


Languages

Language:Jupyter Notebook 100.0%