Prashant812 / Data-Visualization

Data Visualization and Statistics using Pandas

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Data-visualization

Data Visualization and Statistics using Pandas. In this repository , I have used Pima Indians Diabetes Database as a csv file. This data-set is originally from the National Institute of Diabetes and Digestive and Kidney Diseases. The objective is to predict based on diagnostic measurements whether a patient has diabetes. Several constraints were placed on the selection of these instances from a larger database. In particular, all patients here are females with at least 21 years old of Pima Indian heritage. It contains following 9 attributes :-
1.pregs: Number of times pregnant
2.plas: Plasma glucose concentration 2 hours in an oral glucose tolerance test
3.pres: Diastolic blood pressure (mm Hg)
4.skin: Triceps skin fold thickness (mm)
5.test: 2-Hour serum insulin (mu U/mL)
6.BMI: Body mass index (weight in kg/(height in m)^2)
7.pedi: Diabetes pedigree function
8.Age: Age (years)
9.class: Class variable (0 or 1)

About

Data Visualization and Statistics using Pandas


Languages

Language:Jupyter Notebook 98.4%Language:Python 1.6%