NishadiSS / Diabetes-Prediction-system

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Diabetes-Prediction-system

💫ABOUT DATASET

This dataset is originally from the National Institute of Diabetes and Digestive and Kidney Diseases. The objective of the dataset is to diagnostically predict whether a patient has diabetes based on certain diagnostic measurements included in the dataset. Several constraints were placed on the selection of these instances from a larger database. In particular, all patients here are females at least 21 years old of Pima Indian heritage.

💫COLUMN DESCRIPTION FOR DIABETES DATA:

• Pregnancies

• Glucose

• Blood Pressure

• Skin Thickness

• Insulin

• BMI

• Diabetes

• Age

• Outcome

From the data set in the (.csv) File We can find several variables, some of them are independent (several medical predictor variables) and only one target dependent variable (Outcome).

📲STEP 1:

• I downloaded and opened my dataset and attempted to understand the type of analysis expected.

image

📲STEP 2

1. Open Jupyter Notebook

2. Import the important packages

3. Before importing , some packages must be installed in jupyter notebook

🌟pip install numpy
🌟pip install pandas
🌟pip install scikit-learn
🌟pip install matplotlib
🌟pip install seaborn

📲STEP 3- IMPORTING REQUIRED LIBRARIES

image

Pandas: Data analysis and manipulation library for working with structured data using Data Frame and Series.

NumPy: Numerical computing library supporting large, multi-dimensional arrays and matrices, with high-level mathematical functions.

Seaborn: Statistical data visualization library for creating attractive and informative graphics, based on Matplotlib.

Matplotlib: Comprehensive plotting library providing interface for creating various plots like line, scatter, bar, and histograms.

📲STEP 4-IMPORTING IMPORTANT LIBRARIES FOR PREDICTION

image

Train Test Split: Technique for splitting data into training and testing sets to assess model performance.

Logistic Regression: Method for predicting the probability of a binary

outcome using the logistic function.

Accuracy: Metric measuring the proportion of correctly classified instances in a classification model.

Sklearn: Python's Scikit-learn, a powerful machine learning library providing tools for data analysis and model building.

📲STEP 5-LOADING THE DATASET

image

📲STEP 6-CHECKING FOR MISSING VALUES

image

📲STEP 7-CO RELATION MATRIX

image

image

📲STEP 8-TRAINING THE MODEL WITH THE HELP OF TRAIN TEST SPLIT

image

image

📲STEP 9-MAKING PREDICTION

image

📲STEP 10-CREATE FRONT_END USING DJANGO

image

image

💫Conclusion

In conclusion, this project demonstrates the feasibility of using machine learning techniques to predict diabetes in female individuals of Pima Indian heritage. The trained model can serve as a valuable tool for healthcare professionals in early diagnosis and intervention, ultimately improving patient outcomes.

About


Languages

Language:Python 94.9%Language:Jupyter Notebook 2.0%Language:HTML 1.1%Language:JavaScript 1.0%Language:CSS 0.7%Language:PowerShell 0.2%Language:Batchfile 0.0%