mccurcio / Logistic_with_R

Using Framingham Heart Disease data I will use Logisitic Regression to determine Probability of Heart Disease and its factors.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Logistic_with_R

To view report: An Investigation Of The Framingham Heart Study

  1. Purpose: This R markdown document displays my understanding of logistic regression and R. This report is part one of two articles describing Logit.

    A. Logisitc Regression Report B. Under construction

  2. Data: Framingham Heart Disease Study, FHS_data

  3. Conclusion: We find seven (7) factors and their related odds leading to cardiovascular disease.

No. Factors Approximate Odds Over Mean
1 Prevalence Of Stroke In Family History.       240%
2 Male Vs Female       150%
3 Prevalence Of Hypertension In Family History       130%
4 Age < 2,800%
5 Cigarettes Per Day <    210%
6 Systolic Blood Pressure <    780%
7 Glucose Levels <    250%

NOTES: I will use Logistic Regression to determine the Probability of Heart Disease and its factors.

Assumptions of Logistic Regression

  1. Logistic Regression does not require a linear relationship between the independent and dependent variables.

  2. The residuals from the model do not need to follow the normal distribution.

  3. Logistic Regression does not require the assumption of homoscedasticity. Homoscedasticity means all the variables in the model have the same variance. So, the variables may have different variances in the Logistic Regression model.

  4. The dependent variable in Logistic Regression is not measured on an interval or ratio scale.

  5. Logistic Regression algorithm requires little or no multicollinearity among the independent variables. It means that the independent variables should not be too highly correlated with each other.

  6. Logistic Regression model assumes linearity of independent variables and log odds.

  7. The success of Logistic Regression model depends on the sample sizes. Typically, it requires a large sample size to achieve the high accuracy.

About

Using Framingham Heart Disease data I will use Logisitic Regression to determine Probability of Heart Disease and its factors.

License:Creative Commons Zero v1.0 Universal


Languages

Language:HTML 100.0%