Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 Introduction to Modeling Beyond the Basics (Chapter 7)

Similar presentations


Presentation on theme: "1 Introduction to Modeling Beyond the Basics (Chapter 7)"— Presentation transcript:

1 1 Introduction to Modeling Beyond the Basics (Chapter 7)

2 2 Content Simple and multiple linear regression Simple logistic regression –The logistic function –Estimation of parameters –Interpretation of coefficients Multiple logistic regression –Interpretation of coefficients –Coding of variables

3 3 How can we analyse these data? Table 1 Age and systolic blood pressure (SBP) among 33 adult women

4 4 SBP (mm Hg) Age (years) adapted from Colton T. Statistics in Medicine. Boston: Little Brown, 1974

5 5 Simple linear regression Relation between 2 continuous variables (SBP and age) Regression coefficient  1 –Measures association between y and x –Amount by which y changes on average when x changes by one unit –Least squares method y x Slope

6 6 Multiple linear regression Relation between a continuous variable and a set of i continuous or categorical variables Partial regression coefficients  i –Amount by which y changes on average when x i changes by one unit and all the other x i s remain constant –Measures association between x i and y adjusted for all other x i Example –SBP versus age, weight, height, etc

7 7 Multiple linear regression Dependent Independent variables Predicted Predictor variables Response variable Explanatory variables Outcome variable Covariables

8 8 Multivariate analysis Model Outcome Linear regression continous Poisson regression counts Cox model survival Logistic regression binomial...... Choice of the tool according to study, objectives, and the variables –Control of confounding –Model building, prediction

9 9 Logistic regression Models the relationship between a set of variables x i –dichotomous (eat : yes/no) –categorical (social class,... ) –continuous (age,...) and –dichotomous variable Y Dichotomous (binary) outcome most common situation in biology and epidemiology

10 10 How can we analyse these data? Table 2 Age and signes of Coronary Heart Disease (CHD), 33 women CHD

11 11 How can we analyse these data? Comparison of the mean age of diseased and non-diseased women –Non-diseased: 38.6 years –Diseased: 58.7 years (p<0.0001) Linear regression?

12 12 Dot-plot: Data from Table 2

13 13 NO YES Y = -0.527 + 0.20 x AGE

14 14 Table 3 - Prevalence (%) of signs of CHD according to age group

15 15 20-2930-3940-4950-5960-6970-7980-89 Dot-plot: Data from Table 3

16 16 Dot-plot: Data from Table 3 Diseased % Age (years) P 1-P

17 17 Dot-plot: Data from Table 3 Diseased % Age (years)

18 18 The logistic function (2) logit of P(y|x) {

19 19 The logistic function (1) Probability of disease x

20 20 The logistic function (2) logit of P(y|x) {

21 21 The logistic function (3) Advantages of the logit –Simple transformation of P(y|x) –Linear relationship with x –Can be continuous (Logit between -  to +  ) –Known binomial distribution (P between 0 and 1) –Directly related to the notion of odds of disease

22 22 Interpretation of  (1)

23 23 Practice 1. MI and Hyperhomocysteinemia? Hyper Homocysteinemia noyesTotal control622183 case424183 Total10462166

24 24 Practice 1 Normal Homocysteine High Homocysteine MI (%)40.3866.13 Odds0.681.95 Ln(Odds)-0.390.67

25 25 Normal HC  X = 0  ln(Odds)=  +  x 0   = ln(Odds) …….  = -0.39 High HC  X=1  ln(Odds)=  +  x 1   = ln(Odds)-  …….  = 0.67 - (-0.39) = 1.06 OR ? = e  = 2.88 SE  = 0.33 How can you interpret  /OR?

26 26 Interpretation of  (2)  = increase in log-odds for a one unit increase in x Test of the hypothesis that  =0 (Wald test) Interval testing OR

27 27 If you run Linear Regression … Y =.04 + 0.257 x High HC % MI in High HC = 66.13 % MI in Normal HC = 40.38 Diff = 25.7 %  1 What is your interpretation about  1 ?

28 28 Example Age (<55 and 55+ years) and risk of developing coronary heart disease (CD)

29 29 Results of fitting Logistic Regression Model

30 30 Interpretation of  (1)

31 31 Multiple logistic regression More than one independent variable –Dichotomous, ordinal, nominal, continuous … Interpretation of  i –Increase in log-odds for a one unit increase in x i with all the other x i s constant –Measures association between x i and log-odds adjusted for all other x i

32 32 Multiple logistic regression Effect modification –Can be modelled by including interaction terms

33 33 Reference Hosmer DW, Lemeshow S. Applied logistic regression.Wiley & Sons, New York, 1989


Download ppt "1 Introduction to Modeling Beyond the Basics (Chapter 7)"

Similar presentations


Ads by Google