Applied Epidemiologic Analysis - P8400 Fall 2002

Slides:



Advertisements
Similar presentations
Questions From Yesterday
Advertisements

Qualitative predictor variables
Research Support Center Chongming Yang
Tests of Significance for Regression & Correlation b* will equal the population parameter of the slope rather thanbecause beta has another meaning with.
© McGraw-Hill Higher Education. All Rights Reserved. Chapter 2F Statistical Tools in Evaluation.
Collinearity. Symptoms of collinearity Collinearity between independent variables – High r 2 High vif of variables in model Variables significant in simple.
Creating Graphs on Saturn GOPTIONS DEVICE = png HTITLE=2 HTEXT=1.5 GSFMODE = replace; PROC REG DATA=agebp; MODEL sbp = age; PLOT sbp*age; RUN; This will.
Confidence Intervals Underlying model: Unknown parameter We know how to calculate point estimates E.g. regression analysis But different data would change.
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
Linear Regression.
Reading – Linear Regression Le (Chapter 8 through 8.1.6) C &S (Chapter 5:F,G,H)
CHAPTER 4 ECONOMETRICS x x x x x Multiple Regression = more than one explanatory variable Independent variables are X 2 and X 3. Y i = B 1 + B 2 X 2i +
Treatment Effects: What works for Whom? Spyros Konstantopoulos Michigan State University.
Understanding Multivariate Research Berry & Sanders.
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation.
Regression For the purposes of this class: –Does Y depend on X? –Does a change in X cause a change in Y? –Can Y be predicted from X? Y= mX + b Predicted.
Growth Curve Models Using Multilevel Modeling with SPSS David A. Kenny January 23, 2014.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
Applied Epidemiologic Analysis - P8400 Fall 2002 Lab 10 Missing Data Henian Chen, M.D., Ph.D.
Applied Epidemiologic Analysis - P8400 Fall 2002
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Applied Epidemiologic Analysis - P8400 Fall 2002 Lab 9 Survival Analysis Henian Chen, M.D., Ph.D.
Copyright © 2006 The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Dummy Variable Regression Models chapter ten.
Education 793 Class Notes Multiple Regression 19 November 2003.
1 Experimental Statistics - week 14 Multiple Regression – miscellaneous topics.
Practice You collect data from 53 females and find the correlation between candy and depression is Determine if this value is significantly different.
28. Multiple regression The Practice of Statistics in the Life Sciences Second Edition.
With the growth of internet service providers, a researcher decides to examine whether there is a correlation between cost of internet service per.
9.1 Chapter 9: Dummy Variables A Dummy Variable: is a variable that can take on only 2 possible values: yes, no up, down male, female union member, non-union.
FIXED AND RANDOM EFFECTS IN HLM. Fixed effects produce constant impact on DV. Random effects produce variable impact on DV. F IXED VS RANDOM EFFECTS.
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
1 Statistics 262: Intermediate Biostatistics Regression Models for longitudinal data: Mixed Models.
1 Modeling change Kristin Sainani Ph.D. Stanford University Department of Health Research and Policy
Multiple Regression David A. Kenny January 12, 2014.
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
Applied Epidemiologic Analysis - P8400 Fall 2002 Lab 3 Type I, II Error, Sample Size, and Power Henian Chen, M.D., Ph.D.
1 Experimental Statistics - week 11 Chapter 11: Linear Regression and Correlation.
1 Linear Regression Model. 2 Types of Regression Models.
Module II Lecture 1: Multiple Regression
Regression Models First-order with Two Independent Variables
Learning Objectives For two quantitative IVs, you will learn:
Interactions Interaction: Does the relationship between two variables depend on a third variable? Does the relationship of age to BP depend on gender Does.
Chapter 2. Two-Variable Regression Analysis: Some Basic Ideas
Multiple Regression Analysis and Model Building
General principles in building a predictive model
Basics of Group Analysis
Multiple Regression.
Multiple Regression Example
Regression Statistics
Correlation A bit about Pearson’s r.
Least Squares ANOVA & ANCOV
Two Way ANOVAs Factorial Designs.
Multiple Regression – Part II
Multiple logistic regression
تصنيف التفاعلات الكيميائية
Mediation MODERATION THREE WAYS OF DOING ANALYSIS Kun
Chapter 2: Steps of Econometric Analysis
Our theory states Y=f(X) Regression is used to test theory.
Simple Linear Regression
Correlation and Regression
Chapter 8: DUMMY VARIABLE (D.V.) REGRESSION MODELS
24/02/11 Tutorial 2 Inferential Statistics, Statistical Modelling & Survey Methods (BS2506) Pairach Piboonrungroj (Champ)
Correlation and Covariance
Chapter 2: Steps of Econometric Analysis
Financial Econometrics Fin. 505
STA 291 Spring 2008 Lecture 23 Dustin Lueker.
Introduction to Regression
Graziano and Raulin Research Methods: Chapter 12
Presentation transcript:

Applied Epidemiologic Analysis - P8400 Fall 2002 Lab 4 Intercept, Variable Centered & Interaction Henian Chen, M.D., Ph.D. Applied Epidemiologic Analysis - P8400 Fall 2002

Description of the Life Satisfaction Data   % N Mean Range LIFESAT 751 74 10 ~ 100 Age 22 17 ~ 28 Sex Female=0 50.2 377 Male=1 49.8 374 Income Low=0 57.0 428 High=1 43.0 323 Applied Epidemiologic Analysis - P8400 Fall 2002

Applied Epidemiologic Analysis - P8400 Fall 2002 Intercept Y intercept is the estimated Y when all Xi=0 Interaction The circumstance in which the impact of one variable on Y is conditional on (varies across) the values of another predictor. Applied Epidemiologic Analysis - P8400 Fall 2002

Applied Epidemiologic Analysis - P8400 Fall 2002 Centering Subtracting the sample mean on a variable X from each subject’s score on X. x = X - Mx Center X if X doesn’t have a meaningful zero. With centered variable x, the mean is zero. Thus the regression of Y on x at x=0 (intercept) becomes meaningful. Y = α + β1 age Y = α + β1 age + β2 X + β3 age*X β1: regression of Y on age at X=0 β2: regression of Y on X at age=0 Applied Epidemiologic Analysis - P8400 Fall 2002

Applied Epidemiologic Analysis - P8400 Fall 2002 SAS Program proc import datafile='a:life-satisfaction751.txt' out=lifesat dbms=tab replace; getnames=yes; run; data lifesat1; set lifesat; agec=age-22; age17=age-17; age28=age-28; proc reg data=lifesat1; model lifesat= ; /* model 1 */ model lifesat=sex; /* model 2 */ model lifesat=income; /* model 3 */ model lifesat=age; /* model 4 */ model lifesat=age17; /* model 5 */ model lifesat=age28; /* model 6 */ model lifesat=agec; /* model 7 */ model lifesat=sex income /* model 8 */ model lifesat=sex income sex_inco; /* model 9 with interaction */ run;  Applied Epidemiologic Analysis - P8400 Fall 2002

Dependent Variable: LIFESAT Model 2 proc reg data=lifesat; model lifesat=sex; run; Dependent Variable: LIFESAT Parameter Estimates Parameter Standard Variable DF Estimate Error t Value Pr > |t| Intercept 1 74.98674 0.77556 96.69 <.0001 SEX 1 -1.94529 1.09901 -1.77 0.0771 74.98674 is the average life satisfaction score for Females. –1.94529 is the difference on life satisfaction score between male and female. Male is 1.94529 less than Female. Male’s average score=74.98674 – 1.94529 = 73.04145. Applied Epidemiologic Analysis - P8400 Fall 2002

Dependent Variable: LIFESAT Model 4 proc reg data=lifesat; model lifesat=age; run; Dependent Variable: LIFESAT Parameter Estimates Parameter Standard Variable DF Estimate Error t Value Pr > |t| Intercept 1 80.63050 4.49283 17.95 <.0001 AGE 1 -0.29990 0.20223 -1.48 0.1385 80.63050 is the average life satisfaction score for subjects at age=0. This intercept does not make sense for us because we do not have a age=0 in our data. –0.29990 means, on the average, each additional year from 17 to 28 is associated with a decrease in life satisfaction score of 0.29990. Applied Epidemiologic Analysis - P8400 Fall 2002

Dependent Variable: LIFESAT Model 5 proc reg data=lifesat; model lifesat=age17; /* age17 = age - 17 */ run; Dependent Variable: LIFESAT Parameter Estimates Parameter Standard Variable DF Estimate Error t Value Pr > |t| Intercept 1 75.53220 1.15972 65.13 <.0001 AGE17 1 -0.29990 0.20223 -1.48 0.1385 75.53220 is the average life satisfaction score for subjects at age=17. The regression coefficient is the same (-0.29990). Only the regression intercept has changed. Applied Epidemiologic Analysis - P8400 Fall 2002

Applied Epidemiologic Analysis - P8400 Fall 2002 Two-way Interaction Lifesat = α + β1 sex + β2 income + β3 sex*income   For female (0) and low Income (0): Life Satisfaction score = α + β1*0 + β2*0 + β3*0*0 = α For female (0) and high Income (1): Life Satisfaction score = α + β1*0 + β2*1 + β3*0*1   = α + β2 Male (1) and low Income (0): Life Satisfaction score = α + β1*1 + β2*0 + β3*1*0   = α + β1 Male (1) and high Income (1): Life Satisfaction score = α + β1*1 + β2*1 + β3*1*1 = α + β1 + β2 + β3 Applied Epidemiologic Analysis - P8400 Fall 2002

Three-way Interaction Three independent variables: A, B, C Y = α + β1A + β2B + β3C + β4AB + β5AC + β6BC + β7ABC All lower order terms must be included in the regression model for the β7 coefficient to represent the effect of the three-way interaction on Y. To test d-way interaction, the model must be included: all main effect variables all two-way interaction all three-way interaction all (d-1)-way interaction even though some of them are not significant Applied Epidemiologic Analysis - P8400 Fall 2002