Multilevel and Random Coefficients Models School of Nursing “Multi-Level Models: What Are They and How Do They Work? ” Melinda K. Higgins, Ph.D. 30 March.

Slides:



Advertisements
Similar presentations
Questions From Yesterday
Advertisements

Multilevel modelling short course
{ Multilevel Modeling using Stata Andrew Hicks CCPR Statistics and Methods Core Workshop based on the book: Multilevel and Longitudinal Modeling Using.
By Zach Andersen Jon Durrant Jayson Talakai
Statistical Analysis Overview I Session 2 Peg Burchinal Frank Porter Graham Child Development Institute, University of North Carolina-Chapel Hill.
Nested Example Using SPSS David A. Kenny January 8, 2014.
1 SSS II Lecture 1: Correlation and Regression Graduate School 2008/2009 Social Science Statistics II Gwilym Pryce
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
© 2003 Prentice-Hall, Inc.Chap 14-1 Basic Business Statistics (9 th Edition) Chapter 14 Introduction to Multiple Regression.
Clustered or Multilevel Data
Dr. Mario MazzocchiResearch Methods & Data Analysis1 Correlation and regression analysis Week 8 Research Methods & Data Analysis.
Longitudinal Data Analysis: Why and How to Do it With Multi-Level Modeling (MLM)? Oi-man Kwok Texas A & M University.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Correlation and Regression Analysis
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Lorelei Howard and Nick Wright MfD 2008
Introduction to Regression Analysis, Chapter 13,
Linear Regression/Correlation
Unit 3b: From Fixed to Random Intercepts © Andrew Ho, Harvard Graduate School of EducationUnit 3b – Slide 1
Correlation and Regression
Leedy and Ormrod Ch. 11 Gray Ch. 14
Issues in Experimental Design Reliability and ‘Error’
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Categorical Data Analysis School of Nursing “Categorical Data Analysis 2x2 Chi-Square Tests and Beyond (Multiple Categorical Variable Models)” Melinda.
© 2002 Prentice-Hall, Inc.Chap 14-1 Introduction to Multiple Regression Model.
Sampling and Nested Data in Practice- Based Research Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine.
G Lecture 5 Example fixed Repeated measures as clustered data
Regression Analysis. Scatter plots Regression analysis requires interval and ratio-level data. To see if your data fits the models of regression, it is.
Introduction Multilevel Analysis
Chapter 12 Examining Relationships in Quantitative Research Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Multilevel Linear Models Field, Chapter 19. Why use multilevel models? Meeting the assumptions of the linear model – Homogeneity of regression coefficients.
Multilevel Data in Outcomes Research Types of multilevel data common in outcomes research Random versus fixed effects Statistical Model Choices “Shrinkage.
Examining Relationships in Quantitative Research
Linear correlation and linear regression + summary of tests
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Chapter 16 Data Analysis: Testing for Associations.
The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Conducting post-hoc tests of compound coefficients using simple slopes for a categorical.
2nd meeting: Multilevel modeling: intra class correlation Subjects for today:  Multilevel data base construction  The difference between single level.
Analysis of Covariance Combines linear regression and ANOVA Can be used to compare g treatments, after controlling for quantitative factor believed to.
Examining Relationships in Quantitative Research
Introduction to Basic Statistical Tools for Research OCED 5443 Interpreting Research in OCED Dr. Ausburn OCED 5443 Interpreting Research in OCED Dr. Ausburn.
28. Multiple regression The Practice of Statistics in the Life Sciences Second Edition.
Introduction to testing statistical significance of interactions Jane E. Miller, PhD The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition.
I271B QUANTITATIVE METHODS Regression and Diagnostics.
FIXED AND RANDOM EFFECTS IN HLM. Fixed effects produce constant impact on DV. Random effects produce variable impact on DV. F IXED VS RANDOM EFFECTS.
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
Sampling and Nested Data in Practice-Based Research Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine.
1 Statistics 262: Intermediate Biostatistics Regression Models for longitudinal data: Mixed Models.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 7: Regression.
Chapter 8 Relationships Among Variables. Outline What correlational research investigates Understanding the nature of correlation What the coefficient.
G Lecture 71 Revisiting Hierarchical Mixed Models A General Version of the Model Variance/Covariances of Two Kinds of Random Effects Parameter Estimation.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
Topics, Summer 2008 Day 1. Introduction Day 2. Samples and populations Day 3. Evaluating relationships Scatterplots and correlation Day 4. Regression and.
Approaches to quantitative data analysis Lara Traeger, PhD Methods in Supportive Oncology Research.
Methods of Presenting and Interpreting Information Class 9.
Stats Methods at IC Lecture 3: Regression.
Regression Analysis.
Melinda K. Higgins, Ph.D. 11 April 2008
12 Inferential Analysis.
Stats Club Marnie Brennan
CHAPTER 29: Multiple Regression*
Linear Regression/Correlation
LEARNING OUTCOMES After studying this chapter, you should be able to
12 Inferential Analysis.
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

Multilevel and Random Coefficients Models School of Nursing “Multi-Level Models: What Are They and How Do They Work? ” Melinda K. Higgins, Ph.D. 30 March 2009

School of Nursing Multilevel and Random Coefficients Models Why are MLM’s important? Ordinary Least Squares (OLS) and all regression models under the General Linear Model (including logistic and poisson regression) assumes that all observations are independent of one another. Likewise, repeated measures (time1, time2, …) assume that all time points for a given individual are independent of one another (all pairs of time points), including serial dependency (“Durbin–Watson statistic”). When data are “clustered,” OLS may lead to inaccuracies in inference – standard errors may be too small – leads to overestimation of significance (“alpha-inflation”). “Clusters” may include individuals within: dyads; twins; families; counseling groups; households; neighborhoods; schools; nursing units; hospitals (and repeated measures over time within an individual – the individual becomes the “cluster”).

School of Nursing Multilevel and Random Coefficients Models An example Subjects: 386 women across 40 women’s groups with a focus on diet and weight control Group sizes range from 5-15 Independent variable (measured for each individual) is “motivation.” Outcome – weight loss The groups meet regularly and may have some level of cohesion – we may expect some correlation between the individuals with a given group. In fact, there is substantial “clustering” indicated by the fact that the groups differ substantially in average weight loss (9.75lbs-24.43lbs)

School of Nursing Multilevel and Random Coefficients Models Average Weight Loss – 40 Groups

School of Nursing Multilevel and Random Coefficients Models Weight Loss by Motivation (individual)

School of Nursing Multilevel and Random Coefficients Models Regression – Motivation(X) vs Wt Loss (Y) (no consideration of groups)

Regression – 40 separate groups

School of Nursing Multilevel and Random Coefficients Models Regression of Group Averages (avg motivation & avg weight loss) WARNING: “ECOLOGICAL FALLACY” – cannot make generalizations from the group down to the individual level. This approach only tells us the relationship between the group mean motivation and the group’s average weight loss – NOT about individuals performance.

School of Nursing Multilevel and Random Coefficients Models Regression – control for groups 39 (g-1) dummy variables

School of Nursing Multilevel and Random Coefficients Models ANCOVA – same as (g-1) dummy vars

School of Nursing Multilevel and Random Coefficients Models How to estimate the “degree” of clustering? Intraclass Correlation (ICC) The ICC measures the proportion of the total variance of an outcome variable that is accounted for by the clustering of the cases. OLS regression (and GLM) assumes that there is no amount of variance accounted for by these “clusters”/groups; ICC=0. ICC varies from 0 to 1 (normally). [NOTE: ICC can be negative, but it is rare – this would mean that individuals within a given “cluster” were more different from one another within the group than with individuals outside the group.] The ICC may be initially estimated from a one-way (“Fixed Effects”) ANOVA on the groups or “clusters.” Even for ICC’s as small as 0.01, for n=25 subjects per group, the actual alpha=0.11 (which is more than 2 times the nominal alpha=0.05). “Alpha-inflation” increases as ICC and number of subjects per group increases.

School of Nursing Multilevel and Random Coefficients Models Intraclass Correlation Where M n mean number of cases per group; sd 2 (n j ) is the variance of the number of cases per group; g=number of groups Estimated ICC=0.22 indicates substantial clustering ** [for ICC=0.20, n=10, actual alpha=0.28 not 0.05 (Kreft, deLeeuw 1998)] One-Way Fixed Effects

School of Nursing Multilevel and Random Coefficients Models “Fixed vs. Random Coefficients” OLS regression – only the outcomes and errors/residuals are considered to be random (i.e. samples from a larger population) Random Fixed “Random Coefficient” regression – now the intercepts and slopes are also considered to be RANDOM (i.e. samples from a larger population)

School of Nursing Multilevel and Random Coefficients Models “Multi-Level / Hierarchical Linear Models” Level 1: Individual Level; “micro-level” Level 2: Group Level; “macro-level” Combine Level 1 and Level 2 & “Additional New Parameters” – Variance Components – help to capture the impact of the group structure on the relationship of the predictors on the dependent variable. i=individualj=group

School of Nursing Multilevel and Random Coefficients Models Unconditional Cell Means Model MIXED pounds BY group /CRITERIA=CIN(95) MXITER(100) MXSTEP(5) SCORING(1) SINGULAR( ) HCONVERGE(0, ABSOLUTE) LCONVERGE(0, ABSOLUTE) PCONVERGE( , ABSOLUTE) /FIXED=| SSTYPE(3) /METHOD=REML /PRINT=CPS CORB COVB DESCRIPTIVES G LMATRIX R SOLUTION TESTCOV /RANDOM=INTERCEPT | SUBJECT(group) COVTYPE(UN) /EMMEANS=TABLES(OVERALL). Calculate ICC from the RC model using only the group membership (no level 1 predictor) – equivalent to a “One-way Random Effects ANOVA” [for ICC=0.20, n=10, actual alpha=0.28 not 0.05 (Kreft, deLeeuw 1998)] =

School of Nursing Multilevel and Random Coefficients Models Unconditional Cell Means Model = is significant (pval<0.001) – units of within class variance in pounds lost that might be significantly accounted for by motivation predictors. = is also significant (pval=0.002) – units of between class variance in pounds lost (reflects differences between groups in mean pounds lost); and since this is significantly > 0, indicates that there is random variation among the intercepts of the various groups – WE SHOULD NOT IGNORE CLUSTERING. =

School of Nursing Multilevel and Random Coefficients Models Random Coefficient Model MIXED pounds BY group WITH motivatc /CRITERIA=CIN(95) MXITER(100) MXSTEP(5) SCORING(1) SINGULAR( ) HCONVERGE(0, ABSOLUTE) LCONVERGE(0, ABSOLUTE) PCONVERGE( , ABSOLUTE) /FIXED=motivatc | SSTYPE(3) /METHOD=REML /PRINT=CPS CORB COVB DESCRIPTIVES G LMATRIX R SOLUTION TESTCOV /RANDOM=INTERCEPT motivatc | SUBJECT(group) COVTYPE(UN) /EMMEANS=TABLES(OVERALL). Level 2 Level 1 Mixed Level 1 & 2

School of Nursing Multilevel and Random Coefficients Models Consider the Fixed Effects Portion of the Model Weight Loss (Y ij ) = lbs units_motivation/lbs *(motivation_ctr (x ij )) Both the intercept and slope are significant (both pval<0.001). So, for every unit increase in motivation, more lbs are lost; with an average weight loss per group of lbs. Fixed Random

School of Nursing Multilevel and Random Coefficients Models Consider the Random Effect Portion of the Model variance of the slopes across groups is significant (again do not ignore clustering) the covariance between slopes and intercepts was positive indicating the higher the average pounds lost in a group (intercept), the stronger the positive relationship of motivation on weight loss; but this was not significant. the addition of motivation as a predictor reduced the within group variance in pounds lost by 63% (from to 5.933) the addition of motivation as a predictor also reduced the between group differences in avg. pounds lost (intercepts) by 51% (4.906 to 2.397) SO, overall there were fluctuations between the groups in terms of mean pounds lost – these fluctuations were systematically related to the mean motivation level of individuals within these groups. Cannot test this via ANCOVA nor (g-1) dummy variables in regression.

School of Nursing Multilevel and Random Coefficients Models MLM with Predictor at Level 2 (group variable) “treatment” = diet program vs control MIXED pounds BY group WITH motivatc treatc /CRITERIA=CIN(95) MXITER(100) MXSTEP(5) SCORING(1) SINGULAR( ) HCONVERGE(0, ABSOLUTE) LCONVERGE(0, ABSOLUTE) PCONVERGE( , ABSOLUTE) /FIXED=motivatc treatc motivatc*treatc | SSTYPE(3) /METHOD=REML /PRINT=CPS CORB COVB DESCRIPTIVES G LMATRIX R SOLUTION TESTCOV /RANDOM=INTERCEPT motivatc | SUBJECT(group) COVTYPE(UN) /EMMEANS=TABLES(OVERALL). Fixed Random

School of Nursing Multilevel and Random Coefficients Models Repeated Measures Investigate the following hypothetical data set. YA is the outcome measures over 5 time points for 60 individuals

School of Nursing Multilevel and Random Coefficients Models MIXED YA BY ID /CRITERIA=CIN(95) MXITER(100) MXSTEP(5) SCORING(1) SINGULAR( ) HCONVERGE(0, ABSOLUTE) LCONVERGE(0, ABSOLUTE) PCONVERGE( , ABSOLUTE) /FIXED=| SSTYPE(3) /METHOD=REML /PRINT=CPS CORB COVB DESCRIPTIVES G LMATRIX R SOLUTION TESTCOV /RANDOM=INTERCEPT | SUBJECT(ID) COVTYPE(UN) /EMMEANS=TABLES(OVERALL). YA-Model 1 (random intercept – cell means model) DO NOT IGNORE CLUSTERING =

School of Nursing Multilevel and Random Coefficients Models MIXED YA BY ID WITH TIME /CRITERIA=CIN(95) MXITER(100) MXSTEP(5) SCORING(1) SINGULAR( ) HCONVERGE(0, ABSOLUTE) LCONVERGE(0, ABSOLUTE) PCONVERGE( , ABSOLUTE) /FIXED=| SSTYPE(3) /METHOD=REML /PRINT=CPS CORB COVB DESCRIPTIVES G LMATRIX R SOLUTION TESTCOV /RANDOM=INTERCEPT TIME | SUBJECT(ID) COVTYPE(UN) /EMMEANS=TABLES(OVERALL). YA-Model 2 (random intercept and slope)

School of Nursing Multilevel and Random Coefficients Models MIXED YA BY ID WITH TIME /CRITERIA=CIN(95) MXITER(100) MXSTEP(5) SCORING(1) SINGULAR( ) HCONVERGE(0, ABSOLUTE) LCONVERGE(0, ABSOLUTE) PCONVERGE( , ABSOLUTE) /FIXED=TIME | SSTYPE(3) /METHOD=REML /PRINT=CPS CORB COVB DESCRIPTIVES G LMATRIX R SOLUTION TESTCOV /RANDOM=INTERCEPT TIME | SUBJECT(ID) COVTYPE(UN) /EMMEANS=TABLES(OVERALL). YA-Model 3 (random intercept and slope & fixed time) i=time; j=individual

School of Nursing Multilevel and Random Coefficients Models Notes on the MLM approach to RM-ANOVA for YA – Model 3 Model 3 added “time” as a fixed predictor to the model to determine whether there was a significant “linear” effect of time. There is a significant effect of adding time to the model ( residual error dropped from 3.02 down to 0.228), which is still significant. Adding time to the model also accounted for all the significant variance in the slopes of YA on time, e.g. differences between slopes were no longer significant (pval = 0.140). And the covariance (between slopes and intercepts) is also not significant. Individual mean differences (intercepts) (pval<0.001) remain a significant source of variance.

School of Nursing Multilevel and Random Coefficients Models Summary If you suspect group (or cluster) effects, i.e. subjects within your groups may exhibit some level of “cohesion” or “alikeness,” leading you to think they are more alike within groups than between groups, the ICC should be calculated and evaluated. If ICC is close to 0, run both OLS regression and a Random Coefficient (“mixed”) model and compare. Evaluate the “random variance components.” Try comparing RM-ANOVA with the “mixed” multi-level model approach where “time” is level 1 and the individual is level 2. Evaluate the impact of adding time to the model.

School of Nursing Multilevel and Random Coefficients Models References Cohen, Jacob; Cohen, Patricia; West, Stephen; Aiken, Leona. “Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences” 3 rd edition, Lawrence Erlbaum Associates Inc., [All data/examples in this lecture came from this book – mostly Chapter 14 (Randon Coefficient Regression and Multilevel Models) and Section 15.4 (Multilevel Regression of Individual Changes Over Time)] Raudenbush; Stephen W.; Bryk, Anthony S. “Hierarchical Linear Models: Applications and Data Analysis Methods” 2 nd edition, SAGE publications, Kreft, Ita; de Leeuw, Jan. “Introducing Multilevel Modeling,” SAGE publications, Tabachnick, Barbara G.; Fidell, Linda S. “Using Multivariate Statistics,” 5 th edition, Pearson Education Inc., [Chapter 15 focuses on Multilevel Linear Modeling.] *

School of Nursing Multilevel and Random Coefficients Models VIII. Statistical Resources and Contact Info SON S:\Shared\Statistics_MKHiggins\website2\index.htm [updates in process] Working to include tip sheets (for SPSS, SAS, and other software), lectures (PPTs and handouts), datasets, other resources and references Statistics At Nursing Website: [website being updated] And Blackboard Site (in development) for “Organization: Statistics at School of Nursing” Contact Dr. Melinda Higgins Office: / Mobile: