Using Multilevel Modeling in Institutional Research

Slides:



Advertisements
Similar presentations
Questions From Yesterday
Advertisements

Contextual effects In the previous sections we found that when regressing pupil attainment on pupil prior ability schools vary in both intercept and slope.
Multilevel modelling short course
Simple Linear Regression Analysis
Hierarchical Linear Modeling: An Introduction & Applications in Organizational Research Michael C. Rodriguez.
By Zach Andersen Jon Durrant Jayson Talakai
Statistical Analysis Overview I Session 2 Peg Burchinal Frank Porter Graham Child Development Institute, University of North Carolina-Chapel Hill.
School of Veterinary Medicine and Science Multilevel modelling Chris Hudson.
Longitudinal Experiments Larry V. Hedges Northwestern University Prepared for the IES Summer Research Training Institute July 28, 2010.

Clustered or Multilevel Data
Correlation and Regression Analysis
Foster Care Reunification: The use of hierarchical modeling to account for sibling and county correlation Emily Putnam-Hornstein, MSW Center for Social.
Basic Analysis of Variance and the General Linear Model Psy 420 Andrew Ainsworth.
Analysis of Clustered and Longitudinal Data
A simulation study of the effect of sample size and level of interpenetration on inference from cross-classified multilevel logistic regression models.
Introduction to Multilevel Modeling Using SPSS
Inference for regression - Simple linear regression
Chapter 11 Simple Regression
Linear Regression Inference
Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.
Sampling and Nested Data in Practice- Based Research Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine.
Hierarchical Linear Modeling (HLM): A Conceptual Introduction Jessaca Spybrook Educational Leadership, Research, and Technology.
Introduction Multilevel Analysis
Funded through the ESRC’s Researcher Development Initiative Prof. Herb MarshMs. Alison O’MaraDr. Lars-Erik Malmberg Department of Education, University.
Multilevel Data in Outcomes Research Types of multilevel data common in outcomes research Random versus fixed effects Statistical Model Choices “Shrinkage.
Data Analysis – Statistical Issues Bernd Genser, PhD Instituto de Saúde Coletiva, Universidade Federal da Bahia, Salvador
Introduction to Multilevel Modeling Stephen R. Porter Associate Professor Dept. of Educational Leadership and Policy Studies Iowa State University Lagomarcino.
Part IV Significantly Different: Using Inferential Statistics
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Chapter 16 Data Analysis: Testing for Associations.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
Data Analysis in Practice- Based Research Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine October.
FIXED AND RANDOM EFFECTS IN HLM. Fixed effects produce constant impact on DV. Random effects produce variable impact on DV. F IXED VS RANDOM EFFECTS.
Sampling and Nested Data in Practice-Based Research Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine.
Funded through the ESRC’s Researcher Development Initiative Department of Education, University of Oxford Session 2.1 – Revision of Day 1.
Introduction to Multilevel Analysis Presented by Vijay Pillai.
An Introduction to Latent Curve Models
Multiple Regression.
Multilevel modelling: general ideas and uses
The simple linear regression model and parameter estimation
Introduction to Multilevel Modeling Using HLM 6
EXCEL: Multiple Regression
Ling Ning & Mayte Frias Senior Research Associates Neil Huefner
RDI Meta-analysis workshop - Marsh, O'Mara, & Malmberg
Stephen W. Raudenbush University of Chicago December 11, 2006
Virtual COMSATS Inferential Statistics Lecture-26
Chapter 11: Simple Linear Regression
Linear Mixed Models in JMP Pro
Analysis of Variance in Matrix form
Eastern Michigan University
An introduction to basic multilevel modeling
HLM with Educational Large-Scale Assessment Data: Restrictions on Inferences due to Limited Sample Sizes Sabine Meinck International Association.
SAMPLING (Zikmund, Chapter 12.
Chapter 11 Simple Regression
Lecture 4 - Model Selection
Generalized Linear Models
Multivariate Analysis Lec 4
S519: Evaluation of Information Systems
Analyzing Intervention Studies
CHAPTER 29: Multiple Regression*
Multiple Regression.
ELEMENTS OF HIERARCHICAL REGRESSION LINEAR MODELS
Simple Linear Regression
Simple Linear Regression
Longitudinal Data & Mixed Effects Models
Nazmus Saquib, PhD Head of Research Sulaiman AlRajhi Colleges
MGS 3100 Business Analysis Regression Feb 18, 2016
Inferences 10-3.
Presentation transcript:

Using Multilevel Modeling in Institutional Research Ling Ning & Mayte Frias Senior Research Associate Neil Huefner Associate Director Timo Rico Executive Director Research interests: - Developing effect size measures to quantify multilevel interventions - Explore robustness of MLM to assumption violations - Methods to evaluate measurement invariance, and the impact of non-invariance on statistical modeling

Multilevel Data Structure In institutional research, the data structure in the population is usually hierarchical, with students nested within instructors, majors, departments, colleges and campus divisions. Sampling is conducted using multi-stage cluster sampling, rather than simple random sampling. The above two factors give rise to multilevel data, in which lower level units are nested within higher level units

Overview Motivations for using Multilevel Modeling (MLM) in Institutional Research Consequences of not using MLM with clustered data What is Multilevel Modeling (MLM)? Random intercept, random slope and cross-level model Computation software

How can the use of Multilevel Modeling (MLM) positively contribute to Institutional Research? Allows one to calculate standard error more accurately Offers insight into how effects vary at higher levels within the structure Permits the inclusion of predictors at different levels within the model

Motivations for MLM | Standard Error Adjustment Conventional methods assume independent observations Knowing the information of one student tells nothing about another student’s information Often not realistic in institutional research Students in the same majors/student support programs  same context  more similar Characteristics of one student  information for the major  information of another student in the same major

Intraclass Correlation (ICC; ρ) Motivations for MLM | Standard Error Adjustment Intraclass Correlation (ICC; ρ) One way to quantify the degree of clustering (i.e., common effect associated with majors ) = variance associated with majors = student variance Roughly speaking, correlation between students in the same major Let’s use the previous example, divide the variance into two parts

Consequences of Clustering Motivations for MLM | Standard Error Adjustment Consequences of Clustering Clustering  Overlapping information Clustering  Information  SE Ignore clustering    Inflated Type I error (α)  Spurious results Inflated information Overall area less than two unique circles, Animation: inflated information, enlarge SE too small

Example: Effect of a Tutoring Program on retention Motivations for MLM | Standard Error Adjustment Example: Effect of a Tutoring Program on retention 1000 students nested within 63 majors 315 Participants , 685 Non-participants Model: Participation  Retention  The data set I used for the Catholic example

Value of the Intraclass Correlation Motivations for MLM | Standard Error Adjustment The Inflation of the alpha level of 0.05 in the presence of intra-class correlation (ICC) Average n per group Value of the Intraclass Correlation (ICC) 0.00 0.01 0.05 0.2 10 0.06 0.11 0.28 25 0.08 0.19 0.46 50 0.30 0.59 100 0.17 0.43 0.70 The data set Which I will give more info later, Mention students nested within schools Source: Based on Barcikowski, R.S.(1981). Statistical power with group mean as the unit of analysis. Journal of Educational Statistics, 6, 267-285.

Level1: log(Pij /(1-Pij ))= β0j (j = 1, 2, … , 63) What is MLM | Unconditional Random Intercept Model (RIM) Level1: log(Pij /(1-Pij ))= β0j (j = 1, 2, … , 63) Level 2: 0j =γ00 +U0j  Parameter Unconditional RIM   Fixed effects Parameter estimate SE Intercept (γ00) 2.91 1.26 Tutoring (β1) - STEM (γ01) HS GPA (β2) Tutoring*STEM(γ11) Variance estimates Level-one variance (σ2) 3.29 Intercept variance (τ00) 0.44 0.34 Slope variance (τ11) Model fit measures Deviance 353.62 AIC 357.62 BIC 362.83 

What is MLM | Two-level Random Intercept Model (RIM) Level1: log(Pij /(1-Pij ))= β0j + β1Tutoringij +β2HS GPAij (j = 1, 2, … , 63) Level 2: 0j =γ00 + γ01 STEM0j+U0j  Parameter Unconditional RIM (M1)   Conditional RIM (M2) Fixed effects Parameter estimate SE Intercept (γ00) 2.91 1.26 3.12 1.41 Tutoring (β1) - 1.86* 0.60 STEM (γ01) -0.81* 0.36 HS GPA (β2) 3.35 3.84 Tutoring*STEM(γ11) Variance estimates Level-one variance (σ2) 3.29 Intercept variance (τ00) 0.44 0.34 0.32 0.68 Slope variance (τ11) Model fit measures Deviance 353.62 331.86 AIC 357.62 341.86 BIC 362.83  354.88 

What is MLM | Random Intercept Model Level1: log(Pij /(1-Pij ))= β0j + β1Tutoringij +β2HS GPAij (j = 1, 2, … , 63) Level 2: 0j =γ00 + γ01 STEM0j+U0j logit (Retention) Major 1 … Major 5 Average Tutoring Effect … Major 63 0j : γ00 : Tutoring

Random intercept model What is MLM | Random Slope Model Level1: log(Pij /(1-Pij ))= β0j + β1jTutoringij +β2HS GPAij (j = 1, 2, … , 63) Level 2: 0j =γ00 + γ01 STEM0j+U0j 1j =γ10 + U1j  Parameter Unconditional model (M1)   Random intercept model (M2) Random slope model (M3) Fixed effects Parameter estimate SE Intercept (γ00) 2.91 1.26 3.12 1.41 3.22 1.52 Tutoring(γ10) - 1.86* 0.60 2.75* 0.10 STEM (γ01) -0.81* 0.36 -1.11* 0.41 HS GPA(β2) 3.35 3.84 3.78 4.12 Tutoring*STEM(γ11) Variance estimates Level-one variance (σ2) 3.29 Intercept variance (τ00) 0.44 0.34 0.32 0.68 0.13 0.48 Slope variance (τ11) 1.33 1.05 Model fit measures Deviance 353.62 331.86 319.56 AIC 357.62 341.86 331.56 BIC 362.83  354.88  347.19

What is MLM | Random Slope Model Level1: log(Pij /(1-Pij ))= β0j + β1jTutoringij +β2HS GPAij (j = 1, 2, … , 63) Level 2: 0j =γ00 + γ01 STEM0j+U0j 1j =γ10 + U1j Major 1 logit (Retention) … Major 5 Average Tutoring Effect …… Major 63 0j : 1j γ00 : Tutoring

Unconditional model (M1) What is MLM | Cross-level Model Level1: log(Pij /(1-Pij ))= β0j + β1jTutoringij +β2HS GPAij (j = 1, 2, … , 63) Level 2: 0j =γ00 + γ01 STEM0j+U0j 1j =γ10 + γ11 STEM0j+U1j Parameter Unconditional model (M1)   Random intercept Model (M2) Random slope model (M3) Cross-level Model (M4) Fixed effects estimate SE Intercept (γ00) 2.91 1.26 3.12 1.41 3.22 1.52 3.68 1.54 Tutoring(γ10) - 1.86* 0.60 2.75** 0.10 2.29* 0.30 STEM (γ01) -0.81* 0.36 -1.11** 0.41 -1.14* HS GPA (γ02) 3.35 3.84 3.78 4.12 3.82 4.13 Tutoring*STEM (γ11) 5.39** 0.14 Variance estimates Level-one variance (σ2) 3.29 Intercept variance (τ00) 0.44 0.34 0.32 0.68 0.26 0.48 0.24 Slope variance (τ11) 1.33 1.05 1.61 2.13 Model fit measures Deviance 353.62 331.86 319.56 311.64 AIC 357.62 341.86 331.56 323.64 BIC 362.83  354.88  347.19 340.88

What is MLM | Retention in each major 3.01 2.85 Major 2 3.13 2.91 Major 3 2.51 2.12 Major 4 3.74 1.16 … Major 60 3.49 2.32 Major 61 3.38 2.58 Major 62 4.06 2.73 Major 63 2.89 2.47 Mean 3.22 2.75 Variance 0.48 1.33 63 Majors RandomSlope Average intercept, average slope, overall average model, gamma00, gamma10 Random Intercept

MLM | Computing Software Specialized programs for fitting multilevel models HLM MLWin General-purpose statistical software SAS R Stata Mplus

MLM | Motivations Recap. Learning about effects that vary by higher level cluster e.g., A student support program that is more effective in some majors than others Using all the data to perform inferences for groups with small sample size e.g., A program director wants to know how effective her program is for majors with very small number of students. Prediction e.g., To predict a new student’s outcome

MLM | Motivations Recap. 4. Analysis of data from cluster sampling Many national surveys [e.g., PISA(the Program for International Student Assessment)] used multi-stage probability sample design. 5. Including predictors at different levels e.g., A student support program that is more effective in some majors than others due to some characteristics associated with the major such as STEM or Non-STEM. 6. Getting the right standard error The estimated standard errors of regression coefficients might be wrong when we use multiple regression to analyze multilevel data.

MLM | Good References. Hoox, Joop (2010).Multilevel Analysis, Techniques and Applications, Routledge  http://joophox.net/mlbook2/MLbook.htm Gelman, A., and Hill, J. (2006) Data Analysis Using Regression and Multilevel/Hierarchical Models. Cambridge.  http://www.stat.columbia.edu/~gelman/arm/ Singer, J. (2003) Applied Longitudinal Data Analysis: Modeling Change and Event Occurance. Oxford  http://gseacademic.harvard.edu/alda/ Snijders, T., and Bosker, R. (2012) Multilevel Analysis: An Introduction to Basic and Advanced Multilevel Modeling, 2nd Ed. Sage. http://www.stats.ox.ac.uk/~snijders/

MLM | Questions http://csaa.ucdavis.edu/contact.html Contact us