Treating Stimuli as a Random Factor in Social Psychology: A New and Comprehensive Solution to a Pervasive but Largely Ignored Problem Jacob Westfall University.

Slides:



Advertisements
Similar presentations
Introduction Describe what panel data is and the reasons for using it in this format Assess the importance of fixed and random effects Examine the Hausman.
Advertisements

A Conceptual Introduction to Multilevel Models as Structural Equations
By Zach Andersen Jon Durrant Jayson Talakai
Statistical Analysis Overview I Session 2 Peg Burchinal Frank Porter Graham Child Development Institute, University of North Carolina-Chapel Hill.
Multilevel Modeling in Health Research April 11, 2008.
Lecture 8 (Ch14) Advanced Panel Data Method
Nested Example Using SPSS David A. Kenny January 8, 2014.
Optimal Experimental Design in Experiments With Samples of Stimuli Jacob Westfall University of Colorado Boulder David A. Kenny Charles M. Judd University.
Statistical power in experiments in which samples of participants respond to samples of stimuli Jake Westfall University of Colorado Boulder David A. Kenny.
Multilevel modeling in R Tom Dunn and Thom Baguley, Psychology, Nottingham Trent University
Symposium: Statistical Power and Optimal Design Principles for Improving the Efficiency of Psychological Research.
Mitigating Risk of Out-of-Specification Results During Stability Testing of Biopharmaceutical Products Jeff Gardner Principal Consultant 36 th Annual Midwest.
HLM – ESTIMATING MULTI-LEVEL MODELS Hierarchical Linear Modeling.
Direct replication of random stimulus materials Jake Westfall University of Texas at Austin Charles M. Judd David A. Kenny University of Colorado Boulder.
Equivalence margins to assess parallelism between 4PL curves
Overlooking Stimulus Variance Jake Westfall University of Colorado Boulder Charles M. Judd David A. Kenny University of Colorado BoulderUniversity of Connecticut.
Economics 20 - Prof. Anderson1 Fixed Effects Estimation When there is an observed fixed effect, an alternative to first differences is fixed effects estimation.
Longitudinal Data Analysis: Why and How to Do it With Multi-Level Modeling (MLM)? Oi-man Kwok Texas A & M University.
Foster Care Reunification: The use of hierarchical modeling to account for sibling and county correlation Emily Putnam-Hornstein, MSW Center for Social.
Lorelei Howard and Nick Wright MfD 2008
So are how the computer determines the size of the intercept and the slope respectively in an OLS regression The OLS equations give a nice, clear intuitive.
Relationships Among Variables
Analysis of Clustered and Longitudinal Data
Correlation and Regression
Introduction to Multilevel Modeling Using SPSS
Multilevel Modeling: Other Topics
Multilevel and Random Coefficients Models School of Nursing “Multi-Level Models: What Are They and How Do They Work? ” Melinda K. Higgins, Ph.D. 30 March.
Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.
MANAGEMENT AND ANALYSIS OF WILDLIFE BIOLOGY DATA Bret A. Collier 1 and T. Wayne Schwertner 2 1 Institute of Renewable Natural Resources, Texas A&M University,
Montecarlo Simulation LAB NOV ECON Montecarlo Simulations Monte Carlo simulation is a method of analysis based on artificially recreating.
Hierarchical Linear Modeling (HLM): A Conceptual Introduction Jessaca Spybrook Educational Leadership, Research, and Technology.
Growth Curve Models Using Multilevel Modeling with SPSS David A. Kenny January 23, 2014.
Multilevel Linear Models Field, Chapter 19. Why use multilevel models? Meeting the assumptions of the linear model – Homogeneity of regression coefficients.
Reliability & Agreement DeShon Internal Consistency Reliability Parallel forms reliability Parallel forms reliability Split-Half reliability Split-Half.
Multilevel Linear Modeling aka HLM. The Design We have data at two different levels In this case, 7,185 students (Level 1) Nested within 160 Schools (Level.
All Hands Meeting 2005 The Family of Reliability Coefficients Gregory G. Brown VASDHS/UCSD.
CHAPTER 12 Descriptive, Program Evaluation, and Advanced Methods.
Evaluating generalised calibration / Fay-Herriot model in CAPEX Tracy Jones, Angharad Walters, Ria Sanderson and Salah Merad (Office for National Statistics)
Statistical Models for the Analysis of Single-Case Intervention Data Introduction to:  Regression Models  Multilevel Models.
BUSI 6480 Lecture 8 Repeated Measures.
EDCI 696 Dr. D. Brown Presented by: Kim Bassa. Targeted Topics Analysis of dependent variables and different types of data Selecting the appropriate statistic.
Multilevel Modeling: Other Topics David A. Kenny January 7, 2014.
Mixed Effects Models Rebecca Atkins and Rachel Smith March 30, 2015.
Generalized linear MIXED models
Data Analysis in Practice- Based Research Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine October.
Machine Design Under Uncertainty. Outline Uncertainty in mechanical components Why consider uncertainty Basics of uncertainty Uncertainty analysis for.
Simulation Study for Longitudinal Data with Nonignorable Missing Data Rong Liu, PhD Candidate Dr. Ramakrishnan, Advisor Department of Biostatistics Virginia.
FIXED AND RANDOM EFFECTS IN HLM. Fixed effects produce constant impact on DV. Random effects produce variable impact on DV. F IXED VS RANDOM EFFECTS.
Analysis of Experiments
Biostatistics Case Studies Peter D. Christenson Biostatistician Session 3: Missing Data in Longitudinal Studies.
Introduction to Multilevel Analysis Presented by Vijay Pillai.
G Lecture 71 Revisiting Hierarchical Mixed Models A General Version of the Model Variance/Covariances of Two Kinds of Random Effects Parameter Estimation.
Data Screening. What is it? Data screening is very important to make sure you’ve met all your assumptions, outliers, and error problems. Each type of.
Two Crossed Random Factors
Multilevel modelling: general ideas and uses
Analysis for Designs with Assignment of Both Clusters and Individuals
Let’s continue to do a Bayesian analysis
Linear Mixed Models in JMP Pro
Statistical Models for the Analysis of Single-Case Intervention Data
HLM with Educational Large-Scale Assessment Data: Restrictions on Inferences due to Limited Sample Sizes Sabine Meinck International Association.
How to handle missing data values
Repeated Measures Analysis Using Multilevel Modeling with SPSS
G Lecture 6 Multilevel Notation; Level 1 and Level 2 Equations
Statistical Models for the Analysis of Single-Case Intervention Data
Two Crossed Random Factors
Motion Models (cont) 2/16/2019.
Rick Hoyle Duke Dept. of Psychology & Neuroscience
Fixed, Random and Mixed effects
Statistical Models for the Analysis of Single-Case Intervention Data
Presentation transcript:

Treating Stimuli as a Random Factor in Social Psychology: A New and Comprehensive Solution to a Pervasive but Largely Ignored Problem Jacob Westfall University of Colorado Boulder Charles M. Judd David A. Kenny University of Colorado BoulderUniversity of Connecticut

What to do about replicability? Mandatory reporting of all DVs, studies, etc.? Journals or journal sections devoted to straight replication attempts? Pre-registration of studies? Many of the proposed solutions involve large-scale institutional changes, restructuring incentives, etc. These are good ideas worthy of discussing, but surely not quick or easy to implement

One way to increase replicability: Treat stimuli as random Failure to account for uncertainty associated with stimulus sampling (i.e., treating stimuli as fixed rather than random) leads to biased, overconfident estimates of effects (Clark, 1973; Coleman, 1964) The pervasive failure to model stimulus as a random factor is probably responsible for many failures to replicate when future studies use different stimulus samples

Doing the correct analysis is easy! Recently developed statistical methods solve the statistical problem of stimulus sampling These mixed models with crossed random effects are easy to apply and are already widely available in major statistical packages (R, SAS, SPSS, Stata, etc.)

Outline of rest of talk 1.The problem – Illustrative design and typical RM-ANOVA analyses – Estimated type 1 error rates 2.The solution – Introducing mixed models with crossed random effects for participants and stimuli – Applications of mixed model analyses to actual datasets

Illustrative Design Participants crossed with Stimuli – Each Participant responds to each Stimulus Stimuli nested under Condition – Each Stimulus always in either Condition A or Condition B Participants crossed with Condition – Participants make responses under both Conditions Sample of hypothetical dataset:

Typical repeated measures analyses (RM-ANOVA) M Black M White Difference How variable are the stimulus ratings around each of the participant means? The variance is lost due to the aggregation “By-participant analysis”

Typical repeated measures analyses (RM-ANOVA) Sample 1 v.s. Sample 2 “By-stimulus analysis”

Simulation of type 1 error rates for typical RM-ANOVA analyses Design is the same as previously discussed Draw random samples of participants and stimuli – Variance components = 4, Error variance = 16 Number of participants ∈ {10, 30, 50, 70, 90} Number of stimuli ∈ {10, 30, 50, 70, 90} Conducted both by-participant and by-stimulus analysis on each simulated dataset True Condition effect = 0

Type 1 error rate simulation results The exact simulated error rates depend on the variance components, which although realistic, were ultimately arbitrary The main points to take away here are: 1.The standard analyses will virtually always show some degree of positive bias 2.In some (entirely realistic) cases, this bias can be extreme 3.The degree of bias depends in a predictable way on the design of the experiment (e.g., the sample sizes)

The old solution: Quasi-F statistics Although quasi-Fs successfully address the statistical problem, they suffer from a variety of limitations – Require complete orthogonal design (balanced factors) – No missing data – No continuous covariates – A different quasi-F must be derived (often laboriously) for each new experimental design – Not widely implemented in major statistical packages

The new solution: Mixed models Known variously as: – Mixed-effects models, multilevel models, random effect models, hierarchical linear models, etc. Most social psychologists familiar with mixed models for hierarchical random factors – E.g., students nested in classrooms Less well known is that mixed models can also easily accommodate designs with crossed random factors – E.g., participants crossed with stimuli

Grand mean = 100

Mean A = -5 Mean B = 5

Participant Intercepts

Stim. Intercepts:

Participant Slopes

Everything else = residual error

The linear mixed-effects model with crossed random effects Fixed effects Random effects

The linear mixed-effects model with crossed random effects Intercept Slope 6 parameters

Fitting mixed models is easy: Sample syntax library(lme4) model <- lmer(y ~ c + (1 | j) + (c | i)) proc mixed covtest; class i j; model y=c/solution; random intercept c/sub=i type=un; random intercept/sub=j; run; MIXED y WITH c /FIXED=c /PRINT=SOLUTION TESTCOV /RANDOM=INTERCEPT c | SUBJECT(i) COVTYPE(UN) /RANDOM=INTERCEPT | SUBJECT(j). R SAS SPSS

Mixed models successfully maintain the nominal type 1 error rate (α =.05)

Applications to existing datasets 1.Representative simulated dataset (for comparison) 2.Afrocentric features data (Blair et al., 2002, 2004, 2005) 3.Shooter data (Correll et al., 2002, 2007) 4.Psi / Retroactive priming data (Bem) – Forward-priming condition (classic evaluative priming effect) – Reverse-priming condition (psi condition)

Comparison of effects between RM-ANOVA and mixed model analyses DatasetRM-ANOVA (by-participant)Mixed modelStimulus ICC F ratioD.F.pF ratioD.F.p Simulated example 30.48(1, 29)< (1, 38.52).005r = Shooter data57.89(1, 35)< (1, 48.1).072r = Afrocentric features data 6.40(1, 46) (1, 51.1).043r = Bem (2011) Forward- priming condition 22.18(1, 98)< (1, 46.91).029Targets: r = Primes: r = Bem (2011) Reverse- priming condition 6.60(1, 98) (1, 27.58).136Targets: r = Primes: r = 0.0

Comparison of effects between RM-ANOVA and mixed model analyses DatasetRM-ANOVA (by-participant)Mixed modelStimulus ICC F ratioD.F.pF ratioD.F.p Simulated example 30.48(1, 29)< (1, 38.52).005r = Shooter data57.89(1, 35)< (1, 48.1).072r = Afrocentric features data 6.40(1, 46) (1, 51.1).043r = Bem (2011) Forward- priming condition 22.18(1, 98)< (1, 46.91).029Targets: r = Primes: r = Bem (2011) Reverse- priming condition 6.60(1, 98) (1, 27.58).136Targets: r = Primes: r = 0.0

Comparison of effects between RM-ANOVA and mixed model analyses DatasetRM-ANOVA (by-participant)Mixed modelStimulus ICC F ratioD.F.pF ratioD.F.p Simulated example 30.48(1, 29)< (1, 38.52).005r = Shooter data57.89(1, 35)< (1, 48.1).072r = Afrocentric features data 6.40(1, 46) (1, 51.1).043r = Bem (2011) Forward- priming condition 22.18(1, 98)< (1, 46.91).029Targets: r = Primes: r = Bem (2011) Reverse- priming condition 6.60(1, 98) (1, 27.58).136Targets: r = Primes: r = 0.0

Conclusion Many failures of replication are probably due to sampling stimuli and the failure to take that into account Mixed models with crossed random effects allow for generalization to future studies with different samples of both stimuli and participants

The end Further reading: Judd, C. M., Westfall, J., & Kenny, D. A. (2012). Treating stimuli as a random factor in social psychology: A new and comprehensive solution to a pervasive but largely ignored problem. Journal of personality and social psychology, 103(1),