Contextual effects In the previous sections we found that when regressing pupil attainment on pupil prior ability schools vary in both intercept and slope.

Slides:



Advertisements
Similar presentations
Questions From Yesterday
Advertisements

Multilevel modelling short course
Making Inferences about Causality In general, children who watch violent television programs tend to behave more aggressively toward their peers and siblings.
Simple Linear Regression Analysis
Multiple Regression and Model Building
Hierarchical Linear Modeling: An Introduction & Applications in Organizational Research Michael C. Rodriguez.
Lecture 11 (Chapter 9).
Regression and correlation methods
Lesson 10: Linear Regression and Correlation
Kin 304 Regression Linear Regression Least Sum of Squares
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Inference for Regression
1 SSS II Lecture 1: Correlation and Regression Graduate School 2008/2009 Social Science Statistics II Gwilym Pryce
Multiple Regression Fenster Today we start on the last part of the course: multivariate analysis. Up to now we have been concerned with testing the significance.
Objectives (BPS chapter 24)
CORRELATON & REGRESSION
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
From last time….. Basic Biostats Topics Summary Statistics –mean, median, mode –standard deviation, standard error Confidence Intervals Hypothesis Tests.
School of Veterinary Medicine and Science Multilevel modelling Chris Hudson.
Class 5: Thurs., Sep. 23 Example of using regression to make predictions and understand the likely errors in the predictions: salaries of teachers and.
Clustered or Multilevel Data
Intro to Statistics for the Behavioral Sciences PSYC 1900
Treatment Effects: What works for Whom? Spyros Konstantopoulos Michigan State University.
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
Analysis of Covariance Goals: 1)Reduce error variance. 2)Remove sources of bias from experiment. 3)Obtain adjusted estimates of population means.
Lecture 17 Interaction Plots Simple Linear Regression (Chapter ) Homework 4 due Friday. JMP instructions for question are actually for.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Linear Regression and Linear Prediction Predicting the score on one variable.
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Analysis of Clustered and Longitudinal Data
Example of Simple and Multiple Regression
Chapter 13: Inference in Regression
Correlation and Linear Regression
Correlation and regression 1: Correlation Coefficient
Copyright © Cengage Learning. All rights reserved. 12 Simple Linear Regression and Correlation.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 25 Categorical Explanatory Variables.
1 Research Method Lecture 6 (Ch7) Multiple regression with qualitative variables ©
1 FORECASTING Regression Analysis Aslı Sencer Graduate Program in Business Information Systems.
Moderation & Mediation
Lecture 3-3 Summarizing r relationships among variables © 1.
Regression. Regression  y= output, such as income, support for welfare  c= constant, does not change  b= coefficient (an increase of one year of education.
Linear Functions 2 Sociology 5811 Lecture 18 Copyright © 2004 by Evan Schofer Do not copy or distribute without permission.
Regression Analysis. Scatter plots Regression analysis requires interval and ratio-level data. To see if your data fits the models of regression, it is.
Introduction Multilevel Analysis
Correlation and Linear Regression. Evaluating Relations Between Interval Level Variables Up to now you have learned to evaluate differences between the.
1 G Lect 7M Statistical power for regression Statistical interaction G Multiple Regression Week 7 (Monday)
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Scatterplots & Regression Week 3 Lecture MG461 Dr. Meredith Rolfe.
Simple Linear Regression In the previous lectures, we only focus on one random variable. In many applications, we often work with a pair of variables.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
1 Psych 5510/6510 Chapter 13 ANCOVA: Models with Continuous and Categorical Predictors Part 2: Controlling for Confounding Variables Spring, 2009.
General Linear Model.
Applied Quantitative Analysis and Practices LECTURE#31 By Dr. Osman Sadiq Paracha.
Basic Statistics Linear Regression. X Y Simple Linear Regression.
Chapter 8: Simple Linear Regression Yang Zhenlin.
FIXED AND RANDOM EFFECTS IN HLM. Fixed effects produce constant impact on DV. Random effects produce variable impact on DV. F IXED VS RANDOM EFFECTS.
Sharon Wolf NYU Abu Dhabi Additional Insights Summer Training Institute June 15,
ANOVA, Regression and Multiple Regression March
CORRELATION ANALYSIS.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
The simple linear regression model and parameter estimation
Regression Analysis.
Simple Linear Regression
B&A ; and REGRESSION - ANCOVA B&A ; and
Lecture 4 - Model Selection
Multiple Regression – Split Sample Validation
An Introductory Tutorial
Presentation transcript:

Contextual effects In the previous sections we found that when regressing pupil attainment on pupil prior ability schools vary in both intercept and slope resulting in crossing lines. The next question is are there any higher level variables that can explain these patterns of variation? Similarly, we found that when regressing person hedonism on person income countries vary in both intercept and slope resulting in crossing lines. That is are there school or country level variables that can explain some of this between school and between country variance we have found.

Example Research questions Interest lies in how the outcome of individuals in a cluster is affected by their social contexts (measures at the cluster level). Typical questions are Does school type effect students' educational progress? Do teacher characteristics effect students' educational progress? Is a students progress effected by the ability of his or her peers? Does area deprivation effect the health status of individuals in the area? Children within families is also a multilevel structure. Does family background effect child developmental outcomes?

An Educational example A question of widespread interest is do children make better academic progress in single or mixed. This is a question about the contextual effect of school gender on child progress. Consider a random intercept model on our educational data set

Including school gender contextual effects Make inference that girls in girl schools progress better than girls in mixed schools. Z=0.245/0.85=2.9, p=0.005 Do not make inference that boys in boy schools progress better than boys n mixed schools. Z=0.097/0.109=0.89, p=0.390

Comparing the models with and without contextual effects Between school variance reduced by 13% as a result of fitting school gender Between pupil variance unchanged, addition of higher level variables cannot alter variance at lower levels

Comparing random intercept contextual effects model, with single level contextual effects model Inference changes, under single level model we infer boys do better in single sex schools. Single level models which ignore clustering will give rise to mis-estimated precision, particularly for contextual effects. Why?

Mis-estimated precision In this data set we have 4059 pupils in 65 schools. We therefore only have 65 independent bits of information with which to identify the school effects. The standard errors calculated for the single level model assume we have 4059 bits of information. With level 1 predictors we also get mis-estimated precision because the clustering at higher levels induces correlations between individuals in the same group, so we also have less than 4059 bits of information. However, the problem is typically far less severe. Indeed in the above analysis there are no differences between the standard errors for the coefficients of level 1 predictor variable prior ability between the single level and random intercept models.

Why not just include dummy variables for each school in the single level model This would be the equivalent of fitting an analysis of covariance model. Unfortunately this will not work. We only have 65 pieces of information at the school level and if we use this up by fitting 65 dummy variables we can add no more variables at the school level. So we can not add the school gender dummies. Multilevel modelling provides a richer and more accurate exploratory framework for modelling contextual effects. We get the right standard errors We can see how much of the higher level variation was explained by the contextual effects and how much remains.

Within group, between group and contextual effects of for level 1 predictors We will simulate a small data set with 100 individuals from four groups, for the sake of argument lets say that our response is happiness, we have a predictor variable that is income and the groups are 4 different countries. Lets simulate the mean income in each country as 2,4,6,8. We then simulate our response as Note that this model does not include a country level random effect. Therefore the only differences between country happiness levels are produced by differing incomes in the countries. Often contextual variables are constructed by aggregating level 1 predictors, country level income, family level aggression etc

Fitting a regression model ignoring average income And we might conclude as income rises happiness reduces

Including the average income variable Now we have a different picture. Within all countries the relationship between income and happiness is positive. This is the within country slope(1.128) Recall the country average incomes are {2,4,6,8}, which gives intercepts of {1.13,-3.51, , -12.8} However the intercept for country j is av_income j is the contextual effect. Now within all countries we have a positive relationship between income and happiness, however people in richer countries tend to be less happy than poorer ones.

What about the between group effect? The between group effect is the slope of the regression of the country mean for happiness on the country mean for income. We have identified the within group and contextual effects of the income predictor. That is the slope of the regression line through the four red points and is within group = contextual = between group = Notice that the between group = the within group + the contextual B = W + C =1.128+(-1.204) Or C = B – W that is the contextual effect is the the difference of the between and within regressions

Examples of different contextual effects W=1,B= -1,C=-2W = 1, B=1,C=0W=1,B=3,C=2 Contextual effect pulls down intercept as av_income increases Contextual effect 0 so intercept unchanged as av_income increases Contextual effect pushes up intercept as av_income increases

Why does B=W+C?

Group mean centering of a level 1 predictor If we… Something interesting happens. In our example this corresponds to The intercept and within slope remain unchanged, but 2 now estimates the between group coefficient not the contextual coefficient for income…

Comparison of uncentered and group mean centered level 1 income predictor Note that if you grand mean centre income then 2 remains the contextual coefficient – ie behaves in the same way as the uncentered case

With group mean centering why is 2 the between group effect

And now to some real data.. Consider the random slopes model on the hedonism data where we regress hedonism on income Note income centered around grand mean of 6.

Adding contextual effect for income The contextual effect is positive but non-significant. If we obtain model predictions for a low(3.5) and high(8) average income country we obtain Low = ( ) income = income High = ( ) income = income Thus the positive contextual effect means countries with higher average incomes have higher intercepts and this has reduce the intercept variation by a small amount from to 0.066

Cross-level interactions The fixed part of the previous model assumes the within country income slopes do not change as functions of country level variables. So the within country lines for low and high income countries are parallel. It may be that the with country income slopes may change as a function of the contextual variable av_income. We can test this assumption by allowing the within country income slopes to change as a function of av_income that is fit an interaction between income and av_income

Cross level interaction The cross level interaction is negative and statistically significant. The model predictions for countries with low and high average incomes are now Low = ( ) + ( ) income = income High = ( ) + ( ) income = income

Interpreting the graph At low levels of individual income people in rich countries are more hedonistic. As individual income rises the rich-poor country differential diminishes For what incomes is the rich country-poor country gap significant?

Contextual effects explaining level 2 variance Given a random slope model Contextual variables explain level 2 variation in the intercept and cross-level interactions of contextual variables with x 1 explain level variability in the slope coefficient This can be seen by re-arranging