# Multilevel Modeling in Health Research April 11, 2008.

## Presentation on theme: "Multilevel Modeling in Health Research April 11, 2008."— Presentation transcript:

Multilevel Modeling in Health Research April 11, 2008

Outline Introduction Research Design Model Building Software Tools Example with Florida infant health and Florida CHARTS

Multilevel Modeling The multilevel model is also known as a ‘mixed and random-effects model,’ ‘variance component model’ and ’hierarchical model.’ Multilevel modeling tries to avoid two problems confronted in data analysis: Atomistic fallacy: individual level analysis misses the context in which behavior occurs Ecological fallacy: aggregate results might not apply to individual behavior

Multilevel Modeling Single-level Analysis Random Intercepts

Multilevel Modeling MLM helps resolve the tension between the atomistic and ecological fallacy.

Multilevel Modeling Two level models: individuals and communities, “the key feature of multilevel models is that they specify the potentially different intercepts and slopes for each community as coming from a distribution at a high level.” The alpha and beta parameter in the above equations are fixed, while the variances and covariance of the error term are random.

Multilevel Modeling “The most important point in favor of multilevel models is that they are more realistic and faithful to the nature of the social world with its many layers and complex structure. Previously, researchers have been on the horns of a dilemma. They have had to work at either the level of the aggregate or the individual. Choosing to work at the aggregate level lays one open to the charge of the ecological fallacy and aggregation bias, while choosing to work at the individual level risks being found guilty of the atomistic fallacy. The latter approach misses the context in which individual behavior occurs, while the former fails to recognize that it is individuals who act, not aggregates” (Jones 1995, 255).

Multilevel Modeling Technical advantages of multilevel modeling: 1. Fix for autocorrelation: single level models are misspecified, this causes incorrect standard errors, confidence limits and tests, and incorrect estimates of precision 2. Higher-level variables are included: both macro and micro-level analysis is considered 3. Precision-weighted estimation is used: multilevel modes are precision weighted so as to reflect the number of observations on which they are based (Jones 1993).

Multilevel Modeling Critiques of this method are that it requires a discrete set of spatial units and assumes a discontinuous spatial process. Large samples are needed to examine within group and between group variation – 30/30 rule

Software Tools SAS STATA R MLwiN

Example My case: I am interested in individual determinants of low birth weight infants, but I understand that place is interesting and has a role in health, and that place characteristics can help substitute for the lack of information about individuals. Apply the Gllamm model in STATA Plot results, residuals Plot results on map Discuss results and way forward for research

Using Florida Birth Records to Study Infant Health Odds Ratios for Key Paths from Three Generation Biosocial Model (singletons). Mother (G2) LBW Mother (G2) LTHS Mother (G2) UNMAR Infant (G3) LBW Total (N=55,430) a Grandmother (G1) Unmarried 1.446**2.593**2.178**1.041 Mother (G2) Low Birth Weight 1.080*0.9721.781** Mother (G2) Low Education 4.567**1.260** Mother (G2) Unmarried 1.336** White (N=34,267) Grandmother (G1) Unmarried 1.467**3.423**2.358**1.061 Mother (G2) Low Birth Weight 1.176**0.9931.939** Mother (G2) Low Education 4.726**1.496** Mother (G2) Unmarried 1.336** Black (N=20,808) Grandmother (G1) Unmarried 1.435**1.981**2.005**1.042 Mother (G2) Low Birth Weight 1.0280.9541.688** Mother (G2) Low Education 3.996**1.110* Mother (G2) Unmarried 1.212** * p<0.05; ** p<0.01 a Includes controls for race.

Births of Residents in Two Generation Dataset

. gllamm birth_weight mother_education, i(county) Iteration 0: log likelihood = -508012.85 Iteration 1: log likelihood = -507870.83 Iteration 2: log likelihood = -507863.57 (not concave) Iteration 3: log likelihood = -507863.42 Iteration 4: log likelihood = -507862.58 Iteration 5: log likelihood = -507862.51 Iteration 6: log likelihood = -507862.51 number of level 1 units = 64813 number of level 2 units = 67 Condition Number = 2403.9866 gllamm model log likelihood = -507862.51 ------------------------------------------------------------------------------ birth_w | Coef. Std. Err. z P>|z| [95% Conf. Interval] -------------+---------------------------------------------------------------- mothered | 45.22001 1.577136 28.67 0.000 42.12888 48.31114 _cons | 3064.606 6.542607 468.41 0.000 3051.782 3077.429 ------------------------------------------------------------------------------ Variance at level 1 ------------------------------------------------------------------------------ 374160.99 (2079.2281) Variances and covariances of random effects ------------------------------------------------------------------------------

Fixed and Random Effects Shown

Other Variables of Interest that are Contextual

Correlation between Slope for Mother’s Education on LBW and County-Level Variables

Conclusions and Discussion Multilevel modeling offers a way to address structural constrains, neighborhood context, social networks and individual behavioral factors simultaneous. Multilevel modeling embodies a family of statistical approaches, some of which are linear and some of which are non-linear. What are the ways that you see multilevel modeling potentially fitting into your analyses?