Lecture 6 (chapter 5) Revised on 2/22/2008. Parametric Models for Covariance Structure We consider the General Linear Model for correlated data, but assume.

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

GENERAL LINEAR MODELS: Estimation algorithms
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Inference for Regression
Correlation and regression
Lecture 4 (Chapter 4). Linear Models for Correlated Data We aim to develop a general linear model framework for longitudinal data, in which the inference.
Objectives (BPS chapter 24)
Multiple Regression [ Cross-Sectional Data ]
Chapter 13 Multiple Regression
Statistics for Managers Using Microsoft® Excel 5th Edition
Statistics for Managers Using Microsoft® Excel 5th Edition
Chapter 12 Multiple Regression
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Clustered or Multilevel Data
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 11 th Edition.
Longitudinal Data Analysis: Why and How to Do it With Multi-Level Modeling (MLM)? Oi-man Kwok Texas A & M University.
Correlation and Regression Analysis
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
Copyright ©2011 Pearson Education 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft Excel 6 th Global Edition.
Analysis of Clustered and Longitudinal Data
Regression and Correlation Methods Judy Zhong Ph.D.
Lecture 9: Marginal Logistic Regression Model and GEE (Chapter 8)
Inference for regression - Simple linear regression
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Chapter 11 Simple Regression
STA291 Statistical Methods Lecture 27. Inference for Regression.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 25 Categorical Explanatory Variables.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft.
Lecture 1 (Chapter 1). Introduction This course describes statistical methods for the analysis of longitudinal data, with a strong emphasis on applications.
Covariance structures in longitudinal analysis Which one to choose?
Lecture 8: Generalized Linear Models for Longitudinal Data.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 22 Regression Diagnostics.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
HSRP 734: Advanced Statistical Methods June 19, 2008.
Modeling Repeated Measures or Longitudinal Data. Example: Annual Assessment of Renal Function in Hypertensive Patients UNITNOYEARAGESCrEGFRPSV
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Repeated Measurements Analysis. Repeated Measures Analysis of Variance Situations in which biologists would make repeated measurements on same individual.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
+ Chapter 12: More About Regression Section 12.1 Inference for Linear Regression.
Lecture 3 Linear random intercept models. Example: Weight of Guinea Pigs Body weights of 48 pigs in 9 successive weeks of follow-up (Table 3.1 DLZ) The.
Copyright ©2011 Brooks/Cole, Cengage Learning Inference about Simple Regression Chapter 14 1.
1 Regression Analysis The contents in this chapter are from Chapters of the textbook. The cntry15.sav data will be used. The data collected 15 countries’
Lecture 7: Parametric Models for Covariance Structure (Examples)
Lecture 10: Correlation and Regression Model.
Simple linear regression Tron Anders Moger
Time Series Basics Fin250f: Lecture 8.1 Spring 2010 Reading: Brooks, chapter
28. Multiple regression The Practice of Statistics in the Life Sciences Second Edition.
Correlation & Regression Analysis
Dynamic Models, Autocorrelation and Forecasting ECON 6002 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s notes.
Copyright © 2011 Pearson Education, Inc. Regression Diagnostics Chapter 22.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 10 th Edition.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Tutorial I: Missing Value Analysis
1 Statistics 262: Intermediate Biostatistics Regression Models for longitudinal data: Mixed Models.
G Lecture 71 Revisiting Hierarchical Mixed Models A General Version of the Model Variance/Covariances of Two Kinds of Random Effects Parameter Estimation.
Chapter 13 Simple Linear Regression
Chapter 15 Multiple Regression Model Building
Inference for Regression
Linear Mixed Models in JMP Pro
Chapter 6: Autoregressive Integrated Moving Average (ARIMA) Models
G Lecture 6 Multilevel Notation; Level 1 and Level 2 Equations
Statistical Methods For Engineers
CHAPTER 29: Multiple Regression*
Least Squares Regression Line LSRL Chapter 7-continued
A Gentle Introduction to Linear Mixed Modeling and PROC MIXED
BY: Mohammed Hussien Feb 2019 A Seminar Presentation on Longitudinal data analysis Bahir Dar University School of Public Health Post Graduate Program.
Chapter 5 LSRL.
CHAPTER 12 More About Regression
Product moment correlation
Presentation transcript:

Lecture 6 (chapter 5) Revised on 2/22/2008

Parametric Models for Covariance Structure We consider the General Linear Model for correlated data, but assume that the covariance structure of the sequence of measurements on each unit is to be specified by the values of unknown parameters.

Parametric Models for Covariance Structure

Parametric Models for Covariance Matrices Model for the mean Model for the covariance matrix

Parametric Models for Covariance Matrices We now consider more general models for the covariance matrix which can be specified by looking at the empirical variogram.

Interpretation of the Variogram

Variance of the random effects Serial correlation Measurement error corr

Interpretation of the Variogram or Variogram: with variance and correlation function with variance Total variance:

Variogram for a model with Random Intercept plus serial correlation plus measurement error variance of the random effect measurement error variance variance of the serial process

Cow Data 19 weeks of measurement, 3 diets: barley, mixed, lupins (barley data shown above)

Example: Protein Content of Milk The “roof” of the variogram is placed at 0.087

Example: Protein Content of Milk (cont’d) the variogram does not reach the total variance the variogram does not start at zero

Example: Protein Content of Milk (cont’d) Variogram of protrs (17 percent of v_ij’s excluded) variance of the random intercept Measurement error

Example: Protein Content of Milk (cont’d)

Parametric Models for Covariance Structure We will cover this in the multi-level course

Models To develop a model, we need to understand the sources of variation: 1.Random Effects (Intercept): random variation between units 2.Serial Correlation: time-varying random process within a unit 3.Measurement Error: measurement process introduces a component of random variation

How to incorporate these qualitative features into specific models

Pure Serial Correlation Exponential correlation model for equally spaced measurements

Autoregressive Model of Order 1 Another way to build a serial correlation model is to assume an explicit dependence of the on its predecessors The simplest example is a AR(1) where AR(1) models are appealing for equally spaced data, less so for unequally spaced. For example, it would be hard to interpret if the measurements were not equally spaced in time xtgee + corr(AR1)

Exponential Correlation Model for Unequally Spaced data The exponential correlation model can handle unequally spaced data. We can assume Then prais command in stata allow a different coefficient for each time difference

Model-Fitting 1. Formulation: choosing the general form of the model Mean Association 2. Estimation: fitting the model 1.Weighted least squares for 2.ML for covariance parameters or subset 3.Iterate (1) and (2) to convergence

Model-Fitting (cont’d) 3. Diagnostic: checking that the model fits the data by examining residuals for lack of fit, correlation 4. Inference: calculating confidence intervals or testing hypotheses about parameters of interest

Step 1. Formulation Formulation of the model is a continuation of exploratory data analysis. Focus on the mean and covariance structures 1.Look at the residuals 2.Create time plots, scatterplot matrices and empirical variograms 3.Do you have stationarity in the residuals? If not, you need to transform the data or use inherently non-stationary models as a model with a random intercept and random slope 4.Once stationarity has been achieved, use the empirical variogram to estimate the underlying covariance structure

Step 2. Estimation

Step 4. Diagnostics The aim is to compare the data with the fitted model. How? 1.Super-impose the fitted mean response profiles on a time plot of the average observed responses within each combination of treatment and times. 2.Super-impose the fitted variogram on a plot of the empirical variogram.

Examples and Summary: Nepal Dataset This dataset contains anthropologic measurements on Nepalese children. The study design called for collecting measurements on 2258 kids at 5 time points, spaced approximately 4 months apart Q: Estimate the association between arm circumference and child’s weight, adjusted by age and gender, and accounting for the correlation in the repeated measures within the same child

Examples and Summary: Nepal Dataset Time varying confounder Goal: estimate the average change in arm circumference for one unit change In weight accounting for the potential confounding effect of the time varying covariate age and for sex. The standard errors of the estimated association Between arm circumference and weight are estimated accounting for the correlation of the repeated measures within a child,

Examples and Summary: Nepal Dataset Models for the covariance matrix: (measurement error only)

Examples and Summary: Nepal Dataset (More) Models for the covariance matrix: + measurement error

Independence Model (measurement error) 0

Independence Model (cont’d)

Uniform Model (also, “Exchangeable” or “Compound Symmetry”) A better model (than the Independence Model) is to assume same correlation for all pairs of observations: This is called the uniform, exchangeable, or compound symmetry correlation model.

Uniform (Random Intercept) plus measurement error

Uniform Model (also, “Exchangeable” or “Compound Symmetry”) (cont’d)

Uniform Model

Uniform Model (also, “Exchangeable” or “Compound Symmetry”) (cont’d)

Exponential Correlation Model A different model is to assume that the correlation of observations closer together in time is larger than that of observations farther apart in time. One model for this is the exponential model:

Exponential Correlation Model Within subject correlation at lag 1 (4 months separation approx)

Exponential Correlation Model (cont’d)

Exchangeable + Exponential Model

Random Intercept + Serial Correlation

Summary of Example We see that the exchangeable correlation (0.691) is similar to the model without the exponential correlation (0.748), and the exponential correlation (0.185) is now much smaller. The regression parameter estimates are also similar to the exchangeable case. This suggests that the exchangeable correlation model may be capturing the main correlation pattern.

Summary Modelling the correlation in longitudinal data is important to be able to obtain correct inferences on regression coefficients. This leads to: 1.Statistical Efficiency 2.Correct Standard Errors These are marginal models because the interpretation of the regression coefficients is the same as that in cross-sectional data. 1.Exchangeable correlation model: subject-specific formulation 2.Exponential correlation model: transition model formulation

Summary (cont’d) Three basic elements of correlation structure 1.Random effects 2.Auto-correlation or serial dependence 3.Observation-level noise or measurement error

Evaluating Covariance Models Once you have chosen a (set of) covariance model(s), how do you evaluate whether it fits the data well? Or, how do you compare several of them? Several tools (each work with either ML or ReML): 1.Likelihood Ratio Tests (LRTs) for comparing nested models 2.Akaikie’s Information Criterion (AIC) 3.QIC to compare models fitted with GEE 4.Examining fitted model variograms

Comparing Covariance Models with Akaike’s Information Criterion (AIC)

Comparing Covariance Models with Akaike’s Information Criterion (AIC) (cont’d)