ERCIM May 2001 Analysis of variance, general balance and large data sets Roger Payne Statistics Department, IACR-Rothamsted, Harpenden, Herts AL5 2JQ.

Slides:



Advertisements
Similar presentations
The Simple Linear Regression Model Specification and Estimation Hill et al Chs 3 and 4.
Advertisements

Weighted Least Squares Regression Dose-Response Study for Rosuvastin in Japanese Patients with High Cholesterol "Randomized Dose-Response Study of Rosuvastin.
Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
Properties of Least Squares Regression Coefficients
1 Regression as Moment Structure. 2 Regression Equation Y =  X + v Observable Variables Y z = X Moment matrix  YY  YX  =  YX  XX Moment structure.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: The Linear Prediction Model The Autocorrelation Method Levinson and Durbin.
GENERAL LINEAR MODELS: Estimation algorithms
Component Analysis (Review)
1 Chapter 4 Experiments with Blocking Factors The Randomized Complete Block Design Nuisance factor: a design factor that probably has an effect.
Chapter 4 Randomized Blocks, Latin Squares, and Related Designs
Experimental Design, Response Surface Analysis, and Optimization
Lecture 6 (chapter 5) Revised on 2/22/2008. Parametric Models for Covariance Structure We consider the General Linear Model for correlated data, but assume.
4.3 Confidence Intervals -Using our CLM assumptions, we can construct CONFIDENCE INTERVALS or CONFIDENCE INTERVAL ESTIMATES of the form: -Given a significance.
Lecture 13 - Eigen-analysis CVEN 302 July 1, 2002.
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
Factor Analysis Purpose of Factor Analysis
1 Chapter 3 Multiple Linear Regression Ray-Bing Chen Institute of Statistics National University of Kaohsiung.
Chapter 4 Multiple Regression.
Probability theory 2011 The multivariate normal distribution  Characterizing properties of the univariate normal distribution  Different definitions.
Lesson #32 Simple Linear Regression. Regression is used to model and/or predict a variable; called the dependent variable, Y; based on one or more independent.
Ordinary least squares regression (OLS)
Linear and generalised linear models
Linear and generalised linear models
Basics of regression analysis
Linear and generalised linear models Purpose of linear models Least-squares solution for linear models Analysis of diagnostics Exponential family and generalised.
Probability theory 2008 Outline of lecture 5 The multivariate normal distribution  Characterizing properties of the univariate normal distribution  Different.
The t Tests Independent Samples.
Linear regression models in matrix terms. The regression function in matrix terms.
Meta-Analysis and Meta- Regression Airport Noise and Home Values J.P. Nelson (2004). “Meta-Analysis of Airport Noise and Hedonic Property Values: Problems.
So are how the computer determines the size of the intercept and the slope respectively in an OLS regression The OLS equations give a nice, clear intuitive.
Stats & Linear Models.
Separate multivariate observations
1 Statistical Analysis Professor Lynne Stokes Department of Statistical Science Lecture 5QF Introduction to Vector and Matrix Operations Needed for the.
Structural Equation Modeling Continued: Lecture 2 Psy 524 Ainsworth.
H 0 (z) x(n)x(n) o(n)o(n) M G 0 (z) M + vo(n)vo(n) yo(n)yo(n) H 1 (z) 1(n)1(n) M G 1 (z) M v1(n)v1(n) y1(n)y1(n) fo(n)fo(n) f1(n)f1(n) y(n)y(n) Figure.
Stats for Engineers Lecture 9. Summary From Last Time Confidence Intervals for the mean t-tables Q Student t-distribution.
Some matrix stuff.
Applications The General Linear Model. Transformations.
CJT 765: Structural Equation Modeling Class 7: fitting a model, fit indices, comparingmodels, statistical power.
Chapter 12 Multiple Linear Regression Doing it with more variables! More is better. Chapter 12A.
1 Statistical Analysis Professor Lynne Stokes Department of Statistical Science Lecture 6 Solving Normal Equations and Estimating Estimable Model Parameters.
Chem Math 252 Chapter 5 Regression. Linear & Nonlinear Regression Linear regression –Linear in the parameters –Does not have to be linear in the.
Model Comparison for Tree Resin Dose Effect On Termites Lianfen Qian Florida Atlantic University Co-author: Soyoung Ryu, University of Washington.
Colorado Center for Astrodynamics Research The University of Colorado 1 STATISTICAL ORBIT DETERMINATION ASEN 5070 LECTURE 11 9/16,18/09.
1 Statistical Analysis Professor Lynne Stokes Department of Statistical Science Lecture 8 Analysis of Variance.
Analysis Overheads1 Analyzing Heterogeneous Distributions: Multiple Regression Analysis Analog to the ANOVA is restricted to a single categorical between.
Environmental Modeling Basic Testing Methods - Statistics III.
Repeated Measures Analysis of Variance Analysis of Variance (ANOVA) is used to compare more than 2 treatment means. Repeated measures is analogous to.
Simulation Study for Longitudinal Data with Nonignorable Missing Data Rong Liu, PhD Candidate Dr. Ramakrishnan, Advisor Department of Biostatistics Virginia.
Psychology 202a Advanced Psychological Statistics October 22, 2015.
1 Statistical Analysis Professor Lynne Stokes Department of Statistical Science Lecture 9 Review.
G Lecture 71 Revisiting Hierarchical Mixed Models A General Version of the Model Variance/Covariances of Two Kinds of Random Effects Parameter Estimation.
Learning Theory Reza Shadmehr Distribution of the ML estimates of model parameters Signal dependent noise models.
Designs for Experiments with More Than One Factor When the experimenter is interested in the effect of multiple factors on a response a factorial design.
Boot Camp in Linear Algebra TIM 209 Prof. Ram Akella.
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
Econometrics III Evgeniya Anatolievna Kolomak, Professor.
Université d’Ottawa / University of Ottawa 2003 Bio 8102A Applied Multivariate Biostatistics L4.1 Lecture 4: Multivariate distance measures l The concept.
Chapter 7. Classification and Prediction
Linear Mixed Models in JMP Pro
CJT 765: Structural Equation Modeling
Evgeniya Anatolievna Kolomak, Professor
Chapter 3: TWO-VARIABLE REGRESSION MODEL: The problem of Estimation
12 Inferential Analysis.
G Lecture 6 Multilevel Notation; Level 1 and Level 2 Equations
OVERVIEW OF LINEAR MODELS
12 Inferential Analysis.
OVERVIEW OF LINEAR MODELS
Fixed, Random and Mixed effects
Presentation transcript:

ERCIM May 2001 Analysis of variance, general balance and large data sets Roger Payne Statistics Department, IACR-Rothamsted, Harpenden, Herts AL5 2JQ

General balance.. ERCIM May 2001 is a very useful concept for small to medium sized data sets it caters for several sources of variation it leads to very efficient algorithms & to clear output, test statistics etc but what about large to very large data sets...?..

General balance.. ERCIM May 2001 fits a mixed model with several error (or block) terms total sum of squares partitioned into components known as strata, one for each block term: each stratum contains the sum of squares for the treatment terms estimated between the units of that stratum a residual representing the random variability of those units..

General balance: example.. ERCIM May 2001

General balance: properties.. ERCIM May 2001 block (error) terms mutually orthogonal treatment terms mutually orthogonal contrasts of each treatment term all have equal efficiency factors in each of the strata where they are estimated..

General balance: theory.. ERCIM May 2001 mixed model: y =   Z    X  dispersion structure: Var(y) = V =     Š  Š  known symmetric matrices with Š  Š  =   Š  (i.e. orthogonal)   Š  = I..

General balance: theory.. ERCIM May 2001 random effects model: y  E(y) =   Z    where E(  ) = 0 and Var(  ) =   2 I then Var(y) =     2 Z  Z  and if terms orthogonal S  = Z  Z  Z    Z  = (1 / n  ) Z  Z  (if equal rep.) S  is the projection operator (form and project means) Š  = S  ( I   term  marginal to term } Š  )   = n    2 +  term  marginal to term } n    2..

General balance: theory.. ERCIM May 2001 treatment structure E(y) = X  =  i X i  i (X = [ X 0 | X 1 | X 2 |...] ) E(y) =  = T  (T = X ( X X )  X ) treatment terms are orthogonal, so E(y) =  i Ť i  i T i = X i X i X i   X i = (1 / n i ) X i X i (if equal rep.) Ť i = T i ( I   jterm j marginal to term i} Ť j )..

General balance: theory.. ERCIM May 2001 orthogonal block structure implies independent least squares analysis within each stratum residual s.s.(Š  y  Š  T  ) (Š  y  Š  T  ) normal equations T Š  T  ^  = T Š  y final condition Ť i Š  Ť j =  ij i Ť i normal equations now  i i Ť i  ^ i =  i Ť i Š  y solved by  ^ i = (1/ i ) Ť i Š  y with var-cov matrix   Ť i / i i is efficiency factor of term i and eigenvalue of Ť i Š  Ť i..

Analysis by “sweeps”.. ERCIM May 2001 requires a first-order balance all effects of each model term have an equal efficiency factor, at each point where the term is estimated (see Wilkinson 1970, Biometrika; Payne & Wilkinson 1977, Applied Statistics) similar to general balance, but that has block (i.e. error) terms mutually orthogonal (note: always true if nested) treatment terms mutually orthogonal (see Payne & Tobias 1992, Scandinavian J. Stats.)..

Analysis by “sweeps”.. ERCIM May 2001 requires a working vector v which initially contains the data values finally contains the residuals terms fitted sequentially: sweeps estimate and remove effects of a term i in stratum  by v (+1) = { I  ( 1 / i ) T i } v () and are then followed by a repeat of the sweeps up to this point (a reanalysis sequence) can omit reanalysis sweeps of terms orthogonal to term i (so none if i  1, & much simpler if general balance) notice projection operator T i simply calculates tables of means so no matrix inversion (unless there are covariates)..

Pictorial representation.. ERCIM May 2001 efficiency factor = sin 2  (Payne & Wilkinson 1977, Applied Statist.)

Analysis by “sweeps”.. ERCIM May 2001 with general balance: initial working vector for stratum  calculated by S   < (I  S  ) y S  is a pivot (calculate means, and insert into vector) and fitted values for treatment term i in stratum  calculated by (1/ i )T i  j: j>0; j<i {R j (I-(1/ j )T j )} S   < (I  S  )y R i = Iif i = 1 R i = S   < (I  S  )yif i < 1..

Other issues.. ERCIM May 2001 analysis of covariance analyse the response (y) variate and the covariates calculate the covariate regression coefficients (regression of y residuals on covariate residuals) adjust treatment estimates and sums of squares combination of information form treatment (and covariate) estimates combining information from all the strata where each is estimated weighted combination of estimates with general balance estimate stratum variances   to calculate weights see Payne & Tobias (1992, Scand.J.Stats.)..

Workspace requirements.. ERCIM May 2001 sweep algorithm working vector: N vectors for effects of each term: n  or n i analysis of covariance - symmetric matrices: ncov(ncov+1)/2, n i (n i +1)/2 c.f. multiple-regression style algorithms (including REML) which typically require matrix: neffects(neffects+1)/2 where neffects is total no block & treatment effects excluding residual vector(s): N much more efficient for large models (see Payne & Welham 1990, COMPSTAT)..

Large data sets.. ERCIM May 2001 data may be unbalanced take a balanced sample..? adapt the algorithm..? Wilkinson (1970, Biometrika) “general recursive algorithm” requires as many sweep sequences for each term i in stratum  as Ť i Š  Ť i has eigenvalues Iterative algorithms Hemmerle (1974, JASA) Worthington (1975, Biometrika)..

Large data sets.. ERCIM May 2001 Hemmerle (1974, JASA) also uses sweep-type operations does not require first-order balance instead performs a sequence of “balanced sweeps” for each term until estimation converges but only one error term fits whole model at once (so a sequence of increasingly large models is required to assess individual terms) data must be “connected” (i.e. no aliasing) does not provide sed’s..

Large data sets.. ERCIM May 2001 Worthington (1975, Biometrika) performs sequence of operations analogous to projections and sweeps assumes additional (unspecified) algorithm to determine the strata (and their projectors) assumes orthogonal (equal replicated) block structure and assumes equally replicated treatment combinations again fits whole model at once..

Generalizing the algorithms.. ERCIM May 2001 both algorithms are based on the result (I  M) 1 = I + M + M (when this converges) apply this to the general form T Š  T  ^  = T Š  y T S   < (I  S  ) T  ^  = T S   < (I  S  ) y  ^  = (I +  m (I  TS   < (IS  )T) m ) TS   < (IS  ) y  ^  (1)  = TS   < (I  S  ) y  ^  (m+1)  =  ^  (m) + (ITS   < (IS  )T) m TS   < (IS  )y relatively straightforward algorithm sequences of sweeps & pivots with efficiency 1

Generalizing the algorithms.. ERCIM May 2001 iterative scheme  ^  (1)  = TS   < (IS  ) y  ^  (m+1)  =  ^  (m) + (ITS   < (IS  )T) m TS   < (IS  )y implementation  ^  (m+1)  =  ^  (m) +  (m)  (m) =(I  T S   < (I  S  ) T) m TS   < (I  S  ) y =(I  T S   < (I  S  ) T)  (m1) calculation of  (m).. project  (m1) into the treatment space T  (m1) project into stratum  S   < (IS  )T  (m1) project into treatment spaceTS   < (IS  )T  (m1) subtract result from  (m1) (ITS   < (IS  )T)  (m1)