# Dynamic Factor Analysis Ellen L. Hamaker Methods and Statistics Faculty of Social Sciences Utrecht University The Netherlands.

## Presentation on theme: "Dynamic Factor Analysis Ellen L. Hamaker Methods and Statistics Faculty of Social Sciences Utrecht University The Netherlands."— Presentation transcript:

Dynamic Factor Analysis Ellen L. Hamaker Methods and Statistics Faculty of Social Sciences Utrecht University The Netherlands

Outline i.Introduction ii.Time series analysis iii.Linear Kalman filter iv.Illustration 1 v.Regime-switching Kalman filter vi.Illustration 2 vii.Discussion

2 kinds of statistical techniques Concerning means of populations t-test ANOVA MANOVA Concerning covariance structure of populations correlation regression analysis factor analysis path analysis Means and covariance structures combined in SEM

How did it start? In 1884 Galton established his anthropometric laboratory and measured mental faculties and physical appearances of 9000 visitors. His research subject was: variation in the population. Galton believed most mental and physical features were inherited. He was worried that the protection of the weak (i.e., the poor) would interfere with the mechanisms of natural selection. Galton is the founder of eugenics.

Other important eugenicists Pearson follower of Galton, and inventor of the product-moment correlation coefficient Spearmanstudent of Wundt, and inventor of factor analysis, and the concept of general intelligence Fishermathematician, and inventor of: ANOVA, experimental designs, principle of maximum likelihood, inferential statistics, null-hypothesis testing, F-test, Fisher information, non-parametric statistics, et cetera, et cetera…

Mathematical statistics The statistical techniques used in the social sciences were developed to study heredity. Hence, they have two important features: a.heredity operates at level of population: same holds for these techniques b.biometrics is concerned with studying trait- like variables, not processes

What is the problem? Our standard techniques focus on characteristics of the population (means, correlations, proportions). BUT… results are not always generalizable to the individual. For instance: -if we find a beneficial effect of therapy at the group level, this does not guarantee that every individual improved -if we find a smooth change at the group level, it is possible that at the individual level there is a sudden change -if 20% of clients are cured after treatment, this does not imply that an individual has a 20% change of being cured

E.g., correlation words per minute words per minute mistakes interindividualintraindividual mistakes

Who makes this mistake? sociable shy Personality processes, by definition, involve some change in thoughts, feelings and actions of an individual; all these intra-individual changes seem to be mirrored by interindividual differences in characteristic ways of thinking, feeling and acting. McCrae & John (1992)

The same in formulas Let i be the subject index, and x and y be two variables. INTRAindividual correlation: INTERindividual correlation

Questions about processes Is the relationship at the INTRAindividual level identical to the relationship at the INTERindividual level? If not, is there an universal relationship? If not, can the differences between individuals with respect to their dynamics be related to other individual differences?

Outline i.Introduction ii.Time series analysis iii.Linear Kalman filter iv.Illustration 1 v.Regime-switching Kalman filter vi.Illustration 2 vii.Discussion

Dynamic system A DS is a set of equations that describe how the state of the system changes as a function of its previous state. Characteristics of a DS: -1 or more variables -s-state = values of the variables -s-stochastic/deterministic -d-discrete or continuous time -l-linear or nonlinear Time series analysis is a technique to study uni- or multivariate, stochastic systems in discrete time, which may be linear or nonlinear.

Autoregressive models ytyt y t-1 y t-2 y t+1 y t+2 a t-2 a t-1 atat a t+1 a t+2 y* t y* t-1 y* t-2 y* t+1 y* t+2 a* t-2 a* t-1 a* t a* t+1 a* t+2

Time series Unrelated series: first series contains autocorrelation second series is white noise Two related series: first contains positive autocorrelation second contains negative autocorrelation

Dynamic factor model A DFM relates multiple indicators to 1 or more latent variables (factor model). Because the variables are measured repeatedly (T>50), the dynamics can be modeled (i.e., the structure in the changes over time). Two ways of including lagged relationships: -l-lagged factor loadings -l-latent VARMA process

DFM with lagged factor loadings y t+1 ytyt ytyt ytyt ytyt y t-1 ftft f t+1 f t-1 y t-2 f t-2

DFM with latent VARMA process y t+1 ytyt ytyt ytyt ytyt y t-1 ftft f t+1 f t-1 y t-2 f t-2 a t-1 atat a t+1 a t-1

Outline i.Introduction ii.Time series analysis iii.Linear Kalman filter iv.Illustration 1 v.Regime-switching Kalman filter vi.Illustration 2 vii.Discussion

Kalman filter The Kalman filter is an algorithm for estimating the latent states, and for predicting time series models. It requires the model to be reformulated in state- space format, i.e.:

t = T ? Goal of Kalman filter Obtain estimates for the states a t (and predict future observations).

t = T ? Estimation of model parameters

Outline i.Introduction ii.Time series analysis iii.Linear Kalman filter iv.Illustration 1 v.Regime-switching Kalman filter vi.Illustration 2: nonlinear KF extension vii.Discussion

Daily measures of E & N Data: 90 repeated measures in 22 subjects of states associated with the Five Factor Model of personality. Extraversion items Neuroticism items total variance state variance trait variance

Results 1. Does every one have the same 2-factor structure? - 3 persons out of 22 not - only small groups with same factor loadings 2. Are there similarties in dynamics? NtNt N t-1 EtEt E t-1 a t-1 u t-1 atat utut + + - - + -

Outline i.Introduction ii.Time series analysis iii.Linear Kalman filter iv.Illustration 1 v.Regime-switching Kalman filter vi.Illustration 2 vii.Discussion

State-space model with regime-switching Regimes can be thought of as states that differ from each other with respect to their parameters. where S t is an unobserved discrete-valued Markov chain.

Markov-switching process Lets focus on a 2-regimes first-order Markov- switching process. Thus we have: S t = 1,2. For each regime there is a probability of staying in the same regime, and a probability of switching to the other regime.

KF with Markov-switching Because we do not know in which regime the process is at any occasion, we have to estimate all possibilities. Hence, we get 4 (M*M) predictions and 4 updates:

Collapsing the posteriors This implies that at each step we get an M-fold increase in cases (2,4,8,16,32,…). To overcome this problem, the M 2 updates are reduced to M updatesthrough: Hence, to collapse the M 2 posteriors in M posteriors, we need the probabilities Pr[S t-1 = i|S t = j, Y t ]. These are obtained with the Hamilton filter.

Hamilton filter of the probabilities

Outline i.Introduction ii.Time series analysis iii.Linear Kalman filter iv.Illustration 1 v.Regime-switching Kalman filter vi.Illustration 2 vii.Discussion

Positive and negative affect Daily measurements with palm handheld using the PANAS. Question: Are there distinct regimes in daily affect fluctuations? Positive affectNegative affect

Negative affect subject 10 Linear model: AIC: 108.52 BIC: 115.95 Two regime model: AIC: 72.32 BIC: 92.05

Negative affect subject 5 Linear model: AIC: 80.79 BIC: 88.35 Two regime model: AIC: 69.04 BIC: 89.21

Outline i.Introduction ii.Time series analysis iii.Linear Kalman filter iv.Illustration 1 v.Regime-switching Kalman filter vi.Illustration 2 vii.Discussion

Conclusion Today we looked at models for: -multiple indicators -multiple subjects -regime switching TSA allows us to model processes where they take place: at the level of the individual. There are different ways in which we can combine information obtained from multiple subjects.

Aint seen nothing yet! Other possibilities: -transition probabilities as functions of observed variables -smoothly changing parameters -deterministic trends and cycles (weekly, monthly) -difference scores -intervention analysis -change-point models -threshold models -ordinal data -include predictors (situational features) -include a partner (spouses, therapist-client, mother-child) -and much much more…

Thank you email: e.l.hamaker@uu.nl

Download ppt "Dynamic Factor Analysis Ellen L. Hamaker Methods and Statistics Faculty of Social Sciences Utrecht University The Netherlands."

Similar presentations