Measuring abstract concepts: Latent Variables and Factor Analysis.

Slides:



Advertisements
Similar presentations
Agenda Levels of measurement Measurement reliability Measurement validity Some examples Need for Cognition Horn-honking.
Advertisements

1 Factor Analysis Introduction to concept Reading = Individual differences by Colin Cooper.
Structural Equation Modeling
Confirmatory Factor Analysis
Surface normals and principal component analysis (PCA)
Factor Rotation & Factor Scores: Interpreting & Using Factors Well- & Ill-defined Factors Simple Structure Simple Structure & Factor Rotation Major Kinds.
Chapter Nineteen Factor Analysis.
Linear Regression and Binary Variables The independent variable does not necessarily need to be continuous. If the independent variable is binary (e.g.,
Principal Components Analysis with SAS Karl L. Wuensch Dept of Psychology East Carolina University.
Lecture 7: Principal component analysis (PCA)
Psychology 202b Advanced Psychological Statistics, II April 7, 2011.
Principal Components An Introduction Exploratory factoring Meaning & application of “principal components” Basic steps in a PC analysis PC extraction process.
Principal Components An Introduction exploratory factoring meaning & application of “principal components” Basic steps in a PC analysis PC extraction process.
Factor Analysis Research Methods and Statistics. Learning Outcomes At the end of this lecture and with additional reading you will be able to Describe.
The Terms that You Have to Know! Basis, Linear independent, Orthogonal Column space, Row space, Rank Linear combination Linear transformation Inner product.
CHAPTER 30 Structural Equation Modeling From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach,
Goals of Factor Analysis (1) (1)to reduce the number of variables and (2) to detect structure in the relationships between variables, that is to classify.
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
Relationships Among Variables
SECTION 12-1 Visual Displays of Data Slide
Measuring Social Life Ch. 5, pp
Separate multivariate observations
Multivariate Methods EPSY 5245 Michael C. Rodriguez.
Factor Analysis Psy 524 Ainsworth.
Principal Components An Introduction
Scales and Indices While trying to capture the complexity of a phenomenon We try to seek multiple indicators, regardless of the methodology we use: Qualitative.
Measuring the Unobservable
Principal Components Analysis BMTRY 726 3/27/14. Uses Goal: Explain the variability of a set of variables using a “small” set of linear combinations of.
Factor Analysis- Path Analysis (SEM Foundations).
Variable  An item of data  Examples: –gender –test scores –weight  Value varies from one observation to another.
1 Brief Review of Research Model / Hypothesis. 2 Research is Argument.
SEM: Basics Byrne Chapter 1 Tabachnick SEM
Multiple Regression The Basics. Multiple Regression (MR) Predicting one DV from a set of predictors, the DV should be interval/ratio or at least assumed.
Multidimensionality Multi-dimensionality & Factor analysis.
Correlation Analysis. A measure of association between two or more numerical variables. For examples height & weight relationship price and demand relationship.
Advanced Correlational Analyses D/RS 1013 Factor Analysis.
6. Evaluation of measuring tools: validity Psychometrics. 2012/13. Group A (English)
Factor Analysis Psy 524 Ainsworth. Assumptions Assumes reliable correlations Highly affected by missing data, outlying cases and truncated data Data screening.
Factor Analysis Revealing the correlational structure among variables Understanding & Reducing Complexity.
Confirmatory Factor Analysis Psych 818 DeShon. Construct Validity: MTMM ● Assessed via convergent and divergent evidence ● Convergent – Measures of the.
Appraisal and Its Application to Counseling COUN 550 Saint Joseph College For Class # 3 Copyright © 2005 by R. Halstead. All rights reserved.
Descriptive Statistics vs. Factor Analysis Descriptive statistics will inform on the prevalence of a phenomenon, among a given population, captured by.
Testing hypotheses Continuous variables. H H H H H L H L L L L L H H L H L H H L High Murder Low Murder Low Income 31 High Income 24 High Murder Low Murder.
CFA: Basics Beaujean Chapter 3. Other readings Kline 9 – a good reference, but lumps this entire section into one chapter.
Lecture 12 Factor Analysis.
Correlation & Regression Analysis
Multivariate Analysis and Data Reduction. Multivariate Analysis Multivariate analysis tries to find patterns and relationships among multiple dependent.
Module III Multivariate Analysis Techniques- Framework, Factor Analysis, Cluster Analysis and Conjoint Analysis Research Report.
Applied Quantitative Analysis and Practices
Education 795 Class Notes Factor Analysis Note set 6.
Exploratory Factor Analysis Principal Component Analysis Chapter 17.
Measuring abstract concepts: Latent Variables and Factor Analysis.
Factor Analysis I Principle Components Analysis. “Data Reduction” Purpose of factor analysis is to determine a minimum number of “factors” or components.
Advanced Statistics Factor Analysis, I. Introduction Factor analysis is a statistical technique about the relation between: (a)observed variables (X i.
Feature Extraction 主講人:虞台文. Content Principal Component Analysis (PCA) PCA Calculation — for Fewer-Sample Case Factor Analysis Fisher’s Linear Discriminant.
Chapter 8 Relationships Among Variables. Outline What correlational research investigates Understanding the nature of correlation What the coefficient.
Chapter 14 EXPLORATORY FACTOR ANALYSIS. Exploratory Factor Analysis  Statistical technique for dealing with multiple variables  Many variables are reduced.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
Chapter 12 REGRESSION DIAGNOSTICS AND CANONICAL CORRELATION.
Lecture 2 Survey Data Analysis Principal Component Analysis Factor Analysis Exemplified by SPSS Taylan Mavruk.
Principal Components & Common Factoring An Introduction
Dimension Reduction via PCA (Principal Component Analysis)
Descriptive Statistics vs. Factor Analysis
EPSY 5245 EPSY 5245 Michael C. Rodriguez
Product moment correlation
Confirmatory Factor Analysis
Lecture 8: Factor analysis (FA)
Rachael Bedford Mplus: Longitudinal Analysis Workshop 23/06/2015
Presentation transcript:

Measuring abstract concepts: Latent Variables and Factor Analysis

Correlation as the shape of an ellipse of plotted points o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o oo o High correlation (people’s arm & leg lengths?) Lower correlation (arm length & body weight?) No correlation (arm length & income?) Correlation shows how accurately you can predict the score on a second variable if you are told the first. It suggests that there may be some underlying connection: growth, for example.

Multiple dimensions We can show correlations among 3 variables (e.g. length of your arm & leg, and head circumference). If they are correlated, the diagram becomes an ellipsoid. It has a central axis running through it, forming a single summary indicator of the latent variable (size). Mathematically, we can also summarize correlations between more than 3 dimensions (but I can’t draw it) Head circum. arm leg

General measurement approach ϕψsocial Health Conceptual model Selection of indicators (sampling) Scoring system e.g. ϕ x 2 +Ψ x soc.

Possible hierarchies ϕψsocial ϕψ Health In a multi-level construct we need to specify how the different levels relate to each other. This comes entirely from a conceptual approach: there is no empirical way to assert one or the other model. ??

Indicators Latent trait Indicators inter-correlate because they all reflect the latent trait Indicators correlate because they reflect a common (latent) trait

Modeling the link between manifest (measured) and latent (inferred) variables Income Expenditure Indicator scores Health(Probability model) For variables like income & expenditure we can give a relatively fixed model For health there is more variation between people, so a less precise model.

Principal Components analysis Translates a complex system of correlations between many variables into fewer underlying dimensions (or ‘principal components’). Developed by Charles Spearman in 1904 to identify a simpler underlying structure in large matrices of correlations between measures of mental abilities. Later greatly misused in ‘defining’ intelligence.

Common variance (what we are trying to measure) Unique variance in this item (irrelevant bias in the measurement) Spearman’s 1904 core idea: each item contains some common (shared) variance plus some specific variance. The latter (red circles) sometimes raises and sometimes lowers the score, so they cancel out if you have enough items. +-+-

One principal component Red lines show scores on 8 tests as vectors Cosine of angles between them represent correlations: if 2 vectors overlap the correlation is perfect (Cosine 0° = 1.0) Principal component 1 resolves most of the variance in the 8 measures: it’s the best fit, or grand average. 1

Dimensionality & Rotation. The principal component is that which accounts for the most variance; this depends on the conceptual shape of the latent trait being measured. For Chile, one dimension will account for most of the variance in distance between cities; for HK a more complex model is required. To find the dominant dimension with the maximal variation, axes need to be rotated.

Variance ‘explained’ Here 2 vectors, B & C, are only partially correlated. Resolving power of the principal component is shown by comparing length of the vector (B or C) and its projection onto the axis (Bʹ, C ʹ) Here, axis 1 ‘explains’ more variance for B than for C (Bʹ > C ʹ) A second (horizontal) component may be required for C: axis 2 resolves much of the variance in C, but very little for B. Principal axis B Bʹ C Cʹ Second axis

Thurstone’s 1930 multi-factor idea: each item contains some common variance plus several types of unique variance. The latter (colored circles) can compose an additional factor being measured, or just random ‘error’ Common variance (what we are trying to measure) Unique variance in this item (irrelevant bias in the measurement) Second theme in the measurement

Factor Loadings and Validity In the second example, the latent variable is more strongly reflected in the item; it has a higher loading on the variable and is a purer indicator of the underlying variable. The blue rectangle represents the contribution of the latent variable to the indicator. The green segment represents the contribution of other latent variables; the red section shows all other sources of variance (error, etc).

Example of a two-factor solution (here related to concepts in the Health Belief Model) Source: K.S. Lewis, PhD thesis “An examination of the Health Belief Model when applied to Diabetes Mellitus” University of Sheffield, 1994.

Solution with rotated axes 1 Anxiety items Depression items Using factor 1 alone = general mental health factor? 1 Using 2 factors clarifies different groups, but neither explains substantial variance 2 Anxiety factor Depression factor

To rotate or not to rotate? Dimensions are traditionally shown perpendicular to each other: independent & uncorrelated (measures of different things should not be confounded). Applied to example of anxiety & depression there are various options: 1.as they are both are both facets of mental distress, they could be summarized along a single factor 2.perhaps it is diagnostically useful to keep anxiety & depression conceptually distinct: 2 orthogonal factors. If so, our indicators are not terrible good (low variance explained) 3.anxiety & depression often co-occur, so in reality are correlated; the axes could therefore be rotated obliquely to resolve the maximum variance (next slide)

Oblique rotation Allow the axes to correlate Resolves more variance But does not create conceptually independent entities Do you like this approach? Anxiety factor Depression factor

An example of turning principal components analysis results into linear modeling (LISREL), The Health Belief Model. Source: Cao Z-J, Chen Y, Wang S-M. BMC Public Health 2014, 14:26

Cautions to ponder… Correlations between measures do not prove that they record anything concrete. Test scores may or may not result from (or be caused by) the underlying factor. The principal component is a mathematical abstraction; it may not represent anything real – (correlate your age for successive years with the population of Mexico, the weight of your pet turtle, the price of cheese and the distance between any 2 galaxies: this will produce a strong principal component). – Rotating the axes causes the principal component to disappear, so it has no reality We cannot declare that a factor represents an underlying reality (intelligence or health, etc.) unless we have clear evidence from other sources.

Questions to debate Would you use a 1- or a 2-factor solution for anxiety & depression questions? – What sort of rotation? What type of evidence could demonstrate that your presumed health measures really do measure health? Should we ever use oblique rotation?