A quick introduction to the analysis of questionnaire data John Richardson.

Slides:



Advertisements
Similar presentations
Chapter 3 Properties of Random Variables
Advertisements

Richard M. Jacobs, OSA, Ph.D.
Copyright © Allyn & Bacon (2010) Statistical Analysis of Data Graziano and Raulin Research Methods: Chapter 5 This multimedia product and its contents.
What are Concepts and Variables? Book #2. DEVELOPING CONCEPTS EVENT OF INTEREST NOMINAL CONCEPT INDICATOR OPERATIONAL DEFINITION ELEMENTS EXAMPLE - 1.
Copyright © Allyn & Bacon (2007) Statistical Analysis of Data Graziano and Raulin Research Methods: Chapter 5 This multimedia product and its contents.
Appendix A. Descriptive Statistics Statistics used to organize and summarize data in a meaningful way.
Sociology 690 – Data Analysis Simple Quantitative Data Analysis.
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
Four Basic Types Of Measurement: Categorizing –Nominal Ranking –Ordinal Determination of the size interval –Interval Determination of the size of ratios.
QUANTITATIVE DATA ANALYSIS
Lesson Fourteen Interpreting Scores. Contents Five Questions about Test Scores 1. The general pattern of the set of scores  How do scores run or what.
Methods and Measurement in Psychology. Statistics THE DESCRIPTION, ORGANIZATION AND INTERPRATATION OF DATA.
Analysis of Research Data
Introduction to Educational Statistics
FOUNDATIONS OF NURSING RESEARCH Sixth Edition CHAPTER Copyright ©2012 by Pearson Education, Inc. All rights reserved. Foundations of Nursing Research,
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Understanding Research Results
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
@ 2012 Wadsworth, Cengage Learning Chapter 5 Description of Behavior Through Numerical 2012 Wadsworth, Cengage Learning.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Chapter 3 Statistical Concepts.
Statistics. Question Tell whether the following statement is true or false: Nominal measurement is the ranking of objects based on their relative standing.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 16 Descriptive Statistics.
Class Meeting #11 Data Analysis. Types of Statistics Descriptive Statistics used to describe things, frequently groups of people.  Central Tendency 
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Foundations of Educational Measurement
McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. Educational Research: Fundamentals.
Types of data and how to present them 47:269: Research Methods I Dr. Leonard March 31, :269: Research Methods I Dr. Leonard March 31, 2010.
Basic Statistics. Scales of measurement Nominal The one that has names Ordinal Rank ordered Interval Equal differences in the scores Ratio Has a true.
Statistical Evaluation of Data
PPA 501 – Analytical Methods in Administration Lecture 5a - Counting and Charting Responses.
Chapter 11 Descriptive Statistics Gay, Mills, and Airasian
Descriptive Statistics
Instrumentation (cont.) February 28 Note: Measurement Plan Due Next Week.
METHODS IN BEHAVIORAL RESEARCH NINTH EDITION PAUL C. COZBY Copyright © 2007 The McGraw-Hill Companies, Inc.
Basic Statistics Correlation Var Relationships Associations.
Descriptive Statistics
Counseling Research: Quantitative, Qualitative, and Mixed Methods, 1e © 2010 Pearson Education, Inc. All rights reserved. Basic Statistical Concepts Sang.
Tests and Measurements Intersession 2006.
Examining Relationships in Quantitative Research
Advanced Correlational Analyses D/RS 1013 Factor Analysis.
Chapter 2 Statistical Concepts Robert J. Drummond and Karyn Dayle Jones Assessment Procedures for Counselors and Helping Professionals, 6 th edition Copyright.
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
By: Amani Albraikan.  Pearson r  Spearman rho  Linearity  Range restrictions  Outliers  Beware of spurious correlations….take care in interpretation.
DESCRIPTIVE STATISTICS © LOUIS COHEN, LAWRENCE MANION & KEITH MORRISON.
Psychology 101. Statistics THE DESCRIPTION, ORGANIZATION AND INTERPRATATION OF DATA.
Appraisal and Its Application to Counseling COUN 550 Saint Joseph College For Class # 3 Copyright © 2005 by R. Halstead. All rights reserved.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
MOI UNIVERSITY SCHOOL OF BUSINESS AND ECONOMICS CONCEPT MEASUREMENT, SCALING, VALIDITY AND RELIABILITY BY MUGAMBI G.K. M’NCHEBERE EMBA NAIROBI RESEARCH.
Lecture 12 Factor Analysis.
Chapter Eight: Using Statistics to Answer Questions.
BASIC STATISTICAL CONCEPTS Chapter Three. CHAPTER OBJECTIVES Scales of Measurement Measures of central tendency (mean, median, mode) Frequency distribution.
Correlation They go together like salt and pepper… like oil and vinegar… like bread and butter… etc.
Chapter 7 Measuring of data Reliability of measuring instruments The reliability* of instrument is the consistency with which it measures the target attribute.
Introduction to statistics I Sophia King Rm. P24 HWB
Educational Research: Data analysis and interpretation – 1 Descriptive statistics EDU 8603 Educational Research Richard M. Jacobs, OSA, Ph.D.
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
FACTOR ANALYSIS.  The basic objective of Factor Analysis is data reduction or structure detection.  The purpose of data reduction is to remove redundant.
Educational Research Descriptive Statistics Chapter th edition Chapter th edition Gay and Airasian.
Chapter 14 EXPLORATORY FACTOR ANALYSIS. Exploratory Factor Analysis  Statistical technique for dealing with multiple variables  Many variables are reduced.
Measurement and Scaling Concepts
Chapter 12 Understanding Research Results: Description and Correlation
Introduction to Statistics
Basic Statistical Terms
Measuring latent variables
Lesson 12: Presentation and Analysis of Data
15.1 The Role of Statistics in the Research Process
Chapter Nine: Using Statistics to Answer Questions
Measuring latent variables
Presentation transcript:

A quick introduction to the analysis of questionnaire data John Richardson

Measurement scales nominal categorisation = ordinal rank ordering >, =, < interval equal intervals >, = <, +, – ratio absolute zero >, = <, +, –, , 

Frequency distributions A frequency distribution shows all the possible scores in a distribution and how often each score was obtained. A bar graph shows the frequency distribution of a set of scores where the scores are arranged on the x axis and their frequencies are shown on the y axis. A histogram shows the frequency of a set of scores measured on an interval or ratio scale. The bars correspond to successive intervals on the scale. (A histogram is a bar graph in which the bars are touching each other.)

Measures of central tendency The (arithmetic) mean: the sum of the scores in a distribution divided by the number of scores (  X). The median: the point on the scale below which 50% of the scores in a distribution fall. If the number of scores is odd, the median is the middle score when the scores are ranked. If the number of scores is even, the median is the average of the two middle scores when the scores are ranked. The mode: the most frequent score in a distribution.

Measures of central tendency, ctd. The mean assumes an interval or ratio scale. The median assumes an ordinal, interval or ratio scale. The mode assumes a nominal, ordinal, interval or ratio scale.

Measures of variability The range is the difference between the highest and lowest scores in a distribution. A deviation score is the difference between the original score and the mean of the entire distribution. The variance of a set of scores is the average squared deviation score. The standard deviation of a set of scores is the square root of the variance.

Correlation A linear relationship between two variables is one that can be most accurately represented by a straight line. A perfect relationship is one in which all of the points fall on the line. An imperfect relationship is one where a relationship exists but all of the points do not fall on the line. A positive relationship exists when there is a direct relationship between the two variables. A negative relationship exists when there is an inverse relationship between the two variables.

Correlation, ctd. A correlation coefficient expresses quantitatively the magnitude and direction of a relationship: +1a perfect positive relationship   an imperfect positive relationship  0:no relationship   an imperfect negative relationship  –1 a perfect negative relationship

Correlation, ctd. The linear correlation coefficient Pearson r is a measure of the extent to which pairs of scores occupy the same (or opposite) positions within their respective distributions. The square of Pearson r quantifies the proportion of the total variability in one of the variables that is accounted for by the other variable. [If r = 0.80, r² = 0.64, so y explains 64% of the variability in x.]

Correlation, ctd. Pearson r assumes that the data are measured on an interval or ratio scale. For ordinal scales, use the Spearman rank order correlation coefficient rho (r s ). For nominal scales, use the phi (φ) coefficient. Finally, note that correlation does not imply causation.

Reliability A research instrument is reliable if it yields consistent results when used repeatedly under the same conditions with the same participants (that is, it is relatively unaffected by errors of measurement). It can be measured by various coefficients of reliability, all of which vary between zero (reflecting total unreliability) and one (reflecting perfect reliability). (In practice, instruments of poor reliability may actually yield estimates that are less than zero.)

Reliability, ctd. Test-retest reliability is obtained by calculating the correlation coefficients between the scores obtained by the same individuals on successive administrations of the same instrument. If the interval is too short, the participants will become familiar with the instrument and may even recall the responses that they gave at the first administration. If the interval is too long, there may be genuine changes in the personal qualities being measured. In any case, longitudinal studies are hard to carry out because of drop-out between the two administrations.

Reliability, ctd. An alternative approach is to estimate an instrument’s reliability by examining the consistency among the scores obtained on its constituent parts at a single administration. One such measure is split-half reliability: the items are divided into two subsets, and a correlation coefficient is calculated between the scores obtained on the two halves.

Reliability, ctd. The most common measure of reliability is Cronbach’s coefficient alpha. This estimates the internal consistency of an instrument by comparing the variance of the total scores with the variance on the scores on the individual items. (It is formally equivalent to the average value of split-half reliability across all the possible ways of dividing the items into two distinct subsets.)

Factor analysis Factor analysis is a technique for identifying a small number of underlying dimensions from a large number of variables measured on the same participants. Principal component analysis assigns the variance associated with the original variables to the same number of independent dimensions or components. It is based on the original correlation matrix among the variables. However, whereas the diagonal elements of this matrix have a value of 1.00 by definition, the off-diagonal elements are reduced by test-retest unreliability.

Factor analysis, ctd. The various forms of common factor analysis are only concerned with the variance that is common to two or more of the variables. They use an amended correlation matrix in which the diagonal elements are replaced by estimates of the communality of the corresponding variables, and so they acknowledge that the other elements in the matrix are reduced by test-retest unreliability. The most commonly used form of common factor analysis is called principal axis factoring in SPSS.

Factor analysis, ctd. The next problem is to determine the number of factors or component to be extracted. The eigenvalues express the proportion of variance accounted for by each factor. One commonly used rule of thumb is that of extracting the number of factors whose eigenvalues are greater than one in a principal component analysis. This is often inaccurate when tested on artificially generated data. With large numbers of variables, the eigenvalues-one rule tends to overestimate the true number of factors.

Factor analysis, ctd. An alternative procedure works by extracting factors up to the point where the difference between the successive eigenvalues reflects a relatively constant increment attributable to random error. This rule is known as the scree test, and it is more accurate than the eigenvalues-one rule when used with artificially generated sample data. In general, at least two different criteria should be used to justify extracting a particular number of factors.

Factor analysis, ctd. The extracted factors are then usually rotated to yield a more interpretable solution. Rotation tries to maximise the number of variables that show high or low correlations (or loadings) with each factor and to minimise the number of variables with moderate loadings. Orthogonal rotation results in factors that are independent of one another. This may make them easier to interpret.

Factor analysis, ctd. Oblique rotation results in factors that may be correlated with one another. This may be more plausible if the various dimensions result from overlapping sets of mental processes. If a factor analysis results in a number of oblique factors, then one can calculate the participants’ scores on those factors and subject them to a further (second-order) factor analysis.

A quick introduction to the analysis of questionnaire data John Richardson