Measurement in Psychology I: RELIABILITY Lawrence R. Gordon.

Slides:



Advertisements
Similar presentations
Conceptualization, Operationalization, and Measurement
Advertisements

Conceptualization and Measurement
Taking Stock Of Measurement. Basics Of Measurement Measurement: Assignment of number to objects or events according to specific rules. Conceptual variables:
Reliability and Validity checks S-005. Checking on reliability of the data we collect  Compare over time (test-retest)  Item analysis  Internal consistency.
MEASUREMENT CONCEPTS © 2012 The McGraw-Hill Companies, Inc.
Chapter 4 – Reliability Observed Scores and True Scores Error
Chapter 5 Measurement, Reliability and Validity.
4/25/2015 Marketing Research 1. 4/25/2015Marketing Research2 MEASUREMENT  An attempt to provide an objective estimate of a natural phenomenon ◦ e.g.
Defining, Measuring and Manipulating Variables. Operational Definition  The activities of the researcher in measuring and manipulating a variable. 
Part II Sigma Freud & Descriptive Statistics
Business Research for Decision Making Sixth Edition by Duane Davis Chapter 7 Foundations of Measurement PowerPoint Slides for the Instructor’s Resource.
Copyright © Allyn & Bacon (2007) Data and the Nature of Measurement Graziano and Raulin Research Methods: Chapter 4 This multimedia product and its contents.
LECTURE 9.
Data and the Nature of Measurement
1 Single Indicator & Composite Measures UAPP 702: Research Design for Urban & Public Policy Based on notes by Steven W. Peuquet. Ph.D.
RESEARCH METHODS Lecture 18
Lecture Overview: Measurement 1) Reliability of Measures 1) Reliability of Measures 2) Construct Validity 2) Construct Validity 3) Measurement scales 3)
MEASUREMENT. Measurement “If you can’t measure it, you can’t manage it.” Bob Donath, Consultant.
Beginning the Research Design
Nominal Level Measurement n numbers used as ways to identify or name categories n numbers do not indicate degrees of a variable but simple groupings of.
1 Measurement PROCESS AND PRODUCT. 2 MEASUREMENT The assignment of numerals to phenomena according to rules.
Manipulation and Measurement of Variables
Chapter10 Measurement in Marketing Research. The Measurement Process Empirical System (MKT Phenomena) Abstract System (Construct) Number System measurement.
Variables cont. Psych 231: Research Methods in Psychology.
8-1 Chapter Eight MEASUREMENT. 8-2 Measurement Selecting observable empirical events Using numbers (0, 1, #, %) or symbols (M, F, etc.) to represent aspects.
Elementary Statistics Picturing the World
Measurement and Data Quality
1 Psych 5500/6500 Measurement Fall, Measurement: Turning the World into Numbers The problem with only having a hammer as a tool, is that you tend.
MEASUREMENT OF VARIABLES: OPERATIONAL DEFINITION AND SCALES
Measurement in Exercise and Sport Psychology Research EPHE 348.
Instrumentation.
Foundations of Educational Measurement
McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. Educational Research: Fundamentals.
What is a Measurement? Concept of measurement is intuitively simple  Measure something two concepts involved  The thing you are measuring  The measurement.
Chapter 5 Conceptualization, Operationalization, and Measurement.
6. Conceptualization & Measurement
Measurement Cameron G. Thies University of Iowa. The Measurement Process What is measurement? – The process of assigning numbers or labels to units of.
Chapter Five Measurement Concepts. Terms Reliability True Score Measurement Error.
1 Chapter 4 – Reliability 1. Observed Scores and True Scores 2. Error 3. How We Deal with Sources of Error: A. Domain sampling – test items B. Time sampling.
Counseling Research: Quantitative, Qualitative, and Mixed Methods, 1e © 2010 Pearson Education, Inc. All rights reserved. Basic Statistical Concepts Sang.
Chapter 7 Measurement and Scaling Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Variables and their Operational Definitions
Advanced Research Methods Unit 3 Reliability and Validity.
Measurement and Questionnaire Design. Operationalizing From concepts to constructs to variables to measurable variables A measurable variable has been.
CHAPTER OVERVIEW The Measurement Process Levels of Measurement Reliability and Validity: Why They Are Very, Very Important A Conceptual Definition of Reliability.
Chapter 2: Behavioral Variability and Research Variability and Research 1. Behavioral science involves the study of variability in behavior how and why.
Bell Ringer Using female = 0 and male = 1, calculate the average maleness in this classroom.
Research Methodology and Methods of Social Inquiry Nov 8, 2011 Assessing Measurement Reliability & Validity.
Reliability: The degree to which a measurement can be successfully repeated.
Lecture 4 – Ch. 4 Data and the Nature of Measurement.
SOCI 2003B: Sociological Methods Colleen Anne Dell, Ph.D. Carleton University, Department of Sociology & Anthropology Canadian Centre on Substance Abuse.
MEASUREMENT: PART 1. Overview  Background  Scales of Measurement  Reliability  Validity (next time)
Discussion Overview: Measurement I) Reliability of Measures I) Reliability of Measures II) Construct Validity II) Construct Validity III) Measurement scales.
Determining Scale of Measurement 3 steps Examples Try a few.
SECOND EDITION Chapter 5 Standardized Measurement and Assessment
Measurement Experiment - effect of IV on DV. Independent Variable (2 or more levels) MANIPULATED a) situational - features in the environment b) task.
Chapter 6 - Standardized Measurement and Assessment
Reliability a measure is reliable if it gives the same information every time it is used. reliability is assessed by a number – typically a correlation.
Lesson 3 Measurement and Scaling. Case: “What is performance?” brandesign.co.za.
Copyright © 2014 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 11 Measurement and Data Quality.
© 2009 Pearson Prentice Hall, Salkind. Chapter 5 Measurement, Reliability and Validity.
Quality instrument* Questions are determined by objectives Resist the temptation to ask questions that are interesting but not relevant to your hypothesis.
Measurement and Scaling Concepts
Chapter 2 Theoretical statement:
Ch. 5 Measurement Concepts.
Product Reliability Measuring
CHAPTER 5 MEASUREMENT CONCEPTS © 2007 The McGraw-Hill Companies, Inc.
Computing Reliability
Chapter 5 Conceptualization, Operationalization, and Measurement
Presentation transcript:

Measurement in Psychology I: RELIABILITY Lawrence R. Gordon

Do you support the civil union legislation? §What are some of the ways in which you can ask this question? §How do you measure the response (operational definitions)?

Levels of Measurement §Nominal scales l giving names to data, putting into categories l Examples: sex, race labels; baseball uniform numbers §Ordinal scales l numbers give order but not distance l Examples: mailbox numbers; class rankings

Levels of Measurement (cont.) §Interval scales l numbers indicate order and distance (they are separated by equal distances or intervals) l Example: Fahrenheit temperature §Ratio scales l numbers indicate order, distance, AND have a true zero point (zero = there isn’t any) l Examples: height; weight; miles per hour; time

Levels of Measurement Example Auto race which started at 2 pm

Closed vs. Open Responses §Closed responses (a.k.a. forced choice) l Examples (rate civil union support on a scale 1 to 9) l Advantages you know what the responses will be (or what they should be!) because of restrictions on choice easy to empirically evaluate (relatively) gives data that gives a straightforward answer to how you ask your question coding not necessary, usually

Closed vs. Open Responses §Closed responses (a.k.a. forced choice) l Disadvantages may not be sensitive enough to get some interesting information will not give you as clear an indication of what participants think/feel/report §“Do you agree that same-sex couples should have the right to marry/civil union?” Disagree Agree Completely

Support Civil Union (histogram)

Support Civil Union (area graph)

Compare the Graphs: Same Info

Closed vs. Open Responses §Open responses (a.k.a. free response) Examples (Do you support the civil union legislation? Why?) §Example from the survey used the first day? §“Please describe yourself in 12 words or less” more on this in a bit... §Advantages gives any answer participant wants not restricted by choices

Closed vs. Open Responses §Open responses (cont.) l Disadvantages have to code to empirically evaluate (time intensive, need to find people who will do it) reliability issues!

Reliability §Consistency (stays the same) §Repeatable (get the same results again and again) §Measures need to be reliable to be good measures §Now, some nitty-gritty...

Reliability (cont.) §Measuring closed responses l you don’t need to put things into categories l reliable over time (do you get the same answers again and again?) l if the answers vary greatly from one time of measurement to the next, the measurement is not reliable

Reliability (cont.) §Measuring closed responses (cont.) l scales (sets of questions designed to measure something) need to be given multiple times, or in multiple forms, and the answers must remain similar for the scale to be reliable l Example (personality scale?) §Types of reliability l Stability (“test-retest reliability”) l Equivalence (“parallel forms reliability”) l Consistency (“split-half reliability”) l Homogeneity (“internal consistency reliability”)

Reliability Quick Example Any test, scale, inventory with items: E.g., a 50-item test, scored 0-50: Form A 9/4 9/4, Form A Examinee9/4 9/25Form A Form BOdd Even 1 George Alice Mary Larry Linda Doug Chuck Judy Test-retest: Form A, 9/4 vs 9/25 (“r=.92")Stability Parallel forms: Form A vs Form B, 9/4 (“r=.69")Equivalence Cross form: Form A 9/25 vs Form B 3/19 (“r=.72")Stab & Equiv Split-half: Odd vs Even, Form A 9/4 (“r=.79")Consistency Alpha reliability No example – data from all 50 itemsInternal consistency

Reliability (cont.) §Measuring open responses l Will often code into categories (Examples) l How do you assess reliability?

Reliability (cont.) §Measuring open responses (cont.) l Does everyone put the response into the same category? If yes, you have good inter-coder reliability l more specific operational definitions will increase this reliability §Coding personality responses into categories l Using positive, negative, and neutral descriptors

Reliability (cont.) §Measuring behavioral responses through observation l special cases of open response, can’t really control what participants do l coding and/or rating what you observe l reliability of ratings (interrater reliability? If all raters agree on the rating, then yes.) l need to be very clear on operational definitions §Baggage claim study (Scherer & Ceschi, 2000)

Assessing Reliability §Steps l decide on operational definitions of your variables and scale(s) of measurement l train your coders/raters, answer questions, and alleviate confusion l do the coding and rating l compare responses l were the measurements reliable?

Reliability Exercise §Measuring your personality §Looking for “big” traits l defining big traits and training coders l The Big Five Personality Factors 1. Open to Experience (O) vs. Closed to Experience (NO) 2. Conscientious (C) vs. Nonconscientious (NC) 3. Extraverted (E) vs. Introverted (NE) 4. Agreeable (A) vs. Unagreeable (NA) 5. Neurotic (N) vs. Nonneurotic (NN) §Which one best fits the description? §Do the coding!

Reliability Exercise §Measuring your personality §Looking for “big” traits §compare responses to other coders l intercoder reliability l List number on which you agreed l List number on which you disagreed l Calculate the percentages §were the measurements reliable?

And for next time… is reliability enough? §If your measurement is reliable, does that mean that it is good? §Does being reliable make your measurement valid?