Creating Assessments AKA how to write a test. Creating Assessments All good assessments have three key features: All good assessments have three key features:

Slides:



Advertisements
Similar presentations
How good are our measurements? The last three lectures were concerned with some basics of psychological measurement: What does it mean to quantify a psychological.
Advertisements

Consistency in testing
Topics: Quality of Measurements
The Research Consumer Evaluates Measurement Reliability and Validity
RELIABILITY Reliability refers to the consistency of a test or measurement. Reliability studies Test-retest reliability Equipment and/or procedures Intra-
Some (Simplified) Steps for Creating a Personality Questionnaire Generate an item pool Administer the items to a sample of people Assess the uni-dimensionality.
© 2006 The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Validity and Reliability Chapter Eight.
Chapter 4 – Reliability Observed Scores and True Scores Error
Assessment Procedures for Counselors and Helping Professionals, 7e © 2010 Pearson Education, Inc. All rights reserved. Chapter 5 Reliability.
VALIDITY AND RELIABILITY
Lesson Six Reliability.
1Reliability Introduction to Communication Research School of Communication Studies James Madison University Dr. Michael Smilowitz.
 A description of the ways a research will observe and measure a variable, so called because it specifies the operations that will be taken into account.
Quiz Do random errors accumulate? Name 2 ways to minimize the effect of random error in your data set.
-生醫統計期末報告- Reliability 學生 : 劉佩昀 學號 : 授課老師 : 蔡章仁.
Part II Knowing How to Assess Chapter 5 Minimizing Error p115 Review of Appl 644 – Measurement Theory – Reliability – Validity Assessment is broader term.
RESEARCH METHODS Lecture 18
Can you do it again? Reliability and Other Desired Characteristics Linn and Gronlund Chap.. 5.
Reliability Analysis. Overview of Reliability What is Reliability? Ways to Measure Reliability Interpreting Test-Retest and Parallel Forms Measuring and.
Concept of Measurement
Measurement Validity and Reliability. Reliability: The degree to which measures are free from random error and therefore yield consistent results.
Reliability and Validity
MGTO 231 Human Resources Management Personnel selection I Dr. Kin Fai Ellick WONG.
Conny’s Office Hours will now be by APPOINTMENT ONLY. Please her at if you would like to meet with.
Measurement: Reliability and Validity For a measure to be useful, it must be both reliable and valid Reliable = consistent in producing the same results.
Session 3 Normal Distribution Scores Reliability.
Education 795 Class Notes Factor Analysis II Note set 7.
Research Methods in MIS
Classroom Assessment A Practical Guide for Educators by Craig A
Reliability of Selection Measures. Reliability Defined The degree of dependability, consistency, or stability of scores on measures used in selection.
Classical Test Theory By ____________________. What is CCT?
Technical Issues Two concerns Validity Reliability
Measurement and Data Quality
Measurement in Exercise and Sport Psychology Research EPHE 348.
Reliability and Validity what is measured and how well.
MEASUREMENT CHARACTERISTICS Error & Confidence Reliability, Validity, & Usability.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 14 Measurement and Data Quality.
Technical Adequacy Session One Part Three.
Reliability Chapter 3. Classical Test Theory Every observed score is a combination of true score plus error. Obs. = T + E.
Reliability Chapter 3.  Every observed score is a combination of true score and error Obs. = T + E  Reliability = Classical Test Theory.
Reliability: Introduction. Reliability Session 1.Definitions & Basic Concepts of Reliability 2.Theoretical Approaches 3.Empirical Assessments of Reliability.
Reliability & Validity
Assessing Learners with Special Needs: An Applied Approach, 6e © 2009 Pearson Education, Inc. All rights reserved. Chapter 4:Reliability and Validity.
EDU 8603 Day 6. What do the following numbers mean?
Validity In our last class, we began to discuss some of the ways in which we can assess the quality of our measurements. We discussed the concept of reliability.
Research Methodology and Methods of Social Inquiry Nov 8, 2011 Assessing Measurement Reliability & Validity.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Assessing Measurement Quality in Quantitative Studies.
MEASUREMENT. MeasurementThe assignment of numbers to observed phenomena according to certain rules. Rules of CorrespondenceDefines measurement in a given.
SOCW 671: #5 Measurement Levels, Reliability, Validity, & Classic Measurement Theory.
Reliability performance on language tests is also affected by factors other than communicative language ability. (1) test method facets They are systematic.
Reliability: Introduction. Reliability Session 1.Definitions & Basic Concepts of Reliability 2.Theoretical Approaches 3.Empirical Assessments of Reliability.
Reliability: Introduction. Reliability Session Definitions & Basic Concepts of Reliability Theoretical Approaches Empirical Assessments of Reliability.
Measurement Experiment - effect of IV on DV. Independent Variable (2 or more levels) MANIPULATED a) situational - features in the environment b) task.
Reliability and Validity in Testing. What is Reliability? Consistency Accuracy There is a value related to reliability that ranges from -1 to 1.
2. Main Test Theories: The Classical Test Theory (CTT) Psychometrics. 2011/12. Group A (English)
Classroom Assessment Chapters 4 and 5 ELED 4050 Summer 2007.
Reliability When a Measurement Procedure yields consistent scores when the phenomenon being measured is not changing. Degree to which scores are free of.
Language Assessment Lecture 7 Validity & Reliability Instructor: Dr. Tung-hsien He
Dr. Jeffrey Oescher 27 January 2014 Technical Issues  Two technical issues  Validity  Reliability.
Chapter 2 Norms and Reliability. The essential objective of test standardization is to determine the distribution of raw scores in the norm group so that.
RELIABILITY AND VALIDITY Dr. Rehab F. Gwada. Control of Measurement Reliabilityvalidity.
VALIDITY What is validity? What are the types of validity? How do you assess validity? How do you improve validity?
Copyright © 2009 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 47 Critiquing Assessments.
Ch. 5 Measurement Concepts.
Reliability & Validity
Chapter 4 Characteristics of a Good Test
By ____________________
The first test of validity
Psychological Measurement: Reliability and the Properties of Random Errors The last two lectures were concerned with some basics of psychological measurement:
Psy 425 Tests & Measurements
Presentation transcript:

Creating Assessments AKA how to write a test

Creating Assessments All good assessments have three key features: All good assessments have three key features: –Validity –Reliability –Usability

Reliability Next to validity, reliability is the most important characteristic of assessment results. Why? 1. It provides the consistency to make validity possible. 2. It indicates the degree to which various kinds of generalizations are justifiable.

Reliability re·li·a·ble adj. Capable of being relied on; dependable. re·li”a·bil“i·ty or re·li“a·ble·ness n. --re·li“a·bly adv. (American Heritage Dictionary)

Reliability Reliability: the consistency of measurement, i.e. how consistent test scores or other assessment results are from one measurement to another.

Reliability Which is more reliable?

Reliability Classical Test Theory: X = T + e Where:X = observed score T = “true score” e = error

Reliability x = observed score: The score the student receives on the exam. T = “true score”: What the student “really” knows.

Reliability e = error Error variance is the variability that exists in a set of scores and is due to factors other than the one being assessed. –Systematic: errors that are consistent. –Random: errors that have no pattern.

Reliability e = error Positive error (i.e. raises score): –Lucky guesses. –Items that give clues to the answer. –Cheating (students, aides, teachers).

Reliability e = error score Negative error (i.e. lowers score): –Not following directions. –Miss-marking items. –Room climate/atmosphere. –Hunger, fatigue, illness, “need to go potty”. –Assemblies, ball games, fire drills, etc. –Break-up of a relationship.

Circle the figures that are half shaded.

Reliability Determining Reliability: Test-retest method Test-retest method Equivalent forms Equivalent forms Split half method Split half method KR-20 method KR-20 method Interrater reliability Interrater reliability Intrarater reliability Intrarater reliability

Reliability Standard Error of Measurement (SEM)= the estimated amount of variation expected in a score.

Reliability Example: If Sara scored 78 on a standardized test with a SEM of 6 we can be: 68% certain her true score is between 72 and 84 68% certain her true score is between 72 and 84 95% certain her true score is between 66 and 90 95% certain her true score is between 66 and 90 99% certain her true score is between 60 and 96 99% certain her true score is between 60 and 96

Reliability Summation of Reliability: 1. Reliability refers to the results and not to the instrument itself. 2. Reliability is a necessary but not sufficient condition for validity. 3. The more reliable the assessment, the better.

Usability The practical aspects of a test cannot be neglected: –Ease of administration –Time Administration Administration Scoring Scoring –Ease of Interpretation –Availability of equivalent forms –Cost