Presentation on theme: "AN OVERVIEW OF THE FAMILY OF RASCH MODELS Elena Kardanova"— Presentation transcript:
1 AN OVERVIEW OF THE FAMILY OF RASCH MODELS Elena Kardanova University of OstravaCzech republic26-31, March, 20122
2 The family of Rasch measurement models is a way to make sense of the world. Benjamin D. Wright
3 Advantages of Rasch Models The simplest models that provide parameter invarianceInclude minimal number of parametersParameters have simple interpretation, can be easily estimated (on the interval scale with estimate of precision)Can be applied to all item types which use in educational and psychological testsTheory of item and examinee analysis is well developedAll specific testing problems can be easily solved
4 Family of Rasch models: Dichotomous Rasch ModelPartial Credit ModelRating Scale ModelBinomial and Poisson ModelsMany-Facet Rasch ModelMultidimensional Rasch Model
5 Criteria for model choice: The number of response categories: two vs. more than twoThe structure of response alternatives in polytomous items: common vs. individualThe number of attempts to an item: one attempt vs. more than oneThe number of examinee parameters: one ability vs. more than oneThe number of factors influencing the examinee performance: only item difficulty vs. plus additional factors
7 SoftwareWinsteps (Dichotomous Model, PCM, RSM, Binomial and Poisson Models)ConQuest (all models except Binomial and Poisson Models)Facets (Many-Facet Model)Other IRT software (depends on the software)
8 Dichotomous Rasch Model P(Xni =1/ θn, δi) is the probability that an examinee n (n=1,…,N) with ability θn answers item i (i=1,…,I) with difficulty δi correctly.The model is called one-parameter because the probabilty Pni is a function of difference (θn - δi).The model is also called logistic because the function is logistic
9 Item Characteristic Curve in Rasch Dichotomous model δi – the point on the ability scale where the probability of a correct response is 0.5. The greater the value of this parameter, the greater the ability that is required for an examinee to have a 50% chance of getting the item correct; hence, the harder the item.In theory δi parameter can vary from -∞ to +∞, but typically values of δi vary from about -3 to +3.
10 ICCs of three items in Rasch model with difficulties δ1= -1, δ2= 0 и δ3= +1
11 Assumptions of Rasch model ICCs differ only in their location along the ability scale, they don’t cross (are parallel).Item difficulty is the only item characteristic that influences examinee performance.All items are equally discriminating.The lower asymptote of the ICC is zero: examinees of very low ability have zero probability of correctly answering the item (no guessing).
12 Parameter Interpretation in Dichotomous Rasch Model An ability level of any examinee is defined as logarithm chance for this examinee to answer correctly an item with 0 difficulty:A difficulty level of any item is defined as logarithm chance to answer correctly this item by an examinee with 0 ability:
13 Parameter Separation in Rasch Model Log odds that a person passes an item is just difference between examinee ability level and item difficulty.Item and examinee parameters are completely separated, making it possible to estimate examinee ability independently of item difficulty, and to estimate item difficulty independently of examinee ability.Item and examinee parameters lie on the same linear scale.The unit of measurement on this scale is one logit (shortening of log-odds unit – the unit of logarithm chances).
14 Concept of “Specific Objectivity” in Rasch Model Comparisons between objects must be invariant over the specific conditions under which they were observed :- comparisons between persons must be invariant over the specific items used to measure them,- comparisons between items must be invariant over the specific persons used to calibrate them.Only Rasch models guarantee this property.
15 Invariant-Person Comparisons: the same differences are observed regardless of the items Consider the Rasch model predictions for log odds ratio for two persons with abilities θ1 and θ2 for an item with difficulty δi :Subtracting the differences yields the following:Thus, the difference in log odds for any item is simply the difference between the two abilities: the item difficulty δi dropped out of the equation. So, the same difference in performance between the two persons is expected, regardless of item difficulty.
16 Invariant-Item Comparisons: differences between items don’t depend on the particular persons used to compare themConsider two items with difficulties δ1 and δ2 and the following two equations for the log odds of two items for any person n:Subtracting the differences yields:The ability level dropped out of the equation. So, the expected difference in performance for any examinee is the difference between item difficulties.
17 Other IRT Models (2PL and 3PL) fail to meet “specific objectivity”: For example, comparison of two persons in the framework of 2PL model yields the following:And furtherThe right part of this equation contains a discrimination parameter aiof the item. So, unlike the Rasch model, the expected difference in performance does not depend only on abilities; it is proportional to their difference with the proportion ai depending on the particular item.
18 Parameter Estimation in Rasch Model Total number of parameters to be estimated in dichotomous Rasch model is N+I, where N is the number of examinees, I is the number of items.Methods of mathematical statistics are used for parameter point estimation. Most estimation methods employ some form of the method of maximum likelihood (without distributional assumptions or with distributional assumptions regarding the parameters).Under Rasch model raw scores are sufficient statistics for both items and persons measures. It means that all examinees with the same raw score will get the same ability estimate. Similarly for items. Due to this property, all measures can be estimated simultaneously.
19 Probability Curves for Rasch Dichotomous Model πni0 and πni1 – probabilities of getting by an examinee score 0 and 1 for item i.In dichotomous case πni1=Pni and πni0 = 1- πni1 = 1- Pni .
20 Partial Credit ModelA simple extension of dichotomous Rasch model: one or more intermediate levels of performance are allowed.Different levels of performance are labelled 0 (no steps taken), 1, 2, …, m (the highest level of performance possible).In order to reach the highest category m, an examinee must complete m steps consecutively, getting 1 point for each of them. Each step can be taken only if the previous step has been completed.Difficulty of each step doesn’t depend on difficulties of other steps.
21 Two-step item (m=2)Performance levels: 0 (absolutely correct, superior quality) ,1 (particular correct, good quality) and 2 (incorrect, poor quality).An item has an intermediate scoring level which allows to award an additional point for particular completed item.Such item has three possible categories and two steps.
22 The probability of completing each step can be described by a Rasch model: Pni1 - probability of person n scoring 1 rather 0 on item iPni2 - probability of person n scoring 2 rather 1 on item iθn - ability level of examinee nδi1 and δi2 – step difficulties in item i.
23 Item Operating Curves for two-step item (Step Characteristics Curves)
24 Partial Credit Modelπ nik is the probability of examinee n with ability θn to get score k for item i.k is the count of the completed item steps.k=0,1,…, mi , where mi is the number of item steps.
26 Category Probability Curves for Two-Step Item for the case δi1 > δi2 When the second step is easier than the first, the probability curve for the middle response category doesn’t dominate on any part of the ability scale.Even though the second step is easier than the first, the defined order of the response categories requires that this easier second step be undertaken only after the harder first step has been successfully completed.
27 PCM can be written as:For any step k log odds for this examinee is only defined by the difficulty of the step δik
28 Step Characteristics Curves in PCM (Operaing Curves) These operating curves have the same slope (so don’t cross) and differ only in their location on the ability continuum.
29 Item Characteristic Curve for two-step item ICC for polytomous item represents an expected score on the item as a function of examinee ability level
30 ICCs of several two-step items Unlike ICCs in the dichotomous Rasch model, ICCs of different polytomous items are not parllel, they can cross
31 Rating Scale ModelCan be considered as a particular case of PCM when all items have the common response format (for example, Likert scale)Usually is used to collect attitude dataEach item is provided with a stem (or statement of attitude) and a few response alternatives where a respondent is required to chose one, indicating the extent to which the statement in the stem is endorsedThus, all items have m response alternatives and they are the same for all items. Completing of the k-th step can be considered as choosing the k-th alternative over the (k-1)-th in response to the item.
32 Example: Likert Scale SD D N A SA Has 4 or 5 categories: Strongly Disagree, Disagree, Undecided (or Neutral) - may be omitted, Agree, Strongly Agree:SD D N A SAResponse alternatives are ordered to represent a respondent’s increasing inclination towards the concept questionedA person who chooses to Agree with a statement on an attitude questionnaire can be considered to have chosen Disagree over Strongly Disagree (1-st step taken), and also Neutral over Disagree (2-nd step taken), and also Agree over Neutral (3-rd step taken), but to have failed to choose Strongly Agree over Agree (4-th step not taken).All responses are coded as 1,2,3,4,5, where the higher number indicates a higher degree of agreement with the statement.
33 Concept of item difficulty Consider two statements from the test of computer anxiety :I am so afraid of computers I avoid using them SD D N A SAI am afraid that I’ll make mistakes when I use my computer SD D N A SAIt is more than likely that the first stem indicates much higher levels of computer anxiety that does the second stem. Indeed, the children who respond SA on the “mistakes” stem might endorse N on the “avoid using” stem. And we should use :I am so afraid of computers I avoid using them SD D N A SAI am afraid that I’ll make mistakes when I use my computer SD D N A SAThe first item can be considered as more difficult than the second item. So each item can be accorded a difficulty estimate (location of the item on the variable axis)
34 Concept of threshold parameter As the same set of rating points is used with every item, it is usually thought that the relative difficulties of the steps in each item should not vary from item to item.The pattern of item steps around an item location is supposed to be determined by the fixed set of threshold parameters, that is fixed set of rating points used with all items.These threshold parameters are estimated once for the entire item set.
35 Difficulty of any step can be resolved into two components : δik- difficulty of completing the k-th step or choosing the k -th alternative in the response to the item iδi - the location of item i (item difficulty)τk – the location of the k –th step in each item relative to that item’s location (threshold parameter for k-th step)The only difference between items is the difference in their location on the variable (or difference in their difficulty). The pattern of item steps around this location is described by the threshold parameters τk, k =1,…,m, that is fixed set of rating points used with all items.
36 Probabilities of passing each threshold can be described by a Rasch model (for two two-step items): Pnik - probability of person n scoring k rather k-1 (choosing the k-th alternative over (k-1)-th) in response to the item i; k=1,2.θn - ability level of examinee nδi – the location of item i on the variable axis (item difficulty)τi1 and τi2 – threshold parameters in item i.
37 Item Operating Curves (Step Characteristics Curves) for two Rating Scale Items with Three Response Categories
38 Rating Scale Modelπ nik is the probability of examinee n with ability θn to get score k for item i (to chose the k-th alternative).k=0,1,…, m , where m is the number of item steps in any item.
39 RSM can be written as:For any step k (or the k-th response category) log odds of choosing the category over the previous adjacent one for this examinee is only defined by the difficulty of the item δi and difficulty of the k-th step τk