DIF Analysis Galina Larina 28-31 of March, 2012 University of Ostrava.

Slides:

Advertisements

Similar presentations

Copyright © 2006 Educational Testing Service Listening. Learning. Leading. Using Differential Item Functioning to Investigate the Impact of Accommodations.

Advertisements

The effect of differential item functioning in anchor items on population invariance of equating Anne Corinne Huggins University of Florida.

Item Response Theory in a Multi-level Framework Saralyn Miller Meg Oliphint EDU 7309.

LOGO One of the easiest to use Software: Winsteps

How Should We Assess the Fit of Rasch-Type Models? Approximating the Power of Goodness-of-fit Statistics in Categorical Data Analysis Alberto Maydeu-Olivares.

Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.

Item Response Theory in Health Measurement

AN OVERVIEW OF THE FAMILY OF RASCH MODELS Elena Kardanova

Sections 7-1 and 7-2 Review and Preview and Estimating a Population Proportion.

Models for Measuring. What do the models have in common? They are all cases of a general model. How are people responding? What are your intentions in.

N.D.GagunashviliUniversity of Akureyri, Iceland Pearson´s χ 2 Test Modifications for Comparison of Unweighted and Weighted Histograms and Two Weighted.

Overview of field trial analysis procedures National Research Coordinators Meeting Windsor, June 2008.

Item Response Theory. Shortcomings of Classical True Score Model Sample dependence Limitation to the specific test situation. Dependence on the parallel.

A second example of Chi Square Imagine that the managers of a particular factory are interested in whether each line in their assembly process is equally.

7-2 Estimating a Population Proportion

AN ALGORITHM FOR TESTING UNIDIMENSIONALITY AND CLUSTERING ITEMS IN RASCH MEASUREMENT Rudolf Debelak & Martin Arendasy.

© UCLES 2013 Assessing the Fit of IRT Models in Language Testing Muhammad Naveed Khalid Ardeshir Geranpayeh.

SW388R7 Data Analysis & Computers II Slide 1 Multiple Regression – Basic Relationships Purpose of multiple regression Different types of multiple regression.

SW388R7 Data Analysis & Computers II Slide 1 Multiple Regression – Split Sample Validation General criteria for split sample validation Sample problems.

1 Nominal Data Greg C Elvers. 2 Parametric Statistics The inferential statistics that we have discussed, such as t and ANOVA, are parametric statistics.

Item Analysis: Classical and Beyond SCROLLA Symposium Measurement Theory and Item Analysis Modified for EPE/EDP 711 by Kelly Bradley on January 8, 2013.

1 Reducing the duration and cost of assessment with the GAIN: Computer Adaptive Testing.

Measurement Problems within Assessment: Can Rasch Analysis help us? Mike Horton Bipin Bhakta Alan Tennant.

Identification of Misfit Item Using IRT Models Dr Muhammad Naveed Khalid.

Chapter 7 Confidence Intervals and Sample Sizes

Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides

DIFFERENTIAL ITEM FUNCTIONING AND COGNITIVE ASSESSMENT USING IRT-BASED METHODS Jeanne Teresi, Ed.D., Ph.D. Katja Ocepek-Welikson, M.Phil.

Modern Test Theory Item Response Theory (IRT). Limitations of classical test theory An examinee’s ability is defined in terms of a particular test The.

Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics, A First Course 4 th Edition.

Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.

Random Sampling, Point Estimation and Maximum Likelihood.

Which Test Do I Use? Statistics for Two Group Experiments The Chi Square Test The t Test Analyzing Multiple Groups and Factorial Experiments Analysis of.

Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables.

Types of Data in FCS Survey Nominal Scale – Labels and categories (branch, farming operation) Ordinal Scale – Order and rank (expectations, future plans,

Rasch trees: A new method for detecting differential item functioning in the Rasch model Carolin Strobl Julia Kopf Achim Zeileis.

1 An Investigation of The Response Time for Maths Items in A Computer Adaptive Test C. Wheadon & Q. He, CEM CENTRE, DURHAM UNIVERSITY, UK Chris Wheadon.

Sections 7-1 and 7-2 Review and Preview and Estimating a Population Proportion.

1 Differential Item Functioning in Mplus Summer School Week 2.

Formula-Free Geometry. Area and Volume Exact Geometry Easy! Pretty easy!

Differential Item Functioning. Anatomy of the name DIFFERENTIAL –Differential Calculus? –Comparing two groups ITEM –Focus on ONE item at a time –Not the.

The Impact of Missing Data on the Detection of Nonuniform Differential Item Functioning W. Holmes Finch.

Nonparametric Tests of Significance Statistics for Political Science Levin and Fox Chapter Nine Part One.

University of Ostrava Czech republic 26-31, March, 2012.

Estimation. The Model Probability The Model for N Items — 1 The vector probability takes this form if we assume independence.

NATIONAL CONFERENCE ON STUDENT ASSESSMENT JUNE 22, 2011 ORLANDO, FL.

Item Response Theory in Health Measurement

FIT ANALYSIS IN RASCH MODEL University of Ostrava Czech republic 26-31, March, 2012.

Item Analysis: Classical and Beyond SCROLLA Symposium Measurement Theory and Item Analysis Heriot Watt University 12th February 2003.

Rating Scale Examples. A helpful resource

Review: Stages in Research Process Formulate Problem Determine Research Design Determine Data Collection Method Design Data Collection Forms Design Sample.

1 BINARY CHOICE MODELS: LOGIT ANALYSIS The linear probability model may make the nonsense predictions that an event will occur with probability greater.

Stats/Methods II JEOPARDY. Jeopardy Estimation ANOVA shorthand ANOVA concepts Post hoc testsSurprise $100 $200$200 $300 $500 $400 $300 $400 $300 $400.

Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &

Using Simulation to evaluate Rasch Models John Little CEM, Durham University

IRT Equating Kolen & Brennan, 2004 & 2014 EPSY

The Chi Square Test A statistical method used to determine goodness of fit Chi-square requires no assumptions about the shape of the population distribution.

BINARY LOGISTIC REGRESSION

Mixture Modeling of the p-value Distribution

Methodologies & Procedures for Evaluation

Classical Test Theory Margaret Wu.

Item Analysis: Classical and Beyond

POINT ESTIMATOR OF PARAMETERS

Rating Scale Examples.

Multiple Regression – Split Sample Validation

UNIT V CHISQUARE DISTRIBUTION

Lecture Slides Elementary Statistics Twelfth Edition

S.M.JOSHI COLLEGE, HADAPSAR

Item Analysis: Classical and Beyond

Evaluating Multi-item Scales

Item Analysis: Classical and Beyond

Presentation transcript:

DIF Analysis Galina Larina of March, 2012 University of Ostrava

DIF analysis Definitions Item impact – “significant group difference on an item, e.g., when one group has a higher proportion of examinees answering an item correctly than another group ” – Due to the true group differences in proficiency or due to item bias Differential Item Functioning (DIF) – “It occurs when test-takers having identical levels on the latent trait that the test was designed to measure but belonging to different groups, have different probabilities of endorsing (or answering correctly) a particular item” – Examinees in different groups are matched on the proficiency If an item is found to be poor-fitting in the whole data set or within any group of test-takers, it should be remove from subsequent DIF analysis

DIF analysis Effectless of fit statistics WinstepsConquest InfitOutfitInfitOutfit Mean1.00 Maximum Minimum Item Infit and outfit mean square errors for simulated 50-item test in which item 25 has DIF

DIF analysis Types of DIF Uniform DIFNon-uniform DIF Non-uniform mixed DIF

DIF analysis Statistical methods for evaluating DIF CTT methods – Conditional p-value difference – Delta plot – Standardization Chi-square methods – Mantel-Haenszel – etc. IRT methods

DIF analysis Mantel-Haenszel method Base group Focal group

DIF analysis Mantel-Haenszel method Average factor by which the likelihood that a base group member gets the item correct exceeds the corresponding likelihood for comparable focal group members For statistically significant DIF on an item, Prob. < 0.05

DIF analysis Mantel-Haenszel method MH procedure is an extension of the chi-square test of independence Advantages: – Easy to compute – Modest sample size requirements – Effect size ETS DIF classification rules – ‘Large DIF’ absolute value of MH D-DIF greater than or equal to 1.5, chi-square test sig. at 0.05 level/ Category C – ‘Moderate DIF’ at least 1.0 (and less) than 1.5) and the chi- square test sig. at 0.05 level/ Category B

DIF analysis Rasch approaches Separate calibration t-test first proposed by Wright and Stone Where d i1 is the difficulty of item I in calibration 1, d i2 is the difficulty of item i in calibration 2 based on groups 2, s 2 i1 is the standard error of estimate for d i1, and s 2 i2 is the standard error of estimate for d i2 Winsteps applies the above formula in DIF analysis

DIF analysis IRT approaches The between fit approach is based on a single calibration that contains at least two subpopulations of interest. where J is a number of subpopulations, N is a number of person in each populations, x ni is the score for person n responding to item i, and p ni is the probability of person n responding correctly to item i given the overall estimates for the ability of the person and the difficulty of the item

DIF analysis Winsteps DIF label start in person label column 20 DIF label start in person label with a width 1 Column 20 with width 1

DIF analysis Winsteps Press OK Press Entry Number

DIF analysis Winsteps Pairwise comparison This should be at least 0.5 logits for DIF to be noticeable For statistically significant DIF on an item, Prob. < 0.05 For statistically significant DIF on an item, t > |2|

DIF analysis Winsteps Item 1

DIF analysis Winsteps Item 1

DIF analysis Winsteps. Plots Press OK

DIF analysis Winsteps. Plots. Item 1