Dimensionality of the latent structure and item selection via latent class multidimensional IRT models FRANCESCO BARTOLUCCI.

Slides:



Advertisements
Similar presentations
: INTRODUCTION TO Machine Learning Parametric Methods.
Advertisements

Introduction Simple Random Sampling Stratified Random Sampling
Nonparametric estimation of non- response distribution in the Israeli Social Survey Yury Gubman Dmitri Romanov JSM 2009 Washington DC 4/8/2009.
ECG Signal processing (2)
How Should We Assess the Fit of Rasch-Type Models? Approximating the Power of Goodness-of-fit Statistics in Categorical Data Analysis Alberto Maydeu-Olivares.
Notes Sample vs distribution “m” vs “µ” and “s” vs “σ” Bias/Variance Bias: Measures how much the learnt model is wrong disregarding noise Variance: Measures.
SAMPLE DESIGN: HOW MANY WILL BE IN THE SAMPLE—DESCRIPTIVE STUDIES ?
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT
Item Response Theory in Health Measurement
Copyright EM LYON Par accord du CFC Cession et reproduction interdites Research in Entrepreneurship- The problem of unobserved heterogeneity Frédéric Delmar.
Exploring the Full-Information Bifactor Model in Vertical Scaling With Construct Shift Ying Li and Robert W. Lissitz.
Statistical Estimation and Sampling Distributions
Model generalization Test error Bias, variance and complexity
3 rd Place Winning Project, 2009 USPROC Author: Kinjal Basu Sujayam Saha Sponsor Professor: S. Ghosh A.K. Ghosh Indian Statistical Institute, Kolkata,
Model Assessment, Selection and Averaging
Model assessment and cross-validation - overview
A procedure for dimensionality analyses of response data from various test designs Jinming Zhang William Stout.
A Bayesian Approach to Joint Feature Selection and Classifier Design Balaji Krishnapuram, Alexander J. Hartemink, Lawrence Carin, Fellow, IEEE, and Mario.
Supervised classification performance (prediction) assessment Dr. Huiru Zheng Dr. Franscisco Azuaje School of Computing and Mathematics Faculty of Engineering.
Speaker Adaptation for Vowel Classification
Midterm Review Goodness of Fit and Predictive Accuracy
On Comparing Classifiers: Pitfalls to Avoid and Recommended Approach Published by Steven L. Salzberg Presented by Prakash Tilwani MACS 598 April 25 th.
Joint UNECE/Eurostat Meeting on Population and Housing Censuses (13-15 May 2008) Sample results expected accuracy in the Italian Population and Housing.
1 CSI5388 Data Sets: Running Proper Comparative Studies with Large Data Repositories [Based on Salzberg, S.L., 1997 “On Comparing Classifiers: Pitfalls.
Mixture Modeling Chongming Yang Research Support Center FHSS College.
Identification of Misfit Item Using IRT Models Dr Muhammad Naveed Khalid.
Today Evaluation Measures Accuracy Significance Testing
June 19, 2008Stat Lecture 12 - Testing 21 Introduction to Inference More on Hypothesis Tests Statistics Lecture 12.
2015 AprilUNIVERSITY OF HAIFA, DEPARTMENT OF STATISTICS, SEMINAR FOR M.A 1 Hastie, Tibshirani and Friedman.The Elements of Statistical Learning (2nd edition,
1 Selection bias and auditing policies on insurance claims Leuven, July 20, 2006 Jean Pinquet, Montserrat Guillén & Mercedes Ayuso.
沈致远. Test error(generalization error): the expected prediction error over an independent test sample Training error: the average loss over the training.
Introduction Neuropsychological Symptoms Scale The Neuropsychological Symptoms Scale (NSS; Dean, 2010) was designed for use in the clinical interview to.
Chapter 7 Estimation. Section 7.3 Estimating p in the Binomial Distribution.
Measuring Mathematical Knowledge for Teaching: Measurement and Modeling Issues in Constructing and Using Teacher Assessments DeAnn Huinker, Daniel A. Sass,
Outline 1-D regression Least-squares Regression Non-iterative Least-squares Regression Basis Functions Overfitting Validation 2.
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
June 5, 2006University of Trento1 Latent Semantic Indexing for the Routing Problem Doctorate course “Web Information Retrieval” PhD Student Irina Veredina.
6. Evaluation of measuring tools: validity Psychometrics. 2012/13. Group A (English)
Using Inactivity to Detect Unusual behavior Presenter : Siang Wang Advisor : Dr. Yen - Ting Chen Date : Motion and video Computing, WMVC.
Controlling for Common Method Variance in PLS Analysis: The Measured Latent Marker Variable Approach Wynne W. Chin Jason Bennett Thatcher Ryan T. Wright.
CROSS-VALIDATION AND MODEL SELECTION Many Slides are from: Dr. Thomas Jensen -Expedia.com and Prof. Olga Veksler - CS Learning and Computer Vision.
Differential Item Functioning. Anatomy of the name DIFFERENTIAL –Differential Calculus? –Comparing two groups ITEM –Focus on ONE item at a time –Not the.
INTRODUCTION TO Machine Learning 3rd Edition
Slides for “Data Mining” by I. H. Witten and E. Frank.
The Impact of Missing Data on the Detection of Nonuniform Differential Item Functioning W. Holmes Finch.
Simulation Study for Longitudinal Data with Nonignorable Missing Data Rong Liu, PhD Candidate Dr. Ramakrishnan, Advisor Department of Biostatistics Virginia.
Assessing Estimability of Latent Class Models Using a Bayesian Estimation Approach Elizabeth S. Garrett Scott L. Zeger Johns Hopkins University Departments.
Unsupervised Mining of Statistical Temporal Structures in Video Liu ze yuan May 15,2011.
Over-fitting and Regularization Chapter 4 textbook Lectures 11 and 12 on amlbook.com.
Item Response Theory in Health Measurement
FIT ANALYSIS IN RASCH MODEL University of Ostrava Czech republic 26-31, March, 2012.
Validation methods.
Section 6.4 Inferences for Variances. Chi-square probability densities.
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
Computational Intelligence: Methods and Applications Lecture 15 Model selection and tradeoffs. Włodzisław Duch Dept. of Informatics, UMK Google: W Duch.
Two Approaches to Estimation of Classification Accuracy Rate Under Item Response Theory Quinn N. Lathrop and Ying Cheng Assistant Professor Ph.D., University.
Next, this study employed SVM to classify the emotion label for each EEG segment. The basic idea is to project input data onto a higher dimensional feature.
Synthetic Approaches to Data Linkage Mark Elliot, University of Manchester Jerry Reiter Duke University Cathie Marsh Centre.
Chapter 17 STRUCTURAL EQUATION MODELING. Structural Equation Modeling (SEM)  Relatively new statistical technique used to test theoretical or causal.
Chapter 22 Inferential Data Analysis: Part 2 PowerPoint presentation developed by: Jennifer L. Bellamy & Sarah E. Bledsoe.
Data Science Credibility: Evaluating What’s Been Learned
Inference: Conclusion with Confidence
One-Sample Inference for Proportions
Inference: Conclusion with Confidence
Multivariable Logistic Regression Split Cohort into Development &
Estimating a Population Proportion
A Simulation and Cautionary Assessment
Methodology Week 5.
Chapter 10 – Part II Analysis of Variance
Presentation transcript:

Dimensionality of the latent structure and item selection via latent class multidimensional IRT models FRANCESCO BARTOLUCCI

Outline Introduction Data set Statistic Methodology Strategy of Analysis Application to the Dataset Conclusion

Introduction Dimensionality issue of health conditions: Subjects show a degenerative health status to a specific pathology, but have overall good health status. Assume the population is divided into a certain number of latent classes. Address the issue of item selection.

The ULISSE Dataset A network on health care services for older people Longitudinal survey Filled out by the nursing assistant Since 2004 through the repeated administration every 6months 79 items – 1: presence of a specific health problem – 0: its absence

Model Latent class Multidimensional 2PL – Constraint:

Latent class model

Model log-likelihood number of free parameter – LC: – 2PL: – Difference between them:

Estimate Expectation-Maximization (EM) E-step: conditional expected value M-step: maximizing the log-likelihood where is replaced.

Strategy of Analysis Selection of the number of latent class Validation of the multidimensional 2PL model Assessment of the number of dimensions Reduction of the number of items

Step 1. Selection of the number of latent classes BIC: – LC or 2PL #par: penalization term – Number of classes increasing, #par rising AIC tends to overestimate the number of classes

Step 2. Validation of the Multidimensional 2PL model Compare the LC and 2PL model by BIC. For validate the structure of the questionnaire. LC, which is completely unconstrained, allows each item to measure a separate dimension. If 2PL proves preferable in BIC, the evidence of item structure is found.

Step 3. Assessment of the Number of Dimensions Chi-square, df=k-2 – is the probability under the s dimensions – is the probability under the s-1 dimensions When sample size is large, the criterion is too severe, it may lead to overestimating the number of dimensions. Adopt BIC

Step 4. Reduction of the Number of Items Discrimination index between 0 and 1 (constraint) Minimum number of items is 5 retained for each dimension. However, indices are not comparable across dimensions, so latent trait standardized for each dimension is required.

Step 4. Reduction of the Number of Items Standardized ability: Transform the items parameter: normalized Garma:

Step 4. Reduction of the Number of Items Item reduction changes the classification of the subjects. – Posterior on the full set items, and then on the subset. – Use the same parameter obtained with the initial set.

Application to the ULISSE Dataset Selection of the Number of Latent Classes

Application to the ULISSE Dataset Validation of the 2PL Model – 2PL: BIC=68, < – LC: BIC= PL proves preferable The structure is also validated, and the assumption (each section measures each dimension) is supported.

Application to the ULISSE Dataset Assessment of the Number of Dimensions

Application to the ULISSE Dataset

The initial number of dimension (8) may be excessive.

Application to the ULISSE Dataset The latent classes can be interpretation of different degrees of impairment of health status.

Application to the ULISSE Dataset investigate the stability of 5 dimension, compare between s=5 and s=4 model. – Cross –validated log-liklihood.

Cross-validated log-liklihood 2 Randomly chosen partitions of equal size – Training data – Test data Training: s=4  Test: s=4 Training: s=5  Test: s=5 BIC: s=5 is a proper solution

Reduction of the Number of Items

Conclusion More general structure Missing responses Polytomous items

Question why not studying the number of latent classes and dimensionality simultaneously? MNSQ item-fit statistic used to reducing items could be tried in this process. Simulation studies should be conducted to confirm its efficiency and accuracy of the proposed approach.