The Impact of Criterion Noise in Signal Detection Theory: An Evaluation across Recognition Memory Tasks Julie Linzer David Kellen Henrik Singmann Karl.

Slides:



Advertisements
Similar presentations
Signal Detection Theory. The classical psychophysicists believed in fixed thresholds Ideally, one would obtain a step-like change from no detection to.
Advertisements

Method Participants 36 healthy participants (19 females) aged from 17 to 24 years (mean = 20; SD = 1,67) Material Participants were randomly allocated.
ERP correlates of retrieval orientation: cue- related and item-related measures Jane E. Herron and Edward L. Wilding, School of Psychology, Cardiff University.
Modifications of Fechner’s methods, forced choice Research Methods Fall 2010 Tamás Bőhm.
Introduction Relative weights can be estimated by fitting a linear model using responses from individual trials: where g is the linking function. Relative.
Quentin Frederik Gronau*, Axel Rosenbruch, Paul Bacher, Henrik Singmann, and David Kellen Poster presented at MathPsych, Québec (2014) Validating Recognition.
Multi-Modal Text Entry and Selection on a Mobile Device David Dearman 1, Amy Karlson 2, Brian Meyers 2 and Ben Bederson 3 1 University of Toronto 2 Microsoft.
Electrodermal Measures of Face Recognition Iowa State University of Science and Technology Alison L. MorrisDanielle R. Mitchell Nichole Stubbe Anne M.
Designing a behavioral experiment
Experiment 2: MEG Study Materials and Methods: 11 right-handed subjects with 20:20 vision were run. 3 subjects’ data was discarded because of poor performance.
PSYCHOPHYSICS What is Psychophysics? Classical Psychophysics Thresholds Signal Detection Theory Psychophysical Laws.
Sensation Perception = gathering information from the environment 2 stages: –Sensation = simple sensory experiences and translating physical energy from.
Flashbulb Memories? Memories for Events Surrounding September 11th Elizabeth Arnott David Allbritton Stephen Borders DePaul University Presented at the.
CONFIDENCE – ACCURACY RELATIONS IN STUDENT PERFORMANCES We attempted to determine students’ ability to assess comprehension of course material. Students.
Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research.
Participants and Procedure  Twenty-five older adults aged 62 to 83 (M = 70.86, SD = 5.89).  Recruited from St. John’s and surrounding areas  56% female.
Foundations of Educational Measurement
McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. Educational Research: Fundamentals.
The Argument for Using Statistics Weighing the Evidence Statistical Inference: An Overview Applying Statistical Inference: An Example Going Beyond Testing.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
References Arndt, J. & Hirshman, E. (1998). True and false recognition in MINERVA2: Explanation from a global matching perspective. Journal of Memory and.
The Method of Constant Stimuli & Signal Detection Theory VISN2211 Sieu Khuu David Lewis.
Introduction We present a package for the analysis of (non-hierarchical) Multinomial Processing Tree (MPT) models for the statistical programming language.
References McDermott, K.B. (1996). The persistence of false memories in list recall. Journal of Memory and Language, 35, Miller, M.B., & Wolford,
Signal Detection Theory I. Challenges in Measuring Perception II. Introduction to Signal Detection Theory III. Applications of Signal Detection Theory.
Quentin Frederik Gronau, Axel Rosenbruch, Paul Bacher, Henrik Singmann, David Kellen Poster presented at the TeaP, Gießen (2014) Validating a Two-High.
Sensation Perception = gathering information from the environment 2 stages: –Sensation = simple sensory experiences and translating physical energy from.
1 ASSESSING THE PERFORMANCE OF MEDICAL DIAGNOSTIC SYSTEMS: THE RECEIVER OPERATING CHARACTERISTIC (ROC) CURVE JOSEPH GEORGE CALDWELL, PH.D. 27 FEBRUARY.
Does Anxiety Vary by Gender and Race During Adolescence? Alyson Cavanaugh, Kelly A. Cheeseman, and Christine McCauley Ohannessian University of Delaware.
REFERENCES Bargh, J. A., Gollwitzer, P. M., Lee-Chai, A., Barndollar, K., & Troetschel, R. (2001). The automated will: Nonconscious activation and pursuit.
The Role of Mixed Emotional States in Predicting Men’s and Women’s Subjective and Physiological Sexual Responses to Erotic Stimuli Peterson, Z. D. 1 and.
Result 1: Effect of List Length Result 2: Effect of Probe Position Prediction by perceptual similarity Prediction by physical similarity Subject
In a recognition test, participants typically make more hits and fewer false alarms on low-frequency words compared to high frequency words (A pattern.
GRAPPLING WITH DATA Variability in observations Sources of variability measurement error and reliability Visualizing the sample data Frequency distributions.
Psy Psychology of Hearing Psychophysics and Detection Theory Neal Viemeister
Signal Detection Theory.
Signal detection Psychophysics.
A Comparison of Methods for Estimating the Capacity of Visual Working Memory: Examination of Encoding Limitations Domagoj Švegar & Dražen Domijan
Processing Faces with Emotional Expressions: Negative Faces Cause Greater Stroop Interference for Young and Older Adults Gabrielle Osborne 1, Deborah Burke.
OTHER APPROACHES TO TWO- PROCESS MODELS Remembering, Knowing, and Autonoetic Consciousness –Tulving (1983): Episodic memory based on a self-aware consciousness.
Fuzzy Signal Detection Theory: ROC Analysis of Stimulus and Response Range Effects J.L. Szalma and P.A. Hancock Department of Psychology and Institute.
Alison Burros, Kallie MacKay, Jennifer Hwee, & Dr. Mei-Ching Lien
Emilie Zamarripa & Joseph Latimer| Faculty Mentor: Jarrod Hines
Quentin Frederik Gronau1
Alison Burros, Nathan Herdener, & Mei-Ching Lien
Figure Legend: From: The absolute threshold of cone vision
A Normalized Poisson Model for Recognition Memory
The involvement of visual and verbal representations in a quantitative and a qualitative visual change detection task. Laura Jenkins, and Dr Colin Hamilton.
Categorical and coordinate spatial relations from different viewpoints in an object location memory task Ineke J. M. van der Ham, Jessie Bullens, Maartje.
Henrik Singmann Karl Christoph Klauer
Henrik Singmann David Kellen Karl Christoph Klauer
Henrik Singmann Sieghard Beller Karl Christoph Klauer
Backward Masking and Unmasking Across Saccadic Eye Movements
Effective Connectivity between Hippocampus and Ventromedial Prefrontal Cortex Controls Preferential Choices from Memory  Sebastian Gluth, Tobias Sommer,
Using Ensembles of Cognitive Models to Answer Substantive Questions
David Kellen, Henrik Singmann, Sharon Chen, and Samuel Winiger
Choice Certainty Is Informed by Both Evidence and Decision Time
Perirhinal-Hippocampal Connectivity during Reactivation Is a Marker for Object-Based Memory Consolidation  Kaia L. Vilberg, Lila Davachi  Neuron  Volume.
Signature of CRC‐associated gut microbial species Relative abundances of 22 gut microbial species, collectively associated with CRC, are displayed as heatmap.
Volume 71, Issue 4, Pages (August 2011)
Tobias Staudigl, Simon Hanslmayr  Current Biology 
Liu D. Liu, Christopher C. Pack  Neuron 
Volume 27, Issue 23, Pages e3 (December 2017)
Franco Pestilli, Marisa Carrasco, David J. Heeger, Justin L. Gardner 
Franco Pestilli, Marisa Carrasco, David J. Heeger, Justin L. Gardner 
A, Multivariate glm analysis for the aggregate observer (for the interval range within –450 and 250 ms from action execution). A, Multivariate glm analysis.
Henrik Singmann (University of Warwick)
Christoph Kayser, Nikos K. Logothetis, Stefano Panzeri  Current Biology 
Reward associations do not explain transitive inference performance in monkeys by Greg Jensen, Yelda Alkan, Vincent P. Ferrera, and Herbert S. Terrace.
Presentation transcript:

The Impact of Criterion Noise in Signal Detection Theory: An Evaluation across Recognition Memory Tasks Julie Linzer David Kellen Henrik Singmann Karl Christoph Klauer Albert-Ludwigs-Universität Freiburg Background Conclusions References Benjamin, A. S., Tullis, J. G., & Lee, J. H. (2013). Criterion noise in ratings-based recognition: Evidence from the effects of response scale length on recognition accuracy. Journal of Experimental Psychology: Learning, Memory, and Cognition, 39, Green, D. M., & Swets, J. A. (1966). Signal detection theory and psychophysics. New York: Wiley. Kellen, D., Klauer, K. C., & Singmann, H. (2012). On the measurement of criterion noise in signal detection theory: the case of recognition memory. Psychological Review, 119, Kellen, D., Singmann, H., Klauer, K. C., & Flade, F. (submitted). The Impact of Criterion Noise in Signal Detection Theory: An Evaluation across Recognition Memory Tasks. In both experiments, no differences in performance were found between the binary and 8-point scale condition. This result is at odds with Benjamin et al.’s prediction of a performance decrease in the 8-point condition. This result joins previous failures to demonstrate the contribution of criterion noise (Kellen et al., 2012). → The standard SDT model is a suitable measurement model, whose characterization of recognition judgments is not compromised by unaccounted criteria variability. Results Signal Detection Theory (SDT, Green & Swets, 1966) is the most prominent and successful measurement model in recognition memory. According to SDT, studied and non-studied items vary in terms of familiarity values, which are compared with fixed response criteria in producing recognition judgments (for a depiction of the model see Figure 1). Figure 3: Left Panel: Aggregate-data 2AFC ROC from Experiment 1. Hit and false-alarm rates correspond to the rate of "Right" responses to old-right and old-left trials, respectively. Right Panel: Aggregate-data source-discrimination ROC from Experiment 2. Hit and false-alarm rates correspond to the rate of "Source A" responses to source-A and source-B items, respectively. EXPERIMENT 2: source discrimination task Participants. 47 students (mean age = 21.28, SD = 1.9, ranging from 18 to 29 years). Study phase. 160 words (90 Source A and 90 Source B items), presented for 2000ms each (250 ms ISI). Source A items were presented in red on the left side of the screen and Source B items in green on the right side of the screen. Test phase. source discrimination task (only old words tested), same test conditions as in Experiment 1 A recent debate has questioned the assumption that response criteria are fixed, and a number of researchers have argued that response-criteria positions vary substantially across trials. Such criterion noise has the potential of severely distorting the characterization of the data via standard SDT models. Benjamin, Tullis and Lee (2013) found that recognition-memory performance estimates based on responses to rating scales with many options (e.g, an 8- point confidence scale) were lower than estimates obtained with scales with fewer options (e.g., binary responses), a result that is predicted given criterion noise (for a depiction of the aggregate data confidence-rating ROCs as well as the binary-condition's hit and false-alarm rates see Figure 2). However, there are problematic aspects in Benjamin et al.’s analysis that suggest that the evidence for criterion noise was overstated (see Kellen, Singmann, Klauer & Flade, submitted). Figure 1: Left Panel: Representation of the standard SDT model with mean μ n (μ s ) and standard deviation σ n (σ s ) of the signal distribution (noise distribution). c 1, c 2,c 3,c 4,c 5 depict different response criteria. Right Panel: Representation of the SDT model with criterion noise (σ c ). The median d’ estimates were 1.22 in both scale conditions in Exp. 1, and both 1.07 in Exp. 2. Not surprisingly, no significant differences were found (smallest p =. 70, largest Bayes Factor (alt/null) = 0.21). The d’ estimates were correlated across conditions in both experiments (smallest r = 0.65, p <.001). EXPERIMENT 1: two-alternative forced choice task Participants. 60 students (mean age = 21.98, SD = 2.1, ranging from 18 to 28 years) Design and Procedure. The computer-based experiment consisted of a single study phase followed by a single test phase. Study phase: 316 words, presented in black over grey background for 1200ms each, with a 200 ms inter- stimulus interval (ISI). Test phase: two-alternative forced choice task; two test-scale conditions: binary and 8-point scale Experiments old Item new Item "left" "right” old Item new Item "sure left" "sure right" We conducted two experiments, that built upon Benjamin et al.’s (2013) approach. In order to directly compare the performance measures across conditions, we used two different memory tasks in which symmetrical ROC-curves are expected. In this case, the d´ measure becomes a suitable way of summarizing memory performance. Figure 2: Aggregate-Data (uncorrected frequencies) Yes-No ROC from Benjamin et al. (2013). Hit rates correspond to “Yes" responses to old items and “No” responses to new items. False-alarm rates correspond to “No” responses to old items and “Yes” responses to new items.