Dynamics of Reward and Stimulus Information in Human Decision Making Juan Gao, Rebecca Tortell & James L. McClelland With inspiration from Bill Newsome.

Slides:



Advertisements
Similar presentations
Dynamic Models.
Advertisements

Chapter 4: Basic Properties of Feedback
Exploring uncertainty in cost effectiveness analysis NICE International and HITAP copyright © 2013 Francis Ruiz NICE International (acknowledgements to:
LVAD is a battery-operated, mechanical pump-type device that's surgically implanted. Goal of LVAD: providing the patient with as close to a normal lifestyle.
Quasi-Continuous Decision States in the Leaky Competing Accumulator Model Jay McClelland Stanford University With Joel Lachter, Greg Corrado, and Jim Johnston.
LHC Collimation Working Group – 19 December 2011 Modeling and Simulation of Beam Losses during Collimator Alignment (Preliminary Work) G. Valentino With.
Decision Dynamics and Decision States: the Leaky Competing Accumulator Model Psychology 209 March 4, 2013.
Experimental Uncertainties: A Practical Guide What you should already know well What you need to know, and use, in this lab More details available in handout.
HON207 Cognitive Science Sequential Sampling Models.
Controller Tuning: A Motivational Example
THE PHYSICAL BASIS OF SST MEASUREMENTS Validation and evaluation of derived SST products 1.To develop systematic approaches to L4 product intercomparison.
T T18-05 Trend Adjusted Exponential Smoothing Forecast Purpose Allows the analyst to create and analyze the "Trend Adjusted Exponential Smoothing"
Efficient Estimation of Emission Probabilities in profile HMM By Virpi Ahola et al Reviewed By Alok Datar.
From T. McMillen & P. Holmes, J. Math. Psych. 50: 30-57, MURI Center for Human and Robot Decision Dynamics, Sept 13, Phil Holmes, Jonathan.
Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research.
Distinguishing Evidence Accumulation from Response Bias in Categorical Decision-Making Vincent P. Ferrera 1,2, Jack Grinband 1,2, Quan Xiao 1,2, Joy Hirsch.
Chapter 3 Methods for Recording Behavior EDP 7058.
GUIDE to The… D U M M I E S’ DCM Velia Cardin. Functional Specialization is a question of Where? Where in the brain is a certain cognitive/perceptual.
Theory of Decision Time Dynamics, with Applications to Memory.
Exposure Definition and Measurement in Observational Comparative Effectiveness Research Prepared for: Agency for Healthcare Research and Quality (AHRQ)
Cascade and Ratio Control
Sequence of Events in Spatial Cueing Paradigm +. +  time.
Introduction to cognitive modeling Marieke van Vugt University of Groningen The Netherlands.
Dynamic Decision Making in Complex Task Environments: Principles and Neural Mechanisms Annual Workshop Introduction August, 2008.
Dynamic Decision Making in Complex Task Environments: Principles and Neural Mechanisms Progress and Future Directions November 17, 2009.
Experimental Psychology PSY 433
The Perception of Visual Walking Speed While Moving Frank Durgin, Krista Gigone, Rebecca Scott Swarthmore College In Press: Journal of Experimental Psychology:
BUILDING THE REGRESSION MODEL Data preparation Variable reduction Model Selection Model validation Procedures for variable reduction 1 Building the Regression.
SD modeling process One drawback of using a computer to simulate systems is that the computer will always do exactly what you tell it to do. (Garbage in.
Spatial coding of the Predicted Impact Location of a Looming* Object M. Neppi-Mòdona D. Auclair A.Sirigu J.-R. Duhamel.
Decision Dynamics and Decision States in the Leaky Competing Accumulator Model Jay McClelland Stanford University With Juan Gao, Marius Usher and others.
Eurostat Accuracy of Results of Statistical Matching Training Course «Statistical Matching» Rome, 6-8 November 2013 Marcello D’Orazio Dept. National Accounts.
Progress in MURI 15 ( ) Mathematical modeling of decision behavior. AFOSR, Alexandria, VA, Nov 17th, 2009 Phil Holmes 1. Optimizing monkeys? Balancing.
Signal detection Psychophysics.
Modeling Speed-Accuracy Tradeoffs in Recognition Darryl W. Schneider John R. Anderson Carnegie Mellon University.
Expectations and Macroeconomic Stabilization Policies Adaptive and Rational Expectations.
Demand Management and Forecasting Chapter 11 Portions Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Millisecond Time Interval Estimation in a Dynamic Task Jungaa Moon & John Anderson Carnegie Mellon University.
Psychology and Neurobiology of Decision-Making under Uncertainty Angela Yu March 11, 2010.
Effect of Circadian rhythm of temperature on embryo culture Ronny Janssens, Neelke De Munck, Greta Verheyen. Centre for Reproductive Medicine, UZ Brussel,
Dynamic Causal Modeling (DCM) A Practical Perspective Ollie Hulme Barrie Roulston Zeki lab.
Does the brain compute confidence estimates about decisions?
Dynamics of Reward Bias Effects in Perceptual Decision Making Jay McClelland & Juan Gao Building on: Newsome and Rorie Holmes and Feng Usher and McClelland.
Dynamics of Reward Bias Effects in Perceptual Decision Making
Jay McClelland Stanford University
Dynamical Models of Decision Making Optimality, human performance, and principles of neural information processing Jay McClelland Department of Psychology.
Controller Tuning: A Motivational Example
A Classical Model of Decision Making: The Drift Diffusion Model of Choice Between Two Alternatives At each time step a small sample of noisy information.
On the Nature of Decision States: Theory and Data
Volume 28, Issue 8, Pages e6 (April 2018)
Dynamical Models of Decision Making Optimality, human performance, and principles of neural information processing Jay McClelland Department of Psychology.
Decision Making during the Psychological Refractory Period
Using Time-Varying Motion Stimuli to Explore Decision Dynamics
Volume 72, Issue 4, Pages (November 2011)
Human Reward / Stimulus/ Response Signal Experiment: Data and Analysis
Marius Usher, Phil Holmes, Juan Gao, Bill Newsome and Alan Rorie
Recency vs Primacy -- an ongoing project
Braden A. Purcell, Roozbeh Kiani  Neuron 
Banburismus and the Brain
Volume 36, Issue 5, Pages (December 2002)
Jack Grinband, Joy Hirsch, Vincent P. Ferrera  Neuron 
Jay and Juan building on Feng and Holmes
Confidence Is the Bridge between Multi-stage Decisions
Moran Furman, Xiao-Jing Wang  Neuron 
Wallis, JD Helen Wills Neuroscience Institute UC, Berkeley
Redmond G. O’Connell, Michael N. Shadlen, KongFatt Wong-Lin, Simon P
Volume 28, Issue 8, Pages e6 (April 2018)
Analysis Assumptions -x m - m + c
Predictive Neural Coding of Reward Preference Involves Dissociable Responses in Human Ventral Midbrain and Ventral Striatum  John P. O'Doherty, Tony W.
An Upside to Reward Sensitivity: The Hippocampus Supports Enhanced Reinforcement Learning in Adolescence  Juliet Y. Davidow, Karin Foerde, Adriana Galván,
Presentation transcript:

Dynamics of Reward and Stimulus Information in Human Decision Making Juan Gao, Rebecca Tortell & James L. McClelland With inspiration from Bill Newsome and Phil Holmes

Questions Can we track the time course of reward bias as stimulus information is accumulated over time? How well can human participants adjust their bias to optimize reward when optimal bias varies over time? How do humans achieve the observed bias effects? Can we distinguish between alternative accounts of the data?

Design follows Rorie et al. but there is a variable delay between stimulus onset and go cue. Reward cue signals which alternative is worth 2 points 750 msec before Stimulus onset. Stimulus is a rectangle 1,3, or 5 pixels longer to the Left or Right. Participant must respond within 250 msec of go cue.

4 or 5 Participants Show Reward Bias Effect

Accuracy Analysis

Individual Differences in Accuracy and Time Parameters

Leak and Inhibition Dominant LCA: Both can fit the d’ data

2-D inhibition-dominant LCA can fit the data too Final time slice

Optimal Criterion Placement

Optimal vs. Observed Bias Effects Reward harvest rates For short lags:

Reward bias in leak-dominant LCA Reward as input to the accumulators Reward as offset to initial conditions Reward as constant shift or shifted criterion Like the data!

Excellent fits are obtained under leak dominance with reward as a constant offset

But there are drawbacks to the leak-dominant model Leak-dominance produces equivocal decision states, while inhibition dominance produces more categorical activations. –These states may leave the participant better prepared to respond when the signal comes. Evidence Juan will present later favors inhibition dominance in similar paradigms

Reward Bias in Inhibition-dominant LCA ( < 0) Reward as input to the accumulators Reward as offset to initial conditions Reward as constant shift or shifted criterion Like the data!

Simulation of Inhibition-Dominant LCA using Parameters Derived from 1-D Reduction

Relationship between response speed and choice accuracy

Different levels of activation of correct and incorrect responses in Inhibition-dominant LCA Final time slice correct errors

Preliminary Simulation of High- Threshold LCA

Conclusion and Future Directions Two viable models remain, though we favor the leak-dominant LCA model. –Juan Gao will present other evidence relevant to this later. There is evidence that the decision state remains continuous until the response is made, consistent with the high-threshold model –Further tests of the details of this model are necessary. We plan to examine whether the same approach can fit the primate physiology data We look forward to seeing the activations of the accumulators in Human MEG/EEG data