Decision Dynamics and Decision States: the Leaky Competing Accumulator Model Psychology 209 March 4, 2013.

Slides:



Advertisements
Similar presentations
What is the neural code? Puchalla et al., What is the neural code? Encoding: how does a stimulus cause the pattern of responses? what are the responses.
Advertisements

Timing of the brain events underlying access to consciousness during the attentional blink Claire Sergent, Sylvain Baillet, & Stanislas Dehaene.
Institute for Theoretical Physics and Mathematics Tehran January, 2006 Value based decision making: behavior and theory.
Neural Correlates of Visual Awareness. A Hard Problem Are all organisms conscious?
Quasi-Continuous Decision States in the Leaky Competing Accumulator Model Jay McClelland Stanford University With Joel Lachter, Greg Corrado, and Jim Johnston.
A saliency map model explains the effects of random variations along irrelevant dimensions in texture segmentation and visual search Li Zhaoping, University.
For stimulus s, have estimated s est Bias: Cramer-Rao bound: Mean square error: Variance: Fisher information How good is our estimate? (ML is unbiased:
PSYCHOPHYSICS What is Psychophysics? Classical Psychophysics Thresholds Signal Detection Theory Psychophysical Laws.
Sequential Hypothesis Testing under Stochastic Deadlines Peter Frazier, Angela Yu Princeton University TexPoint fonts used in EMF. Read the TexPoint manual.
1 Information Markets & Decision Makers Yiling Chen Anthony Kwasnica Tracy Mullen Penn State University This research was supported by the Defense Advanced.
The Decisive Commanding Neural Network In the Parietal Cortex By Hsiu-Ming Chang ( 張修明 )
Reward processing (1) There exists plenty of evidence that midbrain dopamine systems encode errors in reward predictions (Schultz, Neuron, 2002) Changes.
From T. McMillen & P. Holmes, J. Math. Psych. 50: 30-57, MURI Center for Human and Robot Decision Dynamics, Sept 13, Phil Holmes, Jonathan.
Distinguishing Evidence Accumulation from Response Bias in Categorical Decision-Making Vincent P. Ferrera 1,2, Jack Grinband 1,2, Quan Xiao 1,2, Joy Hirsch.
Theory of Decision Time Dynamics, with Applications to Memory.
An Integrated Model of Decision Making and Visual Attention Philip L. Smith University of Melbourne Collaborators: Roger Ratcliff, Bradley Wolfgang.
Understanding the Variability of Your Data: Dependent Variable Two "Sources" of Variability in DV (Response Variable) –Independent (Predictor/Explanatory)
1 / 41 Inference and Computation with Population Codes 13 November 2012 Inference and Computation with Population Codes Alexandre Pouget, Peter Dayan,
Chapter 50 The Prefrontal Cortex and Executive Brain Functions Copyright © 2014 Elsevier Inc. All rights reserved.
Neuronal Adaptation to Visual Motion in Area MT of the Macaque -Kohn & Movshon 지각 심리 전공 박정애.
Biological Modeling of Neural Networks: Week 12 – Decision models: Competitive dynamics Wulfram Gerstner EPFL, Lausanne, Switzerland 12.1 Review: Population.
Decision Making Theories in Neuroscience Alexander Vostroknutov October 2008.
Dynamic Decision Making in Complex Task Environments: Principles and Neural Mechanisms Annual Workshop Introduction August, 2008.
Dynamic Decision Making in Complex Task Environments: Principles and Neural Mechanisms Progress and Future Directions November 17, 2009.
BCS547 Neural Decoding. Population Code Tuning CurvesPattern of activity (r) Direction (deg) Activity
BCS547 Neural Decoding.
The Computing Brain: Focus on Decision-Making
Modeling interactions between visually responsive and movement related neurons in frontal eye field during saccade visual search Braden A. Purcell 1, Richard.
What’s optimal about N choices? Tyler McMillen & Phil Holmes, PACM/CSBMB/Conte Center, Princeton University. Banbury, Bunbury, May 2005 at CSH. Thanks.
Decision Dynamics and Decision States in the Leaky Competing Accumulator Model Jay McClelland Stanford University With Juan Gao, Marius Usher and others.
Machine Learning 5. Parametric Methods.
6. Population Codes Presented by Rhee, Je-Keun © 2008, SNU Biointelligence Lab,
Optimal Eye Movement Strategies In Visual Search.
Progress in MURI 15 ( ) Mathematical modeling of decision behavior. AFOSR, Alexandria, VA, Nov 17th, 2009 Phil Holmes 1. Optimizing monkeys? Balancing.
Dynamics of Reward and Stimulus Information in Human Decision Making Juan Gao, Rebecca Tortell & James L. McClelland With inspiration from Bill Newsome.
Neural correlates of risk sensitivity An fMRI study of instrumental choice behavior Yael Niv, Jeffrey A. Edlund, Peter Dayan, and John O’Doherty Cohen.
Psychology and Neurobiology of Decision-Making under Uncertainty Angela Yu March 11, 2010.
Event-Related Potentials Chap2. Ten Simple Rules for Designing ERP Experiments (2/2) 임원진
Does the brain compute confidence estimates about decisions?
Dynamics of Reward Bias Effects in Perceptual Decision Making Jay McClelland & Juan Gao Building on: Newsome and Rorie Holmes and Feng Usher and McClelland.
Optimal Decision-Making in Humans & Animals Angela Yu March 05, 2009.
Bayesian inference in neural networks
Dynamics of Reward Bias Effects in Perceptual Decision Making
Determining How Costs Behave
Jay McClelland Stanford University
Synaptic Dynamics: Unsupervised Learning
Piercing of Consciousness as a Threshold-Crossing Operation
Sequential sampling models of the choice process
Dynamical Models of Decision Making Optimality, human performance, and principles of neural information processing Jay McClelland Department of Psychology.
A Classical Model of Decision Making: The Drift Diffusion Model of Choice Between Two Alternatives At each time step a small sample of noisy information.
On the Nature of Decision States: Theory and Data
Changing environment task.
Dynamical Models of Decision Making Optimality, human performance, and principles of neural information processing Jay McClelland Department of Psychology.
Decision Making during the Psychological Refractory Period
Using Time-Varying Motion Stimuli to Explore Decision Dynamics
Human Reward / Stimulus/ Response Signal Experiment: Data and Analysis
Marius Usher, Phil Holmes, Juan Gao, Bill Newsome and Alan Rorie
Recency vs Primacy -- an ongoing project
Choice Certainty Is Informed by Both Evidence and Decision Time
Braden A. Purcell, Roozbeh Kiani  Neuron 
Probabilistic Population Codes for Bayesian Decision Making
Variance as a Signature of Neural Computations during Decision Making
Banburismus and the Brain
C. Shawn Green, Alexandre Pouget, Daphne Bavelier  Current Biology 
Decision Making as a Window on Cognition
Volume 24, Issue 13, Pages (July 2014)
Volume 36, Issue 5, Pages (December 2002)
Interaction of Sensory and Value Information in Decision-Making
Confidence Is the Bridge between Multi-stage Decisions
Redmond G. O’Connell, Michael N. Shadlen, KongFatt Wong-Lin, Simon P
Presentation transcript:

Decision Dynamics and Decision States: the Leaky Competing Accumulator Model Psychology 209 March 4, 2013

Is the rectangle longer toward the northwest or longer toward the northeast?

Longer toward the Northeast! 2.00” 1.99”

A Classical Model of Decision Making: The Drift Diffusion Model of Choice Between Two Alternative Decisions At each time step a small sample of noisy information is obtained; each sample adds to a cumulative relative evidence variable. Mean of the noisy samples is + for one alternative, – for the other, with standard deviation . When a bound is reached, the corresponding choice is made. Alternatively, in ‘time controlled’ or ‘interrogation’ tasks, respond when signal is given, based on value of the relative evidence variable.

The DDM is an optimal model, and it is consistent with some data from neurophysiology It achieves the fastest possible decision on average for a given level of accuracy It can be tuned to optimize performance under different kinds of task conditions –Different prior probabilities –Different costs and payoffs –Variation in the time between trials… The activity of neurons in a brain area associated with decision making seems to reflect the DD process

Neural Basis of Decision Making in Monkeys (Shadlen & Newsome; Roitman & Shadlen, 2002) RT task paradigm of R&T. Motion coherence and direction is varied from trial to trial.

Neural Basis of Decision Making in Monkeys: Results Data are averaged over many different neurons that are associated with intended eye movements to the location of target.

Hard Prob. Correct Easy A Problem with the DDM Accuracy should gradually improve toward ceiling levels as more time is allowed, even for very hard discriminations, but this is not what is observed in human data. Two possible fixes: –Trial-to-trial variance in the direction of drift –Evidence accumulation may reach a bound and stop, even if more time is available

Goals for a Neurally Inspired Model of Decision Making Incorporate principles of neural processing Build a bridge between abstract statistically- grounded approaches and details of physiology Explain existing data Make predictions and see if they are borne out in data Offer a new way of thinking about the nature of decision states

Usher and McClelland (2001) Leaky Competing Accumulator Model Addresses the process of deciding between two alternatives based on external input, with leakage, mutual inhibition, and noise: dy 1 /dt = I 1 -y 1 –f(y 2 )+ 1 dy 2 /dt = I 2 -y 2 –f(y 1 )+ 2 f(y) = [y] + Participant chooses the most active accumulator when the go cue occurs This is equivalent to choosing response 1 iff y 1 -y 2 > 0 Let y = (y 1 -y 2 ). While y 1 and y 2 are positive, the model reduces to: dy/dt = I-y+ I=I 1 -I 2 =-=  -   11 22 y1y1 y2y2

Wong & Wang (2006) ~Usher & McClelland (2001)

The Full Non-Linear LCA i Model y1y1 y2y2 Although the value of the difference variable is not well-captured by the linear approximation, the sign of the difference is approximated very closely.

Time-accuracy curves for different |k-| or || |k-  = 0 |k-  =.2 |k-  =.4

Prob. Correct

Kiani, Hanks and Shadlen 2008 Random motion stimuli of different coherences. Stimulus duration follows an exponential distribution. ‘go’ cue can occur at stimulus offset; response must occur within 500 msec to earn reward.

The earlier the pulse, the more it matters (Kiani et al, 2008)

These results rule out leak dominance X Still viable

Quasi-Continuous, Quasi-Discrete, Reversible Decision States in the Non- linear LCA i Quasi-continuous, quasi-discrete decision states

Predictions We should be able to find signs of differences in decision states associated with correct and incorrect responses. We should be able to see signs of bifurcation even if we ask for a continuous response. We should be able see evidence of rebound of suppressed alternatives if the input changes.

Predictions  We should be able to find signs of differences in ‘strength’ of decision states associated with correct and incorrect responses. We should be able to see signs of bifurcation when we ask for a continuous response. We should be able see evidence of recovery of suppressed alternatives if the input changes.

v v Distribution of winner’s activations when incorrect alternative wins Distribution of winner’s activations when correct alternative wins

Gao, Tortell and McClelland (in press) Experiment on Effect of Reward on Decision Dynamics

Proportion of Choices toward Higher Reward

Fits based on full LCA i * *Reward affects the initial state of the accumulators, before the stimulus starts to affect them.

Relationship between choice and RT for each participant and combined Data from Gao et al (2012)

An Account: Two-Stage LCA i

Response Stage Model

Predictions We should be able to find signs of differences in ‘strength’ of decision states associated with correct and incorrect responses.  We should be able to see signs of bifurcation even when we ask for a continuous response. We should be able see evidence of recovery of suppressed alternatives if the input changes.

Bifurcation in the LCA i

Bimodality in Decision States Lachter, Corrado, Johnston & McClelland (in progress)

Results and Descriptive Model of Data from 1 Participant

Experiment 2 Used much finer scale, much more practice & data per participant Found evidence that some participants show a bifurcation while others show un-imodal responses Mapping to response scale appears to be non-linear in many participants

Predictions We should be able to find signs of differences in ‘strength’ of decision states associated with correct and incorrect responses. We should be able to see signs of bifurcation even when we ask for a continuous response.  We should be able see evidence of recovery of suppressed alternatives if the input changes.

Reversability in the LCA i If activation of loser cannot go below 0, reversal of decision states can occur This leads to a predicted interaction of timing by duration.

Experiment Participants viewed random dot motion stimulus presentations of varying durations Three types of trials: –Constant evidence fixed non-zero coherence throughout trial –Early evidence non-zero coherence in first half, 0 in second half –Late evidence 0 coherence in first half, non-zero in second half

Interaction of timing by duration in one participant

Predictions We should be able to find signs of differences in ‘strength’ of decision states associated with correct and incorrect responses. We should be able to see signs of bifurcation when we ask for a continuous response. We should be able see evidence of recovery of suppressed alternatives.

Conclusions Evidence from several studies is consistent with the idea of quasi-continuous, quasi discrete, sometimes reversible, decision states, although, in general, data from only some of the participants plays a strong role in selection between models. The LCA i model provides a simple yet powerful framework in which such states arise. Alternative models considered have difficulties addressing aspects of the data. More work is needed to understand if the LCA i will turn out to be fully adequate, and how the data might be addressed with other approaches. Quasi-continuous, quasi-discrete decision states