Does Math Matter to Gray Matter? (or, The Rewards of Calculus). Philip Holmes, Princeton University with Eric Brown (NYU), Rafal Bogacz (Bristol, UK),

Slides:



Advertisements
Similar presentations
What is the neural code? Puchalla et al., What is the neural code? Encoding: how does a stimulus cause the pattern of responses? what are the responses.
Advertisements

Action Potentials and Limit Cycles Computational Neuroeconomics and Neuroscience Spring 2011 Session 8 on , presented by Falk Lieder.
Quasi-Continuous Decision States in the Leaky Competing Accumulator Model Jay McClelland Stanford University With Joel Lachter, Greg Corrado, and Jim Johnston.
Neurophysics Part 1: Neural encoding and decoding (Ch 1-4) Stimulus to response (1-2) Response to stimulus, information in spikes (3-4) Part 2: Neurons.
Artificial Spiking Neural Networks
Introduction: Neurons and the Problem of Neural Coding Laboratory of Computational Neuroscience, LCN, CH 1015 Lausanne Swiss Federal Institute of Technology.
1 3. Spiking neurons and response variability Lecture Notes on Brain and Computation Byoung-Tak Zhang Biointelligence Laboratory School of Computer Science.
Decision Dynamics and Decision States: the Leaky Competing Accumulator Model Psychology 209 March 4, 2013.
Introduction to the mathematical modeling of neuronal networks Amitabha Bose Jawaharlal Nehru University & New Jersey Institute of Technology IISER, Pune.
HON207 Cognitive Science Sequential Sampling Models.
Neural Networks Marco Loog.
Sequential Hypothesis Testing under Stochastic Deadlines Peter Frazier, Angela Yu Princeton University TexPoint fonts used in EMF. Read the TexPoint manual.
Evaluating Hypotheses
The Decisive Commanding Neural Network In the Parietal Cortex By Hsiu-Ming Chang ( 張修明 )
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
How facilitation influences an attractor model of decision making Larissa Albantakis.
Laurent Itti: CS599 – Computational Architectures in Biological Vision, USC Lecture 7: Coding and Representation 1 Computational Architectures in.
From T. McMillen & P. Holmes, J. Math. Psych. 50: 30-57, MURI Center for Human and Robot Decision Dynamics, Sept 13, Phil Holmes, Jonathan.
Distinguishing Evidence Accumulation from Response Bias in Categorical Decision-Making Vincent P. Ferrera 1,2, Jack Grinband 1,2, Quan Xiao 1,2, Joy Hirsch.
Basic Models in Theoretical Neuroscience Oren Shriki 2010 Integrate and Fire and Conductance Based Neurons 1.
Biological Modeling of Neural Networks: Week 15 – Population Dynamics: The Integral –Equation Approach Wulfram Gerstner EPFL, Lausanne, Switzerland 15.1.
Theory of Decision Time Dynamics, with Applications to Memory.
An Integrated Model of Decision Making and Visual Attention Philip L. Smith University of Melbourne Collaborators: Roger Ratcliff, Bradley Wolfgang.
MSE 2400 EaLiCaRA Spring 2015 Dr. Tom Way
THE ROLE OF NEURONS IN PERCEPTION Basic Question How can the messages sent by neurons represent objects in the environment?
Artificial Neural Networks
1 Pattern Recognition: Statistical and Neural Lonnie C. Ludeman Lecture 23 Nov 2, 2005 Nanjing University of Science & Technology.
Neural coding (1) LECTURE 8. I.Introduction − Topographic Maps in Cortex − Synesthesia − Firing rates and tuning curves.
Benk Erika Kelemen Zsolt
Neural codes and spiking models. Neuronal codes Spiking models: Hodgkin Huxley Model (small regeneration) Reduction of the HH-Model to two dimensions.
Optimality, robustness, and dynamics of decision making under norepinephrine modulation: A spiking neuronal network model Joint work with Philip Eckhoff.
Biological Modeling of Neural Networks: Week 12 – Decision models: Competitive dynamics Wulfram Gerstner EPFL, Lausanne, Switzerland 12.1 Review: Population.
Decision Making Theories in Neuroscience Alexander Vostroknutov October 2008.
Collective neural dynamics and drift-diffusion models for simple decision tasks. Philip Holmes, Princeton University. Eric Brown (NYU), Rafal Bogacz (Bristol,
Motor Control. Beyond babbling Three problems with motor babbling: –Random exploration is slow –Error-based learning algorithms are faster but error signals.
Dynamic Decision Making in Complex Task Environments: Principles and Neural Mechanisms Annual Workshop Introduction August, 2008.
Dynamic Decision Making in Complex Task Environments: Principles and Neural Mechanisms Progress and Future Directions November 17, 2009.
Introduction to Neural Networks. Biological neural activity –Each neuron has a body, an axon, and many dendrites Can be in one of the two states: firing.
The Computing Brain: Focus on Decision-Making
Modeling interactions between visually responsive and movement related neurons in frontal eye field during saccade visual search Braden A. Purcell 1, Richard.
What’s optimal about N choices? Tyler McMillen & Phil Holmes, PACM/CSBMB/Conte Center, Princeton University. Banbury, Bunbury, May 2005 at CSH. Thanks.
Decision Dynamics and Decision States in the Leaky Competing Accumulator Model Jay McClelland Stanford University With Juan Gao, Marius Usher and others.
Dr.Abeer Mahmoud ARTIFICIAL INTELLIGENCE (CS 461D) Dr. Abeer Mahmoud Computer science Department Princess Nora University Faculty of Computer & Information.
Chapter 3. Stochastic Dynamics in the Brain and Probabilistic Decision-Making in Creating Brain-Like Intelligence, Sendhoff et al. Course: Robots Learning.
6. Population Codes Presented by Rhee, Je-Keun © 2008, SNU Biointelligence Lab,
Response dynamics and phase oscillators in the brainstem
Progress in MURI 15 ( ) Mathematical modeling of decision behavior. AFOSR, Alexandria, VA, Nov 17th, 2009 Phil Holmes 1. Optimizing monkeys? Balancing.
Network Models (2) LECTURE 7. I.Introduction − Basic concepts of neural networks II.Realistic neural networks − Homogeneous excitatory and inhibitory.
CSC321: Neural Networks Lecture 1: What are neural networks? Geoffrey Hinton
The Physics of Decision-Making: Cognitive Control as the Optimization of Behavior Gary Aston-Jones ∞ Rafal Bogacz * † ª Eric Brown † Jonathan D. Cohen.
Psychology and Neurobiology of Decision-Making under Uncertainty Angela Yu March 11, 2010.
CPH Dr. Charnigo Chap. 11 Notes Figure 11.2 provides a diagram which shows, at a glance, what a neural network does. Inputs X 1, X 2,.., X P are.
Does the brain compute confidence estimates about decisions?
Dynamics of Reward Bias Effects in Perceptual Decision Making Jay McClelland & Juan Gao Building on: Newsome and Rorie Holmes and Feng Usher and McClelland.
Memory Network Maintenance Using Spike-Timing Dependent Plasticity David Jangraw, ELE ’07 Advisor: John Hopfield, Department of Molecular Biology 12 T.
Optimal Decision-Making in Humans & Animals Angela Yu March 05, 2009.
Mechanisms of Simple Perceptual Decision Making Processes
Artificial Neural Networks
Jay McClelland Stanford University
Dynamical Models of Decision Making Optimality, human performance, and principles of neural information processing Jay McClelland Department of Psychology.
Machine Learning Today: Reading: Maria Florina Balcan
A Classical Model of Decision Making: The Drift Diffusion Model of Choice Between Two Alternatives At each time step a small sample of noisy information.
Dynamical Models of Decision Making Optimality, human performance, and principles of neural information processing Jay McClelland Department of Psychology.
Using Time-Varying Motion Stimuli to Explore Decision Dynamics
Volume 40, Issue 6, Pages (December 2003)
Banburismus and the Brain
Decision Making as a Window on Cognition
Volume 36, Issue 5, Pages (December 2002)
by Kenneth W. Latimer, Jacob L. Yates, Miriam L. R
Presentation transcript:

Does Math Matter to Gray Matter? (or, The Rewards of Calculus). Philip Holmes, Princeton University with Eric Brown (NYU), Rafal Bogacz (Bristol, UK), Jeff Moehlis (UCSB), Juan Gao, Patrick Simen & Jonathan Cohen (Princeton); Ed Clayton, Janusz Rajkowski & Gary Aston-Jones (Penn). Thanks to: NIMH, NSF, DoE and the Burroughs-Wellcome Foundation. IMA, December 8th, 2005.

Contents Introduction: The multiscale brain. Part I: Decisions and behavior, or Making the most of a stochastic process. Part II: Spikes and gain changes, or Let them molecules go! Morals: Mathematical and Neurobiological, or You bet math matters !

The multiscale brain: Ingredients: ~ neurons, ~ synapses. Structure: layers and folds. Communication: via action potentials, spikes, bursts. Sources: webvision.med.utah.edu/VisualCortex.html

Multiple scales in the brain and in math: Part I Part II … or Cultural Studies …

What neuroscience is and will become: A painstaking accumulation of detail: differentiation. Assembly of the parts into a whole: integration. And what does math do well? Integration and differentiation! (This is not just a corny joke.)

Part I: Decisions and behavior, or Making the most of a stochastic process. (A macroscopic tale: integration) Underlying hypothesis: Human and animal behaviors have evolved to be (near) optimal. (Bialek et al., : Fly vision & steering )

A really simple decision task: “ On each trial you will be shown one of two stimuli, drawn at random. You must identify the direction (L or R) in which the majority of dots are moving. ” The experimenter can vary the coherence of movement (% moving L or R) and the delay between response and next stimulus. Correct decisions are rewarded. “ Your goal is to maximize rewards over many trials in a fixed period. ” You gotta be fast, and right! 30% coherence 5% coherence Courtesy: W. Newsome Behavioral measures: reaction time distributions, error rates. More complex decisions: buy or sell? Neural economics.

An optimal decision procedure for noisy data: the Sequential Probability Ratio Test Mathematical idealization: During the trial, we draw noisy samples from one of two distributions p L (x) or p R (x) (left or right-going dots). The SPRT works like this: set up two thresholds and keep a running tally of the ratio of likelihood ratios: When first exceeds or falls below, declare victory for R or L. Theorem: (Wald, Barnard) Among all fixed sample or sequential tests, SPRT minimizes expected number of observations n for given accuracy. p L (x)p R (x)

Interlude: a mathematical DDance: Take logarithms: multiplication in becomes addition. Take continuum limit: addition becomes integration. The SPRT becomes a drift-diffusion (DD) process (a cornerstone of 20th century physics): drift rate noise strength and is the accumulated evidence (the log likelihood ratio). When reaches either threshold, declare R or L the winner. But do humans (or monkeys, or rats) drift and diffuse? Evidence comes from three sources: behavior, neurons, and mathematical models.

Behavioral evidence: RT distributions Human reaction time data can be fitted nicely to the first passage threshold crossing times of a DD process. (Ratcliff et al., Psych Rev. 1978, 1999, 2004.) thresh. +Z thresh. -Z drift A

Neural evidence: firing rates Spike rates of neurons in oculomotor areas rise during stimulus presentation, monkeys signal their choice after a threshold is crossed. J. Schall, V. Stuphorn, J. Brown, Neuron, Frontal eye field recordings. J.I Gold, M.N. Shadlen, Neuron, Lateral interparietel area recordings. thresholds

Model evidence: integration of noisy signals We can model the decision process as the integration of evidence by competing accumulators. (Usher &McClelland, 1995,2001) Subtracting the accumulated evidence yields a DD process for. thresh. 1 thresh. 2 OK, maybe. But do humans (or monkeys, or rats) optimize?

Optimal decisions redux 1 RT DD The task: maximize your rewards for a succession of trials in a fixed period. Reward Rate: (% correct/average time for resp.) response-to-stimulus interval Threshold too low Too high Optimal D RT $ $ DDD D D D $$ XXX $$$ X DDD

Optimal decisions redux 2 How fast to be? How careful? The DDM delivers an explicit solution to the speed-accuracy tradeoff in terms of just 3 parameters: normalized threshold and signal-to-noise ratio and D. So, setting we can express RT in terms of ER and calculate a unique, parameter-free Optimal Performance Curve: RT/(total delay) = F(ER)

A behavioral test 1 Do people adopt the optimal strategy? Some do; some don ’ t. Is this because they are optimizing a different function, e.g. weighting accuracy more? Or are they trying, but unable to adjust their thresholds? OPC NOT TODAY A mathematical theory delivers precise predictions. Its successes and failures generate further precise questions, suggest new experiments.

A behavioral test, 2 A modified reward rate function with a penalty for errors gives a family of OPCs with an extra parameter: the weight placed on accuracy. (It fits the whole dataset better, but what’s explained?) Bottom line: Too much accuracy is bad for your bottom line. (Princeton undergrads don’t like to make mistakes.) OPC accuracy weight increasing data fit Short version: Holmes et al., IEICE Trans., Long version (182pp): in review,

Choosing a threshold Q: Suboptimal behavior could be reckless (threshold too low) or conservative (threshold too high)? Why do most people tend to be conservative? Could it be a rational choice? Which type of behavior leads to smaller losses? A: Examine the RR function. Slope on high threshold side is smaller than slope on low threshold side, so for equal magnitudes, conservative errors cost less. threshold threshold too high threshold too low

Thresholds and gain changes How might thresholds be adjusted ‘ on the fly ’ when task conditions change? Neurons act like amplifiers, transforming input spikes to output spike rates. Gain improves discrimination. (Servan-Schreiber et al., Science, 1990.) input output (spikes) gain threshold Neurotransmitter release can increase gain. Specifically, norepinephrine can assist processing and speed response in decision tasks, collapsing the multilayered brain to a single near-optimal DD process.

Part II: Spikes and gain changes, or Let them molecules go! (A microscopic tale: differentiation.) Underlying hypotheses: Threshold and gain changes in the cortex are mediated by transient spike dynamics in brainstem areas. Transients determined by inherent circuit properties and stimuli. (Aston-Jones & Cohen, )

A tale of the locus coeruleus (LC) The LC, a neuromodulatory nucleus in the brainstem, releases norepinephrine (NE) widely in the cortex, tuning performance. The LC has only ~ 30,000 neurons, but they each make ~ 250,000 synapses. Transient bursts of spikes triggered by salient stimuli cause gain changes, thus bigger response to same stimulus. Devilbiss and Waterhouse, Synapse, 2000 Aston-Jones & Cohen, Ann. Rev. Neurosci., same stimulus

LC dynamics: tonic and phasic states In waking animals, the LC ‘ spontaneously ’ flips between two states: tonic (fast average spike rate, poor performance) and phasic (slow average spike rate, good performance). Tonic: small transient resp. Phasic: big transient resp. Spike histograms (PSTHs) Usher et al., Science, Transients are crucial: the LC delivers NE just when it ’ s needed.

Modeling LC neurons 1 Hodgkin & Huxley ( J. Physiol., 1952 ) developed a biophysical model of a single cell. Charged ions pass through the cell membrane via gates. Electric circuit equations + gating models fitted to data describe the dynamics. The HH model (for squid giant axon) has been generalized to many types of neurons. It ’ s a keystone of neuroscience; it describes the spikes beautifully, but the equations are really nasty! Rose and Hindmarsh, Proc. R. Soc. Lond. B., However, …… LC cells are spontaneous spikers and we can use this to reduce the HH equations to a simple phase model. Voltage

Modeling LC neurons 2 In phase space, periodic spiking is a closed curve: fire So we may change to ‘ clock face ’ coordinates that track phase -- progress through the firing cycle -- and by marking time in a nonuniform manner, we collapse HH to simply: Voltage Ion gate

Modeling LC neurons 3 Well, it ’ s not quite that simple: External inputs, stimuli and synaptic coupling from other cells, are all ‘ filtered ’ through the phase response curve (PRC), which describes inherent oscillator properties: but given this, we can compute their effects. (external stimuli speed up the spikes most at 9 o ’ clock) And we can find the PRC:

Modeling LC neurons 4 There are many such oscillating ‘ clocks ’ in LC, and the stimulus reorders and coordinates their random phases. The size of this effect depends upon the intrinsic frequency. Tonic LC: fast on average, gives a small burst. Phasic LC: slow on average, gives a big burst.

Modeling LC neurons 5 Adding noise and weak coupling, we can match the experimental PSTH data. After stimulus ends, noise and random frequencies redistribute the phases. decay and reset

Comparison with LC PSTH data Matching the PSTHs reveals that intrinsic frequency and its variability and stimulus duration are key parameters. 1.Slower oscillators deliver bigger coherent bursts. 2.Burst envelopes decay exponentially. 3. Depressed firing rates follow short stimuli. (Brown et al., J. Comp. Neurosci ) The latter may be responsible for attentional blink. (Niewenhuis et al., J. Exp. Psych ) data model theory simulations

Summary and Morals 1.Neural activity in simple decisions is like a DD process: the model predicts optimal speed-accuracy tradeoffs. 2. Threshold adjustments can optimize rewards. 3. The LC-NE system provides a control mechanism: the model reveals roles of intrinsic vs. stimulus properties. 4. There ’ s very pretty mathematics at all scales: stochastic ODE, dynamical systems, freshman calculus. 5.Large gaps remain: we must bridge the scales. Morals: Good mathematical models are not just (reasonably) faithful; they’re also (approximately) soluble. They focus and simplify. _____________________________________________________________________ Thanks for your attention!

Learning a threshold An algorithm based on reward rate estimates and a linear reward rate rule can make rapid threshold updates by iteration. But … Can RR be estimated sufficiently accurately? Can the rule be learned? Does noise cause overestimates? (Simen et al., 2005.) Threshold