1 Asymmetric Encoding of Positive and Negative Values in the Basal Ganglia Mati Joshua Avital Adler Hagai Bergman Hagai Bergman.

Slides:



Advertisements
Similar presentations
Reinforcement Learning I: prediction and classical conditioning
Advertisements

Learning, Volatility and the ACC Tim Behrens FMRIB + Psychology, University of Oxford FIL - UCL.
Lecture 12: olfaction: the insect antennal lobe References: H C Mulvad, thesis ( Ch 2http://
Computational Analysis of Motor Learning. Three paradigms Force field adaptation Visuomotor transformations Sequence learning Does one term (motor learning)
Attention as Information Selection. Early Selection Early Selection model postulated that attention acted as a strict gate at the lowest levels of sensory.
N = 6 males. no trick subjects apprised of drink type just before scan is this different from Urban? Is it different from Yoder?
Dopamine, Uncertainty and TD Learning CNS 2004 Yael Niv Michael Duff Peter Dayan Gatsby Computational Neuroscience Unit, UCL.
Fig. 1. Reward-based, free-choice task and monkey's performance. (A) Time chart of events that occurred during the task. (B) Diagram of large-reward probabilities.
The dopaminergic reward system
Introduction: What does phasic Dopamine encode ? With asymmetric coding of errors, the mean TD error at the time of reward is proportional to p(1-p) ->
Dopamine, Uncertainty and TD Learning CoSyNe’04 Yael Niv Michael Duff Peter Dayan.
Connected Populations: oscillations, competition and spatial continuum (field equations) Lecture 12 Course: Neural Networks and Biological Modeling Wulfram.
FIGURE 4 Responses of dopamine neurons to unpredicted primary reward (top) and the transfer of this response to progressively earlier reward-predicting.
Basal Ganglia. Involved in the control of movement Dysfunction associated with Parkinson’s and Huntington’s disease Site of surgical procedures -- Deep.
Neural circuits for bias and sensitivity in decision-making Jan Lauwereyns Associate Professor, Victoria University of Wellington, New Zealand Long-term.
Neuronal Coding in the Retina and Fixational Eye Movements Friday Seminar Talk November 6, 2009 Friday Seminar Talk November 6, 2009 Christian Mendl Tim.
In Vitro Analog of Operant Conditioning in Aplysia. I. Contingent Reinforcement Modifies the Functional Dynamics of an identified Neuron Interdisciplinary.
Michael S. Beauchamp, Ph.D. Assistant Professor Department of Neurobiology and Anatomy University of Texas Health Science Center at Houston Houston, TX.
Testing computational models of dopamine and noradrenaline dysfunction in attention deficit/hyperactivity disorder Jaeseung Jeong, Ph.D Department of Bio.
CS344 : Introduction to Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 26- Reinforcement Learning for Robots; Brain Evidence.
BURSTING Hasaeam Cho Bio-NanoStructure Lab of Prof. MC Choi.
Show Me the Money! Dmitry Kit. Outline Overview Reinforcement Learning Other Topics Conclusions.
Operant Conditioning of Cortical Activity E Fetz, 1969.
Chapter 16. Basal Ganglia Models for Autonomous Behavior Learning in Creating Brain-Like Intelligence, Sendhoff et al. Course: Robots Learning from Humans.
Dana Ballard - University of Rochester1 Distributed Synchrony: a model for cortical communication Madhur Ambastha Jonathan Shaw Zuohua Zhang Dana H. Ballard.
Biological Modeling of Neural Networks Week 6 Hebbian LEARNING and ASSOCIATIVE MEMORY Wulfram Gerstner EPFL, Lausanne, Switzerland 6.1 Synaptic Plasticity.
Theoretical Neuroscience Physics 405, Copenhagen University Block 4, Spring 2007 John Hertz (Nordita) Office: rm Kc10, NBI Blegdamsvej Tel (office)
A View from the Bottom Peter Dayan Gatsby Computational Neuroscience Unit.
Global plan Reinforcement learning I: –prediction –classical conditioning –dopamine Reinforcement learning II: –dynamic programming; action selection –Pavlovian.
What is meant by “top-down” and “bottom-up” processing? Give examples of both. Bottom up processes are evoked by the visual stimulus. Top down processes.
Two Mean Neuronal Waveforms Distribution of Spike Widths Interaction of Inhibitory and Excitatory Neurons During Visual Stimulation David Maher Department.
What is it? nervous system composition: 2 cell types neurons glia.
Electrophysiology & fMRI. Neurons Neural computation Neural selectivity Hierarchy of neural processing.
Information Processing by Neuronal Populations Chapter 6: Single-neuron and ensemble contributions to decoding simultaneously recoded spike trains Information.
Neural correlates of risk sensitivity An fMRI study of instrumental choice behavior Yael Niv, Jeffrey A. Edlund, Peter Dayan, and John O’Doherty Cohen.
Does the Nucleus Accumbens mediate Aversive Behavior? Clarissa Staton.
Comparing Single and Multiple Neuron Simulations of Integrated Dorsal and Ventral Striatal Pathway Models of Action Initiation Selin Metin1, Neslihan Serap.
Volume 62, Issue 5, Pages (June 2009)
Neuroimaging of associative learning
למידה, התניה ומח – מי עושה מה?
Neuroimaging of associative learning
Learning to Simulate Others' Decisions
The Orbitofrontal Oracle: Cortical Mechanisms for the Prediction and Evaluation of Specific Behavioral Outcomes  Peter H. Rudebeck, Elisabeth A. Murray 
Probabilistic Population Codes for Bayesian Decision Making
Volume 62, Issue 5, Pages (June 2009)
Feature- and Order-Based Timing Representations in the Frontal Cortex
Dopamine in Motivational Control: Rewarding, Aversive, and Alerting
Decoding Cognitive Processes from Neural Ensembles
Hyoung F. Kim, Hidetoshi Amita, Okihide Hikosaka  Neuron 
Translational Perspectives for Computational Neuroimaging
Ju Tian, Naoshige Uchida  Neuron 
Volume 91, Issue 6, Pages (September 2016)
Ronald Keiflin, Patricia H. Janak  Neuron 
Wallis, JD Helen Wills Neuroscience Institute UC, Berkeley
Ethan S. Bromberg-Martin, Masayuki Matsumoto, Okihide Hikosaka  Neuron 
Volume 95, Issue 5, Pages e5 (August 2017)
Neuroimaging of associative learning
Volume 75, Issue 5, Pages (September 2012)
Genela Morris, David Arkadir, Alon Nevet, Eilon Vaadia, Hagai Bergman 
Learning to Simulate Others' Decisions
Neuromodulation of Attention
Antagonistic but Not Symmetric Regulation of Primary Motor Cortex by Basal Ganglia Direct and Indirect Pathways  Ian A. Oldenburg, Bernardo L. Sabatini 
Volume 75, Issue 5, Pages (September 2012)
Proposed encoding of VLS MSNs
Characterization of non-DA firing.
Closed-loop experiments to probe the range of stimulus sensitivity.
Volume 27, Issue 1, Pages e6 (April 2019)
Orbitofrontal Cortex as a Cognitive Map of Task Space
Nonlinear transfer of signal and noise correlations in vivo.
Connectivity in the basal ganglia.
Presentation transcript:

1 Asymmetric Encoding of Positive and Negative Values in the Basal Ganglia Mati Joshua Avital Adler Hagai Bergman Hagai Bergman

2 Layout Introduction Introduction Experimental design Experimental design Reward vs. Aversive Reward vs. Aversive Comparing Basal Ganglia populations Comparing Basal Ganglia populations Summary Summary

3 Introduction - Anatomy Main axis Main axis –Mainly Feed forward –GABA. Glutamate –Ionotropic –MSN,GP,SNr Modulators Modulators –Lateral connections –DA, ACh, 5-HT –Metabotropic –TAN, SNc, DR Breakefield et al, Nature Review 2008

4 The Basal Ganglia as a soccer fan: Penalty Kick Orange fan Basal Ganglia Blue fan Basal Ganglia

5 Introduction – Function Fiorillo et al Science2003 Houk et al 1995 Probabilistic Classical conditioning 75% 25% Reinforcement Learning models Experimental results Dopaminergic (DA) neurons code error signal

6 See also: Fiorillo et. al. Science 2003 Satoh et. al JNS 2003 Pan et. al JNS 2005 Bayer et. al Neuron 2005 Morris et. al. Neuron 2004 Floor effect does not enable symmetric encoding of prediction errors by DA neurons RewardNo Reward Time(ms) 0400 Time(ms) 0400

7 Research objectives Positive vs. Negative values in the Basal Ganglia Positive vs. Negative values in the Basal Ganglia –Encoding of aversive events Comparing neuro-modulators and main axis Comparing neuro-modulators and main axis Reward+AversiveReward only

8 Probabilistic classical conditioning with aversive and reward

9 Monkey is expecting food and air-puff

10 Extracellular recording of Basal Ganglia activity Modulators Main axis

11 The Basal Ganglia database Inclusion criteria: Analysis database: 2 monkeys (L and S) 890 neurons ( TANs-180; SNc-106; SNr-145; GPe-310; GPi-149) Mean Isol. Score = 0.91 (TANs:0.91; SNc:0.78; SNr,GPe, GPi:0.95) Mean fraction of ISIs < 2 ms = (~2/1000) Mean recording time = 56 minutes Mean recording trials= 320 trials Joshua M; Elias; Levine and Bergman, Quantifying the isolation quality of extracellularly recorded action potentials, J. of Neuroscience Methods, 163(2):267-82, 2007 Fraction of ISIs < 2 ms < 0.02 Stable Recording time > 20 minutes Isolation score > 0.8 (SNc>0.5) Recording database : 2002 units

12 Example - GPe

13 Population response to cue Reward > Aversive Reward > Aversive Aversive = Neutral Aversive = Neutral High reward probability> low reward probability High reward probability> low reward probability SNc positive response to aversive expectation SNc positive response to aversive expectation

14 All populations But But –Outliers –Cancellation of opposite effects Single cell analysis Single cell analysis SNc

15 Response index analysis Response index = abs (reward/aversive – neutral) Encoding expectation: Reward > Aversive Example Population Reward Response index (Spike/s) Aversive Response index (spike/s)

16 Encoding expectation: Reward > Aversive

17 Encoding probability: Reward > Aversive Probability coding index analysis Example Population Probability Coding index = abs (high – low)

18 Encoding probability: Reward > Aversive

19 Mid - summary Basal Ganglia encodes positive expectations Basal Ganglia Not BG Cerebellum? Amygdala?

20 Modulator phasic response vs. main axis sustained response Main Axis Modulators

21 Homogenous vs. Diverse Main Axis Modulators Increase/Decrease analysis Response correlation Reward Aversive

22 Summary main axis vs. modulators Modulators Modulators –Fast phasic homogeneous response –Coding the difference between expectations and reality Main Axis Main Axis –Sustained and diverse response –Coding the value of the current state Critic(error)Actor(value) ShortSustained Response pattern HomogenousDiverse Response type ++ Cue probability coding Opposite cue trend - Outcome probability coding

23 From cue to outcome Reminder – DA encoding of reward omission is limited Reminder – DA encoding of reward omission is limited –Learning values with limited RL error is impaired (Mitelman et. al. IBAGS IX) Morris et. al. Neuron 2004

24 Population analysis of the Basal Ganglia modulators - SNc Same polarity to aversive and reward events – More than reward error signal. Same polarity to aversive and reward events – More than reward error signal. No difference between omissions No difference between omissions Time(s) Spike/s

25 Population analysis of the Basal Ganglia modulators -TAN n=180 TANs Time (s ) Responses to cue are similar Responses to cue are similar Large difference in the no outcome epoch Large difference in the no outcome epoch Time(s) Spike/s

26 Modulators have complementary response SNc encodes cue but not omission SNc encodes cue but not omission TAN encode omission but not cue TAN encode omission but not cue Modulators encode more than reward error Modulators encode more than reward error

27 Summary Reward >> Aversive Reward >> Aversive Main axis vs. modulators Main axis vs. modulators TAN vs. SNc TAN vs. SNc Na ï ve model Cerebellum ? Amygdala? Value segregated Value-error Multiple errors

28 Future directions I Event related correlation Reward Aversive ITI not Correlated ITI Correlated

29 Future directions II Learning new cues 66% 33% SNr cell

30 Thanks Avital Adler Avital Adler Boris Rosin Boris Rosin Hagai Bergman Hagai Bergman Inna Finkes Inna Finkes Rea Mitelman Rea Mitelman Yael Renernt Yael Renernt Yifat Prut Yifat Prut All the BG lab All the BG lab

31

32 All population analysis Main axis Modulators

33 Monkey is expecting food and air-puff

34 Response index - modulators Cue OutcomeNo Outcome

35 Summary Main Axis vs. Modulators OmissionOutcomeCue 0+++reward 0---aversive OmissionOutcomeCue-++reward +--aversive Value – SustainedError -Phasic