Show Me the Money! Dmitry Kit. Outline Overview Reinforcement Learning Other Topics Conclusions.

Slides:



Advertisements
Similar presentations
Why do we move our eyes? - Image stabilization
Advertisements

Review: General Control Theory
Cortically OriginatedMovement Cortically Originated Movement I. Motor Tract (corticospinal tractus) Extrapyramidal System (basal ganglia) Cerebellum Praxis.
Visual Attention Attention is the ability to select objects of interest from the surrounding environment A reliable measure of attention is eye movement.
Pre-frontal cortex and Executive Function Squire et al Ch 52.
Charles Pierrot-Deseilligny Dan Milea René M. Müri Eun H. Kim February 21, 2008 Eye Movement Control by the Cerebral Cortex.
1.Exams due 9am 16 th. (grades due 10am 19 th ) 2.Describe the organization of visual signals in extra-striate visual cortex and the specialization of.
Covert Attention Mariel Velez What is attention? Attention is the ability to select objects of interest from the surrounding environment Involuntary.
Attention Wolfe et al Ch 7, Werner & Chalupa Ch 75, 78.
BASAL NUCLEI (Basal Ganglia).
The Basal Ganglia Maryann Martone, Ph. D. NEU257 2/22/2011.
Basal Ganglia and Motor Control L21
Control of Attention and Gaze in the Natural World.
Basal Ganglia, Cerebellum and Movement
THE BRAIN’S CONTROL OF HORIZONTAL SACCADIC EYE MOVEMENTS Shirley H. Wray, M.D., Ph.D.
BASAL GANGLIA Prof. Ashraf Husain Basal Ganglia Structure. Functions. Metabolic features. Connections. Disorders.
The Nervous System A network of billions of nerve cells linked together in a highly organized fashion to form the rapid control center of the body. Functions.
Introduction to Neurobiology Lecture 2: “Structure of the Nervous System-Basic concept in neuroanatomy-”
Neural Control of Muscle. Quick Review: Spinal Cord Flexors Extensors.
Assess Prof. Fawzia Al-Rouq Department of Physiology College of Medicine King Saud University Functional Anatomy of the Nervous System.
Dementia. Definition Loss of function in multiple cognitive abilities Assuming the individual had normal abilities before the onset Many of the 70 recognized.
The Basal Ganglia.
Neural circuits for bias and sensitivity in decision-making Jan Lauwereyns Associate Professor, Victoria University of Wellington, New Zealand Long-term.
Cortical vs. subcortical loops. Lateral inhibition in striatum.
Making Things Happen - 2 Motor Disorders How Your Brain Works - Week 7 Dr. Jan Schnupp HowYourBrainWorks.net.
Prediction in Human Presented by: Rezvan Kianifar January 2009.
The Basal Ganglia. I.Functional anatomy A. Input and output components cerebral cortex  BG  thalamus (VA)  frontal lobe. B. Parallel circuits C. Neurotransmitters.
Neuroscience 1 Pause for thought: Probably 100 million neurones fire during skilled movements.
Michael S. Beauchamp, Ph.D. Assistant Professor Department of Neurobiology and Anatomy University of Texas Health Science Center at Houston Houston, TX.
PhD MD MBBS Faculty of Medicine Al Maarefa Colleges of Science & Technology Faculty of Medicine Al Maarefa Colleges of Science & Technology Lecture – 9:
Testing computational models of dopamine and noradrenaline dysfunction in attention deficit/hyperactivity disorder Jaeseung Jeong, Ph.D Department of Bio.
Sensorimotor systems Chapters 8.
Describe 2 kinds of eye movements and their function. Describe the specialized gaze patterns found by Land in cricket. Describe your results in the ball-catching.
Background The physiology of the cerebral cortex is organized in hierarchical manner. The prefrontal cortex (PFC) constitutes the highest level of the.
Central nervous system (CNS) Brain + Spinal Cord
Subject wearing a VR helmet immersed in the virtual environment on the right, with obstacles and targets. Subjects follow the path, avoid the obstacels,
Author(s): Peter Hitchcock, PH.D., 2009 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Non-commercial–Share.
Chapter 16. Basal Ganglia Models for Autonomous Behavior Learning in Creating Brain-Like Intelligence, Sendhoff et al. Course: Robots Learning from Humans.
The Role of the Basal Ganglia in Habit Formation Week 10 Group 1 Amanda Ayoub Kindra Akridge Barbara Kim Alyona Koneva.
Decision Making Theories in Neuroscience Alexander Vostroknutov October 2008.
A neural mechanism of response bias Johan Lauwereyns Laboratory of Sensorimotor Research National Eye Institute, NIH.
The Reward Factor in the Control of Action: A Neurophysiological Theory Johan Lauwereyns Victoria University of Wellington, New Zealand.
Saccadic Eye Movements: A New Diagnostic Tool with Eye-Opening Possibilities for FAS Research James N. Reynolds Department of Pharmacology & Toxicology,
A new neural framework for visuospatial processing Group #4 Alicia Iafonaro Alyona Koneva Barbara Kim Isaac Del Rio.
Primary Cortical Sub-divisions The mapping of objects in space onto the visual cortex.
Figure 18.1 Motor components of the human basal ganglia.
Copyright © 2007 Wolters Kluwer Health | Lippincott Williams & Wilkins Neuroscience: Exploring the Brain, 3e
Laurent Itti: CS564 - Brain Theory and Artificial Intelligence. Saccades 1 1 L. Itti: CS564 - Brain Theory and Artificial Intelligence University of Southern.
Eye Movements – Target Selection & Control READING Schall JD (2002) The neural selection and control of saccades by frontal eye field. Philosophical Transactions.
What is meant by “top-down” and “bottom-up” processing? Give examples of both. Bottom up processes are evoked by the visual stimulus. Top down processes.
What is meant by “top-down” and “bottom-up” processing? Give examples of both. Bottom up processes are evoked by the visual stimulus. Top down processes.
Neural Circuitry underlying generation of saccades and pursuit Lab # 1 - Catching a ball - What to expect/ think about.
BRAIN AND BEHAVIOR. WHY DO PSYCHOLOGISTS STUDY THE NERVOUS SYSTEM? The nervous system is the direct source of all behavior The nervous system is shaped.
Does the brain compute confidence estimates about decisions?
Introduction to Neurobiology Lecture 2: “Structure of the Nervous System; Basic concept in neuroanatomy” 1.
Organization of Neural Structures Involved in the Control of Movement
Comparing Single and Multiple Neuron Simulations of Integrated Dorsal and Ventral Striatal Pathway Models of Action Initiation Selin Metin1, Neslihan Serap.
Volume 62, Issue 5, Pages (June 2009)
Zhejiang University Ling Shucai
Volume 92, Issue 5, Pages (December 2016)
Basal ganglia movement modulation
Organization of Neural Structures Involved in the Control of Movement
Basal ganglia function
Basal Ganglia.
Volume 62, Issue 5, Pages (June 2009)
Volume 75, Issue 5, Pages (September 2012)
Circuitry of self-control and its role in reducing addiction
Neuromodulation of Attention
Social Attention and the Brain
Volume 75, Issue 5, Pages (September 2012)
Presentation transcript:

Show Me the Money! Dmitry Kit

Outline Overview Reinforcement Learning Other Topics Conclusions

Learning Models Hebbian Learning  Strengthens the relationship between neurons that exhibit similar activity patterns and/or in close proximity  Might Explain Topological Features of the Brain Population Coding Basis function learning Area allocation to different functions Reinforcement Learning (RL)  Strengthens the relationship between choices that are causally connected in obtaining some reward

RL Framework and the Brain Reward Signal Representation  Dopamine Local Action selection structures  Lateral Intra-parietal Area (LIP)  Supplementary Eye Field (SEF)  Frontal Eye Field (FEF) Global Mechanism for action selection  Basal Ganglia

Outline Overview Reinforcement Learning  Reward Signal (Dopamine)  Decision Variables (SEF, LIP, Other)  Global Mechanism for Choice (Basal Ganglia) Other Topics Conclusions

Reward Signal (Dopamine) Located in Nigra Pars Compacta (SNc) Modulates neurons in many different regions Tonic low frequency activity Only sends the error signal between expected and actual rewards

RL and Dopamine

Decision Variables in LIP Contains neurons that code for:  expected gain  relative rewards between different actions This activity was observed to be before the choices were actually presented and movement was made  Suggesting that these neurons were used to decide on appropriate action

LIP Neuron Activity Expectation of:  High reward produced high firing frequency (black line)  Low reward produced high firing frequency (gray line) The firing rate was correlated with gain expectation early in the trial

Overall Neural Activity in LIP A large portion of examined neurons showed a significant activity related to gain expectation, outcome probability and estimated value  These were mostly exhibit in the early part of the trial These neurons were also modulated by the actual movement

Neural Features of SEF Three types of neurons found  Active upon failure to perform task Not responsible for executing actions Not related to spatial stimuli Possible error signal coding  Active upon success Not a response to visual stimuli Not responsible for motor control Related to some internal coding of performance  Active before and during the delivery of reinforcement Possibly interconnected to other regions of the brain Seem to code expected reward versus actual reward received

Function of SEF Monitoring and controlling:  Perception and production systems during decision making  Error Correction  Production of responses that are not well-learned  Overcoming habitual responses Evidence:  Neurons do not generate eye movements  Monitor performance and reward

Reward Coding in Other Structures Neurons in orbitofrontal cortex show:  Selectivity to the type of physical reward Solid Liquid Etc.,  Distinguish between rewards and punishers Some neurons in amygdala respond to magnitude of reward

Local Choices Multiple areas in charge of decision making  Frontal Eye Field (FEF)  LIP  Supplementary Eye Field (SEF)  Etc., Might have different goals Need a global mechanism to arbitrate between these different goals

Physiology (Basal Ganglia) Located at the base of the cerebrum Consists of:  Caudate Nucleus (CD) and putamen (PUT) (collectively called striatum) Input from cerebral cortex and part of the thalamus  Globus pallidus External Segment (GPe) Internal Segment (GPi)  Subthalamic Nucleus (STN) Receives direct input from cerebral cortex  Substantia Nigra Pars reticulate (SNr) Pars compacta (SNc)  Output Stations (GPi and SNr) To thalamus and brain stem motor areas

Anatomical Locations (BG)

BG: Function Controls  Thalamocortical networks Mainly involved in hand or arm movements  Brain stem motor networks Superior Colliculus  eye-head orienting the pedunculopontine nucleus  locomotion periaqueductal gray  vocalization autonomic responses

BG-SC connection Exists in many lower mammals Method of control  CD inhibits neural activity in SNr  SNr projects inhibitory connections to SC Inhibition is the main method of control Appropriate action is selected by inhibiting all except the desired action

Neural Properties of BG Contains memory-guided neurons Contains neurons that code expectation of task specific events SNr  Only effected by planned movements  Response fields of neurons is the same to those they connect to in SC

Circuit Diagram

Coordinated Activity Model Use GPe to select just the activity you need (Focus) Use STN to inhibit a planned future activity (Sequencing) Might be an incorrect model if we emphasize the direct cortical input to the STN  Direct control over movement suppression

Learning of Sequential Procedures Frontoparietal association cortices and anterior part of the basal ganglia learn new sequences  Uses visuospatial coordinates Motor-premotor cortices and the mid- posterior part of the basal ganglia exploit learned sequences  Uses motor coordinates

BG and Decision Making Ventral striatum receives input from neocortical areas (cognition) and limbic (emotional) areas Speed of saccades are related to emotional or motivational state As with SEF and LIP many BG neurons respond to the expectation of reward Uses dopaminergic neurons to:  Modulate selectivity of individual neurons  Modulate response magnitude of individual neurons

Circuit Diagram Revisited

Consequences of BG Disorders Involuntary movement Random movement  Visually guided saccades Shorter saccades Problems with coordinated movements Responds deficit to memory-guided saccades Trouble holding fixation  Especially if STN is damaged Inability to learn sequential procedures Lack of motivation to perform actions

Why Disinhibition? Possibly an evolutionary by-product Need a gating mechanism not an enhancement mechanism

Outline Overview Reinforcement Learning Other Topics Attention Vs. Reward Credit Assignment Problem Conclusions

Attention Or reward? Attention is a more global concept than reward  Defined as the study of vigilance, selective processing of stimuli, and control systems for complex behavior Attention can modulate neurons before the onset of stimuli, just like reward expectation neurons Attention is dependant on task difficulty How does one distinguish between reward expectation signal and attention to a particular stimuli at a single neuron?  Some studies of attention might have been looking at the same neural signal as those studying reward Provide better definitions for reward and attention  Attention might be defined in terms of rewards

The Credit Assignment Problem What chain of actions resulted in reward? Which of the action to the right got you your steak? Action/order1st2nd3rd4th5th Take a car Take a bike Open Door Sit down Shift in seat Point at Menu Item Tapped the table

Solution Start from the point of receiving the reward  Recently shown that in rats the hippocampus replays their daily experiences backwards Back propagate the reward at a discounted rate  In monkeys the neurons coding for reward expectation decreased their activity, when the delay between the cue and reward was increased Converges to optimal policy if:  Any action has some positive probability of being chosen  Infinite time

Infinite Time Humans do not need an optimal policy The inherent randomness in our environment, behaviors, and tasks make it unlikely that a set of truly unrelated actions coincide frequently

Conclusion Choose a set of actions (eg., SEF, LIP, etc.,) Execute a subset of actions that do not violate physical limitations (Basal Ganglia) Compare the final result against the expected result (Dopamine) Try to do the task again Almost every task that we execute uses the eyes to locate the target of interest and therefore it is not surprising that the eye is closely related to the current task. Might be a huge oversimplification: “Correlation does not imply causation”  More experiments are needed to show these relationships

The End Thank You