Perceptual Learning, Roving and the Unsupervised Bias By Aaron Clarke, Henning Sprekeler, Wolfram Gerstner and Michael Herzog Brain Mind Institute École.

Slides:



Advertisements
Similar presentations
Flexible Shaping: How learning in small steps helps Hierarchical Organization of Behavior, NIPS 2007 Kai Krueger and Peter Dayan Gatsby Computational Neuroscience.
Advertisements

Modelling the classic Attentional Blink and its emotional variant Nikos Fragopanagos* & John Taylor** *Sponsored by the BBSRC **Sponsored by the BBSRC.
On the Effects of Immediate Feedback Ido Erev, Adi Luria & Annan Erev Technion Israeli Institute of Technology The Max-Wertheimer Minerva Center for Cognitive.
Thomas Trappenberg Autonomous Robotics: Supervised and unsupervised learning.
Introduction to Artificial Neural Networks
1 st Level Analysis: design matrix, contrasts, GLM Clare Palmer & Misun Kim Methods for Dummies
Success Factors in New Technology Implementation Pat Holahan Kai Wang Howe School of technology Management Stevens Institute of Technology HSATM Roundtable.
Experiment Basics: Variables Psych 231: Research Methods in Psychology.
Martyn Clark Short course on “Model building, inference and hypothesis testing in hydrology” May, 2012.
© 2012 Aptima, Inc. The Science of Game-based Training Effectiveness 29 March 2012 Krista Langkamer Ratwani Kara L. Orvis.
Computational Analysis of Motor Learning. Three paradigms Force field adaptation Visuomotor transformations Sequence learning Does one term (motor learning)
1 Part I Artificial Neural Networks Sofia Nikitaki.
Reward-based decision making under social interaction Damon Tomlin MURI Kick-Off meeting September 13, 2007.
Neural Networks Dr. Peter Phillips. Neural Networks What are Neural Networks Where can neural networks be used Examples Recognition systems (Voice, Signature,
Abstract Cognitive control processes reduce the effects of irrelevant or misleading information on performance. We report a study suggesting that effective.
Theoretical Models of Attention. Broadbent (1958) conceptualized attention as information processing Used a cuing paradigm to show that attentional selection.
Goals of Adaptive Signal Processing Design algorithms that learn from training data Algorithms must have good properties: attain good solutions, simple.
What is Cognitive Science? … is the interdisciplinary study of mind and intelligence, embracing philosophy, psychology, artificial intelligence, neuroscience,
Theoretical Models of Attention. Broadbent (1958) conceptualized attention as information processing Used a cuing paradigm to show that attentional selection.
Artificial Neural Networks (ANNs)
Disorders of Attention Orienting
Chapter 5 Data mining : A Closer Look.
Experimental Design Tali Sharot & Christian Kaul With slides taken from presentations by: Tor Wager Christian Ruff.
Improving auditory skills through training Beverly A. Wright Northwestern University Knowles Hearing Center.
Modeling Item Difficulty for Perceptual Speed and Accuracy Tests Stephanie Taylor, MA Alan Mead, PhD.
1 st Neural Network: AND function Threshold(Y) = 2 X1 Y X Y.
Reinforcement learning This is mostly taken from Dayan and Abbot ch. 9 Reinforcement learning is different than supervised learning in that there is no.
Artificial Intelligence Techniques Multilayer Perceptrons.
CS344 : Introduction to Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 26- Reinforcement Learning for Robots; Brain Evidence.
Computational Modeling Lab Wednesday 18 June 2003 Reinforcement Learning an introduction Ann Nowé By Sutton and.
Modelling Language Evolution Lecture 1: Introduction to Learning Simon Kirby University of Edinburgh Language Evolution & Computation Research Unit.
© 2001 Dr. Laura Snodgrass, Ph.D.1 Conducting Experiments Choosing methods Sampling and sample size Independent variables Dependent variables Controls.
Operant Conditioning of Cortical Activity E Fetz, 1969.
Methodology of Simulations n CS/PY 399 Lecture Presentation # 19 n February 21, 2001 n Mount Union College.
What is modularity good for? Michael S. C. Thomas, Neil A. Forrester, Fiona M. Richardson
Additional Statistical Investigations A paired t-test was performed to evaluate whether a perceptual learning process occurs between the initial baseline.
1 Challenge the future Longitudinal Driving Behavior in case of Emergency situations: An Empirically Underpinned Theoretical Framework Dr. R.(Raymond)
Localization of Auditory Stimulus in the Presence of an Auditory Cue By Albert Ler.
Training Phase Results The RT difference between gain and loss was numerically larger for the second half of the trials than the first half, as predicted,
©2010 John Wiley and Sons Chapter 3 Research Methods in Human-Computer Interaction Chapter 3- Experimental Design.
Results Introduction Nonconditional Feedback Selectively Eliminates Conflict Adaption Summary Methods 38 participants performed a parity judgment task.
Neural Networks Steven Le. Overview Introduction Architectures Learning Techniques Advantages Applications.
Role of Working Memory in Visual Selective Attention de Fockert, Rees, Frith, Lavie (2001)
The effects of working memory load on negative priming in an N-back task Ewald Neumann Brain-Inspired Cognitive Systems (BICS) July, 2010.
GALLERY WALK. GALLERY WALK: CHALLENGES & SOLUTIONS 1.Form groups by area. 2.Visit one poster at a time. Read the challenge and brainstorm solutions. Write.
Supervised Learning. Teacher response: Emulation. Error: y1 – y2, where y1 is teacher response ( desired response, y2 is actual response ). Aim: To reduce.
Effects of Financial Incentives on Herding in Simulated Financial Markets Maria Andersson Martin Hedesström Tommy Gärling University of Gothenburg, Göteborg,
Longer or Shorter? An examination of line length discrimination using manipulated feedback Amanda Hostiuc, Alicia Kim, Melanie Laking, and Matt Pachai.
Perception of Danger Signals: The Role of Control Jochen Brandtstadter, Andreas Voss, and Klaus Rothermund.
SLA Effects of Recasts as Implicit Knowledge Young-ah Do Fall, College English Education.
Perceptrons Michael J. Watts
How Recurrent Dynamics Explain Crowding Aaron Clarke & Michael H. Herzog Laboratory of Psychophysics, Brain Mind Institute, École Polytechnique Fédérale.
Structuring the Learning Experience Chapter 9. Objectives Discuss the concept of practice structure and explain its importance to goal achievement and.
Neural correlates of risk sensitivity An fMRI study of instrumental choice behavior Yael Niv, Jeffrey A. Edlund, Peter Dayan, and John O’Doherty Cohen.
Does the brain compute confidence estimates about decisions?
Intro. ANN & Fuzzy Systems Lecture 11. MLP (III): Back-Propagation.
Dynamics of Reward Bias Effects in Perceptual Decision Making Jay McClelland & Juan Gao Building on: Newsome and Rorie Holmes and Feng Usher and McClelland.
Kimron Shapiro & Frances Garrad-Cole The University of Wales, Bangor
Change Score Analysis versus ANCOVA in Pretest/Posttest Designs:
FUNDAMENTAL CONCEPT OF ARTIFICIAL NETWORKS
Amie Fairs, Sara Bögels, Antje S. Meyer
Using Time-Varying Motion Stimuli to Explore Decision Dynamics
بسمه تعالی کارگاه ارزشیابی پیشرفت تحصیلی
Supervised vs. unsupervised Learning
3.1.1 Introduction to Machine Learning
Controlling BOH4M.
Disruption of Perceptual Learning by a Brief Practice Break
Active avoidance is impaired in aged mice in both multimodal and visual versions of the task. Active avoidance is impaired in aged mice in both multimodal.
JJ Orban de Xivry Hands on session JJ Orban de Xivry
Impaired Associative Learning with Food Rewards in Obese Women
Presentation transcript:

Perceptual Learning, Roving and the Unsupervised Bias By Aaron Clarke, Henning Sprekeler, Wolfram Gerstner and Michael Herzog Brain Mind Institute École Polytechnique Fédérale De Lausanne Switzerland

Talk Outline Perceptual Learning & Roving The Unsupervised Bias Critical Experiment

Perceptual Learning

Block Number d'

Talk Outline Perceptual Learning & Roving The Unsupervised Bias Critical Experiment

Roving 1200” Learning Task 1

Roving 1200” 1800” Learning Task 1 Learning Task 2

Roving Adapted from Tartaglia, Bamert, Mast & H. Herzog (2009)

Hypotheses Roving may disrupt memory-trace-buildup for the roved stimuli (Yu et al., 2004). Roving may diminish the stimuli’s predictability (Adini et al., 2004). Roving may prevent the participants from conceptually tagging each stimulus type in order to switch their attention to the appropriate perceptual template (Zhang et al., 2008).

Roving 1200” 1800” Learning Task 1 Learning Task 2

Hypotheses Roving may disrupt memory-trace-buildup for the roved stimuli (Yu et al., 2004). Roving may diminish the stimuli’s predictability (Adini et al., 2004). Roving may prevent the participants from conceptually tagging each stimulus type in order to switch their attention to the appropriate perceptual template (Zhang et al., 2008).

Talk Outline Perceptual Learning & Roving The Unsupervised Bias Critical Experiment

Talk Outline Perceptual Learning & Roving The Unsupervised Bias Critical Experiment

Model Predictions SupervisedUnsupervisedReward-Based No feedback Trial by trial feedback Error feedback Teacher signal Output Desired Output Error Input Output Desired Output Error Reward Feedback after many trials Error feedback Teacher signal i j Input Δw ij = pre i × e ij Δw ij = Cov(R,w ij ) + ‹R› ‹w ij ›Δw ij = pre i × post j

Δw ij = pre i × e ij Model Predictions UnsupervisedReward-Based No feedback Input Output Desired Output Error Reward Feedback after many trials Error feedback Teacher signal i j Supervised Trial by trial feedback Error feedback Teacher signal Input Output Desired Output Error Herzog & Fahle (1998) Feedback improves performance. Learning is possible without feedback Δw ij = Cov(R,w ij ) + ‹R› ‹w ij ›Δw ij = pre i × post j

Reward-Based Learning Δw ij = Cov(R,w ij ) + ‹R› ‹w ij › weight change Covariation between reward weight change Average reward Averages of past trialsReward & current activations

Reward-Based Learning Δw ij = Cov(R,w ij ) + ‹R› ‹w ij › weight change Covariation between reward weight change Average reward = 0 Averages of past trialsReward & current activations

Reward-Based Learning Δw ij = Cov(R 1 +R 2,w ij ) + ‹R 1 +R 2 › ‹w ij › weight change Covariation between reward weight change Average reward Averages of past trials Learning is impossible with two stimuli. Reward & current activations

Roving Adapted from Tartaglia, Bamert, Mast & H. Herzog (2009)

Talk Outline Perceptual Learning & Roving The Unsupervised Bias Critical Experiment

Hypothesis Roving impairs perceptual learning when the average reward for the two learned stimuli differs significantly. – This kind of situation occurs when the two roved tasks differ in their difficulty levels.

Roving 1200” 1800” Learning Task 1 Learning Task 2

Results H 0 : Mean Hard Slopes = 0: t(7) = , p = ” 1800” H 0 : Mean Easy Slopes = 0: t(7) = , p = Block Number d' Easy Hard

Results Block Number d' Easy Hard Block Number d' H 0 : Mean Non-Roved Slopes = 0: t(7) = 2.144, p = 0.035

Summary There are three types of learning models: supervised, unsupervised and reward-based. Only reward-based learning withstands empirical falsification, and it suffers from the unsupervised bias. When roving two tasks, easy and hard, learning fails, as can be shown mathematically. And that is why roving occurs empirically. A strange prediction from this is that roving a hard and a very easy task should deteriorate performance. Roving two hard tasks might make learning easier than roving a hard and an easy task, and this has actually been shown in other studies.

Thank for your attention.

When is Learning During Roving Successful? Vs. 150 ms500 ms Vs.

Experiment Used two stimuli: 1800” and 1200”. Measured pre-training thresholds for both stimuli in isolation. Trained subjects with fixed offsets (easy = 1.5 × pre- training threshold, hard = 0.9 × pre-training threshold). In 20 blocks of 80 trials. Roved stimuli. 1800” Easy 1200” Hard Easy 1200”

Other Hypotheses Roving may interact with the participants’ initial performance levels where worse initial performers learn more than high initial performers. Roving might cause low-level interference between stimulus types (Tartaglia et al., 2009; Zhaoping, Herzog, & Dayan, 2003).