Lectures 14: Instrumental Conditioning (Basic Issues) Learning, Psychology 5310 Spring, 2015 Professor Delamater.

Slides:



Advertisements
Similar presentations
Schedules of reinforcement
Advertisements

Operant Conditioning Skinner, positive & negative reinforcement, response cost, punishment and schedules of reinforcement.
January 26 th, 2010 Psychology 485.  History & Introduction  Three major questions: What is learned? Why learn? How does learning happen?
Instrumental Conditioning Also called Operant Conditioning.
Lecture 11: Pavlovian Conditioning (Associative Content) Learning, Psychology 5310 Spring, 2015 Professor Delamater.
Lecture 17: Instrumental Conditioning (Associative Structures) Learning, Psychology 5310 Spring, 2015 Professor Delamater.
Psychology 5310: Learning Lecture 1 Spring, 2015 Professor Delamater.
Operant or Instrumental Conditioning Psychology 3306.
Instrumental Learning A general class of behaviors inferring that learning has taken place.
Lecture 20: Extinction (Pavlovian & Instrumental) Learning, Psychology 5310 Spring, 2015 Professor Delamater.
More Instrumental (Operant) Conditioning. B.F. Skinner Coined the term ‘Operant conditioning’ Coined the term ‘Operant conditioning’ The animal operates.
Lecture 18&19: Stimulus Control (Pavlovian & Instrumental) Learning, Psychology 5310 Spring, 2015 Professor Delamater.
Lecture 21: Avoidance Learning & Punishment Learning, Psychology 5310 Spring, 2015 Professor Delamater.
Lectures 7&8: Pavlovian Conditioning (Determining Conditions) Learning, Psychology 5310 Spring, 2015 Professor Delamater.
PSY 402 Theories of Learning Chapter 7 – Behavior & Its Consequences Instrumental & Operant Learning.
Copyright © 2005 Pearson Education Canada Inc. Learning Chapter 5.
Instrumental Learning All Learning where an animal operates on its environment to obtain a reinforcement. Operant (Skinnerian) conditioning.
Instrumental Conditioning: Foundations. Name Game Instrumental: subject instrumental in producing outcome Operant: subject operates on environment to.
PSY 402 Theories of Learning Chapter 10 – A Synthetic Perspective on Instrumental Learning.
OPERANT CONDITIONING DEF: a form of learning in which responses come to be controlled by their consequences.
Learning.
Operant vs. Classical Operant responses understood by comprehending the consequences they produce vs. Pavlovian responses may have environmental effects,
Learning Part II. Overview Habituation Classical conditioning Instrumental/operant conditioning Observational learning.
Lectures 12 & 13: Pavlovian Conditioning (Learning-Performance) Learning, Psychology 5310 Spring, 2015 Professor Delamater.
Operant Conditioning Unit 4 - AoS 2 - Learning. Trial and Error Learning An organism’s attempts to learn or solve a problem by trying alternative possibilities.
Chapter 6: Learning. Classical Conditioning Ivan Pavlov Terminology –Unconditioned Stimulus (UCS): evokes an unconditioned response without previous conditioning.
Chapter 6: Learning. Classical Conditioning Ivan Pavlov Terminology –Unconditioned Stimulus (UCS) –Conditioned Stimulus (CS) –Unconditioned Response (UCR)
Learning.
EXPLORING PSYCHOLOGY EIGHTH EDITION IN MODULES David Myers PowerPoint Slides Aneeq Ahmad Henderson State University Worth Publishers, © 2011.
OPERANT CONDITIONING Changing Behavior Through Reinforcement and Punishment.
Chapter 6 Learning. Table of Contents Classical conditioning Ivan Pavlov Terminology –Unconditioned Stimulus (UCS) –Conditioned Stimulus (CS) –Unconditioned.
B.F. SKINNER - "Skinner box": -many responses -little time and effort -easily recorded -RESPONSE RATE is the Dependent Variable.
Chapter 6 Learning. Table of Contents Learning Learning defined on page –Classical conditioning –Operant/Instrumental conditioning –Observational learning.
© 2013 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner.
Operant Conditioning Unit 4 - AoS 2 - Learning. Trial and Error Learning An organism’s attempts to learn or solve a problem by trying alternative possibilities.
Chapter 6: Learning 1Ch. 6. – Relatively permanent change in behavior due to experience 1. Classical Conditioning : Pairing 2. Operant Conditioning :
PSY 402 Theories of Learning Chapter 10 – A Synthetic Perspective on Instrumental Learning.
OPERANT CONDITIONING. DIFFERENT FROM CLASSICAL CLASSICAL: Experimenter presents UCS and CS and then observes the behavior CLASSICAL: Experimenter presents.
Unit 4: Learning “Operant Conditioning”. Behaviorism To a Behaviorist: Everything you know, everything you are is the result of human behavior. Psychology.
Table of Contents CHAPTER 6 Learning. Table of ContentsLEARNING  Learning  Classical conditioning  Operant/Instrumental conditioning  Observational.
PSY402 Theories of Learning Chapter 6 – Appetitive Conditioning.
Copyright McGraw-Hill, Inc Chapter 5 Learning.
Operant Conditioning A learning process by which the likelihood of a particular behaviour occurring is determined by the consequences of that behaviour.
Operant Conditioning. Operant Conditioning – A form of learning in which voluntary responses come to be controlled by their consequences. What does this.
OPERANT CONDITIONING. DIFFERENT FROM CLASSICAL CLASSICAL: Experimenter presents UCS and CS and then observes the behavior CLASSICAL: Experimenter presents.
Module 10 Operant & Cognitive Approaches. Thorndike’s Law of Effect l Behaviors followed by positive consequences are strengthened while behaviors followed.
Operant conditioning (Skinner – 1938, 1956)
Experimental Psychology PSY 433 Chapter 9 Conditioning and Learning.
Chapter 6 Learning and Behavior Learning n A more or less permanent change in behavior that results from experience.
Steven I. Dworkin, Ph.D. 1 Basic Principles of Operant Conditioning Chapter 6.
Warm-Up You eat a new food and then get sick because of the flu. However, you develop a dislike for the food and feel nauseated whenever you smell it.
Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.
Chapter 11 Inferences and the Representations of Knowledge in Operant Conditioning.
Operant Conditioning. Learning when an animal or human performs a behavior, and the following consequence increases or decreases the chance that the behavior.
PSY402 Theories of Learning Chapter 4 – Appetitive Conditioning.
CHS AP Psychology Unit 6: Learning (Behaviorism) Essential Task 6.3: Predict the effects of operant conditioning with specific attention to (primary, secondary,
Table of Contents Chapter 6 Learning. Table of Contents Learning –Classical conditioning –Operant/Instrumental conditioning –Observational learning Ivan.
Edward Thorndike Acquisition of Goal-directed responses. Law of Effect - behavior occurs in a random, trial-and-error fashion. Consequences of a behavior.
Making Responses for your Reinforcer. Instrumental Conditioning Procedures There are 4 basic types of instrumental conditioning Table 5.1 Instrumental.
Operant or Instrumental Conditioning
Psychology 3510: Learning Lecture 1
Learning.
Module 20 Operant Conditioning.
PSY402 Theories of Learning
Operant Conditioning Unit 4 - AoS 2 - Learning.
Learning from our behavior’s consequences
UNIT 4 BRAIN, BEHAVIOUR & EXPERIENCE
Do-Now: Describe the following phenomena of Classical Conditioning:
PSY 402 Theories of Learning Chapter 7 – Behavior & Its Consequences
Conditioning and Learning
Presentation transcript:

Lectures 14: Instrumental Conditioning (Basic Issues) Learning, Psychology 5310 Spring, 2015 Professor Delamater

Associative Learning Pavlovian vs Instrumental Conditioning: CS - US Contingency R - US Contingency The main procedural difference between Pavlovian and instrumental conditioning is that in Pavlovian learning the critical environmental contingency is that between two stimuli (CS and US), whereas in instrumental conditioning the critical environmental contingency is that between the animals own response and the events that follow (for instance, the rewarding US).

Instrumental Learning Three Key Questions 1. What are the major determinants of learning? 2. What is the content of learning? 3. Motivational factors affecting performance? Basic Paradigms 4 Basic Instrumental Contingencies Determining Conditions Nature of the Response Nature of the Reinforcer Response-Reinforcer Relations Temporal Contiguity, Reinforcer Surprise, Contingency

Instrumental Learning: Basic Paradigms 1.Discrete Trials vs Free Operant 2.Puzzle Box (Thorndike), Mazes (Runway, T-Maze, etc) 3.Operant Conditioning chambers (BF Skinner) Early approaches used discrete trials tasks. The animal was placed in the apparatus and removed at the end of the conditioning trial. Then it was returned for another discrete trial later. In Free operant tasks, the animal is placed in the apparatus and permitted to respond freely during the entire conditioning session without interruption.

Discrete Trials Approaches Cats learned to escape puzzle box to obtain food reward. Thorndike assumed this was caused by the development of an S-R association. Reinforcement “stamped in” this association, without itself being learned about.

Discrete Trials Approaches Rats learn to run down the runway (leaving the start box to the goal box) to get food. They can also learn to choose the correct side in a T maze. Learning is assessed with running speed in runway, and % Correct choice in T maze.

Free Operant Approaches Rats learn to press a lever to get food, continue responding uninterrupted. Response rate is one measure of instrumental learning. The “operant” response is defined as that set of actions that result in delivery of the reinforcement (namely, pressing the lever in some manner). In this way, responding and hence, learning, can be easily assessed.

Shaping of Instrumental Responses Learning a response often consists of shaping by successive approximations. Reinforcement is given for successively closer approximations to the target response. These responses may consist of new combinations of existing responses or the learning of entirely new responses.

4 Basic Instrumental Contingencies Response leads to Positive Event (Positive Reinforcement) Response leads to Negative Event (Punishment) Response leads to termination or prevention of Negative Event (Negative Reinforcement) Response leads to termination or prevention of a Positive Event (Omission) Reinforcement procedures increase responding, the others decrease responding.

Instrumental Learning: Determining Conditions Variable responses can be reinforced and increase in frequency when they are. Responses tend to be less variable and more consistent when variability is not reinforced. 1. Nature of the Response (Complex vs Simple Dimensions) Ross & Neuringer (2002) Study Some humans were asked to draw Rectangles to earn points. In one group only rectangles that differed from previously drawn ones were reinforced. The “yoked” group were given reinforcements for drawing rectangles whenever the first group earned reinforcement, but there was no requirement to draw rectangles in any particular way.

Instrumental Learning: Determining Conditions This indicates that some responses-reinforcer relations are better learned about than others. Remember the CS-US relevance effect in Pavlovian learning (Garcia & Koelling, 1966)??? Maybe those responses that are “relevant” to the particular behavior system being tapped into are most easily reinforced. 1. Nature of the Response (Response “Belongingness”) Shettleworth (1975) Study Hamsters were observed when hungry (food deprived) or not hungry. Some responses increased when hungry and others did not increase. Different groups of hamsters could earn food reward when hungry, but which response was required was varied. Those responses that also increased under baseline conditions when the Hamster was hungry were reinforced more easily than the other responses.

Instrumental Learning: Determining Conditions This is called “Instinctive Drift” and reflects the intrusion of Pavlovian CRs that can compete with learning the instrumental task. 1. Nature of the Response (Response “Belongingness”) Breland and Breland (1961) “Miserly Raccoon” Study Raccoons asked to deposit coins (1 or 2) in a piggy bank. They had a hard time doing this, instead they kept ahold of the coin and treated it as a food object (licking it, biting it, etc).

Instrumental Learning: Determining Conditions 2. Nature of the Reinforcer (Reward Magnitude) The larger the reward, the better the reinforcing effect Child had to press a button in order to obtain a social attention reward (tickles, praise,etc) Greater “Progressive Ratio” responding with longer duration rewards. Trosclair-Lasserre et al. (2008) Autism Study

Instrumental Learning: Determining Conditions 2. Nature of the Reinforcer (Contrasting Rewards) Behavioral Contrast Effects (e.g., Negative Contrast) A small reward is rated as less valuable when the animal has prior experience with a larger reward. Shows up here as a decrease in drinking 4% sucrose after the rat had earlier consumed 32%. Ortega et al. (2011) Study

Instrumental Learning: Determining Conditions 3. Nature of the Response-Reinforcer Relationship (Contiguity) Temporal Contiguity (between Response and Reward) Rats lever press more when food reward is given soon after the response Lever press responding decreases as the R-Reward contiguity is reduced This clearly shows that temporal contiguity is important for instrumental learning. Dickinson, Watt, & Griffiths (1992) Study

Instrumental Learning: Determining Conditions 3. Nature of the Response-Reinforcer Relationship (Reward Surprise) Reward Surprise Rats lever press with a 30-s R-Reward interval A 5-s Light occurs either just after the Response, just before the reward, or not at all. A No Light group acquires responding, but the Late Light group does not, and the Early Light group acquires responding best of all. The Light acts as a conditioned reinforcer (or “marking” stimulus) if it occurs just after the Response, but it Blocks instrumental learning if it occurs just before the reward. This blocking effect occurs because the Light Predicts the food reward and renders it unsurprising. This interferes with instrumental learning. Williams (1999) Study

Instrumental Learning: Determining Conditions 3. Nature of the Response-Reinforcer Relationship (Contingency) Contingency (between Response and Outcome) or Temporal Contiguity? Experimental Design Lever Press -> Pellets Chain Pull -> Sucrose (Freely Delivered) Pellets Instrumental Contingency Experiment (Colwill & Rescorla, 1986) Research Question: Does contingency or contiguity determine instrumental learning?

Instrumental Learning: Determining Conditions 3. Nature of the Response-Reinforcer Relationship (Contingency) Contingency (between Response and Outcome) or Temporal Contiguity? Experimental Design Lever Press -> Pellets Chain Pull -> Sucrose (Freely Delivered) Pellets Instrumental Contingency Experiment (Colwill & Rescorla, 1986) Research Question: Does contingency or contiguity determine instrumental learning? Hypothesis: If its Contiguity, then both responses should be = But, if its Contingency, then notice that the Chain-Sucrose Contingency is stronger than the Lever-Pellet contingency. So, Chain Pull should be > than Lever Press

Instrumental Learning: Determining Conditions 3. Nature of the Response-Reinforcer Relationship (Contingency) Contingency (between Response and Outcome) or Temporal Contiguity? Experimental Design Lever Press -> Pellets Chain Pull -> Sucrose (Freely Delivered) Pellets Instrumental Contingency Experiment (Colwill & Rescorla, 1986) Research Question: Does contingency or contiguity determine instrumental learning? Results: The Rats Chain Pull Responses > Lever Press This confirms that Contingency is important during Instrumental learning, much like it has been shown in Pavlovian learning.