Operant Conditioning Unit 4 - AoS 2 - Learning. Trial and Error Learning An organism’s attempts to learn or solve a problem by trying alternative possibilities.

Slides:



Advertisements
Similar presentations
Operant Conditioning Skinner, positive & negative reinforcement, response cost, punishment and schedules of reinforcement.
Advertisements

3. Operant Conditioning = A form of learning for which the likelihood of a particular response occurring is determined by the consequences of that response.
Instrumental Conditioning Also called Operant Conditioning.
Operant Conditioning What is Operant Conditioning?
Classical Conditioning Pavlov’s experiment - psychic secretions. Pavlov was a Russian physiologists who studied digestion. He won the Nobel prize in physiology.
Chapter 13, Unit 4 Psychology.  While CC is useful for explaining learned behaviour, there are many other learned behaviours that CC cannot explain,
Operant Conditioning. I. Operant Conditioning A type of learning that occurs when we receive rewards or punishments for our behavior A type of learning.
Operant Conditioning What the heck is it? Module 16.
Operant Conditioning Big Question: Is the organism learning associations between events that it does not control (classical) OR is it learning associations.
Copyright © 2005 Pearson Education Canada Inc. Learning Chapter 5.
OPERANT CONDITIONING DEF: a form of learning in which responses come to be controlled by their consequences.
Learning the Consequences of Behavior
Reward and Punishment.  Cats escape from box to get a treat  At first its all trial and error  When successful the behaviour is rewarded  This good.
O PERANT C ONDITIONING Year 12 Psychology Unit 4 Area of Study 1 (chapter 10, page 476)
Learning.
Chapter 6: Learning. Classical Conditioning Ivan Pavlov Terminology –Unconditioned Stimulus (UCS): evokes an unconditioned response without previous conditioning.
What is Operant Conditioning? Module 16: Operant Conditioning.
A Brief Introduction to Learning Theory The concept of learning is fundamental to education We can teach. We can re-teach. We can teach alternatives.
Chapter 5: Learning Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
OPERANT CONDITIONING Changing Behavior Through Reinforcement and Punishment.
© 2008 The McGraw-Hill Companies, Inc. Chapter 6: Learning.
© 2013 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner.
What is Operant Conditioning?. Operant Conditioning A type of learning in which the frequency of a behavior depends on the consequence that follows that.
Operant Conditioning Unit 4 - AoS 2 - Learning. Trial and Error Learning An organism’s attempts to learn or solve a problem by trying alternative possibilities.
OPERANT CONDITIONING.  Many of the behaviours in animals and humans cannot be explained in terms of classical conditioning.  Many complex behaviours.
Chapter 7 Learning. Classical Conditioning Learning: a relatively permanent change in behavior that is brought about by experience Ivan Pavlov: – Noticed.
Learning (Part II) 7-9% of AP Exam Classical Conditioning UCS + UCR + N, etc… Acquisition Extinction Biological Predisposition Pavlov Watson Operant Conditioning.
Learning Classical Conditioning Classical Conditioning in Real Life Operant Conditioning Operant Conditioning in Real Life Social-Cognitive Learning Theories.
Operant Conditioning Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if.
Review Unit 7. Observational Learning Learning by watching others.
Classical Conditioning
Chapter 6 Learning.
Unit 6 (C): Operant Conditioning
Operant Conditioning E.L. Thorndike and B.F. Skinner.
Copyright McGraw-Hill, Inc Chapter 5 Learning.
Learning Experiments and Concepts.  What is learning?
Operant Conditioning A learning process by which the likelihood of a particular behaviour occurring is determined by the consequences of that behaviour.
Operant Conditioning. Operant Conditioning – A form of learning in which voluntary responses come to be controlled by their consequences. What does this.
B. F. Skinner Radial Behaviorism B.F. Skinner ( ) 1925: Hamilton College (NY): degree in English, no courses in psychology Read about Pavlov’s.
Operant conditioning (Skinner – 1938, 1956)
Chapter 6 Learning and Behavior Learning n A more or less permanent change in behavior that results from experience.
Chapter 5 Learning. Copyright © 1999 by The McGraw-Hill Companies, Inc. 2 Defining Learning Learning –a relatively permanent change in behavior that occurs.
Def: a relatively permanent change in behavior that results from experience Classical Conditioning: learning procedure in which associations are made.
Operant Conditioning. Learning when an animal or human performs a behavior, and the following consequence increases or decreases the chance that the behavior.
Chapter 8 Learning. A relatively permanent change in an organism’s behavior due to experience. learning.
Learning 7-9% of the AP Psychology exam. Thursday, December 3 Sit with your group from yesterday’s test review!
Module 27 Operant Conditioning
Operant Conditioning Type of learning in which the frequency of a behavior depends on the consequence that follows that behavior. Another form of learning.
Behavioral Learning Theory : Pavlov, Thorndike & Skinner M. Borland E.P. 500 Dr. Mayton Summer 2007.
CHS AP Psychology Unit 6: Learning (Behaviorism) Essential Task 6.3: Predict the effects of operant conditioning with specific attention to (primary, secondary,
Copyright © Allyn and Bacon Chapter 6 Learning This multimedia product and its contents are protected under copyright law. The following are prohibited.
Trial and error learning Thorndike’s puzzle box. Trial and error learning This type of learning occurs when an organism attempts to learn by undertaking.
Trial and error learning Thorndike 1898, 1911
Operant Conditioning. A type of learning in which the frequency of a behavior depends on the consequence that follows that behavior. The frequency will.
Operant conditioning Learning by consequences. Ratatouille Ratatouille is hungry and performs various exploratory behaviours By chance he presses the.
3 types of Learning 1. Classical 2. Operant 3. Social This Is our second type of Learning.
Thinking About Psychology: The Science of Mind and Behavior Charles T. Blair-Broeker Randal M. Ernst.
Chapter 6 LEARNING. Learning Learning – A process through which experience produces lasting change in behavior or mental processes. Behavioral Learning.
Conditioning and Learning Unit 6 Conditioning and Learning Modules
Operant Conditioning Module 15. Operant Conditioning A type of learning in which the frequency of a behavior depends on the consequence that follows that.
Before Class… Pick up each of the two worksheets on the cart Submit any late work you may have for me Get a RED book from the shelf Begin to work on the.
Operant Conditioning The Main Features of Operant Conditioning: Types of Reinforcement and Punishment.
Learning by consequences
Learning by consequences
Module 20 Operant Conditioning.
Operant Conditioning Unit 4 - AoS 2 - Learning.
UNIT 4 BRAIN, BEHAVIOUR & EXPERIENCE
Thorndike Interested in studying animal intelligence
Operant Conditioning.
Thorndike Interested in studying animal intelligence
Presentation transcript:

Operant Conditioning Unit 4 - AoS 2 - Learning

Trial and Error Learning An organism’s attempts to learn or solve a problem by trying alternative possibilities until a correct solution or desired outcome is achieved. Often involves many attempts (trials) and incorrect choices (errors) Was called instrumental learning,now Operant conditioning - the learner ‘operates’ on the envioronment

Thorndike’s Puzzle Boxes Put hungry cats into a ‘puzzle box’, food outside box outside of reach Cat had to get out of box to get food. The more times a cat was put in the box, the faster it got out (fewer trials) After 7 trials would go straight for lever and get out immediately. Lever pushing now learnt, not random

Thorndike’s Law of effect a behaviour that is followed by ‘satisfying’ consequences is strengthened (more likely to occur) and a behaviour that is followed by ‘annoying’ consequences is weakened (less likely to occur) Instrumental learning because cat is instrumental in obtaining its release

Operant Conditioning first used by Burrhus Skinner. Operant is a response (or set of responses) that occurs and acts (operates) on the environment to produce some kind of effect. behaviour that has consequences ALL behaviour can be explained this way

Operant vs Respondent respondents are behaviours that are elicited by known or recognised stimuli. Pavlov’s dogs responded by salivating to meat powder, then a bell. Thorndikes cats made responses not prompted by stimuli. In CC, behaviour has no effect on consequences

Skinner Boxes small chamber where an animal learns to make a response for which the consequences can be controlled by experimenter. A lever that delivers food / water into a dish. Some have lights / buzzers Some have a flaw that can shock

Reinforcement Reinforcement - applying a positive stimulus OR removing a negative stimulus to subsequently strengthen or increase the likelihood of a particular response that it follow. Reinforcer - any object or event that changes the probability that an operant behaviour will occur again. Interchangeable with reward, but different

Reinforcement Initially, most success if behaviour is continually reinforced. Continuous Reinforcement - reinforcing every correct response after it occurs Partial Reinforcement - process of reinforcing some correct responses but not all of them. Partial may be delivered by different schedules

Fixed-Ratio Schedules When the reinforcer is given after a set (fixed) and unvarying number (ratio) of desired responses have been made eg every third response, one response for every 10 correct responses (1:10) during acquistion phase must be frequent workers who are paid ‘piecework’ eg commission, amount per basket picked.

Variable-Ratio when the reinforcer is given after an unpredictable number of correct responses. A mean number of correct responses that receive reinforcement. Very effective, fast acquisition and doesn’t cease easily. Poker machines - expected payout, but don’t know when

Fixed-Interval schedule when the reinforcer is delivered after a specific period of time has elapsed since the previous reinforcer, provided the correct response has been made. One correct response is all that is needed, like pressing the crossing button. Often erratic, since we realise time not responses are the factor, so wait until time

Variable-Interval Schedule when the reinforcer is delivered after an irregular period of time has elapsed, provided the correct response has been made. a mean period of time, but unpredictable. responses before the delivery time are not reinforced even if correct. Fishing, speed cameras, booze busses.

Positive Reinforcement giving or applying a positive reinforcer after the desired response has been made. positive reinforcer - provides a satisfying consequence (reward), so strenghtens the likelihood of a response.

Negative Reinforcement Removal or avoidance of an unpleasant stimulus. Negative Reinforcer - any unpleasant stimulus that when removed strengthens the likelihood of a desired response occurring. In negative reinforcemnt the reinforcer is removed or avoided, not given (positive)

Examples Getting and A on your exam (positive reinforcer)can be achieved by studying, so studying will be repeated (increased behaviour) Failing your exam (Negative reinforcer) is avoided by studying, so studying will be repeated (increased behaviour) Both lead to desirable / positive consequence.

Punishment delivery of an unpleasant stimulus following a response, or removal of a pleasant stimulus. Consequence of punishment is weakening of response, or decrease in probability of response occuring again

Activity partial or continuous? positive or negative or punishment?

Order of presentation for reinforcement and punishment, it must be presented immediately after a desired response not before. the rat needs to press the lever before getting positive reinforcer

Timing most effective when given immediatley after the response, so they are associated directly. Delay will cause learning to be slow or unsuccessful. Easier in lab than real life. Eg student reports, delayed response.

Appropriateness reinforcers must provide pleasing consequences, Punishments must provide unpleasant consequences. but how do you know what will please each person? Not all reinforcers will work in all situations. Inappropriate punishers can become reinforcers - eg. attention seekers

Key processes - Acquisition In OC, acquisition is the establhsiment of a response through reinforcement. speed depends on whether continuous or partial reinforcement. For complex behaviours successive approximations can be reinforced building up to target behaviour.

Acquisition Shaping - a procedure in which reinforcement is given for any response that successively approximates a final target response, Also known as method or successive approximations eg skinner’s pigeon will have to turn more and more to get same reward.

Extinction the gradual decrease in the strength or rate of a conditioned response following consistent non-reinforcement of the response. eg when does the pigeon stop turning after it isn’t being fed. may actually increase at first, to try to get the reinforcement. don’t want to stop

Spontaneous Recovery can also occur with operant conditioning, when the response occurs in absence of reinforcement after extinction has occurred. likely weaker and temporary

Stimulus Generalisation when the correct response is made ot another stimulus that is similar to the stimulus that was present when the CR was reinforced. Usually at a reduced level (weaker or less often)

Stimulus Generalisation when an oranism makes the correct response to a stimulus and is reinforced, but doesn’t respond to other stimuli, even when similar. eg if reinforced for red lights not green lights, will only respond for red.

CC and OC Role of Learner Timing of Stimulus and Response Nature of Response - Reflex or Voluntary? LA 12