PowerPoint® Presentation by Jim Foley Learning © 2013 Worth Publishers.

Slides:



Advertisements
Similar presentations
Operant Conditioning Skinner, positive & negative reinforcement, response cost, punishment and schedules of reinforcement.
Advertisements

A.P. Psychology Modules 20-22
Welcome! Please write down your homework: –Test next class. Ch. 8 and all review chapters –Notecards due next class.
Associative Learning Operant Conditioning. Foundations Edward Thorndike ( ) –Puzzle Box –Cats became more efficient with each trial –Law of.
Learning Unit 5. Topics in Learning Unit Defining Learning Classical Conditioning Operant Conditioning Cognitive Learning.
Operant Conditioning Module 16 Demo Activity HO 16.1 Pkt. p. 7 See outline in pkt. p. 6 ½ DVD: Discovering Psychology: Disc 2: “Learning”
Operant Conditioning What is Operant Conditioning?
Chapter 8 Learning.  Learning  relatively permanent change in an organism’s behavior due to experience.
Classical Conditioning, Operant Conditioning, and Observational Learning.
Thinking About Psychology: The Science of
Learning Operant Conditioning.  Operant Behavior  operates (acts) on environment  produces consequences  Respondent Behavior  occurs as an automatic.
Myers EXPLORING PSYCHOLOGY (6th Edition in Modules) Module 19 Operant Conditioning James A. McCubbin, PhD Clemson University Worth Publishers.
Chapter 8 Operant Conditioning.  Operant Conditioning  type of learning in which behavior is strengthened if followed by reinforcement or diminished.
Operant Conditioning. I. Operant Conditioning A type of learning that occurs when we receive rewards or punishments for our behavior A type of learning.
Operant Conditioning What the heck is it? Module 16.
Introduction to Operant Conditioning. Operant & Classical Conditioning 1. Classical conditioning forms associations between stimuli (CS and US). Operant.
Operant Conditioning Big Question: Is the organism learning associations between events that it does not control (classical) OR is it learning associations.
1 PSYCHOLOGY (8th Edition, in Modules) David Myers PowerPoint Slides Aneeq Ahmad Henderson State University Worth Publishers, © 2007.
Learning is a relatively permanent change in an organism’s behavior due to experience. Learning is more flexible in comparison to the genetically- programmed.
What is Operant Conditioning? Module 16: Operant Conditioning.
EXPLORING PSYCHOLOGY EIGHTH EDITION IN MODULES David Myers PowerPoint Slides Aneeq Ahmad Henderson State University Worth Publishers, © 2011.
Learning Chapter. Operant Conditioning Module 20.
What is Operant Conditioning?. Operant Conditioning A type of learning in which the frequency of a behavior depends on the consequence that follows that.
Learning (Part II) 7-9% of AP Exam Classical Conditioning UCS + UCR + N, etc… Acquisition Extinction Biological Predisposition Pavlov Watson Operant Conditioning.
Operant Conditioning Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if.
Classical Conditioning
Learning. LEARNING  Learning  relatively permanent change in an organism’s behavior due to experience.
Unit 6 (C): Operant Conditioning
Operant Conditioning E.L. Thorndike and B.F. Skinner.
Learning Chapter 7. Overview: Topics in this Chapter  Goals  Classical conditioning  Operant conditioning  ____________ ____________ learning What.
Chapter 7 Learning PowerPoint® Presentation by Jim Foley.
+ Operant Conditioning AP Psychology: Learning. + What is Operant Conditioning? Type of learning in which the frequency of a behavior increases or decreases.
LEARNING  a relatively permanent change in behavior as the result of an experience.  essential process enabling animals and humans to adapt to their.
OPERANT CONDITIONING. Learning in which a certain action is reinforced or punished, resulting in corresponding increases or decreases in behavior.
Schedules of Reinforcement 11/11/11. The consequence provides something ($, a spanking…) The consequence takes something away (removes headache, timeout)
Operant Conditioning. Operant Conditioning – A form of learning in which voluntary responses come to be controlled by their consequences. What does this.
General Psychology. Scripture James 1:17-18 Every good gift and every perfect gift is from above, and cometh down from the Father of lights, with whom.
Thinking About Psychology: The Science of Mind and Behavior Charles T. Blair-Broeker Randal M. Ernst.
Unit 5: Learning (Behaviorism)
Learning  relatively permanent change in an organism’s behavior due to experience  Helps us …
Myers’ PSYCHOLOGY (7th Ed) Chapter 8 Learning James A. McCubbin, PhD Clemson University Worth Publishers.
Unit 6: Learning. How Do We Learn? Learning = a relatively permanent change in an organism’s behavior due to experience. 3 Types:  Classical  Operant.
AP Psychology Learning
Classical Conditioning. It’s unconditioned because it happens automatically!
AP PSYCHOLOGY UNIT VI Part Two: Operant Conditioning: Reward and Punishment.
Operant conditioning Zimbardo, P.G. & Johnson, R.L. & McCann, V. (2009). Learning and human nurture. In S. Frail (Ed.) Psychology: Core Concepts.
Module 27 Operant Conditioning
Operant Conditioning Type of learning in which the frequency of a behavior depends on the consequence that follows that behavior. Another form of learning.
CHS AP Psychology Unit 6: Learning (Behaviorism) Essential Task 6.3: Predict the effects of operant conditioning with specific attention to (primary, secondary,
Copyright © Allyn and Bacon Chapter 6 Learning This multimedia product and its contents are protected under copyright law. The following are prohibited.
Operant Conditioning Chapter 6.
Operant Conditioning Overview
Chapter 8 pt. 2: Operant Conditioning and Observational Learning
Operant Conditioning. A type of learning in which the frequency of a behavior depends on the consequence that follows that behavior. The frequency will.
Thinking About Psychology: The Science of Mind and Behavior 2e Charles T. Blair-Broeker Randal M. Ernst.
Learning Principles & Applications 7-9% of AP Exam.
3 types of Learning 1. Classical 2. Operant 3. Social This Is our second type of Learning.
Thinking About Psychology: The Science of Mind and Behavior Charles T. Blair-Broeker Randal M. Ernst.
Chapter 6 LEARNING. Learning Learning – A process through which experience produces lasting change in behavior or mental processes. Behavioral Learning.
Conditioning and Learning Unit 6 Conditioning and Learning Modules
Unit 6: Learning (Behaviorism)
PowerPoint® Presentation by Jim Foley
Module 20 Operant Conditioning.
PowerPoint® Presentation by Jim Foley
Operant conditioning.
Chapter 7 (C): Operant Conditioning
PowerPoint® Presentation by Jim Foley
Operant Conditioning.
PowerPoint® Presentation by Jim Foley
Operant Conditioning What the heck is it?
Presentation transcript:

PowerPoint® Presentation by Jim Foley Learning © 2013 Worth Publishers

Module 19: Operant Conditioning

Topics you may find rewarding Skinner’s Experiments: Shaping Behavior Types of Reinforcers:  Positive and Negative  Primary and Conditioned  Immediate and Delayed Reinforcement Schedules  Fixed- and Variable- Ratio and Interval Punishment:  Positive and Negative  The downsides Applications of Operant Conditioning, in School, Work, Sports, and Home Operant vs. Classical Conditioning, revisited

How it works: An act of chosen behavior (a “response”) is followed by a reward or punitive feedback from the environment. Results:  Reinforced behavior is more likely to be tried again.  Punished behavior is less likely to be chosen in the future. Operant Conditioning Response: balancing a ball Consequence: receiving food Behavior strengthened Operant conditioning involves adjusting to the consequences of our behaviors. Examples:  We may smile more at work after this repeatedly gets us bigger tips.  We learn how to ride a bike using the strategies that don’t make us crash.

Thorndike’s Law of Effect Edward Thorndike placed cats in a puzzle box; they were rewarded with food (and freedom) when they solved the puzzle. Thorndike noted that the cats took less time to escape after repeated trials and rewards. Thorndike’s law of effect: behaviors followed by favorable consequences become more likely, and behaviors followed by unfavorable consequences become less likely.

B.F. Skinner: Behavioral Control B. F. Skinner saw potential for exploring and using Edward Thorndike’s principles much more broadly. He wondered:  how can we more carefully measure the effect of consequences on chosen behavior?  what else can creatures be taught to do by controlling consequences?  what happens when we change the timing of reinforcement? B.F. Skinner trained pigeons to play ping pong, and guide a video game missile.

B.F. Skinner: The Operant Chamber  B. F. Skinner, like Ivan Pavlov, pioneered more controlled methods of studying conditioning.  The operant chamber, often called “the Skinner box,” allowed detailed tracking of rates of behavior change in response to different rates of reinforcement. Recording device Bar or lever that an animal presses, randomly at first, later for reward Food/water dispenser to provide the reward

Reinforcement  Reinforcement: feedback from the environment that makes a behavior more likely to be done again.  Positive + reinforcement: the reward is adding something desirable  Negative - reinforcement: the reward is ending something unpleasant For the meerkat, this warm light is desirable. This meerkat has just completed a task out in the cold

Shaping Behavior Reinforcing Successive Approximations Shaping: guiding a creature toward the behavior by reward behavior that comes closer and closer to the desired behavior. Shaping the teacher: Students could smile and nod more when the instructor moves left, until the instructor stays pinned to the left wall.

A cycle of mutual reinforcement 10 Children who have a temper tantrum when they are frustrated may get positively reinforced for this behavior when parents occasionally respond by giving in to a child’s demands. Result: stronger, more frequent tantrums Parents who occasionally give in to tantrums may get negatively reinforced when the child responds by ending the tantrum. Result: parents giving-in behavior is strengthened (giving in sooner and more often)

Discrimination  Discrimination: the ability to become more and more specific in what situations trigger a response.  Shaping can increase discrimination, if reinforcement only comes for certain discriminative stimuli.  For examples, dogs, rats, and even spiders can be trained to search for very specific smells, from drugs to explosives.  Pigeons, seals, and manatees have been trained to respond to specific shapes, colors, and categories. Bomb-finding rat Manatee that selects shapes

Why we might work for money  If we repeatedly introduce a neutral stimulus before a reinforcer, this stimulus acquires the power to be used as a reinforcer.  A primary reinforcer is a stimulus that meets a basic need or otherwise is intrinsically desirable, such as food, sex, fun, attention, or power.  A secondary/conditioned reinforcer is a stimulus, such as a rectangle of paper with numbers on it (money) which has become associated with a primary reinforcer (money buys food, builds power).

A Human Talent: Responding to Delayed Reinforcers  Dogs learn from immediate reinforcement; a treat five minutes after a trick won’t reinforce the trick.  Humans have the ability to link a consequence to a behavior even if they aren’t linked sequentially in time.  A piece of paper (paycheck) can be a delayed reinforcer, paid a month later, if we link it to our performance.  Delaying gratification, a skill related to impulse control, enables longer-term goal setting.

How often should we reinforce?  Do we need to give a reward every single time? Or is that even best?  B.F. Skinner experimented with the effects of giving reinforcements in different patterns or “schedules” to determine what worked best to establish and maintain a target behavior.  In continuous reinforcement (giving a reward after the target every single time), the subject acquires the desired behavior quickly.  In partial/intermittent reinforcement (giving rewards part of the time), the target behavior takes longer to be acquired/established but persists longer without reward.

 Fixed interval schedule:  Every so often  Variable interval schedule:  Unpredictably often We may schedule our reinforcements based on an interval of time that has gone by.  Fixed ratio schedule:  Every so many behaviors  Variable ratio schedule:  After an unpredictable number of behaviors We may plan for a certain ratio of rewards per number of instances of the desired behavior. Different Schedules of Partial/Intermittent Reinforcement

Which Schedule of Reinforcement is This? Ratio or Interval? Fixed or Variable? 1.Rat gets food every third time it presses the lever 2.Getting paid weekly no matter how much work is done 3.Getting paid for every ten boxes you make 4.Hitting a jackpot sometimes on the slot machine 5.Winning sometimes on the lottery you play once a day 6.Checking cell phone all day; sometimes getting a text 7.Buy eight pizzas, get the next one free 8.Fundraiser averages one donation for every eight houses visited 9.Kid has tantrum, parents sometimes give in 10.Repeatedly checking mail until paycheck arrives FR FI FR VR VI/VR VI FR VR FI

Rapid responding near time for reinforcement Fixed interval Rapid responding near time for reinforcement Fixed interval Results of the different schedules of reinforcement Which reinforcements produce more “responding” (more target behavior)?  Fixed interval: slow, unsustained responding If I’m only paid for my Saturday work, I’m not going to work as hard on the other days.  Variable interval: slow, consistent responding If I never know which day my lucky lottery number will pay off, I better play it every day. Steady responding Variable interval

Reinforcers Effectiveness of the ratio schedules of Reinforcement  Fixed ratio: high rate of responding Buy two drinks, get one free? I’ll buy a lot of them!  Variable ratio: high, consistent responding, even if reinforcement stops (resists extinction) If the slot machine sometimes pays, I’ll pull the lever as many times as possible because it may pay this time! Variable ratio Fixed ratio

Operant Effect: Punishment Punishments have the opposite effects of reinforcement. These consequences make the target behavior less likely to occur in the future. + Positive Punishment You ADD something unpleasant/aversive (ex: spank the child) - Negative Punishment You TAKE AWAY something pleasant/ desired (ex: no TV time, no attention)-- MINUS is the “negative” here  Positive does not mean “good” or “desirable” and negative does not mean “bad” or “undesirable.”

When is punishment effective?  Punishment works best in natural settings when we encounter punishing consequences from actions such as reaching into a fire.  In that case, operant conditioning helps us to avoid dangers.  Punishment is less effective when we try to artificially create punishing consequences for other’s choices;  Severity of punishments is not as helpful as making the punishments immediate and certain.

 Punished behaviors may simply be suppressed, and restart when the punishment is over.  Instead of learning behaviors, the child may learn to discriminate among situations, and avoid those in which punishment might occur.  Instead of behaviors, the child might learn an attitude of fear or hatred, which can interfere with learning. This can generalize to a fear/hatred of all adults or many settings.  Physical punishment models aggression and control as a method of dealing with problems. Applying operant conditioning to parenting Problems with Physical Punishment

Don’t think about the beach Don’t think about the waves, the sand, the towels and sunscreen, the sailboats and surfboards. Don’t think about the beach. Are you obeying the instruction? Would you obey this instruction more if you were punished for thinking about the beach?

Problem: Punishing focuses on what NOT to do, which does not guide people to a desired behavior.  Even if undesirable behaviors do stop, another problem behavior may emerge that serves the same purpose, especially if no replacement behaviors are taught and reinforced. Lesson: In order to teach desired behavior, reinforce what’s right more often than punishing what’s wrong.

More effective forms of operant conditioning The Power of Rephrasing  Positive punishment: “You’re playing video games instead of practicing the piano, so I am justified in YELLING at you.”  Negative punishment: “You’re avoiding practicing, so I’m turning off your game.”  Negative reinforcement: “I will stop staring at you and bugging you as soon as I see that you are practicing.”  Positive reinforcement: “After you practice, we’ll play a game!”

Summary: Types of Consequences Adding stimuliSubtract stimuliOutcome Positive + Reinforcement (You get candy) Negative – Reinforcement (I stop yelling) Strengthens target behavior (You do chores) Positive + Punishment (You get spanked) Negative – Punishment (No cell phone) Reduces target behavior (cursing) uses desirable stimuli uses unpleasant stimuli

 The way to modify behavior is through consequences.  Behavior is influenced only by external feedback, not by thoughts and feelings.  We should intentionally create consequences to shape the behavior of others.  Humanity improves through conscious reinforcement of positive behavior and the punishment of bad behavior.  This leaves out the value of instruction and modeling.  Adult humans have the ability to use thinking to make choices and plans  Natural consequences are more justifiable than manipulation of others.  Humanity improves through free choice guided by wisdom, conscience, and responsibility. B.F. Skinner’s Legacy B.F. Skinner’s View Critique

School: long before tablet computers, B.F. Skinner proposed machines that would reinforce students for correct responses, allowing students to improve at different rates and work on different learning goals. Sports: athletes improve most in the shaping approach in which they are reinforced for performance that comes closer and closer to the target skill (e.g., hitting pitches that are progressively faster). Work: some companies make pay a function of performance or company profit rather than seniority; they target more specific behaviors to reinforce. Applications of Operant Conditioning

More Operant Conditioning Applications Parenting 1.Rewarding small improvements toward desired behaviors works better than expecting complete success, and also works better than punishing problem behaviors. 2.Giving in to temper tantrums stops them in the short run but increases them in the long run. Self-Improvement Reward yourself for steps you take toward your goals. As you establish good habits, then make your rewards more infrequent (intermittent).

Contrasting Types of Conditioning Organism associates events. Classical ConditioningOperant Conditioning Basic Idea Associating events/stimuli with each other Associating chosen behaviors with resulting events Response Involuntary, automatic reactions such as salivating Voluntary actions “operating” on our environment Acquisition NS linked to US by repeatedly presenting NS before US Behavior is associated with punishment or reinforcement Extinction CR decreases when CS is repeatedly presented alone Target behavior decreases when reinforcement stops Spontaneous Recovery Extinguished CR starts again after a rest period (no CS) Extinguished response starts again after a rest (no reward) Generalization When CR is triggered by stimuli similar to the CS Response behavior similar to the reinforced behavior. Discrimination Distinguishing between a CS and NS not linked to U.S. Distinguishing what will get reinforced and what will not

If the organism is learning associations between events that it does not control, it is... If the organism is learning associations between its behavior and the resulting events, it is... operant conditioning classical conditioning Operant vs. Classical Conditioning