Title: Operant Conditioning I
1Operant Conditioning I
2Priscilla the Fastidious Pig
- http//www.youtube.com/watch?vy6tpXC2mwlQ
3Thorndike and Law of Effect
- Rewarded behavior is likely to occur
4B. F. Skinner
- Operant conditioning shapes behavior as a
sculptor shapes a lump of clay -
Its all a matter of consequences
5Operant Conditioning
- Learning where responses come to be controlled by
their consequences - Classical conditioning regulating reflexive,
involuntary responses - Operant conditioning voluntary responses
6Skinner Pigeons
- http//www.youtube.com/watch?vvGazyH6fQQ4
7What the what!? How did he do that?
- http//www.youtube.com/watch?vTtfQlkGwE2U
- Shaping and reinforcement
- Shaping operant technique, reward for closer
and closer approximation of desired response
8Volunteer?
9Skinner says
- Organisms tend to repeat responses that are
followed by favorable consequences - Understood best through idea of reinforcement
response is strengthened because it leads to
rewarding consequences - Defined AFTER THE FACT
10Tendency to patronize Chipotle increases
Rewarding stimulus presented Most delicious
meal you will ever have ever
Response Go to Chipotle for a meal
REINFORCEMENT IN OPERANT CONDITIONING
Tendency to tell jokes increases
Response Tell jokes
Rewarding stimulus presented Friends laugh
11Graphic Portrayal of Operant Conditioning
- Steep slope ?
- Shallow slope ?
12Primary vs. Secondary Reinforcers
13Generalization vs. Discrimination
- Which is which?
- Kids only ask parents for sweets when they know
the parents are in a good mood. - A cat runs into the kitchen whenever a can opener
is being utilized
14Basic Processes in Classical and Operant
Conditioning
VS.
15Operant Conditioning II
16Reinforcement Schedules
Fixed
Variable
Variable-Ratio (VR) Higher resistance to
extinction
Fixed-ratio (FR) Lower resistance to extinction
Rapid Responding
High, steady rate without pauses
Short pause after reinforcement
Ratio
Cumulative Responses
Note Higher ratios generate higher response rates
Note higher ratios generate higher response rates
Variable-Interval (VI) Higher resistance to
extinction
Fixed-interval (FI) Lower resistance to
extinction
Long pause after reinforcement yields scalloping
effect
Interval
Cumulative Responses
Low, steady rate without pauses
Note Shorter intervals generate higher rates
overall
Note Shorter intervals generate higher rates
overall
Time
Time
17Fixed-ratio schedule
- A rat is reinforced for every tenth lever press
- A salesperson receives a bonus for every fourth
gym membership sol
18Reinforcement Schedules
Fixed
Variable
Variable-Ratio (VR) Higher resistance to
extinction
Fixed-ratio (FR) Lower resistance to extinction
Rapid Responding
High, steady rate without pauses
Short pause after reinforcement
Ratio
Cumulative Responses
Note Higher ratios generate higher response rates
Note higher ratios generate higher response rates
Variable-Interval (VI) Higher resistance to
extinction
Fixed-interval (FI) Lower resistance to
extinction
Long pause after reinforcement yields scalloping
effect
Interval
Cumulative Responses
Low, steady rate without pauses
Note Shorter intervals generate higher rates
overall
Note Shorter intervals generate higher rates
overall
Time
Time
19Variable-ratio schedule
- A slot machine in a casino pays off once every
six tries on the average. - The number of winning responses between payoffs
varies greatly from one time to the next.
20Reinforcement Schedules
Fixed
Variable
Variable-Ratio (VR) Higher resistance to
extinction
Fixed-ratio (FR) Lower resistance to extinction
Rapid Responding
High, steady rate without pauses
Short pause after reinforcement
Ratio
Cumulative Responses
Note Higher ratios generate higher response rates
Note higher ratios generate higher response rates
Variable-Interval (VI) Higher resistance to
extinction
Fixed-interval (FI) Lower resistance to
extinction
Long pause after reinforcement yields scalloping
effect
Interval
Cumulative Responses
Low, steady rate without pauses
Note Shorter intervals generate higher rates
overall
Note Shorter intervals generate higher rates
overall
Time
Time
21Fixed-interval schedule
- A man washing his clothes periodically check to
see whether each load is finished - The reward (clean clothes) is available only
after a fixed time interval - The man who checks his laundry before it is
completed in the cycle does not receive
reinforcement of clean clothes because theyre
not done yet
22Reinforcement Schedules
Fixed
Variable
Variable-Ratio (VR) Higher resistance to
extinction
Fixed-ratio (FR) Lower resistance to extinction
Rapid Responding
High, steady rate without pauses
Short pause after reinforcement
Ratio
Cumulative Responses
Note Higher ratios generate higher response rates
Note higher ratios generate higher response rates
Variable-Interval (VI) Higher resistance to
extinction
Fixed-interval (FI) Lower resistance to
extinction
Long pause after reinforcement yields scalloping
effect
Interval
Cumulative Responses
Low, steady rate without pauses
Note Shorter intervals generate higher rates
overall
Note Shorter intervals generate higher rates
overall
Time
Time
23Variable-interval schedule
- Person wants to win a radio contest, so calls the
station and gets a busy signal - Getting through to the DJ is the reinforcer
- A rat is reinforced for the first lever press
after a 1 minute interval, but the following
intervals are 3 min, 2 min, and 4 min (average of
2 min)
24Reinforcement Schedules
Fixed
Variable
Variable-Ratio (VR) Higher resistance to
extinction
Fixed-ratio (FR) Lower resistance to extinction
Rapid Responding
High, steady rate without pauses
Short pause after reinforcement
Ratio
Cumulative Responses
Note Higher ratios generate higher response rates
Note higher ratios generate higher response rates
Variable-Interval (VI) Higher resistance to
extinction
Fixed-interval (FI) Lower resistance to
extinction
Long pause after reinforcement yields scalloping
effect
Interval
Cumulative Responses
Low, steady rate without pauses
Note Shorter intervals generate higher rates
overall
Note Shorter intervals generate higher rates
overall
Time
Time
25Conclusion
- Faster responding leads to reinforcement sooner
when ratio is in effect - Variable schedules tend to generate steadier
response rates - Greater resistance to extinction
26Reinforcement Schedules
Fixed
Variable
Variable-Ratio (VR) Higher resistance to
extinction
Fixed-ratio (FR) Lower resistance to extinction
Rapid Responding
High, steady rate without pauses
Short pause after reinforcement
Ratio
Cumulative Responses
Note Higher ratios generate higher response rates
Note higher ratios generate higher response rates
Variable-Interval (VI) Higher resistance to
extinction
Fixed-interval (FI) Lower resistance to
extinction
Long pause after reinforcement yields scalloping
effect
Interval
Cumulative Responses
Low, steady rate without pauses
Note Shorter intervals generate higher rates
overall
Note Shorter intervals generate higher rates
overall
Time
Time
27Positive vs. Negative Reinforcement
- Study hint THEY BOTH HAVE THE WORD
REINFORCEMENT IN IT ITS ABOUT REINFORCEMENT
28- Positive reinforcement response is strengthened
because it is followed by the presentation of a
rewarding stimulus - Ex good grades, tasty meals, paychecks,
scholarship, promotions, nice clothes, attention,
flattery
29- Negative Reinforcement occurs when a response is
strengthened because it is followed by the
removal of an averse stimulus - ex you rush home in winter to get out of the
cold, you clean a house to get rid of a mess, you
give in to an argument to avoid an unpleasant
situation
30REINFORCEMENT IS REINFORCEMENT
- Both positive and negative reinforcement involve
a favorable outcome that STRENGTHENS - a response tendency
31Positive vs. Negative Reinforcement in a Skinner
Box
Behavior
Consequence
Rewarding stimulus presented food delivered
Response Press lever
Tendency to press lever increases
Aversive Stimulus removed shock turned off
Response Press lever
Tendency to press lever increases
-
32Negative reinforcement applications
- 1. Escape learning organism acquires a response
that decreases or ends some aversive stimulation - Ex you leave a party where you were getting
picked on by peers
33- 2. Avoidance learning organism acquires a
response that prevents some aversive stimulation
from occurring - Ex you quit going to parties because of your
concern about being picked on
34- How does avoidance learning present an example of
how classical conditioning and operant
conditioning work together to regulate behavior? - Ex Rat, shuttle box, shock
35Punishment Consequences that weaken responses
- Punishment occurs when event following a
response weakens the tendency to make that
response - Super easy to mix-up
- How is this different from negative reinforcement?
36Punishment examples
- If you wear a new outfit and your classmates make
fun of it, your behavior will have been punished
and your tendency to wear the same clothing will
probably decline. - 2) If you have a bad meal and a restaurant,
your response will have been punished, and you
will be less likely to go to the restaurant again.