Overview
This lecture explains operant conditioning, including how behaviors are increased or decreased through reinforcement or punishment, different reinforcement schedules, and the role of cognition in learning.
Operant Conditioning Basics
- Operant conditioning is learning to associate a behavior with its consequence.
- Pleasant consequences (reinforcement) increase the likelihood of repeating a behavior; unpleasant ones (punishment) decrease it.
- B.F. Skinner built on Thorndike’s law of effect: behaviors followed by satisfaction are repeated, while those with unpleasant outcomes are avoided.
- Skinner used the "Skinner box" to study how animals learn via operant conditioning.
Types of Reinforcement and Punishment
- Positive means adding a stimulus; negative means removing one.
- Reinforcement (positive or negative) increases behavior.
- Punishment (positive or negative) decreases behavior.
- Positive reinforcement: adding a desirable stimulus (e.g., giving a toy for cleaning a room).
- Negative reinforcement: removing an undesirable stimulus (e.g., turning off a beeping seatbelt alarm).
- Positive punishment: adding an unpleasant stimulus (e.g., scolding for texting).
- Negative punishment: removing a pleasant stimulus (e.g., taking away a toy for misbehavior).
Shaping and Behavior Modification
- Shaping involves reinforcing successive steps toward a target behavior.
- Used to teach complex behaviors by reinforcing closer approximations.
- Token economies and sticker charts are common in schools and homes for behavior modification.
- Time-out is a negative punishment technique by removing a child from a desirable activity.
Primary and Secondary Reinforcers
- Primary reinforcers are innate (e.g., food, water).
- Secondary reinforcers gain value through association with primary ones (e.g., money, praise, tokens).
Reinforcement Schedules
- Continuous reinforcement: behavior is reinforced every time; best for new behaviors.
- Partial (intermittent) reinforcement: behavior is reinforced sometimes.
- Fixed interval: reinforcement after a set time (e.g., hourly pain relief).
- Variable interval: reinforcement after unpredictable time intervals (e.g., checking social media).
- Fixed ratio: reinforcement after a set number of responses (e.g., commission sales).
- Variable ratio: reinforcement after varying number of responses (e.g., gambling); most resistant to extinction.
Gambling and the Brain
- Variable ratio schedules create persistent behaviors, as seen in gambling addiction.
- Gambling activates brain reward centers involving dopamine, norepinephrine, and serotonin.
- Gambling is now considered an addiction, not just an impulse control disorder.
Cognition and Latent Learning
- Tolman’s experiments showed learning can occur without immediate reinforcement, called latent learning.
- Cognitive maps are mental representations used to navigate environments.
- Latent learning is demonstrated when a learned behavior appears only when needed.
Key Terms & Definitions
- Operant Conditioning — learning by associating behavior with consequences.
- Reinforcement — increases likelihood of a behavior.
- Punishment — decreases likelihood of a behavior.
- Positive — adding a stimulus.
- Negative — removing a stimulus.
- Shaping — reinforcing successive steps toward a target behavior.
- Primary Reinforcer — naturally rewarding stimulus (e.g., food).
- Secondary Reinforcer — acquires value by association with primary reinforcers.
- Reinforcement Schedule — plan for how often a behavior is reinforced.
- Latent Learning — learning occurring without immediate demonstration, shown when needed.
- Cognitive Map — mental image of spatial layout.
Action Items / Next Steps
- Review definitions and examples of reinforcement and punishment types.
- Practice identifying reinforcement schedules in real-life scenarios.
- Reflect on examples of latent learning or cognitive maps in your experience.