🧠

6.3 Operant Conditioning Overview

Sep 5, 2025

Overview

This lecture explains operant conditioning, including how behaviors are increased or decreased through reinforcement or punishment, different reinforcement schedules, and the role of cognition in learning.

Operant Conditioning Basics

  • Operant conditioning is learning to associate a behavior with its consequence.
  • Pleasant consequences (reinforcement) increase the likelihood of repeating a behavior; unpleasant ones (punishment) decrease it.
  • B.F. Skinner built on Thorndike’s law of effect: behaviors followed by satisfaction are repeated, while those with unpleasant outcomes are avoided.
  • Skinner used the "Skinner box" to study how animals learn via operant conditioning.

Types of Reinforcement and Punishment

  • Positive means adding a stimulus; negative means removing one.
  • Reinforcement (positive or negative) increases behavior.
  • Punishment (positive or negative) decreases behavior.
  • Positive reinforcement: adding a desirable stimulus (e.g., giving a toy for cleaning a room).
  • Negative reinforcement: removing an undesirable stimulus (e.g., turning off a beeping seatbelt alarm).
  • Positive punishment: adding an unpleasant stimulus (e.g., scolding for texting).
  • Negative punishment: removing a pleasant stimulus (e.g., taking away a toy for misbehavior).

Shaping and Behavior Modification

  • Shaping involves reinforcing successive steps toward a target behavior.
  • Used to teach complex behaviors by reinforcing closer approximations.
  • Token economies and sticker charts are common in schools and homes for behavior modification.
  • Time-out is a negative punishment technique by removing a child from a desirable activity.

Primary and Secondary Reinforcers

  • Primary reinforcers are innate (e.g., food, water).
  • Secondary reinforcers gain value through association with primary ones (e.g., money, praise, tokens).

Reinforcement Schedules

  • Continuous reinforcement: behavior is reinforced every time; best for new behaviors.
  • Partial (intermittent) reinforcement: behavior is reinforced sometimes.
  • Fixed interval: reinforcement after a set time (e.g., hourly pain relief).
  • Variable interval: reinforcement after unpredictable time intervals (e.g., checking social media).
  • Fixed ratio: reinforcement after a set number of responses (e.g., commission sales).
  • Variable ratio: reinforcement after varying number of responses (e.g., gambling); most resistant to extinction.

Gambling and the Brain

  • Variable ratio schedules create persistent behaviors, as seen in gambling addiction.
  • Gambling activates brain reward centers involving dopamine, norepinephrine, and serotonin.
  • Gambling is now considered an addiction, not just an impulse control disorder.

Cognition and Latent Learning

  • Tolman’s experiments showed learning can occur without immediate reinforcement, called latent learning.
  • Cognitive maps are mental representations used to navigate environments.
  • Latent learning is demonstrated when a learned behavior appears only when needed.

Key Terms & Definitions

  • Operant Conditioning — learning by associating behavior with consequences.
  • Reinforcement — increases likelihood of a behavior.
  • Punishment — decreases likelihood of a behavior.
  • Positive — adding a stimulus.
  • Negative — removing a stimulus.
  • Shaping — reinforcing successive steps toward a target behavior.
  • Primary Reinforcer — naturally rewarding stimulus (e.g., food).
  • Secondary Reinforcer — acquires value by association with primary reinforcers.
  • Reinforcement Schedule — plan for how often a behavior is reinforced.
  • Latent Learning — learning occurring without immediate demonstration, shown when needed.
  • Cognitive Map — mental image of spatial layout.

Action Items / Next Steps

  • Review definitions and examples of reinforcement and punishment types.
  • Practice identifying reinforcement schedules in real-life scenarios.
  • Reflect on examples of latent learning or cognitive maps in your experience.