Unit 6: Learning

Search this chapter

Audio Overview

0:00 / 0:00

Autoplay next chapter

Welcome to Last Minute Lecture.

This free chapter overview is designed to help students review and understand key concepts.

These summaries supplement not replaced the original textbook and may not be redistributed or resold.

For complete coverage, always consult the official text.

You know, if you look closely at the life cycle of a Chinook salmon,

it almost reads like a piece of science fiction.

Oh, totally.

It's wild.

Right.

Because when a Chinook salmon first emerges from its egg, and it's usually buried deep in this gravel bed of some freezing freshwater stream, it arrives into the world completely alone.

Yeah.

There's no parent hanging around.

Exactly.

It doesn't have a manual.

It doesn't have a parent teaching it the ropes.

And yet, its genes provide almost every single behavioral instruction it will ever need for its entire life.

It's just built in.

It instinctively knows exactly how to swim, what to eat,

and crucially, you know, how to avoid predators.

Following this unyielding genetic blueprint, it just makes its way out of that stream and into the massive expanse of the ocean.

Where it spends, what, roughly four years?

Yeah, about four years out there in the deep water.

And then, and this is the part that completely blows my mind, it navigates hundreds, sometimes thousands of miles back to the exact mouth of its home river.

Which is incredible.

It uses the highly specific olfactory signature, like the unique scent of its home stream, to swim upstream, fighting currents, fighting predators, just to find the perfect temperature in gravel bed.

It breathes, and then it dies.

It's an incredibly beautiful, but entirely rigid biological program.

I mean, it was a brilliantly engineered, completely closed loop.

Yeah.

You could say it's essentially binary.

The salmon's life path, its ultimate fate, is effectively written into its DNA before it even hatches.

There's just zero room for improvisation.

Yeah, and then you look at us, you know?

You look at a human infant.

We are born with exactly zero genetic plans for our life direction.

We are pretty pathetic at birth.

So pathetic.

We are, comparatively speaking, completely and utterly helpless.

We can't walk, we can't forage, we can't defend ourselves.

We don't even know how to hold our own heads up.

Which seems like at first glance like a massive evolutionary disadvantage.

And yet, that lack of a rigid genetic plan is exactly what makes us the apex species on this planet.

Unlike the salmon, our greatest gift from nature isn't a hard -wired instruction manual.

It's our adaptability.

Adaptability, right.

Because we aren't locked into building a specific type of nest in a specific temperature of water, we have this behavioral flexibility that is just unmatched.

We can learn to build grass huts in the sweltering savanna, or carve snow shelters in the freezing Arctic.

We engineer submarines for the deep ocean.

Exactly.

And space stations to live in a vacuum.

We can adjust to almost any environment imaginable for one simple reason, which is that we have the profound capacity to learn.

Which brings us perfectly to our mission for this deep dive.

We are unpacking the psychology of how we adapt.

We're looking at the fundamental mechanics of learning.

And we really want to treat this like an ultimate masterclass for you.

Getting into the absolute bedrock of how human beings and animals alter their behavior to survive.

So let's start with the big picture.

How do psychologists actually define this concept?

Because you know, learning is a word we throw around all the time in everyday conversation, but we usually just mean like memorizing facts for a test.

But in the realm of psychology, the definition is much broader and much more foundational.

Learning is formally defined as a relatively permanent behavior change due to experience.

A relatively permanent behavior change due to experience.

So it's not just about acquiring knowledge.

It's about a measurable shift in what we actually do.

Precisely.

If an organism's behavior hasn't changed in a relatively permanent way, true learning hasn't occurred.

And to understand how this incredible survival mechanism actually works, we have to ask a fundamental question.

Which is?

How do our brains take the chaos of the world like?

The millions of sights, sounds, and physical sensations we experience every day and extract meaningful patterns from them.

Right.

How do we glue these disparate experiences together so that they actually help us navigate the future?

Because, well, a single experience is totally useless if it doesn't inform what you do the next time.

The answer to that question is elegantly simple.

And it's actually something philosophers figured out long before modern neuroscience even existed.

Over 2 ,000 years ago, Aristotle proposed the concept, and centuries later, enlightenment philosophers like John Locke and David Hume echoed it.

Which is that we learn by association.

Exactly.

Our minds are essentially pattern recognition machines.

We naturally connect events that occur in a sequence.

Okay, let's dig into this idea of association because it sounds a little abstract, but it's happening to us every single second of the day.

Think about a deeply comforting experience.

So suppose you walk into a kitchen.

You see a loaf of freshly baked bread on the counter, and you smell that incredible yeast and butter aroma.

Oh, the best.

You cut a piece, eat it, and you find it incredibly satisfying.

Well, the next time you see and smell fresh bread, your mind has already connected those sequential events.

That sensory experience automatically leads you to expect that eating it will, once again, be satisfying.

You don't have to relearn the concept of bread every single time you see it.

A great visual for this, and this is a classic one, is the association of lightning and thunder.

Stimulus one is the lightning, you know, a bright flash in the sky.

And then a few seconds later, stimulus two happens, the loud booming crack of thunder.

As children, we might just be startled by the thunder, but after repeating this sequence enough times in our lives, our brain just glues stimulus one and stimulus two together.

We don't just wait passively for the thunder anymore.

No, the moment we see the lightning, we wince.

We physically brave ourselves.

We've learned to predict the immediate future.

And it's not just passive expectations either.

This associative power drives active behaviors too.

Yeah, consider a seal at an aquarium.

The seal learns over time that performing a specific behavior like balancing a red ball on its nose is consistently followed by a specific consequence.

Usually a trainer tossing at a herring.

Exactly.

The seal associates his own behavior with a rewarding outcome.

So if associative learning is this powerful, it must be the foundation of our habits, right?

Like the things we do automatically without even thinking.

It absolutely is.

Learned associations fee our most deeply ingrained habitual behaviors.

Think about the simple act of eating popcorn in a movie theater.

Oh, I'm guilty of this one.

Right.

The context.

Walking into the theater, smelling the butter, seeing the dim lights in the big screen.

All of that becomes inextricably associated with the behavior of eating popcorn.

So your next experience of that context automatically triggers the habitual response.

Yeah, you might not even be hungry at all, but the environment commands the behavior.

Which explains why habits, especially destructive ones, are so incredibly difficult to break.

When someone is trying to kick a smoking habit, they might do perfectly fine when they are on vacation or out of their normal routine.

But the moment they return to the exact context where they used to smoke.

Like a specific chair on their back porch or the break room at work, they are hit with a massive, almost uncontrollable urge to light up.

The environment itself is acting as a signal to the brain.

Just firing up those old associative pathways.

And if we zoom out and look at this from a broader biological perspective, associative learning isn't just some quirky mechanism for human habits.

It is a universal survival tool.

Take something as simple as a sea slug.

Specifically the Apligia.

Okay.

This is an incredibly basic organism.

Out in the ocean, if this slug gets disturbed by a squirt of choppy water, its natural defense mechanism is to withdraw its gill to protect itself.

It's just a reflex.

Right.

Now if the water keeps squirting and nothing bad happens, the slug's withdrawal response slowly diminishes.

It realizes the squirt isn't a threat.

This phenomenon is called habituation.

But if you change the environmental sequence like, if researchers repeatedly give that shock just a fraction of a second after the squirt of water, the gill withdrawal response doesn't diminish.

It actually grows much stronger.

This simple, primitive sea slug has learned to associate the squirt of water with the impending shock.

It is using association to predict danger.

And out in the wild, the ability to predict danger is literally the difference between life and death.

There's a profoundly heartbreaking story that perfectly illustrates what happens when an animal lacks this associative history.

The wolves.

Yeah.

Back in 1998,

conservationists released 11 captive -bred Mexican gray wolves into the Apache National Forest.

These animals had been extinct in the wild in the U .S.

for over 20 years.

And these wolves had impeccable genetics.

They had the nature side of the equation fully intact.

They instinctively knew how to hunt prey, how to pack together, they even had a natural instinct to keep roughly a hundred foot distance from humans.

But within eight short months, out of those 11 wolves, only a single lone survivor was recaptured.

The rest had been killed.

Because while they had the genetic predispositions, they lacked a history of appropriate associative learning.

Right.

Having been raised in captivity, they had never learned to associate a human being with a gun.

They didn't know that a human pointing a stick at them was a lethal threat that required immediate escape.

It's a tragic but powerful demonstration of reality.

Successful adaptation requires both nature and nurture.

The genes give you the baseline hardware.

But associative learning is the software that allows you to survive the hyperspecific, ever -changing threats of your local environment.

And this basic pairing of two stimuli, the lightning and the thunder, the squirt of water and the shock, the human and the gun, this concept is the bedrock of our first major area of exploration, which is classical conditioning.

To really grasp classical conditioning, we need to travel back to the early 20th century, a time when the field of psychology was undergoing this massive radical shift.

We have to talk about the era of behaviorism, championed heavily by a man named John B.

Watson.

Watson was kind of a renegade, wasn't he?

He was entirely dismissive of the status quo.

At the time, psychology was heavily focused on introspection, you know, asking patients to describe their inner thoughts, their feelings, their hidden motives.

Think of the whole Freudian approach.

Watson argued that this was completely unscientific.

He believed that if psychology was ever going to be taken seriously as a true objective science like physics or chemistry, it needed to entirely discard all that invisible mentalistic stuff.

Basically saying, if I can't see it, it doesn't belong in science.

No more guessing what someone is feeling.

Exactly.

Watson declared that psychology should focus purely on how organisms respond to stimuli in their environments.

Observable, measurable behavior, period.

This doctrine was called behaviorism, and it completely dominated North American psychology for decades.

But the fascinating thing is Watson's revolutionary ideas didn't spring from his own original experiments.

No, they didn't.

They were built almost entirely on the foundation laid by a Russian scientist named Ivan Pavlov.

And here is where the story gets so wonderfully bizarre.

Ivan Pavlov, the man whose name is literally synonymous with classical conditioning, you know, the bell guy, wasn't even a psychologist.

He had zero interest in the mind.

Not a bit.

He was a strict, rigorous physiologist.

He had a medical degree and spent two decades meticulously studying the digestive system.

His work was so groundbreaking that it actually earned him Russia's first Nobel Prize in 1904.

He was studying saliva, gastric juices, and the mechanical processes of digestion.

So how on earth does a Nobel winning digestion researcher stumble into becoming the godfather of a massive psychological movement?

Well, like many great scientific discoveries, it started as an extreme annoyance.

A happy accident.

Yeah.

Pavlov was trying to measure exactly how much dogs salivated when various types of food were placed in their mouths.

But he kept running to a highly frustrating problem.

The dogs were salivating before the food even arrived.

They were ruining his data.

Precisely.

They drooled at the mere sight of the food,

or the sight of the empty food dish, or even just the sight of the laboratory assistant who normally delivered the food.

Sometimes just the sound of the assistants approaching footsteps down the hall would trigger a fountain of saliva.

Pavlov was so irritated by this that he initially just called them psychic secretions, basically writing them off as a nuisance.

But then his brilliant, objective mind realized that this wasn't just a quirk.

It was an incredibly profound observation.

Right.

These dogs weren't just leaking saliva randomly.

They were anticipating the future.

They were exhibiting a simple, yet highly effective form of learning.

And because he was a rigorous scientist, Pavlov didn't sit around philosophizing about what the dog was feeling when it heard the footsteps.

He didn't ask if the dog was happy or anxious.

No.

He designed an incredibly strict experiment to isolate the variables.

He put a dog in a small, completely isolated room.

He secured it in a harness to prevent extraneous movement, and he surgically attached a device to the dog's cheek to precisely, mathematically measure every single drop of saliva.

OK.

We need to break down exactly what happened in that room, because the terminology Pavlov created is the absolute core of understanding how humans and animals learn to anticipate events.

Let's walk you through the exact mechanics.

It all starts with a biological baseline, what happens naturally, without any training or learning whatsoever.

If you blow meat powder into a dog's mouth, the dog will salivate.

It is a hardwired biological reflex,

because this response requires absolutely no learning.

It is unconditioned by the environment.

Pavlov called the drooling the unconditioned response, or UR.

So unconditioned just means unlearned.

It is automatic.

Nature programmed it.

Right.

And the thing that triggers this automatic response, in this case the food in the mouth, is called the unconditioned stimulus, or US.

So the unconditioned stimulus, the food,

reliably and automatically produces the unconditioned response, the drool.

OK.

The baseline is set.

Food equals drool.

Unconditioned stimulus leads to unconditioned response.

Now Pavlov introduces something entirely new and meaningless.

Let's say a tone from a tuning fork.

A completely neutral stimulus.

Exactly.

If you play a tone for a normal dog, it might perk its ears up, it might look around, but it is certainly not going to start drooling.

But here is where the magic of conditioning happens.

Pavlov sounds the tone and then immediately blows the meat powder into the dog's mouth.

Tone then food, tone then food, over and over again.

After several repetitions of this highly specific sequence, the dog's brain connects the dots.

It learns the association.

And the result is that now, if you just sound the tone, with absolutely no food present, the dog begins to salivate.

The dog's physiology is actively anticipating the meat powder based purely on an auditory cue.

Exactly.

And because this new salivation response is entirely conditional upon the dog having learned this specific association, we call it the conditioned response, or CR.

And the tone itself, which used to be completely neutral but has now acquired the almost magical power to trigger a physiological reflex, is called the conditioned stimulus, or CS.

It is an incredibly elegant system.

Conditioned simply equals learned.

Unconditioned equals unlearned or natural.

Let me throw out another example just to make sure we've completely anchored this concept for you, because it's so foundational.

Let's say you're participating in a psychology experiment.

You're sitting in a chair, and a researcher sounds a sharp tone, and then immediately delivers a puff of air right into your eye.

Ouch.

Right?

Naturally, you blink.

After this happens five or six times, the researcher sounds the tone, delivers no air puff, but you blink anyway.

In this scenario, the air puff is the unconditioned stimulus.

It naturally causes the reflex.

Blinking to the air puff is the unconditioned response.

And the tone starts out neutral, but becomes the conditioned stimulus.

And your learned habit of blinking to the tone alone is the conditioned response.

That is perfectly mapped out, and what's wonderful is how intuitively we actually understand this, even in pop culture.

Think of the classic Peanuts comic strip featuring Snoopy.

Oh yeah, the can opener one?

Yes.

Snoopy is lounging on top of his doghouse.

In the film's panel, his ears suddenly perk up.

The caption reads, the ears hear the can opener.

In the second panel, Snoopy is smiling blissfully, thinking, right away the stomach knows that supper is coming.

And in the third panel, he's leaping off the doghouse, pondering this deep neurological question, how do the ears tilt a stomach?

Snoopy is literally outlining a dissertation on associative learning.

The sound of the electric can opener is the conditioned stimulus, and the stomach's joyous anticipation, the gastric juices flowing, is the conditioned response.

It's textbook.

So once Pavlov definitively demonstrated this basic mechanism, he didn't just stop there.

He essentially converted his lab into a massive research factory.

Over the next three decades, he and his team published over 500 scientific papers exploring every conceivable nuance of this phenomenon.

And through all that work, they identified five major conditioning processes that govern how these associations are formed, maintained, and broken.

Let's walk you through those five processes.

The first one is acquisition.

This is the initial stage of learning, the moment when the stimulus -response relationship is actually acquired.

And the absolute most critical factor here is timing.

Timing is everything.

Pavlov discovered that in order to successfully acquire the association, the neutral stimulus, the tone, need to be presented before the unconditioned stimulus, the food.

And it can't just be generally before, it has to be immediately before.

Usually an interval of about half a second works best.

But why?

What if the researcher gave the dog the food, let the dog eat it, and then sounded the tone?

Would the dog still learn the association?

In almost every case, no.

Conditioning rarely happens if the sequence is reversed.

And if we look at this through the lens of evolutionary biology, it makes perfect sense.

The entire purpose of classical conditioning is biologically adaptive, you know.

It's designed to help an animal prepare for an impending good or bad event.

Oh, that makes total sense.

To Pavlov's dogs, the tone was a signal that food was impending.

So the body prepped the digestive system.

In the wild, imagine a deer in the forest.

The snapping of a twig is the conditioned stimulus.

It signals the approach of a predator, which is the unconditioned stimulus.

If the predator has already pounced and bitten the deer, hearing a twig snap afterward doesn't offer any survival advantage.

The conditioned stimulus has to be a herald.

It must predict the future.

There's a fascinating and somewhat amusing experiment conducted by researcher Michael Domjohn that perfectly illustrates how acquisition serves biological relevance.

He used male Japanese quail for this study.

Quail?

Yeah.

Domjohn arranged a setup where he would turn on a red light in a cage just before presenting an approachable sexually receptive female quail.

So the red light was the neutral stimulus and the female quail was the unconditioned stimulus that naturally triggers arousal.

I assume the male quail figured this out pretty quickly.

Incredibly quickly.

After a few pairings, the red light alone caused the male quail to become highly sexually aroused.

The birds actually developed a behavioral preference for staying in the area of their cage where the red light shone, their own little red light district, so to speak.

But the most crucial finding was biological.

When a female did finally appear after the light, the conditioned males actually mated much more quickly and released significantly more sperm than males who had not been conditioned.

So this isn't just a quirky behavioral trick.

The conditioning gave the male quail a literal, measurable reproductive advantage.

Their bodies were prepped for reproductive success.

Exactly.

It prepares the organism for survival and reproduction.

And we see this in humans too.

Take the story of a psychologist who noted how classical conditioning shaped his own romantic life.

His first serious girlfriend had a deep love for eating onions.

Uh oh.

Consequently, whenever they kissed, which is an unconditioned stimulus that naturally produces romantic arousal, the unconditioned response, he was constantly smelling strong onion breath, which was initially a neutral stimulus.

But through the process of acquisition,

that neutral smell became deeply conditioned.

The onion breath became the conditioned stimulus.

And he noted that before long, just the smell of onion breath alone, even if she wasn't around, would send a romantic tingle up his spine.

The conditioned stimulus triggered the conditioned response, which is a perfect segue into the second process Pavlov discovered, which is higher order conditioning, sometimes referred to as second order conditioning.

This happens when you build a chain of associations.

So like expanding the network.

Yes.

You link a new neutral stimulus to an already existing conditioned stimulus.

Imagine our dog that has already been perfectly conditioned to salivate to the tone.

Now the experimenter turns on a bright light and then sounds the tone.

Light then tone.

Importantly, the light is never paired with the food directly.

But eventually the dog's brain connects the light to the tone, which is connected to the food.

Exactly.

The dog will start salivating to the light alone.

The light has become a new second order conditioned stimulus.

It's like a game of neurological telephone.

And even though this second order conditioning is usually a bit weaker than the primary conditioning, it shapes so much of our daily lives and fears.

For example, let's say you were severely bitten by a large dog as a child.

The bite is the US.

The intense fear is the UR.

As a result, the sight of a large dog becomes a conditioned stimulus triggering fear.

But because of higher order conditioning, just hearing the sound of a barking dog in the distance, which brings to mind the image of the big dog, which brings to mind the bite,

can be enough to make your heart race.

The sound is a step removed, but it still has the power to trigger the visceral emotion.

It also plays a massive role in how we form our attitudes and prejudices, often completely outside of our conscious awareness.

There is a brilliant study by Olson and Fazio, where they demonstrated how higher order conditioning could subtly manipulate attitudes.

Right, the Pokemon study.

Yes.

They had adult participants stare at a video screen, telling them they were playing the role of a security guard looking for a specific target image.

Which was a total misdirection, completely.

The real experiment was happening in the margins.

While they were guarding, two different neutral Pokemon characters kept flashing on the screen very briefly.

Unbeknownst to the participants, one Pokemon character was consistently paired with positive words and images, things like the word awesome or a picture of a hot fudge sundae.

And the other one.

The other Pokemon character was consistently paired with negative things, like the word awful or a picture of a cockroach.

And the participants had no idea this was happening?

None at all.

But later, when the researchers assessed the participants' feelings toward various images, they found that the adults had formed a gut -level positive attitude toward the Pokemon associated with the positive words, and a distinct negative attitude toward the Pokemon paired with the cockroaches.

The neutral characters had become conditioned stimuli for positive or negative affect, just shaping their preferences without them ever realizing it.

Which explains so much about advertising and political campaigns.

You constantly pair a candidate with a sweeping musical score and cheering crowds,

or conversely, you pair their opponent with dark, ominous music and grainy black and white photos.

You are building higher -order conditioned responses in the voters' brains.

Absolutely.

So that covers acquisition and higher -order conditioning.

But what happens when the environment changes?

This brings us to the third process, which is extinction.

What happens if Pavlov keeps sounding the tone day after day, but completely stops delivering the food?

Does the dog just drool forever at the sound of the tone?

Does the conditioning last a lifetime?

No, it doesn't.

Pavlov found that if you present the conditioned stimulus repeatedly without the unconditioned stimulus, the conditioned response begins to steadily diminish.

The dog salivates less and less until it starves entirely.

So this declining response is called extinction.

The tone no longer reliably predicts the food, so the brain adapts and stops preparing for it.

But here is the crucial nuance, and this leads right into the fourth process.

Extinction does not mean the memory is erased.

The dog hasn't suffered amnesia and forgotten the association.

We know this because of a phenomenon Pavlov called spontaneous recovery.

Right.

After Pavlov fully extinguished the drooling response, he would take the dog out of the harness and let it rest for several hours.

Then he would bring the dog back, put it in the room, and sound the tone again, and suddenly, Seemingly out of nowhere, the salivation response reappeared.

It was usually weaker than it was at its peak, but the fact that it spontaneously recovered proves something profound.

Extinction isn't the elimination of a learned response.

It is merely the suppression of it.

This explains so much about human relationships and trauma.

Let's go back to our psychologist friend and his onion -loving girlfriend.

Right.

They eventually broke up.

So the conditioned stimulus, the smell of onion breath, was no longer being paired with the unconditioned stimulus, the kissing.

Over months and years, he experienced extinction.

Smelling onions at a restaurant no longer caused romantic arousal.

But he noted that occasionally, years after the breakup, if he suddenly caught a whiff of strong onion breath,

it would unexpectedly awaken a tiny fleeting flutter of that old emotional response.

That ghost of a feeling is spontaneous recovery in action.

And we see the exact same mechanism in addiction recovery.

A person might go to rehab and successfully extinguish their cravings in that safe environment.

But years later, a sudden exposure to an old trigger,

a specific song, a specific street corner can cause a spontaneous recovery of the intense craving.

Which brings us to the fifth and final set of classical conditioning processes, which is generalization and discrimination.

Generalization is the brain's tendency to respond to stimuli that are similar to the original conditioned stimulus.

Pavlov demonstrated this beautifully.

He conditioned a dog to drool when a specific localized spot on its thigh was scratched.

Then he tested the dog's reaction to being scratched on other parts of its body.

The dog didn't just drool for the thigh scratch.

It drooled a bit when its pelvis was scratched and a little bit when its trunk was scratched.

The closer the scratch was to the original conditioned spot, the stronger the drool response.

The learned response had generalized across the dog's body map.

And from a survival standpoint, generalization is incredibly adaptive.

If a toddler has a terrifying experience with a moving car in the street and learns to fear it, it is highly beneficial for their survival if that fear automatically generalizes to moving trucks, buses, and motorcycles.

The brain is basically saying this whole category of large, fast -moving metal objects is dangerous.

Stay away.

But generalization can also be the source of profound psychological trauma.

Take the harrowing case of an Argentine rider who was imprisoned and brutally tortured by the military junta.

Years after his release, long after he was safe, he still recoiled with visceral, intense fear and anxiety just at the sight of black leather shoes.

Because when he was lying on the floor of his cell, his first glimpse of his torturers approaching was always their black military shoes, right?

Exactly.

The intense terror of the torture generalized to the mundane sight of the footwear.

We see this with abused children as well.

Researchers conducted a study where they showed abused children and non -abused children a digital image of an angry face on a computer screen while monitoring their brain waves.

And the neurological responses of the abused children were dramatically stronger, and the alarm bells in their brains lasted much longer.

Their deeply conditioned fear of a specific, abusive adult's anger had generalized to a generic, digital representation of an angry face.

Their brains were overgeneralizing the threat.

And the necessary counterbalance to generalization is discrimination.

This is the learned ability to distinguish between a conditioned stimulus that predicts an event and other similar but irrelevant stimuli.

It's the ability to see the nuance.

If you are walking down an alley at night and you are confronted by a loose guard dog bearing its teeth and growling, your heart is going to race.

You experience a fear response.

But if you are walking in a park and you see a guide dog in a harness calmly leading its owner, your heart will not race.

Both are dogs.

But you have learned to discriminate between the two stimuli because they predict vastly different consequences.

So to recap, those are the five foundational pillars mapped out by Pavlov acquisition, higher order conditioning, extinction, spontaneous recovery, generalization, and discrimination.

Now Pavlov and Watson were so confident in these findings that they believed these rules applied perfectly, mechanically, and universally to every animal, every human, and every situation.

They thought they had found the ultimate equation for behavior.

But here is where the story takes a fascinating turn.

Because as groundbreaking as Pavlov and Watson were, their dogmatic insistence on strict behaviorism created some massive blind spots.

They explicitly, almost aggressively, ignored cognition, meaning thoughts, perceptions, and expectations.

And they severely underestimated the power of evolutionary biology.

So let's explore how modern psychology pushed back against strict behaviorism.

Let's start with the cognitive pushback.

A strict behaviorist believed that learning was just a mindless, automatic stamping in of associations.

The animal is just a biological machine reacting to inputs.

But a researcher named Martin Seligman proved that animals are actually processing complex expectations about their reality.

This is the learned helplessness research, right?

Yes.

Seligman conducted an experiment where dogs were strapped in a harness and subjected to repeated electric shocks.

Crucially, in this first phase, there was absolutely nothing the dogs could do to avoid or escape the shocks.

The shocks were completely inescapable.

So no matter how much they struggled, the pain came anyway?

Correct.

Later, these same dogs were placed in a completely different environment, a shuttle box, where they could easily escape the shocks simply by leaping over a small, low hurdle to the other side.

But shockingly, the dogs didn't even try to jump.

They just lay down, cowered in fear, and passively took the shocks.

Why?

I mean, they had the physical ability to escape?

Because they had learned a cognitive expectation.

During the harness phase, they learned that their actions had no effect on their environment.

They had developed a sense of helplessness.

Their cognition, their internal belief was that they had no control, so they simply gave up.

Psychologists call this passive resignation learned helplessness, and it totally shattered the behaviorist idea that animals don't think or expect.

And we see this cognitive override in human therapies, too.

Imagine a treatment center trying to cure alcohol dependency using strict classical conditioning.

The therapists give the patient alcohol, but they spike it with a powerful drug that causes severe agonizing nausea.

The behaviorist theory is that the alcohol, the conditioned stimulus, will get paired with the vomiting, the unconditioned response, creating a lifelong aversion to alcohol.

It seems logical based on Pavlov's rules.

But it frequently fails.

Why?

Because human cognition gets in the way.

The patient's conscious mind knows that the nausea is being caused by the hidden drug, not the alcohol itself.

Because they are aware of the trick, the cognitive awareness severely weakens the association.

Even in a foundational process like classical conditioning,

the organism's thoughts matter.

Okay, so that's the cognitive limit.

Now let's look at the biological pushback.

Because in the mid -1950s, behaviorists were so arrogant about their theories that researchers like Gregory Kimball confidently declared that basically any response could be conditioned to any stimulus in any animal.

It didn't matter if it was a rat, a pigeon, or a person.

That was the prevailing dogma.

But 25 years later, Kimball had to humbly retract that statement, acknowledging that hundreds of new studies had proven him completely wrong.

The most famous and devastating of these studies was conducted by a researcher named John Garcia.

John Garcia's rats.

This is a legendary study that basically broke two of Pavlov's cardinal rules simultaneously.

Garcia wasn't even looking for a psychological breakthrough.

He was studying the physical effects of radiation on rats.

But he noticed a strange behavioral quirk.

The rats were actively avoiding drinking water from the plastic bottles inside the radiation chambers.

He wondered if classical conditioning was at play.

Did the rats subconsciously associate the plastic -tasting water with the nausea caused by the radiation?

To test this hypothesis, Garcia and his colleague Robert Kohling set up an incredibly clever experiment.

They gave rats three different types of novel stimuli, a novel flavor to taste, a novel sight to see, or a novel sound to hear.

Then they exposed the rats to radiation or drugs that made them severely nauseous and vomit.

And the results were completely startling to the behaviorist establishment.

First, the rats learned to avoid the novel flavor, even if they didn't get sick until hours after tasting it.

This was a massive blow to Pavlov's rule of acquisition, which insisted that the conditioned stimulus must be followed by the unconditioned stimulus almost immediately, usually within half a second.

Right.

Garcia proved that for taste aversion, the brain can link a cause and effect across a gap of several hours.

But the second finding was even more destructive to the behaviorist worldview.

The sickened rats only developed an aversion to the tastes.

They did absolutely not develop an aversion to the novel sights or the novel sounds, even though those sights and sounds were present right before they got sick.

This directly contradicted the behaviorist claim that any perceivable stimulus could serve as a conditioned stimulus.

But if you stop thinking like a strict laboratory behaviorist and start thinking like an evolutionary biologist, Garcia's findings make perfect, elegant sense.

How does a rat in the wild identify tainted, poisonous food?

It tastes it.

It doesn't identify poison by what it sounds like or what color it is.

Therefore, over millions of years of evolution, the rat's brain has been biologically hardwired, prepared to associate taste with nausea.

It is a biological imperative.

And humans are wired the exact same way.

If you go to a new seafood restaurant, eat some contaminated oysters, and become violently ill four hours later in the middle of the night, you are going to develop a severe, long -lasting aversion to the taste and smell of oysters.

You are not going to develop an aversion to the music that was playing in the restaurant, or the color of the tablecloth, or the friends you were eating with.

Our biology dictates what we are capable of associating.

Conversely, look at birds.

Birds hunt primarily by sight, not by smell or taste.

So research shows that birds are biologically primed to develop aversions to the sight of tainted food.

You can't condition them to avoid a taste as easily.

Every species is heavily constrained by its unique evolutionary biology, prepared to learn only those specific associations that enhance its survival.

There is actually a fascinating study regarding human biological predispositions conducted by Elliot and Yesta.

They hypothesize that humans, specifically men, are naturally biologically disposed to learn associations between the color red and female sexuality.

And this isn't just cultural conditioning, it's rooted in primate biology.

Female primates visibly display red swellings when nearing ovulation to signal fertility, and human females experience enhanced blood flow blushing during flirtation and sexual excitation.

So yes, our culture frequently pairs red with Sexing of Valentine's Heart's Red Light District's Red Lipstick.

But Elliot and Yesta's experiments showed that, completely without men's conscious awareness, the presence of the color, red naturally and reliably enhances their attraction to women in photographs.

We are biologically prepared, deep in our primate brains, to make that specific association.

And yet, even acknowledging these cognitive and biological constraints, classical conditioning remains an incredibly potent force in human life, especially in medical and clinical settings.

Take a patient undergoing chemotherapy.

The patient understandably feels severe nausea, the unconditioned response from the toxic chemotherapy drugs, which are the un -renditioned stimulus.

But over a series of treatments, the clinic's waiting room itself becomes a conditioned stimulus.

The environment takes on the power of the drug.

Exactly.

Just walking into that specific waiting room, smelling the antiseptic, or seeing the triage nurses can provoke intense conditioned nausea and anxiety before the drugs are even administered.

We see the exact same mechanism driving the tragedy of drug relapse.

Former drug users who have gotten clean often experience sudden, intense physical cravings when they return to their old drug -using context.

The specific people, the specific street corner, or the specific room they associate with their previous highs act as powerful conditioned stimuli.

The environment itself is screaming at the brain to expect the drug.

This is exactly why addiction counselors emphasize that recovering addicts must completely change their environment to survive.

You really can't discuss real -world classical conditioning without addressing one of the most famous and frankly most ethically disturbing experiments in the history of psychology, the case of Little Albert.

Right.

This takes us back to John B.

Watson, the founder of behaviorism.

Watson wanted to prove that complex human emotions, specifically fears and phobias, weren't the result of deep subconscious Freudian conflicts.

He believed fears were just a bundle of simple conditioned responses.

So in 1920, he and his graduate student, Rosalie Rayner, acquired an 11 -month -old infant known in the literature as Little Albert.

Like most healthy babies, Albert was relatively fearless.

He wasn't afraid of white rats.

In fact, when they placed a white rat in front of him, he happily reached out to pet it.

It was a neutral stimulus.

But Albert, like all humans, had a natural, unconditioned, startle reflex to loud noises.

So Watson and Rayner set up their conditioning trial.

They presented the white rat to Albert, but the exact moment the baby reached out to touch the animal, Watson stood right behind the baby's head and violently struck a heavy steel bar with a hammer.

The terrifying deafening noise was the unconditioned stimulus, which immediately caused Albert to exhibit the unconditioned response.

He burst into tears and exhibited intense fear.

They repeated this brutal sequence just seven times, rat bang cry, rat bang cry.

And sure enough, after only seven pairings, the sight of the white rat alone, now a highly charged conditioned stimulus, caused Little Albert to break down in terrified tears.

And the conditioning didn't stop there.

Five days later, Albert demonstrated massive generalization.

He reacted with intense, conditioned fear to a rabbit, a dog, and even a sealskin coat.

He didn't react to dissimilar objects like wooden toys showing some discrimination, but Watson had successfully and permanently classically conditioned a profound phobia into a human child.

It's an incredibly tough story to read, and it's a study that would be unequivocally banned by every ethical review board in the world today.

But the historical footnote to this story is absolutely wild.

Shortly after this experiment, Watson's career imploded.

He had an affair with his assistant, Rosalie Rainer.

It became a massive public scandal, and he was fired from his prestigious professorship at Johns Hopkins University.

He completely left academia.

But he didn't stop using behaviorism.

He took his deep knowledge of associative learning and joined the massive J.

Walter Thompson advertising agency in New York, where he essentially revolutionized modern advertising.

He used behavioral conditioning to invent the concept of the Maxwell House coffee break.

He realized that if you pair a product, coffee, with a reliable, comforting pause in the stressful workday, the coffee becomes a conditioned stimulus for relaxation.

He fundamentally shaped American consumer culture using the exact same principles he used on Little Albert.

It is a profound legacy, for better or worse.

And that brings us to a major pivot in our deep dive.

Up until now, we have been exclusively discussing classical conditioning.

But there is a massive limitation to classical conditioning.

Right.

Classical conditioning is entirely about things happening to you.

It forms associations between stimuli that you don't control, leading to automatic respondent behavior like a dog drooling, or a person flinching, or a baby crying.

But what if you want to teach a complex voluntary action?

What if you want to teach an elephant to walk on its hind legs?

Or a child to politely say, please and thank you.

You cannot classically condition an elephant to walk on its hind legs by pairing a bell with food.

For intentional voluntary behavior, we have to explore an entirely different mechanism, operant conditioning.

In operant conditioning, organisms aren't just reacting to the environment, they are learning to associate their own actions with consequences.

Behaviors that are followed by reinforcers increase in frequency.

Behaviors followed by punishers decrease.

The animal is operating on the environment to produce a specific result.

The undisputed titan of operant conditioning is B .F.

Skinner.

But to understand Skinner, we briefly have to acknowledge the man whose work he built upon, a psychologist named Edward Thorndyke, who formulated the law of effect.

Thorndyke was the guy with the cats, right?

Yes.

In the late 1800s, Thorndyke designed these elaborate puzzle boxes for cats.

He would take a hungry cat and place it inside the box.

Outside the box, in plain sight, he would place a piece of fish.

To get out, the cat had to figure out how to manipulate a specific mechanism like pressing a lever and pulling a string to unlatch the door.

Thorndyke meticulously recorded exactly how long it took the cat to escape on successive trials.

And if you look at the graph of Thorndyke's data, the escape time drops dramatically over a series of trials.

The first time, the cat is just frantically thrashing around the box.

By pure accident, it steps on a lever, the door pops open, and it gets the fish.

The next time you put the cat in, it thrashes a little less and hits the lever a little sooner.

Eventually, you put the cat in, and it immediately walks over and presses the lever.

This simple observation led Thorndyke to formulate the law of effect, which states that rewarded behavior is more likely to be repeated, and punished behavior is less likely to be repeated.

It sounds like common sense, but it is the absolute fundamental principle of operant conditioning.

B .F.

Skinner took Thorndyke's basic concept and turned it into an ultra -precise science.

He built a much more sophisticated testing environment, which became famously known as the operant chamber, or the Skinner box.

Inside a Skinner box, you have a highly controlled environment.

There is a bar or a key that an animal, usually a rat or a pigeon can, press or peck.

Pressing the bar triggers a dispenser that releases a reward, like a food pellet or a drop of water.

The genius of the Skinner box is that it is hooked up to electronic devices that continuously and automatically record every single response the animal makes, allowing for massive, precise data collection without the researcher even needing to be in the room.

But here's the practical problem.

You can't just drop a naive rat into a Skinner box and wait for it to magically know how to press a bar.

A rat might wander around that box for days without ever accidentally pressing the tiny lever.

You have to actively train it.

And Skinner developed a brilliant method for this, which is called shaping.

Shaping relies on a process called successive approximations.

You carefully observe the animal's natural behavior and gradually guide it.

If your goal is to get the rat to press the bar, you don't wait for the final behavior.

First, you drop a food pellet when the rat simply turns to face the bar.

The rat learns, OK, looking this way gets me food.

Once it does that reliably,

you raise the criteria.

You stop rewarding it just for looking.

Now, you only give it food when it takes a step toward the bar.

You are demanding a closer approximation.

Exactly.

Then, you only reward it when it physically touches the bar with its nose.

Finally, you withhold all rewards until it actually presses the bar down.

You have shaped a complex behavior by reinforcing a chain of tiny successive steps.

It's how animal trainers do almost everything.

Think of a marine biologist training a massive manatee in a pool.

They use a target stick, a long pole with a buoy on the end.

They start by rewarding the manatee just for swimming near the stick.

Then, they reward it only when it bumps the stick with its snout.

Eventually, they can guide the 1 ,000 -pound animal through complex medical examinations just by moving the target stick, shaping its movements step by step.

Skinner used shaping to teach pigeons to perform absolutely incredible cognitive feats.

Because shaping allows you to train an animal to respond to one specific stimulus and ignore another, you can test the limits of their perception.

Skinner actually shaped pigeons to peck a key when they were shown a picture of a human face, but not when they were shown a picture without a face.

The pigeons essentially learn to categorize the abstract concept of a human face.

They learn to distinguish between categories like cars versus chairs.

It shows how unbelievably powerful this mechanism is.

But operant conditioning isn't just a parlor trick for pigeons and manatees.

We are using it on each other every single day, often completely unconsciously.

Think of the dynamic between exhausted parents and a crying baby.

The baby wakes up screaming in the middle of the night.

The parents rush in, pick the baby up, rock it, and soothe it.

And from the baby's perspective, what just happened?

The behavior crying produced a highly desirable consequence of tension and comfort.

The crying has been reinforced.

Exactly.

There's a comic strip where a baby is thinking to himself, I'll have to wake up crying in the middle of the night more often.

The parent thinks they are soothing the child, but the baby is actually shaping the parent's behavior.

Who is really training whom in that scenario?

It's a reciprocal process.

And to really analyze who is training whom, we have to define our terms carefully.

We need to distinguish between the different types of reinforcement.

A reinforcer is defined simply as any event that strengthens the behavior it follows.

But there are two distinct types, positive reinforcement and negative reinforcement.

And this right here is where so many people get confused.

Let's make this absolutely crystal clear.

Positive reinforcement is intuitive.

You add a desirable stimulus to increase a behavior.

A dog sits, it gets a treat.

An employee finishes a project, they get a bonus.

A child cleans their room, they get praise.

The behavior is strengthened by adding something good.

But negative reinforcement is widely misunderstood.

People constantly use the term negative reinforcement when they actually mean punishment.

Let's be definitive.

Negative reinforcement is not punishment.

To understand it, you have to think of positive and negative in mathematical terms, not emotional terms.

Positive means adding something.

Negative means subtracting or removing something.

Precisely.

Negative reinforcement strengthens a behavior by removing or subtracting an aversive, unpleasant stimulus.

The consequence of the behavior is relief.

The ultimate everyday example is a headache.

You have a terrible pounding headache.

You engage in a behavior, you take an aspirin.

Fifteen minutes later, the headache goes away.

The behavior, taking the pill, removed the bad thing, the pain.

That relief negatively reinforces the behavior of taking aspirin.

You are highly likely to take aspirin the next time your head hurts.

Another perfect example is your morning alarm clock.

The alarm is a harsh, highly aversive sound.

You reach over and hit the snooze button and the sound instantly stops.

That welcome result, the sudden silence and the ability to sleep for ten more minutes, is powerful negative reinforcement.

It significantly increases the odds that you will hit the snooze button again tomorrow morning.

We also see the tragic power of negative reinforcement in severe drug addiction.

When a heroin addict goes into withdrawal, they experience agonizing physical pain and psychological terror.

When they inject the drug, those horrific withdrawal symptoms instantly vanish.

That profound relief is a massive negative reinforcement.

They are not chasing a high anymore.

They are desperately repeating the behavior to remove the punishment of withdrawal.

Understanding that distinction is crucial.

Furthermore, we must distinguish between primary and secondary reinforcers.

Primary reinforcers are unlearned.

They are innately satisfying biological needs.

Getting food when you are starving or experiencing pain relief when you are hurt are primary reinforcers.

Secondary reinforcers, which are also called conditioned reinforcers, have no innate biological value.

They only get their power because we have learned to associate them with primary reinforcers.

If a rat in a skinner box learns that a green light turning on means food is about to drop, the green light becomes a secondary reinforcer.

The rat will actually work just to turn on the green light.

For human beings, the ultimate secondary reinforcer is money.

A physical dollar bill has no innate biological value.

You cannot eat it to survive.

But because we have learned that in the exchange for food, shelter and security, it becomes a tremendously powerful motivator.

There was actually a fascinating European study that explored how deeply interconnected our primary drives and our secondary reinforcers really are.

The researchers hypothesized that if our desire for money is ultimately derived from our biological desire for food, then making someone physically hungry might actually make them more money hungry as well.

And the results confirmed it.

They found that people who were hungry were significantly less likely to donate money to charity than people who had just eaten.

They even found that participants who were simply sitting in a room permeated with hunger arousing aromas, like the smell of baking bread, were much less likely to share their cash with a partner.

The primary biological drive for food subconsciously bled over into an increased stinginess with their secondary reinforcers.

Our biology is always running in the background.

The final major concept recording reinforcement is the timing of the consequences, immediate versus delayed reinforcement.

This is a major area where humans differ from rats.

If a rat in a skinner box presses the bar, the food pellet must arrive immediately.

If there is a delay of even 30 seconds between the bar press and the food,

the rat won't learn the association.

It will likely associate the food with whatever random movement it made in the 29th second.

But humans have the cognitive capacity to respond to vastly delayed reinforcers.

We can work an exhausting 40 -hour week totally unrewarded in the moment because we know a paycheck is coming on Friday.

A teenager can study for four years to get a college acceptance letter.

We have the unique ability to delay gratification.

However, we are far from perfect at it.

Our evolutionary hardware still heavily biases us toward immediate payoffs.

Small immediate consequences frequently overpower massive delayed consequences.

Oh, constantly.

Staying up until 2 a .m.

watching random YouTube videos offers the immediate tiny gratification of entertainment.

That immediate reward frequently overpowers the delayed much larger reward of feeling rested and energetic the next morning.

We see this in major societal issues as well.

The immediate gratification of risky unprotected sex often prevails over the delayed rational consideration of safe sex.

The immediate convenience and status of driving a massive gas guzzling vehicle frequently prevails over the delayed abstract consequence of global climate change.

Our brains are stubbornly hardwired to value the bird in the hand over the two in the bush.

Which is a brilliant transition into our next area of exploration because how and when a payoff is delivered radically fundamentally changes how an organism behaves.

Skinner spent years obsessively studying reinforcement schedules.

Skinner compared continuous reinforcement where reward is given every single time the behavior occurs with partial or intermittent reinforcement.

He mapped out four distinct schedules of partial reinforcement and they produce wildly different patterns of behavior.

Let's visualize these four schedules because they explain so much about our daily routines and our addictions.

The first is the fixed ratio schedule.

This schedule reinforces behavior after a set predictable number of responses.

The classic real world example is a coffee shop loyalty punch card.

You buy exactly 10 coffees.

You get the 11th one free.

The ratio of behavior to reward is fixed.

When you put an animal on a fixed ratio schedule in the lab, you see a very specific pattern.

After they receive the reward, they pause briefly and then they return to a very high frantic rate of responding to get to the next fixed number as fast as possible.

They are grinding to the next level.

Now compare that to the second schedule, the variable ratio schedule.

This provides reinforcers after an unpredictable random number of responses.

You never know exactly how many times you have to pull the lever to get the payoff.

This is the schedule that builds Las Vegas.

Slot machines are pure variable ratio conditioning.

You might win the jackpot after your first pull or it might take 500 pulls.

You never know.

Fly fishing operates on the exact same psychological principle.

You cast your line over and over and over, completely blind to which specific cast will result in a strike.

Because the reinforcers generally increase as the number of responses increases, but the exact timing is a total mystery.

A variable ratio schedule produces incredibly high persistent rates of responding.

The behavior becomes highly addictive and incredibly difficult to extinguish.

The animal or the gambler always thinks maybe the very next pull is the one.

Then we shift from ratios, which are based on the number of physical responses, to intervals, which are based on the passage of time.

The third schedule is fixed interval.

This reinforces the first response only after a set predictable period of time has passed.

Imagine you are making jello.

You put it in the fridge and you know it takes exactly four hours to set.

You don't check the fridge constantly during the first hour.

But as the clock ticks closer to that four hour mark, you start opening the fridge more and more frequently.

On a data graph, this produces a choppy stop start pattern of behavior.

Responding accelerates only as the antizominal time approaches.

Finally, we have the variable interval schedule.

This reinforces the first response after an unpredictable random amount of time has passed.

You don't know when the reward is coming, only that it will eventually.

This is exactly what it feels like to wait for an incredibly important email or text message.

You don't know if the reply will arrive in two minutes, two hours, or two days.

Because there is no way of knowing when the waiting period will end.

This schedule produces a slow, steady,

relentless rate of responding.

You just keep checking your phone over and over at a steady pace.

Skinner was deeply, profoundly obsessed with these schedules.

He famously reviewed thousands of data charts and claimed that it didn't matter what species you were testing.

He said, catch a given species in the act and you cannot tell whether it is pigeon, rat, monkey, or human.

Behavior shows astonishingly similar properties.

According to Skinner, the schedule dictates the behavior.

Universally.

Free will is an illusion, the schedule is everything.

Now up to this point, we've only been talking about reinforcement strategies designed to increase the frequency of a behavior.

But the dark side of operant conditioning is punishment consequences designed to decrease the frequency of a preceding behavior.

And just like reinforcement, punishment can be positive or negative.

Positive punishment means adding an aversive stimulus to stop a behavior.

A rat pressing a lever receives a painful electric shock.

A child running into the street receives a harsh spanking.

You're adding pain to stop the action.

Negative punishment, on the other hand, means withdrawing a desirable stimulus.

A teenager breaks curfew so their parents take away their driving privileges for a week.

A patron at a library refuses to pay their fines, so the library revokes their borrowing card.

You are subtracting something good to stop the bad behavior.

But we need to pause here because the psychological community has spent decades analyzing the efficacy of punishment, particularly the use of physical positive punishment on children like spanking.

And the vast consensus highlights four severe drawbacks to this approach.

First, and perhaps most importantly, punished behavior is usually suppressed, not forgotten.

This creates a dangerous illusion for parents.

A child throws a tantrum, the parent spanks them, the child cries, but stops the tantrum.

The parent's behavior of stanking is negatively reinforced because the annoying tantrum stopped.

The parent thinks, ah, it worked.

But the child hasn't actually unlearned the underlying desire to throw the tantrum, they're just temporarily suppressing it out of fear.

Second, punishment teaches discrimination.

The child quickly learns the specific context where the punishment occurs.

A child might learn that swearing around their parents results in a spanking.

So they don't unlearn swearing, they simply learn to discriminate.

They become highly proficient at swearing when their parents aren't around.

Third, physical punishment teaches fear.

Through the mechanisms of classical conditioning we discussed earlier, a child can begin to associate fear and anxiety, not just with the undesirable behavior, but with a specific person delivering the punishment.

This can lead the child to actively avoid the punishing parent altogether, damaging the relationship.

And fourth,

physical punishment may inadvertently increase aggressiveness by modeling aggression as a legitimate way to cope with problems.

The child observes that when the parent is frustrated, they use physical force to achieve compliance.

Studies frequently show that abusive parents often come from abusive families.

The behavior is modeled across generations.

Now, we do have to add a critical scientific caveat here regarding the research on spanking.

It is the classic correlation versus causation problem.

Studies consistently show that spanked children are at a higher risk for aggression, depression, and low self -esteem later in life.

But some critical researchers ask, is it a chicken and egg scenario?

Does the physical spanking cause the aggression?

Or do children with innate pre -existing severe behavioral problems simply draw more exasperated physical punishment from their parents?

It is a deeply complex dynamic.

However, looking at the totality of the behavioral evidence, Skinner himself strongly opposed punishment.

He believed it simply taught organisms how to avoid the punisher.

The consensus of almost all psychologists today is to emphasize reinforcement over punishment.

Actively notice people behaving well and reward them for it rather than waiting for them to fail and striking them.

This debate over how we process consequences naturally moves us into our next major topic, cognition in operant conditioning.

And this is an area where B .F.

Skinner received intense pushback from the broader scientific community.

Skinner harbored a deep, almost visceral hatred for cognitive science.

He believed that talking about internal expectations, mental maps, or thoughts was a massive step backward for the hard science of psychology.

He died in 1990, and right up to his final days, he vehemently resisted the idea that cognition belonged in the study of conditioning.

But the evidence for internal mental processes just kept piling up.

Ironically, it was visible even in his own experiments.

Think back to the fixed interval schedule, the jello in the fridge.

The animals were responding more and more frequently as the time approached for the reward.

Even though Skinner refused to use the word, the animals were clearly behaving as if they expected the reward was coming.

Expectation is a cognitive process.

And the definitive blow to strict operant behaviorism came from the phenomenon of latent learning.

Researchers conducted experiments where they placed rats in a complex maze.

But crucially, they provided absolutely no food reward for finding the end of the maze.

The rats just wandered around sniffing and exploring for 10 straight days.

According to strict Skinnerian behaviorism, since there was no reinforcement,

absolutely no learning should be taking place.

The rats were just aimlessly walking.

But they weren't.

They were actually building a complex cognitive map of the maze in their minds.

On day 11, the researchers finally placed a food reward at the end of the maze.

And instantly, the rats ran straight to the end, navigating the blind alleys perfectly, just as fast as a control group of rats who had been rewarded every single day.

They had learned the layout of the maze lately without any reinforcement.

But they only demonstrated that learning when there was a sudden motivation to do so.

They had a mental map.

We also see the undeniable power of cognition in the phenomenon insight.

Sometimes learning doesn't happen through a slow plotting process of trial and error shaping.

Sometimes it happens in a sudden brilliant flash of understanding.

There's a wonderful real world story about a 10 year old boy named Johnny Appleton.

A group of adult construction workers were totally stumped.

A young Robin had fallen to the bottom of a very narrow 30 inch deep hole in a cement block wall.

The workers couldn't reach their hands in and they couldn't grab the bird with tools without crushing it.

They were giving up.

Johnny walked up, looked at the situation and solved it instantly.

He told the workers to grab some dry sand and slowly pour it down the sides of the hole.

As the sand piled up, the bird naturally stepped up to stay on top of it.

They kept pouring and the bird essentially rode the rising elevator of sand right to the top of the block where they grabbed it.

Johnny didn't use behavioral shaping or trial and error.

He perceived the entire solution in a sudden cognitive flash of insight.

Cognition also plays a massive complicated role in motivation.

Psychologists draw a sharp line between extrinsic motivation and intrinsic motivation.

Extrinsic motivation is doing something solely to get an external reward or avoid an external punishment.

Like a student studying a subject they hate just to get an A and avoid being grounded.

Intrinsic motivation is entirely different.

It is doing something purely for its own sake because you find it inherently interesting, satisfying or challenging.

And here is the fascinating counterintuitive twist that completely undermines strict behaviorism.

Providing excessive external rewards can actually destroy intrinsic motivation.

It is called the over justification effect.

Let's say you have a child who loves reading fantasy novels just for the pure joy of the story.

They have high intrinsic motivation.

Then the parents decide to encourage this by implementing a reward system, paying the child five dollars for every book they finish.

What happens?

The child starts reading faster, maybe picking shorter books just to get the cash.

They start to view reading as a chore, a job they're being paid to do.

The external reward has effectively killed their natural internal desire.

When the money stops, the reading stops.

Despite Skinner's stubborn blind spot for these cognitive realities, his operant principles still have incredible, undeniable real world applications.

He actually envisioned a radical transformation for the education system.

He dreamed of teaching machines,

computers that could case educational material perfectly to each individual student's unique rate of learning.

He hated the traditional classroom model where one teacher tries to lecture 30 kids simultaneously, leaving the fast learners bored and the slower learners hopelessly confused.

Skinner imagined a computer program that provides small chunks of information, asks a question and provides immediate right or wrong feedback, reinforcing correct answers step by step.

He essentially predicted modern interactive software apps like Duolingo decades before the technology existed.

We can also directly apply operant conditioning to our own lives for self -improvement.

If you want to build a better habit, you can use a four -step behavioral plan.

First, explicitly state your goals in measurable terms.

Second,

monitor how often you currently engage in the desired behavior.

Third,

reinforce the desired behavior.

Give yourself a tangible reward, but only after you actually finish the task.

And fourth, gradually reduce the external rewards as the behavior becomes an intrinsic habit.

This principle of taking control of our own conditioning extends even to our physiology through biofeedback.

This is an incredible system of recording, amplifying and feeding back information about subtle physiological responses.

Imagine a person sitting in a chair looking at a digital dial that is precisely measuring their own blood pressure or the micro tension in their forehead muscles.

By seeing the immediate visual consequence of trying to relax their breathing, they can literally learn to consciously control bodily functions that we used to think were completely involuntary.

So to summarize the first two massive pillars of our discussion, Classical conditioning is all about associating different external stimuli to produce automatic respondent behavior.

Operant conditioning is all about associating our own voluntary actions with consequences to shape our operant behavior.

But there is a third pillar of learning, and it might be the most uniquely human of them all.

We don't just learn by having things happen to us or by fumbling around through trial and error, because if we did, we'd probably be dead.

We don't have to actually put our bare hand on a glowing hot stove to learn that it burns.

We don't have to step off a cliff to learn about gravity.

We can learn simply by watching someone else.

If we see someone touch a hot stove and scream in agony, we learn the lesson without ever enduring the blisters.

We learn by observing and imitating others, a profound process known as observational learning or modeling.

And while humans are the masters of this, we aren't the only ones who do it.

We see observational learning throughout the animal kingdom.

Recess macaque monkeys are generally known to be a fairly aggressive species.

But researchers found that if you take young macaques and raise them with older, more forgiving monkeys,

the young macaques actually learn to forgive and make up after fights.

They model the peacemaking behavior.

Chimpanzees do this as well.

They learn complex tool use like stripping leaves off a branch and using it to fish for termites deep in a mound, purely by watching older, experienced chimps perform the action.

And human infants are observational sponges.

Studies have shown 14 -month -old babies imitating specific acts they saw on a television screen.

The old saying monkey see monkey do is vastly more accurate as children see children do.

And we now actually understand the neurological hardware that makes this possible.

In the 1990s, brain scans revealed a fascinating specific type of neuron located in the frontal lobe, which researchers dubbed mirror neurons.

These neurons fire when we perform an action.

But remarkably, they also fire when we simply watch someone else perform that exact same action.

It's as if our brain is generating a highly realistic, silent inner simulation.

If I want you to take a sip of a glass of water, the specific neurons in my brain that would normally command my own arm to reach out and drink water are lighting up.

My brain is practicing the movement internally.

Exactly.

Mirror neurons are believed to be the biological foundation for human empathy and our theory of mind, our crucial ability to infer another person's mental and emotional state.

When you see a loved one in physical pain, brain scans show that it triggers activity in the exact same pain regions in your own brain.

You quite literally feel their pain.

This simulation is so powerful that even just quietly reading a fictional story triggers intense mental simulation in the brain's motor cortex.

Some researchers have even pointed out a potential link between mirror neurons and autism spectrum disorder.

They suggest that some individuals with autism may display reduced mirror neuron activity, a phenomenon they describe as broken mirrors.

This neurological difference might help explain some of the well -documented challenges with imitating behaviors and intuitively reading complex social cues.

But if we want to talk about the absolute titan of observational learning, we have to discuss the psychologist Albert Bandura.

His work permanently altered how we view the media in childhood development.

And we must discuss his legendary Bobo doll experiment.

It is a masterclass in illustrating exactly how powerfully and how quickly imitation shapes human behavior.

Okay, let's set the scene for this experiment because the details are what make it so chilling.

A preschool -aged child is brought into a room and sits at a table quietly drawing pictures.

On the other side of the room, an adult is playing with a Tinker toy set.

Suddenly, the adult stands up and aggressively turns toward a five -foot tall inflated Bobo doll.

For ten straight minutes, the adult viciously attacks the doll.

They punch it, they hit it with a wooden mallet, they kick it across the room, and while they are attacking it, they shout very specific aggressive phrases like, Sock him in the nose and hit him down.

It's a shocking display for a child to watch.

After observing this violent outburst, the child is taken out of that room and placed into another room full of highly appealing, wonderful toys.

But after just a few minutes of playing, the experimenter purposely frustrates the child.

The experimenter interrupts them and says, Actually, these are my best toys.

I'm saving them for the other children to play with.

The child is now highly frustrated and irritated.

They are then moved to a third room which contains only a few broken or unappealing toys and a Bobo doll.

And the results were stark.

Compared to a control group of children who had not witnessed the adult's violent outburst, the children who had observed the aggressive model were vastly more likely to physically lash out at the doll.

But the most striking part wasn't just that they were generally aggressive.

It was the precision of the imitation.

They didn't just push the doll.

They imitated the exact violent actions they had seen.

They sought out the wooden mallet and struck the doll with it.

And they shouted the exact same specific phrases, Sock him in the nose, that they had heard the adult use.

They hadn't just been riled up.

They had perfectly modeled the specific behaviors of violence.

This experiment forced psychology and society at large to confront the profound impact of the models we are constantly exposed to.

The good news is that observational learning works beautifully for pro -social models, those demonstrating positive, helpful,

constructive behavior.

If parents are avid readers and keep books in the home, their children are significantly more likely to become readers.

Historical figures like Mahatma Gandhi or Martin Luther King Jr.

modeled nonviolent civil action, prompting millions of followers to imitate that specific approach to create massive social change.

But those models have to be genuinely consistent.

Many parents operate on a deeply flawed, do -as -I -say -not -as -I -do policy.

If a parent constantly lectures their child about the importance of being honest, but the child consistently watches the parent lie to neighbors or fake illnesses to get out of social obligations, the child doesn't learn honesty.

The child actually learns to imitate the hypocrisy.

They will learn to say that you should be honest while actively continuing to lie.

They model the contradiction.

And the deeply troubling news is that anti -social models have devastatingly powerful anti -social effects.

It is a tragic reality that abusive parents frequently raise aggressive children.

And to prove that this isn't simply a matter of shared aggressive genetics,

studies on animals have shown that young monkeys who are separated from their aggressive mothers and subjected to high levels of aggression from surrogate models grow up to be highly aggressive themselves.

The environment models the behavior.

Which brings us to the ultimate ongoing societal debate surrounding observational learning TV and media violence.

The statistics are absolutely staggering.

During the late 20th century, researchers calculated that the average child viewed 8 ,000 televised murders and 100 ,000 other acts of violence before they even finished elementary school.

And it isn't just the sheer overwhelming quantity of the violence.

It's the specific context in which it is portrayed.

Studies analyzing television programming note that in these violent shows,

nearly 75 % of the violent acts went completely unpunished.

Almost 60 % of the violent scenes did not show the victim experiencing any realistic pain.

Nearly half the incidents involved violence that was explicitly portrayed as justified.

And nearly half involved a highly attractive, charismatic perpetrator.

If you look at that through the lens of Bandura's Gobo doll experiment, it is the absolute perfect recipe for the violence viewing effect.

You have an attractive model performing a justified action, receiving no punishment, and causing no visible pain.

It practically begs for imitation.

But the big rigorous question in psychology is always correlation versus causation.

We know there is a correlation between heavy viewing of media violence and aggressive behavior.

But does watching violent TV cause kids to be aggressive?

Or do kids who already have innately aggressive personalities simply prefer to watch violent TV?

To definitively pin down causation, you have to move away from observational studies and conduct controlled experiments.

Psychologists have done this.

They randomly assign groups of kids to watch violent programming while other equivalent groups watch non -violent programming.

And the consensus from the scientific research community is remarkably clear.

Exposure to violence on television and in media does directly lead to increase in aggressive behavior in the children and teenagers who watch it.

And beyond direct imitation, chronic exposure to media violence causes deep psychological desensitization.

A study by Edward Donnerstein demonstrated this chilling effect.

He exposed adult participants to a graded series of violent scenes over several days, moving from simple fistfights to killings to graphic slasher movies.

He found that prolonged exposure literally fosters an emotional indifference to brutality.

They became completely numb to it.

There is a deeply unsettling photograph commonly shown in psychology lectures.

A young boy, maybe seven years old, standing in a dark arcade.

He is dual wielding two plastic assault rifles firing away at a screen and his face is completely entirely blank.

There is no shock, no stress, no excitement.

Just a deadpan stare.

Watching chronic cruelty fundamentally alters our emotional baseline.

It creates a terrifying indifference.

And that sober reality brings us to the end of our deep dive into the architecture of learning.

We have covered an immense amount of intellectual ground today, driven by the profound paradigm shifting legacy of four true pioneers.

Yvonne Pavlov showed us the sheer power of association, proving how our biology and our environment link together to form our most basic expectations.

John B.

Watson, flawed as he was, forced psychology to look at objective behavior, trying to build the entirety of human emotion on those simple associative links.

B .F.

Skinner took the next leap, proving with mathematical precision how the consequences of our actions shape our voluntary behavior, for better or worse.

And finally, Albert Bandura showed us that we are not isolated individuals learning only from our own pain and pleasure, but that we are deeply social creatures, constantly absorbing the world around us just by watching.

Together, they prove that learning isn't just about memorization.

It is our ultimate most vital survival tool.

It is the very engine of our adaptability.

I want to leave you with a final thought to mull over as you go about your day.

Take out your smartphone and just look at the layout of the apps on your home screen.

Think about the little red notification badges.

Are those red dots acting as classical condition stimuli, making your heart rate spike slightly with anticipation before you even open the app?

Or think about the endless feed of a social media app.

Are the variable ratio totally unpredictable rewards of endless scrolling, actively shaping your operant behavior, keeping you swiping like a gambler at a slot machine?

Who is the pigeon in the Skinner box today?

Are you the one doing the shaping, or are you the one being shaped?

The schnook salmon might be trapped by its genetics, but at least it isn't trapped by an algorithm.

It's worth considering.

Thanks for joining us in this deep dive.

And a special thanks to you on behalf of the Last Minute Lecture Team.

Keep questioning your environment.

We'll see you next time.

ⓘ This audio and summary are simplified educational interpretations and are not a substitute for the original text.

Chapter SummaryWhat this audio overview covers

Learning constitutes a relatively permanent shift in behavior arising from experience, distinguishing it from temporary fluctuations caused by fatigue or maturation. While many organisms operate primarily on genetic programming, humans demonstrate remarkable adaptability, allowing them to acquire novel behaviors in response to environmental challenges and opportunities. Psychological research has identified three fundamental categories of learning mechanisms: classical conditioning, operant conditioning, and observational learning. Classical conditioning involves an organism learning to associate two distinct stimuli, enabling prediction of future events. Ivan Pavlov's pioneering experiments with dogs demonstrated how neutral stimuli could become capable of triggering responses through repeated pairing with naturally stimulating events, establishing the foundation for behaviorism as an objective science of observable behavior. Key components of classical conditioning include the unconditioned stimulus and response, conditioned stimulus and response, along with processes of acquisition, extinction, spontaneous recovery, generalization, and discrimination. However, organisms do not simply form mechanical associations; research by Rescorla and Wagner revealed that animals learn based on the predictability of events, and biological predispositions like taste aversion show that survival-relevant associations form more readily than arbitrary ones. Operant conditioning operates on a different principle, where organisms learn associations between their own actions and the consequences those actions produce. B.F. Skinner's work expanded Thorndike's law of effect, demonstrating that reinforcement strengthens behavior while punishment weakens it. Critical distinctions exist between positive and negative reinforcement, between primary and conditioned reinforcers, and among various schedules of reinforcement that differentially affect learning speed and behavioral persistence. Cognitive and biological influences permeate operant conditioning as well, evident in phenomena such as latent learning, the development of cognitive maps, and instinctive drift. Observational learning, the third major category, enables individuals to acquire behaviors by watching and imitating models in their environment. Albert Bandura's research on modeling, particularly his Bobo doll experiments, established that exposure to observed behavior—whether prosocial or aggressive—significantly influences behavioral development. Mirror neurons provide a neural mechanism underlying this capacity for imitation and empathetic response.

Using this chapter to study? Last Minute Lecture is free and student-run. If it helped, consider supporting the project.

Support LML ♥

Unit 6: Learning

Related Chapters