Chapter 9: Hearing: Physiology & Psychoacoustics

Search this chapter

Audio Overview

0:00 / 0:00

Autoplay next chapter

Welcome to Last Minute Lecture.

This free chapter overview is designed to help students review and understand key concepts.

These summaries supplement not replace the original textbook and may not be redistributed or resold.

For complete coverage, always consult the official text.

Hello and welcome back to the Deep Dive.

We are trying something a little different today, aren't we?

We are.

We're kicking off what we're calling the Last Minute Lecture Series.

It sounds a bit frantic when you put it that way, but honestly, we know the reality.

Sometimes you have an exam in 24 hours.

Sometimes you have a meeting where you need to sound like an expert on auditory processing by, you know, lunchtime.

Exactly.

We are here to bridge that gap.

That is the mission.

So for you, our listener, we're taking a single source text.

In this case, it's chapter nine, Hearing,

Physiology and Psychoacoustics from the Sensation and Perception sixth edition textbook.

And we are going to, well, we're going to dismantle it.

We are going to walk through it page by page, figure by figure in the exact order the author presents it.

We aren't just skimming the bold terms.

That's not the point.

We are going to actually explain the mechanics.

The goal is that by the end of this hour, you really understand how air pressure becomes a symphony inside your brain.

A lofty goal, but I think we can do it.

Let's jump right in.

The chapter opens with a really, um, a really striking observation.

It compares the eye and the ear, but not in the way you usually hear.

It talks about urlids.

Or more to the point, the lack of them.

It's such a simple physiological fact, but the implications are just, they're profound.

Right.

You have eyelids.

If the visual world becomes too much, or if you need to sleep, you close them.

You can shut out the signal.

You have control.

You cannot do that with your ears.

The ears are always open.

They're always listening.

It's a 24 hour surveillance system.

Precisely.

And the text makes this point very clearly.

Even when you are sound asleep, your auditory system is processing the environment.

It has to.

Why?

Has to?

From an evolutionary perspective.

Absolutely.

The text frames this as a survival necessity.

Vision is the spotlight.

It's fantastic, but it only tells you what is in front of you.

And only when there's light.

Hearing is a radar.

It tells you what is happening behind you, around corners, and crucially in the dark.

It is the primary warning system.

That twig snapping behind you in the night vision is useless there.

That distinction really sets the tone for the entire chapter.

And there's another sentiment in the intro that really stuck with me.

The text discusses the, um, the isolation of sensory loss.

Yeah.

That part is heavy.

It says essentially that blindness separates you from things, but deafness separates you from people.

And that's a concept you really have to sit with.

Yeah.

Think about how we connect.

It's through speech, through the subtle tone of voice, through shared laughter, through music.

The human connection.

It's all auditory.

A world without sound is not just quiet.

The text argues it is socially severed.

And physically it's a more dangerous world.

You lose that omnidirectional alarm.

If a car is coming up behind you or someone shouts a warning.

You are completely vulnerable without that sense.

So yes, hearing is our social tether and it's also our bodyguard.

So hearing is both our social tether and our bodyguard.

Now, before we get lost in the philosophy, let's look at the roadmap for this deep dive.

Chapter nine is actually the first of a trilogy in this book, right?

It is.

Yeah.

The authors, they very wisely realized that hearing is just too complex for one chapter.

So chapter nine, our focus today is the hardware.

The nuts and bolts.

Exactly.

Yeah.

It's the physics of sound, the anatomy of the ear, and the basic neural coding of loudness and pitch.

The absolute fundamentals.

And the complex stuff.

I'm talking music, speech, figuring out where a sound is in 3D space.

That is all reserved for chapters 10 and 11.

Today, we are strictly looking at the machinery.

You have to understand the instrument before you can appreciate the music.

Makes sense.

Okay.

Let's dive in.

Section 9 .2, the physics of sound.

We use the word sound constantly, but physically, what are we actually talking about?

At the most fundamental level, sound is vibration,

but we need to be a little more precise.

The text uses the analogy of a pebble dropped into a still pond, which we've all seen.

That's figure 9 .1 in the book.

You drop the rock and ripples move outward in perfect circles.

Right.

But there is a trap in that analogy.

A common misunderstanding.

Okay.

When you watch those ripples, it looks like the water itself is moving away from the center.

Like it's flowing to the edge of the pond.

Yeah, that's what it looks like.

But it's not.

If you put a little rubber duck on that pond, it doesn't surf to the edge.

It just bobs up and down in place.

The water molecules are moving vertically.

The wave of energy is what's moving horizontally.

Ah, okay.

So in the air, when I speak, I'm not literally shooting air molecules out of my mouth at you.

Thankfully, no.

That would be unpleasant.

Sound in air is a pressure wave.

My vocal cords vibrate and they push a little group of air molecules together.

That's called compression.

Right.

Then as my vocal cords move back, they create a space and the molecules spread apart.

That's called rarefaction.

So it's a pattern of high pressure, low pressure, high pressure, low pressure.

Precisely.

And that pattern travels across the room.

The individual air molecules just kind of wiggle back and forth.

But the wave itself, the pattern, moves.

And just like in the pond, that wave changes as it travels, right?

It does.

As that circle of ripples gets wider and wider, the energy is spread out over a larger and larger circumference.

So the height of the wave, what we call the intensity,

has to drop.

Which is why sound gets quieter with distance.

That's it.

The pattern of the wave, which we'll call frequency, stays the same.

But the energy, the intensity dissipates.

Speaking of traveling, let's talk about speed.

We all have a general sense of the speed of sound, maybe from movies.

It's roughly 340 meters per second in air.

But the text throws a curve ball when it talks about water.

It's so counterintuitive, isn't it?

Yeah.

You would think that water, being so much heavier and denser, would slow sound down.

Like trying to run through a swimming pool versus running on a track.

Exactly.

It should offer more resistance.

And yet sound actually travels faster in water.

Way faster.

About 1500 meters per second.

Okay, why?

What's the physics behind that?

It all comes down to how efficiently the molecules can bump into their neighbors.

Water is denser, yes, but it's also much less compressible than air.

Less spongy.

Very much so.

When you push on a water molecule, it shoves its neighbor almost instantly.

Air is spongy.

There's a little bit of a lag as you compress it before it pushes the next molecule.

So sound zips through water and it goes even faster through something solid, like steel.

But compared to the speed of light, it's still crawling.

Oh, it's a snail's pace.

Light is nearly a million times faster, which gives us that classic lightning and thunder rule the text mentions.

Right, the five second rule.

Yeah.

Because the light from the lightning flash gets to you effectively instantly.

The delay you experience is just how long it takes for the sound wave of the thunder to travel that distance.

Sand takes about five seconds to travel one mile.

So you see the flash, you start counting one, one thousand, two, one thousand, up to five.

Boom, that storm is one mile away.

And if it's 10 seconds, it's two miles away.

A very handy bit of physics.

Okay, let's get to the two big fundamental properties of these waves.

Figure 9 .2 in the text breaks sound down into amplitude and frequency.

We have to nail these definitions because really the rest of the chapter relies on them.

Absolutely.

Let's start with amplitude.

Physically, this is the intensity of the wave.

The size of the pressure change.

Exactly.

And in our pond analogy is how high the ripple rises above the still water level.

In the air, it's the magnitude of that compression and rarefaction.

And what is that to us perceptually?

It's loudness.

Simple as that.

High amplitude is a shout.

Low amplitude is a whisper.

The text uses a great visual analogy here to help keep it straight.

It does.

It says that amplitude in hearing is like brightness and vision.

A bright light versus a dim light.

Okay.

Amplitude is brightness.

That's a good mental hook.

Now, the second property,

frequency.

Frequency is all about the speed of the vibration.

It's how many times that wave pattern, the compression and rarefaction cycle, repeats itself in one second.

And we measure this in?

Hertz, abbreviated hertz.

So a sound that is 100 hertz means the wave is cycling 100 times every second.

And this corresponds to pitch.

Correct.

Low frequency is a low pitch.

Think of a tuba or the rumble of thunder.

High frequency is a high pitch of flute.

A squeaking door hinge.

So to complete the visual analogy from the book.

Frequency is like color and vision.

Red versus blue.

Low pitch versus high pitch.

So amplitude is brightness.

Frequency is color.

That's super helpful.

Now, what is the range we're working with here?

What are the actual limits of human hearing?

Figure 9 .3 shows this spectrum beautifully.

A healthy young human ear, and the young part is important, as we'll see later, can detect frequencies from about 20 hertz at the very bottom, the lowest rumble, up to about 20 ,000 hertz at the top.

But that's just our little window on the world.

The animal kingdom plays by very different rules.

Oh, very different.

And there is a general biological rule of thumb here.

Size matters.

Oh, so.

Larger animals tend to hear lower frequencies.

The text uses elephants as the prime example.

They can communicate using infrasound frequencies that are below our 20 hertz threshold.

We can't perceive them as sound.

And these waves can travel for miles.

For miles, sometimes through the ground itself.

They are incredibly long, powerful waves.

And on the other end of the spectrum,

the high frequencies.

Smaller animals usually hear higher.

Dogs, and especially bats, are masters of ultrasound frequencies way, way above our 20 ,000 hertz ceiling.

Which is why a silent dog thistle works.

Exactly.

To us, it's silent.

But to a dog, that whistle might be screaming at 30 ,000 hertz.

They hear it perfectly clearly.

You hear absolutely nothing.

Okay, now we have to tackle the elephant in the room.

Or maybe the math in the room.

The decibel.

I feel like this is where students' eyes usually start to glaze over.

Why can't we just measure loudness with a normal ruler?

It's a great question.

And the answer is because the ears is just too good.

That's the problem.

Too good.

The text explains that the range of pressures that the human ear can process is absurdly vast.

The difference in pressure between the faintest sound you can possibly detect and the loudest sound you can tolerate without immediate pain is a ratio of roughly one to a million.

Wow.

One to a million.

Yeah.

So if we used a linear scale, like a ruler.

The numbers would get ridiculous.

Ridiculous.

If the faintest sound was represented by one inch on your ruler, the loudest sound would be, I think the math works out to about 15 miles away.

You can't put that on a graph.

It's unreadable.

So we need to compress that range.

We need to compress it.

And that's what a logarithmic scale does.

It squashes that huge range into manageable numbers.

That scale is the decibel, or dB.

Okay.

Here's the thing that confuses everyone, including me sometimes.

Zero dB.

That sounds like it should mean zero sound.

Like absolute silence.

Like zero degrees Kelvin means zero heat.

But it doesn't.

No.

And this is a crucial point to understand.

Zero dB is not silence.

It is a reference point.

A baseline.

Scientists just had to agree on a starting line.

So they defined a specific, very faint pressure level, 0 .0002 dynes per square centimeter, if you want to be technical, which is roughly the average threshold of human hearing for a thousand hertz tone.

Zero dB just means the sound I am measuring is equal in pressure to that reference point.

So you can actually have negative decibels.

Absolutely.

Yeah.

If a sound is quieter than that reference threshold, it will have a negative dB value.

It's just like temperature on the Celsius scale.

Zero degrees isn't the absence of heat.

It's just the freezing point of water.

You can go colder.

And table 9 .1 in the chapter gives us some real world landmarks to anchor ourselves.

Right.

Which is really helpful.

Leaves rustling in the wind are about 20 dB.

A quiet residential neighborhood is maybe 40 dB.

Normal conversation, like we're having now, is around 60 dB.

And then it starts to ramp up.

A chainsaw is about 100 dB.

A jet takeoff from nearby is 120 dB, which is getting into the pain threshold.

But you have to remember, because it's logarithmic, a small jump in the dB number is a huge jump in physical energy.

Give us a sense of that scale.

How does it work?

The basic rule is that every 20 dB increase represents a tenfold increase in the actual sound pressure.

So going from, say, 20 dB to 40 dB?

That's not twice the pressure.

It's 10 times the pressure.

Yeah.

And going from 20 dB to 60 dB isn't three times the pressure.

It's 10 times 10.

It's 100 times the physical pressure.

So by the time you get to 100 dB chainsaw, you are talking about a massive, massive amount of physical energy compared to a 20 dB whisper.

That really changes how you think about hearing safety.

Okay, let's complicate things a little.

We've been talking about these perfect, smooth sine waves.

But real -world sounds, a guitar, a voice, a slamming door, they don't look like that on an oscilloscope.

Not at all.

A sine wave is what we call a pure tone.

It's a laboratory construct.

You almost never hear one in nature.

Nature is messy.

But here is where the math gets really beautiful.

The text introduces a concept called Fourier analysis.

This is a mathematical theorem from the 1800s that states that any complex sound, no matter how jagged or messy the wave looks, can be broken down into a combination of simple sine waves.

It's like unbaking a cake.

That is the perfect analogy.

You present the ear with a chocolate cake, which is a complex sound, and the ear's job, as we'll see when we get to the cochlea, is to break it back down into its ingredients.

The flour, the eggs, the sugar, the cocoa.

Each of those is a simple sine wave.

And we can visualize this breakdown with a graph called a spectrum.

Right.

A spectrum, shown in figure 9 .6, is just a graph that instead of showing the wave over time, it shows how much energy or amplitude is present at each individual frequency.

When we look at the spectrum of musical instruments or the human voice, we see a very specific pattern emerge.

It's called a harmonic spectrum.

Right.

Let's go with the text example.

Imagine plucking a guitar string.

It vibrates as a whole from end to end at a certain speed.

Let's say 100 hertz.

We call that the fundamental frequency.

And that's what determines the pitch we hear, the musical note.

That's the note.

The fundamental is the pitch.

But here's the cool part.

The string doesn't just vibrate as a whole.

At the same time, it's also vibrating in halves, in thirds, in quarters, all superimposed on each other.

And those smaller vibrations are the harmonics.

Yes.

And they follow a strict mathematical rule.

They are always integer multiples of the fundamental.

So if the fundamental is 100 hertz, the second harmonic will be exactly 200 hertz.

The third will be 300 hertz.

The fourth, 400 hertz, and so on up the scale.

This finally brings us to the concept of timbre.

I love this word.

It explains why a trumpet and a piano playing the exact same note can sound so completely different.

Yes.

They are playing the same note, which means they have the same fundamental frequency.

The reason they sound different is the recipe of their harmonics.

The recipe.

A trumpet puts a ton of energy into its higher harmonics, which gives it that bright, brassy, piercing bite.

A flute, on the other hand, puts almost all of its energy into the fundamental frequency with very weak harmonics.

That's why it sounds so smooth and pure.

So our brain analyzes this recipe of harmonics, the shape of the spectrum, to identify the instrument.

That's timbre.

It's the quality of a sound, distinct from its pitch or loudness.

Okay, fantastic.

We have the physics down.

We have our pressure wave ready to go.

Now let's follow that wave into the head.

Section 9 .3, the structure of the auditory system.

We are going on a journey from the outside in.

We start logically with the outer ear.

First stop, the piranha.

That's the crinkly, fleshy cartilage structure on the side of your head.

We tend to just think of it as a funnel, which it is, but it's actually much more clever than that.

How so?

The text notes that these external ear flaps are unique to mammals.

And because of all those specific nooks and crannies, it actually filters the sound in a subtle way.

That filtering changes depending on whether the sound is coming from above you, below you, or in front of you.

So it helps with sound localization.

It gives your brain crucial clues about the elevation of a sound source.

It's not just a simple funnel.

Then the wave travels down the ear canal.

The auditory metis, technically, it's about 25 millimeters long in an adult, It has two main jobs.

One is protection.

It keeps the delicate eardrums safely recessed inside the skull.

And the second job.

It acts as a resonance tube.

Just by the specific length and shape, it naturally amplifies or boosts frequencies in a specific range.

Let me guess.

The range used for speech.

You got it.

It specifically enhances frequencies between about 2 ,000 and 6 ,000 hertz.

Evolution is clever.

The anatomy is physically tuned to amplify the signal we need to hear most.

At the end of that canal, we finally hit the tympanic membrane.

The eardrum.

It's a very thin sheet of skin that seals off the canal completely.

It vibrates in sympathy with the incoming sound pressure wave.

This marks the border.

Everything before this was the air -filled outer ear.

Now we cross into the middle ear.

The middle ear.

This is a small air -filled cavern inside the skull, and it contains the ossicles.

The malleus, which means hammer.

The incus, or anvil.

And the stapes, the stirrup.

These are famously the three smallest bones in the human body.

And they form a little chain, right?

A mechanical chain.

The malleus is attached to the eardrum.

When the eardrum vibrates, it pushes the malleus, which pushes the incus, which pushes the stapes.

And the footplate of the stapes presses against a little membrane called the oval window.

Here is the big question.

Why?

Why does this Rube Goldberg machine of tiny bones even exist?

Why not just have the sound wave hit the inner ear directly?

This is maybe the single most important engineering problem the ear has to solve.

It's a problem called impedance mismatch.

And we really need to understand this to appreciate the ear.

Okay.

Impedance mismatch.

The inner ear, which we'll get to next, is not filled with air.

It's filled with fluid.

And fluid is much, much heavier and harder to move than air.

Exactly.

The textbook is a great example.

Imagine you are standing at the edge of a swimming pool, and your friend is underwater.

You shout their name.

Do they hear you?

Barely.

It's all muffled and distant.

Right.

Because when that sound wave traveling through the light, fluffy air hits the surface of the dense, heavy water.

99 .9 % of the sound energy bounces off.

It reflects, like a tennis ball hitting a concrete wall.

The air is just too weak to make the water move.

Precisely.

If our ears were just open holes leading directly to that fluid -filled inner ear, we would be effectively deaf to all but the very loudest sounds.

So the middle ear, the ossicles, are an amplifier.

They are an impedance matching device.

Their job is to overcome that reflection by amplifying the pressure.

And they do it in two brilliant ways.

The first is leverage.

Right.

The bones are hinged in a way that creates a lever action, much like using a crowbar to lift something heavy.

It provides a small amount of force amplification.

But the bigger factor, by far, is the second mechanism, the concentration of surface area.

This is the stiletto heel analogy from the book.

It's the perfect analogy.

Think about standing on soft ground wearing snowshoes.

A snowshoe has a very large surface area, so your body weight is distributed.

You don't sink.

Right.

Now imagine trying to stand on that same soft ground wearing a stiletto heel.

All of your body weight is focused onto that tiny sharp point.

It creates immense pressure.

You'd sink right in.

So in this analogy, the eardrum is the snowshoe.

Exactly.

The eardrum is large.

It collects a lot of sound energy from the air.

The oval window, the membrane where the stapes pushes in, is tiny.

It's about 18 times smaller.

Ah, so you're taking all the force collected by the big snowshoe and focusing it down onto the tiny stiletto heel of the stapes.

And that's how you do it.

That concentration of force creates enough pressure to punch the sound wave into the fluid of the inner ear and create a meaningful wave.

It's an incredible piece of biological engineering.

But the middle ear isn't just an amplifier.

It also has a built -in brake system.

The text calls it the acoustic reflex.

Yes.

There are two tiny muscles in the middle ear.

The tensor tympani and the stapedius.

When your brain detects a very loud, potentially damaging sound, it sends a signal to these muscles.

And they contract.

They contract and they pull on the ossicles, stiffening the entire chain.

This makes it much harder for the bones to vibrate, which reduces the amount of sound energy that gets through to the inner ear.

It's like putting a mute on the system to protect it.

It is a protective mechanism.

But, and this is big a pam, but it has a fatal flaw.

It's slow.

The reflex has a latency of about 200 milliseconds or one -fifth of a second.

So for a sudden sharp bang, like a gunshot or a firecracker next to your ear.

The damage is done long before the muscles have time to clamp down.

They're too slow to protect against compulsive sounds.

They're better for sustained loud noises, like at a rock concert.

The text also points out they activate for another reason.

Right.

They also activate when we talk, chew, or swallow.

They dampen our own self -generated noises, so we don't deafen ourselves with the sound of our own voice or a crunchy carrot.

Okay, we've crossed the middle ear.

The stapes is now pushing on the oval window.

We are officially entering the inner ear.

The cochlea.

The star of the show.

The name means snail in Greek, because that's what it looks like.

It's a spiral structure, shaped like a tiny snail shell, buried deep inside the temporal bone of the skull.

It is minuscule by the size of a baby pea.

The text asks us to perform a thought experiment.

Unroll this snail shell into a straight tube.

When we do that, we see it's divided into three parallel chambers.

Right.

It helps to visualize it as a long tube that's been divided into three floors or levels.

The top floor is the vestibular canal.

The bottom floor is the tympanic canal.

These two canals are actually connected at the very tip, the far end of the cochlea, at a point called the helicotrema.

They are both filled with a fluid called perilymph.

What about the middle floor?

The middle floor sandwiched in between is the middle canal or the scolomedia.

This is the VIP section.

It's a self -contained tube.

Separated by membranes.

Yes.

It's separated from the top canal by Reisner's membrane and from the bottom canal by the basilar membrane.

It's filled with a different special fluid called endolymph, which is very rich in potassium ions.

Sitting on the basilar membrane, so on the floor of this middle canal, is the main event, the organ of corti.

This is it.

This is the actual organ of hearing.

This is the structure that translates the fluid vibration into electricity that the brain can understand.

And it contains the crucial sensory cells, the hair cells.

Two types.

First, the inner hair cells.

They are arranged in a single neat row.

There are only about 3 ,500 of them per ear.

These are the workhorses.

They are responsible for sending about 90 % of the auditory signals to the brain.

And the other type.

The outer hair cells.

These are much more numerous, about 10 ,500.

And they are arranged in three rows, often in a V or W shape.

We used to think they were just less important backups, but their job is actually incredibly specialized and mechanical, which we will definitely get to.

And sticking out of the top of all these hair cells are the stereocilia.

The hairs themselves.

They are actually tiny, stiff bristles.

And they are arranged on each cell in rows of graduating height, from short to tall.

And resting gently on top of the longest of these bristles is a gelatinous, flap -like structure called the tectorial membrane.

Okay, the whole stage is set.

Let's run the machine.

The stapes pushes the oval window in.

What happens next?

That push creates a pressure wave in the fluid of the top chamber, the vestibular canal.

That wave travels down the canal, creating a bulge that pushes down on the flexible middle canal.

So the whole middle chamber gets squished down.

Which means the basilar membrane, the floor of that chamber, is forced to move downwards.

As the stapes pulls back out, the membrane moves back up.

The whole central partition is bouncing up and down with the frequency of the sound.

And there's another membrane, the round window, that helps with this.

Right.

Since fluid can't be compressed when the oval window pushes in, something has to bulge out to relieve the pressure.

That's the round window at the base of the tympanic canal.

So the floor of the organ of Corti is bouncing.

It's bouncing.

And because of the way the organ of Corti is hinged, when the basilar membrane moves up and down, the tectorial membrane that flap on top slides back and forth horizontally across the tops of the hair cells.

Creating a shearing force.

It literally bends the stereocilia back and forth, like wind blowing through a field of wheat.

This is the moment.

This is transduction.

This is where the physics becomes biology.

So how does bending a tiny hair create a neural signal?

The mechanism is just breathtakingly simple and beautifully mechanical.

The text explains, and figure 9 .11 shows this, that the tips of the stereocilia are connected to each other by tiny little filaments called tip links.

Like tiny ropes between them.

Exactly.

Imagine a tiny string connecting the very tip of a shorter hair to the side of the taller hair right behind it.

OK.

So when the shearing force bends the bundle of hairs toward the tallest one, the hair spread apart slightly, which pulls that little string tight.

And the string is attached to a gate.

A trap door.

Yeah.

Literally.

It's a mechanically gated ion channel.

The tip link physically yanks open a pore, a little trap door, on the tip of the stereocilia.

That's wild.

It's not a chemical reaction.

No.

And that's key.

In the eye, turning light into a signal is a relatively slow, complex chemical cascade.

In the ear, it's a direct physical pull.

The trap door opens.

Positively charged potassium ions, which are abundant in the endolymph, rush into the cell.

Which depolarizes the cell.

It depolarizes the cell,

which triggers voltage -gated calcium channels to open at the base of the cell.

And the cell then releases neurotransmitters to the auditory nerve fiber waiting just below it.

Signal sent.

And the text really highlights the incredible speed of this system.

It has to be fast.

To hear a high -frequency sound like 5 ,000 hertz, this entire process -bending, gate -opening ion flow neurotransmitter release has to happen 5 ,000 times per second.

A chemical reaction would be far too slow for that.

It has to be a direct physical linkage.

And the sensitivity is just, it's almost unbelievable.

Well, it's at the atomic scale.

The text mentions that one of the faintest sounds you can hear bends the stereocilia by just one nanometer.

How big is that?

That is about the diameter of a large atom.

The text is an amazing analogy.

If the bundle of stereocilia were the size of the Eisel Tower, the movement required to hear a whisper would be about one centimeter.

Just a tiny nudge.

That really puts the fragility of hearing into perspective.

A little too much movement, and you could imagine those delicate structures just snapping.

Which is exactly what happens in noise -induced hearing loss.

Okay, so the hair cell is firing.

Now we move to section four, coding.

How does the brain know what we are hearing?

Right.

The firing of the nerve is just a spike, a yes signal.

The brain needs to decode two crucial pieces of information from these signals,

amplitude, or loudness, and frequency, or pitch.

Let's start with amplitude.

That seems like the easier one.

It is.

It's pretty straightforward.

A louder sound creates a bigger pressure wave in the cochlear fluid.

This makes the basilar membrane bounce up and down with a greater excursion, a bigger movement.

Which means the stereocilia bend further.

They bend more, which pulls the trapdoors open wider, or keeps them open for a longer portion of the cycle.

More positive ions flow in, more neurotransmitter gets released, and the auditory nerve fiber connected to that hair cell fires faster.

So faster firing rate equals louder sound.

Simple.

To a point, yes.

More on that later.

But basically, that's the code for loudness.

Now frequency coding.

This is the tricky one.

This is where the magic happens.

The text calls the cochlea an acoustic prism.

I love that description.

It's so evocative.

A glass prism takes white light, which is a mix of all colors, and it physically splits it apart into a rainbow.

You see the red, the green, the blue, all laid out in space.

And the cochlea does the same for sound.

It takes a complex mound, which is a mix of all different frequencies, and it physically splits it apart.

It sorts the frequencies, laying them out along its length.

It separates the high pitch from the low pitch.

How?

This relies on the physical properties of the basilar membrane, right?

It's not the same all the way down.

No, it is a highly specialized structure, and figure 9 .12 is crucial for understanding this.

Jorg von Beckezi won a Nobel Prize for figuring this out.

What did he find?

He found that the basilar membrane systematically changes its physical properties from one end to the other.

At the base of the cochlea, near the oval window where the sound comes in, the membrane is narrow, it's thick, and it's very stiff.

Okay.

As you move down towards the other end, the apex, the membrane gradually gets wider, thinner, and much more flexible or floppy.

So think of it like a set of guitar strings.

From the thin, tight, high E string to the thick, floppy, low E string.

That's a great way to think about it.

The stiff bass resonates with, and is moved most by, high frequencies.

The floppy apex resonates with, and is moved most by, low frequencies.

So when a sound wave enters the cochlea, it creates this wave in the fluid that travels down the spiral.

Exactly.

It creates a traveling wave.

The wave moves along the membrane, and the membrane starts to vibrate.

But the wave doesn't just go on forever.

It travels until it hits the specific spot on the membrane that is tuned to its frequency.

And then what?

At that point, the wave transfers all its energy to the membrane, creating a peak of displacement, a point of maximum vibration, and then the wave quickly dissipates and dies out.

So a high -pitched sound?

A high note, say 10 ,000 hertz, will create a traveling wave that peaks very early near the stiff bass, and then it's gone.

It never reaches the apex.

And a low bass note?

A low rumble, maybe 100 hertz, creates a wave that travels all the way down the entire length of the cochlea before it finally peaks at the floppy apex.

So the brain just has to look at where along the cochlea the signal is coming from.

Precisely.

This is the famous place code for frequency.

If the nerve fibers connected to the base of the cochlea are firing like crazy, the brain interprets that as, I hear a high pitch.

If the fibers at the apex are firing, the brain says, I hear a low pitch.

The cochlea has turned frequency into a physical location.

But wait a minute.

The text mentions a complication.

It says that this passive resonance, just the stiffness of the membrane alone, isn't actually sharp enough.

The tuning would be sloppy.

The peak of the wave would be too broad.

This is where the story gets really amazing.

This is a revolutionary discovery.

For a long time, we thought the ear was a passive device, like a microphone, just receiving sound.

It turns out the ear is an active living amplifier.

And this is where the outer hair cells come back into the story.

The ones we said had a special mechanical job?

Yes.

The outer hair cells have a property that is almost unique in the body.

It's called electromotility.

Electromotility.

It moves the electricity.

It means they physically move.

When an outer hair cell is stimulated by an electrical change, it doesn't just release neurotransmitters.

It physically elongates and contracts.

It gets longer and shorter like a tiny muscle.

They dance.

They pump up and down in time with the sound wave.

Why on earth would they do that?

They do it to amplify the movement of the basilar membrane, but only at that one specific spot.

It's like pushing a child on a swing.

If you push it just the right time, you can make the swing go much higher.

The outer hair cells are pushing the basilar membrane at its resonant frequency.

So they sharpen the peak of the traveling wave.

They make it incredibly sharp and precise.

Figure 9 .14 in the book shows the tuning curve of the membrane, with and without healthy outer hair cells.

Without them, the tuning is a blunt, rounded hill.

It's sloppy.

With them, the tuning curve becomes a sharp, precise needle.

They provide about 50 dB of amplification.

So the ear is mechanically zooming in on the frequency it's hearing.

It's a cochlear amplifier.

It is.

And here is a crazy side effect.

Because these cells are physically moving and pumping energy back into the system, the ear actually makes its own sound.

A speaker?

You're kidding.

No.

They're called otoacoustic emissions.

If you put a tiny, very sensitive microphone into a healthy ear canal in a quiet room, you can actually record the sound that the ear itself is producing.

That is incredible.

It's now used as a standard hearing test for newborn babies.

They can't tell you if they can hear, so you play a little click sound into their ear.

If the cochlea is healthy, the outer hair cells will amplify that click and send a tiny echo back out.

If the microphone picks up the echo, you know the cochlea is working.

If it's silent, there might be hearing loss.

Wow.

Okay, my mind is a little blown.

Let's move on to section five, the auditory nerve.

So we have our precisely coded signal now leaving the cochlea.

We are now officially in the domain of neuroscience.

We're dealing with neurons.

Figure 9 .33 shows what's called a threshold tuning curve for a single auditory nerve fiber.

And what is that showing us?

It's just a graph that confirms what we've been talking about.

It plots how loud a sound needs to be at different frequencies to make that one specific neuron fire.

And you see a sharp V shape.

There's one frequency where the neuron is most sensitive.

It requires the least amount of energy to fire.

That's its characteristic frequency, or CF.

Exactly.

It's the pitch that that specific neuron likes the best.

And that CF corresponds to its physical location on the basilar membrane.

But the nervous system isn't perfect.

The coding runs into some problems.

The first one the book discusses is rate saturation.

This is a fundamental limitation of biology.

A neuron can't fire infinitely fast.

After it fires an action potential, it needs a moment to recharge the refractory period.

This limits its maximum firing rate to maybe 500 or 1000 spikes per second.

So what happens when a sound gets really, really loud?

The neuron just maxes out.

It hits its ceiling.

It saturates.

It's firing as fast as it possibly can.

And what's the problem with that?

When it saturates, it gets sloppy.

Figure 9 .16 shows this.

At low volumes, the neuron is very picky, only responding to its characteristic frequency.

But as the sound gets louder and the neuron saturates, it starts responding to a much broader range of frequencies it would normally ignore.

Its tuning curve widens out.

So how can we tell the difference between different notes at a loud rock concert if all our neurons are maxed out and responding to everything?

The brain is clever.

It uses a team approach.

The text explains that the auditory nerve is not one type of fiber.

It has at least two.

High spontaneous fibers and low spontaneous fibers.

You can think of them like the rods and cones in your eye.

That's a great parallel.

The high spontaneous fibers are like the rods.

They are incredibly sensitive.

They fire even in response to random background noise.

They're great for hearing whispers in a quiet room.

But because they're so sensitive, they saturate instantly when a sound gets even moderately loud.

So they're useless at the rock concert.

Useless.

That's where the low spontaneous fibers come in.

They are like the cones.

They are stubborn.

They completely ignore quiet sounds.

They only start firing when the sound is already quite loud.

But because they wait to start firing.

They have a much higher ceiling before they saturate.

They maintain their sharp frequency selectivity even at very high volumes.

So the brain gets the full picture by listening to both the sensitive but easily saturated fibers and the stubborn but robust fibers at the same time.

That makes sense.

Now, we talked about the place code for pitch where the wave peaks.

But the book says there's a problem with the place code for very low frequencies.

Right.

Down at the very floppy apex of the cochlea, the tuning is a bit broad.

The peaks for, say, 100 hertz and 150 hertz are very wide and overlapping.

It's hard to tell them apart just based on location.

So the brain uses a second complementary strategy.

It's called temporal coding.

Temporal, meaning based on timing.

Instead of looking where the neurons are firing, the brain looks at when they are firing.

This involves a phenomenon called phase locking.

Yes.

Figure 9 .18 shows this perfectly.

For low frequency sounds,

auditory nerve neurons tend to fire their action potentials at a specific consistent point in the sound wave cycle.

Usually it's at the peak of the pressure wave.

So they are firing in sync with the wave?

They are locked to the phase of the wave.

Yeah.

If a sound is vibrating at 100 hertz, the neuron will fire 100 times a second in perfect rhythm with the sound.

The brain doesn't need to look at where the neuron is coming from.

It can just count the rhythm of the spikes.

100 pulses per second must be 100 hertz sound.

But wait, you just told me a single neuron can't fire much faster than 1000 hertz.

So how can we use temporal coding for a sound that's, say, 3000 hertz?

This is where the volley principle comes in.

This is brilliant.

The text uses an analogy.

Imagine a line of old -timey soldiers with muskets.

A single soldier shoots, but it takes him a long time to reload.

Can't shoot very fast on his own.

Okay.

But if you have a group of five soldiers and they take turns firing, one, two, three, four, five, and by then the first one is reloaded, the group as a whole can maintain a very rapid continuous barrage of fire.

So different neurons take turns firing on different peaks of the sound wave.

Exactly.

Neuron A fires on the first wave peak.

Neuron B fires on the second.

Neuron C on the third.

No single neuron has to fire on every peak.

But when the brain sums up the activity of the entire bundle of nerve fibers, the overall pattern of firing perfectly matches the high frequency of the sound.

It's teamwork.

It's neural teamwork.

And this volley principle allows the brain to use temporal coding for frequencies up to about 4000 or 5000 hertz.

And above that, what happens for really high -pitched sounds?

Above 5000 hertz, the wave is just too fast.

Even for a group, the timing gets jittery and unreliable.

So for the very highest pitches, the brain relies entirely on the place code for the base of the cochlea.

Let's quickly trace the signal's path from the nerve to the brain.

This is section six, auditory brain instructions.

It's a bit more of a relay race than the visual pathway, with more stops along the way.

So first stop from the auditory nerve.

The cochlear nucleus, which is in the brain stem.

This is the first synapse.

A lot of basic processing starts here.

Then the superior olive.

And this one's really important.

This is the first place in the brain where inputs from the left ear and the right ear converge and meet.

And that convergence is crucial for...

For figuring out where a sound is in space.

By comparing the tiny timing and intensity differences between what the two ears hear, the superior olive calculates the location of the sound source.

After that, it goes up to the inferior colliculus in the midbrain.

Then to the medial geniculate nucleus, or MGN, in the thalamus.

Just like in vision, the thalamus is the main sensory relay station.

And finally, from the thalamus, it projects to the cortex.

To the primary auditory cortex called A1, which is located in the temporal lobe, tucked inside a deep fold called the Sylvian fissure.

And the text makes a big point that the tonotopic organization is preserved the entire way up this pathway.

It's amazing.

That beautiful map of frequencies we saw on the basilar membrane, low frequencies at one end, high frequencies at the other.

That organization is maintained in the nerve, in the brain stem nuclei, in the thalamus, and is laid out right across the surface of the primary auditory cortex.

If you were to record from neurons in A1, you'd find a literal piano keyboard of frequencies mapped onto the brain.

Okay, that was a whirlwind tour of the anatomy.

Let's get to section 7.

Psychoacoustics.

This is where we leave the biology and ask, what do we actually perceive?

Psychoacoustics is the study of the relationship between the physical signal, like frequency and hertz, and the psychological experience, like the perception of pitch.

They are always a one -to -one match.

For example, let's talk about audibility thresholds.

Right.

We said earlier that humans can hear from 20 hertz to 20 ,000 hertz, but that doesn't mean we hear all those frequencies equally well.

No.

Figure 9 .22 shows the classic audibility curve, and it's a deep U shape.

It is.

It shows we are incredibly sensitive to sounds in the middle range, specifically between about 2 ,000 and 6 ,000 hertz.

We can hear a pin drop in that range.

The speech range.

The speech range again.

But for very low bass notes or very high treble sounds, the sound has to be physically much, much louder, have a much higher decibel level for us to even detect that it's there.

This directly explains why older stereo systems used to have a loudness button.

Exactly.

When you listen to music at a low volume, the first thing to disappear perceptually is the bass, because our ears are naturally bad at hearing quiet bass sounds.

The loudness button was just an equalizer that boosted the low and high frequencies at low volumes to compensate for our ears' natural deficiency.

And this concept is mapped out in what the book calls equal loudness curves.

Right.

Those curves show you that a hundred hertz tone has to be physically more intense, have more decibels to sound just as loud as a thousand hertz tone.

Our perception of loudness depends heavily on the frequency of the sound.

Let's touch on another psychoacoustic concept, masking.

Masking is a really powerful tool that researchers used to figure out how the ear processes different frequencies.

The basic idea is simple.

I play a pure tone that you're trying to hear, and at the same time I play some noise to try and hide it or mask it.

And what did they find when they did this?

They found something that proves the cochlear works like a set of filters.

If you're trying to hide a thousand hertz tone, only the noise that is also right around a thousand hertz is effective at masking it.

You can blast low bass noise or high treble noise at the same time, and it won't really affect your ability to hear that one thousand hertz tone.

This proves the ear really does act like a set of independent channels or filters, the critical bandwidth.

Exactly.

The noise has to fall within the target frequency's channel to interfere with it.

But there is a fascinating asymmetry to this, which the book calls the upward spread of masking.

Okay, what's that?

This brings us all the way back to the physics of the traveling wave on the basilar membrane.

Remember, a low frequency wave has to travel all the way to the apex of the cochlea.

So it sweeps across the entire length of the membrane on its way.

It disturbs everything.

It creates motion all along the membrane.

So a loud low frequency sound like the rumble of a truck outside is very, very effective at masking higher frequency sounds like speech.

But the reverse isn't true.

The reverse is not true.

A high frequency wave peaks at the base of the cochlea and dies out immediately.

It never reaches the low frequency part of the membrane at the apex.

So a high pitch noise like a hiss does not mask a low pitch tone effectively at all.

The interference mostly goes one way from low to high.

That makes perfect mechanical sense based on what we learned.

Okay, finally, let's wrap up with section eight, hearing loss.

The book says about 30 million Americans are dealing with this to some degree.

And we can categorize most hearing loss into two main buckets,

conductive and sensorino.

Conductive sounds like a mechanical problem.

It is.

It's any problem in the outer or middle ear that prevents sound from being conducted properly to the cochlea.

This could be as simple as earwax completely blocking the canal.

Or a middle ear infection.

Right, otitis media, which is very common in children, where the middle ear fills up with fluid instead of air.

Or a condition called otis sclerosis, where there's abnormal bone growth that fuses the stapes bone in place so it can't move.

The good news is these are often fixable.

Often, yes, with medicine or surgery.

Sensorineural is the more serious one.

This is damage inside the cochlea or to the auditory nerve itself.

And it's generally permanent.

Sensory loss is hair cell damage.

From what?

Loud noise exposure is the big one.

But also certain antibiotics, some cancer chemotherapy drugs,

and just aging can kill the hair cells.

And once they are dead, in mammals, they do not regenerate.

Neural loss is when the auditory nerve fibers themselves die off.

The text details a very common type of this called presbycusis.

Which is just the technical term for age -related hearing loss.

And it follows a very specific predictable pattern.

You lose the high frequencies first.

If you think about the cochlea's layout,

the high frequency area is at the base, right to the front door, where every sound wave enters.

It gets the most wear and tear over a lifetime.

The low frequency apex is protected at the far end.

This is why older adults often struggle to hear women's voices or children's voices, which are higher pitched.

Exactly.

And they struggle with the high frequency consonant sounds in speech, like S and T and F, which makes it sound like everyone is mumbling.

And the treatments, let's talk about hearing aids.

Modern hearing aids are incredibly sophisticated computers.

They are not just simple amplifiers that turn up the volume on everything.

No.

Because many people with hearing loss have a condition called recruitment, where quiet sounds are inaudible, but loud sounds are still perceived as painfully loud.

So a simple amplifier would be unbearable.

So what do they do?

They use complex compression.

They provide a lot of amplification for soft sounds to make them audible, but provide very little or no amplification for loud sounds.

They can also shed frequencies from dead regions of the cochlea to regions that are still working.

And for profound deafness, where hearing aids aren't enough, there are cochlear implants.

Yes.

Figure 9 .29 shows one.

This is a true marvel.

It's a thin wire with an array of electrodes that is surgically threaded into the spiral of the cochlea itself.

And it bypasses the dead hair cells.

It completely bypasses them and stimulates the auditory nerve fibers directly with electrical pulses.

For people who get them, it can be life -changing, especially for understanding speech.

The text does note, though, that because it only has a limited number of electrodes, maybe 22 compared to thousands of hair cells, the fidelity is low.

Music, especially, often sounds distorted or like garbage.

There is one last really interesting and recent discovery mentioned, hidden hearing loss.

This is a bit of a medical detective story.

For years, people, especially younger veterans or musicians, would come to the doctor complaining that they couldn't hear in a noisy restaurant.

They had trouble with the cocktail party problem.

But when they took a standard hearing test, they would pass with flying colors.

Their audiogram, which tests the ability to detect quiet tones, would be perfectly normal.

Their hair cells were fine.

So what was actually broken?

Researchers discovered it was the synapse, the connection point.

It turns out that exposure to loud noise, even if it doesn't kill the hair cell outright, can destroy the delicate synapse between the inner hair cell and the auditory nerve fiber.

So the information is being generated by the hair cell, but it can't cross the bridge to get to the brain.

Exactly.

And because standard hearing tests only check if the hair cell is working, this loss remains hidden.

It reveals the need for more sophisticated tests that can assess our hearing in noisy real -world situations.

I want to end on the sidebar in the book about manatees.

It feels like it combines everything we've learned into one perfect real -world example.

It really does.

So for years, there was a big problem in Florida with manatees getting hit and killed by boats.

The common assumption among biologists was just that manatees were sort of dumb and had bad hearing.

But that wasn't true.

Not at all.

When they were finally tested properly, it turns out manatees have great hearing,

but only for high frequencies.

Their hearing is terrible for low frequencies.

And boat engines produce low -frequency rumbles.

Exactly.

So the manatee literally might not be able to hear the boat's engine.

And now adding what we learned about the upward spread of masking, the general background noise of the water might be enough to completely mask the faint low thrum of a distant boat engine.

They can't hear the boat until it's right on top of them.

And the tragic irony of the solution they came up with.

The initial laws were to create slow -speed zones for boats in manatee areas.

It makes sense, right?

Give them more time to get away.

But a slower moving engine produces an even lower pitch sound.

Making the boats even more invisible or inaudible to the manatees.

Potentially, yes.

It's a perfect example of why you have to understand the specific physiology and psychoacoustics of a system.

The tech suggests the boats might need high -frequency pingers like on a submarine to alert them.

It's a perfect case study of why understanding all this matters in the real world.

It really is.

Hearing is just this incredible mechanical marvel.

We take subtle changes in air pressure.

We turn that into bone leverage, which becomes fluid hydraulics, which creates a shearing force, which opens a microscopic trap door.

And that finally becomes a neural spark that the brain can interpret as music or language.

It's a fragile,

absolutely beautiful system.

Well, that is Chapter 9 from Physics to Perception.

We really hope this deep dive helps you ace that exam.

Or maybe just helps you appreciate the incredible machinery inside your head a little more next time you listen to your favorite song.

Thanks for studying with us.

And maybe turn down the volume on your earbuds.

Just a little.

We'll see you in the next deep dive.

Goodbye.

ⓘ This audio and summary are simplified educational interpretations and are not a substitute for the original text.

Chapter SummaryWhat this audio overview covers

Sound waves represent pressure variations propagating through a medium, characterized fundamentally by frequency, which determines perceived pitch, and amplitude, which correlates with perceived loudness. The auditory system begins its sensory journey when the pinna and ear canal collect these vibrations and direct them toward the tympanic membrane, initiating a cascade of mechanical transformations. Within the middle ear, three interconnected ossicles—the malleus, incus, and stapes—function as a lever system that amplifies incoming vibrations by concentrating the force from the larger tympanic membrane onto the considerably smaller oval window, thereby efficiently coupling mechanical energy into the fluid-filled cochlea. Once inside this spiral structure, sound-induced pressure changes generate traveling waves that propagate along the basilar membrane in a tonotopically organized manner, with different frequencies reaching peak displacement at specific locations. The organ of Corti, sitting atop the basilar membrane, contains the sensory receptors responsible for converting mechanical displacement into neural signals. Inner hair cells bend their stereocilia in response to basilar membrane movement, and molecular structures called tip links mechanically open ion channels, allowing the electrochemical transduction process to occur. Outer hair cells contribute an active mechanical amplification mechanism called electromotility, which enhances the sharpness of frequency tuning and increases overall auditory sensitivity. The auditory nerve encodes frequency information through two complementary mechanisms: place coding, which reflects the specific tonotopic location activated on the basilar membrane, and temporal coding, which relies on phase locking and the volley principle to represent low-frequency stimuli based on the timing of neural firing patterns. Central auditory processing involves a hierarchical neural pathway ascending through the cochlear nucleus, the superior olive where sound localization computations occur, the inferior colliculus, and the medial geniculate nucleus before reaching the primary auditory cortex. Psychoacoustic measurements characterize hearing capabilities through audibility thresholds and equal-loudness contours, while additional phenomena such as masking and critical bandwidth reveal how the auditory system separates and prioritizes competing sound sources. Understanding these normal mechanisms provides context for examining various hearing pathologies, including conductive losses from middle ear dysfunction, age-related presbycusis, sensorineural damage from inner ear injury, and hidden hearing loss affecting neural processing despite normal audiometric thresholds, with treatment options ranging from hearing aid amplification to cochlear implant restoration of auditory signals.

Using this chapter to study? Last Minute Lecture is free and student-run. If it helped, consider supporting the project.

Support LML ♥

Chapter 9: Hearing: Physiology & Psychoacoustics

Related Chapters