Chapter 53: The Sense of Hearing

Search this chapter

Audio Overview

0:00 / 0:00

Autoplay next chapter

Welcome to Last Minute Lecture.

This free chapter overview is designed to help students review and understand key concepts.

These summaries supplement not replaced the original textbook and may not be redistributed or resold.

For complete coverage, always consult the official text.

You know, usually when we talk about a medical diagnosis, there's this expectation of structural precision.

You fall, you break your arm, the x -ray shows that jagged white line, and the doctor just points and says, well, there it is.

Yeah, it feels very binary, right?

And entirely mechanical.

Exactly.

Broken or not broken, it's clean.

But then you step into the world of human senses and specifically hearing.

And suddenly we aren't just looking at structural mechanics anymore.

No, not at all.

We're looking at this incredible, almost

magical translation of invisible air pressure into a conscious thought.

So welcome to this deep dive.

Our mission today is a journey through Guyton and Hall textbook of medical physiology,

specifically the chapter on the sense of hearing.

Chapter 53 for those following along.

Right.

And for everyone using this as their last minute lecture prep for a physiology exam, we're doing this just for you.

We're going to trace a single sound wave from the exact moment it hits your outer ear to the very millisecond your brain interprets what it means.

And we're tackling this chronologically starting with the anatomy because, well, in human physiology, physical anatomy perfectly dictates function, right?

And that function requires regulation.

And finally, all of that leads to our integrated perception of sound.

It is this continuous logical chain where every single microscopic part has a vital job.

Okay, let's unpack this.

We have to start with the physical problem of getting sound into the head.

So when sound waves travel down your ear canal, they hit the tympanic membrane,

right?

But the ultimate destination for that sound is the inner ear, the cochlea, which is completely filled with fluid.

So we are transitioning sound from air into fluid, which is an enormous physics problem, honestly, because of the inertia.

Exactly.

Fluid has significantly more inertia than air.

It really resists movement.

If an airborne sound wave just hit a fluid boundary directly, about 99 .9 % of the acoustic energy would just bounce right off the surface.

Oh, wow.

I mean, it's like trying to swing a baseball bat underwater.

Have you ever stood in the shallow end of a pool and tried to swing your arms fast?

Oh, it's exhausting.

The water's inertia just fights you.

Moving fluid takes a massive amount of force compared to moving air.

So our bodies evolve this brilliant mechanical workaround called impedance matching, which involves the middle ear bones.

Yeah.

Attached to the back of the eardrum is a chain of three tiny bones,

the malleus, the incus, and the stapes.

And those bones form a lever system, basically bridging the eardrum to the oval window.

And that oval window is the entrance to the fluid filled inner ear.

So the malleus is on the eardrum.

Right.

Physically attached to it.

The incus sits in the middle, and then the flat faceplate of the stapes pushes right up against that oval window.

You know, a lot of people, they assume this chain of bones is designed to increase the distance of the vibration, like to get a bigger swing.

I do assume that, yeah.

But it's actually physically impossible to gain both distance and force from a lever system like this.

The stapes actually moves a shorter distance than the malleus, only about three fourths as much.

Wait, a shorter distance?

Yeah.

What the lever system actually does is decrease distance to increase the force by about 1 .3 times.

But I mean, 1 .3 times isn't nearly enough to swing that bat underwater, right?

You can't overcome the inertia of cochlear fluid with a tiny 30 % boost.

You absolutely cannot.

The real power comes from a drastic difference in surface area.

Okay.

How so?

Well, if you visualize the anatomy, the surface area of the eardrum is about 55 square millimeters.

But the faceplate of the stapes, which is doing the actual pushing against the fluid, is microscopic.

It's only 3 .2 square millimeters.

Oh, so it's taking all the energy collected across a massive drum and just focusing it down onto a tiny pinhead.

Exactly.

It's the difference between stepping on someone's foot with a wide snowshoe versus like a sharp stiletto heel.

That is a perfect analogy.

It creates a 17 -fold concentration of force.

And when you multiply that 17 -fold area advantage by the 1 .3 -fold lever boost from the bones - You get the 22.

Right.

You get about 22 times as much total force exerted on the fluid of the cochlea compared to what hit the eardrum.

That's incredible.

It perfectly matches the impedance of the fluid, which allows up to 75 % of the sound energy to pass through from normal speaking frequencies.

Okay.

But what happens when that system works too well?

Say a bomb goes off or someone drops a heavy pan right next to your head.

How do we keep that 22 times force multiplier from just blowing out our inner ear?

Well, that is handled by the attenuation reflex.

The attenuation reflex.

Right.

After a latent period of about 40 to 80 milliseconds,

a loud noise triggers an automatic central nervous system reflex.

You have two tiny muscles attached to this bone chain,

the tensor tympani and the stapedius.

Okay.

When they're triggered, the tensor tympani pulls the malleus inward and the stapedius pulls the stapes outward.

Wait.

So they are actively pulling the bone chain in opposite directions.

Exactly.

By doing so, the entire acicular system becomes stiff and rigid.

It practically locks up.

Yeah.

Heavily dampening the physical vibration before it reaches the fluid.

So this is basically built in noise canceling headphones.

Functionally, yes.

It reduces the transmission of low frequency sounds by 30 to 40 decibels, which is massive.

It protects your delicate inner ear.

But what's really fascinating here is that it also masks low frequency background noise in your environment.

Oh, really?

Yeah.

If you are in a loud rumbling room, this reflex stiffens your ear bones, tuning out the low hum.

So you can actually hear higher frequency conversations above a thousand cycles per second.

I read that it even decreases your hearing sensitivity to your own speech.

Like your brain actually sends a signal to these tiny muscles when you start talking.

So the sheer volume of your own voice vibrating through your head doesn't deafen you.

It's true.

It's a brilliant system.

That is wild.

Now, before we move on to the inner ear, we should probably quickly note that sound can bypass as bone mechanics entirely, right?

Yes.

Through bone conduction.

Right.

You can have conduction directly through the skull into the cochlear fluid.

But normal sounds in the air just don't have enough energy for that.

You'd need like a tuning fork or an electronic vibrator pressed right against your skull.

Which really just highlights how vital that middle ear amplifier is for our daily existence.

Definitely.

Okay.

So the mechanical forces hit the oval window.

We enter the inner ear, the cochlea.

Right.

For everyone listening, picture a coiled tube looking exactly like a tiny snail shell.

If we were to unroll it and look inside, it's divided into three parallel fluid -filled tubes.

The skeletis tibuli on top, the scala media in the middle, and the scala tympani on the bottom.

And the critical structure here is the partition separating the scala media from the scala tympani.

The basilar membrane.

Exactly.

It contains 20 ,000 to 30 ,000 tiny stiff fibers that project outward.

They're free at one end, which allows them to vibrate like the reeds of a harmonica.

And the physical layout of these fibers is completely counterintuitive.

Yeah.

You would think the fibers right at the entrance, near the oval window, would be long and floppy to catch the sound.

Right.

But they aren't.

They're the exact opposite.

They really are.

At the base of the cochlea, where the sound enters, the basilar fibers are short and incredibly stiff.

As you travel up the coil toward the far end, which is called the apex, or helicotrema, the fibers get progressively longer, and their diameter actually decreases.

They become more than a hundred times less stiff.

So short and rigid at the entrance, long and limber at the end.

Yes.

Therefore, high -frequency sounds, which vibrate very quickly, they resonate perfectly with those short, stiff fibers at the base.

And the low rooms.

Low -frequency sounds travel much further down the tube and resonate with the long limber fibers near the apex.

When a sound wave enters the fluid, it travels along this basilar membrane, building an amplitude.

It gets stronger and stronger.

Right.

Until it hits the exact physical spot on the membrane that shares its natural resonant frequency.

Okay.

Here is the part that completely blew my mind.

At that exact resonant point, the membrane vibrates back and forth with such ease that all the energy in the wave is completely absorbed.

The physical wave just stops dead.

It just dies at that exact spot and travels no further down the tube.

Is that really how it works?

That is exactly how it works.

This is the absolute foundation for how we process pitch.

High frequencies die early, and low frequencies travel all the way to the end.

The physical properties of the anatomy have literally sorted the sound by pitch.

Okay.

So the physical wave hit a dead end, and a specific spot on the membrane is vibrating.

But our brain doesn't understand physical vibrations.

No, it only speaks electricity.

Right.

So how do we translate a bouncing membrane into a nerve signal?

Well, we have to look at the surface of the basilar membrane at a structure called the organ of Corti.

This is the true sensory receptor of the ear.

It's covered in specialized receptor cells called hair cells.

There are about 3 ,500 inner hair cells and 12 ,000 outer hair cells.

And above these hair cells, hovers another structure called the tectorial membrane.

And the hair cells have these tiny microscopic hairs called stereocilia, sticking straight up, embedding themselves into the gel -like coating of that tectorial membrane above them.

Right.

So when a sound wave causes the basilar membrane to bounce up and down, the entire structure rocks back and forth.

Like a hinge.

Exactly.

Because the hair cells and the tectorial membrane are pivoting on slightly different anatomical hinges,

this rocking motion forces the tiny hairs to shear back and forth against the membrane above them.

So they physically bend.

They physically bend.

And that mechanical bending pulls open microscopic trap -doors cation channels on the very tips of the stereocilia.

Yep.

And here's where it gets really interesting.

Because if you're studying physiology, you know that in almost every other cell in the human body, sodium is the ion that rushes in to depolarize the cell.

But here, the textbook specifically states it is potassium.

Yes.

This is a brilliant evolutionary adaptation known as the endococcal potential.

The endococcal potential.

Let's unpack that.

So those hair cells are bathing in the fluid of the scala media, which is called endolymph.

Now, unlike the perilymph in the other chambers, which is basically just cerebrospinal fluid endolymph, is actively secreted by a highly vascular area called the stria vascularis.

Okay.

And the stria vascularis is constantly pumping potassium into this space.

Creating a completely unique fluid environment.

Right.

It creates a massive electrical battery.

The endolymph is incredibly positive, sitting at roughly plus 80 millivolts.

Meanwhile, the inside of the hair cell itself is very negative, about negative 70 millivolts.

Wow.

So across the very top of that hair cell, right where those tiny trap -doors are, you have a staggering electrical gradient of 150 millivolts.

It's huge.

That enormous electrical pressure is just waiting to explode into the cell.

It makes the hair cells insanely sensitive to even the slightest atomic level bending.

So the moment the hairs bend, the trap doors open, potassium violently floods in, and the cell depolarizes.

This instantly opens voltage sensitive calcium channels, calcium rushes in, and the cell releases the neurotransmitter glutamate to the auditory nerve endings.

Boom.

The physical force of a sound wave has officially become a an electrical nerve signal.

Exactly.

But that raises an important question.

How does the brain

decode this electrical storm?

How do I know if I'm hearing a high -pitched scream versus a low -bass drum?

Well, for frequency or pitch, the auditory system primarily relies on the place principle.

The place principle.

Right.

Because we know the physical wave stops dead at a very specific spot on the basilar membrane.

The brain simply knows that if nerve fiber number 4000 is firing, that corresponds to a highly specific frequency.

So the place in the membrane tells the brain the pitch.

Exactly.

But there's a limitation to that, isn't there?

Because for extremely low frequencies, under 200 cycles per second, the wave reaches the absolute end of the cochlea.

It all hits the exact same spot.

So the brain can't use the place principle anymore.

That's absolutely right.

For those deep booming low frequencies,

the ear shifts to the volley or frequency principle.

Okay.

How does that work?

Instead of relying on location, the nerve fibers actually fire in direct synchrony with the sound waves themselves.

So if a 100 cycle per second tone enters the ear, the nerve fires exactly 100 impulses per second.

The brain reads the firing rate directly as the pitch.

Okay.

So pitch makes complete sense.

It's either about where the wave stops or the direct firing rhythm.

But volume has to be different, right?

Well, because you can have a deafening low bass note or whisper quiet low bass note, and both are vibrating the exact same spot at the end of the cochlea.

How does the ear translate loudness?

The brain determines loudness in three ways.

First, a louder sound simply vibrates the basilar membrane with a higher physical amplitude, which causes the hair cells to shear harder and fire at a more rapid rate.

Okay.

That makes sense.

Second is spatial summation.

As that wave gets bigger and more violent, it recruits more hair cells on the fringes of the resonant point.

So the brain suddenly receives signals from dozens of neighboring nerve fibers instead of just a few.

Got it.

And the third way.

The third way involves those outer hair cells we mentioned earlier.

You have mostly outer hair cells, but they actually don't get stimulated significantly until the vibration reaches a relatively high intensity.

Oh.

So when the brain starts seeing signals from the outer hair cells, it essentially says, Okay, this is getting loud.

Yeah.

And we absolutely must talk about how the ear compresses that volume, which physiology calls the power law.

Yes, the power law.

If researchers map out human hearing thresholds on a graph, the sheer range of physical energy our ears can handle is just staggering.

The difference in actual physical sound energy between the softest whisper you can barely detect and the loudest noise you can tolerate experiencing pain.

It's roughly a one trillion fold increase in actual sound energy.

It's hard to even wrap your head around that.

A trillion times more energy.

But when a jet engine takes off, we obviously don't perceive it as a trillion times louder than a whisper.

Our brains would overload.

They absolutely would.

The ear brilliantly compresses that trillion fold energy increase into roughly a 10 ,000 fold change in perceived sensation.

How?

It operates on an inverse power function where perceived loudness changes in proportion to the cube root of the actual sound intensity.

This is exactly why we use the decibel scale to measure sound.

It's a logarithmic scale specifically designed to compress this massive trillion fold reality into manageable number.

And that threshold graph also proves we don't hear all pitches equally, right?

Definitely not.

You can hear a 3000 cycle per second sound, which is close to a baby crying at a tiny, fraction of the energy required to hear a deep 100 cycle hum.

Our ears are aggressively tuned for human survival frequencies.

Yes they really are.

So the signal has been fully translated and compressed.

It leaves the ear and travels up the auditory nerve.

Let's trace it into the brain.

All right.

The pathway from the ear to the cortex is essentially a massive neural relay race.

Okay.

It moves from the spiral ganglion to the cochlear nuclei in the medulla.

From there, it crosses over to the superior olivary nucleus, travels up through the lateral lemniscus to the inferior colliculus, stops at the medial geniculate nucleus of the thalamus, and finally reaches the primary auditory cortex in the temporal lobe.

You know, when I first read that, my eyes completely glazed over.

It sounds like reading a bus schedule.

It does, a bit.

But the insight here isn't just memorizing the stops.

It's realizing that this pathway isn't just a simple extension cord plugging the ear into the brain.

The signal is being actively dissected, filtered, and processed at every single handoff.

Heavily processed.

By the time the electrical signal finally reaches the auditory cortex, it is highly organized into tonotopic maps.

Tonotopic maps.

Yes.

Just like the physical basal membrane is laid out by pitch, the cortex physically maps high frequencies to one end of the tissue and low frequencies to the other.

There are at least six of these different physical maps in the brain.

Six of them.

Yeah.

Each likely dissecting out different qualities of the sound, like how fast the sound started or how it changes over time.

And the frequency response gets violently sharpened, doesn't it?

Down in the cochle, a single neuron might respond to a somewhat broad range of frequencies because the physical wave isn't perfectly sharp.

But by the time it reaches the cortex,

that neuron is hyper specific to one exact pitch.

That sharpening happens through lateral inhibition.

Lateral inhibition, right?

As the signal travels up the brainstem, collateral nerve fibers branch off and actively inhibit the adjacent pathways.

Oh, wow.

Yeah.

It essentially silences the background noise of neighboring frequencies to make the exact target pitch stand out dramatically.

It's very similar to how the visual system sharpens the edges of an image by darkening the borders.

And then there's the interpretation phase.

A fascinating clinical correlation from the text involves the auditory association area,

specifically Wernicke's area.

If a patient suffers damage to this specific part of the cortex, their anatomical hearing remains perfectly intact.

They can differentiate tones, they can hear words, they can even repeat those words back to you.

But they lose all ability to understand what the words actually mean.

It's a profound physiological reminder that hearing is physical, but understanding is purely cortical.

The translation of sound into meaning has been completely severed there.

Wow.

Now, beyond just identifying a sound, the central pathways have another critical function,

locating where a sound is coming from in space.

Oh, right.

Biologically, there are two main ways our bodies handle this on a horizontal plane, time lag and intensity difference.

Time lag is used for low frequencies.

So if a dog barks on your left, that sound wave hits your left ear literally a fraction of a millisecond before it wraps around your head and hits your right ear.

Exactly.

But for high frequencies, the physical mass of your head actually blocks the sound wave, creating an intensity difference.

It is noticeably louder in the left ear than the right.

And the real mathematical genius of this happens down in the brainstem, specifically in the superior olivary nucleus.

It is physically divided into two sections to do this exact calculus.

Okay.

How so?

Well, the lateral portion detects those intensity differences for high pitches, but the medial superior olivary nucleus is hardwired to detect the time lag.

So it's essentially a highly tuned biological stopwatch.

Exactly.

The neurons in the medial section have dendrites stretching in both directions.

A sound from the right ear hits the right dendrite.

A sound from the left hits the left.

And these neurons are mapped so precisely across the tissue that specific neurons will only fire if there is an exact specific millisecond delay between the left and right signals.

It calculates the geometry of the sound source based purely on microsecond delays.

That is amazing.

And if we look at the very end of this physiological chain, there's a concept that completely flips everything we've talked about.

Yeah.

Retrograde signals or centrifugal pathways.

Oh, this is one of my favorite parts.

Up until now, everything's been traveling from the ear up to the brain, but there are nerve fibers that travel backwards from the brainstem all the way down to the tiny hair cells in the cochlea themselves.

And these are inhibitory signals.

Right.

It creates an active descending feedback loop.

So what does this all mean for you, the listener?

This is the exact mechanism that explains how you can stand in a loud crowded party and somehow tune out all the overlapping chatter to focus on a single voice.

Or how you can listen to a symphony orchestra and isolate the sound of one single oboe.

Yes.

Your brain is reaching all the way back into your inner ear and actively turning down the volume on the hair cells, responding to the frequencies you want to ignore.

It highlights the brilliant integrated outcomes of the entire system.

And to fully appreciate this, we have to look at what happens when the anatomy fails.

Glingle hearing loss is divided into two primary categories, nerve deafness and conduction deafness.

Right.

And if you know how to read an audiogram, which is a graph plotting a patient's hearing thresholds, you can use pure logic to deduce exactly which system is broken.

Exactly.

An audiogram tests two different things.

Air conduction, which is sound played through headphones,

and bone conduction, which is sound vibrated directly into the skull behind the ear.

So in a case of old age nerve deafness or presbycusis, the graph will show the lines for both air conduction and bone conduction dropping off drastically at the high frequencies.

Why both?

Because the damage is located at the base of the cochlea or the auditory nerve itself.

It doesn't matter if the sound travels through the air in the ear bones or directly through the skull.

The physical receptor is broken, so neither method works.

Right.

That makes sense.

But then you have middle ear sclerosis, which causes conduction deafness.

In this patient's graph, the air conduction line is deeply depressed, meaning they struggle immensely to hear sound through the air.

But their bone conduction line is completely normal.

Because if a disease causes the middle ear bones to literally freeze up and calcify, completely blocking the oval window, sound can't get through the air pathway.

But the cochlea and the nerve are completely healthy.

So vibrating the skull bone delectly bypasses the frozen middle ear entirely, yielding perfect hearing.

Understanding the separate anatomical pathways perfectly explains the clinical diagnostics.

That is just incredibly satisfying logic.

To quickly recap the beautiful chain we just traced, we started with physical air pressure moving the eardrum.

Those tiny bones multiplied the force to overcome the inertia of fluid, creating a wave in the cochlea.

Right.

That wave sheared microscopic hairs, triggering a 150 millivolt electrical gradient to flood the cell with potassium.

That depolarization became a nerve signal, heavily processed and sharpened through complex relays in the brainstem for timing and location, before finally reaching the cortex where it is mapped out and perceived as meaningful sound.

It's quite a journey.

And if we connect this to the bigger picture, I want to leave you with a final thought to ponder.

That's it.

We talked about those retrograde signals the brain sends back down to the outer hair cells.

Research suggests that stimulating these nerve fibers can cause the outer hair cells to physically shorten and change their stiffness.

Wait, really?

Yes.

Think about the implications of that.

Your brain isn't just passively receiving sound like a microphone.

It is actively physically altering the microscopic shape of your inner ear in real time to change what you are physically capable of hearing.

Oh, wow.

You are literally reshaping your own sensory organs right now just by listening to this.

That is the perfect place to leave it.

Thank you so much for joining us on this deep dive.

And for everyone using this as their last -minute lecture prep, we sincerely thank you for listening.

And the last -minute lecture team hopes this helps you absolutely ace your physiology studies.

Keep asking questions, and we'll see you next time.

ⓘ This audio and summary are simplified educational interpretations and are not a substitute for the original text.

Chapter SummaryWhat this audio overview covers

Sound transmission through the ear involves an elegant cascade of mechanical and electrical processes that convert vibrations in air into neural signals the brain can interpret. The middle ear addresses a fundamental physical challenge: cochlear fluid has much greater inertia than air, requiring force amplification to stimulate inner ear structures effectively. The ossicular chain of malleus, incus, and stapes, combined with the surface area differential between the tympanic membrane and stapes footplate, magnifies sound vibrations approximately twenty-fold. The attenuation reflex protects against acoustic trauma by stiffening the ossicular system in response to intense sounds, simultaneously enhancing the ear's ability to focus on speech frequencies while suppressing low-frequency background noise. Within the cochlea's three-chambered structure, the basilar membrane exhibits remarkable frequency-selective properties due to variations in fiber stiffness along its length, with short rigid fibers at the base responding to high frequencies and longer flexible fibers at the apex responding to low frequencies. Sound waves generate traveling waves that propagate along this membrane and dissipate at locations matching the sound's resonant frequency. The organ of Corti, sitting atop the basilar membrane, contains inner and outer hair cells whose stereocilia bend in response to basilar membrane displacement. This mechanical deformation opens ion channels, allowing potassium influx from the potassium-rich endolymph and triggering glutamate release that stimulates auditory nerve fibers. The sustained positive endocochlear potential of eighty millivolts enhances hair cell sensitivity to minimal vibrations. Frequency perception relies on the place principle, whereby specific basilar membrane regions signal particular frequencies to the brain, supplemented by the volley principle for lower frequencies where neural firing synchronizes with sound wave oscillations. Loudness encoding involves increased firing rates from stimulated hair cells, recruitment of additional sensory elements, and activation of outer hair cells, allowing the auditory system to compress an enormous intensity range spanning one trillion-fold. Central processing routes auditory information through successive brain structures including cochlear nuclei, superior olivary complex, inferior colliculus, medial geniculate nucleus, and ultimately primary auditory cortex in the temporal lobe, where tonotopic organization preserves frequency information. Sound localization emerges from interaural time differences at lower frequencies and intensity disparities at higher frequencies, analyzed principally in superior olivary nuclei. Deafness manifests as either conduction impairment from middle ear pathology or nerve deafness from cochlear or neural damage, with bone conduction sometimes providing an alternative pathway when ossicular transmission fails.

Using this chapter to study? Last Minute Lecture is free and student-run. If it helped, consider supporting the project.

Support LML ♥

Chapter 53: The Sense of Hearing

Related Chapters