Chapter 3: Protein Structure & Function

Search this chapter

Audio Overview

0:00 / 0:00

Autoplay next chapter

Welcome to Last Minute Lecture.

This free chapter overview is designed to help students review and understand key concepts.

These summaries supplement not replaced the original textbook and may not be redistributed or resold.

For complete coverage, always consult the official text.

Okay, let's unpack this.

Welcome to the Deep Dive, where we take the intricate, complex, molecular worlds of the cell and try to translate them into foundational knowledge for you.

Today, we are diving deep into the absolute workhorses of all cellular life proteins.

And this isn't just about memorizing structures.

No, not at all.

This is really a master class in molecular cell biology.

We're exploring how those initial linear chains of amino acids can almost magically transform into these complex three -dimensional architectures that make everything in biology possible.

That transformation is the whole game.

It's the central concept.

The entire field operates on this one fundamental principle, structure dictates function.

So, our mission today is to take this really detailed chapter and translate it into clear, step -by -step concepts.

We want to emphasize the cause and effect logic, how a tiny molecular detail like the placement of one side chain leads directly to a crucial cellular outcome.

And when you look at the sheer diversity of tasks proteins do, I mean, from catalyzing reactions to moving our muscles,

it's just astounding.

But what's really amazing is that functionally, they only exploit three core mechanisms.

That's such a crucial insight because everything a protein does boils down to one of these three things.

The first and maybe the simplest is binding.

Okay.

Proteins bind to other molecules, which we call ligands.

And this binding is completely driven by molecular complementarity.

So like a lock and key, a perfect fit.

Exactly.

The shapes, the chemical groups,

everything on the protein's binding side has to match its partner perfectly.

And the second mechanism.

That would be enzymatic catalysis.

These proteins, the enzymes, are designed so their 3D shape aligns specific amino acid side chains just right.

And that positioning lets them do what, exactly?

It lets them rearrange the covalent bonds in other molecules, the substrates, and they do it incredibly quickly.

Okay.

So binding and catalysis was the third one.

The third is regulation, which is all about control.

A protein's activity gets altered usually by binding a partner or getting some kind of chemical modification.

And that induces a huge conformational change.

A shape shift.

A shape shift, yeah.

And that acts like a molecular switch.

It turns the protein's function precisely on or off in response to some kind of signal.

So these proteins fall into several key classes.

You've got the structural proteins.

I like to think of them as molecular girders or guide wires.

That's a perfect analogy.

They define the cell shape, things like actin or collagen.

Then you have the scaffold proteins.

These sound like the organizers.

They are the organizational experts, the molecular assembly lines.

They bring other proteins together in these ordered arrays, which makes sure that, you know, multi -step processes can happen way more efficiently than if everything was just floating around on its own.

It cuts down on the diffusion time, right, makes the whole system faster.

Exactly.

Then, of course, you have the enzymes, the chemists of the cell.

You have membrane transport proteins managing flow across boundaries,

regulatory proteins acting as signals and sensors, and the phenomenal motor proteins that are responsible for pretty much all cellular movement.

And it's important to remember that these rules often get combined.

Oh, absolutely.

Many proteins are multi -talented.

A cell surface receptor, for example.

It's a regulatory protein because it senses a signal, but it's also an enzyme because it transmits that signal by, say, catalyzing a reaction inside the cell.

And when these multi -talented proteins come together, we call them molecular machines.

Yes.

So to really get how all this function emerges, we need to map out the architecture.

We're going to follow the standard four -level hierarchy of protein structure.

That's the roadmap.

Primary, secondary, tertiary, and finally, quaternary structure.

So let's start at the absolute foundation, which is the primary structure, the simple linear sequence of amino acids.

This is the blueprint.

It's encoded directly by your DNA and it dictates absolutely everything that comes after.

It's just a specific order in which amino acids are linked together.

And those links are called peptide bonds.

Right.

They're formed by a condensation reaction.

The carboxyl group of one amino acid joins the amino group of the next and a water molecule is lost in the process.

And because of how they're formed, the resulting chain has an inherent directionality.

It does.

We always talk about the N -terminus, the amino end where synthesis starts, and the C -term is the carboxyl end where it finishes.

It's a one -way street, which is crucial for how it folds.

And just a quick technical point that comes up all the time size, we measure protein mass in Daltons or day.

Or kilodaltons, KDA.

And if you're in the lab, you can do a quick back of the envelope calculation.

The average amino acid residue weighs about 113 day.

So if you know you have a 50 kiloday protein.

You can estimate it's roughly 440 amino acids long.

Exactly.

It's a useful shorthand.

Okay.

Now we get to the geometry, which is where things get really interesting.

Secondary structure, the local folding.

The key insight here is that the peptide bond itself is surprisingly rigid.

It's very rigid and it's

That's because it has a partial double bond character, which locks it in place almost always in what we call the trans configuration.

So all the flexibility for folding has to come from somewhere else.

It does.

It's limited to rotation around just two bonds attached to the central alpha carbon,

the phi angle and the psi angle.

It's those two angles that determine the orientation of adjacent planes,

but the side chains, the R -groups, they get in the way.

They do.

They are just physically impossible.

This dramatically limits the folding options and channels the chain into a very small number of stable arrangements.

And the two most common of those are the major secondary structures.

The first is the alpha helix.

Right.

Think of this as a stable rod -like cylinder.

What holds that rod shape together?

It's a really beautiful pattern of hydrogen bonds that form within the backbone itself.

The carbonyl oxygen of one residue, let's call it residue N, forms a hydrogen bond with the unmide hydrogen of the residue at position N plus four.

So it's a repeating regular pattern that holds the backbone straight.

Perfectly.

And all the amino acid side chains, they project outward from this cylinder.

Their chemistry, whether they're hydrophobic or hydrophilic, determines where that helix is going to end up in the final protein.

And there's one famous exception to this role,

the amino acid

Yes.

Proline is known as a helix breaker.

Its structure is unique.

The amino group is part of this rigid ring.

It can't contribute the right hydrogen atom for that end to end plus four pattern.

So it just introduces a kink.

A sharp kink.

Yeah.

It breaks the helix.

Okay.

So the second major secondary structure is the beta sheet.

If the helix is a rod, this is more like a flexible flattened pleated structure.

It is.

It's made of laterally packed strands called beta strands.

These are short, almost fully extended segments of the polypeptide.

And here the stabilization happens between the strands, not within one.

Right.

Hydrogen bonds form perpendicularly between the carbonyl oxygen of one strand and the emanate hydrogen of an adjacent strand.

And because the R groups alternate, pointing above and below the plane,

the whole sheet looks pleated.

And there are two ways these strains can line up, correct?

Correct.

They can be anti -parallel, where adjacent strands run in opposite directions.

This gives you perfectly straight, really strong hydrogen bonds.

Or.

Or they can be parallel, where all the strands run in the same direction.

The hydrogen bonds are a little bit angled here, so they're slightly weaker, but still very effective at holding the sheet together.

I've also seen cases where a beta sheet curves around on itself.

That's a beta barrel.

These are fascinating, especially in membranes.

The outside of the barrel can be hydrophobic to sit nicely in the lipid part of the membrane.

While the inside forms a hydrophilic pore.

Exactly.

Like a channel or a hydrophilic chimney that lets ions and small molecules pass through.

And to make all these compact shapes, the chain has to be able to turn back on itself.

That's the job of the beta turn.

These are just sharp U -shaped bends made of four residues.

They're essential for folding long chains into compact globular shapes.

And you often find glycine and proline here, because their unique structures help facilitate that really tight 180 degree turn.

Okay.

Moving up a level, we have structural motifs or super secondary structure.

These are recurring combinations of secondary structures.

Yes.

And they often have a specific reusable function.

The classic example is the coiled coil.

This is where two, three, or even four alpha helices wrap tightly around each other.

And they're incredibly stable because of a pattern in the primary sequence called a heptad repeat.

Right.

A repeating seven amino acid unit.

The genius of this repeat is that it ensures that hydrophobic side chains, like glycine, always land at the first and fourth positions.

So they line up along one face of the helix.

They do.

And when two or more of these helices come together, these hydrophobic strips act like molecular Velcro.

They interlock, hiding away from the water in the core of the coil, making the whole structure incredibly stable.

We also see other common motifs like the EF hand, which is a helix loop helix that's crucial for binding calcium ions.

And the zinc finger, which often involves helices and strands stabilized by a zinc ion.

You see that a lot in DNA binding proteins.

All right.

Let's get to the full picture.

Tertiary structure.

This is the overall complete 3D confirmation of a single polypeptide chain, the whole functional shape.

And this final fold is stabilized primarily by non -covalent interactions.

So hydrogen bonds, ionic bonds, and especially the hydrophobic effect.

But for proteins that are going to be secreted or live on the cell surface.

You see covalent disulfide bonds.

Exactly.

They form between two cysteine residues and act like molecular staples, locking the fold into place and making it much more stable in the harsh environment outside the cell.

The single biggest driver of this final fold, especially for water -soluble proteins, is the oil drop model.

This is a fundamental concept.

It's not so much that the hydrophobic side chains like each other, but the water hates them.

Right.

It's the hydrophobic effect.

The hydrophobic effect forces these non -polar oily side chains to cluster together in the center of the protein away from the water.

This minimizes their exposed surface area and maximizes the entropy of the surrounding water molecules.

So the water is really the ultimate architect.

It shoves all the oily parts into the middle.

It does.

And at the same time, the charged and polar side chains, the hydrophilic ones, stay on the outer surface where they can happily interact with water.

Trichiary structure also introduces the idea of domains.

These are distinct modular regions of the protein.

And we categorize them in a few ways.

A functional domain is a region with a specific activity, like the ability to bind DNA.

You can often cut just that piece of the protein out and it will still perform that one function.

Then you have a structural domain, which is a part of the protein, usually 40 amino acids or more, that can fold up into a stable shape all by itself.

And many proteins are just mosaics of these domains linked by flexible spacers.

If the amino acids are letters, the domains are like words.

Each one has a distinct transferable meaning or function.

We see this all the time.

The EGF domain, for example, is found in dozens of different proteins and in each one it's conferring a specific binding ability.

We also define topological domains just based on where they are.

So cytoplasmic domain, a membrane -spanning domain, and so on.

Before we leave structure, we should touch on evolutionary relationships.

Proteins from a common ancestor are called homologs.

And they're grouped into families.

It's really remarkable how similar their core structures can be, even if their sequences have diverged over millions of years.

But the truly revolutionary discovery of the last couple of decades has been the existence of intrinsically disordered proteins or IDPs.

Yes, these are segments or even entire proteins that just don't have a fixed stable 3D structure.

They're incredibly flexible, like molecular spaghetti.

That seems to fly in the face of everything we just said about structure dictating function.

How can something without a stable shape have a function?

Its function is its formlessness.

Their amino acid composition is key.

They're rich in polar amino acids, but very poor in hydrophobic ones.

So that oil drop clustering can't happen.

They can't form a stable core?

Exactly.

So they remain flexible.

They serve as flexible linkers or binding sites or signals.

And very often, they only fold into a stable structure when they bind to a partner.

The POA protein is a classic example.

So it's a binding -induced folding event.

And there are two models for how this happens.

Right.

One is conformational selection, where the disordered protein is constantly sampling different shapes and its partner just selects or captures the one that fits.

And the other?

The other is induced fit.

In this model, the protein starts to bind while it's still flexible and the act of binding itself induces it to fold into the correct final shape.

And the evidence seems to favor induced fit for many of these interactions.

Finally, we get to quaternary structure, the highest level of organization.

This describes proteins made of more than one separate polypeptide chain or subunit that assemble together.

Hemoglobin is a classic example, a tetramer of two alpha and two beta chains.

And sometimes this assembly is all about efficiency.

Yes, metabolic coupling.

You see this in large multi -enzyme complexes.

By putting enzymes that work in a sequence right next to each other, the product of one reaction is immediately available to the next enzyme.

It channels the intermediates.

Exactly.

It prevents them from diffusing away, which makes the whole pathway vastly more efficient.

More recently, we've learned that proteins with lots of these disordered regions can assemble into what we call biomolecular condensates.

These are like membrane -less liquid -like compartments that concentrate molecules to facilitate specific reactions.

Okay, we've mapped the final structure.

So the next big question is, how does it get there?

How does this chaotic linear chain find its one specific 3D shape out of a nearly infinite number of possibilities?

This is the heart of the folding problem.

And the foundational work here was done by Christian Anthensen.

His experiments showed that the primary sequence,

that string of amino acids, contains all the information needed for folding.

He could denature a protein, make it a useless mess, and it would spontaneously refold in a test tube.

Exactly.

In vitro, it can self -assemble.

So if it can do it on its own, why does the cell need folding assistance in vivo, in a living cell?

Because the environment inside a cell is nothing like a dilute test tube.

The cytosol is incredibly crowded.

It's a molecular traffic jam.

Okay.

And in that crowded space,

unfolded or partially folded proteins are really dangerous because they expose their sticky hydrophobic side chains.

The high concentration makes it very likely they'll just stick to other unfolded proteins before they get a chance to fold properly.

And that leads to aggregation, toxic clumps.

Precisely.

So to fight this, the cell has a quality control team, the chaperones.

These are the folding assistants?

They are.

They're highly conserved, absolutely essential proteins.

They bind unfolded proteins, sequester them, and give them the time and space to fold correctly away from the chaos.

They also help disassemble toxic aggregates if they do form.

And their whole function is tied to energy.

Absolutely.

Chaperones are ATP dependent.

They use cycles of ATP binding, hydrolysis, and nucleotide exchange to drive these big conformational changes in themselves.

This lets them control how tightly they bind their substrate.

And crucially, for how long?

Let's start with the molecular chaperones, like the Hsp70 family.

They seem like the first responders.

They really are.

Hsp70 binds to short hydrophobic stretches of a polypeptide.

And it's often right there at the ribosome, binding to the protein chain as it's being synthesized.

So it gets involved immediately?

Immediately.

And the Hsp70 cycle is this beautiful example of regulated timing.

The substrate first binds to Hsp70 when the chaperone is in its open state, bound to ATP.

Then what happens?

A co -chaperone, like Hsp40, comes in.

It stimulates Hsp70 to hydrolyze its ATP to ADP.

And what does that hydrolysis do?

It triggers a huge conformational change.

Flipping Hsp70 to a closed state that binds the substrate very tightly.

This traps the sticky regions and prevents them from aggregating.

So to restart the cycle, it has to open back up.

Right.

Another factor comes in and promotes the exchange of that ADP for a new ATP.

This flips Hsp70 back to the open state, and the substrate is released, getting another chance to fold.

If it fails, the cycle just repeats.

There's another type, Hsp90.

This one seems to handle proteins that are already partially folded.

Yes.

Hsp90 is critical, especially in eukaryotes.

It usually recognizes clients that are a bit further along in the folding process, including a lot of really important regulatory proteins like kinases.

Hsp70 often hands its clients off to Hsp90 for the final steps.

So it's more of a finishing school for proteins?

You could say that.

It uses its ATP cycle to promote the client's final remodeling and activation, really pushing it into its final active shape.

Okay, now for the second major family, the chaperonins.

These are like physical folding chambers.

They are.

They're massive supramolecular assemblies.

The bacterial one, Gro -Groes, is the classic example.

Gro -Ale is a barrel made of two stacked rings, forming two little chambers, and Groes is the lid.

And how does the Gro -Groes cycle work?

It sounds like a piece of molecular machinery.

It is.

A partially folded protein enters one of the open chambers of the Gro -Ale barrel.

Then, when ATP binds to that ring, it triggers a conformational change that allows the Gro -Ale lid to seal the chamber.

So the key here is isolation.

Isolation, exactly.

The inside of the chamber becomes more hydrophilic, giving the protein a protected private space to try and fold correctly.

And it has a built -in timer.

It does.

The hydrolysis of the ATP in that sealed chamber is the slow rate limiting step.

That's the timer.

And once it's hydrolyzed, that signals for the next step.

ATP and a new Gro -Ale's lid bind to the opposite ring.

And that binding on the second ring forces the first one open.

It physically pries the lid off the first chamber, releasing the now hopefully folded protein.

It's an incredibly coordinated two -stroke engine.

But sometimes, despite all this help, folding fails.

And that can lead to devastating consequences.

Misfolding in amyloids.

Misfolded proteins can accumulate in these toxic aggregates or plaques.

And the most common highly organized structure they form is called the amyloid state.

And what's wild is that all sorts of different proteins with completely unrelated sequences can all form these amyloid fibrils that have the same basic structure.

That structure is the cross -beta sheet structure.

Short segments from many copies of the protein -hydrogen bond together into these long arrays of beta sheets.

The key thing to visualize is that the individual beta strands run perpendicular to the long axis of the fibril.

Like rungs on a ladder.

Exactly, like rungs on a ladder.

And that perpendicular, highly organized structure is what makes them so incredibly stable and resistant to being broken down by the cell's normal machinery.

Which is why they accumulate in diseases like Alzheimer's and Parkinson's.

It's a fundamental breakdown of this quality control system.

A complete breakdown in proteostasis, yes.

All right.

With structure and quality control in place, let's talk about how proteins actually do their jobs.

We'll start with the basics.

Ligand binding.

A ligand is just any molecule a protein binds to.

And the two defining features of that interaction are specificity and affinity.

Specificity is the ability to bind just one particular molecule, right?

Yes, sometimes distinguishing between molecules that differ by just a single atom.

The antibody is the classic example.

It binds its target, the epitope, with incredible precision, all because of that perfect molecular complementarity.

And affinity is just the strength of that binding, how tightly it holds on.

Right.

And importantly, ligand binding almost always causes a conformational change in the protein.

This isn't a side effect, it's the main event.

It's the action that allows the protein to do its job.

Okay, let's move on to enzymes.

Nature's catalysts.

These are the proteins responsible for all the chemical reactions in the cell.

They are biochemical marvels.

They can increase reaction rates by factors of, you know, a trillion or more.

But crucially, they are true catalysts.

They do not change the overall equilibrium of the reaction.

They just make it happen much, much faster.

Incredibly faster.

They do it by lowering the activation energy.

They provide a special environment, the active site, that perfectly stabilizes the unstable high -energy transition state of the reaction.

And the architecture that does this is the enzyme active site.

This is usually a cleft or a pocket on the enzyme, formed by amino acids that might be very far apart in the linear sequence, but are brought together by folding.

And it has two distinct regions.

First is the substrate binding site, which is responsible for that exclusive specificity we talked about.

And second is the catalytic site, which has the chemical groups that actually carry out the reaction once the substrate is locked in place.

To measure how well they work, we use enzyme kinetics.

The process starts with the enzyme E binding the substrate S to form an enzyme substrate complex, ES.

Right, and that ES complex then converts to the product P, and the free enzyme is released to start again.

By plotting the reaction rate versus the substrate concentration, we can get two key numbers.

The first is Vmax, the maximal velocity.

That's the enzyme's top speed when it's completely saturated with substrate.

It's a measure of its raw catalytic power.

The second number is the Michaelis constant, or Kaolin -Molliston.

This is the substrate concentration you need to reach half of Vmax.

And this is really informative, because it's a rough measure of the enzyme's affinity for its substrate.

It is.

So if an enzyme needs to be active, even when its substrate is rare, you'd expect it to have a very low galler.

Right.

It's very efficient at low concentrations.

Exactly.

Whereas an enzyme with a high moller only really gets going when the substrate is abundant, it links the enzyme's kinetics directly to the cell's physiology.

To see how all this works in practice, let's look at the serine proteases like trypsin and chymotrypsin.

They're a perfect case study.

Their specificity is determined entirely by the chemistry of a little pocket in the active site, the side chain specificity binding pocket.

So trypsin has a negatively charged amino acid at the bottom of its pocket.

An aspartate, yes.

So it loves a binding cleave next to long positively charged side chains like arginine or lysine.

The opposite charges attract.

But chymotrypsin has a large hydrophobic pocket.

So it cleaves next to large hydrophobic side chains like phenylalanine.

And the lastase has a tiny little pocket, so it only cleaves next to small side chains like glycine.

Just a few amino acid changes completely redefine the enzyme's target.

OK, once the substrate is bound, the catalytic mechanism kicks in using the famous catalytic triad of serine, histidine, and aspartate.

How does it work?

The histidine and aspartate work together to make the hydroxyl group on the serine a very powerful nucleophile.

This activated serine then attacks the peptide bond of the substrate.

Creating a really unstable intermediate structure.

The first tetrahedral intermediate, which is the transition state.

And the enzyme has a special feature called the oxyanion hole that uses hydrogen bonds to stabilize this very unstable state.

That's the key to catalysis.

OK, so once that's stabilized, the peptide bond breaks.

One half of the product is released.

And the other half is left covalently attached to the serine.

This is called the acyl enzyme intermediate.

Then the rest of the process is just regeneration.

Right.

A water molecule comes in, attacks the acyl enzyme, forms a second tetrahedral intermediate, which is again stabilized.

And then finally, the bond to the serine breaks.

The second product is released.

And the enzyme is back to its original state, ready to go again.

That is an incredibly elegant system.

And it brings us back to this idea of cooperation and metabolic coupling.

Efficiency is everything in the cell.

Yeah.

So instead of letting molecules diffuse randomly, enzymes in a pathway are often assembled into huge multi -enzyme complexes or onto a scaffold protein.

This allows them to channel the product of one reaction directly to the active site of the next.

Exactly.

It minimizes diffusion time, prevents side reactions, and increases the overall speed of the pathway by orders of magnitude.

All right.

Once a protein is built and working, the next challenge is control.

The cell needs to be able to switch its activity on and off and manage its lifespan.

And there are really three major strategies for this.

First, control the amount of the protein through synthesis and degradation.

Second, control its intrinsic activity through modifications.

And third, control its location.

Let's start with controlling the amount through degradation.

This is the concept of proteostasis.

Right.

Protein lifespan is very wildly.

Some last for minutes, others for decades.

And regulated protein degradation is vital for two reasons.

First, it's the ultimate quality control that gets rid of toxic, misfolded proteins.

And second.

Second, it allows for rapid changes in protein levels.

If the cell needs to quickly down -regulate a pathway,

destroying a key enzyme is a very fast and powerful way to do it.

And the machine that does this destroying is the proteasome, the cellular chamber of doom.

It's a massive multi -subunit machine that degrades proteins.

And the process involves a few key steps.

First, the protein has to be tagged for destruction.

Then the proteasome binds it, unfolds it, and feeds it into an inner chamber.

Where proteolytic subunits chop it into little peptides.

So what does this machine look like?

The core is the cylindrical 20S core particle.

It's made of four stacked rings.

The two inner rings house the three different proteolytic active sites that can cleave after hydrophobic, acidic, or basic residues.

So it can chew up pretty much any protein.

And on top of that core, you have the regulatory part.

The 19S regulatory particle, or RP.

This is the brains of the operation.

It recognizes the tag.

And it has six powerful AAAAT passes in a ring.

These AT passes use energy to unfold the target protein and thread it into the 20S core.

So the tag that marks a protein for this fate is ubiquitin.

How does that work?

It's a three enzyme cascade.

You have E1, the activating enzyme, which uses ATP to activate a ubiquitin molecule.

Then E2, the conjugating enzyme, takes the ubiquitin from E1.

And the key step is E3.

E3, the ubiquitin ligase, is the critical one.

It recognizes the specific target protein and transfers the ubiquitin from E2 onto a lysine residue on that target.

It keeps doing this, building a long polyubiquitin chain.

The sheer number of E3 ligases is what gives the system its specificity.

Humans have over 600 E3s, each targeting a different set of proteins.

And for the proteasome to see the tag, you usually need a chain of at least four ubiquitins linked in a specific way.

And once the protein is degraded, other enzymes called dubs recycle the ubiquitin monomers.

It's a very economical system.

Okay, let's move to controlling intrinsic activity.

First, non -covalent allosteric switches.

Allostery is just communication within a protein.

A ligand binds at one site, site A, and that causes a conformational change that alters the activity at a totally different site, site B.

The classic example is feedback inhibition in metabolic pathways.

The final product of the pathway comes back and shuts down one of the first enzymes.

Exactly.

And we see two other critical allosteric switches all the time in signaling.

The first is CARI2 plus calmodulin switching.

The cell keeps free calcium levels extremely low.

But a signal can cause a rapid massive spike.

And that spike is sensed by proteins like calmodulin.

Calmodulin has four binding sites for calcium called EF hands.

When calcium binds, it causes this huge conformational change in calmodulin, exposing a big hydrophobic patch.

And that patch then allows it to bind to and regulate other proteins.

Exactly.

It wraps around specific parts of its target proteins and changes their activity.

It's a very sensitive calcium sensor.

The other essential switch is the GTPase switching family, which includes proteins like rays.

These proteins are molecular switches that cycle between two states.

They're active or on when they're bound to GTP.

And they're inactive or off when they're bound to GTP.

And they need helper proteins to switch between these states.

They do.

To turn the switch on, they need a GEF, a guanine nucleotide exchange factor, which helps them kick out the old GTP and bind a new GTP.

And to turn off, they hydrolyze the GTP to GTP.

Right.

And that hydrolysis is sped up dramatically by another helper protein called a GK, a GTPase -activating protein.

The GKP acts like a timer, making sure the on state doesn't last too long.

OK, finally, let's cover regulation by covalent modifications or PTMs.

This is a whole system of writers, readers, and erasers.

A writer enzyme adds the modification.

A reader protein recognizes it.

And an eraser enzyme removes it.

And the most important PTM is phosphorylation and dephosphorylation.

This is the reversible addition of a phosphate group, usually from ATP, onto a serine, threonine, or tyrosine residue.

The writers are the protein kinases.

And the erasers are the phosphatases.

This is the cell's main on -off switch.

Adding that big, doubling negative phosphate group can instantly change a protein's shape, activity, location, pretty much everything.

To see how this works, let's look at protein kinase A, or PKA.

All kinases have a conserved kinase domain with two lobes and an active site in the cleft between them.

And they are highly specific.

PKA, for example, recognizes a very specific sequence motif around the serine or threonine that's going to phosphorylate.

It does.

It requires a sequence like Lyre -XSS -Phi.

And that's because its active site is perfectly shaped to recognize that motif.

It has negative pockets for the two positive arginines and a hydrophobic pocket for the hydrophobic residue.

It's pure molecular complementarity.

And the kinase itself is regulated by phosphorylation.

Yes.

It's often part of a cascade.

Many kinases are activated by phosphorylation of a key residue in a region called the A -loop, or activation loop.

This phosphorylation causes a conformational change that opens up the active site and turns the kinase on.

So while phosphorylation is reversible, some modifications aren't.

We have to mention proteolytic cleavage as an irreversible form of regulation.

This is for permanent activation.

It's how you turn inactive precursors, called zymogens, into fully active enzymes.

This is what kicks off massive cascades like blood clotting.

It's a way to get a very rapid all -or -nothing response.

To understand all this at an atomic level requires some incredibly powerful tools.

We need ways to separate, identify, and actually see these molecules.

And we can start with the basic separation techniques.

The foundation is centrifugation.

For really big stuff, we use differential centrifugation.

You spin your sample, and the heaviest things, like nuclei, pellet out first.

And for finer separation by mass.

You use rate zonal centrifugation, which uses a density gradient.

As you spin, proteins move through the gradient at different rates, based on their mass and shape, separating into discrete bands.

Next up is electrophoresis.

And the technique that really revolutionized protein science is SDS -PAGE.

SDS -PAGE is so powerful because it gets rid of the native differences in shape and charge.

You treat your proteins with the detergent SDS, which denatures them and coats them all with a uniform negative charge.

So now everything has the same charge to mass ratio.

Exactly.

So when you run them through the polyacrylamide gel, which acts like a sieve, they separate almost purely based on their size.

Smaller proteins move faster.

And to get even higher resolution for really complex mixtures, you can use two -dimensional gel electrophoresis.

Right.

This combines two different separation methods.

The first dimension is isoelectric focusing, which separates proteins based on their isoelectric point, PI, which is the pH where they have no net charge.

Then you take that strip and run it in the second dimension.

Which is a standard SDS -PAGE gel, to separate them by size.

You end up with this beautiful 2D map of spots, where each spot is a unique protein defined by both its charge and its mass.

Okay, moving from gels to columns, we have chromatography.

First up, gel filtration.

This separates purely by size, using porous beads.

And it's a bit counterintuitive.

The bigger proteins can't fit in the pores, so they flow around the beads and come out first.

The smaller proteins get trapped in the pores and travel slower.

Then there's ion exchange chromatography.

This separates based on net charge, using charged beads.

Proteins with the opposite charge stick, and you will loot them off by running a salt gradient through the column.

And finally, the most targeted technique, affinity chromatography.

This is the gold standard for purification.

You attach a specific ligand for your protein of interest to the beads.

Only your protein sticks.

Then you can elute it by, for example, flooding the column with a high concentration of the free ligand.

Okay, once we've separated, then we need to detect them.

And immunoblotting, or western blotting, is the workhorse here.

This combines the power of SDS page with the specificity of antibodies.

You run your gel, then you transfer or blot the proteins onto a membrane.

Then you use a primary antibody that's specific for your protein of interest.

And then you use a secondary antibody, which is linked to an enzyme that binds to the first one.

When you add the substrate for that enzyme, you get a signal, either color or light, that tells you exactly where your protein is and roughly how much of it there is.

And to figure out what proteins are interacting inside the cell, we use co -immunoprecipitation, or CoIP.

With CoIP, you use an antibody to pull down your protein of interest.

And anything that's physically stuck to it comes along for the ride.

It's a fantastic way to map out protein complexes.

We can also track a protein's fate over time using pulse chase experiments.

Here, you use radioactive amino acid to label newly made proteins.

The pulse is a short exposure to the radioactive label.

The chase is when you flood the cells with tons of non -radioactive amino acid, so no more labeling occurs.

And by taking samples over time, you can watch what happens to that initial pool of labeled protein.

You can see if it gets modified or if it gets degraded.

Exactly.

It's like putting a tracker on the protein and following its life story.

Okay, finally, to get the actual 3D atomic structure, we have a few major methods.

The classic is X -ray crystallography.

Here, you have to grow a perfect crystal of your protein.

You shoot X -rays at it, they diffract.

And from that diffraction pattern, you can computationally reconstruct the 3D atomic model.

It gives you incredible high -resolution detail.

But the bottleneck is getting the crystal.

It's a huge bottleneck, which is why the newer method, cryo -electron microscopy, cryo -EM,

has been such a revolution.

You don't need a crystal.

You just flash freeze your sample in a thin layer of ice.

And then you take thousands of pictures of individual molecules in random orientations.

And a powerful computer averages all those images together to reconstruct the 3D structure.

It's perfect for huge flexible complexes that you could never crystallize.

And for small proteins, there's NMR spectroscopy.

Right.

NMR works on proteins and solutions so you can see their dynamics.

It measures distances between atoms and uses those constraints to calculate the 3D structure.

The last technique, which is the engine of modern systems biology, is mass spectrometry, or MS.

Mass spec is an incredibly sensitive way to measure the exact mass of molecules.

But its real power for us comes from tandem MS, or MS -MS.

What does that do?

MS -MS, you take a peptide, you weigh it, then you smash into pieces with a gas, and then you weigh all the fragments.

And because it breaks up the peptide bonds, the mass differences between the fragments tell you the amino acid sequence.

Exactly.

And when you couple this to liquid chromatography, you get LC -MS -MS, a machine that can identify and sequence thousands of different peptides from a complex mixture in a single run.

And LC -MS -MS is what enables the entire field of proteomics, the systematic study of the entire set of proteins in a cell, the proteome.

It's about getting that global view.

And this approach tries to answer a few core questions.

First, what proteins are actually there and how much of each?

Which is critical, because we know that the amount of a protein often doesn't correlate well at all with the amount of its messenger RNA.

You have to measure the protein directly.

Second, what are the modifications?

What's the PTM landscape?

Phosphor proteomics, for example, is just mapping all the phosphorylation sites in a cell at once.

Right, which lets you map entire signaling networks.

Third, who's interacting with whom?

How do protein complexes change in different conditions?

And changes in protein levels or modifications that are characteristic of a certain state, like a disease, are called biomarkers.

Yes, and proteomics is hugely important for discovering new biomarkers for diagnosis and treatment.

This technology also lets us map out the contents of organelles.

That's organelle proteome profiling.

You separate out the organelles, like mitochondria, and then use LC -MS -MS to identify every single protein inside.

It's like taking a complete inventory of each cellular compartment.

And to make really precise comparisons between two different states, like a healthy cell versus a cancer cell, we can use quantitative proteomics with a technique called SILAC.

SILAC is brilliant.

You grow one set of cells with normal light amino acids and the other set with heavy amino acids that have stable isotopes.

Then you mix the samples together and analyze them at the same time in the mass spec.

So for every peptide, you see two peaks in the mass spectrometer, a light one and a heavy one.

And the ratio of the heights of those two peaks gives you a super accurate measurement of the relative abundance of that protein between your two samples.

It removes all the technical variability.

Finally, there's a cool technique for mapping who's near whom inside a living cell.

Proximity -dependent labeling.

This is great for figuring out the composition of compartments that are hard to purify.

You fuse your protein of interest to a special enzyme.

And when you add a substrate, that enzyme releases a highly reactive chemical that just tags everything nearby.

Everything within about 10 to 20 nanometers.

It covalently attaches a label, usually biotin, to any neighboring proteins.

Then you can just pull down all the biotin -labeled proteins and identify them by mass spec.

It gives you a snapshot of the local molecular neighborhood.

And that brings us to the end of our deep dive into protein architecture and control.

Here's where it all comes together.

We started with that simple linear chain, the primary structure, and saw how basic physical principles, like the rigidity of the peptide bond and the massive force of the hydrophobic effect, drive that chain to find its one specific functional shape.

We detailed the vital role of the ATP -powered chaperones, the molecular timing devices that ensure quality control and prevent the formation of those devastating amyloid aggregates.

And we saw the cell's sophisticated regulatory toolkit, instantly flipping activity with allosteric switches like calcium or the GTPase timer, or with the reversible covalent switch of phosphorylation.

And when all else fails, the molecular shredder, the podism, takes over.

Finally, we explored the amazing tools that let us see this world, from the atomic precision of X -ray and cryo -EM to the global systems -level view of proteomics and mass spectrometry.

The whole chapter, the whole field, really, revolves around this idea that form dictates function.

If the shape is wrong, the job doesn't get done.

But then you have this fascinating paradox we discussed.

The intrinsically disordered proteins, the IDPs.

They lack a stable shape, yet they're absolutely essential.

So this raises a really interesting question for you to think about.

In a world where order is life, is controlled disorder or flexibility actually the ultimate strategy for achieving functional complexity?

We hope this deep dive into the architecture regulation and detection of proteins has given you the foundational concepts you need to build your understanding of molecular cell biology.

Thank you for diving deep with us today.

ⓘ This audio and summary are simplified educational interpretations and are not a substitute for the original text.

Chapter SummaryWhat this audio overview covers

Proteins are linear polymers of amino acids whose three-dimensional architecture fundamentally determines their biological roles in the cell. The relationship between molecular structure and function emerges through a hierarchical organization spanning primary, secondary, tertiary, and quaternary levels, where the amino acid sequence encodes the instructions for proper folding into a functional conformation. Stable structural motifs such as alpha helices and beta sheets provide the framework for protein stability, while modular protein domains enable independent functions within larger multidomain complexes. Achieving the native state requires sophisticated cellular machinery, particularly molecular chaperones like Hsp70 and GroEL, which guide nascent polypeptide chains through the folding pathway and prevent catastrophic aggregation. When this process fails, proteins may accumulate as amyloid fibrils, structures implicated in neurodegenerative conditions and other pathological states. Biological activity depends critically on molecular complementarity between proteins and their ligands, allowing enzymes to catalyze reactions by binding substrates in their active sites, stabilizing transition states, and reducing the activation energy barrier. Cellular responsiveness to environmental signals relies on regulatory mechanisms including allosteric switches mediated by calcium-binding proteins such as calmodulin and nucleotide-dependent conformational changes in GTPase superfamily members. Reversible phosphorylation catalyzed by kinases and phosphatases provides rapid, dynamic control over protein activity without requiring new protein synthesis. Damaged or surplus proteins are eliminated through the ubiquitin-proteasome pathway, which maintains cellular proteostasis by targeting proteins for degradation. Understanding protein structure and function requires powerful experimental approaches tailored to different scales of organization. SDS-PAGE separates proteins by molecular weight, while liquid chromatography and mass spectrometry enable high-resolution purification and identification of protein sequences and their post-translational modifications. Atomic-level structural information comes from X-ray crystallography, cryo-electron microscopy, and NMR spectroscopy, techniques that reveal how amino acid arrangements create functional three-dimensional shapes. Collectively, these analytical tools support proteomics, the comprehensive analysis of all proteins within a biological system that provides systems-level insight into cellular organization and physiology.

Using this chapter to study? Last Minute Lecture is free and student-run. If it helped, consider supporting the project.

Support LML ♥

Chapter 3: Protein Structure & Function

Related Chapters