Chapter 19: Gene Expression II: Protein Synthesis & Sorting
Welcome to Last Minute Lecture.
This free chapter overview is designed to help students review and understand key concepts.
These summaries supplement not replaced the original textbook and may not be redistributed or resold.
For complete coverage, always consult the official text.
Welcome back to The Deep Dive.
Our mission here is to take the deepest, densest source
and today that's the grand conclusion of gene expression from our cell biology text and really turn it into a targeted, thorough, and hopefully highly engaging conversation.
I think we can manage that.
We've covered the blueprint, which is the DNA, and the working copy, the RNA.
Okay.
So today we're finally hitting the manufacturing floor.
We are.
We're focused on the ultimate purpose of that genetic code, which is translation.
Right.
This is that irreversible shift from let's call it the informational language of nucleic acids to the functional language of amino acids.
So this is where the action happened.
This is where it all happens.
And our central theme today, I think the biggest conceptual hurdle for a lot of people, is understanding the precise mechanism by which a cell not only translates this linear message into a complex three -dimensional machine.
But also directs traffic.
But also performs the necessary traffic control.
Exactly.
Making sure that every single one of those thousands of proteins is folded correctly, modified properly, and delivered to its exact address.
Whether that's the cytoplasm, the nucleus, or being shipped out of the cell entirely.
Precisely.
So our journey today is going to follow the source material pretty sequentially.
We'll start with the components, move through the three stages of translation, look at quality control and modification, and then finish up with the really complex pathways of protein targeting and sorting.
And it's so important to do it that way.
Why do you say that?
Why dedicate a whole deep dive to this?
Well, because understanding these sequential steps that cause and effect logic of the cell's main engine is just essential to grasping structure, function, and life itself.
I mean, this is the moment the information becomes reality.
Okay, let's start with the central factory then.
The complex machine that orchestrates this entire process.
The ribosome.
The ribosome.
You know, it's really less of a simple workbench.
It's not a passive table.
No, not at all.
It's more of a highly specialized, almost robotic assembly line.
It's this massive particle structurally composed of ribosomal RNA, the rRNA, and a whole collection of structural proteins.
And we find them all over the cell, don't we?
We do.
They're either floating free in the cytosol, or they're bound to the endoplasmic reticulum membrane and the nuclear envelope in eukaryotes.
And critically, because of their evolutionary history, you'll also find specialized ribosomes inside mitochondria and chloroplasts.
And their structure.
That's one of the clearest dividing lines between the major domains of life, right?
When we look at the size comparison, the differences between prokaryotic and eukaryotic ribosomes are, well, they're not just apparent.
They're functionally crucial.
Oh, absolutely.
The source material highlights this using something called the Svedberg unit, or just S.
In prokaryotes of bacteria and archaea, the intact ribosome is a 70S particle.
70S.
Got it.
And that machine is built from two subunits.
Oh.
A smaller 30S subunit and a larger 50S subunit.
Okay.
And in eukaryotes?
In eukaryotic cells, everything is just a bit bigger.
The intact structure is 80S, and it's assembled from a 40S small subunit and a 60S large subunit.
And these eukaryotic parts, they contain significantly more proteins and much larger rRNA molecules than their bacterial counterparts.
I have to pause on the S values, because this is always a point of confusion for students.
You have a 30S and a 50S subunit, but they don't add up to 80S.
They make 70S.
Right.
And 40S plus 60S somehow only equals 80S.
What's going on there?
That's a really necessary clarification.
The S stands for the Svedberg unit, and the key thing to remember is that it is not a unit of mass.
Okay.
So it's not weight.
It's not weight.
It's a measure of the sedimentation velocity of the particle when you spin it in a centrifuge.
And that velocity depends on mass, yes, but also density, and critically, the shape of the particle.
Ah, the shape.
The shape.
When the two subunits click together, the resulting 70S or 80S complex has a much different, more compact shape than the individual parts.
So it's sediments at a different rate, one that isn't just the sum of its parts.
It's a measure of how efficiently the machine moves through a liquid, not just how heavy it is.
So the difference isn't a failure of math.
It's a consequence of changing hydrodynamics.
Okay.
Now, functionally, the biggest surprise in recent decades has been about catalysis.
We used to just assume the dozens of ribosomal proteins were doing all the enzymatic work.
That was the dogma for decades, right?
Proteins are the enzymes of life.
But the major revelation was proving that it's actually the ribosomal RNA itself that performs many of the ribosome's most critical functions.
Which is huge.
It's a massive deal.
It elevated RNA to the status of an enzyme, a ribozyme.
Specifically, the catalytic activity, the part that forms the actual peptide bond, is called peptidyl transferase.
And that's in the large subunit.
It resides within the large subunit.
In bacteria, this activity is localized to the 23S RNA molecule.
I mean, this discovery fundamentally changed our understanding of the origins of life and how enzyme function even evolved.
The machine is structured to facilitate this complex chemical reaction,
designed like an assembly line.
Can you walk us through the four critical binding sites, the ones located right where the two subunits meet?
Absolutely.
So think of the small subunit as the DCOGI platform and the large subunit as the catalytic engine.
Okay.
And right at their interface, we have four sequential sites.
First, there's the mRNA binding site.
Its job is just to hold and orient the messenger RNA correctly so the start codon is positioned perfectly for the first tRNA.
And then you have the tRNA pockets.
Then you have three tRNA binding pockets.
First is the A site, or aminoatel site.
This is the receiving bay.
It binds the newly arriving transfer RNA, which is already charged with its specific amino acid.
Next to that.
Next is the P site, or peptidyl site.
This is the working station.
It holds the tRNA that is carrying the growing polypeptide chain.
And the last one.
And finally, the E site, or exit site.
This is the disposal chute.
It's where the now empty discharged tRNAs are jettisoned from the ribosome, ready to be recycled.
This ordered progression A to P to E is the core mechanism of elongation.
So if the ribosome is the factory, the transfer RNAs, or tRNAs, they must be the critical adapter molecules.
They're the linguistic bridge.
And this whole concept actually started as a theoretical prediction from Francis Crick.
A remarkable one, too.
Crick recognized this inherent chemical incompatibility.
He knew that an amino acid side chain couldn't chemically read a nucleotide base sequence.
They don't speak the same language.
They don't.
So he proposed the adapter hypothesis.
He reasoned there must be some small molecule, an adapter, that could read the nucleotide code on one end and carry the corresponding amino acid on the other.
This was pure theoretical genius.
And it didn't stay theoretical for long.
It was confirmed by Mahlon Hoagland, who showed experimentally that radioactive amino acids first bind covalently to these small soluble RNA molecules.
The tRNAs before protein synthesis even kicks off.
And that experiment established the dual specificity that defines the tRNA.
Each tRNA molecule has to meet two criteria.
First, it must bind one specific amino acid, say alanine.
And second.
Second, it must recognize the one or more specific mRNA codons that call for that amino acid.
Once that amino acid is attached via a high energy ester bond to the tRNA's three prime end, we call it a charged tRNA.
Like a lanol tRNA alum.
Exactly.
And the energy stored in that high energy bond is the chemical fuel that will be used later to drive the formation of the peptide bond.
The part that does the recognizing is the anticodon.
How do we properly orient that sequence to understand its function?
Understanding the orientation is absolutely key to understanding the reading frame.
So we conventionally read the mRNA codon in the five prime to three prime direction.
Right.
The anticodon on the tRNA is the three nucleotide sequence that binds to the codon.
And crucially, we write the anticodon sequence in the three prime to five prime orientation.
So they're anti -parallel.
Exactly.
So for the alanine codon five prime and GCC three prime, the tRNA anticodon is three prime CGG five prime.
This precise anti -parallel base pairing is what guarantees the accurate reading of the genetic code.
It aligns the correct amino acid delivery with the specific codon call.
Now, if the base pairing is so specific and we have 61 codons that specify amino acids, logic would tell you we need 61 unique tRNAs.
Yeah.
But the cell is, wow, it's much more frugal than that.
Why is that?
And this brings us to Crick's second great contribution here, the wobble hypothesis.
Wobble.
This principle introduces a bit of flexibility, or wobble, specifically in the pairing between the third base of the mRNA codon, that's the three prime end, and the corresponding first base of the tRNA anticodon, the five prime end.
The kind of controlled sloppiness.
That's a great way to put it.
And it significantly reduces the total inventory of tRNAs the cell has to produce.
So certain non -standard pairings are allowed, but only at that third position.
Precisely.
For example, the base guanine usually pairs with cytosine, but in the wobble position, guanine can also form a stable pair with umicil.
And there's an even more versatile player.
There is.
The most versatile wobble base is inosine, or I, which is a modified base often found in that wobble position of tRNAs.
Inosine can pair effectively with uracil, cytosine, or adenine.
Wow.
So that dramatically simplifies the code and provides efficiency without really compromising the quality.
It does.
I mean, think about the amino acid isoleucine.
It has three codons, AUU, AUC, and AUA.
Right.
If a tRNA has the anticodon three prime, UAI five prime, that inosine at the five prime position can recognize U, C, or A at the three prime position of the codon.
So one tRNA reads all three.
One tRNA is efficient to read all three isoleucine codons.
And the critical insight from the source material here is that this flexibility does not introduce error.
All codons read by a single tRNA always code for the same amino acid.
It's a mechanism for molecular resource conservation, not for ambiguity.
The entire integrity of this translation process then rests on one single step that's performed before the ribosome even gets involved, linking the correct amino acid to its specific tRNA.
It has to be.
Because if an error is made there, the ribosome can't fix it.
It only recognizes the anticodon, not the cargo it's carrying.
This is perhaps the single most critical checkpoint in all of gene expression.
And this task is monopolized by a group of enzymes called the aminoacyl tRNA synthetases.
And there's one for each amino acid?
Generally, yes.
There are 20 distinct synthetases, one for each amino acid.
They catalyze that crucial reaction where the amino acid is covalently linked to the three prime OH of the adenine nucleotide at the end of its appropriate tRNA.
And this linkage, it's not free.
It requires a significant investment of energy.
It absolutely does.
The process is called amino acid activation and it requires the hydrolysis of ATP all the way down to AMP and pyrophosphate.
Two high energy bonds.
Two high energy bonds.
And this activation step creates that high energy ester bond we mentioned earlier.
That bond is paramount because its stored energy is what the ribosome will ultimately use without any further energy input to form the subsequent peptide bond.
So these synthetases have a dual specificity.
They have to recognize the correct amino acid and the correct tRNA.
How do they ensure this match with such high fidelity?
The recognition mechanism is incredibly complex and rigorous.
Synthetases don't just check the anticodon.
They recognize multiple features of the tRNA structure, including bases on the acceptor stem near that three prime end.
And they can proofread.
They do.
They function as sophisticated quality control mechanisms.
Many synthetases have a proofreading domain.
If the wrong smaller amino acid is accidentally attached, the enzyme can actually hydrolyze the bond and release the incorrect amino acid, which contributes significantly to that impressively low error rate of protein synthesis.
And the classic experimental proof for this, the Chaikville -Libman experiment, that really drove home the absolute supremacy of the tRNA as the adaptor molecule.
Oh, that experiment is foundational.
What they did was they took a charged cysteine tRNA -cis complex and then chemically modified the cysteine, convoding it into alanine.
But the alanine was still attached to the tRNA for cysteine.
That was the key.
The alanine remained attached to the tRNA specific for cysteine.
So when this modified complex was supplied to the ribosome, the ribosome inserted alanine into positions that were dictated by cysteine codons, proving the ribosomes blind to the cargo.
It proved definitively that the ribosome only reads the tRNA's anticodon.
Therefore, that initial check performed by the aminoacyl tRNA synthetase is the single most important fidelity checkpoint in the entire pathway.
OK, with the components assembled and charged, we can move into the dynamic process of translating the blueprint.
Let's start with the structure of the message itself, the mRNA.
Right, the template.
And its structure varies between life forms.
In prokaryotes, it's very common to find what's called polycystronic mRNA.
Meaning multiple genes on one message.
Exactly.
A single mRNA molecule can encode multiple different polypeptides, often organized sequentially in an operon.
This allows multiple genes involved in a single pathway to be regulated and translated together.
And eukaryotic mRNA is different.
Eukaryotic mRNA is generally monocystronic, encoding just one polypeptide.
And the eukaryotic message has those unique hallmarks that are necessary for its function in the nucleus and its export.
Exactly.
Eukaryotic mRNA features the 5' cap, a result of processing, and the 3' polyA tail.
And crucially for translation, prokaryotic mRNA contains a distinct upstream sequence called the Shine -Dalgarnov sequence, which eukaryotic mRNA lacks.
And that one difference necessitates fundamentally different ways of starting translation.
Completely different initiation strategies.
But regardless of the cell type, the direction of reading is universal.
Yes, that is constant.
Translation occurs uniformly in the 5' to 3' direction on the mRNA template.
This was confirmed really elegantly using artificial RNA templates.
How did they do that?
Well, by synthesizing a message like 5' RNA and so on, AAC3', scientists knew that the resulting polypeptide would be a string of lysine residues, the AAA codon ending with an asparagine from the AAC codon.
And since the asparagine was found at the C -terminus.
Which is always the last part of the protein to be synthesized.
It proved that the reading must proceed from 5' to 3'.
So, getting the ribosome complex to lock on to the correct AUG start codon is a moment of high choreography.
Let's tackle bacterial initiation first.
How does the small subunit find its starting position?
Bacterial initiation is driven by three initiation factors, or IFs.
And it happens in a very rapid sequence.
Step one.
The 30S small subunit binds IF1, IF2, which is carrying GCP, and IF3.
And IF3's role is important here.
It's crucial.
It prevents the large 50S subunit from binding prematurely.
Okay, well step two.
Step two.
The mRNA and the initiator tRNA join the complex.
This is where the Shine -Delgarno sequence, often 5' AGA3', becomes critical.
It's located just upstream of the AUG start codon.
And it base pairs with the ribosome itself.
It base pairs specifically with a complementary sequence on the 3' end of the 16S rRNA, which is part of the small subunit.
And this specific interaction is what precisely positions the AUG start codon into the P -side ribosome.
And the initiator tRNA is unique in bacteria.
It is.
It carries n -formylmethionine, or FMET.
The IF2 factor has the specific job of distinguishing this initiator FMET tRNA from all other tRNAs, ensuring that it, and not a regular methionine tRNA, binds only to the start AUG.
Which guarantees the reading frame is set correctly from the very first base.
Exactly.
Then step three.
IF3 is released.
The 50S large subunit is recruited.
The GTP that was bound to IF2 is hydrolyzed, and IF1 and IF2 are released.
The result is the complete 70S initiation complex, with that FMET tRNA positioned perfectly in the P -site, ready for the first aminoacyl tRNA to enter the A -site.
The mechanism is so precise that disrupting just one small part of that complex can be devastating.
I'm thinking the effect of colicin E3.
Oh, colicin E3 is a perfect illustration of this structure function dependence.
It's a bacterial toxin that cleaves a specific site on the 16S rRNA.
The exact site that base pairs with the Shine -Delgarno sequence.
Precisely.
By destroying that binding site, colicin E3 makes it impossible for the 30S subunit to properly align the mRNA.
It halts initiation cold and effectively kills the bacterium.
Now, eukaryotic initiation is far more complex.
It relies on the 5' cap and the 3' polyA tail, instead of a Shine -Delgarno sequence.
Let's explore the roles of all those EIFs.
It is much more complex.
Eukaryotic initiation, which uses regular MET instead of FMET, involves at least 12 distinct eukaryotic initiation factors, or EIFs.
The goal is to first assemble the machinery and then locate the start codon.
So what's the first step?
Step one is the formation of the 43S pre -initiation complex.
EIF2 -GTP binds the initiator MET tRNA.
This complex, along with a few others like EIF1, moniker 3, and medical 5, binds to the 40S small subunit.
Okay, then step two is grabbing the mRNA.
Step two is mRNA recruitment and circularization.
The 5' cap is recognized by EIF4E.
This protein then recruits EIF4G, which is a big scaffolding protein.
And EIF4G is the bridge.
It's the critical bridge, because it simultaneously interacts with the polyA binding protein, or PABP, which is attached way down at the 3' polyA tail.
Here's where it gets really interesting.
This connection essentially circularizes the mRNA.
Why is that circularization so critical for initiation?
It provides robust stability and it dramatically enhances efficiency.
The cell is basically ensuring the integrity of both ends of the message before it commits all that energy to translation.
It also helps with the reinitiation, right?
Massively.
Circularization makes it much easier for the ribosome subunits that have just finished one round of translation to quickly hop back on and reinitiate on the same mRNA molecule.
It's a huge accelerator for protein production.
Then comes the searching phase, which often involves the COZAC sequence.
That's step three.
Scanning.
The small subunit complex scans along the mRNA, typically looking for the very first AUG triplated encounters.
To increase fidelity, this AUG is often embedded within a consensus sequence called the COZAC sequence.
So once the met tRNA base pairs with that AUG start codon, a bunch of factors are released and EIF5B -GTP facilitates the critical final step.
The joining of the 6DS large subunit.
The final GCP hydrolysis releases all the remaining factors, leaving you with a complete actively translating ADS ribosome complex.
That system sounds like it leaves very little room for error.
But there are exceptions to this cap -dependent scanning rule.
Yes.
Certain viruses, and even some of our own cellular genes under stress, can utilize what are called internal ribosome entry sequences, or IERS.
What do they do?
These IERS sequences allow the ribosome to completely bypass the need for cap recognition and that lengthy scanning process.
It can bind directly to an internal site near the start codon.
And this mechanism is critical because it allows the cell to translate specific, often survival -related messages even when general cap -dependent translation has been shut down.
Right.
Once initiation is complete, the ribosome is primed to start synthesis.
The elongation phase is this rapid three -step cycle that adds amino acids with incredible precision.
And this cycle begins with the A site ready to receive the second charge tRNA, guided by specialized elongation factors, or EFs.
So let's talk about cycle one.
Cycle one.
Aminoacyl tRNA binding to the A site.
The correct incoming aminoacyl tRNA is escorted by the elongation factor EF2N bacteria, which is bound to GTP.
This whole complex enters the A site.
And it checks for a match.
It does.
If the anticodon correctly base pairs with the codon, a conformational change happens, GTP is hydrolyzed, and the EF2 -GDP complex is rapidly released.
If the pairing is incorrect, the complex usually just dissociates before GTP hydrolysis can even occur.
So that's a significant proofreading step.
It is.
And then another factor, EFTs, is required to regenerate the active EF2 -GTP for the next cycle.
This factor dependency ensures the process is both rapid and highly accurate.
So once the new tRNA is stabilized in the A site, we get to the actual work of building the protein.
That's cycle two.
Peptide bond formation.
This is catalyzed by the peptidyl transferase activity of the large subunit, the ribozyme.
The amino group of the amino acid in the A site attacks that high energy ester bond connecting the polypeptide chain to the tRNA in the P site.
And that transfers the whole chain over.
It transfers the growing chain from the P site tRNA to the A site amino acid, forming a peptide bond.
So now the entire polypeptide chain, one amino acid longer, is attached to the tRNA sitting in the A site.
The tRNA in the P site is now empty.
And cycle three requires moving the entire assembly forward by exactly one codon.
Three nucleotides.
That's cycle three.
Translocation.
This step is driven by a second elongation factor, EFG, also known as translocase, which is also bound to GTP.
And it ratchets the ribozyme forward.
It does.
EFG associates with the ribozyme.
And upon GTP hydrolysis, it induces this massive conformational change that effectively ratchets the ribozyme three nucleotides down the mRNA.
This movement physically shifts the contents of the sites.
The peptidyl tRNA moves from A to P.
And the empty tRNA moves from P to E, where it's promptly released.
And the ribosome is reset.
The A site is vacant.
The next codon is exposed, ready for a new tRNA.
Ready to start the cycle all over again.
The efficiency of this is just stunning.
Multiple ribosomes can simultaneously translate a single mRNA, forming a structure called a polyribosome, which just exponentially increases the speed of production.
And we really have to emphasize the energetic cost of this high -speed, high -fidelity production.
Elongation is incredibly expensive.
We calculate that at least four high -energy phosphon hydride bonds are consumed for every single amino acid added.
Four?
Where do they come from?
Two from ATP during the initial charging of the tRNA by the synthetase.
And two from GTP during elongation.
One for EF2 binding to the A site, and one for EFG driving translocation.
This enormous energy sink highlights just how fundamental translation is, and as we'll see, its inherent vulnerability.
Okay.
The machine runs until it encounters one of the three nonsense codons, the stop signs.
Since there are no tRNAs that recognize UAG, UAA, or UGA, how is the ribosome prompted to stop and dismantle the complex?
The termination signal is recognized not by an RNA molecule, but by protein release factors.
So when one of the three stop codons arrives in the A site, a release factor along with TTP binds there instead of a tRNA.
And what action do these release factors take that differs from the elongation factors?
They activate the peptidyl transferase ribosome, but they force it to use a water molecule as the recipient instead of an incoming amino acid.
Ah, so it's hydrolysis.
It triggers a hydrolytic cleavage reaction that transfers the completed polypeptide chain to that water molecule.
This action immediately releases the polypeptide, creating its free C -terminus.
Then subsequent GTP hydrolysis releases the release factor, and the entire complex, the mRNA, the ribosomal subunits, the empty tRNAs, it all just dissociates ready to be recycled.
So now we have a long linear chain of amino acids.
It's chemically perfect, but it's structurally useless until it assumes its specific functional three -dimensional shape.
And this folding process is incredibly high stakes.
It is.
The genetic sequence contains all the information needed for folding.
But in the crowded, often hostile environment inside a cell, the protein needs help to prevent inappropriate interactions, specifically aggregation, before it reaches its final state.
And that assistance comes from molecular chaperones.
It does.
The Hsp70 and Hsp60 families of heat shock proteins are the primary assistance here.
And they're called that because they're expressed under stress, like high heat.
Correct.
The Hsp70 chaperones are often the first responders.
They bind transiently to short exposed hydrophobic patches on newly synthesized polypeptides or on misfolded ones.
They use ATP hydrolysis to drive conformational changes, which stabilizes the chain and provides these short windows of opportunity needed for correct folding.
And the Hsp60 chaperonins are,
well, they're something else entirely.
They're like a contained environment for folding.
They are essentially tiny molecular refolding chambers.
In bacteria, the system is GroL -GroA.
GroL forms this large barrel -like cage.
OK.
A misfolded or partially folded protein is fed into that large GroL -L chamber.
A cap subunit, GroES, then attaches.
This docking, driven by ATP hydrolysis, causes a conformational change inside the chamber, temporarily transforming it into a highly favorable hydrophilic shielded environment.
So the protein gets a private room to fold in.
It does.
It gets a moment to fold correctly without any external interference.
When the ATP is hydrolyzed and the cap detaches, the correctly folded protein is released and the chamber is ready for its next client.
Now, folding often isn't the final step.
To become fully active, the protein usually needs some chemical or structural tuning, known as post -translational modifications.
And these modifications are vital for function, for localization, for everything.
The simplest one is cleavage.
Just removing the initiating groups, like the FMET, or the starting methionine and eukaryotes, or cutting off N -terminal signal sequences after they've done their job.
And the processing of insulin is still the most elegant example of this maturation process.
It perfectly illustrates sequential cleavage and modification.
Insulin starts as a single polypeptide called pre -pro insulin.
Okay, pre -pro insulin.
First, the N -terminal pre -sequence, which is a signal sequence, is cleaved off, leaving pro insulin.
While it's still a single chain, pro insulin forms the necessary disulfide bonds that will link its future A and B chains.
Then the final cut.
Then the final cut.
Specific peptidases excise the internal pro -amino acids, releasing the mature functional two -chain insulin protein held together by those essential disulfide bonds.
Without that precise series of cleavages, the protein is completely inactive.
Beyond structural cleavage, proteins can also be chemically tagged with all sorts of different groups.
Indeed.
We see regulatory additions like phosphorylation adding phosphate groups, which acts as a major on -off switch for protein activity.
We also see methylation and acetylation.
And glycosylation.
Glycosylation, the addition of carbohydrate side chains, is critical for secreted proteins and for cell surface identity.
And then proteins can also associate with non -protein prosthetic groups, like the iron -containing heme in hemoglobin.
There is also this remarkable process that's analogous to RNA splicing, but it's happening at the protein level.
That's protein splicing.
It's wild.
Just as introns are removed from pre -mRNA, internal amino acid sequences called intanes are excised from the polypeptide.
The remaining segments, the exions, are spliced together to yield the mature protein.
It's a fascinating mechanism for regulating protein function after it's already been synthesized.
Accuracy is everything in gene expression.
And the cell has molecular systems to cope when the DNA blueprint itself is flawed.
These changes in the nucleotide sequence are mutations and their severity varies wildly.
A simple point mutation can have huge consequences.
A missense mutation changes a single base pair, which results in a different amino acid being incorporated.
Sickle cell anemia is the classic example.
A single base change substitutes valine for glutamic acid.
And then there's the really damaging one.
The nonsense mutation.
That's where a base change creates a stop codon UAG, UAA, or UGA, leading to premature termination and usually a non -functional truncated protein fragment.
And on the flip side, sometimes the mutation is functionally invisible.
That would be the silent mutation.
The change in the DNA sequence still encodes the same amino acid, often thanks to the redundancy of the code and the flexibility of that wobble position.
And the worst for structure.
The worst structural damage usually comes from frame shift mutations.
These are caused by the insertion or deletion, an indel of one or two bases.
This shifts the entire reading frame downstream and generally results in a massive sequence change followed very quickly by a nonsense stop codon.
The cell even has mechanisms to try and counteract the most severe type.
The nonsense mutation.
It does.
It employs suppressor tRNAs.
These are produced by independent mutations in tRNA genes.
A suppressor tRNA has a mutant anticodon that, instead of stopping, actually recognizes a stop codon and inserts an amino acid.
So it reads through the stop sign.
It allows the ribosome to read through the premature termination signal.
This can restore a full length, sometimes partially functional protein.
But these have to be inefficiently regulated.
If they were too good at their job, they would interfere with normal, legitimate termination signals at the true end of the mRNA, which would cause massive cellular problems.
Beyond suppressor tRNAs, the cell has dedicated systems to ensure that flawed mRNAs, which represent wasted energy and potentially toxic protein products, are degraded before too much damage is done.
This is mRNA decay quality control.
In eukaryotes, the primary system is nonsense mediated decay, or NMD.
This targets mRNAs that contain premature stop codons.
How does it know the stop codon is premature?
During pre -mRNA splicing, protein complexes, glonex unjunction complexes, or NJCs, are deposited on the mRNA.
The cell interprets a stop codon as premature if it occurs before the final EJC.
And that signals the mRNA for rapid degradation, preventing the synthesis of truncated proteins.
And what about the opposite problem messages that never stop?
That's handled by non -stop decay in eukaryotes.
If an mRNA lacks a stop codon, the ribosome just reaches the end and stalls indefinitely.
Specific machinery recognizes the stalling, and recruits nucleases to degrade both the stalled complex and the mRNA.
And bacteria have their own system for this?
They do, and it's fascinating.
They use a hybrid molecule called TMRNA, which stands for transfer messenger RNA.
It binds to the stalled ribosome, adds a short sequence of amino acids, a destruction tag to the C -terminus of the polypeptide, and then recruits proteases and nucleases to degrade both the protein and the non -stop mRNA.
This entire system, particularly the structure of the ribosome, is where the evolutionary difference between prokaryotes and eukaryotes becomes a matter of targeted chemical vulnerability.
I mean, antibiotics are all about exploiting the differences between the 70S bacterial ribosome and the 80S eukaryotic ribosome.
It's highly effective chemical warfare.
You're targeting the most energy -intensive and fundamental life process in the pathogen while sparing the host.
And we can categorize antibiotics based on which part of the translation machine they bind to and disrupt.
Okay, let's start with the small subunit inhibitors.
Streptomycin binds to the 30S subunit.
It inhibits the proper alignment and initiation of translation.
But even if synthesis does start, it causes massive misreading of the genetic code, rendering the resulting proteins non -functional and usually toxic.
And tetracycline.
Tetracycline also binds the 30S subunit, but its mechanism is to physically block the acite.
It prevents the charged aminoacyl tRNA from even binding.
Without that delivery, elongation just stops immediately.
Then you have the large subunit inhibitors which target the chemistry and the movement.
Exactly.
Chloramphenicol binds the 50S subunit and blocks the ribozyme activity, the peptidyl transferase, preventing the formation of the peptide bond itself.
Erythromycin also binds the 50S subunit, but it specifically blocks the tunnel through which the growing polypeptide chain exits, physically preventing the ribozyme from translocating.
It effectively locks it in place.
This precision is why they work so well.
But the constant bacterial evolution leads to resistance, creating these so -called super bugs.
How exactly do bacteria counteract these powerful targeted weapons?
Resistance often involves specific molecular countermeasures that either physically alter the drug or the target.
For chloramphenicol resistance, for example, the cat gene produces an enzyme that acetylates the antibiotic, chemically modifying it so it can no longer bind to the 50S subunit.
And for erythromycin?
For erythromycin resistance, some bacteria produce methylase enzymes that chemically modify the 23S rRNA, right at the site where erythromycin binds, blocking the drug's access.
The resistance mechanism against tetracycline is particularly ingenious because it leverages the cell's own machinery.
It does.
This resistance is often mediated by ribosomal protection proteins, or RPPs.
These proteins are structurally very similar to the elongation factors, EF2 and EFG.
So it's molecular mimicry.
It is.
They bind to the ribosome and, using energy from GTP hydrolysis, they physically displace the tetracycline molecule from its binding site in the A -site region.
This allows the charged tRNA to bind correctly, overcoming the block and permitting elongation to proceed.
It just highlights the extraordinary resourcefulness of bacterial evolution in this ongoing arms race.
We've established how the protein is made, folded, and quality controlled.
Now for the logistics, delivering it to its correct cellular address.
I mean, if a secretory protein ends up in the nucleus, the cell fails.
The cell's success depends on molecular traffic control.
It does.
And every single protein synthesis event that's initiated by nuclear genes starts on ribosomes that are free in the cytosol.
The destination of that polypeptide is determined extremely early, usually by the first 15 to 30 amino acids that emerge from the ribosome exit tunnel.
And that single decision splits cellular delivery into two fundamental pathways.
It does.
The first pathway is caught translational import.
This pathway is reserved for proteins destined for the endomembrane system.
The ER, the goal key, lysosomes, secretory vesicles, and the plasma membrane, or for export out of the cell.
And the key feature is timing.
The crucial feature is that the ribosome attaches to the ER membrane while translation is still in progress.
The nascent polypeptide is imported simultaneously.
And the second pathway handles all the internal addresses.
That is post -translational import.
Here, synthesis finishes entirely on ribosomes free in the cytosol.
The completed, or nearly completed, polypeptide is then delivered to its final destination.
The cytosol itself, the nucleus, mitochondria, chloroplasts, or peroxisomes.
The sorting for these organelles happens after synthesis is complete.
The genius behind this sorting mechanism was first articulated by Gunter Blobel in the signal hypothesis for which he won the Nobel Prize.
Blobel's insight was just that proteins contain these intrinsic amino acid sequences, molecular postal codes, that function as signals to direct them to the correct cellular location.
And these signals are recognized by dedicated cellular machinery that ensures accurate transport.
So let's follow the journey of a protein taking that co -translational highway to the ER.
The process begins with the ER signal sequence.
This sequence is typically located at the end terminus of the nascent polypeptide.
It's usually 15 to 30 amino acids long and has a characteristic structure.
A few positively charged residues, a core stretch of 1015 hydrophobic amino acids, and then a polar region with the cleavage site.
This structure is basically non -negotiable for ER targeting.
The signal sequence emerges from the ribosome, but it doesn't immediately bind the ER.
It binds a large complex particle first.
That's the signal recognition particle, or SRP.
It's an impressive complex made of six polypeptides and a small 7S RNA molecule.
The SRP binds with high affinity to that hydrophobic core of the ER signal sequence as soon as it emerges.
And it does something else too.
Critically, this binding temporarily blocks further translation.
And this pause is vital because it ensures the polypeptide cannot fold incorrectly in the cytosol before it even reaches the ER membrane.
So the SRP is both the physical escort and the temporary breaking mechanism.
How does this complex find and bind to the ER membrane?
The SRP ribosome complex is guided to the ER by the SRP receptor, which is a protein embedded in the ER membrane.
GDP binding and hydrolysis are central to this docking process.
Once the SRP binds its receptor, the ribosome complex docks onto the translocon.
And the translocon is the physical channel or pore that the protein will pass through.
The core of the translocon is the SEX61 complex.
GDP hydrolysis, driven by the factors bound to the SRP and its receptor, causes the SRP to be released and, at the same time, opens the SEX61 pore.
And the protein starts threading through.
The signal sequence, now inserted into the pore, allows the nascent polypeptide chain to thread through the channel and pass into the ER lumen as translation resumes.
And a dedicated N -LIME, signal peptidase, recognizes and cleaves the ER signal sequence off the growing chain as it enters the lumen.
This works seamlessly for soluble proteins destined for the lumen or secretion.
But what about integral membrane proteins that have to remain anchored within the ER membrane?
They need a way to stop the translocation process halfway through.
They utilize internal hydrophobic sequences that act as anchors.
The first type is the type -I transmembrane protein.
These proteins start with the normal N -terminal ER signal sequence that initiates import.
However, further down the sequence, they contain a hydrophobic stop transfer sequence.
And that does exactly what it sounds like.
It does.
When the ribosome synthesizes this sequence, it enters the translocon, but the hydrophobic patch stalls the entire translocation process.
This stop transfer sequence then moves laterally sideways out of the translocon channel and embeds permanently into the lipid bilayer.
Which sets the orientation.
It results in the N -terminus of the protein being in the ER lumen, and the C -terminus remaining exposed to the cytosol.
And the second type, type II, doesn't even use an N -terminal signal sequence to start the process.
That's right.
Type II transmembrane proteins lack that cleavable N -terminal signal.
Instead, they possess an internal start transfer sequence, a hydrophobic stretch that simultaneously functions as both the signal recognized by the SRP and the permanent membrane anchor.
The orientation of the sequence dictates the final topology of the protein.
So once a protein is in the ER lumen,
it's subject to intense folding assistance and quality control before it can move on to the Golgi.
Oh, the ER lumen is a hypervigilant sorting and folding environment.
The molecular assistants here are specialized luminal chaperones.
B by P is the most abundant.
It binds to hydrophobic regions of newly arriving proteins, using ATP hydrolysis to help promote proper folding and prevent aggregation.
And there's another key enzyme for structure.
Protein desulfide isomerase, or PDI, which catalyzes the rapid formation, shuffling, and breakage of desulfide bonds until the thermodynamically most stable and correct configuration is achieved.
But sometimes folding just fails.
What are the cellular responses when a protein persistently misfolds?
This triggers the unfolded protein response, or UPR.
The cell senses an overload of misfolded proteins via three main transmembrane sensors.
These sensors are normally kept inactive by BP.
But when BP is pulled away to assist misfolded proteins, the sensors activate.
And they send a signal to the nucleus.
A critical signal that causes a global slowdown of general protein synthesis, while simultaneously dramatically increasing the transcription and production of more folding machinery, more chaperones, more PDI.
And if the protein is just deemed unsalvageable.
It is marked for destruction via ER -associated degradation, or ER.
The misfolded protein is retro -translocated, so it's exported back across the ER membrane from the lumen back into the cytosol.
Once it's in the cytosol, it is tagged with ubiquitin and rapidly destroyed by the proteasome.
We've established that proteins move from the ER to the Golgi.
But some critical players, like BP and PDI, have to stay in the ER to do their jobs.
How does the cell make sure these resident proteins don't accidentally get shipped out?
They carry a molecular return ticket.
Soluble ER resident proteins possess a specific C -terminal amino acid sequence, typically KDEL, bicephaglulu.
If these proteins are accidentally carried forward to the Golgi, a dedicated KDEL receptor in the Golgi recognizes this sequence and packages them into vesicles that bud off and traffic them specifically backward, or retrogradely, right back to the ER lumen.
Finally, let's turn to the second major pathway.
Post -translational import into organelles like mitochondria and chloroplasts.
Since these proteins are finished in the cytosol, they need a dedicated delivery signal.
And that signal is the N -terminal transit sequence.
These are structurally distinct from ER signals.
They're often amphipathic alpha helices, meaning they have positively charged and hydrophobic amino acids on opposite sides of the helix.
And like the ER signal, once the protein is correctly delivered, the transit sequence is cleaved off inside the organelle by transit peptidase.
And since these organelles have two membranes, the proteins have to be kept unfolded in the cytosol, ready to be threaded through two sets of dedicated pores.
Exactly.
Cytosolic HSP70 chaperones bind to the completed polypeptide, maintaining it in an unfolded transport competence state.
The translocase complexes are the Tom -Tim translocase of the outer and inner mitochondrial membrane and the analogous TF -CITIC for chloroplasts.
The transit sequence binds to the outer membrane receptor, and the polypeptide is threaded through both membranes at sites where they transiently touch.
What are the energy requirements for dragging a polypeptide across two membranes
Mitochondrial import is highly energy demanding.
It requires ATP hydrolysis for the chaperone cycling to keep the protein unfolded, and for the matrix HSP70 to literally pull the protein into the matrix.
And it needs something else too.
Crucially, it also requires the electrochemical proton gradient across the inner mitochondrial membrane to facilitate the initial translocation steps through that TIM complex.
Chloroplast import, on the other hand, relies mainly on ATP hydrolysis and doesn't necessarily require a membrane potential.
And for proteins destined for an internal subcompartment like the inner membrane, does the process use multiple signals?
It does.
Targeting to subcompartments is highly complex.
A protein targeting the inner mitochondrial membrane, for instance, often possesses the N -terminal transit sequence plus an internal hydrophobic sorting signal.
After the protein passes through the TOM complex, the internal sorting signal acts as a second specific signal.
It can function like a stop transfer sequence within the inner membrane translocase, halting full entry and promoting the lateral release of the protein into the inner membrane.
So we have completed the journey of gene expression from reading the mRNA code to the final delivery and installation of the functional protein.
We've seen the revolutionary catalytic ability of the ribozyme, the elegance of the tRNA adapters and the Wobble hypothesis,
and the absolute necessity of that aminoacyl tRNA synthetase checkpoint.
Right, the one that prevents catastrophic error.
And we wrapped up by detailing the incredible logistical mastery of protein targeting, how these tiny intrinsic molecular postal codes dictate the fate of a polypeptide, sending it either down the CAU translational pathway to the ER system or the post -translational route to the cytosol, nucleus, and other organelles, each requiring unique machinery.
It is a system built for speed, efficiency, and near -perfect fidelity.
To reinforce the theme of cause and effect and cellular priority, let's just circle back to the energetic cost for a moment.
The fact that the cell dedicates the energy equivalent of at least four high -energy phosphate bonds ATP and GTP to incorporate every single amino acid, it just underscores that protein synthesis is arguably the most energetically demanding task the cell performs.
And that colossal budget makes the translational machinery a prime weak point.
And that weak point, as we detailed with antibiotics like tetracycline and erythromycin, is precisely what we and pathogens exploit.
The ongoing molecular arms race against superbugs proves that tinkering with this high -stakes, high -energy process remains one of the most fundamental challenges in molecular biology.
So for your final thought today?
For your final thought, consider the financial strain on the cell.
We've talked about the immense energy used for synthesis, folding, and delivery.
Now consider the cost of failure.
When a disease or stress triggers the unfolded protein response and activates ERAD, the cell is forced to spend additional energy in the form of massive chaperone production and the energy for retrotranslocation and proteasimal degradation, just to destroy proteins that have already cost so much ATP and GTP to create in the first place.
So it's paying twice.
It is.
The cellular cost of quality control beyond just the initial synthesis is a staggering economic burden.
A truly massive investment in molecular currency.
Thank you for joining us for this deep dive into protein synthesis and traffic control.
We hope you feel thoroughly informed and ready to take on the complexities of cellular biology.
ⓘ This audio and summary are simplified educational interpretations and are not a substitute for the original text.
Using this chapter to study? Last Minute Lecture is free and student-run. If it helped, consider supporting the project.
Support LML ♥Related Chapters
- Translation: Synthesis of ProteinsMarks' Basic Medical Biochemistry: A Clinical Approach
- Gene Expression: Translation and Protein SynthesisiGenetics: A Molecular Approach
- Protein Synthesis & the Genetic CodeHarper's Illustrated Biochemistry
- Translation and ProteinsEssentials of Genetics
- Protein SynthesisPrinciples of Biochemistry
- Gene Expression: From Gene to ProteinCampbell Biology in Focus