Chapter 15: Regulation of Gene Expression in Bacteria
Welcome to Last Minute Lecture.
This free chapter overview is designed to help students review and understand key concepts.
These summaries supplement not replaced the original textbook and may not be redistributed or resold.
For complete coverage, always consult the official text.
Welcome to the Deep Dive.
We're your shortcut to getting really well informed on some complex stuff.
Today we are plunging right into the world of gene expression regulation in bacteria.
It sounds technical, maybe, but it's fundamental.
It's how life at a very basic level makes decisions.
Exactly.
We're talking about how cells like E.
coli manage to turn genes on and off and the precision is just incredible.
You see these huge swings in protein concentration, right, from maybe five or ten molecules per cell.
To a hundred thousand.
Yeah, and they adapt so fast to whatever's in their environment, making enzymes only when they're actually needed.
It's peak evolutionary efficiency, really.
It is, and that's why bacteria like E.
coli have been, you know, such workhorses for research in this area.
Their life cycles are so fast, right?
Super fast.
Generations in a day, you get billions of cells, easy to grow in a pure culture, and they react directly to environmental changes.
It makes studying genetic control much more straightforward.
And this isn't just about food metabolism, is it?
Oh no, not at all.
This control covers everything.
DNA replication, how they repair DNA damage, recombination, cell division,
even how they fight off viruses, bacteriophages.
Okay, let's maybe unpack some of the basic concepts here first.
Good idea.
We talk about inducible enzymes.
Those are the ones made only when a specific chemical, like a food source, is actually present.
Correct.
Like lactose, for example.
If there's lactose, the cell makes the enzymes to break it down.
If not, it doesn't bother.
On -demand production.
Then there are constitutive enzymes.
Right, those are sort of the housekeeping genes, produced continuously, pretty much regardless of the environment, always on.
And the flip side,
repressible systems.
Yeah, that's where the presence of a molecule actually stops gene expression.
Often it's the end product of a metabolic pathway.
Like if the cell has enough tryptophan, it stops making the enzymes to synthesize more tryptophan.
Exactly.
Why waste the energy?
Makes sense.
And generally, these strategies fall into two main control categories.
There's negative control.
That means the gene is basically on by default.
And expression happens unless a specific regulator molecule comes along and actively shuts it off.
Like a brake pedal.
Kind of, yeah.
Then there's positive control.
Here, transcription only happens if a regulator molecule is present to actively stimulate it.
More like an accelerator pedal.
Exactly.
And often, bacterial systems cleverly combine both negative and positive control for really fine -tuned regulation.
Which brings us, I think, to the classic example everyone learns about, the lac operon in E.
coli.
The absolute classic.
It's the textbook case for inducible negative control.
How bacteria manage lactose metabolism.
So what exactly is an operon?
Right.
An operon is essentially a cluster of genes on the bacterial chromosome that have related functions.
They're grouped together along with their regulatory sequences right there on the DNA.
All controlled as one unit.
Pretty much.
And the regulatory region, things like the promoter, where the transcription machinery binds, and the operator site is usually just upstream at the 5 -fin.
And these DNA sites, the promoter and operator, they're called cis -acting sites.
Correct.
Because they have to be on the same piece of DNA right next to the genes they control.
They work by being binding sites for transacting factors.
Which are?
Usually proteins.
Molecules that can diffuse through the cell, hence trans -alo, and bind to those cis sites to regulate transcription.
Like the repressor protein that we'll talk about.
Okay.
So for the lac operon, what are the actual genes involved, the structural genes?
There are three main ones.
LACC is probably the most famous.
It codes for the enzyme B -galactosidase.
And that's the one that breaks lactose down into glucose and galactose for energy.
Then there's LAC, which codes for permise.
Permise.
Like permission.
It lets lactose in.
Precisely.
It's a protein in the cell membrane that helps transport lactose into the cell.
And the third one, LAC -O.
Yeah, LAC -O codes for transacetylase.
Its exact role is a bit less clear, honestly.
Maybe involved in detoxifying some byproducts of lactose metabolism.
The key thing, though, is how they're transcribed.
Absolutely key.
These three genes, Z, Y, and A, are transcribed together onto a single messenger RNA molecule, a polycystronic mRNA.
Polycystronic?
Many messages on one molecule.
Right.
So the cell makes all three enzymes simultaneously in a coordinated fashion whenever that single mRNA is made.
Super efficient.
And controlling this whole unit, we have the regulatory bits?
Yep.
There's the lac I gene.
It's located nearby, but it's actually a separate gene that produces the repressor protein.
The transacting factor we mentioned.
Exactly.
Then on the operon itself, you have the promoter, P, which is the binding site for RNA polymerase, the enzyme that does the transcribing.
And right next to that.
The operator.
O.
That's the critical DNA sequence where the repressor protein binds.
Okay, so now we can put it together.
The operon model from Jacob and Monod.
How does the negative control work?
It's really elegant.
The lac I gene constantly produces a small amount of this repressor protein, and this repressor is allosteric.
Allosteric, meaning its shape can change.
Exactly.
Its shape, therefore its function, changes when it binds to another molecule.
So in the absence of lactose, the repressor protein is in a shape that allows it to bind very tightly to the operator DNA sequence.
And when it's sitting there on the operator, it physically blocks the RNA polymerase from moving down the DNA and transcribing the lac Z, Y, and A genes.
So no lactose, genes off.
Simple.
Simple, but effective.
Now in the presence of lactose, things change.
A derivative of lactose, actually called allolactose, acts as the inducer.
Ah, the signal.
Right.
This inducer molecule binds to the repressor protein.
And because the repressor is allosteric, this binding causes it to change shape.
And in this new shape.
It can no longer bind to the operator DNA.
It just falls off.
So the operator site is now clear.
Exactly.
The roadblock is gone.
RNA polymerase can now bind to the promoter and transcribe the structural genes.
Producing beta -galactosidase, permice,
transacetylase, all the things needed to use the lactose that's now present.
Precisely.
The presence of the substrate induces the production of the enzymes needed to metabolize it.
That is clever.
And the evidence for this came from looking at what happens when it breaks, right?
The regulatory mutations.
Absolutely.
That was crucial.
For instance, mutations in the lac I gene called lac I mutants, what happened?
They made the enzymes all the time.
Constitutively.
Exactly.
Because the repressor protein was either faulty or just not made.
It couldn't bind the operator, so the genes were always on even without lactose.
And mutations in the operator itself lack subsisa.
Same result.
Constitutive expression, but for a different reason.
Here, the operator DNA sequence was altered.
So even a normal repressor protein couldn't recognize and bind to it.
Again, genes always on.
Right.
These mutations really help nail down the roles of the repressor protein and
And they could prove this definitively using clever genetic tricks, like partial diploids.
Yeah, marigolds.
They used plasmids, F -factors, to introduce extra copies of genes into E.
coli.
So you could have a cell that was, say, I on the chromosome, but you introduce a plasmid carrying a normal I plus gene.
And what happened?
Inducibility was restored.
The neural repressor protein produced from the plasmid, I plus, could diffuse through the cell, proving it's transacting, and bind to the operator on the chromosome, shutting off expression in the absence of lactose.
But if you did the same thing with the operator,
introduce a normal O plus on a plasmid into an ossepsis subcell.
No effect.
The ossepsis mutation on the chromosome still caused constitutive expression of the genes next to it.
Why?
Because the operator is cis -acting.
It only controls the genes immediately adjacent to it on the same DNA molecule.
That makes perfect sense.
You can't fix a broken binding site on one piece of DNA by providing a good one somewhere else.
Exactly.
And they even found another type, isopsup or super repressor mutants.
Super repressor.
Yeah.
These repressors bound the operator okay, but they couldn't bind the inducer, lactose.
So they were stuck on the operator permanently, keeping the genes off even when lactose was present.
Which neatly showed the repressor has two distinct functional parts.
One for binding DNA, one for binding the inducer.
Precisely.
Two separate domains.
And then the ultimate proof, actually isolating the repressor protein itself, that must have been hard.
Incredibly hard.
We're talking maybe 10 molecules per cell,
but Gilbert and Miller -Hill pulled it off in 66.
Ingenious.
They used a mutant strain, isopscope.
They just happened to make about 10 times more repressor.
Then they used a gratuitous inducer, IPTG.
Gratuitous.
Meaning it binds the repressor like lactose does, causing it to fall off the operator, but the cell can't actually metabolize IPTG, it just acts as a mimic.
And they used equilibrium dialysis.
They had the cell extract in a dialysis bag, bathed in a solution with radioactive IPTG.
If something inside the bag bound IPTG, the radioactivity would be trapped inside.
And it was.
It was.
They labeled the proteins with radioactive sulfur, and showed the IPTG binding stuff was indeed a protein.
Then, crucially, they showed this isolated protein, specifically down to DNA, that contained the normal LACO plus operator sequence, but not to DNA with the osub -C submutation.
Direct physical proof.
Wow.
Okay, so the cell turns on laxines when lactose is around, but E.
coli prefers glucose, right?
What happens if both are available?
Ah, good question.
This leads us to catabolite repression.
Basically, the cell says thanks, but no thanks to lactose if glucose is present.
Glucose is the preferred, easier -to -use sugar.
So even if lactose is there,
inducing the repressor to fall off, the lac operon doesn't get turned on strongly if glucose is also around.
That's right.
Transcription is diminished.
This involves positive control.
Remember we said operons can have both?
Yes.
This positive control involves a protein called CKP, the catabolite -activating protein.
Okay.
For the lac operon to be transcribed really efficiently, maximally, two things need to happen.
One, the repressor needs to be off the operator, so lactose present.
Two,
CKP needs to be bound to a specific site near the promoter, the CAP binding site.
And CKP binding helps RNA polymerase?
It does.
It helps RNA polymerase bind more efficiently to the promoter and start transcription.
It's like an extra boost switch.
So what controls CKP binding?
It must be linked to glucose levels?
It is, indirectly.
CAP itself needs to bind to another molecule to become active.
That molecule is cyclic AMP, or CAMP.
CAMP.
I've heard of that.
It's a signaling molecule.
A very important one.
Only the CAMP -CapCamplex can bind to the CK site on the DNA and stimulate transcription.
Okay, so where does glucose fit in?
Glucose levels regulate CMP levels.
High glucose inhibits the enzyme adenyl cyclase, which is the enzyme that makes the AMP.
Ah, so high glucose, low -call AMP.
Exactly.
And low -CAMP means very little CAMP -CapComplex forms.
So CMP doesn't bind to the lac operon promoter region effectively.
And transcription stays low, even if lactose is present and the repressor is gone.
Correct.
Conversely, when glucose is absent or low, adenyl cyclase is active.
CMP levels rise.
Lots of CAMP -CapComplex.
Yep.
This complex binds to the CMP site, strongly promotes RNA polymerase binding.
And if lactose is also present, keeping the repressor away, you get high levels of lac operon transcription.
Wow.
That's a really sophisticated two -factor authentication system.
You need lactose present, removing the negative block, and glucose absent, providing the positive boost.
It ensures the cell always goes for the most efficient energy source first, a beautiful combination of negative and positive regulation.
Okay.
Let's switch gears from turning things on, like the lac operon, to turning things off, the repressible TREP operon.
This is for making tryptophan, right?
And amino acid.
That's right.
Tryptophan is essential, but making it costs energy.
So if E.
coli finds tryptophan readily available in its environment, it stops making its own, saves resources.
Exactly.
This is a repressible system.
Like the lac operon, it involves a repressor protein and an operator site.
But it works slightly differently.
How so?
The TREP repressor, encoded by the TPR gene, is synthesized in an inactive form.
By itself, it cannot bind to the TREP operator.
Okay.
So the default is on here too, initially.
But tryptophan itself acts as a core pressor.
Core pressor.
When tryptophan levels in the cell are high, tryptophan molecules bind to the inactive repressor protein.
This binding causes an allosteric change.
Ah, that shape change again.
Which activates the repressor.
Now, this active repressor tryptophan complex can bind tightly to the TREP operator sequence, TRPO.
And just like with lac, binding to the operator blocks RNA polymerase.
Precisely.
Blocking transcription of the structural genes, TRPEDCBA,
which are the genes needed to synthesize tryptophan.
So high tryptophan leads to the being shut off.
Still negative control because the active complex inhibits transcription.
Correct.
It's just that the repressor needs the core pressor tryptophan to become active, whereas the ACK repressor is active until the inducer, allolactose, inactivates it.
Different logic, same principle of negative control via an operator site.
And the TREP structural genes, like lac, are right there next to their promoter, TRPP, and operator TRPO.
Five contiguous genes all involved in the tryptophan synthesis pathway.
But regulation isn't just about proteins binding DNA, is it?
RNA itself gets involved.
Oh, absolutely.
RNA plays some really crucial regulatory roles.
One amazing example, especially in the TREP operon, is attenuation.
Attenuation, like weakening something.
Kind of.
It's a second layer of control for the TREP operon, a fine -tuning mechanism.
Even when the repressor isn't fully bound, or maybe let's go occasionally,
transcription can still be stopped prematurely.
How does that work?
There's a region at the beginning of the TREP mRNA, before the structural genes, called the leader sequence.
Transcription starts, and it makes this leader RNA about 140 nucleotides long.
Okay.
This leader sequence RNA can fold into different shapes, specifically stem -loop structures or hairpins, and which shape it forms depends on the availability of tryptophan during translation of a short peptide within that leader sequence.
Whoa, okay.
Translation affects transcription.
In bacteria, they happen simultaneously.
The ribosome hops onto the mRNA almost as soon as it's being transcribed.
So, within this leader peptide sequence, there are two tryptophan codons right next to each other.
UG, UGG.
Okay, codons for tryptophan.
Now, if tryptophan is scarce in the cell, the ribosome translating this leader peptide will stall when it hits those UGG codons, because there isn't enough charged tRNA sub -tripsub available.
It pauses, waiting for the right amino acid.
Exactly.
And this stalling physically prevents a certain hairpin structure, the terminator, from forming the RNA behind it.
Instead, a different hairpin forms the anti -terminator.
Anti -terminator.
So, let's transcription continue.
Yeah.
Precisely.
RNA polymerase just keeps going, transcribing the TREPEDCBA genes downstream.
Clever.
But if tryptophan is abundant...
Then charged tRNA sub -tripsub is plentiful.
The ribosome doesn't stall at the UGG codons.
It reads right through them smoothly.
Okay.
And because it moves quickly, it allows a different hairpin structure to form in the RNA just ahead of it, the terminator hairpin, also called the attenuator.
And this structure signals the RNA polymerase.
To stop transcription.
It terminates prematurely right after the leader sequence before it even gets to the structural genes.
So, abundant tryptophan leads to termination.
Scarse tryptophan allows read -through.
That's an incredibly sensitive feedback loop based on translation speed.
It's remarkably elegant.
And attenuation is used in other amino acid biosynthesis operons, too.
And RNA's role doesn't stop there.
You mentioned riboswitches.
Right.
Riboswitches are another fascinating form of direct RNA regulation.
These are specific structures, usually within the 5 -untranslated region, 5 -UTR, of an mRNA molecule.
So part of the message itself acts as a sensor.
Exactly.
They have a region called the aptamer that can directly bind to a small molecule ligand, often a metabolite, related to the protein encoded by the mRNA.
Like the product of the pathway, perhaps.
Could be, or an intermediate.
When the ligand binds to the aptamer, it causes a conformational change, a shape shift, in the RNA structure.
L -asteric RNA?
Sort of.
This change in the aptamer triggers a change in another part of the riboswitch, called the expression platform.
Very often, this change causes the expression platform to fold into a transcription terminator structure.
Just like in attenuation.
Similar outcome, yeah.
The RNA polymerase gets knocked off, shutting down expression of the gene.
So the metabolite itself directly controls the expression of the genes involved in its own metabolism, just by binding to the mRNA.
No protein intermediate needed.
That's direct feedback.
And these are common.
Surprisingly common.
Found in maybe 5 % of genes in bacteria like bacillus subtilis, and also in archaea, fungi, even plants.
A very ancient form of regulation, it seems.
And then there are the small non -coding RNAs, or sRNAs.
These are distinct RNA molecules, not mRNA, not tRNA, not rRNA.
Usually 50 to 500 nucleotides long.
And they regulate other genes.
They do, often by base pairing with target mRNA molecules.
They can be transcribed from locations that overlap their target gene, maybe even on the opposite strand, or from totally different places in the genome.
How do they regulate?
Couple of main ways.
For negative regulation, an sRNA might bind to an mRNA near the star codon, physically blocking the ribosome from binding to the ribosome binding site, RBS.
No ribosome binding, no translation.
So it shuts down protein production from that specific mRNA.
Yep.
But they can also do positive regulation.
Sometimes an mRNA might naturally fold up in a way that hides its own ribosome binding site.
An sRNA can bind to that mRNA,
change its structure, and unmask the RBS.
Allowing the ribosome to bind and translation to occur.
Exactly.
So sRNAs can act as either inhibitors or activators.
Any examples?
Sure.
In E.
coli, dsRa is an sRNA that helps activate gene expression when the temperature drops suddenly.
RiHb is another one that, when iron levels are low, inhibits the translation of mRNAs for non -essential enzymes that use iron, conserving it for critical functions.
Which ties back neatly to clinical relevance like MRSA.
Absolutely.
Think about the Mecca gene in MRSA.
Yeah.
Conferring resistance to beta -lactam antibiotics like methicillin.
Its expression is tightly regulated.
Induced by the antibiotic.
Yes.
When the antibiotic is present, the gene is turned on, producing the resistance protein.
When the antibiotic is absent, it's repressed.
It's very much like the lack -operon logic on -demand resistance.
Understanding this regulation is crucial for tackling antibiotic resistance.
It really highlights how these fundamental mechanisms have huge real -world impact.
To finish off, let's talk about something truly amazing bacteria having their own adaptive immune system.
CRISPR -Cas.
It's mind -blowing, really.
We usually think of adaptive immunity the kind that learns and remembers specific threats as something complex animals have.
Like our immune system with antibodies and memory cells.
Right.
Bacteria have innate defenses.
Sure, things that just generally block viruses,
phages.
But CRISPR -Cas is different.
It allows bacteria to specifically recognize and fight off phages they've encountered before.
It's a molecular memory system.
How is this even discovered?
Those repeats.
It started back in 87.
Researchers noticed these weird repeating DNA sequences in E.
coli.
Clustered regularly, interspaced, short, palindromic repeats, CRISPR.
Between the identical repeats were unique sequences called spacers.
And nobody knew what the spacers were for.
Not for a long time.
Then, around 2005,
computational analysis revealed something stunning.
Many of these spacer sequences perfectly match chunks of DNA from bacteriophages, the viruses that infect bacteria.
Like snapshots of past invaders stored in the bacterial genome.
Exactly.
A genetic most wanted list.
And the proof this was actually an immune system.
Brilliant experiments, actually.
Some came from Denisco, the food ingredients company working with Streptococcus thermophilus for yogurt cultures.
Phage infections are a big problem in industrial fermentation.
Makes sense.
They exposed sensitive S -thermophilus cultures to a specific phage.
Most died, but some survived.
And the survivors were now resistant specifically to that phage, but not to others.
They adapted.
They adapted.
And when they look at the CRISPR locus in these resistant bacteria, they have new spacers have been added.
Spacers whose sequences perfectly matched the phage they had just survived exposure to.
Wow.
Direct evidence.
Even better, they showed that if you artificially deleted those new spacers, the bacteria lost their resistance.
And if you engineered specific viral -derived spacers into sensitive bacteria, you can make them resistant.
Game -changing work.
Okay.
So how does the mechanism work?
Three steps, right?
Yep.
Three main stages.
First is spacer acquisition.
When a new phage injects its DNA, specialized Cas proteins, particularly Cas1 and Cas2, recognize it as foreign, cut out a small fragment and insert it into the CRISPR array, usually at one end, as a new spacer.
They also duplicate one of the repeats so the pattern stays intact.
Repeat spacer, repeat spacer.
So it learns adding the new threat to its memory banks.
Precisely.
Step two is CRRNA biogenesis.
The entire CRISPR array repeats and all the spacers gets transcribed into one long RNA molecule.
Okay.
This long pre -KRNA is then processed, chopped up by other Cas proteins into small, mature CRISPR RNAs or secret RNAs.
Each CRNA typically contains a single spacer sequence, plus some bits of the repeat.
So now you have individual guide molecules corresponding to past infections.
Exactly.
Okay.
Which leads to step three.
Yeah.
Target interference.
These mature CRNAs team up with other Cas proteins, which are nucleases, enzymes that cut DNA.
The most famous is Cas9 from Streptococcus pyogenes.
It's a gene editing tool.
The very same.
Right.
CRNA acts as a guide, using its spacer sequence to find a matching DNA sequence in any invading phage DNA.
It scans the incoming viral DNA.
Finds the match and the Cas nucleus associated with it makes a precise cut in the viral DNA, usually a double strand break.
Which effectively destroys the phage genome.
Infection neutralized.
Neutralized.
Based on the memory stored in the CRISPR array.
It's incredibly specific and effective.
Incorporate viral DNA, make guide RNAs, guide a nucleus back to destroy that same viral DNA.
Simple, elegant, powerful.
And while the basic steps are the same, there's variation in the details, the specific Cas proteins involved across different bacteria.
Lots of variation, yes.
It's a diverse and evolving system.
But that core logic is conserved.
And of course, harnessing this system, particularly Cas9, has revolutionized genetic engineering.
A tool taken directly from bacterial warfare.
It's one of the most impactful biological discoveries of recent decades, no question.
It really is stunning thinking about the sheer level of control these supposedly simple organisms exert over their genes.
From deciding whether to eat lactose or glucose.
To carefully regulating amino acid production.
To deploying this incredibly sophisticated memory -based immune system against viruses.
It's just remarkable biological programming.
It really underscores how studying these fundamental processes in bacteria teaches us so much about the core principles of life.
About efficiency, adaptation, defense.
And the tools we derive from them, like CRISPR, are transforming medicine and biotechnology.
Absolutely.
It makes you wonder, doesn't it?
If bacteria evolved these elegant solutions for regulation and memory, what inspiration can we draw from them for designing other complex systems?
Maybe even artificial ones.
Something to think about.
Well, thank you for diving deep into the intricate world of bacterial gene regulation with us today on the Deep Dive.
ⓘ This audio and summary are simplified educational interpretations and are not a substitute for the original text.
Using this chapter to study? Last Minute Lecture is free and student-run. If it helped, consider supporting the project.
Support LML ♥Related Chapters
- Gene Regulation in BacteriaGenetics: Analysis and Principles
- Gene Expression Control in ProkaryotesBiochemistry
- Gene Regulation in Bacteria and BacteriophagesiGenetics: A Molecular Approach
- Control of Gene Expression in BacteriaGenetics: A Conceptual Approach
- Regulation of Bacterial Processes & Gene ControlPrescott's Microbiology
- Regulation of Gene ExpressionMarks' Basic Medical Biochemistry: A Clinical Approach