Chapter 18: Genetics of Viruses

0:00 / 0:00
Report an issue

Welcome to Last Minute Lecture.

This free chapter overview is designed to help students review and understand key concepts.

These summaries supplement not replaced the original textbook and may not be redistributed or resold.

For complete coverage, always consult the official text.

Welcome.

Today we're diving deep into a topic that's, well, both fascinating and really critical.

The genetics of viruses will explore how these tiny entities operate, reproduce,

and even how they manage to evade our defenses.

We're drawing our insights from genetics, analysis and principles.

The seventh edition by Robert J.

Brooker, a really great source.

It really is comprehensive.

Our mission today is basically to unpack the core concepts from their basic structure right through to the incredibly intricate genetic mechanisms that govern their existence where maybe their non -existence will get into that.

Yeah, definitely.

And our goal is to make sure you walk away feeling genuinely well -informed, understanding

the fundamental principles, the clever experimental methods that revealed all this stuff, and some fascinating case studies too.

I want to give you a clear, thorough understanding, but without getting totally bogged down in every single detail.

Okay, so let's start with a foundational question then.

What is a virus really?

And why are they considered, well, not quite alive?

It's a great place to begin.

At their core,

viruses are non -living infectious particles.

They carry a nucleic acid genome.

Non -living.

Why non -living?

Well, the non -living part is key because unlike living cells, they just can't perform metabolism or generate energy or maintain internal balance, you know, homeostasis, by themselves.

They absolutely need a living host cell to replicate.

Outside of a host, they're essentially just, well, dormant packages of genetic info.

It's pretty remarkable how long it took us to even figure these things out.

Our source talks about this really interesting history, starting way back with the tobacco mosaic virus, TMV, in the late 19th century.

Oh, yes, TMV.

Yeah.

This virus causes these distinct mosaic patterns on plants.

You see patches of normal green mixed with lighter areas, and it damages leaves, flowers, fruit.

Adolf Mayer, back in 1883, he noticed the disease could be spread just by spraying sap from an infected plant onto a healthy one.

Simple as that.

But then it became more complex.

Right.

Building on Mayer's work, Dmitry Ivanovsky did a really critical experiment.

He proved it wasn't a bacterium.

He showed that even sap filtered through pores tiny enough to trap bacteria could still transmit the disease.

Oh, okay.

So smaller than bacteria.

Much smaller.

Then, Martinez Bejerink took it another step.

He ruled out a simple chemical toxin because he showed the agent kept multiplying through many plant generations.

A static chemical just couldn't do that.

So the key thing was that it multiplied.

That must have been a huge realization.

Absolutely.

That was a differentiator.

And soon after, they started finding animal viruses, like foot and mouth disease, and then, bang, 1900, the first human virus identified was yellow fever.

Wow.

And now we know of, what, over 120 different viruses that infect humans.

At least, yeah.

And despite all that diversity, they do share some core things.

They're incredibly small, and they all absolutely rely on a living host cell to replicate.

But the differences are also pretty striking, right?

Yeah.

Like their host range.

Host range varies massively.

You've got TMV, the plant virus we mentioned that can infect hundreds of different plant species.

Super broad.

Okay.

But then you have something like HIV, which has a really narrow host range.

It specifically targets just one type of human immune cell, the helper T cells.

Very specific.

And their structure.

You said they're tiny.

How tiny?

Oh, incredibly tiny.

We couldn't even really see them until the electron microscope came along in the 1930s.

They range from about 20 to 400 nanometers in diameter.

Which means?

Give me some perspective.

Okay.

Well, think about a typical bacterium.

That's around 1 ,000 nanometers.

So these viruses are significantly smaller.

Just imagine that.

It's kind of hard to wrap your head around how something that small can be so effective.

What's the secret to their, let's say, design?

Well, at their core, all viruses have this protective protein shell.

It's called a capsid.

Okay, a capsid.

Yeah.

And you can think of it like Lego bricks snapping together to build the shell, those individual protein bricks.

We call them capsimers.

And this capsid shell encloses their genetic material.

They come in different shapes, too.

Some are helical, like TMV, others are polyhedral, sort of like a geodesic dome, like you see in adenoviruses, which, you know, cause colds.

And I remember reading that many animal viruses have an extra layer.

Exactly.

Many do.

Like the influenza virus.

They have an outer viral envelope.

Where does that come from?

Good question.

They actually steal it.

It's a lipid bilayer they snag from the host cell's plasma membrane as they're exiting.

And embedded in this envelope are these virally encoded spike glycoproteins, sometimes called peplomers.

Spikes?

Yeah.

Like the ones we hear about with, say, coronaviruses.

Precisely like those.

And these spikes, along with the capsid itself, are absolutely crucial.

They act like keys, basically allowing the virus to bind to specific lox receptors on the host cell surface and get the infection started.

Interesting.

And what about viruses that infect bacteria?

Bacteriophages.

Ah, phages.

Yeah, they often have even more complex capsids.

Some look almost like little lunar landers with accessory structures like tail fibers for grabbing onto the bacterium and injecting their nucleic acid.

Really quite intricate.

So lots of variation in the coats, but the real diversity, the kind of core instruction manual, that's the genetic material inside, right?

The viral genome.

That's where it gets really interesting.

Unlike, well, any other form of life we know, viruses sort of play by their own rules genetically.

They can have either DNA or RNA as their genetic material.

Which is weird, because all living things use DNA, right?

Exactly.

That's a fundamental difference.

And the variations don't stop there.

Their nucleic acid can be single -stranded or double -stranded.

SsDNA, dsDNA, ssRNA, dsRNA.

All possible.

All possible.

And it can be linear or circular.

Some viruses even carry more than one copy of their genome.

The size varies hugely, too.

How much variation?

Well, you get tiny ones like phage Q, just a few thousand nucleotides, maybe four genes.

And then you get really complex ones like phage T4 with, you know, many, many more genes needed to build its elaborate structure over 100 ,000 base pairs sometimes.

OK, this is probably a good point to talk about that key experiment.

The one that proved RNA could actually be the genetic material.

Because by 1956,

everyone pretty much accepted DNA was the stuff of heredity.

Yeah, that was a pivotal moment.

Alfred Yerrer and Gerhard Schramm did this work with TMV.

They managed to isolate just the RNA from the virus particle.

Just the RNA.

Just the RNA.

And when they applied this purified RNA directly to tobacco plants, the plants developed the exact same mosaic lesions as if they had been infected with the whole virus.

Wow.

So the RNA alone was enough.

It was strong evidence, yeah.

The RNA carried the infectious instructions.

And then Franklin Conrad and Singer really nailed it down, didn't they, with that reconstitution experiment?

They did.

It was brilliant.

They worked with two different TMV strains.

The standard wild type, giving those mottled yellow -green spots, and another strain called Holmes ribgrass, or HR.

And the HR strain was different how?

It caused different symptoms, streaks, and rings.

And importantly, its capsid protein was slightly different chemically.

It contained specific amino acids, histidine, and methionine that the wild type protein lacked.

Okay, so distinct symptoms and distinct proteins.

Exactly.

And they knew that you could take purified TMV RNA and purified TMV protein, mix them in a test tube, and they would spontaneously self -assemble into infectious virus particles in vitro.

Clever.

So what did they mix?

They did two key mixes.

First, wild type RNA with HR strain proteins.

Second, HR strain RNA with wild type proteins.

Then they infected tobacco leaves with these reconstituted viruses.

And what happened?

What were the results?

The results were crystal clear.

The mix with wild type RNA and HR protein.

It caused wild type symptoms.

And when they analyzed the new viruses produced in the infected leaves, the proteins were wild type.

They lacked the histidine and methionine of the HR strain.

So the RNA dictated the symptoms and the type of protein made.

Precisely.

And the other mix confirmed it.

HR RNA with wild type protein caused HR symptoms and the new viral proteins contained histidine and methionine.

They were HR proteins.

So absolutely conclusive.

The RNA was carrying the genetic blueprint for TMV.

Without a doubt.

It showed RNA could be genetic material just like DNA.

A huge finding.

That really is fundamental.

Okay.

So we've got to handle on what viruses are, their structures, their unique genomes.

Now how do they actually, you know, take over a cell and make more of themselves, the viral reproductive cycle?

Right.

It's a series of steps all kicked off by the viral genome expressing itself inside a host cell.

And the end goal is always making new virus particles.

The exact details vary a lot between viruses.

But there are sort of six basic steps you see in most cycles.

And we can illustrate these using phage, that bacteria infecting virus, and HIV.

Yeah, they make a great comparison.

Two very different viruses, two different hosts.

Okay.

Step one.

Attachment.

The virus has to specifically latch onto the host cell surface.

It needs to find the right molecules to bind to.

Like a key finding its lock.

Exactly.

For phageome, its tail fibers grab onto specific proteins in the E.

coli outer membrane.

For HIV, those spike glycoproteins in its envelope bind to particular receptors, CD4 primarily, on human helper T cells.

Highly specific.

Okay.

Attached.

What's next?

Step two.

Entry.

The viral genome has to get inside the cell.

Phageo is pretty direct.

It basically injects its DNA straight into the bacterial cytoplasm like a syringe.

And HIV.

Being enveloped.

It's different.

HIV's envelope fuses with the host cell's plasma membrane.

That fusion releases the HIV capsid and its contents the RNA genome, reverse transcriptase, integrates into the cytosol.

Then host enzymes come along and break down the capsid proteins.

That's called uncoding.

Uncoding.

Okay.

So the genome is inside and released.

Right.

And then depending on the virus, the genes might start being expressed right away.

Or there might be a delay, maybe involving integration.

Ah, integration.

Step three, but it's optional.

Optional, but really key for viruses like phage and HIV.

This is where the viral genome actually inserts itself into the host's own chromosome.

Usually needs a viral enzyme integrase to do the job.

So how does that work for phageo?

For phageo, its double -stranded DNA integrates directly into the bacterial chromosome.

Once it's in there, we call it a profage.

Profage.

And this kicks off the lysogenic cycle.

In this cycle, no new phages are made right away.

The host cell isn't destroyed.

The profage DNA just gets copied along with the host DNA every time the bacterium divides.

It's kind of a stealth mode.

And why would it do that?

Often favored when conditions aren't great for the host, like if nutrients are scarce, the phage just waits it out, integrate it safely.

Okay, but HIV's is an RNA virus.

How does it integrate into our DNA?

That seems impossible.

Well, that's where a special enzyme comes in.

Reverse transcriptase.

The one that breaks the rules.

Exactly.

It's HIV's secret weapon, really.

This enzyme, which is carried inside the HIV particle, does something amazing.

It makes DNA using the viral RNA as a template.

First, a single DNA strand complementary to the RNA, then it makes a second DNA strand, creating a double -stranded viral DNA copy.

So it's an RNA to DNA, backwards from the usual DNA to RNA.

Precisely.

That's why HIV is called a retrovirus.

This resulting viral DS DNA, now called a provirus, can then enter the host cell nucleus and integrate into one of our chromosomes, again, using the enzyme integrase.

And like the profage, this provirus can just sit there, latent, for potentially long periods.

Incredible.

Okay, so step four must be actually making the parts for new viruses.

Right.

Synthesis of viral components, replication of the viral genome, and making all the viral proteins, capsid proteins, enzymes, everything needed.

How does that work for phageal, especially if it was hiding out as a profage?

Good question.

If it was lysogenic, the profish first has to pop back out of the host chromosome.

That needs help from another viral enzyme called excisionase.

Once it's free, a separate DNA, host enzymes get hijacked to copy the phage DNA multiple times.

Host enzymes also transcribe viral mRNA, and then host ribosomes translate that mRNA into viral proteins.

Often, the phage directs the destruction of the host's own DNA, focusing all resources on making more phages.

And for HIV, with its integrated provirus.

The provirus DNA, sitting in our nucleus, gets transcribed by our own cellular machinery into many copies of viral RNA.

These RNA molecules then move out into the cytosol, and they have two jobs.

Some are translated to make viral proteins, and others are set aside to become the genome for the new virus particles being assembled.

Which brings us to step five, viral assembly, putting all the pieces together.

Exactly.

For some simple viruses, like TMV, it's almost magical the parts can actually self -assemble.

Protein and RNA just spontaneously come together correctly, even in a test tube.

Wow.

But not for the more complex ones.

Not usually.

Phage D, for example, needs other non -capsid proteins, like stafelting proteins, to help guide the assembly process.

For HIV, the components gather at the inner surface of the host cell's plasma membrane.

The capsid proteins cluster around two copies of the viral RNA genome, plus the essential enzymes like reverse transcriptase and integrase.

Assembled.

Final step.

Release.

Getting out of the cell.

Step six.

Right.

Bacteriophages often use what's called the flitic cycle for release.

After assembly, they produce an enzyme, lysozyme, which digests the bacterial cell wall from the inside.

The cell bursts open, it lysis -releasing a flood of newly made phages.

Quite dramatic.

But enveloped viruses like HIV are different.

Yeah.

They usually have a more subtle exit.

They escape by budding.

The assembled capsid pushes out against the plasma membrane, and a portion of that host membrane wraps around the capsid, forming the new viral envelope.

It then pinches off, budding away from the cell.

Crucially, this usually doesn't slice a cell immediately.

So the cell can survive and keep pumping out more viruses.

Potentially, yes, for a while.

It's a different strategy.

This brings up the idea of latency, right?

That dormant phase you mentioned.

Exactly.

Whether it's a prophage in bacteria or a provirus in our cells, the viral genome can just hang out, inactive.

Most viral genes are shut off.

No new viruses are made.

In bacteriophages, you call that latency lysogeny.

Correct.

And a bacterium carrying a prophage is called lysogenic.

It just copies the prophage along with its own DNA when it divides.

Phage is called a temperate phage because it can choose between the lysogenic cycle, latency,

and the elitic cycle, active reproduction and lysis.

What influences that choice?

Often environmental cues.

Good conditions for the host might favor the light -lylic cycle, while stress or poor conditions might favor lysogeny.

But some phages, called virulent phages, only do the lytic cycle.

They can't integrate or become latent.

And human viruses show latency, too.

You mentioned HIV.

Yes.

HIV's integrated provirus is a classic example of latency, potentially lasting years.

Herpes viruses are another big one.

Think chickenpox caused by the varicella zoster virus.

The virus doesn't necessarily get eliminated.

Its genome can persist in nerve cells, sometimes as an episome, a separate piece of DNA that replicates along with the cell, though it can sometimes integrate, too.

Years later, maybe decades, if immunity wanes, it can reactivate and cause shingles.

Same virus, different disease manifestation due to latency and reactivation.

That leads kind of naturally into emerging viruses, doesn't it?

These viruses that seem to pop up or change and cause new problems.

It does.

Emerging viruses are basically those that have appeared recently or are causing infections more frequently or severely than previous strains.

They often cause, understandably, a lot of public concern.

Like new flu strains.

Exactly.

Influenza is a prime example.

New strains, like H1N1 or swine flu back in 2009, emerge regularly because the virus mutates easily.

H1N1 had a mix of genes from pig, bird, and human flu viruses.

That's why we need updated flu vaccines so often.

And Zika.

That caused a lot of alarm a few years ago.

Zika virus, yes.

A flavivirus, single -stranded RNA spread by eggy's mosquitoes.

The symptoms themselves could be fairly mild fever, rash, joint pain.

But the big concern was infection during pregnancy, which could lead to severe birth defects, particularly microcephaly, and it spread globally quite rapidly.

And then there's HIV, arguably one of the most impactful emerging viruses of the last century.

Without a doubt.

HIV, the human immunodeficiency virus.

Research points to it being a mutated form of SIV simian immunodeficiency virus from chimpanzees in West Africa.

How did it jump to humans?

Likely through hunting and contact with infected blood, probably in the early 20th century.

It then spread gradually.

As we know, HIV specifically targets and destroys those crucial helper T cells in our immune system.

Which are vital for coordinating immune response.

Absolutely.

And losing them compromises the whole immune system, leading to acquired immunodeficiency syndrome or AIDS.

This makes people vulnerable to opportunistic infections, things like rare pneumonias or cancers that a healthy immune system would normally handle easily.

What makes HIV so difficult to fight, both for our bodies and for medicine?

A key factor is its incredibly high mutation rate.

Remember that reverse transcriptase enzyme it uses?

The one that makes DNA from RNA.

That's the one.

Well, it's notoriously sloppy.

It lacks the proofreading function that our own DNA copying enzymes have.

So it makes a lot of errors when it synthesizes that viral DNA.

Meaning lots of mutations.

Lots and lots of mutations.

This generates huge diversity in the virus population, even within a single infected person.

It constantly creates new mutant strains.

This helps it evade the immune system.

And it's the main reason why antiviral drug resistance can develop so quickly.

It's just an incredibly adaptable moving target.

That challenge really highlights why we need to understand these viral mechanisms in such detail.

And maybe zooming in on phage again as a model system can show us just how intricate that control can be.

You called it a genetic switch masterclass.

It really is.

Phage studied since the 50s has taught us so much about gene regulation, about how proteins can act like switches to turn genes on or off.

These principles apply everywhere in biology, not just viruses.

So when OMAIL infects E.

coli, its linear DNA gets injected and then forms a circle inside the bacterium.

Right.

And its genome is beautifully organized.

You can literally see clusters of genes dedicated to either the lysogenic pathway or the lysolytic pathway.

The expression of certain early genes determines which path the phage commits to.

Let's talk about the lysogenic path first.

Integration.

OK.

If lysogyny is chosen, the key enzyme is integrase from the int gene.

Integrase is highly specific.

It recognizes identical short DNA sequences called attachment sites.

There's one on the phage DNA called at -TP and a matching one on the E.

coli chromosome at at -TP.

So it knows exactly where to integrate.

Pretty much, yes.

Integrase makes staggered cuts at both sites,

swaps the DNA strands between the phage and the host chromosome, and then seals them up, ligates them.

That seamlessly integrates the ELNA into the E.

coli chromosome, forming the prophage.

And it can stay there, latent, replicating with the host for generations.

But it can get out again.

It can.

If the host cell gets stressed, say by UV light, which damages DNA, that can trigger excision.

This requires integrase plus another protein called excisionase from the exos gene to basically reverse the integration process, popping the prophage back out.

A clever escape plan if the ship is sinking.

Exactly.

If the host is in trouble, it's time for the phage to switch to the lytic cycle, reproduce rapidly, and find new hosts.

So the lytic cycle, what happens then?

That's the all -out reproduction phase.

Lots of phage DNA copies are made, and all the necessary coat proteins are synthesized.

Specific genes get turned on to make the head proteins, shaft proteins, tail fibers, things needed for assembly, packaging the DNA inside the new heads, and finally the lysis enzymes to burst the cell open.

And the choice between these two cycles, lysogyny versus lysis, comes down to that genetic switch involving the CDI and Cro protein.

That's the heart of it.

It's a regulatory decision based on the relative levels and activities of these key proteins, especially CPI and Cro.

Right after the DNA enters the cell, two promoters, PL and PR, get turned on.

And they make?

They produce short initial RNA transcripts for two crucial proteins, the N protein and the Cro protein.

What does N protein do?

N protein is really interesting.

It's an anti -terminator.

It prevents the transcription machinery from stopping at certain downstream termination signals.

So let's transcription continue further down the DNA.

Precisely.

This allows for longer transcripts to be made, including the genes for integrase, calp -X's, excisionase, and very importantly CPI.

There are others too, like O and P needed for DNA replication, and Q, another anti -terminator for late molletic genes.

And you mentioned CPI protects the eye.

Yes, the CPLY protein inhibits a specific host cell protease that likes to chew up the C -CHI protein.

So C3 helps CTI accumulate.

Okay, so if CI manages to build up to high levels, which cycle gets favored?

High CTI levels push the phage towards the lysogenic cycle.

And when would that happen?

Usually when the host cell is stressed or starved.

Under those conditions, the levels of that CII degrading protease are low.

So C -CHI isn't broken down as fast and accumulates.

What does high CTI actually do?

It acts as an activator.

It turns on two specific promoters.

One is PRE, the promoter for repressor establishment.

This turns on the CI gene, which makes the all -important ol -repressor.

The lambda -repressor.

That's the key to lysogeny.

Absolutely.

The ol -repressor is what establishes and maintains the lysogenic state.

The second promoter CI activates is PI, the promoter for integrase, turning on the INT gene needed for integration.

How does the lambda -repressor maintain lysogeny?

It binds to specific operator sites, AWELL and OR, which are located near those initial promoters, PL and PR.

When ol -repressor binds there, it physically blocks RNA polymerase, shutting down the transcription of most phage genes, especially the lytic genes.

So it represses for lytic development.

Exactly.

But here's a really elegant twist.

While it represses PL and PR, the ol -repressor actually activates its own transcription from a different, weaker promoter called PRM, the promoter for repressor maintenance.

Wait, it activates itself?

Kind of.

The strong PRE promoter, turned on by CCRI, gets the initial high levels of repressor established.

Then, once repressor binds to the operators, it ensures a continuous low -level supply of itself from PRM, even after site AU levels might drop.

It creates a stable feedback loop to maintain the lysogenic state.

Very clever.

How does it switch out of lysogeny, then?

Like with UV light?

UV light causes DNA damage.

The host cell has a protein called REPE that senses this damage.

Activator Rekay then interacts with the O -repressor and causes it to be cleaved and activated.

Ah, so the repressor gets destroyed.

Right.

And once the repressor is gone, the block on the PR promoter is lifted.

This allows transcription of the lytic genes, including Crow, to begin pushing the phage into the lytic cycle.

Makes sense, right?

Host is damaged.

Time to multiply and leave.

Okay, so that's the lysogenic path driven by high CTI and the lambda repressor.

What about the other way?

The lytic cycle driven by Crow?

That happens when conditions are good for the host bacterium.

Plenty of nutrients.

Rapid growth.

Under these conditions, that host protease that degrades CTI is highly active.

So CTI gets chewed up quickly and never reaches high levels.

But Crow protein isn't degraded.

No, Crow is stable.

So with CTI levels low, Crow protein accumulates instead.

And what does Crow do when it builds up?

Crow protein also binds to those same operator sites, AWIL and OR, but with different effects and affinities compared to the O -repressor.

It acts primarily as a repressor.

It represses.

It binds to OR and strongly shuts down transcription from PRM, the promoter for repressor maintenance.

So it prevents the O -repressor from being made, effectively blocking the lysogenic pathway.

Turning off the path to latency.

Exactly.

It also binds to OL and eventually OR to turn down transcription from PL and PR.

But crucially, it does this in a way that allows the essential early solitic genes, like OP for replication and Q, to be expressed first.

Q protein.

You said that was another anti -terminator.

Yes.

Once enough Q protein is made, it acts as an anti -terminator specifically for a promoter called PR, PR prime.

This allows transcription of a large block of late -lative genes, all the genes needed for making the phage structural proteins, head, tail, assembling the new phages, packaging DNA, and finally, lasting the cell.

So it's really this competition at the OR operator region that acts as the switch.

Precisely.

The OR operator region is the genetic switchboard.

It actually consists of three distinct operator sites, OR1, OR2, and OR3.

And controlling these sites are two promoters, PRM for repressor and PR for Cro -inflative genes pointing in opposite directions.

How do repressor and Cro compete there?

They have different binding preferences.

Yeah.

The O -repressor binds most tightly to OR1 than cooperatively to OR2.

Binding to OR1 and OR2 shuts down PR, blocking lysis, and simultaneously activates PRM, maintaining lysogeny.

Okay.

Crotein binds most tightly to OR3.

Binding there immediately shuts down PRM, blocking lysogeny.

As Cro levels rise further, it then binds to OR2 and OR1, eventually shutting down PR as well, but only after the necessary early slitic functions have occurred.

It's like a molecular seesaw, tilting one way or the other based on which protein wins the binding competition at OR.

That's a great analogy.

It's a beautiful example of how complex developmental decisions can be controlled by the interplay of just a few regulatory proteins binding to specific DNA sites.

These kinds of genetic switches are fundamental not just in viruses, but in bacteria and even in our own cells during development.

It's truly amazing the level of sophistication packed into such a tiny entity.

You almost have to admire the biological elegance of it.

Absolutely.

It really underscores how evolution can craft these incredibly precise molecular machines.

But if you think lambda phage is intricate, let's switch gears to HIV.

Its interaction with human cells is a whole other level of complexity and frankly, molecular cunning.

Especially since it targets our immune system itself, right?

Specifically helper T cells.

Exactly.

Helper T cells are absolutely crucial commanders in our immune response.

HIV targets them, infects them, and ultimately destroys them.

And the loss of these cells is what devastates the immune system, leading to AIDS.

Okay, so let's look at the HIV genome again.

It's single -stranded RNA, about 9 ,700 nucleotides, and it has those identical sequences at the ends, the LTRs.

Yes, the long -terminal repeats LTRs.

They're not just padding, they are critical regulatory regions.

They contain signals for replication, integration into the host's DNA, and controlling the expression of viral genes.

Think of them as the virus's command and control centers.

And within those LTRs are the actual genes.

How many?

HIV packs incredible efficiency into its small genome.

It has nine main genes, but some of them are processed to make multiple proteins.

They fall into a few categories.

You've got structural genes, like GAG, which makes proteins for the viral core and capsid.

Then POLL, which encodes the crucial enzymes, reverse transcriptase, integrase, and protease.

And NV makes the envelope glycoproteins, GP120 and GP41, that sped the outside of the virus.

Those are the structural and enzymatic parts.

What else?

Then there's a whole suite of smaller regulatory and accessory genes, TAT, REV, NEF, VIF, VPR, VPU.

These are like the master controllers and facilitators.

They regulate viral gene expression, help the virus counteract host defenses, transport the viral DNA into the nucleus, and manage the assembly and release of new viruses.

It's a complex toolkit.

Okay, so after HIV enters a T cell and uncoats, the first big job is that unique step, reverse transcription.

Right.

This happens out in the cytoplasm.

That viral reverse transcriptase enzyme, brought in with a virus particle, gets to work using the viral RNA as its template.

And it makes double -stranded DNA.

Yes.

Through a pretty complex process, it uses a host tRNA molecule that binds to the viral RNA as a starting point, a primer.

It synthesizes a complementary DNA strand.

Then an RNAase H activity that's part of the reverse transcriptase enzyme degrades the original RNA template.

It then uses that first DNA strand as a template to make the second DNA strand.

There's some clever template jumping involved using those LTR sequences.

The end result.

The end result is a linear, double -stranded DNA copy of the HIV genome, now with complete LTR sequences at both ends.

It's ready for the next step.

Which is integration into the host chromosome,

also mediated by a viral enzyme.

Correct.

By viral integrase, also packaged in the incoming virus particle.

Integrase binds to the ends of the newly made viral DNA, near the LTRs.

And then what happens is just find a random spot.

It forms a structure called the preintegration complex.

This is the viral DNA bundled with integrase and some other viral proteins, like VPR.

VPR actually helps shuttle this whole complex through the nuclear pore into the host cell's nucleus.

Once inside, the complex binds to the host chromosomal DNA.

It's not entirely random.

There seem to be some preferences for actively transcribed regions, and integrase gets to work.

How does it actually insert the DNA?

Integrase makes staggered cuts in the host's DNA.

Then it covalently links the ends of the viral DNA to the cut ends of the host's DNA.

The host cell's own DNA repair enzymes then fill in the small gaps, sealing the deal.

The result is the HIV DNA, now called a pro -virus, seamlessly integrated into the host genome.

And it can just sit there, blatant.

It can, potentially for years, silent, hidden within our own DNA.

That's a major challenge for curing HIV.

So how does it get activated to actually start producing new viruses?

Activation often requires the host T -cell itself to become activated.

Remember NF -HU, that host transcription factor.

Yeah, you mentioned it could be activated when a T -cell fights infection.

Exactly.

When a T -cell encounters its specific antigen and gets activated, signaling pathways turn on NF -HU.

Activated NF -HU moves into the nucleus, and it turns out the HIV LTR promoter region has binding sites for NF -HU.

So when the host cell gets activated to do its immune job.

It inadvertently activates the latent HIV pro -virus as well.

NF -HU binds to the HIV promoter and kickstarts transcription of the viral genes by the host cell's RNA polymerase.

So ironically, the very cells trying to fight infection become factories for producing more HIV.

That's incredibly insidious.

Okay, so transcription starts.

What happens to the HIV RNA that's made?

You mentioned complex splicing.

Yes.

The primary HIV RNA transcript can be processed in multiple ways.

It can be fully spliced, incompletely spliced, or remain unspliced.

This differential splicing is crucial for regulating which viral proteins are made at which time.

What gets made first?

Early on, you get mostly fully spliced HIV RNA.

This RNA exits the nucleus easily and is translated in the cytoplasm to produce the early regulatory proteins.

NEF, TAT, and REV.

Cat and REV sound important.

They are critical.

TAT dramatically boosts the rate of HIV transcription from the LTR promoter.

It's a powerful positive feedback loop, amplifying virus production.

REV is also essential.

It shuttles back into the nucleus and binds to specific sequences on the other forms of viral RNA, the incompletely spliced and unspliced ones.

Why does it need to do that?

Normally, incompletely spliced or unspliced RNAs are retained in the nucleus in our cells.

REV acts like an export permit, binding to these RNAs and allowing them to be transported out to the cytoplasm.

So REV allows the later RNAs to get out, what do they encode?

Incompletely spliced HIV RNA, once it gets out, thanks to REV, is translated to make other viral proteins like VIF, VPR, VPU, and importantly the N -violi protein, which gets cleaved into the GP120 and GP41 envelope spikes.

And the unspliced RNA.

The full -length unspliced RNA has two vital roles.

First, it serves as the actual genome that will be packaged into new virus particles.

Second, it's translated to produce the large GAG polyprotein and occasionally via a ribosomal frameshift or read -through event, the even larger GAG -POL polyprotein.

GAG and GAG -POL, what do they become?

GAG is the main structural polyprotein.

It contains the sequences for the matrix, capsid, and nucleocapsid proteins that form the core of the new virus.

GAG -POL contains the GAG proteins plus the POL proteins, protase, reverse transcriptase, and integrase.

Okay, so all the components are being made.

How do they come together?

The assembly, budding, and maturation phase.

It's a coordinated process, mainly occurring at the host cell's plasma membrane.

The GAG polyprotein is the master organizer of assembly.

How does GAG do that?

GAG has different domains.

One part binds to the interface of the plasma membrane.

Another part interacts with other GAG molecules, driving the formation of the spherical particle.

Another domain binds and captures the two copies of the unspliced HIV RNA genome.

And a small domain at the end interacts with other viral and host proteins needed for budding.

So GAG molecules gather at the membrane, bringing the RNA with them.

Exactly.

They cluster together, trapping the genome and enzymes inside, and start to form a spherical bud that pushes outward from the cell surface.

This budding process.

How does it complete?

How does the virus pinch off?

It actually hijacks a host cell pathway called the ESCRT pathway.

This pathway is normally used by the cell for things like membrane repair and forming vesicles.

HIV, particularly through its GAG protein and another protein called VPU, recruits components of the ESCRT machinery to the neck of the budding virus.

And ESCRT helps it pinch off?

The ESCRT proteins mediate the membrane fission event that releases the immature HIV particle from the cell.

VPU also plays a role here by counteracting a host defense protein called tetherin, which would otherwise literally tether the budding virus to the cell surface and prevent its release.

So it escapes.

But you call it an immature particle.

Right.

The final step, which actually begins during or just after budding, is maturation.

This is absolutely critical for the virus to become infectious.

What happens during maturation?

This is where HIV protease comes in.

Remember, it was packaged into the particle as part of the GAG -Pol polyprotein.

Now, HIV protease becomes active and starts cleaving the GAG and GAG -Pol polyproteins at specific sites.

Cutting them into individual proteins?

Precisely.

It cuts GAG into the separate matrix MA, capsid CA, nucleocapsid NC, and P6 proteins.

These proteins then rearrange.

The matrix proteins line the inner surface of the viral envelope.

The capsid proteins assemble into the characteristic conical core structure that encloses the RNA genome and enzymes.

Nucleocapsid proteins coat the RNA.

And cleaving GAG -Pol releases the enzymes.

Yes, it releases more active protease, which can continue the cleavage process, plus the mature reverse transcriptase and integrase enzymes, all packaged neatly inside the capsid, ready to infect the next cell.

This structural rearrangement transforms the immature non -infectious particle into a mature infectious virium.

Wow.

That whole cycle from entry to maturation is just incredibly complex and coordinated.

It truly is.

Every step is finely tuned and relies on interactions between viral and host components.

So reflecting on all this, what's the big takeaway?

We've gone from the basic definition of a virus, these non -living entities, right through the detailed molecular steps of phage lambda's genetic switch and HIV's hijacking of our cells.

You see this incredible precision, almost like molecular engineering, and how they operate.

Lambda choosing between lysis and lysogeny based on its host condition, HIV using reverse transcriptase, integrating using our own NFVB against us, the complex splicing, the protease -driven maturation, it's quite something.

It really demonstrates the power of molecular processes and evolution, and it highlights something crucial.

Understanding these intricate cycles isn't just, you know, interesting science for its own sake.

Right.

It has real -world implications.

Huge implications.

It directly informs how we develop antiviral therapies and vaccines.

Think about HIV treatment.

Many of the most successful drugs target specific steps in that cycle we just discussed.

Like reverse transcriptase inhibitors.

Exactly.

Or protease inhibitors that block that final maturation step.

Integrase inhibitors are another important class.

Each step represents a potential vulnerability, a target we can hit with drugs.

The more we understand these fundamental processes at a molecular level, the better equipped we are to design effective strategies against existing viruses and, importantly, future viral threats.

Absolutely.

These tiny particles, technically non -living, have such a profound and complex impact on the living world.

And our deep dive today, drawing on Brooker's text, really shows that by digging into the of these smallest biological players, we uncover fundamental secrets about genetics, regulation, and life itself, secrets that can directly improve human health.

Well said.

The study of viruses continues to be incredibly revealing.

Well thank you for joining us on this deep dive into the fascinating world of viral genetics.

We certainly hope you found it as enlightening and maybe even as awe -inspiring as we did.

And I thank you, as always, for being part of the Last Minute Lecture family.

We really appreciate you listening and can't wait to explore more science with you next time.

ⓘ This audio and summary are simplified educational interpretations and are not a substitute for the original text.

Chapter SummaryWhat this audio overview covers
Viral genetics encompasses the mechanisms by which viruses replicate their genetic material, maintain their genetic information, and evolve through mutation and recombination. Unlike cellular organisms, viruses exhibit remarkable diversity in their genetic organization, employing single-stranded or double-stranded DNA, single-stranded or double-stranded RNA, or even hybrid strategies to encode the proteins necessary for their replication cycle. The fundamental processes of viral genetic replication vary substantially depending on viral type and host cell environment, requiring specialized enzymes and strategies distinct from those found in cellular systems. Mutation rates in viruses characteristically exceed those observed in their host organisms, driven by error-prone replication machinery, absence of proofreading mechanisms in many viral polymerases, and selective pressures favoring rapid adaptation to host immune responses. These elevated mutation frequencies generate genetic diversity that enables viral populations to establish persistent infections, escape immune recognition, and develop resistance to antiviral treatments. Viral recombination occurs when multiple viruses infect the same cell, allowing exchange of genetic segments through complementation or reassortment events, particularly significant in segmented genome viruses such as influenza and reoviruses. Population genetics principles applied to viral systems reveal quasi-species dynamics, where viral populations exist as clouds of closely related genetic variants rather than uniform entities, conferring substantial evolutionary flexibility. Viral genetics also encompasses the regulation of gene expression through temporal control mechanisms that coordinate early, middle, and late gene expression phases with viral lifecycle stages, ensuring efficient resource allocation during infection. Horizontal gene transfer between viruses and hosts, or between different viral species, contributes to the evolution of viral genomes and the acquisition of genes that enhance virulence or transmissibility. Understanding viral genetic mechanisms has proven essential for developing vaccines, designing antiviral therapies, predicting viral evolution, and identifying zoonotic spillover events that initiate pandemic threats in human populations.

Using this chapter to study? Last Minute Lecture is free and student-run. If it helped, consider supporting the project.

Support LML ♥