Chapter 2: Genome Structure and Gene Expression
Welcome to Last Minute Lecture.
This free chapter overview is designed to help students review and understand key concepts.
These summaries supplement not replaced the original textbook and may not be redistributed or resold.
For complete coverage, always consult the official text.
Welcome back.
We've got a deep dive today taking a chapter from a plant physiology textbook and really pulling out the core knowledge.
Right.
We're looking at plant genomes, what they are, how they're organized.
How plants actually use that information and importantly how we study and even modify them.
Exactly.
It's like getting the blueprint for a plant and it helps understand how it grows, responds to its environment.
And how the traits we see the phenotype come from those genetic instructions, the genotype.
Plus of course epigenetics and environment play huge roles.
Yeah, it's that whole complex picture.
We've touched on DNA basics before, the double helix, chromosomes, DNA to RNA to protein.
But this time we're going deeper into the nuclear genome organization.
What's in there besides just protein coding genes?
And looking outside the nucleus too, right?
At the other genomes.
Yes, exactly.
And how gene expression is regulated, the tool scientists use, all that good stuff.
Okay, so let's start inside that nucleus.
That's where most of the essential genes are.
And the first thing, wow, the size variation is just enormous, isn't it?
It really is staggering.
You take something like Arabidopsis thaliana, Thale Crest.
The workhorse model plant.
Right, one of the first plant genomes sequenced.
Pretty small, relatively speaking, about 157 million base pairs, five chromosomes, roughly 27 ,000 protein coding genes.
Seems manageable.
But then you compare that to, what was it, Paris Japonica?
Yeah, it's something like 150 billion base pairs.
It's just vast, hundreds of times larger than Arabidopsis.
So clearly, more DNA doesn't just mean more protein coding genes.
What else is taking up all that space, even in Arabidopsis?
Well that's a key point from the chapter.
Besides the genes we usually think about, there are pseudo genes, kind of like broken gene copies.
Okay.
And mobile DNA elements, transposons, we'll definitely circle back to those.
Plus, a whole lot of non -protein coding RNAs, things like ribosomal RNA, transfer RNA, sure.
But also RNAs that regulate other genes.
How do they fit it all in, especially those giant genomes, billions of base pairs?
It's all about packaging.
The DNA wraps around these proteins called histones.
Like beads on a string.
Exactly.
That forms nucleosomes.
And this whole DNA protein complex is called chromatin.
And this chromatin isn't all the same, is it?
There are different types.
No, it exists in different states.
The textbook talks about euchromatin and heterochromatin.
Heterochromatin is really tightly packed, condensed, stains darker under a microscope.
And euchromatin is looser.
Looser, less condensed.
And this difference in packing is really important for function.
Because the active genes are in the looser stuff.
Precisely.
Most of the genes that are being actively transcribed into RNA are found in the euchromatin, where the cellular machinery can actually access the DNA.
Okay, so the super condensed heterochromatin, that's mostly in active genes.
Or other things.
Yeah, mostly in active genes or regions with very few genes.
It's also where you find a lot of repetitive DNA sequences.
And structural parts of the chromosome, like centromeres and telomeres.
Ah, repetitive DNA.
So that's a big feature of heterochromatin.
A very big feature.
You have tandem repeats, patterns repeated right next to each other.
And dispersed repeats scattered around.
Things like microsatellites and those transposons again.
And these repeats help build important chromosome structures.
You mentioned centromeres and telomeres.
They do.
Scientists can actually see these landmarks using techniques like FESH fluorescent in -situ hybridization.
The chapter shows some neat maze examples.
So centromeres, that's the constricted bit in the middle.
Usually near the middle, yeah.
It's where the spinal fibers attach during cell division via a protein complex called the kinetochore.
And centromeres are built from just massive stretches of repetitive DNA.
Which makes them hard to sequence.
Extremely hard.
Millions of base pairs of repeats sometimes.
And telomeres are the ends.
Like caps.
Exactly.
Protective caps.
They stop the chromosome ends from degrading during replication and prevent them from sticking together.
Essential for stability.
And N -ORs.
What are they?
Nucleolar organizer regions.
This is where the genes for ribosomal RNA are clustered.
You need tons of ribosomes for protein synthesis.
So you need many copies of these RNA genes.
Hundreds, often.
And they form the nucleolus.
Right.
These N -ORs organize that dense structure within the nucleus where ribosomes are actually assembled.
Interestingly, even though they're repetitive, this RNA is highly transcribed.
Okay, let's go back to transposons.
Jumping genes.
They sound pretty disruptive.
They can be.
They're a major component, especially in heterochromatin.
And yes, they can move around the genome.
How do they move?
Are there different ways?
Two main classes, based on the mechanism.
Class one are retrotransposons.
They use a copy and paste method.
Copy and paste.
They make an RNA copy of themselves first.
Then reverse transcriptase makes a DNA copy from that RNA, and that new DNA copy inserts somewhere else.
So their numbers can increase rapidly.
Wow.
And they can make up a lot of the genome.
A huge amount.
The source mentioned 58 % in Norway's spruce.
Just these elements.
Incredible.
And class two.
Class two are DNA transposons.
They use more of a cut and paste mechanism.
An enzyme called transposase cuts the transposin DNA out and pastes it into a new spot.
Usually this doesn't increase the copy number.
So why are these jumping genes so important biologically?
Because they cause mutations.
If one lands inside a gene, it can wreck it.
If it lands near a gene, it can change how that gene is turned on or off.
So they drive genetic variation.
Maybe even evolution?
Potentially, yes.
By creating new mutations and rearranging things.
But plants aren't just passive victims.
They have ways to control them.
Primarily through epigenetic silencing.
DNA methylation, histone modifications, basically locking them down in that tightly packed heterochromatin.
And there are experiments showing this.
Yeah, the textbook describes mutants like DDM1 and Arabidopsis that can't maintain proper methylation.
Over generations, transposons that were silent can wake up and start jumping again.
And cause problems.
Exactly.
There's a figure showing a transposon jumping into a growth hormone gene, DWF4, making the plant a dwarf.
But then, if the transposon jumps back out.
The plant goes back to normal.
It reverts.
It's a really clear demonstration of how important methylation and heterochromatin are for keeping the genome stable by silencing these mobile elements.
Okay.
So chromosomes have structure, they have territories, even when the cell isn't dividing.
What about when the plant needs to make pollen or egg cells?
That's meiosis, right?
Right.
Meiosis.
It's a very special kind of cell division.
Its job is two -fold.
Cut the chromosome number in half, deploy it to haploid, and shuffle the genetic deck through recombination.
Two divisions.
But only one DNA replication step.
That's how the number gets halved.
Correct.
And the shuffling part, the crossing over, happens mainly during a long stage called prophase I.
Which has sub -stages.
Leptotene, zygotene.
Zygotene, where homologous chromosomes pair up, forming these structures called syneptonomal complexes.
Then, pecatine, where crossing over happens, diplatine, where you can see the crossover points, the chiasmata.
This exchange of DNA segments is vital for genetic diversity, isn't it?
Absolutely critical.
Then meiosis separates the homologous pairs, and meiosis II separates the sister chromatids, ending up with four unique haploid cells.
It's a beautiful, intricate process.
Now, something else common in plants, maybe more so than animals, is polyploidy.
Having extra sets of chromosomes.
Yes.
A really major feature of plant genomes and evolution.
It means having more than two complete sets of chromosomes.
It can happen just in certain tissues, or it can happen in the germ line and create a whole new polyploid organism.
And there are different kinds, auto and allopolyploids.
That's right.
Autopolyploids have multiple genome sets, all from the same species.
Allopolyploids have combined genome sets from different species.
How do they even arise?
Errors in meiosis are a big one.
If meiosis messes up and produces diploid gametes instead of haploid, and two of those diploid gametes from the same species fuse, you get an autotetraploid four sets.
Or sometimes somatic cells just duplicate their genome without dividing.
And allopolyploids often involve hybridization first.
Often, yes.
Two different species cross, making a hybrid.
That hybrid might be sterile because the chromosomes don't pair well in meiosis, but if that hybrid undergoes genome duplication… It gets two matched sets of each parental chromosome.
Exactly.
Which can restore fertility.
The brassica triangle is the classic example.
Different brassica species hybridizing and duplicating to form things like canola and other crops.
We can even induce it artificially with chemicals.
Is this the same as hybrid vigor?
Good question.
No, it's distinct.
Hybrid vigor, or heterosis, is usually the boost you see when crossing different inbred lines of the same species.
It seems more related to increased heterozygosity than just having more chromosome sets.
Because more sets doesn't automatically mean better.
Not necessarily.
The textbook shows data from maize where doubling from haploid to diploid increases vigor, but going higher to tetraploid actually decreases it in those lines, so it's complex.
But many important crops are allopolyploids, right?
Like wheat, cotton, coffee… Yes, definitely.
Allopolyploids often benefit from both the hybrid nature, combining different gene versions, and the genome duplication, gene redundancy, buffering.
They're often very robust.
What happens at the molecular level when you suddenly match two different genomes together in an allopolyploid?
It can be quite dramatic and happen fast.
The chapter mentions genome reorganization, big shifts in epigenetics, changes in gene expression patterns, altered splicing, transposons waking up.
A lot happens.
Tools like RNAseq are vital for tracking all this.
And those extra gene copies from polyploidy?
What happens to them over time?
Evolution gets to work.
One copy might be lost, one might gain a new function, or the two copies might divide up the original functions, that sub -functionalization, or they might just evolve different expression patterns.
So species that look diploid now might have been polyploid way back.
Exactly.
They're called paleopolyploids.
They went through duplication and then lost a lot of the extra DNA, but you can still see the evidence.
Arabidopsis, maize, brassicas, they all show signs of ancient polyploidy.
And this is totally different from aneuploidy.
Completely.
Aneuploidy is having an extra or missing individual chromosome, like trisomy 21 in humans.
That's often really detrimental, especially in animals.
Plants, particularly polyploids, tend to tolerate aneuploidy much better.
Is polyploidy always a good thing, evolutionarily?
It's debated.
It clearly creates novelty.
But polyploids often seem to be younger species, leading some to think it might be an evolutionary dead end sometimes, maybe because meiosis gets complicated.
Still, it's so common in plants, the advantages must often outweigh the disadvantages.
Okay, we've been deep in the nucleus.
But plants have DNA elsewhere, too.
They do.
In mitochondria, the power plants, and in plastids like chloroplasts, the solar panels.
And these came from bacteria originally, endosymbiosis.
That's the leading theory, yeah.
An ancient cell engulfed an aerobic panaterium, became mitochondria, and later maybe another engulfed a photosynthetic cyanobacterium, became plastids.
The evidence, like their double membranes and bacterial -like DNA, supports this.
Their DNA sits in regions called nucleoids, no nuclear envelope.
Are the genomes big?
Plastid genomes are fairly consistent, maybe 120, 160 ,000 base pairs, coding for photosynthesis genes and their own expression machinery.
Mitochondria are weirder in plants, much more variable in size than in animals.
From under 200 ,000 base pairs up to 11 million.
Wow, why so variable?
Mostly non -coding repetitive DNA again.
They code for energy metabolism genes and their own expression stuff.
But interestingly, most genes needed for organelle function actually migrated to the nucleus over time.
The proteins are made outside and then imported back in.
And the DNA isn't just simple circles.
Often not.
It seems they can form complex, linear, and branched structures, sometimes multiple genome copies linked together.
Not the simple picture we used to have.
So inheritance of these organelle genes,
it's not standard Mendelian genetics?
No, it's non -Mendelian.
Usually you see uniparental inheritance.
Meaning from only one parent.
Right.
In most flowering plants, mitochondria and plastids come solely from the mother, through the egg cell.
Conifers are an exception where plastids can come from the father, but it's typically one -sided.
And there's another weird thing.
Vegetative segregation.
Yeah.
If a cell starts with a mix of, say, normal and mutant chloroplasts, when that cell divides by mitosis, the organelles get randomly distributed.
Just by chance, some daughter cells might end up with mostly mutant, others mostly normal.
Leading to patches in the plant.
Exactly.
That can cause variegation, like those leaves with white or yellow patches mixed with green.
It's the visible result of this random sorting of organelles during cell division.
Okay, fascinating.
So we have the blueprint, we know where it is.
How does the plant actually read and use it?
Gene expression regulation.
This is key.
Getting from a gene to a functioning protein is tightly controlled.
The right amount, right time, right place.
And the control happens at different levels.
Multiple levels.
The main one is transcription, deciding whether to even make the RNA copy.
But then there's post -transcriptional control, what happens to the RNA, and post -translational control managing the protein itself.
Let's start a transcription.
How does the machinery know where to start?
RNA polymerase II, the enzyme for most protein -coding genes, needs to find the promoter region of a gene.
This promoter has a core part right near the start site, plus other regulatory sequences nearby and sometimes far away.
And the core promoter has specific signals, like the TATA box.
Yeah, conserved elements like the TATA box, an initiator element, INR, maybe a DPE or BRE.
These bind general transcription factors, which then help recruit the RNA polymerase.
And other sequences nearby fine -tune it.
Right.
Propsimal promoter elements like the CC box or GC box bind specific transcription factors.
These DNA sequences are cis -acting elements.
The proteins that bind them are trans -acting factors.
And some regulatory bits can be really far away, enhancers.
Tens of thousands of base pairs away sometimes.
These bind activator or repressor proteins.
How do they work from so far?
The DNA loops.
It's flexible.
The DNA can actually bend around so that factors bound way out at an enhancer can physically touch the transcription machinery assembled at the promoter, influencing its activity.
The diagrams show this looping really well.
Flavor.
And how does it know when to stop comping?
There are termination signals in the DNA.
In plants,
specific sequences signal the end and also trigger the addition of that poly -A tail to the mRNA, which helps protect it.
Okay.
And epigenetics comes back in here too, right?
Controlling access.
Absolutely.
Those epigenetic marks, chemical tags on DNA or histones are crucial for transcriptional control.
They don't change the sequence, but they change gene activity.
Like DNA methylation, adding methyl groups to cytosines.
Yes.
In plants, it happens at CG, CHG, and CHH sequences.
Enzymes put the marks on, others take them off.
Methylation, especially in promoters, usually means off transcriptional silencing.
And modifying the histones.
Yeah.
Those histone tails can be methylated, acetylated, phosphorylated, lots of things.
Acetylation usually opens chromatin up, promoting transcription.
Methylation can activate or repress depending on which specific amino acid gets methylated.
It's complex.
Like a code.
A histone code.
That's the idea.
These marks are read by other proteins, including chromatin remodeling complexes.
These use ATP energy to physically slide or restructure nucleosomes, opening or closing access to the DNA.
So transcription is done, mRNA is made.
Is it smooth sailing to protein synthesis now?
Not quite.
There's post -transcriptional control.
The mRNA gets processed introns spliced out, a cap added, the poly a tail added, then exported.
But its lifespan is also regulated.
It doesn't last forever.
No.
N -LIMES can chew it up from the ends once the cap or tail is removed.
This is a way to quickly stop making a protein when it's not needed anymore.
Specific sequences in the mRNA can affect how stable it is.
And then there's RNA interference, RNAi.
That's post -transcriptional too.
A major one, yes.
It uses small non -coding RNAs to silence gene expression, often by targeting mRNA.
It's triggered by double -stranded RNA.
Where does DSRNA come from?
Several sources.
MicroRNAs, muRNAs, are encoded by the plant's own genome and regulate development.
Short interfering RNAs, CERNAs, can come from transposons, viruses, or even artificially introduced genes, transgenes.
And these get chopped up?
Yes, by enzymes called dicer -like, or DCLs, into small 2124 nucleotide pieces.
Then what happens?
These small RNAs get loaded into a complex called RISC, the RNA -induced silencing complex, which contains an argonaut protein.
And RISC uses the small RNA as a guide.
Exactly.
If it's in mRNA, RISC usually finds a target mRNA.
The argonaut protein can then slice the mRNA, destroying it, or sometimes just block it from being translated into protein.
And CERNAs, especially those from transposons.
They often guide RISC back to the DNA sequence they came from.
This recruits enzymes that add methylation to the DNA and histones, leading to chromatin compaction and long -term silencing.
It's a key way plants keep transposons quiet.
And it works against viruses, too.
Yes, it's a major antiviral defense.
The plant makes CERNAs from the viral RNA, which then target the viral RNA for destruction, or the viral DNA, if any, for methylation.
Plants can even send these silencing signals systemically.
That petunia example makes sense now.
Adding the gene triggered silencing.
Right.
The plant likely saw the overexpressed RNA, or aberrant RNAs, as foreign or excessive, triggering a CERNA response that silenced both the introduced gene and the plant's own copy, hence co -suppression and the white patches.
OK, mRNA dealt with.
What about the protein itself?
Is its life regulated?
Definitely.
Post -translational control.
Proteins have varying lifespans.
The amount of any protein depends on its synthesis rate versus its degradation rate.
How are proteins targeted for destruction?
A major route is the ubiquitin proteasome system.
Proteins marked for removal get tagged with chains of a small protein called ubiquitin.
By specific enzymes?
Yes.
Particularly E3 ubiquitin legacies, which recognize specific target proteins.
This ubiquitination process uses ATP.
And the tag is a death sentence?
Pretty much.
The tagged protein is recognized by the 26S proteasome, a huge molecular machine that unfolds and chops up the protein.
It's vital for getting rid of damaged proteins and for regulating levels of key signaling proteins, like those involved in hormone responses.
Wow, OK.
So much complexity.
How do scientists figure all this out?
What are the tools?
A cornerstone is mutant analysis.
Find a plant with an interesting trait, figure out the gene that's changed, and infer the gene's function from what went wrong.
How do you find mutants?
You can cause random mutations with radiation or chemicals, then screen thousands of plants,
or use transposin tagging, insert a known DNA tag randomly, find plants with mutations, then use the type sequence to identify the disrupted gene.
Then techniques like map -based cloning help pinpoint the gene's location.
And figuring out when and where a gene is active.
You measure the mRNA.
Older methods like northern blots exist, but now it's mostly genome -wide approaches.
Microarrays were big, but RNAseq is really the standard now.
Remind me how RNAseq works.
You sequence basically all the mRNA from your sample, map those sequence reads back to the genome, and count how many reads hit each gene.
More reads means higher expression.
You can compare different tissues, treatments, mutants.
It gives you a snapshot of the entire transcriptome.
And you can look at proteins, too.
Proteomics.
Yeah.
And metabolites, metabolomics, epigenetic marks, epigenomics.
The whole anomics approach tries to be comprehensive.
Proteomics is still challenging, but uses things like mass spectrometry.
What about watching gene expression live?
You mentioned GFP.
Right.
Gene fusions are great for that.
You link a gene's promoter, its on -off switch, to a reporter gene like GFP, green fluorescent protein.
Wherever that promoter is active, the cell makes GFP and glows green.
Or GUS turns blue when you add a chemical, and lets you visualize expression patterns directly in the plant.
All this knowledge.
Yeah.
Leads to genetic engineering, right?
Yeah.
Modifying plants intentionally.
Exactly.
Humans have done it for millennia via selective breeding, think Teosinte to maize.
But traditional breeding shuffles whole genomes randomly.
Whereas biotechnology is more precise.
Much more.
You can identify a specific gene, even from a totally different organism, and insert just that gene into the plant.
That's a GMO, a genetically modified organism.
And the main tool for doing that in plants is that bacterium.
Agrobacterium.
Agrobacterium tumifaciens, yeah.
It naturally inserts a piece of its DNA, the tDNA, into the plant genome to cause tumors.
Scientists basically hijack this system.
They take out the tumor genes from the tDNA on agrobacterium's T -plasmid, and replace them with the gene they want to introduce, plus usually a marker gene, like for antibiotic resistance, so they can select the transformed cells.
Then they infect plant tissue with this engineered agrobacterium.
And the bacterium injects the desired gene into the plant's DNA.
Yes.
The modified tDNA gets integrated into the plant's chromosomes.
You can then regenerate a whole plant from those transformed cells.
Other methods exist too, like the gene gun shooting DNA -coated particles into cells.
And this has led to crops resistant to herbicides or pests.
Those are major examples, yes.
Engineering crops to tolerate glyphosate herbicide, or to produce bleat toxins that kill specific insect larvae.
It addresses real agricultural problems, though like any technology, it comes with ongoing discussions about safety and environmental impact.
So to wrap up this deep dive based on the chapter, we've covered a huge amount of ground.
We really have, from the basic idea of genotype, phenotype, and environment.
To the structure of the nuclear genome, its size variation, the non -coding parts, chromatin packaging, those key landmarks like centromeres and telomeres.
The whole dynamic world of transposons, the significance and mechanisms of polyploidy.
Then we looked at the organelle or genome's mitochondria and plastids, their origins, inheritance patterns like uniparental inheritance, vegetative segregation.
And then dove into the regulation of gene expression at the transcriptional level with promoters and enhancers, the crucial role of epigenetics like methylation and histone modification.
Post -transcriptional control via mRNA stability, and that whole intricate RNAi system with mirenase and CERNase.
And finally, post -translational control through protein degradation, like the ubiquitin -protesome pathway.
Plus, we covered the tools scientists use mutants, mapping,
RNA -seq reporter genes, and how that leads to genetic engineering applications using things like agrobacterium.
We've definitely summarized the key physiology and mechanisms from the source material.
It shows the plant genome isn't static at all.
It's dynamic, regulated, full of mobile bits, capable of huge changes.
Which makes you wonder, given how complex and dynamic plant genomes are, and how fast they can evolve, what other novel ways might they adapt to future challenges?
What else is hiding in all that DNA we used to just call junk?
That's the exciting part, isn't it?
There's likely still so much more to discover about how these amazing organisms work at their most fundamental level.
ⓘ This audio and summary are simplified educational interpretations and are not a substitute for the original text.
Using this chapter to study? Last Minute Lecture is free and student-run. If it helped, consider supporting the project.
Support LML ♥Related Chapters
- Regulation of Gene Expression in EukaryotesPrinciples of Genetics
- Regulation of Gene ExpressionCampbell Biology in Focus
- Control of Gene Expression in EukaryotesGenetics: A Conceptual Approach
- Differential Gene Expression: Mechanisms of Cell DifferentiationDevelopmental Biology
- Gene Expression Control in EukaryotesBiochemistry
- Regulation of Gene ExpressionCampbell Biology