Chapter 21: Genomes and Their Evolution
Welcome to Last Minute Lecture.
This free chapter overview is designed to help students review and understand key concepts.
These summaries supplement not replace the original textbook and may not be redistributed or resold.
For complete coverage, always consult the official text.
Hello everyone and welcome back to the Deep Dive.
We are doing something a little distinct today.
Usually we take a stack of articles, maybe a new non -fiction release, and we pull out the golden nuggets.
But today we are going back to school.
Literally.
We really are.
We are taking on the heavy hitter today.
Exactly.
We are diving into Campbell Biology 12th edition.
Specifically, we are tackling chapter 21, which is titled Genomes and Their Evolution.
Now I know what you're thinking.
A textbook chapter.
Is this going to be dry?
And the answer is absolutely not.
No, not at all.
This chapter is, it's actually the pivot point of modern biology.
It really is.
It really is.
It represents the moment we stopped looking at life through a keyhole, you know, studying one gene at a time in isolation.
And we basically kipped the door wide open to look at the entire library of genetic information all at once.
Right.
It changes everything about how we understand life, disease, and even evolutionary history.
So for our listeners, specifically college students out there who might be using this as your ultimate study companion, here is our mission.
We are going to walk you through this chapter section by section, concept by concept, where I'm going to skip the hard stuff.
No, we're going to unpack it.
We're going to translate those really dense paragraphs into something you can actually visualize and hold on to.
And here is the roadmap for the next hour.
We're going to start with the massive revolution in sequencing, like how we actually read the DNA book.
Then we'll move to bioinformatics, which is basically the strategy for how scientists keep from drowning in all that data.
And from there, we'll look at the physical reality of the genome itself, the size, the number of genes, and the density.
And spoiler alert for everyone listening, there are some massive paradoxes there.
Huge surprises.
Then we get into the dark matter of the genome, the non -coding DNA and jumping genes.
We'll look at how genomes evolve through duplication and rearrangement, and we will finish up with comparative genomics in a field called EvoDevo.
Evolutionary developmental biology.
That is the section that explains how you get a giraffe versus a mouse using basically the exact same toolkit.
So grab your notebook, maybe open your text to chapter 21 if you have it in front of you, and let's get started.
Section one, the sequencing revolution.
This is concept 21 .1 in the text.
And this story really begins with the human genome project.
I think it's hard for us sitting here today with things like 23andMe and Ancestry .com to appreciate just how audacious this project was when it started in 1990.
The text describes it as an international, publicly funded consortium.
This wasn't just one guy in a lab coat tinkering away.
This was big science.
Right.
Involved 20 large sequencing centers across six different countries.
The goal was simple to state, but nearly impossible to execute at the time, which was to sequence the entire human genome, read every single letter.
And it was largely completed by 2003.
But the thing the text really emphasizes and which you need to understand for the context of this entire chapter is the speed of progress.
The numbers are honestly mind -blowing.
They really are.
The text draws a huge contrast between the 1980s and the modern era.
In the 80s, a really productive, lab -like, a top -tier lab could sequence maybe 1 ,000 base pairs a day.
Just imagine that.
You are sitting there manually reading a gel, writing down A, T, C, G, 1 ,000 letters a day.
The human genome is 3 billion letters.
So at that rate, it would take...
Thousands of years, literally.
Fast forward to the year 2000, research centers were suddenly doing 1 ,000 base pairs per second.
Per second.
And today, the text mentions these next -generation machines that can sequence nearly 35 million base pairs per second.
It's an exponential leap.
It's like going from walking across the country to teleporting.
And because of that sheer speed, the cost has just plummeted.
I love the financial comparison the chapter provides here.
The first human genome took 13 years and cost somewhere between $500 million and $1 billion.
That is a very hefty price tag for one genome.
Right.
And today, the text says a machine can sequence 48 individual genomes in just 44 hours for under $1 ,000 each.
That democratization of data is exactly what makes this chapter possible.
We went from having one single genome to compare to having thousands of them.
But we need to talk about how they actually do it.
The text highlights a specific methodology called the whole -genome shotgun approach.
Yes, this is crucial to understand.
If you have the book, this is visualized in figure 21 .2.
And shotgun is a very descriptive name for it.
It really is, because the old way was to map everything out carefully, page by page, chromosome by chromosome.
The shotgun approach is messier, but it's way faster.
If you are looking at figure 21 .2, it breaks this down into three distinct steps.
Okay, so imagine you have a massive encyclopedia volume.
That's your chromosome.
What's step one?
Step one is to basically shred the book.
Seriously.
You cut the DNA from many copies of a chromosome into overlapping fragments.
In the figure, these look like scattered little red curves.
You don't try to keep them in order at all.
You just chop them up.
Okay, so you have a massive pile of confetti.
What's step two?
Step two is cloning and sequencing each of those fragments.
Step two is cloning and sequencing each of those fragments.
letters on every little shred.
Since they are small, the sequencing machines can handle them easily.
But now you just have millions of little sentences with no page numbers.
How do you get the book back together?
Because that's step three, right?
Exactly.
Step three is where the computers come in.
You order the sequences into one continuous strand.
And the key word here is overlapping.
Because you shredded many different copies of the chromosome, the fragments overlap with each other.
So the computer just looks for matches.
Exactly.
If fragment A ends with G -A -T -A -C -C and fragment B starts with G -G -T -A -C -C, the computer infers they probably belong together.
It stitches the sequence back into one continuous strand.
It's like putting together a jigsaw muzzle, but you are using the letter patterns on the edges of the pieces to figure out exactly where they go.
And this brings us to a bit of historical drama mentioned in the text.
You had the public consortium, which was being very methodical and map -based.
And then in 1998, Jay Cranston, Greg Venter, and his company, Solera Genomics, stepped in.
The disruptors of the biology world.
Venter basically said, we can do this faster using the shotgun method for the whole genome at once.
And it was controversial.
Highly controversial.
People thought the computer wouldn't be able to handle the repetitive parts of DNA, that it would assemble the puzzle wrong because too many pieces looked identical.
But he proved that with enough computing power, it actually worked.
Today, this shotgun strategy is basically the standard.
So we have the sequence.
We have the data.
But data isn't knowledge.
That leads us directly to section two, which is concept 21 .2.
Bioinformatics.
Right.
The text defines bioinformatics as the application of computational methods to store and analyze biological data.
Because you obviously can't analyze three billion letters with a pen and paper.
You need sophisticated algorithms.
And the text highlights a specific database that every biology student needs to know.
NCBI.
The National Center for Biotechnology, Information.
They maintain the database.
It's like the Library of Congress for DNA.
And they have a tool called BLAST.
I feel like every biology student ends up using BLAST eventually.
It's a rite of passage.
They absolutely do.
It stands for Basic Local Alignment Search Tool.
It's essentially the Google of genetics.
The text uses figure 21 .3 to show you how to interpret a BLAST search.
This is a visual skill you definitely might need for an exam.
So walk us through figure 21 .3.
What are we actually looking at?
Okay.
The computer window.
At the top, you have your query sequence.
In the figure, this is a partial amino acid sequence from a musk melon.
A musk melon.
Okay.
So we have this string of protein letters from a melon, and we have absolutely no idea what it does.
Right.
So you run a BLAST search.
The figure shows the query aligning with sequences from other organisms.
Specifically, it shows a match with a nematode worm and a yeast cell.
And we are looking for similarities between them.
We are looking for matches.
The figure highlights these yellow sections, called the WD -40 domain.
Now that sounds like a lubricant, but it's definitely not.
No.
Not the spray can in your garage.
It's a specific structural motif in the protein.
Because the musk melon sequence matches the WD -40 domain in the yeast and the nematode, and we know what the WD -40 domain does in those organisms, we can predict the function of the musk melon protein.
That is the power of bioinformatics right there.
You don't have to start from scratch in the lab.
You can use the evolutionary connections that are already mapped, mapped out.
Precisely.
If it looks like a duck in yeast, and it quacks like a duck in a nematode, it's probably a duck in a melon.
And this ability to look at everything at once has shifted biology from just studying individual genes to something called systems biology.
This is a key definition in the text.
It's the shift from reductionism to integration.
Instead of just looking at one gear, you look at how the whole engine runs.
The text mentions genomics, which is studying whole sets of genes, and proteomics, which is studying sets of proteins.
And there is a really cool, slightly chaotic visual for this in the book.
Figure 21 .4 is the protein interaction map of yeast.
Yes.
If you look at figure 21 .4, it looks like a giant hairball or a really dense constellation.
You have thousands of dots connected by lines.
How do we even begin to interpret this map?
The dots represent proteins.
The lines represent interactions, meaning those two proteins touch or work together physically inside the cell.
The colors are important, too.
The figure groups proteins by their function.
So all the proteins involved in RNA processing might be red, and you see them all clustered tightly together in one part of the web.
It really illustrates that the cell is a dynamic network.
It's not just a bag of loose parts floating around.
Everything is deeply connected.
And the text gives a great medical application for the systems approach, which is cancer research.
Instead of looking for one single cancer gene, researchers can compare the gene sequences and expression patterns in cancer tissue.
They can see how the whole network changes.
It's a much more powerful way to understand complex diseases.
Okay, let's move on to section three, concept 21 .3.
This is about the physical reality of genomes, size, number, and density.
And this is where our intuition often fails us entirely.
Completely.
We have this very human -centric bias.
We think humans are complex, bacteria are simple, so humans must have way more DNA.
That's the logical assumption.
But let's look at the statistics.
The text compares the three domains, bacteria, archaea, and eukarya.
Bacteria and archaea generally have small genomes, but eukaryotes, it's all over the place.
The text calls this the genome size paradox.
And they give a specific example to highlight this.
Fritillaria.
It's a flowering plant.
A fritillaria.
It looks like a standard beautiful garden flower, but it has a genome that is 40 times the size of the human genome.
40 times.
Yes.
So clearly, genome size does not correlate with phenotypic complexity.
A lily is not 40 times more complex than you are.
That is a humbling fact for us humans.
What about the number of genes?
Also very surprising, the text contrasts E.
coli, which has about 4 ,400 genes, with humans.
How many do we actually have?
The current estimate cited in the text is about 21 ,300.
That seems really low.
It is low.
When the Human Genome Project started, scientists actually placed bets on this.
They thought there would be 50 ,000 to maybe 100 ,000 genes to explain human complexity.
But we only have a few.
We have about 21 ,000.
To put that in perspective, the text mentions that maize regular corn has 32 ,000 genes.
Corn has more genes than us.
It does.
We have fewer genes than a stalk of corn.
So if we don't have more genes, how are we more complex?
The text explains that vertebrates are very good at something called alternative splicing of RNA transcripts.
Basically, we can use one single gene to make three or four different proteins.
So even though our gene count is low, our functional protein count is very high.
That makes perfect sense.
We're just more efficient with the file compression, so to speak.
Now, there's another metric here.
Gene density.
This is the number of genes per given length of DNA.
And this really explains the size difference we were just talking about.
Bacteria have very high gene density.
Their genomes are incredibly efficient.
There's very little wasted space.
Humans and other mammals have very low gene density.
Why is that?
Because of what lies between the genes.
The vast amount of non -coding DNA.
Humans have methamphetamine.
Massive stretches of DNA that don't code for protein, as well as introns within the genes themselves.
Which is the perfect segue to section four, concept 21 .4, the nature of non -coding DNA.
This is one of the most important concepts in the entire chapter.
For a long time, this was called junk DNA.
But the text makes it very clear that label is totally outdated.
This DNA plays crucial roles.
Let's look at figure 21 .6.
It's a pie chart of the composition of the human genome.
And this visual is honestly shocking.
It is.
If you look at the slice of the pie that represents exons, the regions that actually code for proteins, rRNA or tRNA, it is tiny.
It's 1 .5%.
Just 1 .5%.
That means 98 .5 % of your genome does not code for proteins.
So what is the rest of it doing?
What is it made of?
The pie chart breaks it down for you.
You have introns, which are cut out during RNA processing.
That's about 20%.
You have unique non -coding DNA.
But a huge chunk, about 44 % of the whole pie, is made up of repetitive DNA.
Specifically, transposable elements.
Jumping genes.
Exactly.
And this is where the history of biology gets really good.
The text tells the story of Barbara McClintock.
Barbara McClintock is an absolute giant in the field.
In the 1940s and 50s, she was studying Indian corn.
Maize.
And she noticed something strange about the colors of the kernels.
Right.
If you look at figure 21 .7, you see corn with spotted kernels.
Some are purple.
Some are yellow.
Some are spotted.
McClintock deduced that these changing colors weren't just random mutations.
They were evidence of geneticism.
She proposed that a genetic element could move from one location and land right in the middle of a pigment gene.
If it lands there, it breaks the gene.
No purple pigment gets made, so the kernel is yellow.
If it jumps out again, the gene works again, and you get a purple spot.
And the text mentions that at the time, people just didn't believe her.
It was a radical idea.
Genes were supposed to be stable, fixed particles on a string.
The idea that they could hop around was basically heresy.
But she was right.
She eventually won the Nobel Prize for discovering these transposable elements.
The text distinguishes between two different types of these mobile elements.
Transposons and retrotransposons.
We need to clarify the difference here because it explains why our genome is so big today.
It's all about the mechanism of how they move.
Transposons move via a DNA intermediate.
The text calls it a cut -and -paste mechanism.
So the DNA physically moves from one spot to another.
Right.
Right.
Right.
Right.
It literally cuts it out of spot A and pastes it into spot B.
Contrast that with retrotransposons.
These move via an RNA intermediate.
Retro, like a retrovirus.
Exactly.
The retrotransposon is transcribed into RNA first.
Then an enzyme called reverse transcriptase turns that RNA back into DNA, which then gets inserted somewhere else in the genome.
So the original stays put, and a brand new copy lands somewhere else.
It's strictly a copy -and -paste job.
Correct.
And because it always leaves a copy behind,
retrotransposons can really grow.
They can grow the size of the genome over evolutionary time.
This is exactly why 44 % of our genome is repetitive stuff.
It's ancient elements copying themselves over millions of years.
Okay.
Moving on from dumping genes, the text mentions another type of repetitive DNA,
STRs, short tandem repeats.
This is what the text calls simple sequence DNA.
These are just short units, like GTTAC, repeated over and over.
GTTTAC, GTTAC, GTTAC.
And the key here is, is the variation between people, right?
Yes.
The number of repeats varies wildly between individuals.
I might have 10 repeats at a certain spot.
You might have 15.
This is the entire basis of genetic profiling or DNA fingerprinting used in forensics.
So when you watch a crime show and they match the DNA to the suspect, they're usually looking at these STRs.
Exactly.
Now, not all DNA is repetitive dunk or unique single genes.
Some genes come in families.
Gene families.
The text uses the globin family as the prime example here.
Hemoglobin.
The stuff that carries oxygen, your red blood cells.
It's not just coded by one single gene.
Let's look at figure 21 .7.
It shows the globin genes on completely different chromosomes.
You have the alpha globin family on chromosome 16 and the beta globin family on chromosome 11.
But what the figure really emphasizes is that you have different versions of these genes for different times in your life.
Right.
The figure shows there's an embryonic version, a fetal version, and an adult version.
And that's a crucial adaptation.
The fetal hemoglobin needs a higher affinity for oxygen so it can affect the fetal hemoglobin.
It can effectively steal oxygen from the mother's blood across the placenta.
By having a family of genes, we can switch them on and off as we develop.
Which naturally leads us to the question, where do these families come from in the first place?
How does a genome get a family of genes?
That brings us to section five, mechanisms of genome evolution.
This is concept 21 .5.
The central theme here is duplication.
You can't evolve a new function if your only copy of a gene is busy doing a critical job keeping you alive.
It's a spare tire to experiment with.
So duplication provides the raw material for evolution.
Let's talk about the biggest scale of duplication first,
polyploidy.
Polyploidy is the duplication of entire sets of chromosomes.
Instead of two sets, which is diploid, you get three or four.
This happens due to accidents during meiosis.
The text notes this is common in plants, but usually lethal in animals.
Right.
But in plants, it can actually lead to instant speciation.
Wheat, tobacco, chrysanthemums, these are all, all polyploids.
Moving down in scale, let's look at alterations of chromosome structure.
There is a smoking gun in human evolution mentioned here regarding our chromosomes versus chimpanzees.
This is fascinating.
Humans have 23 pairs of chromosomes.
Chimpanzees have 24 pairs.
So where did the missing pair go?
It didn't disappear.
It fused.
The text explains that human chromosome 2 is virtually identical to two distinct ancestral chromosomes that are still found separately in chimpanzees.
Specifically, chromosomes 12 and 13, in the chimp.
So somewhere in our evolutionary lineage, two chromosomes stuck together end to end to form our chromosome 2.
Exactly.
And the evidence is right there in the sequence.
We see the banding patterns match up perfectly.
We see remnants of telomeres, which are usually only at the very ends of chromosomes, sitting right in the middle of chromosome 2.
It's exactly like a scar from the fusion.
Now let's zoom in even further to the duplication of gene -sized regions.
How do we get two copies of a single gene?
The primary mechanism discussed is unequal crossing over.
We need to visualize figure 21 .13 here to understand this.
Okay.
Picture two homologous chromosomes lining up during meiosis.
Usually they align perfectly side by side, but if you have those repetitive transposable elements we talked about earlier, the chromosomes might get confused by the matching sequences and misalign.
So they line up slightly off -center.
Right.
So when crossing over happens, when they swap chunks of DNA, one chromated gives up a chunk, but doesn't get one back.
It ends up with a deletion.
The other chromated gets an extra chunk.
It has a duplication.
So now that second chromosome has two copies of the gene sitting next to each other.
And this is exactly how gene families are born.
The text uses the globin genes again as the model in figure 21 .4 time.
Walk us through that evolutionary model.
You start with a single ancestral globin gene.
Hundreds of millions of years ago, there was a duplication event.
Now you have two copies.
One keeps doing the original job, of carrying oxygen.
The other is free to accumulate random mutations without killing the organism.
It has the freedom to experiment.
Exactly.
Divergence.
Over time, these copies drifted apart structurally to become the ancestors of the alpha and beta families.
Then they duplicated again and again to create the modern complex families we discussed earlier.
So duplication leads to divergence.
The text gives another really interesting example of this.
Lysozyme and alpha -lectalbumin.
This is a classic.
This is a classic textbook example of genes with novel functions.
Lysozyme is an enzyme that breaks down bacterial cell walls.
It protects us from infection.
You have it in your tears and saliva.
And alpha -lectalbumin.
It's a non -enzymatic protein that plays a role in milk production in mammals.
Those seem totally unrelated.
They do at first glance.
But structurally, the amino acid sequences of the proteins are very similar.
The genetic evidence suggests the lysozyme gene duplicated in the mammalian ancestor.
One copy kept fighting bacteria.
The other kept fighting bacteria.
The other copy mutated over time and took on a completely new role in milk production.
That is evolution in action.
Repurposing old tools for completely new jobs.
And sometimes it's not just whole genes that get duplicated or moved.
Sometimes it's pieces of genes.
This is called exon shuffling.
Figure 21 .16 illustrates this beautifully with the TPA gene.
Tissue plasminogen activator.
It's a vital protein that helps dissolve blood clots.
It's actually a lifesaver for heart attack victims.
The figure shows it looks like a blood clot.
It's almost like a patchwork cold.
It is a patchwork.
The TPA gene contains exons coding segments that are identical to exons found in three other completely different genes.
Epidermal growth factor, fibronectin, and plasminogen.
So it's like someone took Lego bricks from three different sets, say a castle, a spaceship, and a fire truck, and built a completely new car.
That's a perfect analogy.
Occasional errors in meiotic recombination mixed and matched these functional domains, these exons, to create a protein with a brand new, highly complex, function.
Incredible.
Okay, we've reached the final section.
Concept 21 .6, comparative genomics and evo -devo.
This is where we step back and look at the big picture.
We compare different genomes to understand evolutionary history.
The text starts with the very deep history.
Broad comparisons between the domains of bacteria, archaea, and eukaryotes.
Figure 21 .17 shows the tree of life.
It confirms that these three domains diverged somewhere between two and four billion years ago.
But despite that, it's not a big deal.
But despite that vast amount of time, we still share core genes.
There is a specific study mentioned in the text where human genes were put into yeast.
Yes, this is a mind -blowing experiment.
They replaced essential yeast genes with the human versions of those same genes.
And in 47 % of the cases, the human gene worked perfectly in the yeast cell.
The cell survived.
That is wild to think about.
We're separated by billions of years of evolution, but the core cellular machinery is so conserved that our parts are literally interchangeable.
It really emphasizes the importance of the human genes.
It really emphasizes the fundamental unity of life.
Now let's look at more closely related species, humans and chimpanzees.
We differ by only about 1 .2 % in single nucleotide substitutions, but obviously we are quite different organisms phenotypically.
So scientists look very closely for the specific genes that are different.
And the star of this section is the FOXP2 gene.
The FOXP2 story.
This is a gene that functions in vocalization invertebrates.
The text mentions that humans with mutations, in this specific gene, suffer from severe speech and language deficits.
Right.
So researchers wanted to test its function.
They looked at the gene in mice, and they actually did a genetic swap.
They replaced the mouse FOXP2 gene with the human version.
And what happened?
Did the mice start talking?
No, they didn't start reciting Shakespeare.
Yeah.
But their ultrasonic vocalizations, the tiny squeaks they make to communicate, actually changed.
The underlying structure of the sound was altered by the human gene.
That strongly suggests this gene played, a major role in the evolution of human speech.
Exactly.
It shows how we can use comparative genomics to pinpoint the exact molecular changes that make us human.
Finally, we arrive at evo -devo, evolutionary developmental biology.
This field asks a fundamental question.
How do you build a different body shape?
Do you need all new genes to get a new shape?
And the answer seems to be a resounding no.
No.
You just use the same old genes differently.
The key players here are the homeotic genes, specifically a type called, the Hox genes.
Figure 21 .18 is the primary visual here.
It compares a fruit fly embryo and a mouse embryo.
And it is beautifully color -coded to show the relationships.
You see a sequence of genes on the chromosome, colored yellow, orange, red, and so on.
And then you see where those genes are expressed in the embryo.
So the yellow gene controls the development of the head.
The red gene controls the thorax.
Exactly.
And the amazing thing is, the order of the genes on the chromosome is the exact same in the fly and the mouse.
The mouse has four sets of these genes, due to duplication.
But the sequence and the function telling the embryo, this is the front, this is the back, is conserved for hundreds of millions of years.
And the text defines the homeobox here.
That specific 180 nucleotide sequence within these Hox genes that is widely conserved across animals.
So anatomical diversity doesn't come from inventing new genes from scratch.
It comes from regulatory changes.
Explain that for us.
The text mentions crustaceans and insects.
They have varied genes.
They have very different body plans.
Different numbers of legs, wings, different segments.
But they use the exact same Hox genes.
The difference is where and when those genes are turned on during development.
So it's like having the same set of light switches in a house, but flipping them in a different pattern creates a totally different mood or function for the room.
Perfect.
Small changes in regulation, just changing when the switch flips, can lead to massive changes in body form over evolutionary time.
That is the core lesson of Evodevo.
So what does this all mean for us?
Let's synthesize this whole change.
We started with sequencing, the modern ability to quickly read the book of life.
Then we saw that the book is full of surprises, jumping genes, ancient viruses, duplicated pages.
And we learned that evolution is essentially a very messy editor.
It cuts, pastes, duplicates, and shuffles pieces around.
But it's also a highly conservative editor.
It keeps what works.
The machinery that builds a yeast cell or a fruit fly is largely the exact same machinery that builds you.
The unity of life is found in the parts, the incredible diversity is in the assembly.
Well said.
I want to leave you with a final thought from the very end of the chapter.
We often focus on how genes make us,
but we are now entering an era where we can read and potentially rewrite those genes.
The genome is no longer just a history book.
It's becoming an instruction manual that we are learning to edit.
What happens when the editor of the genome is no longer natural selection, but us?
That is the big question for the next century of biology.
Definitely something to mull over.
Thank you for listening to this deep dive into Campbell Chapter 21.
Good luck with your studies, and a warm thank you from the entire Last Minute Lecture team.
Happy studying, everyone!
ⓘ This audio and summary are simplified educational interpretations and are not a substitute for the original text.
Using this chapter to study? Last Minute Lecture is free and student-run. If it helped, consider supporting the project.
Support LML ♥Related Chapters
- Genomes and Their EvolutionCampbell Biology in Focus
- Genomics and ProteomicsGenetics: A Conceptual Approach
- Cells, Genomes, and the Diversity of LifeMolecular Biology of the Cell
- DNA, Chromosomes, and GenomesMolecular Biology of the Cell
- Evolution: Molecules, Genes, Cells & OrganismsMolecular Cell Biology
- Evolutionary GeneticsGenetics: A Conceptual Approach