Chapter 22: Quantitative Genetics and Complex Traits

Search this chapter

Audio Overview

0:00 / 0:00

Autoplay next chapter

Welcome to Last Minute Lecture.

This free chapter overview is designed to help students review and understand key concepts.

These summaries supplement not replaced the original textbook and may not be redistributed or resold.

For complete coverage, always consult the official text.

Welcome back to The Deep Dive.

Our mission today is to take a stack of challenging source material and turn complexity into clarity.

And we are definitely plunging into one of the most intellectually rigorous and, well, often socially relevant areas of biology, quantitative genetics.

It really is.

It feels like the field that was built to solve a massive biological puzzle.

It is, because if you look at the world around you, you see traits that just, they defy the simple Mendelian rules we all learned first.

Mendel's peas, you know, they were either tall or short, yellow or green.

Right.

They were distinct categories, what we call discontinuous traits.

Exactly.

They fall into neat boxes.

But look at us.

No one is simply tall or short.

We exist on a spectrum.

We're talking about continuous variation.

Whether it's human height or the exact weight of a newborn, or even something as detailed as the ridge patterns on your fingerprints,

these traits show the smooth, wide, continuous range of phenotypes.

They don't land in neat three -to -one boxes.

And that's the central question we're tackling.

If heredity is governed by these discreet, individual genes,

how do we end up with this smooth, continuous spectrum?

So our mission today is to unpack the analytical toolkit, the stats, the molecular methods,

all the things geneticists use to figure out how multiple genes.

What way genes?

Solid genes, right, interact with the environment to create all this complexity.

It's the only field that explicitly tries to answer that age -old nature versus nurture question, but with mathematical rigor.

Okay, let's frame this.

What are the core questions quantitative geneticists are always asking?

Well, the field really boils down to about six core questions.

First, and this is the big one, you have to partition the influence.

To what degree does the total phenotypic variation we see result from differences in genotype versus differences in the environment?

The fundamental split, nature versus nurture, right there in an equation.

Right.

Second, okay, so genetics is involved, but how many genes are we talking about?

Is it three?

Is it 30?

A hundred.

Finding the number of loci is incredibly hard when their individual effects are so small.

They're all masked by each other.

Exactly.

Third, are their contributions equal?

Is every gene adding the same small amount to the final height, for example?

Or are there a few major effect genes doing the heavy lifting, with the rest just being minor modifiers?

That would totally change how you'd try to additive.

Meaning, if one allele adds an inch, do two of them add exactly two inches, or do they interact?

You mean like dominance, where one masks the other, or a pistasis, where they interact across different genes?

Decisely.

That complicates the math significantly.

Fifth, if we apply selection, a breeder choosing the biggest cow or nature favoring drought resistant plants, how fast can that trait actually change?

That's basically measuring the speed of evolution.

It is.

Finally, number six,

what are the side effects?

If we select for just one thing we want, do we unintentionally drag another, maybe a detrimental trait along for the ride?

Those are what you call genetic correlations.

Right.

And you have to understand those constraints before you can do anything useful.

These six questions,

they're really the backbone of our whole deep dive.

Okay, so to really get this, we need to firm up that first distinction you made.

Discontinuous versus continuous?

Let's do it.

Discontinuous traits, or qualitative traits, are the easy ones.

They're defined by discrete classes, you know, like the Snailsapia numeralis.

Its shell is either brown, pink, or yellow.

That's it.

You're in one category.

And that simplicity means the genotype -phenotype relationship is pretty clear.

You know, the genotype at maybe one or two loci, you can predict the color, it's a simple mapping.

Exactly.

Continuous traits, or quantitative traits, are the opposite.

You have to describe them with numbers.

Millimeters, kilograms, bushels per acre.

They show a wide, bill -shaped distribution.

Most individuals are near the average, with fewer and fewer out at the extremes.

So after Mendel's work was rediscovered, people saw these continuous traits and thought, well, Mendel must be wrong, or at least incomplete.

How did the field reconcile this blending look of height with the particulate nature of genes?

That was the big intellectual jump.

And it brings us to the polygene hypothesis.

Okay.

The groundwork was laid by Wilhelm Johansson in 1903.

He was studying seed weight in beans.

And he was the one who showed, crucially, that the variation he saw was multi -factorial.

Meaning both genetic and environmental.

Both.

He proved it.

And since a single gene couldn't explain that smooth curve he was seeing,

the logical next step was to propose that many genes, each contributing a tiny amount, must be controlling the trait.

Hence, polygenes.

Many genes.

And the classic, just beautiful experiment that proved the polygene hypothesis came from Herman Nilsen L.

in 1909 with wheat kernel color.

This is the landmark.

It showed how continuous variation emerges naturally from discrete Mendelian parts.

Let's walk through that cross, because the numbers are the whole story here.

Okay.

So you crossed a true breeding dark red kernel plant with a true breeding white kernel plant.

The P generation.

And the F1.

The F1 generation was the first clue.

All the F1 plants were uniform, and they all had an intermediate red color.

Uniformity means they're all genetically identical.

All heterozygous.

And intermediate.

Well, that looks a lot like blending, which is what the anti -Mendelians were arguing for.

Exactly.

It looked like they were right.

But then he crossed the F1s to get the F2 generation, and that's where the truth came out.

He didn't get a simple 3 to 1 red to white ratio.

Nope.

He didn't even get a 15 to 1 ratio, which you might expect with two simple dominant genes.

What he found was that he could sort the F2 kernels into five distinct classes of color.

Dark red, medium red, intermediate red, light red, and pure white.

And when he counted them, he got a very specific ratio.

A very specific one.

Approximately 1 to 4 to 6 to 4 to 1.

And that ratio is the historical lightning strike.

Why is that number so important?

Because the 1 .4 .6 .4 .1 ratio perfectly fits the binomial expansion of A plus B to the fourth power.

It's a mathematical fingerprint that proves the color was controlled by two independently segregating gene pairs.

So let's call them R and C.

The dark red parent was big R, big R, big C, big C.

Four contributing factors.

The white parent was little r, little r, little c, little c.

Zero contributing factors.

And the F1 was heterozygous for both.

Big R, little r, big C, little c.

It has two contributing factors, hence the intermediate red color.

This gets us to the idea of additive alleles.

Yes.

The interpretation is just so elegant.

We can classify the alleles as either contributing, like big R and big C, or non -contributing, like little r and little c.

And the final color, the phenotype, is just a function of the total number of contributing alleles you have.

It doesn't matter which gene they come from.

It's the count that matters.

That's it.

Four contributing alleles.

Like in the RRCC parent, you get dark red.

Three, you get medium red.

Two, contributing alleles.

Well, there are a few ways to get that genotype, which is why the six in the ratio is the biggest group.

That gives you intermediate red.

And this proved the principle of additive effects.

Each contributing allele adds a small, equal, predictable amount to the final phenotype.

It was the intellectual bridge.

It showed that the apparent blending is just the result of many discrete Mendelian genes adding their tiny effects together, plus a little environmental noise.

It saved Mendel and created a whole new field.

So that sets the stage perfectly.

We know continuous traits are polygenic and environmental.

But if you can't use simple ratios, you can't track individual genes.

You have to turn to statistics to describe whole populations.

This is where genetics becomes explicitly mathematical.

The effect of any single gene is just lost in the noise of all the others.

So we have to analyze the population's characteristics, specifically its variation.

Back to that first equation.

VP equals VG plus VE.

The total phenotypic variance is the sum of you need good data.

And that starts with sampling.

So critical.

You define your population, say, every single newborn in New York City this year.

And then you take a sample, which is the subset you actually measure.

And for your estimates to mean anything, that sample has to be large and more importantly, has to be random.

If you only sample babies from one hospital in a wealthy neighborhood, your estimates for the whole city are going to be completely biased.

Completely.

So once you have that good random sample, you start to describe its shape.

Not with ratios, but with a frequency distribution.

Which is basically a histogram.

You group your data into bins like 100 -milligram weight classes for beans, and you count how many fall into each bin.

And for so many continuous traits, when you plot that histogram, you get that familiar, symmetrical bell -shaped curve.

The normal distribution.

And the great thing about the normal distribution is you can describe the entire complex data set with just two key numbers.

Exactly.

The first is the mean, which is just the average.

It tells you the center of the distribution.

Let's use EM East's tobacco flower study to make this concrete.

He crossed a short flowered strain with a mean of about 40 millimeters, with a long flowered strain with a mean of 93 millimeters.

And the F1 generation had a mean of about 63 millimeters.

Right in the middle.

So the mean immediately tells you the F1 is intermediate.

It gives you the expected value, the starting point.

Right.

But the mean highs a lot.

Two groups can have the same average, but one could be super uniform, and the other could be all over the place.

Which brings us to the second number, the measure of spread, the variance.

Variance is the engine of this entire field.

It measures the spread around the mean.

Low variance, you get a tall, narrow curve.

High variance, you get a broad, flat curve.

And the calculation is, it's a bit specific.

It's the squared deviation from the mean.

Why squared?

Great question.

Two reasons.

First, if you just average the deviations, the positive ones and negative ones would cancel out and you'd always get zero.

Squaring makes every deviation, whether above or below the mean, a positive number that contributes to the measure of spread.

It gets rid of the direction.

It does.

And second, we divide by N1 instead of N.

That's a statistical correction that gives us a better, less biased estimate of the true population variance, especially when our sample size is small.

Okay, so variance is powerful, but it's in squared units, like square millimeters, which is weird to think about.

It is.

So for description, we usually take the square root of the variance and that gives us the standard deviation.

And the standard deviation is back in the original units, like millimeters, so it's easier to interpret.

And it's incredibly useful.

For any normal distribution, about two thirds of your data falls within one standard deviation of the mean, and 95 % falls within two.

It's a quick way to understand the spread.

Let's go back to East's Tobacco to show why you need both the mean and the variance.

Okay, so the F1 generation had a mean of about 63 millimeters and a very low variance, only about 8 .6.

Because they were genetically uniform.

Right.

The F2 generation had a similar mean, about 68 millimeters, but the variance exploded to 42 .4.

It was five times bigger.

So even though their averages were close, the F2 population was way more variable.

And since the environment was the same for both, that extra variability must be genetic.

It's the result of all those alleles segregating and recombining from the heterozygous F1 parents.

You can only see that fundamental genetic difference by calculating the variance.

Okay, so that's describing one trait.

But often we want to know how two traits relate to each other.

That's where correlation comes in.

Correlation measures the strength and direction of the association between two variables, like arm length and leg length.

The actual number you get, the correlation coefficient r, ranges from minus one to plus one.

Plus one is a perfect positive correlation.

As one goes up, the other goes up.

Minus one is a perfect negative correlation.

As one goes up, the other goes down.

And zero means there's no linear relationship at all.

But this is where we have to put up the big flashing warning sign.

We have to.

We absolutely have to.

Correlation does not imply causation.

Just because two things are associated doesn't mean one causes the other.

The textbook example is always the best.

There's a strong positive correlation in cities between the number of ministers and the amount of liquor consumed.

It sounds absurd.

Because it is.

You wouldn't conclude that ministers are causing more drinking or that drinking is causing more ministers.

Right.

There's a third factor population size.

Bigger cities have more of both.

Exactly.

And failing to see that third factor is one of the most common errors in science and in, well, in daily life.

Correlation just shows you a pattern.

It's not an explanation.

So if correlation tells us if two traits are related, regression tells us how to predict one from the other.

Regression is about defining the precise quantitative relationship.

We fed a straight line to the data, the regression line, that best describes the trend.

And the key is the slope of that line, the regression coefficient.

The slope is your predictive power.

If we're regressing a son's height against his father's height, and we get a slope of .5, that means for every one -inch increase in the father's height, we predict, on average, a half -inch increase in the son's height.

And that predictive power is fundamental to how we estimate heritability, which we'll get to.

And the last tool in the box, the one that lets us slice up the variance pie,

is the Analysis of Variance, or InnoVe.

InnoVe.

It's a statistical workhorse.

First, it can tell you if the mean differences between groups are significant.

But for us, its most important job is partitioning the variance.

It's the tool that lets you take that total VP.

And attribute specific percentages to different causes.

You can say, okay, 40 % of the variation in corn yield is due to the different fertilizer treatments, the environment, and 60 % is due to the different genetic strains we planted.

Without InnoVe, we couldn't really do quantitative genetics.

So let's apply all that to another classic experiment.

The Emerson and East Corn Ear Length Study from 1913.

Perfect example.

They started with two pure breeding strains.

One had short ears, around 6 .6 cm, and one had long ears, around 16 .8 cm.

And because they were pure breeding, any little variation within those parent groups was basically all environmental, all VE.

That's the assumption.

Then they crossed them to get the F1 generation.

And what did they see?

First, the F1 ear length, 12 .89 cm, was right in between the two parents.

Second, the F1s were very uniform, low variability, just like the parents.

This confirms they're all genetically identical heterozygotes, and their variance is almost purely environmental.

Then came the F2, from crossing the F1s.

Right.

And here, observation three.

The mean ear length was about the same as the F1, but the variability exploded.

The variance shot way up.

That's the smoking gun for genetic variance, VG.

All the recombination and segregation in the F2 created a huge new array of genotypes.

Exactly.

And finally, observation four.

The range of ear lengths in the F2 was much wider.

The extremes in the F2 actually stretched out and got close to, and in some cases even surpassed, the original parental values.

Wait, how can the F2 be more extreme than the original parents?

Weren't the parents the most extreme genotypes they started with?

That's the magic of recombination.

The short parent might have had some long ear alleles, and the long parent might have had some short ear alleles.

Oh, I see.

So if the short parent was, say, AA little b little b, and the long parent was a little ABB.

Then recombination in the F2 can create a new combination, like AABB, which might be even longer than the original long parent.

It shows that the genetic variation was truly reshuffled and maximized in that F2 generation.

It's a hallmark of polygenic inheritance.

This brings us to probably the most complex and definitely the most misused concept in the whole field, heritability.

Now we're trying to formally quantify what proportion of that phenotypic variation is actually due to genetic factors.

And this is so important for breeders.

If a treat has high heritability, selection will work well.

If it's low, selection won't do much because it's all environmental noise.

Right.

And we started with that simple equation, VP equals VG plus VE.

But the real world is much messier.

We have to expand it.

Okay.

The full equation is VP equals VG plus VE plus two other terms.

Right.

Plus two COVGE and VGBI.

Let's break those down because they are critical.

Let's start with the covariance term, genetic environmental covariance.

This term reflects a bias.

It happens when certain genotypes are not randomly distributed across environments.

Think of a dairy farmer.

If she gives her best feed, the superior environment, only to her cows with the best milking genes, the superior genotype.

Then the good genes and the good environment are reinforcing each other.

They are.

The high milk yield she sees is due to this positive covariance.

If we don't account for that, we'll overestimate how much of the success was due to the genes alone.

Okay.

So that's a non -random association.

What about the other term, genotype by environment interaction?

That sounds different.

It is different.

A G by E interaction exists when the relative performance of different genotypes changes across different environments.

Give me an example.

Okay.

Imagine two strains of corn,

genotype A and genotype B.

In a well -watered field, genotype A yields 20 % more than B.

It's the clear winner.

But in a drought year, genotype B proves to be more drought tolerant and now it yields 10 % more than A.

The winner changed depending on the environment.

That's a G by E interaction.

You can't say one genotype is just better.

It depends on the context.

This is why high yield crops are often bred for very specific climates.

Okay.

So that complicates the VP equation, but now we have to break down VG, the genetic variance itself.

Yes.

Because not all genetic variation is equally useful for predicting what the next generation will look like.

So we partition VG into three parts, VA, VD, and VI.

And the most important by far is VA, the additive genetic variance.

VA is the variance from genes that has simple additive effects, just like Nielsen -Ella's wheat genes.

One allele adds a little, a second one adds a little more.

This is the only component that is reliably passed from parent to offspring in a predictable way.

It's the predictable stackable part of genetic inheritance.

That's a good way to put it.

Then you have VD, the dominance variance.

This is when one allele masks the effect of another at the same locus.

Right.

And because of that masking, a heterozygous parent, big A little a, might look phenotypically identical to a homozygous dominant parent, big A big A.

That effect isn't passed on in a simple, predictable way because the alleles segregate.

And the last part is VI, epistatic variance.

That's the variance due to interactions between alleles at different loci.

This is when the effect of a gene at locus A depends entirely on which allele is present at locus B.

These are complex, non -additive interactions that are really hard to predict because the specific combinations get broken up by recombination.

So really, VA is the only part that breeders and evolution can consistently work with from one generation to the next.

For the most part, yes.

And we can even partition the environmental variance VE.

We have general effects like poor nutrition early in life that has a permanent effect.

We have special effects like a sunburn that's temporary.

And then you have the tricky ones, things shared by families that aren't genetic.

Right.

Common family environmental effects and maternal effects.

This is huge in mammals.

The uterine environment, milk quality, parental care.

These can make siblings resemble each other for purely environmental reasons, mimicking a genetic effect.

You have to be so careful to disentangle those.

All of this leads us to the two formal definitions of heritability.

Let's start with the broad one.

Broad sense heritability, or H squared B, is the proportion of the total phenotypic variance that's due to all genetic factors.

So it's VG divided by VP.

It gives you a general sense of how much genes matter, including all the complex dominance and epistatic interactions.

But the more useful, more predictive measure is?

Narrow sense heritability, or H squared N.

This is defined as VA divided by VP.

It only considers the proportion of variation due to those simple, predictable additive genetic effects.

And that's the gold standard for breeders.

It is, because VA is what determines the response to selection.

It's the part you can count on.

Okay, now we have to do the critical thinking segment.

Heritability is maybe the most misunderstood concept in all of biology.

So let's go to the five big misunderstandings.

Number one.

Okay, number one, heritability does not define the complete genetic basis of a trait.

This is a tough one.

A trait can be 100 % determined by genes, but have a heritability of zero.

How is that possible?

Think about the number of heads humans have.

It's 100 % genetic, but is there any variation in the number of heads in the population?

No.

So the genetic variance, VG, is zero.

And if VG is zero, heritability is zero.

Heritability is about the genetic contribution of the variation we see, not the existence of the trait itself.

Okay, that's a great point.

Misunderstanding number two.

Heritability applies only to a population, never to an individual.

You can't say 80 % of my height is genetic.

That's meaningless.

It's a property of the variance within a group.

Number three.

Heritability is not a fixed number.

No, it depends completely on the population and the specific environment it's in.

If you measure height heritability in a wealthy population where everyone has perfect nutrition,

VE is tiny, so heritability will be very high.

But if you measure it in a population with huge nutritional disparities, VE will be large, and the heritability estimate for the very same genes will plummet.

Exactly.

You can't transfer a heritability value from one environment to another.

This leads to the fourth and maybe most dangerous misunderstanding.

It is.

High heritability within a group does not mean that differences between groups are genetic.

Let's use the mouse example from the text to make this crystal clear.

Okay.

You have two groups of mice that are genetically identical.

You feed group one a rich diet and they grow huge.

You feed group two a poor diet and they're all stunted.

So the difference in the average size between the two groups is 100 % environmental.

It's the diet.

Right.

But now look within each group.

Within the well -fed group, any small differences in size are likely due to tiny genetic variations.

So heritability is very high, say 0 .9.

Same thing within the poorly fed group.

So you could find high heritability within both groups and still be totally wrong if you concluded the difference between them was genetic.

And that is a massively important lesson when people try to apply this concept to human populations.

It's a huge logical error.

And finally, number five.

Familial traits are not necessarily highly heritable.

Familial just means it runs in families.

That could be because of shared genes or it could be because of a shared environment, diets, culture, whatever.

You have to do the experiment to find out which it is.

Wow.

Okay.

So given all those pitfalls, how do we actually calculate the useful metric, the narrow -sense heritability?

The most common way is to look at the resemblance between relatives.

We use parent -offspring regression.

The logic is if additive genes are important, then offspring should resemble their parents.

And you plot the offspring's phenotype against the mid -parent value.

The mid -parent value is just the average of the two parents' phenotypes.

When you plot that data for a whole population and fit a regression line, the slope of that line B is your estimate of the narrow -sense heritability.

So if the slope is one, the parents perfectly predict the offspring and h squared n is one.

And if the slope is zero, there's no relationship and h squared n is zero.

That's a really powerful and direct way to measure it.

And now we get to the payoff.

Once you know that narrow -sense heritability, you can predict how a population will evolve under selection.

This is evolution in action.

The amount the mean phenotype changes in one generation is called the response to selection, or R.

And breeders use a simple formula for this, the breeders equation.

The equation is just R equals h squared n times s.

The response is the product of the narrow -sense heritability and the selection differential s.

Okay, what is s, the selection differential?

S is your measure of how strong the selection is.

It's the difference between the mean of the parents you chose to breed and the mean of the entire original population.

So if the average fly weighs 1 .3 milligrams, but you only pick the biggest flies to be parents, and their average is 3 .0 milligrams.

Then your selection differential s is 1 .7 milligrams.

That's the pressure you're applying.

And if the offspring of those parents have a new mean of 2 .0 milligrams, the response R is on 0 .7 milligrams.

And now you can use the equation backwards.

h squared n equals R divided by s, or 0 .7 divided by 1 .7.

So you've just estimated heritability from your selection experiment.

We know this response to selection can't go on forever, though.

We see in long -term experiments that it eventually plateaus.

Right, the response tapers off.

That can happen for two reasons.

Either you've exhausted all the additive genetic variants, all the favorable alleles are now fixed in the population, or more often, you run into the problem of detrimental genetic correlations.

The hidden cost of selection.

This is when two or more traits are genetically locked together.

They don't vary independently.

If you select for one, the other comes along for the ride, whether you want it to or not.

And what causes this genetic linkage?

There are two main causes.

The most important one for polygenic traits is pleiotropy.

That's when a single gene influences multiple, seemingly unrelated traits.

A growth hormone gene might affect height, bone density, and muscle mass all at once.

So if you select for more muscle, you might inadvertently be selecting for taller height, too.

Exactly.

The other cause is just physical genetic linkage, where the genes for two different traits are located close together on the same chromosome and tend to be inherited as a single block.

And these correlations create trade -offs.

If it's a positive correlation, great.

But negative correlations are what constrain evolution.

The garter snake example is perfect.

These snakes eat toxic newts.

So they need two things.

They need to be fast to catch prey, and they need to be resistant to the newts neurotoxin.

And scientists found a negative genetic correlation between speed and resistance.

A strong one.

The snakes that are the most resistant to the toxin are also the slowest.

And the fastest snakes are the most vulnerable to the poison.

So you can't be both.

Natural selection has to find a compromise.

It's a trade -off.

In an area with super toxic newts, selection favors resistance, even if it means being slow.

In another area, speed might be more important.

The negative correlation prevents the evolution of a perfect snake that's both super fast and super resistant.

And breeders face this all the time.

Milk yield and butterfat content in cattle are negatively correlated.

It's a constant struggle.

If you select for cows that produce huge volumes of milk, the butterfat percentage tends to drop.

The genetics are working against you if you want both.

Okay.

So for a century, this was all done with statistics.

But then molecular biology came along, and suddenly we could find the actual genes.

Yes.

This is the modern era.

We can move beyond just variance components and identify the specific genes, the quantitative trait loci, or QTLs.

A QTL is a specific spot on a chromosome that contributes to a quantitative trait.

Right.

And the whole set of QTLs for a trait is its genetic architecture.

The goal of QTL mapping is to find the physical addresses of the genes responsible for the variation.

So how do you do it?

What's the strategy?

You start with a cross between two phenotypically different lines, like a huge tomato and a tiny one.

You generate an F2 population.

Then you genotype every individual in that population for hundreds of molecular markers like SNPs that are spread all across the genome.

So you have a genetic map for every individual.

You do.

And then for each marker, you split the F2s into groups based on their genotype at that marker.

And you ask a simple question using Innovo.

Is there a significant difference in the average fruit weight between the different marker genotype groups?

And if there is, that means the marker must be physically linked to a QTL that actually affects fruit weight.

That's the logic.

You've just found the neighborhood where a gene for fruit weight lives.

And this has led to some incredible insights.

The monkey flower study is a great example of evolution.

It is.

You have two closely related species, one adapted for hummingbird pollination, and one for bee pollination.

Their flowers are totally different color, shape, everything.

And when they did QTL mapping, they found something surprising.

They found that a lot of these big differences were controlled by just a few QTLs of major effect.

Sometimes a single QTL could explain 25 % or more of the variation in a trait -like flower color.

So that suggests that big, rapid evolutionary shifts might not need hundreds of tiny changes.

A mutation in just one or two key genes could create a dramatic new adaptation.

It does.

And often, these key genes are involved in regulation, not just building blocks.

The corn gene, TB1, is a perfect case study.

This is the gene that controls branching.

It's a big part of the difference between modern corn and its wild ancestor, Teosinte.

Right.

When they cloned the gene, they found the critical difference between the corn allele and the Teosinte allele wasn't in the protein sequence itself.

It was in a non -transcribed regulatory region nearby.

So the difference was in gene expression, when and how much of the gene gets turned on.

Exactly.

The domestication of corn was largely about selecting for changes in gene regulation.

We see the same thing with the tomato gene, FW2 .2, which controls a huge chunk of fruit weight.

The variation is all about the timing and level of the gene's expression during fruit development.

And that tomato gene also had a trade -off, a negative correlation.

The allele for small fruit actually produced more total fruits on the plant.

The breeder's dilemma.

A few big tomatoes or lots of small ones?

The genetic architecture dictates your choices.

Since we can't do experimental crosses in humans, we use association studies.

Right.

We use naturally existing variation.

We scan the genomes of thousands of people, looking for correlations between specific SNPs and a trait like height or risk for diabetes.

If a SNP is consistently associated with a trait, it's likely near a QTL.

And the final example from the text brings everything together, dissecting the genetics of aggression in Drosophila.

A fantastic study.

First, they did artificial selection, breeding high and low aggression lines of flies.

So they knew there was additive genetic variants to work with.

Then they used microarrays to compare the gene expression profiles between the two lines.

They found about a thousand genes were expressed differently.

And were there any patterns?

Yes.

The aggressive flies overexpressed genes for chemical detection, for smell and taste.

The low aggression flies overexpressed genes for learning and memory.

But that's still just a correlation.

They needed proof.

So they took 19 of those candidate genes and systematically knocked them out one by one and tested the fly's behavior.

The result was stunning.

For 15 of the 19 genes, mutating them caused a clear measurable change in aggression.

So they went from a complex behavior all the way down to a list of specific validated genes.

A complete dissection.

It shows that even for something as complex as behavior, these modern methods can unravel the genetic architecture.

That was an incredibly detailed deep dive.

Let's try to recap the highest yield principles.

Okay.

We started by seeing that continuous variation comes from the polygene hypothesis.

Many genes with additive effects plus the environment.

We realized we need statistics, especially the mean and the variance to describe populations.

The core task is partitioning the phenotypic variance, VP, into its genetic and environmental parts.

We defined narrow sense heritability, H squared N, as the most important predictive measure because it's based on the additive variance, VA, which is what determines the response to selection.

And we hammered home the five big misunderstandings of heritability.

That it's population -specific, it's not fixed, and crucially, within -group heritability tells you nothing about the cause of between -group differences.

Finally, we saw how modern genetics finds the actual QTLs, the genes themselves, and discovered that the genetic architecture is often about changes in gene regulation and it's always constrained by trade -offs from genetic correlations.

Speaking of those constraints, here's a final provocative thought for you to carry forward.

Imagine you're that tomato breeder and you find that the allele for large fruit size is genetically correlated with lower disease resistance.

In the short term, you might accept that trade -off for the bigger fruit.

But how does that negative correlation constrain the long -term evolutionary path of your crop?

And what kind of advanced genetic engineering or massive breeding program would it take to finally break that undesirable link to uncouple the good from the bad?

It's the central challenge in every complex system.

Genetics gives you the pieces, but correlation often dictates the rules of the game.

Thank you for joining us for this deep dive.

ⓘ This audio and summary are simplified educational interpretations and are not a substitute for the original text.

Chapter SummaryWhat this audio overview covers

Quantitative genetics addresses the inheritance of traits exhibiting continuous phenotypic variation across populations rather than discrete categorical classes. Unlike discontinuous traits governed by simple Mendelian mechanisms, quantitative traits such as human height, body mass, and agricultural yield arise from the combined effects of multiple genes at different loci and substantial environmental influences. The polygene hypothesis, illustrated through the classic example of wheat kernel color inheritance, demonstrates how additive alleles distributed across several chromosomal locations generate a spectrum of intermediate phenotypes within populations. Understanding these complex traits requires mastery of statistical frameworks including frequency distributions, descriptive measures of central tendency and dispersion, and correlation methods that quantify relationships among variables. Regression analysis provides predictive capacity by establishing how parental phenotypes relate to offspring outcomes. Phenotypic variance, the fundamental measure of trait diversity within a population, decomposes into distinct components: genetic variance (itself subdivided into additive, dominance, and epistatic interactions), environmental variance, and covariance effects between genotype and environment. Heritability estimates the relative contribution of genetic versus environmental influences to total phenotypic variation. Broad-sense heritability encompasses all genetic effects, while narrow-sense heritability isolates additive genetic variance and directly predicts evolutionary response under selection pressure. The breeder's equation quantifies how selection intensity and the strength of additive genetic effects determine phenotypic change across generations in artificial and natural populations. Genetic correlations emerge through pleiotrophic gene action or physical linkage between loci affecting different traits. Modern genomic approaches employ molecular markers and statistical mapping techniques to localize Quantitative Trait Loci responsible for continuous variation at the chromosomal level, bridging classical biometrical concepts with contemporary molecular biology.

Using this chapter to study? Last Minute Lecture is free and student-run. If it helped, consider supporting the project.

Support LML ♥

Chapter 22: Quantitative Genetics and Complex Traits

Related Chapters