Chapter 23: Mitochondrial DNA Profiling

Search this chapter

Audio Overview

0:00 / 0:00

Autoplay next chapter

Welcome to Last Minute Lecture.

This free chapter overview is designed to help students review and understand key concepts.

These summaries supplement not replaced the original textbook and may not be redistributed or resold.

For complete coverage, always consult the official text.

Welcome to the Deep Dive.

We're the show that takes those complex sources and really boils them down for you so you get the core information without all the noise.

Today we are tackling a crucial, maybe less talked about, forensic tool, mitochondrial DNA profiling dot mtDNA.

Yeah, this isn't your standard TV crime show stuff.

This is really for the most challenging cases.

We're talking about really degraded remains, maybe victims from mass disasters.

Or even evidence like an old hair shaft, maybe found years later, where the regular DNA, the nuclear DNA is just, well, it's either gone or there's barely anything left to work with.

Exactly.

That resilience, that's the key thing here.

Our source today, Chapter 23 of Forensic Biology, really highlights this.

See, every cell has hundreds of these mitochondria and each one has its own little genome, so the sheer number of copies is way, way higher than nuclear DNA.

Even if the nuclear stuff fragments, you've often still got detectable mtDNA.

So our mission today is to give you a really solid understanding of this whole process.

We'll unpack the biology behind it, how it's inherited, then get into the lab techniques they use, and finally how results are interpreted.

What does it all mean in a forensic context?

Okay, let's unpack this.

So let's start right down at the cellular level.

Mitochondria.

We learn in basic bio, they're the powerhouses of the cell, right?

Generating energy.

That's the one.

But the crucial part for forensics is they have their own DNA, completely separate from the main DNA stored in the cell nucleus.

Okay, so it's like an extra set of instructions just floating around in the cell.

Sort of, yeah.

We call it an extra chromosomal genome, and it's tiny compared to nuclear DNA, just 16 ,569 base pairs.

And it's circular.

It's packed really efficiently, too.

37 genes, mostly for its own energy functions.

And interestingly,

none of those non -coding spacer regions you find in nuclear DNA.

It's all pretty much functional sequence.

And he said hundreds per cell.

That's the first big advantage, right?

Just the sheer quantity compared to only copies of nuclear DNA.

Exactly.

Massive copy number advantage.

But maybe the even bigger deal,

forensically speaking, is how it's inherited.

It's non -Mendelian.

Meaning,

not the usual mix for mom and dad.

Right.

You typically get your MTDNA only from your mother.

It's maternal inheritance.

Okay, walk us through that.

How does that work?

Why doesn't the dad's MTDNA count?

Well, think about fertilization.

The sperm's job is basically to deliver the nuclear DNA.

Its mitochondria are mostly in the midpiece, powering the tail.

Usually that part doesn't even enter the egg.

And even if a few paternal mitochondria do sneak in, the egg has a system to get rid of them.

Like a bouncer at the door.

Or more like internal security.

Oh yeah, internal security is a good way to put it.

The current thinking involves a protein called ubiquitin.

It essentially tags the paternal mitochondria.

Marks them for disposal.

Exactly.

Marks them for disosomes.

They just get broken down.

Wow.

So that means me, my siblings, my mom, her siblings, my grandmother,

we all share the exact same MTDNA sequence.

Assuming an unbroken female lying.

That's the principle, yes.

We call that shared sequence a mitotype or haplotype.

And you can see immediately why that's huge for identifying missing persons or victims in mass disasters by comparing their MTDNA to living maternal relatives.

Game changer.

It is.

But.

There's something important here.

The mutation rate.

Oh.

It's not always a perfect copy down the generations.

It mutates faster than nuclear DNA.

Maybe up to 10 times faster.

Which is good for evolution over long timescales.

Increased diversity.

But for forensic ID, comparing, say, a grandchild to a maternal grandmother,

you might actually expect to see a difference just due to mutation.

It complicates things.

We'll come back to how labs handle that.

Mutation rate noted.

So if we're going to compare sequences, we need a standard, right?

Like a ruler or a map.

Absolutely.

You can't just list 16 ,000 odd bases.

The original standard was the Cambridge reference sequence, the CRS from back in 81.

But the one everyone uses today is the revised Cambridge reference sequence, RCRS.

That came out in 1999.

It basically corrected about 10 small errors in the original.

So the RCRS is the universal map.

Every forensic MTDNA sequence aligns to it, and we report differences from that reference.

Got it.

So you have the map.

Now, where on this little circular map do you actually look for the variations that make people different?

You don't sequence the whole thing every time, do you?

Generally, no.

You focus on the hot spots.

The region with the most variation is called the control region, or sometimes the D loop.

It doesn't code for proteins, so mutations can build up there without causing problems for the cell.

Here's where it gets really interesting.

Because forensic zooms in even further, right?

Just tiny parts of that D loop.

Exactly right.

We focus on two specific areas within that control region called hypervariable regions, HV1 and HV2.

Hypervariable, meaning lots of changes between people.

Precisely.

HV1 is about 342 base pairs long, and HV2 is about 268.

That's where most of the sequence differences between unrelated maternal lineages are found.

So sequencing just those two regions gives you a lot of discriminatory power.

Okay, standard sequence.

Focus on HV1 and HV2.

Sounds straightforward enough, but I suspect biology throws another curveball.

You suspect correctly.

The curveball is called heteroplasmy.

Heteroplasmy, meaning mixed types.

Exactly.

It means a single individual can actually have more than one type of MTDNA sequence present in their cells, sometimes even within the same tissue.

We see it quite often, especially in hair samples.

Whoa, so my own cells might not all have the exact same MTDNA.

It's possible, and there are two main kinds we look for.

First is sequence heteroplasmy.

That's when, at a single position in the sequence, you see evidence of two different bases,

like maybe both an A and a G.

On the sequencing data, it looks like two overlapping peaks.

Okay, a mix at one spot.

What's the other kind?

The other is length heteroplasmy.

This one's a bit more technically tricky.

It happens in areas where there's a long run of the same base, usually cytosine.

We call them C stretches.

Sometimes the replication process kind of stutters in these regions, adding or deleting a C.

This variation in the length of the C stretch can mess up the sequencing process downstream from it.

It makes it hard to read the sequence accurately after that point.

This is like a headache for the lab.

Does heteroplasmy ever actually help an investigation?

Oh, absolutely.

It can be incredibly informative.

If you find the same specific heteroplasmy, say that A G mix at position 16 ,129 in both the crime scene sample and the suspect's reference sample.

That makes the match much stronger because it's a more unusual feature to share.

Exactly.

It adds significant weight to the association.

It's another point of comparison and often quite a rare one.

Right.

Okay, let's shift gears into the lab workflow.

Say you get a really challenging sample fragment of old bone, maybe a tooth.

What's step one?

First, meticulous cleaning of the surface is crucial.

You have to remove any potential modern contaminants.

Then, for hard stuff like bone or teeth, you have to physically break it down.

Often involves grinding it into a powder,

pulverizing it, using specialized mills.

This breaks open the cells in matrix to release the DNA.

Labs usually try to do duplicate extractions if they have enough material, just as a check.

Given we're dealing with high amounts of DNA initially, but then amplifying it, contamination must be a huge concern.

Paramount.

Absolutely critical.

You co -extract MTDNA along with any nuclear DNA, then you quantify it.

Often using MTDNA -specific methods like real -time PCR.

And you run tons of controls.

Regent blanks with no sample, negative controls with no DNA template added to the amplification step, positive controls using a known sequence, like from the HL60 cell line.

You have to constantly monitor for any stray DNA getting in.

Okay, so you've extracted, you've quantified, you've controlled for contamination.

Is the next step always full sequencing?

Or is there a quicker way to screen samples?

There often is, yeah.

A preliminary screening technique uses something called an allele -specific oligonucleotide assay, or ASO assay.

Think of it like a spot check.

Kits like the linear array test for known variations at maybe 19 or so common locations within HV1 and HV2.

It's a way to quickly exclude someone.

If the sample doesn't match the reference on these key spots, you know they can't be the source, and you save the time and expense of full sequencing.

Makes sense.

But for the full picture, the gold standard is still Sanger sequencing, even with all the newer, faster sequencing tech out there.

That's still largely true in forensic MTDNA labs, yes.

Sanger sequencing, the chain termination method, is incredibly well validated, robust, and produces very high quality data for specific HV regions.

Forensic science values reliability and established protocols.

And the magic of Sanger relies on tricking the DNA copying process, right, using special ingredients.

Exactly.

The special ingredients are called d -deoxynucleotide triphosphates, or ddNTPs.

You run a DNA synthesis reaction with the sample DNA as a template, a polymerase enzyme, the regular DNA building blocks dNTPs, and a tiny amount of these ddNTPs.

What's the difference between a dNTP and a ddNTP?

The crucial difference is that ddNTPs are missing a specific chemical group, the three prime hydroxyl group, that's needed to add the next base in the chain.

So whenever the polymerase happens to slot in a ddNTP instead of a regular dNTP, boom, the chain stops growing right there.

Termination.

And you do this with ddNTPs for A, T, C, and G, each labeled with a different colored fluorescent dye.

So you end up with millions of DNA fragments, all different lengths, each ending with a specific colored base corresponding to its position in the original sequence.

And then you sort them by size.

Right.

Using capillary electrophoresis.

The machine reads the color of the dye on the very last base of each fragment as it passes a detector from shortest fragment to longest.

That tells you the sequence base by base.

And you mentioned cycle sequencing.

How does that help?

Cycle sequencing just uses thermal cycling like in PCR during the sequencing reaction itself.

It basically amplifies the amount of those terminated fragments, giving you a much stronger signal, which is really important when you might be starting with very little template DNA, makes the whole process more sensitive.

Okay.

So we've got the sequence data, those colored peaks on the electrophorogram.

How does that get reported in a standardized way for court?

It all comes back to that reference sequence, the RCRS.

You align your sample sequence to the RCRS and you only report the positions where your sample differs.

So if the RCRS has a T at position 16 ,223, but your sample has a C, the report will just say 16223C.

It's a very concise notation.

There are also standard ways to report insertions, deletions, and any heteroplasmy using specific codes.

And then the big question, what does it mean?

How do you interpret the comparison between say the crime scene hair and the suspect's reference sample?

Are there strict rules?

Oh, yes, very strict guidelines developed by groups like SWG DAM, the scientific working group on DNA analysis methods and ISFG, the International Society for Forensic Genetics.

There are basically three possible conclusions.

Okay.

What's the first?

First is exclusion.

You can confidently say the samples did not come from the same person or the same maternal lineage, but, and this is crucial, you can only make this call if there are two or more nucleotide differences between the sequences.

Why two?

If they differ at even one spot, aren't they different?

That goes back to the high mutation rate we talked about.

Because MTDNA mutates relatively quickly,

a single difference, especially between relatives, could just be a mutation that occurred between generations.

It's considered too ambiguous to be certain.

The two difference rule provides a necessary safety margin to avoid false exclusions.

Okay, that makes sense.

So, exclusion is two or more differences.

What if they match?

If the sequences are identical, the conclusion is cannot exclude, meaning the suspect could be the source or someone from the same maternal line could be.

And remember, heteroplasmy.

If they both show the same heteroplasmy, that significantly strengthens the association.

Interestingly, though, if one sample shows heteroplasmy and the other doesn't, or they show different heteroplasmy, that does not automatically lead to an exclusion.

Heteroplasmy can vary between tissues in one person or appear, disappear across generations.

Right, the tissue variation aspect.

Okay, so exclusion cannot exclude.

What's the third option?

The third is inconclusive result.

This is what you report if there's only a single nucleotide difference between the questioned and known sample and there's no heteroplasmy involved to potentially explain it.

So that one difference falls into that mutation gray area.

Exactly.

You can't confidently exclude them based on that single difference because of the known mutation rate, but you also certainly can't say they match.

It's just inconclusive.

Wow, okay.

This has been a really thorough look at MTDNA profiling.

We've covered the basics, the high copy number, that unique maternal inheritance, the importance of the RCRS reference focusing on HV1 and HV2, the challenge and potential value of heteroplasmy, and then the whole workflow from pulverizing bone to ASO screening and the details of Sanger sequencing, right down to those crucial interpretation rules.

Exclusion cannot exclude and inconclusive.

It really highlights the power and the limitations, doesn't it?

The very thing that makes MTDNA useful over long evolutionary timescales, that higher mutation rate, is the same thing that forces such caution like the two difference rule in forensic comparisons.

It's this constant balancing act.

Absolutely fascinating.

Thank you so much for walking us through all that.

My pleasure.

It's complex stuff, but incredibly important in certain cases.

And thank you, our listeners, for joining us on this deep dive.

Here's where it gets really interesting.

If you get a cannot exclude result, meaning the sequences match, but doesn't mean it's definitely the suspect.

It means they belong to a particular maternal lineage, a specific mitotype.

So how common is that specific mitotype in the general population?

How do labs figure out the statistical weight, the actual significance of that match?

That's a whole other layer we didn't even get into today.

Something to think about.

ⓘ This audio and summary are simplified educational interpretations and are not a substitute for the original text.

Chapter SummaryWhat this audio overview covers

Mitochondrial DNA profiling represents a powerful forensic technique for establishing biological relationships and ruling out potential sources of evidence through analysis of the maternal genetic material present within mitochondrial organelles. Unlike nuclear DNA, mitochondrial DNA exists in hundreds to thousands of copies per cell as a small circular molecule, following strictly maternal inheritance patterns wherein individuals acquire their complete mitochondrial genome exclusively from their mother, with paternal contributions typically degraded before oocyte development. The forensic analysis centers on two hypervariable regions within the D-loop control region, designated HVI and HVII, which display substantial sequence variation across individuals and populations, providing the discriminatory power necessary for comparative casework. A significant complication in mtDNA interpretation is heteroplasmy, wherein multiple different mitochondrial sequences coexist within a single individual due to the vast number of mitochondria distributed throughout bodily cells, creating potential ambiguity when examining degraded or mixed biological samples. The analytical process begins with preliminary screening to determine whether evidence justifies detailed investigation, followed by polymerase chain reaction amplification of target regions to generate sufficient material for sequencing. Analysts employ either Sanger sequencing or next generation sequencing methodologies to determine precise nucleotide sequences and generate a mitotype, the distinctive mtDNA profile characteristic of an individual or evidence sample. Comparative analysis yields three possible interpretive outcomes: exclusion when sequences demonstrably differ, cannot exclude when sequences match supporting a potential maternal relationship, or inconclusive when data quality or heteroplasmy prevents definitive determination. Professional standards established by organizations including SWGDAM and ISFG guide all analytical conclusions and ensure consistency across laboratories and jurisdictions. Population mtDNA databases provide essential reference data for evaluating the frequency and significance of sequence matches, while specialized protocols address the unique challenges posed by degraded biological evidence.

Using this chapter to study? Last Minute Lecture is free and student-run. If it helped, consider supporting the project.

Support LML ♥

Chapter 23: Mitochondrial DNA Profiling

Related Chapters