Chapter 9: DNA Structure and Analysis

0:00 / 0:00
Report an issue

Welcome to Last Minute Lecture.

This free chapter overview is designed to help students review and understand key concepts.

These summaries supplement not replaced the original textbook and may not be redistributed or resold.

For complete coverage, always consult the official text.

Welcome to the Deep Dive, where we sift through complex information to extract the core insights just for you.

Today we're tackling one of biology's, well, most profound questions.

What is the genetic material?

How does this invisible blueprint shape, you know, every living thing from a tiny bacterium to us?

For centuries, it was just obvious that traits were inherited passed down.

But the actual molecule doing that heavy lifting, that remained a complete mystery.

We knew about heredity, but the chemical how was, yeah, totally unknown.

So with this deep dive, we're plunging into a key chapter from Essentials of Genetics, our mission to really distill those groundbreaking discoveries that finally revealed DNA as the blueprint of life.

We'll explore its amazing structure and even touch on how we analyze it today.

We want you to get the important stuff without feeling overwhelmed.

That's right.

We'll trace that whole experimental journey, starting way back with those early scientific debates.

Then we'll unpack the famous Watson -Crick model that really changed everything.

We'll also look at DNA's, let's say, cousins, like RNA, and finally, glance at some of the really cutting edge techniques that have just revolutionized how we understand and even manipulate genetics these days.

It's quite a story, a real scientific detective story.

Okay, let's untack this.

So we know life's blueprint gets passed on, but before scientists really cracked the code of DNA,

what does a molecule even need to qualify as genetic material?

Our source lists four sort of negotiables.

First up, replication.

It has to copy itself, right?

Exactly, accurately.

So that information passes faithfully cell to cell, generation to generation, crucial.

And second, it needs enormous storage of information.

I mean, we're talking vast complex instructions for building and running a whole organism.

Absolutely.

Think about the sheer volume.

Third is expression of information.

So it has to actually direct what cells do, tell them how to operate.

And this is that central dogma comes in, isn't it?

DNA makes RNA, RNA makes protein.

That's the core concept, yeah.

DNA holds the master plan, RNA is the messenger and worker, and proteins are the final product.

The workhorse is doing the jobs in the cell.

And the fourth point, which is maybe less obvious, but just as vital, variation by mutation.

The genetic material can't be perfectly static, it needs to be able to change sometimes.

Right.

Because those small changes, those mutations, are the raw material for evolution.

They allow life to adapt, to diversify.

Without that capacity for change, life would be stuck.

So those are the requirements.

But interestingly, for a long time, scientists actually thought proteins were the better bet for this job, didn't they?

They did, yeah.

And you can see why.

Proteins were known to be incredibly diverse, complex molecules.

They do so many things in the cell, they seem like the obvious candidates.

DNA, on the other hand, well, Friedrich Miescher found it way back in 1868, this nuclein stuff in cell nuclei.

But then, around 1910, Phoebus Levine came up with his tetranucleotide hypothesis.

Ah, right.

The idea that DNA was just a simple, boring, repeating pattern of four bases in equal amounts.

Exactly.

And that made DNA seem way too simple to hold complex genetic information.

So for quite a while, DNA was sort of pushed to the side.

Proteins seem much more likely.

So Levine's idea kind of put brakes on DNA research for a bit.

But then Erwin Chargaff came along in the 1940s and started looking closely at the actual amounts of those bases.

And his findings subtly started to chip away at Levine's hypothesis, didn't they?

They really did.

Chargaff found something Levine's model couldn't explain.

He showed that while the total amount of purines, that's A and G, equal the total pyrimidines, C and T, the individual amounts weren't equal.

Specifically, he discovered the amount of adenine A always seemed to equal thymine T and guanine G always equal cytosine C, AT, GC.

Ah, Chargaff's rules.

That's them.

This hinted at a specific pairing, a hidden complexity that Levine's simple repeating unit just didn't account for.

It was a crucial clue, suggesting DNA might be more complex, more information -rich than previously thought.

Okay, so Chargaff planted a seed of doubt.

The focus then shifted to getting direct experimental proof.

Where did the first big piece of that puzzle come from?

That would be Frederick Griffith back in 1927.

His experiments with the Placoccus pneumonia, the pneumonia bacteria, were groundbreaking.

He worked with two strains, a virulent S -strain S for smooth because it had this protective capsule and an avirulent R -strain R for rough, no capsule.

Right, and only the S -strain could kill mice.

So Griffith did this key experiment.

He injected mice with a mix, live harmless R -strain bacteria plus heat -killed dead S -strain bacteria.

Nether should have been lethal on its own.

Exactly.

Logically, the mice should have been fine, but the astonishing result was they died.

Wow.

And when Griffith looked at their blood, he found living, virulent S -strain bacteria, the deadly kind.

How could that happen?

Well, Griffith concluded that something from the dead S -cells, he called it a transforming principle,

must have somehow changed or transformed the live R -cells into the deadly S -type.

He'd immediately think genetics, more like a physiological change, but it opened a huge door.

A transforming principle.

That really set the stage for Avery, McCloud, and McCarty, didn't it?

They spent, what, a decade trying to figure out exactly what that principle was.

An entire decade, yes.

From 1934 to 1944, incredibly careful work.

They purified the transforming principle from the S -cells and then used enzymes to systematically destroy different types of molecules within it.

So like they added an enzyme to destroy proteins.

Right.

Protease.

And when they did that, the extract could still transform R -cells into S -cells.

So it wasn't protein.

Okay.

Then they tried an enzyme for RNA.

Ribonuclease.

Same result.

Transformation still happened.

So it was an RA either.

But then the DNA enzyme?

Yoxyribonuclease, D -Dase.

When they added that, the transforming activity was completely destroyed.

No more transformation.

So that was it.

DNA had to be the transforming principle.

That was their conclusion.

It was the first really direct experimental evidence suggesting DNA carried through the hereditary information.

A landmark moment.

It basically launched the era of molecular genetics.

An incredible piece of detective work.

Then came the second big confirmation, the Hershey Chase experiment in 1952.

Using viruses, right?

Bacteriophages.

Exactly.

Bacteriophage T2, which infects bacteria.

Phages are beautifully simple, just a protein coat outside and the DNA core inside.

They stick to a bacterium, inject something, take over, make more phages, and burst the cell open.

And the big question was, what gets injected?

The protein coat or the DNA core?

Precisely.

Hershey and Chase used radioactive isotopes.

Brilliant idea.

They labeled the phage DNA with radioactive phosphorus, 32P, because DNA has phosphorus, but protein mostly doesn't.

Okay.

And they labeled the phage protein coat with radioactive sulfur, 35S, because proteins have sulfur, but DNA doesn't.

So they had two batches of labeled phages.

Clever.

So they could track each part separately.

Yep.

They let these labeled phages infect bacteria.

Then critically, they used a blender, literally a kitchen blender, to knock the phage coats off the outside of the bacteria after the injection had happened.

Then they separated the bacteria from the phage coats.

And what did they find?

Where did the radioactivity end up?

The results were crystal clear.

Most of the 32P, the DNA label, was found inside the bacterial cells, and it was passed on to the next generation of phages.

But most of the 75S, the protein label, remained outside the bacteria, with the empty phage coats, the phage ghosts, as they called them.

So DNA went in, protein stayed out.

Undeniably.

It confirmed that DNA was the material carrying the instructions for making new phages.

Together with Avery's work, this really sealed the deal for DNA as the genetic material, at least in bacteria and viruses.

Okay.

Solid proof for bacteria and viruses.

But what about your carryouts?

Like plants, animals, us?

The evidence there was initially more indirect, right?

Based on where DNA is found and how much of it there is.

That's right.

Several lines of circumstantial evidence pointed strongly towards DNA.

First, its distribution.

DNA is mainly found in the nucleus, right where the chromosomes are.

And also in mitochondria and chloroplasts, places we know have their own genetic functions.

Protein, meanwhile, is everywhere in the cell.

That makes sense.

What else?

The ploidy correlation.

If you measure the amount of DNA in different cells from the same organism, you find diploid somatic cells.

Regular body cells have exactly twice the amount of DNA as haploid, gamete, sperm or egg cells.

Half the chromosomes, half the DNA.

Precisely.

Data from humans, chickens, trout, they all showed this consistent 2 .1 ratio.

You don't see that kind of precise correlation with protein amounts.

It strongly suggested DNA quantity was tied to the chromosome sets.

And the third piece of evidence involved UV light and mutations.

Yes, mutagenesis.

UV light is known to cause mutations.

Scientists found that the wavelengths of UV light most effective at causing mutations peaked around 260 nanometers.

And guess what else absorbs light most strongly at 260 nanometers?

Nucleic acids, DNA and RNA.

Exactly.

Proteins, on the other hand, absorb best around 280 nanometers, a wavelength that isn't particularly mutagenic.

So the fact that the mutation -causing spectrum matched the DNA absorption spectrum was another strong pointer.

So lots of converging indirect evidence, but then came the modern era with direct proof in eukaryotes.

Absolutely.

Recombinant DNA technology is a prime example.

We can literally take a human gene, say for insulin, splices DNA into bacteria and those bacteria will start producing human insulin.

That directly shows DNA carries the blueprint for that protein, even across species.

Unequivocally.

And then there's genomics, sequencing entire genomes like the Human Genome Project.

The entire basis of comparing genomes to find genes linked to diseases rests on the fundamental fact that DNA is a genetic material.

We're looking for differences in the DNA sequence that cause those inherited conditions.

It's foundational.

Now, we've established DNA as the rule, but you mentioned exceptions earlier.

Some viruses use RNA instead.

Yes, it's an important point.

Viruses like tobacco mosaic virus, TMV or phage QB, use RNA as their genetic blueprint.

You can actually take purified RNA from TMV, put it on a tobacco leaf, and it will cause the infection.

Proves the RNA itself carries the infectious information.

And then there are retroviruses like HIV.

They have a unique twist involving RNA and DNA, right?

Using reverse transcriptase.

That's right.

Retroviruses store genetic information as RNA, but when they infect a host cell, they use this special enzyme, reverse transcriptase, to make a DNA copy of their RNA genome.

So they go RNA back to DNA, the reverse of the usual flow.

Exactly.

Hence the name reverse transcription.

And the clever part is this newly made viral DNA can then get inserted right into the host cell's own DNA, its own chromosomes.

Wow.

So it becomes part of the host's genome.

It does.

And every time the host cell copies its DNA, it copies the viral DNA too.

When the host genes are read, the viral DNA gets read too, producing new viral RNA and proteins.

It's a very effective and frankly sneaky way for the virus to hide and replicate along with the host.

Okay.

So the mystery of what the genetic material is was solved mostly DNA, sometimes RNA.

The next huge question was what does DNA actually look like, its structure?

This led to that famous scientific race in the early 50s.

They really did.

Culminating in 1953 with Watson and Crick's model.

But we absolutely have to acknowledge the others involved.

Linus Pauling was close with the triple helix idea, incorrect, but close.

And crucially, Roslyn Franklin and Maurice Wilkins, whose X -ray diffraction work provided the critical data.

Before we get to the double helix itself, maybe a quick refresher on the building blocks.

The basic chemistry Watson and Crick were working with.

Good idea.

So DNA is a polymer of nucleotides.

Each nucleotide has three parts, a nitrogenous base, a pentose sugar, deoxyribose in DNA, and a phosphate group.

The bases are A, G, C, and T.

Right.

Adenine A and guanine G are the bigger purines.

Cytosine C and thymine T are the smaller pyrimidines.

And RNA, uracil U, another pyrimidine, replaces thymine.

And the sugar difference is key too, right?

Deoxyribose versus ribose.

Very key.

Deoxyribose in DNA lacks one oxygen atom compared to ribose in RNA.

That small difference makes DNA much more stable, better suited for long -term information storage.

RNA's ribose makes it more reactive.

Got it.

Base plus sugar is a nucleoside out of phosphate.

It's a nucleotide.

Correct.

And these nucleotides link up using phosphodiester bonds.

The phosphate of one links to the sugar of the next, forming a chain, a polynucleotide strand with a sugar phosphate backbone.

And the sequence of those bases, A, G, C, T, along that chain is where the information is stored.

The potential for variation is immense, isn't it?

Astronomical.

Our source mentions a chain just 1 ,000 nucleotides long could be arranged in four to the power of a thousand ways.

That's more possibilities than atoms in the universe, basically.

It explains how DNA can encode so much complexity.

Which brings us back again to Chargaff.

His base composition rules A, T, G, C were vital for Watson and Crick figuring out the structure, weren't they?

Absolutely fundamental.

Chargaff's data, collected between 49 and 53, showed that fixed pairing.

A always pairs with T, G always pairs with C.

It refuted Levine's idea of equal amounts and strongly suggested a specific complementary relationship between the bases.

Like pieces of a puzzle fitting together.

And the other crucial piece was the X -ray diffraction data, from Franklin especially.

Yes.

Rosalind fibers than previous attempts.

Her famous photograph 51 showed a distinct X pattern, which is characteristic of a helix.

It also revealed key dimensions, like a regular spacing of 3 .4 angstroms, suggesting stacked bases and indicated the overall helical diameter.

So armed with Chargaff's rules and Franklin's X -ray data, Watson and Crick built their model.

What did they propose in that landmark 1953 paper?

They proposed the iconic double helix.

Their model had several key features.

One, two long polynucleotide chains coiled around a central axis and it's a right hand of helix.

Two chains, okay.

Two, the chains run in opposite directions.

They're anti -parallel.

Right.

One strand goes five prime to three prime, the other goes three prime to five prime.

Think of it like lanes on a highway.

Got it.

Anti -parallel.

Three, the flat bases are stacked on the inside of the helix perpendicular to the axis, like steps on a spiral staircase 3 .4 angstroms apart.

Bases inside, backbone outside.

Four, the two chains are held together by hydrogen bonds between specific complementary bases.

Adenine pairs with thiamine using two hydrogen bonds, AT.

Guanine pairs with cytosine using three hydrogen bonds, G.

This explains Chargaff's rules perfectly.

A with T, G with C, the specific pairing.

Five, each complete turn of the helix is 34 angstroms long and contains 10 base pairs.

Later refined to about 10 .4, 10 was the initial model.

Six,

the coiling creates alternating wider major groove and narrower minor groove spaces along the molecule surface.

These grooves are important binding sites for proteins.

Major and minor grooves.

Right.

And seven,

the helix has a consistent diameter of 20 angstroms.

That base pairing AT and GC, that complementarity,

that's really the heart of it, isn't it?

Genetically speaking.

It's absolutely central.

It immediately suggested how DNA could be accurately copied.

Each strand can serve as a template for making a new complementary strand.

It also explained how information could be stored in the sequence of bases.

And the structure is remarkably stable.

You have thousands of those hydrogen bonds, weak individually but strong collectively.

Plus, the hydrophobic bases are tucked inside, away from water, while the charged sugar phosphate backbones face outwards.

It's an incredibly stable yet accessible structure.

And Watson and Crick saw these implications right away.

They did.

In their original paper, they famously understated, it has not escaped our notice that the specific pairing we have postulated immediately suggests a possible copying mechanism for the genetic material.

It was revolutionary.

It opened the door to understanding replication,

gene expression,

mutation, basically all of molecular biology.

And the Nobel Prize followed in 62 for Watson, Crick and Wilkins.

But it's important to remember Rosalind Franklin's contribution and the ethical question surrounding how her data was used, as the source points out.

Yes, the case study in the text highlights this.

Franklin had passed away before the prize was awarded, making her ineligible by its rules.

But the way her critical photo 51 was shown to Watson, apparently without her full knowledge or permission, remains a significant point of discussion about credit and collaboration in science.

It's a reminder that discovery involves people, and acknowledging contributions fairly is crucial.

Absolutely.

Now, we usually picture that classic Watson -Crick BDNA helix, but DNA can actually twist into slightly different shapes, can't it?

It can, yes.

BDNA is the standard form found in cells under normal physiological conditions, aqueous, relatively low salt.

It's the right -handed helix we've been discussing.

But under conditions of lower humidity or high salt,

DNA can adopt the ADNA form.

It's still right -handed, but wider and shorter, more compact, with about 11 base pairs per turn, and the bases are tilted relative to the axis.

It's less likely to be the main form in living cells, but might occur in certain DNA -protein interactions or DNA -RNA hybrids.

OK, so ADNA is a variation.

What about ZDNA?

That one sounds quite different.

ZDNA is fascinating.

It's a left -handed helix, which is quite unusual.

It's thinner than BDNA and has a zigzagging sugar phosphate backbone, hence the Z.

It has 12 base pairs per turn.

Left -handed.

Does it actually exist in cells?

Evidence suggests short stretches of ZDNA can form in vivo, particularly in sequences with alternating purines and pyrimidines, like GC repeats, especially when the DNA is under torsional stress.

Its exact biological role is still debated, but it might be involved in regulating gene activity, perhaps acting as a recognition signal for certain proteins.

And there are other rarer or lab -induced forms, too, like C, D, E, and PDNA.

So DNA has some structural flexibility.

Let's see.

Three key differences.

One, RNA uses the sugar ribose instead of deoxyribose.

Two, RNA uses the base uracil, U, instead of thymine, T.

So in RNA, A pairs with U3.

RNA is usually single -stranded, although it can fold into complex 3D shapes by base pairing with itself.

And some viruses do have double -stranded RNA genomes.

And this single -stranded nature allows it to take on many different jobs in the cell, right?

What are the main types?

The three major players involved in making proteins are ribosomal RNA, or RNA, which is the most abundant type, and forms the core structural and catalytic components of ribosomes in the protein synthesis factories.

Okay, rRNA and ribosomes.

The messenger RNA, mRNA.

This molecule carries the actual genetic code.

The instructions copied from a DNA gene to the ribosome.

Its size varies depending on the protein it codes for.

mRNA is the blueprint copy.

And transfer RNA, PRNA.

These are relatively small molecules shaped like a cloverleaf.

Their job is to ferry the correct amino acids to the ribosome, matching them to the codons on the mRNA sequence during translation.

PRNA brings the building blocks.

Got it.

Beyond those three, there's a growing list of other functional RNAs, especially eukaryotes.

Things like telomerase RNA involved in chromosome ends, small nuclear RNA, SNRNA for processing mRNA, and various regulatory RNAs like microRNAs, mRNAs, and long non -coding

RNAs that play complex roles in controlling gene expression.

They're distinguished partly by how they behave in a centrifuge, their Svedberg coefficient, or S -value.

It's an intricate system.

How do scientists actually study these molecules?

What are some key analytical techniques?

Many techniques exploit the hydrogen bonding.

For example, denaturation.

If you heat DNA, the hydrogen bonds break and the two strands separate or melt apart.

This causes changes we measure, like a decrease in viscosity and, importantly, an increase in UV light absorption.

That's the hyperchromic shift you mentioned.

Exactly.

As the strands unwind, the bases absorb more UV light.

If you plot this absorption against temperature, you get a melting curve.

The temperature at the midpoint, where 50 % of the DNA is denatured, is called the melting temperature, TM.

So what does this TM tell you?

Why is it useful?

Well, the TM is directly related to the F2.

So DNA with a higher percentage of GC pairs is more stable and requires a higher temperature to melt.

Its TM will be higher.

Measuring TM gives you an idea of the GC content.

Ah, okay.

So stability relates to GC content.

What about putting strands back together?

That's renaturation or hybridization.

If you cool denatured DNA slowly,

complementary strands can find each other and reform the double helix.

This principle is incredibly useful.

You can hybridize DNA strands from different sources, or even DNA with RNA, if they have complementary sequences.

We use this in molecular hybridization assays, often using labeled probes, short, known sequences of DNA or RNA, to detect the presence of specific target sequences in a sample.

Like finding a specific gene.

Exactly.

A powerful visualization technique based on this is EFISH fluorescence in situ hybridization.

You use fluorescently labeled probes that bind specific locations on chromosomes, allowing you to actually see where a gene is located, or detect chromosomal abnormalities like deletions or translocations right under the microscope.

Very useful in diagnostics.

Okay, hybridization is powerful.

Another technique you hear about all the time is electrophoresis.

What's that used for?

Gel electrophoresis is fundamental for separating nucleic acid fragments based on their size.

How does it work?

You place a mixture of DNA or RNA fragments in a gel matrix, usually agarose or polycrylamide, and apply an electric field.

Since the phosphate backbone gives nucleic acids a negative charge, they migrate towards the positive electrode.

And smaller pieces move faster.

Precisely.

The gel acts like a sieve.

Smaller fragments navigate the pores more easily and travel further down the gel in a given time than larger fragments.

This allows you to separate fragments that differ in length by even a single nucleotide sometimes.

It's essential for DNA sequencing, cloning, mapping, and techniques like southern blotting for DNA and northern blotting for RNA.

And finally, with the explosion of sequence data, bioinformatics has become indispensable.

Absolutely critical.

Bioinformatics uses computational tools and mathematical approaches to manage, analyze, and interpret the massive amounts of biological data, especially DNA and protein sequences.

Think about databases like GenBank at NIH, holding hundreds of billions of base pairs.

Bioinformatics helps us make sense of it all.

Storing, sharing, comparing.

And a key tool for comparison is BLAST.

Yes.

BLAST's basic local alignment search tool.

It's a powerful algorithm that lets you take a sequence you're interested in, say, a newly discovered gene, and rapidly search entire databases to find similar sequences.

So you can see if your gene looks like known genes in other species.

Exactly.

Finding similarities can give you clues about your gene's possible function, its evolutionary history, or potential links to disease.

It's an essential tool for connecting new discoveries to the vast existing knowledge base.

So it's been quite a journey we've traced today.

From those early debates, the elegant experiments proving DNA's role, the unveiling of the double helix, to the sophisticated ways we analyze it now, it really showcases the power of scientific investigation.

It truly does.

And the beauty of the was how it immediately suggested fundamental biological mechanisms, replication, information storage, mutation.

These weren't just abstract ideas anymore.

You could see how the molecule's physical structure enabled these processes.

It really laid the groundwork for everything that followed in molecular biology.

So for you listening, knowing how DNA is structured, how stable yet copyable it is, what new questions pop into your head about how life manages this incredible feat.

How does it maintain and transmit these complex instructions generation after generation with such fidelity, yet also allow for the change that drives evolution?

Something to think about.

Thank you so much for joining us for this deep dive.

We hope exploring the genetic blueprint has given you some valuable insights and maybe sparked some new questions.

Join us next time for another deep dive into the world of fascinating knowledge.

Thank you for being part of our last minute lecture family.

ⓘ This audio and summary are simplified educational interpretations and are not a substitute for the original text.

Chapter SummaryWhat this audio overview covers
Establishing DNA as the central molecule of inheritance requires understanding both the historical investigations that identified its role and the structural principles that explain how it functions. The chapter traces the conceptual journey from early debates about hereditary material through landmark experiments that definitively proved DNA's primacy over competing theories. Griffith's pioneering work with bacterial transformation revealed that some unknown chemical agent could produce heritable changes in phenotype, setting the stage for the systematic identification work performed by Avery, MacLeod, and McCarty, who used selective enzyme treatments to pinpoint DNA as the transforming principle. The Hershey-Chase team supplied the final confirmatory evidence by employing radioactive isotopes to track the fate of viral components during infection, demonstrating conclusively that DNA enters the host cell while protein remains outside. These convergent experimental approaches, supplemented by findings from transfection studies, species distribution analyses, ultraviolet mutagenesis patterns, and advances in recombinant DNA technology, collectively established DNA's universal role across all biological domains. Understanding DNA's physical architecture reveals why it can fulfill these hereditary functions so reliably. The Watson-Crick double helix represents a right-handed structure composed of two antiparallel strands held together by hydrogen bonding between complementary base pairs, where adenine pairs with thymine and guanine pairs with cytosine. This model elegantly synthesized Chargaff's observations about nucleotide ratios with the structural insights from Franklin's pioneering X-ray crystallography work. DNA structure proves more dynamic than the textbook double helix suggests, adopting alternative conformations such as A-form and Z-form depending on environmental conditions and biological context. RNA molecules, while sharing the same basic building blocks and complementary pairing principles, differ structurally through their ribose sugar and uracil substitution, and functionally through their diverse roles as messengers, transfer molecules, ribosomal components, and regulatory agents including microRNAs and siRNAs. Analyzing nucleic acids requires both traditional biochemical approaches and modern computational techniques. Molecular hybridization exploits complementarity for detecting specific sequences, while gel electrophoresis separates molecules by size and charge, enabling physical isolation and characterization. Physicochemical analysis through ultraviolet spectroscopy reveals compositional information and structural stability, with thermal melting temperature serving as a sensitive indicator of base pairing integrity. Contemporary molecular investigation integrates these experimental methods with computational tools like sequence alignment algorithms, bridging classical biochemistry with modern bioinformatics.

Using this chapter to study? Last Minute Lecture is free and student-run. If it helped, consider supporting the project.

Support LML ♥