Chapter 11: Chromosome Structure and DNA Sequence Organization
Welcome to Last Minute Lecture!
This free chapter overview is designed to help students review and understand key concepts.
These summaries supplement not replaced the original textbook and may not be redistributed or resold.
For complete coverage, always consult the official text.
Welcome to the deep dive.
Okay, imagine trying to fit miles, literally miles, of super into a tiny thimble.
Sounds, well, impossible, right?
But that's basically what every cell does with its DNA, whether it's a tiny virus or, you know, our own complex cells.
So today we're diving deep into this incredible, really intricate world of DNA organization.
We're looking at Essentials of Genetics, the 10th edition, to unpack how all this vital genetic stuff is meticulously packaged, unpacked, regulated, and what it all actually means for life.
It's fascinating, isn't it?
Not just how it's packed, but the sheer variety of ways different life forms do it.
Understanding these strategies is just fundamental.
It's key to understanding gene function, inheritance, everything, really.
Absolutely.
So let's start simple, or maybe elegantly simple, thinking about viruses, bacteria, and even the DNA inside some of our own cellular components or organelles.
Right, the sort of minimalist approach first.
Exactly.
Viruses, for instance, they were just masters of packing.
Their chromosomes, often just a single nucleic acid molecule, right, could be DNA, could be RNA, single or double stranded.
And the efficiency is just, well, it's astonishing.
Take bacteriophage lambda.
Its DNA, stretched out, is about 17 millimeters long.
17 millimeters?
That's huge for a virus.
It is, but it gets packed into a viral head that's less than a tenth of a millimeter across.
Wow.
Or the T2 phage, it stuffs over 50 millimeters of DNA into an even tinier head.
It's almost perfect packing, like biological origami with almost no wasted space.
It's like folding a map perfectly back into its tiny square.
And they come in different shapes, too, circular or linear.
Yeah, some are circular, like Nexo 174, others are linear, like the T -even phages.
And lambda phage, it's actually linear outside the host, but then it circularizes once it gets inside.
Clever little adaptation.
Very clever.
Okay, so moving up a bit in complexity, bacteria, their chromosomes are also generally pretty simple.
Generally, yes.
Usually a single double -stranded circular DNA molecule.
The classic example is E.
coli S.
greekia coli.
Right?
Its chromosome, even though it's circular, is about 1 .2 millimeters long if you stretched it out.
1 .2 millimeters?
I'm a microscopic bacterian.
Exactly.
It can actually take up maybe a third of the cell's entire volume.
So how does it keep all that organized, keep it becoming just a tangled mess, especially when it needs to access genes quickly?
Good question.
It uses specific DNA binding proteins, not histones, like we'll see in eukaryotes, but proteins like HU and HNS.
Okay.
These are small, positively charged, so they bind tightly to the negatively charged DNA.
And they basically fold and bend the DNA, supercoiling it, compacting it into a region we call the nucleoid.
The nucleoid, got it.
And interestingly, some of these proteins, like HNS, also help regulate gene activity.
Sort of a hint of the complexity to come.
Okay.
Now, for something really mind -bending, I think, the DNA inside mitochondria and chloroplasts.
These things are inside our cells, but they have their own DNA.
How did we figure that out?
Well, the first clues came from inheritance patterns.
Things that didn't quite follow standard Mendelian rules often passed down maternally.
Ah, okay.
And then later, direct evidence confirmed a unique DNA within these organelles.
And what's really profound is that this DNA, MTDNA in mitochondria, CPDNA in chloroplasts, looks a lot like bacterial and viral DNA.
How so?
It's usually double -stranded, closed circles.
And importantly, it's mostly free of the complex proteins, the histones, that package eukaryotic DNA.
Wow.
And this is really strong evidence supporting the endosymbiotic theory.
Right.
The idea that these organelles were once free -living bacteria.
Exactly.
That they were engulfed by early eukaryotic cells and formed a symbiotic relationship.
We're basically seeing the echoes of that ancient event in their DNA structure today.
That's incredible.
An evolutionary story right there in the cell structure.
Does the size of this organelle DNA vary?
Oh, yes.
Quite a bit.
Especially mitochondrial DNA.
Human MTDNA is pretty compact, around 16, 18 kilobases.
But yeast MTDNA is much larger, maybe 75 kilobates.
And some are more gene -packed.
Generally, yes.
Human MTDNA, for example, has very few non -coding regions, or introns.
Larger ones, like in yeast, can have them.
Choroplast DNA is a bit more uniform in size, usually 100, 200, and 25 kilobands.
Also circular, protein -free, often larger than MTDNA because it has more non -coding bits of duplications.
Okay, so that covers the, let's say, simpler packaging strategies.
But eukaryotes, like us, with our massive genomes, things get way more complicated.
And actually, some specialized eukaryotic chromosomes gave us early clues about how DNA works, like polythene chromosomes.
Ah, yes.
These are amazing structures.
You find them in specific tissues in insect larvae.
The salivary glands of Drosophila, the fruit fly, are the classic example.
What's so special about them?
Well, first, they're huge.
You can see them easily with a light microscope, even during interphase when other chromosomes are usually decondensed and invisible.
Okay.
And second, they represent paired homologs.
The two copies of each chromosome are perfectly aligned side by side.
That's pretty rare in somatic cells.
And they're made of thousands of DNA strands all lined up, giving them that banded look.
Exactly.
Thousands of identical DNA strands in perfect register create that distinctive banding pattern.
But the real breakthrough was observing what scientists called puffs.
Puffs?
What are those?
These are specific bands that look, well, puffed out, uncoiled.
And using radioactive tracers for RNA synthesis,
researchers show that these puffs were sites of intense gene transcription.
So they could actually see where genes were being actively read.
Precisely.
It was one of the first direct visual confirmations that specific DNA regions become active, physically change structure to allow gene expression.
You could literally see genes turning on and off during development.
Incredible.
Like watching the genome at work way before modern molecular tools.
And then there are lamp brush chromosomes.
Similar idea.
Similar idea, but in a different context.
They get their name because they look like the brushes used to clean old oil lamps.
Okay.
I can picture that.
You find these mainly in developing egg cells, oocytes of vertebrates like amphibians, and also in some insect sperm cells.
They are specifically meiotic chromosomes seen during prophase eye of meiosis.
And they're stretched out, not condensed.
Nassively stretched out.
They uncoil and extend, sometimes becoming hundreds of micrometers long.
They have a central axis with condensed blobs called chromomeres.
And sticking out from these chromomeres are pairs of lateral loops.
And I bet those loops are active.
You bet.
Just like the puffs, these loops are sites of intense RNA synthesis.
It's DNA that's been reeled out, made accessible for transcription.
So again, these specialized structures visible long ago give us fundamental insights.
DNA organization is dynamic and directly linked to gene function.
Amazing early glimpses.
Okay, so let's tackle the big eukaryotic challenge.
How do you take, say, in humans over two meters of DNA?
Yeah, two meters per cell nucleus.
And pack it into a nucleus that's maybe five or 10 micrometers across.
I mean, that's like fitting a marathon runner's entire route into a space smaller than a grain of sand.
It's a phenomenal packing ratio, something like 10 ,000 to one.
It's achieved through a complex called chromatin.
Chromatin.
That's the DNA plus associated proteins.
Exactly.
And the main protein players are the histones.
These are small, basic proteins, positively charged.
There are five main types,
H1, H2A, H2B, H3, and H4.
And the positive charge is key.
Absolutely key.
DNA is negatively charged because of its phosphate groups.
So the positive histones bind tightly to the negative DNA through electrostatic attraction.
They're essential structural components.
This leads to that famous model, the beads on a string, right?
The nucleosome.
How did we figure that out?
That was a series of landmark discoveries around the mid -1970s.
First, researchers found that gently digesting chromatin with enzymes cut the DNA into fragments of about 200 base pairs, or multiples of 200.
Suggesting a repeating unit.
Exactly.
Then electron microscopes provided the visual.
Ada and Donald Allens took these incredible pictures showing chromatin fibers looking like, beads on a string.
These beads were named nucleosomes.
A real aha moment seeing that structure.
Definitely.
And biochemical work filled in the details.
Each bead, each nucleosome, consists of a core of eight histone proteins in octamer, with two copies each of H2A, H2B, H3, and H4.
An octamer core.
And around this core, a stretch of DNA, about 147 base pairs long, is wrapped about 1 .7 times.
147 base pairs wrapped around the core.
What about the rest of the 200 pairs from the digestion experiments?
That remaining DNA, roughly 50 base pairs or so, acts as linker DNA, connecting one nucleosome to the next.
And that linker DNA is often associated with the fifth histone, H1.
So the nucleosome is the basic building block, the first level of packing.
But that's not nearly enough compaction, is it?
Not even close.
That first level, forming the 11 nanometer diameter nucleosome fiber, only compacts the DNA maybe six or seven fold.
Reduces its length to about a third of the original.
Okay, so what's next?
How does it get tighter?
The next level involves coiling that 11 nanometer fiber, the beads on a string, into a thicker structure, about 30 nanometers in diameter.
This is often called the solenoid.
A solenoid, like a coil.
Exactly.
This step usually involves histone H1, helping to organize the coil, and it gives another six fold compaction.
This 30 nanometer fiber is thought to be the sort of default state for much of the chromatin in the nucleus during interphase when the cell isn't dividing.
So even when the cell is just doing its normal business, the DNA is already pretty well organized into this 30 nanometer structure.
That's right.
But to get to a visible chromosome, you need more.
The third level involves folding these 30 nanometer fibers into a series of looped domains.
These loops seem to be anchored to a protein scaffold within the nucleus.
This compacts down to about a 300 nanometer diameter fiber.
Looped domains, okay.
Getting tighter still.
And finally, for cell division, these looped fibers undergo further coiling and folding to form the dense arms of a chromatid, which is about 700 nanometers wide.
Two sister chromatids joined together make up the highly condensed metaphase chromosome that we can see under the microscope, about 1 ,400 nanometers across.
Wow.
From 2 nanometers to 1 ,400 nanometers through multiple levels of coiling and folding?
It's an incredible hierarchy, and it's not just about storage.
This packing has to be dynamic.
It controls access.
Right.
That's the paradox, isn't it?
If it's packed so tightly, how does anything get to the DNA, like the enzymes for replication or transcription?
Exactly.
The DNA has to be accessible.
And that's where chromatin remodeling comes in.
The cell has ways to locally unpack and repack chromatin.
How does that work?
Well, detailed studies, like X -ray diffraction, showed the histones aren't just compact blobs.
They have these unstructured histone tails sticking out from the nucleosome core.
Tails?
Like little handles?
Kind of.
These tails are accessible and are targets for various chemical modifications.
These act like signals or switches.
What kind of modifications?
A really important one is acetylation.
Enzymes called HATs, histone acetyltransferases, add acetyl groups to lysine amino acids on the tails.
This neutralizes the positive charge of lysine.
And neutralizing the charge loosens the histone's grip on the DNA.
Precisely.
It tends to open up the chromatin structure, making the DNA more accessible, so acetylation is generally linked to gene activation.
Makes sense.
Open for business.
A great example is the bar body, the inactive X chromosome in female mammals.
Its histone H4 is known to be heavily underacetylated, keeping it tightly shut down.
Fascinating.
Are there other modifications?
Oh yes.
Methylation adding methyl groups to lysines or arginines is more complex.
Depending on which amino acid gets methylated and how many methyl groups are added, it can either activate or repress transcription.
So methylation is more versatile.
Right.
And phosphorylation adding phosphate groups introduces a negative charge and is involved in processes like chromosome condensation during mitosis and DNA repair.
Crucially, all these modifications are reversible.
So the cell can dynamically add and remove these tags.
Exactly.
It's a complex code, sometimes called the histone code, that helps regulate gene expression.
It's a major part of epigenetics changes in gene function without changing the DNA sequence itself.
Epigenetics.
Right.
And this remodeling is critical.
You mentioned Robert syndrome earlier.
Yes.
A tragic example.
It's a rare genetic disorder where there's a defect in how certain chromosome regions, particularly the centromeres and nearby heterochromatin, are maintained and separated during cell division.
This fundamental problem in chromatin organization leads to severe developmental issues, really highlighting how vital proper packaging is.
A stark reminder.
And it's not just histone modifications, right?
The DNA itself can be modified.
That's right.
Cytosine methylation, adding a methyl group directly to cytosine bases in the DNA, usually at CPG sequences, is another important layer of control.
Generally, high methylation in the gene's promoter correlates with that gene being silenced or turned off.
Okay, so multiple layers of control.
You mentioned heterochromatin just now in the context of Robert syndrome.
What exactly distinguishes it from euchromatin?
These are old terms from 1928, actually, based on how chromosomes looked under the microscope.
Euchromatin is the part of the chromosome that's less condensed, stains lightly, and is generally where active genes are located.
It's transcriptionally active.
The busy part of the cell cycle, even during interphase,
it stains darkly and is largely genetically inactive.
So it's transcriptionally silent, mostly?
Mostly.
It either contains few genes or the genes within it are repressed.
It also tends to replicate later in the S phase, the new chromatin.
Classic examples are the telomeres at the ends of chromosomes.
Which protect the ends.
Right.
And the centromeres, crucial for chromosome segregation,
also large chunks like most of the Y chromosome in mammals, and the entire inactive X chromosome, the bar body,
are heterochromatic.
And this location matters.
You mentioned the position effect.
Yes, that's a fascinating phenomenon.
Yeah.
If a normally active gene happens to get moved, maybe through a chromosome rearrangement, next to a block of heterochromatin, that gene can actually become silenced.
Its new neighborhood shuts it down.
Wow.
Location, location, location really applies to genes too.
So chromosomes have these distinct, active, and inactive neighborhoods.
How do scientists even tell different chromosomes apart visually?
You said they looked similar before the 70s.
It was tough.
But then chromosome banding techniques were developed.
These are staining methods that create unique patterns of light and dark bands along each chromosome.
Like R codes for chromosomes.
Exactly like VAR codes.
G banding is very common.
You treat the chromosomes with an enzyme, trypsin, then stain with GMSA.
It preferentially stains AT rich regions,
creating a specific pattern for each chromosome.
Another one is C banding, which specifically stains the constitutive heterochromatin, particularly around the centromeres.
These banding patterns are so consistent and unique that you can identify every chromosome and even spot tiny rearrangements or translocations between them.
It's hugely important for clinical genetics.
Incredible detail.
Now let's shift gears a bit.
One of the big surprises from sequencing genomes was just how much of the DNA is repetitive.
It's not all unique genes coding for proteins, is it?
Not even close.
A huge fraction of eukaryotic genomes consists of repetitive DNA sequences.
We could broadly categorize them.
One type is highly repetitive DNA, often called satellite DNA.
Why satellite?
Because when you centrifuge DNA,
this repetitive stuff often has a slightly different density.
So it separates out as smaller satellite bands away from the main bulk of the DNA.
Ah, I see.
Experiments back in the 60s and their partners again showed this DNA reassociates very quickly.
Why?
Because there are so many identical copies, they find each other easily.
Makes sense.
And where do you find this highly repetitive DNA?
It typically consists of short sequences repeated thousands or even millions of times, usually in long tandem repeats, meaning head to tail arrays.
And it's primarily found in heterochromatic regions, especially surrounding the centromeres.
The centromeres again.
So this DNA is important there.
Absolutely critical.
The specific sequences within the centromere, the CEN region, are essential for binding the protein complex called the kinetochore.
Which is what the spindle fibers attach to during cell division.
Exactly.
So these repetitive sequences aren't just filler.
They form the structural foundation for ensuring chromosomes segregate correctly.
It's fascinating to humans.
The main centromeric is called alphoid DNA, and it forms arrays up to a million base pairs long.
And there's even a special histone variant, CENPA, that replaces histone H3 specifically at the centromere, linking this DNA sequence directly to a kinetochore assembly.
So definitely not junk DNA, vital machinery.
What about other kinds of repeats?
There's also middle or moderately repetitive DNA.
This category includes some functional genes that exist in multiple copies, like the genes for ribosomal RNA.
We need lots of ribosomes, so we have multiple copies of those genes.
Okay, practical repetition.
But much of it is non -coding.
This includes things like VNTRs, variable number tandem repeats, also called minisatellites.
These are repeats of maybe 15 to 100 base pairs.
And the variable number part is key.
Exactly.
The number of times the sequence is repeated varies hugely between individuals.
This variation is the basis for DNA fingerprinting used in forensics.
And then there are STRs, short tandem repeats, or microsatellites.
These are even shorter, just two to five base pairs repeated over and over,
like CAN.
The number of repeats also varies greatly between people, making them incredibly useful as molecular markers for genetic mapping, population studies, and modern forensics.
So those variations in simple repeats are unique identifiers.
Pretty much.
And finally, we have repetitive transposed sequences.
These aren't usually clustered together.
They're scattered individually throughout the genome.
And they are, or were, mobile.
They can jump or be copied to new locations.
The jumping genes.
That's the nickname.
There are two main types in mammals.
Scions, short interspersed elements, and lines, long interspersed elements.
Okay, so nice first.
Scions are, well, short, less than 500 base pairs.
The most common one in humans is the allu family.
There are over half a million allu repeats scattered through our genome.
They make up something like 10 % or more of our DNA.
10%.
Just from this one type of element.
Yeah, it's amazing.
They're thought to have arisen from an RNA molecule that got reverse transcribed into DNA and then inserted back into the genome, potentially moving around via an enzyme called reverse transcriptase.
Reverse transcriptase, like viruses use.
Exactly, like retroviruses use.
Which brings us to lines.
These are much longer.
Around 6 ,000 base pairs.
The main one in humans is the allu family.
There are hundreds of thousands of these too.
And they move similarly.
Yes, but importantly, many lines actually carry the gene for reverse transcriptase themselves.
So the line DNA gets transcribed into RNA.
That RNA is then used as a template by the line's own reverse transcriptase to make a new DNA copy.
And that copy gets inserted somewhere else.
So they carry their own machinery for copying and pasting themselves.
Precisely.
That's why they're called retrotransposons.
Their mechanism is very similar to how retroviruses replicate.
And together, cyanids and lines make up a huge chunk of our genome, maybe over a third.
They've played a big role in shaping genome evolution, sometimes causing mutations or creating new gene variations.
Incredible.
So adding it all up, the repetitive DNA, the introns within genes, the regulatory regions, it leads to this striking fact.
Only a tiny fraction of our DNA actually codes for proteins.
It's one of the biggest surprises from genomics.
For humans, it's estimated that only about 2 % of our DNA consists of protein coding sequences, the exons of our roughly 20 ,000 genes.
Just 2%.
So what is all the rest of that stuff doing?
You hear terms like pseudogenes too.
Right.
Pseudogenes are interesting.
They look like copies of real genes, but they've accumulated mutations so they're no longer functional, usually not even transcribed.
They're like evolutionary relics.
So it really raises the question, if only 2 % makes proteins, what's the point of the other 98 %?
It feels like a library where most of the shelves are filled with something other than readable books.
That's a great analogy.
And for a long time, some of it was dismissed as junk DNA, but we now know that's far too simplistic.
A lot of this non -coding DNA is actually transcribed into RNA molecules that don't make proteins, but play crucial regulatory roles.
They can control gene expression, chromatin structure, and many other cellular processes.
So the dark matter of the genome is actually functional, just not in the way we first thought.
Exactly.
We're realizing it's an incredibly complex regulatory network, and discoveries are still ongoing.
Take copy number variations, or CNVs.
What are those?
These are relatively large chunks of DNA, sometimes including whole genes or groups of genes that can be present in different numbers of copies in different individuals.
So instead of just having two copies of a gene segment, one from each parent, someone might have one, or three, or four, or more.
Due to duplications or deletions?
Precisely.
Thousands of these CNVs have been found in humans.
And research is increasingly linking variations in copy number for certain genes to susceptibility or resistance to various diseases.
For instance, genes involved in immunity, like the defensein, DEF genes,
often show significant CNV, and differences in copy number might affect how individuals respond to infections or inflammatory conditions.
So understanding this large -scale organization, even the number of copies of certain regions, has direct health implications.
Absolutely.
It highlights that understanding DNA organization, from the nucleosome up to chromosome structure and copy number, isn't just academic.
It's fundamental to understanding health and disease.
It really is.
So we've journeyed from the incredibly efficient packing in viruses and bacteria, through those amazing giant chromosomes, the polythene and lamp brush ones, that gave us early peaks at gene activity, to the multi -level hierarchy of folding in eukaryotic chromatin, the dynamic remodeling with histone tails and modifications,
and finally, grappling with the
It truly is.
Nature's solution to an immense information storage and retrieval problem.
So as we wrap up, here's something to think about.
We've seen how intricately DNA is organized, but what are the limits?
Could it be packed even more efficiently?
And as we keep decoding the functions of that vast non -coding genome, that genomic dark matter, what other profound secrets about how life works are still hidden within its folds and sequences?
Great questions to ponder.
There's still so much to learn.
There really is.
Thank you for joining us on this deep dive today.
Keep asking questions, keep exploring and stay curious.
ⓘ This audio and summary are simplified educational interpretations and are not a substitute for the original text.
Using this chapter to study? Last Minute Lecture is free and student-run. If it helped, consider supporting the project.
Support LML ♥Related Chapters
- Chromosome Structure and Organelle DNAGenetics: A Conceptual Approach
- DNA, Chromosomes, and GenomesMolecular Biology of the Cell
- Genes, Chromatin & ChromosomesMolecular Cell Biology
- DNA & Molecular Structure of ChromosomesPrinciples of Genetics
- DNA Organization in ChromosomesConcepts of Genetics
- DNA Organization, Replication, & RepairHarper's Illustrated Biochemistry