Chapter 19: Microbial Taxonomy, Evolution & Classification
Welcome to Last Minute Lecture.
This free chapter overview is designed to help students review and understand key concepts.
These summaries supplement not replaced the original textbook and may not be redistributed or resold.
For complete coverage, always consult the official text.
Welcome back to The Deep Dive.
Today we are strapping in for an exploration into what is arguably the most revolutionary,
messy, and fascinating field in modern biology.
Microbial taxonomy.
The science of classification.
Right.
We're exploring the, well, the organizational framework of life.
Specifically how advances in genomics are really challenging the established boundaries of the microbial world.
For centuries, classifying life was based on what you could see.
Now we're classifying organisms based on DNA fragments we find floating in the ocean.
It's forcing us to question the fundamentals of what life is and how it's organized.
And the central audacious query from our sources that sort of sets the stage for this entire dive is, is the microbial universe actually expanding?
It absolutely is, yeah.
And most of it currently falls into this category of microbial dark matter.
We sort this into three camps, which helps put the scale in perspective.
Okay.
First, the explored, those we can culture and study easily like, you know, industrial microbes or common pathogens.
The ones we know well.
Exactly.
Second, the unexplored.
These are known only through metagenomic sequencing.
We see their genes, maybe understand some functions, but we just cannot grow them in the lab yet.
And finally, the undiscovered.
The true microbial dark matter that remains, well, completely unknown.
But thanks to next generation sequencing, we're pulling entire genomes out of the environment, which is why we've seen like hundreds of new bacterial and archaeophila proposed recently.
It's incredible.
And here's maybe the most mind bending part of this expansion.
Scientists are now finding gene sequences, sometimes in these large unusual viruses that don't align with any of the three accepted domains, bacteria, archaea, or eukarya.
Whoa.
Okay.
So that's, that's huge.
We are talking about the distinct possibility of a brand new domain of life existing right under our noses.
Okay.
Okay.
Let's unpack this then maybe by first laying the groundwork of how we actually classify these tiny entities.
Sure.
The whole process is called taxonomy.
It involves three core activities.
First classification, which is arranging organisms into hierarchical groups or taxa, then nomenclature, which is just naming them according to established rules.
And finally, identification, determining if a new isolate, something you've just found belongs to a recognized taxa and the broader study encompassing things like ecology, morphology, and genetics.
That's called systematics.
Okay.
So to ground ourselves, let's look back a bit.
For larger organisms like plants and animals, we had natural classification based on shared characteristics, right?
Linnaeus's system.
But you mentioned that didn't really work for something as morphologically simple as a bacterium.
So what did scientists fall back on before they had all this deep genetic data?
Well, they relied heavily on what we call phonetic classification, basically grouping microbes purely by observable similarity.
You know, things like the presence of flagella or motility patterns, metabolic tests, even.
Okay.
So how they looked and behaved.
Exactly.
It was useful for organizing things, for putting some order to the chaos, but it had a fundamental flaw.
Yeah.
Just because two microbes looked alike or acted alike didn't necessarily mean they were evolutionarily related.
Ah, okay.
And that's where the true paradigm shift happened then, moving to phylogenetic classification based purely on evolutionary relationships.
How did Carl Woese and George Fox give us the tool to figure out that evolutionary history that physical traits couldn't?
They leveraged the small subunit, SSU, our RNA sequences.
This was the molecular breakthrough, really, because the gene function is the same across pretty much all life, and it rarely tolerates large mutations.
So it changes very, very slowly.
It provided that stable molecular clock needed to accurately map microbial ancestry.
Something looks alone, couldn't do.
Right, right.
And today, classification isn't just one thing.
Modern species assignment is polyphasic, you said, meaning you have to combine phenotypic, genotypic, and phylogenetic features to make a valid assignment.
Absolutely.
Yeah.
It's a multi -pronged approach.
We still use the established hierarchy domain down through phylum, class, order, family, genus, and the species epithet.
But the sheer volume of new data, especially from sequencing, has led to this somewhat controversial classification,
superphylum.
It sits just above phylum.
Wait, okay.
If this rank is designed to help organize this massive influx of new data, why is it controversial?
Well, because critics argue it's sometimes based on
maybe insufficient data, perhaps only SSU RNA sequences, for example, and doesn't yet have enough robust genomic backing to really warrant a permanent spot in the hierarchy.
It's kind of a pragmatic necessity bumping up against traditional taxonomic rigor, you could say.
I see.
That makes sense.
We still use Linnaeus' binomial system, though, that Latinized, italicized, two -part name, genus plus species epithet.
Yep.
And if you want a great example of this molecular revolution in action, you mentioned Streptococcus faecalis.
It was completely reclassified as Enterococcus faecalis after RNA analysis proved it belonged in a whole different genus.
Exactly.
And we also must distinguish between a species, which is really a collection of strains that share similar stable properties, and a strain itself, which is basically the descendant of a single, pure microbial culture.
Okay.
And for permanence, every species has to have a designated type strain.
That's the official nameholder, the reference point.
Got it.
And if we want to talk about classifying that unexplored dark matter, we have to mention Candidatus.
You know, these microbes, known only through their genetic characterization, like Candidatus mycoplasma gereribii, precisely because they resist being grown in pure culture in the lab.
It's an official way of acknowledging life we can currently only see via its DNA.
Wow.
So we've gone from just looking at flagella under a microscope to sequencing whole lineages that we can't even grow.
That's a huge leap.
That brings us nicely to the modern molecular tool set itself.
How do scientists actually gather all this data?
What's some of the tech involved now beyond the classical stuff like morphology?
Well, before the deep sequencing methods really took off, we were still relying quite a bit on biochemical signatures.
For the culturable microbes, fame analysis, fatty acid, methyl ester, is still very useful.
It analyzes the unique profile of bacterial fatty acids, chain length, saturation, that sort of thing.
It's critical in areas like public health, food and water microbiology.
But as you pointed out, it does require you to be able to culture the organism first.
Right.
But you mentioned a non -sequencing biochemical powerhouse that has really changed clinical labs recently.
MALDI -TUF, matrix -assisted laser desorption ionization time of flight.
That's a mouthful.
Can you break down how that mass spectrometry technique works so quickly?
Yeah, it's really quite elegant.
Essentially, it gives you a super fast molecular barcode of the microbe.
You mix the sample with a matrix on a plate, a laser hits it, this ionizes the highly abundant proteins, mostly ribosomal proteins.
Okay.
Those charged molecules then fly through a vacuum tube towards a detector.
The time it takes for them to travel the time of flight determines their exact mass.
This creates a characteristic mass fingerprint, like a spectrum, allowing labs to identify known bacteria often within minutes by matching into a database.
That's incredible speed, especially for clinical work.
But when we talk about evolution, which is our ultimate focus here, we inevitably turn back to the genome itself.
Given that microbes have almost no fossil record, molecular analysis is pretty much the only way forward, right?
Absolutely.
So why are the SSU RNAs still considered the gold standard for defining genera, even with all the new tech?
Well, they just have the perfect characteristics for a reliable molecular clock.
Yeah.
They're ubiquitous, found in essentially all life.
They evolve very slowly because their function is so critical, they can't tolerate many mutations.
And crucially, they're rarely subject to horizontal gene transfer, HTT, which we'll definitely talk more about.
Okay.
So we use these sequences to find signature sequences, these short conserved stretches of DNA.
They're specific to certain phylogenetic groups.
They act almost like a molecular postcode, helping place organisms in the right family or genus.
And we have a pretty solid molecular roller now too.
You mentioned organisms with less than 98 .65 % SSU rRNA identity are reliably considered different species.
But for really fine tuning species and strain identification, we've shifted more towards whole genome sequencing, WGS metrics.
That's right.
WGS is definitely the future.
And in many ways, it's already here.
The modern standard for species classification is average nucleotide identity, ANI.
This compares the coding regions of two entire genomes head to head.
You generally need at least 95, 96 % identity across those shared genes for them to be considered the same species.
And importantly, ANI is computationally replacing the older, much more labor -intensive DNA hybridization method.
BDH was the old gold standard, but it was cumbersome, required a lot of DNA, and wasn't always reproducible between labs.
ANI is calculated from sequence data, making it faster, cheaper, and more standardized.
And one of the simplest WGS metrics is just G plus C content, right?
Just the percentage of guanine and cytosine bases.
Yep.
A very basic but useful check.
It tends to be relatively constant within a species, or at least within a narrow range, and it's super easy to calculate directly from a full genomic sequence data.
Okay.
So what about classifying below the species level, down to the strain level, where things might change even faster?
Right.
For that, we need techniques that track genes that evolve more rapidly than rRNA.
For example, multi -locus sequence analysis, or MLSA.
Some comes called WGMLSA for whole genome
MLSA.
This compares the sequences of at least five, often more, conserved housekeeping genes across different strains.
Housekeeping genes being?
Genes essential for basic cell function, things involved in metabolism or DNA replication.
If strains share the same versions, the same alleles of these genes, it suggests they're very closely related, likely part of the same recent lineage.
Or we can track SNP single nucleotide polymorphisms.
These are literally single base pair differences in conserved regions of the genome.
Tracking these SNPs gives us incredible resolution to reveal fine scale evolutionary changes, like tracking an outbreak strain.
Okay.
So we have all this data,
SSU, rRNA,
ANI, SNPs.
How do we actually visualize these complex evolutionary relationships?
Right.
If we connect this to the bigger picture,
we visualize these relationships using phylogenetic trees.
Think of them as maps of inferred evolutionary history.
On these trees, the nodes represent divergence events where lineage is split.
And the branch length means something.
Yes, absolutely.
The branch length is actually meaningful.
It usually correlates with the number of molecular changes, like mutations, that have occurred along that lineage since the divergence.
The organisms we're actually studying sit at the tips of the branches.
Those are the operational taxonomic units, or OTUs, and a group of reloaded OTUs sharing a common ancestor is called a clade.
And these trees might be unrooted, just showing relationships, but not necessarily the oldest common ancestor, or they can be rooted, which includes a specific node representing that ancestor.
Exactly.
And we could often root an unrooted tree computationally by simply adding an outgroup, which is just a known distantly related species, including the outgroup, helps establish the historical context, and pinpoints the likely root of the tree you're interested in.
Okay, that makes sense.
But you mentioned a complication earlier.
The largest hurdle in microbial phylogeny remains horizontal gene transfer, HGT.
Yes.
HGT is, well, it's a huge factor.
It's the frequent exchange of genetic material between often distantly related organisms, even across different domains.
It means microbial evolution isn't purely linear, not strictly tree -like, like we see with animals.
Genes jump around.
That sounds like complete chaos.
How do genomic scientists even manage HGT when trying to define a species or understand its evolution?
They address the chaos by defining two essential contrasting components of the genome, the core genome and the pan genome.
The core genome includes only those genes found in all members of a species or group.
Think of these as the essential survival genes, things like informational proteins, RNA genes,
basic metabolic pathways.
Okay, the non -negotiables.
Right.
By contrast, the pan genome is the core genome, plus every additional gene found in at least one strain within that group.
These accessory or flexible genes are very often acquired by HGT, and they're what enables rapid adaptation to new niches, antibiotic resistance, things like that.
And the difference can be huge.
Oh, absolutely.
The contrast is really illustrative.
Take Bacillus anthracis, the cause of anthrax.
Its core and pan genomes are very similar in size.
It reflects a relatively limited stable diversity, probably because it occupies a very specific pathogenic mich.
Right.
But compare that to something like E.
coli.
Its core genome might be around 2 ,800 genes, but its pan genome is estimated to contain potentially up to 37 ,000 different genes found across all known strains.
That massive difference just showcases the incredible genetic diversity that HGT enables in highly adaptable general species like E.
coli.
Wow, 37 ,000 genes.
That incredible flexibility enabled by HGT really raises huge questions about how we define life itself, doesn't it?
So transitioning now perhaps to the deepest questions,
how did complex life actually begin and what truly defines microbial species anyway?
Let's first look at the evolution of eukaryotes, cells like ours with a nucleus.
Yeah, the origin of the nucleus is still debated, but evidence now strongly suggests that the first eukaryotic cell actually arose from within the archaeal lineage.
This is really highlighted by the recent discovery of the Asgard superphylum.
Asgard, like the mythical realm.
Exactly.
These are Archaea, known mostly from DNA sequences found in deep ocean sediments that surprisingly encode many eukaryotic -like proteins.
Things involved in complex processes like membrane trafficking, cytoskeleton formation, even phagocytosis engulfing other cells.
So our distant ancestors might have been Archaea that already had some eukaryotic features.
That's the implication, yeah.
It suggests eukaryotes didn't just emerge alongside Archaea, but rather from a specific group within them.
It's reshaping our view of the base of the eukaryotic tree.
Okay, and while the nucleus's origin is maybe still being worked out, the origin of the organelles inside eukaryotes, mitochondria, and chloroplasts, that's generally accepted under the endosymbiotic theory, right?
Yes, that's very well established.
The theory proposes that a proto -eukaryotic cell, likely one of these Asgard -like Archaea, first engulfed an aerobic proteobacterium.
Over time, that bacterium evolved into the mitochondria, the cell's power and chloroplasts came later.
Right.
A subsequent separate event involved the descendant of that early eukaryote engulfing a cyanobacterium, probably something related to modern Prochlorococcus.
That cyanobacterium then evolved into the chloroplast, enabling photosynthesis.
And the evidence for this is pretty strong.
Very strong.
Mitochondria and chloroplasts still have their own single circular chromosomes, much like bacteria.
They replicate by binary fission, independently of the cell nucleus,
and they possess 70S ribosomes, the bacterial type, not the 80S type found in the eukaryotic cytoplasm.
It all points clearly back to their bacterial ancestry.
Okay, so if that's the origin story for complex cells, what about defining a species down at the microbial level?
Because the traditional biological species concept, where species are defined by their ability to interbreed sexually, that just fails completely for bacteria and archaea, doesn't it?
They mostly reproduce asexually.
Exactly.
It fails entirely, which causes, frankly, intense confusion and debate.
For instance, you mentioned Bacillus anthracis, the deadly pathogen.
Genetically, it's incredibly close to Bacillus serius, a very common soil and food spoilage microbe.
Based purely on DNA similarity, many argue they should be lumped together as one species.
But they're not.
They're not.
Because B.
anthracis it gets a separate species designation based primarily on its pathogenicity, its distinct ecological role, even though its core genome is almost identical to some B.
serius strains.
So what is the current operational gold standard for defining a new microbial species, as recommended by the ICSP, the International Committee on the Systematics of Prokaryotes?
Well, they look for a convergence of evidence, multiple criteria met simultaneously.
The key quantitative ones are usually 70 % or greater DNA, DNA hybridization similarity, or its modern movement around 95, 96 % ANI.
98 .65 % or greater 16S RNA gene sequence homology.
And the G plus C content difference should be less than about five degrees C melting temp difference, which corresponds to a small percentage difference.
Plus, there needs to be significant phenotypic similarity.
They have to look and act alike, too.
Okay, but wait, the sources point out something pretty striking here.
If those strict criteria, 70 % DDH, 98 .7 % CS rRNA similarity, or applied to eukaryotes, all primates that includes monkeys, apes, and us humans would be lumped into a single species.
How can scientists stand by a definition that seems so ridiculously narrow compared to how we classify animals?
Yeah, it really highlights that the microbial species is fundamentally an operational concept, a pragmatic tool for organization, not a reflection of the same biological reality as in sexually reproducing organisms.
We use these type molecular thresholds to avoid confusion, especially in clinical and industrial settings where precise identification is critical.
So it's about utility, not necessarily deep evolutionary disurgence.
Exactly.
We know the definition is flawed when comparing the vast fluid genetic diversity bacteria to say, mammals, but it provides a necessary albeit imperfect framework for communication and classification in microbiology.
It's a tool, not necessarily a perfect mirror of natural evolutionary boundaries.
Okay.
This raises an important question then.
If that gold standard is so strict, how do new microbial species actually evolve and stick around long enough to be recognized?
Well, we think it involves a couple of primary mechanisms.
There's antigenesis, sometimes just called genetic drift.
That's the slow accumulation of small random genetic changes over long periods.
But the mechanism that likely creates clear new lineages more rapidly relies on adaptive mutations.
These are those rare mutations that happen to confer a significant growth advantage in a particular environment.
Because they provide an edge, they are strongly selected for and can sweep through the population becoming fixed.
And this ties into the ecotype model.
Precisely.
An ecotype is defined as a population of microbes that are genetically quite similar, but ecologically distinct.
They suggest that when one of these adaptive mutations arises, it allows that specific lineage to out -compete its close relatives during what are called periodic selection events.
Like a selective sweep.
Exactly.
That successful lineage effectively wipes out the less fit competitors in that niche.
This process, often described as punctuated equilibria periods of stasis, punctuated by rapid change, fixes the new beneficial traits and drives the divergence that ultimately leads to speciation.
And where does HGT fit into this evolutionary process?
That's key.
While mutation might drive the initial divergence, the splitting off from a relatively homogenous population, HGT is often the engine that drives rapid adaptation and niche expansion after that initial split, especially in groups that are already diverse.
It allows microbes to quickly acquire new metabolic capabilities,
resistance mechanisms, or virulence factors from their neighbors, accelerating their adaptation to new environments.
Think of the thermotopia adapting to hot springs by borrowing genes from archaea.
Right.
Adding tools from elsewhere.
Okay, finally, before we wrap up, we should probably mention the ultimate reference source for all this classification.
Absolutely.
Berge's Manual of Systematics of Archaea and Bacteria is the accepted, modern, definitive reference guide.
It's now primarily online and continuously updated.
And crucially, it has completely shifted its organizational principle away from just phonetic traits, how things look or act, to being based almost entirely on phylogeny, the evolutionary relationships derived from molecular data like RNA and genome sequences.
A living document reflecting this molecular revolution.
Exactly.
So to maybe summarize this whole deep dive, microbial classification is, well, it's a dynamic, rapidly evolving, and inherently polyphasic field.
It absolutely relies heavily now on comparing molecular markers like SSU RNA and whole genome metrics like ANI.
And it's constantly grappling with the challenges posed by rampant HGT, often using concepts like the core and pan genomes to make sense of it all.
The scale is just staggering to think about.
Estimates suggest there could be up to a million distinct microbial species out there.
Maybe more.
That means we potentially have about a billion times more microbes on Earth than stars in the entire observable universe.
Just mind -boggling It really is.
And considering the complexity revealed by things like the Asgard superphylum,
remember, organisms known initially only through DNA fragments and the sheer vast volume of microbial dark matter still remaining unexplored, makes you wonder how might our operational definition of a microbial species need to change again in the next decade, especially to accommodate organisms that fundamentally reshape the tree of life, even if they resist culture and are defined solely by their vast, unique genomic footprint.
That's a powerful thought to end on.
It really underscores that our understanding of life's organizational chart is, maybe, still just a rough draft constantly being revised.
Thank you for diving deep into the microscopic world with us today.
We hope this exploration into the messy, expanding, and utterly vital microbial universe has given you, our listeners, a fresh perspective on the vast majority of life on this planet.
We'll catch you next time on the Deep Dive.
ⓘ This audio and summary are simplified educational interpretations and are not a substitute for the original text.
Using this chapter to study? Last Minute Lecture is free and student-run. If it helped, consider supporting the project.
Support LML ♥Related Chapters
- Genomes and Their EvolutionCampbell Biology
- Microbial Evolution and Genome DynamicsBrock Biology of Microorganisms
- Molecular Evolution and PhylogeneticsiGenetics: A Molecular Approach
- Eukaryotic Microbes – Structure & DiversityPrescott's Microbiology
- Eukaryotic Organelles and the Origin of GenesStrickberger’s Evolution
- Evolution of Microorganisms & MicrobiologyPrescott's Microbiology