Chapter 11: Carbohydrates: Structure & Function

Search this chapter

Audio Overview

0:00 / 0:00

Autoplay next chapter

Welcome to Last Minute Lecture.

This free chapter overview is designed to help students review and understand key concepts.

These summaries supplement not replaced the original textbook and may not be redistributed or resold.

For complete coverage, always consult the official text.

Okay, let's unpack this.

When we talk about the great molecular machinery of life,

we tend to reserve the highest praise for the genome, you know, the blueprint and the proteome incredible workers who build everything.

And when most people hear the word carbohydrates, they immediately jump to too much simpler functional ideas.

Right.

Either it's fuel, like glucose providing that rapid energy, or it's just basic structure, like the rigid cellulose in a plant cell wall.

Exactly.

For a very long time, biochemistry really relegated carbohydrates to that secondary status.

They were the underlying girders and, you know, basic fuel for the architecture of the cell.

Right.

Just the scaffolding.

They were seen as peripheral necessary, sure, but certainly not where the real informational action was happening.

Not when you compare them to the vast informational storage capacity of DNA and RNA.

But the reality, and this is what we're digging into today based on our source material, is that this view drastically, drastically undervalues them.

Carbohydrates are far more complex, dynamic, and crucially information rich than we ever gave them credit for.

They form a secret language that's absolutely central to defining cell identity, cellular recognition, and communication between cells.

To immediately shift your perception, let's revisit that analogy of structure.

Think about an athlete running and the shock absorption that has to happen in their knees and feet.

Right.

That protective cushioning function is driven by massive highly charged molecules called glycosaminoglycans.

Which are carbohydrates.

They are large carbohydrate polymers.

They are not simple fuel.

They are complex shock absorbing hydrophilic architecture.

Their chemistry directly dictates their physiological function.

That's a perfect hook.

So our mission today is to go step by step through the structure of carbohydrates, teaching the fundamentals, while at the same time emphasizing how their structure dictates this really complex function.

We're moving the focus decisively from fuel to information.

We're going to explore how their incredible potential for structural diversity, which, and this surprised me, far exceeds the diversity found in proteins or nucleic acids,

is what allows them to act as that secret language defining cellular identity.

That's right.

And this explosion of understanding regarding their informational role has even spawned entirely new fields of study.

We're looking at glycobiology, which is the study of the synthesis and structure of carbohydrates and how they interact with other molecules, especially proteins and lipids.

And there's an omics field for it too, right?

Of course.

You have glycomics, which is the systematic study of the entire glycum.

That's the dynamic and often rapidly changing set of carbohydrates and carbohydrate associated molecules that a cell produces in response to its environment.

And the key, the core theme we will keep returning to, is that structural diversity.

The varying sizes, the stereochemistry, and the almost astronomical number of linkage possibilities.

That's the crucial property that enables them carry so much information.

It really is.

Let's just quantify that complexity early.

If you have, say, three different amino acids, they can only link up in a few ways, but if you have just three different monosaccharide units, you can form

thousands, thousands of unique structures.

That's the informational potential we are going to explore.

Exactly.

And to do that, we need to understand the building blocks first before we can really appreciate the library they create.

Let's do it.

Let's start at the fundamental definition.

What makes a carbohydrate chemically different from, say, a lipid or an amino acid?

At their core, we define carbohydrates as polyhydroxy aldehydes or ketones.

Okay, let's break that down.

Polyhydroxy.

It just means they are carbon -based molecules that are incredibly rich in hydroxyl or EAH groups.

For many of the simple ones, the empirical formula is often a hydrate of carbon, so CH2S of dollars.

A carbon for every water molecule, essentially.

Essentially, yes.

And the simplest functional units, the individual building blocks, are called monosaccharides.

And even the simplest units, which we often think of as just fuel, are fundamental constituents of the cell.

I mean, we all know the five -carbon sugar deoxyribose forms the structural backbone of DNA alternating with those phosphoryl groups.

A perfect example.

So chemically, how do we start classifying them based on their structure?

Classification really begins with the location of the carbonyl group, the C double bondo.

If the monosaccharide contains an aldehyde group, meaning the carbonyl is at the end of the carbon chain, it's an aldose.

Okay, an example.

A smallest one in this category is D -glyceraldehyde, a three -carbon sugar.

Now, if it contains a keto group where the carbonyl is typically within the chain, usually at the second carbon, C2, it's called a ketose, dihydroxyacetone is the classic example there.

So that one carbonyl location dictates the basic chemical family, and then we classify by size, right?

Triose, pentose.

Correct.

We use suffixes based on the number of carbon atoms, N.

So trioses are three -carbon sugars, like the glyceraldehyde and dihydroxyacetone we just mentioned.

Then you have tetrises for four carbons, pentoses for five.

Like ribose and deoxyribose, fundamental to nucleic acid structure.

Exactly.

And then the major players, the hexoses, with six carbons, this includes D -glucose, the essential energy source for pretty much all life, and D -fructose, a common dietary sweetener.

And it's fascinating that despite the large number of carbons, so many of these sugars are just different arrangements of the same basic formula.

It is.

Here's where it gets really interesting and where that diversity starts to build up, and that's with isomerism.

We can start simple with constitutional isomers.

Right.

Constitutional isomers are molecules that have the exact same molecular formula, but differ fundamentally in the order of attachment of their atoms.

So the connections are different?

The connections themselves are different.

As we mentioned, dihydroxyacetone and glyceraldehyde are both C3H60333, but the keto group in one is in a different place than the aldehyde group in the other.

This makes them chemically distinct molecules.

But the real explosion in complexity comes with stereoisomers.

Here, the atoms are connected in the exact same order, but they differ only in their spatial arrangement.

And this is entirely driven by the presence of asymmetric or chiral carbon centers.

Precisely.

And the most basic type of stereoisomer here is an enantiomer.

Mirror images.

They are non -superimposable mirror images.

Think of your left and right hands.

They are mirror images, but you can't lay one perfectly on top of the other.

D -glyceraldehyde and L -glyceraldehyde are enantiomers.

And crucially, the enzymes in living systems are incredibly specific about this.

They can tell left from right.

Absolutely.

Most vertebrate monosaccharides are found in the D configuration.

This D or L configuration is determined by looking at the asymmetric carbon atom that is farthest from the carbonyl group.

And it's a tiny but important detail that dihydroxyacetone is the odd one out.

It is.

It's the only monosaccharide that lacks any asymmetric carbon atoms, which makes it acryl.

It doesn't have a mirror image form.

Okay, so once we move past the simple three -carbon sugars, we enter the world of sugars with multiple asymmetric carbons, and this opens up the possibility of diastereoisomers.

Right.

These are stereoisomers that are not mirror images of each other.

The number of possible stereoisomers increases exponentially and is calculated as $2, where at N is the number of asymmetric carbons.

Wait, let's stop there and just emphasize that potential.

A six -carbon aldose, like D -glucose, has four asymmetric carbon atoms.

It does.

So 2M2432 gives us 16 possible diastereoisomers.

16.

That is 16 chemically distinct, naturally occurring sugars like glucose, mannose, galactose, that all share the exact same formula, C6H12 -66.

That density of structural information embedded in the basic building block structure is just staggering.

It is staggering, and it highlights why enzymes involved in carbohydrate metabolism have to be exquisitely precise.

Nature only selects and uses a small handful of those 16 possibilities.

Why wouldn't it use all of them?

Well, that selection is governed by evolutionary pressures and the specific recognition patterns required by different biological pathways.

The cell doesn't just want fuel, it needs information built into the fuel's very structure.

Okay, and we can narrow this down even further with the specific and really common category of diastereoisomers called epimers.

What defines an epimer?

Epimers are sugars that differ in configuration at only a single asymmetric center.

Everything else is identical.

So they're almost the same molecule, but not quite.

Exactly.

For example, D -glucose and D -mannose are epimeric at C2.

They are functionally different molecules recognized by different enzymes, simply because the hydroxyl group on the second carbon is pointed one way in mannose and the other way in glucose.

And another example.

D -glucose and D -galactose are epimeric at C4.

This subtle difference dictates major functional roles.

For instance, glucose is our primary metabolic fuel, while galactose is often incorporated into complex structural components and those recognition markers on cell surfaces.

So we've been talking about these complex sugars, and we usually draw them as straight chains on paper.

But the reality inside the aqueous environment of the cell is very different.

Very different.

The predominant forms of sugars like glucose are actually rings.

Why do they do that?

Why switch from an open chain to a ring?

It's all about stability and reactivity.

This cyclization happens through an intramolecular reaction.

That means the molecule reacts with itself.

The aldehyde, or ketone group, reacts with the hydroxyl group within the same molecule.

Okay, let's take an example.

Glucose.

For an aldehexose like glucose, the aldehyde group at C1 reacts with the hydroxyl group down at C5.

This forms what we call an intramolecular hemiacetal.

And the resulting ring structure has a name, right?

Based on a molecule it resembles.

Yes.

The resulting six -membered ring structure is called a pyranose, because it looks like the 6 -membered ring molecule pyran.

And what about for ketoses like fructose?

For ketohexoses like fructose, the C2 ketone typically reacts with the C5 hydroxyl group.

This forms an intramolecular hemiacetal, and the result is a five -membered ring structure called a furanose, because it resembles the molecule furan.

I see.

So the difference in ring size is dictated by which hydroxyl group grabs that carbonyl.

And this structure isn't fixed, which leads to some pretty interesting real -world consequences in food chemistry.

It really does.

Fructose is a great example.

Although it often forms that five -membered furanose ring, depending on the concentration and temperature, it can also form a six -membered pyranose ring.

And one is sweeter than the other.

Much sweeter.

The six -membered ring form, beta -D -fructo -pyranose, which you find in cold honey and corn syrup, is incredibly sweet.

But if you heat that solution, say you're making hot fudge, a portion of it converts to the five -membered beta -D -fructo -furanose form, which is noticeably less sweet.

This lability means the physical properties of the molecule are highly dependent on which ring structure it happens to adopt at that moment.

Now, when the sugar cycles, we create a new center of stereoisomerism.

So we're adding yet another layer of complexity.

Correct.

The former carbonyl carbon, that's C1 in glucose or C2 in fructose, is now a saturated carbon with four different groups attached.

That makes it a new asymmetric center.

We give it a special name, the anomeric carbon.

And the two resulting ring structures, which differ only at this new anomeric carbon, are called anamers.

They are.

And we distinguish these two anamers using the Greek letters alpha, alpha,

and beta.

What's the rule there?

For D sugars, the standard convention is that the alpha designation means the hydroxyl group attached to the anomeric carbon is on the opposite side of the ring relative to the C6 group.

Okay, so it's pointing down if you imagine the ring flat.

If you imagine it flat, yes.

The beta designation means the hydroxyl group is on the same side as C6, so it's pointing up.

And in a solution, it's not all one or the other.

It's a mix.

It's a dynamic equilibrium.

In an aqueous solution of glucose, you'll typically find a mixture.

About two -thirds is the betaromer, about one -third is the alphanomer, and that tiny but crucial less than 1 % remains in the open chain form.

And finally, that ring isn't just a simple flat circle on paper.

It exists in a specific three -dimensional shape, a conformation.

We should probably talk about the physical stability of those shapes.

Right.

The six -membered purinose ring is not planar.

The tetrahedral geometry of its saturated carbons dictates bond angles of about 109 .5 degrees.

So to satisfy that, it puckers.

It predominantly adopts the chair form.

The chair form.

And there's another one, the boat form.

Yes, the boat form, which is much less favorable.

Why?

Why doesn't the molecule just stay in the boat form?

What's the mechanical disadvantage there?

The boat form has significant steric hindrance and strain.

You can imagine two atoms sticking up at opposite ends of the boat, trying to occupy the same space.

We call that flagpole crowding.

It's very unstable.

The chair form resolves this by minimizing all steric hindrance and torsional strain.

It's the lowest energy state.

And that leads to the specific prevalence of one particular chair form in nature, betadiglucopyranose.

Exactly.

In the chair form, substituents can be in two primary locations.

Axial, which are perpendicular to the plane of the ring, sort of sticking straight up and down.

Or equatorial, which are parallel to the plane, sticking out around the equator of the ring.

So axial is crowded, equatorial is roomy.

That's the perfect way to think about it.

And betadiglucopyranose predominates because all of its bulky groups, all the hydroxyl groups and the primary alcohol group at C6, can be placed in those roomier, less hindered equatorial positions.

This leaves only small hydrogen atoms in the crowded axial positions.

It is the most energetically favorable arrangement possible for glucose, making it the dominant structure in solution.

Let's talk about reactivity.

Because the alpha and beta animers of glucose are continuously interconverting through that short -lived open chain form.

A process we call mutarotation.

Because of that, glucose still exhibits the chemical properties of a free aldehyde.

It can react with oxidizing agents.

It can.

And this capability defines it as a reducing sugar.

It can reduce the cupric ion, TeXP2 +, in common lab regions like Feelings or Benedict's solution to cuprous ion, TeXP8II.

That causes a color change, while the sugar itself is oxidized to gluconic acid.

And this chemical property has profound clinical relevance, especially in diabetes management.

It really does.

Reducing sugars, because of that reactive carbonyl group in the open chain form, can non -specifically react with free amino groups found on proteins.

This forms stable covalent bonds in a process we call non -enzymatic glycation.

Which leads us to glycosylated hemoglobin, or TeXHBA.

Exactly.

Since red blood cells have a lifespan of several months, about 120 days, monitoring the amount of hemoglobin that has been randomly glycated by circulating glucose gives you a long -term, averaged assessment of blood glucose regulation over the preceding two to three months.

It's invaluable compared to a single snapshot measurement of blood glucose, which can fluctuate wildly.

And while that monitoring is beneficial, the same process happening throughout the body is actually pretty bad.

It's very detrimental.

These reactions form irreversible products called advanced glycation end products, or AGEs.

AGEs, that's appropriate.

Very appropriate.

The accumulation of AGEs modifies the function of vital proteins like collagen, leading to the pathologies we see with long -term, poorly controlled diabetes.

Things like accelerated aging, vascular stiffness,

arteriosclerosis.

So if nature wants to build larger stable structures like chains, or attach these sugars to proteins, it needs to lock that anomeric carbon and stop it from entering the open chain form and becoming a reducing sugar.

Right.

This is where the formation of glycosidic bonds comes in.

A glycosidic bond forms between the anomeric carbon atom of one sugar and another molecule.

If the anomeric carbon links to the oxygen atom of an alcohol, it's an O -glycosidic bond.

These are the critical links used to form polysaccharide polymers and to attach carbohydrates to the hydroxyl groups of proteins.

And the N linkage.

If the anomeric carbon links to the nitrogen atom of an amine, it's an N -glycosidic bond.

We see this prominently in nucleic acids, where it links the ribose or deoxyribose sugar to the nitrogenous bases like adenine and guanine.

There's a third key modification and one that absolutely defines metabolism, and that's phosphorylation.

It seems like almost every sugar that enters a cell is immediately phosphorylated.

Why is this step so necessary?

Phosphorylation is the addition of a phosphoryl group, and it's catalyzed by a kinase enzyme.

Let's take glucose again.

The very first step of glycolysis traps it by forming glucose 6 -phosphate.

This modification serves a dual and really brilliant purpose.

Okay, what's the first one?

First, the phosphoryl group is highly anionic.

It carries a strong negative charge.

This negative charge prevents the sugar from crossing the lipid bilayer membrane.

It effectively traps the fuel inside the cell where it's needed for metabolism.

It's a one -way door.

Highly efficient molecular fence.

It's a perfect one -way door.

And second, the addition of the phosphate group creates highly reactive intermediates.

These are molecules that are now primed and energetically ready for all the subsequent in metabolism and biosynthesis, whether it's breaking the sugar down for ATP or using that derivative to build nucleotides or lipids.

Okay, so we're moving from the single building block, the monosaccharide, and we're starting to assemble them into chains.

This is where that informational capacity just begins to explode.

Right.

These chains are called oligosaccharides if they contain a few units or polysaccharides for many units.

They're built by linking two or more monosaccharides via those O -glycosidic bonds we just mentioned.

And they have a directionality?

They do.

Unlike proteins, which have a simple N -terminus to C -terminus direction, oligosaccharides have a directionality that's defined by the reducing end and the non -reducing end.

The reducing end being the unit with a free anomeric carbon that could, in theory, open up into that reactive open chain form.

That's the one.

And the presence of multiple hydroxyl groups on every single monosaccharide allows for a vast structural complexity that is completely unmatched by proteins or nucleic acids.

Let's revisit that incredible number we mentioned earlier.

The source material states that just three different hexases, say glucose, mannose, and galactose, could theoretically be linked together in the lab to form more than 12 ,000 unique structures.

It's an astronomical number.

Why is that complexity possible here, but not in, say, peptides?

Well, in peptides, the linkage is always the same.

It's between the alpha carboxyl group of one amino acid and the alpha amino group of the next.

It's a single repeating amyamide bond.

That's it.

But with sugars?

With a monosaccharide like glucose, you have four different hydroxyl groups and the anomeric carbon, and they are all available to form different linkages.

You can form an alpha one -dealer for a linkage, a beta one -dollar for a dollar linkage, an alpha dollar six -dollar linkage, and so on.

This ability to vary the linkage point, the stereochemistry of that linkage alpha or beta, and the order of the units means the informational density is radically higher in the glycum than in the proteome.

This is the structural freedom that allows carbohydrates to serve as these complex biological address markers.

Okay, let's ground that complexity in a few common two -sugar structures, or disaccharides that we encounter every day.

Let's start with the most famous one, sucrose.

Sucrose, common table sugar.

It's glucose linked to fructose.

And this molecule is metabolically efficient, but it's structurally unique.

The glycosidic linkage joins the anomeric carbon of both units.

Both of them.

The alpha C1 for glucose and the beta C2 for fructose.

If both anomeric carbons are tied up in that bond, what's the functional consequence?

Is it still a reducing sugar?

No, it becomes a non -reducing sugar.

Because neither sugar component has a free anomeric carbon, neither ring can open up into that reactive open chain form.

This makes sucrose chemically stable and non -reactive, which is great for things like long -term storage or transport in plants.

Then we have lactose, milk sugar, which is galactose joined to glucose by a beta $1, $4 glycosidic linkage that's hydrolyzed by the specific enzyme lactase in our small intestines.

And maltose, or malt sugar, which is two glucose units joined by an alpha $4, $4 linkage.

That often comes from the breakdown of starch.

And these molecules really demonstrate how specific our enzymes have to be to hydrolyze one specific linkage and not another.

Absolutely.

Okay, let's talk about the big storage polymers.

The need for these large polymers arises because, as we discussed earlier, you can't just store free glucose in high concentrations without severe consequences.

Right, the osmotic problem.

Exactly.

It would drastically increase the osmolarity of the cell, drawing in massive amounts of water and potentially causing the cell to just burst.

So nature's solution is to store glucose as units in large, osmotically inactive polymers, which we call polysaccharides.

These are large polymeric homogamers, meaning they are composed entirely of glucose units.

The primary storage homopolymer in animals is glycogen.

You find it most abundantly in muscle and liver tissue.

And its structure is primarily defined by linear chains of?

Of alpha $1, $4, and glycosidic bonds.

But to make it compact and rapidly accessible, it's highly branched.

And the branches are formed by a different kind of linkage.

Correct.

The branches are introduced by alpha $1, $6 glycosidic bonds, and they occur pretty frequently, roughly once every 10 glucose units.

What's the advantage of all that branching?

It means the molecule has many non -reducing ends.

This allows many enzymes to work on it simultaneously to quickly add or remove glucose units as the cell's energy demands change.

It's built for speed.

Plants, meanwhile, use starch as their nutritional reservoir, and that comes in two forms that are chemically similar but structurally distinct from glycogen.

Starch consists of amylose, which is the unbranched polymer of glucose linked only by alpha $4, $4 bonds.

It's just a long helical chain.

And then there's amylopectin, which is the branched form.

How does amylopectin compare to glycogen?

It's structurally similar, but it branches much less frequently, only about once every 30 linkages compared to every 10 for glycogen.

So glycogen is a much more rapid energy source.

Much more rapid.

Plant starch is slower to break down.

Starch constitutes the majority of carbohydrate we ingest, and those alpha $4 linkages are easily cleaved by our enzyme alpha amylase.

This is truly one of the most brilliant examples of how small structural differences dictate massive functional outcomes in biochemistry.

The profound difference between energy storage and structural rigidity comes down to the stereochemistry of that anomeric carbon, the distinction between the alpha and beta glycosidic linkages.

This is the critical takeaway when you're thinking about polysaccharides.

Cellulose, the most abundant organic molecule on Earth and the primary structural component of plant cell walls,

is also a homopolymer of glucose.

But instead of alpha $1 for linkages, it uses beta $1 for all linkages.

And the functional result is completely different.

How does that beta configuration change the molecule's overall architecture?

The beta configuration forces the glucose units to flip relative to one another, about 180 degrees.

This geometry favors the formation of long, rigid, straight chains, like stacked bricks.

These straight chains are perfectly suited to align themselves, allowing for extensive intermolecular hydrogen bonds to form between parallel chains.

What you end up with are these microfibrils that provide extraordinary tensile strength and rigidity.

It's optimal for structure a load -bearing ladder.

So the beta linkage creates a rigid, supportive ladder, while the alpha linkage creates a bend.

That's it, exactly.

The alpha $1 for linkages introduce a slight bend in the chain that is unfavorable for forming those straight parallel sheets.

Instead, the geometry highly favors the molecule twisting into a compact coiled structure, a hollow helix.

More like a spiral staircase.

A perfect analogy.

This helical structure is far more compact, it's more water soluble, and importantly, its non -reducing ends are easily accessed by enzymes.

This makes it optimal for rapid energy storage, like a coiled rope just waiting to be unfurled.

And because we mammals lack the specific cellulose enzymes needed to hydrolyze those beta $1 for linkages, cellulose becomes indigestible.

Yet it's still vital in our diet in the form of insoluble fiber.

Precisely.

Insoluble fibers, like cellulose, are physically abrasive.

They increase the speed at which digestion products pass through the large intestine, which minimizes exposure time to potential toxins.

And soluble fiber.

Soluble fibers, like pectin, form viscous gels that slow the movement of food through the GI tract.

And that's beneficial for controlled nutrient absorption, and it helps regulate blood sugar.

So even when we can't metabolize the sugar, the structure of its polymer dictates a critical function for us.

Okay, now we move to the realm where carbohydrates truly become an informational language.

Glycoconjugates.

Yep.

These are molecules where sugars are covalently attached to proteins or lipids.

Right.

And this process, called glycosylation, is not rare at all.

Approximately 50 % of the entire human proteome consists of glycoproteins.

Half of our proteins have sugars attached.

So we have three major functional classes to distinguish here, based on the ratio of protein to carbohydrate mass.

The first is simply glycoproteins.

In simple glycoproteins, the protein component is the dominant factor by weight.

These are absolutely vital.

You often find them embedded in the cell membrane, where they mediate crucial external communication events, like cell adhesion, pathogen recognition,

even sperm egg binding.

And secreted proteins too, right?

Yes.

Most secreted proteins, including the majority of the serum proteins in your blood, are also glycoproteins.

The second class, proteoglycans, flips that ratio entirely.

Entirely.

Proteoglycans are predominantly carbohydrate by weight, sometimes up to 95%.

They consist of a protein core that's conjugated to a specific type of polysaccharide called a glycosaminoglycan, or a GAG.

Gags.

Gags are polymers of repeating desaccharide units, one unit of which is always an amino sugar derivative.

And crucially, they possess an extremely high negative charge density.

They're covered in carboxylate and sulfate groups.

Why is that high negative charge density so necessary?

What does it do?

That charge is the key to their function as structural components and lubricants.

Because they are so highly anionic, they attract massive amounts of counter ions, and most importantly,

huge volumes of water.

They swell to fill space, providing cushioning and hydration.

Can you give us some examples of gags?

Sure.

Key gags include chondroitin sulfate, keratin sulfate, and heparin, which is a powerful natural anticoagulant.

It works by binding to and activating antithrombin, which then inhibits blood clotting.

And what happens if the body loses the ability to manage these highly charged gags?

If the lysisome, the cell's recycling center, fails to degrade these gags properly, they accumulate inside the cell.

This leads to devastating conditions called mucopolysaccharidosis.

Hurler disease is one of the better known ones.

And what's the pathology there?

The accumulation of these undigested, swelling, osmotically active carbohydrate polymers inside the lysosomes causes cellular dysfunction.

This leads to severe progressive symptoms, skeletal deformities, coarse facial features, developmental delays.

The failure of a single degradative of enzyme has catastrophic systemic consequences.

And the third class are the mucins.

Mucins, or mucoproteins, are also very high in carbohydrate content, up to 80 % by weight.

They are heavily O -glycosylated, meaning they are just bristling with sugar chains.

And they make mucus.

They are the key components of mucus.

They serve as powerful lubricants and create these robust, protective barriers in our respiratory and gastrointestinal tracts, shielding delicate cells from mechanical abrasion, pathogens, and corrosive substances like stomach acid.

And when a single protein can be glycosylated at multiple potential sites, and each of those can have a different branching pattern, this generates a massive number of distinct molecular entities.

Yes, we call them glycoforms.

This is why the complexity of the proteome is immediately and exponentially expanded by the glyco.

So how are these sugars linked to the proteins?

What are the bond types?

The sugars are linked to the protein via one of two primary types of glycosidic bonds.

We have the N -linked and O -linked attachments, named for the atoms they connect to.

The N -linked bond refers to the sugar being attached to the amide nitrogen atom of the side chain of the amino acid asparagine, or asin.

And there's a specific sequence requirement for a protein to be a substrate for N -glycosylation.

There is.

The asparagine must be part of the sequence asinexacer, or asin XCR, where X can be any residue except proline.

So it's a recognition signal.

It's a recognition signal for the enzymes that initiate the process.

Furthermore, all N -linked oligosaccharides, regardless of how complicated they eventually become, share a common structural foundation, a pentasaccharide core.

What's that made of?

It consists of three mannose and two N -acetylglucosamine residues.

This common core acts as a starting anchor for all N -linked additions.

And the O -linked bond?

O -linked glycosylation involves the sugar attaching to the oxygen atom of the hydroxyl groups found in the side chains of either serine or threonine.

This type often occurs as single shiver units or shorter, less complex chains than the N -linked glycans.

The hormone erythropoietin, or EPO, is a fantastic real -world example of how essential glycosylation is for functionality.

EPO is a great case study.

It's secreted by the kidneys, and its job is to stimulate the production of red blood cells.

It's a glycoprotein that is highly decorated.

It's 165 amino acids, yet 40 % of its total mass is carbohydrate.

It is N -glycosylated at three different asparagine residues and O -glycosylated at one serine residue.

So why is that 40 % carbohydrate mass so essential?

What happens if you strip the sugars off?

The glycosylation is absolutely essential for stability and longevity in circulation.

Studies show that the un -glycosylated protein, it still retains the ability to bind its receptor in vitro in a dish.

But not in the body.

But once you inject it into the bloodstream, it has only about 10 % of the bioactivity of the nater form.

The carbohydrates prevent the protein from being rapidly recognized and removed from circulation by clearance receptors in the kidneys and liver.

So the sugars act as a shield.

They essentially act as a shield, extending the protein's half -life so it has time to reach the bone marrow and perform its job.

And this structural difference has major pharmacological and regulatory consequences, particularly in competitive sports.

It does.

Recombinant human EPO, or ORAPO, is produced in cell cultures and is used medically, but it may have slightly different glycosylation patterns compared to the EPO naturally produced in the human body.

How different?

These are subtle differences, often relating to the degree of terminal sialic acid content, which alters the molecule's overall negative charge.

But drug testing labs, using techniques like isoelectric focusing, can distinguish between natural EPO and prohibited performance -enhancing re -EPO based on those tiny differences in the sugar coat.

So beyond stability, glycosylation is also intimately involved in cellular regulation.

It can act as a direct sensor for the cell's nutritional status.

Yes, through a process called GLCNA salation.

Which is the covalent attachment of N -acetylglucosamine, or GLCNAK, to serine or threonine residues.

How does this one modification manage to act as a sensor for the entire cell's nutritional profile?

The enzyme that performs this attachment, OGLCNAK -transferase, uses a molecule called UDPGLCNAK as its substrate.

And the concentration of UDPGLCNAK is directly downstream of the metabolic pathways that break down carbohydrates, amino acids, and fats.

So if there's a lot of food coming in?

Then there's a lot of UDPGLCNAK.

Therefore, the level of GLCNA salation inside the cell directly reflects whether nutrients are abundant or scarce.

High GLCNA salation is a signal of nutrient abundance.

This sounds like it should interfere with a much more famous regulatory mechanism, phosphorylation.

It does, and that interference is the core of its function.

The sites that undergo GLCNA salation are often the very same serine or threonine residues that are targeted for phosphorylation.

So they compete.

They compete.

This leads to crucial crosstalk between the OGLCNAK -transferase and protein kinases.

If GLCNAK is attached, it can block the kinase from attaching a phosphate, and that modulates that protein's signaling activity.

The cell uses this reversible process to rapidly adjust the activity of hundreds of proteins in response to nutrient availability.

Disregulation here is heavily implicated in conditions like insulin resistance and neurological disorders.

Now let's return to our starting analogy,

the shock absorber in the runner's foot.

This highlights the structural and mechanical function of proteoglycans.

Right.

We're talking about the cartilage matrix.

This consists of collagen for tensile strength and the massive proteoglycan agrikin for cushioning.

An agrikin is heavily decorated with those highly anionic glycosaminoglycans we mentioned, like keratin sulfate and chondroitin sulfate.

Those high negative charges are the functional core.

They act as molecular magnets.

Exactly.

The density of negative charges on the gags causes massive electrostatic repulsion between the chains, forcing them to adopt this extended hydrated conformation.

They attract and bind huge amounts of water and counter ions.

So when you apply pressure, when the foot impacts the ground.

The water is squeezed out of the agrikin structure.

This compresses the matrix and Christians the impact.

And the moment the pressure is released.

The negative charges push back.

The chains immediately re -expand due to that repulsion and the osmotic pressure created by the counter ions.

This causes the structure to rapidly reabsorb water and spring back to its original shape.

It makes agrikin the perfect material for resisting compressive forces.

And what happens in osteoarthritis?

The chronic condition of osteoarthritis often involves the progressive loss of these vital proteoglycans and the corresponding loss of water content from the cartilage.

And that leads to pain and stiffness.

Given the incredible complexity and specificity of glycosylation, the synthesis of these molecules must be highly regulated and compartmentalized.

Absolutely.

The major pathway takes place in two organelles that are central to protein trafficking.

The endoplasmic reticulum or ER lumen and the Golgi complex.

N -linked glycosylation starts in the ER.

And then both N -linked modification and all O -linked glycosylation are completed in the Golgi.

How does N -link synthesis even begin in the ER?

It's a pretty complex multi -step process.

A large oligosaccharide core, the precursor to that penicaccharide we mentioned, is first assembled piece by piece on a specialized lipid molecule that's embedded in the ER membrane.

It's called doliculphosphate.

So you build the sugar chain on lipid first.

You do.

This pre -assembled activated oligosaccharide is then transferred on block, all at once, to the asparagine residue of the growing polypeptide chain while the protein is still being synthesized.

And then the Golgi acts as this sophisticated sorting center and kind of finishing school for these carbohydrates.

It's the major processing plant.

As proteins travel through the stacks of the Golgi cisternae moving from the cis face to the medial face and finally to the trans face, the carbohydrate units are extensively trimmed, modified, and elaborated by a massive suite of specialized enzymes.

And this is where the address labels are put on.

Crucially, yes.

The Golgi ensures proteins receive their proper address labels directing them to their final destination, whether it's the lysosome, the plasma membrane, or for secretion out of the cell.

The creation of all these complex, energy -intensive glycosidic linkages requires specialized tools.

They are called glycosyltransferases.

These are the enzymes that catalyze the formation of specific glycosidic bonds.

They are responsible for generating the astronomical diversity we discussed.

And they need energy.

They do.

To make the formation of these new bonds energetically favorable, the carbohydrate donors are almost always activated sugar nucleotides, such as UDP glucose or GDP mannose.

Attaching the sugar to a high -energy nucleotide enhances its energy content, and that powers the biosynthesis reaction forward.

The functional power and specificity of these glycosyltransferases is best illustrated by the human ABO blood groups.

This shows how a tiny change in a surface carbohydrate can dictate life and death.

This is one of the most stunning examples of molecular identity.

All A, B, and O blood groups share a common oligosaccharide foundation that decorates the surface of red blood cells.

We call this the O antigen.

The difference between the blood types boils down to the presence or absence of a single final sugar that's added to that foundation.

So the O antigen is the base model, and the A and B types add a unique molecular hood ornament.

That's a perfect way to put it.

Individuals with type A blood inherit the type A transferase.

This enzyme specifically adds N -acetylgalactosamine via an alpha $1, $3 linkage to the O antigen?

N type B.

Individuals with type B blood inherit the type B transferase.

It's a slightly different enzyme, but it performs a similar function, adding galactose via the same alpha $1, $3 linkage.

And type O blood, what's going on there?

Type O results from a mutation in the transferase gene that causes the synthesis of an inactive enzyme.

It doesn't work.

Therefore, the cell can only make the O antigen foundation and it just stops there.

And that single difference is what your immune system recognizes as self or non -self.

Exactly.

It leads to catastrophic blood lysis if incompatible antigens are introduced during a transfusion.

Speaking of addressing and targeting failure, let's explore eye cell disease.

This is a devastating example of what happens when that Golgi sorting mechanism fails, specifically in adding an address label to a sugar.

Eye cell disease, or mucolypidosis II, is a lysosomal storage disease.

Normally, the cell produces many essential digestive enzymes that are destined for the lysosome, that cellular recycling center.

These enzymes require a specific carbohydrate address label, a terminal mannose 6 -phosphate marker, or M6P.

This is part of an N -linked oligosaccharide and it's added in the Golgi.

This marker is then read by a specific receptor that guides the enzymes to the lysosome.

And in eye cell disease, the enzymes are created, but they're missing that address label.

Exactly.

The patients are deficient in the enzyme that's responsible for adding the phosphate to the mannose.

It's called N -acetylglucosamine phosphotransferase.

So without that M6P marker?

The digestive enzymes are recognized as just generic secreted proteins.

They are incorrectly exported out of the cell into the blood and urine, rather than being delivered to the lysosome where they belong.

So the recycling center receives no workers.

What is the consequence of that failure?

The lysosomes accumulate large inclusions of undigested material glycosaminoglycans and glycolipids because the necessary digestive machinery is missing.

This buildup causes the lysosomes to swell, leading to cellular dysfunction and eventual death.

And the result for the patient?

The result is severe symptoms, including psychomotor retardation and the severe skeletal deformities often seen in children with the disease, all because the cell failed to attach one simple phosphate group to one sugar in the Golgi apparatus.

We've established that the complexity of these carbohydrate units on cell surfaces holds immense information.

It's the cellular alphabet.

Now we need to discuss the proteins that actually read that information.

These are the lectins.

Lectins are specialized glycan binding proteins.

They specifically recognize and bind to carbohydrate structures that are displayed on cell surfaces or on circulating molecules.

They are universal across all life.

They are essentially the readers of the cell's sugary language.

And their primary function is to facilitate specific cell -cell contact.

This is vital for building functional tissues, immune surveillance, and inflammatory responses.

A key aspect of their binding is that a single lectin typically contains two or more carbohydrate binding sites.

This allows it to bridge carbohydrates displayed on two different cells, creating cellular adhesion.

And how do they bind?

The individual binding interaction is achieved through multiple weak non -cavivalent interactions, hydrogen bonds, van der Waals forces.

This is where that Velcro analogy comes in, isn't it?

It is.

Individually, each bond is weak.

This means the connection can eventually be reversed or broken when needed, which ensures dynamic cellular processes.

But compositely, the multiple simultaneous bonds create a strong, highly specific link that holds cells together when required for things like inflammation or tissue repair.

And just to tie things together, the protein that guides those digestive enzymes to the lysosome, the mannose 6 -phosphate receptor we just discussed as a lectin, that is a prime example of a lectin.

It reads the M6P address label on the carbohydrate.

So lectins are organized into different classes based on their structure and requirements.

We have the widespread C -type lectins in animals, where the C stands for calcium requiring.

Right.

The calcium ion plays a key role here.

It often acts as a bridge, linking the protein to the sugar through interactions with the sugar's hydroxyl groups.

The most clinically significant family within the C -type lectins are the selectins.

And selectins are absolutely critical for the inflammatory response.

They mediate the essential binding of immune system cells to the sites of injury or infection.

They mediate that initial crucial interaction.

When inflammation occurs, the endothelial cells that line the blood vessels begin to display P and E selectins.

Passing leukocytes, the white blood cells, have specific carbohydrate ligands on their surface.

The selectins bind to these ligands, causing the leukocytes to slow down and begin rolling along the vessel wall.

This is a necessary step before they can stop, adhere firmly, and then extravasate, leave the bloodstream to enter the tissue site of injury.

And L -selectin even plays a surprising role in early development.

It does.

L -selectin, which is also involved in immune cell trafficking to lymph nodes, is utilized during human implantation.

It helps the developing embryo attached to the mother's uterine endometrium by recognizing a specific cell surface oligosaccharide on the uterine wall.

It really demonstrates the fundamental reliance on this sweet language for every step of life.

And what about the L -type lectins?

L -lectins are structurally different.

You find them abundantly in the seeds of leguminous plants, where they can act as potent insecticides against pests.

But L -type lectins are also critical molecular chaperones inside our own eukaryotic ER.

Like what?

Think of proteins like colexin and calreticulin.

They bind specific end -linked oligosaccharides on newly synthesized proteins, helping them to fold correctly, and ensuring they pass quality control before being shipped off to the Golgi.

Now, the ability of lectins to read cell surface carbohydrates is often hijacked by pathogens to gain entry to host cells.

This is a massive area of ongoing biological warfare at the cellular level.

It absolutely is.

And the influenza virus is the classic, most important example of this exploitation.

The virus uses its surface protein, hemagglutinin, as the molecular key.

Hemagglutinin is a lectin.

It binds specifically to sialic acid residues that are located at the very termini of the host cell's glycoproteins.

This is the lock -and -key mechanism that enables the virus to adhere to the respiratory tract cell surface and be engulfed by the cell.

Okay, that's the entry mechanism.

But after viral replication, the new viral particles bud out and they are still stuck to the cell surface by that same hemagglutinin sialic acid bond.

How do they escape to infect other cells?

To spread the infection, the virus uses a molecular pair of scissors.

It's another protein called neuraminidase, or sialidase.

This enzyme specifically cleaves the glycosidic bonds, linking that terminal sialic acid, freeing the newly formed viral particles to disseminate and infect surrounding cells.

And this viral exit mechanism provides a crucial clinical target for anti -influenza drugs.

Absolutely.

Inhibitors of neuraminidase, like aceltamivir, temiflu, and centamivir, relenza, are important anti -influenza agents because they block the virus's ability to cleave those sialic acid bonds.

They effectively trap the newly formed variants on the surface of the infected cell and prevent them from spreading the infection throughout the host.

And the carbohydrate binding specificity of hemagglutinin also explains species specificity, which is why something like avian influenza is difficult to transmit from human to human.

That's right.

Different species of influenza have evolved hemagglutinins that recognize slightly different carbohydrate sequences.

Avian influenza, H5N1, recognizes a carbohydrate sequence that, in humans, is typically located deep within the alveolar cells of the lungs.

This makes initial infection difficult.

Whereas seasonal flu?

Seasonal human influenza, however, recognizes sequences that are much more abundant in the upper respiratory tract, and this enables easy aerosol transmission via coughing and sneezing.

Finally, the malaria parasite Plasmodium falciparum uses a similar glycan -binding strategy for its complex two -stage invasion process.

It does.

The parasite initially uses glycan -binding proteins to attach to the glycosaminoglycan heparin sulfate on liver cells.

That initiates the first stage of infection.

Later, once it exits the liver and enters the bloodstream, it switches mechanisms.

It uses another glycan -binding protein to attach to the carbohydrate moiety of glycophorin on red blood cells for invasion.

So understanding these precise carbohydrate recognition events is critical for developing new anti -malarial therapies.

It's a key strategy.

The goal is to develop drugs that block that initial host -pathogen interaction.

Okay, so what does this all mean?

The core message, and it's validated so extensively by the source material, is that carbohydrates are not just fuel.

They're not just inert structural components.

They are sophisticated, information -rich molecules.

Nature does not construct complex, varied patterns when simple repeating ones will suffice.

The sheer number of structural possibilities afforded by the varied stereochemistry, the variable ring size, and the numerous possible linkage points, allows the glycum to encode a level of complexity that far exceeds that of a simple peptide sequence.

And we reviewed the primary structural consequence of that molecular difference.

The alpha linkages in starch and glycogen enforce that bent, hollow helix structure, which is optimal for compact, accessible energy storage.

In contrast, the beta linkages in cellulose enforce long, straight chains that are stabilized by hydrogen bonds, providing that necessary structural rigidity and high tensile strength.

And we saw the primary informational consequences all across the cell.

Glycosylation patterns define our blood types by adding a single terminal sugar.

They are essential for protein stability and longevity in circulation, dramatically boosting the half -life of hormones like EPO.

They act as cellular nutrient sensors via GLC and acylation, and they provide crucial address markers like mannose 6 -phosphate to direct enzymes to their proper destination.

The dynamic interplay of lectins and these surface carbohydrates guides vital cell contacts, inflammation, and even dictates how pathogens infect us.

This brings us to a final, provocative thought.

While the sequencing and characterization of the human genome, and increasingly the proteome, have been these monumental tasks, characterizing the entire glycum remains arguably biochemistry's next great frontier.

Because the number of possible unique oligosaccharides so greatly exceeds the number of tripeptides you can make from just three amino acids,

the depth of information encoded in this sweet language is truly immense.

What crucial mechanisms controlling cell recognition, function, and disease, what hidden informational pathways still await full discovery in this dynamic, complex, and beautiful sugary layer that's coating every single cell in your body.

It is the next major puzzle of molecular life.

We are only just beginning to read the language.

Thank you for joining us for this deep dive into the world of carbohydrates.

We hope you feel a little bit more well informed about the secret complexity of life's essential molecules.

ⓘ This audio and summary are simplified educational interpretations and are not a substitute for the original text.

Chapter SummaryWhat this audio overview covers

Carbohydrates function far beyond simple energy provision, serving as information-bearing molecules that govern cellular recognition, structural stability, and intercellular communication through the integrated field of glycobiology. The foundational layer begins with monosaccharides, the elementary sugar units distinguished by their carbonyl positioning into aldoses and ketoses, whose three-dimensional arrangements produce distinct stereoisomers including enantiomers, diastereoisomers, and epimers that profoundly affect biological function. These straight-chain sugars undergo cyclization through hemiacetal or hemiketal condensation, forming five-membered furanose or six-membered pyranose ring structures with anomeric carbons at the ring closure point, capable of adopting multiple spatial configurations such as chair or boat geometries that influence reactivity and recognition. Individual monosaccharides condense through O-glycosidic or N-glycosidic linkages to generate increasingly complex carbohydrate scaffolds, from simple two-sugar disaccharides to elaborate polysaccharides serving distinct biological roles. Energy storage polymers such as glycogen in animals and starch's amylose and amylopectin components in plants employ alpha 1,4-glycosidic bonds in linear chains, enabling efficient glucose mobilization when metabolic demands arise. Structural polysaccharides, exemplified by cellulose's characteristic beta 1,4-linkages, form unbranched, rigid chains that provide mechanical support in plant cell walls and resist enzymatic breakdown. Beyond homopolymers, carbohydrates form covalent conjugates with proteins through glycosylation, generating glycoproteins and proteoglycans with profound physiological consequences. N-linked glycosylation involves asparagine residues within the endoplasmic reticulum, utilizing dolichol phosphate as an activated intermediate, while O-linked glycosylation decorates serine or threonine side chains, both processes requiring specialized glycosyltransferases throughout the secretory pathway. Genetic and biochemical disorders emerge when these systems malfunction, as demonstrated by ABO blood group determination through glycosyltransferase specificity, mucopolysaccharidoses including Hurler disease and I-cell disease resulting from defective glycosaminoglycan processing, and infectious disease mechanisms wherein carbohydrate-binding proteins like hemagglutinin enable viral attachment through recognition of sialic acid residues on host cell surfaces.

Using this chapter to study? Last Minute Lecture is free and student-run. If it helped, consider supporting the project.

Support LML ♥

Chapter 11: Carbohydrates: Structure & Function

Related Chapters