Chapter 7: Weighted Averages

Search this chapter

Audio Overview

0:00 / 0:00

Autoplay next chapter

Welcome to Last Minute Lecture.

This free chapter overview is designed to help students review and understand key concepts.

These summaries supplement not replace the original textbook and may not be redistributed or resold.

For complete coverage, always consult the official text.

Imagine you're tasked with figuring out the exact distance between two geological landmarks.

You hire this top -tier surveying firm, right, and they bring out a state -of -the -art laser rangefinder.

It's calibrated for atmospheric distortion, the works.

So nice.

Yeah.

And they give you a measurement that's accurate to within like a few millimeters.

But then somebody else walks up, maybe just some enthusiastic amateur, and they just pace out the distance foot by foot.

Oh, boy.

And they just shout out their final tally.

So now you, the listener we're talking to right now, you have two measurements for the exact same physical space.

And your confidence in those two numbers is obviously worlds apart.

Right.

Night and day.

Exactly.

But here is the challenge.

You are required to combine that laser reading and the pacing estimate into one single official best estimate for that distance.

Which honestly sounds like a really great way to just ruin perfectly good data.

The immediate instinct for anyone working in a lab or in the field is to just ignore the amateur pacing completely and polish the laser data.

Right.

In a simplified scenario, sure, you just drop it.

But in the actual practice of experimental physics,

it's, well, it's almost never that distinct.

Yeah.

You're routinely faced with multiple laboratories or multiple researchers measuring the exact same physical quantity.

And they all arrive at slightly different answers and they all report different margins of error.

You can't just

arbitrarily crown one the winner and discard the rest.

No, you really can't.

You need a rigorous way to combine them.

So welcome to this deep dive.

We are talking directly to you, whether you're prepping for a high stakes lab, reviewing for a final, or just, you know, trying to understand how the scientific community actually agrees on fundamental constants.

Yeah.

Think of this as your personalized one -on -one last minute lecture tutoring session.

Exactly.

And today we are focusing exclusively on chapter seven of John R.

Taylor's introduction to error analysis.

Right.

And the central mission here is mastering this concept known as weighted averages.

We need to really dig into the mathematical machinery that lets us take disparate measurements,

respect their individual uncertainties, and merge them into one ultimate best estimate.

Yes.

But, and this is a big, but before we get anywhere near the math, there's a strict prerequisite.

Taylor makes it very clear that this entire framework only functions if we assume all systematic errors have been identified and reduced to like a negligible level.

I really want to highlight that assumption because it's a trap a lot of students fall into.

We're assuming no one is using a stretched out tape measure or a faulty sensor that consistently reads like two degrees too hot.

Right.

If there is a fundamental flaw in your experimental design, mathematical tricks won't save your data.

They absolutely will We are operating under the strict condition that we're dealing solely with random error.

Just the random stuff.

Exactly.

And because the errors are random, we can rely on the fact that these measurements will distribute themselves normally, you know, following that classic Gaussian bell curve around the actual unknown true value.

And if you can't confidently assume a normal distribution of random errors, the formulas we're about to build will just, well, output mathematical fiction.

Total 7 .1.

We have two students, student A and student B measuring the same quantity.

We'll just call it X.

Okay.

Bog standard setup.

Right.

Student A does the work and reports a result of XA plus or minus sigma A and that sigma is their standard deviation, which represents their uncertainty.

Right.

Then student B goes through the exact same process and reports XB plus or minus sigma B.

So we have two values and two different bands of confidence.

And the immediate first step here is to check for a discrepancy.

You have to look at the absolute difference between their two core values.

If the gap between student A's answer and student B's answer is significantly larger than both of their uncertainties combined, you have a major problem.

Because that means their ranges of confidence don't even overlap.

Exactly.

It would be like one person saying the distance is 10 meters, give or take a centimeter.

And the other saying it's 30 meters, give or take a meter.

They are entirely inconsistent.

And when measurements are inconsistent like that, the procedure just stops.

You don't pass go.

Right.

You don't average them.

You don't try to find some happy middle ground.

Inconsistency on that scale indicates an unnoticed systematic error or, frankly, a severe blunder by one of the operators.

Someone messed up.

Yeah.

You have to go back to the equipment and figure out what went wrong.

Okay.

But assuming we do that check and we find they are consistent, like their confidence intervals do overlap, meaning they're pointing toward the same general neighborhood for the true value, we hit a bit of a mathematical wall.

We do.

Because the most common instinct for a student is to just add XA and XB together and divide by two, the simple average.

Right.

But applying a simple average to our opening scenario demonstrates exactly why that failed.

It actively degrades the quality of your information.

Right.

If you take a simple average of the laser range finder and the amateur pacing, you are giving 50 % of the voting power to the highly precise laser and 50 % to the guy walking heel to toe.

Which is ridiculous.

It completely dilutes the highly precise data.

We need a method that naturally grants more influence to the tighter, more precise measurement.

Which means we have to step away from basic arithmetic and look at the underlying probability of the errors themselves.

Okay.

So probability.

Right.

We can't just guess how much more to trust the laser.

We have to mathematically derive it.

And this brings us to what's called the principle of maximum likelihood.

And this is where the genius of Gauss comes into play.

Because if we're trying to find the true value, which Taylor represents with a capital X in the text, we have to work backward from the data we actually have.

We know the probability of student A getting their value follows that Gaussian bell curve.

Right.

And the shape of that curve is defined by an exponential function.

The probability is proportional to raise to a negative exponent.

And that exponent is key.

Very much so.

That exponent is the squared difference between the measured value and the true value, all divided by twice the variance or, you know, the uncertainty squared.

Okay.

So the further a measurement strays from the true value, the faster its probability plummets.

And because it's an exponential drop, I mean, it plummets incredibly fast.

So we have that probability curve for student A and a separate independent probability curve for student B.

If we want to find the true value that makes both of these independent measurements highly probable at the same time, we have to combine them.

Yep.

And just like any independent probabilities, like flipping coins, you multiply them together.

And the mathematical rule for multiplying exponential functions is that you just add exponents.

So our combined probability for both measurements relies on a new single exponential function.

And the exponent of this new function is just the sum of the two individual exponents.

I remember this from my statistics classes.

That combined exponent is referred to as chi squared, right?

Yes, chi squared.

But visualizing an exponent built out of fractions is kind of tough.

How should you be thinking about this chi squared value practically?

I like to think of chi squared as like an aggregated penalty score.

Oh, a penalty score.

I like that.

Yeah.

It's the sum of the squared deviations from the true value.

But, and this is the vital part, each deviation is divided by its own uncertainty squared.

So the uncertainty scales the penalty.

Exactly.

So if student A has a very small uncertainty, a really precise measurement, even a tiny deviation from the true value creates a massive penalty in the chi squared sum.

Because you're dividing by a tiny fraction.

Right.

But if student B has a huge uncertainty, they have a lot of leeway.

Their deviation doesn't add much to the penalty at all.

That makes total sense.

So we're trying to find the best estimate for the true value X.

And the principle of maximum likelihood states that our best estimate is the value that makes the actual physical observations the most statistically likely to have occurred.

Exactly.

We want to maximize that combined probability.

Okay.

So look at the structure of the combined probability equation.

The chi squared term sits in the exponent and it carries a negative sign.

Right.

It's e to the negative chi squared.

Because it's a negative exponent, if we want the overall probability to be as large as possible, we need that chi squared penalty score to be as small as possible.

We need to minimize it.

Yes.

And this is the exact origin of the famous method of least squares.

We are literally finding the value of capital X that results in the least possible sum of those squared weighted deviations.

And to find the absolute minimum of that curve, we use calculus.

We just take the derivative of the chi squared equation with respect to X.

And we set that derivative equal to zero.

Right.

Solving that derivative for X achieves our goal.

Gives us the mathematical best estimate.

But yeah.

The resulting algebraic expression is just a sprawling clunky fraction filled with deviations in the numerators and denominators.

I mean, it is computationally sound, but it completely lacks intuitive clarity.

It's just a mess of symbols.

So to clean up the algebra and reveal the mechanics of what the formula is actually doing,

Taylor introduces the concept of weights.

Yes.

We define the weight of any measurement labeled as the reciprocal of its uncertainty squared.

So the weight is one over sigma squared.

And by substituting this weight variable back into our clunky calculus result, the formula transforms into something just elegant.

It really does.

The best estimate for X becomes the sum of each measurement multiplied by its individual weight, all divided by the sum of the weights themselves.

It is a beautiful formula.

And Taylor's uses a fantastic analogy on page 175 to explain why this substitution is so profound.

This weighted average formula is mathematically identical to the formula used in physics to find the center of gravity of multiple objects.

I love that analogy.

It translates a statistical abstraction into a physical reality you can actually visualize.

Yeah.

So if you picture a rigid seesaw, the different measurements are the positions on the board where you place the objects.

The weights we just calculated act as the physical masses of those objects.

If one measurement is incredibly precise, its calculated weight is massive.

It acts like a bowling ball on the seesaw, pulling the balance point, which is the final average aggressively toward its own position.

And a sloppy measurement with a low weight acts like a feather.

It barely shifts the balance point at all.

Yeah.

The crucial mechanism here, though, is why that weight is so commanding.

The formula for weight isn't just the inverse of the uncertainty.

It's the inverse square.

Right.

One over sigma squared.

And that is not an arbitrary choice to make the algebra look pretty.

It emerges directly from the variance of

the Gaussian probability exponent.

And the practical effect of that inverse square is that precision is rewarded exponentially, while sloppiness is punished ruthlessly.

Exactly.

Consider a situation where one measurement is, say, four times less precise than another.

Its standard deviation is four times larger.

Because the weight formula squares that uncertainty in the denominator, you don't end up with a weight that is four times smaller.

You square the

in 16.

That less precise measurement is assigned a weight 16 times smaller than the precise one.

It goes from being a somewhat inferior measurement to having virtually zero voting power in the final calculation.

As Taylor notes, in many practical applications, a measurement with an uncertainty that much larger could simply be ignored entirely.

Because the math just snuffs it out.

Right.

The inverse square naturally isolates and neutralizes noisy data.

So we have our combined best estimate, which we call the weighted average, or x -wav.

But compiling a best estimate is only half the job in error analysis.

Right.

You need the uncertainty.

Exactly.

A combined value is basically useless if we don't also calculate the new overall uncertainty for it.

We need to know our margin of error for this new center of gravity.

And since the weighted average is a function of the original measured values, we determine its uncertainty using standard error propagation.

Taylor walks through this derivation, which results in equation 7 .1.

The uncertainty of the weighted average sigma -wav is calculated by taking the sum of all the individual weights, finding the square root of that sum, and taking the reciprocal.

In simpler terms, you add up every single weight you calculated, take the square root of that grand total, and divide one by that number.

And think about the implications of the denominator there.

You are summing up weights.

And weights are always positive numbers.

Always.

Every time you add another measurement into the mix, no matter how sloppy it is, you are adding a positive fraction to that total sum of weights.

Which means the total weight always increases.

And since the total weight is in the denominator, as it grows larger, the final uncertainty shrinks.

That is so cool.

Your final combined uncertainty will fundamentally always be smaller, meaning more precise than even your single sharpest individual measurement.

By combining data properly, you strictly increase the overall precision of your knowledge.

Additive precision.

And to see this architecture in action, we should really look at the concrete example Taylor provides in section 7 .3.

Oh, the resistance example.

Yeah, it perfectly illustrates how the math processes varying levels of quality without requiring, you know, subjective human intervention.

Okay, so the scenario involves three students in lab all tasked with measuring the exact same electrical resistance.

Right.

Student one is careful and gets a resistance of 11 plus or minus one ohm.

Student two is equally careful and gets 12 plus or minus one ohm.

Sounds good.

But student three clearly rushes through the procedure and reports a value of 10 plus or minus three ohms.

First things first,

the discrepancy check passes.

The core values of 11, 12, and 10 all overlap comfortably within their respective error margins.

There's no glaring inconsistency that would force us to halt the calculation.

But looking at the raw data, student three's uncertainty is three times larger than the other two.

I mean, my immediate reaction and probably the reaction of anyone managing this lab would be to exclude student three's data entirely.

Oh, absolutely.

It feels like it will just drag down the quality of the first two measurements.

But let's trace how the weighted average handles it.

Okay, so the first

uncertainties.

For student one and student two, their uncertainty is one.

And the inverse square of one is just one.

Right.

So both of their measurements carry a weight of exactly one.

But for student three, the uncertainty is three.

We square that to get nine and take the inverse.

So student three's measurement carries a weight of 19.

The next phase is building the numerator for the weighted average.

You multiply each student's reported resistance by their specific weight and sum those products together.

Okay, so the two precise students carry heavy influence there, contributing fully to the sum.

But the sloppy student's value of 10 is multiplied by 19, severely throttling its impact on the numerator.

Exactly.

Then you divide that combined numerator by the sum of the weights.

So we have a weight of one plus one plus 19.

You're dividing by roughly 2 .11.

And when you run those final numbers, the weighted average for the resistance comes out to 11 .42 ohms.

Right.

And then we apply the uncertainty formula we discussed earlier.

One divided by the square root of the sum of the weights.

The sum is 2 .11.

Yep.

The reciprocal of its square root gives a final combined uncertainty of 0 .69 ohms.

So applying the standard rules for significant figures in error analysis,

rounding the uncertainty to one significant digit and matching the main values decimal place, we get a polished final answer of 11 .4 plus or minus 0 .7 ohms.

The result is really clean.

But Taylor points out an underlying comparison that is truly illuminating here.

What would have happened if we followed your initial instinct?

Oh.

What if we subjectively decided student three was a liability and dropped their data entirely, simply averaging the two careful students?

Well, if we only look at student one and student two, their uncertainties are identical.

When uncertainties are equal, the weights are equal and the weighted average just collapses back into a simple average.

Right.

The midpoint between 11 and 12 is 11 .50.

And if you propagate the uncertainty for just those two, it comes out to 0 .71.

So relying exclusively on the two top tier students yields 11 .50 plus or minus 0 .71.

But utilizing the weighted average formula to incorporate all three students yields 11 .42 plus or minus 0 .69.

The results are practically identical.

The mathematical framework automatically executed what we wanted to do subjectively.

Yeah, it did the filtering for us.

Because student three's uncertainty was three times larger, the inverse square law dictated that their data was nine times less important.

The formula digested the sloppy data, extracted the tiny fraction of statistical value it actually contained, which is why the final uncertainty dropped slightly from 0 .71 to 0 .69 and it safely quarantined the noise.

It replaces arbitrary human judgment with a mathematically rigorous filter.

You do not have to debate whether a data set is good enough to include.

As long as you are confident in the calculated uncertainty, the method of least squares will appropriately scale its influence.

So to synthesize this for your own work, always verify consistency first.

If the ranges overlap, calculate the weight of each measurement as the inverse square of its standard deviation.

Use those weights to pull the average toward the most precise data and calculate your new, tighter uncertainty margin.

This isn't just a way to satisfy a grading rubric.

It is the fundamental protocol for how the global scientific community synthesizes distinct research into accepted truth.

It is the absolute bedrock of experimental consensus.

It really is.

But as we wrap up, I want to leave you with a physical and mathematical implication to consider.

Ooh, okay.

This entire elegant structure, the weighting, the method of least squares, the seamless blending of data,

it all rests completely on the assumption that random errors distribute themselves in a symmetric Gaussian bell curve.

Which, to be fair, is a remarkably reliable assumption for most standard physical measurements.

Right.

But what if they don't?

What if the physical phenomenon you're measuring naturally produces asymmetric errors, like a skewed distribution where an error on the high side is vastly more probable than an error on the low side?

How that throws a wrench in it.

Exactly.

If the underlying universe is throwing non -Gaussian curves at you, minimizing the sum of the squares no longer represents the maximum likelihood.

The formula will happily spit out an answer down to the third decimal place, but the structural integrity of that answer will be entirely compromised.

That's a scary thought.

It's a stark reminder that our statistical tools are only as robust as the assumptions we build them on.

Thank you so much for joining us on this deep dive.

Yes, thank you.

On behalf of the Last Minute Lecture team, we really appreciate you tuning in, and we wish you the absolute best of luck applying these concepts to your own lab work.

Just remember, the math will handle the noise, but only if you understand the shape of it.

ⓘ This audio and summary are simplified educational interpretations and are not a substitute for the original text.

Chapter SummaryWhat this audio overview covers

Combining multiple measurements of the same quantity with varying precision requires a systematic approach grounded in statistical optimization and error analysis. The foundation of this process rests on two prerequisites: measurements must exhibit consistency, meaning observed discrepancies align reasonably with stated uncertainties, and only random errors should remain after all systematic components have been identified and corrected. The Maximum Likelihood Principle provides the theoretical basis, establishing that the optimal estimate maximizes the probability of observing the measured values by minimizing chi-squared, a quantity representing the weighted sum of squared deviations from the true value. This optimization framework, formalized through the method of least squares, yields three interconnected mathematical tools. Weight assignment operates inversely to squared uncertainty, ensuring that more precise measurements contribute disproportionately to the final estimate while less certain measurements exert minimal influence. The weighted average emerges from multiplying each measurement by its corresponding weight, summing these products, and normalizing by total weight. The uncertainty attached to this combined estimate follows from error propagation principles and equals the reciprocal square root of the total weight across all measurements. A concrete illustration clarifies these relationships: when one measurement carries three times the uncertainty of others, its weight becomes one-ninth as large, rendering its impact nearly negligible on the final result. This approach transforms the problem of synthesizing diverse measurements into a disciplined mathematical procedure that respects the reliability hierarchy among data points while providing a defensible single estimate supported by quantifiable confidence bounds. The method accommodates real experimental conditions where measurement quality varies considerably, replacing intuitive or arbitrary averaging schemes with a principled framework that physics and engineering practitioners apply universally.

Using this chapter to study? Last Minute Lecture is free and student-run. If it helped, consider supporting the project.

Support LML ♥

Chapter 7: Weighted Averages

Related Chapters