Chapter 4: Probability
Welcome to Last Minute Lecture.
This free chapter overview is designed to help students review and understand key concepts.
These summaries supplement not replaced the original textbook and may not be redistributed or resold.
For complete coverage, always consult the official text.
Imagine waking up like the morning of a massive life altering exam.
You swallow and there it is, a razor sharp sore throat.
Oh, that is just the absolute worst timing.
Right, it always happens then.
So do you take that cold medicine that might make you terribly drowsy or do you risk taking the exam through the pain?
Yeah, you have to weigh your options.
Exactly.
Without even realizing it, your brain just ran a highly complex mathematical algorithm.
You weighed the chances of the medicine working against the chances of falling asleep at your desk.
You instinctively calculated probability.
And you do that dozens of times a day, honestly.
We all do.
I mean, we tend to think of advanced mathematics as something confined to, you know, engineering laboratories or dusty chalkboards.
Or stressful exams.
Right, but probability is the actual math of real life.
It is how insurance empires decide what premium to charge you by calculating the exact risk of an accident.
Wow, yeah.
It's how governments decide where to allocate billions in infrastructure spending based on like weather patterns.
Which is exactly why we are so thrilled you were sitting at the table with us today.
Welcome to a special last minute lecture edition of our deep dive.
We are so glad you're here.
You are officially the third person in this study group.
And our mission today is very specific.
We are going to help you completely master chapter four on probability from the Cambridge International AS and A level mathematics course book.
We are getting right into it.
Yeah, we're going to decode the logic behind the formula so that you don't just memorize them you actually understand them.
Because honestly, if you understand the underlying mechanism of the math, the exam questions stop looking like these abstract traps and start looking like simple puzzles.
That is the goal.
So let's establish our baseline right away.
Since you are studying at the A level standard, we are going to bypass the elementary school recap of like what a fraction is.
Yeah, we trust you know how fractions work.
Exactly.
We simply need to agree on the scale.
In this universe, probability operates on a strict uncompromising spectrum.
From exactly zero, meaning an event is physically impossible.
Like me suddenly flying.
Right.
Zero chance.
To exactly one, meaning an event is an absolute undeniable certainty.
So anything that can possibly happen exists somewhere between zero and one.
You can dress that number up however you like for the examiner as a fraction, a decimal, or a percentage, but the scale never changes.
Never.
Now before we start calculating the odds of complex multi -step scenarios, we really have to talk about vocabulary.
Oh, absolutely.
The Cambridge text is incredibly specific with its terminology.
And if you don't know the difference between an outcome and an event, the exam will absolutely catch you out.
Language is everything here.
The foundational hierarchy is built on three terms, experiments, outcomes, and events.
Let's break those down.
First, the experiment.
Now in normal conversation, an experiment implies test tubes and lab coats, right?
Yeah, like chemistry class.
But in probability, an experiment is simply the physical action taking place.
So if you roll a standard six -sided die, the physical flick of your wrist, and the die tumbling across the table, that action is the experiment.
Okay, and the result of that tumbling die gives us the next term.
Yes, the outcome.
The textbook also refers to this as an elementary event.
Elementary event, got it.
An outcome is the specific, irreducible, single result of your experiment.
The die stops moving, and the number two is facing up.
Rolling a two is a single outcome.
Makes sense.
Finally, we have the event.
An event is a broader classification.
It is a specific collection or combination of outcomes that we happen to care about.
Let me test that distinction just to make sure I'm wrapping my head around it.
If I say I want to roll an odd number, rolling an odd number is the event.
Correct.
But that event actually contains three distinct outcomes, rolling a one, a three, or a five.
That is the exact distinction you need to hold on to.
An event is basically a bucket, and the outcomes are the specific items inside that bucket.
I like that.
The bucket analogy works.
Now, the text introduces a phrase that dictates how we calculate everything random selection.
Oh yeah, random.
In everyday slang, random means chaotic or unpredictable, like, oh, that was so random.
Right.
But in mathematics, random implies a state of perfect, flawless fairness.
It means the universe isn't playing favorites.
Exactly the opposite of chaotic.
If you draw a name from a hat, random selection means every single flip of paper has the exact same mathematical chance of getting selected.
When a scenario achieves that perfect fairness, we call the outcomes equiprobable.
Equiprobable.
Let's anchor that to the real world with the textbook's demographic example.
Imagine you are looking at a classroom of 19 students.
Okay, 19 total.
Yeah.
And the group is made up of 11 boys and 8 girls.
If the teacher randomly selects one student to answer a question, what is the probability that the chosen student is a boy?
Well, because the selection is truly random, every student is an equiprobable outcome.
Exactly.
So the underlying mechanism is wonderfully simple.
You just take the number of outcomes you actually want, the favorable ones.
Right, the 11 boys.
And divide them by the total number of equally likely outcomes, which is 19.
11 favorable outcomes over 19 total possibilities.
The probability is 11 over 19.
That simple ratio is really the beating heart of basic probability.
Just favorable over total.
Exactly.
But we also need to address what happens when we look at the entire scope of an experiment.
This introduces two crucial concepts,
exhaustive events and complementary events.
Okay, exhaustive.
That sounds like me after a long exam.
I mean, fair enough.
But in this context, it means you have exhausted all physical possibilities.
Oh, I see.
If you flip a coin, the outcome is either heads or it is tails.
There is no third option.
The coin isn't going to just suspend itself in midair.
Right, definitely not.
Because heads and tails cover every single possibility, they are exhaustive.
And because they cover everything, if you add their probabilities together, one half plus one half, they must equal exactly one.
Which naturally leads us to the complement.
If I know the probability of something happening, I automatically know the probability of it not happening.
That is the complement.
In your text, you'll see it written as the letter of the event with a small prime symbol, like a little apostrophe right next to it.
Like a...
Exactly.
Exactly.
Let's say event A is rolling a one on a die,
the probability is one sixth.
The complement, event A prime, is simply not rolling a one.
Okay.
Since the probabilities must sum to one, the complement must be five sixths.
So the formula is just the probability of event A plus the probability of not A equals one.
That feels like a very handy shortcut.
It really is.
Like if a test asks me to calculate the odds of a massive complicated event happening, it might actually be faster to calculate the odds of it not happening and just subtract that from one.
That is a highly advanced exam strategy and we will definitely see it come into play later.
Awesome.
But for now, we have our vocabulary and we can calculate the probability of a single isolated event.
Right.
But life is rarely a single isolated event.
So true.
The real world presents us with multiple options simultaneously.
So let's talk about how we handle parallel choices.
What happens when I need to know the probability of event A or event B happening?
That tiny two -letter word or is perhaps the biggest trap in the entire chapter.
Really?
Just or?
Oh yeah.
The general rule of thumb is that when you see the word or in a probability question, you're going to add the probabilities together.
Okay.
Simple enough.
But there's a massive condition attached to that.
You can only simply add them if the two events are mutually exclusive.
Mutually exclusive.
Let's clearly define what that physical reality looks like.
Mutually exclusive means the two events cannot physically exist at the same time.
They have zero common favorable outcomes.
Imagine our six -sided die again.
What is the probability of rolling an even number or a factor of five?
Let's list the buckets.
The even number bucket contains two, four, and six.
Right.
The factor of five bucket contains one and five.
Are they mutually exclusive?
Yes.
If you look at those two groups, there is absolutely no overlap.
No single face on that die is both an even number and a factor of five.
Perfect.
Because they are completely separate, your math is safe.
You simply add the probability of the even numbers three out of six to the probability of the factors of five two out of six.
So three sixths plus two sixths.
Exactly.
The total is five out of six.
But let's alter the parameters.
What if the exam asks for the probability of rolling an odd number or a factor of five?
Ah, that changes the physical reality completely.
The odd number bucket contains one, three, and five.
The factor of five bucket still contains one and five.
They share outcomes.
The numbers one and five live in both buckets.
They overlap.
They are not mutually exclusive.
This is where students panic.
Because if you blindly follow the add the probabilities rule, you add three sixths to two sixths and get five sixths.
Which feels right, but it's not.
Right.
That math is fundamentally flawed.
You just double counted the one in the five.
To save you from this trap, the Cambridge text utilizes the greatest visual aid in mathematics,
the Venn diagram.
Yes.
The Venn diagram is a lifesaver.
If you are ever staring at a confusing question, just sketch one.
Imagine a large rectangular box on your paper.
That box is the possibility space.
It contains everything.
Inside that box, you draw circles to represent your specific events.
The physical relationship between those circles tells you exactly what math to use.
If the circles are entirely separate, like two distinct islands, the events are mutually exclusive.
Add away.
But if they crash into each other.
Right.
If they crash into each other and overlap in the middle, you have a problem.
Let's explore a classic textbook survey scenario to illustrate this trap.
Let's hear it.
Imagine a survey about computer ownership.
The data shows that 50 % of people own a desktop computer.
60 % of people own a laptop computer.
Okay.
50 and 60.
The question asks what percentage of people own neither.
Okay, wait.
If I just look at the raw numbers, 50 plus 60 is 110%.
You told me earlier that probability cannot exceed 1 or 100%.
I did say that, yes.
So did the survey lie or are we dealing with ghost voters here?
Neither.
The survey is perfectly accurate, but the events are not mutually exclusive.
The crucial missing piece of data in that problem is the intersection.
The text goes on to reveal that 15 % of the participants own both a desktop and a laptop.
There are the double counted people.
Yes.
Picture the Venn diagram.
The desktop circle and the laptop circle overlap, and that shared middle section contains the 15%.
Okay, I see it.
This visual gives us the formal addition law formula.
The probability of A or B equals the probability of A plus the probability B minus the probability of A and B happening together.
So we add the 50 % desktop owners to the 60 % laptop owners, giving us that impossible 110%.
Right.
But then we subtract the 15 % overlap.
We remove the double counting.
110 minus 15 brings us back to reality 95%.
Which means 95 % of the survey group owns at least one type of computer.
The people who own neither are the ones standing outside those circles, inside the rectangular box.
Right.
Because the whole box must equal 100%.
Exactly.
So the neither group makes up the remaining 5%.
That physical visualization of subtracting the overlap makes so much sense.
So for your notes, if you're listening, the word or are usually means we add, but we must interrogate the scenario first.
If there is an intersection, we have to subtract it.
Always look for the overlap.
Now let's pivot from parallel options to sequential actions.
What happens when we want to know the chances of event A happening and then event B happening right after it?
This introduces a concept that is deeply tied to the philosophy of cause and effect independent events.
Independent events.
Two events are mathematically independent if the occurrence of the first event has absolutely no effect on the probability of the second event occurring.
The universe basically completely resets between actions.
And when we are dealing with independent events, the word A and D triggers our next major rule, the multiplication law.
Yes.
The probability of event A and event B happening is simply the probability of A multiplied by the probability of B.
But we should explain why we multiply rather than just reciting the rule.
When you multiply fractions, you are essentially taking a fraction of a fraction.
Okay, give me an example.
If there is a one -half chance that it rains tomorrow and a one -half chance that you forget your umbrella when it rains, you are taking half of a half.
You are left with a one -quarter chance of getting wrecked.
Oh, I see.
You are constantly whittling down the overall probability with each new condition.
The textbook offers a fantastic visual map for these sequential sequences.
The probability tree diagram.
Let's visualize their classic example.
You have a bag containing seven balls, two are blue and five are white.
Okay, got the bag in my head.
You reach in blindly, pick one ball, note its color, and then, and this is the critical physical action, you put the ball back into the bag.
That act of replacement is everything.
By putting the ball back, you reset the bag to its original state.
You ensure that your second draw is entirely independent of your first draw.
Because the bag has no memory of what you just did.
Exactly, the universe recessed.
Let's map it.
You start at a single point on the left side of your paper.
From that point, you draw two branches angling outward for your first selection.
The top branch represents pulling a blue ball, and we write the probability on that line two over seven.
The bottom branch represents pulling a white ball, so we write five over seven.
Now from the end of each of those first branches, we sprout two new branches for the second draw.
Okay.
So from the end of the first blue branch, we draw a new blue branch and a new white branch, writing two sevens and five sevens on them respectively.
We do the exact same thing from the end of the first white branch.
You now have a branching map flowing from left to right.
Here is how you use this tool on an exam.
If the question asks for the probability of a specific exact sequence, like pulling a blue ball and then a white ball, you walk along that specific pathway from left to right.
And as you walk across the branches, you multiply the fractions.
Two sevens multiplied by five sevens gives you 10 over 49.
But the questions are rarely that simple, you know.
What if the exam asks for a combined outcome?
What is the probability of pulling two different colored balls in any order?
Well, looking at our tree, there are two separate pathways that satisfy that requirement.
You could walk the blue then white path, which we just calculated as 10 over 49.
Right.
Or you could walk the white then blue path.
Five sevens multiplied by two sevens is also 10 over 49.
So you have two successful endpoints on the far right of your tree.
To find the total probability, you simply add those final fractions together vertically.
10 over 49 plus 10 over 49 equals 20 over 49.
The golden rule of the tree diagram is this multiply as you move horizontally across the branches and add as you move vertically down the final column.
The text also mentions an alternative to the tree, the possibility diagram, which looks like a grid.
You draw a seven by seven square representing the seven balls on the first draw and the seven balls on the second draw.
Yeah, that gives you 49 individual squares representing every possible equiprobable outcome.
You literally just count the squares that match your desired result.
It's a great tool if you are a highly visual learner who wants to physically see every combination.
It is excellent for finite equiprobable scenarios.
But we really need to address the elephant in the room regarding the tree diagram.
We stressed how important it was to put the ball back into the bag to maintain independence.
Yes.
And I have to challenge that scenario.
In reality, our choices have permanent consequences.
We don't always get to reset the universe.
Very true.
What if I reached into that bag, pulled out a blue ball and put it in my pocket?
The bag has physically changed.
It definitely has.
There are only six balls left and only one of them is blue.
The denominator has completely shifted.
How do we map a scenario where the second event is entirely dependent on the physical reality of the first event?
That challenge leads us directly to the final and most advanced concept in the chapter conditional probability.
Okay, here we go.
This is when the probability of an event is explicitly dependent on some additional information or previous action.
The notation looks a bit strange.
It is the letter of an event,
followed by a straight vertical line, followed by another event.
You read it aloud as the probability of event A, given that event B has already occurred.
Given that, those two words are the signal flare that everything has changed.
Let me offer an analogy to explain what conditional probability actually does to a math problem.
I'd love to hear it.
Imagine I take three blank cards and write the letters A, C, and E on them.
I shuffle them face down and ask you to pick one.
The probability that you select the letter E is one out of three.
A simple, equiprobable experiment.
But before you flip your chosen card over, I decide to give you a hint.
I say, I will tell you this.
The letter you are holding is a vowel.
Ah, the physical reality just shifted.
Exactly.
The letter C is a consonant.
Because of my hint, the letter C is instantly, entirely eliminated from the realm of possibility.
It's just gone.
Your universe of options just shrank from three cards down to two cards, A and E.
The probability that you were holding the E is no longer one third.
Because the universe shrank, the probability is now one half.
That is conditional probability right there.
The given condition acts as a filter that actively shrinks your denominator.
That is a phenomenal analogy.
The textbook demonstrates this shrinking universe perfectly with a data table regarding student subjects.
Imagine a classroom of 25 students.
Okay, 25 total.
The table breaks down exactly who studies biology,
who studies chemistry,
who studies both and who studies neither.
Now, listen very closely to the wording of the question.
Okay, I'm listening.
What is the probability that a randomly selected student studies chemistry,
given that they study biology?
Let's apply the filter.
The phrase is, given that they study biology.
Precisely.
Because that condition is absolute, you must throw out the rest of the data.
You completely ignore the total class size of 25.
Just ignore it completely.
You act as though the students who don't take biology do not exist.
You look at the table and see that 16 students in total study biology.
That number, 16, is your new shrunken universe.
It is your new denominator.
Once we have our new universe, we just look for our favorable outcomes inside it.
Out of those 16 specific biology students, how many of them also study chemistry?
The table tells us that nine of them do.
So the final probability is simply 9 over 16.
We didn't need a complex formula.
We just needed to understand how the condition filtered the data.
We shrank the universe and evaluated the remaining options.
This concept actually allows us to mathematically prove whether two events are independent.
If the probability of event A, given that event B occurred, is exactly the same as the probability of event A on its own meat.
Then the events are strictly inattentive.
Exactly.
Makes perfect logical sense.
If knowing that B happened didn't shrink your universe in a way that affected A, then B has no power over A.
They are independent.
Absolutely.
Okay, let's tie all of these rules, the addition, the multiplication, and the conditional shrinking universe into one final masterful scenario from the textbook.
It's a real -world problem about a brother and sister making weekend plans.
Oh, this is a brilliant synthesis of the entire chapter.
Here is the setup.
Every Saturday, a brother invites his sister to hang out.
70 % of the time, he suggests going to the theater.
30 % of the time, he suggests the cinema.
But her response is not a random coin flip.
It is entirely conditional based on his suggestion.
If he suggests the theater, she accepts 90 % of the time.
If he suggests the cinema, she rejects the invitation 40 % of the time.
We need to calculate the exact probability of her accepting an invitation on any random Saturday.
This sounds like a nightmare of percentages.
It does.
But if we build a probability tree, the logic sorts itself out beautifully.
Let's do it.
We start on the left with the brother's initial choice.
We draw a top branch for theater and assign it a probability of 0 .7.
We draw a bottom branch for cinema and assign it 0 .3.
Notice we're using decimals here instead of fractions.
It operates on the exact same 0 to 1 scale.
Now from the end of the theater branch, we draw the sister's conditional responses.
We draw an accept branch and write 0 .9.
Because exhaustively, she must either accept or reject, the reject branch must be the complement 0 .1.
From the bottom cinema branch, we draw her second set of responses.
The problem stated, she rejects the cinema 40 % of the time.
So the reject branch gets 0 .4.
That means her conditional accept branch for the cinema must be the complement 0 .6.
We now have a complete map of every possible weekend outcome.
We want to find the total probability of her accepting.
Looking at the tree, there are two distinct pathways that end in an accept.
Let's walk the first one.
The brother chooses theater NED, she accepts.
We multiply horizontally across those branches, 0 .7 multiplied by 0 .9.
Let's think about that logically.
Out of 100 Saturdays, he chooses the theater 70 times.
Of those 70 times, she says yes, 90 % of the time.
That results in 63 successful Saturdays, or mathematically 0 .63.
Now the second successful pathway.
He chooses cinema NED, she accepts.
We multiply across the bottom branches, 0 .3 multiplied by 0 .6 gives us 0 .18.
We have calculated the probability of the two separate mutually exclusive paths.
She cannot physically go to the theater and the cinema on the same Saturday.
Right.
So to find the grand total, we employ our very first rule, the addition law.
We add the end results vertically, 0 .63 plus 0 .18 equals 0 .81.
There is an 81 % chance she accepts his invitation.
By breaking it down visually, we turned a dense paragraph of overlapping conditions into a very manageable sequence of logical steps.
Multiply across the independent actions, add down the mutually exclusive results.
It really is that straightforward when you map it out.
Okay, take a deep breath.
Look at how far we've come.
We started by defining the very basic vocabulary of outcomes and events on a scale of 0 to 1.
We learned that the word OR means we add probabilities, but we must use Venn diagrams to interrogate the physical reality and subtract any overlapping intersections so we don't double count data.
We discovered that the word ANDD means we multiply probabilities, taking a fraction of a fraction, and that tree diagrams are the ultimate visual tool for mapping those sequential actions.
And finally, we conquered the phrase given that, realizing that conditional probability is simply the act of using new information as a filter to actively shrink our universe of possibilities, changing the denominator of our equation.
It's a remarkable progression of critical thinking.
And if I could leave you with one final thought to ponder as you close your notebook today.
Please do.
Think back to our opening scenario about the sore throat.
Everything we just mapped with balls and bags and survey data applies to you.
Every choice you make today, from the route you walk to class, to the specific friends you choose to study with, to the exact practice questions you decide to focus on, you are actively navigating a massive invisible probability tree diagram.
Oh, wow.
Every single decision you make is a node.
And every choice filters and changes the denominator of your future outcomes.
That is a staggering thought.
The math really is everywhere.
We truly hope this deep dive helped decode the logic of chapter four for you, and that the formulas now feel like intuitive tools rather than memorized rules.
On behalf of the entire last minute lecture team, thank you so much for sitting down to study with us today.
We wish you the absolute best of luck on your mathematical journey and tremendous success on that upcoming exam.
You have entirely got this.
Keep calculating those odds.
β This audio and summary are simplified educational interpretations and are not a substitute for the original text.
Using this chapter to study? Last Minute Lecture is free and student-run. If it helped, consider supporting the project.
Support LML β₯Related Chapters
- Discrete Probability DistributionsElementary Statistics
- Normal Probability DistributionsElementary Statistics
- ProbabilityElementary Statistics
- Probability Amplitudes & InterferenceThe Feynman Lectures on Physics
- Probability DistributionsCambridge International AS & A Level Mathematics Probability & Statistics 1 Coursebook
- Probability in Physics β Chance & UncertaintyThe Feynman Lectures on Physics