Discrete random variables
This chapter establishes the foundational concepts of discrete random variables, encompassing their probability distributions, expectation, and variance. A thorough understanding of these principles is critical for subsequent advanced topics in probability and statistics, and is consistently evaluated in CMI examinations through both theoretical and applied problems.
---
Chapter Contents
|
| Topic |
|---|-------| | 1 | Probability distribution | | 2 | Expectation | | 3 | Variance at school level | | 4 | Simple modelling problems |---
We begin with Probability distribution.
Part 1: Probability distribution
Probability Distribution
Overview
A probability distribution describes how probability is assigned to the possible values of a random variable. In CMI-style questions, this topic is not just about definitions. It includes checking whether a formula really defines a distribution, building distributions from experiments like coin tosses and die throws, and computing exact probabilities from repeated independent trials. ---Learning Objectives
After studying this topic, you will be able to:
- Define and use the probability mass function of a discrete random variable.
- Check whether a given table or formula is a valid probability distribution.
- Construct distributions from simple experiments.
- Use Bernoulli and binomial ideas in repeated independent trials.
- Compute probabilities such as βexactly β, βat least oneβ, and βat most β.
Core Idea
A discrete random variable is a variable that takes only finitely many or countably many values.
Its probability distribution is given by the function
called the probability mass function or pmf.
Valid Probability Distribution
For a discrete random variable with pmf , the following must hold:
- for every possible value of
The first condition says probabilities cannot be negative.
The second condition says the total probability over all possible values must be exactly .
Distribution Table
A discrete distribution is often written in tabular form. | | | | | | |---|---|---|---|---| | | | | | | where ---Cumulative Distribution Function
The cumulative distribution function of is
- is nondecreasing
- as ,
- as ,
Expectation and Variance
If has pmf , then
The variance of is
where
Bernoulli Distribution
A Bernoulli random variable takes only the values and .
If
then has Bernoulli distribution with parameter .
- success = βhead occursβ
- success = βdie shows an even numberβ
- success = βoutcome divisible by β
Binomial Distribution
If an experiment is repeated times independently, and each trial has success probability , then the number of successes has binomial distribution:
Standard Probability Forms
If , then:
- Exactly successes:
- At least one success:
- At most one success:
PYQ-Style Example 1
One or more of the first three throws is Each throw is independent, and So $\qquad P(\text{at least one }4\text{ in first three throws}) =1-\left(\dfrac{5}{6}\right)^3$ $\qquad =1-\dfrac{125}{216} =\dfrac{91}{216}$ ---PYQ-Style Example 2
Exactly two of the last four throws are divisible by A die outcome is divisible by if it is or , so Let be the number of such throws among the last four. Then Hence $\qquad =6\cdot \dfrac{1}{9}\cdot \dfrac{4}{9} =\dfrac{24}{81} =\dfrac{8}{27}$ ---Constructing a Distribution from an Experiment
To build the distribution of a random variable:
- identify all possible values of the variable
- compute the probability of each value
- check that all probabilities are nonnegative
- check that their sum is
Common Mistakes
- β Forgetting that total probability must add to
- β Using binomial formula without independence
- β Mixing up βat least oneβ with βexactly oneβ
- β Ignoring the support of the random variable
- β Treating expectation as always one of the possible values
CMI Strategy
- First decide whether the question is about a pmf, cdf, or event probability.
- If a formula is given, check nonnegativity and total sum .
- For repeated trials, test whether the setup is Bernoulli/binomial.
- For βone or moreβ, use complement before expanding anything.
- For βexactly β, use the binomial coefficient carefully.
- Write the final answer in exact form whenever possible.
Practice Questions
:::question type="MCQ" question="Which of the following can be a valid probability mass function on the set ?" options=["","","",""] answer="B" hint="Check nonnegativity and whether the probabilities add to ." solution="A valid pmf must have all probabilities nonnegative and total sum . Option A sums to , so it is invalid. Option B has nonnegative probabilities and so it is valid. Option C has a negative probability. Option D sums to , so it is invalid. Hence the correct option is ." ::: :::question type="NAT" question="A fair die is thrown times. Find the probability that appears in one or more of these throws." answer="\\dfrac{91}{216}" hint="Use the complement event 'no throw shows '." solution="The probability that a single throw does not show is So the probability that none of the three throws shows is Therefore the probability that appears in one or more throws is Hence the answer is ." ::: :::question type="MSQ" question="Which of the following statements are true for a discrete random variable with pmf ?" options=[" for every possible value of ",""," is nondecreasing"," must always be an integer"] answer="A,B,C" hint="Recall the basic definitions of pmf, cdf, and expectation." solution="1. True.Summary
- A valid discrete distribution must have nonnegative probabilities summing to .
- The pmf is , while the cdf is .
- Bernoulli distribution models one success-failure trial.
- Binomial distribution models the number of successes in repeated independent identical trials.
- βAt least oneβ is often easiest by complement.
- Exact algebraic probability expressions are preferred over decimal approximations.
---
Proceeding to Expectation.
---
Part 2: Expectation
Expectation
Overview
Expectation is the average or mean value of a random variable in the long run. In school-level and olympiad-style probability, expectation is one of the most powerful summary quantities because it often turns a complicated random process into a simple weighted sum. In exam problems, the main difficulty is modeling the random variable correctly before applying the formula. ---Learning Objectives
After studying this topic, you will be able to:
- Compute expectation for a discrete random variable.
- Build a random variable from a probability problem.
- Use linearity of expectation.
- Interpret expectation even when the value itself need not be achievable.
- Solve basic modelling questions involving expected counts and gains.
Core Idea
If a discrete random variable takes values
with probabilities
,
then the expectation of is
Expectation is the weighted average of possible values of the random variable.
Probability Distribution Conditions
If takes values with probabilities , then:
Linearity of Expectation
For random variables and ,
and for a constant ,
Linearity allows us to compute expectations of complicated quantities by breaking them into simpler parts.
Expectation of Common Variables
- If is Bernoulli with success probability , then
- If a fair die is rolled and is the face value, then
Minimal Worked Examples
Example 1 A fair die is rolled. Let be the outcome. Then --- Example 2 A random variable takes values with probabilities Then So ::: ---Modelling Expectation
In many questions, the main step is deciding what represents.
Examples:
- number of heads in repeated tosses
- score obtained in a game
- number of defective items chosen
- profit or loss in a scheme
Expected Value Need Not Be a Possible Outcome
Expectation need not be one of the actual values taken by the random variable.
For example, in one roll of a fair die:
but the die never actually shows .
Indicator Variable Idea
If is an indicator variable that is when an event occurs and otherwise, then
This is often used to count expected numbers of successes.
CMI Strategy
- Define the random variable clearly.
- List its possible values and probabilities.
- Check that probabilities sum to .
- Compute the weighted average.
- Use linearity when several pieces are involved.
Common Mistakes
- β Averaging values without using probabilities
- β Forgetting that probabilities must sum to
- β Confusing expectation with most likely value
- β Assuming expectation must be one of the actual outcomes
Practice Questions
:::question type="MCQ" question="If a fair die is rolled once and is the outcome, then is" options=["","","",""] answer="B" hint="Use the average of all six equally likely values." solution="We have Hence the correct option is ." ::: :::question type="NAT" question="A random variable takes values with probabilities respectively. Find ." answer="5/3" hint="Compute the weighted average." solution="We compute So the answer is ." ::: :::question type="MSQ" question="Which of the following are true?" options=["Expectation is a weighted average","Expectation of a die roll need not be an integer","Probabilities in a distribution must sum to ","Expectation is always the most likely outcome"] answer="A,B,C" hint="Check definition and interpretation carefully." solution="1. True. 2. True, for example a fair die has expectation . 3. True. 4. False. Expectation need not be the most likely value. Hence the correct answer is ." ::: :::question type="SUB" question="A fair coin is tossed three times. Let be the number of heads obtained. Find ." answer="" hint="List the binomial probabilities or use linearity of expectation." solution="Let be the number of heads in three tosses. Using linearity of expectation, write where if the th toss is a head and otherwise. For each toss, So Hence the expected number of heads is ." ::: ---Summary
- Expectation is the weighted average of a random variable.
- A correct model comes before the formula.
- Linearity of expectation is one of the most useful tools in probability.
- Expectation need not be a possible outcome.
- Expected counts are often computed using indicator variables.
---
Proceeding to Variance at school level.
---
Part 3: Variance at school level
Variance at School Level
Overview
Variance measures how spread out a random variable is around its mean. At school level, the important goals are to compute variance correctly, understand its meaning, and use the shortcut formula In exam problems, the main trap is arithmetic: students often compute the mean correctly but forget to square it or mishandle the second moment. ---Learning Objectives
After studying this topic, you will be able to:
- Define variance of a discrete random variable.
- Compute variance directly or using the shortcut formula.
- Find standard deviation from variance.
- Interpret variance as spread.
- Avoid common errors in mean-and-square calculations.
Core Idea
If is a random variable with mean
,
then the variance of is
Shortcut Formula
A very important identity is
Standard Deviation
The standard deviation of is
Basic Properties
- for a constant random variable
- Small variance means values are tightly clustered near the mean
Minimal Worked Examples
Example 1 Suppose takes values and with probabilities each. Then Also, So --- Example 2 A fair die is rolled and is the outcome. We know Now Hence ---Interpretation
Variance does not tell you where the centre is; expectation already does that.
Variance tells you how much the values typically fluctuate around the mean.
Direct vs Shortcut Computation
You may compute variance directly from
or by the shortcut
The shortcut is usually faster in exam problems.
Common Mistakes
- β Using
- β Forgetting to square the mean
- β Computing as
- β Forgetting that variance cannot be negative
CMI Strategy
- Find carefully.
- Find separately.
- Use
- Simplify only after both pieces are correct.
- If asked, take square root at the end for standard deviation.
Practice Questions
:::question type="MCQ" question="Which of the following is equal to ?" options=["","","",""] answer="B" hint="Recall the shortcut formula." solution="The standard identity is Hence the correct option is ." ::: :::question type="NAT" question="A random variable takes values and with probabilities each. Find ." answer="1" hint="Compute and ." solution="We have Also, Hence So the answer is ." ::: :::question type="MSQ" question="Which of the following are always true?" options=["Variance is never negative","A constant random variable has variance ","Standard deviation is the square root of variance","Variance always equals expectation"] answer="A,B,C" hint="Recall the definitions." solution="1. True. 2. True. 3. True. 4. False. Variance and expectation measure different things. Hence the correct answer is ." ::: :::question type="SUB" question="A fair coin is tossed twice. Let be the number of heads obtained. Find and ." answer="" hint="Use the distribution of ." solution="For two fair tosses, the random variable takes values with probabilities So Also, Hence Therefore ." ::: ---Summary
- Variance measures spread around the mean.
- The shortcut formula is
- Variance is always nonnegative.
- Standard deviation is the square root of variance.
- Careful computation of and is the core skill.
---
Proceeding to Simple modelling problems.
---
Part 4: Simple modelling problems
Simple Modelling Problems
Overview
Simple modelling problems in probability ask you to convert a real situation into a random variable, event structure, or probability distribution. The mathematics is usually not hard once the model is correct β the real challenge is identifying what is random, what the possible outcomes are, and what assumptions are being made. ---Learning Objectives
After studying this topic, you will be able to:
- Translate a verbal situation into a probability model.
- Identify outcomes, events, and random variables clearly.
- Compute probabilities and expectations in basic applied settings.
- Distinguish between model assumptions and conclusions.
- Check whether a model is realistic and internally consistent.
Core Idea
A probability model consists of:
- a sample space,
- a rule assigning probabilities,
- the event or random variable of interest.
Standard Questions in Modelling
When reading a modelling problem, ask:
- What are the possible outcomes?
- Are the outcomes equally likely?
- What is the random variable?
- What probability or expectation is required?
- Are there hidden assumptions such as independence?
Common Modelling Situations
- Tosses of coins
- Rolls of dice
- Drawing cards or balls
- Defective / non-defective items
- Success-failure trials
- Gain-loss games
Choosing the Right Random Variable
If the question asks for:
- number of successes, define that count
- total score, define score sum
- gain or loss, define profit variable
- waiting time, define number of trials
Minimal Worked Examples
Example 1 A fair coin is tossed twice. Let be the number of heads. The sample space is So takes values:- for
- for
- for
- points for a head
- points for a tail
Independence in Modelling
If repeated trials are described as fair and separate, independence is usually intended.
Examples:
- repeated coin tosses
- repeated fair die rolls
But in sampling without replacement, outcomes are not independent.
Common Modelling Errors
- β Assuming equally likely outcomes when they are not
- β Defining the wrong random variable
- β Forgetting restrictions such as βwithout replacementβ
- β Mixing up event probability with expected value
CMI Strategy
- Write the experiment first.
- Write the sample space or value table.
- Define the random variable precisely.
- Compute probabilities before jumping to expectation or variance.
- Check whether independence or replacement is involved.
Practice Questions
:::question type="MCQ" question="A fair coin is tossed once. Let be the score, where for Head and for Tail. Then is" options=["","","",""] answer="B" hint="This is a Bernoulli model with success probability ." solution="The score is with probability and with probability . Therefore So the correct option is ." ::: :::question type="NAT" question="A fair die is rolled once. Let be the indicator of the event 'the outcome is even'. Find ." answer="1/2" hint="For an indicator variable, expectation equals the probability of the event." solution="The event 'even' has probability Since is the indicator of that event, Hence the answer is ." ::: :::question type="MSQ" question="Which of the following are important first steps in a modelling problem?" options=["Identify the random experiment","Define the random variable clearly","Decide whether outcomes are equally likely","Assume all events are independent without checking"] answer="A,B,C" hint="Think about model-building, not guesswork." solution="1. True. 2. True. 3. True. 4. False. Independence must be justified, not assumed automatically. Hence the correct answer is ." ::: :::question type="SUB" question="A fair coin is tossed three times. Let be the number of tails obtained. Construct the probability distribution of ." answer="" hint="Count how many sequences have exactly tails." solution="For three fair tosses, there are equally likely outcomes. Let be the number of tails.- only for , so
- for , so
- for , so
- only for , so
Summary
- A good probability model starts with the experiment and sample space.
- The random variable must match the quantity asked.
- Expectation and variance come only after the model is set correctly.
- Independence and equal likelihood must not be assumed blindly.
- Many modelling problems are easy once the setup is clean.
---
Chapter Summary
- A Discrete Random Variable (DRV) takes on a finite or countably infinite number of distinct values. Its behaviour is described by a Probability Mass Function (PMF) , which satisfies for all and .
- The Expectation (or mean) of a DRV , denoted , is the weighted average of its possible values: .
- Expectation is a linear operator: for constants , .
- The Variance of a DRV , denoted , quantifies the spread of its distribution around its mean: .
- For constants , the variance property is . The standard deviation is .
- Simple modelling problems involve defining a DRV based on a real-world scenario, constructing its PMF, and then using the PMF to calculate probabilities, expectation, and variance, interpreting these values in context.
- A thorough understanding of these concepts is fundamental for analysing discrete data and forms the bedrock for more advanced topics in probability and statistics.
---
Chapter Review Questions
:::question type="MCQ" question="A discrete random variable has the following probability mass function:
What is the probability that is an even number?" options=["","","",""] answer="3/5" hint="First, determine the value of using the property that the sum of all probabilities must equal 1. Then, identify the outcomes for which is even and sum their probabilities." solution="The sum of all probabilities must equal 1:
The probability that is an even number is :
:::
:::question type="NAT" question="A game involves rolling a fair four-sided die (numbered 1, 2, 3, 4). You win points equal to the number rolled, unless you roll a 4, in which case you lose 5 points. What is the expected number of points you will win?" answer="0.25" hint="Define a random variable for the points won. List all possible outcomes and their corresponding probabilities and points. Calculate the expectation using the formula ." solution="Let be the random variable representing the points won.
The possible outcomes for the die roll are 1, 2, 3, 4, each with a probability of .
The corresponding points are:
If die roll is 1, points .
If die roll is 2, points .
If die roll is 3, points .
If die roll is 4, points .
The expectation is:
:::
:::question type="MCQ" question="A random variable has and . What is ?" options=["","","",""] answer="16" hint="Recall the properties of variance for linear transformations: ." solution="Using the property of variance, .
In this case, and .
:::
:::question type="NAT" question="A box contains 3 red balls and 2 blue balls. Two balls are drawn randomly without replacement. Let be the number of red balls drawn. Calculate ." answer="1.2" hint="First, determine the possible values for and their probabilities (PMF). Then, use the expectation formula." solution="Let be the number of red balls drawn. The possible values for are 0, 1, or 2.
Total number of balls = 5. Number of ways to draw 2 balls from 5 is .
: No red balls, meaning 2 blue balls are drawn.
: One red ball and one blue ball are drawn.
: Two red balls are drawn.
Check: . The PMF is correct.
Now, calculate the expectation :
:::
---
What's Next?
This chapter has laid the essential groundwork for understanding discrete random variables, their distributions, and key summary statistics like expectation and variance. Building on this foundation, your CMI journey will next delve into Continuous Random Variables, where outcomes can take any value within an interval, necessitating the use of probability density functions and integral calculus. You will also encounter specific distributions like the Binomial, Poisson, and Geometric, which are vital for modelling various real-world phenomena involving discrete counts or trials. A robust understanding of discrete variables is indispensable for mastering more advanced topics in probability theory and statistical inference.