Statistics – Measures of Central Tendency (Cambridge IGCSE Mathematics 0580)
This note follows the 2025‑2027 Cambridge IGCSE Mathematics (0580) syllabus.
It covers the core statistics content (C9.1–C9.5) and the optional extended content (C9.6–C9.7) that may appear in the Core and Extended papers.
Qualitative (categorical) data – non‑numerical attributes (e.g. favourite colour, type of transport).
Quantitative data – numerical values that can be counted or measured.
Discrete – countable whole numbers (e.g. number of books read).
Continuous – can take any value within a range (e.g. height, time).
Population vs. sample – the whole set of observations vs. a subset used for analysis.
Class interval – a range of continuous values grouped together (e.g. 150 – 159 cm).
It is introduced here so learners see the link to later sections on grouped data.
Frequency table – lists each distinct value (or class interval) and its frequency (f).
Example: Colours of cars in a car‑park (raw data → frequency table).
Car colour
Frequency (f)
Red
4
Blue
7
Black
5
White
2
Other
2
2. Measures of Central Tendency (C9.2)
2.1 Mean (arithmetic average)
The mean is the sum of all observations divided by the number of observations.
Q1. A data set has 15 values. Which position gives the median? Answer: Position \((15+1)/2 = 8\).
Q2. For an even‑sized data set of 20 values, the median is the average of which positions? Answer: Positions 10 and 11.
3. Bounds of Accuracy, Relative & Expected Frequencies (C9.2 (2‑3))
3.1 Bounds of accuracy
For a discrete value x the lower bound is x − 0.5, the upper bound is x + 0.5.
For a class interval the lower bound = lower class limit − 0.5, upper bound = upper class limit + 0.5.
When calculating a grouped mean, use the class midpoint (average of the lower and upper bounds) as the representative value.
Numeric example (bounds → grouped mean)
Class (cm)
Lower limit
Upper limit
Lower bound
Upper bound
Mid‑point
f
140‑149
140
149
139.5
149.5
144.5
4
150‑159
150
159
149.5
159.5
154.5
8
160‑169
160
169
159.5
169.5
164.5
12
Mean calculation (as in Section 2.5) uses the mid‑points 144.5, 154.5, 164.5.
3.2 Relative frequency
Relative frequency = \(\displaystyle \frac{f}{\sum f}\).
It may be expressed as a fraction, decimal, or percentage and shows the proportion of the total that each class/value represents.
3.3 Expected frequency
Only required when a theoretical distribution is supplied (e.g., “if the data were uniformly distributed”).
Formula:
\[
E = N \times p_{\text{theoretical}}
\]
where \(N\) = total observations, \(p_{\text{theoretical}}\) = expected proportion for the class.
Use the expected frequency to compare with the observed frequency – a useful check for “reasonable” data.
4. Grouped Data – Class Intervals (C9.3)
4.1 Example data set (heights of 30 students)
Class interval (cm)
Frequency (f)
140 – 149
4
150 – 159
8
160 – 169
12
170 – 179
5
180 – 189
1
4.2 Step‑by‑step calculations
Find the midpoint of each class (used as \(x\)):
140 – 149 → 144.5
150 – 159 → 154.5
160 – 169 → 164.5
170 – 179 → 174.5
180 – 189 → 184.5
Compute \(\sum (x\cdot f)\) and \(\sum f\):
\[
\begin{aligned}
\sum (x\cdot f) &=144.5(4)+154.5(8)+164.5(12)+174.5(5)+184.5(1)=4\,905\\
\sum f &=30
\end{aligned}
\]
Mean
\[
\displaystyle \bar{x}= \frac{4\,905}{30}=163.5\text{ cm}
\]
Median – locate the \(\frac{n}{2}=15^{\text{th}}\) observation using cumulative frequencies:
Cumulative frequencies: 4, 12, 24, 29, 30.
The 15th lies in the 160 – 169 class.
Grouped‑median formula:
\[
\text{Median}=L+\left(\frac{\frac{n}{2}-F}{f}\right)C
\]
where \(L=160\) (lower bound), \(F=12\) (cumulative freq before the median class), \(f=12\) (freq of median class), \(C=10\) (class width).
\[
\text{Median}=160+\left(\frac{15-12}{12}\right)10=162.5\text{ cm}
\]
Bar width = class width; height = frequency (or relative frequency) per unit width.
Use the lower and upper bounds from the class intervals.
5.6 Summary of key differences
Chart type
Data type
Bars touch?
Typical use
Bar chart
Qualitative / discrete
No
Compare categories
Histogram
Grouped continuous
Yes
Show distribution of intervals
Pie chart
Proportional
N/A
Show parts of a whole
6. Scatter Diagrams, Correlation & Line of Best Fit (C9.5)
6.1 Scatter diagram
Plots ordered pairs \((x, y)\) as points on a Cartesian plane.
Reveals the type of relationship:
Positive correlation – points rise from left to right.
Negative correlation – points fall from left to right.
No correlation – points appear random.
6.2 Drawing a straight line of best fit (by eye)
Identify two points that appear to lie near the centre of the cloud (often the extremes).
Draw a straight line through them; adjust so roughly equal numbers of points lie above and below.
Label the line “Best fit” or write its equation \(y = mx + c\) if you wish to estimate gradient and intercept.
6.3 Using the line of best fit
Estimate a missing or future value by reading the corresponding \(y\) (or \(x\)) on the line.
State the estimate with appropriate units and note that it is an approximation.
6.4 Example – Study time vs. test score
Student
Study time (h)
Score (out of 100)
1
2
58
2
3
65
3
4
71
4
5
78
5
6
84
6
7
88
7
8
92
8
9
95
When plotted, the points form an upward‑sloping cloud. A straight line drawn through the centre gives an approximate equation \(y \approx 5.5x + 48\).
Using the line, a student who studies 5.5 h would be expected to score about \(5.5(5.5)+48 \approx 75\) marks.
6.5 Short exercise (try it yourself)
Data: Number of books read (x) vs. reading enjoyment rating (y, out of 10) for 6 pupils.
Pupil
Books (x)
Rating (y)
1
1
4
2
2
5
3
3
6
4
4
7
5
5
8
6
6
9
Task: Sketch a scatter diagram, draw a line of best fit, and estimate the enjoyment rating for a pupil who reads 7 books.
Solution outline: Points lie on a straight line; the line of best fit is essentially \(y = x + 3\). For 7 books, estimated rating ≈ 10 (capped at 10).
Your generous donation helps us continue providing free Cambridge IGCSE & A-Level resources,
past papers, syllabus notes, revision questions, and high-quality online tutoring to students across Kenya.