Data are raw, unprocessed facts – numbers, text, images, sounds etc. Information is data that has been organised, interpreted or presented so that it is meaningful to a user.
Sources of data
Direct – collected first‑hand (questionnaires, sensors, interviews, experiments).
Indirect – obtained from existing records or secondary sources (databases, reports, the Internet).
Advantages of using data
Supports objective decision‑making.
Provides a basis for trend analysis and forecasting.
Enables automation of routine tasks.
Disadvantages / risks
Data may be incomplete, inaccurate or biased.
Collection can be costly or time‑consuming.
Privacy, security and legal issues may arise.
1.2 Quality of Information – Factors (Cambridge AS/A Level IT 9626)
Cambridge states that “Factors that affect the quality of information include accuracy, relevance, age (timeliness), level of detail and completeness.” These five criteria must be applied to every data set before it is processed, analysed or reported.
Inter‑relationships
Low relevance can mask errors in accuracy because the data may never be used where the mistake matters.
Very detailed data (detail) that is out‑of‑date (age) may be less useful than a concise, recent summary.
Missing elements (completeness) often lead to misleading conclusions about accuracy and relevance.
Key Quality Criteria
Criterion
What to Look For
Impact of Poor Quality
Accuracy
Correctness of facts, figures and statements; matches original source.
Mis‑interpretation, faulty calculations, loss of trust.
Relevance
Degree to which information meets the current need or problem.
Wasted time, irrelevant conclusions.
Age (Timeliness)
How recent the information is; whether it reflects the latest data.
Out‑of‑date decisions, missed opportunities.
Detail (Granularity)
Level of specificity; sufficient depth for the task.
Over‑generalisation, inability to perform detailed analysis.
Completeness
All necessary parts are present; no critical gaps.
Partial answers, need for additional sources, biased outcomes.
Assessing Each Criterion
Accuracy
Cross‑check data against original records or trusted references.
Look for typographical errors, rounding mistakes, or inconsistent units.
Advantages: enables automatic control, improves safety and efficiency.
Disadvantages: complex design, must guarantee response time.
Comparison Table
Method
Typical Use‑case
Timing of Output
Resource Implications
Batch
End‑of‑month payroll
After the batch run (hours‑days later)
Can run on low‑priority servers; scheduled downtime acceptable.
Online
E‑commerce order entry
Immediately (seconds)
Requires high‑availability hardware and network.
Real‑time
Smart‑thermostat temperature control
Within milliseconds
Deterministic processing, often embedded hardware.
Suggested Flowchart (Batch → Online → Real‑time)
Flowchart of the evaluation process – start → check accuracy → check relevance → check age → check detail → check completeness → decision (accept / reject / request more data).
1.6 Algorithms (A‑Level only)
An algorithm is a step‑by‑step procedure for solving a problem or performing a computation.
Characteristics
Clear and unambiguous.
Finite – must terminate after a limited number of steps.
Input and output defined.
Effective – each step is simple enough to be carried out.
Representations
Flowcharts – use standard symbols (process, decision, input/output, connector).
Pseudocode – plain‑language description that mirrors programming logic.
Example: Simple sorting (bubble sort)
FOR i ← 1 TO n‑1
FOR j ← 1 TO n‑i
IF A[j] > A[j+1] THEN
SWAP A[j] AND A[j+1]
END IF
NEXT j
NEXT i
This algorithm repeatedly steps through a list, compares adjacent items and swaps them if they are in the wrong order. It illustrates iteration, comparison and swapping – core concepts examined in the syllabus.
Practical Example 1 – Market Survey
A team receives a CSV file containing customer‑satisfaction scores for a new smartphone.
Accuracy: Verify each score is between 1 and 5 and matches the original questionnaire.
Relevance: Confirm the survey targets the smartphone model under review, not a previous version.
Age: Note the collection date – data older than three months may not reflect recent firmware updates.
Detail: Ensure demographic fields (age, region, usage frequency) are present for segment analysis.
Completeness: Check that every respondent has a score; missing entries indicate incomplete data.
Practical Example 2 – Smart‑Home Sensor Data
A smart‑home system logs temperature, humidity and motion readings every minute. The data will be used to optimise heating schedules.
Accuracy: Compare a sample of sensor readings with a calibrated reference thermometer.
Relevance: Ensure the data covers the rooms that contain heating zones; readings from the garden are irrelevant.
Age: Use only the most recent week of data because seasonal changes affect heating needs.
Detail: Minute‑level granularity is required; hourly averages would be too coarse for fine‑tuned control.
Completeness: Look for gaps in the timestamp sequence – a missing minute may indicate a sensor outage.
Mini‑Exercise – Spot the Quality Issues
Below is a short data set (CSV format) that records weekly sales of three products.
Using the five quality criteria, answer the questions that follow.
Identify any accuracy problems.
Is the data relevant for analysing product performance over the quarter? Explain.
Comment on the age of the data if today is week 5.
Does the data provide sufficient detail for a profit‑margin calculation? Why or why not?
List the completeness issues and suggest how they could be resolved.
Quick‑Evaluation Checklist
Criterion
Questions to Ask
Evidence Needed
Accuracy
Is the data correct and free from errors?
Source documents, verification logs.
Relevance
Does it answer the current question?
Problem statement, user requirements.
Age
When was it created or last updated?
Timestamp, version number.
Detail
Is the level of detail sufficient?
Data dictionary, field definitions.
Completeness
Are any required elements missing?
Requirement checklist, completeness report.
Summary
Evaluating information quality is a systematic, syllabus‑driven activity. By deliberately applying the five criteria—accuracy, relevance, age, detail and completeness—students can ensure that the data they process, analyse and report is fit for purpose. Mastery of these concepts supports the examination requirements (AO2 Apply, AO3 Evaluate) and prepares learners for real‑world IT practice, where poor‑quality data can compromise security, decision‑making and system reliability.
Your generous donation helps us continue providing free Cambridge IGCSE & A-Level resources,
past papers, syllabus notes, revision questions, and high-quality online tutoring to students across Kenya.