Data Storage, Compression & the Wider Digital Context – Cambridge IGCSE 0478 (Topic 1.3)
Learning Objectives
AO1: Explain the purpose of data compression and why it is required in modern computing.
AO2: Convert between binary, hexadecimal and decimal; represent signed numbers using two’s‑complement; calculate file sizes for text, image and sound data.
AO3: Describe the basic hardware and software components that manipulate data, the principles of data transmission, and the role of the Internet, security and digital currency.
1. Data Representation
1.1 Number systems
Binary (base‑2) – each digit is a bit (0 or 1).
Hexadecimal (base‑16) – digits 0‑9 and A‑F; useful for compactly writing binary values.
Conversion shortcuts:
Binary → Hex: group bits in fours (e.g., 1011 1100 = BC16).
Hex → Decimal: multiply each digit by 16ⁿ (e.g., 3A16 = 3·16¹ + 10·16⁰ = 5810).
1.2 Signed numbers – two’s‑complement
Step
Method (8‑bit example)
Positive value
Write the binary magnitude (e.g., 13 = 00001101).
Negative value
Invert all bits and add 1 (‑13 → 11110010 + 1 = 11110011).
Range
‑128 to +127 for 8‑bit.
1.3 Text, image & sound representation
Text – ASCII (7‑bit) or extended ASCII (8‑bit). One byte per character.
Images – colour depth = bits per pixel.
Monochrome: 1 bpp
256‑colour: 8 bpp
True‑colour (24‑bit): 8 bits each for Red, Green, Blue.
Sound (PCM) – sample rate × bit depth × number of channels.
CD quality: 44.1 kHz, 16‑bit, stereo.
1.4 File‑size calculations (binary units)
All exam calculations use binary prefixes: 1 KiB = 1024 bytes, 1 MiB = 1024 KiB.
Run‑Length Encoding (RLE) – Lossless
Replaces a run of identical symbols with a single symbol and a count. Example: AAAAA → A5.
Discrete Cosine Transform (DCT) – Lossy
Used in JPEG images and MPEG video. Converts blocks of pixels to the frequency domain and discards high‑frequency components that are less noticeable to the human eye.
3.2 Optional/advanced algorithms
Algorithm
Category
Key idea
Typical use
Huffman coding
Lossless
Shorter binary codes for more frequent symbols
Text files, PNG images
Lempel‑Ziv‑Welch (LZW)
Lossless
Dictionary of repeated patterns built on‑the‑fly
GIF, ZIP archives
Transform coding (e.g., MP3)
Lossy
Convert audio to frequency components and remove those below hearing thresholds
Exam tip: No calculators are allowed in the written paper. Perform division and subtraction by hand and round only in the final answer if a whole‑number percentage is required.
4.1 Example (Lossless)
File before compression: 500 KiB; after RLE: 200 KiB.
Memory management – allocates RAM for compression algorithms.
Process scheduling – allows compression utilities to run alongside other tasks.
7.3 Interrupts
Hardware signals that temporarily halt the CPU’s current instruction stream to service a higher‑priority event (e.g., I/O completion, timer). After handling, the CPU resumes the interrupted task.
7.4 Language levels & development tools
Machine language – binary instructions executed directly by the CPU.
Assembly language – mnemonic representation of machine instructions; one‑to‑one mapping.
High‑level languages – C, Java, Python; easier for humans, require translation.
Compilers vs. interpreters
Compiler translates source code to machine code before execution (e.g., C → .exe).
Interpreter executes source line‑by‑line at run‑time (e.g., Python).
IDE (Integrated Development Environment) – combines editor, compiler/interpreter, debugger, and project management (e.g., Eclipse, Visual Studio).
8. The Internet & Its Uses (AO3)
8.1 WWW vs. Internet
Internet – global network of interconnected computers using TCP/IP.
World Wide Web (WWW) – a service on the Internet that uses HTTP/HTTPS to exchange hyper‑text documents.
Your generous donation helps us continue providing free Cambridge IGCSE & A-Level resources,
past papers, syllabus notes, revision questions, and high-quality online tutoring to students across Kenya.