Cambridge Notes, Past Papers, Revision Questions

5.2 Language Translators

Learning Objective

Show understanding of why an interpreter is required for the translation and execution of a high‑level language program, how it relates to other translators (assembler, compiler) and to development tools such as an IDE, and how these concepts connect to the wider Cambridge IGCSE/A‑Level syllabus.

1. Why Translation Is Needed

High‑level languages (e.g., Python, JavaScript, Ruby) are written for human readability.

The CPU can only execute machine code – binary instructions that are specific to a processor architecture.

Therefore a translation step is mandatory to convert the programmer’s intent into a form the hardware can execute.

2. Types of Language Translators

2.1 Assembler

Purpose: Translates assembly language (a symbolic representation of machine instructions) into machine code.

Typical use‑cases: Low‑level system software, embedded systems, device drivers where direct hardware control or maximum speed is required.

Link to register‑transfer notation (syllabus 4.2): Each assembly instruction can be expressed as a register‑transfer operation, e.g. LDA #10 ⇢ IR ← #10; ACC ← IR.

Simple example (generic 8‑bit CPU):


; myprog.asm
LDA #10        ; load constant 10 into the accumulator
STA 0x20       ; store accumulator at memory address 0x20
HLT            ; halt the CPU


; myprog.bin (machine code)
00110000 00001010   ; 0x30 0x0A  (LDA #10)
00100000 00100000   ; 0x20 0x20  (STA 0x20)
11111111            ; 0xFF      (HLT)

2.2 Compiler

Purpose: Translates an entire high‑level program into machine code (or an object file) before the program runs.

When it is preferred: Performance‑critical applications, system software, or when the target platform is known in advance.

Typical pipeline: Source → lexical analysis → parsing → optimisation → code generation → object file → linking → executable.

2.3 Interpreter

Purpose: Translates and executes a program line‑by‑line at run‑time.

When it is needed:
- Rapid development and testing – immediate feedback after each statement.
- Platform independence – the same source runs on any system that provides a compatible interpreter.
- Dynamic language features (dynamic typing, reflection, run‑time code generation).
- Educational environments – hides the compilation step so learners can concentrate on algorithms.
- Security‑focused execution – interpreters can sandbox code, limiting access to system resources.

2.4 Partial‑Compilation / Mixed‑Mode Languages (e.g., Java)

Step 1 – Compilation: javac translates Java source into byte‑code (a platform‑independent intermediate representation).

Step 2 – Execution: The Java Virtual Machine (JVM) either
- interprets the byte‑code directly, or
- uses a Just‑In‑Time (JIT) compiler to translate frequently‑executed “hot‑spot” byte‑code into native machine code.

This hybrid approach gives the portability of an interpreter and the speed of native code where it matters most.

3. How an Interpreter Works

The interpreter repeats the following cycle for each statement or expression:

Fetch: Read the next line (or token stream) from the source.

Parse: Check syntax and build an internal representation – usually an Abstract Syntax Tree (AST).

Translate (optional): Convert the AST into an intermediate form such as byte‑code or a stack‑based instruction set.

Execute: Perform the operation directly or via a virtual machine.

Example – Python REPL session:


>>> x = 5          # fetch & parse
>>> y = x * 2      # parse, evaluate, store
>>> print(y)       # execute built‑in function
10

Diagram placeholder: Insert a flow‑chart showing “Fetch → Parse → Translate → Execute” parallel to the compiler pipeline “Source → Compile → Object → Link → Executable → Run”.

4. Formal Language Description (A‑Level requirement)

4.1 Backus‑Naur Form (BNF)

A concise way to describe the syntax of a programming language.


<assignment> ::= <identifier> "=" <expression>
<expression> ::= <term> { ("+" | "-") <term> }
<term>       ::= <factor> { ("*" | "/") <factor> }
<factor>     ::= <number> | <identifier> | "(" <expression> ")"

4.2 Reverse Polish Notation (RPN)

Many interpreters (e.g., for calculators) evaluate expressions using a stack‑based RPN representation.

Infix: 3 + 4 * 5

RPN: 3 4 5 * +

Evaluation proceeds by pushing operands onto a stack and applying operators as they appear.

5. Benefits & Drawbacks – Compiler vs. Interpreter

Aspect	Compiler	Interpreter
Translation time	Whole program translated before execution (offline)	Translation occurs during execution (online)
Execution speed	Fast – native machine code runs directly on the CPU	Slower – each statement incurs translation overhead
Portability	Needs recompilation for each target platform	Source runs on any platform with a suitable interpreter
Debugging	Errors reported after compilation; mapping to source lines can be indirect	Errors reported at the exact point of execution; natural step‑by‑step debugging
Memory usage	Executable contains only machine code (compact)	Interpreter + intermediate representation must stay in memory
Security	Compiled code runs with full OS privileges unless sandboxed externally	Interpreter can enforce a sandbox, restricting file‑system, network, or OS calls
Typical use cases	System software, games, high‑performance scientific code	Scripting, rapid prototyping, teaching, web‑page scripting

6. Performance Illustration

Assume a program contains n = 1 000 statements.

Compiled version: each statement executes in 1 µs.

Interpreted version: an additional k = 5 µs is spent on translate‑and‑execute per statement.

Version	Total time (µs)
Compiled	1 000 × 1 = 1 000 µs
Interpreted	1 000 × (1 + 5) = 6 000 µs

Both algorithms are O(n), but the interpreter is six times slower because of the per‑statement translation cost.

Extension for AO2 (algorithm analysis): Students should be able to discuss how the overhead changes for more complex algorithms, e.g., a compiled quick‑sort (O(n log n)) versus an interpreted quick‑sort with the same asymptotic complexity but a larger constant factor.

7. Features of a Typical Integrated Development Environment (IDE)

Syntax checking & highlighting – catches many errors before execution (AO1).

Code completion (auto‑complete) – speeds up coding and reduces typographical mistakes.

Integrated debugger – set breakpoints, step through code, inspect variables (AO2).

Project management – organise multiple files, libraries, and build configurations.

Run/Debug console – executes the program via the appropriate translator (compiler or interpreter) and shows output.

Version‑control integration – commit, branch, and merge code directly from the IDE.

Terminal / REPL window – especially useful for interpreted languages (e.g., Python, Ruby).

8. Cross‑Reference to Other Syllabus Sections

5.1 System Software: Translators are a core component of the operating system’s toolchain.

4.2 Data Representation & Register Transfer: The assembler example demonstrates how symbolic instructions map to register‑transfer operations.

9 Algorithm Design: Understanding translation overhead helps students evaluate algorithm efficiency (AO2).

12 Security: Interpreters can provide sandboxed execution environments, an important security consideration.

9. Assessment Objective (AO) Alignment

AO1 – Knowledge: Definitions of assembler, compiler, interpreter, mixed‑mode language, BNF, RPN, and IDE features.

AO2 – Analysis: Compare benefits & drawbacks of each translator type; discuss performance implications and security aspects.

AO3 – Design/Implementation: Choose an appropriate translation strategy for a given problem (e.g., decide between an interpreter and a compiler) and justify the choice using the criteria above.

10. Key Take‑aways

Translators bridge the gap between human‑readable code and machine‑readable instructions.

Assembler: Converts assembly language to machine code; each instruction can be expressed as a register‑transfer operation.

Compiler: Translates an entire high‑level program ahead of time, giving fast execution but requiring recompilation for each platform.

Interpreter: Translates and runs code line‑by‑line, offering rapid feedback, platform independence, dynamic language features, and the possibility of sandboxed execution.

Languages such as Java use a mixed‑mode approach (byte‑code + JIT) to combine portability with speed.

Formal descriptions (BNF) and evaluation techniques (RPN) are part of the A‑Level expectation for describing language syntax and execution.

Modern IDEs provide integrated tools that support writing, testing, debugging, and version control, regardless of whether the underlying translation method is compilation or interpretation.

Show understanding of the need for: an interpreter for translation and execution of a high-level language program

5.2 Language Translators

Learning Objective

1. Why Translation Is Needed

2. Types of Language Translators

2.1 Assembler

2.2 Compiler

2.3 Interpreter

2.4 Partial‑Compilation / Mixed‑Mode Languages (e.g., Java)

3. How an Interpreter Works

4. Formal Language Description (A‑Level requirement)

4.1 Backus‑Naur Form (BNF)

4.2 Reverse Polish Notation (RPN)

5. Benefits & Drawbacks – Compiler vs. Interpreter

6. Performance Illustration

7. Features of a Typical Integrated Development Environment (IDE)

8. Cross‑Reference to Other Syllabus Sections

9. Assessment Objective (AO) Alignment

10. Key Take‑aways