Know and understand characteristics and uses of file formats including css, csv, gif, htm, .jpg, .pdf, .png, .rtf, .txt, .zip, .rar

11. File Management – Characteristics and Uses of Common File Formats

This section fulfils all requirements of Cambridge IGCSE ICT (0417) – Section 11. It explains the purpose of generic file formats, the role of compression/archiving, provides a concise reference table, detailed notes on each format, practical file‑naming advice, cross‑references to other syllabus sections, and a checklist that maps each format to the exact syllabus points.

Why Generic File Formats Matter

  • Cross‑platform compatibility – a file created on Windows can be opened on macOS, Linux, tablets, or smartphones without loss of information.
  • Long‑term stability – formats such as PDF, PNG and CSV are widely supported and are unlikely to become obsolete.
  • Easy data exchange – generic formats enable smooth transfer of data between different applications (e.g., CSV ↔ spreadsheet ↔ database).
  • Reduces vendor lock‑in – using recognised standards ensures that files remain accessible even if the original software is no longer available.

Purpose of File Compression and Archiving

  • Reduce storage space – large collections occupy less disk space when compressed.
  • Speed up transfer – smaller files download or email faster (most school email systems limit attachments to 10 MB).
  • Preserve integrity – lossless compression (ZIP, RAR) keeps the original data unchanged.
  • Group related files – several files/folders can be bundled into a single archive for easier handling.
  • Security – many archive formats allow password‑based encryption, protecting confidential data during transfer.

Summary Table of the 11 Required Formats

ExtensionFile typeTypical use (syllabus example)Key characteristics required by the syllabus
.cssStyle sheetDefines visual presentation of HTML pages (Section 19 – Website authoring)

  • Plain‑text, human‑readable.
  • Rules cascade; can be linked, embedded or inline.
  • Separates design from content.

.csvComma‑separated valuesSimple tabular data for spreadsheets or databases (Section 18 – Data handling)

  • Plain‑text; each line = one record.
  • Fields separated by commas (or semicolons in some locales).
  • No formatting – only raw data.

.gifGraphics Interchange FormatSimple web graphics or short looping animations (Section 19)

  • Lossless LZW compression.
  • Maximum 256 colours.
  • Supports animation (multiple frames).
  • Binary (on/off) transparency only.

.htm / .htmlHyperText Markup LanguageWeb pages displayed in browsers (Section 19)

  • Plain‑text markup using tags (e.g., <h1>, <p>, <a>).
  • Can contain metadata (title, meta‑description, charset) in the <head> section.
  • Links to external CSS, JavaScript, images, video.
  • Browsers render the markup into a visual page.

.jpg / .jpegJoint Photographic Experts Group imagePhotographs and complex images on the web (Section 19)

  • Lossy compression – discards data that the eye is less likely to notice.
  • 24‑bit colour (≈16 million colours).
  • Adjustable compression level (higher = smaller file, lower quality).
  • No transparency or animation support.

.pdfPortable Document FormatRead‑only documents that must retain layout (Section 21 – Document production)

  • Embeds text, raster images, vector graphics, hyperlinks, and interactive forms.
  • Searchable; can be encrypted/password‑protected.
  • Preserves fonts, colours, page layout on any device.

.pngPortable Network GraphicsWeb graphics requiring lossless quality or transparency (Section 19)

  • Lossless compression – original image data unchanged.
  • Supports 8‑bit palette and 24‑bit true‑colour.
  • Alpha channel provides varying levels of transparency.

.rtfRich Text FormatCross‑platform documents with basic formatting (Section 21)

  • Plain‑text with formatting codes (bold, italics, fonts, colours, paragraph alignment).
  • Readable by Word, LibreOffice, Google Docs, and many other editors.
  • Does not support complex layout features such as automatic tables of contents or advanced styles.

.txtPlain textSimple notes, source code, configuration files (Section 8 – e‑Safety & data handling)

  • Contains only characters – no formatting codes.
  • Stored as a binary file whose contents are plain‑text characters.
  • Universal compatibility on all operating systems.
  • Smallest possible file size for textual information.

.zipCompressed archiveBundling multiple files/folders for storage or transfer (Section 8 – Evidence documents)

  • Lossless compression (Deflate algorithm).
  • Supports password‑based AES encryption.
  • Native support in Windows, macOS and most Linux distributions.
  • Extraction: double‑click (Windows/macOS) or right‑click → “Extract Here” (Linux); command‑line unzip archive.zip.

.rarCompressed archive (proprietary)Higher‑ratio compression for large multimedia collections (Section 8)

  • Lossless compression; often achieves better ratios than ZIP, especially for video/audio.
  • Can split archives into volumes (e.g., part1.rar, part2.rar).
  • Proprietary encryption (password‑protected; algorithm not publicly documented).
  • Requires WinRAR, 7‑Zip with RAR plugin, or other compatible software to create/extract.

Detailed Characteristics of Each Format

1. Cascading Style Sheets – .css

  • Plain‑text file containing rule‑sets: selector { property: value; }.
  • Can be linked externally (<link rel="stylesheet">), embedded in <style> tags, or added inline.
  • Facilitates site‑wide design changes – one file can style many HTML pages.

2. Comma‑Separated Values – .csv

  • Each line = one record; fields separated by commas (or semicolons for locales that use commas as decimal separators).
  • Ideal for importing/exporting data between Excel, Google Sheets, and database programs.
  • No formatting – only raw data; can be opened in any text editor.

3. Graphics Interchange Format – .gif

  • Lossless LZW compression; limited to 256 colours.
  • Supports animation – multiple frames stored sequentially and played back by browsers.
  • Binary (on/off) transparency only – a pixel is either fully transparent or fully opaque.
  • Best for simple icons, logos and short looping animations.

4. HyperText Markup Language – .htm / .html

  • Markup language that structures web content with tags.
  • Contains a <head> section for metadata (title, charset, viewport, description, keywords).
  • Can link to external resources: CSS (link), JavaScript (script), images (img).
  • Browsers interpret the markup to render a visual page.

5. JPEG – .jpg / .jpeg

  • Lossy compression – discards data that is less noticeable to the human eye.
  • Suitable for photographs where smooth colour gradients are important.
  • Compression level is adjustable; higher compression = smaller file but lower visual quality.
  • No support for transparency or animation.

6. Portable Document Format – .pdf

  • Preserves exact layout, fonts, colours, and vector graphics on any device.
  • Can embed text, raster images, vector graphics, hyperlinks, and interactive forms (text fields, check boxes).
  • Searchable; can be secured with passwords and permissions (e.g., prevent printing or editing).
  • Widely used for manuals, e‑books, official reports and exam papers.

7. Portable Network Graphics – .png

  • Lossless compression – original image data is retained.
  • Supports both 8‑bit palette (256 colours) and 24‑bit true‑colour modes.
  • Alpha channel provides varying levels of transparency, ideal for overlays and logos.
  • File size larger than JPEG for photographs but smaller for graphics with large uniform areas.

8. Rich Text Format – .rtf

  • Encodes text with simple formatting (bold, italics, fonts, colours, paragraph alignment).
  • Cross‑platform – readable by Microsoft Word, LibreOffice, Google Docs, and many other editors.
  • Does not support advanced layout features such as automatic tables of contents, footnotes, or complex styles.

9. Plain Text – .txt

  • Contains only characters; no formatting codes.
  • Stored as a binary file whose content is plain‑text characters (ASCII or Unicode).
  • Universal compatibility – can be opened on any operating system with any text editor.
  • Smallest possible file size for textual information.

10. ZIP Archive – .zip

  • Combines many files/folders into a single container.
  • Lossless compression (Deflate) keeps original data unchanged.
  • Supports password‑based AES encryption for data protection.
  • Native extraction: double‑click (Windows/macOS) or right‑click → “Extract Here” (Linux); command‑line unzip archive.zip.
  • Widely supported – no additional software required on most platforms.

11. RAR Archive – .rar

  • Proprietary format developed by Eugene Roshal.
  • Often achieves higher compression ratios than ZIP, especially for multimedia files.
  • Can split large archives into smaller volumes (e.g., part1.rar, part2.rar).
  • Proprietary encryption (password‑protected; algorithm not publicly documented).
  • Extraction requires WinRAR, 7‑Zip with RAR plugin, or other compatible software.

Practical Tips for Managing These Files

  1. Back‑up originals before creating archives – compression is lossless, but an uncompressed copy protects against accidental corruption.
  2. Choose the right image format:

    • Use .png for graphics that need lossless quality or transparency.
    • Use .jpg for photographs where a smaller file size is more important than perfect fidelity.

  3. Store tabular data as .csv when you need to import/export between spreadsheets, databases, or statistical software.
  4. Separate design from content by using external .css files – a single change updates the whole site.
  5. Compress before emailing – bundle related files into a .zip to stay under typical school attachment limits (10 MB). For larger collections, split a .rar archive into volumes.
  6. Use .pdf for documents that must retain exact layout, be searchable, or need security (password protection, encryption).
  7. File‑naming & folder‑structure checklist (syllabus requirement):

    • All lower‑case letters; avoid spaces – use underscores (_) or hyphens (-).
    • Include the date in YYYYMMDD format (e.g., report_20231230.pdf).
    • Add a version number if the file is revised (v1, v2, …).
    • Use a logical hierarchy, e.g. Year/Subject/Topic/.
    • Avoid special characters such as * ? & % $ # @ ! that some operating systems treat as commands.

Cross‑Reference to Other Syllabus Sections

  • Section 18 – Data handling: .csv for importing/exporting tables; .txt and .rtf for simple data storage.
  • Section 19 – Website authoring: .html, .css, .gif, .jpg, .png.
  • Section 21 – Document production: .pdf, .rtf, .txt.
  • Section 8 – e‑Safety & evidence documents: .zip and .rar for secure evidence bundles; password protection aligns with data‑protection concepts.

Syllabus Checklist – Mapping Formats to Required Points

FormatSyllabus point(s) satisfied
.css11.1 – purpose of generic file formats; 19.2 – use of external style sheets.
.csv11.2 – data exchange; 18.1 – storing tabular data; 18.3 – importing/exporting between applications.
.gif11.1 – cross‑platform graphics; 19.3 – simple web graphics and animation.
.htm / .html11.1 – generic markup; 19.1 – structure of web pages; 19.4 – inclusion of metadata.
.jpg / .jpeg11.1 – generic image format; 19.2 – photographs on the web; 19.5 – lossy compression.
.pdf11.1 – generic document format; 21.1 – preserving layout; 21.3 – embedding forms and security.
.png11.1 – generic image format; 19.2 – lossless graphics; 19.6 – alpha‑channel transparency.
.rtf11.1 – generic document format; 21.2 – basic formatting across platforms; 21.4 – limitation (no automatic TOC).
.txt11.1 – plain‑text format; 8.2 – e‑safety (no hidden data); 18.2 – simple data storage.
.zip11.2 – purpose of compression; 8.3 – security (password encryption); 8.4 – evidence document bundling.
.rar11.2 – higher compression ratio; 8.3 – security (proprietary encryption); 8.4 – splitting large evidence files.

Additional Practical Notes

  • Typical email attachment limits in schools are 10 MB; a single high‑resolution .jpg may exceed this, so compress with .zip or reduce image resolution.
  • Exam evidence documents are often required to be under 5 MB; using .pdf (which compresses text well) or a .zip archive helps stay within limits.
  • When converting between formats:

    • From .png to .jpg – expect loss of transparency and possible quality loss.
    • From .rtf to .pdf – layout is preserved and the file becomes read‑only.
    • From .csv to .xlsx – data remains, but formatting (colours, borders) can be added.

Suggested diagram: Decision flowchart for selecting a file format based on content type (text, image, data table, archive) and required features (compression, transparency, editability, security).