☰ Show TOC

Part 3: Handling Missing Values

Questions 21 and 22

Choose TWO letters, A–E.
Which TWO opinions about deleting rows with missing values do the students express?

  1. It is only suitable for small datasets.
    B. It can distort results if the missingness is not random.
    C. It is the most transparent method for beginners.
    D. It is better than imputation in most cases.
    E. It saves time but often wastes valuable information.

Questions 23 and 24

Choose TWO letters, A–E.
Which TWO predictions about how companies will handle missing data are the students doubtful about?

  1. Most firms will standardise a single imputation rule across all projects.
    B. More teams will document missing-data assumptions in reports.
    C. Automated tools will remove the need for human judgement.
    D. Multiple imputation will become more common in business analytics.
    E. Missing data will be less of an issue because data collection will improve.

Questions 25–30

What comment do the students make about each technique?
Choose SIX answers from the box and write the correct letter, A–G, next to Questions 25–30.

Comments
A. This method is fast but can seriously shrink variability.
B. It is easy to explain to non-technical stakeholders.
C. It works well for time-ordered data if the gaps are short.
D. It is usually inappropriate when missingness depends on the value itself.
E. It can perform well but becomes slow on large datasets.
F. It is praised because it reflects uncertainty rather than pretending the values are known.
G. It often improves predictive models when missingness itself carries information.

Techniques
25 Mean or median replacement

26 Deleting incomplete records

27 Adding a missing-value flag

28 Interpolating between values

29 Filling from similar cases

30 Creating several filled versions and combining results