Sage Journals: Discover world-class research

Abstract

Background/Aims:

Multiple imputation is often recommended over complete case analysis for handling missing data in clinical trials due to its ability to recover information from participants with incomplete data. While multiple imputation is generally held to be more efficient than complete case analysis, its planned use in clinical trials is typically not considered during sample size estimation. The standard approach of inflating the sample size for anticipated loss to follow-up is applicable for complete case analysis but could lead to excess power and hence inefficient resource use should multiple imputation be planned for analysis. In this article, we systematically reviewed published clinical trials with the aim of quantifying the precision advantages of multiple imputation over complete case analysis in treatment effect estimation, hence informing sample size planning for future trials.

Methods:

We conducted a targeted review of clinical trials published between January 2019 and December 2023 in Lancet, The BMJ, Journal of the American Medical Association and New England Journal of Medicine. Clinical trials were eligible for inclusion if point and variance estimates for the effect of treatment on a primary efficacy or safety outcome could be determined for both multiple imputation and complete case analysis. The design effect due to multiple imputation was calculated as the variance of the treatment effect estimate using multiple imputation divided by the corresponding variance using complete case analysis. As a supplementary analysis, we also conducted an untargeted review of other journals by searching in PubMed for clinical trials with the keywords ‘imputation’ or ‘imputed’ in their title or abstract.

Results:

The targeted search identified 547 articles, of which 59 satisfied eligibility criteria. Included trials tended to be large in size (median 653 participants) and reported a median of 8.6% missing data in the complete case analysis of the primary outcome (range 0.4%–30.5%). Multiple imputation was most frequently applied using chained equations under a missing at random assumption, with auxiliary variables included in the imputation model in most trials. The median design effect due to multiple imputation was 1.00 in both unadjusted (n = 15 trials) and covariate-adjusted analyses (n = 46 trials), suggesting multiple imputation typically was not offering precision advantages over complete case analysis. Similar design effects were observed in the untargeted review (median 0.96 and 1.01 for unadjusted and covariate-adjusted analyses), despite higher rates of missing data overall (median 15.7%, n = 49 trials).

Discussion:

Multiple imputation did not consistently lead to more precise treatment effect estimates than complete case analysis in the trials included in the review. Findings should not be construed as an argument against the use of multiple imputation but suggest the standard approach of inflating the sample size for anticipated loss to follow-up is reasonable when multiple imputation is planned for analysis.

Keywords

Missing data clinical trial multiple imputation sample size power

Get full access to this article

View all access options for this article.

References

Rubin

Multiple imputation for nonresponse in surveys. New York: Wiley & Sons, 1987.

Lee

Carlin

Simpson

, et al. Assumptions and analysis planning in studies with missing data in multiple variables: moving beyond the MCAR/MAR/MNAR classification. Int J Epidemiol 2023; 52: 1268–1275.

European Medicines Agency. ICH E9 (R1) Addendum on estimands and sensitivity analysis in clinical trials to the guideline on statistical principles for clinical trials, 2020, https://www.ema.europa.eu/en/documents/scientific-guideline/ich-e9-r1-addendum-estimands-and-sensitivity-analysis-clinical-trials-guideline-statistical-principles-clinical-trials-step-5_en.pdf

White

Carpenter

Horton

NJ.

Including all individuals is not enough: lessons for intention-to-treat analysis. Clin Trials 2012; 9: 396–407.

Little

D’Agostino

Cohen

, et al. The prevention and treatment of missing data in clinical trials. N Engl J Med 2012; 367: 1355–1360.

Committee for Proprietary Medicinal Products. Guideline on missing data in confirmatory clinical trials. Report no. EMA/CPMP/EWP/1776/99 Rev. 1, 2009, https://www.ema.europa.eu/en/documents/scientific-guideline/guideline-missing-data-confirmatory-clinical-trials_en.pdf

Jakobsen

Gluud

Wetterslev

, et al. When and how should multiple imputation be used for handling missing data in randomised clinical trials – a practical guide with flowcharts. BMC Med Resmethodol 2017; 17: 162.

Hughes

Heron

Sterne

JAC

, et al. Accounting for missing data in statistical analyses: multiple imputation is not always the answer. Int J Epidemiol 2019; 48: 1294–1304.

Sullivan

White

Salter

, et al. Should multiple imputation be the method of choice for handling missing data in randomized trials. Stat Methods Med Res 2018; 27(9): 2610–2626.

10.

Collins

Schafer

Kam

CM.

A comparison of inclusive and restrictive strategies in modern missing data procedures. Psychol Methods 2001; 6(4): 330–351.

11.

Little

Carpenter

Lee

KJ.

A comparison of three popular methods for handling missing data: complete-case analysis, inverse probability weighting, and multiple imputation. Sociol Method Res 2024; 53: 1105–1135.

12.

Madley -Dowd

Hughes

Tilling

, et al. The proportion of missing data should not be used to guide decisions on multiple imputation. J Clin Epidemiol 2019; 110: 63–73.

13.

Hardt

Herke

Leonhart

Auxiliary variables in multiple imputation in regression with missing X: a warning against including too many in small sample research. BMC Med Res Methodol 2012; 12: 184.

14.

Graham

JW.

Missing data: analysis and design. New York: Springer, 2012.

15.

Mainzer

Nguyen

Carlin

, et al. A comparison of strategies for selecting auxiliary variables for multiple imputation. Biom J 2024; 66(1): e2200291.

16.

Zha

Harel

Power calculation in multiply imputed data. Statistical Papers 2021; 62: 533–559.

17.

Bell

Fiero

Horton

, et al. Handling missing data in RCTs; a review of the top medical journals. BMC Med Resmethodol 2014; 14: 118.

18.

Hopewell

Chan

A-W

Collins

, et al. CONSORT 2025 explanation and elaboration: updated guideline for reporting randomised trials. BMJ 2025; 389: e081124.

19.

Page

McKenzie

Bossuyt

, et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ 2021; 372: n71.

20.

Tan

Cro

Van Vogt

, et al. A review of the use of controlled multiple imputation in randomised controlled trials with missing outcome data. BMC Med Res Methodol 2021; 21: 72.

21.

Hayati Rezvan

Lee

Simpson

. The rise of multiple imputation: a review of the reporting and implementation of the method in medical research. BMC Med Res Methodol 2015; 15: 30.

22.

Wood

White

Thompson

SG.

Are missing outcome data adequately handled? A review of published randomized controlled trials in major medical journals. Clin Trials 2004; 1(4): 368–376.

23.

Higgins

Thomas

Cochrane Handbook for Systematic Reviews of Interventions Version 6.3, 2022, https://training.cochrane.org/handbook/archive/v6.3: The Cochrane Collaboration

24.

White

Royston

Wood

AM.

Multiple imputation using chained equations: issues and guidance for practice. Stat Med 2011; 30: 377–399.

25.

Flum

Davidson

Monsell

, et al. A randomized trial comparing antibiotics with appendectomy for appendicitis. N Engl J Med 2020; 383: 1907–1919.

26.

Fernandez

Achten

Parsons

, et al. Cemented or uncemented hemiarthroplasty for intracapsular hip fracture. N Engl J Med 2022; 386: 521–530.

27.

Palmer

AJR

Gupta

Fernquest

, et al. Arthroscopic hip surgery compared with physiotherapy and activity modification for the treatment of symptomatic femoroacetabular impingement: multicentre randomised controlled trial. BMJ 2019; 364: l185.

28.

van der Vaart

Vollebregt

Milani

, et al. Effect of pessary vs surgery on patient-reported improvement in patients with symptomatic pelvic organ prolapse: a randomized clinical trial. JAMA 2022; 328: 2312–2323.

29.

Antoszyk

Glassman

Beaulieu

, et al. Effect of intravitreous aflibercept vs vitrectomy with panretinal photocoagulation on visual acuity in patients with vitreous hemorrhage from proliferative diabetic retinopathy: a randomized clinical trial. JAMA 2020; 324: 2383–2395.

30.

Butler

Gillespie

White

, et al. C-reactive protein testing to guide antibiotic prescribing for COPD exacerbations. N Engl J Med 2019; 381: 111–120.

31.

Fidler

Stöhr

Pace

, et al. Antiretroviral therapy alone versus antiretroviral therapy with a kick and kill approach, on measures of the HIV reservoir in participants with recent HIV infection (the RIVER trial): a phase 2, randomised trial. Lancet 2020; 395: 888–898.

32.

Finkelstein

Zhou

Taubman

, et al. Health care hotspotting – a randomized, controlled trial. N Engl J Med 2020; 382: 152–162.

Multiple imputation in clinical trials – what difference does it make?: A systematic review of the impact of multiple imputation on treatment effect estimation

Abstract

Background/Aims:

Methods:

Results:

Discussion:

Keywords

Get full access to this article

References