A review of current practice in the design and analysis of extremely small stepped-wedge cluster randomized trials

Abstract

Background/Aims

Stepped-wedge cluster randomized trials tend to require fewer clusters than standard parallel-arm designs due to the switches between control and intervention conditions, but there are no recommendations for the minimum number of clusters. Trials randomizing an extremely small number of clusters are not uncommon, but the justification for small numbers of clusters is often unclear and appropriate analysis is often lacking. In addition, stepped-wedge cluster randomized trials are methodologically more complex due to their longitudinal correlation structure, and ignoring the distinct within- and between-period intracluster correlations can underestimate the sample size in small stepped-wedge cluster randomized trials. We conducted a review of published small stepped-wedge cluster randomized trials to understand how and why they are used, and to characterize approaches used in their design and analysis.

Methods

Electronic searches were used to identify primary reports of full-scale stepped-wedge cluster randomized trials published during the period 2016–2022; the subset that randomized two to six clusters was identified. Two reviewers independently extracted information from each report and any available protocol. Disagreements were resolved through discussion.

Results

We identified 61 stepped-wedge cluster randomized trials that randomized two to six clusters: median sample size (Q1–Q3) 1426 (420–7553) participants. Twelve (19.7%) gave some indication that the evaluation was considered a “preliminary” evaluation and 16 (26.2%) recognized the small number of clusters as a limitation. Sixteen (26.2%) provided an explanation for the limited number of clusters: the need to minimize contamination (e.g. by merging adjacent units), limited availability of clusters, and logistical considerations were common explanations. Majority (51, 83.6%) presented sample size or power calculations, but only one assumed distinct within- and between-period intracluster correlations. Few (10, 16.4%) utilized restricted randomization methods; more than half (34, 55.7%) identified baseline imbalances. The most common statistical method for analysis was the generalized linear mixed model (44, 72.1%). Only four trials (6.6%) reported statistical analyses considering small numbers of clusters: one used generalized estimating equations with small-sample correction, two used generalized linear mixed model with small-sample correction, and one used Bayesian analysis. Another eight (13.1%) used fixed-effects regression, the performance of which requires further evaluation under stepped-wedge cluster randomized trials with small numbers of clusters. None used permutation tests or cluster-period level analysis.

Conclusion

Methods appropriate for the design and analysis of small stepped-wedge cluster randomized trials have not been widely adopted in practice. Greater awareness is required that the use of standard sample size calculation methods can provide spuriously low numbers of required clusters. Methods such as generalized estimating equations or generalized linear mixed models with small-sample corrections, Bayesian approaches, and permutation tests may be more appropriate for the analysis of small stepped-wedge cluster randomized trials. Future research is needed to establish best practices for stepped-wedge cluster randomized trials with a small number of clusters.

Keywords

Systematic review stepped-wedge designs sample size and power calculation small number of clusters small-sample correction small stepped-wedge trials

Get full access to this article

View all access options for this article.

References

Hemming

Haines

Chilton

, et al. The stepped wedge cluster randomised trial: rationale, design, analysis, and reporting. BMJ 2015; 350: h391.

Lew

Miller

Kim

, et al. A method to reduce imbalance for site-level randomized stepped wedge implementation trial designs. Implement Sci 2019; 14(1): 46.

Hussey

Hughes

. Design and analysis of stepped wedge cluster randomized trials. Contemp Clin Trials 2007; 28(2): 182–191.

. Design and analysis considerations for cohort stepped wedge cluster randomized trials with a decay correlation structure. Stat Med 2020; 39(4): 438–455.

Hughes

Hemming

, et al. Mixed-effects models for the design and analysis of stepped wedge cluster randomized trials: an overview. Stat Methods Med Res 2021; 30(2): 612–639.

Nevins

Davis-Plourde

Pereira Macedo

, et al. A scoping review described diversity in methods of randomization and reporting of baseline balance in stepped-wedge cluster randomized trials. J Clin Epidemiol 2023; 157: 134–145.

Nevins

Ryan

Davis-Plourde

, et al. Adherence to key recommendations for design and analysis of Stepped-Wedge Cluster Randomized Trials: a Review of trials published 2016-2022. Clin Trials 2024; 21(2): 199–210.

Hemming

Taljaard

Forbes

. Modeling clustering and treatment effect heterogeneity in parallel and stepped-wedge cluster randomized trials. Stat Med 2018; 37(6): 883–898.

Taljaard

Teerenstra

Ivers

, et al. Substantial risks associated with few clusters in cluster randomized and stepped wedge designs. Clin Trials 2016; 13(4): 459–463.

10.

Ford

Westgate

. Maintaining the validity of inference in small-sample stepped wedge cluster randomized trials with binary outcomes when using generalized estimating equations. Stat Med 2020; 39(21): 2779–2792.

11.

Barker

McElduff

D’Este

, et al. Stepped wedge cluster randomised trials: a review of the statistical methodology used and available. BMC Med Res Methodol 2016; 16(1): 69.

12.

Prost

Binik

Abubakar

, et al. Logistic, ethical, and political dimensions of stepped wedge trials: critical review and case studies. Trials 2015; 16(1): 351.

13.

Turner

Preisser

. Sample size determination for GEE analyses of stepped wedge cluster randomized trials. Biometrics 2018; 74(4): 1450–1458.

14.

Kahan

Forbes

Ali

, et al. Increased risk of type I errors in cluster randomised trials with small or medium numbers of clusters: a review, reanalysis, and simulation study. Trials 2016; 17(1): 438.

15.

Kenward

Roger

. Small sample inference for fixed effects from restricted maximum likelihood. Biometrics 1997; 53(3): 983–997.

16.

Kenward

Roger

. An improved approximation to the precision of fixed effects from restricted maximum likelihood. Comput Stat Data Anal 2009; 53(7): 2583–2595.

17.

Thompson

Davey

Hayes

, et al. Permutation tests for stepped-wedge cluster-randomized trials. Stata J Promot Commun Stat Stata 2019; 19(4): 803–819.

18.

Wang

De Gruttola

. The use of permutation tests for the analysis of parallel and stepped-wedge cluster-randomized trials. Stat Med 2017; 36(18): 2831–2843.

19.

Thompson

Davey

Fielding

, et al. Robust analysis of stepped wedge trials using cluster-level summaries within periods. Stat Med 2018; 37(16): 2487–2500.

20.

Grantham

Kasza

Heritier

, et al. Evaluating the performance of Bayesian and restricted maximum likelihood estimation for stepped wedge cluster randomized trials with a small number of clusters. BMC Med Res Methodol 2022; 22(1): 112.

21.

Zhan

Ouyang

, et al. Improving efficiency in the stepped-wedge trial design via Bayesian modeling with an informative prior for the time effects. Clin Trials 2021; 18(3): 295–302.

22.

Caille

Taljaard

Le Vilain-Abraham

, et al. Recruitment and implementation challenges were common in stepped-wedge cluster randomized trials: results from a methodological review. J Clin Epidemiol 2022; 148: 93–103.

23.

Hemming

Taljaard

McKenzie

, et al. Reporting of stepped wedge cluster randomised trials: extension of the CONSORT 2010 statement with explanation and elaboration. BMJ 2018; 363: k1614.

24.

Nevins

Davis-Plourde

Pereira Macedo

, et al. Handling of covariates in stepped-wedge cluster randomized trials: protocol for a methodological review, https://ruor.uottawa.ca/handle/10393/43901 (2022, accessed November 6, 2023).

25.

Copas

Lewis

Thompson

, et al. Designing a stepped wedge trial: three main designs, carry-over effects and randomisation approaches. Trials 2015; 16(1): 352.

26.

Airtable, 2022, https://airtable.com/

27.

R Core Team. R: a language and environment for statistical computing. R Foundation for Statistical Computing, 2023, https://www.R-project.org/

28.

Wright

Ivers

Eldridge

, et al. A review of the use of covariates in cluster randomized trials uncovers marked discrepancies between guidance and practice. J Clin Epidemiol 2015; 68(6): 603–609.

29.

Hayes

Bennett

. Simple sample size calculation for cluster-randomized trials. Int J Epidemiol 1999; 28(2): 319–326.

30.

Scott

deCamp

Juraska

, et al. Finite-sample corrected gee of population average treatment effects in stepped wedge cluster randomized trials. Stat Methods Med Res 2017; 26(2): 583.

31.

Rathouz

, et al. Marginal modeling of cluster-period means and intraclass correlations in stepped wedge designs with binary outcomes. Biostatistics 2022; 23(3): 772–788.

32.

Zhang

Preisser

Turner

, et al. A general method for calculating power for GEE analysis of complete and incomplete stepped wedge cluster randomized trials. Stat Methods Med Res 2023; 32(1): 71–87.

33.

Barker

D’Este

Campbell

, et al. Minimum number of clusters and comparison of analysis methods for cross sectional stepped wedge cluster randomised trials with binary outcomes: a simulation study. Trials 2017; 18(1): 119.

34.

Lee

Yang

, et al. Inclusion of unexposed clusters improves the precision of fixed effects analysis of stepped-wedge cluster randomized trials. Stat Med 2022; 41(15): 2923–2938.

35.

Hooper

Teerenstra

De Hoop

, et al. Sample size calculation for stepped wedge and other longitudinal cluster randomised trials. Stat Med 2016; 35(26): 4718–4728.

36.

Rezaei-Darzi

Kasza

Forbes

, et al. Use of information criteria for selecting a correlation structure for longitudinal cluster randomised trials. Clin Trials 2022; 19(3): 316–325.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.07 MB