Abstract
Background:
Cluster randomised crossover designs (CRXOs) are a powerful type of longitudinal cluster randomised trial in which all participating clusters switch between two treatment conditions. “Multiple-period” CRXOs divide the trial duration into a number of periods of equal length and can allow for multiple crossovers between treatment conditions. It can be assumed that increasing the number of crossovers leads to an increase in statistical power. We investigate whether this is true for standard correlation structures, comparing CRXO designs with equal numbers of clusters, participants, and periods but differing in the number of crossovers, considering continuous outcomes.
Methods:
We consider the formula for the variance of the treatment effect estimator for multiple-period CRXOs under exchangeable and block-exchangeable within-cluster correlation structures, assuming equal cluster-period sizes and different patterns of treatment conditions in the treatment sequences varying in the number of crossovers. We also conduct a simulation study to compare the statistical power between multiple-period CRXO designs with different numbers of crossovers that share the same number and duration of periods and the same number of participants in each cluster period.
Results:
Under exchangeable and block-exchangeable correlation structures and equal cluster-period sizes, the number of crossovers in the treatment sequences does not impact study power, provided the design is balanced in terms of the number of periods and clusters implementing each condition.
Conclusions:
When an exchangeable or block-exchangeable within-cluster correlation structure and a time-invariant effect of treatment are assumed, power calculations for CRXO designs are invariant to the specific ordering of treatment conditions. In particular, a CRXO design with additional crossovers does not lead to increased statistical power compared to a CRXO design with just one crossover for these within-cluster correlation structures. Further work is required to investigate the utility of multiple crossovers in situations where the treatment effect varies over time.
Keywords
Background
Cluster randomised crossover designs (CRXOs) are a type of longitudinal cluster randomised trial in which clusters alternate between two treatment conditions over time; each alternation is called a “crossover”. 1 These designs are suitable for interventions that can be removed once implemented. When suitable, CRXOs are efficient design choices: unlike standard parallel cluster randomised trials, CRXOs enable within-cluster comparisons, permitting more precise treatment effect estimation. CRXOs thus generally require far fewer clusters than parallel cluster randomised trials, 2 potentially making infeasible trials feasible. 3
While there are many variants and generalisations of CRXOs, such as those that include more than two sequences or interventions, 4 we consider two-sequence CRXOs. The most widely used variant is the two-period CRXO, where half of the clusters are randomly selected to commence with condition 1 before switching to condition 2, while the other half start with condition 2 before switching to condition 1; 5 an example is in Figure 1 (Design 0). Multiple-period CRXOs extend the two-period design to more periods, allowing each cluster to switch between treatment conditions multiple times. Here, we consider balanced designs consisting of a pair of dual sequences with an equal number of clusters assigned to each sequence. Examples of multiple-period CRXOs with two treatment sequences are shown in Figure 1, indicating that there may be one or multiple crossovers between treatment conditions in each sequence of a multiple-period CRXO.

Cluster randomised crossover (CRXO) design schematics with varying numbers of periods (rows), with one crossover (left column), two crossovers (middle column) and T-1 crossovers (right column).
Several recent trials have adopted multiple-period CRXO designs involving multiple crossovers. For example, the B-Free trial evaluated whether a restrictive benzodiazepine usage policy during cardiac surgery reduced the incidence of post-surgical delirium. 6 Twenty hospitals were randomised to sequences that switched between the new restrictive and standard policies every 4 weeks, across 18 periods. The CALM-ICU trial was conducted over four 3-month periods in intensive care units, where clusters switched between two different antipsychotic drugs for treating hyperactive delirium among critically ill patients. 7 The Aqueous-PREP trial similarly used a multiple-period CRXO design, with 14 hospitals altering between two preoperative antiseptic solutions every 2 months. 8
There thus may be a perception that in multiple-period CRXOs, statistical power increases with the number of times clusters switch between treatment conditions. The B-Free trial investigators noted: “We incorporated multiple crossovers to gain statistical power,” 6 and similarly, in the Aqueous-PREP trial, the authors stated “performing multiple treatment crossovers was expected to create a marginal increase in statistical power.” 8 Here, we investigate whether increasing the number of crossovers in a multiple-period CRXO leads to gains in statistical power under standard modelling assumptions.
Here, we consider balanced designs consisting of a pair of dual sequences with an equal number of clusters assigned to each sequence and an equal number of participants included per cluster period. We consider a cross-sectional sampling structure where each individual provides a single measurement throughout the trial; cohort sampling structures are not considered. We aim to compare the statistical power of the designs in Figure 1 that are equal in total duration and number of periods but differ in the number of crossovers, for example, Designs 1A versus 1B versus 1C, 2A versus 2B versus 2C, and 3A versus 3B versus 3C.
Methods
Statistical framework for CRXOs
We consider a linear mixed model for
where
where
The power of a design to detect an effect size
where
Results
Impact of the number of crossovers on statistical power
Equation (2) for
This is most obvious when
We emphasise that we should not make comparisons between designs with the same total number of participants in each cluster but differing numbers of study periods; that is, while we can compare Design 1A with 1B, 1A with 1C or 1B with 1C (all with four periods), we cannot compare these designs with Designs 2A, 2B, 2C (six periods) or 3A, 3B, 3C (eight periods). The reason is that the ICC (
Hence, when exchangeable or block-exchangeable correlation structures are assumed, the power of a CRXO does not depend on the number of crossovers in the treatment sequences: for a fixed number of periods, including multiple crossovers offers no statistical advantage.
Simulation study
We conducted a simulation study to illustrate and empirically validate the theoretical findings. We simulated data from Equation (1) for CRXO designs with the same number of clusters, periods, and participants per cluster period but differing in the number of crossovers: comparing Designs 1A to 1C, 2A to 2C, and 3A to 3C. Table 1 summarises the parameter settings. For each combination of parameters, theoretical power was calculated using the Shiny CRT calculator. 10
Simulation settings.
Two thousand datasets were simulated for each combination of parameters (36 combinations): for CRXO designs with
Each simulated dataset was analysed by fitting the linear mixed-effects model with random effects for cluster and cluster period in Equation (1), with parameters estimated via restricted maximum likelihood. When CAC = 1, only the cluster-level random effect was included in the model. Since the objective of the analysis was to compare the empirical power between each pair of designs, we calculated the proportion of simulated datasets in which
Figure 2 displays the empirical and theoretical power of the designs across all parameter settings, with separate panels for designs with different numbers of periods. In all scenarios, the empirical power for each pair of designs is identical up to Monte Carlo error and aligns with theoretical power. This supports our theoretical result: power is not influenced by the number of crossovers when the correlation structure is either block-exchangeable or exchangeable. As expected, the figure shows that power decreases with increasing ICC when the CAC is low (e.g. 0.5) but remains relatively stable as the CAC approaches 1. Power increases as the total number of participants in each cluster increases, regardless of the values of ICC and CAC. Similar results for smaller sample sizes (

Empirical power for CRXO designs with a single crossover (solid line) and CRXO designs with multiple crossovers (dashed line), for designs with four periods (top panel), six periods (middle panel), and eight periods (bottom panel), across all combinations of intra-cluster correlation (ICC) and cluster autocorrelation (CAC). Theoretical power (dotted line) is included for comparison.
Conclusion
In this paper, we have demonstrated, theoretically and via simulation, that when an exchangeable or block-exchangeable within-cluster correlation structure is assumed when planning a cluster randomised crossover trial, all designs with a fixed number of periods of the same duration have the same precision, regardless of the number of crossovers in the treatment sequences. In particular, designs with the maximum number of crossovers confer no additional precision benefit compared to those with the same number of periods and just one crossover. Furthermore, as pointed out by Hemming et al., 1 including multiple crossovers can reduce the practicality of a design by requiring clusters to stop and start interventions multiple times, and when washout periods are required to transition between interventions, it may not be desirable to include a large number of crossovers.
Researchers with access to a dataset to estimate ICCs and CACs when planning a trial may question how to divide time up into periods in this dataset: how many periods should they choose? In general, the length of periods used to estimate ICCs and CACs should match those assumed in the planned design. In many contexts, we would expect that the ICC and CAC will increase as the number of periods increases (due to the shorter duration of time of each period), but further investigation is required.
The results in this paper hold under two commonly used assumptions: (1) the within-cluster correlation structure is exchangeable or block-exchangeable and (2) treatment effects are immediate and constant across time. However, numerous crossovers may be beneficial in other circumstances. First, if a discrete-time or continuous-time decay within-cluster correlation structure is more appropriate, then there will be gains in study power through the inclusion of multiple crossovers.11,16 Second, when the treatment effect is time-varying, a CRXO design with intervention periods of limited duration may provide more power to detect “immediate” or “time-limited” treatment effects; further work is required to investigate this.17,18 If the objective is to estimate carryover effects or period-by-intervention interactions, or there is a need to reduce predictability of treatment assignment, multiple crossovers may be justified. We have also assumed that cluster periods are of equal sizes; the impact of deviations from this assumption requires further work. In addition, when a cohort sampling structure is considered, additional crossovers may increase power, but we expect this will depend on the within-individual correlation structure. Moreover, extending the number or duration of periods could lead to increased power. Increases in power under these alternative scenarios could be explored in future work, and where gains are possible, trialists would need to weigh those benefits against feasibility constraints.
Supplemental Material
sj-pdf-1-ctj-10.1177_17407745261431140 – Supplemental material for Additional crossovers in cluster randomised crossover trials do not always increase statistical power
Supplemental material, sj-pdf-1-ctj-10.1177_17407745261431140 for Additional crossovers in cluster randomised crossover trials do not always increase statistical power by KM Tanvir, Andrew B Forbes, Kelsey L Grantham and Jessica Kasza in Clinical Trials
Footnotes
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: J.K. is supported by an NHMRC Investigator (Leadership 1) Grant (GNT 2033380).
Supplemental material
Supplemental material for this article is available online.
