Spatiotemporal effects on dengue incidence based on a large cluster randomized study

Abstract

A recent large-scale cluster randomized test-negative study assessed the impact of a mosquito-based intervention on the incidence of clinical dengue showing a protective efficacy of 77.1% (95% CI: (65.3%, 84.9%)). While the intervention was randomized at a cluster-level, human and mosquito movement suggest potential violations in assumptions necessary for intention-to-treat analyses to produce accurate estimates of the full intervention effect due to spatial clustering of dengue cases, and/or potential non-independence in the intervention arising from spillover of the intervention (or control) across cluster boundaries. We address these distinct but related effects using two approaches. First, we examine whether a clustering effect exists, that is, whether the presence of a recent dengue case in the sample within a specified distance from a residence raises the risk of dengue. Second, we use cluster reallocation techniques to examine intervention spillover effects. We find strong spatial effects of the presence of dengue cases on the risk of clinical dengue that exhibit both serospecificity and a dose response, more evident in control than intervention clusters at least on an additive scale. Contrarily, there is no evidence of any appreciable local spillover effect from intervention to control clusters, or vice versa, in terms of either the risk of dengue infection or the level of disease clustering.

Keywords

Cluster-randomized trial cluster reallocation dengue disease clustering spatiotemporal point process spillover test-negative studies

1. Introduction

The Applying Wolbachia to Eliminate Dengue (AWED) trial assessed the impact of a community-level intervention on the incidence of dengue in a large city, Yogyakarta, Indonesia, based on the geographic clusters.^1–3 As is standard, clusters were randomized to either receive a community-level intervention or to act as control. Unlike traditional cluster-randomized trials which require the enrollment and intensive longitudinal surveillance of cluster cohorts, the study used test-negative sampling that exploited clinic-based surveillance systems to identify and enroll symptomatic health-care seeking patients. Once enrolled, individuals are tested for the disease of interest. Those who test positive are classified as cases, and those who test negative as controls. The process of sampling and enrolling individuals over the study period thus resembles a variant of a case-control study design superimposed over cluster randomization.

The primary intention-to-treat (ITT) analysis treated the intervention status of cluster of residence as an individual’s exposure. This approach exploited the cluster randomization scheme as the basis for inference, but did not consider the role of other risk factors for dengue incidence (although a constrained randomization approach was used that balanced arms on known risk factors for dengue including age and historical dengue incidence). Thus, spatial effects were not considered in the basic ITT analysis, nor the possibility that estimation of the intervention effect might be affected by spillover of either the intervention or control conditions across cluster boundaries. A further nuance arises since infection with dengue virus arises from four distinct serotypes where transmission is serotype-specific.⁴ That is, an individual infected with the serotype DENV1 cannot give rise in a future chain of transmission to infections other than those associated with serotype DENV1. To account for this, we explore the presence of global, serotype-specific and serotype-discordant disease clustering.

This work characterizes two consequences of the spatiotemporal natural of the intervention itself and the disease of interest: first, the risk of an individual acquiring a new dengue infection associated with the local presence of other dengue cases in the recent past, and second, potential spillover effects. We consider the impact of proximal dengue cases (in time and space) on an individual’s risk of dengue—evidence of disease clustering—with interest in determining whether the relationship differs between intervention and control clusters. This can be achieved by defining a new proximity risk factor and incorporating it into basic logistic regression models that capture intervention efficacy while accounting for this clustering. For spillover, we consider a sensitivity analysis based on cluster reallocation schemes which enlarge or shrink cluster boundaries. Both analyses build on recent work, including a report on the spatiotemporal clustering of dengue that used a different approach, providing insight into the focal transmission of dengue in the intervention and control areas of the AWED study⁵ as well as a spatiotemporally resolved reanalysis of the AWED data, providing evidence of potential underestimation of the intervention effect due to human and mosquito movement within the AWED study.⁶

2. Effect of a proximal dengue case

We first examine the spatial impact of recent prevalent dengue cases on the risk of an individual acquiring a new dengue infection. To explore such effects we use geographic information system (GIS) information to locate all dengue cases in the study and their proximity in both space and time to all other participants, allowing us to create a proximal case indicator. Ignoring serotype, Table 1 provides a classification of dengue cases and test-negatives by whether the residence of any individual was located “close” to a dengue case in the data set, specifically a dengue case that had occurred in the prior 30 days for a participant whose residence was also within 300 m of the individual’s home location, these distances being based on the earlier work.⁵ This indicator variable thereby acts as a proxy for close prevailing dengue infections. Note that this table includes 67 dengue cases for whom the serotype was unknown. Seven individuals were infected by two different dengue serotypes at different points of time and only the first of these infections are included here. The estimated odds ratio (OR) associated with the presence of a proximal case, is 4.98 (95% CI: (3.78, 6.54)), reflecting the impact of local spread of infection (Figure 1); estimation and inference is carried out according to Jewell et al.³; here, variability of estimation of the OR is first measured on the log scale and estimated using the standard robust sandwich estimator that accounts for clustering. This OR does not change materially ( $\hat{O R}$ = 4.65, 95% CI: (3.50, 6.18)) when the 67 unknown serotypes are removed from the analysis. As illustrated in Figure 1, the estimated OR increases when either the time period is reduced from 30 days or the distance is reduced from 300 m, although the uncertainty increases also due to smaller number of exposed when the proximal measure is tightened. For example, if a proximal case is defined as within 7 days and with a residence within 100 m, the estimated OR is 11.2, (95% CI: (6.21, 20.1)). Recall that these ORs reflect estimates of the relative risk ( $R R$ ) of being a test-positive dengue case comparing participants with recent proximal exposure (as defined explicitly) as compared to those without such exposure.³ From this, the efficacy of the intervention is immediately determined by $100 \times (1 - R R)$ .

Figure 1.

The estimated aggregate odds ratio (OR) associated with a proximal case, when the definition of “proximal” changes in space and time.

Table 1.

Effects of intervention and exposure on risk of a DENV, where a participant’s exposure is defined by a dengue infection occurrence (of another participant) within the prior 30 days and with a home location within 300 m of the infected participant. The ORs reported in the bottom half of the table are based on a logistic regression model, using GEEs to account for clustering, with CIs based on the standard sandwich variance estimator.

	Aggregate			Intervention			Control
Proximal case	DENV case	Test negative	OR (CI)	DENV case	Test negative	OR (CI)	DENV case	Test negative	OR (CI)
Exposed	180	888	4.98	13	231	2.72	167	657	4.08
Unexposed	205	5033	(3.78, 6.54)	54	2607	(1.66, 4.46)	151	2426	(3.06, 5.46)

Variable	OR	(95% CI)	p-value
Exposure	4.08	(3.06, 5.46)	0.00	<0.001
Intervention	0.33	(0.23, 0.49)	0.00	<0.001
Exposure $\times$ Intervention	0.67	(0.38, 1.18)	0.16

DDENV: test-positive dengue case; GEEs: generalized estimating equations; ORs: odds ratios; CI: confidence interval.

The estimated ORs associated with a proximal case (with a space-time window of 300 m and 30 days) are 2.72 (95% CI: (1.66, 4.46)), and 4.08 (95% CI: (3.06, 5.46)) in the intervention and control arms, respectively. Note that a participant’s intervention arm is determined by their place of residence but a proximal case may not be in the same arm. Although this difference does not provide strong evidence that the effect of a recent proximal dengue case differs (multiplicatively) between treatment and control clusters (the p-value associated with the interaction term is 0.16), it nevertheless suggests more disease clustering in control clusters than in intervention clusters; further, the findings suggest a degree of additive interaction induced by the greater frequencies of dengue infections in control, as compared to intervention, clusters.

Note that, the reported marginal intervention OR for the trial is 0.23 (95% CI: (0.15, 0.35)),¹ when there is no adjustment for the existence of proximal cases. The data in Table 1 yield an estimated intervention OR of 0.33 (95% CI: (0.23, 0.49)) when there is no proximal case; when a proximal case is present the intervention OR decreases (and thus efficacy increases) slightly to 0.22 (95% CI: (0.11. 0,46)), although this small change is not quite statistically significant (p = 0.16). When the interaction term is omitted, the estimated intervention OR is 0.30 (95% CI: (0.20, 0.46)), reported in Tables 2 to 7. This slight increase in the OR when proximal cases are included as a risk factor (as compared to the original ITT estimate) does not reflect any diminution of the efficacy of the intervention, but suggests that a small part of the success of the intervention in reducing risk is accounted for by a reduction in the number of proximal cases in intervention clusters. In this sense, the proximal risk factor is acting as a mediator of a small component of the intervention effect, so that the OR of 0.30 can then be interpreted as a measure of the direct effect of the intervention.

Table 2.

Effects of intervention and proximal exposures on the risk of infection with dengue of any serotype, where proximal exposure is defined by existence of an infection occurrence within the past 30 days with a home location within 300 m; exposure dose captures the number of proximal DENV cases according to this same definition. The ORs are based on a logistic regression model (without interaction), using GEE to account for clustering, with CIs based on the standard sandwich variance estimator.

Variable	OR	(95% CI)	p-value
Exposure	3.86	(2.98, 5.01)	0.00	<0.001
Intervention	0.300	(0.198, 0.457)	0.00	<0.001
Variable	OR	(95% CI)	p-value
Dose exposure	1.56	(1.36, 1.79)	0.00	<0.001
Intervention	0.294	(0.197, 0.439)	0.00	<0.001

DENV: test-positive dengue case; GEEs: generalized estimating equations; ORs: odds ratios; CI: confidence interval.

Table 3.

Effects of intervention and exposure on infection with dengue serotype DENV1, where a participant’s exposure is defined by a dengue infection occurrence (of another participant) within the prior 30 days and with a home location within 300 m of the infected participant.