Abstract
Background and objectives:
It has been suggested that white matter hyperintensity lesions (WMHs), which typically progress over time, can also regress, and that this might be associated with favorable cognitive performance. We determined the prevalence of WMH regression in patients with cerebral small vessel disease (SVD) and examined which demographic, clinical, and radiological markers were associated with this regression.
Methods:
We used semi-automated lesion marking methods to quantify WMH volume at multiple timepoints in three cohorts with symptomatic SVD; two with moderate-to-severe symptomatic SVD (the SCANS observational cohort and the control arm of the PRESERVE interventional trial) and one with mild-to-moderate SVD (the RUN DMC observational cohort). Mixed-effects ordered logistic regression models were used to test which factors predicted participants to show WMH regression.
Results:
No participants (0/98) in SCANS, 6/42 (14.3%) participants in PRESERVE, and 6/276 (2.2%) in RUN DMC showed WMH regression. On multivariate analysis, only lower WMH volume (OR: 0.36, 95% CI: 0.23–0.56) and better white matter microstructural integrity assessed by fractional anisotropy using diffusion tensor imaging (OR: 1.55, 95% CI: 1.07–2.24) predicted participant classification as regressor versus stable or progressor.
Discussion:
Only a small proportion of participants demonstrated WMH regression across the three cohorts, when a blinded standardized assessment method was used. Subjects who showed regression had less severe imaging markers of disease at baseline. Our results show that lesion regression is uncommon in SVD and unlikely to be a major factor affecting the use of WMH quantification as an outcome for clinical trials.
Introduction
Cerebral small vessel disease (SVD) is the most common cause of vascular cognitive impairment and causes approximately 25% of ischemic strokes. 1 Most cases are sporadic and typically associated with cardiovascular risk factors; 2 however, despite its importance, understanding of the underlying pathophysiology is incomplete and there are few effective disease-modifying treatments. 3
White matter hyperintensities (WMHs) are a radiological hallmark of SVD and independently predict both stroke and dementia. 4 These areas of presumed chronic ischemia are best seen on T2-weighted magnetic resonance imaging (MRI) as high signal in subcortical and brainstem structures. Observational studies in community-dwelling participants have shown that the total volume of such lesions increases over time.5–7 WMH progression has also been assessed in specific patient groups with ischemic stroke, 8 hypertension, 9 and unselected cardiovascular disease10,11 in which longitudinal lesion growth has been demonstrated. WMHs are readily quantifiable and increasingly used as an outcome measure (particularly in SVD trials) to test interventions in smaller sample sizes prior to large clinical trials with the endpoints of stroke and dementia.12–14
Although WMH volume tends to increase over time, there have been reports of WMH regression in a proportion of patients with stroke and SVD,15,16 with regression noted in up to 37% of participants in a population with minor stroke. 17 A recent meta-analysis found that WMH regression was documented explicitly in one-third of MRI studies investigating adults with sporadic SVD. 18 This finding has potentially important implications. If WMHs do regress, then better understanding of this process may inform potential treatment approaches. Second, it adds complexity to power calculations required for trial design and might increase requisite sample sizes for interventional studies. Various reasons have been suggested for observed WMH regression, including radiological and analytical factors. 19
To determine the frequency of WMH lesion volume regression while accounting for possible technical factors, we examined three cohorts of patients all with symptomatic SVD: in two, this was defined as a clinical lacunar stroke syndrome with a corresponding lacunar infarct and confluent WMHs on MRI (St George’s Cognition and Neuroimaging Study (SCANS) and PRESERVE) and in one, as any clinical symptoms compatible with and any radiological evidence of SVD on MRI (the Radboud University Nijmegen Diffusion Tensor and Magnetic Resonance Cohort (RUN DMC)). Using robust analysis techniques blinded to time point of scans, we determined whether regression of lesion load occurs across multiple time points and which patient and imaging factors (including more subtle measurements of white matter microstructural damage, such as diffusion tensor imaging (DTI)) are associated with this regression.
Methods
Study populations
Three cohorts of patients with symptomatic SVD and differing degrees of WMH severity were studied. All participants provided written, informed consent prior to enrollment.
The SCANS 20 was a prospective observational study that recruited patients with symptomatically defined SVD presenting with a lacunar stroke syndrome and had both a compatible lacunar infarct and at least early confluent WMHs on MRI (Fazekas scale 21 score ⩾ 2). MRI scans were performed at least 3 months post-stroke and annually for 3 years afterwards. Patients were recruited from three South London hospitals between March 2007 and October 2010, and the study was approved by the Wandsworth Research Ethics Committee (ukctg.nihr.ac.uk; study ID: 4577).
The PRESERVE study 13 was a multicentre randomized control trial of intensive versus standard blood pressure treatment in SVD. Patients with a lacunar stroke syndrome (and a compatible lacunar infarct on MRI) and WMHs of Fazekas score ⩾ 2 were recruited at least 3 months post-stroke from six UK-wide hospitals between February 2012 and October 2015. MRI was performed at baseline and after 2 years; the trial was approved by the Harrow Research Ethics Committee (reference: 11/LO/0458) and registered with the International Standard Randomized Control Trial Number registry (reference: ISRCTN37694103). For the purposes of this study, we included only participants from the standard treatment arm because WMH change in the intensive arm may be confounded by the effects of intensive antihypertensive treatment.
The RUN DMC 22 was a prospective long-term cohort study that recruited patients with any symptoms compatible with SVD and evidence of either lacune(s) or any WMHs on MRI. Patients were recruited at Radboud University Medical Center, Netherlands and had MRI scans in 2006, 2011, and 2015. Due to equipment upgrade after the first time point in the RUN DMC study, we used images from the second and third time point as baseline and follow-up time points. The study was approved by the Arnhem–Nijmegen Medical Research Ethics Committee (No. 2005/256).
Image acquisition
The three cohorts included participants at least 3 months after stroke, at which point they are expected to be clinically stable 23 and evidence of cavitation is likely to be visible already. 24 MRI scanner and sequence details for the studies above have previously been published20,22,25 and are summarized here in brief and fully in the Supplemental material.
MRI imaging in SCANS was performed on a 1.5 Tesla GE Signa HD MRI scanner at St George’s, University of London and included T1-weighted, fluid-attenuated inversion recovery (FLAIR) and DTIs.
Images in PRESERVE were acquired using the following eight 3.0 Tesla MRI scanners across the six sites: Siemens Prisma, Siemens Magnetom Prisma Fit, Siemens Verio, Philips Ingenia, Philips Achieva, and three Philips Achieva TX. T1-weighted scans were acquired to produce 1 mm3 isotropic resolution, and repetition time (TR) and echo time (TE) were optimized to ensure T1-weighting/tissue contrast was comparable across sites. FLAIR sequences had identical inversion times and were also TE matched with TR long enough to minimize T1-weighting (representative TR/TE/TI 11,000/120/2800 ms, final resolution 0.48 × 0.48 × 3 mm). DTIs were acquired using 32 equally spaced non-collinear diffusion gradients at b = 1000 s/mm2 and eight unweighted images at b = 0 s/mm2 (representative TR = 6850 ms, TE = 75 ms; final resolution is 2.5 mm3 isotropic). T1-weighted, FLAIR, and DTI images were used for this analysis.
In the RUN DMC study, MRI was acquired using a 1.5 Tesla Siemens Magnetom Avanto and included T1-weighted, FLAIR, and DTI sequences.
Image analysis
Two methods of measuring WMH lesion load were used. The analysis pipeline used in SCANS has previously been described. 26 In brief, images were pre-processed using Statistical Parametric Mapping 8 (http://fil.ion.ucl.ac.uk/spm/software/spm8/) to check orientation and co-register to MNI registration in 1 mm isotropic voxels. Group average tissue probability maps were created using a modified multivariate mixture of Gaussians 27 and individual images were then segmented using these custom maps and repaired manually if required using ITK-SNAP (http//www.itksnap.org) including the removal of any lacunes or hemorrhages. Each subject image was then warped to an individual participant midpoint average image to create divergence maps for voxels containing WMHs.
WMH lesion load was determined in PRESERVE and RUN DMC using Jim version 8.0 (http://xinapse.com/j-im-software/), a semi-automated program in which a region of interest is selected by the rater and voxels within this contour delineated. The program was run on a Microsoft® Surface PC and manual adjustment performed using a stylus tool to correct lesion boundaries on the screen. To minimize errors relating to the selection of lesions, images were marked slice by slice on a parallel split screen and the image intensity was matched between scans. To reduce the risk of bias, images were randomly displayed in terms of order of acquisition and the rater was blinded to image time point. Any hemorrhages and isolated lacunes were not included in the lesion mask; in the case of lacunes within areas of confluent WMHs, the entire area of FLAIR high signal was marked and the area of low signal isointense with cerebrospinal fluid was then removed.
We used previously described criteria to determine whether there was progression or regression of WMH lesion load. Participants were categorized as “regressors” if they showed a decrease in the total WMH volume of at least 0.25 cubic centimeters (cc) between any sequential scans. This is the minimum difference between WMH volume that can be appreciated visually and has previously been used to define lesion regression. 28 “Progressors” were defined as participants who showed an increase in greater than one standard deviation of the inter-scan change within each study as has also been used previously. 28
DTI images in SCANS and PRESERVE were pre-processed using the Eddy correct tool from Functional MRI of the Brain’s Diffusion Toolbox (http://fsl.fmrib.ox.ac.uk/fsl/fslwiki/FDTref). In RUN DMC, the DTI images were pre-processed using an in-house iteratively reweighted least squares algorithm. 29 In all three cohorts, fractional anisotropy (FA) and mean diffusivity (MD) histograms were created using the FMRIB’s DTIFit tool. White matter FA median and MD normalized peak height were used for analysis, as these DTI parameters have been shown to be reproducible between sites in SVD patients. 30
Brain volumes in SCANS and RUN DMC were calculated by excluding the WMH lesion maps from the gray and white matter masks derived from SPM8, correcting for intracranial volume using SIENAX (https://fsl.fmrib.ox.ac.uk/fsl/fslwiki/SIENA) and adding the lesion masks. Brain volumes in PRESERVE were calculated using applying SIENAX to images that had been intensity non-uniformity corrected by N4ITK and segmented using SPM12 including correction for intracranial volume using SIENAX as above.
A comparison of the three studies, including population and imaging details, is given in Table 1.
Comparison of the three included studies.
Data from the SCANS and PRESERVE studies can be obtained from Hugh Markus on reasonable request. Data from RUN DMC can be obtained from Frank-Erik de Leeuw on reasonable request.
Statistical analysis
Between-category differences in demographic factors, comorbidities, and baseline imaging parameters were tested using the Student’s t-test or one-way between-groups analysis of variance (ANOVA) for continuous variables and Pearson’s chi-square test for categorical variables (or Fisher’s exact tests for cases where a category had fewer than five observations) as appropriate. All analyses were performed in the R project for statistical computing V.3.6.3 (https://www.R-project.org/).
As the PRESERVE and RUN DMC images were analyzed using the same method, individual participant data were pooled and tested again using study site as a covariate. Ordered logistic regression models were performed using the “polr” function in the R package “MASS” (https://www.rdocumentation.org/packages/mass/) to test the association of demographic factors (age, sex, premorbid IQ, and years in education), participant comorbidities (hypertension, hypercholesterolemia, diabetes, and smoking history), and baseline imaging parameters (including MRI hallmarks of SVD, such as lacunar infarcts, cerebral microbleeds, and DTI metrics) with participant category. We used a mixed-effects model with study site as a random factor and other predictors as fixed factors. Selection of predictor variable was conducted using stepwise forward selection of any predictors with p value less than 0.05 on univariate analysis and backward selection of predictors that lost this significance in the multivariate model. We also used this method to test the effect of any change in brain volume on WMH regression.
Results
Proportion of regressors
We included 98 participants from SCANS (mean age 69.0 years, 66.3% male, mean baseline WMH volume 37.3 cc) who had four scans over a 3-year period. Of these, no participants showed lesion regression using the definition of a reduction of 0.25 cc. In total, 42 participants from PRESERVE were included (mean age 68.7 years, 65.4% male, mean WMH volume 31.0 cc) who had repeat scans at 2 years. Of these, 6/42 (14.3%) showed lesion regression. We included 276 participants from RUN DMC (mean age 68.1 years, 57.5% male, mean WMH volume 7.75 cc) who had repeat scans at 5 years; of these, 6/276 (2.2%) showed lesion regression. Images from participants who were identified as showing WMH regression were visually inspected and no pairs of images showed discrepancies in image quality/positioning or artifacts that would explain a reduction in WMH volume. Figure 1 shows the WMH volume over time for each of the three cohorts, stratified by quintile of WMH change over the entire study course.

WMH volume change over study duration.
There was a very high agreement between the WMH lesion volumes calculated for this analysis and the previously marked and published values in RUN DMC 28 (intraclass correlation coefficient for both time points 0.992, 95% CI: 0.990–0.993, Figure 2).

Inter-rater reliability across WMH marking methods.
Effects of brain atrophy on proportion of regressors
To test the effect of brain atrophy on lesion measurement, we also calculated WMH volumes normalized by total brain volume at each time point. Using this method, one participant from PRESERVE was reclassified from “regressor” to “stable” leaving 5/42 patients from PRESERVE and 6/276 from RUN DMC as regressors. In a further exploratory analysis, change in brain volume was not associated with WMH regression on univariate analysis (β-coefficient 0.178, p = 0.12). Figure 3 shows the relationship between change in brain volume and WMH in each study; there were a small number of participants who showed an increase in brain volume which was less than 50 mm3 and this is within the expected tolerance of SIENAX. 31

Degree of atrophy versus measured WMH reduction. Scatter plot showing changes in brain volume and WMH measurements over the duration of the study in (a) SCANS, (b) PRESERVE, and (c) RUN DMC.
Effect of using different thresholds to define regression
We performed a secondary analysis defining “regressors” using a threshold of one standard deviation of the inter-scan WMH change (i.e. the inverse of participants defined as progressors). Using this definition, we found no regressors in any population. Conversely, matching the threshold of WMH growth to define “progressors” to 0.25 cc led to all the participants originally marked as “stable” to be reclassified as “progressors” in all three cohorts, except for one participant in SCANS who showed an WMH increase in less than 0.25 cc between each of the four time points. Differences between participants in binary categories (regressors versus non-regressors) are shown in Supplemental Table 1.
Factors associated with regression
Table 2 shows the number of regressors, stable, and progressor participants in each study and the differences in demographic, risk factor, and imaging variables between groups. In RUN DMC, regression was associated with lower age. More severe imaging markers of SVD tended to be negatively associated with regression; the WMH regressor group had lower baseline lesion volume, higher brain volume, and MD normalized peak height in RUN DMC, and lower lacune count and higher FA median in both studies. Levene’s test of homogeneity was used to ensure that ANOVA assumptions were valid in each cohort.
Comparison of demographics, risk factor, and imaging variables at baseline between regressors, stable, and progressors in each cohort, separated by study.
Values are given as n (%) or M ± SD. Significant values (p < 0.05) are in bold. NART: National Adult Reading Test; WMH: white matter hyperintensities; FA: fractional anisotropy; MD: mean diffusivity.
Using pooled individual participant data, significant predictors for WMH regression on univariate analysis were lower age, absence of hypercholesterolemia, lower baseline WMH volume and lacune count, and higher brain volume, FA median, and MD normalized peak height (see Table 3). On multivariate analysis including variables significant on univariate analysis, only baseline WMH volume (OR: 0.36, 95% CI: 0.23–0.56) and baseline FA median (OR: 1.55, 95% CI: 1.07–2.24) were associated with lesion regression. Figure 4 shows how the probability of lesion regression depended on baseline WMH volume across the three cohorts, with regression being associated with lower baseline WMH volume.

Probability of lesion regression versus baseline WMH volume.
Predictors of WMH regression using ordered logistic regression assessed on both univariate and multivariate analysis.
β: unstandardized β coefficient; SE: standard error; OR: odds ratio; WMH: white matter hyperintensities; FA: fractional anisotropy; MD: mean diffusivity; Significant values (p < 0.05) are in bold.
This multivariate model was re-tested with the participants classified as regressors versus non-regressors (i.e. stable and progressor groups combined). In this model, only baseline brain volume reached statistical significance with minimal contribution to the classification (OR per additional cc of brain volume 1.0007, 95% CI: 1.0002–1.001).
Discussion
In this study, across three prospective cohorts with symptomatic SVD, we found regression of WMH lesion volume in a small proportion of individuals over follow-up periods of between 2 and 5 years. Using a definition of 0.25 cc to define regression, the percentage showing regression was 0%, 2.2%, and 14.3% in SCANS, RUN DMC, and PRESERVE, respectively. When defining “regressors,” using a threshold of one standard deviation of the inter-scan WMH change, we found no regressors in any population. These values are lower than reported in recent studies.16–18
This analysis differs from the most previous studies, in that the three populations included only participants with both neurological symptoms and radiological evidence of SVD, and so our study is likely to have excluded participants with less severe, non-progressive, or incidental pathology. The only factor which consistently predicted regression was a lower severity of SVD defined by other imaging variables, and participants who demonstrated lesion regression had a lower WMH burden at baseline. On multivariate analysis, both WMH volume and white matter ultrastructural integrity, as assessed by FA, were associated with regression. Taken together, these results suggest that reduction in WMH is more likely early in the disease process.
Although imaging data from these cohorts have been published previously,13,20,32 the current study applied a novel image analysis method specifically to look for WMH regression and additionally used logistic regression to test for associations with other demographic and radiological factors. Previous analysis of the RUN DMC cohort found lesion regression in 26 participants (9.4%) between 2006 and 2011 and in five participants (1.8%) between 2011 and 2015. 28 The WMH lesion volumes we calculated, even though obtained by a different method, correlated very highly with previously published values. In our analysis, we only used data from the 2011 and 2015 time points because there was a scanner change between 2006 and 2011, and during this period, our estimate of 2.2% is very similar to the 1.8% previously reported. The much higher proportion of regressors between 2006 and 2011 and 2011 and 2015 might be related to the measurement of WMH on different scanners for the 2006 and 2011 analysis.
In PRESERVE, there were a higher number of regressors. The reason for this is uncertain. Of note, it was a multicentre study with image acquired on multiple scanners, and even in the standard blood pressure treatment arm in PRESERVE, there was a reduction in mean blood pressure reduced by 15.3 mmHg. 13 It is possible that this led to regression in some patients. Blood pressure lowering has been shown to reduce WMH progression in one clinical trial. 14
The regression of WMHs observed in previous studies has several possible explanations. It may represent a true biological process, such as the resolution of edema or contraction of lesioned tissue as part of an inflammatory or scarring process (as has been proposed based on the finding of expanded free-water compartment surrounding WMHs 33 ). Alternatively, it could also be explained by radiological or technical factors causing apparent regression in some subjects. Such factors could include registration of images between scans, equipment changes between scans, and discrepancies between image quality and slice angle. It might also reflect statistical factors, such as the uncertainty caused by partial volume effects (particularly in periventricular areas), regression to the mean in lesion marking techniques, or differential rates of atrophy affecting the mapping of lesioned tissue at follow-up to baseline space. 19
Treatment trials in SVD are increasingly using WMHs as an outcome marker, and the SPRINT-MIND trial was the first to show that intensive blood pressure treatment is associated with slower WMH progression. 14 In this study, the intensive treatment group showed 0.54 cm3 less WMH progression than the control group and this is close to the threshold of 0.25 cc that we used to define regressors. These differences are small and by definition difficult to appreciate visually, though in the context of a trial can be clinically important. WMH regression has not been investigated specifically as an outcome in treatment trials.
Strengths of our study included the use of multiple cohorts which included a range of mild-to-moderate-to-severe SVD, at a time interval after stroke where appearances are likely to be stable. The large sample size allowed associations between demographic risk factors/imaging markers and regression to be examined, and the rater being blinded to time point allowed us to minimize any bias in the application of the semi-automated lesion marking technique that we used.
Limitations of our study include the differing intervals between follow-up imaging in each cohort (as we could not rule out regression and then progression or vice versa, particularly in the 5-year intervals between scans in RUN DMC) and the difference in MRI field strength. Previous meta-analysis found that field strength was not significantly associated with the calculation of WMH progression rate 34 and this might also apply to the assessment of lesion regression; however, further work is required to assess the effect of scanner and field strength on the estimates of lesion progression and regression. There were minor differences in image acquisition between the PRESERVE and RUN DMC studies; however, we harmonized the WMH segmentation as closely as possible by analyzing the raw images using the same lesion marking program/version and on the same PC screen.
We are also limited by the understanding of scan–rescan reliability and though this has been tested in small cohorts with various image analysis techniques,35,36 it is still unclear to what extent small fluctuations in WMHs should be expected from imaging across multiple time points. If the noise in WMH calculation is similar to a definitive estimate of natural WMH variation, then the WMH regression we observed in a small number of participants may be pathophysiologically insignificant. Other authors have found that FLAIR signal can decline within WMHs, 37 and this may indicate that tissue is less severely damaged or less prone to inflammation; our analysis does not account for this phenomenon. It is also possible, particularly in severe cases, that net WMH growth comprises both voxels of de novo WMH in parallel with voxels where WMH revert to normal-appearing white matter. As we focused on whole brain lesion burden, this was not captured in our study. Furthermore, as we identified so few subjects showing regression, it is likely that our study was underpowered to detect other associations with patient and imaging factors.
Our study does not exclude the fact that individual white matter lesions can regress, and this has been clearly demonstrated. 37 However, although our results may not necessarily generalize to asymptomatic or community populations, they does suggest that in symptomatic SVD, it is unusual for total WMH lesion volume to regress. This is reassuring in the use of WMH as a surrogate endpoint in clinical trials for SVD and suggests that previous power calculations performed without considering regressors are likely to be accurate. Future studies testing interventions in SVD or performing longitudinal analysis of white matter damage in other populations (such as participants with cardiovascular risk factors or cognitive impairment) should consider a similar blind-to-time point analysis method.
Supplemental Material
sj-docx-1-wso-10.1177_17474930231169132 – Supplemental material for How often does white matter hyperintensity volume regress in cerebral small vessel disease?
Supplemental material, sj-docx-1-wso-10.1177_17474930231169132 for How often does white matter hyperintensity volume regress in cerebral small vessel disease? by Robin B Brown, Daniel J Tozer, Marco Egle, Anil M Tuladhar, Frank-Erik de Leeuw and Hugh S Markus in International Journal of Stroke
Footnotes
Author contributions
R.B.B. designed the study, marked the lesions in the PRESERVE and RUN DMC cohorts, performed the statistical analysis, and drafted the article. D.J.T. supervised the image acquisition in the PRESERVE study and the image analysis. M.E. coordinated the PRESERVE study data collection and original analysis. A.M.T. supervised the image acquisition in RUN DMC. F-E.D.L. is the senior author of the RUN DMC study and was involved in gaining funding and ethical approval. H.S.M. designed the study, edited the article, and as the senior author of the SCANS and PRESERVE studies was involved in gaining funding and ethical approval. All authors contributed to the study design, provided interpretation of the results and statistical analysis, and were involved in critical editing of the article.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: R.B.B. is supported by an Association of British Neurologists Clinical Research Training Fellowship funded by the Guarantors of Brain. M.E. is supported by a Priority Program Grant from the Stroke Association (PPA 2015/02). A.M.T. is a junior staff member of the Dutch Heart Foundation (Grant No. 2016T044). F-E.D.L. has received a VIDI grant (016-126-351) and the Clinical Established Investigator Dutch Heart Foundation grant (2014T060). H.S.M. is supported by a National Institute for Health Research (NIHR) Senior Investigator Award. This research was supported by the NIHR Cambridge Biomedical Research Center (BRC-1215-20014) and the Cambridge BHF Center of Research Excellence [RE/18/1/34212]. The views expressed are those of the authors and not necessarily those of the NIHR or the Department of Health and Social Care.
Supplemental material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
