Abstract
Objectives
To estimate the level of overdiagnosis of all breast cancers and of invasive breast cancers in women aged 40–49 invited to the subsequent screening rounds in the Swedish service-screening programme 1986–2005.
Methods
To estimate the level of overdiagnosis in subsequent screening, the rate ratios (RR) of the breast cancer incidence in the study group (women in areas with screening in ages 40–49) and the control group (women in areas with no screening in ages 40–49) were calculated for all breast cancers and for invasive breast cancers. The RR estimates were adjusted for the prescreening difference in incidence between study and control group and for lead time.
Results
The prescreening incidence rate ratio was estimated at 0.92 (95% confidence interval [CI]: 0.88–0.97). The number of breast cancer cases and person-years were 6047 and 3.8 million, and 7790 and 5.2 million, in the study group and control group respectively during the study period. The RR estimate for all cancers was 1.01 (95% CI: 0.94–1.08) when adjusted for prescreening difference and a lead time of 1.2 years. The corresponding estimate for invasive breast cancers was 0.95 (95% CI: 0.88–1.02).
Conclusions
We found no significant overdiagnosis for women aged 40–49 in the Swedish service screening programme with mammography.
INTRODUCTION
Lower age limit for invitation to screening and period of inclusion into the study and control group in the areas for screening in Sweden for each year of the follow-up 1986 to 2005. n = screening had not yet started at beginning of year. Study group (screening of women age 40–49) = shaded cells with lower invitation age in italics. Control group (no screening of women age 40–49) = shaded cells with lower invitation age in normal or no lower invitation age.
N = Northern, So = Southern, E = Eastern, W = Western
The goal of mammography screening is to detect breast cancers at an early stage, which can save lives. In our previous cohort study, SCRY (SCReening of Young women), we showed that the breast cancer mortality was 26% lower among those invited to screening and 29% lower among those attending screening in the ages 40–49 compared with those not invited in the ages 40–49. 1
The potential harms of mammography screening are radiation exposure, discomfort and anxiety due, for example, to false-positives, false-negatives and unnecessary treatment, for example due to overdiagnosis. Overdiagnosis is the excess of cancers diagnosed with screening compared with without screening that is not due to earlier diagnosis. In other words, overdiagnosis is the cancers detected with screening that would otherwise have remained undiagnosed throughout the women's lifetimes.
Several attempts have been made to estimate the level of overdiagnosis in inviting women aged 40–49 years to screening. Moss concluded that for women aged 40–74 there is a possible shift from invasive to in situ breast cancer but no evidence of overdiagnosis, 2 while Jørgensen and Gøtzsche for the same age interval concluded that one in three breast cancers was overdiagnosed. 3 Biesheuvel concluded that most estimates of overdiagnosis for ages 40–49 were biased, but ranged from −4% to 7% in the studies with the best precision in their estimates. 4 A cohort study of 11 out of 24 Swedish counties followed for on average 12.8 years from screening start to the year 2000 found no excess incidence for ages 40–49. 5
Three methods have been applied in the estimation of overdiagnosis: modelling, comparison of incidence rates, and comparison of cumulative incidences between screened and unscreened. 4 The aim of this study was to estimate the level of overdiagnosis of all breast cancers and of invasive breast cancers in women 40–49 years invited to subsequent screening in the Swedish service screening programme 1986–2005 using incidence rates.
MATERIALS AND METHODS
The SCRY cohort
During the study period 1986–2005 about half of the Swedish counties invited women aged 40–49 to service screening with mammography (Table 1). The SCRY cohort consists of all 40–49-year-old women in Sweden, split into a control group and a study group based on whether the women were invited to screening or not. For the control group (without screening in ages 40–49) the follow-up time period was chosen so that the averages (weighted using population sizes) for follow-up time and mid-calendar year of follow-up corresponded to those of the study group (Table 1). For example, women in the counties of Halland and Jämtland were included in the control group from 1986–2001 and 1990–2005, respectively, to achieve similar follow-up in study and control groups. Halland was only followed through 2001 due to a gradual inclusion of younger women into the screening programme starting in 2001, disqualifying later years from the control group. Later years were not included in the study group due to short follow-up. Only first primary breast cancers were included in the study. Participation rates varied between 80% and 90%. In 1990, there were 620,620 women aged 40–49 years in Sweden.
Information on the service screening programmes, including initiation of the programme, invited age groups and changes during the study period, was collected through a questionnaire to the screening centres. Data on breast cancer cases were retrieved from the Swedish Cancer Registry. Breast cancers were defined as breast cancer according to the International Classification of Diseases for Oncology, Revision Seven [ICD7], code 170. Population data that were used in the calculation of person-years were supplied by Statistics Sweden. The SCRY cohort has previously been described in more detail. 1
In the present study Blekinge county and parts of Örebro county, comprising 10% of the person-years in the original study group, were excluded as only women aged 45–49 were invited to screening, which might bias the overdiagnosis estimates. The control group was identical to the original SCRY cohort control group. Thus, the study includes 23 out of 24 counties in Sweden (Table 1).
Lead time
Breast cancers may be detected earlier with screening, prolonging the elapsed time between diagnosis and death even if the time of death remains the same. The time intervals from when a cancer is detectable with mammography and from when a cancer actually is detected to when it would surface clinically is called sojourn time and lead time, respectively.
Lead time will cause a temporary increase in the incidence during the first, so-called prevalence screening round, due to prevalent ‘future’ cancers being detected and added to the incidence that would have been observed without screening. A similar increase takes place in the youngest ages invited to screening (i.e. ages 40–41). When comparing a screened and an unscreened population this will cause a bias due to lead time. We refer to this bias as prevalence peak bias.
Breast cancer incidence increases with age 6 and may change over time, e.g. an increasing trend. Because a screened population corresponds to the future incidence in an unscreened population, another bias due to lead time may occur. Thus, there may be a bias when comparing the study group with the control group throughout follow-up, even if prevalence screening is excluded. We refer to this bias as trend bias.
The study period of the present study equals that of the original SCRY cohort study but was divided into prevalence screening, i.e. the first three years of each area's follow-up and ages 40 and 41, and subsequent screening. To adjust for prevalence peak bias, prevalence screening was excluded. It should be noted that the first round of screening is assumed to be included in the first three years of screening but does not necessarily take three years. In a similar way, some women may be invited to screening twice in ages 40–41. The weighted average follow-up years and average mid-calendar year of follow-up for subsequent screening were 12.4 and 1999 and 12.7 and 1999 in the study group and control group, respectively.
Overdiagnosis
To estimate the level of overdiagnosis in women aged 40–49 the crude rate ratio (RR) was calculated as the ratio of the breast cancer incidence (all breast cancer, i.e. in situ and invasive) in the study group and the control group. A crude RR between the study and control group was also calculated for a reference period with equal follow-up in all areas, 1970–1985, when screening had not yet begun in most counties (RRref). The corresponding RR and RRref were also calculated for invasive breast cancers. To adjust for the potential difference in baseline incidence between the study and control group the RR estimates of the study period were divided by the RRref estimates.
Five areas, covering 18% of the study population, were excluded from the calculation of the RRref due to ongoing screening activities (randomized controlled trial and pilot project) that took place before the start of their service-screening programmes. To estimate the impact of this exclusion a RR for the study period excluding the same five areas was calculated.
RRs were also estimated for the first three years of screening and the youngest ages invited to screening (40–41 years). These RRs are not estimates of overdiagnosis.
The software programme R was used for statistical analyses (R Foundation for Statistical Computing, Vienna, Austria). 7
Adjustment for trend bias
To adjust for trend bias the RR estimates were divided by an adjustment for the annual relative incidence difference (RRLT).
LT = average population lead time for all breast cancer, CD = relative change in incidence in the control group per calendar year, and AD = relative change in incidence in the control group per year of age.
The calculations of annual change in incidence in the control group (CD and AD) were based on incidence data for women aged 40–49 in the areas included in the control group in 1986–2005. AD is the average relative change in incidence per year of age, i.e. from age 40 to 41, from 41 to 42 and so on. CD is the corresponding average relative change in incidence per calendar year, i.e. from 1986 to 1987, from 1987 to 1988 and so on. The assumed average lead time for the screened population in the study group, including cancers not detected through screening can be calculated using estimates of mean sojourn time (MST) and sensitivity (S) for 40–49-year-olds. The population lead time LT can be calculated based on the lead time and the screening participation rate (PR) in the study group.
8–15
Due to the variation in the estimates of MST and S in the literature a sensitivity analysis was made by calculating RR estimates based on differing LTs.
RESULTS
Baseline incidence difference and trend bias
During the pre-screening reference period 1970–1985 there were 2486 breast cancer cases during 2.5 million person-years in the study group and 3879 breast cancer cases during 3.5 million person-years in the control group resulting in an RR of 0.92 (95% confidence interval [CI]: 0.88–0.97). The estimate was similar for invasive breast cancer (Table 2).
Summary of results for reference period (prescreening period) by diagnosis and age and for prevalence screening, i.e. years and ages excluded in overdiagnosis estimates adjusted for prevalence peak. Numbers of breast cancer cases and person-years in the study group (SG) and the control group (CG), respectively, relative risks (RR) and 95% confidence intervals (CI). preD = prescreening difference
The mean relative increase in incidence in the control group (CD) was 2.45% per calendar year 1985–2005 and 5.07% per one year age group (AD). The participation rate in the study group was 0.8. Assuming a sojourn time of 2.4 years and a sensitivity of 0.6 resulted in a population lead time of 1.2 years and a lead time adjustment (RRLT) of 1.09. Population lead times based on a range of possible sojourn and sensitivity estimates were made resulting in population lead times of up to 1.5 years. The lead time adjustments for a population lead time of 1.0 and 1.5 years were 1.08 and 1.12 respectively.
Overdiagnosis, main results
During the study period 1986–2005 there were 6047 breast cancer cases during 3.8 million person-years in the study group and 7790 breast cancer cases during 5.2 million person-years, resulting in an estimated crude RR of 1.07 (95% CI: 1.03–1.10) (Table 3).
Summary of main results by diagnosis, type of adjustment and ages included. Numbers of breast cancer cases and person-years in the study group (SG) and the control group (CG), respectively, relative risks (RR) and 95% confidence intervals (CI). preD = prescreening difference
When adjusted for the prescreening difference the RR estimate was 1.16 (95% CI: 1.09–1.23) and when also adjusted for prevalence peak bias by excluding prevalence screening and for trend bias the RR estimate was 1.01 (95% CI: 0.94–1.08). Based on a population lead time of 1 year and 1.5 years the adjusted RR estimate was 1.02 and 0.99 respectively. For invasive breast cancers the corresponding adjusted RR estimate, i.e. based on a population lead time of 1.2 years, was 0.95 (95% CI: 0.88–1.01). Five areas could not be included in the calculation of the RR for the reference period. When these areas were excluded from the study period the study period RR estimates were two percentage points higher.
Prevalence screening
The RR estimate for the initial three years of screening and for ages 40–41 were 1.11 (95% CI: 1.02–1.21) and 1.34 (95% CI: 1.19–1.51) respectively when adjusted for the prescreening difference and trend bias but not for prevalence peak bias.
DISCUSSION
During the time period 1986–2005 about half of Sweden's counties invited women aged 40–69 (74) years and half invited women aged 50–69 (74) years. This facilitated a study of the overdiagnosis in the Swedish service screening programme with mammography of women aged below 50 years.
For the subsequent screening rounds the estimated overdiagnosis for all cancers (in situ and invasive) was one percent and non-significant when adjusted for prescreening difference, bias due to lead time such as prevalence peak bias and trend. Thus there was no overdiagnosis in the 40–49 year age group in subsequent screening.
Strengths and limitations
The incidence might have differed between the study and control group independently of screening because the current study was a geographical comparison where exposure was determined by screening policy in each area. This difference was adjusted for by utilizing data from the reference period. A few areas could not be included in the reference period due to variation in invited age groups. When these areas were excluded in the study period this increased the RR estimates, but only by two percentage units. Screening continued in ages above 50, hence no compensatory drop in incidence could be observed in this study. This necessitated adjustment for the potential biases associated with lead time. Trend bias was adjusted for based on population lead time and prevalence peak bias by excluding prevalence screening. There may have been some prevalence screening also in subsequent screening due to women not participating in the first round, leading to an overestimation of overdiagnosis, though the high participation rate indicates that it was small. Population lead time was calculated to be 1.2 years but varying estimates of mean sojourn time and sensitivity have been made for the age group 40–49. 8–15 A sensitivity analysis was therefore conducted. For population lead times of 1.0 and 1.5 years the overdiagnosis estimates were 1.02 and 0.99 respectively and not statistically significant.
Comparison with earlier studies on ages 40–49
Moss estimated that the absolute excess of breast cancers in the NBSS I trial for women 40–49 was 0.25 (95% CI 0.04–0.46) and 0.12 (95% CI −0.08–0.32) corresponding to RR estimates of 1.14 and 1.08 for all and invasive cancers respectively. 2 Gøtzsche estimated it at 1.30 (95% CI: 1.13–1.50) for all cancers, however the Moss study had a longer follow-up. Gøtzsche also estimated the RR in the Göteborg trial for women 39–49 years (RR = 1.13; 95% CI: 0.90–1.41). 16 Biesheuvel noted that Gøtzsches estimates were not adjusted for lead time and are therefore biased upwards. 4 In the current study, which included 23 out of 24 counties, the number of breast cancer cases was 6047 in the study group. The number of breast cancer cases in the NBSS I and Gothenburg trial study groups were 663 and 144 respectively. A previous Swedish cohort study based on 11 out of 24 Swedish counties estimated the overdiagnosis in ages 40–49 to be 0.96 (95% CI: 0.77–1.21). 5
Prevalence screening
Prevalence screening was present in the first three years of screening and the youngest ages invited to screening (40–41). Though the screening round was closer to two years than three it is likely that there were backlogs, i.e. the screening centres could not keep the two-year screening interval. The RR estimates for the first three years of screening are low in comparison with the estimates for the first ages of screening (ages 40 and 41). This may be due to screening starting a few months into the first calendar year of screening so that a short period of non-screening is included. Also, in the areas with screening activities before start of screening the first years of inclusion in the study group were not prevalence screening and therefore diluted the increase of the observed incidence. Notable is that the RR estimates for prevalence screening were not adjusted for prevalence peak bias and therefore not meant as estimates of overdiagnosis.
In situ and invasive cancers
Though the majority of breast cancers are invasive, excluding the in situ cancers reduces the overdiagnosis estimate by six percentage points, hence in situ cancers make out a disproportionately large part of the estimate. For invasive cancers the RR estimate was 0.95 and non-significant. This indicates that fewer invasive cancers were diagnosed in ages 40–49 with screening than without, possibly due to cancers being detected already at the in situ stage. Without screening at age 40–49 pre-clinical cancer can progress to clinical cancer, remain as pre-clinical cancer or regress. A majority of in situ cancers seem to progress to invasive cancers. 17 Cancers that remain as pre-clinical would probably be detected in screening at ages 50 and older because the screening programme in Sweden covers all women from 50 years of age.
CONCLUSION
The current large study of the Swedish service screening programme finds no significant overdiagnosis from subsequent screening for women aged 40–49.
Footnotes
ACKNOWLEDGEMENTS
This study was funded by the Swedish Cancer Society.
