Sage Journals: Discover world-class research

Abstract

Background

Overdiagnosis in breast cancer screening is a topic of debate. Researchers often estimate trends in incidence prior to screening and project these to predict incidence during the screening epoch.

Methods

Data was obtained from the Cancer Registry of Norway and the Norwegian Breast Cancer Screening Programme. Using breast cancer incidence prior to screening in Norway (1976–1995), incidence trends were estimated from age-period and age-cohort models. These estimates were used to predict the incidence of breast cancer in five-year age and period groups in the screening epoch (1996–2009).

Results

Excess numbers of cancers in the screening age range (6,876 cancers), and deficits in women above and below the screening age range (1,947 cancers) were observed. However, only part of the observed differences between the observed and the expected incidence can be explained by screening, as evidenced by numbers of excess cancers greater than the numbers of screen-detected cancers in some age groups and time periods.

Conclusion

There are potential errors in estimation of overdiagnosis from screening if individual data on screening exposure and detection mode are not taken into account. For reliable estimates of overdiagnosis, it is necessary to compare excess incidence in the screening period in those actually screened with the corresponding excess in those not screened. This is the subject of ongoing research.

Keywords

Breast cancer screening overdiagnosis breast cancer incidence

Introduction

Concern is frequently expressed about the risk of overdiagnosis from cancer screening, in particular breast cancer screening.^1,2 In this context, overdiagnosis is defined as the diagnosis of cancer as a result of screening which would not have been diagnosed in the patient’s lifetime had screening not taken place.³ Implicit in this definition is a very long time frame. Breast screening, for example, usually takes place in middle aged women in developed countries, who have many future years of expected life in which a tumour could potentially progress to symptomatic diagnosis. Long term observation is one way of distinguishing excess incidence, due to overdiagnosis, from that due to screening lead time.⁴ Ideally, overdiagnosis could be estimated from a randomized trial of screening, in which the control group was never screened, and for which there is a long period of observation after screening ceased in the intervention group. This is what the recent UK review of breast cancer screening aimed to do,³ although reservations have been expressed about the review’s choice of data sources, and insufficient follow-up time.^4–6

In the absence of appropriate trial data, many attempts have been made to estimate overdiagnosis from trends in observational data on national or regional incidence of breast cancer, in conjunction with the time of introduction of screening.^1,7–10 Commonly, researchers estimate trends in incidence prior to screening, and project these to predict incidence during the screening epoch. An excess of observed incidence over that predicted may be partly attributable to overdiagnosis. Such an excess will also be partly due to lead time, the diagnosis as a result of screening of cancers which would otherwise have been diagnosed symptomatically some years later. This may be evidenced by a ‘compensatory drop’ in cancer incidence above the upper age limit for screening. A recent review by Puliti et al noted that those studies which adequately adjusted for lead time, either by prolonged follow-up after screening ceases or by adjustment for external estimates of lead time, and for changes in incidence unrelated to screening estimated modest rates of overdiagnosis, of the order of 10% or less, whereas those which did not derived much higher estimates.¹¹

Although not invariably acted upon, the significance of lead time and trends in incidence not attributable to screening is well known. What is less appreciated is the value of individual data on exposure to screening, and on whether cancers were screen detected or symptomatic.¹² Clearly, a cancer which was symptomatic rather than screen-detected cannot be overdiagnosed under the definition generally used. In this paper, we use data from the Cancer Registry of Norway and the Norwegian Breast Cancer Screening Programme (NBCSP) to demonstrate the extent to which the individual screening data qualifies interpretation of incidence trends. The aim was not to estimate overdiagnosis at this stage, but to assess the potential for such trend analyses to do so.

Data and methods

The NBCSP was initiated in November 1995 (although only 956 screens took place, and there were only three screen-detected cancers in 1995), offering biennial 2-view mammography to women aged 50–69. The programme began in four counties, and achieved nationwide coverage in 2005.

We obtained data on invasive breast cancers, from the Cancer Registry of Norway, including age at and date of diagnosis, from 1953 to 2009. Data on ductal carcinoma in situ (DCIS) was available from 1993 to 2009. The NBCSP provided data on detection mode (outside of the screening cohort, screen detected, interval cancer, non-attender, not invited due to upper age limit, and not invited as opted out). From the NBCSP, we had data on all screening invitations and attendances from November 1995 to December 2009. We also had tabular data on the resident female population in Norway by age and calendar year, as estimated in January every year. Age was calculated by subtracting the date of birth from the relevant calendar time.

We considered up to 1995 as the pre-screening epoch, and 1996 onwards as the screening epoch. Using poisson regression we estimated three models using the whole country’s incidence data from 1976 to 1995, as we were not confident of projecting trends from before 1976 through to 2009. The three models were:

A discrete age-cohort model using five-year age groups (30–34, 35–39,……, 85–89) and time periods (1976–80, 1981–85, 1986–90, 1991–95);

A discrete age, continuous period trend model; and

A separate period trend for each five-year age group.

We then used the estimates from each model to predict the incidence of breast cancer in five year age groups, and periods in the screening epoch from 1996 onwards (the last period being of four years duration, 2006–09). We compared the predicted numbers of cases with breast cancer cases actually observed. The observed cases were then classified as screen detected or symptomatic, and we also calculated the number of screening episodes for each age group and period, to establish bounds on the extent to which any excess in the screening epoch was actually due to the screening.

Results

Table 1 shows the numbers of breast cancer cases, person-years, and incidence rates by age group and time period, 1976–2009. Below age 60, rates were relatively stable in the first 15 years, then showed an increase in 1991–95. From age 60 upwards, rates were increasing steadily from 1976 onwards, with a reduction in the last, or penultimate period. For the screening ages 50–69, a dramatic increase was seen in 1996–2000, with the onset of the screening programme.

Table 1.

Cases, person years and incidence rates per 100,000 (in that order) by five year age group and calendar period.^a

Age	Period
Age	1976–1980	1981–1985	1986–1990	1991–1995	1996–2000	2001–2005	2006–2009
30–34	134	139	132	167	147	149	148
	742,011	739,460	768,710	782,344	839,328	847,980	625,304
	18.1	18.8	17.2	21.3	17.5	17.6	23.7
35–39	264	349	322	416	410	436	340
	569,388	743,862	742,841	772,768	790,507	852,501	700,872
	46.4	46.9	43.3	53.8	51.9	51.1	48.5
40–44	374	513	649	756	818	844	706
	487,031	568,960	743,529	742,434	776,186	796,744	687,570
	76.9	90.2	87.3	101.8	105.4	105.9	102.7
45–49	602	562	739	1,163	1,309	1,414	1,183
	504,457	484,855	566,288	739,639	741,761	776,658	636,388
	119.3	115.9	130.5	157.2	176.5	185.4	185.9
50–54	715	615	612	879	1,806	1,970	1,639
	562,546	499,167	479,418	560,444	733,226	736,185	617,197
	127.1	123.2	127.7	156.8	246.3	267.6	265.5
55–59	878	740	703	827	1,576	2,408	1,628
	623,820	552,038	489,797	471,498	552,441	721,321	579,386
	140.7	134.0	143.5	175.4	285.3	333.8	281.0
60–64	909	979	913	954	1,316	1,906	1,807
	583,807	605,447	535,334	476,071	460,508	538,061	558,423
	155,7	161.7	170.5	200.4	284.4	354.2	323.6
65–69	947	1,056	1,142	1,003	1,355	1,564	1,372
	534,838	556,290	577,269	511,324	457,065	442,206	402,346
	177.1	189.8	197.8	196.2	296.5	353.7	341.0
70–74	913	1,059	1,142	1,245	1,195	946	730
	459,845	492,645	514,871	535,991	477,081	428,140	329,812
	198.5	215.0	221.8	232.3	250.5	221.0	221.3
75–79	782	964	1,066	1,134	1,200	1,082	747
	366,383	396,622	428,513	450,523	474,740	426,331	312,682
	213.4	243.1	248.8	251.7	252.8	253.8	238.9
80–84	492	659	770	834	902	1,030	766
	239,506	277,702	307,362	335,482	360,633	386,082	286,097
	205.4	237.3	250.5	248.6	250.1	266.8	267.7
85–89	206	323	401	483	499	585	521
	115,181	146,577	175,127	196,787	221,074	243,043	217,832
	178.8	220.4	229.0	245.4	225.7	240.7	239.2

Last period is of four years.

In age-period-cohort modelling of the 1976–95 data, the ‘best’ model was the age-cohort model, because the effect of period was not significant, after adjusting for age and cohort, whereas age and cohort each had significant effects on incidence after adjusting for the other two factors. Table 2 shows the relative rates (RR) and 95% confidence intervals (CI) from the age-cohort model, with cohorts calculated as time period minus age group, and with each cohort labelled as the midpoint year of birth for that cohort. As might be expected, there was increasing incidence with both increasing age and more recent birth cohorts, although the former effect was considerably stronger than the latter. While the size of the increase from cohort to cohort was not exactly constant, the average increase was approximately 8% per five-year cohort.

Table 2.

Relative risks and 95% confidence intervals in the age-cohort model applied to incidence data from 1976–95.

Factor	Category^a	RR (95%CI)
Age	30–34	1.00 (−)
	35–39	2.72 (2.43–3.04)
	40–44	5.52 (4.92–6.18)
	45–49	8.82 (7.84–9.91)
	50–54	9.91 (8.76–11.21)
	55–59	11.89 (10.47–13.50)
	60–64	14.57 (12.79–16.60)
	65–69	17.09 (14.96–19.51)
	70–74	20.23 (17.67–23.17)
	75–79	23.49 (20.46–26.98)
	80–84	24.47 (21.22–28.23)
	85–89	24.73 (21.26–28.78)
Cohort	1891	0.72 (0.61–0.83)
	1896	0.85 (0.78–0.92)
	1901	0.92 (0.87–0.98)
	1906	1.00 (−)
	1911	1.03 (0.98–1.09)
	1916	1.08 (1.02–1.14)
	1921	1.14 (1.07–1.21)
	1926	1.16 (1.09–1.24)
	1931	1.28 (1.19–1.38)
	1936	1.36 (1.25–1.48)
	1941	1.56 (1.42–1.70)
	1946	1.70 (1.54–1.87)
	1951	1.76 (1.57–1.96)
	1956	1.89 (1.65–2.17)
	1961	2.11 (1.72–2.60)

Midpoint year of cohort given

Table 3 shows the overall age-adjusted period trend in RR and 95% CI from model (2), and the individual trends for each age group in model (3). Overall, adjusting for age, there was a 7% increase per five-year period. The trend varied by age group, from a 4% increase for ages 35–39 to an 11% increase per five-year period for ages 45–49.

Table 3.

Age-adjusted overall trend in incidence with calendar time, and age-specific trends during 1976–95, expressed as a relative risk per five-year period.

Trend estimate	Age group	RR (95%CI)
Age-adjusted	30–89	1.07 (1.05–1.08)
Age-specific	30–34	1.05 (0.97–1.12)
	35–39	1.04 (0.99–1.09)
	40–44	1.08 (1.04–1.12)
	45–49	1.11 (1.07–1.14)
	50–54	1.07 (1.04–1.11)
	55–59	1.07 (1.04–1.11)
	60–64	1.08 (1.05–1.11)
	65–69	1.04 (1.01–1.06)
	70–74	1.05 (1.02–1.08)
	75–79	1.05 (1.02–1.08)
	80–84	1.06 (1.02–1.09)
	85–89	1.09 (1.04–1.14)

The observed incidence per 100,000 person-years and that expected from models (1), (2), and (3) are shown by age group and period, for the years 1996–2009 in Table 4. The upper triangle of expected rates using the age-cohort model is not estimable, as we did not use data on cohorts born after 1961. For all three models, observed incidence tended to be higher than expected in the screening age range, and lower than expected outside this range.

Table 4.

Observed incidence per100,000 and projected incidence by the three models, by age and period, from 1996 onwards.

Age	Incidence rate	Period
Age	Incidence rate	1996–2000	2001–05	2006–09
30–34	Observed	17.5	17.6	23.7
	Projected model 1	–	–	–
	Projected model 2	22.0	23.5	25.0
	Projected model 3	21.0	22.0	23.0
35–39	Observed	51.9	51.1	48.5
	Projected model 1	58.1	–	–
	Projected model 2	55.5	59.2	63.1
	Projected model 3	52.9	55.2	57.7
40–44	Observed	105.4	105.9	102.7
	Projected model 1	105.3	117.8	–
	Projected model 2	104.4	111.2	118.6
	Projected model 3	108.3	117.5	127.4
45–49	Observed	176.5	185.4	185.9
	Projected model 1	156.5	168.2	188.2
	Projected model 2	154.6	164.9	175.9
	Projected model 3	168.5	186.8	206.9
50–54	Observed	246.3	267.6	265.5
	Projected model 1	169.8	175.8	189
	Projected model 2	157.2	167.6	178.8
	Projected model 3	159.6	171.3	183.8
55–59	Observed	285.2	333.8	281.0
	Projected model 1	186.9	203.8	211.0
	Projected model 2	173.8	185.3	197.6
	Projected model 3	177.4	190.6	204.9
60–64	Observed	284.4	354.2	323.6
	Projected model 1	199.8	229.0	249.7
	Projected model 2	201.0	214.3	228.5
	Projected model 3	209.9	227.7	247.0
65–69	Observed	296.5	353.7	341
	Projected model 1	221.6	234.3	268.6
	Projected model 2	223.0	237.8	253.6
	Projected model 3	207.5	214.8	222.4
70–74	Observed	250.5	221.0	221.3
	Projected model 1	237.9	262.5	277.5
	Projected model 2	253.8	270.6	288.5
	Projected model 3	245.1	257.5	270.6
75–79	Observed	252.8	253.8	238.9
	Projected model 1	269.8	276.1	304.7
	Projected model 2	279.9	298.4	318.2
	Projected model 3	269.9	283.4	297.6
80–84	Observed	250.1	266.8	267.7
	Projected model 1	266.0	281.1	287.7
	Projected model 2	275.7	294.0	313.4
	Projected model 3	271.3	287.3	304.1
85–89	Observed	225.7	240.7	239.2
	Projected model 1	258.4	268.9	284.1
	Projected model 2	257.6	274.6	292.8
	Projected model 3	272.9	298.7	327.0

Table 5 shows the absolute numbers of cancers observed and expected in 1996–2009 from model (3), the age-specific period effect model. Substantial excess numbers of cancers in the screening age range, small deficits in women above the screening age range, and smaller deficits below the screening age range were observed. We did not interpret these observed excess numbers as estimates of overdiagnosis: at this stage, they constitute excess incidence over that expected, which remains to be explained. Similar results were observed using models (1) and (2), with slightly smaller excesses and deficits observed for model (1) (age-cohort). While this pattern might be expected, in qualitative terms, it is clear that only part of the observed differences between observed and expected incidence could be due to screening. In 1996–2000, in women aged 50–59, the excess number of cancers observed was 34% (319 cases) larger than the total number of screen-detected cancers. In the same period in women aged 75 or older, there was a deficit of 262 cancers in women who were already past the upper age limit when the programme was initiated.

Table 5.

Observed absolute number of cases, expected numbers from model 3, excess cancers (negative numbers indicate a deficit) and number of observed screen detected cancers, by age and period, from 1996 onwards.

Age	Quantity	Period
Age	Quantity	1996–2000	2001–05	2006–09	Total
30–34	Observed cancers	147	149	148	444
	Expected cancers	177	186	144	507
	Excess cancers	−30	−37	4	−63
	Screen-detected cancers	0	0	0	0
35–39	Observed cancers	410	436	340	1,186
	Expected cancers	418	471	404	1,293
	Excess cancers	−8	−35	−64	−107
	Screen-detected cancers	0	0	0	0
40–44	Observed cancers	818	844	706	2,368
	Expected cancers	841	936	876	2,653
	Excess cancers	−23	−92	−170	−285
	Screen-detected cancers	0	0	0	0
45–49	Observed cancers	1,309	1,414	1,183	3,906
	Expected cancers	1,250	1,450	1,317	4,017
	Excess cancers	59	−36	−134	−111
	Screen-detected cancers	31	59	62	152
50–54	Observed cancers	1,806	1,970	1,639	5,415
	Expected cancers	1,171	1,261	1,135	3,567
	Excess cancers	635	709	504	1,848
	Screen-detected cancers	480	969	901	2,350
55–59	Observed cancers	1,576	2,408	1,628	5,612
	Expected cancers	978	1,375	1,187	3,540
	Excess cancers	598	1,033	441	2,072
	Screen-detected cancers	434	1,341	955	2,730
60–64	Observed cancers	1,316	1,906	1,807	5,029
	Expected cancers	967	1,225	1,379	3,571
	Excess cancers	349	681	428	1,458
	Screen-detected cancers	447	1,075	1,172	2,694
65–69	Observed cancers	1,355	1,564	1,372	4,291
	Expected cancers	948	950	895	2,793
	Excess cancers	407	614	477	1,498
	Screen-detected cancers	472	863	905	2,240
70–74	Observed cancers	1,195	946	730	2,871
	Expected cancers	1,169	1103	893	3,165
	Excess cancers	26	−157	−163	−294
	Screen-detected cancers	94	39	41	174
75–79	Observed cancers	1,200	1,082	747	3,029
	Expected cancers	1,282	1,208	931	3,421
	Excess cancers	−82	−126	−184	−392
	Screen-detected cancers	0	0	0	0
80–84	Observed cancers	902	1,030	766	2,698
	Expected cancers	978	1,109	870	2,957
	Excess cancers	−76	−79	−104	−259
	Screen-detected cancers	0	0	0	0
85–89	Observed cancers	499	585	521	1,605
	Expected cancers	603	726	712	2,041
	Excess cancers	−104	−141	−191	−436
	Screen-detected cancers	0	0	0	0

Discussion

Using breast cancer incidence in the two decades prior to screening initiation in Norway, 1976–95, we estimated incidence trends from age–period and age-cohort models. These were then extrapolated to the screening period, 1996–2009, to give expected incidence if the trends continued unchanged. In the screening period, we observed an excess of 6,876 cancers in the screening age range, and a deficit of 1,947 cancers at all other ages. Several observations prohibit use of these figures to estimate overdiagnosis due to screening. These include:

For some age groups and periods, the excess incidence is greater than the number of screen-detected cancers, for example, in ages 50–59 in 1996–2000;

A substantial proportion of the deficit observed in the non-screening ages occurs in women who were never screened, including women too young for the screening programme, and in women already past the upper age limit for screening at the programme’s initiation; for example, women aged 75 or older in 1996–2000.

The first of these implies that some of the observed excess in the screening period is due to symptomatic tumours, and therefore there are changes in the incidence trends between the pre-screening and screening periods, which are not due to screening, and which are not captured by estimation from the pre-screening period alone. The poisson regression models used provide a reasonable fit to the period of estimation (data available from the authors), but cannot be extrapolated to the screening period, because of these unattributed changes in incidence. Further, they cannot provide an estimate of overdiagnosis. This casts doubt on similar past estimates of overdiagnosis from trends in the absence of screening exposure data.^1,7,9,13 Hofvind et al.¹⁴ found that increases in incidence in the screening period in Norway were consistent with changes in use of hormone replacement therapy. There may also be changes in other risk factors, or in breast cancer awareness, which in turn may lead to greater diagnostic or non-programme screening activity. The second implies that deficits observed in the non-screening ages were partly due to changes in incidence trends other than those induced by a compensatory drop in incidence due to lead time.^8,11 As has been previously noted, in order to observe the full compensatory drop, there must be sufficient observation time, of the order of ten years or more, of screened cohorts after they have ceased to be screened.^4,11

Although Table 5 only shows the extrapolation and excess estimates for the age-specific period effect model, qualitatively similar results were obtained for the age-adjusted common period effect, and the discrete age-cohort model (results available from the authors). It could be argued that our excesses are overestimates in any case due to the unavailability of DCIS data in the early pre-screening years. However, restriction of analysis to invasive cancers only yielded the same qualitative results.

There are several conclusions from these results. Firstly, that there are potential errors in estimation of overdiagnosis from screening, in the absence of individual data on screening exposure and detection mode. It is possible that we, and others, have overestimated overdiagnosis in the past, as a result of absence of information on individual exposure to screening and screen-detection of cancers.^1,7,8,13 It would be unwise to present overdiagnosis estimates, based on the balance between observed and expected incidence, unstratified by screening exposure. It may also be necessary to obtain other covariates of risk for reliable prediction of incidence rates from pre-screening data. For reliable estimates, at the very least it will be necessary to compare excess incidence in the screening period in those actually screened , with the corresponding excess (if any) in those not screened. This is the subject of ongoing work.

Footnotes

Acknowledgements

This work was funded by the Research Council of Norway (project number 189520/V50). The funding body had no role in the drafting of this manuscript. No writing assistance was used.

References

Bleyer

Welch

. Effect of three decades of screening mammography on breast-cancer incidence. N Engl J Med 2012; 367: 1998–2005.

Bretthauer

Kalager

. Principles, effectiveness and caveats in screening for cancer. Br J Surg 2013; 100: 55–65.

Independent

. Panel on Breast Cancer Screening. The benefits and harms of breast cancer screening: an independent review. Lancet 2012; 380: 1778–86.

Duffy

Parmar

. Overdiagnosis in cancer screening: the importance of long term observation and lead time. Breast Cancer Research 2013; 15: R41–R41.

Njor SH, Garne JP, Lynge E. Over-diagnosis estimate from the Independent UK Panel on Breast Cancer Screening is based on unsuitable data. J Med Screen 2013;20:104–5.

Duffy

. The benefits and harms of breast cancer screening. Lancet 2013; 381: 802–802.

Duffy

Tabar

Olsen

. Absolute numbers of lives saved and overdiagnosis in breast cancer screening, from a randomized trial and from the breast screening programme in England. J Med Screen 2010; 17: 25–30.

Jørgensen

Gøtzsche

. Overdiagnosis in publicly organised mammography screening programmes: systematic review of incidence trends. BMJ 2009; 339: b2587–b2587.

Puliti

Zappa

Miccinesi

Falini

Crocetti

Paci

. An estimate of overdiagnosis 15 years after the start of mammographic screening in Florence. Eur J Cancer 2009; 45: 3166–71.

10.

Hellquist

Duffy

Nyström

Jonsson

. Overdiagnosis in the population-based service screening programme with mammography for women aged 40-49 in Sweden. J Med Screen 2012; 19: 14–9.

11.

Puliti

Duffy

Miccinesi

de Koning

Lynge

Zappa

Paci

. Euroscreen Working Group. Overdiagnosis in mammographic screening for breast cancer in Europe: a literature review. J Med Screen 2012; 19(S1): 42–56.

12.

Tabar

Dean

Chen

Yen

Chiu

Tot

Smith

Duffy

The impact of mammography screening on the diagnosis and management of early phase breast cancer. In: Francescatti

Silverstein

(eds). Selected Topics in Breast Surgery, New York, NY: Springer, 2013.

13.

Kalager

Adami

Bretthauer

Tamimi

. Overdiagnosis of breast cancer due to mammography screening: results from the Norwegian screening programme. Ann Intern Med 2012; 156: 491–9.

14.

Hofvind

Sakshaug

Ursin

Graff-Iversen

. Breast cancer incidence trends in Norway- explained by hormone therapy or mammographic screening? Int J Cancer 2012; 130: 2930–8.

Trends in aggregate cancer incidence rates in relation to screening and possible overdiagnosis: A word of caution

Abstract

Background

Methods

Results

Conclusion

Keywords

Introduction

Data and methods

Results

Discussion

Footnotes

Acknowledgements

References