Facility Mammography Volume in Relation to Breast Cancer Screening Outcomes

Abstract

Objectives

To clarify the relationship between facility-level mammography interpretive volume and breast cancer screening outcomes.

Methods

We calculated annual mammography interpretive volumes from 2000–2009 for 116 facilities participating in the U.S. Breast Cancer Surveillance Consortium (BCSC). Radiology, pathology, cancer registry, and women’s self-report information were used to determine the indication for each exam, cancer characteristics, and patient characteristics. We examined the effect of annual total volume and percentage of mammograms that were screening on cancer detection rates using multinomial logistic regression adjusting for age, race/ethnicity, time since last mammogram, and BCSC registries. “Good prognosis” tumours were defined as screen-detected invasive cancers that were <15 mm, early stage, and lymph node negative at diagnosis.

Results

From 3,098,481 screening mammograms, 9,899 cancers were screen-detected within one year of the exam. Approximately 80% of facilities had annual total interpretive volumes of >2,000 mammograms, and 42% had >5,000. Higher total volume facilities were significantly more likely to diagnose invasive tumours with good prognoses (odds ratio [OR] 1.32; 95% confidence interval [CI] 1.10–1.60, for total volume of 5,000–10,000/year v. 1,000–2,000/year; p-for-trend <0.001). A concomitant decrease in tumours with poor prognosis was seen (OR 0.78; 95%CI 0.63–0.98 for total volume of 5,000–10,000/year v. 1,000–2,000/year).

Conclusions

Mammography facilities with higher total interpretive volumes detected more good prognosis invasive tumours and fewer poor prognosis invasive tumours, suggesting that women attending these facilities may be more likely to benefit from screening.

Keywords

Cancer detection mammography breast cancer screening volume

Introduction

While there is compelling evidence that mammography leads to a decrease in breast cancer mortality, benefit is largely gained through early detection of some invasive tumours.^1–4 There is widespread agreement that the quality of mammography has to be high for mammography to maximize mortality benefits.⁵ Defining and measuring quality in mammography interpretation are challenging. Mammography quality can be measured using characteristics of the images, or by numerous measures of interpretive performance, such as the proportion of women called back for additional imaging, sensitivity, specificity, and positive predictive value. Another important measure of quality is the rate of detecting early invasive cancer,^4,5 as it is only by diagnosing and treating invasive cancers early, and thereby preventing progression to more advanced disease, that mammography can achieve a mortality benefit.^1–4

Volume-outcome relationships have been identified across broad areas of medicine, including for cancer directed treatments.^6,7 Although there has been considerable work published on variation in breast screening performance by physician volume,^8–15 less has been published on facility volume. Mammography facilities vary significantly in volume of mammograms interpreted,¹⁶ but whether this variation relates to quality is unclear. Some evidence suggests that facility-level mammography volume may be associated with better mammography interpretive performance, although this relationship has not been consistently demonstrated.^15,17,18 Two studies examining facility-level cancer detection – one in the United Kingdom¹⁸ and one in Canada,¹⁵ demonstrated a positive association between facility mammography volume and overall cancer detection rate, even when accounting for radiologist volume.¹⁵

Most analyses of mammography quality focus on measures of sensitivity (the proportion of cancers for which the mammogram detects the cancer), and specificity (the proportion of women without cancer who have a normal test result). However, it is really the detection of good prognosis tumours (small, early stage and, lymph node negative cancers) that would be expected to influence mortality rates and/or excess morbidity, rather that the detection of all cancers. We here assess the association between facility interpretive volume and the rate of detecting good prognosis invasive tumours. We hypothesized that cancer detection rate of more favourable prognosis tumours may be associated with greater facility mammography volume because readers in those facilities would have more experience and skill.

Methods

Study Population and Data Sources

This study included data from 116 facilities participating in one of seven U.S. Breast Cancer Surveillance Consortium (BCSC) breast imaging registries (San Francisco Mammography Registry, Colorado Mammography Advocacy Project, Carolina Mammography Registry, New Hampshire Mammography Network, Vermont Breast Cancer Surveillance Consortium, Group Health Registry, New Mexico Mammography Project). These registries collect information on mammography performed at participating facilities in their defined catchment areas and link this information to state tumour registries or regional Surveillance Epidemiology and End Results programmes, to obtain population-based cancer data.¹⁹ Demographic and breast cancer risk factor data including age and first-degree family history are collected using a self-reported questionnaire completed at each screening mammogram. Time since last mammogram is both self-reported and derived from observed BCSC registry data. Both the facility at which the mammogram is performed and the facility at which it is interpreted are tracked separately through registry protocols. The BCSC registries and Statistical Coordinating Center received institutional review board approval for active or passive consenting processes and a Federal Certificate of Confidentiality and other protections for participating women, physicians, and facilities. All procedures are Health Insurance Portability and Accountability Act compliant.¹⁹

Mammography Data and Mammographic Volume Definition

Interpretive volumes (total and screening) were calculated monthly for facilities from 2000–2009. Mammography exams indicated by the radiologist to be performed for screening and not for the additional evaluation of a prior mammogram, short interval following up, or the evaluation of a breast symptom, were considered screening exams.^20–22 These monthly volume measures were then aggregated to generate time-varying facility measures of annual screening volume, total volume, and percent of total volume that is screening as in prior studies.^13,14,21

The unit of analysis was at the mammogram level. At each mammogram, woman-level data were collected including information on age, race and ethnicity, screening history, and breast cancer risk factors. For measuring our outcomes, we included screening mammograms performed from 2001–2009 among women with no prior history of breast cancer, mastectomy, or breast augmentation. We were primarily interested in whether the interpretive volume at a facility in the year prior to each study screening mammogram was associated with detection of good or poor prognosis invasive tumours (defined below) diagnosed within the year following the screening mammogram, after controlling for important woman-level characteristics.

Breast Cancer Cases and Tumour Characteristics

Women diagnosed with invasive breast carcinoma within one year of a positive screening mammogram and before their next screening mammogram were considered to have screen-detected cancer.^21,23 Tumour characteristics were collected from tumour registries and pathology databases and included: size (small, <10 mm; moderate, 10 -<15mm; large, ≥15 mm), AJCC 6^th edition stage (early, 1, 2a; late, 2b, 3, 4),²⁴ and invasive nodal status (negative, positive).^13,14,21 “Good prognosis” tumours were defined as screen-detected invasive cancers that were: small or moderate size (<15 mm), early stage (stage 1, 2a), and lymph node negative at diagnosis. If a screen-detected tumour was large (≥15 mm), late stage (2b, 3,4), or lymph node positive, we classified it as a poor prognosis tumour.^13,14,21

Analysis

We examined the distribution of mammograms, and overall and screen-detected cancer cases according to the interpreting facility volume measures (annual screening, annual total, and % screening) interpreted at the facility in the year prior to the mammogram.^13,14 We also characterized women’s age, race/ethnicity, and time since prior mammogram in relation to those measures. The distributions of invasive tumour characteristics (size, stage, and nodal status) were calculated within each volume measure level. We also calculated crude, unadjusted rates of detection of cancers with each tumour characteristic within these volume measure categories. To examine the association between facility interpretive volume and detection of cancers with different tumour characteristics, we modeled outcomes (detection of ‘good prognosis invasive cancers’, and ‘poor prognosis invasive cancers’) with multinomial logistic regression. The models were adjusted for the potentially confounding factors of age, race/ethnicity, BCSC registry, and time since prior mammogram.

We used generalized estimating equations, assuming an independent working correlation, and robust standard errors to account for clustering of exams at the reading facility. Separate models were fit for each of the volume measures. We estimated multinomial odds ratios (OR) and 95% confidence intervals (CI) for each volume level, and calculated a test of trend across the volume categories using a Wald test. We performed multiple imputations via chained equations to account for missing data on invasive tumour characteristics, race/ethnicity, and mammography history prior to fitting our outcome models, and we adjusted statistical inference estimates accordingly using Rubin's rules.²⁵

Results

The majority of facilities had total annual interpretive volumes of >2,000–5,000 mammograms (37.9%), and almost 20% had volumes of ≥10,000 (Table 1). From the 3,098,481 screening mammograms performed from 2001–2009 that were included in our outcome analyses, 9,899 invasive cancers were screen-detected within 12 months of these screening mammograms (3.2 cancers/1000 screening mammograms) (Table 1). Overall and screen-detected cancer rates within one year of mammogram were similar across facilities with different volumes (Table 1). Demographic characteristics of women did not vary notably across volume categories, with the exception of race. White, non-Hispanic women were more highly represented in the lower volume facilities (<5000 average annual mammograms), while Black, non-Hispanic women were more highly represented in facilities with greater annual volume (>5000 mammograms). Race and ethnicity did not appear to have a consistent association with percent of volume that is screening. (Appendix 1, available online).

Table 1.

Descriptives of the study population.

	Facilities		Screening mammograms included		Invasive cancers within 1 year of screening mammogram		Screen-detected invasive cancers
	N	%	N	%	N	Cancer rate/ 1,000 screens	N	Detection rate/ 1,000 screens
Total	116		3,098,481		12,064	3.9	9,899	3.2
Interpretative volume of facility at which screening mammogram was interpreted
Annual volume (screening and diagnostic)
(480, 1000]	4	3.4	16,174	0.5	58	3.6	47	2.9
(1000, 2000]	19	16.4	117,295	3.8	432	3.7	363	3.1
(2000, 5000]	44	37.9	633,422	20.4	2,492	3.9	2,097	3.3
(5000, 10000]	26	22.4	715,773	23.1	3,097	4.3	2,552	3.6
10000+	23	19.8	1,615,817	52.1	5,985	3.7	4,840	3.0
% of annual volume that is screening
≤75%	3	2.6	129,477	4.2	656	5.1	562	4.3
(75, 80]	19	16.4	813,911	26.3	3,268	4.0	2,714	3.3
(80, 85]	24	20.7	884,230	28.5	3,323	3.8	2,740	3.1
(85, 90]	35	30.2	619,442	20.0	2,387	3.9	1,925	3.1
(90, 95]	25	21.6	503,150	16.2	1,958	3.9	1,583	3.1
(95, 100]	10	8.6	148,271	4.8	472	3.2	375	2.5

Totals shown at the facility level are based on measuring the average annual volume (and % that is screening) interpreted at the facility over the entire study period. At the mammogram level, however, these are measures based on volume interpreted at the facility in the year prior to the mammogram. Other tables use this latter measure of volume.

Higher rates of detecting small and moderate invasive cancers were seen with increasing total volume and decreasing percent of total volume that is screening (Table 2). This pattern was also observed for early stage and node-negative cancers. The converse may also be noted, with the results suggesting fewer large, late stage and positive nodes and poor prognosis cancers trending with increasing total volume and higher % of total volume that is screening (Table 2).

Table 2.

Unadjusted cancer detection rates per 1,000 screening mammograms.

	Invasive cancer (rate per 1,000 screening mammograms)
	Size			Stage		Nodal status		Overall tumour prognosis*
	Small (<10 mm)	Moderate (10 -<15mm)	Large (≥15 mm)	Early (0,1,2a)	Late (2b,3,4)	Negative	Positive	Good	Not good
Total	0.9	0.8	1.4	2.6	0.4	2.4	0.7	1.4	1.6
Interpretative volume of facility at which screening mammogram was interpreted
Annual volume (screening and diagnostic)
(480, 1000]	0.6	0.4	1.5	1.8	0.6	1.7	0.9	0.7	1.8
(1000, 2000]	0.7	0.6	1.6	2.4	0.5	2.2	0.8	1.1	1.8
(2000, 5000]	0.9	0.8	1.4	2.6	0.4	2.4	0.7	1.4	1.6
(5000, 10000]	1.0	0.9	1.5	2.9	0.5	2.6	0.8	1.6	1.7
10000+	0.9	0.7	1.3	2.5	0.4	2.3	0.7	1.4	1.5
% of annual volume that is screening
≤75%	1.2	1.2	1.8	3.7	0.5	3.3	1.0	2.0	2.1
(75, 80]	1.0	0.8	1.4	2.8	0.4	2.5	0.7	1.6	1.6
(80, 85]	0.9	0.7	1.3	2.6	0.4	2.4	0.6	1.4	1.5
(85, 90]	0.8	0.7	1.4	2.4	0.4	2.2	0.8	1.3	1.6
(90, 95]	0.8	0.8	1.4	2.5	0.5	2.2	0.8	1.2	1.6
(95, 100]	0.7	0.6	1.2	2.1	0.3	1.9	0.5	1.1	1.3

Good prognosis tumours defined as screen-detected invasive cancers that have all three of the following characteristics: small/moderate size (<15 mm), early stage (<2b), and node negative. Not good prognosis tumours defined as screen-detected invasive cancers that have at least one of the following characteristics: large size (≥15 mm), late stage (2b,3,4), or node positive.

580 of 9,899 invasive cancers detected had unknown tumour prognosis. The cancer detection rate of invasive cancers with unknown tumour prognosis was 0.2 per 1,000 screening mammograms overall, and in subgroups defined by volume this rate ranged from 0.1 to 0.5 per 1,000 screening mammograms. Detail on the distribution of tumour characteristics among all screen-detected cancers is shown in Appendix 2, available online.

In adjusted models, higher volume facilities were significantly more likely to diagnose tumours with good prognostic features (Table 3). In comparison with facilities with an average annual volume of >1,000–2,000 mammograms interpreted per year, facilities with an annual volume of 5,000–10,000 per year were 32% more likely to diagnosis a good prognosis tumour (OR 1.32, 95% CI 1.10–1.60, p-for-trend <0.001) (Table 3). After multivariable adjustment, neither volume model measure demonstrated a significant trend with detection of poor prognosis tumours (Table 3). While there was no significant trend, the two highest annual total volume groups had significantly fewer poor prognosis tumours detected relative to the >1,000–2,000 volume referent group (OR 0.78, 95% CI 0.63–0.98 for >5,000–10,000 and OR 0.72, 95% CI 0.57–0.90 for 10,000+). The percent of annual volume that is screening did not appear to be related to a trend in good- v. poor prognosis tumours (Table 3).

Table 3.

Adjusted (multinomial) odds ratios for the association between cancer detection and volume of the interpreting facilities in the year prior to the mammogram*.

	Invasive cancer
	Overall tumour prognosis**
	Good		Not good
Interpretative volume of facility at which screening mammogram was interpreted	OR	95% CI	OR	95% CI
Annual volume (screening and diagnostic)
(480, 1000]	0.71	(0.47, 1.05)	0.84	(0.55, 1.29)
(1000, 2000]	1.00	REF	1.00	REF
(2000, 5000]	1.22	(1.03, 1.46)	0.82	(0.67, 1.01)
(5000, 10000]	1.32	(1.10, 1.60)	0.78	(0.63, 0.98)
10000+	1.27	(1.05, 1.52)	0.72	(0.57, 0.90)
	p-for-trend <0.001		p-for-trend = 0.159
% of annual volume that is screening
≤75%	0.99	(0.83, 1.18)	1.02	(0.84, 1.23)
(75, 80]	1.00	REF	1.00	REF
(80, 85]	0.92	(0.82, 1.04)	0.93	(0.85, 1.02)
(85, 90]	0.89	(0.76, 1.04)	1.04	(0.93, 1.15)
(90, 95]	0.86	(0.74, 1.01)	1.05	(0.92, 1.19)
(95, 100]	0.80	(0.62, 1.03)	0.93	(0.80, 1.08)
	p-for-trend = 0.060		p-for-trend = 0.777

Separate models were estimated for annual total volume and for % that is screening. Models are based on using generalized estimating equations to fit multinomial logistic regression models, accounting for clustering of exams by interpreting facility, and adjusting for mammography registry, age, time since prior mammography, and race/ethnicity. Multinomial odds ratios (OR) with 95% confidence intervals (CI) are shown relative to a reference volume level for each model. In addition to ORs and 95% CIs, we present the p-value for a trend test assessing whether risk of cancer detection tends to increase with increasing volume or percent that is screening. We performed multiple imputation to account for missing data (race, time since prior mammography, and tumour characteristics) prior to fitting our outcome models, and we adjusted statistical inference estimates accordingly using Rubin's rules.

Discussion

Our study is unique in examining interpretive quality as measured by characteristics of screen-detected cancers in relation to facility volume. Detecting invasive cancers that are small, early stage, and node-negative may yield the most mortality reduction – and morbidity benefit – compared with large, late stage, disseminated cancers, the detection of which may not notably improve mortality or morbidity, although we also note the potential for overdiagnosis. We found significant differences in mammographic outcomes by facility interpretive volume. As total volume increased, the rate of detecting “good prognosis” invasive cancers tended to increase. Even after adjusting for potential confounders and possible correlation between exams read at the same facility, the likelihood of detecting “good prognosis” invasive cancers increased significantly with increasing volume.

This study adds important evidence to the literature on mammography performance because few studies have focused on the relationship of facility interpretive volume and interpretive performance, particularly in terms of cancer detection rates and tumour characteristics. High facility volume has been hypothesized to be associated with improved outcomes, such as early detection of invasive cancers, however, the scant literature has been mixed and has measured various outcomes with differing volume measures.^{15–18,23,24} Also using BCSC data, Taplin et al. studied annual facility volume and screening mammography interpretive accuracy and found no association with sensitivity,¹⁷ and Jackson et al. found no relationship with sensitivity or the area under the receiver operating curve of diagnostic mammography with diagnostic mammography facility volume.²⁶ In contrast, two international studies found positive associations of volume with screening mammography interpretive performance, two with cancer detection^15,18 and one with positive predictive value.¹⁸ However, comparability is uncertain, given differing clinical practices outside of the U.S. The significant association we found with volume and detection of invasive cancers with “good” characteristics supports the volume-outcome relationship for mammography at the facility level.

The Mammography Quality Standards Act addresses radiologist, not facility, volume, with a minimum interpretive volume of 960 mammograms every two years. This is 5–10-fold less than for other countries such as the UK, Canada, and Australia.²⁷ Facility volume is not currently assessed for quality assurance. Our results suggest better early invasive cancer detection above 2,000 mammograms interpreted annually at a facility, which is consistent with our prior findings suggesting improved screening performance with 2,000 exams interpreted annually for radiologists.^14,21 Measuring facility interpretive volume may be more practical to carry out, because many radiologists practice at more than one facility, making accurate tracking across multiple facilities challenging. Further, there are many small facilities, which may have more difficulty recruiting highly skilled breast imagers, may not have the most up-to-date equipment or technologists, may not have the resources to allow extensive follow up of abnormal cases as part of ongoing quality control,¹⁷ may not have the resources to permit extensive continuing medical education, and may not provide sufficient volume for radiologists to maintain a high level of clinical skill. Facility level quality is also important, as many patients have some ability to choose which facility they attend. However, our findings are based on the facility at which mammograms were interpreted, which may not be apparent to patients if images taken at a facility are sent to another facility for interpretation.

This study had some limitations, which we addressed to the extent possible, but are noted here. First, 5.9% of the screen-detected invasive cancer cases from the >3 million mammograms followed were missing tumour information, precluding their classification as good or poor prognosis, and thus were imputed during the multiple imputation process. We are not able to study the relative contribution of radiologist-level v facility-level volume influences, given that many radiologists work at multiple facilities, including facilities outside the BCSC. We also cannot be certain that women seen at each facility were at similar risk of breast cancer, although we explored overall cancer rates in Table 1 and adjusted for likely confounders. We chose not to include ductal carcinoma in situ (DCIS) cases because our conceptual framework was based on tumour prognosis in relation to volume as a marker for interpretive quality. In that framework, we hypothesized that higher volume would be associated with detection of more “good prognosis” tumours, which could conceivably yield a mortality benefit from early detection. For early invasive cancers, this argument is easier to make, but for DCIS, the desirability of more detection is in question and much recent debate has arisen regarding detection of DCIS as a “benefit” or a “harm” of breast cancer screening. This is therefore outside of the conceptual framework of this study.²⁸

Conclusion

This study shows a positive association of facility-level interpretive volume with detection of invasive tumours having “good” characteristics, and possibly a concomitant decrease in detection of poor prognosis tumours. Based on these findings, we speculate that facility-level mammography quality monitoring could be useful, and should focus, in part, on tumour characteristics. In addition, volume requirements for facilities may be considered, with at least an average of 2,000 mammograms annually recommended. This may be achievable now that most facilities use digital mammography, enabling small facilities to send their mammograms to be interpreted by larger facilities. Also, studies to isolate the mechanism by which volume affects quality may guide interventions to achieve similar performance gains among smaller volume facilities.

Footnotes

Acknowledgements

This work was supported by the American Cancer Society, made possible by a generous donation from the Longaberger Company's Horizon of Hope®Campaign (SIRSG-07-271, SIRSG-07-272, SIRSG-07-273, SIRSG-07-274, SIRSG-07-275, SIRGS-06-281, SIRSG-09-270, SIRSG-09-271], the Breast Cancer Stamp Fund, and the National Cancer Institute Breast Cancer Surveillance Consortium (HHSN261201100031C]. This study was also supported by the National Cancer Institute R21 CA131698 and K24 CA125036. The collection of cancer and vital status data used in this study was supported in part by several state public health departments and cancer registries throughout the U.S. For a full description of these sources, please see: http://www.breastscreening.cancer.gov/work/acknowledgement.html. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Cancer Institute or the National Institutes of Health. Their work was invaluable to the success of this project. We also thank the participating women, mammography facilities and radiologists for the data they have provided for this study. A list of the BCSC investigators and procedures for requesting BCSC data for research purposes are provided at: .

References

Nyström

Andersson

Bjurstam

Frisell

Nordenskjöld

Rutqvist

. Long-term effects of mammography screening. Lancet. 2002; 359(9310): 909–919.

Gøtzsche PC, Nielsen M. Screening for breast cancer with mammography [update of Cochrane Database Syst Rev. 2001;[4]:CD001877]. Cochrane Database Syst Rev. 2006;[4]:CD001877.

Berry

Cronin

Plevritis

Fryback

Clarke

Zelen

Mandelblatt

Yakovlev

Habbema

JDF

Feuer

. Effect of screening and adjuvant therapy on mortality from breast cancer. NEJM 2005; 353: 1784–1792.

Bleyer

Welch

. Effect of three decades of screening mammography on breast-cancer incidence. N Engl J Med 2012; 367: 1998–2005.

U.S. Food and Drug Administration Mammography Quality Standards Act and Program National Statistics. 2009. [Accessed October 12, 2009]. Available at: http://www.fda.gov/Radiation-EmittingProducts/MammographyQualityStandardsActandProgram/FacilityScorecard/ucm113858.htm.

Begg

Cramer

Hoskins

Brennan

. Impact of hospital volume on operative mortality for major cancer surgery. JAMA 1998; 280: 1747–1751.

Hillner

Smith

Desch

. Hospital and physician volume or specialization and outcomes in cancer treatment: importance in quality of cancer care. J Clin Oncol 2000; 18: 2327–2340.

Elmore

Wells

Lee

Howard

Feinstein

. Variability in radiologists’ interpretations of mammograms. N Engl J Med 1994; 331: 1493–1499.

Ciatto

Houssami

Apruzzese

Bassetti

Brancato

Carozzi

. Reader variability in reporting breast imaging according to BI-RADS® assessment categories [the Florence experience]. The Breast 2006; 15: 44–51.

10.

Berg

Campassi

Langenberg

Sexton

. Breast Imaging Reporting and Data System: Inter- and intraobserver variability in feature analysis and final assessment. Am J Roentgenol 2000; 174: 1769–1777.

11.

Beam

Conant

Sickles

. Association of volume and volume-independent factors with accuracy in screening mammogram interpretation. JNCI 2003; 95: 282–290.

12.

Esserman

Cowley

Eberle

Kirkpatrick

Chang

Berbaum

Gale

. Improving the accuracy of mammography: volume and outcome relationship. JNCI 2002; 94: 369–375.

13.

Haneuse

SJPA

Anderson

Buist

DSM

Sickles

Carney

Onega

Geller

Kerlikowske

Rosenberg

Yankaskas

Elmore

Taplin

Smith

Miglioretti

. Mammographic interpretive volume and diagnostic mammography interpretive performance in community practice. Radiology 2011; 259(1): 72–84.

14.

Buist

DSM

Anderson

Haneuse

SJPA

Sickles

Smith

Carney

Taplin

Rosenberg

Geller

Onega

Monsees

Bassett

Yankaskas

Elmore

Kerlikowske

Miglioretti

. Influence of annual interpretive volume on United States screening mammography performance. Radiology 2011; 259(1): 72–84.

15.

Theberge

Hebert-Croteau

Langlois

Major

Brisson

. Volume of screening mammography and performance in the Quebec population-based Breast Cancer Screening Program. CMAJ 2005; 172: 195–199.

16.

Hèbert-Croteau

Roberge

Brisson

. Provider’s volume and quality of breast cancer detection and treatment. Breast Cancer Res Treat 2007; 105: 117–132.

17.

Taplin

Abraham

Barlow

Fenton

Berns

Carney

Cutter

Sickles

D’Orsi

Elmore

. Mammography facility characteristics associated with interpretive accuracy of screening mammography. JNCI 2008; 100: 876–887.

18.

Blanks

Bennett

Wallis

Moss

. Does individual programme size affect screening performance? Results from the United Kingdom NHS breast screening programme. Journal of Medical Screening 2002; 9: 11–14.

19.

National Cancer Institute. http://breastscreening.cancer.gov.

20.

National Cancer Institute. http://breastscreening.cancer.gov/data/bcsc_data_definitions.pdf.

21.

Buist DSM, Anderson ML, Smith RA, Carney PA, Miglioretti DL, Monsees BS, Sickles EA, Taplin SH, Geller BM, Yankaskas BC, Onega TL. Effect of radiologists’ diagnostic work-up volume on interpretive performance. Radiology. 2014; Ahead of print, 10.1148/radiol.14132806.

22.

Sickles

Miglioretti

Ballard-Barbash

. Performance benchmarks for diagnostic mammography. Radiology 2005; 235: 775–790.

23.

Rosenberg

Yankaskas

Abraham

. Performance benchmarks for screening mammography. Radiology Oct 2006; 241(1): 55–66.

24.

American Joint Committee on Cancer [AJCC], version 7.

25.

Rubin

. Multiple imputation after 18+ years. Journal of the American Statistical Association 1996; 91: 473–489.

26.

Jackson

Taplin

Sickles

. Variability of interpretive accuracy among diagnostic mammography facilities. JNCI 2009; 101: 814–827.

27.

Théberge

Chang

Vandal

Daigle

Guertin

Pelletier

Brisson

. Radiologist interpretive volume and breast cancer screening accuracy in a Canadian organized screening program. JNCI 2014; 106: djt461–djt461.

28.

Mandelblatt

Cronin

Bailey

. Effects of mammography screening under different screening schedules: model estimates of potential benefits and harms. Ann Intern Med 2009; 151: 738–747.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.27 MB