Abstract
Introduction
Breast density is a risk factor for breast cancer and reduces the sensitivity of mammography. Manual breast imaging reporting and data system (BI-RADS) classification remains the clinical standard, but automated methods have been developed to improve reproducibility and efficiency. This review evaluated the concordance between automated/semi-automated measurements and manual assessments of mammographic breast density.
Methods
We systematically searched MEDLINE, Embase, Cochrane Database of Systematic Reviews, CENTRAL, Scopus, and Web of Science (2014 onwards) for studies comparing automated or semi-automated measurement with manual BI-RADS classification on 2D digital mammography. Eligible studies included ≥60% of participants from routine screening populations. Data extraction and risk of bias assessment followed a registered protocol (PROSPERO: CRD42024550250).
Results
There is good concordance between automated/semi-automated measurement and manual assessment of breast density in the 26 included studies. Meta-analysis of 13 Volpara studies showed a tendency to classify mammograms as dense compared with manual assessment, but the difference was not statistically significant and statistical heterogeneity was very high (pooled difference 0.03, 95% CI −0.03 to 0.10; I2 = 98%). Studies of Quantra and other software showed broadly similar findings, but variability in software versions and BI-RADS editions limited comparability. Reporting of participant demographics was poor, thus generalisability is unclear.
Conclusions
Automated breast density software, such as Volpara and Quantra, shows promising concordance with manual BI-RADS assessment and may enhance consistency in screening programmes. Heterogeneity across studies and limited information on representativeness preclude firm conclusions. Large-scale, standardised, and inclusive evaluations are needed to establish clinical utility.
Funding
National Institute for Health and Care Research
Introduction
In the UK, breast cancer is the most common type of cancer among women, accounting for 15% of all new cancer cases. Based on data from 2016 to 2018, there are approximately 56,000 new breast cancer cases in the UK annually, corresponding to more than 150 per day. 1 The UK breast cancer screening programme currently screens all women aged 50–70 years at 3-year intervals with mammography. Although breast cancer screening is highly successful in preventing breast cancer mortality (20-40% reduction in risk),2–4 breast cancer deaths are still not prevented in a substantial proportion of people due to underdiagnosis. 5
In the context of screening, breast density is of concern because women with high breast density have an increased risk of breast cancer compared to those with low breast density, 6 and the sensitivity of mammography screening is lower in women with denser breasts. 7 In clinical practice, the biologic continuum for breast density is categorised into four groups according to the American College of Radiology breast imaging reporting and data system (ACR BI-RADS) atlas, 6 which is used as a clinical reference, with ‘A’ referring to breasts that are entirely fatty, ‘B’ referring to breasts with scattered areas of fibroglandular density, ‘C’ referring to breasts that are heterogeneously dense, and ‘D’ referring to extremely dense breasts. Women with extremely dense and moderately dense breasts (BI-RADS groups D and C) are at particular risk of underdiagnosis and may account for almost half of the screening population. 8
In clinical practice, breast density assessment has traditionally been conducted by subjective manual assessment where the radiologist inspects mammograms to categorise the density of the breast. More recently, automated and semi-automated quantitative methods of breast density assessments have been developed to improve the reproducibility of breast density assessment, and these may improve workflow efficiency. 9
Automated software systems such as Volpara and Quantra provide density measurements that correspond to the four BI-RADS density categories, but a review by Patterson et al. (2019) for the UK National Screening Committee (UK NSC) found that, while the test-retest reliability of automated methods was good, and reliability was better than human readers, there was a paucity of high-quality evidence and the concordance (agreement) between automated methods was variable. 10 They concluded that automated methods cannot be used interchangeably to measure breast density. Since there is yet to be a comprehensive systematic review that evaluates concordance between automated and manual methods, the objective of this review was to determine the agreement (concordance) between automated/semi-automated measurement and manual assessment of mammographic breast density.
Methods
This systematic review was commissioned by the UK NSC via the UK National Institute for Health and Care Research (NIHR) and was conducted in accordance with the recommendations of the Cochrane Handbook for Systematic Reviews of Interventions 11 and reported in adherence with the Preferred Reporting Items for Systematic Reviews guidelines. 12 The methods were pre-specified in a protocol and registered with the PROSPERO International Prospective Register of Systematic Reviews, available from: https://www.crd.york.ac.uk/prospero/display_record.php?ID=CRD42024550250.
Two patient and public involvement (PPI) partners were part of the project's Advisory Group, which also included academic and clinical experts. One PPI partner has lived experience of undergoing mammography for routine breast screening, and the other has lived experience of breast cancer and undergoing mammography. PPI partners contributed to regular Advisory Group discussions and made recommendations at each stage of the project.
Most people who use the UK's breast screening programme identify as women, though not all do. While using exclusively gender-neutral language can enhance inclusivity, it may also reduce clarity. None of the studies included in our review reported data on non-binary participants. We have, therefore, chosen to use both ‘women’ and gender-neutral language where appropriate. We acknowledge that this is a compromise; however, when we refer to ‘women’, we ask readers to interpret this as including all individuals who use the breast screening service, not only those who identify as women.
Search strategy
Comprehensive search strategies were developed by an information scientist (PM) with input from expert advisors to identify studies of any design that compared manual assessment and automated/semi-automated measurement of breast density. The databases searched were MEDLINE, Embase, Cochrane Database of Systematic Reviews, CENTRAL, Scopus, and Web of Science. There were no restrictions on study type or language at the search stage, but results were limited to articles published in 2014 onwards. This was chosen as the start date to ensure the search was comprehensive and captured recent technological advances. All references were exported to endnote for recording and deduplication. The reference lists of all articles selected for full-text appraisal were screened for additional studies. Details of the search strategies are reported in Appendix 1 (see online Supplementary Material).
Study selection
Full-text articles of published studies were eligible for inclusion if they reported the agreement between automated or semi-automated measurement of breast density and manual (visual) assessment of breast density using the BI-RADS breast density scoring system 6 (editions 3, 13 4, 14 or 5) 7 for 2D digital mammography or they reported the resources required to measure breast density for the two methods, and included at least 60% of participants in their sample who underwent mammography for routine breast cancer screening and had no prior history of breast cancer. Studies of synthetic or spectral mammography were deemed ineligible because 2D digital mammography is the standard imaging modality used in the UK National Breast Screening Programme. We aimed to include mammograms from participants who are representative of the UK general screening population, rather than those undergoing mammography for diagnostic indications or for surveillance for second primary or recurrent breast cancer. Case–control studies, systematic reviews, editorials, conference abstracts, letters, and opinion articles were not eligible for the review.
Two reviewers (CR and SD) independently screened 20% of the titles and abstracts to ensure consistency by comparing their results. The remaining citations were screened by a single reviewer (CR). All potentially relevant full-text articles were retrieved and assessed for inclusion by one reviewer (CR), with a second reviewer (MB, DC or SD) checking all articles labelled as unclear (20%). Any disagreements were resolved through discussion between reviewers.
We attempted to contact the corresponding authors of studies where details of the study population were unclear, provided the study included at least 200 participants (or 200 mammograms if the number of participants was not reported). Due to time constraints, we excluded studies with unclear population details that had fewer than 200 participants or 200 mammograms without attempting to contact the study authors.
Data extraction and risk of bias assessment
A single reviewer (CR) conducted data extraction using a pre-specified data extraction form that was developed with input from the Advisory Group and in accordance with guidance from the PRO-EDI initiative 15 for considering equality, diversity, and inclusion of participant characteristics in evidence syntheses. The same reviewer conducted risk of bias assessment using an adapted version of the review body for interventional procedures (ReBIP) quality assessment tool for non-randomised comparative and case series studies. The ReBIP 18-item checklist was originally developed for National Institute for Health and Care Excellence and was adapted from several quality assessment checklists and guidance documents, including the National Health Service (NHS) Centre for Reviews and Dissemination's guidance, Verhagen and colleagues, Downs and Black, and the generic appraisal tool for epidemiology.16–19 The tool assesses bias and generalisability, sample definition and selection, description of the intervention, outcome assessment, adequacy of follow-up, and performance of the analysis. Individual ReBIP question items 1 to 12 were rated as ‘yes’, ‘no’ or ‘unclear’. A rating of ‘yes’ denoted the optimal rating for methodological quality. Items 13 to 18 of the checklist were considered unsuitable for the scope of the current review. MI conducted a 20% check of the data extraction and risk of bias assessments.
Data analysis
Investigators have used a range of statistical methods to assess agreement between manual assessment and automated/semi-automated measurement including the area under the curve, measure of diagnostic accuracy. These include sensitivity or specificity, Spearman's rank correlation coefficient, Pearson's correlation coefficient, and the kappa statistics. The choice of methods was at the discretion of each study's authors. Among studies reporting the kappa statistics, there was variation in whether the kappa was weighted and, if so, whether linear or quadratic weights were applied. In certain cases, only the kappa value was provided with no accompanying indication of precision. Furthermore, the basis of agreement differed across studies. In some, the agreement was assessed for binary classifications (dense versus non-dense), while in others it was related to the four density categories.
Where studies reported the proportions of participants classified into breast density categories by both automated/semi-automated and manual methods, we analysed these proportions for studies using similar automated/semi-automated software to determine the ability of the software to consistently classify participants as having dense or non-dense breasts compared with manual assessment. For studies reporting numerical data for both automated/semi-automated and manual methods, we conducted a random effects meta-analysis to compare the proportions classified as dense and non-dense. This analysis was possible for studies comparing Volpara to manual assessment. Among these studies, there were four multi-arm studies. To avoid potential bias, we did not split the Volpara group when there were multiple control groups, nor did we combine control groups when these represented different manual assessments of the same participants. For the main meta-analysis, we selected the more recent version of Volpara when two were available, e.g., Gemici et al. (2020), 20 and the findings from breast imaging experts in Eom et al. (2018), 21 randomly selected observer 1 in Singh et al. (2016), 22 and the most experienced radiologist in Rigaud et al. (2022). 23 Sensitivity analyses were also conducted using the excluded groups from the Gemici et al. (2020), Eom et al. (2018), and Singh et al. (2016) studies.
For each study, the proportion classed as dense from using Volpara was compared with the proportion classed as dense by manual measurement. The difference in proportion and the associated standard error is used in a random effects meta-analysis using restricted maximum likelihood. Heterogeneity was assessed using the I2 statistic. The meta-analysis was conducted using the meta suite of Stata19. 24
Results
The literature searches identified 1032 citations, and 215 full-text reports were selected for eligibility assessment. Five reports were unavailable. We attempted to contact the corresponding authors of 28 reports where it was unclear whether the report included: (a) eligible population (n = 25),25–49 (b) eligible mammography (n = 2),50,51 or (c) was a secondary report of a participant sample in another included study (n = 1). 52 We received replies from five authors and were subsequently able to include two studies.28,51
We excluded 11 studies53–63 that used mammograms from databases of digitised film mammography, specifically the digital database of screening mammography 64 and the mammographic image analysis society databases. 65
In total, 26 reports were included. Details of the screening process are presented in Figure 1 and the list of excluded studies with the main reasons for exclusion is presented in Appendix 2.

PRISMA flow diagram of the screening process. PRISMA: Preferred Reporting Items for Systematic Reviews.
Characteristics of the included studies
The characteristics of the included studies are detailed in Appendix 3. The included studies were conducted in Europe (n = 9: Sweden [n = 5],51,66–69 the Netherlands [n = 2],70,71 France [n = 1], 72 Norway [n = 1]); 73 the USA (n = 4);23,28,74,75 the Republic of Korea (n = 4);21,76–78 Peru (n = 2);79,80 and one study each was conducted in Argentina, 81 Australia, 82 Brazil, 83 India, 22 Saudia Arabia, 84 and Turkey. 20 Finally, one study was done in both the UK and USA. 85
The studies varied in whether they reported their units of analysis as the number of participants (total across studies, n = 29,784) or the number of mammograms (total across studies, n = 16,194). The youngest reported mean age of participants was 48.8 years 22 and the oldest was 58.8 years. 70 Only one study reported the ethnicity of the participants. 78 None of the studies reported details of the socioeconomic status or transgender characteristics of the participants.
Five studies51,66–69 obtained mammograms from the Mälmo breast tomosynthesis screening trial. 86 It is unclear whether any participant overlap exists between these studies, but each study evaluated different automated software. Similarly, while it is unclear whether there is participant overlap between two studies conducted in Peru, both studies evaluated different automated software. We believe there are no concerns about duplication in these studies. Four studies explicitly reported evaluating raw (for processing/pre-processed) images;66–68,79 however, most studies did not specify whether they evaluated raw or processed (for presentation) images.
Overall, the included studies were of moderate quality even though many items of the ReBIP checklist items were rated as unclear due to insufficient reporting by study authors. The results of the study-level quality assessment are provided in Appendix 4. The authors of 11 studies either developed the automated software or had associations with the manufacturers of the automated software (see Appendix 3 for details).51,66–71,74,79,80,83
Overall concordance findings
Full-length results tables are provided in Appendix 5. Eight studies evaluated Volpara software,20,21,68,70,75,76,84 two studies evaluated Quantra,73,82 two evaluated Volpara and Quantra in the same study,71,77 and three studies evaluated Volpara and other automated/semi-automated algorithms (Cumulus Hand Delineation and ImageJ software, 85 EfficientNetB0 deep learning software 23 and AI-CAD Lunit INSIGHT MMG). 78 The findings of these studies are reported here. The other 11 studies evaluated various other individual automated software systems, and because of this heterogeneity their findings are not reported here but are summarised in Appendix 6.28,51,66,67,69,72,74,79–81,83
In some studies, more than one agreement statistic was used. Across all the related studies, three general types of agreement were reported: kappa statistics, correlation coefficients, and percentage agreement. Nearly 70% of the reported methods are kappa with over 80% either linear or quadratic weighted Cohen's kappa. All studies included participants with a full range of breast density categories, so low prevalence in either dense or non-dense classifications was not an issue. Even in studies using a four-level density agreement – where categories A and D had fewer participants – there were sufficient participants in categories B and C to avoid the need for a prevalence-adjusted Kappa. We are, therefore, satisfied that the studies used appropriate methods to measure agreement for ordinal rating categories between different reviewers. Where studies had more than two reviewers, Fleis Kappa was used to account for multiple raters.
Volpara software versus manual assessment
The results of the 13 studies that compared Volpara software with manual density assessment using BI-RADs are summarised in Table 1.20–23,68,70,71,75–78,84,85 The studies evaluated 15 versions of Volpara compared against BI-RADS 4th and BI-RADS 5th editions. Two studies did not report the BI-RADS edition.21,23 The different versions of Volpara and BI-RADS editions were considered as suitably similar for combining in our meta-analysis. The studies evaluated mammograms obtained using Hologic (n = 5);20,23,70,71,77 GE Healthcare (n = 4),21,75,76,78 Siemens AG (n = 1), 68 and Phillips (n = 1) 22 systems. Two studies did not report the mammography system.84,85
Summary of results of the studies evaluating Volpara versus manual assessment of mammographic breast density.
Source for the definitions of agreement interpretations: Kappa statistic, Altman (1999); 88 Spearman's correlation coefficient, Cohen (1988); 89 it is unclear whether data are reported for four-way density or non-dense versus dense comparison.
AUC: area under the curve; FFDM: full field digital mammography; κ: Kappa statistic; NR: not reported; ρ: Spearman's rank correlation coefficient; SE: standard error; SNBCSP: Saudi National Breast Cancer Screening Programme; V: Volpara; BI-RADS: breast imaging reporting and data system.
Concordance ranged from Kappa −0.40 to 0.83. Most studies (53.8%)21,22,70,71,75,76,85 showed good agreement between Volpara and manual assessment with BI-RADS, both for categorising mammograms into the four density and non-dense/dense categories, although one of these studies by Alomaim et al. (2020) 85 showed only moderate agreement for mammograms that did not contain image distractors.
The study by Eom et al. (2018) found very good agreement between Volpara Version 1.5.12 and visual assessment by expert radiologists for classifying mammograms into dense and non-dense categories, although this reduced to good agreement for measuring the four density categories, and the agreement between Volpara and general radiologists was good for both the four-way and two-way density classifications in this study. 21
Four studies showed moderate agreement between Volpara and BI-RADS although, of these, the study by Aloufi et al. (2022) found only fair agreement for categorising mammograms into the four density categories compared with moderate agreement for the dense/non-dense categories. 84 Two studies by Rigaud et al. (2022) 23 and Singh et al. (2016) 22 showed only fair agreement and the study by Gemici et al. (2020) showed poor agreement. 20 The version of Volpara software, BI-RADS edition, and the type of mammography system used in the studies were not consistently associated with the strength of agreement between the Volpara and visual density assessments.
The meta-analysis comparing the density categorisation of Volpara and manual assessments is shown in Figure 2. This shows a slightly higher categorisation as dense than non-dense from Volpara in comparison to manual classification. In Gemici et al. (2020), 20 two versions of Volpara were used and the most recent was used in our meta-analysis. In Eom et al. (2018) 21 and Singh et al. (2016), 22 there were two control groups and we chose breast imaging experts as the control group for Eom et al. (2018) and used observer 1 as the control group for Singh et al. (2016). The sensitivity analyses, using the alternative groups from these three studies, are all consistent with overall differences of 0.03 (95% CI −0.04, 0.10), 0.03 (95% CI −0.03, 0.09), and 0.04 (95% CI −0.03, 0.11) in comparison to the overall difference in the meta-analysis of 0.03 (95% CI −0.03, 0.10). There are no dominant or small studies in the meta-analysis, with the weights being between 7.3% and 9.6%. However, the I2 statistic value of 97.8% indicates considerable statistical heterogeneity between the studies.

Meta-analysis of Volpara versus manual assessment of mammographic breast density. CI: confidence interval.
Quantra software versus manual assessment
The results of the four studies that compared Quantra software with manual assessment are summarised in Table 2. All studies used Selenia (Hologic) mammography systems. Concordance ranged from Kappa 0.54 to 0.84. Two studies by van der Waal et al. (2015) 71 and Ekpo et al. (2016) 82 showed excellent or very good agreement between Quantra and BI-RADS density assessments, although this reduced to good agreement for the classification of the four density categories in the latter study. The study by Osteras et al. (2016) 73 showed good agreement, while the study by Youk et al. (2021) 77 showed moderate agreement between Quantra and BI-RADS assessments. As with the evaluations of Volpara software, the different Quantra versions and BI-RADS editions were not consistently associated with the strength of agreement between the different methods of density assessment. We were unable to analyse the proportions of mammograms classified by the different density categories because data for both Quantra and BI-RADs assessments were only available for the study by Youk et al. (2021). 77
Summary of results of the studies evaluating quantra versus manual assessment of mammographic breast density.
Source for the definitions of agreement interpretations: Kappa statistic, Altman (1999); 88 AUC statistic, Hosmer et al. (2013). 90
AUC: area under the curve; FFDM: full field digital mammography; κ: Kappa statistic; NR: not reported; RANZCR: Royal Australian and New Zealand College of Radiology; SE: standard error; BI-RADS: breast imaging reporting and data system.
Discussion
This evidence synthesis includes 26 studies that evaluated the concordance between automated/semi-automated and manual assessment of breast density for 2D digital mammography published during the last decade. However, these findings must be considered against a back-drop of advances in artificial intelligence that are accelerating, leading to automated systems for measuring breast density that are evolving rapidly (see below). 91 This means that some currently used and future software may supersede those examined in this review. Nevertheless, to the best of our knowledge, this review provides the most comprehensive and up-to-date synthesis of available evidence on agreement between automated and manual assessments of breast density in routine screening populations.
Overall, the included studies were of moderate quality, although many of the ReBIP checklist items were rated as unclear due to insufficient reporting in the full-text publications. Our findings show that, overall, there is good concordance between automated and manual assessment of breast density. Nevertheless, there is considerable variation both between automated technologies and within different versions of automated software. Robust conclusions are difficult to draw because of the small number of studies evaluating similar versions of automated software using comparable BI-RADS editions. The largest body of evidence for one type of automated software is from studies evaluating Volpara. This is unsurprising given the widespread use of Volpara in clinical practice. Our meta-analysis indicates that Volpara may be more likely to categorise mammograms as dense compared with manual assessment, but the difference was not statistically significant and the I2 statistic indicated the presence of considerable heterogeneity between studies. It should be noted that Volpara measures the volume of dense tissue, while visual assessment estimates the visible area of dense tissue; therefore, visual assessment may not capture volume information in manual ratings.
Three studies examined the impact of radiologists’ experience levels on agreement.21,70,72 Their findings indicate that the agreement between manual and automated density assessments is consistent regardless of whether the assessments were performed by senior/experienced radiologists or junior/general radiologists. This suggests that radiologists’ experience does not influence the level of agreement between manual and automated density assessments.
One study indicated that agreement between Volpara and manual density assessment is greater for mammograms that contain image distractors. 85 The authors of that study 92 noted that this finding was unexpected and indicates the need for further research to explore the impact of the image quality of mammograms for automated and manual breast density assessment. It should be noted that the results of this study were derived from a small sample of 250 mammograms making it difficult to draw firm conclusions.
A 2024 multi-society paper, presenting the views of Radiology Societies in the USA, Canada, Europe, Australia, and New Zealand, highlights that automation bias (the tendency of humans to favour AI-generated decisions over those made by humans) can lead to errors if the AI system is incorrect. 9 This risk may increase when radiologists are fatigued or when there is limited capacity to supervise or validate the AI output. Automation bias has been evaluated in other areas of clinical decision support and highlights the risk of errors if clinicians are over-reliant or uncritical of automated decisions.92–94 This emphasises the need for specialised training of those who use automated tools as part of their clinical decision-making. Similarly, in scenarios where autonomous AI systems continue to learn and adapt over time, such automated systems require ongoing monitoring to ensure their performance remains satisfactory. This might also apply to situations where new versions of automated software are released. The implementation of automated technology in radiological clinical practice would need to take into account any associated training, strategic, regulatory, performance, technical, IT infrastructure, or economic considerations, including the ability to implement at scale across different NHS trusts and regions.
Limitations
Overall, there is a paucity of evidence evaluating similar versions of automated/semi-automated software against manual breast density assessment using similar editions of BI-RADS, making it difficult to draw firm conclusions on the concordance of automated software with manual assessment, particularly for newer versions or less common software. While we tried to ensure that the included studies are representative of the UK general breast cancer screening population, confirming the eligibility of study populations was challenging because the term ‘screening’ is often used in different ways by study authors. For example, some use it to refer to imaging for breast cancer detection in the general screening population, while others use it when imaging is used for surveillance to detect recurrent or second primary breast cancer. Consequently, it is possible that some included studies may have study populations with fewer than our predefined 60% general screening participants eligibility criterion. It is also possible that some relevant studies have been excluded because we were unable to establish the screening characteristics of their populations.
The applicability of our findings to minority ethnic and other under-served groups is uncertain because of the poor reporting of participants’ ethnic and socioeconomic characteristics in the included studies. This lack of information could be problematic if the automated technologies were trained on datasets that excluded select groups. It is, therefore, unclear whether our findings are truly representative of a broad screening population, highlighting the need for more inclusive research and transparent reporting. Future research studies should provide clearer descriptions of the novel aspects of the considered technologies and characteristics of the participants, as well as consider the broader implications of the introduction of automated tools within the healthcare system.
Implications for policy and practice
Automated breast density measurement tools show promise for integration into breast screening programmes. However, concerns about the generalisability of the current evidence base, together with the practical challenges of implementing new systems in clinical practice, currently preclude firm policy recommendations. The finding that Volpara tended to classify more women as having dense breasts may also have important resource implications if adopted in practice. In the UK, the NSC has established an expert breast cancer working group to consider new and emerging evidence and developments that could improve breast screening programmes. The group has indicated support for future modelling work examining the clinical impact and costs of incorporating breast density into screening pathways, alongside new assessments for breast cancer risk, including AI-based methodologies. Therefore, breast density is being actively considered within ongoing UK NSC work, evaluating its clinical and cost implications as part of future risk-based screening approaches.
Conclusions
Automated breast density measurement tools such as Volpara and Quantra show good agreement with manual BI-RADS classification and hold promise for integration into population breast screening programmes. However, heterogeneity between software types and versions, limited comparability across studies, under-reporting of participant demographics, and the rapid development of AI-based technologies constrain the generalisability of these findings.
Supplemental Material
sj-docx-1-msc-10.1177_09691413261447057 - Supplemental material for Concordance between automated/semi-automated measurement and manual assessment of mammographic breast density in individuals undergoing breast cancer screening: A systematic review
Supplemental material, sj-docx-1-msc-10.1177_09691413261447057 for Concordance between automated/semi-automated measurement and manual assessment of mammographic breast density in individuals undergoing breast cancer screening: A systematic review by Clare Robertson, David Cooper, Sinéad N Duggan, Paul Manson, Mari Imamura, Rodolfo Hernández, Mike Clarke, Shaun Treweek and Miriam Brazzelli in Journal of Medical Screening
Supplemental Material
sj-docx-2-msc-10.1177_09691413261447057 - Supplemental material for Concordance between automated/semi-automated measurement and manual assessment of mammographic breast density in individuals undergoing breast cancer screening: A systematic review
Supplemental material, sj-docx-2-msc-10.1177_09691413261447057 for Concordance between automated/semi-automated measurement and manual assessment of mammographic breast density in individuals undergoing breast cancer screening: A systematic review by Clare Robertson, David Cooper, Sinéad N Duggan, Paul Manson, Mari Imamura, Rodolfo Hernández, Mike Clarke, Shaun Treweek and Miriam Brazzelli in Journal of Medical Screening
Supplemental Material
sj-docx-3-msc-10.1177_09691413261447057 - Supplemental material for Concordance between automated/semi-automated measurement and manual assessment of mammographic breast density in individuals undergoing breast cancer screening: A systematic review
Supplemental material, sj-docx-3-msc-10.1177_09691413261447057 for Concordance between automated/semi-automated measurement and manual assessment of mammographic breast density in individuals undergoing breast cancer screening: A systematic review by Clare Robertson, David Cooper, Sinéad N Duggan, Paul Manson, Mari Imamura, Rodolfo Hernández, Mike Clarke, Shaun Treweek and Miriam Brazzelli in Journal of Medical Screening
Supplemental Material
sj-docx-4-msc-10.1177_09691413261447057 - Supplemental material for Concordance between automated/semi-automated measurement and manual assessment of mammographic breast density in individuals undergoing breast cancer screening: A systematic review
Supplemental material, sj-docx-4-msc-10.1177_09691413261447057 for Concordance between automated/semi-automated measurement and manual assessment of mammographic breast density in individuals undergoing breast cancer screening: A systematic review by Clare Robertson, David Cooper, Sinéad N Duggan, Paul Manson, Mari Imamura, Rodolfo Hernández, Mike Clarke, Shaun Treweek and Miriam Brazzelli in Journal of Medical Screening
Supplemental Material
sj-docx-5-msc-10.1177_09691413261447057 - Supplemental material for Concordance between automated/semi-automated measurement and manual assessment of mammographic breast density in individuals undergoing breast cancer screening: A systematic review
Supplemental material, sj-docx-5-msc-10.1177_09691413261447057 for Concordance between automated/semi-automated measurement and manual assessment of mammographic breast density in individuals undergoing breast cancer screening: A systematic review by Clare Robertson, David Cooper, Sinéad N Duggan, Paul Manson, Mari Imamura, Rodolfo Hernández, Mike Clarke, Shaun Treweek and Miriam Brazzelli in Journal of Medical Screening
Supplemental Material
sj-docx-6-msc-10.1177_09691413261447057 - Supplemental material for Concordance between automated/semi-automated measurement and manual assessment of mammographic breast density in individuals undergoing breast cancer screening: A systematic review
Supplemental material, sj-docx-6-msc-10.1177_09691413261447057 for Concordance between automated/semi-automated measurement and manual assessment of mammographic breast density in individuals undergoing breast cancer screening: A systematic review by Clare Robertson, David Cooper, Sinéad N Duggan, Paul Manson, Mari Imamura, Rodolfo Hernández, Mike Clarke, Shaun Treweek and Miriam Brazzelli in Journal of Medical Screening
Footnotes
Acknowledgements
We are grateful to the following members of the Advisory group: Lesley Anderson (Aberdeen Centre for Health Data Science, University of Aberdeen), Ruth Burns (Patient Partner), Debra Dulake (Patient Partner), John Marshall (National Screening Committee), Cristina Visintin (National Screening Committee), and Shantini Paranjothy (Public Health, NHS Grampian).
We are grateful to the authors of included studies who kindly responded to our enquiries regarding specific aspects of their work.
ORCID iDs
Author contributions
MB, MC, SD, PM, CR, and ST contributed to the development of the protocol. CR conducted the review (screening of search results, data extraction and quality assessment). SND participated in the screening of search results. MI conducted the data extraction check. DC conducted the statistical analyses. CR drafted the initial version of this manuscript. PM conducted the literature search and contributed to writing the manuscript. All authors contributed to the interpretation of the data, writing the manuscript and had the opportunity to approve the final version manuscript.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This review was funded by the National Institute for Health and Care Research (NIHR) Evidence Synthesis Programme (Project number NIHR164221) and commissioned by the United Kingdom National Screening Committee (UK NSC). The UK NSC provides screening recommendations to the UK government, based on a range of criteria, including a consistent approach to the identification and synthesis of relevant literature. The UK NSC and NIHR have recently agreed on a formal collaboration to ensure the quality of evidence reports. As the commissioners of this work, a report of the findings was submitted to the UK NSC to inform their evidence base. The NIHR Evidence Synthesis Programme had no role in the collection, analysis, or interpretation of the data; in the writing of the report; or in the decision to submit the article for publication. The UK NSC, as commissioners of the work, participated in the Advisory Group and provided expert input on study design, data interpretation, and the current screening pathway. However, they had no role in writing this manuscript, formulating the conclusions, or deciding to submit it for a journal publication.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data availability
All the data upon which the review is based are available in the supplementary files (Appendices 1-6).
Supplemental material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
