Assessment of the value of polygenic risk scores in the prevention of disease

Abstract

It is claimed that polygenic risk scores will transform disease prevention, but a typical polygenic risk score for a common disease only detects 11% of affected individuals at a 5% false positive rate. This level of screening performance is not useful. Claims to the contrary are either due to incorrect interpretation of the data or other influences. Implementation of polygenic risk scores would divert resources from population-wide approaches that address the major disease burden in the average-risk majority to the follow-up of the many false positive results in those designated at high polygenic risk.

Keywords

Prediction screening risk stratification polygenic risk score genomics

Interest in polygenic risk scores

Genome-wide association studies (GWAS) of common diseases such as myocardial infarction, stroke and cancers have identified tens of thousands of common DNA sequence variants (mainly single nucleotide polymorphisms, SNPs) that influence disease risk.¹ Attention has been turning to how these discoveries might be used to improve healthcare.²

One avenue is to map disease-associated SNPs to the causal genes and encoded proteins to cast light on disease mechanisms and inform the development of new medicines.^3–6 Another is to apply the new knowledge to disease prediction and prevention through the calculation of polygenic risk scores.^7–9 A polygenic risk score is calculated for an individual as the weighted sum of independent DNA sequence variants present in their genome that influence the risk of a particular disease. Polygenic risk scores can be readily generated because a person's DNA can be extracted from blood or saliva, and common DNA sequence variation assayed using microarrays or, as costs fall, by whole genome sequencing technology.

The perceived appeal of polygenic risk scores

Academia, industry and policymakers have expressed enthusiasm for the introduction of polygenic risk scores into healthcare. Academics appear motivated by the desire to find applications for genomic discoveries¹⁰; industry by the opportunity to sell polygenic risk score services as consumer tests^2,11,12; and policymakers by the possibility of using polygenic risk scores as a tool for population-wide disease prevention,^13,14 to alleviate the growing demand on health systems from the management of chronic diseases.

Appealing features of polygenic risk scores include ‘one-off’ measurement at any time from conception, since the DNA sequence of the germline does not change; a single test technology to calculate polygenic risk scores for many different diseases; the low cost of microarray-based genotyping; and the potential for polygenic risk scores to provide information that is independent from non-genetic risk factors.² However, these appealing features are incidental unless polygenic risk scores are useful tests.

Construction of polygenic risk scores and their screening performance

Researchers construct polygenic risk scores using SNPs whose disease associations surpass a pre-specified statistical significance threshold.^15,16 Those SNPs that are highly correlated with the most significant (‘lead’) SNP within a region of the genome are either eliminated or down weighted. Depending on the criteria used for SNP selection, a polygenic risk score can include from a few hundred to a few million SNPs.

Screening is defined as ‘the systematic application of a test or enquiry to identify individuals at sufficient risk of a specific disorder to benefit from further investigation or direct preventive action, among persons who have not sought medical attention on account of symptoms of that disorder’.¹⁷ The screening performance of a test is evaluated using the detection rate (DR) and false positive rate (FPR), which are the percentage of people with a test value above a particular cut-off (‘a positive test’) among those who are later affected or unaffected, respectively.¹⁷ A simple summary measure of screening performance that has been described is the DR for a 5% FPR (DR₅), that is, a test cut-off at the 95th percentile of the unaffected distribution.¹⁸ A DR₅ of 5% is useless, a DR₅ of 15% is a poor test and a DR₅ of 80% is a good test, as shown in Figure 1(a). If the distributions of test values in affected and unaffected groups are Gaussian with the same standard deviation (SD), screening performance is determined solely by the difference between the group means expressed in SD units (z-score). Overlapping distributions producing a DR₅ of 80% are separated by +2.5 SD units (Figure 1(b)).

Figure 1.

Overlapping distribution of test results in affected and unaffected individuals for a hypothetical screening test with a DR₅ of 80% (a), showing that the mean value of test results in the two groups are separated by 2.5 SD units (b).

Polygenic risk scores display a Gaussian distribution with the same SD in affected and unaffected groups.¹⁹ Figure 2 shows that the median DR₅ value for polygenic risk scores for 28 common diseases deposited in the PGS Catalog as of April 2022 was 11%, indicative of poor screening.¹⁹ Figure 3 shows that a DR₅ of 11% corresponds to a difference in affected and unaffected group distributions of +0.4SD units, much less than the +2.5SD unit separation seen for a good screening test with a DR₅ of 80%. A polygenic risk score with a DR₅ of 11% misses 89% of affected individuals. When applied to a population with a background odds of disease of 1: 9 (10% risk), such a score yields an odds of becoming affected given a positive result of 0.11 × 1 : 0.05 × 9 = 1 : 4 (a positive predictive value of 20%), which is too low to be useful. It is unlikely that more recent polygenic risk scores deposited in the PGS Catalog data after April 2022 will materially alter estimates of screening performance because the major risk loci of largest effect are discovered early²⁰ and, as exemplified below for coronary artery disease (CAD), results of older and newer polygenic risk score studies provide similar results.

Figure 2.

Screening performance of polygenic risk scores for 28 common diseases (adapted from ref¹⁹). The scores analysed were deposited in the Polygenic Score Catalog up to April 2022. The horizontal line within each box is the median estimated DR₅ (%) and the limits of each box represent interquartile range; n = number of studies for each disease.

Figure 3.

Polygenic risk scores have a Gaussian distribution, with the distribution in affected individuals having the same standard deviation but being shifted to the right with respect to the unaffected individuals. In the typical example shown, the mean value in the affected group lies 0.4SD units to the right of the mean value in the affected group, which produces a DR₅ of 11%.

Misleading screening performance measures

The poor performance of polygenic risk scores in screening is not widely appreciated because the DR and FPR are almost never disclosed in research papers, and nor do they appear in the Polygenic Score Catalog, a regularly updated repository of polygenic risk scores.²¹ Instead, many papers report (a) the odds ratio (OR) for a 1-SD increment in the score (OR/SD), (b) the OR comparing the highest with the lowest 20% of the polygenic risk score distribution, or (c) the area under the receiver operating characteristic curve (AUC). For a test with a DR₅ of 80%, which represents a good screening performance, the corresponding values for these measures are (a) 12, (b) 2284 and (c) 0.96, respectively (as shown in Figure 4). The screening performance of polygenic risk scores falls well short of this. The corresponding values of these measures for polygenic scores with a median DR₅ of 11% are (a) 1.6, (b) 3 and (c) 0.62, respectively (as also shown in Figure 4).

Figure 4.

Relationship between the DR₅ and commonly reported polygenic risk score performance measures: (a) OR per SD; (b) OR comparing the highest and lowest 20% of a polygenic risk score distribution; and (c) AUC. Values of each performance measure are shown that correspond to a DR₅ of 11% (the median DR₅ value of polygenic risk scores in the PGS Catalog as of April 2022) and, as a benchmark, values that correspond to a DR₅ of 80%.

Use of misleading performance measures has also led to misinterpretation of the scale of any performance improvement when a polygenic risk score is updated. For example, a polygenic risk score for CAD reported in 2023 (referred to here as GPS Mult2023) used information from the latest GWAS of >269,000 cases and >1,178,000 controls arising from multiple ancestries, and incorporated SNPs for 10 CAD risk factors including blood pressure and LDL-cholesterol.²² The authors concluded that this 2023 score with an OR/SD of 1.73 exhibited a ‘significant’ improvement over their previously published 2018 score which had an OR/SD of 1.49.²³ Figure 5 shows this difference equates to a trivial improvement in the DR₅ from 11% to 14%. Similarly, Genomics Ltd reported that their ‘enhanced’ polygenic risk scores for common diseases outperformed almost all comparator scores.²⁴ But the reported improvements in OR/SD values equated to negligible improvement in the DR₅ of around 2%, for example from 8% to 10% for cardiovascular disease, and 13% to 15% for breast cancer.

Figure 5.

Negligible improvement in the performance of polygenic risk scores for coronary artery disease between 2018 and 2023. (a) DR₅ = 11% for GPS 2018 based on the reported OR/SD = 1.49 and (b) DR₅ = 14% for GPS Mult 2023 based on the reported OR/SD = 1.73.²²

Misleading graphical displays of screening performance

Common graphical displays seen in research papers exaggerate the screening performance of polygenic risk scores. Figure 6(a) (taken from Patel et al.²²) shows a common display type in which the risk of CAD is plotted on the vertical axis using an arithmetic scale, with the percentile of the polygenic risk score GPS Mult2023 on the horizontal scale. Use of an arithmetic y-axis scale produces a sharp uptick in CAD risk above the 95^th centile of the polygenic risk score distribution, as if identifying a distinct high-risk group. It is claimed that this apparently distinct group have a CAD risk as high as carriers of a mutation causing monogenic familial hypercholesterolaemia (FH).^22,23

Figure 6.

Performance of the same polygenic risk score (GPS Mult2023) depicted using different choices for axes and scales. Image (a) is adapted from the original publication²² and (b) The likelihood ratio and prevalence of CAD using a logarithmic scale on the vertical axis, and the polygenic risk score centile using an SD unit (Z-score) scale for the horizontal axis.

Figure 6(b) also displays the performance of GPS Mult2023 but using the likelihood ratio on a logarithmic (doubling) scale for the vertical axis and an SD (z-score) scale for the horizontal axis. Multiplying the likelihood ratio by the background population odds of disease and converting to the risk scale returns the corresponding disease risk (as shown in Figure 6(b)). This plot produces a straight-line relationship without any indication of an uptick in risk. A logarithmic vertical axis scale is appropriate because disease risks that are, for example, 4 and ¼ times the population average should be located equidistant from the average population risk. The straight-line relationship shows that the same arithmetic difference in the polygenic risk score produces the same proportional difference in disease risk across the whole range of values without a threshold: a log-linear relationship. The apparently sharp rise in risk in the original plot is seen to be an artefact of the choice of axis and scaling: an arithmetic risk scale compresses values <1 and expands values above 1. People in the top 5% of this polygenic risk score distribution do have a 3-fold higher risk of CAD compared to the remainder of the population, but this is much less than the 15-fold relative risk of CAD among FH mutation carriers up to age 50. The claim of a risk equivalence applies only to those FH mutation carriers who have survived beyond age 60 without a prior CAD event.^25,26

Performance of polygenic risk scores in stratification and sequential screening

Stratification is a special case of screening which involves using more than one test cut-off to segment a population into groups of differing risk.²⁷ Such information might help tailor the type or intensity of a preventive intervention, or the timing or frequency of a definitive but costly screening test. Figure 7(a), which is based on a figure on the Genomics Ltd website,¹² shows a typically used display intended to illustrate effective population stratification for a breast cancer polygenic risk score developed by the company.²⁸ The plot shows breast cancer incidence by age for the highest and lowest 3% of a breast cancer polygenic risk score distribution, with the middle 20% of the distribution being used as a reference.

Figure 7.

Proportion of affected individuals by strata of a breast cancer polygenic risk score. (a) The incidence of diagnosed breast cancer by age for the highest and lowest 3% of the polygenic risk score distribution, with the middle 20% of the distribution used as a reference group (modified and redrawn from a figure in ref²⁸). (b) The proportion of affected individuals in each of the strata depicted and omitted from (a). (c) The overlapping distributions and DR₅ value for the same breast cancer polygenic risk score.

This plot omits information on 74% (100 − [3 + 3 + 20] %) of the population. It also provides no information on the proportion of affected individuals in each polygenic risk score category. Figure 7(b) uses the published performance of the Genomics Ltd breast cancer polygenic risk score in Europeans²⁹ to show that the middle 20% of the distribution (the reference group) contribute 17% of affected individuals, more than the highest and lowest 3% of the distribution, which contribute 10% and 1% of affected individuals, respectively. Those with a polygenic risk score between the lowest 3% and the middle 20% contribute 19% of affected individuals, and those with a polygenic risk score between the middle 20% and highest 3% contribute 53% of affected individuals. Neither of the latter two groups, which contribute most of the cases, was shown in the original plot.

Where a risk factor has a Gaussian distribution and displays a log-linear relationship with disease risk, more cases arise among the majority with near average risk factor values than among the few with more extreme values – the ‘prevention paradox’.^30,31 This is made clear in Figure 7(c), which shows that the overlapping distributions for affected and unaffected individuals for the same breast cancer polygenic risk score, corresponding to a DR₅ of 15. This re-analysis shows why polygenic risk scores such as this perform poorly in stratification as well as in screening: high-risk groups contain more false than true positives and the greater proportion of affected individuals occur among those designated as average risk.

The plot in Figure 7(a) has also been used to argue that women with a high polygenic risk score should be offered mammographic screening a decade earlier than is routine because they achieve a similar breast cancer risk at around age 40 as an average woman at age 50, the age at which mammography is currently offered to all women. Table 1 shows data from Huntley et al., who used UK demographic and cause-specific cancer incidence data to estimate the outcome of breast cancer screening in younger women (aged 40–49) who have a breast cancer polygenic risk score in the highest 20% of the distribution.³² There are around 4 million women in the this decade, below the current screening age of 50, among whom 7533 breast cancers were estimated to arise annually with 693 breast cancer deaths. All 4 million women would need to be genotyped to identify the 900,000 or so in the top 20% of the polygenic risk score distribution. Huntley et al. estimated that 2811 cancers would occur in this group of which 1968 would be detected at mammography with an estimated 102 deaths averted by subsequent intervention.³² However, Huntley et al. also showed that an alternative approach of simply screening the highest 20% by age (in effect reducing the universal screening age from 50 to 48 years) would detect a similar number of additional breast cancers and avert a similar number of deaths (Table 1). This simpler alternative removes the need to genotype, analyse and interpret polygenic risk scores from 4 million women while achieving almost the same additional benefit.

Table 1.

Comparison of top PRS quintile-based or top-age quintile-based sequential breast cancer screening for women aged 40–49.

	Population offered screening
Age	40–49
Number	4,369,703
Breast cancers annually	7533
Breast cancer deaths	694

Mammography offered to	PRS top quintile	Oldest quintile (ages 48–49)
Number	873,941	937,850
Cancers in the top quintile	2811	2198
Cancers detected	1968	1538
Deaths averted (intervention)	102	80

Data are taken from Huntley et al.³²

Polygenic risk scores used in conjunction with non-genetic screening tests

Proponents argue that polygenic risk scores should not be used as stand-alone tests but should be incorporated into risk models that include clinical variables, as part of established screening pathways.

A pilot study in the UK of 836 people aged 45–64 years attending an NHS Health Check, funded by Genomics Ltd, evaluated the feasibility of adding a polygenic risk score to the QRISK model based on conventional cardiovascular risk factors that is currently used in primary prevention.³³ Re-analysis of the data in this publication shows that inclusion of a polygenic risk score in the QRISK model made no difference to statin eligibility for 90% of participants evaluated. In 5% of the participants, 10-year risk of cardiovascular disease went from below 10% using QRISK alone to above 10% after the addition of a polygenic risk score, rendering this group eligible for statin treatment. However, the up classification of statin eligibility in this group was exactly offset by a downgrading of risk in the remaining 5% of participants, resulting in no overall change in the proportion of people eligible for statins from the addition of a polygenic risk score to QRISK.

Sun et al. claimed that adding a polygenic risk score to conventional cardiovascular risk factors produced worthwhile improvement in the prediction and prevention of CAD and stroke.³⁴ However, this is not borne out by a re-analysis of their published data.¹⁹ In Sun et al., a conventional risk factor model with a 10% 10-year risk cut-off detected 60% of those later affected by CAD or stroke at a 24% FPR (DR₂₄ = 60%). The addition of a polygenic risk score produced a negligible increase in DR to 61% at a 23% FPR (DR₂₃ = 61%). Assuming statins were prescribed to all those with a 10-year risk exceeding 10%, 100% adherence to treatment, and adopting the authors’ assumption that statins reduce the risk of CAD and stroke by 20%, 974 events would be prevented per 100,000 people screened using a model based on conventional risk factors together with polygenic risk scores instead of 957 using a conventional risk factor model with no genetic information; a gain of 17 cases. This gives a number needed-to-genotype to prevent one additional event of 5882. A re-analysis of similar data published by Genomics Ltd revealed consistent findings.³⁵ The QRISK3 model based on conventional risk factors, using the same 10% 10-year risk cut-off as used by Sun et al., detected 81% of affected individuals at a 42% FPR (DR₄₂ = 81%) in UK Biobank. Addition of their polygenic risk score to the model detected 84% of affected individuals with a 41% FPR (DR₄₁ = 84%). With the assumption that statins reduce CAD and stroke events by 20%, the number needed-to-genotype to prevent one additional event based on this study was 8879.

These results should not be surprising. An independent risk factor that performs poorly on its own will also perform poorly when incorporated into a risk model together with the other risk factors. A simpler approach to cardiovascular prevention is to use age as the sole screening test. Age is the major determinant of CAD and stroke risk and performs about as well as multi-factor risk models that include age.³⁶ Offering statins and low-dose blood pressure lowering medications in combination to all those without contraindications above the age of 50 has been estimated to prevent 60% of heart attacks and strokes assuming complete adherence, without the requirement for risk assessment or genetic testing.^37,38

Polygenic risk scores in individual risk prediction and as direct-to-consumer tests

Some cohort studies (e.g. the EMERGE Consortium in the US and the Our Future Health Study³⁹ in the UK) aim to return polygenic risk score results to study participants. The EMERGE Consortium returns results for eight diseases (asthma, atrial fibrillation, breast cancer in women and prostate cancer in men, coronary heart disease, chronic kidney disease, and type 1 and type 2 diabetes). For six of eight diseases, the EMERGE Consortium genetic report designates a participant as ‘high’ or ‘average’ risk.⁴⁰ The proportion of the population designated high risk differs by disease. For example, 10% of the population are designated high risk for prostate cancer with a 4-fold higher risk than the remaining 90% of the population who are labelled ‘average’ risk. This equates to a test with a 26% DR for a 10% FPR, very poor discrimination. The EMERGE consortium designates only 2% of the population as high risk for type 2 diabetes based on a 4-fold risk compared to the remainder of the population which equates to a DR of 6% for a 2% FPR, missing 94% of cases. No absolute measure of risk is provided to participants for these conditions. Absolute risk estimates are provided to participants only for breast cancer and coronary heart disease as part of an integrated risk model including non-genetic risk factors.

Companies are developing polygenic risk scores as a direct-to-consumer test.¹¹ In the UK, the House of Commons Science and Technology Committee 2021 report entitled Direct-to-Consumer Genetic Testing⁴¹ called for greater regulation of such tests, strict clinical performance requirements, independent validation of test performance, and closer scrutiny of the information provided by companies to consumers. The Committee also raised concerns about increased pressure on the publicly funded National Health Service (NHS) that might be generated by consumers seeking advice and follow-up of private sector tests. Unfounded claims of the accuracy and benefits of direct-to-consumer tests in general have been highlighted by others.^42,43

Since the publication of the House of Commons Committee report, a further concern has emerged about the apparent instability of a polygenic risk score result for an individual. Abramovitz and colleagues tested 46 different polygenic risk scores for CAD in 170,000 participants from the All of Us study.⁴⁴ They found that all the scores tested performed consistently poorly at group level (DR₅ values were in the range 6–10%), but highly inconsistently for any individual. The same individual could have a CAD polygenic risk score result as far apart as the 5^th or 95^th centile, depending on the polygenic risk score used. The source of this inconsistency is unresolved but may relate to the practice of including millions of genetic variants in polygenic risk scores, most with tiny effect sizes and statistical significance below the threshold applied in GWAS to declare aetiological association. The inclusion or exclusion of such SNPs may be close to random, adding more noise than signal that varies from score to score. Whatever the explanation for the within-individual variability of results from different polygenic risk scores for the same disease, the observation undermines confidence in the use of polygenic risk scores for individual risk assessment.

Reflections on the state of the field

Many thousands of research papers have been published on polygenic risk scores in prediction of common diseases. Almost all report the same misleading measures and graphical displays of performance and mistakenly conclude that polygenic risk scores represent an advance in disease prevention.

It is over several decades that the correct analysis of potential screening tests has been described⁴⁵ and over 5 years since this was first applied to polygenic risk scores showing their poor screening performance.⁴⁶ ORs comparing top and bottom quintile groups of 10, 100, 1000 and 10,000 yield DR₅ values of about 20%, 50%, 75% and 90% respectively. Polygenic risk scores do not even achieve an OR of 10. The freely available Risk Screening Converter produces DR and FPR values from OR comparisons for quintile groups, the OR/SD and AUC.^47,48 Yet this knowledge and this resource have largely been ignored by researchers in the field.

Worse, there is evidence of a failure to cite key papers (e.g. references^46,49–53) and a reluctance to publish papers that question the value of polygenic risk scores in prevention. The BMJ Medicine paper¹⁹ that demonstrated the poor performance of over 900 polygenic risk scores for 300 diseases in screening, risk stratification and individual risk prediction has an Altmetric attention score of 1296 (in the top 5% of all research articles) but was rejected by seven journals before eventual publication nearly two years after initial submission. Five rejections were without peer review because the editors did not consider the paper to be a ‘sufficient advance’. Reviewers and editors revealed misconceptions and contradictions in their feedback. The use of polygenic risk scores clearly meets the definition of screening. However, reviewers ‘raised concerns about the premise that polygenic risk scores are screening tests’. In endorsing the use of the AUC (but not the DR₅) as an appropriate performance measure, reviewers appeared to be unaware of the contradiction that the AUC is a measure of screening performance, albeit an unhelpful one, and that the DR₅ is one data point on the ROC curve which is used to derive the AUC. After publication of the BMJ Medicine paper,¹⁹ social media was used in an attempt to diminish the importance of the findings.

It is difficult to escape the conclusion that many researchers are choosing to ignore the evidence that polygenic risk scores lack value in screening and disease prediction, even acting as if such evidence did not exist, and that journal editors and reviewers are playing a part in this.

These actions have had unwarranted consequences. Opinion leaders and expert groups are now arguing for implementation of polygenic risk scores for disease prevention^9,10; over 20 commercial testing and software service providers have been established to sell polygenic risk score tests to consumers and healthcare providers; and some life and health insurers now offer polygenic risk scores to their customers.^9,54,55 In an example of policy leapfrogging evidence,¹⁴ Fit for the Future, the 10-year Health Plan for England, seeks to ‘implement universal newborn genomic testing and population based polygenic risk scoring alongside other emerging diagnostic tools’. It is claimed this will enable ‘early identification and intervention for individuals at high risk of developing common diseases’. All these initiatives are based on the false premise that polygenic risk scores are of benefit in screening, risk stratification and disease prediction despite overwhelming evidence to the contrary.

Several actions are now required. All those engaged in polygenic risk score research should reflect on their responsibilities to the scientific endeavour, when writing, reviewing and publishing papers, or commenting on social media. Policymakers should impose tighter regulation of commercial providers of polygenic risk scores to protect consumers from purchasing unhelpful or misleading genetic tests. Follow-up of ‘high-risk’ (largely false positive) results should be made the responsibility of the companies selling such tests, not already stretched public healthcare systems. Health systems should use the same established structures for evaluating polygenic risk score performance as used for non-genetic screening tests and hold them to the same standards. There should be renewed focus on simple, ‘low-tech’, untargeted, population-wide approaches to prevention that address the greater burden of disease among the average-risk majority.

Conclusion

An assessment of the evidence leads to the inescapable conclusion that polygenic risk scores are not of value in the prevention of disease.

Footnotes

Declaration of conflicting interests

The author declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the National Institute for Health and Care Research. Aroon Hingorani is an NIHR Emeritus Senior Investigator.

ORCID iD

Aroon D Hingorani

References

Visscher

Wray

Zhang

, et al.

10 years of GWAS discovery: biology, function, and translation

Am J Hum Genet 2017; 101. DOI: 10.1016/j.ajhg.2017.06.005.

Hingorani

Shah

Kumari

, et al. Science, medicine, and the future: Translating genomics into improved healthcare. BMJ (Online) 2010; 341. DOI: 10.1136/bmj.c5945.

Gill

Georgakis

Walker

, et al. Mendelian randomization for studying the effects of perturbing drug targets. Wellcome Open Res 2021; 6: 16.

Schmidt

Hingorani

Finan

. Human genomics and drug development. Cold Spring Harb Perspect Med 2022; 12. DOI: 10.1101/cshperspect.a039230.

Plenge

Scolnick

Altshuler

. Validating therapeutic targets through human genetics. Nat Rev Drug Discovery 2013. DOI: 10.1038/nrd4051.

Sofat

Hingorani

Smeeth

, et al. Separating the mechanism-based and off-target actions of cholesteryl ester transfer protein inhibitors with CETP gene polymorphisms. Circulation 2010; 121. DOI: 10.1161/CIRCULATIONAHA.109.865444.

Lambert

Abraham

Inouye

. Towards clinical utility of polygenic risk scores. Hum Mol Genet 2019; 28: 2–133.

Holmes

Harrison

Talmud

, et al. Utility of genetic determinants of lipids and cardiovascular events in assessing risk. Nat Rev Cardiol 2011. DOI: 10.1038/nrcardio.2011.6.

Torkamani

Wineinger

Topol

. The personal and clinical utility of polygenic risk scores. Nat Rev Genet 2018. DOI: 10.1038/s41576-018-0018-x.

10.

Knowles

Ashley

. Cardiovascular disease: the rise of the genetic risk score. PLoS Med 2018. DOI: 10.1371/journal.pmed.1002546.

11.

Allelica . https://eu.allelica.com/ (accessed 14 July 2025).

12.

Genomics plc . https://www.genomicsplc.com/ (accessed 5 June 2022).

13.

GOV.UK . Genome UK: the future of healthcare, https://www.gov.uk/government/publications/genome-uk-the-future-of-healthcare (accessed 14 July 2025).

14.

NHS England . Fit for the Future: 10 Year Health Plan for England, https://www.england.nhs.uk/long-term-plan/ (accessed 15 July 2025).

15.

Wray

Lin

Austin

, et al. From basic science to clinical application of polygenic risk scores: a primer. JAMA Psychiatry 2021. DOI: 10.1001/jamapsychiatry.2020.3049.

16.

Choi

Shin

Mak

, et al. A guide to performing polygenic risk score analyses. bioRxiv 2018; 5: 11–13.

17.

Wald

. Guidance on terminology. J Med Screen 2023; 30: 53–54.

18.

Wald

Hackshaw

Frost

. When can a risk factor be used as a worthwhile screening test? Br Med J 1999. DOI: 10.1136/bmj.319.7224.1562.

19.

Hingorani

Gratton

Finan

, et al. Performance of polygenic risk scores in screening, prediction, and risk stratification: secondary analysis of data in the Polygenic Score Catalog. BMJ Med 2023; 2: e000554.

20.

Zhang

Hurson

Zhang

, et al. Assessment of polygenic architecture and risk prediction based on common variants across fourteen cancers. Nat Commun 2020; 11: 3353.

21.

PGS Catalog . The Polygenic Score (PGS) Catalog, https://www.pgscatalog.org/.

22.

Patel

Wang

Ruan

, et al. A multi-ancestry polygenic risk score improves risk prediction for coronary artery disease. Nat Med 2023; 29: 1793–1803.

23.

Khera

Chaffin

Aragam

, et al. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat Genet 2018; 50: 1219–1224.

24.

Thompson

Wells

Selzam

, et al. A systematic evaluation of the performance and properties of the UK Biobank Polygenic Risk Score (PRS) Release. PLoS One 2024; 19: e0307270.

25.

Mundal

Igland

Veierød

, et al. Impact of age on excess risk of coronary heart disease in patients with familial hypercholesterolaemia. Heart 2018; 104: 1600–1607.

26.

Iyen

Qureshi

Weng

, et al. Sex differences in cardiovascular morbidity associated with familial hypercholesterolaemia: a retrospective cohort study of the UK Simon Broome register linked to national hospital records. Atherosclerosis 2020; 315: 131–137.

27.

Wald

Duffy

Hackshaw

. Risk stratification in medical screening. J Med Screen 2024; 31: 119–120.

28.

Polygenic Risk Scores . https://www.genomics.com/newsroom/polygenic-risk-scores.

29.

Chuong

Thompson

Weale

, et al. Preventing premature deaths through polygenic risk scores. medRxiv, 2024: 2024.12.26.24319670.

30.

Holmes

Harrison

Talmud

, et al. Utility of genetic determinants of lipids and cardiovascular events in assessing risk. Nat Rev Cardiol 2011; 8: 207–221.

31.

Rose

. Sick individuals and sick populations. Int J Epidemiol 1985. DOI: 10.1093/ije/14.1.32.

32.

Huntley

Torr

Sud

, et al. Utility of polygenic risk scores in UK cancer screening: a modelling analysis. Lancet Oncol 2023; 24: 658–668.

33.

Fuat

Adlen

Monane

, et al. A polygenic risk score added to a QRISK®2 cardiovascular disease risk calculator demonstrated robust clinical acceptance and clinical utility in the primary care setting. Eur J Prev Cardiol 2024; 31: 716–722.

34.

Sun

Pennells

Kaptoge

, et al. Polygenic risk scores in cardiovascular risk prediction: a cohort study and modelling analyses. PLoS Med 2021. DOI: 10.1371/JOURNAL.PMED.1003498.

35.

Riveros-Mckay

Weale

Moore

, et al. Integrated polygenic tool substantially enhances coronary artery disease prediction. Circ Genom Precis Med 2021. DOI: 10.1161/CIRCGEN.120.003304.

36.

Wald

Simmonds

Morris

. Screening for future cardiovascular disease using age alone compared with multiple risk factors and age. PLoS One 2011. DOI: 10.1371/journal.pone.0018742.

37.

Wald

Hingorani

Vale

, et al. Comparing screening based on the NHS Health Check and Polypill Prevention Programmes in the primary prevention of heart attacks and strokes. J Med Screen 2024; 31: 59–65.

38.

Jordan

Hingorani

Wald

. Primary prevention of heart attacks and strokes: seeking consensus on the polypill approach. Br Med J 2025; 388: r208.

39.

Our Future Health . https://ourfuturehealth.org.uk/ (accessed 20 May 2021).

40.

Lennon

Kottyan

Kachulis

, et al. Selection, optimization and validation of ten chronic disease polygenic risk scores for clinical implementation in diverse US populations. Nat Med 2024; 30: 480–487.

41.

Science and Technology Committee - House of Commons . Direct-to-consumer genomic testing, 2021. https://publications.parliament.uk/pa/cm5802/cmselect/cmsctech/94/9402.htm (accessed 15 July 2025).

42.

Horton

Crawford

Freeman

, et al. Direct-to-consumer genetic testing. Br Med J 2019. DOI: 10.1136/bmj.l5688.

43.

Lancet

. Direct-to-consumer medical testing: an industry built on fear. Lancet 2024; 404: 91.

44.

Abramowitz

Boulier

Keat

, et al. Evaluating performance and agreement of coronary heart disease polygenic risk scores. JAMA 2025; 333: 60–70.

45.

Wald

Leck

. Antenatal and neonatal screening, 2009. DOI: 10.1093/acprof:oso/9780192628268.001.0001.

46.

Wald

Old

. The illusion of polygenic disease risk prediction. Genet Med 2019. DOI: 10.1038/s41436-018-0418-5.

47.

Wald

Duffy

Hackshaw

. The risk-screening converter. J Med Screen 2023; 30: 1–2.

48.

Risk-Screening Converter . https://screening.shinyapps.io/RiskScreeningConverter/ (accessed 15 July 2025).

49.

Sud

Turnbull

Houlston

. Will polygenic risk scores for cancer ever be clinically useful? npj Precis Oncol 2021. DOI: 10.1038/s41698-021-00176-1.

50.

Sud

Horton

Hingorani

, et al. Realistic expectations are key to realising the benefits of polygenic scores. Br Med J 2023; 380. DOI: 10.1136/bmj-2022-073149.

51.

Huntley

Torr

Sud

, et al. Utility of polygenic risk scores in UK cancer screening: a modelling analysis. Lancet Oncol 2023; 24: 658–668.

52.

Groenendyk

Greenland

Khan

. Incremental value of polygenic risk scores in primary prevention of coronary heart disease: a review. JAMA Intern Med 2022. DOI: 10.1001/JAMAINTERNMED.2022.3171.

53.

Mosley

Gupta

Tan

, et al. Predictive accuracy of a polygenic risk score compared with a clinical risk score for incident coronary heart disease. JAMA 2020; 323: 627–635.

54.

MassMutual . Genomics plc and MassMutual’s program enables more policyowners to understand health risks through innovative genetic testing, https://www.massmutual.com/about-us/news-and-press-releases/press-releases/2024/04/genomics-plc-and-massmutuals-program-enables-more-policyowners-to-understand-health (accessed 15 July 2025).

55.

Bupa Group . Bupa becomes first UK private healthcare provider to pilot whole genome sequencing for selected UK customers, https://www.bupa.com/news-and-press/press-releases/2024/bupa-pilots-whole-genome-sequencing-for-selected-uk-customers (accessed 15 July 2025).