Abstract
Background
Tumour biomarkers have become increasingly important in oncology, shaping cancer diagnostics, classification, and patient management. Despite their potential, the use of cancer biomarkers in clinical settings remains limited.
Objective
This paper aims to outline biomarker development, from classical, serum protein markers to emerging tumour biomarkers, including meta-biomarkers, to show their diversity and point out the challenges in their development, reporting, and implementation in clinical practice as well as their relevance in evidence-based pathology and cancer classification.
Methods
A literature-based analysis, incorporating insights from our ongoing research, is presented.
Results
Although numerous potential biomarkers, biomarker signatures, and meta-biomarkers, are being discovered, existing innovations are often not supported by sufficiently rigorous research methodologies and standardised reporting practices to enable their translation into clinical practice.
Conclusions
To ensure that biomarker discoveries are both scientifically sound and clinically useful, improved research and validation methods, along with adherence to established reporting standards, are essential. We propose the use of the Hierarchy of Evidence for Tumour Pathology as a framework to evaluate and map existing evidence and identify knowledge gaps and research priorities.
Keywords
Introduction
Recent methodological advancements and new insights have fuelled significant progress in the development of tumour biomarkers. Biomarkers encompass biomolecules, cells, cellular structures, or bioactivities that can be quantified and evaluated as indicators of cancer risk, presence, characteristics, or progression. They can be evaluated in blood, other body fluids or tissues, and help to detect, diagnose, and monitor cancer, as well as predict and assess the effectiveness or toxicity of a treatment. Tumour biomarkers can be categorised according to various criteria, including their nature (e.g. protein, nucleic acid, metabolite, and radiographic biomarker), their function (e.g. receptor and mutation), the type of sample in which they are measured (e.g. blood-derived), their clinical purpose (e.g. screening, early detection, and recurrence monitoring), or the techniques employed for detecting them (e.g. immunoassays and molecular methods). 1
An ideal biomarker should exhibit high specificity and sensitivity, enable minimally invasive or non-invasive testing, and be consistently and accurately measurable across different laboratory settings, using a simple, cost-effective method. Additionally, it should be related to tumour biology, have a clear biological association with tumour characteristics and/or treatment response mechanisms. 2
Tumour biomarkers are valuable tools, enhancing the accuracy and reliability of cancer diagnosis, prognosis, and treatment decisions. Therefore, it is essential that tumour biomarkers used in clinical practice are those that have been rigorously evaluated, and whose clinical utility has been demonstrated through validation in well-designed studies. Additionally, standardised procedures with rigorous quality control measures, following respective guidelines, are crucial throughout the biomarker development process, from identification and validation to the implementation of appropriate tests in routine clinical practice.3,4
Objective
The paper aims to overview the developments in tumour biomarkers, particularly in their application for tumour classification, diagnosis, prognosis, and treatment decisions (prediction). This work examines both classical (serum protein markers, biochemical markers) and emerging biomarkers that have the potential to improve cancer detection, classification, and patient management. A summary of landmark events in tumour biomarker research is shown in Figure 1. The paper also explores the role of study design and scientific rigour in biomarker validation studies. Timeline – selected landmarks in tumour biomarker field. Abbreviations: RNA-seq: RNA sequencing; “WCT EVI MAP: Mapping the Evidence for the World Health Organization (WHO) Classification of Tumours: a Living Evidence Gap Map by Tumour Type” project
89
; ISOBM: International Society of Oncology and Biomarkers; MCED: multi-cancer early detection tests; LLMs: large language models.
Classical tumour biomarkers
Historical view
The first biochemical marker was a protein discovered by Henry Bence-Jones in 1847 in the urine of a patient suffering from osteomalacia. 5 Over a hundred years later, this protein was recognised as the multiple myeloma-associated immunoglobulin light chain and termed the Bence-Jones protein. In the early 20th century, the concept of tumour markers as substances produced by cancer cells or in response to cancer emerged with the observations of higher concentrations of certain substances in the blood of cancer patients, such as adrenocorticotropic hormone (ACTH) in small cell lung carcinoma, and prostatic acid phosphatase (PAP) in prostate cancer. Extensive research on cancer biomarkers began in the 1960s with the discovery of the first clinically useful human tumour markers, alpha-fetoprotein (AFP) 6 by Abelev, and carcinoembryonic antigen (CEA) discovered by Gold and Freeman. 7 Based on these discoveries, and to promote research in tumour marker development, the International Research Group for Carcinoembryonic Proteins was founded in 1973. In 1978, it was transformed into a society under the name International Society for Oncodevelopmental Biology and Medicine (ISOBM). The society, later renamed the International Society for Oncology and Biomarkers (ISOBM), included Professors Abelev and Gold as honorary members. Professor Hirai, a co-founder of the society, played a key role in establishing the foundations for the use of tumour markers in oncology, making significant contributions to the development and standardisation of tumour markers, particularly in the area of glycoprotein antigens. Notably, he conducted pioneering research on AFP, helping to elucidate its biochemical properties, improve assay methodologies, and establish its clinical relevance as a key marker in the diagnosis and monitoring of hepatocellular carcinoma and germ cell tumours.8–10
Further developments
By the 1990s, the use of tumour markers expanded significantly, and significant improvements were made in the standardisation of tumour marker tests. Serum protein markers, often referred to as ‘classical’ tumour markers, became widely used in clinical practice. Other examples include CEA for colorectal cancer, AFP for hepatocellular carcinoma, CA 15-3 for breast cancer, CA125 for ovarian cancer, prostate-specific antigen (PSA) for prostate cancer, CA19-9 for pancreatic cancer, human chorionic gonadotropin (HCG) for trophoblastic tumours, S100-protein for melanoma, and squamous cell carcinoma antigen (SCCA) for cervical and other squamous cell epithelium-derived cancers. Interestingly, the clinical utility of PSA as a screening tool for prostate cancer has been a subject of significant debate, with concerns regarding overdiagnosis and overtreatment discouraging its routine use as a single marker. 11 However, the advent and integration of multiparametric magnetic resonance imaging into the diagnostic pathway has markedly improved the specificity of PSA screening. This has led to a re-evaluation of PSA’s role, effectively revitalising its value as a frontline marker in prostate cancer detection. 12 Endocrine markers NSE and ProGRP are used for the diagnosis of small cell lung cancer subtypes, while CEA and CYFRA 21-1 – for non-small cell lung cancer.13,14
More recent markers include human epididymis protein 4 (HE4), which, in combination with CA125, forms the ROMA (risk of ovarian malignancy algorithm) index for risk assessment of ovarian cancer in women with pelvic masses, 15 as well as PIVKA-II (Protein Induced by Vitamin K Absence or Antagonist-II), which, in combination with AFP, constitutes the GAAD index (combining Gender, Age, AFP, and DCP (PIVKA-II)) to improve hepatocellular carcinoma detection. 16 The issue of meta-biomarkers, which incorporate multiple different molecules and data into diagnostic algorithms, is explored further in a subsequent section of this article.
Accuracy issues
Although tumour marker concentrations vary significantly between cancer patients and healthy individuals, they usually provide insufficient sensitivity and specificity. For instance, many classical biomarkers can be elevated in non-malignant conditions, leading to false-positive results, and reduced diagnostic specificity. 17 Examples include CA125, extensively studied in ovarian cancer, which can also be elevated in benign conditions, such as endometriosis. The low sensitivity of CA125, which is reported to be elevated in up to 50% of stage I cases, limits its effectiveness as a screening tool for early-stage ovarian cancer. 18 Similarly, PSA levels rise in response to benign prostatic hyperplasia. Evidence from secondary care-based studies showed that although PSA has high sensitivity, it has a low diagnostic specificity of 20% in symptomatic prostate cancer patients. 19 Moreover, although PSA-based screening shows a reduction in prostate-cancer-specific mortality, its low specificity results in overdiagnosis and overtreatment. 11 Further drawbacks of classical tumour markers include individual variability, which is amplified by factors such as race, age, body mass index, and comorbidities, making the establishment of universal cut-off values for cancer detection challenging. Although of limited use for screening and diagnostic purposes, these markers have proved to be successful in several clinical applications, such as monitoring treatment response and disease progression. This is exemplified by both CA125 and PSA, which are clinically used in the follow-up of ovarian and prostate cancer patients, respectively, to assess treatment response and predict disease recurrence,18,20 as well as by CEA, which has been identified as a predictor of disease progression in patients with metastatic colorectal cancer receiving chemotherapy. 21 However, for monitoring of therapy response or surveillance after primary treatment, individual changes in marker levels assessed at defined time intervals are more relevant than absolute cut-off values based on healthy controls or specific cancer cohorts. 22
Novel classes of tumour markers
Molecular histopathological markers
Classical tumour markers remain fundamental to standard clinical practice in oncology. However, advances in biomedical sciences and techniques, such as immunohistochemistry, flow cytometry, polymerase chain reaction, and in situ hybridisation, have introduced a molecular dimension to histopathology, offering new possibilities for determining the origin of a tumour, facilitating differential diagnosis, and improving prediction and prognosis. For example, cytokeratins aid in differentiating carcinomas of various origins, e.g., CK7−/CK20+ is associated with colorectal carcinoma 23 ; WT1 helps identify ovarian cancer origin in metastases 24 ; HepPar-1 and Arginase-1 assist in confirming hepatocellular carcinoma and excluding metastatic adenocarcinomas in the liver 25 ; Ki67, a proliferation marker, is widely used as a prognostic factor in several cancer types, including breast cancer 26 ; HER2 in breast cancer, EGFR in lung cancer, and CD20 in lymphomas hold both prognostic and predictive significance.27–31
Immunotherapeutic-context-related tissue markers
Some novel categories of biomarkers validated for clinical use refer not only to different biological levels, but also to a new immunotherapeutic context. Examples include the predictive value of the expression of the immune checkpoint protein, Programmed Cell Death Ligand 1 (PD-L1), in several tumour types, including non-small cell lung cancer 32 and microsatellite instability (MSI) in different tumour types, including colorectal cancer, 33 where MSI also serves as a well-established diagnostic and prognostic marker.
Apart from protein biomarkers expressed in tumour cells that aid in classification and prognostication, cells characterising the immune microenvironment may also serve as biomarkers of prognostic value in some cancers. For example, tumour infiltrating lymphocytes (TILs), assessed on routine H&E slides, were shown to be associated with improved prognosis in early-stage HER2-positive breast cancer patients. 34
Bacteria-specific markers
Certain tumour markers have recently been associated with bacterial infections, assessed either directly through bacteria-specific molecules or indirectly through host-specific factors triggered by infections. Examples of such markers include gingipains (proteases produced by Porphyromonas gingivalis) and increased MMP-9 levels as host markers in oropharyngeal cancer patients. 35 Bacteria-specific markers include various molecules, such as Fusobacterium nucleatum-specific DNA detected in faecal samples, or proteins, such as anti-F. nucleatum antibodies in the serum and saliva of patients with gastrointestinal malignancies. 36
Markers derived from high-throughput methods
The advent of ‘omics’ methods, including genomics, transcriptomics, proteomics, epigenomics, and metabolomics, has accelerated the discovery of novel biomarkers within diverse molecules. A seminal transcriptomic study by Perou et al. 37 defined molecular subtypes of breast cancer. Whole genome sequencing has led to defining distinct glioma subtypes based on IDH1/2 mutations.38,39 Hybrid capture-based next generation sequencing (NGS) has shown that tumour mutational burden (TMB) predicts pembrolizumab (anti-PD-1 therapy) response across different tumour types. 40 Accumulating high-throughput data hold promise for circulating metabolomic signatures to support diagnosis and prognosis, for example, in pancreatic cancer, 41 as well as for circulating microRNA (miRNA) signatures, for example, in colorectal, lung, and breast carcinomas. miRNA-based clinically approved tests are emerging, exemplified by the CE-IVD-certified M371 and GASTROClear tests for early detection of testicular cancer and gastric cancer, respectively.42,43
Markers generated from single-cell analyses
Novel technologies now enable genomic and transcriptomic characterisation at the single-cell level. This allows the examination of the molecular features of individual cells and the identification of subpopulations with distinct gene expression profiles, genetic alterations, or lineage origins. A seminal example in cancer research is the study by Patel et al., 44 in which single-cell RNA sequencing was used to reveal intratumoral heterogeneity in glioblastoma. This approach is particularly valuable for studying tumour immune microenvironment and detecting rare cell types, such as cancer stem cells or therapy-resistant clones, to inform personalised treatment strategies. Further opportunities for tumour biomarker discovery are provided by single-cell spatial omics, offering valuable spatial insight by integrating omics-level data with tissue architecture. 45
cfDNA/ctDNA and circulating tumour cells as liquid biopsy markers
New classes of circulating tumour markers, at the cellular and molecular levels, referred to as ‘liquid biopsy’, include tumour cells (CTCs) and cell-free tumour DNA (ctDNA) found in the peripheral blood of cancer patients. CTCs are present in very small quantities, even in patients with advanced cancer. They are detected and characterised by surface markers, such as EPCAM or cytokeratins, and have prognostic value, for example, in breast, prostate, and colorectal cancers. 46 However, to date, CTC detection faces significant challenges due to their low abundance, cellular heterogeneity, and issues related to standardisation of detection methods. Thus, CTCs are often considered overvalued in terms of their clinical utility, currently offering limited practical benefit in most cases.47,48
ctDNA constitutes only a small fraction of the total cell-free (cf)DNA circulating in the blood stream, sometimes as little as 0.01 to 1%. ctDNA carries various malignancy-associated alterations, including single mutations, deletions, fusions, and amplifications. 49 These markers can be detected and quantified using highly sensitive technologies such as digital PCR (e.g. for single mutations) or NGS-based deep sequencing (for targeted approaches, i.e. limited to pre-specified genes or untargeted approaches, i.e. genome-/exome-wide).50,51 ctDNA analysis can be useful even when tumour tissue is unavailable for molecular characterisation (a tumour-agnostic approach), enabling patient stratification for certain targeted or immune therapies based on genetic driver mutations or other molecular indicators. Treatment-relevant ctDNA markers are exemplified by mutations of BRAF in melanoma, KRAS, NRAS and BRAF in colorectal cancer, EGFR in lung cancer, and ESR1 and PIK3CA in breast cancer.51–54 Pre-analytical factors, including sample collection, processing, storage, and handling, that profoundly affect biomarker stability, accuracy, and reproducibility in both research and clinical applications, are especially critical in ctDNA analysis, though equally important for all cancer biomarkers. Beyond these pre-analytical considerations, obtaining a sufficient amount of cfDNA and employing a standardised, highly sensitive analytical procedure that meets the quality requirements of international (ISO15189) and national institutions50,55,56 are essential prerequisites for liquid biopsy. Further developments include comprehensive pattern recognition methods for nucleic acid markers in total cfDNA, capturing both tumour characteristics and antitumour responses of immune and other patient cells. These methods encompass epigenetic DNA methylation patterns, DNA fragment patterns, nucleosome positioning patterns, and more. 57 The development and application of highly sophisticated bioinformatic pipelines are essential for deconvoluting the complex information obtained through untargeted sequencing approaches. 58
Multi-omic approaches
High-throughput studies, enabling the investigation of the molecular landscape of multifactorial diseases, such as cancer, have generated vast amounts of data, stored in multiple databases, such as GEO, 59 Ensembl, 60 the Cancer Genome Atlas (TCGA), 61 MetaboLights, 62 and ENCODE, 63 just to name a few. A major value of these data lies in their secondary use driven by recent advancements in data science and artificial intelligence (AI). While single-layer omic approaches provide valuable insights, understanding complex, multilayered biological systems requires the integration of genomic, transcriptomic, proteomic, and other ‘omic’ data into multi-omics approaches, which offer new potential for biomarker discovery and are currently being extensively developed.64–66
The excitement surrounding the discovery of novel markers is tempered by numerous factors that hinder their implementation in clinical practice, and many novel biomarkers still lack sufficient sensitivity and specificity. 1
Improving testing accuracy: Meta-biomarkers
It is unlikely that a single biomarker can reliably support early screening, diagnosis, prognosis, or assess the safety and efficacy of treatment modalities.67,68 Therefore, new opportunities for precision medicine are emerging through the development of meta-biomarkers incorporating multiple different molecules in biomarker panels or integrating tumour biomarkers with data derived from other diagnostic methods (e.g. endoscopy and radiography), clinical records or exposomes.69,70
Panels of classical tumour markers
Combining individual classical markers into panels has been shown to improve their effectiveness. 71 Significant progress has been made in the development of blood and urine biomarker panels, which are critical for the early detection and curative interventions in pancreaticobiliary malignancies. For biliary tract cancer, a combination of serum protein biomarkers (CEA, CA 19-9, CYFRA 21-1, and MMP7) shows diagnostic potential. For pancreatic cancer, combinations like CA 19-9 with CFB or IL-8, IL-1β, and CA 19-9 demonstrate high sensitivity and specificity. A urine three-biomarker panel (LYVE-1, REG1B and TFF1) was developed as a non-invasive test for the early detection of pancreatic ductal adenocarcinoma, and its use in the PancRISK score enables the stratification of patients for appropriate management.72,73 Entire biomarker panels can also be combined. For example, the 4MP protein panel (consisting of the precursor form of surfactant protein B, CA125, CEA and CA 19 fragment) in conjunction with a lung cancer risk prediction model (PLCOm2012) demonstrates high predictive performance for lung cancer screening in high-risk individuals. 74
Multigene diagnostic tests
Similar to classical tumour markers, integrating molecular markers has proven to be effective. For instance, the Oncotype DX assay, a leading genomic biomarker test for breast cancer, evaluates the expression of 21 genes, including 16 cancer-related genes. By incorporating expression levels of these genes into an algorithm, the assay generates a recurrence score that predicts the benefit of adding chemotherapy to endocrine therapy. 75 Widely recommended in international guidelines, including those of the American Society of Clinical Oncology (ASCO) and the National Comprehensive Cancer Network (NCCN), Oncotype DX provides both predictive and prognostic insights, guiding adjuvant chemotherapy decisions for women with early-stage oestrogen receptor (ER)-positive, HER2-negative breast cancer. Other innovations include the ColonES assay, a blood-based multigene cell-free DNA (cfDNA) methylation test, which enables early detection of colorectal cancer and prognosis estimation. 76 Broad implementation of such tests is frequently limited by financial constraints and lack of reimbursement.
Markers combined with clinical data and structural insights
Complementing biomarker-based tests with patient clinical data has a significant potential to improve diagnostic accuracy. For example, combining CA125 and HE4 serum levels with menopausal status resulted in the development of ROMA for ovarian cancer, as mentioned above. 15 Similarly, combining CA 19-9 levels alongside patients’ BMI values into the PancRISK score has been shown to improve its performance. 72
Also, tissue tumour markers can be integrated into a meta-biomarker approach, particularly as it is now possible to capture the spatial organisation and structure of the analysed tissues in detail. 77 Emerging technologies, such as spatial transcriptomics and spatial proteomics, allow researchers to map the relative positions of the investigated molecules within a given sample. This is further enhanced and complemented by the development of digital pathology, image analysis and interpretation, assisted by AI-based tools, such as the FDA-approved Paige AI platform. 78 This integrative approach is termed ‘pathomics’ and encompasses the entirety of a tissue’s pathological characteristics, the ‘pathome’. When combined with radiomics, pathomics becomes part of ‘morphomics’, the study of morphological features across all levels of achievable resolution within a system or organism. ‘Radiomics’ refers to the extraction of quantitative data from radiological medical images, mostly at a macroscopic scale. 77 Future research should complement existing knowledge on tumour biology with morphological insights at the ultrastructural, microscopic, and macroscopic scales.
Multi-cancer early detection panels
Multi-cancer early detection (MCED) tests represent a relatively new generation of cancer screening assays that may incorporate meta-biomarkers. These tests are designed to analyse a single specimen, typically blood, for cancer-related biomarkers, most commonly ctDNA. MCED assays aim to detect multiple cancers simultaneously, helping to identify potential malignancies in asymptomatic individuals. MCED platforms include Galleri®, CancerSEEK, SPOT-MAS™, Trucheck™, and Cancer Differentiation Analysis, all of which generally exhibit high specificity. Yet, the sensitivity of these tests remains highly variable depending on the cancer type and stage. Moreover, sensitivity was shown to be influenced by study design, population, follow-up time and the reference standard test used.79,80 MCED requires further validation in clinical trials before it can be implemented in clinical practice.68,81 The integration of AI with protein biomarkers and/or cfDNA holds promise for earlier cancer detection. 82 Machine learning (ML) and AI-based techniques 83 hold the potential to establish novel meta-biomarkers by integrating multi-omic datasets (such as genomics or radiomics) with real-world data, while enabling biomarker validation across diverse populations beyond controlled clinical trials.84,85 In addition, this approach makes it possible to analyse complex, multidimensional data to identify subtle patterns indicative of early disease stages.
Biomarkers for tumour classification
Tumour classification is a complex process that involves identifying and categorising cancer types based on their tissue of origin, histological and molecular characteristics, and clinical behaviour. Historically, the main criterion for tumour classification was microscopic analysis of slides to identify morphological alterations. In recent decades, technological advancements have enabled more in-depth characterisation of tissue and body fluid samples from patients with various tumour types.
The World Health Organisation (WHO) Classification of Tumours (WCT), published by the International Agency for Research on Cancer (IARC), provides globally recognised standards for tumour diagnosis and practical guidance for pathologists and oncologists worldwide. Presented as a series of volumes, known as the WHO Blue Books, and a website (https://tumourclassification.iarc.who.int/welcome/), the WCT provides a comprehensive and standardised classification system, including characteristics of each tumour type, pathological and clinical features, and associated molecular alterations. IARC acknowledges that ‘new technologies are now transforming the field of pathology more rapidly than at any other time in the past 30 years, and it has become increasingly clear that the traditional approach to cancer classification is insufficient’. 86 Traditionally based on histological and, more recently, molecular characteristics, tumour classification is becoming increasingly multidisciplinary. Currently, the WCT includes additional characteristics, such as imaging and diagnostic molecular pathology data. As a result, the term ‘biomarker’, by referring to diverse data, has gained a broad meaning and has been assigned an increasingly important role in supporting classification. Importantly, while tissue analysis remains essential for tumour classification, chemical pathology (also known as clinical biochemistry), which involves the investigation of bodily fluids, is now reflected in the WCT. Beyond enhancing diagnostic accuracy, biomarkers enable personalised treatment strategies, and the discovery of novel, reliable biomarkers could lead to significant changes in tumour classification. For example, IDH (isocitrate dehydrogenase) mutation and 1p/19q co-deletion status have been integrated into central nervous system tumour classification, refining the categorisation of adult-type diffuse gliomas, to assist in the tailoring of treatment strategies. 87 Biomarkers are expected to play an increasingly significant role in tumour classification by providing molecular and genetic insights that enhance traditional histopathology. It is highly probable that these biomarkers will include meta-type biomarkers identified through metadata analyses. A notable example is the WHO classification of endometrioid carcinoma of the uterine corpus into four molecular subtypes, largely based on findings from The Cancer Genome Atlas (TCGA). 88 Such biomarker-driven cancer classifications will pave the way for better personalised and more effective therapeutic strategies.
Until now, tumour classification has largely relied on the consensus of the opinions from invited experts. However, the data included have not always been informed by the best representation of the available evidence. 86 It has become crucial to synthesise and map evidence to ensure guidance is based on the most reliable evidence. It is also critical for identifying knowledge gaps and pockets of low-level evidence in tumour pathology, to guide research prioritisation. To achieve this, the IARC has launched a project entitled ‘Mapping the Evidence for the World Health Organisation (WHO) Classification of Tumours: a Living Evidence Gap Map by Tumour Type (WCT EVI MAP)’, 89 employing an adapted Evidence Gap Map (EGM) methodology.90,91 One of the project’s key goals is to integrate the results into the strategic planning for the 6th and subsequent editions of the WCT. The resulting EGMs are also expected to accelerate the discovery of novel clinically applicable tumour markers by describing evidence gaps and pinpointing potential areas of interest for further research.
Biomarker development guidelines
When integrating biomarkers into clinical practice, several challenges arise, including the standardisation of assay methods, validation across diverse populations, and the need for robust clinical trials to establish their efficacy and reliability. 92 New biomarkers or combinations of biomarkers intended for clinical use, and ultimately to support tumour classification, must then undergo rigorous regulatory evaluation. Regulatory agencies and organisations, such as the International Organization for Standardisation (ISO), 93 the International Quality Network for Pathology (IQN Path), 94 the U.S. Food and Drug Administration (FDA), 95 the National Institutes of Health (NIH), 96 the European Medicines Agency (EMA), 97 and the WHO, 98 have established guidelines to regulate this process. Biomarkers require validation and must demonstrate their capacity to improve clinical decision-making. Regulatory bodies define the qualification processes for biomarkers, outlining specific performance characteristics, such as precision and quality assurance, that must be met. 99 Guidelines from organisations, such as the Clinical and Laboratory Standards Institute (CLSI), are essential for standardisation across different laboratories, instruments, and populations to ensure consistent measurements. Unfortunately, the use and assessment of most tumour markers, even the classical ones, are not standardised, and the state of harmonisation remains unknown.86,100 Detailed guidelines for the evaluation of biomarker studies and strategies to avoid bias are beyond the scope of this article and are accessible through, for example, the EQUATOR (Enhancing the QUAlity and Transparency Of health Research) Network and the GRADE (Grading of Recommendations, Assessment, Development and Evaluation) approach.101–103
Identification of a tumour biomarker – Study design issues
Study designs according to the WCT EVI MAP framework 89 and their relevance to biomarker research.
Consequently, a new pathology-relevant hierarchy, the Hierarchy of Evidence for Tumour Pathology (HETP), has been developed via a Delphi-informed process to standardise the evaluation of histopathological research, tumour classification and biomarker studies. 107 The HETP categorises studies based on their design and the associated risk of bias, which can influence the reliability of the results. Studies of higher LoE (e.g. systematic reviews and randomised controlled trials) provide stronger evidence than studies of lower LoE (e.g. case reports). To evaluate the evidence for tumour classification, an updated HETP version 108 aligned with the WCT EVI MAP 89 framework has been developed. It includes eight categories (‘characteristics’), clinical features, epidemiology, aetiology, pathogenesis, histological diagnostic features, molecular diagnostic features, prognosis, and prediction, most of which accommodate tumour biomarker studies.
In biomarker research, rigorous validation is crucial to ensure clinical utility. The updated HETP 108 organises study designs into five LoE, labelled P1 through P5 (for Pathology). For all categories, the highest level, P1, comprises systematic reviews, while the lowest level, P5, comprises case reports. Levels P2, P3, and P4 vary between ‘Histological and Molecular diagnostic features’, ‘Aetiology (Risk factors)’, ‘Pathogenesis (Mechanisms and pathways)’, ‘Prognosis’, and ‘Prediction’ in terms of the included study designs. A structured LoE system helps to determine which biomarkers are supported by substantial evidence, meeting the precision and reliability required for clinical diagnostics and patient care.
Studies that may lead to identifying new biomarkers can be carried out in nearly all study designs. To ensure a consistent system for classifying study designs of published studies across diverse disciplines, including pathology, epidemiology, oncology, molecular biology, and public health, a Decision Tree (DT) was developed within the WCT EVI MAP project mentioned above. 109 A special diagnostic branch has been included in the DT to address studies evaluating the accuracy and reproducibility of diagnostic tests.
Study designs contributing to biomarker discovery, with their levels of evidence (LoE) for pathology (P), across key biomarker-related characteristics 108
Taken together, studies at levels P3 through P1, with data of progressively higher reliability, potentially provide reduced risk of false-positive or false-negative results that could misguide diagnoses and treatments. Level P5 and P4 studies are used to identify candidate biomarkers. Level P4 studies refine biomarker assays, ensuring their technical reliability. Level P3 and P2 studies evaluate biomarkers in real-world or experimental settings, assessing their predictive, diagnostic or prognostic value. Level P1 studies synthesise findings across multiple studies, providing high-confidence evidence for clinical guidelines or regulatory approvals.
It has become evident that for diagnosis, prediction, and prognosis, studies at levels P1 and P2 have been particularly considered in the definition of clinical guidelines in oncology. To evaluate the efficacy of cancer treatments, study designs such as systematic reviews or RCTs are required, as they provide the highest level of scientific rigour, data collection, and documentation. Since biomarkers do not directly influence patient outcomes and prospective RCTs are not conducted due to financial constraints, biomarker studies have historically struggled to reach these high levels of evidence and have often been excluded from clinical guidelines. However, in our most recent update of the HETP, ‘Observational longitudinal RCT-derived studies’ – that is, translational studies in which biomarkers were evaluated within originally drug-efficacy-focused RCTs – were also classified as level P2. This development opens up the possibility for high-quality biomarker studies to be considered for future clinical guidelines. Furthermore, biomarker evaluations are categorised into different study types: ‘Diagnostic test agreement/reproducibility studies’ for method or investigator comparisons, ‘Laboratory test validation studies’ for purely analytical evaluations in the laboratory, and ‘Diagnostic test accuracy studies’ to assess the clinical utility of biomarkers, which require specific quality criteria, such as sensitivity, specificity, and predictive values. These new categories provide a more differentiated perspective on studies assessing the value of biomarkers in oncological diagnostics. 109
Those studies are paramount for the analytical and clinical accuracy of tumour marker tests when applied in clinical patient care. It is known that different results may be obtained if a tumour marker is quantitatively assessed in blood serum or plasma using different assays or analytical platforms from different manufacturers. A thorough analytical and pre-analytical evaluation, together with head-to-head method comparison using control materials and real patient samples are essential steps for clinical labs. More advanced requirements are better standardisation and harmonisation of the assays already in the development and optimisation phase of the manufacturers. Later, the regular use of internal controls and participation in external quality controls are mandatory measures for clinical labs. Moreover, clinical studies testing the utility of the biomarkers for all relevant clinical questions are necessary to define appropriate decision thresholds and inform about positive and negative predictive values of the markers. All these study types have been covered by the new HETP. 108
Tumour Biomarker Reporting and Evidence Mapping
Adherence to publishing guidelines for clear and transparent reporting of biomarker performance characteristics, including limitations, is essential. 69 Guidelines, such as STARD (Standards for Reporting Diagnostic Accuracy) 110 and REMARK (Reporting Recommendations for Tumour Marker Prognostic Studies), 111 as well as tumour pathology-specific guidance, 112 recommend adhering to standardised reporting frameworks.
The Evidence Gap Maps by tumour type, being developed within the WCT EVI MAP project, 89 will facilitate evidence-based decision-making in tumour pathology. By integrating multiple published studies, EGMs will provide an overview of the current evidence for biomarkers in cancer classification and help establish research priorities by highlighting areas that require further investigation. With numerous research groups exploring diverse cancer parameters and characteristics for potential biomarker development, the living (regularly updated) EGM tool will dynamically reflect the evolving body of evidence. Designed for open access, this tool will benefit all stakeholders, including the scientific community and diagnostic companies. It will assist researchers, clinicians, and policy makers in identifying biomarkers that meet precision and reliability standards, have high clinical utility, and can be effectively translated into patient care.
Conclusions
Biomarkers hold significant potential for advancing cancer diagnostics and patient management. However, their integration into clinical practice is hindered by scientific, technical, translational, and practical challenges. These difficulties largely stem from the biological complexity of cancer. As a multifactorial and highly heterogeneous disease, cancer is driven by multiple dynamic and context-dependent molecular mechanisms, and influenced by both individual constitution and environmental factors. Our understanding of tumour biology, including the biological role of biomarker molecules and their links to disease mechanisms, is rapidly expanding, driven by advances in molecular profiling technologies at both the tissue and single-cell levels, as well as by the integration of multi-omics approaches supported by data science and artificial intelligence. As a result, countless potential biomarkers, biomarker signatures, and meta-biomarkers are being discovered. It is now possible to develop multiplex panels for multi-cancer early detection, referred to as MCED. Yet, despite this vast potential, a relatively small number of molecular biomarkers and biomarker panels have been successfully translated into clinical practice or incorporated into tumour classification systems. The complexity, inter- and intra-tumour heterogeneity, and the dynamic nature of cancer make the translation of tumour biomarkers into clinical practice exceptionally difficult. First, candidate biomarkers, often identified in small cohorts, may lack reproducibility in independent populations. Second, biomarkers that prove useful in specific contexts may not demonstrate utility across different tumour subtypes or clinical settings. Third, the clinical value of many potential biomarkers is limited by their insufficient specificity and sensitivity, which undermines their diagnostic reliability. This is further confounded by non-biological factors, such as technical variability between biomarker assays, their insufficient sensitivity or inadequate standardisation, and implementation obstacles, such as regulatory and reimbursement barriers. Finally, pre-analytical considerations that affect biomarker stability and reproducibility must not be overlooked. Therefore, to ensure the reliability, reproducibility, and clinical effectiveness of tumour biomarkers, both sample handling and assessment methods must undergo rigorous validation across diverse platforms, patient populations, and clinical contexts. To ensure that biomarker discoveries are scientifically robust, clinically meaningful, and effectively integrated into practice, it is essential to employ study designs appropriate to the specific research objectives, adhere to established reporting and validation guidelines, and adopt evidence-based pathology and the HETP framework. As a result, tumour biomarkers will become increasingly important in cancer classification by enabling more precise stratification of cancer subtypes, which is crucial for personalised treatment approaches.
Footnotes
ORCID iDs
Collaborators
WCT EVI MAP, including: Anne-Sophie Bres, Richard Colling, Beatriz Pérez Gómez, Elena Plans-Beriso, Ester García-Ovejero, Oana M. Craciun, Joanna Didkowska, Christine Giesen, Iciar Indave, Alex Inskip, Ramon Cierco Jimenez, Dilani Lokuhetty, Nur Diyana Md Nasir, Cécile Monnier, Marina Montes-Mota, Marina Pollán Santamaría, and Puay Hoon Tan.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This publication, through the WCT EVI MAP project, is funded by the European Union (HORIZON EUROPE Grant No. 101057127).
Declaration of conflicting interest
The authors declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: SH has received research funding or honoraria from Roche, Sysmex, Thermo, Guardant, Novartis, Trillium, Medica, and Instand and is founder of SFZ BioCoDE and CEBIO. MCh, SH, and MK are editorial board members of Tumour Biology journal but had no involvement in the peer review process of this article. The other authors declared no potential conflicts of interest.
Disclaimer
The content of this article represents the personal views of the authors and does not represent the views of the authors’ employers and associated institutions. Where authors are identified as personnel of the International Agency for Research on Cancer/World Health Organization, the authors alone are responsible for the views expressed in this article, and they do not necessarily represent the decisions, policy, or views of the International Agency for Research on Cancer/World Health Organization.
