Abstract
Objectives:
Biobanks are crucial for clinical biomarker research, but pre-analytical processing can impact sample suitability. Insufficient reporting of pre-analytical methods can lead to invalid conclusions.
Methods:
This study evaluated the pre-analytical reporting quality in 294 clinical biomarker studies using biobank plasma and serum samples published from 2018 to 2023, based on the Sample PREanalytical Code and the Biospecimen Reporting for Improved Study Quality guidelines.
Results:
The analysis revealed frequent incomplete reporting. Key underreported elements included fasting time (reported in 31% of articles), freeze–thaw cycles (22.8%), internal transport (8.5%), and centrifugation settings (20%–35%). Demographic data (96.9%), storage temperatures (81%), and blood tube additives (82.7%) were more consistently reported. No significant correlation was found between reporting quality and journal impact factor.
Conclusions:
Our findings highlight the need for standardized pre-analytical reporting to improve research replicability and biomarker study quality.
Introduction
Replicability is a cornerstone of clinical research, as it ensures that research findings can be consistently reproduced when methodologies are transparently reported. However, Ioannidis 1 observed that over 30% of highly published clinical research articles fail to replicate, yielding less-promising or even contradictory results in subsequent studies. Efforts to reproduce previous preclinical research findings also reveal concerning trends; for instance, Amgen and Bayer independently reported reproducibility rates of just 25% and 11%, respectively.2,3 More recently, the Center for Open Science confirmed these findings, with replication success rates of only 46% for results from high-impact articles. 4
Several factors contribute to this lack of reproducibility, with inadequate documentation and reporting being significant contributors.5–7 For example, Sun et al. 8 identified poor reporting of analytical characteristics of biomarkers used in clinical research. We propose that insufficient characterization and documentation of the pre-analytical phase for biobank biomaterials could be an underappreciated factor influencing research replicability and transparency.9,10
The pre-analytical phase encompasses all procedures conducted prior to the analysis of samples. It comprises factors of biological origin that may occur in vivo or in vitro. In vivo factors include effects of health and disease status and can be modifiable, such as diet and circadian rhythm, or unmodifiable, such as age, gender, race, and genetic background. In vitro factors include samples collection, processing, transport, and storage.
Accurate reporting of pre-analytical factors is essential to verify whether the stored materials accurately represent the biomaterials intended for analysis and to assess their fitness for the intended biomarker measurements. Numerous studies have demonstrated that errors in the pre-analytical phase, such as incorrect additives, delays between sample collection and centrifugation/storage, and suboptimal storage conditions, can significantly impact measurement outcomes.9,10 For instance, results from studies measuring (mi)RNA in plasma have been shown to be influenced by factors such as tube additives and storage temperature.9,11 Additionally, metabolomic profiles in plasma and serum are highly susceptible to pre-analytical variables, including fasting status, collection time, hemolysis, delays in processing, and repeated freeze–thaw cycles.12–15 Therefore, comprehensive reporting of sample collection and handling procedures is critical for ensuring the reliability and replicability of research findings.
Several guidelines have been developed to assist researchers in reporting pre-analytical factors. Notably, the Sample PREanalytical Code (SPREC) and the Biospecimen Reporting for Improved Study Quality (BRISQ) criteria have been established to standardize the reporting of pre-analytical procedures.16–19 SPREC focuses on metadata related to in vitro pre-analytical procedures, including information on sample type, primary containers, centrifugation conditions, and long-term storage protocols. Developed by the International Society for Biological and Environmental Repositories, its primary goal is to ensure that each biobank sample is accompanied by a standardized set of metadata. In contrast, BRISQ offers broader guidance for reporting including both in vivo and in vitro factors such as pre-acquisition procedures, transport parameters, and quality assurance steps relevant to downstream analysis. An analysis of BRISQ’s implementation in high-impact journals revealed that pre-analytical factors were often poorly reported, with little improvement observed following the publication of the guidelines in 2011. 20 These findings suggest that the BRISQ guidelines have not yet been widely adopted and may be overly ambitious as an initial step toward improving pre-analytical reporting in the scientific literature.
In this study, we aim to assess the quality of pre-analytical reporting in recent clinical biomarker research using biobank plasma and serum samples, focusing on studies from the beginning of 2018 to February 2023. We developed a scoring system based on SPREC, supplemented by BRISQ elements, to evaluate the comprehensiveness of reporting. Our ultimate objective is to raise awareness of the importance of the pre-analytical phase and encourage the adoption of standardized reporting practices to mitigate biases that may lead to misleading results and conclusions in biomedical literature.
Methods
Article selection
We included studies that at time of the searches end of February 2023 were officially published or added to PubMed between January 1, 2018, and February 28, 2023, and utilized biobank plasma or serum samples for biomarker analysis. Articles were initially selected through a combination of search terms in the PubMed database, ensuring that the selected articles met the criteria of being published in English and focusing on plasma, serum, or urine samples, as outlined in Supplemental Table SI. Studies were excluded if they met any of the following criteria: (a) focused on the design of future biobanks, (b) provided general information about biobanks without reporting pre-analytical details, (c) examined the effect of pre-analytical factors on measurement outcomes, (d) were review articles, (e) involved animal research, (f) did not use plasma or serum samples, (g) lacked full-text availability, (h) were not published in English, or (i) did not explicitly mention biobanking or sample storage. Studies published before 2018 were also excluded, and for feasibility reasons of our literature study, we excluded studies involving urine as a sample type.
Data extraction
The selected articles were evaluated based on their reporting of various pre-analytical factors. These factors were derived from the SPREC guidelines and supplemented with additional elements from BRISQ. We included all relevant SPREC elements pertaining to plasma and serum as these focused on preanalytical processes for biobank biomaterials, and extended the evaluation by incorporating BRISQ elements such as clinical characteristics, storage duration, transport conditions, fasting status, separation gel use, and freeze–thaw cycles.16–18 To ensure objectivity, the data extraction process was piloted by the authors to minimize inter-observer bias.
Pre-analytical elements from SPREC and BRISQ were identified in the main article, supplementary materials, and references. The elements were categorized and recorded according to their presence and source of reporting. The final list of elements included internal transport, biospecimen type, fasting duration, demographic characteristics (age, sex), blood tube additives, separation gel, delays prior to and after centrifugation, centrifugation parameters (speed, time, temperature), storage conditions, and freeze–thaw cycles. Demographic details were recorded separately from the pre-analytical process. For each article, additional descriptive information was collected, including PubMed article ID, publication year, country of affiliation of the primary authors, journal name, impact factor (obtained from Clarivate), biobank name, use of multiple pre-analytical protocols, presence of external transport, and analysis purpose (Table 1).
Characteristics and Pre-Analytical Variables Analyzed in Selected Articles and Their Description
For the elements, the source of the description was reported (main article, first-line references, or supplement).
Scoring and quality control
Data extraction was performed by a single investigator (J.A.W.J.), with findings discussed in meetings involving all four authors: a master’s student in Medical Biology (J.A.W.J.), a professor of clinical chemistry (D.W.S.), an associate professor in epidemiology (F.V.), and the sample process coordinator from the Radboud Biobank (M.A.v.d.B.). Scoring was initially completed on paper and later transferred into Excel. Quality assessments were conducted in two rounds to minimize subjectivity across the dataset. After the initial review by the main assessor (J.A.W.J.) of 100 articles, 15 randomly selected articles were reviewed by two different other authors. Each of the other authors (D.S, F.V., M.B) re-examined 10 articles. For the second round, 9 additional articles from the remaining 194 articles were reviewed by two other authors, with each of three other authors re-examining 6 articles.
Statistical analysis
Descriptive statistics were performed using SPSS version 28 in all articles and in subgroups (articles on UK Biobank, and articles with specific analysis purposes: miRNA and transcriptomics, metabolites, lipid metabolites and metabolomics measurements). We calculated the number of 14 specific pre-analytical elements reported for each article and presented the results as percentages across different continents, publication years, and biobanks. We also examined correlations between the number of reported elements, journal impact factor, and publication year using Spearman’s correlation coefficient. Inter-rater reliability was assessed using Fleiss’ kappa, and discrepancies in scoring were discussed, after which each individual author could adapt their scoring.
Results
A total of 756 records were identified through the PubMed search. After applying the inclusion and exclusion criteria, 307 articles remained for data extraction. Following a further round of exclusions during data extraction, 294 articles were included in the final analysis (Fig. 1). The 294 articles originated from 30 countries, were published in 194 journals, and involved 153 biobanks. The most frequently used biobank was the UK Biobank, which accounted for 23 (7.8%) of the articles (Table 2). The majority of studies were based in Western nations, with Europe (n = 174, 59.2%), The Netherlands (n = 33, 11.2%) and the United States (14.3%) being the largest contributors (Supplemental Table SII). Among the 294 articles, 55.4% (n = 163) focused on protein biomarkers, 20.4% (n = 60) on metabolites, and 9.5% (n = 28) on miRNA, with the remainder studying transcriptomics (n = 5, 1.7%) or other biomarker types (Supplemental Table SIII). Detailed information on article characteristics is provided in Table 2 and Supplemental Tables SII and SIII.

Flow diagram describing selection of articles. At the end of February 2023 we searched PubMed for articles that made use of biobank serum, plasma, or urine samples, and articles published between January 2018 and February 2023. Exact search terms that resulted in the 756 articles can be found in Supplemental Table SI. During initial selection, we also exclude studies that used urine for biomarker studies and studies that were published before 2018. Subsequent use of selection criteria before and during data extraction resulted in 294 articles included for scoring in the study.
Characteristics of Included Articles That Performed Clinical Biomarker Research on Biobank Plasma or Serum Samples
Data are presented as n (%) or median [25th, 75th percentile].
aContinents of origin of the authors (list of specific countries can be found in Supplement Table SIII). bBiobanks used in more than six articles; Other refers to biobanks used in six articles or fewer. cOne or multiple pre-analysis protocols described in the article. dsame pre-analytical elements reported for the different protocols; edifferent pre-analytical elements reported for the different protocols; fWhether there was external transport from collection location to either a processing or storage location.
Reporting of pre-analytical elements
The average reporting of the pre-analytical elements listed in Table 1 across the 294 articles reviewed was 38.8% (Table 3). The reporting varied, with some elements being reported more frequently than others (Fig. 2). Demographic characteristics, gender and age, were reported most consistently, i.e., in 96.9% of articles. Blood tube additives, long-term storage time, and the inclusion period were documented in 82.7%, 81.0%, and 72.4% of the articles, respectively. In contrast, all other pre-analytical elements were reported in fewer than 40% of articles. Internal transport, blood tube separation gel, pre-centrifugal delays, post-centrifugal delays, freeze-thaw cycles, and the analysis period were mentioned in only 8.5%, 11.9%, 22.4%, 17.0%, 22.8%, and 8.8% of articles, respectively. The reporting of centrifugal conditions ranged between 20% and 35%, with centrifugal temperature, time, and speed reported in 20.7%, 33.0%, and 34.4% of articles, respectively. Fasting time was described in 31.0% of the articles (Fig. 2, Table 3).

Reporting of pre-analytical elements. Reporting is expressed as percentage of the selected 294 articles that did not report (red) or reported pre-analytical elements in the main article text (green), supplement (yellow), or first-line references (blue).
Percentage of Pre-Analytical Elements Reported for All Articles and Subgroups
Data presented as percentage of articles mentioning element in either main article, supplement, or reference.
*For the UK Biobank, we found a relatively high between-article variation in referring to other reports for the pre-analytical elements as well as in the pre-analytical information provided in these articles (data not shown).
Among the reported elements, most were described in the main article (84.2%), while a smaller proportion appeared in the first-line references (12.6%) or the supplement (3.2%). The percentage of reporting in the main article varied by element, with additives, demographics, long-term storage temperature, and inclusion period reported in 76.5%, 89.8%, 68.7%, and 66.3%, respectively (Fig. 2).
To assess whether the percentage of reported pre-analytical elements differed for frequently used biobanks, we analyzed data from the UK Biobank, the most used biobank in our study, and compared these with the overall group. Of the 23 articles utilizing the UK Biobank, the average number of reported elements (46.6%) was higher than that of the entire group (38.8%; Table 3). However, the pre-analytical elements for the UK Biobank were more often cited in references than in the main articles, with centrifugal elements described in a reference in 27.5% of publications, compared with only 4.3% in the main articles. For all other articles, centrifugal elements were reported in 5.9% of the main articles and 22.8% in references (data not shown). Furthermore, there was considerable variability between UK Biobank articles in the pre-analytical information provided in references, with some articles references containing comprehensive pre-analytical details, while references in other articles provided little to no relevant information (data not shown).
We found no correlation between the impact factor of journals and the total number of pre-analytical elements reported per article (Spearman’s correlation coefficient: 0.04). Similarly, there was no significant association between the year of publication and the number of elements reported (Spearman’s correlation coefficient: 0.05) (Supplemental Figs. SIA and S1B and Table 4).
Quality Assessment of Agreement Between Evaluators
Agreement data are presented as Fleiss’ kappa measure (95% confidence interval) between the main assessor (J.A.W.J.) and the three evaluators (M.A.v.d.B., F.d.V., and D.W.S.) for two assessment rounds. Quality assessment 1 was performed after the first scoring round of 100 articles, and assessment 2 was performed after scoring of the final 194 articles. A.D, after discussion: Fleiss’ kappa score after each evaluator was given the option to adapt the scores after discussing the discrepancies.
Given that miRNA, transcriptomics, and metabolites are particularly vulnerable to pre-analytical handling variations, we conducted a separate analysis of these analysis types. Despite the vulnerability of these analyses, pre- and post-centrifugal delays were reported less frequently or similarly for miRNA and transcriptomics (9.1% and 15.2%, respectively), and for metabolites (16.7% and 15.0%), compared with proteins/proteomics (25.1% and 18.9%) and the entire group of articles (22.4% and 17.0%; Table 3). Additionally, fasting time, which has been shown to impact metabolite measurements, was reported in 45% of metabolite articles, a higher percentage compared with articles on proteins (29.4%) and all articles (31.0%). Nevertheless, this still represents a minority of articles.
Quality assessment
The first quality assessment, conducted after the data collection of the first 100 articles, showed substantial to almost perfect agreement between each evaluator and the main assessor (Table 4). Following discussions of each article, overall agreement improved, with Fleiss’ kappa measures ranging between 0.90 and 0.98. The second quality assessment, conducted after reviewing the subsequent 194 articles, demonstrated almost perfect agreement between evaluators and the main assessor. Fleiss’ kappa values were 0.85, 0.83, and 0.80 between the main assessor (J.A.W.J.) and evaluators M.A.v.d.B., F.d.V., and D.W.S., respectively, with post-discussion values of 0.95, 0.88, and 0.94 (Table 4).
Discussion
This study investigated the reporting of pre-analytical procedures for biobank samples used in biomarker research, based on a selection of SPREC and BRISQ criteria, in articles published between the beginning of 2018 and the end of February 2023. Our findings indicate that the reporting of pre-analytical elements is insufficient in the majority of articles, with many elements not reported or only partially described. Demographic characteristics, blood tube additives, long-term storage time, and the inclusion period were relatively well documented, while other elements, such as fasting time, centrifugal conditions, freeze–thaw cycles, transport, and pre- and post-centrifugal delays, were poorly reported. The differences in reporting between different types of analysis were minimal, and no significant correlation was found between impact factor and the number of reported pre-analytical elements.
Our assessment focused on pre-analytical reporting in biobanking, using SPREC as a foundation for the selection of elements, supplemented with relevant BRISQ criteria.16,18 We believe that this framework provided an appropriate list of elements for reporting pre-analytical processes in biobanks. The findings of poor pre-analytical reporting align with those of a previous evaluation conducted a decade ago, which highlighted the limited implementation of BRISQ guidelines and the lack of improvement in reporting biospecimen data in scientific literature since their publication. 20 The authors of this evaluation attribute this low uptake of BRISQ in scientific literature to its broadness, openness for interpretation, and the vague definition of some categories.
Our results demonstrate that, even today, most pre-analytical processes are reported in fewer than 25% of articles, which contrasts sharply with the near-universal reporting of demographic characteristics. This suggests a clear consensus in clinical research on the importance of documenting subject characteristics 21 , yet no similar consensus exists for pre-analytical processes. It is important to note that a lack of information on certain pre-analytical aspects does not necessarily imply that the pre-analysis is unsuitable for its intended purpose, but it does limit the ability to assess the reliability and robustness of the study’s conclusions.
We observed that the reporting of pre-analytical elements was not influenced by the type of analysis. Also articles on metabolites, miRNA, and transcriptomics analyses demonstrated poor reporting of pre-centrifugal delays, despite evidence that delays in sample processing can alter the composition of the sample, potentially affecting these analysis outcomes.12,22–24 This is particularly the case when blood cells have not been separated from plasma or serum. Similarly, the reporting of fasting time was found to be suboptimal even for articles on metabolomic analysis, despite its known impact on these measurements.
Our analysis found no relationship between the impact factor of journals and the completeness of pre-analytical reporting, which is concerning given that higher-impact journals are generally regarded as more influential on clinical decisions and guidelines. This suggests that the quality of reporting on pre-analytical procedures is not necessarily related to journal prestige, a factor that could undermine the credibility of research published in such journals.
Our study also highlighted differences in the reporting of pre-analytical elements for articles using the UK Biobank. Compared with the broader set of articles, for information on a higher percentage of pre-analytical elements, UK Biobank articles referred to the references rather than providing this information in the main text or supplementary materials. Interestingly, the majority of these references contained limited or no useful information about pre-analytical processes, with a minority that comprised all relevant pre-analytical information. These findings suggest that even when relevant information is available for a biobank, researchers may not be aware of its existence or importance.
We argue that complete reporting of pre-analytical procedures is a shared responsibility among the research community, including researchers, biobank organizations, funders, and journal editors. We recommend that authors ensure the adequate reporting of pre-analytical processes and that biobanks make this information readily accessible, for example, through publicly available and regularly maintained websites.
The strength of our study lies in the large number of recent articles included, which were selected through a thorough search strategy. This enabled us to capture a broad picture of pre-analytical reporting in the biomedical field. Our study also comprises several limitations. First, as depicted in the flow diagram describing selection of articles, we slightly changed our study aim after the initial searches in which we also included urine as a biomaterial. Since this resulted in a number of articles, that exceeded our scorings capacity, at second instant, we decided to exclude articles that had urine as a sample type (rather than shortening the inclusion period). Second, by not using the search term “biorepository” as an alternative word for “biobank,” we may have missed articles. Third, our searches did not select articles on analyte or “multi-analyte” (omics) measurements in biobank materials that did NOT mention the term “biomarker.” Using this research strategy we may have missed articles on certain analytes measured in biobank biomaterials, among which are studies on cytokines. For reasons of feasibility of our literature search we also excluded Genome Wide Association Studies (GWAS), while these studies also use measurement results for certain analytes measured in biobank plasma or serum. We believe the abovementioned second and third limitations have not affected our overall conclusion of poor reporting of pre-analysis in biomarker studies using biobank materials. However, they preclude to obtain some specific insights, for instance, on reporting of pre-analysis in GWAS studies and of analytes typically not described by the term “biomarker.” Fourth, all analyses were conducted by a single assessor, although two rounds of quality assessments were carried out to mitigate this interobserver bias.
We envisage the list of pre-analytical elements that should be minimally reported in publications is a trade-off between transparency on how pre-analysis was conducted for elements that makes the sample fit for its intended use and the bureaucratic burden for researchers, funding agencies, and article reviewers. It is beyond the scope of this study to propose a set of pre-analytical characteristics that meets these requirements, but we suggest it consists of a simple checklist approach for elements as outlined by SPREC and BRISQ and that are adapted for differences in intended use.
Better biospecimen data reporting increases transparency that is essential to evaluate credibility of scientific claims. Reporting may be facilitated by making pre-analytical biospecimen data available to the researcher at time of sample issuance. To this end, we encourage biobanks and suppliers to collaborate on inclusion of pre-analytical sample characteristics as described in SPREC and BRISQ standards in their Laboratory Information Management Systems and/or Biobank Information Management Systems.
Improving quality of scientific publications by promoting transparent and accurate reporting is also the mission of the international network for Enhancing the QUAlity and Transparency Of health Research (EQUATOR). Its website showcases reporting guidelines for many study types, among which are CONsolidated Standards Of Reporting Trials (CONSORT) for randomized trials and Standards for the Reporting of Diagnostic Accuracy Studies (STARD) for diagnostic and prognostic studies. Its library also comprises the BRISQ guideline for “biospecimen reporting.” However, to the best of our knowledge, none of the biomedical journals requires adherence to the BRISQ (or SPREC) guideline for reporting pre-analytical variables for biomarker studies. We propose that journals that publish analyte measurement results of biobank samples refer to guidelines that provide guidance on reporting of the pre-analytical characteristics of biobank samples for clinical biomarker research.
Our findings underscore the importance of improving pre-analytical reporting in clinical biomarker research, as inadequate reporting can lead to irreproducible results and faulty conclusions, contributing to research waste. We call on researchers, biobank organizations, academic institutions, policymakers, funders, and journal editors to prioritize transparent and accurate reporting of pre-analytical processes and to collaboratively develop and implement reporting guidelines that address the specific needs of different analysis purposes.
Authors’ Contributions
D.W.S. conceived the study. D.W.S., F.d.V., and M.A.v.d.B. designed the study. All authors contributed critical insights into the search strategy, article selection, data extraction, interpretation, and analysis. J.A.W.J. performed the search and data selection. D.W.S., F.V., and M.A.v.d.B. screened a selection of articles for quality assessment. J.A.W.J. wrote the first draft of the article. D.W.S. wrote subsequent versions of the article with J.A.W.J., F.D., and M.A.v.d.B. providing essential input for revisions. All authors had full access to all data and were responsible for the decision to submit the article for publication.
Footnotes
Author Disclosure Statement
No conflicting financial interests exist.
Funding Information
No funding was received for this article.
Supplemental Material
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
