Abstract
Objective:
We aim to evaluate the quality of patient-reported outcomes included in randomized control trials for the treatment of congestive heart failure using the International Society for Quality of Life Research (ISOQOL) checklist, a validated tool for critically appraising the quality of patient-reported outcomes.
Design:
We performed a cross-sectional analysis of 65 randomized control trials with patient-reported outcomes for drug intervention trials for treating congestive heart failure.
Setting:
N/A.
Participants:
N/A.
Main outcome measures:
The primary outcome of this study was to evaluate the reporting completeness of patient-reported outcomes in congestive heart failure clinical trials with drug interventions according to the ISOQOL checklist.
Results:
Our search returned 1114 studies, of which, 65 are included in the analysis. The average completion of the ISOQOL reporting standards was 44.51%. Higher completion of the ISOQOL patient-reported outcome standards was observed in the clinical trials with patient-reported outcomes as primary endpoints compared to the clinical trials with patient-reported outcomes as a secondary endpoint. The multivariable regression model showed that clinical trials with patient-reported outcomes as a primary endpoint had a 21.46% better completion percentage (t = 4.45, p ⩽ 0.001) when controlling for PRO recording duration and trial registration. Eight (8/65, 12.31%) of the clinical trials met the satisfaction criteria of completing two-thirds of the ISOQOL patient-reported outcomes reporting standards. All of these RCTs had a patient-reported outcome as a primary endpoint.
Conclusion:
Our analysis of the reporting of patient-reported outcomes in congestive heart failure clinical trials with drug interventions suggests that the quality of reporting is suboptimal. This evidence of substandard reporting of patient-reported outcomes is disconcerting as it reduces the transparency of randomized control trials, which are considered the foundation of evidenced-based medicine. Inadequate reporting may result in clinicians implementing misrepresented or incomplete evidence into clinical practice. Validated reporting tools, such as the ISOQOL, can be used by trialists and clinicians alike to improve and critically appraise the reporting of patient-reported outcomes in randomized control trials.
Trial registration:
N/A.
Keywords
Introduction
The American Heart Association estimates that 6.2 million American adults have congestive heart failure (CHF). This estimate is projected to increase to over 8 million adults by 2030.1–3 In 2014, 1.1 million emergency department visits and 1 million hospitalizations were attributed to primary CHF with a mean cost of $11,552 per admission and a total estimated cost of greater than $11 billion. 4 Worldwide, it is estimated that more than 64 million people have CHF, affecting 1%–2% of all adults.5,6 CHF may also increase the risk of complications such as arrhythmia, thromboembolism, hepatic dysfunction, muscle wasting, and pulmonary congestion. 7 Given the high prevalence of CHF, accompanied by its escalating societal costs and associated comorbidities, research efforts are essential for investigating therapies that decrease morbidity and mortality. Moreover, because CHF directly affects the quality of life and daily function of patients, equal efforts should focus on quality-of-life improvement and other patient-reported outcomes (PROs). PROs are defined as measurements based on reports directly from patients about their health condition and treatment experiences, without interpretation by clinicians or others. PROs are crucial in assessing the impact of treatments from the patient’s perspective. In CHF trials, commonly used PRO measures include the Kansas City Cardiomyopathy Questionnaire (KCCQ), which assesses the patient’s health status, and the Minnesota Living with Heart Failure Questionnaire (MLHFQ), which measures the impact of heart failure on quality of life.8,9 For investigations of efficacy and quality of life, results from randomized clinical trials (RCTs) are frequently used to explore the methods and strategies for potential improvements in these outcomes.
RCTs are widely regarded as the foundation of evidence-based medicine through their study design that reduces bias, decreases confounders, and allows for the exploration of causal relationships between interventions and outcomes. 10 To illustrate their importance, RCTs are often used as strong supporting evidence for recommendations underpinning clinical practice guideline development. For example, a clinical practice guideline for the management of heart failure recommends the use of ACE inhibitors for patients with prior or current symptoms of chronic heart failure with a reduced ejection fraction based on the work of an RCT. 11 While RCTs are commonly used to investigate therapeutic efficacy, this study design is also effective for examining PROs. A recent systematic review of PROs found that few RCTs examined the quality of life parameters as primary outcomes 12 ; however, when these outcomes are selected as primary endpoints, patients report greater satisfaction with study results. 13 Thus, because of their widespread importance to patient care, RCTs must be clearly and completely reported, carefully outlining their methodology and decision-making.14,15
Still, trialists will intermittently forgo the rigorous study design mandated by RCTs, giving way to variability and insufficient outcome reporting. This deficiency has led to the development of the Consolidated Standards of Reporting Trials (CONSORT) statement. 16 The CONSORT statement created evidenced-based recommendations to be used by authors as a tool to ensure complete and appropriate reporting of trials. Several extensions of the CONSORT statement have been developed to improve different aspects of the original guidelines.17,18 One of those extensions was developed by the International Society for Quality of Life Research (ISOQOL) to improve the reporting of PROs in RCTs. 18 In addition to the ISOQOL guidelines, the CONSORT-PRO (Consolidated Standards of Reporting Trials–Patient-Reported Outcomes) extension provides minimum standards for reporting PROs in clinical trials. While CONSORT-PRO is widely recognized, this study used the ISOQOL checklist due to its comprehensive approach, offering a more detailed assessment of PRO reporting, particularly in the context of CHF trials. In this study, we focus on examining the reporting quality of RCTS that evaluate PROs for individuals with CHF using the ISOQOL screening guideline.
Methods
This methodology review (which used elements of systematic review methodology, when applicable) did not qualify as human research as outlined in the United States Code of Federal Regulations and did not require institutional review board oversight. The writing of this manuscript was directed by a modified reporting guideline of the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA). This guideline was developed by Murad and Wang 19 for the reporting of epidemiological methodology research.
Study objectives
The primary objective of this study was to evaluate the completeness of PRO reporting in CHF studies with drug interventions according to the ISOQOL reporting guidelines. We selected the ISOQOL reporting guidelines as the primary framework for analysis due to its detailed and specific approach to evaluating PRO reporting. While the CONSORT-PRO guidelines establish minimum reporting standards, we believe ISOQOL offers a more comprehensive evaluation of reporting completeness, which is crucial for assessing the quality of PRO reporting in CHF clinical trials. Our secondary objectives were to determine factors that influence the completeness of PRO reporting and how primary or secondary endpoints affect the completeness of PRO reporting. We evaluated the change in completeness of PRO reporting prior to and after the publication of CONSORT-PRO in 2013 (allowing 1 year for guidelines to be implemented).
PubMed search
PubMed, which includes the MEDLINE collection, was searched by DR and JP. Our search string was created using a combination of medical subject heading (MESH) terms, key clinical phrases, and PRO-specific phrases used in a previous systematic review. 20 The following search string was deployed as follows: (“Heart Failure”[Mesh] AND “Heart failure” OR “Congestive Heart failure” OR “CHF” OR “Decompensated heart failure”) AND (“quality of life” OR “health related quality of life” OR “health status” OR “health outcomes” OR “psychosocial” OR “psychological” OR “depression” OR “anxiety” OR “emotional” OR “distress” OR “social functioning” OR “social wellbeing” OR “patient reported symptom” OR “patient reported outcomes” OR pain OR fatigue OR “PRO” OR “PROs” OR “HRQL” OR “QOL” OR “HRQOL” OR “symptom distress” OR “symptom burden” OR “symptom assessment” OR “functional status” OR coping OR socialization). To narrow this search to RCTs, we applied the filters for RCTs. We also limited our search to English-only publications and studies published on or after January 1st, 2010.
Eligibility
Eligible publications in this study included all types of RCTs including, but not limited to, cluster designs, crossover designs, and parallel arm studies with PROs. Other study designs lacking participant randomization including cohort, case–control, case reports, chart review, systematic reviews, meta-analyses, observational studies, animal studies, and letters to the editor were, thus, excluded. We also excluded trials with a sample size of less than 50 participants as these smaller trials are considered inadequately powered for PRO analyses (consistent with previous studies20–23), or trials involving participants with other medical conditions. According to the 2017 CHF clinical practice guidelines, pharmacological interventions are a common treatment modality for CHF. 11 Thus, in this study, only RCTs focused on pharmacological interventions were included. All other intervention modalities were excluded.
Screening
The search string returns were exported into Rayyan, 24 a systematic review screening tool, for the first screening phase, including title and abstract screening. DR independently screened the titles and abstracts of all records twice using the aforementioned criteria. All excluded records were tagged with a reason for exclusion. Any difficult or questionable records were also reviewed by a second investigator (JP) and decisions about inclusion were made jointly through discussion. Following title and abstract screening, all included studies underwent a secondary full-text screening by both investigators (DR and JP) in a masked, independent fashion. Any differences found during the screening were discussed between DR and JP and a mutual decision was made by discussion. Two third-party researchers were available for consultation; however, this process was not needed. A list of included and excluded records can be found on Open Science Framework(https://osf.io/5zyp7/?view_only=a10612e0a0b34c53802e68dad4561d7f)
Data extraction
The investigators (DR and JP) who completed the screening also performed data extraction. Using the ISOQOL reporting guidelines as a reference, 18 a Google form was developed that included all 28 checklist items, as well as the year of publication, country of first author, trial registration status, trial phase, PRO measure (quality of life, anxiety, depression, symptoms, functional status, etc.), length of PRO measure, statistical significance between treatment arms, conflict of interest, and funding statements. The inclusion of these characteristics was based on findings reported in previous studies.20–23 Both researchers completed the form for each publication independently and masked. After completing data extraction, DR and JP reconciled any differences in the extracted data. Two other investigators (MV and CW) were available for consultation; however, it was not needed. The Google form used for data extraction is available at https://osf.io/5zyp7/?view_only=a10612e0a0b34c53802e68dad4561d7f.
Data analysis
For our primary objective, sufficient completion of the ISOQOL reporting standards was determined using two different approaches. As used by Mercieca-Bebber et al., 20 our first approach considered adequate reporting as the completion of two-thirds of the ISOQOL reporting items. The second approach consisted of a tiered system to determine the level of completeness, which was based upon a previous study by Efficace et al. 25 The three tiers included mostly complete (⩾66% item completeness), partially complete (⩾33%, ⩽66% item completeness), and incomplete (⩽33% item completeness). For our secondary objective, we performed a multivariable regression analysis to determine the associations’ percent of PRO reporting completeness and RCTs length of PRO assessment, trial registration, whether PROs were primary or secondary endpoints, and whether the study was published prior to or after 2014.
Results
The initial search string returned 1114 studies. After a primary screening, 1021 studies were excluded. A secondary screening was performed resulting in the exclusion of 28 additional studies. Approximately 65 RCTs met the inclusion criteria and are included in the analysis. A summary of the exclusion process is shown in Figure 1.

Preferred Reporting Items for Systematic Reviews and Meta-Analyses diagram for included studies.
RCT characteristics
RCT characteristics are fully reported in Table 1. A PRO was reported as a primary endpoint in 29 RCTs (44.62%) and reported as a secondary endpoint in 30 RCTs (46.15%). In six of the RCTs (9.23%), the PRO was not listed as a primary or a secondary endpoint. Quality of life was the PRO that was most commonly reported with 62 of the 65 RCTs (95.38%) reporting on it. Symptoms were the second most common PRO reported with 15 RCTs (23.08%) reporting on it. In these studies, 36 RCTs (55.38%) had a statistically significant difference in the PRO. The most common PRO that had a statistical difference between the arms was quality of life (33/65, 50.77%).
Baseline characteristics of randomized controlled trials.
PRO, patient-reported outcome.
PRO measures
Within the 65 RCTs, there were 45 unique PRO measures used. The most common PRO measures used were the KCCQ with 35 RCTs (53.85%), the MLHFQ with 25 RCTs (38.46%), and the EQ-5D with 16 RCTs (24.62%). All of the RCTs reported which PRO measure they used.
Adherence to ISOQOL PRO reporting standards
Table 2 shows the adherence to ISOQOL PRO reporting standards. Of all 65 studies, the average completion of the ISOQOL reporting standards was 44.51% (SD = 17.85%). Items commonly reported include the PRO as an outcome in the abstract (61/65, 93.85%), the schedule for data collection (58/65, 89.23%), and the baseline characteristics (53/65, 81.54%). The items that were reported the least include discussing generalizability issues with the PRO measure (10/65, 15.38%) and discussing the limitations with the PRO measure (12/65, 18.46%).
ISOQOL guideline adherence.
ISOQOL, International Society for Quality of Life Research; PRO, patient-reported outcome; RCT, randomized clinical trial.
All of the 29 RCTs that had a PRO as the primary endpoint provided the PRO measurement tool or the tool was previously published. In addition, 23 of these RCTs (79.31%) specified and justified the window for valid PRO responses, 21 RCTs (72.41%) included a summary of PRO research in the introduction, and 19 RCTs (65.52%) specified which PRO is measured in the title of the paper. Only two of these RCTs (6.90%) provided a hypothesis that specified directionality and timing and four RCTs (13.79%) discussed how they addressed multiple comparisons.
Higher completion of the ISOQOL PRO standards was observed in the RCTs with PROs as primary endpoints compared to the RCTs with PROs as a secondary endpoint. The RCTs with a PRO as a primary endpoint had an average completion rate of 55.54% (SD = 17.11%). This is compared to RCTs with a PRO as a secondary endpoint with an average completion rate of 38.24% (SD = 12.33%). The lowest completion rate was seen in the RCTs that did not specify if the PRO was a primary or secondary endpoint with a completion rate of 22.55% (SD = 5.78). The item with the largest difference between primary and secondary endpoints was providing evidence for the validity and reliability of the PRO measure with 20 RCTs (68.97%) with primary endpoints and 7 RCTs (23.33%) with secondary endpoints reporting it.
Eight (8/65, 12.31%) of the RCTs met the satisfaction criteria of completing two-thirds of the ISOQOL PRO reporting standards. All of these RCTs had a PRO as a primary endpoint. These studies had an average completion rate of 76.79%. Using the tiered completeness approach, 8 (8/65, 12.31%) were mostly complete, 39 (39/65, 60.00%) were partially complete, and 18 (18/65, 27.69%) were incomplete.
Influence on adherence
The multivariable regression model showed that the percentage of completeness of ISOQOL reporting items was not significantly different between RCTs PRO duration (−3.17%, t = −0.68, p = 0.502) or registration status (7.29%, t = 1.63, p = 0.111). However, reporting completeness was significantly different between studies with primary or secondary endpoints—RCTs with PROs as the primary endpoint showed a 21.46% better completion percentage (t = 4.45, p ⩽ 0.001), when controlling for PRO recording duration and trial registration.
Discussion
Our analysis of the reporting of PROs in CHF RCTs with drug interventions suggests that the quality of reporting is suboptimal. This evidence of substandard reporting of PROs is disconcerting as it reduces the transparency of RCTs, which are considered the foundation of evidence-based medicine. 26 Inadequate reporting may result in clinicians implementing misrepresented or incomplete evidence into clinical practice. Poor reporting practices result in the loss of valuable data, diminish the utility of research findings, and ultimately impede the translation of evidence into clinical practice. This notion is supported by Chalmers and Glasziou, who identified avoidable waste in the production and reporting of research evidence, emphasizing that incomplete or unusable reports represent a significant source of inefficiency in biomedical research. Similarly, Glasziou et al. have called for improved reporting standards to reduce waste and enhance the reliability of published research findings. Our study reinforces the need for strict adherence to validated reporting guidelines, such as ISOQOL and CONSORT-PRO, to ensure that PRO data are fully and accurately reported, thus maximizing their value for clinical decision-making and policy development.27,28 Our primary finding—that PRO reporting in CHF RCTs with drug interventions showed suboptimal reporting—is supported by three similar studies across different fields of medicine that similarly used the two-third item completion threshold.20–22 These studies concluded that overall reporting of PRO findings is insufficient, noting that high-quality reporting is crucial for the appropriate interpretation and appraisal of PRO results for use in clinical practice. Therefore, improvement in PRO reporting among RCTs is warranted given their importance in clinical decision-making and healthcare policy development.
In this investigation, we found that publications with PROs as the primary endpoint were associated with a statistically significant increase in adherence to the ISOQOL reporting guidelines. Of note, secondary endpoint PROs did not mirror the statistically significant adherence to ISOQOL reporting seen with PROs as primary endpoints. While our study focused on the ISOQOL guidelines for a detailed evaluation of PRO reporting in CHF trials, we acknowledge the important role of CONSORT-PRO as a widely recognized standard for PRO reporting. Future research could benefit from a comparative analysis using both ISOQOL and CONSORT-PRO guidelines to explore the impact of these reporting standards on the quality and completeness of PRO data. This finding is considerable, as better reporting is associated with transparent and reproducible research practices, decreased bias, and improved generalizability of results. 29 Omission of PROs from clinical trials is regrettably a common occurrence.30–33 The absence of these outcomes decreases patient engagement in healthcare decision-making leading to worsening medical conditions secondary to lack of comfort with available knowledge about a drug treatment. 34 As the adoption of more patient engagement metrics is used by healthcare organizations and governments, so too should a rise be seen in RCTs reporting primary endpoint PROs, strictly adhering to ISOQOL and other PRO reporting guidelines.
Given that RCTs have begun to increasingly use PROs along with safety and efficacy data for drug approval, an expected increase would be seen in the quality of the PROs as more analyses are performed. However, the results of studies examining the quality of PROs over the years are inconclusive. The findings of our study of PROs in RCTs for CHF noted that there were no statistically significant differences in reporting quality over time, mirroring the findings of Mercieca-Bebber et al. 20 Conversely, Rees et al. 23 report that certain criteria (the clinical significance of PRO findings and information about instrument administration) have increased over time. Both studies indicate that the overall quality of reporting of PROs remains inadequate. Both the Federal Drug Administration and the European Medicine Agencies have issued guidance to trialists on the implementation and interpretation of PROs used to endorse drug approval and appropriate pharmaceutical labeling.35,36 Yet, neither of these organizations mentions the use of validated reporting quality guidelines, such as the ISOQOL. We recommend that as PROs become more commonplace, a greater emphasis should be placed on these PROs adhering to externally verifiable reporting guidelines, thus ensuring the reliability of these outcomes on patients and the public.
Lingering concerns for wider adoption of PROs is clinician understanding of the results and integration into clinical decision-making. While PRO measures have been shown to increase clinician time spent discussing these outcomes with patients, 37 studies have failed to show an influence on treatment selection. 38 Reasons for this finding include unfamiliarity with PRO research design, skepticism about the generalizability of the results, and lack of clinician time and resources. These questions have led to numerous published articles meant to aid clinicians in the interpretation of PRO results. Still, this places the burden upon clinicians to familiarize themselves with differing methods of deconstructing PROs to make sense of true clinical benefit, which complicates matters rather than clarifying them. For common diseases, such as CHF, sifting through the multitude of PROs included within RCTs is not a feasible task for the busy clinician. A solution to this quandary is the universal implementation of verified reporting quality guidelines, like the ISOQOL, when submitting a manuscript or drug approval. This mandate would encourage trialists to observe the items underpinning a high-quality PRO, as well as provide a score based on the number of items met that is easily discernible to clinicians.
Strengths and limitations
Strengths of our study include the data extraction method, in which all studies were coded independently and blinded by two researchers, increasing the reproducibility of our results. We selected the ISOQOL guideline as this is an internationally recognized measure of PROs that has been the basis for the development of other PRO reporting standards such as the CONSORT-PRO extension.18,20,39 In addition, we elected to include multiple PRO types rather than focusing only on health-related quality of life. Limitations of our study include the exclusion of non-English RCTs which may fail to account for all RCTs evaluating drug interventions for CHF, but the effect of their absence has been suggested to be non-significant. 40 In addition, the ISOQOL-PRO reporting standards may be a source of subjectivity in the interpretation of the reporting items, such as the reporting of PRO statistical analysis and missing data, which requires some interpretation to determine the quality of reporting. Lastly, we only used PubMed as the source for identifying relevant records. While PubMed is a comprehensive and widely recognized database for biomedical literature, this approach may have excluded relevant studies indexed in other databases such as Embase, Scopus, or the Cochrane Library. This could potentially limit the breadth and diversity of our included studies, leading to a narrower scope of findings.
Conclusion
Overall, PRO reporting in RCTs of pharmaceutical interventions for CHF remains suboptimal. The underreporting of PROs may lead to significant errors in interpreting data from these studies, which ultimately will impact patient care. It is vital that the results of PROs in RCTs are reported at the highest quality for not only patient safety but also for the practice of evidence-based medicine. This problem may be solved by a collective effort from researchers, clinicians, and journal editors to adhere to internally recognized standards for reporting PROs such as the ISOQOL guidelines. Together, the use of PRO quality guidelines optimizes research and positively facilitates the implementation of RCT PROs into clinical practice.
