Sage Journals: Discover world-class research

Abstract

Background/Aims:

The Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events (PRO-CTCAE^®) was developed to capture symptomatic adverse events from the patient perspective. We aim to describe statistical properties of PRO-CTCAE items and summary scores and to provide evidence for recommendations regarding PRO-CTCAE administration and reporting.

Methods:

Using data from the PRO-CTCAE validation study (NCT02158637), prevalence, means, and standard deviations of PRO-CTCAE items, composite scores, and mean and maximum scores across attributes (frequency, severity, and/or interference) of symptomatic adverse events were calculated. For each adverse event, correlations and agreement between attributes, correlations between attributes and composite scores, and correlations between composite, mean, and maximum scores were estimated.

Results:

PRO-CTCAE items were completed by 899 patients with various cancer types. Most patients reported experiencing one or more adverse events, with the most prevalent being fatigue (87.7%), sad/unhappy feelings (66.0%), anxiety (63.6%), pain (63.2%), insomnia (61.8%), and dry mouth (60.0%). Attributes were moderately to strongly correlated within an adverse event (r = 0.53 to 0.77, all p < 0.001) but not fully concordant (κ_weighted = 0.26 to 0.60, all p < 0.001), with interference demonstrating lowest mean scores and prevalence among attributes of the same adverse event. Attributes were moderately to strongly correlated with composite scores (r = 0.67 to 0.97, all p < 0.001). Composite scores were moderately to strongly correlated with mean and maximum scores for the same adverse event (r = 0.69 to 0.94, all p < 0.001). Correlations between composite scores of different adverse events varied widely (r = 0.04 to 0.68) but were moderate to strong for conceptually related adverse events.

Conclusions:

Results provide evidence for PRO-CTCAE administration and reporting recommendations that the full complement of attributes be administered for each adverse event, and that attributes as well as summary scores be reported.

Keywords

Patient-reported outcomes adverse events cancer symptoms tolerability

Background/aims

The National Cancer Institute’s (NCI) Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events (PRO-CTCAE^®) is a library of 124 items, measuring 78 symptomatic adverse events (AEs) from the patient perspective. The NCI contracted development of this measure as a companion to the Common Terminology Criteria for Adverse Events (CTCAE now in version 5) for use in cancer clinical trials.¹ PRO-CTCAE is a valid and patient-centric method for capturing symptomatic AEs, including those that may be missed or underreported by clinicians.^2,3 Because of the unique structure for grading AEs according to the CTCAE, each PRO-CTCAE symptomatic AE is assessed using one to three self-report items, measuring the attributes of presence, amount, frequency, severity, and/or interference with usual or daily activities. For example, pain is measured using three items for pain frequency, severity, and interference with usual or daily activities. The standard recall period for PRO-CTCAE is the past 7 days. The composite score for each AE can be produced algorithmically based on individual PRO-CTCAE items; this composite scoring algorithm was designed with clinical investigator input and is structured and scaled to resemble how clinicians would grade the same AE.⁴ The composite score is designed to aid analysis and interpretation of PRO-CTCAE by providing a single summary score of each patient-reported symptomatic AE.⁴

This secondary analysis of prospectively collected data was designed to evaluate the statistical properties (including distributional characteristics, correlations, and agreement) of PRO-CTCAE items and summary scores using data from patients with a variety of cancer types, a majority of whom had recently received cancer treatment. Previous work with this dataset has established the validity and reliability of PRO-CTCAE items and response scales, reported how items correlate with anchors from other patient-reported instruments, established mode equivalence, compared recall periods, and developed a rubric for an overall AE composite score.^3–8 In this study we aimed to address questions related to administering, analyzing, and reporting PRO-CTCAE, including: how attribute items relate to each other and the extent to which each attribute contributes unique information; how summary scores relate to individual items; and how symptomatic AEs relate to each other. To address these questions, we investigated the statistical properties of individual PRO-CTCAE item scores and summary scores, including: prevalence of AEs, as determined by attribute score > 0 and by composite score > 0; distributional characteristics of items and summary scores; correlations and agreement between attributes for each AE; correlations between each attribute and the composite score for each AE; correlations between the composite score and other summary scores for each AE; and correlations between AEs.¹ Knowledge of these statistical properties can provide empirical insights to inform trial design, analysis and results reporting in studies that include PRO-CTCAE.

Method

Data used in this study were originally collected at nine centers as part of a large validation study of the PRO-CTCAE (ClinicalTrials.gov: NCT02158637).^3–9 English-speaking adult patients with a solid tumor or hematologic malignancy who were receiving or initiating chemo- and/or radiation therapy were eligible to participate.⁹ This study uses baseline data collected at the first study visit. Most participating patients received prior cancer treatment or were on active therapy at the time of enrollment. Included in the current study are items from a set of 20 symptomatic AEs that were administered to all participants in the validation sample. These 20 symptom terms reflect symptoms that are commonly reported by patients with a variety of cancer sites and treatment types, specifically anxiety, arm or leg swelling, constipation, decreased appetite, dry mouth, fatigue, hair loss, headache, insomnia, diarrhea, mouth or throat sores, nausea, numbness/tingling in hands/feet, pain, concentration problems, taste problems, rash, sad/unhappy feelings, shortness of breath, vomiting, and feeling like nothing could cheer you up. These 20 AEs were measured by one to three attributes, for a total of 41 items. More information about the PRO-CTCAE is available at https://healthcaredelivery.cancer.gov/pro-ctcae/.

Frequency, severity, interference, and amount are measured on a 0 (Never/None/Not at all) to 4 (Almost constantly/Very severe/Very much) scale. Presence is measured on a 0/1 (yes/no) scale. Branching logic was employed so that if a participant indicated they were not experiencing a specific AE, that is, frequency was 0, questions about severity and interference were skipped; a zero value was imputed for the conditionally skipped items. Composite scores were determined based on AE attributes, as previously described.⁴ Composite scores are scaled from 0 to 3, with higher scores representing worse symptoms. Mean and maximum scores across attributes for the same AE were calculated by patient for AEs with two or more attributes. Composite scores, mean scores, and maximum scores were computed for all respondents with complete data for the AE, meaning all items not conditionally skipped were answered.

Item scores and composite scores were descriptively summarized using means and standard deviations. Prevalence of score > 0 was summarized as frequency and percentage. Rash presence is a dichotomous item with no composite score and was descriptively summarized as frequency and percentage of item prevalence; rash was not included in correlation or agreement analyses. For each AE, Pearson correlation coefficients were calculated between attributes (frequency, severity, and/or interference), between composite scores and each attribute (amount, frequency, severity, and/or interference), and between composite score, mean score, and maximum score among patients with complete data for all administered attributes for the AE who had a score > 0 on one or more attribute. As a supplemental analysis, Pearson and Spearman correlation coefficients were calculated among patients with complete data for all administered attributes of the AE, without the limitation of a score > 0 on an attribute. As correlation coefficients are inflated by the inclusion of a large number of 0 scores, this analysis is presented in the supplement and for the primary analysis the sample was restricted to patients with scores > 0 on one or more AE attributes. Agreement between attributes by AE was assessed using weighted kappa coefficients, among patients with a score > 0 on one or more attribute. Coefficients > 0.40 were interpreted as moderate agreement.¹⁰ In addition, Pearson and Spearman correlation coefficients were calculated between composite scores for AEs, and the magnitude of correlation was graphically summarized (Pearson correlation coefficients only). Correlations ≥ 0.50 were considered moderate to strong in magnitude.^11,12

Results

Of the 940 patients enrolled in the validation study, 899 completed one or more PRO-CTCAE items of interest at baseline and were included in the analysis. At the time of survey, most patients (743/899, 82.6%) had an Eastern Cooperative Oncology Group Performance Status (ECOG PS) of 0–1 and most (737/883, 83.5%) had received treatment (chemotherapy, radiation therapy, and/or surgery) in the past 2 weeks (Table 1).

Table 1.

Demographics for N = 899 patients included in the analysis.

Characteristic	n (%)
Age in years, mean (SD)	58.3 (12.86)
Female	509 (56.6%)
Race
White	648 (73.0%)
Black or African American	191 (21.5%)
Native Hawaiian or Other Pacific Islander	5 (0.6%)
Asian	41 (4.6%)
American Indian or other Pacific Islander	2 (0.2%)
Multiple races reported	1 (0.1%)
Unknown/not reported	11
Ethnicity
Hispanic/Latino	53 (6.3%)
Non-Hispanic	795 (93.8%)
Unknown/Not Reported	51
Education level
Less than high school	67 (7.6%)
High school or GED	224 (25.5%)
Some college	195 (22.2%)
College graduate or more	393 (44.7%)
Missing	20
Disease
Breast	242 (26.9%)
Lung/head/neck	321 (35.7%)
Gastrointestinal	88 (9.8%)
Genitourinary or gynecologic	165 (18.4%)
Hematologic	46 (5.1%)
Other/unknown	37 (4.1%)
ECOG performance status 2–4 at Visit 1	156 (17.4%)
Chemotherapy in past 2 weeks
Yes	492 (55.7%)
No	391 (44.3%)
Missing	16
Radiation therapy in past 2 weeks
Yes	418 (47.4%)
No	463 (52.6%)
Missing	18
Surgery in past 2 weeks
Yes	33 (3.7%)
No	848 (96.3%)
Missing	18
Any treatment in past 2 weeks
Yes	737 (83.5%)
No	146 (16.5%)
Missing	16

Table 2 reports attribute-level means, standard deviations, and prevalences (percentage of patients with score > 0) for each of the 20 symptomatic AEs examined in this study. Item means were generally low (across all attributes, 31.7% (13/41) of item means were ≥ 1). For AEs with multiple attributes, interference consistently had the lowest mean score and lowest prevalence among the attributes (Table 2). For AEs measured by frequency and/or severity as well as interference, only rarely were mean interference scores higher than the scores of other attributes. Pain and fatigue were the symptomatic AEs with the greatest proportion of participants who rated interference higher than they did frequency and/or severity, (97/899 [10.8%] and 119/899 [13.2%] of participants, respectively) (Table S1).

Table 2.

PRO-CTCAE attribute scores, composite, mean and maximum scores, and prevalence rates.

Symptomatic adverse event^a	n ^b	Mean (SD)						Prevalence rate (score > 0), n (%)
		Frequency	Severity	Interference	Composite score	Mean score	Maximum score	Frequency	Severity	Interference	Composite Score
Anxiety	837	1.2 (1.1)	1.0 (1.0)	0.7 (1.1)	1.0 (1.0)	1.0 (1.0)	1.3 (1.2)	547 (65.4%)	530 (63.3%)	330 (39.4%)	532 (63.6%)
Arm or leg swelling	827	0.6 (1.1)	0.5 (0.9)	0.3 (0.8)	0.4 (0.8)	0.4 (0.9)	0.6 (1.2)	219 (26.5%)	216 (26.1%)	126 (15.2%)	216 (26.1%)
Constipation	847	—	0.9 (1.1)	—	0.9 (1.1)	—	—	—	421 (49.7%)	—	421 (49.7%)
Decreased appetite	840	—	1.0 (1.1)	0.7 (1.1)	0.8 (0.9)	0.9 (1.1)	1.0 (1.2)	—	427 (50.8%)	309 (36.8%)	427 (50.8%)
Dry mouth	842	—	1.0 (1.1)	—	1.0 (1.0)	—	—	—	—	505 (60.0%)	505 (60.0%)
Fatigue	875	—	1.7 (1.1)	1.6 (1.2)	1.5 (0.9)	1.6 (1.1)	1.8 (1.2)	—	767 (87.7%)	678 (77.5%)	767 (87.7%)
Hair loss^c	815	1.1 (1.5)	—	—	0.6 (0.8)	—	—	344 (42.2%)	—	—	344 (42.2%)
Headache	836	0.7 (0.9)	0.7 (0.9)	0.5 (0.8)	0.7 (0.9)	0.6 (0.8)	0.8 (1.1)	390 (46.7%)	384 (45.9%)	240 (28.7%)	384 (45.9%)
Insomnia	853	—	1.1 (1.1)	0.9 (1.1)	0.9 (0.9)	1.0 (1.1)	1.2 (1.2)	—	527 (61.8%)	413 (48.4%)	527 (61.8%)
Diarrhea	817	0.8 (1.0)	—	—	0.6 (0.7)	—	—	391 (47.9%)	—	—	391 (47.9%)
Mouth or throat sores	822	—	0.4 (0.8)	0.3 (0.8)	0.4 (0.7)	0.3 (0.8)	0.4 (0.9)	—	206 (25.1%)	120 (14.6%)	206 (25.1%)
Nausea	834	0.9 (1.1)	0.8 (1.0)	—	0.7 (0.9)	0.8 (1.0)	0.9 (1.1)	402 (48.2%)	387 (46.4%)	—	402 (48.2%)
Numbness/tingling in hands/feet	846	—	0.6 (0.9)	0.4 (0.9)	0.5 (0.8)	0.5 (0.9)	0.7 (1.0)	—	332 (39.2%)	207 (24.5%)	332 (39.2%)
Pain	848	1.3 (1.3)	1.2 (1.2)	1.0 (1.3)	1.2 (1.1)	1.2 (1.2)	1.5 (1.4)	539 (63.6%)	535 (63.1%)	424 (50.0%)	536 (63.2%)
Concentration problems	837	—	0.7 (0.9)	0.6 (0.9)	0.6 (0.8)	0.6 (0.9)	0.8 (1.0)	—	393 (47.0%)	306 (36.6%)	393 (47.0%)
Taste problems	839	—	0.9 (1.2)	—	0.9 (1.1)	—	—	—	395 (47.1%)	—	395 (47.1%)
Rash^d	821	—	—	—	—	—	—	144 (17.5%)	—	—	—
Sad/unhappy feelings	840	1.1 (1.0)	1.0 (0.9)	0.6 (1.0)	1.0 (0.9)	0.9 (0.9)	1.2 (1.1)	576 (68.6%)	552 (65.7%)	337 (40.1%)	554 (66.0%)
Shortness of breath	832	—	0.7 (0.9)	0.6 (1.0)	0.6 (0.8)	0.6 (0.9)	0.8 (1.0)	—	367 (44.1%)	276 (33.2%)	367 (44.1%)
Vomiting	826	0.3 (0.7)	0.3 (0.7)	—	0.3 (0.6)	0.3 (0.7)	0.3 (0.8)	147 (17.8%)	138 (16.7%)	—	147 (17.8%)
Nothing could cheer you up	815	0.7 (1.0)	0.6 (0.9)	0.5 (0.9)	0.6 (0.9)	0.6 (0.9)	0.7 (1.1)	322 (39.5%)	296 (36.3%)	229 (28.1%)	299 (36.7%)

Dashes are given where an attribute does not exist for the AE.

Range of attributes, mean, and maximum score is 0–4. Range of composite score is 0–3.

AE overall sample size is the number of complete sets of administered attributes. Patients were excluded from summary if they were missing one or more attribute for the AE. A zero value was imputed for conditionally skipped items, thus these conditionally skipped items were not considered missing.

Amount of hair loss.

Presence of rash.

Composite score means ranged from 0.3 to 1.5, with only 5 AEs demonstrating mean composite scores of 1 or greater (Table 2). The prevalence of composite scores > 0 ranged from 17.8% (vomiting) to 87.7% (fatigue), with the most prevalent AEs being fatigue (767/875, 87.7%), sad/unhappy feelings (554/840, 66.0%), anxiety (532/837, 63.6%), pain (536/848, 63.2%), insomnia (527/853, 61.8%), and dry mouth (505/842, 60.0%) (Table 2). Results for mean and maximum scores were similar to those of composite scores (Table 2). Composite scores were more similar to mean scores than to maximum scores. For all AEs, the mean and standard deviation of the composite scores never exceeded those of the maximum scores.

Among patients experiencing an AE (i.e. respondents with at least one PRO-CTCAE attribute score > 0), attributes were all moderately to strongly positively correlated, with correlation coefficients ranging from 0.53 to 0.77 (p < 0.001 for all; Table 3). In AEs with all three attributes, correlations between the interference and severity items were consistently numerically stronger than those between the interference and frequency items (Table 3). Agreement between attributes was fair to moderate, with weighted kappa coefficients ranging from 0.26 to 0.60 (p < 0.001 for all; Table 4). In AEs with three attributes, agreement was consistently numerically highest between frequency and severity and lowest between frequency and interference. Individual attribute scores were moderately to strongly correlated with the composite score (r = 0.67 to 0.97), severity and interference attribute scores demonstrated stronger correlations with the composite score than did frequency attribute scores (Table 3). Composite scores were also moderately to strongly correlated with both mean and maximum scores (r = 0.69 to 0.94); compared to maximum scores, mean scores demonstrated stronger correlation with the composite score (Table 3). Mean and maximum scores also demonstrated strong correlations (r = 0.78 to 0.96) (Table 3). The supplemental analysis based on all patients (including those with all attribute scores equal to 0) yielded uniformly higher correlations, as expected. Pearson and Spearman correlations were similar (Tables S2, S3, and S4).

Table 3.

Correlations between attributes and summary scores (composite, mean, maximum) of adverse events among respondents with any attribute score > 0.

Symptomatic adverse event	n ^a	Correlation^b between attributes			Correlation^b between composite score and frequency, severity, interference			Correlation^b between composite score, mean score, maximum score
		Frequency/severity	Frequency/interference	Severity/interference	Composite/frequency	Composite/severity	Composite/interference	Composite/mean	Composite/maximum	Mean/maximum
Anxiety	547	0.73	0.65	0.68	0.74	0.91	0.79	0.91	0.83	0.92
Arm or leg swelling	219	0.69	0.53	0.59	0.74	0.87	0.76	0.91	0.79	0.89
Constipation	421	—	—	—	—	0.97	—	—	—	—
Decreased appetite	427	—	—	0.62	—	0.77	0.88	0.92	0.82	0.92
Dry mouth	505	—	—	—	—	0.97
Fatigue	767	—	—	0.77	—	0.88	0.89	0.94	0.87	0.96
Hair loss	344	—	—	—	0.93^c	—	—	—	—	—
Headache	390	0.60	0.56	0.60	0.67	0.89	0.75	0.90	0.80	0.90
Insomnia	527	—	—	0.63		0.79	0.87	0.92	0.82	0.93
Diarrhea	391	—	—	—	0.84	—	—	—	—	—
Mouth or throat sores	206	—	—	0.74	—	0.83	0.88	0.92	0.87	0.96
Nausea	403	0.65	—	—	0.77	0.88	—	0.91	0.78	0.95
Numbness/tingling in hands/feet	333	—	—	0.59	—	0.77	0.83	0.90	0.79	0.92
Pain	539	0.75	0.68	0.76	0.77	0.90	0.83	0.92	0.85	0.93
Concentration problems	393	—	—	0.66	—	0.83	0.82	0.90	0.82	0.93
Taste problems	395	—	—	—	—	0.97	—	—	—	—
Sad/unhappy feelings	576	0.71	0.61	0.65	0.73	0.93	0.74	0.90	0.80	0.90
Shortness of breath	367	—	—	0.74	—	0.86	0.87	0.92	0.87	0.94
Vomiting	147	0.68	—	—	0.83	0.85	—	0.92	0.80	0.95
Nothing could cheer you up	322	0.70	0.61	0.67	0.78	0.91	0.89	0.92	0.69	0.78

Some AEs are assessed using a single attribute (constipation, dry mouth, hair loss, diarrhea, taste problems). For these AEs, the only correlation computed is between the single attribute and the composite score. In these cases and others, dashes are given where an attribute or summary score does not exist for the AE. Rash presence is a dichotomous item with no composite score and is omitted from this table.

All participants with one or more attribute score > 0 were included in each analysis.

Pearson correlation coefficient. All correlation coefficients significantly differ from 0 (p < 0.001).

Correlation between composite score and hair loss amount.

Table 4.

Agreement among PRO-CTCAE attribute pairs in respondents with any attribute score > 0.

Symptomatic adverse event	n ^a	Weighted Kappa coefficient (95% CI)^b
		Frequency/severity	Frequency/interference	Severity/interference
Anxiety	547	0.57(0.52, 0.62)	0.30(0.26, 0.34)	0.38(0.34, 0.43)
Arm or leg swelling	219	0.45(0.37, 0.53)	0.26(0.20, 0.33)	0.33(0.26, 0.41)
Decreased appetite	427	—	—	0.39(0.34, 0.44)
Fatigue	767	—	—	0.60(0.56, 0.64)
Headache	390	0.47(0.39, 0.54)	0.26(0.21, 0.31)	0.29(0.23, 0.35)
Insomnia	527	—	—	0.43(0.38, 0.48)
Mouth or throat sores	206	—	—	0.38(0.31, 0.45)
Nausea	403	0.50(0.43, 0.56)	—	—
Numbness/tingling in hands/feet	333	—	—	0.30(0.24, 0.37)
Pain	539	0.60(0.56, 0.65)	0.43(0.39, 0.48)	0.52(0.48, 0.56)
Concentration problems	393	—	—	0.43(0.38, 0.51)
Sad/unhappy feelings	576	0.54(0.48, 0.59)	0.27(0.22, 0.31)	0.36(0.31, 0.41)
Shortness of breath	367	—	—	0.50(0.44, 0.56)
Vomiting	147	0.53(0.43, 0.64)	—	—
Nothing could cheer you up	322	0.54(0.47, 0.61)	0.34(0.28, 0.41)	0.50(0.44, 0.57)

Dashes are given where an attribute does not exist for the AE.

All participants with one or more attribute scores > 0 were included in each analysis.

All Kappa coefficients significantly differ from 0 (p < 0.001).

Figure 1 visualizes correlations between composite scores, which ranged from 0.04 to 0.68 (Table S4). The strongest correlations were seen between conceptually related or mechanistically co-occurring AEs, specifically sad/unhappy feelings and feeling nothing could cheer you up (r = 0.68), anxiety and sad/unhappy feelings (r = 0.65), anxiety and feeling nothing could cheer you up (r = 0.56), nausea and vomiting (r = 0.60), decreased appetite and taste changes (r = 0.55), and fatigue and pain (r = 0.51) (Table S4). Comparable results were seen for correlations between mean scores and between maximum scores.

Figure 1.

Correlations between composite scores. Sample size is the number of non-missing pairs, ranging from n = 805 to 842. Shade and size of circles show strength of correlation, with stronger correlations represented as larger, darker circles and weaker correlations represented as smaller, lighter circles. Table S4 presents values of correlation coefficients visualized in this figure. Rash is a dichotomous AE with no composite score. As such, it was not included in this analysis.

Conclusions

Prior work established the validity and reliability of PRO-CTCAE and developed a composite scoring algorithm to yield a single numerical score for AEs.^3,4 In this report we used data from the PRO-CTCAE validation study to extend those efforts by examining the prevalence, distributional characteristics, correlations, and agreement of attribute and summary scores in a diverse cohort of cancer patients, most of whom were undergoing treatment or had received treatment prior to study enrollment. Results presented here derive from data used to validate the PRO-CTCAE item library and can be used as normative or reference values, with the understanding that data were gathered in the setting of a diverse cancer patient population receiving or initiating chemo- and/or radiation therapy. We have described relationships among AE attributes and relationships between AE attributes and summary scores. These analyses can be applied to inform trial design, analysis and results reporting, and to strengthen the interpretability of PRO-CTCAE as a measure of tolerability.

While summary metrics like the composite score allow for succinct reporting of symptomatic AEs, descriptive analysis of individual attributes in addition to the summary score is also warranted and can provide a granular understanding of the burden of symptomatic AEs, particularly for AEs that are important to understanding the tolerability of a treatment regimen being studied. This recommendation is in line with the United States Food and Drug Administration guidance on PRO reporting, namely that summary scores and individual item scores be reported in tables and visualizations.¹³ The distinct attributes of an AE can be differentially impacted by treatment, a nuance that may be missed when summary scores alone are reported.¹⁴ Researchers should note that prevalence (as determined by score > 0) differs depending on whether prevalence is based on the individual attributes of frequency, severity, interference, or a composite score. An important result to highlight is that while PRO-CTCAE attributes are correlated, they are not identical. This finding supports the recommendation to use the complete set of attributes for an AE (and to conserve respondent burden by using conditional branching logic) rather than administering only a single item for assessment, as each attribute contributes unique information about the symptomatic AE and its contribution to conclusions about tolerability. Clinicians and researchers may be hesitant to include multiple attributes due to concerns about patient burden; however, patient burden can be alleviated by judicious selection of symptomatic AEs for monitoring and thoughtful determination of the intervals for PRO-CTCAE assessments within a given trial design.¹⁵ Selecting for surveillance only those AEs most relevant to the treatment, and scheduling assessments to align with the anticipated onset and offset of symptomatic AEs can help manage patient burden, as assessments that are perceived to be relevant by patients may be less burdensome.¹⁶

Composite scores were strongly correlated with mean scores and maximum scores (all correlations ≥ 0.78); however each of these summary scores (composite score, mean score, maximum score) may have value in analysis and reporting. Further study is needed to understand the strengths and limitations of these different approaches with respect to individual- and group-level interpretability and change over time. Composite scores, mean scores, and maximum scores may each offer distinct advantages as summary measures. As previously described, composite scores were designed to align with CTCAE grades and to have a clinically meaningful interpretation.⁴ For example, in contrast to mean scores, composite scores and maximum scores are always integers, which may be more interpretable. In contrast to mean scores which weight all attributes equally and maximum scores which assign greatest weight to whichever attribute has the highest score, composite scores reflect clinicians’ interpretation of each unique combination of attribute scores. For example, a patient reporting frequency = 4, severity = 1, and interference = 0 would be assigned a composite score = 1, mean score = 1.67, and maximum score = 4. Composite scores for different AEs were all positively correlated, although most correlations were weak and very few pairs were more than moderately correlated, supporting the distinctness of the AEs assessed by PRO-CTCAE. Low correlation between conceptually unrelated AEs was expected. Our results showing that some AEs were comparatively more strongly correlated is intriguing, but is not a robust exploration of symptom clustering, a concept worth further investigation.

Strengths of this study include the large and diverse sample (with respect to demographic and clinical characteristics, cancer type, and treatment modality). The sample included patients with varied previous treatments and was enriched for patients with impaired performance status, meaning we were able to include patients with substantial symptom burden.

A limitation of this study is that analyses addressed only 20 of the 78 symptomatic AEs included in the PRO-CTCAE item library; further study is needed to determine if comparable results are seen with other symptomatic AEs such as cough, urinary urgency and frequency, or fecal incontinence. However, the AEs included in this analysis reflect the most common, cross-cutting AEs reported by patients with a variety of cancer types enrolled in clinical trials.¹⁷ The data used in this analysis were drawn from a study that was designed as a validation study, and as such, in this diverse sample, participants were asked about many AEs that were not necessarily relevant to their treatment experiences. As a result, the proportions of zero scores and low scores were greater than what might be seen in a more homogeneous patient population asked about a set of AEs targeting a specific disease or treatment. Thus, to confirm our observations, these findings should be replicated in different settings. This study made use of only one time point, as such, we have not investigated associations at multiple predefined points during treatment or change over time. Lastly, although PRO-CTCAE is now widely available in more than 50 languages, the validation study from which our data are drawn was conducted only with English-speaking patients.

This study describes the statistical properties of PRO-CTCAE in a large sample of patients undergoing cancer treatment. Knowledge of these properties can be applied in future research to guide study design, analysis, and results reporting. Importantly, these results support current recommendations to include the full complement of attributes for each AE and to report attributes as well as summary scores.

Supplemental Material

sj-docx-1-ctj-10.1177_17407745241286065 – Supplemental material for Statistical properties of items and summary scores from the Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events (PRO-CTCAE®) in a diverse cancer sample

Supplemental material, sj-docx-1-ctj-10.1177_17407745241286065 for Statistical properties of items and summary scores from the Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events (PRO-CTCAE®) in a diverse cancer sample by Carolyn Mead-Harvey, Ethan Basch, Lauren J Rogak, Blake T Langlais, Gita Thanarajasingam, Brenda F Ginos, Minji K Lee, Claire Yee, Sandra A Mitchell, Lori M Minasian, Tito R Mendoza, Antonia V Bennett, Deborah Schrag, Amylou C Dueck and Gina L Mazza in Clinical Trials

Footnotes

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the National Cancer Institute of the National Institutes of Health [Award No. U01CA233046, Analyzing and Interpreting PRO-CTCAE with CTCAE and Other Clinical Data to Characterize Drug Tolerability]; data collection for the Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events (PRO-CTCAE) validation study was supported by the National Cancer Institute [Contract HHSN261200800043C].

Disclaimer

The opinions expressed in this article are the authors’ own and do not reflect the view of the National Institutes of Health, the US Department of Health and Human Services, or the United States government.

Trial registration number

ClinicalTrials.gov: NCT02158637.

ORCID iDs

Ethan Basch

Lauren J Rogak

Antonia V Bennett

Amylou C Dueck

Supplemental material

Supplemental material for this article is available online.

References

Basch

Reeve

Mitchell

, et al. Development of the National Cancer Institute’s patient-reported outcomes version of the common terminology criteria for adverse events (PRO-CTCAE). J Natl Cancer Inst 2014; 106(9): dju244.

Basch

Jia

Heller

, et al. Adverse symptom event reporting by patients vs clinicians: relationships with clinical outcomes. J Natl Cancer Inst 2009; 101(23): 1624–1632.

Dueck

Mendoza

Mitchell

, et al. Validity and reliability of the US National Cancer Institute’s Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events (PRO-CTCAE). JAMA Oncol 2015; 1(8): 1051–1059.

Basch

Becker

Rogak

, et al. Composite grading algorithm for the National Cancer Institute’s Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events (PRO-CTCAE). Clin Trials 2021; 18(1): 104–114.

Atkinson

Hay

Dueck

, et al. What do “none,”“mild,”“moderate,”“severe,” and “very severe” mean to patients with cancer? Content validity of PRO-CTCAE™ response scales. J Pain Symptom Manage 2018; 55(3): e3–e6.

Bennett

Dueck

Mitchell

, et al. Mode equivalence and acceptability of tablet computer-, interactive voice response system-, and paper-based administration of the U.S. National Cancer Institute’s Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events (PRO-CTCAE). Health Qual Life Outcomes 2016; 14: 24.

Mendoza

Dueck

Bennett

, et al. Evaluation of different recall periods for the US National Cancer Institute’s PRO-CTCAE. Clin Trials 2017; 14(3): 255–263.

Lee

Basch

Mitchell

, et al. Reliability and validity of PRO-CTCAE® daily reporting with a 24-hour recall period. Qual Life Res 2023; 32(7): 2047–2058.

Validation study of the Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events (PRO-CTCAE). ClinicalTrials.gov identifier: NCT02158637. https://www.clinicaltrials.gov/study/NCT02158637 (2017, accessed 26 July 2023).

10.

Landis

Koch

GG.

The measurement of observer agreement for categorical data. Biometrics 1977; 33(1): 159–174.

11.

Cohen

Statistical power analysis for the behavioral sciences. New York: Academic Press, 2013.

12.

Moore

Notz

Flinger

. The basic practice of statistics. 6th ed. New York: W. H. Freeman and Company, 2013.

13.

U.S. Food and Drug Association. Submitting patient-reported outcome data in cancer clinical trials: guidance for industry. Silver Spring, MD: U.S. Food and Drug Association, 2023.

14.

Liu

Hillman

Frith

, et al. Randomized phase III trial of eribulin (E) versus standard weekly paclitaxel (P) as first- or second-line therapy for locally recurrent or metastatic breast cancer (MBC). J Clin Oncol 2020; 38(15_suppl): 1016.

15.

U.S. Food and Drug Association. Core patient-reported outcomes in cancer clinical trials: draft guidance for industry. Silver Spring, MD: U.S. Food and Drug Association, 2021.

16.

Aiyegbusi

Cruz Rivera

Roydhouse

, et al. Recommendations to address respondent burden associated with patient-reported outcome assessment. Nat Med 2024; 30(3): 650–659.

17.

Reilly

Bruner

Mitchell

, et al. A literature synthesis of symptom prevalence and severity in persons receiving active cancer treatment. Support Care Cancer 2013; 21(6): 1525–1550.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.04 MB

Statistical properties of items and summary scores from the Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events (PRO-CTCAE ® ) in a diverse cancer sample

Abstract

Background/Aims:

Methods:

Results:

Conclusions:

Keywords

Background/aims

Method

Results

Conclusions

Supplemental Material

sj-docx-1-ctj-10.1177_17407745241286065 – Supplemental material for Statistical properties of items and summary scores from the Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events (PRO-CTCAE®) in a diverse cancer sample

Footnotes

Declaration of conflicting interests

Funding

Disclaimer

Trial registration number

ORCID iDs

Supplemental material

References

Supplementary Material