Abstract
Introduction:
US National Cancer Institute’s (NCI) Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events (PRO-CTCAE®) is a library of 78 symptom terms and 124 items enabling patient reporting of symptomatic adverse events in cancer trials. This multicenter study used mixed methods to develop an Italian language version of this widely accepted measure, and describe the content validity and reliability in a diverse sample of Italian-speaking patients.
Methods:
All PRO-CTCAE items were translated in accordance with international guidelines. Subsequently, the content validity of the PRO-CTCAE-Italian was explored and iteratively refined through cognitive debriefing interviews. Participants (n=96; 52% male; median age 64 years; 26% older adults; 18% lower educational attainment) completed a PRO-CTCAE survey and participated in a semi-structured interview to determine if the translation captured the concepts of the original English language PRO-CTCAE, and to evaluate comprehension, clarity and ease of judgement. Test-retest reliability of the finalized measure was explored in a second sample (n=135).
Results:
Four rounds of cognitive debriefing interviews were conducted. The majority of PRO-CTCAE symptom terms, attributes and associated response choices were well-understood, and respondents found the items easy to judge. To improve comprehension and clarity, the symptom terms for nausea and pain were rephrased and retested in subsequent interview rounds. Test-retest reliability was excellent for 41/49 items (84%); the median intraclass correlation coefficient was 0.83 (range 0.64-0.94).
Discussion:
Results support the semantic, conceptual and pragmatic equivalence of PRO-CTCAE-Italian to the original English version, and provide preliminary descriptive evidence of content validity and reliability.
Keywords
Background
The National Cancer Institutes’s (NCI’s) Common Terminology Criteria for Adverse Events (CTCAE) is an internationally accepted system for clinician grading and reporting of adverse events (AEs) in cancer clinical trials. 1 However, a clinician report may underestimate the prevalence and severity of symptomatic adverse events.2-4 Inclusion of direct patient self-reporting of symptomatic adverse events using patient-reported outcomes (PROs) in oncology has been widely endorsed as complementary to clinician-reporting,5-11 thereby improving precision and reliability in the reporting of symptomatic AEs.12-14
To facilitate patient self-reporting of symptomatic adverse events in cancer clinical trials, NCI has developed the Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events (PRO-CTCAE®).15,16 PRO-CTCAE is an item library comprised of 124 self-report items reflecting 78 symptomatic AEs drawn from the CTCAE. Each symptomatic AE is evaluated using a combination of one to three attributes (frequency, amount, severity, interference, presence/absence). A multi-site cognitive interviewing study found the English language PRO-CTCAE items to be well comprehended, acceptable, and meaningful to patients undergoing cancer treatment.17,18 PRO-CTCAE also demonstrates strong measurement properties, including construct validity and test-retest reliability. 19 The validity of the 7-day recall period, 20 the discriminative validity of the PRO-CTCAE response choices, 21 and the measurement equivalence of paper, web and interactive voice response modes of administration 22 have also been confirmed. Translation and cross-cultural adaptation of PRO-CTCAE is being conducted in a number of different languages. 23
It is well recognized that if PRO measures are to be used in different countries and cultures they must demonstrate cultural acceptability, as well as linguistic and conceptual equivalence to the source measure.24 -26 The aim of this study was to develop an Italian language version of PRO-CTCAE, and examine its content validity and test-retest reliability in a diverse sample of Italian-speaking patients undergoing cancer treatment.
Materials and methods
This multisite study recruited participants from 15 cancer treatment centers around Italy and used a mixed methods approach to develop and refine a culturally acceptable and well-comprehended Italian language version of PRO-CTCAE. The study was approved by the Ethics Committees of all participating centers; all participants provided written informed consent.
Translation
We followed internationally-established procedures for the translation and cross-cultural adaptation of PRO instruments.24 -26 Two independent forward-translations of PRO-CTCAE were prepared by Italian-speaking bilingual translators. As recommended by Beaton et al., 25 one translator was aware of the PRO-CTCAE item definitions, while the other was not provided any contextual details about the project, and had no medical background. The translators then produced a single reconciled Italian version. The reconciled PRO-CTCAE instrument then underwent dual independent back-translation into English by two bilingual translators who had no specific background in oncology, were not informed about the specific project aims, and had not seen the English PRO-CTCAE source materials. In preparing the final reconciled version to be advanced for linguistic validation, each PRO-CTCAE-Italian item, including the attributes (e.g. severity), response choices, and phrasing for ‘severity at its worst’ were examined to ensure that the PRO-CTCAE-Italian was equivalent to the PRO-CTCAE-English with respect to semantic (similar meaning), conceptual (similar relevance of the constructs), and pragmatic (similar tone or communicative effect) equivalence.
Cognitive debriefing interviews
Interviews were conducted to explore the comprehensibility, clarity and ease of response of PRO-CTCAE-Italian. Eligible study participants were 18 years of age or older; were receiving (or had received within the past six months) anticancer therapy with curative or palliative intent; and were able to speak, read and comprehend Italian as their primary language. Patients judged by their clinicians as unable to recall symptoms during the past seven days due to cognitive impairment were excluded.
Given the intended context of use across a range of disease sites and treatment types, a diverse sample with respect to cancer site, treatment type, age, educational attainment, and geographic regions in Italy (North, Center and South/Island) was recruited from 15 cancer treatment centers around Italy. To avoid potential biases due to regional literacy patterns, we established an accrual goal that at least 25% of the participants from each geographic area should have lower educational attainment (defined as primary school level or lower, i.e., five or fewer years of schooling).
The interview methodology followed the approach used in the linguistic validation of other PRO-CTCAE language versions.17,27-31 In the first round of interviews (n=47), males were debriefed on all 74 gender-relevant symptom terms (120 items), and females were debriefed on 76 gender-relevant symptom terms (122 items). After a protocol amendment, in Rounds 2 through 4, we employed eight shorter interview schedules to reduce the duration of the interviews and address participant burden. Each of the eight shorter interview schedules (four per gender) contained 29-32 symptom terms for debriefing. Interview schedules were deployed in a manner that ensured that ultimately all PRO-CTCAE items received approximately equivalent attention during the debriefing process.
Participants were first asked to independently complete a series of PRO-CTCAE items. Subsequently, they were debriefed by an Italian-speaking, master’s prepared interviewer with experience interviewing research participants in medical settings. A semi-structured interview guide (see Supplementary Table 1) was used to examine the comprehension, clarity and acceptability of the PRO-CTCAE symptom terms, the attributes (e.g. severity, interference) and the associated response options, as well as the clarity and ease of judgement for ‘severity at its worst’ and the 7-day recall period. 17 For each PRO-CTCAE element, four categories related to cognitive processing were considered: comprehension, memory retrieval, ease of judgment, and response mapping. 31 Cognitive interviews were audio-recorded, and extensive field notes were taken by interviewers. Items with > 10% respondent difficulties were considered for rephrasing and/or retesting in a subsequent interview round. Interviews continued until no additional comprehension difficulties were identified.
As recommended by Beaton et al., 25 following the translation and adaptation process, it is important to explore quantitatively the measurement properties of the newly adapted instrument. Test-retest reliability is a fundamental property of a measure upon which construct validity, responsiveness, and sensitivity to change are based. Reliability may be demonstrated through either internal consistency reliability and/or test–retest reliability methods. 32 Internal consistency reliability is only relevant when items are part of a unidimensional scale or factor. However, for a library of symptom items such as PRO-CTCAE, evidence of consistent scores over time in a population with stable symptoms supports a conclusion that the instrument is reliable. We therefore investigated the test-retest reliability of our cross-culturally adapted instrument in a second sample of adult participants (n=135) who were currently undergoing or had recently completed cancer treatment at one of six participating cancer centers, and were willing to complete a PRO-CTCAE survey on the day of study enrollment and again within 24-36 hours. To limit respondent burden, a subset of 26 symptomatic AEs (49 PRO-CTCAE items) was presented to each participant. To enhance comparability with previously published test-retest reliability coefficients, these were the same 26 symptomatic AEs that were previously evaluated by Dueck et al. 19 in their study to evaluate the measurement properties of PRO-CTCAE in English. Intraclass correlation coefficients (ICC), and their 95% confidence intervals, were calculated based on a two-way mixed-effect analysis of variance model with interaction for the absolute agreement between single scores. 33 ICC values were interpreted as follows: Values less than 0.50, between 0.50 and 0.75, between 0.75 and 0.90, and greater than 0.90 were interpreted as indicating poor, moderate, good, and excellent reliability, respectively.33-35
Results
Translation
Comparison of the two forward-translations revealed semantic equivalence (‘same words’) and conceptual (‘same meaning’) equivalence for 29 of the 78 PRO-CTCAE symptom terms. Translation of the symptom term “fatigue” caused some difficulty, as there is no Italian word that encompassed all of the meanings of the English term, and an explanatory elaboration of the Italian term chosen for fatigue (
Cognitive debriefing interviews
Participants (n=96) were recruited at 15 cancer centers throughout Italy to participate in one of four consecutive rounds of cognitive interviewing. The sample was diverse with respect to age, educational attainment, and geographic region (Table 1). 54% of the sample had completed eight or fewer years of education. Participants enrolled in Round 1 (n=47) completed a survey comprised of all the PRO-CTCAE items (122 for women and 120 for men), followed by cognitive debriefing on all of the relevant symptoms terms. The average time required to complete each PRO-CTCAE item was 14 seconds (SD ±6 seconds). Cognitive debriefing interviews in Round 1 lasted an average of 86 minutes (SD± 27 minutes). Six of 47 participants experienced the length of the survey as taxing, with two respondents unable to complete the survey, one unable to complete the interview, and three requiring a rest break during the study-related procedures.
Sample characteristics.
Note: *Data about disease site was not captured in cognitive interviewing participants.
In Rounds 2 through 4, participants completed a shorter survey comprised of a subset of 29-34 PRO-CTCAE symptom terms, followed by cognitive debriefing only on the items included in that survey. The average time to complete each PRO-CTCAE item in Rounds 2 through 4 was 22 seconds (SD± 8), 14 seconds (SD± 5), and 15 seconds (SD± 6), respectively. The average time for cognitive debriefing interviews in Rounds 2 through 4 was 55 minutes (SD ±18 minutes), 47 minutes (SD ±15 minutes), and 47 minutes (SD ±28 minutes), respectively. In Rounds 2 through 4, all participants were able to complete both the survey and the debriefing interview without becoming unduly fatigued.
As shown in Figure 1, approximately half of all PRO-CTCAE symptom terms posed comprehension difficulties for at least one respondent. Rounds 1 and 2 of cognitive interviews (see Table 2) revealed four PRO-CTCAE-Italian items presenting difficulties in more than 10% of participants: bedsores (9/49, 18.4%), general pain (11/62, 17.7%), flashing lights (7/49, 14.3%), and nausea (8/62, 12.9%). Respondents were uncertain if “Pain”, originally translated as

Waterfall plot of rates of comprehension difficulties with PRO-CTCAE-Italian symptom terms across four rounds of interviews (n=96) and the total number of participants debriefed on each item.
Comprehension difficulties across four rounds of cognitive interviews.
Test-retest reliability
The test-retest reliability substudy enrolled 137 unique participants at 11 cancer centers in Italy; 135 participants had complete data. Participants completed 46 PRO-CTCAE items on two occasions (median interval between two surveys was 1 day; range 1-3 days). Due to rapid accrual, the sample size exceeded the minimum required based on the a priori power analysis. Characteristics of the participants are summarized in Table 1. Median age was 63 years; 42% were female; 58.4% had less than a high school education. A variety of tumor sites were reflected in the sample, including lung (47.4%), genitourinary (29.9%), breast (10.2%), gastrointestinal (9.5%) and other (2.9%). In the 135 respondents who completed PRO-CTCAE on two consecutive business days, the test-retest reliability for the 49 prespecified items ranged from ICC 0.64 to 0.94 (median ICC 0.83) (Supplementary Table 2); 84% of the items examined demonstrated an ICC greater than or equal to 0.75 (Figure 2).

Test-retest reliability of 49 PRO-CTCAE-Italian Items: Intraclass Correlation Coefficient and 95% CI (n=135).
Participants in the test-retest sample were highly symptomatic, endorsing the presence of a median of 20 symptoms (IQR 15-28). 97% of the sample (131/135) reported the presence of five or more concurrent symptoms at the first visit. Of the 49 PRO-CTCAE items evaluated in this study, 48 (98%) were reported as present by at least 10% of respondents at both visits, with 37 of 49 (76%) items having at least 25% prevalence (Supplementary Table 3). The distribution of item scores for the set of 20 commonly occurring symptomatic AEs, as identified by Dueck et al., 19 is depicted in Figure 3.

Distribution of PRO-CTCAE Item Scores for Symptomatic Toxicities Prevalent in ⩾10% of Respondents at Visit 1 (n=135).
Discussion
We report the cross-cultural adaptation, linguistic validity and test-retest reliability of the PRO-CTCAE-Italian. The translation process emphasized the use of plain language phrasing to promote comprehension by diverse respondents, including those with lower educational attainment. Review of the back-translations demonstrated that PRO-CTCAE-Italian was semantically and conceptually consistent with the English source. Four rounds of cognitive interviews were conducted, resulting in adjusted phrasing in two of the PRO-CTCAE symptom terms to achieve satisfactory comprehension. Results of the cognitive testing confirm the comprehensibility, clarity and ease of judgement of the Italian PRO-CTCAE, and support its conceptual equivalence to the English source. PRO-CTCAE-Italian also exhibited excellent test-retest reliability as indicated by a median ICC of 0.83. 19 The absence of prominent floor and ceiling effects is supported by the fact that more than 95% of respondents reported the presence of 5 or more symptoms, and exhibited high scores (>3) for at least one PRO-CTCAE symptom (91/135, 67.4%) at first visit, and 84/135 (62.2%) at second visit. These findings also support a conclusion that reliability coefficients were not erroneously amplified due to low variability in a symptom-free sample. Our test-retest reliability parameters are comparable to those reported by Dueck et al. 19 for the English language version of PRO-CTCAE, and provide encouraging preliminary evidence that PRO-CTCAE-Italian yields reproducible scores.
Both the strengths of this study as well as some limitations should be considered in interpreting our findings. First, because our study was conducted in patients currently receiving cancer treatment, and because some interviews lasted more than an hour, participants had to have sufficient endurance to complete study-related procedures. Thus, our sample may have been biased toward inclusion of participants with better performance status. Although we did not formally collect data to evaluate this possibility, we do note that the majority of our respondents were experiencing disease- or treatment-related symptoms. Second, we did not capture cancer site in the participants in the cognitive debriefing component of the study. While the sample in the test-retest substudy was somewhat diverse with respect to disease site, we acknowledge that almost 50% of the test-retest sample were being treated for lung or head and neck cancers. At the same time, patients with lung and head/neck cancers often have prominent symptom burden,36,37 and the high level of symptom burden observed in our sample helps to mitigate the possibility that our reliability estimates were inflated in the absence of symptoms. 38 We also acknowledge that the small sample used to examine test-retest reliability was drawn from a single center, and as such future study to determine that our reliability estimates are replicable is warranted. Third, comprehension problems for two items, bedsores and flashing lights, persisted in a small proportion of participants; these could not be resolved because no suitable phrasing alternatives could be identified for these symptom terms. Encouragingly, we noted during cognitive interviews that both symptom terms were generally well understood by respondents who had direct experience with these symptoms. Lastly, all of our interviewers were female and there was some hesitancy and embarrassment observed on the part of male respondents during the part of the interview when they were asked to rephrase in their own words symptoms terms reflecting urinary or sexual dysfunction. While this improved in Rounds 3 and 4 with adjustments to the interview probes as recommended by Arthur et al., 39 we cannot exclude the possibility that gender mismatch between interviewers and study participants may have influenced the accuracy and quality of our data. At the same time, strengths of the study include compliance with rigorous procedures for translation and cross-cultural adaptation of PRO instruments. An additional strength was the large and diverse sample, which included approximately 20% of respondents with an extremely low level of educational attainment, a proportion that reflects the Italian general population where approximately 18% have completed only primary school education or less. 40 Lastly, inclusion of the evaluation of test-retest reliability provides supportive evidence that the comprehensibility of PRO-CTCAE-Italian is reproducible. Additional research examining the psychometric properties and responsiveness of PRO-CTCAE-Italian in a large sample of Italian speakers undergoing cancer treatment is planned.
Conclusions
We have developed and content validated an Italian language version of PRO-CTCAE for use in cancer clinical trials. These study results advance the ongoing efforts to make the PRO-CTCAE measurement system available to patients of different linguistic and cultural backgrounds. 23 Currently, PRO-CTCAE is available in more than 35 languages, with additional languages in development and testing. This availability of the PRO-CTCAE measurement system in a wide variety of languages and developed using rigorous procedures for translation and cultural adaptation will allow an expanded number of patients to provide their perspective on the tolerability of cancer treatment, and supports the multinational conduct of trials of new therapies for cancer.
Supplemental Material
sj-pdf-1-tmj-10.1177_03008916221099558 – Supplemental material for Cultural adaptation of the Italian version of the Patient-Reported Outcomes Common Terminology Criteria for Adverse Event (PRO-CTCAE®)
Supplemental material, sj-pdf-1-tmj-10.1177_03008916221099558 for Cultural adaptation of the Italian version of the Patient-Reported Outcomes Common Terminology Criteria for Adverse Event (PRO-CTCAE®) by Caterina Caminiti, Jane Bryce, Silvia Riva, Diane Ng, Francesca Diodati, Elisa Iezzi, Lucia Sparavigna, Silvia Novello, Camillo Porta, Lucia Del Mastro, Giuseppe Procopio, Saverio Cinieri, Amalia Falzetta, Fabio Calabrò, Vito Lorusso, Alessio Aligi Cogoni, Giampaolo Tortora, Marco Maruzzo, Rodolfo Passalacqua, Francesco Cognetti, Vincenzo Adamo, Enrica Capelletto, Alessandra Ferrari, Michela Bagnalasta, Maurizio Bassi, Annalisa Nicelli, Davide De Persis, Alessia D’Acunti, Elisabetta Iannelli (patient), Francesco Perrone and Sandra A. Mitchell in Tumori Journal
Footnotes
Acknowledgements
We gratefully acknowledge the four Italian translators, Enrica Bondavalli, Anne Prudence Collins, Claudia Di Loreto, and Stephanie Johnson. We also thank Luca Gattinoni, MD, Experimental Transplantation and Immunology Branch, National Cancer Institute, for his expert review of the PRO-CTCAE-Italian instrument prior to the cognitive interviews.
Author’s contributions
The authors confirm contributions to the paper as follows:
All authors reviewed and approved the final version of the manuscript.
Availability of data and material
Declaration of conflicting interests
The author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article:
SN received payments or honoraria from Eli Lilly, Sanofi, Amgen, Beigene, Roche, Takeda, Pfizer, AstraZeneca, Boeringher Ingelheim, MSD, Novartis.
CP received payments or honoraria from Eisai, MSD, BMS, Astra Zeneca, Pfizer, Novartis, Merck, Janssen, Ipsen, EUSA Pharma, Angelini, General Electric.
LDM received payments or honoraria from Roche, Novartis, Eli Lilly, MSD, Pfizer, Ipsen, Celgene, Genomic Helath, Pierre Fabre, Daiichi Sankyo, Seagen, AstraZeneca, Eisai.
GP received payments or honoraria BMS, MSD, Bayer, Astra Zeneca, Merck, Ipsen, Janssen, Novartis, Pfizer, Sanofi.
SC received payments or honoraria from Eli Lilly.
FCal received payments or honoraria from MSD, BMS, Pfizer, AstraZeneca, Ipsen.
GT received payments or honoraria from BMS, MSD, Servier, Astra Zeneca, Merck.
RP received payments or honoraria from Regeneron, Ipsen, Sanofi-Aventis, Amgen, MSD.
FCog received payments or honoraria from Genomic Helath, Biolitec Pharma, Pfizer, Abbott, Celgene, Glaxo, Roche, Bayer, Novartis, Amgen, Pfizer, AstraZeneca, Eisai, Merck-Serono, Boeringher Ingelheim, MSD, BMS, Takeda, Astella, Eli Lilly, Seagen.
VA received payments or honoraria from Amgen, Astra Zeneca, BMS, Eli Lilly, MSD, Novartis, Pfizer, Roche, Sandoz, Sanofi, Servier, Takeda.
DDP received grants to his institution from Fondazione Smith Kline.
EIan received grants to her institution from Fondazione Smith Kline.
FP received payments or honoraria from Incyte, GSK, Eli Lilly, Ipsen, Astellas, Astra Zeneca, Roche, BMS, Bayer, Clovis, Pierre Fabre; FP also received grants to his institution from Roche, Astra Zeneca, Pfizer, MSD, Bayer, Incyte Taiho, Janssen, Exelixis, Aileron, Daiichi Sankyo.
CC, JB, SR, DN, FD, EIez, LS, AFal, VL, AAC, MM, EC, AFer, MBag, MBas, AN, ADA, SAM did not declare competing interests.
Ethics approval and consent to participate
Study oversight and ethics approval was provided by the Independent Ethics Committee of the National Cancer Institute of Naples and by Ethics Committee at each Italian participating centre. All study participants provided written consent to participate in this study.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: this research was supported in part by an unrestricted grant of the Fondazione Smith-Kline.
Supplemental material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
