Sage Journals: Discover world-class research

Abstract

Despite the expanding therapeutic options available to cancer patients, therapeutic resistance, disease recurrence, and metastasis persist as hallmark challenges in the treatment of cancer. The rise to prominence of generative artificial intelligence (GenAI) in many realms of human activities is compelling the consideration of its capabilities as a potential lever to advance the development of effective cancer treatments. This article presents a hypothetical case study on the application of generative pre-trained transformers (GPTs) to the treatment of metastatic prostate cancer (mPC). The case explores the design of GPT-supported adaptive intermittent therapy for mPC. Testosterone and prostate-specific antigen (PSA) are assumed to be repeatedly monitored while treatment may involve a combination of androgen deprivation therapy (ADT), androgen receptor-signalling inhibitors (ARSI), chemotherapy, and radiotherapy. The analysis covers various questions relevant to the configuration, training, and inferencing of GPTs for the case of mPC treatment with a particular attention to risk mitigation regarding the hallucination problem and its implications to clinical integration of GenAI technologies. The case study provides elements of an actionable pathway to the realization of GenAI-assisted adaptive treatment of metastatic prostate cancer. As such, the study is expected to help facilitate the design of clinical trials of GenAI-supported cancer treatments.

Keywords

Generative artificial intelligence cancer treatment prostate cancer adaptive therapy intermittent androgen deprivation therapy deep learning

Introduction

Sustained world-wide efforts in basic and translational cancer research are expanding the repertoire of treatment options available to cancer patients. Notwithstanding the great strides this expanding cancer armamentarium is enabling towards improving patient outcomes, the emergence of therapeutic resistance is often inevitable for most cancer patients¹ and remains the ultimate barrier to achieving cancer cure.² This persisting reality of cancer treatments is perpetuated by the inability of standard of care (SOC) to effectively and fully consider the adaptive complexity of cancer³ and its underlying eco-evolutionary,^4-6 genetic,^7-11 and immunological dimensions.^12-16 Indeed, the effective control of cancer as a time-varying nonlinear dynamical system cannot be achieved with predefined fixed schedules of therapeutic interventions.^17,18 Combination therapies have long been accepted as appropriate strategies to optimize treatment outcome, delay the onset of resistance, and reduce the risk of minimal residual disease.^19,20 On the other hand, adaptive therapy has been championed as an intuitively sound approach to address the evolutionary dynamics underlying resistance.¹⁷,^21-25 Adaptive combination therapy would further expand the horizon of possible improvements in treatment outcomes.^26,27 However, despite the tens of thousands of clinical trials for combination therapy that are currently registered in ClinicalTrials.org, it is not clear how to safely and effectively combine multiple drugs. Furthermore, significant advances are needed to address the feasibility, accuracy, and reliability of continuous disease monitoring and treatment response predictions to support adaptive therapy. Challenges to the development of more effective cancer treatments may however be more amenable to clinically viable resolutions by unlocking the potential of the growing big multimodal data being collected about cancer and the meteoric rise of data-driven generative artificial intelligence (GenAI).²⁸ Indeed, GenAI potential utility in oncology is garnishing increasing attention.^29,30 Notable explorations of GenAI applications to cancer care are many, including the use of large language models (LLM) such as ChatGPT as an assistant that can be queried about cancer by patients and health care practitioners,^31-36 extraction of clinical information from medical reports,^37-39 and clinical decision support for diagnosis and treatment recommendations.^40-44 Many of these studies explore the performance of pre-trained LLMs, such as ChatGPT, and often compare it against benchmarks and human medical expertise.⁴²,^45-47 One particularly promising oncological application of GenAI is radiotherapy (RT) treatment planning, where dedicated LLM-enhanced algorithms have been developed to automatically delineate tumour volume targeted by radiotherapy.^44,48 Given the pathway that has already been chartered towards maturity and clinical adoption of ML/AI (machine learning/artificial intelligence) assisted diagnosis of cancer,⁴⁹ GenAI-augmented cancer diagnosis algorithms are expected to attract heightened research interests.^50-52 On the other hand, assessment of LLMs used for treatment recommendations revealed that they are still no match for human medical experts.⁵³ Most of these explorations were undertaken in the context of standard of care, where traditional fixed schedules of treatments are used. For the case of adaptive cancer therapy, OncoGPT, a transformer-based model, was proposed for the prediction of treatment response.²⁸ The training of OncoGPT is assumed to place patients on a phenotypic space, in accord with their phenotypic similarity. This provides a basis for the plausibility of one-step-ahead predictions of treatment response, by generalizing from the treatment response trajectories of patients who underwent similar treatments.²⁸ Notwithstanding GenAI potential to revolutionize cancer care,^30,52,54 the use of LLMs faces challenges that are typical to ML/AI systems, including the lack of explainability, opacity, ethical concerns, and the need for large training datasets.^55-57 In addition, LLMs have an intrinsic risk for hallucinations,⁵⁸ which in the context of oncology means that they may yield incorrect or clinically implausible outputs such as non-sensical treatment recommendations.

To highlight some concrete aspects of the application of generative artificial intelligence to advance the ongoing progress in adaptive therapy for prostate cancer,^22,24,26,27,^59-62 metastatic castrate-sensitive prostate cancer (mCSPC) is selected as a study case for the use of OncoGPT.²⁸ The proposed analysis spans GenAI model training and inferencing as well as mitigation strategies for the hallucination problem and its implications to clinical validation and integration. The analysis is prefaced with a short overview section on adaptive mCSPC therapy. This serves as a baseline for the proposed GenAI-assisted adaptive therapy of mCSPC, detailed in the subsequent sections. Insights are shared in the discussion section about the potential, challenges, and limitations of the proposed GenAI-assisted approach to the treatment of mCSPC.

Adaptive Therapy of Metastatic Castrate-Sensitive Prostate Cancer

Depending on factors such as disease volume and side effects, mCSPC may be treated with androgen deprivation therapy (ADT), combination of ADT and androgen receptor-signalling inhibitors (ARSI), or triplet therapy that includes ADT, ARSI, and docetaxel or radiotherapy.^63-66 Each one of these treatment modalities may be appropriate to select for a specific class of mCSPC patients based on risk stratification. Ultimately, treatment response will, for all patients, be time-varying and nonlinear, accompanied with an often-inevitable emergence of drug resistance due to the adaptive complexity of cancer and its underlying eco-evolutionary, genetic, and immunological dimensions.^3-16 Adaptive therapy has been argued to be the most sensible approach an oncologist would take to stay one step ahead of cancer’s adaptation and resistance by changing the timing and treatment doses based on continuously monitored biomarkers.^17,22,24,67 Adaptive therapy was explored for metastatic prostate cancer, where treatment response is assumed to be determined by the competition between 3 types of cancer cells: (1) androgen receptor positive (AR+), CYP17+ testosterone-producing cells (TP), (2) AR+, CYP17− cells that require androgens (T+), and (3) AR−, CYP17− androgen independent cells (T-).^24,25,68 Based on this model, monitored prostate-specific antigen (PSA) and testosterone serum levels are used to guide the timing of intermittent ADT therapy in an adaptive androgen deprivation trial for mCSPC.²⁴ Depending on monitored testosterone and PSA levels, luteinizing hormone–releasing hormone (LHRH) antagonist, new hormonal agent (NHA) such as abiraterone, enzalutamide, or apalutamide, or LHRH + NHA were used to target a 50% PSA reduction, after which all treatments are intermittently stopped before they are resumed on PSA or radiographic progression.²⁴ It is presumed that the monitoring of PSA and testosterone provides sufficient information about the fractions of TP, T−, and T+ cells in the tumour to support the evolutionary-based adaptive treatment strategy described above. This model is aligned with the notion that the cell is the correct level of abstraction to consider in understanding function,^69,70 and it effectively integrates relevant genetic information (eg, expressions of AR and CYP17 and their relevant signalling pathways) into observable phenotypes, in a way akin to coarse-graining abstraction of knowledge.⁷¹ The effectiveness of this adaptive treatment approach may be further enhanced by the use of GenAI models trained to learn patient treatment-response maps that would support treatment adaption through the provision of treatment response predictions.

GenAI-Supported Adaptive Androgen Deprivation of mCSPC

The application of GenAI to the prediction of treatment response was explored in the context of a theoretical framework for adaptive cancer therapy.²⁸ The framework views cancer treatment as a problem of controlling cancer as a time-varying nonlinear system whose states are observable through repeated monitoring of treatment response biomarkers covering the genetic, immunological, and eco-evolutionary causal dimensions underlying cancer dynamics. To leverage the learning capabilities of transformers⁷² in adaptive therapy, treatments are represented by a sequence of therapeutic actions (controls) $u_{0}, u_{1}, \dots, u_{i}$ , where $u_{i} \in U$ is the control applied at the discrete instant of time identified by the non-negative integer $i$ . $U$ is a finite set of therapeutic actions defined through the consideration of available drugs, and the quantization of drug doses and time intervals between consecutive therapeutic actions. Likewise, treatment response is also framed to take discrete states $s_{0}, s_{1}, \dots, s_{i}$ , where $s_{i}$ is the treatment response to the control $u_{i}$ and $s_{i} \in S$ , with $S$ being a defined finite set of treatment responses, obtained through the quantization of biomarker signals of treatment response.

Considering the critical importance of clinical feasibility of future GenAI applications to cancer treatment, the Phase 1b Adaptive ADT trial²⁴ is taken as a baseline for the setup of this case study. In particular, the objective of the case study is to demonstrate, at least theoretically, how OncoGPT²⁸ can be used to support adaptive ADT for the treatment of mCSPC. Although there are no limits on the number and type of biomarkers that can be used to define treatment response, for the purpose of anchoring this case study with respect to the mentioned clinical trial, PSA and testosterone are the only response biomarkers required to be monitored in addition to RECIST (response evaluation criteria in solid tumours)⁷³ assessments. Likewise, LHRH agonist/antagonist drugs and NHA are assumed to be the main types of drugs under consideration. To define the set of possible treatment responses (disease states), serum PSA is quantized into 30 possible discrete levels. The PSA range from 0 to 4 ng/mL is quantized into 20 distinct levels each representing a 0.2 ng/mL wide interval. For example, level 1 represents the range (0-0.2 ng/mL) while level 2 represents the range (0.2-0.4 ng/mL) and so on. The range (4-10 ng/mL) is divided into 5 equally wide intervals representing levels 21 to 25, respectively, while the PSA range (10-110 ng/mL) is divided into 4 equally wide intervals representing levels 26 to 29, respectively. PSA values that are higher than 110 ng/mL are represented by level 30. This quantization scheme attempts to achieve a monitoring resolution that recognizes PSA ranges that are clinically relevant for PSA screening.⁷⁴ Testosterone may be quantized into 20 discrete levels representing the intervals between the following possible testosterone values in nmol/L: 0, 0.1, 0.2, 0.3, 0.4, 0.5, 0.7, 0.9, 1.1, 1.3, 1.5, 1.7, 2, 5, 10, 15, 20, 25, 30, and 35 nmol/L. Here too, the quantization scheme is intended to provide a monitoring resolution that reflects clinical practice regarding salient ranges of testosterone levels for patients undergoing ADT.⁷⁵ Considering the 4 possible RECIST disease states leads to 2400 (30 × 20 × 4) possible treatment responses. Advances in the analysis of tumour circulating DNA (ctDNA) using liquid biopsy (LB) are expected to enable the use of additional biomarkers of treatment response,^76,77 further enhancing the resolution of treatment response monitoring. For instance, abundance of ctDNA, AR amplification, and alterations of PTEN, RB1, P53, and DNA damage repair (DDR) genes, including BRCA1/2, ATM, and CDK12, may be monitored using LB to serve as biomarkers of mPC treatment response.^77-81 In addition, cytokines such as interleukin IL-6, IL-5, IL-8, IL-10, IL-23, tumour necrosis factor (TNF)-α, and interferon (INF)-γ may also be monitored using LB⁸² as biomarkers of the immune dimension of treatment response. Given the critical involvement of AR amplification in mCRPC,⁸³ the prognostic nature of ctDNA detection,⁸⁴ and the role of IL-8 in promoting the infiltration of myeloid-derived suppressor cells (MDSCs),⁸⁵ it is also possible to include the abundance of ctDNA, the amplification of AR, and IL-8 level as additional LB monitored biomarkers of mCSPC treatment response. Assuming 3 quantization levels that could be assigned to each one of these potential biomarkers (ie, Low, Medium, and High), this would bring the size of the set of disease states to 21 600.

The second step in defining the GenAI model is the discretization of therapeutic control. Relugolix⁸⁶ and abiraterone with prednisone⁸⁷ will be the example for LHRH and NHA drugs under consideration, respectively. Given the Food and Drug Administration (FDA)-recommended dosage of relugolix,⁸⁸ the dosage of this LHRH antagonist will be discretized into 7 different levels separated by an interval of 60 mg, where doses of 0 and 360 mg are mapped to levels 1 and 7, respectively. Likewise, given FDA abiraterone dosage recommendations,⁸⁹ the dosage of this NHA drug is discretized into 5 levels separated by an interval of 250 mg and were 0 and 1000 mg corresponding to levels 1 and 5, respectively. The dosage of prednisone will be tied to that of abiraterone, where 0, 1.25, 2.5, 3.75, and 5 mg would be administered for dosage levels 1, 2, 3, 4, and 5 of abiraterone, respectively. As a result of the above discretization of LHRH antagonist and NHA drugs, the total number of possible controls is 35 (7 × 5). Notice that no-treatment is represented by a zero value of drug doses.

Although therapeutic control is assumed to be applied daily through the administration of ADT + NHA drugs, the monitoring of PSA and testosterone and the assessment of imaging progression may be done once every few weeks. Blood tests, MRI (magnetic resonance imaging), and CT (computed tomography) routinely used in clinical setting would be the main methods used for biomarker monitoring. However, advances in LB and disease monitoring instrumentation, including portable devices, are expected to enable a more frequent sampling of treatment response. Meanwhile, the discrepancy between the frequency of drug administration and that of disease monitoring can be bridged through the use of estimated treatment responses regularly calibrated with the monitored treatment response as an injection of ground truth as illustrated in Figure 1.

Figure 1.

GenAI-supported adaptive androgen deprivation therapy for metastatic sensitive prostate cancer.

The ground truth injection consists in using the recently monitored treatment response, ie, $s_{i - 1}$ to adjust the estimated treatment response sequence ${\hat{s}}_{i - N}, \dots {\hat{s}}_{i - 2} {\hat{s}}_{i - 1}$ prior to providing it as an input to OncoGPT. $N > 0$ is the length of the treatment response history under consideration. One simple approach to implement this ground truth injection is to augment the elements of the estimated response sequence by $\vec{v} ≜ s_{i - 1} - {\hat{s}}_{i - 1}$ . Note that $\vec{v}$ is a vector whose components are equal to the differences between the corresponding components of treatment response and its estimate. The adjustment moves the sequence of estimated states in the direction of $\vec{v}$ towards the ground truth (ie, the last monitored treatment response) with a displacement equal to the magnitude of $\vec{v}$ . The magnitude of the adjustment may also be modulated using a forgetting function to attenuate the impact of new monitored treatment responses on past, temporally distant estimations of treatment response. The adaptive therapeutic strategy of the proposed GenAI-assisted treatment of mPC may be selected from the many possible adaptive control approaches proposed in the litterature,¹⁸ based on their robustness and the feasibility of their clinical integration. Adaptive PID (proportional-integral-derivative) control embody time-proven, widely used real-world control strategy^90-93 and would hence be an adequate fit for the real-world clinical context. Planning the trajectory of desired treatment response used as the daily setpoint ${\bar{s}}_{i}$ for an adaptive controller would necessarily be informed by expert clinical knowledge about the dynamics of treatment response. Given a desired treatment duration of $M$ days, an initial disease state $s_{0}$ , and a target final disease state ${\bar{s}}_{M}$ , the desired treatment response ${\bar{s}}_{i + 1}$ at time $i + 1$ may be set to the disease state closest, in Euclidean distance, to ${\hat{s}}_{i} + \frac{1}{M - i} ({\bar{s}}_{M} - {\hat{s}}_{i})$ . In other words, since ${\bar{s}}_{M}$ and ${\hat{s}}_{i}$ are vectors, the next desired treatment response is set to be one step worth $\frac{1}{M - i} ({\bar{s}}_{M} - {\hat{s}}_{i})$ closer towards the final desired response. The planning of the desired treatment response trajectory may be improved through the consideration of treatment response dynamics and planning strategies available from other application domains.⁹⁴

OncoGPT Training on mCSPC Treatment Response Data

OncoGPT uses the original encoder-decoder transformer architecture,⁷² with the sequences of controls (treatments) and disease states (treatment responses) as inputs to the encoder and decoder, respectively. The encoder and decoder are constructed using deep neural networks, ie, neural networks with large number of hidden layers.⁹⁵ Both controls and disease states as well as their respective positions within their respective sequences are represented by embeddings, which are vectors of values. Embeddings are ultimately multiplied by weights to be learned through training using a dataset consisting of pairs $d ≜ ({u_{i}}_{i = 0}^{N - 1}, {s_{i}}_{i = 0}^{N - 1})$ of control sequences and corresponding sequences of treatment response. The sequence length N is the duration of one patient-treatment cycle, which would be chosen to account for the longitudinal causal dependencies between treatments and responses. N may be set to 256 days representing a 9-month-long patient-treatment cycle as a clinically plausible cycle duration. The training consists in adjusting the weights so as to minimize a so-called loss function which represent a measure of how far is the output of the transformer from the desired treatment response expected for a treatment provided as input. The weights are adjusted by minimizing the loss function using optimizers such as gradient descent,⁹⁶ and the backpropagation algorithm,⁹⁷ which backpropagates the error between the actual and desired output to the hidden layers of the network to adjust their weights. These weights represent the actual learning achieved by the transformer.

Response to mCSPC treatment is defined based on the 3 required biomarkers, namely, PSA, testosterone, and RECIST, augmented with IL-8 level, AR amplification, and ctDNA abundance as additional biomarkers. The discretization of these biomarkers leads to 21 600 possible disease states. On the other hand, therapeutic control based on the combination of LHRH antagonist (relugolix) and NHA (abiraterone) drugs is discretized into 35 possible controls, which include one no-treatment control. The numbers of possible controls and disease states for the specific application of OncoGPT to prostate cancer is far below the size of the vocabularies supported by LLMs such as CroissantLLM,⁹⁸ suggesting that the required size of the training dataset would be relatively modest. Given the inherent privacy challenges to the collection and curation of clinical data, the study case will explore the implementation and deployment of OncoGPT in the confine of a single health care organization. For example, assuming the average number of prostate cancer patients treated yearly in a major cancer centre to be 15 000, one may estimate that the records for at least 150 000 treatment cycles accumulated over a decade span would be available to train/retrain OncoGPT at any point in time. Treatment data size that could be curated to train OncoGPT for prostate cancer may reach millions of records through data sharing agreements under the umbrellas of consortiums such as the Prostate Cancer Clinical Trial Consortium (PCCTC). In addition, synthetic training data may be generated using clinically parameterized mathematical models of tumour dynamics such as the Lotka-Voltera model.¹⁸ However, an even more promising approach to the curation of training data would be the use of patient-derived xenografts (PDXs), which are known to yield an accurate replication of treatment response.⁹⁹

Given the different types of ADT drugs that have been used in prostate cancer treatment (eg, degarelix, abarelix, relugolix, leuprorelin/leuprolide, goserelin, and triptorelin), the curation of treatment data would involve discretizing the dosage of these drugs into levels as illustrated earlier for the examples of relugolix and abiraterone. Furthermore, as radiotherapy and chemotherapy (eg, docetaxel) may also be used in addition to ADT and NHA,⁶³ their dosage need to be also discretized into appropriate levels as part of the curation of treatment data. On the other hand, data for treatment response biomarkers are expected to be sparse and incomplete in treatment records being collected over long periods of time and across multiple institutions. In such cases, other recorded clinical and genetic treatment response variables may be used to estimate missing biomarker data towards the curation of a reasonably complete dataset for the training of OncoGPT.

Discussion

Predictions of treatment response based on curated treatment datasets are predicated on the assumption that these data embody the phenotypic diversity of the patient population. Furthermore, the tuning of OncoGPT into instances personalized to individual patients or groups of patients will require the curation of additional datasets covering patients that share some degree of phenotypic similarity which may be defined based on genomic, immunological, and eco-evolutionary biomarkers.²⁸ The collection and curation of quality treatment datasets in sufficient quantity to meet the needs of OncoGPT training face numerous challenges related to privacy, consent, and data ownership. These may likely be overcome within the respective confines of clinical and research institutions where compliance with protocols and quality control standards of data collection, curation, and use can be maintained and verifiably assured. Like any other deep-learning-based AI model, OncoGPT has limitations regarding transparency, bias, generalizability across health care communities, and performance drift.¹⁰⁰ In this respect, assuming the existence of data sharing frameworks among multiple cancer centres that enable the access to quality data in sufficient quantity to train OncoGPT, its deployment should be contingent on the local curation of training data to validate and regularly tune it for the target population.¹⁰⁰ Beyond these data-related challenges lies the fundamental question of how to mitigate hallucination, which is an unwelcome and inevitable feature of LLMs.¹⁰¹ Among the many techniques of hallucination mitigation that have been proposed,¹⁰² fine-tuning OncoGPT using a loss function specific to treatment response predictions and a high-quality treatment dataset may be most appropriate as this would be task-specific. However, irrespective of the mitigation method being used, it is critical to have an estimation of the effect of hallucination on the accuracy and reliability of treatment response predictions. Another equally important question is how potential metrics of accuracy and reliability would be used in the clinical validation of GenAI-supported cancer therapy. The accuracy of disease state predictions may be defined using the Euclidian distance between observed and predicted disease states given by $M S E = \frac{1}{N} \sum_{i = 1}^{N} ∥ s_{i} - {\hat{s}}_{i} ∥$ for a disease state trajectory ${s_{i}}_{i = 1}^{N}$ corresponding to one treatment cycle, and where $∥ ∥$ is the Euclidean norm. Disease state trajectories reflect the dynamics of disease progression being controlled through therapy, supporting hence the clinical pertinence of this accuracy metric. In particular, E would quantify the extent to which OncoGPT have learned the dynamics of treatment response for the phenotypic classes of mCSPC patients on which it was trained. Phenotypic classification of training datasets may be carried out based on the status of genetic alterations affecting the PI3K and androgen receptor pathways, and DNA damage repair, given their clinical relevance to the treatment of mPC patients.⁷⁷ As treatment response predictions are generated within the context of an adaptive closed-loop involving the patient, assessing the predictive performance of OncoGPT would be normally carried out as part of the analysis of clinical trial results. The prediction accuracy of OncoGPT may be defined as $A = \frac{1}{M} \sum_{i} A_{i}$ , $A_{i} = \frac{1}{K} \sum_{j} M S E_{i j}$ , where $M$ is the number of patients in the clinical trial, $K$ is the number of treatment cycles per patient while $M S E_{i j}$ is the prediction error for the jth treatment cycle of the ith patient. The reliability of treatment response prediction would also need to be assessed as an essential step towards clinical validation. Given the definition of reliability for measurements,¹⁰³ the reliability of treatment response predictions may be defined as the fraction of variance of disease state predictions that can be ascribed to the true variance of treatment response, namely: $R = \frac{σ^{(s)}}{σ^{(\hat{s})}}$ , where $σ^{(s)}$ , $σ^{(\hat{s})}$ are the variances of monitored and predicted treatment responses across all patients, respectively. As disease states are multi-dimensional variables, the variances $σ^{(s)}$ and $σ^{(\hat{s})}$ may be computed as the means $\frac{1}{L} \sum_{i} σ_{i}^{(s)}$ and $\frac{1}{L} \sum_{i} {σ_{i}}^{(\hat{s})}$ of variances for the $L$ components of monitored and predicted treatment responses, respectively. Accuracy and reliability metrics as defined above would be instrumental in improving trust in the performance of OncoGPT’s support for adaptive therapy to improve treatment outcomes. On the other hand, the guidance issued by the FDA on the clinical evaluation of software as a medical device (SaMD)¹⁰⁴ provides an appropriate 3-pronged framework (ie, valid clinical association, analytical validation, clinical validation) for undertaking the clinical validation of GenAI-supported adaptive therapy systems. In particular, the metrics of accuracy and reliability defined above would be used to establish analytical validity, ie, that the predicted treatment response is what would be technically expected. Ultimately, clinical trials would be needed to establish the validity of clinical association between predicted treatment response and disease state as well as to clinically validate the performance of GenAI-assisted adaptive therapy with respect to clinical end-points such as time to progression, disease free survival, and overall survival.

Conclusions

GenAI-assisted adaptive androgen deprivation therapy is explored for the treatment of metastatic prostate cancer. OncoGPT is integrated in the treatment loop to predict treatment response. Both treatment and treatment response are discretized to yield finite sets of disease states and therapeutic controls to serve as the vocabularies underlying the sequences of controls and disease states used as inputs and outputs of OncoGPT, respectively. Disease states and therapeutic controls are defined based on biomarkers and drugs typically used in the treatment of mPC, respectively. The study case addresses the various steps and issues related formulation, training, integration, and assessment of OncoGPT-tailored application to adaptive androgen deprivation therapy for mPC. These include training data curation, model fine-tuning, and the exploration of treatment response accuracy and reliability as critical instrument in mitigating the risk of hallucination and improving trust in the use of GenAI to assist in improving cancer treatment outcomes.

Footnotes

Funding:

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work and its publication were supported by Toronto Metropolitan University.

Declaration of conflicting interests:

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Author Contributions

The author is the sole contributor to the article.

Data Availability Statement

Data sharing not applicable to this article as no datasets were generated or analyzed during the current study.

ORCID iD

Youcef Derbal

References

Wang

Zhang

Chen

Drug resistance and combating drug resistance in cancer. Cancer Drug Resist. 2019;2:141-160. doi:10.20517/cdr.2019.10

Vasan

Baselga

Hyman

DM.

A view on drug resistance in cancer. Nature. 2019;575:299-309. doi:10.1038/s41586-019-1730-1

Derbal

The adaptive complexity of cancer. Biomed Res Int. 2018;2018:5837235. doi:10.1155/2018/5837235

Nowell

PC.

The clonal evolution of tumor cell populations. Science. 1976;194:23-28. doi:10.1126/science.959840

Merlo

Pepper

Reid

Maley

CC.

Cancer as an evolutionary and ecological process. Nat Rev Cancer. 2006;6:924-935. doi:10.1038/nrc2013

Greaves

Evolutionary determinants of cancer. Cancer Discov. 2015;5:806-820. doi:10.1158/2159-8290.CD-15-0439

Boveri

Concerning the origin of malignant tumours by Theodor Boveri. Translated and Annotated by Henry Harris. J Cell Sci. 2008;121:1-84. doi:10.1242/jcs.025742

Vogt

PK.

Cancer genes. West J Med. 1993;158:273-278.

Hanahan

Weinberg

RA.

The hallmarks of cancer. Cell. 2000;100:57-70.

10.

Hanahan

Weinberg

RA.

Hallmarks of cancer: the next generation. Cell. 2011;144:646-674. doi:10.1016/j.cell.2011.02.013 S0092-8674(11)00127-9 [pii]

11.

Hanahan

Hallmarks of cancer: new dimensions. Cancer Discov. 2022;12:31-46. doi:10.1158/2159-8290.Cd-21-1059

12.

Balkwill

Mantovani

Inflammation and cancer: back to Virchow?

Lancet. 2001;357:539-545. doi:10.1016/s0140-6736(00)04046-0

13.

Dunn

Bruce

Ikeda

Old

Schreiber

RD.

Cancer immunoediting: from immunosurveillance to tumor escape. Nat Immunol. 2002;3:991-998. doi:10.1038/ni1102-991

14.

Schreiber

Old

Smyth

MJ.

Cancer immunoediting: integrating immunity’s roles in cancer suppression and promotion. Science. 2011;331:1565-1570. doi:10.1126/science.1203486

15.

Chen

Mellman

Elements of cancer immunity and the cancer-immune set point. Nature. 2017;541:321-330. doi:10.1038/nature21349

16.

Gonzalez

Hagerling

Werb

Roles of the immune system in cancer: from tumor initiation to metastatic progression. Genes Dev. 2018;32:1267-1284. doi:10.1101/gad.314617.118

17.

Gatenby

Silva

Gillies

Frieden

BR.

Adaptive therapy. Cancer Res. 2009;69:4894-4903. doi:10.1158/0008-5472.CAN-08-3658

18.

Derbal

Adaptive control of tumor growth. Cancer Cont. 2024;31:10732748241230869. doi:10.1177/10732748241230869

19.

Boshuizen

Peeper

DS.

Rational cancer treatment combinations: an urgent clinical need. Mol Cell. 2020;78:1002-1018. doi:10.1016/j.molcel.2020.05.031

20.

Bayat Mokhtari

Homayouni

Baluch

, et al. Combination therapy in combating cancer. Oncotarget. 2017;8:38022-38043. doi:10.18632/oncotarget.16723

21.

West

Newton

PK.

Capitalizing on competition: an evolutionary model of competitive release in metastatic castration resistant prostate cancer treatment. J Theor Biol. 2018;455:249-260. doi:10.1016/j.jtbi.2018.07.028

22.

Zhang

Cunningham

Brown

Gatenby

RA.

Integrating evolutionary dynamics into treatment of metastatic castrate-resistant prostate cancer. Nat Commun. 2017;8:1816. doi:10.1038/s41467-017-01968-5

23.

Gatenby

Brown

JS.

Integrating evolutionary dynamics into cancer therapy. Nat Rev Clin Oncol. 2020;17:675-686. doi:10.1038/s41571-020-0411-1

24.

Zhang

Gallaher

Cunningham

, et al. A phase 1b adaptive androgen deprivation therapy trial in metastatic castration sensitive prostate cancer. Cancers. 2022;14(21):5225. doi:10.3390/cancers14215225

25.

Zhang

Cunningham

Brown

Gatenby

Evolution-based mathematical models significantly prolong response to abiraterone in metastatic castrate-resistant prostate cancer and identify strategies to further improve outcomes. eLife. 2022;11:e76284. doi:10.7554/eLife.76284

26.

West

You

Zhang

, et al. Towards multidrug adaptive therapy. Cancer Res. 2020;80:1578-1589. doi:10.1158/0008-5472.CAN-19-2669

27.

West

Dinh

Brown

Zhang

Anderson

Gatenby

RA.

Multidrug cancer therapy in metastatic castrate-resistant prostate cancer: an evolution-based strategy. Clin Cancer Res. 2019;25:4413-4421. doi:10.1158/1078-0432.Ccr-19-0006

28.

Derbal

Adaptive cancer therapy in the age of generative artificial intelligence. Cancer Control. 2024;31:10732748241264704. doi:10.1177/10732748241264704

29.

Liu

Wang

Liu

Utility of ChatGPT in clinical practice. J Med Inter Res. 2023;25:e48568.

30.

Iannantuono

Bracken-Clarke

Floudas

Roselli

Gulley

Karzai

Applications of large language models in cancer care: current evidence and future perspectives. Front Oncol. 2023;13:1268915. doi:10.3389/fonc.2023.1268915

31.

Park

Lipsitz

Dominici

, et al. Generative artificial intelligence as a source of breast cancer information for patients: proceed with caution [published online ahead of print August 30, 2024]. Cancer. 2024. doi:10.1002/cncr.35521

32.

Kim

, et al. Communicative competence of generative artificial intelligence in responding to patient queries about colorectal cancer surgery. Int J Colorectal Dis. 2024;39:94. doi:10.1007/s00384-024-04670-3

33.

Johnson

King

Warner

Aneja

Kann

Bylund

CL.

Using ChatGPT to evaluate cancer myths and misconceptions: artificial intelligence and cancer information. JNCI Cancer Spectr. 2023;7:pkad015.

34.

Rahsepar

Tavakoli

Kim

GHJ

Hassani

Abtin

Bedayat

How AI responds to common lung cancer questions: ChatGPT versus Google Bard. Radiology. 2023;307:e230922.

35.

Deng

Wang

Zhai

, et al. Evaluation of large language models in breast cancer clinical scenarios: a comparative analysis based on ChatGPT-3.5, ChatGPT-4.0, and Claude2. Int J Sur. 2024;110:1941-1950.

36.

Yeo

Samaan

, et al. Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma. Clin Mol Hepatol. 2023;29:721-732.

37.

Wang

Zhou

Zhang

, et al. Large language model answers medical questions about standard pathology reports. Front Med. 2024;11:1402457. doi:10.3389/fmed.2024.1402457

38.

Choi

Song

Shin

Chang

Jang

BS.

Developing prompts from large language model for extracting clinical information from pathology and ultrasound reports in breast cancer. Radiat Oncol J. 2023;41:209-216. doi:10.3857/roj.2023.00633

39.

Huang

Yang

Rong

, et al. A critical assessment of using ChatGPT for extracting structured data from clinical notes. NPJ Digit Med. 2024;7:106. doi:10.1038/s41746-024-01079-8

40.

Sorin

Barash

Konen

Klang

Large language models for oncological applications. J Cancer Res Clin Oncol. 2023;149:9505-9508.

41.

Haemmerli

Sveikata

Nouri

, et al. ChatGPT in glioma adjuvant therapy decision making: ready to assume the role of a doctor in the tumour board? BMJ Health Care Inform. 2023;30:e100775.

42.

Liu

Shen

Chen

Large language model produces high accurate diagnosis of cancer from end-motif profiles of cell-free DNA. Brief Bioinform. 2024;25(5):bbae430. doi:10.1093/bib/bbae430

43.

Chen

Kann

Foote

, et al. Use of artificial intelligence chatbots for cancer treatment information. JAMA Oncol. 2023;9:1459-1462. doi:10.1001/jamaoncol.2023.2954

44.

Rajendran

Yang

Niedermayr

, et al. Large language model-augmented auto-delineation of treatment target volume in radiation therapy. ArXiv, July 10, 2024.

45.

Chang

Amini

Davis

, et al. ChatGPT4 outperforms endoscopists for determination of postcolonoscopy rescreening and surveillance recommendations. Clin Gastroenterol Hepatol. 2024;22:1917-1925. doi:10.1016/j.cgh.2024.04.022

46.

Zhang

Cheng

Wei

, et al. BertTCR: a Bert-based deep learning framework for predicting cancer-related immune status based on T cell receptor repertoire. Brief Bioinform. 2024;25(5):bbae420. doi:10.1093/bib/bbae420

47.

Marchi

Bellini

Iandelli

Sampieri

Peretti

Exploring the landscape of AI-assisted decision-making in head and neck cancer treatment: a comparative analysis of NCCN guidelines and ChatGPT responses. Eur Arch Otorhinolaryngol. 2024;281:2123-2136. doi:10.1007/s00405-024-08525-z

48.

Liu

Pastor-Serrano

Chen

, et al. Automated radiotherapy treatment planning guided by GPT-4Vision. ArXiv, July 1, 2024.

49.

Swanson

Zhang

Alizadeh

Zou

From patterns to patients: advances in clinical machine learning for cancer diagnosis, prognosis, and treatment. Cell. 2023;186:1772-1791. doi:10.1016/j.cell.2023.01.035

50.

Waqas

Bui

Glassy

, et al. Revolutionizing digital pathology with the power of generative artificial intelligence and foundation models. Lab Invest. 2023;103:100255.

51.

Ghebrehiwet

Zaki

Damseh

Mohamad

MS.

Revolutionizing personalized medicine with generative AI: a systematic review. Artif Intell Rev. 2024;57:1-41.

52.

Singh

Hathaway

Erickson

BJ.

Generative AI in oncological imaging: revolutionizing cancer detection and diagnosis. Oncotarget. 2024;15:607.

53.

Benary

Wang

Schmidt

, et al. Leveraging large language models for decision support in personalized oncology. JAMA Network Open. 2023;6:e2343689.

54.

Uprety

Zhu

West

ChatGPT – A promising generative AI tool and its implications for cancer care. Cancer. 2023;129:2284-2289.

55.

Shreve

Khanani

Haddad

TC.

Artificial intelligence in oncology: current capabilities, future opportunities, and ethical considerations. Am Soc Clin Oncol Educ Book. 2022;42:1-10. doi:10.1200/EDBK_350652

56.

Holzinger

Langs

Denk

Zatloukal

Müller

Causability and explainability of artificial intelligence in medicine. Wiley Interdiscip Rev Data Min Knowl Discov. 2019;9:e1312.

57.

Chen

Lin

Sun

Artificial intelligence for assisting cancer diagnosis and treatment in the era of precision medicine. Cancer Commun. 2021;41:1100-1115. doi:10.1002/cac2.12215

58.

Peng

Narayanan

Papadimitriou

On limitations of the transformer architecture. arXiv preprint arXiv:240208164, 2024.

59.

Cunningham

Brown

Gatenby

Staňková

Optimal control to develop therapeutic strategies for metastatic castrate resistant prostate cancer. J Theor Biol. 2018;459:67-78. doi:10.1016/j.jtbi.2018.09.022

60.

Cunningham

Thuijsman

Peeters

, et al. Optimal control to reach eco-evolutionary stability in metastatic castrate-resistant prostate cancer. PLoS ONE. 2020;15:e0243386. doi:10.1371/journal.pone.0243386

61.

Strobl

MAR

West

Viossat

, et al. Turnover modulates the need for a cost of resistance in adaptive therapy. Cancer Res. 2021;81:1135-1147. doi:10.1158/0008-5472.Can-20-0806

62.

Chahoud

Anderson

ARA

Zhang

Brown

Gatenby

RA.

Evolutionary dynamics and intermittent therapy for metastatic cancers. J Clin Oncol. 2023;41:4469-4471. doi:10.1200/JCO.23.00647

63.

Meagher

Salmasi

Stewart

TF.

Treatment landscape for metastatic castrate-sensitive prostate cancer: a review. Res Rep Urol. 2023;15:509-517. doi:10.2147/rru.S398129

64.

Klotz

The history of intermittent androgen deprivation therapy: a Canadian story. Can Urol Assoc J. 2020;14:159-162. doi:10.5489/cuaj.6601

65.

Virgo

Rumble

Talcott

Initial management of noncastrate advanced, recurrent, or metastatic prostate cancer: ASCO guideline update. J Clin Oncol. 2023;41:3652-3656. doi:10.1200/jco.23.00155

66.

Sayegh

Swami

Agarwal

Recent advances in the management of metastatic prostate cancer. JCO Oncol Prac. 2022;18:45-55. doi:10.1200/op.21.00206

67.

Gatenby

Brown

JS.

The evolution and ecology of resistance in cancer therapy. Cold Spring Harb Perspect Med. 2020;10(11):a033415. doi:10.1101/cshperspect.a040972.

68.

Stankova

Brown

Dalton

Gatenby

RA.

Optimizing cancer treatment using game theory: a review. JAMA Oncol. 2019;5:96-103. doi:10.1001/jamaoncol.2018.3395

69.

Brenner

Sequences and consequences. Philos Trans R Soc Lond B Biol Sci. 2010;365:207-212. doi:10.1098/rstb.2009.0221365/ 1537/207 [pii]

70.

Brenner

Nature’s gift to science. Chembiochem. 2003;4:683-687.

71.

Derbal

On modeling of living organisms using hierarchical coarse-graining abstractions of knowledge. J Biol Syst. 2013;21:1350008.

72.

Vaswani

Shazeer

Parmar

, et al. Attention is all you need. Adv Neur Inf Proc Sys. 2017;30:5998-6008.

73.

Eisenhauer

Therasse

Bogaerts

, et al. New response evaluation criteria in solid tumours: revised RECIST guideline (version 1.1). Eur J Cancer. 2009;45:228-247. doi:10.1016/j.ejca.2008.10.026

74.

NCI. Prostate-Specific Antigen (PSA) Test. Accessed December 23, 2024. https://www.cancer.gov/types/prostate/psa-fact-sheet

75.

Shayegan

Pouliot

Fernandes

Macri

Testosterone monitoring for men with advanced prostate cancer: review of current practices and a survey of Canadian physicians. Can Urol Assoc J. 2017;11:204-209. doi:10.5489/cuaj.4539

76.

Lone

Nisar

Masoodi

, et al. Liquid biopsy: a step closer to transform diagnosis, prognosis and future of cancer treatments. Mol Cancer. 2022;21:79. doi:10.1186/s12943-022-01543-7

77.

Urabe

Sumiyoshi

Tashiro

Goto

Kimura

Kobayashi

Prostate cancer and liquid biopsies: clinical applications and challenges. Int J Urol. 2024;31:617-626.

78.

Jamaspishvili

Berman

Ross

, et al. Clinical implications of PTEN loss in prostate cancer. Nat Rev Urol. 2018;15:222-234. doi:10.1038/nrurol.2018.9

79.

Nyquist

Corella

Coleman

, et al. Combined TP53 and RB1 loss promotes prostate cancer resistance to a spectrum of therapeutics and confers vulnerability to replication stress. Cell Rep. 2020;31:107669.

80.

Zhang

van Gent

Incrocci

van Weerden

Nonnekens

Role of the DNA damage response in prostate cancer formation, progression and treatment. Prostate Cancer Prostatic Dis. 2020;23:24-37. doi:10.1038/s41391-019-0153-2

81.

Fettke

Dai

Kwan

, et al. BRCA-deficient metastatic prostate cancer has an adverse prognosis and distinct genomic phenotype. EBioMedicine. 2023;95:102728.

82.

Dathathri

Isebia

Abali

, et al. Liquid biopsy based circulating biomarkers in metastatic prostate cancer. Front Oncol. 2022;12:863472. doi:10.3389/fonc.2022.863472

83.

Podolak

Eilers

Newby

, et al. Androgen receptor amplification is concordant between circulating tumor cells and biopsies from men undergoing treatment for metastatic castration resistant prostate cancer. Oncotarget. 2017;8:71447-71455. doi:10.18632/oncotarget.16169

84.

Trujillo

Wetterskog

Attard

Blood-based liquid biopsies for prostate cancer: clinical opportunities and challenges. Br J Cancer. 2022;127:1394-1402. doi:10.1038/s41416-022-01881-9

85.

Lopez-Bujanda

Haffner

Chaimowitz

, et al. Castration-mediated IL-8 promotes myeloid infiltration and prostate cancer progression. Nat Cancer. 2021;2:803-818. doi:10.1038/s43018-021-00227-3

86.

Shore

Saad

Cookson

, et al. Oral relugolix for androgen-deprivation therapy in advanced prostate cancer. N Engl J Med. 2020;382:2187-2196. doi:10.1056/NEJMoa2004325

87.

Fizazi

Tran

Fein

, et al. Abiraterone plus prednisone in metastatic, castration-sensitive prostate cancer. N Engl J Med. 2017;377:352-360. doi:10.1056/NEJMoa1704174

88.

FDA. Orgovyx (relugolix) tablets, for oral use: US prescribing information. https://www.accessdata.fda.gov/drugsatfda_docs/label/2020/214621s000lbl.pdf

89.

FDA. ZYTIGA™ (abiraterone acetate) Tablets For Oral Administration: US prescribing information. https://www.accessdata.fda.gov/drugsatfda_docs/label/2011/202379lbl.pdf

90.

Åström

Hägglund

Advanced PID control. 2005. Accessed December 23, 2024. https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=1580160

91.

Åström

Hägglund

Hang

WK.

Automatic tuning and adaptation for PID controllers: a survey. IFAC Proc Vol. 1992;25:371-376. doi:10.1016/S1474-6670(17)50762-4

92.

Nuella

Cheng

Chiu

M-S.

Adaptive PID controller design for nonlinear systems. Ind Eng Chem Res. 2009;48:4877-4883. doi:10.1021/ie801227d

93.

Shi

Lam

H-K

Xiao

Tsai

S-H.

Adaptive PID controller based on Q -learning algorithm. CAAI Trans Intell Tech. 2018;3:235-244. doi:10.1049/trit.2018.1007

94.

LaValle

SM.

Planning Algorithms. Cambridge University Press; 2006.

95.

LeCun

Bengio

Hinton

Deep learning. Nature. 2015;521:436-444. doi:10.1038/nature14539

96.

Zhang

Gradient descent based optimization algorithms for deep learning models training. arXiv preprint arXiv:190303614, 2019.

97.

Rumelhart

Hinton

Williams

RJ.

Learning representations by back-propagating errors. Nature. 1986;323:533-536.

98.

Faysse

Fernandes

Guerreiro

, et al. CroissantLLM: a truly bilingual French-English language model. arXiv preprint arXiv:240200786, 2024.

99.

Izumchenko

Paz

Ciznadija

, et al. Patient-derived xenografts effectively capture responses to oncology therapy in a heterogeneous cohort of patients with solid tumors. Ann Oncol. 2017;28:2595-2605. doi:10.1093/annonc/mdx416

100.

Kelly

Karthikesalingam

Suleyman

Corrado

King

Key challenges for delivering clinical impact with artificial intelligence. BMC Med. 2019;17:195. doi:10.1186/s12916-019-1426-2.

101.

Jain

Kankanhalli

Hallucination is inevitable: an innate limitation of large language models. arXiv preprint arXiv:240111817, 2024.

102.

Tonmoy

Zaman

Jain

, et al. A comprehensive survey of hallucination mitigation techniques in large language models. arXiv preprint arXiv:240101313, 2024.

103.

Ratitch

Trigg

Majumder

Vlajnic

Rethemeier

Nkulikiyinka

Clinical validation of novel digital measures: statistical methods for reliability evaluation. Digit Biomark. 2023;7:74-91. doi:10.1159/000531054

104.

FDA. Software as a Medical Device (SAMD): clinical evaluation, 2017. Accessed December 23, 2024. https://www.fda.gov/regulatory-information/search-fda-guidance-documents/software-medical-device-samd-clinical-evaluation

Adaptive Treatment of Metastatic Prostate Cancer Using Generative Artificial Intelligence

Abstract

Keywords

Introduction

Adaptive Therapy of Metastatic Castrate-Sensitive Prostate Cancer

GenAI-Supported Adaptive Androgen Deprivation of mCSPC

OncoGPT Training on mCSPC Treatment Response Data

Discussion

Conclusions

Footnotes

Funding:

Declaration of conflicting interests:

Author Contributions

Data Availability Statement

ORCID iD

References