Abstract
Introduction
ChatGPT, as an AI tool, has been introduced in healthcare for various purposes. The objective of the study was to investigate the principal benefits of ChatGPT utilization in healthcare services and to identify potential domains for its expansion within the building blocks of the healthcare industry.
Methods
A comprehensive three-phase study was conducted employing mixed methods. The initial phase comprised a systematic review and thematic analysis of the data. In the subsequent phases, a questionnaire, developed based on the findings from the first phase, was distributed to a sample of eight experts. The objective was to prioritize the benefits and potential expansion domains of ChatGPT in healthcare building blocks, utilizing gray SWARA (Stepwise Weight Assessment Ratio Analysis) and gray MABAC (Multi-Attributive Border Approximation Area Comparison), respectively.
Results
The systematic review yielded 74 studies. A thematic analysis of the data from these studies identified 11 unique themes. In the second phase, employing the gray SWARA method, clinical decision-making (weight: 0.135), medical diagnosis (weight: 0.098), medical procedures (weight: 0.070), and patient-centered care (weight: 0.053) emerged as the most significant benefit of ChatGPT in the healthcare sector. Subsequently, it was determined that ChatGPT demonstrated the highest level of usefulness in the information and infrastructure, information and communication technologies blocks.
Conclusion
The study concluded that, despite the significant benefits of ChatGPT in the clinical domains of healthcare, it exhibits a more pronounced potential for growth within the informational domains of the healthcare industry's building blocks, rather than within the domains of intervention and clinical services.
Introduction
In the wake of the COVID-19 pandemic, there has been a marked acceleration and application of Artificial Intelligence (AI) in diverse fields such as healthcare, public safety, and business operations.1,2 Additionally, the existing evidence emphasizes the transformative potential of AI and advanced technologies on business practices in the post-pandemic world. 3
ChatGPT, a product of OpenAI, is an advanced artificial intelligence (AI) chatbot. It employs natural language processing techniques to facilitate dialogues that mimic human conversation. ChatGPT, which is constructed on the basis of the generative pre-trained transformer (GPT) model, possesses the ability to respond to inquiries and generate an extensive variety of written content. This includes, but is not limited to, articles, updates for social media, essays, code, and emails.4,5 This phenomenon highlights the critical importance of identifying areas with potential for the utilization and expansion of ChatGPT. Such recognition is vital for the formulation of future strategies and interventions aimed at improving services within sectors that stand to benefit from ChatGPT.
ChatGPT is pretrained on an extensive text corpus, which enables it to comprehend context and meaning. During this phase, the model discerns patterns and relationships within the language, thereby facilitating the generation of coherent responses to user inputs. Upon receiving a prompt, ChatGPT tokenizes the text and processes these tokens to predict the subsequent token in the sequence, selecting the one with the highest probability. This iterative process continues until a stopping criterion is met, such as a maximum length or a specific end token. 6 Furthermore, ChatGPT is capable of producing a variety of outputs, including concise answers, essays, and conversational interactions. 6 In this context, the utilization of ChatGPT has been identified as a valuable strategy to enhance healthcare services, offering personalized care, 24/7 availability, cost reduction, improved efficiency, and enhanced communication, education, and diagnosis.7–9
The use of ChatGPT in the field of education solves educational problems in less than a few minutes. 10 In the field of transportation, it can control traffic. 11 It also improves the quality of products during the production process. 12 In the field of business, it improves the quality and effectiveness of services provided to customers. 13 ChatGPT, as an AI application, is utilized in healthcare for various tasks like analyzing literature, functioning as a dialogue agent, aiding in medical education, research, and clinical management. It collects patient data, updates medical professionals with recent advancements, and suggests treatments based on symptoms.14,15 In this regard, as presented in a study, ChatGPT-3.5 and ChatGPT-4 have demonstrated strong performance on fundamental healthcare leadership and management inquiries. Notably, ChatGPT-4 has been presented to outperform its predecessor, making it the preferred choice for leadership and management training as well as for acquiring information on various hospital leadership and management topics. 16
As mentioned, all these articles address specific applications or benefits of ChatGPT in healthcare services. This particular paper, however, employs a systematic review to identify the benefits of ChatGPT in the healthcare sector and then prioritizes the healthcare building blocks based on these identified benefits. This approach aims to provide a comprehensive understanding of the benefits of ChatGPT in enhancing healthcare delivery.
Several review studies have investigated the advantages of employing ChatGPT in healthcare services.9,17,18 Conversely, other publications have underscored the limitations and challenges associated with the use of ChatGPT. One systematic review conducted by Sallam (2023), explored the potential benefits and limitations of ChatGPT in healthcare education, research, and practice. It discussed its applications, such as efficient data analysis, code generation, and personalized medicine, and raised concerns about ethical issues, lack of originality, and inaccurate citations. 18 Another study in the field of medical education has shown that ChatGPT scored near the passing threshold on important medical exams. It has also demonstrated the ability to write scientific abstracts at an acceptable level. However, the results of this paper have shown that the professional use of ChatGPT for academic writing should be approached with caution. 19 Another systematic review conducted by Garg et al. (2023) analyzed 118 articles and found that ChatGPT could assist with patient care and research tasks, but also highlighted concerns about its accuracy, reliability, and ethical implications. The review concluded that while ChatGPT could serve as a clinical assistant and aid in medical research and scholarly writing, it also presented limitations and ethical considerations that needed to be addressed. 17 Bias, transparency, privacy, accountability, equity, trust, and replacement have been identified as major ethical challenges associated with the use of AI in healthcare. 20
Furthermore, Sadaghat (2024) has underscored the challenges associated with the utilization of ChatGPT in medical research. A significant limitation is the risk of presenting incorrect or unverified content, which could undermine credibility. Additionally, the absence of transparent references and the potential for copyright or plagiarism issues present substantial challenges to the reliable use of ChatGPT in professional medical studies. 21 In another work, Sadaghat indicates that the primary challenge of using ChatGPT in daily medical practice is the necessity to carefully validate its outputs and ensure it is employed to complement, rather than replace, physicians’ clinical expertise and decision-making. 22
In this context, it has been reported that the responsibility for patient harm resulting from ChatGPT-generated advice remains unclear, necessitating the establishment of clear legal frameworks to define accountability and protect patient data. The use of ChatGPT may disrupt the traditional physician-patient relationship, potentially undermining essential compassion and trust. Conversely, overreliance on AI could diminish the humanistic aspects of care, leading to reduced patient adherence to treatment plans. Transparency regarding AI-generated content is crucial to maintaining trust and integrity in healthcare. Additionally, algorithmic biases in AI systems, such as those in ChatGPT, can perpetuate existing inequalities in healthcare. Finally, ensuring the accuracy and reliability of AI-generated information requires continuous updates and rigorous validation against clinical standards.23–25
Gala and Makaryus (2023) investigated the benefits and concerns of using ChatGPT in the fields of cardiology and their findings discussed the importance of providing adequate training to healthcare professionals who use artificial intelligence tools. In their research, they found out that the incorrect use of artificial intelligence technology can have negative consequences such as wrong diagnosis or incorrect treatment decisions for the patient. Additionally, if an AI model is not properly trained or validated, it may produce false or misleading results that could lead to unintended consequences. 26 In the course of this research, we employed the framework of building blocks as established by the World Health Organization (WHO) to comprehend the categorized domains within the network of healthcare service delivery. This framework is architecturally designed with six fundamental components: provision of services, healthcare personnel, health data systems, medical commodities, immunizations and technologies, funding, and leadership/administration. The framework offers a collection of metrics and evaluative indicators to gauge the efficacy of these fundamental components and to monitor advancements in fortifying healthcare systems.27,28
As observed, while multiple systematic reviews have explored the advantages of employing ChatGPT in healthcare services, none have aimed to prioritize the most significant benefits. This can be regarded as a significant gap within the literature; Therefore, this study sought to address this gap by providing scientific evidence on the existing benefits of using ChatGPT within the healthcare system. Thus, the ultimate aims of the study are
Explore and prioritize the benefits of utilizing ChatGPT in healthcare services. Determine the potential domains of ChatGPT expansion within the healthcare industry.
This study employs a mixed-methods approach, integrating findings from existing literature with expert opinions in the field through the application of multi-criteria decision-making techniques. This methodology represents a novel contribution to the relevant literature and offers valuable and comprehensive insights for stakeholders regarding the utilization of ChatGPT and its potential applications within the healthcare sector. It provides the findings of other papers within the literature and the views of experts within a single document. The findings can be utilized by managers and policymakers within the healthcare sector, identifying potential areas where ChatGPT can be employed and ultimately achieving the envisioned outcomes through such utilization. Furthermore, the findings of this paper can be utilized by future researchers, highlighting the most important areas for conducting research and offering detailed information on the potential impacts of ChatGPT utilization.
Material and method
Methodology
This research, conducted in 2023, utilized a mixed-methods approach, integrating both qualitative and quantitative methodologies, and was executed in three distinct phases. The primary objective of the study was to investigate and prioritize the advantages of implementing ChatGPT in healthcare services during phases one and two. In phase three, the study aimed to identify potential areas for the expansion of ChatGPT within the healthcare sector, as detailed in the subsequent section on objectives and research questions.
Objectives and research question
Phase one
The objective of the systematic review was to systematically examine the benefits of employing ChatGPT within healthcare services, as documented in the literature. The Joanna Briggs approach was employed for this purpose. 29 Initially, a research question was formulated for this phase of the study in collaboration with the authors, which was: “What are the benefits of utilizing ChatGPT within healthcare services according to the existing literature?”
Phase two
In this phase of the study, we aimed to prioritize ChatGPT benefits utilizing the gray SWARA (Gray Stepwise Weight Assessment Ratio Analysis) methodology. The research question for this phase of the study was formulated as: “How ChatGPT benefits are prioritized in terms of their importance according to the study experts?”
Phase three
In this phase of the study, we aimed to determine the potential building blocks for ChatGPT expansion within healthcare systems using the Gray MABAC (Gray Multi-Attributive Border Approximation Area Comparison) methodology. The research question for this phase of the study was formulated as: “What are the potential building blocks for ChatGPT expansion within healthcare systems according to the study experts?”
Data collection
Phase one
A systematic exploration was undertaken to locate all published articles pertinent to the research topic, within the timeframe of 2000–2023, and exclusively in English. The databases of PubMed, Scopus, Cochrane, and ProQuest were utilized for this search since they are considered the most prominent databases indexing studies regarding the context. 30 Medical Subject Headings (MeSH) terms were employed to classify all keywords into two categories: benefits, and ChatGPT. Synonymous keywords were amalgamated using the logical operator “OR”. Subsequently, the logical operator “AND” was applied to consolidate the first, second, and third groups of keywords. The references were managed using EndNote 21.2 software. The search strategy employed to identify relevant literature is depicted in Table 1, and was registered in GitHub with the following link:
The search strategy utilized within the systematic review.
The inclusion criteria encompassed:
Papers addressing the research question. Papers published within the timeframe from 2000 to 2023. Papers written in English. Brief communications, letters to the editor, and conference papers. Papers identified as duplicates.
The exclusion criteria encompassed:
In this stage of the study, duplicate studies were eliminated, and the remaining ones were screened based on their titles and abstracts. Studies not pertinent to our research objective were discarded, and the full text of the remaining articles was reviewed. Only those that satisfied our eligibility criteria were included in the final analysis. Furthermore, only studies providing data relevant to the research question were included in the study. This entire process was independently executed by two researchers. Data corresponding to the study objective were independently extracted from the final studies by two authors. A third author was consulted in case of any conflict of views between the initial two authors.
Phase two
In this phase of the study, the data was derived from a sample of experts in the medical sciences field who have proficiency in ChatGPT and its utilization in healthcare services. In this regard, eight experts were utilized comprising of physicians, pharmacologists, healthcare practitioners, and researchers. The experts were free to withdraw from the study if they presented their unwillingness to participate in the research (exclusion criteria). The criteria for inclusion in the study were as follows:
General familiarity with ChatGPT and its application in healthcare services. General experience in using ChatGPT in the delivery of healthcare services or conducting healthcare research. Possession of a PhD. degree in a healthcare field or being a practicing physician.
Data collection was conducted using a questionnaire formulated based on the data obtained from the review conducted in the initial phase of the study. During this process, the benefits of ChatGPT derived from the data were integrated into the questionnaire to be assessed and ranked using the gray SWARA (SWARA-G) method. Consequently, data were collected through two different questionnaires. The first questionnaire was designed for the initial ranking of the identified benefits, while the second questionnaire was used for the final weighting of the benefits. These questionnaires were subsequently administered to experts in two consecutive stages. Ultimately, the benefits were weighted and ranked based on their importance according to the opinions of these experts.
Phase three
In this phase of the study, data collection was conducted using the same sample of experts as in phase two. Meanwhile, the inclusion and exclusion criteria remained consistent. The data collection process was executed via a questionnaire. This questionnaire was constructed by leveraging the data obtained from the thematic analysis conducted in the initial phase of the study. Additionally, it incorporated the framework of monitoring the building blocks of health systems, a concept published by the World Health Organization (WHO) in order to prioritize potential healthcare domains for ChatGPT expansion.
27
This study utilized several items from the framework, including the following:
Infrastructure, Information and Communication Technologies, which underscore the need for robust and reliable infrastructure and information technologies in health systems.
27
Health Workforce, focusing on the accessibility, skill diversity, and distribution of the healthcare workforce, representing one of the six fundamental components of the WHO framework for monitoring healthcare system building blocks.
27
Information, highlighting the need for reliable and robust data for effective health system monitoring.
27
Intervention Access and Services Readiness, aligning with the service delivery block of the WHO's building blocks, focusing on the accessibility and readiness of healthcare services for its users.
27
Intervention Quality, Safety, aligning with the service delivery block of the WHO's building blocks, focusing on the quality and safety of services provided to beneficiaries within healthcare systems.
27
Upon integrating the benefits derived from the thematic analysis of the systematic review and the building blocks proposed by the WHO framework into the final questionnaire, the Gray MABAC (Multi-Attribute Border Approximation Area Comparison) methodology was employed to facilitate the ranking and prioritization of the health blocks. Therefore, the required data has been collected through a questionnaire that was designed for this method. At the next step, the questionnaire was disseminated to the experts. Appendix A (Questionnaire) presents the content of the questionnaire.
Data analysis
Phase one
During this phase of the research, the thematic analysis approach formulated by Braun and Clarke was utilized. This approach consists of six stages: data familiarization, data coding, theme generation, theme review, theme definition and naming, and the final report. 31
At the outset, the authors initiated a process of becoming acquainted with the subject matter and the research context by conducting a comprehensive examination of the content pertinent to the topic. Following this, the texts relevant to the research question in the data from the final studies were subjected to coding. In the subsequent stage, sub-themes and themes were derived from the coded data through a process of categorization and grouping of the codes. The authors then undertook multiple reviews of the generated themes to ensure the process's validity and reliability and to avoid any risk of bias. In the next step, the authors delineated and assigned names to the themes and their sub-themes based on their intrinsic and existential attributes. Finally, the authors amalgamated the generated themes, sub-themes, and their respective codes into a unified document. MAXQDA 2020, a prominent software for conducting qualitative analysis, was utilized for the analysis. 32
Phase two
The Swara-G method combines gray theory with the Swara approach to achieve the benefits of this combination. As a valuable decision-making methodology, the Swara-G method proves instrumental in analyzing situations characterized by uncertainty. Also, this approach has a simpler evaluation process than other methods such as analytical hierarchy process (AHP). 33 For instance, in scenarios where there are 11 criteria requiring evaluation, the Swara-G method necessitates a mere 10 pairwise comparisons, whereas gray’s Analytic Hierarchy Process (AHP-G) entails 55 pairwise comparisons by the expert for problem resolution.
Swara's approach incorporates the expert's perspective regarding the validity of weight criteria. In other words, Swara's method provides the possibility to evaluate and determine the weight for the criteria and considers the expert's opinion about the correctness of this weight. Moreover, within this methodology, experts possess the opportunity to engage in collaborative consultations and mutually cooperate, consequently yielding more precise and robust outcomes in contrast to alternative approaches employed in multi-criteria decision-making. 34 In addition, Swara's method requires simple computational steps, which makes it very user-friendly. 35 Moreover, gray theory can model and predict in uncertain and incomplete conditions. 36
Compared to the standard SWARA method, the gray SWARA approach provides a more robust and comprehensive evaluation of the criteria, leading to better-informed decisions. The key steps of gray SWARA are as follows: identify the criteria and have decision-makers provide assessments using gray numbers; rank the criteria from most to least important; determine the comparative importance of the average gray value for each criterion; calculate the gray coefficient and gray weights; and normalize the gray weights to obtain the final prioritization.
This gray SWARA analysis enabled decision-makers to make more informed decisions in a military context, considering the uncertainty and vagueness in the evaluation criteria. The use of gray numbers helps decision-makers better reflect the real-world complexities they face. 36
In our study, we have employed the gray SWARA method to weigh the benefits of ChatGPT in the field of healthcare. So, through the utilization of gray theory, the analysis of uncertain information can be effectively incorporated into modeling processes, enabling the amalgamation of such data with definitive information to yield more precise outcomes.
Steps of Swara-G
Step 1: The first step in the Swara-G method is to identify and define the elements that must be weighted.
Step 2: Ranking factors
In this step, the experts assign ranks to the criteria in descending order of importance, from the most significant criterion to the least significant criterion.
During this stage, decision-makers assess the gray relative importance values. Specifically, the relative significance of each criterion or sub-criterion is evaluated in relation to its preceding criterion or sub-criterion, based on the prevailing research conditions.
Step 4: Calculation of comparative gray coefficients
The initial phase of the mathematical procedure in the Swara-G method involves the computation of gray comparison coefficients utilizing equations (1) and (2):
In this step, using equations (3) and (4), unscaled gray weights of criteria and sub-criteria are obtained.
In this step, the scaled gray weights are calculated using equations (5) and (6).
In this step, the scaled weights are calculated using equation (7).
In this step, Experts’ opinions are integrated using equation (8).
Phase three
The framework for utilizing the MABAC (Multi-Attributive Border Approximation Area Comparison) method involves defining the distance of the criterion function from each alternative, based on the BAA (Boundary Approximation Area). The MABAC-G represents an extended version of the crisp MABAC. The gray MABAC is a multi-criteria decision-making technique used to evaluate and rank a set of alternatives based on various performance criteria. The gray MABAC method involves several steps, including the normalization of the decision matrix, the calculation of the weighted normalized decision matrix, the determination of the ideal and anti-ideal solutions, and the ranking of the alternatives based on their proximity to the ideal solution. It is particularly useful when dealing with uncertain or incomplete information, as it can incorporate gray numbers into the decision-making process. 36
Our study employs the gray MABAC multi-criteria decision-making technique to systematically prioritize the healthcare building blocks according to the potential benefits of integrating ChatGPT. In other words, the gray MABAC analysis determines the healthcare domains where ChatGPT can be most advantageously utilized. By calculating the distance of each healthcare building block from the ideal and anti-ideal solutions, the gray MABAC method identifies and ranks the top-priority domains where ChatGPT can be most beneficially utilized.
The procedure for implementing the MABAC-G, encompasses the following steps 37 :
Step 1: The construction of the aggregate gray decision matrix is the initial step. Given “m” alternatives, “n” criteria, and “k” experts, the aggregate gray decision matrix is derived through the application of equation (8).
Ethical considerations
All procedures were conducted in strict adherence to relevant ethical guidelines and regulations, including the Helsinki Declaration of 1975. 38 In this regard, Informed consent was obtained from all members of the expert panel, who were also given the freedom to withdraw from the study at any time upon request.
Results
The findings of the study are presented in the subsequent sections.
Phase one: identification of ChatGPT benefits through systematic review
As demonstrated in Figure 1, from the total of 1049 studies retrieved from various databases, 86 were identified as duplicates. Upon meticulous screening of the titles, abstracts, and full texts of the remaining manuscripts, a final selection of 74 studies was included in the research. All of the studies were published in 2023 and consisted of both qualitative and quantitative methodologies. Furthermore, the studies were conducted in diverse regions, including the United States, the United Kingdom, Saudi Arabia, and multiple other locations.

PRISMA diagram of the systematic review.
Thematic analysis
As delineated in Table 2, the thematic analysis of the data acquired through the final studies yielded 11 distinct themes. These themes encompassed Medical Documentation and Insights, Healthcare Information and Education, Clinical Decision-Making, Healthcare Research, Healthcare Writing, Medical Diagnosis, Medical Procedures, Healthcare Surveys, Privacy, Patient-Centered Care, and Administrative Tasks.
Thematic analysis on the data acquired from the included studies.
Medical documentation and insights
ChatGPT can quickly summarize the documented medical records, making it easier for doctors to review patient histories and make informed decisions. Furthermore, ChatGPT can assist with report summaries, making it easier for doctors to review and interpret imaging and examination results.39–69
Healthcare information and education
ChatGPT can aid in medical education by providing information, updates, and explanations to healthcare professionals and patients, helping them stay informed about new developments in the professionals` respective fields and various health conditions and treatment options for patients while answering common questions and providing information about procedures and treatments.39,41,43,44,47,48,50–56,58–64,66,67,69–95
Clinical decision-making
ChatGPT can assist healthcare professionals in making clinical decisions by providing them with evidence-based recommendations.26,43,49,52–54,62,67,69,72,74,75,78,80,88,91,96–101
Healthcare research
ChatGPT can support medical students in their learning process by answering questions and providing feedback. It can also assist researchers in analyzing large datasets and generating hypotheses.9,14,40,41,45,53,55,57,58,63,64,70,72,73,76,77,83,86,91,94,96,98,102–104
Healthcare writing
ChatGPT can assist in healthcare writing, making it easier for healthcare professionals to draft reports and other documents.40,41,43,45,51,53,57,58,66,67,72,75,77,96,102,103
Medical diagnosis
ChatGPT has the potential to assist with diagnosis of multiple illnesses, helping doctors to identify the conditions and diseases.14,50–52,54,59–61,70,75,89,90,101,104–107
Medical procedures
ChatGPT can be utilized in clinical procedures, improving patient outcomes through AI technology.39,55,62,70,75,83,85,93,95,104,108,109
Healthcare surveys
ChatGPT can assist in drafting patient surveys, streamlining the process of collecting and analyzing data.40,48,75
Privacy
ChatGPT can assist in deidentifying patient data, ensuring compliance with HIPAA requirements, and maintaining patient privacy and confidentiality.48,60,86
Patient-centered care
ChatGPT can provide personalized and timely responses to patients’ inquiries, support symptom tracking and medication adherence, and offer mental health support through conversational interfaces.41,50,72,83,87–90,92,97,98
Administrative tasks
ChatGPT can automate administrative tasks such as scheduling appointments, managing medical records, and handling insurance.9,41,48,70,85,87,94,98,108
Phase two: weighting of ChatGPT benefits through gray SWARA
In this phase, the gray SWARA method was used to weigh the benefits of ChatGPT in the field of healthcare. Based on the first and second stages of the gray SWARA method, the experts have ranked 11 potential benefits of ChatGPT in healthcare as indicators, ordered from the most important to the least important index. Then, the experts evaluated the relative importance of each indicator compared to the previous indicator. This evaluation was conducted using the linguistic terms of “much less important,” “less important,” “moderately less important,” and “relatively equal importance. Subsequently, the table of the relative importance of the indicators was converted into a table of gray relational importance values according to gray numbers. The final results are presented in Table 3. The average of the weights for each indicator is presented in the last column of this table as the final weights.
Calculation of scaled weights and final weights.
As depicted in Table 4, certain benefits of ChatGPT were accorded significantly greater priority than others. The benefits that were given higher precedence encompassed clinical decision-making (weight: 0.135), medical diagnosis (weight: 0.098), medical procedures (weight: 0.070), and patient-centered care (weight: 0.053). This indicates that ChatGPT appears to confer substantial benefits to the medical domain of healthcare services, far exceeding its impact on other service domains such as administrative tasks, which was ranked as the least prioritized benefit (weight: 0.006). Figure 2 illustrates the final weights of the benefits of using ChatGPT in the healthcare sector. As the figure presents, clinical decision-makings and medical diagnosis significantly surpass the rest of ChatGPT benefits in terms of importance and priority.

Radar chart of the weights of the benefits of using ChatGPT.
Ranking of ChatGPT benefits in terms of importance and priority.
Phase three: prioritization of building blocks according to ChatGPT benefits through MABAC-G
As illustrated in Table 5, the domains of Information and Infrastructure, Information and Communication Technologies were identified as the most critical domains of healthcare system building blocks, possessing the greatest potential for ChatGPT expansion. Conversely, the domain of Intervention Access and Services Readiness was regarded as having the least potential.
Ranking of building blocks according to ChatGPT benefits.
Sensitivity analysis of the results
The key limitation of using the gray MABAC method in this study is the inherent subjectivity involved in the decision-making process. The gray MABAC method used to evaluate the potential expansion domains of ChatGPT is sensitive to the choice of reference points. The selection of these reference points can significantly impact the final rankings, introducing another source of subjectivity that may limit the reliability and replicability of the findings. To address these limitations and to enhance the validity of the results, in this study, the healthcare system building blocks were prioritized using several methods. The results are presented in Table 6.
Sensitivity analysis of option rankings.
According to Table 6, in all three methods, ChatGPT provides the greatest benefit in the information block. Additionally, in all three methods, the facilities and information and communication technology block hold the second position.
Discussion
In this research, the SWARA-G method was utilized to weigh the benefits of using ChatGPT in the health system, and the MABAC-G method was employed to prioritize the health building blocks within these benefits in a gray environment. Additionally, Microsoft Excel was used to implement the SWARA-MABAC method in our study. The mathematical structure of the MABAC approach remains constant, regardless of the number of alternatives and criteria. This approach demonstrates the capability to be applied to a greater multitude of alternatives and criteria, thereby offering a distinct ranking of alternatives accompanied by numerical values, which enhances the comprehensibility of the results. Furthermore, this method can be used for both qualitative and quantitative criteria. 110
The incorporated gray SWARA-MABAC approach has significant advantages over other multi-criteria decision-making methods. In this manner, the relationship between the benefits of ChatGPT within the health blocks can be accurately identified. This methodology holds significant potential in informing managerial and strategic decisions within the healthcare domain, enhancing both accuracy and predictability. By providing decision-makers with a comprehensive understanding of the interrelationships among the investigated factors, it enables more optimal decision-making processes.
In this segment of the study's outcomes, a detailed analysis and discussion will be conducted, specifically focusing on the prioritization of benefits derived from ChatGPT and potential healthcare building blocks for the expansion of ChatGPT.
Review of ChatGPT benefits
As presented within the results section, the thematic analysis of the data acquired through the review yielded 11 themes. The themes included Medical Documentation and Insights, Healthcare Information and Education, Clinical Decision-Making, Healthcare Research, Healthcare Writing, Medical Diagnosis, Medical Procedures, Healthcare Surveys, Privacy, Patient-Centered Care, and Administrative Tasks. Among the themes, Healthcare Information and Education had the highest share of citations among the included studies, being cited by 70% of them. Medical Documentation and Insights was another theme with a significant share of citations, cited by 41% of the studies.
The results corresponding to this section of the study highlighted that ChatGPT's benefits, including clinical decision-making, medical diagnosis, medical procedures, and patient-centered care, were prioritized as the most important. These findings indicated significantly higher potential benefits of ChatGPT in the medical domain of healthcare services, surpassing its benefits in administrative tasks suggesting that policy makers, manufacturers, and researchers in the healthcare industry should focus on enhancing ChatGPT's benefits within clinical domains rather than administrative and nonmedical ones, which can be a significant implication derived from the findings of the study. As the results of the study indicated, the highest potential benefits of ChatGPT within the realm of healthcare services was proposed to be within the scope of clinical domains of healthcare systems. In such context, ChatGPT's capability for continuous clinical decision assistance is demonstrated to possess an overall precision of 71.7% across a variety of clinical scenarios. This indicates that ChatGPT exhibits remarkable accuracy in clinical decision-making, with its proficiency escalating as it acquires more clinical data. 111 Nonetheless, it has been disclosed that healthcare professionals exhibit moderate to low degrees of confidence in ChatGPT's capacity to formulate medical decisions. Moreover, there are abundant apprehensions regarding the accuracy, dependability, and medicolegal consequences of the data provided by ChatGPT. 89
Another prioritized potential benefit of employing ChatGPT within healthcare services was the enhancement of patient-centered care. Patient-centered care is an approach that values patients’ experiences, needs, and preferences and necessitates active understanding from healthcare organizations and professionals; It fosters partnerships among practitioners, patients, and their families, ensuring decisions align with patients’ desires and needs.112,113 In this regard, ChatGPT has the potential to enhance the conveyance of discoveries, foster interdisciplinary cooperation, and streamline workflow, thereby augmenting the efficacy in pinpointing and directing patients in their postoperative care.114,115 Moreover, ChatGPT is suggested as a solution to surmount linguistic obstacles encountered by patients who are not native English speakers, thus promoting communication that is centered around the patient. 116 Furthermore, ChatGPT can provide support to healthcare professionals in addressing patient inquiries, composing medical notes, discharge summaries, and treatment plans, which in turn bolsters the efficiency and precision of healthcare provision. 17
Administrative tasks were regarded as the least significant benefit of ChatGPT within the context of healthcare services. The low prioritization of utilizing chatbots for administrative tasks appears to be associated with the seemingly less significant outcomes of artificial intelligence applications in administrative domains compared to those in clinical settings, as evidenced by the existing literature.117,118 Another factor contributing to the low prioritization of AI in healthcare may be the ethical concerns associated with its implementation. These concerns encompass issues such as data privacy, algorithmic bias, and accountability for decisions made by AI systems.119,120 Furthermore, the potential replacement of humans by AI technology in administrative and decision-making roles is expected to be limited due to these ethical considerations.121,122
Review the prioritization of potential healthcare building blocks for ChatGPT expansion
The study results highlighted that the building blocks of Information, and Infrastructure, Information and Communication Technologies, hold significant potential for ChatGPT expansion. However, the Intervention Access and Services Readiness domain was found to have the least potential.
The findings in this section of the study are supported by multiple researches within the literature.107,123 ChatGPT's application in the domain of medical information is significant, particularly in aiding patients’ health management. It can automate the summarization of patient interactions and medical histories, thereby streamlining recordkeeping for healthcare professionals. By transcribing their notes, clinicians can use ChatGPT to summarize essential details like symptoms, diagnoses, and treatments, and extract pertinent patient record information. It can also facilitate clinical trial recruitment by analyzing patient data to identify eligible individuals. Furthermore, ChatGPT can help patients manage their medications by providing reminders, dosage instructions, and information about potential side effects and drug interactions. As per a recent article, ChatGPT can act as a reliable agent to gather information from patients with various diseases. 14
Another finding of the study within this section was the low priority of Intervention Access and Services Readiness domain for ChatGPT expansion. In this regard, several limitations of ChatGPT have been reported that may hinder its potential for providing immediate and accessible clinical services to users. These limitations encompass a lack of human-like comprehension and a propensity for generating irrelevant or unoriginal text. Moreover, while ChatGPT can contribute to medical education, research, and clinical management, it is not a substitute for human expertise and knowledge, and thus, must be employed judiciously. 15
Limitations and implications
This study presented several limitations and implications that warrant attention. The experts involved in the study were exclusively based in Iran, and we were unable to engage with international experts. Consequently, our findings concerning the prioritization of ChatGPT benefits and potential domains for ChatGPT expansion within healthcare building blocks may be influenced by the socioeconomic factors and other characteristics specific to Iran and its healthcare system. We, therefore, encourage future researchers to conduct similar studies in other countries to gain a more accurate understanding of the topic with reduced bias.
Additionally, there are limitations in the chosen methodology. The gray SWARA and gray MABAC methods rely on subjective weightings and reference points, which could introduce uncertainties and inconsistencies in the prioritization of benefits and expansion domains for ChatGPT in healthcare. To address these limitations, we used several methods as sensitivity analysis of the results. However, future research can employ more diverse decision-making techniques. Despite our efforts, there may still be inherent biases or limitations in the expert selection process that could have influenced the findings. Future researchers can expand the expert panel to include a more diverse range of stakeholders and perspectives.
The study's findings could have substantial implications for healthcare policymakers, manufacturers, and researchers by providing a prioritized list of ChatGPT benefits within healthcare systems and potential domains for ChatGPT expansion. We recommend that future researchers conduct studies connecting the findings of studies similar to this one with those conducted in other sectors beyond the healthcare industry, as such results would be fruitful in providing a more detailed view of the benefits of ChatGPT. Further research is required to deepen and refine our understanding of these findings.
Conclusion
The study was executed in three phases. The first phase involved a systematic review that resulted in 74 studies. A thematic analysis of the data from these studies revealed 11 unique themes, including Medical Documentation and Insights, Healthcare Information and Education, Clinical Decision-Making, Healthcare Research, Healthcare Writing, Medical Diagnosis, Medical Procedures, Healthcare Surveys, Privacy, Patient-Centered Care, and Administrative Tasks. In the second phase, the views of multiple experts were utilized to prioritize the clinical domains of ChatGPT benefits, such as clinical decision-making, medical diagnosis, medical procedures, and patient-centered care, as the most important domains of ChatGPT benefits in healthcare. Concurrently, the study results underscored that the information building blocks of healthcare systems, including Information, Infrastructure, and Information and Communication Technologies, possess substantial potential for ChatGPT expansion. However, the Intervention Access and Services Readiness domain was identified as having the least potential. This finding suggests that ChatGPT has a higher potential for expansion in the information domains of the healthcare industry's building blocks rather than the intervention and clinical service-based domains.
Supplemental Material
sj-docx-1-dhj-10.1177_20552076241297059 - Supplemental material for ChatGPT utilization within the building blocks of the healthcare services: A mixed-methods study
Supplemental material, sj-docx-1-dhj-10.1177_20552076241297059 for ChatGPT utilization within the building blocks of the healthcare services: A mixed-methods study by Payam Shojaei, Mohsen Khosravi, Yalda Jafari and Amir Hossein Mahmoudi, Hadis Hassanipourmahani in DIGITAL HEALTH
Supplemental Material
sj-pdf-2-dhj-10.1177_20552076241297059 - Supplemental material for ChatGPT utilization within the building blocks of the healthcare services: A mixed-methods study
Supplemental material, sj-pdf-2-dhj-10.1177_20552076241297059 for ChatGPT utilization within the building blocks of the healthcare services: A mixed-methods study by Payam Shojaei, Mohsen Khosravi, Yalda Jafari and Amir Hossein Mahmoudi, Hadis Hassanipourmahani in DIGITAL HEALTH
Footnotes
Acknowledgments
We would like to acknowledge the Bing AI chatbot for its contribution to rewriting the manuscript in terms of English grammar and words.
Availability of data
The research data can be accessed by contacting the corresponding author of the research.
Contributorship
PSH initiated the research, conducted the data analysis and revised the manuscript. MKH conducted the review and cooperated in writing of the manuscript. YJ cooperated in conduction of the data analysis and writing of the manuscript. AHM cooperated in conducting the review and analysis. HH cooperated in conducting the review and analysis.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Ethical approval
Not applicable to this methodology of research.
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
Guarantor
PS.
Supplemental material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
