Abstract
Objective
The integration of medical large language models (MLLMs) into healthcare has garnered global interest, however, the determinants of their adoption by medical professionals remain underexplored. This study aims to elucidate the factors influencing doctors’ intention to utilize MLLMs, encompassing both psychological determinants and demographic attributes.
Methods
An extended theoretical model was developed using constructs derived from the Technology Acceptance Model (TAM) and five constructs. A hybrid online and offline survey was conducted from March to December 2023, including 955 Chinese medical practitioners. Structural equation modeling was utilized to test the research hypotheses.
Results
The measurement model exhibited satisfactory reliability and validity, with fit indices meeting scholarly standards. Perceived ease of use emerged as a significant predictor of both perceived usefulness and satisfaction. Content quality was identified as a substantial influence on perceived satisfaction but did not significantly predict perceived usefulness. Technical support and social influence were found to significantly affect perceived usefulness without directly impacting satisfaction. Perceived usefulness positively influenced both satisfaction and usage behavior, while perceived risk had a negative effect. A significant relationship between perceived satisfaction and usage behavior was established, with gender, age, education, and professional title moderating this relationship.
Conclusions
The study provides empirical evidence for understanding the adoption of MLLMs by Chinese doctors, offering management implications for future technical research, development, and implementation in the medical field.
Introduction
The emergence of medical large language models (MLLMs), as sophisticated deep learning-based natural language processing (NLP) tools, has marked a significant advancement in the field of healthcare. These models have demonstrated exceptional capabilities in clinical decision support and patient interaction, showcasing a potential that rivals, and in some instances, surpasses the performance of medical experts in specific medical tasks. 1 For instance, models such as Google's PaLM2 and Hippocratic AI have been recognized for their high accuracy in these domains. 2 As intelligent assistants, MLLMs offer substantial benefits to the medical community by aiding physicians in rapid and precise diagnosis, clinical decision-making, and patient care delivery.3,4 Their integration into healthcare systems promises to enhance the efficiency and accuracy of medical practice, mitigate the occurrence of medical errors, and facilitate improvements in remote consultations and the management of medical documentation. 5 In China, with its extensive population base and abundant medical data, presents a unique landscape for the application of MLLMs.6,7 The deployment of these models in China holds the promise of addressing disparities in the distribution of medical resources, alleviating the workload of healthcare professionals, and catering to the growing demand for efficient healthcare services. Furthermore, MLLMs are expected to contribute significantly to chronic disease management, a critical area within the healthcare sector. 8 The role of MLLMs in the Chinese medical and health sector is set to become increasingly pivotal, heralding a new era of technological integration in healthcare delivery.
Despite the potential benefits, the adoption of MLLMs by doctors is a critical factor determining their practical effectiveness in medicine. While current research predominantly concentrates on the technical aspects of MLLMs, there is a significant gap in understanding the behavioral aspects of their adoption, especially within the Chinese context, where cultural and systemic differences may present unique challenges. This study addresses this research gap by examining the behavioral intentions and influencing factors of Chinese doctors regarding the adoption of MLLMs. Utilizing surveys and interviews, the study will develop a comprehensive model that incorporates individual and psychological factors, shedding light on the mechanisms of adoption. The insights gained from this research will offer tailored recommendations for the Chinese medical context, thereby facilitating the effective integration and utilization of MLLMs in clinical practice.
Methods
Theoretical model
The theoretical model underpinning the study is primarily based on the technology acceptance model (TAM), which traditionally emphasizes perceived usefulness (PU) and perceived ease of use (PEOU) as determinants of technology adoption.9,10 The present study extends it by incorporating additional constructs that emerged as context-specific predictors within the domain of MLLMs. Content quality, another construct added to the model, originates from the information quality famework and is critical in the context of healthcare technology, where the accuracy, relevance, and timeliness of information can significantly impact decision-making.11–13 Technical support was added to account for the facilitating conditions that can influence adoption.11,14 Social influence was included as the interconnectedness of the modern digital age facilitates the rapid dissemination of information and opinions among peers.15,16 Perceived risk was included to account for potential negative implications associated with technology use, such as privacy concerns or data security issues, which have been shown to influence satisfaction and usage behavior.17,18 Perceived satisfaction reflects the affective evaluation of technology use, which is increasingly recognized for its influence on user engagement and long-term adoption.12,19,20 The theoretical model is shown in Figure 1.

Theoretical model.
Hypotheses development
The hypotheses of this study are grounded in a robust theoretical framework and supported by empirical evidence from the literature. Each hypothesis is developed with reference to specific constructs and relationships identified in the existing body of knowledge.
The positive impact of perceived ease of use on both perceived usefulness and perceived satisfaction is posited based on the foundational work of Davis (1989), which established perceived ease of use as a key determinant of perceived usefulness within the technology acceptance model (TAM).
21
Perceived ease of use can lead to a more positive user experience, thereby increasing perceived satisfaction with technologies.
22
Therefore, the following two research hypotheses are proposed:
H1a: Perceived ease of use has a positive impact on doctors’ perceived usefulness in adopting MLLM. H1b: Perceived ease of use has a positive impact on doctors’ perceived satisfaction in adopting MLLM.
The influence of content quality on perceived usefulness and satisfaction is derived from the Information Quality Framework, which emphasizes the importance of accurate and relevant information in user decision-making processes.
23
In the context of healthcare, where the stakes are particularly high, content quality is considered a critical factor in the adoption of MLLMs.24,25 Therefore, the following hypotheses are proposed:
H2a: Content quality has a positive impact on doctors’ perceived usefulness in adopting MLLM. H2b: Content quality has a positive influence on doctors’ perceived satisfaction in adopting MLLM.
Technical support is included as a predictor of perceived usefulness, reflecting the notion that adequate support can facilitate the effective use of technology.
26
The critical role of technical support in addressing user concerns, facilitating the resolution of issues, and thereby contributing to a higher level of user satisfaction.27,28 According to the existing literature, two research hypotheses are proposed:
H3a: Technical support has a positive impact on doctors’ perceived usefulness in adopting MLLM. H3b: Technical support has a positive impact on doctors’ perceived satisfaction in adopting MLLM.
Social influence is widely recognized as a significant factor in the adoption of new technologies, emphasizing subjective norms in shaping behavioral intentions.29,30 The difference between its impact on perceived usefulness and satisfaction is that social influence may initially drive adoption by generating positive perceived usefulness,
30
and it also being initial factor in determining satisfaction in the field of health information technology.
31
Then, two hypotheses are proposed:
H4a: Social influence has a positive influence on doctors’ perceived usefulness in adopting MLLM. H4b: Social influence has a positive impact on doctors’ perceived satisfaction in adopting MLLM.
The hypothesis linking perceived usefulness to perceived satisfaction is based on the cognitive evaluation of technology's utility leading to affective responses.10,32 The perceived usefulness has a positive influence on usage behavior is a core proposition of the TAM, indicating that the perceived benefits of technology drive its adoption and continued use.
22
The following two research hypotheses are proposed:
H5: Perceived usefulness has a positive impact on doctors’ perceived satisfaction in adopting MLLM. H6: Perceived usefulness has a positive influence on doctors’ use behavior in adopting MLLMs.
The perceived satisfaction with technology has been shown to confirm initial expectations, ultimately leading to continued use behavior. When users’ expectations are met or exceeded by technology, they are more likely to continue using it, which aligns with the findings of numerous empirical studies that have demonstrated the positive relationship between satisfaction and continued use behavior.33,34 It is crucial for technology designers and developers to prioritize user satisfaction to foster continued use and ultimately achieve long-term success.
33
H7: Perceived satisfaction has a positive impact on doctors’ use behavior in adopting MLLM.
The negative influence of perceived risk on satisfaction and use behavior aligns with the Protection Motivation Theory, which suggests that potential negative consequences can diminish user satisfaction and act as barriers to technology adoption.17,18 Two research hypotheses are proposed:H8: Perceived risk has a negative impact on doctors’ perceived satisfaction in adopting MLLM.
H9: Perceived risk has a negative influence on doctors’ use behavior in adopting MLLM.
Post hoc analysis led to the examination of demographic variables as potential moderators. Specifically, we examined whether gender, age, education, and professional title might influence the relationships among perceived usefulness, perceived satisfaction, perceived risk, and usage behavior to MLLMs. Although these are not proposed hypotheses, their examination provides a deeper understanding of factors on technology adoption within different demographic subgroups.
Questionnaire design and data collection
The current study is a cross-sectional survey aimed at understanding the factors influencing Chinese doctors’ intention to use MLLMs. The questionnaire utilized in this study was developed based on established constructs from the technology acceptance model (TAM) and literature, as indicated in our theoretical model. The questionnaire consisted of two sections: demographics and 34 items measured on a 7-level Likert scale. References to the specific items of the questionnaire are provided in Table 1, ensuring transparency and replicability of our research methodology. The study received ethical approval with the approval number 2022-S313 from the Institutional Review Board (IRB) of Third Xiangya Hospital, Central South University. Before commencing the survey, written informed consent was obtained from all respondents.
Measurement items of the constructs.
The survey was designed to assess the perceptions of Chinese doctors toward MLLMs. Given the rapidly evolving landscape of AI in medicine, the survey did not specify particular MLLMs but rather focused on the general capabilities and features characteristic of this class of tools. This approach was intended to capture a broad perspective on the potential impact and reception of MLLMs in medical practice. However, it is acknowledged that different MLLMs may offer distinct features and capabilities, which could influence the adoption factors under investigation. For further insights into specific MLLMs and their applications, we refer to recent literature.35–37 To prevent varying understanding of MLLMs among our respondents, we included a standardized definition and a brief educational introduction about MLLMs at the beginning of our survey. This introduction aimed to provide a common baseline of knowledge. Additionally, we assessed respondents’ baseline knowledge of MLLMs with an initial question, allowing us to consider the level of understanding when analyzing responses. Feedback from pilot testing was used to refine our approach, ensuring clarity across all respondents.
Prior to the formal survey, the questionnaire was pilot-tested with 50 doctors to ensure clarity and relevance, and language revision was conducted by an expert to enhance comprehension. The pilot test helped refine the questionnaire for better understanding and reliability. The formal survey was conducted from March to December 2023 in five major cities in China: Beijing, Shanghai, Guangzhou, Changsha, and Chengdu. These cities were chosen for their diverse medical infrastructure and representativeness of the country's medical community. Questionnaires were distributed randomly through medical institutions and online platforms, ensuring anonymity and voluntary participation. The survey was accessible on mobile phones, and respondents were typically able to complete it within 3–5 minutes.
To ensure data accuracy, the survey included trap questions and IP checks. Ultimately, 955 completed surveys were collected, with 126 partial responses excluded due to incomplete data. The study is committed to transparency and reproducibility. The raw data collected are stored securely and are available upon request for researchers wishing to conduct further analyses or reproducibility studies. Access to the data will be granted in accordance with ethical considerations and data protection standards.
Data analysis
We utilized IBM SPSS and AMOS for data analysis in this study. These are industry-standard software packages widely recognized for their role in statistical analysis. The use of these tools complies with the licensing agreements provided by IBM Corporation, ensuring that all terms and conditions related to the software's utilization in research have been adhered to.
We first performed the Harman one-factor test to check for common method bias (CMB), aiming for a CMB index below 40%. 38 Then, we ran descriptive statistics to understand sample characteristics. The scale's structural validity was initially evaluated using exploratory factor analysis (EFA) to explore the underlying factor structure of the items within each construct. After gaining insights from the EFA, the internal consistency of the scale was assessed using comprehensive reliability (CR) with a threshold of 0.7, ensuring that the items within each construct are reliable indicators of the underlying constructs. 39 The validity of convergence was further examined through Cronbach's alpha, with a minimum acceptable value of 0.7, and average variance extracted (AVE > 0.5), ensuring that each item had a substantial shared variance with its corresponding construct. Discriminant validity was evaluated by comparing item cross-loadings with the square root of AVEs, confirming that each item had a stronger association with its intended construct than with others.
The confirmatory factor analysis (CFA) was employed to evaluate the model fit to the proposed theoretical model. The model fit was evaluated using various indicators, including CHI/DF less than 5, goodness-of-fit indexes greater than 0.90, and root mean square error of approximation (RMSEA) less than 0.08, which indicates an acceptable to good fit. The hypotheses were tested by structural equation modeling for the simultaneous examination of multiple relationships within the proposed model.
The moderating effects of demographic variables were investigated by MacKinnon's nonstandardized coefficient method, which involved splitting the sample into two groups based on the demographic characteristic of interest and assessing the significance of the difference in path coefficients using a Z-test with a threshold of |Z| > 1.96. 40
Results
Demographic characteristics
Harman's test showed acceptable common method biases (33.58%<40%). Demographic characteristics of respondents are shown in Table 2. The demographic distribution of respondents closely aligned with the overall doctor population. Roughly two-thirds of survey respondents were male (65.2%) and one-third were female (34.8%). The surveyed doctors’ ages mostly ranged from 31 to 45 years (35.5%), with 20% being under 30 or over 60. Mostly, they held a doctorate degree (57.5%); others had a master's or lower degree. Among them, associate chief doctors/professors made up the largest group (37.0%), while chief doctors/professors were the least (11.0%).
Demographic characteristics of respondents.
Reliability and validity analysis
The results of the reliability and validity assessments provide evidence that the research instrument is robust and suitable for the purpose of examining the factors influencing Chinese doctors’ adoption of MLLMs. The internal consistency and stability were evaluated to ensure reliability. As detailed in Table 3, all items demonstrated high reliability, with Cronbach's alpha values exceeding the threshold of 0.7, indicating strong internal consistency among the items within each construct. Convergence validity was assessed to determine the degree to which the items within a scale converge to represent a single underlying construct. The results were satisfactory, with average variance extracted (AVE) values greater than 0.5 and composite reliability (CR) values exceeding 0.7 for all factors. This suggests that the constructs are well-defined and the scales are reliable in measuring the intended constructs. Each item exhibited strong factor loadings onto its respective factor, with all values greater than the recommended threshold of 0.67. This finding indicates that the measurement instrument is effective in capturing the underlying theoretical constructs it was designed to assess.
Factor loadings, internal consistency, and convergent validity.
Loading: standardized factor loading; bC.R: composite reliability; cAVE: average variance extracted.
The discriminant validity of the constructs was confirmed, as evidenced by the average variance extracted (AVE) square roots exceeding the interconstruct correlations for all pairs. This finding suggests that each construct is distinct from others, reinforcing the distinctiveness of the constructs. The strong results for reliability and validity provide substantial support for the measurement instrument and the research model. The high levels of reliability and validity enhance the credibility of the conclusions drawn regarding the factors influencing the adoption of MLLMs by Chinese doctors. The fit of the theoretical model to the empirical data was evaluated using various indices, yielding satisfactory results. The CHI/DF was 2.62 (<5), NFI (0.94), IFI (0.96), TLI (0.96), CFI (0.92), GFI (0.93), and AGFI (0.92) were above 0.90. RMSEA was 0.04 and below 0.08. The indices of model-data fit collectively indicate that the theoretical model is well-aligned with the sample data, providing a solid foundation for the conclusions of the study.
Hypothesis testing
The results of the hypothesis testing are presented in Table 4. The squared multiple correlation (R2) of perceived usefulness, perceived satisfaction, and usage behavior were 0.440, 0.421, and 0.553, respectively, suggesting the adequate explanatory power of the proposed model. Key findings from the hypothesis testing are as follows: Perceived ease of use (PEOU) significantly predicted both perceived usefulness (β=.42, p < .001) and perceived satisfaction (β=.10, p = .026), supporting hypotheses H1a and H1b. Content quality (CQ) had a significant effect on perceived satisfaction (β=.46, p < .001) but not on perceived usefulness (β=.06, p = .155), thus supporting H2b but not H2a. Technical support (TS) was a significant predictor of perceived usefulness (β=.18, p < .001), supporting H3a, but did not significantly influence perceived satisfaction, failing to support H3b. Similarly, social influence (SI) significantly affected perceived usefulness (β=.17, p < .001), supporting H4a, but had no significant impact on perceived satisfaction, not supporting H4b. Perceived usefulness (PU) positively predicted perceived satisfaction (β=.10, p = .019) and usage behavior (β=.26, p < .001), supporting H5 and H6. Perceived satisfaction (PS) was a significant predictor of usage behavior (β=.38, p < .001), supporting H7. Perceived risk (PR) negatively influenced perceived satisfaction (β=−.14, p < .001) and usage behavior (β=−.34, p < .001), supporting H8 and H9. These results provide valuable insights into the factors influencing Chinese doctors’ adoption of MLLMs, offering a foundation for management strategies and interventions.
Hypotheses testing results based on the structural equation model.
***p < .001, bootstrapping N = 6000.
The squared multiple correlation (R2) of perceived usefulness, perceived satisfaction, and usage behavior were 0.440, 0.421, and 0.553, respectively.
PEOU: perceived ease of use; CQ: content quality; TS: technical support; SI: social influence; PU: perceived usefulness; PS: perceived satisfaction; PR: perceived risk; UB: usage behavior.
Moderating effects
Table 5 presented the results on the moderating effects of demographic variables. The four demographic variables gender, age, education, and title, all exerted moderating effects in the theoretical model along the path between perceived usefulness, perceived satisfaction, perceived risk, and usage behavior, but the path from perceived satisfaction to usage behavior was not affected by the above variables. Gender was found to be a significant control variable, affecting relationship strength on three paths: from perceived usefulness to perceived satisfaction (Z = −2.53), from perceived risk to perceived satisfaction (Z = 2.89), and from perceived risk to usage behavior (Z = −5.84). Age significantly moderated three paths, from perceived usefulness to perceived satisfaction (Z = −4.25), from perceived risk to perceived satisfaction (Z = −2.12), and from perceived usefulness to usage behavior (Z = 5.77). Education demonstrated significant moderating effects on three paths: from perceived risk to perceived satisfaction (Z = −2.99), and from perceived usefulness to usage behavior (Z = −4.08), and from perceived risk to usage behavior (Z = −2.66). Title significantly regulated the path from perceived usefulness to perceived satisfaction (Z = −3.72).
Moderating effects of control variables.
PU: perceived usefulness; PS: perceived satisfaction; PR: perceived risk; UB: usage behavior. e Standardized path coefficients for different demographic groups; f Z represents the significance of the moderating effects among demographic variables. If the absolute value of Z is greater than 1.96, the moderating effects are significant, otherwise not significant.
Discussion
Perceived ease of use was a key factor in determining perceived usefulness and satisfaction, aligning with past studies. Perceived ease of use is critical for medical technology acceptance and satisfaction, as seen in electronic health record (EHR) systems, where complexity hinders adoption despite advanced features. 27 Similarly, an intuitive Picture Archiving and Communication System (PACS) interface boosts radiologists’ satisfaction. In mHealth apps, a simple interface is vital for quick info access in busy settings.34,41 Doctors and healthcare workers need quick access to information in busy environments, and a simple and intuitive mobile app interface is crucial in driving user acceptance and usage behavior. 42 This study underscores perceived ease of use's role in enhancing medical technology's perceived usefulness and satisfaction, emphasizing its importance in designing large-scale medical language models to improve service quality.
In the realm of MLLMs, content quality emerges as a pivotal factor influencing perceived satisfaction, but it does not significantly affect perceived usefulness. This distinction underscores the multifaceted nature of user experience, where satisfaction is tied to the quality of interaction and information delivery, while usefulness is more closely aligned with the model's functionality and performance in specific tasks. MLLMs are responsible for analyzing vast medical datasets, including patient records, research findings, and clinical guidelines. Their crucial role in diagnostics, treatments, and clinical decision-making is crucial and has direct implications for medical productivity and patient safety. 24 The statistical analysis shows that while content quality is highly valued for its contribution to user satisfaction, its impact on the perceived usefulness of MLLMs is less pronounced. This suggests that future development of MLLMs should not only focus on enhancing content quality but also on improving other aspects that contribute to their perceived usefulness.
In the medical technology field, technical support is crucial for acceptance and usage.43,44 However, in the context of MLLMs, technical support predicts perceived usefulness but not satisfaction. The significance of technical support in digital health is recognized. For instance, it aids doctors use electronic medical records and increases satisfaction with smart hospital technologies. 45 Studies show that technical support enhances the perceived usefulness of e-visits, but its impact on satisfaction may not be as significant as other elements. When it comes to MLLMs, technical support is essential for perceived usefulness, but its impact on satisfaction is unclear. This could be due to the complexity and professionalism of these models, which require more comprehensive support than basic online assistance. 46 Doctors may prioritize model performance and output quality over technical support. Therefore, in addition to technical support, factors such as model performance, content quality, and perceived ease of use should be taken into consideration for a comprehensive user experience.
Social influence wields a significant impact on the acceptance of medical technology, particularly in the realm of MLLMs. This study echoes the established notion that the social context and peer opinions significantly shape the perceived usefulness of new technologies among medical professionals. 47 However, this influence does not extend to perceived satisfaction, which appears to be more intrinsically linked to perceived ease of use, content quality, and its capacity to fulfill specific needs. 12 The distinction between the effects of social influence on perceived usefulness versus satisfaction is not a unique phenomenon. It mirrors observations from studies on other medical technologies, such as electronic medical records and telemedicine, where social factors have been shown to play a crucial role in the initial adoption phase. 48 Understanding this multifaceted nature of social influence is vital for the strategic promotion of MLLMs. To effectively leverage social influence, several approaches can be adopted: establishing platforms for open dialogue to encourage sharing experiences and discussion of best practices; 49 presenting endorsements from respected peers to build credibility and trust in technology; 50 integrating MLLMs into professional development to offer hands-on experience, which can enhance perceived usefulness and adoption. 51 These measures, while not guaranteed to significantly boost adoption, could contribute to a more informed and positive perception of MLLMs among doctors.
Doctors may be inclined to utilize an MLLM if they perceive it as a tool that can efficiently aid in processing medical data. In the context of medical information technology applications, perceived usefulness and satisfaction are often regarded as pivotal factors influencing usage behavior. These observations resonate with recent research on large language models tailored for healthcare. Within the medical technology sector, the perception of a technology's usefulness is a key determinant in doctors’ acceptance of novel technologies. For instance, doctors are more likely to adopt an online health community system if they believe it can enhance their productivity. 52 Furthermore, perceived satisfaction plays a significant role in influencing users’ willingness to persist in using healthcare information technology. These insights provide valuable support for promoting the utilization of MLLMs.
Perceived satisfaction plays a crucial role in predicting usage behavior, especially in the context of MLLMs. This finding contrasts interestingly with other research in the medical technology field. For instance, in electronic medical record systems, system functionality, user interface design, user training, and support all significantly impact satisfaction.20,53 This underscores the diversity of satisfaction levels and varying emphases across different technology fields within medical technology. Compared to telemedicine, MLLMs prioritize the accuracy of natural language processing and information extraction, which partially determines user satisfaction with the model's performance. Conversely, telemedicine focuses more on accessibility, convenience, and the quality of doctor-patient interaction. While perceived satisfaction is vital in these areas too, it's influenced by factors like doctors’ responsiveness and service availability. 19 The perceived satisfaction in medical language models bears some similarities to other medical technology research but also has unique characteristics. These differences suggest that when studying user satisfaction in medical technologies, it's essential to consider the specificities of technical features and application scenarios.
The statistical analysis indicates that perceived risk is a significant negative predictor of both perceived satisfaction and usage behavior among Chinese doctors regarding MLLMs. This finding is consistent with previous literature that has identified perceived risk as a barrier to technology adoption in healthcare settings.54,55 To address these perceived risks and enhance the adoption of MLLMs, several strategies can be employed. First, transparency regarding the data sources, training processes, and operational mechanisms of MLLMs can alleviate concerns about privacy and data security. 56 Second, ongoing education and training for healthcare professionals can improve their understanding of MLLMs, reduce anxiety associated with the technology, and increase confidence in its use.57,58 Third, establishing robust legal and ethical frameworks that govern the use of MLLMs can help protect user rights and ensure the responsible application of these technologies. 17 By implementing these strategies to mitigate the negative impact of perceived risks on the adoption of MLLMs and foster a more positive attitude toward their integration into clinical practice.
The statistical analysis revealed that demographic variables such as gender, age, education, and title act as moderators in the relationship between perceived usefulness, perceived satisfaction, perceived risk, and usage behavior within the theoretical model. The practical implications of these moderating effects are noteworthy and suggest that individual differences significantly influence the adoption of MLLMs. Gender was found to significantly influence multiple relationships, indicating that male and female doctors may respond to MLLMs differently, possibly due to varying levels of exposure, interest, or comfort with technology. Age plays a crucial role, with older doctors potentially requiring more support or different considerations in the adoption process compared to their younger counterparts. The level of education can affect their perceived risk and satisfaction, which in turn influences their usage behavior, suggesting that educational background may be linked to their tech-savviness and risk perception. Finally, a doctor's professional title moderates the relationship between perceived usefulness and satisfaction, indicating that the status and experience associated with their title could influence their attitudes and the value they see in MLLMs. These findings align with prior research in medical IT, emphasizing the impact of demographic factors on technology adoption.58,59 However, it is worth noting that different studies may have varying results depending on the specific context and sample characteristics. Adoption of MLLMs by healthcare professionals such as doctors, nurses, and allied health workers may be influenced by different factors. Their professional roles and work environments may result in varying priorities and needs for MLLMs. It is crucial for future research to examine these differences to ensure that technology meets the diverse needs of the broader healthcare community. Although our study concentrated on individual perceptions and attitudes, it is acknowledged that organizational and systemic factors play a crucial role in the adoption of new technologies such as MLLMs. Organizational, legal, ethical, and technological integration issues are also likely to play a crucial role in the adoption process and warrant further investigation. Understanding these factors and their influence on individual adoption behaviors is an important area for future research.
This study provides a perspective on doctors’ adoption of MLLMs, and our model integrates individual psychological and social demographic factors, offering a deeper understanding of this complexity. Based on our findings, we recommend that medical big language model developers, hospitals, and regulators collaborate to enhance training, showcase model benefits, improve technical support, and foster a supportive culture for technological innovations, considering social and cultural factors. It is worth noting that our sample consisted of physicians who had knowledge of MLLMs, which could have influenced their adoption factors. The extent of prior experience with MLLMs among our respondents was not the primary focus of this study; however, it may be a valuable area for future research. Comparative studies that examine the impact of prior experience on the adoption of MLLMs could provide deeper insight into how familiarity with such technologies affects their integration into clinical practice.
This study has some limitations. One of the limitations of this study is the possibility of bias in sample selection, as it was exclusively focused on Chinese doctors. As a result, the generalizability of the findings to other countries or healthcare systems remains unclear. Cultural and systemic differences in healthcare practices, education, and technology adoption may significantly impact the factors that drive the adoption of MLLMs. Further research should examine these factors across diverse contexts to gain a better understanding of their global applicability. Second, the study lacks specificity in terms of the individual MLLMs referenced in the survey questions, and it potentially overlooking the distinct impacts of individual models’ features on adoption factors. Future research should conduct detailed comparative analyses and employ mixed methods to capture specific MLLMs’ reception in varied medical contexts, enhancing the understanding of their integration and adoption.
We acknowledge that different MLLMs possess unique features, capabilities, and limitations that could significantly influence the adoption factors we sought to investigate. Our decision to maintain a broad approach was driven by the desire to understand the overarching perceptions and attitudes toward MLLMs as a collective, rather than focusing on specific models. However, this approach may have implications for the generalizability of our findings to specific MLLMs. Future research should examine the adoption factors of specific MLLMs to provide a more nuanced understanding of their reception and integration into medical practice. Third, it is acknowledged that the lack of a formal validation process for the questionnaire prior to use limits the validity of this study. Although the questionnaire was developed based on well-established theoretical frameworks and a pilot test was conducted with 50 doctors to refine the instrument and ensure its clarity and relevance, the lack of extensive validation may affect the generalizability of our findings. Therefore, future research should implement a more rigorous validation process to enhance the psychometric properties of the questionnaire and broaden the scope of the results. Moreover, the reliance on self-reported survey data, which may be influenced by social desirability bias. Despite potential biases, self-report measures are a common and valuable tool in psychological research, and these measures are often used to gather diagnostic and meaningful information.60,61 Future studies might consider combining self-reported data with objective usage data to provide a more comprehensive understanding of the adoption and usage of MLLMs. Finally, the cross-sectional design restricts our ability to infer causality and monitor changes in adoption intentions over time. The performance of large language models in China is rapidly improving due to the quality of training data, which will change doctors’ attitudes toward MLLMs. These limitations highlight the need for ongoing studies that can provide a more comprehensive understanding of MLLM adoption and performance.
Conclusion
The application of MLLMs marks a digital shift. This study aimed to assess factors influencing Chinese doctors’ adoption intentions. By constructing an extended theoretical framework, the empirical analysis showed that perceived ease of use, content quality, technical support, social influence, perceived usefulness, and satisfaction positively impact adoption, while perceived risk has a negative effect. This highlights that doctors’ positive evaluation of the MLLM and their awareness of improving work efficiency are the key factors driving their adoption behavior. Furthermore, personal traits also affect the adoption intention of doctors to varying degrees, highlighting the external conditions and personal characteristics that need to be considered when promoting the MLLM. Through ongoing research and practical experience, we strive to understand doctors’ motivations for using these models to promote their effective use worldwide.
Supplemental Material
sj-docx-1-dhj-10.1177_20552076241297237 - Supplemental material for Factors influencing Chinese doctors to use medical large language models
Supplemental material, sj-docx-1-dhj-10.1177_20552076241297237 for Factors influencing Chinese doctors to use medical large language models by Shujuan Qu, Lin Liu, Min Zhou, Chuting Zhou and Kathryn S. Campy in DIGITAL HEALTH
Footnotes
Acknowledgments
The authors are deeply grateful to the Chinese doctors who participated in our study. The authors acknowledge the utilization of AI language polishing tools (Kimi and ERNIE Bot) in the refinement of this manuscript. It is important to note that while these AI tools assisted in the editing process, the conceptualization, research content, data analysis, and interpretation of results were original contributions from the authors and were not influenced by AI tools.
Contributorship
Shujuan Qu performed conceptualization, wrote the original draft, implemented the methodology, and acquired funding. Lin Liu conducted investigations, provided resources, curated data, and wrote the original draft. Min Zhou implemented the methodology, conducted investigations, utilized software, conducted validation, and performed formal analysis. Chuting Zhou wrote the original draft and conducted writing reviews and editing. Kathryn S. Campy performed conceptualization, conducted writing reviews and editing, and supervised the process.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Ethical approval
Not required.
Funding
This work was supported by the National Social Science Fund of China (21BGL227), the National Natural Science Foundation of China (82102280), and the Natural Science Foundation of Hunan Province (2023JJ30839, 2024JJ5119, 2022JJ30893).
Guarantor
SQ.
Supplemental material
Supplementary material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
