Abstract
Objectives
Chatbots, transcending the limitations of space and time while reducing user resistance, offer a personalized and autonomous approach potentially enhancing digital health interventions. This systematic review aims to quantitatively evaluate the effectiveness of chatbot-based exercise interventions (EIs) in randomized controlled trials (RCTs), emphasizing physical activity (PA), exercise habits (EH), and sedentary behavior (SB).
Methods
A comprehensive search was carried out across Embase, Web of Science, PubMed, and Cochrane databases for studies published from January 2010 to October 2024. The characteristics of the included studies and the employed chatbots were analyzed. A meta-analysis was performed to synthesize the impact of chatbot-based EIs on PA, EH, and SB. We carried out subgroup analyses (SAs) for investigating potential modifying effects.
Results
This review includes 12 studies conducted between 2013 and 2024, encompassing 2446 participants and 240 parent-child dyads, with participant ages ranging from 8 to 71 years. The chatbot-based EIs significantly enhanced PA compared to control groups (SMD = 0.20, 95% CI = 0.04–0.37,
Conclusion
Chatbot-based EIs appear promising in improving PA, particularly moderate to vigorous PA (MVPA). Nonetheless, further RCTs with diverse chatbot designs, larger sample sizes, low-resource settings, objective measures of SB (e.g., accelerometers), and extended follow-ups are essential to corroborate these findings. Future research should also explore the adaptation of chatbots to various socioeconomic and cultural contexts and the integration of technologies like ChatGPT into EIs.
Introduction
Non-communicable diseases (NCDs) are the predominant causes of mortality within the realm of public health globally, accounting for over 41 million deaths in 2019. 1 These diseases are particularly prevalent in regions with scarce resources and in low- and middle-income countries.2,3 The most prevalent NCDs include cardiovascular and chronic respiratory diseases, type 2 diabetes, and cancer. As the global population ages, the burden of NCDs is increasingly becoming a critical issue worldwide. 4 Regrettably, projections indicate that the burden of NCDs is expected to escalate further in the future. 5 In response, the World Health Organization (WHO) has spearheaded a global action plan aimed at the prevention and control of NCDs. 6 A notable risk factor in the prevention of NCDs is physical inactivity, which is characterized by insufficient physical activity (PA) and excessive sedentary time. It has been estimated that approximately 3.2 million deaths annually can result from physical inactivity. 7 PA involves any movement of the body's muscles that triggers the consumption of energy. 8 It is essential throughout a person's life. It is crucial to enhance PA and reduce sedentary behavior (SB) from infancy to old age. However, merely increasing PA levels is insufficient; a balanced approach that addresses both activity and SB is optimal.
Several types of exercise interventions (EIs) have been developed to target overweight, 9 obesity, 10 and type 2 diabetes. 11 Most research indicates that these interventions positively affect overweight, obesity, and type 2 diabetes.12–14 EIs are an effective and economical approach to addressing the energy imbalance between consumption and expenditure. However, they present several challenges, including difficulty in maintaining sustainable engagement and in meeting the diverse needs of participants.15–18 Existing studies indicate that EI programs are more effective in the short term than in the long term.19,20 This may be because participants initially find the programs engaging, but their interest wanes over time as they become tired of the repetitive nature of the interventions. Consequently, sustainable engagement appears unattainable. Furthermore, individual differences are significant. 21 Often, researchers implement the same intervention across diverse groups. Some participants may like the intervention, others may only slightly like it, and some may dislike it. Those who dislike the intervention often experience a low sense of self-efficacy and merely go through the motions without genuine engagement. The goal of behavioral intervention programs is to establish a habit of exercising. According to Fogg's behavior theory, motivation and competence are crucial factors in habit formation. 22 However, targeted individuals exhibit varying levels of motivation and competence, and a single intervention style does not universally apply. It is widely recognized that a one-size-fits-all approach to interventions is ineffective. 23 Current EIs lack personalized designs meeting diverse participants’ individual needs.
The WHO has emphasized the importance of digital health interventions in public health. 24 Chatbots have become a new trend in the digital environment due to their ability to sustain engagement and meet diverse user requirements. 25 Chatbots, also known as conversational and virtual agents, engage in bilateral dialogue with users based on machine learning (ML), 26 deep learning (DL), 27 and natural language processing (NLP). 28 In these dialogue systems, chatbots can communicate with clients naturally and automatically in the form of text, images, and voice/speech by collecting users’ information, thus meeting the needs of the current context. 29 Chatbots are applied in various settings, including business, 30 education, 31 and healthcare. 32 Behavioral theories such as the theory of planned behavior (TPB) and social cognitive theory (SCT) provide valuable frameworks for designing personalized chatbot interventions. TPB can guide the design of chatbots that target key factors influencing user intentions, such as perceived behavioral control, subjective norms, and attitudes. 33 By addressing specific factors, chatbots can guide users toward adopting more adaptive behaviors and making healthier choices. SCT underscores the significance of observational learning and self-regulation in effecting behavioral change.34,35 In the context of chatbot design, SCT can be operationalized by crafting interactive dialogues that enhance self-efficacy and encourage users to emulate positive behaviors. Furthermore, with the advancement of technology and the increasing number of e-device users, the application of chatbots in EIs attracts rising attentions. Chatbots, by overcoming the constraints of space and time and mitigating user resistance, have become a viable, personalized, and autonomous form of intervention. 36 Theoretically, integrating chatbots with EIs represents a promising synergy.
Numerous chatbot-based EIs have targeted mental health,37,38 and related health behaviors,39,40 encompassing total PA, 41 moderate-to-vigorous PA (MVPA), 42 daily step counts, 41 exercise habits (EHs), 43 and SBs. 44 Despite these developments, a gap persists regarding the effectiveness of AI-driven chatbots in PA interventions. Some researchers contend that it is premature to conclusively determine the efficacy of chatbot interventions on PA,45–47 while other studies highlight the substantial impact of such interventions on improving PA.36,41,48 It is possible that the design of these interventions lacks robustness. Following the Oxford Centre for Evidence-Based Medicine 2011 Levels of Evidence, evidence quality is categorized into four grades, ranging from high to low: randomized controlled trials (RCTs), cohort studies and case-control studies, case series, and expert opinions without explicit critical appraisal. 49 A systematic review and meta-analysis should ideally include only high-quality articles, specifically RCTs. Additionally, although some chatbot-driven EIs have explored outcomes related to EHs and SB, they lack a systematic review and meta-analysis. This review aims at investigating the effectiveness of chatbot-based EIs on PA, EHs, and SBs.
Method
The protocol for this systematic review was registered with PROSPERO (ID: CRD42024609852) and conforms to the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines. 50 To maintain a high standard of quality, the review process rigorously adhered to established guidelines, encompassing steps involving the strategy for searching, selecting studies, and extracting data. No funding was received for this work.
Information sources
A comprehensive systematic search was conducted from January 2010 to October 2024 across multiple databases. All databases were last searched on 26 October 2024. The scientific literature databases searched included Web of Science, Embase, Cochrane, and PubMed.
Selection criteria
The selection criteria were stringently based on the PICOS framework (study types, outcome, comparison, intervention, population). 51 Population: any demographic group; Intervention: any EI involving chatbots, conversational agents, or virtual assistants. Chatbots, such as Siri and ChatGPT, also known as virtual assistants or conversational agents, are increasingly being integrated into daily life. They can be categorized into three distinct types: (1) commercial voice assistants, encompassing Google Assistant, Alexa (Amazon), and Siri (Apple); (2) custom-designed text- or voice-based chatbots tailored for specific research or clinical interventions, developed using platforms like Dialogflow, Rasa, or Microsoft Bot Framework; and (3) AI-driven conversational agents based on large language models, such as ChatGPT or BERT, characterized by their dynamic, generative capabilities. These agents utilize advanced dialogue algorithms to enable natural language interactions through voice, text, or multimodal channels. The spectrum includes any chatbot form—text-based, voice-based, or both—and may operate as standalone software, within web browsers, on mobile devices, in applications, or as embodied conversational agents (ECAs). A constrained-input chatbot limits user input by various means such as length, format, language, or content. Conversely, unconstrained-input chatbots allow flexible, open-ended dialogues. Intervention settings (e.g., laboratory, hospital, home) and doses (i.e., duration, frequency, intensity) were not restricted. Comparison: studies involving a comparison condition (e.g., intervention group vs. control group or waitlist group or usual care group). Outcomes: eligible studies must measure PA outcomes, including total PA, moderate-to-vigorous PA, vigorous PA, moderate PA, light PA, or daily steps, or assess SB. Study type: only RCTs were included.
Exclusion criteria included duplicate records, article type, language, study design, and PICO relevance. Duplicate records encompassed identical publications from different databases, varying formats of the same study, multiple publications of the same trial, and multilingual versions. Ineligible studies included literature reviews, commentaries, patent literature, protocols, and non-English publications. Studies not meeting the required design criteria—such as observational studies—were excluded. Additionally, studies were excluded if the intervention did not integrate both chatbot-related and exercise-based components. Studies that lacked sufficient statistical data on PA, EH, or SBs (e.g., means, standard deviations, effect sizes, or confidence intervals) and for which data could not be obtained from the authors were also excluded.
Data extraction and management
All studies were imported into Endnote X9, after which two authors eliminated duplicates to prevent data redundancy. The researchers meticulously reviewed the methodology sections of all included studies, focusing on sources of participants, data collection periods, study settings, and sample characteristics to ensure comprehensive understanding. Data extraction was collaboratively undertaken by two authors (WQ and YWX). Disagreements were resolved through discussion until everyone agreed, achieving 100% agreement. In cases where consensus could not be reached, an independent third-party expert was consulted to make the final decision regarding data extraction. Data were selected based on several criteria: (1) study characteristics, such as title, authors, and publication year; (2) characteristics of participants, including age and sample size; (3) types of RCT, e.g., two-arm, three-arm, cluster, stratified, and pilot studies; (4) duration of the intervention; (5) intervention programs; (6) type of chatbot utilized; (7) outcome measures; (8) the mean and standard deviation of outcomes. In instances where different studies utilized identical datasets, these were amalgamated into a singular study for analysis. If a study with incomplete data was identified, the authors endeavored to contact the corresponding author to acquire the necessary data or to calculate the mean and standard deviation using recommended formulas.
Quality of assessment
The risk of bias was independently assessed by two reviewers (YWX and WQ) using the Cochrane Risk of Bias Tool (version 2.0). Any unresolved disagreements were adjudicated by a third, qualified expert. The assessment encompassed seven domains: (1) selective reporting (reporting bias); (2) completeness of outcome data (attrition bias); (3) blinding of outcome assessment (detection bias); (4) blinding of participants and personnel (performance bias); (5) allocation concealment (selection bias); (6) random sequence generation (selection bias); and (7) other potential biases. Each domain was evaluated with three potential outcomes: high risk of bias (-), low risk of bias (+), or not applicable (NA). The reviewers’ assessments were based on these criteria, classifying the quality of each study as low, moderate, or high risk. The visualization of risk of bias was performed using RevMan 5.4 software.
Outcome of interest
The outcomes of interest included: (1) MVPA only; (2) total PA; (3) daily steps, expressed as steps per day or week; (4) EHs, assessed using the Self-Report Habit Index (SRHI) for exercise; (5) SB, quantified as duration of sedentary time, e.g., minutes per day or week.
Statistical analysis
The meta-analysis conducted in this review employed a random-effects model to estimate the degree of heterogeneity among the selected studies. This model posits that variations in the true effect size across studies can result from differences in intervention characteristics, participant demographics, and study methodologies. A random-effects model provides a more conservative estimate than a fixed-effect model and is deemed suitable in scenarios exhibiting moderate to substantial heterogeneity, as evidenced by an I² value exceeding 50%. The primary analysis, which included forest plots, heterogeneity tests, sensitivity analyses, and subgroup analyses (SAs), was facilitated using RevMan software version 5.4. Due to inconsistencies in measurement units, the standard mean difference (SMD) within a 95% confidence interval (CI) was employed to carry out statistical analysis. SAs were stratified following the type of chatbot and intervention duration (eight weeks or fewer versus more than eight weeks). These analyses assessed potential differences in effects across various subgroups. Differences between groups were deemed with statistical significance if the p-value fell below 0.05.
We used
Results
Study selection
Initially, a search of relevant keywords yielded 3508 records. Following a preliminary screening of titles and abstracts, we omitted 3281 studies. Further application of inclusion and exclusion criteria resulted in the elimination of 33 articles. Ultimately, 12 studies were selected for a systematic and quantitative review (Figure 1).

Article selection flow chart.
Study characteristics
This review encompasses 12 studies conducted between 2013 and 2024, involving a total of 2446 participants and 240 parent-child dyads (refer to Table 1).42–44,53–61 Among these studies, ten focused on individual participants, while two examined parent-child dyads. The studies targeted diverse populations, including children and the elderly. All articles were classified as high-quality given their design as RCTs. Specifically, eight studies implemented two-armed RCTs,43,44,54,55,57,59–61 including one pilot study. 60 Additionally, three studies were structured as three-armed RCTs,42,56,58 and one as a four-armed RCT. 53 The two-armed RCTs were more common, whereas the four-armed RCTs were less frequent. The bulk of the research selected has come out recently, with five articles emerging after 2020. Geographically, four studies originated from the USA,53,54,57,58 two from the Netherlands,42,56 and another two from the UK.55,59 The remaining studies, each conducted in a different country, came from Belgium, 44 Japan, 60 Canada, 61 and Korea, 43 respectively. The most extensively studied intervention involving chatbot-based exercise was in the USA. Methodologically, five studies exclusively utilized objective monitoring devices, such as the Actigraph GT3X + and Fitbit Flex 2HR. Four studies employed subjective questionnaires, including the Dutch Short Questionnaire to Assess Health-Enhancing PA and the International PA Questionnaire. Notably, three studies combined both objective measurements and subjective questionnaires.
Characteristics results of a meta-analysis on the intervention.
ST: sedentary time; SQUASH: Dutch Short Questionnaire to Assess Health Enhancing Physical Activity; MVPA: moderate-to-vigorous physical activity; PAQ(FOC): Physical Activity Questionnaire for Older Children, PAQ(SF): Physical Activity Questionnaire Short Form; SBQ: Sedentary Behavior Questionnaire; FEAHQ-R: Family Eating and Activity Habits Questionnaire; SHRI: self-report habit index; IPAQ: International Physical Activity Questionnaire.
The theoretical frameworks utilized across the studies were highly varied. Eleven articles encompassed multiple theories including the health action process approach, elaboration likelihood model, persuasive systems design model, tpb, self-regulation theory, precaution adoption process model, self-determination theory, self-monitoring theory, goal-setting theory, SCT, player experience and need satisfaction theory, agency, challenge, uncertainty, discovery, and outcomes framework, habit formation theory, health-behavior theory, and the transtheoretical model. Notably, the goal-setting theory and SCT were featured in three studies, while two studies concurrently utilized the self-determination theory and health action process approach. The time frame for the interventions extended from 4 weeks to a year. Each intervention featured a specific program involving a chatbot, such as #LIFEGOALS, Intelligent Personal Assistant Project, and Gabby. Outcomes varied; some studies focused on the MVPA of parent-child dyads. Additionally, beyond the MVPA indicator, outcomes included SB, daily step count, and EHs.
Characteristics of chatbot
The chatbots demonstrated considerable variation across the studies in terms of their names, delivery methods, platforms, diversity, input and output modalities, functionalities, and password settings (refer to Table 2). Specifically, four studies employed chatbots with distinct names, such as Aimbot, Echo Dot, Ally, and Alexa. The primary delivery methods were computers and mobile phones: six studies utilized mobile phones, four employed computers, and two integrated both devices. Since 2020, interventions have predominantly utilized mobile phones or a combination of both mobile phones and computers. The delivery platforms encompassed software programs, websites, embodied intelligent personal assistants, and mobile applications. Notably, software-based interventions, which first appeared in 2013, were discontinued subsequently, whereas mobile applications have been increasingly adopted since 2017. Half of the studies featured multi-component interventions, whereas the others relied on single-chatbot interventions. In terms of user interaction, seven studies featured constrained input, wherein participants communicated with the chatbots via pre-programmed routines. Conversely, two studies allowed for unconstrained input, facilitating more flexible and natural interactions. Three studies combined both types of input. All studies provided text-based output, with some also incorporating voice or speech and visual elements. The functionalities of the chatbots were categorized into three groups: delivering personalized, one-on-one behavioral counseling; providing automated goal-setting and feedback; and offering both tangible and psychological rewards. Password usage was infrequently mentioned, with only one study requiring participants to establish a password.
Characteristics results of a meta-analysis on the chatbot.
Risk of bias
In the twelve selected studies, the risk of bias was found to be optimal (refer to Table 3 and Figure 2). Owing to the RCT design, all studies involving randomization exhibited a low risk of selection bias. Furthermore, the majority of these trials provided detailed descriptions of the flow of participant enrolment in their documentation, suggesting a low risk of attrition bias. The researchers in these trials diligently adhered to methodological standards, particularly concerning allocation concealment and the blinding of participants or personnel. Importantly, since the outcomes were predominantly quantified using objective monitoring devices, the likelihood of detection bias was minimal. The researchers also devoted significant attention to mitigating other potential biases, which are elaborated upon subsequently.

Risk of bias graph each risk of bias item presented as percentages.
Risk of bias summary table.
Results of meta-analysis
Results of PA
The chatbot-based EI significantly improved PA compared to the control group, as indicated by a standardized mean difference (SMD = 0.20, 95% CI = 0.04–0.37,

Forest plot of chatbot-based exercise intervention on PA.
Subgroup analysis by duration
In the analysis stratified by duration, the chat-based EI was linked with a modest increase in PA in the short term (SMD = 0.29, 95% CI = 0.08–0.50,

Forest plot of duration of intervention on PA.
Subgroup analysis by type of chatbot
In the SA concerning the chatbot delivery method, children's PA was not notably improved in the mobile phone subgroup (SMD = 0.30, 95% CI = −0.22–0.82,

Forest plot of chatbot type of intervention on PA.
Analysis of chatbot input subgroups
In the SA concerning chatbot inputs, a notable improvement was documented in the group receiving both constrained and unconstrained inputs, with minimal heterogeneity observed (SMD = 0.37, 95% CI = 0.23–0.52,

Forest plot of chatbot input of intervention on PA.
EH results
Chatbot-based interventions did not demonstrate a significant enhancement in PA habits when compared to control groups. The difference observed was not statistically significant (SMD = 0.29, 95% CI = −0.48–1.06,

Forest plot of chatbot-based exercise intervention on exercise habits.
Sedentary time results
The analysis indicated that chatbot-based interventions were not associated with a significant reduction in sedentary time relative to control groups (SMD = 0.32, 95% CI = −0.54–1.19,

Forest plot of chatbot-based exercise intervention on sedentary time.
Discussion
Overall effect of chatbot-based interventions
This study aims at evaluating the efficacy of chatbot-based interventions on PA, EHs, and SB. In assessing the overall impact, synthesizing and quantitatively analyzing twelve RCTs suggests that these interventions could be a promising method for enhancing PA. Notably, there was a significant improvement in MVPA, a finding consistent with other research. 48 However, these interventions did not significantly affect total PA or daily step count. There was a notable discrepancy in outcomes related to daily steps and total PA between this review and prior studies. 48 Furthermore, EHs or SB did not change notably. The absence of meta-analyses focusing on EHs and sedentary time renders this review potentially unique and pioneering. Surprisingly, even as interventions increased MVPA, they did not impact EHs or SB. This review found no correlation between PA, EHs, and sedentary time. Regarding the methodology, our findings build upon previous high-quality research. However, the blinding of participants and personnel was suboptimal. Seven out of the twelve reviewed trials did not implement blinding of personnel, which is considered undesirable, and one trial exhibited a notable likelihood of bias in performance. In terms of measurement, there was no uniform unit of measurement across the studies due to the use of various devices, which poses a challenge for standardization. Despite the powerful capabilities of advanced generative models like ChatGPT, no existing trials have employed such technologies in chatbot interventions. Nonetheless, these models offer substantial promise for enhancing the effectiveness, emotional engagement, and user experience of digital health interventions focused on exercise. The observed heterogeneity (I² = 51%) in the results may result from changes in chatbot types, intervention settings, and participant demographics. This variability underscores the necessity to develop uniform methods for future research and reporting protocols to improve comparability. Due to the variability and limitations in the data, this analysis does not provide conclusive evidence regarding the efficacy of chatbot-based EIs.
Interpretations and suggestions
Interpretations of these review results could fall along a spectrum of possibilities, offering multiple perspectives for future development of chatbot-based EIs aimed at increasing PA, fostering EHs, and reducing SB. SAs have indicated that these interventions are particularly effective in the short term, when implemented via desktop platforms, and when they incorporate both constrained and unconstrained chatbot input modalities. Three key factors influencing the effectiveness of interventions were identified: duration, delivery method, and chatbot input modality (refer to Table 4).
The summary table of subgroup findings.
Intervention duration appears to be a critical influencing factor. Short-term interventions tend to be more effective. Initial user engagement is frequently driven by the novelty or curiosity of the program; however, sustained participation often wanes due to the repetitive nature of the content. This decline in motivation, commonly termed as ‘intervention fatigue’, results from cognitive overload and emotional disconnection, particularly when the intervention lacks variability or fails to customize responses to individual user requirements. In the absence of personalized feedback or dynamic interactions, participants may view the intervention as tedious or ineffectual, which can lead to disengagement or even burnout. Future research should focus on integrating a variety of engaging activities and periodically introducing new themes tailored to user preferences gathered via chatbot interactions. To maintain interest over time, chatbots should actively engage with users by providing personalized and attractive program suggestions. Furthermore, the establishment of consistent EHs is crucial. Embedding theories of goal-setting and habit formation within the chatbot algorithms may enhance long-term commitment. As users develop intrinsic motivation and autonomy in their exercise practices, the reliance on external interventions naturally decreases.
The mode of delivery also emerged as a vital determinant of success. Notably, interventions delivered through computers proved more effective than those administered via mobile devices. Despite the ubiquity of mobile phones, their propensity for interruptions, such as notifications, can significantly detract from user engagement and adherence. Future interventions could address these challenges by incorporating features such as scheduled reminders, disabling notifications during sessions, or restricting access to other distracting applications. Nevertheless, given their convenience, mobile-based interventions warrant further refinement rather than complete abandonment.
Additionally, the modality of chatbot input is crucial. Interventions that combine constrained and unconstrained input modalities enable users to interact more flexibly, catering to a wide range of preferences. A standardized input mode may not suit all users, particularly vulnerable groups. For instance, children might misuse an entirely unrestricted input, whereas older adults may find overly rigid input formats challenging due to sensory impairments. Future studies should aim to develop input modalities that are age-appropriate and accessible, including simplified interfaces for children and enhanced usability features for older adults.
The importance of SB
The efficacy of chatbot-based interventions on SB remains suboptimal. A primary contributing factor is the prevalent research focus on enhancing PA while neglecting SB as a significant outcome. Furthermore, the development of sustained EHs generally requires long-term repetition within stable contexts and involves deep cognitive restructuring, as outlined in theories of habit formation. Chatbots, particularly those that do not track progress or provide long-term reinforcement, often lack the necessary persistence and contextual integration to support the development of these habits. Nevertheless, SB presents considerable risks to both physical and mental health.62–64 Although regular PA yields substantial health benefits, these do not completely mitigate the detrimental effects of extended periods of SB. Surprisingly, some studies included in this review have found that chatbot-based EIs showed a link to an increase in sedentary time. To overcome this issue, effectively reducing SB may require context-specific cues, such as sedentary alerts, which prompt users to take immediate action. To improve the effectiveness of chatbots in reducing SB, future interventions could incorporate passive monitoring tools (e.g., smartphone sensors or wearable devices) and deliver timely, personalized sedentary alerts.
Strengths and limitations
This review possesses some notable strengths. Primarily, it is the first to quantitatively evaluate the effectiveness of chatbot-based EIs on EHs and SB, offering a novel perspective for future research in this domain. Additionally, all included studies were RCTs, adhering to rigorous methodological standards. Consequently, the high quality of the selected articles lends significant credibility to the findings of this review. Furthermore, the studies encompassed participants from a broad age spectrum, ranging from young adults to the elderly. This demographic diversity enhances the generalizability of the results and minimizes potential biases. However, the review also presents certain limitations. Covered studies had small sample sizes, reflecting the nascent stage of research in chatbot-based EIs. Nonetheless, all studies met the minimum required sample sizes. Another limitation is that the included studies were predominantly carried out in high-income countries, such as the United States and the Netherlands, which may limit the applicability of the findings to low-resource settings. Future research should aim to evaluate chatbot-based EIs in a variety of socioeconomic and cultural contexts. Although the number of studies included was limited, this review integrated the most recent and comprehensive research available at the time.
Conclusion
The chatbot-based EI has been shown to significantly improve PA, particularly MVPA. However, it did not exert a significant impact on EH and SB. It is noteworthy that, in SAs, the duration and type of chatbot emerged as significant influencing factors. Subsequent studies can emphasize well-designed and tailored interventions that incorporate engaging and entertaining activities and apply theories related to the formation of EH. Attention should also be directed towards strategies for minimizing distractions during mobile phone use, and exploring the subdivision of diversified input modes for specific demographic groups. Since reducing sedentary time is a viable target in chatbot-based EIs, integration of passive monitoring tools (e.g., smartphone sensors or wearable devices) to deliver timely, personalized sedentary alerts is recommended. Furthermore, more RCTs involving diverse chatbots, conducted on a larger scale, in low-resource settings, and utilizing objective measures of SB (e.g., accelerometers), along with extended follow-up, are necessary to establish definitive conclusions. Future chatbot-based EIs should also aim to design chatbots that are adaptable to varied socioeconomic and cultural contexts and investigate the potential integration of technologies like ChatGPT into EIs.
Footnotes
Acknowledgements
The authors would like to acknowledge Hong Gu, and Yongwei, Li for their insightful advice on the concept analysis and strategic direction of this study as well as their thoughtful consideration in the selection of terms.
Ethical considerations
Ethics committee approval was not needed for this research because it did not engage in clinical trials involving animals or humans.
Author contributions
The team comprising WQ, Lawrence, YWX, and XK contributed to the review by designing its structure, searching for and screening literature, selecting applicable studies, extracting key data, compiling the results, and drafting and revising the manuscript. The final manuscript received the approval of all authors.
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data availability statement
The primary results of the study are recorded in both the article and supplementary materials. Contact the corresponding author for more information.
