Abstract
Aim
To synthesize recent research on artificial intelligence (AI) in intensive care unit (ICU) nursing from 2020 to 2025, highlight trends, and outline integration challenges.
Methods
A narrative synthesis approach was used, reviewing English-language studies from PubMed, Web of Science, Scopus, and IEEE Xplore. From 4138 articles, 37 studies were included.
Results
Evidence was international with strong contributions from Asia and North America. Most studies were retrospective and drew on large ICU databases such as MIMIC-III/IV and eICU. Methods were dominated by machine learning, with limited but growing deep learning. Applications clustered around early warning and risk prediction, with additional work on nursing decision support and workload or documentation support. Reported discrimination frequently exceeded AUC 0.80, while calibration, external validation, and human factors evaluation were less often described.
Conclusion
Artificial intelligence shows promise for earlier risk recognition, decision support, and workflow enablement in ICU nursing. Priorities include multicenter prospective evaluation, external validation with calibration, electronic health record-embedded implementation, and nurse codesign to ensure safe, useful, and generalizable tools.
Implications for clinical practice
Thoughtfully integrated AI can support timely decisions and reduce documentation burden when paired with real-time validation and nurse-led workflow adaptation.
Introduction
Artificial intelligence (AI) has rapidly emerged as a transformative force in healthcare, driven by its potential to improve clinical decision-making, efficiency, and patient outcomes. 1 In critical care settings, such as the intensive care unit (ICU), nurses manage vast amounts of complex, time-sensitive patient data and make high-stakes decisions under pressure. 2 Artificial intelligence technologies offer tools to assist in this context—for example, by continuously monitoring patient vitals, detecting subtle trends or deteriorations, and providing decision support that can augment nurses’ clinical judgment. 3 Over the past decade, there has been a marked increase in research exploring AI applications in critical care nursing, reflecting growing enthusiasm for integrating machine intelligence into ICU nursing practice. 4 Publications on this topic have followed an upward trajectory globally, with particularly active contributions from countries such as the United States, China, and the United Kingdom. This trend underscores a broad recognition that AI could play a significant role in optimizing ICU workflows and enhancing the precision and timeliness of nursing care. 5
Current research hotspots illustrate the diverse ways AI is being leveraged to support critical care nursing. Several major domains of AI application in ICU nursing have been intensively analyzed nowadays, including continuous patient monitoring, predictive risk modeling, clinical decision support systems, nursing interventions, documentation automation, and resource allocation. 6 Predictive analytics is especially prominent—AI models have been developed to forecast clinical events such as sepsis onset, pressure injuries, delirium episodes, or unexpected ICU transfers, enabling earlier interventions to prevent complications. 7 These innovations aim to enhance patient safety and quality of care—by improving early problem recognition, supporting complex clinical decisions, streamlining documentation, and optimizing the efficiency of care delivery in the ICU. Early studies have reported promising results, such as high accuracy in predicting adverse events or reductions in nurses’ charting time, suggesting that thoughtfully deployed AI could empower ICU nurses and improve patient outcomes. 8
Despite its promise, the integration of AI into critical care nursing comes with substantial challenges. Intensive care unit nurses and leaders have noted that AI presents both opportunities and difficulties in practice. 9 One major concern is the “black box” nature of many AI algorithms—a lack of transparency in how recommendations are generated can hinder clinicians’ trust in AI tools. 10 Nurses have emphasized that understanding an AI system's reasoning is essential for them to feel confident incorporating its suggestions into patient care. Another practical limitation is output verbosity in generative systems. Simulation work with ICU nurses shows that overly long, detail-heavy suggestions complicate information triage and distract from key clinical signals, underscoring the need for concise, nurse-tailored summaries and controls over level of detail. 11 Relatedly, potential biases in AI models (due to unrepresentative training data or flawed algorithms) raise ethical concerns, as AI could inadvertently perpetuate healthcare disparities or unsafe recommendations if not carefully monitored. 12 Implementation into clinical workflows is another challenge. As critical care is a fast-paced, unpredictable environment, and AI systems must integrate seamlessly with existing electronic health records (EHRs) and nursing routines to be truly useful. Many hospitals lack clear guidelines or standards for introducing AI into nursing practice, making it difficult to scale up successful pilot projects. 13 Additionally, nurses worry about overreliance on AI—there is a concern that if clinicians begin to uncritically depend on algorithm outputs, it could erode their clinical skills and autonomy over time. Maintaining a balance where AI provides support without diminishing the central role of human clinical expertise is therefore critical. 14 Finally, the evidence base for AI in ICU nursing, while rapidly growing, is still evolving. Many published studies are retrospective or proof-of-concept, with heterogeneous methods and endpoints, making it hard to draw definitive conclusions about real-world effectiveness.
In light of this background, the purpose of this review is to synthesize the recent literature on AI applications in critical care nursing and clarify the current state of knowledge in this domain. We specifically focus on ICU settings from a nursing perspective, examining how AI has been applied to support the work of critical care nurses and impact patient care. This review concentrates on the last five years (2020–2025) of published English-language literature, encompassing both original research studies and review articles relevant to AI in ICU nursing. By restricting to this recent time frame and including nursing-focused studies, we aim to capture the contemporary trends, innovations, and challenges that define the intersection of AI and critical care nursing. The review seeks to (1) summarize key areas of application and findings from recent studies, (2) identify prevailing themes or “hot spots” in research (such as common targets for AI such as patient monitoring or risk prediction), and (3) discuss the challenges, knowledge gaps, and implications for nursing practice and future research. Ultimately, this work is intended to provide critical care nurses, nurse leaders, and researchers with a clear overview of how AI is influencing ICU nursing today and guide efforts to harness AI effectively in this high-stakes field.
Methodology
Study design
This review employed a narrative synthesis approach to summarize and categorize original research articles focusing on AI applications in ICU nursing settings. Due to the heterogeneity in study designs, data types, AI technologies used, and outcome measures, a meta-analysis was not feasible. The review focused on empirical studies published in peer-reviewed English-language journals over the past five years.
Inclusion and exclusion criteria
Studies were included if they met the following criteria: (1) focused on the application of AI in ICU nursing practice or within clinical decision support systems that directly involve nurses; (2) comprised either original empirical research—such as primary data collection or secondary analysis of clinical datasets—or peer-reviewed review articles, including systematic reviews, scoping reviews, or narrative reviews; (3) provided a clear description of the AI methodology, including the type of algorithm used, model training, and validation approach, where applicable; and (4) reported measurable outcomes relevant to clinical performance, nursing processes, patient care, or nursing-related knowledge synthesis.
Data sources and search strategy
A comprehensive search was conducted across four major academic databases: PubMed, Web of Science, Scopus, and IEEE Xplore, to identify relevant literature on AI applications in ICU nursing. The following keywords were used in combination with Boolean operators: (“intensive care unit” OR “ICU”) AND (“nursing” OR “nurse”) AND (“artificial intelligence” OR “machine learning” OR “deep learning” OR “natural language processing” OR “predictive model” OR “generative AI” OR “large language model” OR “LLM”). The search covered studies published between 1 January 2020 and 30 October 2025. The search was limited to studies published in English. Both original empirical studies and review articles (including systematic reviews, scoping reviews, and narrative reviews) were considered eligible. Publications such as editorials, opinion pieces, conference abstracts, and nonpeer-reviewed materials were excluded to ensure the inclusion of scientifically rigorous and thematically relevant sources. An initial total of 4138 articles were retrieved.
Study selection and screening
The initial database search yielded a total of 4138 records. All retrieved articles were imported into EndNote reference management software, where duplicate entries were identified and removed using the software's de-duplication function. A manual verification step was then performed to ensure all duplicates, reviews, and clearly irrelevant records were excluded.
Following de-duplication, a two-stage screening process was conducted. In the first stage, titles and abstracts were independently screened by two reviewers to determine potential relevance based on the predefined eligibility criteria. Articles that were clearly unrelated to AI, ICU settings, or nursing practice were excluded at this stage. In the second stage, full-text reviews were carried out for the remaining records to assess whether they met all inclusion criteria. Studies were excluded during this phase for reasons including lack of nursing relevance, absence of AI applications, focus outside the ICU setting, or insufficient methodological detail. Discrepancies between reviewers were resolved through discussion or, when necessary, consultation with a third reviewer.
We conducted and report this review in accordance with PRISMA 2020. After completing the full screening process, 37 articles were included in the final synthesis, comprising original empirical studies of AI applications within ICU nursing practice. The identification, screening, eligibility, and inclusion steps are shown in the PRISMA flow diagram (Figure 1).

PRISMA flow diagram illustrating the literature screening and selection process.
Quality appraisal
The 37 eligible papers were evaluated for quality using the Critical Appraisal Skills Program (CASP) Checklist for Qualitative Research as described by previous analysis. 15 A prominent critical evaluation instrument for evaluating the overall quality of research results reports, particularly qualitative literature reviews, is CASP. Two reviewers independently assessed each study, with disagreements resolved through discussion or consultation with a third reviewer. The CASP scores ranged from 7 to 10 out of a possible 10 points, indicating generally good methodological quality across the included studies. Table 1 illustrates the results of the quality evaluation for each of the 37 papers.
Summary of quality assessment.
Results
Study characteristics
A total of 37 empirical studies on AI in ICU nursing were included, published between 2020 and 2025 (Table 2). The number of publications peaked in 2025, reflecting a heightened global interest in applying AI to critical care nursing in the wake of the COVID-19 pandemic. The research is international in scope, with notably strong representation from Asia and North America. By country, the United States contributed 17 studies; China 12 (with one additional study from Taiwan); South Korea 6; Italy 3; and the Netherlands 3. The United Kingdom and Thailand each contributed two studies, while Australia, Germany, Spain, Switzerland, New Zealand, and Turkey contributed one each. In total, 10 studies were multinational, including collaborations such as China–United States (
Summary of included studies on artificial intelligence applications in ICU nursing.
In terms of study design, the majority of the included studies were quantitative observational analyses. Using overlapping tags, 33 studies employed retrospective cohort or case–control designs, utilizing existing critical care databases or EHR data. These investigations commonly leveraged large publicly available ICU datasets such as MIMIC-III/IV (
Regarding clinical setting, population focus varied. A total of 32 studies targeted adult ICUs or covered mixed populations. In contrast, only five studies addressed pediatric critical care (PICU), typically aiming to predict complications or clinical deterioration in critically ill children.17,19,23,45,51 No studies specifically focused on neonatal ICUs (NICUs), highlighting a clear gap in AI research for neonatal critical care nursing.
Artificial intelligence methods used
Across the studies, a variety of AI and machine learning (ML) techniques were employed, with a clear dominance of data-driven modeling approaches. Traditional ML algorithms were widely used, often for predictive modeling tasks using structured clinical data such as vital signs, lab values, and nursing assessments. In particular, Gradient boosting methods (e.g., XGBoost/GBM/LightGBM/CatBoost) appeared in 15 studies. 17 19–21,26–28,32,39–43,45,49 Random forest was used in 14 studies.16,17,24,26,2730–33,35,37,40,43,50 Logistic regression appeared in 7 studies,17,19,21,26,32,34,36 typically serving as an interpretable baseline for comparison. Other methods were less common: support vector machines in 3,27–29 neural networks/DL in 4,18,29,44,46 decision trees in 1, 36 and k-nearest neighbors in 1. 38 Several papers benchmarked multiple models in parallel, and tree-based ensembles (gradient boosting and random forest) were most frequently reported as top performers for structured ICU data.
Among the four deep-learning (DL) studies, authors used architectures that are well-suited to time-series and high-dimensional ICU data. Two studies built recurrent neural network/long short-term memory models to capture temporal evolution of physiologic signals and ventilatory status for sequential risk prediction—respectively, estimating extubation failure and next-day extubation readiness in critically ill patients.29,46 One study developed a multimodal convolutional neural network (CNN) framework that fused structured clinical variables with other data sources to predict acute kidney injury (AKI) in the ICU, illustrating DL's capacity for feature learning across heterogeneous inputs. 44 Another study reported a prospective, multicenter evaluation of a DL-based early-warning system for clinical deterioration (e.g., cardiac arrest/ICU transfer) in real-world wards, directly informing ICU escalation pathways and nursing workflows. 18
Applications of natural language processing (NLP) to unstructured clinical text were uncommon but present in four studies. These works combined narrative notes with structured EHR variables to support prediction and triage. For example, studies have been analyzed it for incorporating triage free-text or nurses’ narratives alongside vitals and labs for acuity/deterioration risk.37,49 Others used multimodal pipelines that fused text with tabular inputs, including a CNN-based framework for ICU AKI risk, 44 and a language modela languagejury ICU pipelines that fused text with tabular inputs, including a CNNt for incorp. 50 By contrast, the majority of studies relied primarily on structured EHR data, underscoring that text-based ML is emerging but not yet mainstream in ICU nursing research.
Clinical applications
Early warning systems and risk prediction
A total of 30 studies focused on prediction models designed to provide early warnings of patient deterioration or other critical events such as ICU transfer/admission, mortality, delirium, extubation/weaning outcomes, and resource-related endpoints (length of ICU stay or ICU readmission). This represents the most prominent area of AI application in ICU nursing to date. These models predominantly utilized structured clinical data—including vital signs, laboratory results, demographic information, and nurse-entered documentation—to identify high-risk patients and support timely clinical decision-making.
Intensive care unit admission or transfer prediction was a common target. Several studies applied ML models—particularly random forests and logistic regression—to identify patients at risk of clinical deterioration requiring ICU transfer. For instance, Cheng et al. 16 developed a random forest model using routinely collected EHR data from COVID-19 inpatients and achieved an AUC of 0.799 for predicting ICU transfer within 24 h, demonstrating the feasibility of such tools in real-time triage. In the emergency department (ED) setting, Pandey et al. 37 developed ML models to predict ICU admission directly from ED presentations, integrating triage and early clinical information to assist escalation decisions at the front door. Beyond routine tabular data, Zakariaee et al. 38 incorporated chest CT severity scores alongside clinical variables to model ICU admission (and length-of-stay [LOS]) in COVID-19 cohorts, highlighting the value of multimodal inputs for early risk stratification. In addition, a prospective, multicenter evaluation of a DL-based early-warning system demonstrated utility for detecting imminent deterioration with downstream ICU implications in real-world wards, 18 underscoring the breadth of approaches used to anticipate ICU-level care needs.
Another major application was mortality prediction. Several studies developed and validated AI models to estimate ICU or in-hospital mortality, often comparing their performance to traditional scoring systems such as APACHE, SOFA, or MEWS. For example, Pan et al. 21 used XGBoost and logistic regression to predict mortality in COVID-19 ICU patients and achieved an AUC of 0.92, outperforming conventional risk scores. Similarly, Alghatani et al. 34 developed multiple ML classifiers, with the random forest model reaching an accuracy of ∼89% for ICU mortality prediction based on MIMIC-III data. In addition to predictive performance, some tools (e.g., SHAP or LIME) also provided interpretable outputs, highlighting risk factors like lactate, oxygenation, and comorbidities—thus aiding nurses in clinical prioritization and end-of-life planning.
Several studies aimed for ICU readmission and LOS prediction. Some studies targeted operational endpoints central to capacity planning and discharge coordination. Most models drew on structured EHR variables (vital signs, labs, demographics, coded nursing data), sometimes augmented with imaging-derived severity scores. For ICU readmission, Lim et al. 33 developed and multicenter-validated a model to flag patients at risk of readmission within 48 h after ICU discharge, supporting safer step-down and follow-up planning. For LOS, Alghatani et al. 34 trained classifiers to estimate ICU LOS alongside mortality using routine EHR features; Fan et al. 35 identified prolonged ICU stay among patients with spinal cord injury using perioperative and clinical data; and Zakariaee et al. 38 modeled ICU admission and LOS in COVID-19 by combining clinical variables with chest CT severity scores. Collectively, these studies illustrate how ML can underpin operational decision-making in critical care—from discharge timing to bed management.
Nursing decision support
A total of four studies explored the use of AI as a decision aid for nursing-specific tasks. These tools were designed to support clinical judgment, triage assessment, documentation, and care planning—areas where timely and standardized decisions are essential for patient safety in ICU settings.
In the ventilation weaning/extubation domain, Fenske et al. 46 developed models to predict next-day extubation, enabling nurses to plan shift-to-shift readiness checks and coordinate team huddles; Zappalà et al. 47 proposed a real-time weaning-readiness predictor for invasively ventilated patients, offering moment-to-moment guidance that can structure nurse-led assessments. For nursing diagnoses and escalation cues, Cesare et al. 48 used random forest models to rank standardized nursing diagnoses by their predictive relevance for ICU transfer risk across adult and pediatric cohorts, providing data-driven prioritization signals for care plans. Triage decision support was addressed by Sitthiprawiat et al., 49 who integrated nurse-captured triage assessments with structured EHR variables to identify patients at risk of critical outcomes (including ICU admission) and benchmarked the tool against conventional triage rules.
Workload and documentation support
A total of four studies focused on applying AI to support nursing workload management and clinical documentation, areas that directly influence ICU nursing efficiency and staff well-being.50–52 Unlike prediction models targeting patient outcomes, these studies centered on optimizing nurse-centered processes such as staffing, resource allocation, and structured record-keeping.
One study by Fan et al. 50 aligned language model–based methods with critical-care nursing documentation, illustrating how narrative nursing text can be structured and surfaced for downstream decision support without disrupting established charting practices. Moving from documentation to capacity planning, Palmer et al. 51 demonstrated the feasibility of forecasting future critical-care bed availability using routine bed-management data, an approach that can inform staffing and resource allocation at the unit level. Complementing this, Schiele et al. 52 developed neural-network models to predict ICU bed occupancy in support of integrated operating-room scheduling, highlighting how data-driven forecasts can bridge perioperative planning with ICU capacity.
Although these applications are still in early phases, they represent an emerging direction for AI in critical care—one that focuses not just on clinical outcomes but also on improving nurse workflow, documentation accuracy, and administrative efficiency. If successfully integrated into clinical systems, such tools have the potential to reduce clerical burden and free up more time for direct patient care.
Discussion
This narrative review reveals that the application of AI in ICU nursing is an emerging and rapidly evolving field, with most empirical studies published in the last five years. The focus has predominantly been on developing and validating predictive models that leverage large datasets of patient information (vital signs, assessments, EHR data) to assist with clinical predictions or decision support. Key areas of development include early warning scores for patient deterioration, predictive analytics for complications such as delirium or AKI, and decision support systems for nursing tasks such as triage prioritization and care planning. Collectively, the studies demonstrate that AI techniques—from traditional ML to DL—can achieve impressive accuracy in retrospective analyses. There is also a clear trend toward incorporating more sophisticated algorithms. These trends indicate a maturation of the research from purely technical proof-of-concept models toward more context-aware tools that consider usability in the nursing environment.
It is worth noting that implementation considerations for PICUs differ meaningfully from adult ICUs. Children have age-dependent physiologic norms—heart rate, respiratory rate and blood pressure vary widely by age—so alert thresholds, feature sets and model calibration cannot be directly transferred from adult models. Authoritative guidance provides age-specific acceptable ranges for unwell children and emphasizes that pattern of change matter as much as static cut-points, reinforcing the need for pediatric-specific tuning of ML systems. National PEWS programs also embed age-stratified observation charts and thresholds, underlining that pediatric early-warning differs from adult EWS by design.53,54 In evaluation, outcomes also differ. For early-warning systems outside the PICU, mortality is rare, and composite deterioration endpoints (e.g., unplanned PICU admission, urgent interventions) are more appropriate than mortality alone, implicating pediatric-appropriate target selection for model training and validation. 55 Empirically, PICU-focused ML studies illustrate these distinctions. Pediatric AKI can be predicted 24–48 h earlier than guideline thresholds using physiologic time-series, highlighting distinct pediatric phenotypes and the value of sequential data. 23 Pediatric delirium risk models built on PICU cohorts similarly required pediatric variables and workflows, with tools intended for bedside nursing use. 56 Extubation planning in PICU has leveraged expert-augmented ML to encode clinician rules alongside data, reflecting pediatric-specific practice patterns and decision thresholds. 45 Finally, staffing, sedation, and care processes differ (e.g., pediatric-specific sedation protocols; workforce mix, and staffing ratios), which can influence data quality, label definitions, and alert acceptabilitydata, reflectidesign with PICU nurses and local recalibration are critical steps before deployment. 57 Future pediatric work should report age-appropriate calibration and use pediatric-suitable endpoints (e.g., composite clinical deterioration events), incorporate multicenter PICU cohorts, and involve bedside nurses in threshold setting and usability testing prior to EHR integration.
A notable strength in the current body of evidence is the international and interdisciplinary nature of the work. Studies from multiple countries have tackled similar problems, lending a global perspective on ICU nursing challenges that AI can address. Many researchers capitalized on open ICU databases (MIMIC, eICU, etc.) and multicenter cohorts, which increases the sample size and diversity of data used to train models. The result has been robust model performance in many cases, as well as publicly available algorithms or code in a few instances, which can accelerate collective progress. Another strength is the early attention to explainability and user-centered design in some studies—for instance, providing triage nurses with explanations for AI risk scores or designing AI systems explicitly to fit into nursing workflows.58,59 This indicates that some investigators appreciate that an accurate algorithm alone is not enough but it must be interpretable and actionable for frontline nurses. Additionally, a few prospective studies and pilot implementations have been conducted, which is a critical step forward from purely retrospective research. The example of an AI-assisted triage intervention that successfully reduced mistriage in a live ED setting is an encouraging sign that these technologies can deliver real-world improvements when thoughtfully deployed.
Despite promising results, our review underscores significant limitations in the current evidence. Foremost, the level of clinical validation and implementation is limited. The vast majority of studies stopped at model development or retrospective validation stages. Only a couple of prospective trials were identified, meaning there is scant high-level evidence for actual patient outcomes or workflow improvements resulting from AI in ICU nursing. As a recent systematic review noted, the heterogeneity of study designs and lack of rigorous trials make it difficult to draw definitive conclusions about effectiveness. 7 Future research needs to move beyond accuracy metrics and assess impact on clinical outcomes, nursing efficiency, and safety in real settings. Another limitation is that many models are context-specific and may not generalize well. For example, models trained on single-center or single-country data (or on a narrow patient group such as spinal cord injury ICU patients) may perform poorly elsewhere due to differences in patient populations, clinical practices, or data recording. Indeed, several studies themselves cite generalizability as a concern and often did not externally validate their algorithms. Data quality and completeness issues were also common challenges—for example, models relying on nursing documentation noted variability or missing data in those inputs.
Artificial intelligence directions beyond tabular prediction that are directly relevant to ICU nursing. Like previously mentioned, NLP is being used to leverage nursing narratives and triage free text for risk assessment and workflow support, while multimodal/computer visionoadjacent approaches fuse structured variables with image-derived signals (e.g., CT severity indices; CNN-based frameworks) to enhance early recognition of organ dysfunction. Also, AI in nursing education and simulation is gathering evidence for virtual simulation/adaptive tools that improve learning outcomes and support rehearsal of escalation pathways, suggesting a parallel route to uplift ICU nursing skills. 60 And nurse-assistive robotics (cobots) is an emerging strand for logistics and repetitive tasks, it has been supported to emphasize the need for codesign with nurses and implementation studies before routine use in high-acuity settings. 61
To translate these strands into safe routine practice, implementation should follow contemporary reporting/appraisal standards that make external validation and calibration a requirement for transportability (e.g., TRIPOD + AI and PROBAST + AI explicitly extend earlier guidance to ML methods and emphasize transparent reporting, validation, and applicability judgments), rather than relying on internal discrimination alone. 62 In parallel, regulators have converged on lifecycle controls for EHR-embedded deploymentnternal discrimination alonestme data feeds, and monitoring for drift—through the joint FDA/Health Canada/MHRA Good ML Practice principles and their Predetermined Change Control Plan guidance, which tie postdeployment monitoring and change management to safe updates of ML devices. These controls are directly responsive to well-documented risks such as data drift and distribution shift in clinical ML. 63 Nurse-centered human-factors work is equally necessary. User-centered design, threshold setting, and alert-governance are needed to mitigate alarm burden/fatigue repeatedly documented in ICU settings. 64 Also, commissioning frameworks such as NICE's Evidence Standards Framework specify implementation and evidence expectations (including postdeployment monitoring) that nurse-led teams can adopt as a practical checklist for deployment readiness. 65
Limitations
This review has several limitations. First, it only includes studies published in English, which may exclude relevant research published in other languages. Second, the majority of the studies reviewed were retrospective and observational in nature, which limits the ability to draw definitive causal conclusions or assess the real-world impact of AI interventions. Additionally, there is significant methodological heterogeneity across the included studies, with varying AI techniques, datasets, and outcome measures, making it difficult to directly compare results. Another limitation is the lack of prospective validation studies, which are critical for establishing the clinical utility and generalizability of AI models. Finally, while we focused on empirical research, the fast-paced nature of AI advancements means that newer studies may not have been captured, potentially overlooking recent developments in the field.
Conclusion
This review adds a nursing-centered synthesis rather than a model-centered catalog. We curated a contemporary corpus of 37 empirical studies in ICU nursing and mapped applications across three practice domains that match bedside work, namely early warning and risk prediction, nursing decision support, and workload and documentation support. We quantified method use across the corpus, with gradient boosting in 15 studies, random forest in 14, and logistic regression in 7, and we identified four DL implementations with task specific architectures. We characterized settings by population, showing five PICU studies and no NICU specific studies, and by design, showing a predominance of retrospective analyses with limited external validation and calibration. We also provide a lightweight quality appraisal suited to ICU ML studies and convert the synthesis into actionable guidance on model selection for tabular versus temporal or multimodal data, on data preprocessing requirements, and on steps for embedding tools into the EHR with nurse codesign. Together these elements clarify where evidence is mature, where gaps remain, and how nurse led teams can translate current tools into real world workflows.
Artificial intelligence applications in ICU nursing are moving from concept to practice and already show promise in early warning, complication prediction, decision support, and workload streamlining. The evidence base is still largely observational with short follow-up, which limits generalizability and confidence in sustained benefit. To turn promise into dependable practice, use tree-based ensembles as strong baselines for structured EHR data and add recurrent or convolutional models for temporal or multimodal signals only after transparent calibration and external validation. Preprocessing should state how missing data are handled, how class imbalance is addressed, and how feature stability is checked. Models should also provide nurse facing explanations that support prioritization without adding cognitive load. Implementation works best when tools are embedded in the EHR with auditable versioning, clear alert routing and suppression rules, and active monitoring for model drift, and when thresholds, screens, and workflows are co designed with bedside nurses.
The field now needs multicenter prospective studies and pragmatic nurse facing randomized trials using cluster or stepped wedge designs. These studies should report both process outcomes such as documentation time, alarm exposure, escalation timeliness, and adherence to protocols, and patient or operational outcomes such as mortality, delirium, ICU transfer accuracy, LOS, readmissions, and bed flow. Models should undergo geographic and temporal validation, report calibration and fairness across subgroups, preregister analysis plans, and include surveillance after deployment for safety events, performance drift, and workload impact. Important gaps remain in pediatrics, especially NICU settings, and in the availability of international datasets. Future work should also include economic evaluations to inform scale up. The foundation laid by these 37 studies can be strengthened through multidisciplinary collaboration and nurse leadership. With thoughtful integration, AI will not replace critical care nurses but will empower them to deliver smarter, more proactive, and patient-centered care.
Supplemental Material
sj-docx-1-dhj-10.1177_20552076251406302 - Supplemental material for Artificial intelligence applications in intensive care unit nursing: A narrative review (2020–2025)
Supplemental material, sj-docx-1-dhj-10.1177_20552076251406302 for Artificial intelligence applications in intensive care unit nursing: A narrative review (2020–2025) by Aiping Bi, Tie Li, Guohui Cheng and Jing Hu in DIGITAL HEALTH
Footnotes
Contributorship
Aiping Bi: Conceived the study design, conducted the literature review, and wrote the manuscript. Tie Li: Assisted in data collection and analysis and contributed to manuscript revision. Guohui Cheng: Provided critical feedback on the methodology and interpretation of results. Jing Hu: Supervised the study, reviewed the manuscript, and coordinated the research.
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data availability
Data supporting the findings of this study are available within the article. All relevant data can be made available upon request to the corresponding author.
Guarantor
JH.
Supplemental material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
