Abstract
This research has developed an algorithm to measure behavioral health acuity that relies predominantly on patient-level information from electronic health records (EHRs) but also has nurse-input-driven parameters contributing to the tool, known as the Nurse Effort Assessment Tool (NEAT). The overall objective is to create the NEAT to address the increasing acuity of behavioral patients by moving away from subjective patient assessment to using an automated algorithm that draws predominantly from the rich data in EHRs. Data from over 9,000 patient records provided statistically significant results that the subjective rating both correlates and serves as significant predictors for the NEAT score. Future research looks to expand the algorithm and validate it across multiple clinical sites.
Introduction
The quantity and acuity of U.S. residents experiencing mental health challenges has steadily increased over the past 20 years—accelerating since the COVID-19 pandemic (Goodwin et al., 2022). To exacerbate the issue, inpatient care facilities face dramatic staffing challenges, as many nurses have retired or resigned. A recent study (American Nurses Foundation, 2022) shows that insufficient staffing is the most common reason nurses leave the profession; nine of ten nurses indicate that the hospital where they work is understaffed. In addition, hospital expense growth—of which, labor costs are the biggest contributor—has increased at a rate more than twice the average reimbursement rate (American Hospital Association, 2023). Burgeoning mental health needs combined with critical capacity issues and health system cost structures drive the need to more efficiently deploy direct care staff in the mental health inpatient setting.
In response to the increasing acuity seen in psychiatric facilities, the Harbor Performance Initiative (HPI) started a systematic process of developing an algorithm to measure behavioral health acuity that relies predominantly on patient-level information from EHRs. This paper reports on the experience of one HPI member site in collecting data to produce the NEAT measures of nurse staffing effort.
This research focuses on developing patient handoff metrics in inpatient behavioral health settings. Patient handoff is: “the process of sharing pertinent health information during transitions in patient care” (Keebler et al., 2016). Appropriate levels of nurse-to-patient ratios avoid nursing staff burnout, fatigue, cognitive stress, and psychological stress (American Psychiatric Nurse Association [APNA], 2012; Harper & McCully, 2007). In hospital environments, providing the highest quality care for patients is essential. Knowing that there are tools, processes, and procedures to help facilitate and inform healthcare personnel can assist healthcare workers. The need for appropriate assessment of behavioral health patients continues to grow with the complexity and comorbidities of patients. Mental illness is “a mental state which is characterized by alternation in thinking, mood, or behavior (or some combination thereof) and associated with significant distress and impaired functioning for an extended period” (Canadian Federation of Mental Health Nurses [CFMHN], 2014). Human Factors Engineering (HFE) is positioned to support appropriate staffing measures, environmental design, and tailored intervention approaches to maximize the safety of patients and staff. Barriers to an appropriate handoff process include clinician fatigue, poor teamwork, time constraints, and lack of knowledge of effective processes (Cannon, 2021; Keebler et al., 2016). Currently, staffing loads for nursing are predominantly based on ratios of patients per staff member with nurse assessments of patient care needs, being the most commonly researched approach. However information about the patient’s acuity level can be generated from multiple sources, and more granular ratios can be developed with more precise and complex data.
Related Work
Algorithmic Approaches
Researchers have recently developed algorithmic approaches to estimate patient needs and nurse staffing. A Discrete Event Simulation (DES) model adjusted two proven drivers of workload: (a) nurse workload capacity, and (b) patient acuity based on historical data. This simulation modeled 35 different conditions, each comprising a combination of the different acuity levels and nurse-patient ratios compared to subjective nursing assessment models (Qureshi et al., 2020). Another algorithmic approach utilized EHR abstraction to develop individual patient scores, a product of nine patient need score components (e.g., medications, lines, drains, airways, risks, wounds, orders), to automate the creation of individual patient workload score (Womack et al., 2021). This early-stage algorithm received criticism for arbitrary cut-off points, nurses’ concerns over the number of outliers in pediatric and mental health units, and its failure to calculate the number of positions needed.
Existing Measures
Acute Hospital
Four major approaches to determining staffing of nurses in acute hospitals are: (a) professional adjustment, (b) volume-based methods such as patient ratios, (c) patient prototype classification, and (d) time-task approaches (Griffiths et al, 2020). In the U.S., four popular measures of nurse staffing for intensive care units (ICUs) are: (a) the Therapeutic Intervention Scoring System (TISS), (b) the Nursing Activities Score (NAS), (c) the Subjective Workload Assessment Technique (SWAT), and (d) the National Aeronautics and Space Administration-Task Load Index (NASA-TLX). The TISS quantifies type and number of intensive care treatments for each patient to determine the needed number of nursing staff (Moreno & Morais, 1997). The NAS determines nurse workload by measuring the time to address patient needs through seven major categories and 23 activities (Miranda et al., 2003). The SWAT, which employs a human factors approach, measures three dimensions of nurse workload: cognitive, time, and psychological stress burden (Reid & Nygren, 1988). The NASA-TLX also employs a human factors approach and measures six dimensions of a nurse’s workload perception: cognitive demand, physical demand, time pressure, performance, effort, and frustration (Lebet et al., 2021; Racy et al., 2021). A common weakness of this field is the lack of research determining which approach is more accurate or appropriate in a given circumstance.
Mental Health Inpatient Facilities
Current nursing mental health staffing ratios rely primarily on surveys of nurses about their education and skill set to determine capacity. A systematic review of 20 studies on ratios of mental health nurses to inpatient psychiatric patients determined that most studies followed the same methodology: estimating a nurse-to-patient ratio by surveying nurses’ skill mix and education (Moyo, 2020). Three studies found significant associations between a more highly educated nursing workforce and a lower mortality risk. Patient characteristics in models of nursing-staff needs include primary psychiatric diagnosis, previous psychiatric hospitalization, psychiatric subdiagnoses, procedures, and number of physical subdiagnoses as measured by the Elixhauser Comorbidity Measures (ECM) (American Psychiatric Nurse Association [APNA], 2012; Park et al., 2020).
Measuring Patient Acuity
Accurately measuring patient acuity to determine required nursing effort is essential for estimating optimal nurse staffing needs, but approaches to determining patient acuity vary. A common approach is to rely on patient classification systems based on a nurse assessment (De Groot, 1989; Harper & McCully, 2007). Other approaches rely on the complex and detailed data collected in electronic patient records (Juvé-Udina et al., 2020; Needleman et al., 2011).
Variables of interest. Common measures of adverse outcomes, often referred to as nursing-sensitive indicators, include mortality, nosocomial infections, hospital-acquired pneumonia, unplanned extubation, respiratory failure, cardiac arrest, and failure to rescue (Kane et al., 2007; Needleman et al., 2011; Oner et al., 2021). Patient status variables include previous patient care, medications, activities of daily living, wounds, lines, drains, and airways (Qureshi et al., 2020; Womack et al., 2021). Factors that capture the physical characteristics and operating procedures of a nursing unit include inpatient unit floor plan, task location, walking distance, operating logic, and facility characteristics (Park et al., 2020; Qureshi et al., 2020).
Research Gaps and Open Questions
Concerns have been raised about the potential bias and inaccuracy due to the reliance on subjective ratings of nurse capacity (Lebet et al., 2021). In staffing ratios for inpatient psychiatric patients, the American Psychiatric Nurses Association issued a 2023 position statement noting the scarcity of well-supported nursing-to-patient ratios and a strong understanding of their relation to outcomes developed and evaluated with data (Johnson et al., 2023).
The behavioral health assessment literature lacks empirically developed algorithmic solutions. More specifically, algorithmic solutions that have been verified to meet the needs of practitioners by practitioners and validated through psychometric analyses that individual variables measure the latent constructs, which can lead to improved understanding. An acceptable middle ground can be achieved by combining a data-driven approach and nurse stakeholder input.
This collaboration brought together a group of researchers seeking to draw a connection between data collected and the level of care a patient would need. Further, it would have the capability to provide quantitative justification related to the expense of resources thereby advocating for appropriately adjusted reimbursement rates. The goal is to develop and iteratively prove that our underlying algorithm is usable and extensible, continuing to adapt to meet the needs of patients. The Nurse Effort Assessment Tool (NEAT) is being deployed across multiple hospital systems to validate its effectiveness.
Our two primary research questions were:
How do the subjective scores provided by the nurses associate with the overall NEAT Score?
In terms of predictive variance, how much do the individual nurse scores account for?
These led to the following two hypotheses:
H1: The nurse assessment scores will be positively correlated to the NEAT Score, so as nurse effort increases, so does the overall NEAT Score.
H2: The nurse assessment scores will represent a significant portion of the variance explained by the overall NEAT score.
Methods
Participants
Participants comprised 9,417 de-identified archived patient records captured between November 2023 and January 2024. Records were collected as an initial pilot test of our algorithm through one participating healthcare facility in the southeastern United States. Records contained the aggregate assessment score and subjective inputs from the nursing staff.
Materials
Patient records were outputted into a Microsoft Excel workbook (Version 2312) and then extracted into an IBM SPSS Statistics data file (Version 29.0.2).
Algorithm Variables
The algorithm includes a proprietary set of 12 measures extracted seamlessly from the EHRs, such as fall risk, medication, and recent admissions, and two measures from the nurses delivering the care. The subjective input from the nurses is because those two values are the nurses’ perceived state of the patient: one of which is the current state, and the other is what they believe to be the future state. Subjective data is what one gathers from the patient that you cannot use your five senses to measure (Adam et al., 2023).
The two nursing measures are:
Current State: refers to the nurse effort required for current patient care on a 1 to 4 scale: 1 refers to minimal effort, 2 refers to average care needs, 3 refers to above-average needs, and 4 refers to almost constant levels of effort.
Future State: refers to the nurse effort required for future patient care on the same 1 to 4 scale mentioned above.
Development and Verification
To understand the relationship between the subjective variables from the nurses and how they played into the overall aggregate score, correlation analyses were used first, followed by regression analysis.
Correlation Assumptions and Analysis Strategy
Bivariate correlation analysis assessed the correlation between the nurse-reported inputs and the overall NEAT score. It is acknowledged that scales such as the 4-point scale the nurses were using could be viewed as ordinal, and thereby violate the assumption of continuous variables within correlation. However, the reason we pursued this analysis technique is based on the way that end users suggested these categories. Since patients can be seen as moving within the stages (average to above average and vice versa) from a care perspective, this comparison would still be beneficial and provide insight for our regression analysis. The correlation also violated the normality assumption according to Shapiro-Wilk (p ≤ .05); however, the test was run because correlations are known to be somewhat robust to deviations from normality. The other three assumptions were met (pairs, linearity, and no significant outliers). Two correlations were run: NEAT Score versus Current State, and NEAT Score versus Future State.
Regression Assumptions and Analysis Strategy
Linear Regression was chosen as the analysis method to assess the overall nurse score. It is acknowledged (as above) that the nurse scale violates the assumption of continuous variables. This method was done this way because it is an exploratory analysis that we extend to expand both the nursing score to a scalar format (e.g., moving it from a 1 to 100 scale) and because follow-on analysis is intended to extract each of the 14 variables and look at their overall weighting score. Our data had 20 outliers (out of 9,417 records). Still, we decided to maintain the outliers as we believe they accurately represent patient data and don’t represent significant changes to the overall model. We acknowledge that multicollinearity exists due to the correlation between the two variables. Still, in terms of our research question, we are looking at the variance accounted for by both variables (including overlap) and not looking at individual contributions. The assumptions of linearity, homoscedasticity, and independence were met.
Results
In the first analysis phase, we examined the correlation between the subjective measurements and the overall NEAT score. The Pearson correlations indicate a high and significant relationship of 67% between the NEAT Score and the subjective current measurement of patient need (0.671, p < .001) and the subjective future measurement of patient need (0.672, p < .001). Next, the regression model analysis, which removed duplicates, determined that current and future intervention scores predicted 46% of the NEAT score variance (p < .001). In the regression analysis limited to 142 instances where scores shifted for the current and future intervention, these changes predicted 29% of the NEAT score variance (p < .001). This finding on the change intervention scores verified the robustness and predictability of the NEAT score.
Discussion
Few research studies have examined approaches to optimize the use of extensive data in EHRs to determine nursing staff levels in inpatient mental health facilities. A strength of NEAT is the inclusion of 12 measures derived based on EHR data and two measures based on nurse assessments of patient level of care. The correlation analysis indicated a high degree of relationship (67.1%; 67.2%) between the EHR measure of patient needs with the nurse assessment of current and future care. The regression analysis, which removed duplicate cases, indicated that current and future nurse scores predicted 46% of the NEAT variance. Therefore, NEAT is capturing nearly half of the patient needs that nurse assessments do.
Implications and Applied Use Cases
This study is the first step in incorporating the rich objective measures documented in the EHR with nurse assessments to determine the level of nursing care patients require. Studies on acute hospital care present an initial list of nurse-sensitive conditions that must be modified to mental health conditions (Kane et al., 2007; Needleman et al., 2011; Oner et al., 2021).
Future Work
Future research could examine how well NEAT nursing levels prevent adverse patient outcomes in mortality, morbidity, longer patient stays, and readmissions. To continue its work on inpatient behavioral health staffing, HPI will implement NEAT across participating inpatient facilities to benchmark the labor resources required to provide quality mental health care. The information gained will provide inpatient mental health facilities with evidence-based staffing requirements during payment rate negotiations.
Conclusion
NEAT advances the field of nursing care by developing an algorithmic approach to determine nurse-to-patient staff ratios (Griffiths et al., 2020). By relying on patient-level data in EHRs, NEAT provides a less time-intensive approach to identifying nurse staffing requirements. Measures of the high correlation between the 12 EHR-derived and two nurse-created measures indicate the potential for future versions of NEAT to improve the power of EHR-derived measures to more closely match the nursing measures.
Footnotes
Acknowledgements
We thank the Harbor Performance Initiative (HPI) which launched a 2017 workgroup to study patient acuity measurement in inpatient behavioral health units. This work led to the development of the NEAT
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
