Sage Journals: Discover world-class research

Abstract

The current study evaluated the use of a machine learning model to determine benefit of medical record variables in predicting geriatric clinic communication requirements. Patient behavioral symptoms and global cognition, medical information, and caregiver intake assessments were extracted from 557 patient records. Two independent raters reviewed the subsequent 12 months for documented (1) incoming caregiver contacts, (2) outgoing clinic contacts, and (3) clinic communications. Random forest models’ average explained variance in training sets for incoming, outgoing, and clinic communications were 7.42%, 3.65%, and 6.23%, respectively. Permutation importances revealed the strongest predictors across outcomes were patient neuropsychiatric symptoms, global cognition, and body mass, caregiver burden, and age (caregiver and patient). Average explained variance in out-of-sample test sets for incoming, outgoing, clinic communications were 6.17%, 2.78%, and 4.28%, respectively. Findings suggest patient neuropsychiatric symptoms, caregiver burden, caregiver and patient age, patient body mass index, and global cognition may be useful predictors of communication requirements for patient care in a geriatric clinic. Future studies should consider additional caregiver variables, such as personality characteristics, and explore modifiable factors longitudinally.

Keywords

geriatrics caregiver burden health care utilization

Introduction

Providing direct medical care for patients represents only a portion of the responsibilities of health care providers in geriatric medicine. A considerable amount of time is also spent in communications, such as coordinating care within and outside of the treatment team and returning calls to caregivers. This important responsibility contributes significantly to workload: one study found that, on average, physicians spend over an hour per day responding to phone calls.¹ Longer hours of work may, in turn, contribute to feelings of compassion fatigue, stress, and burnout,^2-4 underscoring the importance of understanding factors that influence communication requirements for health care providers.

Older adults, particularly persons with dementia, are often cared for by informal caregivers,⁵ unpaid family members or friends, who provide valuable information to physicians and advocate on the patient’s behalf. Past work from another population demonstrates an association between caregiver burden and the frequency of their clinic contacts,⁶ suggesting that distress in a caregiver could contribute to workload for the health care provider. More recently, a study drawing from geriatric clinic records showed small but significant correlations between caregiver distress and both outgoing and clinic communications, though these relationships did not consistently remain significant after controlling for the care recipient’s severity of dementia (Martin et al, in preparation). That same study also found that demographic aspects of the caregiver, including younger age and female gender, were associated with a higher number of communications. By examining the factors that drive communication requirements in geriatric clinics, health care providers can better allocate resources to patients and caregivers who are likely to require the most attention. This proactive approach might help ensure that resources are distributed efficiently and effectively. Doing so might improve caregiver and patient outcomes, such as reduced caregiver burden or increased satisfaction with care, while also reducing feelings of overwork in the health care professional.

Prior efforts contribute to our understanding of these issues, but a broader perspective of the factors that predict geriatric clinic communications requirements is needed. Analysis of information collected in this clinical context (i.e., medical record data) using machine learning might prove useful to this end. Machine learning techniques allow for analysis of a very large number of predictors with linear and nonlinear relationships among predictor and outcome variables,⁷ making it ideal for analyses utilizing medical record data. The current study evaluated the use of a machine learning model to determine the most useful variables for predicting communication requirements in a geriatric clinic when managing patient care, while also gauging the use of such a model in predicting clinic communication needs for future patient-caregiver dyads.

Methods

Participants

Data were gathered from 557 patient-caregiver dyads in a clinical registry of an outpatient geriatric clinic that provides specialty services for dementia. All patients presented for their initial evaluation between 04/11/2017 and 06/01/2018. To be included in the present study, patients were required to: (1) have a clinical diagnosis of major or mild neurocognitive disorder after comprehensive evaluation with a geriatrician, (2) have a caregiver who completed clinical caregiver assessments, and (3) remain a patient of the clinic for 12 months after initial intake. Participants were excluded if: (1) the care recipient moved to a structured living facility during the study period (i.e., nursing or assisted living care), (2) records suggested the dyad opted not to use the memory clinic for primary memory care (i.e., indicated they had begun primary memory care elsewhere, moved, or failed to present for scheduled appointments), as these indicators would suggest the dyad is seeking treatment elsewhere, eliminating the need to contact this particular clinic, or (3) the care recipient did not complete a brief measure of global cognition.

Measures

All of the following variables were gathered by medical chart review.

Communication-related outcomes

Incoming contacts were the total number of communications (i.e., calls or emails) originating from informal caregivers to the clinic during the 12 months following the initial appointment. These incoming contacts represented a variety of requests such as medication refills or adjustments, appointment scheduling, and inquiries for information relating to the patient’s disease. Outgoing contacts represented the total number of outgoing communications (i.e., calls or emails) made by clinic staff, including physicians, social workers, and support staff during the 12 months following a patient’s initial appointment. Typical themes of outgoing contacts were answering questions of caregivers, advising next steps or further evaluations, communication of test results, and medication changes. Clinic communications were the total number of intra-clinic messages between staff recorded in the patient’s medical record over the 12 months following the patient’s initial appointment. These were counted when a clinic staff member created a note adding new information, asking a question, making a request, or other messages that required a response from another clinic staff member.

Two trained raters independently classified and counted contacts. Interrater reliability was assessed via an intraclass correlation coefficient 2-way mixed effects model with an absolute agreement definition for each subject. Final agreement across individual cases was .98. Data between raters were averaged together, forming a single variable as done in previous research with continuous data rated by multiple researchers.⁸

Caregiver information

Demographic information for caregivers including age, gender, years of education, and relationship to the care recipient was collected from self-report measures routinely administered by the clinic. Caregivers completed several measures related to their caregiving role: the Zarit Burden Interview (ZBI),⁹ Pearlin Self-Mastery Scale (SMS),¹⁰ and Positive Aspects of Caregiving scale (PAC).¹¹ The ZBI⁹ is comprised of 22 statements that describe how someone may feel while providing care for an individual with an illness (e.g., “Do you feel strained when you are around your relative?” “Do you feel uncomfortable about having friends over because of your relative?”). Participants were asked to indicate how frequently they feel that way ranging from 0 (never) to 4 (nearly always). Scores are calculated for each caregiver by summing all items, with higher scores indicating greater burden. Psychometric properties of the ZBI include strong correlations with other measures of caregiver burden, Cronbach’s alpha ranging from .82-.92.^9,12 The SMS¹⁰ consists of 10 items, each rated on a 5-point Likert scale from 1 (strongly disagree) to 5 (strongly agree), that measures an individual's perceived sense of control over their life and ability to cope with stressors. Sample items from the SMS include “I have little control over the things that happen to me.” and “I can do just about anything I really set my mind to.” Total scores on the SMS range from 10 to 50, with higher scores indicating a greater sense of self-mastery. It has been found to have high internal consistency reliability, with Cronbach's alpha coefficients typically ranging from .75 to .85. Construct validity has been demonstrated through correlations with other measures of psychological well-being and coping.^10,13 The PAC¹¹ contains 9 items that are designed to assess the positive aspects of caregiving, which may include personal growth, improved relationships, and a sense of purpose. Each item is rated on a 5-point Likert scale ranging from strongly disagree (1) to strongly agree (5). Sample items from the PAC include: “I feel that my relationship with the care recipient has grown stronger through my caregiving.” and “I have learned to appreciate the value of life through my caregiving experiences.” Scores on the PAC range from 9 to 45, with higher scores indicating a greater perception of positive aspects of caregiving. The PAC has been found to have good internal consistency reliability, with Cronbach's alpha coefficients typically ranging from .70 to .90 across different samples. Construct validity has been demonstrated through correlations with other measures of psychological well-being and caregiving stress.¹⁴ See Table 1 for demographics and measures.

Table 1.

Patient-Caregiver Dyad Descriptives.

	Mean (SD)		%
Caregiver	Age (years)	62.69 (12.74)	Sex (female)	70.4
	Education (years)	14.31 (2.63)	Relation to patient
	ZBI	28.91 (17.07)	Child	54.5
	SMS	19.82 (3.75)	Spouse	33.3
	PAC – SA	19.68 (5.16)	Friend/Family	9.8
	PAC – OL	7.57 (2.01)	Other	2.4
	CMAI – Agg	10.23 (2.62)
	CMAI – PNA	10.36 (6.11)
	CMAI – VAB	12.39 (6.95)
	BEHAV5	1.87 (1.60)
Patient	Age (years)	79.42 (8.20)	Sex (female)	64.8
	Education (years)	12.62 (2.84)	MMSE or MoCA (MoCA)	81.1
	PHQ-9	3.44 (5.95)	Patient smoking history
	MMSE score^a	12.07 (5.11)	Current	7.2
	MoCA score^b	16.64 (5.56)	Former	40.5
	BMI (kg/m²)	27.19 (5.76)	None	52.3
			Alcohol history
			Current	28.9
			Former Heavy	2.5
			None	68.6
			Marital status
			Married	47.0
			Widowed	37.8
			Divorced	10.8
			Never Married	4.4
			Living arrangement
			With One Other	64.6
			Alone	31.1
			With Multiple Others	4.3
			Race
			White	88.9
			Black	9.8
			Asian	0.7
			Latino	0.3
			Other	0.3

Note: Table values based on available data before multiple imputation.

^aFor patients who completed the MMSE.

^bFor patients who completed the MoCA. Agg – Physically Aggressive Behaviors, BEHAV5 – BEHAV5 Scale, BMI – Body Mass Index, CMAI – Cohen-Mansfield Agitation Inventory, MMSE – Mini-Mental State Examination, MoCA – Montreal Cognitive Assessment, OL – Outlook on Life, PAC – Positive Aspects of Caregiving Scale, PHQ-9 – Patient Health Questionnaire-9, PNA – Physically Non-Aggressive Behaviors, SA – Self-Affirmation, SMS – Self-Mastery Scale, VAB – Verbally Aggressive Behaviors, ZBI – Zarit Burden Interview.

Patient information

Care recipient demographic information included gender, race, education, living arrangement, and marital status. Per HIPAA regulations, patients 90 years of age and over represent a vulnerable, identifiable group, and their specific ages were not made available for analyses; patient age was recorded as continuous data through age 89. Direct patient assessments included cognitive performance and depressive symptoms. Cognition was measured using one of 2 brief measures of global cognition, the Mini-Mental State Examination (MMSE),¹⁵ or the Montreal Cognitive Assessment (MoCA).¹⁶ These measures screen across multiple cognitive domains including memory, orientation, attention, language, and visuospatial functions. The MoCA has test-retest reliability of .92 and internal consistency with a Cronbach’s alpha of .83.¹⁶ The MMSE demonstrates a test-retest reliability between .80 and .95 and a Cronbach’s alpha between .68 and .96.¹⁷ Patients also completed the Patient Health Questionnaire-9 (PHQ-9), a brief screening measure for current depressive symptoms.¹⁸ The questions in the PHQ-9 ask the patient to rate how often they have experienced certain symptoms of depression over the past 2 weeks ranging from 0 (not at all) to 3 (nearly every day). The total score is calculated by summing the scores for each question and ranges from 0 to 27, with higher scores indicating more severe depressive symptoms. The measure has high diagnostic accuracy (sensitivity of 77% and specificity of 89%), good reliability (.86 to .91), and good internal consistency (.89).^18,19 Caregivers also completed 2 measures on their care recipient’s behavior: the Cohen-Mansfield Agitation Inventory (CMAI),²⁰ and BEHAV5+.²¹ The CMAI is a 29-item questionnaire that measures various types of agitated behaviors. The items on the CMAI are rated on a 7-point scale (1 = never to 7 = several times per hour). Items fall into 3 categories including psychically aggressive behaviors (e.g., hitting, scratching), physically non-aggressive behaviors (e.g., pacing, wandering), and verbally aggressive behaviors (e.g., cursing, yelling). The measure demonstrates good test-retest reliability (.95), and good concurrent validity with other measures (.89).^20,22 The BEHAV5+ is a 6-item scale that screens for the following behaviors exhibited by the patient within the past month: agitation, hallucinations, irritability, suspiciousness, indifference, and sleep problems. Caregivers indicate Yes (1) or No (0), and a higher total score suggests greater presence of behavioral symptoms. The measure shows good internal consistency (.77), high test-retest reliability (.88), and good convergent validity with related measures (.81 - .87).²³ See Table 1 for a full list of measures and demographics. Additionally, information regarding the patient’s health profile (i.e., body mass index, diagnoses, medication use, and surgical history) were recorded (supplemental material).

Analyses

Multiple imputation

Of 557 patient-caregiver dyads, only 200 dyads had complete data for all variables under consideration. Given that listwise deletion as a missing data method results in reduced power and can result in inaccurate estimations when data are not missing completely at random (MCAR),²⁴ we utilized multiple imputation to address missing data. Compared to other missing data methods, such as listwise deletion, pairwise deletion, or single (e.g., mean, mode, median) imputation, multiple imputation holds several advantages. It accounts for error in estimation of missing data values by estimating several potential missing values, increases statistical power compared to deletion strategies, and is suitable when data are either missing at random (MAR) or MCAR.²⁴

To conduct multiple imputation, we utilized the fully conditional specification approach via the Multivariate Imputation by Chained Equations (MICE) package,²⁵ in R 4.0.1 (cran.r-project.org), imputing 10 datasets using 15 iterations. To ensure that imputed values for continuous variables were within logical ranges and to increase robustness to violations of normality,²⁶ all continuous values were imputed using predictive mean matching (PMM), utilizing 5 donors. Categorical variables were imputed using multinomial logistic regression. We used an inclusive strategy for selecting predictor variables for multiple imputation.²⁷ Specifically, any variables that demonstrated at least a 1% variance overlap with one another (i.e., Pearson’s r > .1 for 2 continuous variables, η2 > .01 for a categorical and continuous variable, or Cramer’s V >.1 for 2 categorical variables) were used in each other’s imputation, as this represents at least a small effect size in behavioral research.²⁸ Individual questionnaire items were used in imputation models, for instances in which patients or caregivers skipped one or 2 items in a questionnaire while completing all other items.

To account for some patients having completed a MoCA while others completed an MMSE, MoCA and MMSE total scores were aggregated into one “global cognition” column in the dataset, and a supplementary “MMSE or MoCA” categorical variable was added to indicate which test each patient completed. This step prevented the addition of a variable with ∼80% missingness (MMSE score) into multiple imputation and subsequent analyses, as most patients completed a MoCA. To determine whether performance on these 2 tests was differentially associated with other variables, we assessed whether an interaction between global cognition score and test type (MoCA vs MMSE) significantly predicted all other variables in the dataset. In the event of a significant interaction in the prediction of a variable (α = .05), this interaction term and its simple effects were included as predictors for that variable. Throughout multiple imputation, this interaction term was imputed via passive imputation.

Random forests

We utilized random forests to predict the number of incoming contacts, outgoing contacts, and clinic communications using patient and caregiver variables. The random forests algorithm is a nonparametric machine learning algorithm that can handle a very high number of predictors, as well as capture nonlinear relationships among predictors and outcome variables.⁷ It is an extension of classification and regression trees (CART), which utilize a splitting rule to categorize cases based on predictor variable cutpoints that yield the most accurate prediction of outcome variables (e.g., if MoCA score <22, predict 8 incoming calls; if > 22, predict 4 incoming calls). This process can be repeated using multiple variables, as well as the same variable several times (i.e., nonlinear relationship), dividing the sample into smaller groups until a stopping criterion determined by the user is reached.⁷ Compared to the use of only one tree with CART, random forests reduces the likelihood of overfitting in several ways: (1) random forests uses multiple trees and selects a predicted value through a “voting” process, aggregating estimates from several trees to produce a final estimate; (2) each tree can only access a pre-specified number of random predictor variables to make its cutpoints, which can result in detection of relationships that may not have been identified if all predictors were considered simultaneously, given that the best predictor available is always chosen; (3) samples for each tree are bootstrapped with replacement from the study sample, resulting in a slightly different sample for each tree.⁷ These differences increase the likelihood that a random forests model will maintain its predictive accuracy when used in other samples.

To determine the optimal number of minimum samples per leaf (i.e., the minimum number of patient-caregiver dyads required in each resulting group for a split to be made in a regression tree), as well as the optimal number of maximum features (i.e., the number of predictor variables considered in each regression tree), we utilized 10-fold cross-validation: minimum dyad values between 10 and 55, as well as maximum feature values between 5 and 25, were considered. The best combination of minimum dyad size and number of maximum features was determined by selecting the model with the lowest mean squared error. With 10 imputed datasets for multiple imputation and 3 outcomes considered in each dataset, 30 final models were ultimately evaluated. The importance of each predictor variable within final models was assessed using permutation importances.⁷

To determine the predictive accuracy of our trained random forests models for new dyads, we divided each imputed dataset into a training set (80% of total sample) and a test set (20%). Similar outcome variable distributions between test and training sets were obtained by stratifying the outcome variable during splitting, and the dyads comprising training and test sets were kept consistent across all imputed datasets. The aforementioned cross-validation process was completed only using dyads in the training set; optimal models that were produced using dyads in the training set were then used to predict outcome variables for dyads in the test set. Cross-validation and random forest modeling were completed using the scikit-learn library in Python 3.7.3.²⁹ Random forest models were created using the sklearn.ensemble.RandomForestRegressor function with 1000 estimators. While the number of minimum samples per leaf and maximum number of features were decided via cross-validation, all other function arguments remained at their default values. Finally, to describe model performance, each model’s explained variance was calculated.

Results

Descriptive Statistics and Missing Data

See Table 1 for patient-caregiver dyad descriptives. Patient medical history, medication, and surgery descriptives can be found in the supplemental material. The percentage of missing data for each variable used in multiple imputation can also be found in the supplemental material.

Random Forests

See Tables 2 through 4 for results of cross-validation and test set prediction via random forests. Random forest models’ average explained variance in cross-validation training sets for incoming calls, outgoing calls, and clinic communications were 7.42%, 3.65%, and 6.23%, respectively. Permutation importances revealed that the strongest predictors for all 3 outcomes were the BEHAV5, the ZBI, CMAI subscales, caregiver and patient age, patient body mass index, and patient global cognition scores (Figures 1 through 3), with other variables contributing little to nothing to the models. Average explained variance in cross-validation test sets for incoming contacts, outgoing contacts, and clinic communications were 1.79%, .70%, and 1.74%, respectively. However, these estimates underestimated model performance in out-of-sample test sets; average explained variance in out-of-sample test sets were 6.17%, 2.78%, and 4.28%, for incoming contacts, outgoing contacts, and clinic communications, respectively.

Table 2.

Results for Best Incoming Calls Random Forests Models Identified via Cross-Validation.

Imputed dataset #	Max features	Min leaf size	CV train set variance explained (%)	CV test set variance explained (%)	Test set variance explained (%)
1	20	55	5.50	1.50	5.68
2	20	55	6.01	1.84	6.85
3	20	55	5.46	1.26	5.75
4	20	55	5.25	1.30	5.28
5	20	55	5.59	1.37	5.26
6	20	55	5.87	1.88	6.76
7	20	55	5.45	1.35	4.73
8	20	55	6.13	2.35	6.30
9	20	55	5.90	2.39	6.20
10	20	30	11.08	2.68	8.93

Note: CV – Cross-Validation. Models were trained using sklearn.ensemble.RandomForestRegressor from the scikit-learn Python library.

Table 3.

Results for Best Outgoing Calls Random Forests Models Identified via Cross-Validation.

Imputed dataset #	Max features	Min leaf size	CV train set variance explained (%)	CV test set variance explained (%)	Test set variance explained (%)
1	15	55	3.29	.65	2.82
2	25	55	4.10	.94	3.28
3	15	55	3.20	.37	2.36
4	15	55	3.32	.69	2.50
5	15	55	3.26	.30	1.90
6	15	45	4.51	.77	4.35
7	15	55	3.23	.34	1.97
8	15	55	3.45	.85	2.09
9	15	55	3.31	.52	2.44
10	25	50	4.85	1.58	4.05

Note: CV – Cross-Validation. Models were trained using sklearn.ensemble.RandomForestRegressor from the scikit-learn Python library.

Table 4.

Results for Best Clinic Communications Random Forests Models Identified via Cross-Validation.

Imputed dataset #	Max features	Min leaf size	CV train set variance explained (%)	CV test set variance explained (%)	Test set variance explained (%)
1	25	50	5.35	1.75	4.76
2	15	25	9.25	1.68	4.25
3	25	55	4.55	1.23	4.01
4	15	25	9.07	1.94	5.29
5	15	25	9.07	1.50	3.97
6	25	45	6.45	2.22	4.25
7	15	25	9.31	1.41	3.54
8	25	50	5.49	1.68	3.79
9	25	50	5.36	1.49	3.43
10	20	25	10.24	2.50	5.50

Note: CV – Cross-Validation. Models were trained using sklearn.ensemble.RandomForestRegressor from the scikit-learn Python library.

Figure 1.

Box-and-whisker plot of permutation importances from the random forests model for incoming calls in the 10th training set. Agg – Physically Aggressive Behaviors, BEHAV5 – BEHAV5 Scale, BMI – Body Mass Index, CMAI – Cohen-Mansfield Agitation Inventory, Global Cognition – Mini-Mental State Examination or Montreal Cognitive Assessment, OL – Outlook on Life, PAC – Positive Aspects of Caregiving Scale, PNA – Physically Non-Aggressive Behaviors, VAB – Verbally Aggressive Behaviors, ZBI – Zarit Burden Interview.

Figure 2.

Box-and-whisker plot of permutation importances from the random forests model for outgoing calls in the 10th training set. Agg – Physically Aggressive Behaviors, BEHAV5 – BEHAV5 Scale, BMI – Body Mass Index, CMAI – Cohen-Mansfield Agitation Inventory, Global Cognition – Mini-Mental State Examination or Montreal Cognitive Assessment, OL – Outlook on Life, PAC – Positive Aspects of Caregiving Scale, PNA – Physically Non-Aggressive Behaviors, VAB – Verbally Aggressive Behaviors, ZBI – Zarit Burden Interview.

Figure 3.

Box-and-whisker plot of permutation importances from the random forests model for clinic communications in the 10th training set. Agg – Physically Aggressive Behaviors, BEHAV5 – BEHAV5 Scale, BMI – Body Mass Index, CMAI – Cohen-Mansfield Agitation Inventory, Global Cognition – Mini-Mental State Examination or Montreal Cognitive Assessment, Hx – History, OL – Outlook on Life, PAC – Positive Aspects of Caregiving Scale, PNA – Physically Non-Aggressive Behaviors, VAB – Verbally Aggressive Behaviors, ZBI – Zarit Burden Interview.

Discussion

The present study used a machine learning model to uncover the most useful variables for predicting communication requirements in a geriatric clinic specializing in dementia, and tested them to predict needs for future patient-caregiver dyads. Results indicated that patient behavioral symptoms, caregiver burden, caregiver and patient age, patient body mass index, and global cognition may be useful predictors of communication requirements for patient care in this setting. However, much variance remains to be explained in the prediction of communication requirements, suggesting that additional variables should be considered.

The present study expands upon past work investigating predictors of caregiver communications by exploring a large number of variables available within the electronic medical record and using an advanced statistical technique that can make use of all available information. Previous studies have demonstrated a relationship between specific caregiver characteristics (e.g., caregiver burden, caregiver age) and caregiver communications (Martin et al, in preparation); however, these works relied on statistical methods that limited the number of variables that could be considered. The current study demonstrates that variables beyond those previously considered, including patient behavioral symptoms, patient health (particularly pertaining to body mass index), and global cognition are also important factors to examine in the context of caregiver communication needs.

Discovery of these predictors of communication needs has several important implications. The current work made use of information from the medical record that was available at the initial intake; using this information could help administrative decision-making regarding allocation of resources. For example, anticipated workload for each patient-caregiver might be given a weighted caseload estimate to be considered when assigning the treatment team. This could facilitate even distribution of cases that are likely to require greater support. If supported by future work, specific interventions targeting predictors, such as behavioral symptoms of dementia, might also reduce communication needs and ultimately workload for staff, potentially mitigating stress and burnout.

The present study includes strengths and limitations. It is the first known attempt to predict caregiver communication requirements from a broad set of variables available in medical records, and used an advanced statistical technique capable of effectively handling a large number of predictors. This work made use of naturalistic data accessible to clinicians so that findings are likely to generalize to real world settings. In other words, results should be relevant and useful for health care providers who work in the actual clinical environments where patients are treated. However, it is noted that the current work is not theory-based, and the techniques used do not shed light on the specific relationships with predictors, including directionality.⁷ Another limitation of the current work is a lack of ethnic diversity in sample demographics. Ethnicity has been linked to caregiver outcomes,³⁰ making it an important aspect of background to consider – results might have differed in a more diverse sample. Additionally, analyses included communications data from a 1-year period; this timeframe was used to reduce attrition, but does not reflect the entire course of dementia. Particularly given that predictors were retrieved from information available at the time of intake, the study design is not able to identify how these variables might change over time, and what these relationships could look like at later stages. Finally, overall study data and some significant predictors, including the BEHAV5 and ZBI, relied on varying levels of imputed data. While advanced imputation techniques were utilized to reduce risks associated with listwise deletion, the need for future studies to confirm findings is underscored.

Current findings and limitations of this study highlight several areas for future research. Foremost, further work to examine directionality is needed. Intuitively, it would seem that greater behavioral symptoms and caregiver burden, and poorer performance on the measure of global cognition would prompt greater caregiver contact. However, other variables are less clear: does higher BMI connote greater medical risk from obesity-associated disease,³¹ and thus caregiver contacts? Or does lower BMI suggest a decline associated with frailty,³² triggering caregiver contacts? In addition, several predictors of communication requirements, including patient behavioral symptoms, caregiver burden, and patient body mass index, may be modifiable. Once directionality is firmly established, future work could explore the effects of intervention for these predictors and observe any influence on communication requirements. Because the current study suggests that much variance remains to be explained in these outcomes, other predictors should be considered, as well. Traits including personality and health characteristics of the caregiver, as well as caregiver social support and perceived social support, could also be predictive of communication requirements.^33,34 As the current work made use of variables available from existing medical records, future research may benefit from utilizing theoretical models of communication and/or health care utilization to guide further analyses using additional predictors. Further, future work could examine actionable information such as relationships between communication requirements and number of probable office visits, as this type of information in particular would be useful to a clinical audience. Addressing the above questions in next steps will contribute to a conceptual theoretical framework, which might then be explored to more comprehensively understand the nature of these relationships and drivers of communications.

In conclusion, the present study demonstrated that patient behavioral symptoms, caregiver burden, caregiver and patient age, patient body mass index, and global cognition may be useful predictors of communication requirements for patient care in a geriatric clinic providing specialty care for individuals with dementia. Findings provide a foundation for further work to examine how these variables influence caregiver communication needs. Future work is needed to build a more comprehensive framework to understand these relationships and explore additional predictors not included in the medical record (e.g., caregiver traits).

Footnotes

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

ORCID iDs

John T. Martin

Jason R. Anderson

Kimberly R. Chapman

References

Fernando

Consedine

. Beyond compassion fatigue: The transactional model of physician compassion. J Pain Symptom Manage. 2014;48(2):289-298.

Bakker

Schaufeli

Sixma

, et al.. Patient demands, lack of reciprocity, and burnout: A five-year longitudinal study among general practitioners. J Organ Behav. 2000;21(4):425-441.

Ray

Wong

White

, et al. Compassion satisfaction, compassion fatigue, work life conditions, and burnout among frontline mental health care professionals. Traumatology. 2013;19(4):255-267.

Woodward

Ferrier

Cohen

, et al.

How is family physicians’ work time changing?

Can Fam Physician. 2001;47:1414-1421.

Wolff

Spillman

Freedman

, et al. A national profile of family and unpaid caregivers who assist older adults with health care activities. JAMA Intern Med. 2016;176(3):372-379.

Spitznagel

Cox

Jacobson

, et al. Assessment of caregiver burden and associations with psychosocial function, veterinary service use, and factors related to treatment plan adherence among owners of dogs and cats. J Am Vet Med Assoc. 2019;254(1):124-132.

Strobl

Malley

Tutz

. An introduction to recursive partitioning: Rationale, application, and characteristics of classification and regression trees, bagging, and random forests. Psychol Methods. 2009;14(4):323-348.

Ross

Girard

Wright

, et al. Momentary patterns of covariation between specific affects and interpersonal behavior: Linking relationship science and personality assessment. Psychol Assess. 2017;29(2):123-134.

Zarit

Reever

Bach-Peterson

. Relatives of the impaired elderly: Correlates of feelings of burden. Gerontol. 1980;20(6):649-655.

10.

Pearlin

Schooler

. The structure of coping. J Health Soc Behav. 1978;19(1):2-21.

11.

Tarlow

Wisniewski

Belle

, et al.. Positive aspects of caregiving: contributions of the REACH project to the development of new measures for Alzheimer’s caregiving. Res Aging. 2004;26(4):429-453.

12.

Hébert

Bravo

Préville

. Reliability, validity and reference values of the Zarit burden Interview for assessing informal caregivers of community-dwelling older persons with dementia. Can J Aging. 2000;19(4):494-507.

13.

Pearlin

Menaghan

Lieberman

, et al. The stress process. J Health Soc Behav. 1981;22(4):337-356.

14.

Schulz

Newson

Mittelmark

, et al. Health effects of caregiving: The caregiver health effects study: An ancillary study of the cardiovascular health study. Ann Behav Med. 1997;19(2):110-116.

15.

Folstein

McHugh

. Mini-mental state. A practical method for grading the cognitive status of patients for the clinician. J Psychiatr Res. 1975;12:189-198.

16.

Nasreddine

Phillips

Bédirian

, et al. The montreal cognitive assessment, MoCA: A brief screening tool for mild cognitive impairment. J Am Geriatr Soc. 2005;53(4):695-699.

17.

Tombaugh

McIntyre

. The mini-mental state examination: A comprehensive review. J Am Geriatr Soc. 1992;40(9):992.

18.

Kroenke

Spitzer

Williams

. The PHQ-9: Validity of a brief depression severity measure. J Gen Intern Med. 2001;16(9):606-613.

19.

Manea

Gilbody

McMillan

. A diagnostic meta-analysis of the Patient Health Questionnaire-9 (PHQ-9) algorithm scoring method as a screen for depression. Can Med Assoc J. 2012;184(3):E191-E196.

20.

Cohen-Mansfield

Marx

Rosenthal

. A description of agitation in a nursing home. J Gerontol. 1989;44(3):M77-M84.

21.

Borson

Scanlan

Sadak

, et al. Dementia services mini-screen: A simple method to identify patients and caregivers in need of enhanced dementia care services. Am J Geriatr Psychiatry. 2014;22(8):746-755.

22.

Livingston

Johnston

Katona

, et al. Systematic review of psychological approaches to the management of neuropsychiatric symptoms of dementia. Am J Psychiatry. 2005;162(11):1996-2021.

23.

Borson

Sadak

. BEHAV5+: A new tool for screening and monitoring behavioral symptoms in dementia. Alzheimers. Dement. 2019;15(7):P1210-P1211.

24.

Van Ginkel

Linting

Rippe

RCA

, et al. Rebutting existing misconceptions about multiple imputation as a method for handling missing data. J Pers Assess. 2019;102(3):297-308.

25.

Van Buuren

Groothuis-Oudshoorn

. Mice: Multivariate imputation by chained equations in R. J Stat Softw. 2011;45(3):1-67.

26.

Marshall

Altman

Royston

, et al. Comparison of techniques for handling missing covariate data within prognostic modelling studies: A simulation study. BMC Med Res Methodol. 2010;10(7):1-16.

27.

Enders

. Applied Missing Data Analysis. New York, NY: Guildford Press; 2010.

28.

Cohen

. A power primer. Psychol Bull. 1992;112(2):155-159.

29.

Pedregosa

Varoquaux

Gramfort

, et al. Scikit-learn: Machine learning in Python. J Mach Learn Res. 2011;12(85):2825-2830.

30.

Siegler

Brummett

Williams

, et al. Caregiving, residence, race, and depressive symptoms. Aging Ment Health. 2010;14(7):771-778.

31.

Villareal

Apovian

Kushner

, et al. Obesity in older adults: Technical review and position statement of the American Society for Nutrition and NAASO, the Obesity Society. Am J Clin Nutr. 2005;82(5):923-934.

32.

Trevisan

Crippa

, et al. Nutritional status, body mass index, and the risk of falls in community-dwelling older adults: a systematic review and meta-analysis. J Am Med Dir Assoc. 2019;20(5):569-582.

33.

Hooker

Frazier

Monahan

. Personality and coping among caregivers of spouses with dementia. Gerontol. 1994;34(3):386-392.

34.

Elliott

Burgio

DeCoster

. Enhancing caregiver health: Findings from the resources for enhancing Alzheimer’s caregiver health II intervention. J Am Geriatr Soc. 2010;58(1):30-37.

Predicting Caregiver Communications in a Geriatric Clinic

Abstract

Keywords

Introduction

Methods

Participants

Measures

Communication-related outcomes

Caregiver information

Patient information

Analyses

Multiple imputation

Random forests

Results

Descriptive Statistics and Missing Data

Random Forests

Discussion

Footnotes

Declaration of Conflicting Interests

Funding

ORCID iDs

References