Sage Journals: Discover world-class research

Abstract

About 27% of patients with diabetes also suffer from depression, and the presence of co-morbid depression could increase the cost of care for diabetes by up to 100%. Several randomized clinical trials have demonstrated that physical and mental health are more likely to improve for diabetes patients suffering from depression when regular treatment for depression is provided in a primary care setting (called Collaborative Care). However, Collaborative Care requires additional resource utilization costs and a separate reimbursement model. When managing Collaborative Care, clinics must balance patient health outcomes with the program’s financial sustainability. Important operational levers in Collaborative Care are allocating care managers’ time to patients based on their requirements and the care managers’ staffing level. This staffing and workload allocation influences the revenue, costs, and patient health outcomes. We present a novel Markov Dynamic Programing model that, unlike existing approaches, jointly optimizes both staffing levels and treatment policies for Collaborative Care programs and quantifies the costs and benefits of collaborative care. Mathematically, we model Collaborative Care management at the clinical level as an infinite-horizon Markov Dynamic Program. The objective is a weighted sum of total patient quality-adjusted life years (QALYs) and the clinic profits. The model incorporates insurance payment, resource utilization costs, and disease progression of co-morbid diabetes and depression. We derive structural properties for the joint optimization of the staffing level and allocating care managers’ time to different patient categories. Using these structural properties, we develop a practical and easy-to-implement policy for staffing level and care managers’ time allocation that performs close to the optimal solution. We calibrate the model with data from a large academic medical center and show that our solutions can improve total QALYs and clinic profits compared to current practices. Our analysis also reveals key insights into payment models’ effects on Collaborative Care. Profit under the fixed-fee model responds nonmonotonically to payment rate increases, highlighting complex financial dynamics. Fixed-fee models show a threshold behavior, with high-intensity treatments becoming optimal only above certain payment rates. This threshold varies based on the profit-QALY weight balance, and this threshold is lower under joint-optimization than treatment-only optimization.

Keywords

Chronic Care Management Depression Diabetes Dynamic Programing

1. Introduction

Healthcare spending in the U.S. has reached $4.9 trillion in 2023, with approximately 75% of the costs incurred by patients with multiple chronic conditions.¹^,² Mental health disorders, like depression, are a common co-morbid condition with chronic physical conditions such as cancer, congestive heart failure, diabetes mellitus, and Parkinson’s disease. About one-third of patients with severe chronic illnesses also suffer from depression.³

Patients with multiple chronic conditions, especially mental health issues, often receive fragmented care from different providers without central coordination. This fragmentation frequently causes poor communication, leading to worse outcomes, more emergency visits and hospitalizations, and higher costs (Kern et al., 2018). Patients with co-morbid depression and diabetes exhibit behaviors like reduced physical activity, overeating, inadequate self-care, and medication nonadherence. Consequently, they have poorer health, higher mortality, and cost 50–100% more than patients with only diabetes (Unützer et al., 2009). Despite strong evidence that treating depression improves diabetes outcomes and costs, psychiatric access is limited due to mental health provider shortages (Dampier, 2018), lack of insurance coverage (Bishop et al., 2014), and social stigma (Clement et al., 2015).

To address this, Collaborative Care was designed at the University of Washington (Unützer et al., 2002) to integrate mental and physical health treatment. Collaborative Care is a systematic treatment program for improving access to mental health resources for patients with co-morbid physical and mental health conditions. Under this program, mental health services are provided within the Primary Care Provider’s (PCP’s) office by a care manager who is typically a health professional such as a Licensed Social Care Worker. The care manager regularly consults with the PCP and often with a psychiatric consultant to provide regular follow-up and timely modification to the patient’s care plan, behavioral health therapies, and changes or adjustments to medication.

Over the past 20 years, over 80 randomized trials have demonstrated Collaborative Care’s efficacy for depression, including the most extensive study to date with over 1,800 patients showing a doubling of depression remission rates (Unützer et al., 2002). Consequently, the U.S. Surgeon General and the President’s Commission have endorsed it as a “best practice” (Unützer and Ratzliff, 2015).

Despite evidence of collaborative care’s effectiveness, adoption has been slow due to high maintenance costs and a lack of clear payer reimbursement pathways to cover implementation costs, resulting in uncertainty about the program’s financial sustainability. While randomized trials have typically focused on the clinical outcomes, the financial ramifications of the Collaborative Care program in practical implementation have not been adequately studied.

We partner with the Departments of Psychiatry and Family Medicine at a large academic medical center in the US to develop a model-driven approach to aid decision-making for managing the Collaborative Care program. We develop and test the Collaborative Care program for patients suffering from depression and diabetes. While Collaborative Care can be delivered to patients for any combination of mental health conditions, such as Generalized Anxiety Disorder, and physical conditions, such as hypertension, we select depression and diabetes as this combination constitutes the largest population of patients with co-morbid physical and mental health conditions and contributes a significant cost to the health system. Our model can easily adapt to other conditions with appropriate parameter value changes.

There are two primary levers for managing resources for Collaborative Care. The first lever is allocating care managers’ time to patients based on their needs. More complex patients will require more time from the care manager, increasing the workload for care managers and, in a fee-for-service payment model, the revenue per patient. The second operational lever for the clinic is the staffing level of care managers. Clinics often meet workload requirements with regular staffing levels and more expensive overtime hours. Lower staffing levels will lead to higher overtime costs for the same policy for allocating care managers’ time to patients. On the other hand, higher staffing levels will lead to higher wages for regular employees. Therefore, operating Collaborative Care efficiently requires the joint optimization of the policy of care managers’ time allocation based on patient needs and the staffing level of care managers. In this paper, we model the Collaborative Care program as an infinite-horizon Markov Dynamic Program to maximize the dual objectives of the clinic: Patient health outcomes and financial profits. The model’s objective reflects the clinic’s two goals, patient health outcome measured in quality-adjusted life years (QALYs) gained and clinic financial profits. The decision variables are the care manager’s time allocated to each patient type and the number of care managers required to manage the collaborative care program. Markovian dynamics is a standard approach in healthcare modeling, supported by prior literature on both diabetes and depression. This assumption is widely adopted due to its analytical tractability and acceptance in the field (Denton, 2018). Subsequently, we derive structural results for the joint optimization of the allocation of care managers’ time based on patient needs and determining the number of care managers. Using these structural properties of the optimal policy, we develop a computationally efficient and easy-to-implement model-based policy to determine the optimal allocation of care managers’ time for treatment and the staffing level. We validate our model and solution approach by testing on data from a large academic medical center. Our data consists of clinical lab tests and depression treatment over five years for 17,152 unique patients suffering from co-morbid depression and diabetes. We use a prediction-based method to estimate the model parameters from this historical patient data.

Finally, we perform a comprehensive set of numerical simulations to show that the policy recommended by the model can improve patient health and clinic profits compared to current practices and other heuristic approaches. We also derive several managerial insights on the impact of different aspects of collaborative care under different payment policies. These insights can guide clinic managers in managing collaborative care and help policy and insurance firms design payment policies for programs like collaborative care.

In this research, we do not consider granular decisions such as prescribing personalized depression treatment for each patient and scheduling individual appointments with the care managers. While optimizing such decisions could be an interesting problem to consider, it is beyond the scope of the current paper, where we consider the issue from a tactical perspective to set policy on care manager time for treatment across patient categories and overall staff levels.

The main contributions of this work are as follows. Our research is the first to study optimal resource planning and allocation for collaborative care management. We develop properties for optimal treatment plans, considering both the diabetes and depression state of a patient. Using real data and a practical policy, we provide insights on improvements that could be gained by a healthcare facility where they optimize treatment and staffing plans in collaborative care. Finally, we provide managerial insights on the impact of arrival rates, insurance plans, and weightage on financial and quality outcomes.

The remainder of the paper is as follows. In Section 2, we discuss related literature to our work. Section 3 describes the care management process in detail. In Section 4, we provide a detailed mathematical formulation of our model. In Section 5, we describe the theoretical results. In Section 6, we describe the estimation of the model parameters. In Section 7, we provide details of our computational analysis, and we follow that up with managerial and policy insights in Section 8. Finally, we conclude with future directions of research in Section 9.

2. Literature Review

Our work is primarily related to capacity management in healthcare operations. Related literature on chronic care management can be divided into four groups.

The first group of literature pertains to managing screening and treatment for a patient or a population of patients. Representative papers in this group are Alagoz et al. (2007), Shechter et al. (2008), and Ayer et al. (2012), which determine optimal patient screening or treatment decisions. The system state in these models is typically the health state of a single patient. Deo et al. (2013), Ayer et al. (2019), Lee et al. (2019), and Adams et al. (2023) model the sequential screening or treatment decision for a population of patients under resource constraints. The system state is typically the vector of the health status of each patient in the system. In these models, the decision-maker chooses which patients to treat or screen each period. An alternate approach is to formulate the system state as a compartmental model. In this formulation, the system state is the vector of the number of patients in each health state. Natarajan and Swaminathan (2017) uses this approach to allocate inventory procured through donor funding to patients in different health states. Ho et al. (2019) also uses the compartmental model for resource allocation for population disease management.

The second group of papers relates to capacity management for healthcare with patient disease dynamics. In the context of primary and chronic care, Liu et al. (2024) studies capacity allocation for the patient’s first appointment in coordinated care networks, where patients must visit multiple providers. Bavafa et al. (2019) investigates the impact of integrating nonphysician providers into primary care, analyzing their effects on patient health dynamics, physician decision-making, and overall healthcare outcomes. They also investigate the impact of physician payment schemes. Diamant (2021) investigates the scheduling practices of multistage outpatient health programs. In the context of behavioral health, Baek et al. (2023) investigates the problem of optimizing personalized patient interventions. In the context of inpatient workload, Liu et al. (2019) and Kamalahmadi et al. (2023) investigate capacity management to deliver care to patients in a hospital. Ahuja et al. (2020) shows empirical evidence that co-location and continuity in care improve health outcomes, and the continuity of care is more critical for more complex patients. They find that maintaining continuity of care provides significant gains for patients suffering from mental health conditions.

Our work is also related to articles such as Simon et al. (2001), Liu et al. (2003), and Yildirim et al. (2021). These papers evaluate the cost-effectiveness of Collaborative Care and depression screening programs through observational or simulation analysis. These papers typically do not optimize resource allocation decisions, and existing literature in this stream has not compared different payment policies.

More broadly, our study corresponds with the broader body of literature advocating for the use of data and analytics to enhance healthcare operations. Dai and Tayur (2020) offers an extensive examination of recent research in healthcare operations, while de Véricourt and Perakis (2020) and Mišić and Perakis (2020) concentrate on data analytics techniques for enhancing operational effectiveness in various organizations.

While there are many interesting papers in the fields as listed above, our paper is the first to study capacity and care management for the Collaborative Care model with physical (diabetes) and mental (depression) disease progression. The CMS has identified chronic care management of patients with multiple conditions as a critical component of primary care.⁴ Our modeling approach is general enough to be adapted to other chronic care management programs with the appropriate model parameter values.

Second, right-sizing care management staffing levels and allocating care managers’ time based on patient severity are essential drivers for improving patient health outcomes and ensuring clinic financial sustainability. We contribute to the literature on chronic care management by explicitly capturing this trade-off. We derive structural properties for the joint optimization of staffing levels and treatment policies that determine the time allocated to patients for behavioral health services based on patient needs.

Third, we derive a computationally efficient and easy-to-implement model-based policy for simultaneous optimization of the treatment policy and staffing levels. Lastly, we derive relevant managerial insights on the different aspects of collaborative care under different payment models.

3. Collaborative Care for Depression and Diabetes in Primary Care

In the Usual Care (Unützer et al., 2002) model of treating diabetes patients with co-morbid depression, the PCP treating the patient for the physical health condition refers the patient to a psychiatrist if they think that the patient also requires psychiatric care. The patient is then responsible for setting up an appointment and subsequent follow-up with the psychiatrist. However, as described in Section 1, this Usual Care model has several challenges: Long wait times for appointments due to a shortage of available psychiatrists, failure of patient compliance, lack of regular patient follow-up, and difficulties with insurance coverage.

The Collaborative Care delivery model helps alleviate several of these issues. The principal change from the Usual Care model is the addition of a care manager to the PCP’s office. The care manager provides mental health services to the patient from the PCP’s office in consultation with the PCP and a psychiatric consultant. The primary activities of the care manager include—developing a detailed longitudinal care plan for each patient with benchmarks and goals for improvement; providing psychotherapy if required; counseling using evidence-based techniques such as motivational interviewing, behavioral activation, and problem-solving treatment; performing proactive follow-up; medication management; managing patient registries for depression; participating in monthly case reviews with the PCP and submitting monthly time and billing statements for approved reimbursement.

The care manager performs the above actions in person, via computer, or by telephone. While some activities can be automated, those related to interactions with the patient are in person. The above set of activities provides the patient with timely, regular, and systematic care for their depression. This care is often offered at the same location where they get treatment for their physical health condition. Insurance providers such as Medicare, Aetna, Blue Cross Blue Shield, and others have instituted billing codes that reimburse care manager activities as services received within primary care.⁵ Thus, the Collaborative Care program mitigates the challenges related to timely access to professional mental health services, unnecessary use of one-on-one visits with a psychiatrist, insurance coverage, and stigma associated with visiting a psychiatrist.

The most common implementation of collaborative care is the Improving Mood Promoting Access to Collaborative Treatment Trial (IMPACT) model, which was implemented by the University of Washington’s AIMS Center.⁶ We base our modeling assumptions on the IMPACT model. The main steps of the IMPACT model are as follows:

Enrollment: The clinic identifies high-risk patients and offers them enrollment in the Collaborative Care program.

Measurement-Based Care: The care managers routinely follow up and administer tests to measure the severity of mental and physical health conditions. These lab tests determine the current health state of the patient.

Stepped-Care: Patients are divided into two groups based on their current health state, one group for low-severity maintenance interventions such as counseling, psycho-education, and recommendations for physical exercise. The second group of patients is provided with more intensive treatments like psychotherapy. The high-severity group receives more frequent follow-ups than the low-severity group.

Discharge: Patients who have satisfied predefined discharge criteria are discharged from the Collaborative Care program. Some patients drop out of the program on their own, even if they have not met the discharge criteria. These patients may continue to receive their diabetes care at the PCP as before.

The stepped-care plan specifies the time the care manager spends on each patient. Consequently, it determines the future patient health progression, the resource utilization costs, and the fee-for-service revenue. Therefore, managing the stepped-care plan is the primary resource allocation decision of the Collaborative Care model. In the next section, we mathematically model the above characteristics of the Collaborative Care program.

4. Model

We model Collaborative Care for depression and diabetes as a discounted infinite-horizon MDP. A period in our model is a month. There are two reasons for having a month as a period. First, insurance firms pay clinics monthly under existing payment models for Collaborative Care. Second, the Collaborative Care program specifies monthly monitoring, follow-up, and adjustment of stepped care for patients.

Next, we define the state variables, transition probabilities, decision variables, and the objective function of the MDP model. We provide references and estimation results for model parameters in Section 6. We provide a schematic diagram of the steps in the Collaborative Care program in the Electronic Companion in EC.1, and a table of mathematical notation used in the model in section EC.2.

4.1. Patient Health State and System State

The health state of an enrolled patient is defined by a combination of their physical and mental health state. A Patient Health Questionnaire-9 ( $P H Q 9$ ) score determines the patient’s mental health state. $P H Q 9$ is a nine-item multiple-choice self-reported questionnaire, a standard and validated measure used for depression monitoring (Kroenke and Spitzer, 2002). The patient may take the $P H Q 9$ themselves or be administered a $P H Q 9$ by the care manager, in-person or via telephone. The questionnaire responses result in a $P H Q 9$ score ranging from 0 to 27. Consistent with clinical literature (Collins et al., 2009), we divide the $P H Q 9$ score into four categories. We show the classification of $P H Q 9$ into four categories in Table 1. These four categories define the set of possible mental health states $J$ , such that $J = {1, 2, 3, 4}$ .

Table 1.
Patient Health Questionnaire-9( $P H Q 9$ ) and depression severity.

Depression health state $(j)$ $P H Q 9$ score Depression severity

1 0–4 None-minimal

2 5–9 Mild

3 10–14 Moderate

4 15+ Severe

Depression health state $(j)$	$P H Q 9$ score	Depression severity
1	0–4	None-minimal
2	5–9	Mild
3	10–14	Moderate
4	15+	Severe

The patient’s diabetes severity defines the patient’s physical health state. Consistent with standard clinical practices of monitoring diabetes, we use HbA1c to measure diabetes severity. HbA1c measures the blood sugar level, typically expressed as a percentage. For diabetes patients, the American Diabetes Association defines an $H b A 1 c$ of less than 7% as being in glycemic control and an $H b A 1 c$ greater than 7% as not in glycemic control.

We define the physical health state of the patient by the level of $H b A 1 c$ . The physical health state of the patient is represented by $k$ , where $k \in K$ and $K = {1 (H b A 1 c < 7 %), 2 (H b A 1 c \geq 7 %)}$ . The tuple $(j, k) \in J \times K$ defines the patient’s physical and mental health. The sets of physical and mental health states, $J$ and $K$ , are ordered so that a lower index corresponds to better health. For $(j, k), (j^{'}, k^{'}) \in J \times K$ , we write $(j, k) \leq (j^{'}, k^{'})$ if and only if $j \leq j^{'}$ and $k \leq k^{'}$ . In this case, we say that $(j, k)$ is no worse than, or equivalently, weakly healthier than $(j^{'}, k^{'})$ .

We have two additional possible states for a patient: $d$ and $f$ . The state $d$ is for patients discharged from the Collaborative Care program based on the predetermined discharge criteria. The state $f$ is for patients who drop out of the program, even if they have not met the discharge criteria. We refer to the health state of the patient as $s \in S$ , we use the notation $(j, k)$ when we need to refer to depression and diabetes health states of the patient explicitly.

Then, $s \in S$ denotes the health state of a single patient. However, since we model all patients in the clinical system, we need to define the system state of all patients in the Collaborative Care program. We define $N_{s}$ as the number of patients with health state $s$ . Then, the system state for the entire Collaborative Care program is given by $N^{t} = (N_{s})_{s \in S}$ , a vector of the number of patients in each health state $s \in S$ at time $t$ .

4.2. State Transition Probabilities

We first describe the state transition probability with one patient in the system. We use the notation $e_{s} = (e_{s})_{s \in S}$ to signify the indicator vector such that $e_{s} = 1$ when there is one patient in health state $s$ , and no other patients in any of the other health states.

Each patient has four types of state transitions: Enrollment, treatment, discharge, and drop-out. These transitions correspond to the steps of the Collaborative Care program described in Section 3.

4.2.1. Enrollment

Estimates of the proportion of patients in each depression $P H Q 9$ category $j \in J$ , and diabetes state $k \in K$ are known from demographic information of primary care patients. We denote these proportions by $η_{s}$ .

Each clinic implementing Collaborative Care typically has predetermined enrollment criteria such that only patients in certain health states are eligible for enrollment in the program. Let this set of eligible states be $E \subset J \times K$ . Not all patients who are offered enrollment accept this offer. We parameterize the probability of acceptance as $ν$ . Then, the enrollment probability of a single patient into health state $s$ is given by:

\begin{aligned} P [N^{t + 1} = e_{s} | N^{t} = 0] = ν_{s} = {\begin{cases} ν η_{s} \forall s \in E, \\ 0 otherwise . \end{cases} \end{aligned}

(1)

In modeling patient enrollment, we assume a common acceptance probability $ν$ across eligible health states. This choice is motivated by three considerations. First, clinics generally track overall acceptance rates but do not collect data stratified by patient severity, making state-specific estimates difficult to obtain. Second, a uniform probability yields a parsimonious model that facilitates calibration and interpretation of managerial insights. Third, depression treatment-seeking behavior by patients are often shaped by factors such as stigma or insurance status (Mojtabai and Olfson, 2006; Tanner, 2024), and care managers typically reach out to all eligible patients using standardized criteria, which reduces systematic variation by health state. To test the impact of this assumption, we performed a robustness analysis allowing for different $ν$ across states. The results show that our policy recommendations and structural insights are qualitatively unchanged under moderate heterogeneity in enrollment behavior. We present these results in Section EC.11.3 of the Electronic Companion.

In our model, the enrollment probability is not a function of the current number of patients in the system. This reflects how Collaborative Care is typically implemented: Clinics maintain a registry of patients diagnosed with depression and have visibility into the broader panel of patients with diabetes in primary care, using that information to enroll patients. It is indeed possible that the intensity of enrollment efforts may sometimes depend on the existing number of patients in Collaborative Care at a facility. Such a model of optimal admission control could be an interesting research problem. However, our focus in this paper is to develop a tactical planning model that provides insights into optimal treatment policy and the staffing levels required to serve the projected number of enrolled patients. As a result, we assume that enrollment probability is independent of the existing number of patients.

Further, the proportions $η_{s}$ refer to the distribution of health states within the clinic-defined patient panel who are eligible to be enrolled and who are currently undergoing diabetes treatment with a PCP in the clinic. Marketing or interclinic competition does not influence enrollment decisions, as each clinic manages enrollment independently.

In line with standard practice, our model does not allow for re-enrollment of patients who have previously been discharged from Collaborative Care. Collaborative Care evaluations have shown that patients who reach remission tend to maintain symptom improvement over time (Gilbody et al., 2006), supporting the assumption that re-enrollment is rare.

4.2.2. Discharge

Discharge is a deterministic process. Certain health states for enrolled patients are determined to be eligible for discharge. We call these sets of states $D$ . If $(j^{d}, k^{d})$ is the defined discharge threshold, then patients with health states $(j^{d}, k^{d})$ or better are discharged.

Therefore, $D = {(j, k) \in J \times K | (j, k) \leq (j^{d}, k^{d})}$ . If the health state of an enrolled patient in the current period is such that they are eligible for leaving the program, then they are moved to the discharged state $d$ in the next period. Mathematically:

\begin{aligned} P [N^{t + 1} = e_{d} | N^{t} = e_{s}] = {\begin{cases} 1 \forall s \in D, \\ 0 o t h e r w i s e . \end{cases} \end{aligned}

(2)

We define the discharge eligible set $D$ to include all health states $s$ that satisfy the discharge criterion. The model is flexible and can accommodate any criterion of the form $j \leq j^{d}$ and $k \leq k^{d}$ . If only one condition is required to be met, for instance, discharge based solely on depression severity $j \leq j^{d}$ , then the other threshold (e.g., $k^{d}$ ) can be set to the highest possible severity level. In this case, discharge depends exclusively on the depression health state. This structure allows the model to represent clinical protocols in which patients are discharged upon achieving the criterion in either one or both conditions. All structural results presented in Section 5 hold under this general formulation.

In our numerical experiments, we adopt the discharge rule from the IMPACT model described in Section 3. Under this protocol, patients are discharged from Collaborative Care after they achieve remission from depression, defined as $P H Q 9 \leq 5$ , regardless of their diabetes control status. Patients continue to receive diabetes management through their PCP after discharge from Collaborative Care. Accordingly, in Section 6.7, we set the discharge threshold $j^{d} = 1$ , corresponding to $P H Q 9 \leq 5$ , to reflect the discharge criteria and set $k^{d} = 2$ , the highest possible diabetes severity level, to reflect the absence of a discharge condition based on diabetes. Discharge decisions are governed by clinical thresholds, such as $P H Q 9 \leq 5$ , rather than by the system state or capacity constraints. As a result, the discharge probability depends solely on the patient’s health state and is not influenced by the number of patients currently enrolled in the program.

4.2.3. Drop-Out

Several patients undergoing treatment leave the program even if they have not achieved the discharge criteria. We assume a patient can drop out each month with a probability $ϕ$ . Then, the transition probability of drop-out is given by:

\begin{aligned} P [N^{t + 1} = e_{f} | N^{t} = e_{s}] = ϕ \forall s \in J \times K, s \notin D, s \notin {d, f} \end{aligned}

(3)

While we assume a common attrition probability $ϕ$ across health states, this assumption reflects the lack of reliable empirical data on state-specific attrition. In the Electronic Companion (Section EC.11.3), we extend the formulation to allow for $ϕ_{s}$ varying by state and show that our results are robust to moderate heterogeneity in attrition rates.

4.2.4. Monitoring and Treatment

Each month, the care manager administers a $P H Q 9$ and an $H b A 1 c$ test to enrolled patients. These lab tests determine the current health state of the enrolled patient.

For all enrolled patients, i.e., $s \in J \times K$ , based on the patient’s current health state, the care manager either performs low-intensity maintenance interventions such as counseling, psycho-education, recommending physical exercise, or more intensive treatments such as antidepressants and psychotherapy. Typically, the higher-intensity intervention requires two monthly follow-up meetings (once every two weeks). The less intensive maintenance activities require one follow up a month (Liu et al., 2007).

We model the two levels of treatment as $T = {L o w (L), H i g h (H)}$ . Where $L o w$ signifies the lower intensity intervention and $H i g h$ represents the higher intensity intervention. Then, $a (N) = {a_{s} (N) \in T, \forall s \in J \times K}$ , represents the treatment policy for a given system state $N$ . The treatment policy determines the level of intervention provided to patients of health state $s$ , given that there are $N$ patients in the program.

An enrolled patient, who is not in a discharge eligible state, or in discharged state ( $d$ ), or in attrition state ( $f$ ), that is, $s \in J \times K,$ and $s \notin D$ and $s \notin {d, f}$ , based on the current health state of the patient and the treatment level $a_{s} \in T$ , the patient transitions to health state $s^{'} \in J \times K$ with probability $p_{s, s^{'}}^{a_{s}}$ . Additionally, the patient stays enrolled and does not drop-out of the system through attrition with probability $(1 - ϕ)$ . Therefore, the treatment transition probability when there is one enrolled patient in the system is given by:

\begin{aligned} P [N^{t + 1} = e_{s^{'}} | N^{t} = e_{s}, a_{s}] = (1 - ϕ) p_{s, s^{'}}^{a_{s}} \\ \forall s \in J \times K, s^{'} \in J \times K, and s \notin D, and s \notin {d, f} \end{aligned}

(4)

Enrolled patients could also transition to either the discharged state $d$ or to the attrition state $f$ , as described in Sections 4.2.2 and 4.2.3.

Patients in the attrition state continue to stay in the attrition state, and patients in the discharge state continue to stay in the discharge state. Therefore,

Combining equations (2)–(4), we define the transition probabilities of a patient as:

\begin{aligned} P [N^{t + 1} = e_{s^{'}} ∣ N^{t} = e_{s}, a_{s}] \\ = {\begin{cases} (1 - ϕ) p_{s, s^{'}}^{a_{s}}, & if s, s^{'} \in J \times K, s \notin D, s \notin {d, f} \\ ϕ, & if s \in J \times K, s \notin D, s^{'} = f, \\ 1, & if s \in D, s^{'} \in d, \\ 1, & if s = d, s^{'} = d, or, s = f, s^{'} = f \\ 0, & otherwise . \end{cases} \end{aligned}

(5)

The above expressions are the system state when one patient is in the system. We assume $m$ patients arrive at the primary care each month and are potential patients for enrollment. These are patients identified as potentially appropriate for Collaborative Care based on their current clinical status, for example, a PHQ9 score above 9 and a diagnosis of diabetes. Whether an arriving patient actually enrolls is governed by the enrollment transition probabilities, as defined in Equation (1).

After enrollment, each patient undergoes monitoring, treatment, drop-out, and discharge as described above. We also assume that all patients in a given health state in a given time period are treated identically and their transition probabilities to the new state are identical.

A closed-form representation of the system state transition probability as a function of the individual patient transition probability does not exist. However, given the individual transition probabilities described above, we can simulate the system transition.

4.3. Objective Function

Clinics face the dual challenge of improving patient health outcomes and ensuring financial profitability. The success and longevity of clinical programs depend not only on delivering quality healthcare services but also on achieving positive financial outcomes. Although clinics are not solely profit-driven entities, financial considerations play a crucial role in sustaining their operations. Effective financial management and favorable financial outcomes are essential for supporting and expanding clinical programs, improving patient access, and fostering innovation in healthcare delivery. In formulating the objective function for the Collaborative Care program, it is necessary to capture the intricate relationship between clinical programs and their financial outcomes. The objective function should reflect the multifaceted considerations that contribute to sustainable operations and strike a balance between fiscal responsibility and the provision of quality patient care (Dai and Tayur, 2020).

Models for chronic care management come in various formulations that balance both financial and QALY factors. For instance, Deo et al. (2013) proposes a model that maximizes patient QALYs under a fixed budget constraint, which is a common approach in cost-utility analysis. In contrast, Lee et al. (2008) discusses three different formulations in the context of dialysis therapy: A weighted sum of QALY and cost, a purely QALY-focused objective with cost as a constraint, and a cost-minimizing approach. Meanwhile, several studies, such as Chen et al. (2018), adopt a net monetary benefit model. This framework calculates net benefit by multiplying QALYs by the predetermined “willingness to pay” for each QALY—effectively translating health outcomes into financial terms. This approach is commonly used to assess societal cost-effectiveness, as it combines the financial and health benefits into a single metric for evaluating health interventions from a societal perspective.

However, in the context of a specific clinic that needs to balance patient health outcomes and clinic financial performance, the net monetary benefit model, which relies on societal willingness-to-pay values, may not be suitable. The focus here is not on maximizing social welfare but on optimizing resource allocation within the clinic itself. To capture the clinic-specific trade-offs, we propose using a weighted, normalized sum of QALYs gained and clinic profits. This formulation gives the clinic flexibility to balance patient outcomes with financial sustainability. It does so without relying on societal willingness-to-pay parameters, which are typically used in models that focus on social welfare.

We formulate the per-period reward for a given system state at time $t$ , as $N^{t}$ , treatment policy $a^{t}$ , and the staffing level $g$ as,

\begin{aligned} r (N^{t}, a^{t}, g) = (1 - θ) ζ_{Q} Q (N^{t}) + θ ζ_{π} [R (N^{t}, a^{t}) - C (N^{t}, a^{t}, g)] \end{aligned}

(6)

We define $Q (N^{t})$ as the total QALYs accrued in period $t$ , based on the number of patients in each health state and their associated utility weights. This reflects the ongoing value of maintaining or improving health status. $R (N^{t}, a^{t}, g)$ is the system revenue, and $C (N^{t}, a^{t}, g)$ is the system costs. The parameter $θ$ is the weight on clinic profits, and $(1 - θ)$ is the weight on the QALYs. The parameter $θ$ can be modified to generate different Pareto optimal solutions with varying weights on clinic profit and patient outcome. Because of the discrete nature of treatment decisions, the feasible set in the (QALY, Profit) space may not be convex. As a result, varying the weighting parameter $θ$ may not capture all Pareto optimal solutions. Nonetheless, the scalarization approach provides operationally relevant trade-offs that inform practical decision-making. Clinics placing greater emphasis on financial objectives would correspond to higher values of $θ$ .

We adopt this weighted-sum formulation because constrained MDPs are significantly challenging to solve. The current approach provides computational tractability in solving an MDP with a large state space. This efficiency allows us to systematically explore solutions across the spectrum of clinic preferences and to characterize how care intensity and staffing recommendations shift with different emphases on health versus profit. In contrast, an alternative formulation could have been to maximize QALY subject to a financial constraint. Such a constraint-based approach would not only be computationally challenging but also limit our ability to derive managerial insights based on relative preference for QALY in comparison to profits at clinics.

We note that the two parts of the objective function—QALYs and clinic profits are quantities of different scales. Therefore, the two parts of the objective function would need to be centered and normalized. We follow the standard normalization technique of multiobjective optimization (Mausser, 2006). In this method, each objective is centered and scaled. For example, the scaled and centered objective function for QALYs would be $\frac{\sum_{s \in S} u_{s} N_{s}^{t} - Q_{L}}{Q_{U} - Q_{L}}$ , where $Q_{U}$ and $Q_{L}$ are upper-bound and lower-bound QALYs. We use a similar approach for profits. This ensures that both parts of the objective function are between 0 and 1. In the electronic companion (EC.3), we describe the procedure to compute the upper bound and lower bound for scaling and the expressions for the normalization parameters $ζ_{π}$ and $ζ_{Q}$ in terms of these upper and lower bounds. We omit the centering parameters for conciseness as they constitute a constant term for the objective function. Since the scaling and centering procedure is a linear transformation of the objective, all our results also hold for the scaled and centered version of the objective function.

Given the requirements specific to our context, we have adopted a multiobjective formulation as our modeling approach. This formulation provides a comprehensive framework for addressing the dual objectives of patient health and clinic financial outcomes, ultimately supporting effective decision-making in healthcare management.

Next, we expand the expressions for QALY, revenue, and costs to characterize the reward function.

For a patient in health state $s$ , the Quality of Life (QoL) score is given by $u_{s}$ . We describe the source data for the QoL scores for other health states in Section 6. We note that successfully discharged patients continue to accrue QALY with a QoL score of $u_{d}$ , given by the average QoL of a patient with only diabetes. Patients who drop out of the program through attrition accrue a QALY of zero, therefore $u_{f} = 0$ .

To compare different policies, we compute the QALY gain, which is the difference between the total QALY of enrolled patients and the QALY for the no-treatment policy $\sum_{s \in S} u_{s} N_{s}^{t} - Q A L Y (N o T r e a t m e n t)$ . This is a standard approach in the literature to compare QALY between different policies (Ayer et al., 2012). For conciseness, we omit the no-treatment QALY term in the model. Therefore, $Q (N^{t}) = \sum_{s \in S} u_{s} N_{s}^{t}$ .

The revenue will be determined by whether the Collaborative Care program follows a fee-for-service payment model ( $β = 1$ ) or a per-member-per-month payment model ( $β = 0$ ). In a fee-for-service model, the revenue will be determined by the intensity of treatment provided. As we describe in Section 6, the fee-for-service for Collaborative Care is a tiered model. For the higher-intensity treatment, the payment is $μ^{H}$ , and for the lower-intensity treatment, the payment is $μ^{L}$ . We model the fee-for-service revenue from one patient in the health state $s \in J \times K, s \notin D$ as $μ^{a_{s}}$ . Therefore, the total revenue from all patients in a fee-for-service model will be $\sum_{s \in J \times K, s \notin D} N_{s} μ^{a_{s}}$ . We note that revenue is only accrued from enrolled patients, and discharged, discharge-eligible, and attrition patients do not generate any revenue.

In a per-member-per-month payment model, the insurance firm pays the healthcare facility a fixed payment ( $μ_{c}$ ) for every enrolled patient, irrespective of the level of treatment. Therefore, in a per-member-per-month model, the total revenue will be $\sum_{s \in J \times K, s \notin D} N_{s} μ_{c}$ . Thus, the revenue in each period is given by $R (N^{t}, a^{t}, g) = β (\sum_{s \in J \times K, s \notin D} N_{s}^{t} {μ_{f}}^{a_{s}^{t}}) + (1 - β) μ_{c} \sum_{s \in J \times K, s \notin D} N_{s}^{t}$ .

Each care manager is available for $w_{0}$ hours per month, which defines one Full-Time Equivalent (FTE).

The total available hours for care managers in a month is $w_{0} \times g$ , where $w_{0}$ is the number of hours in a month that one FTE care manager works. In large healthcare systems, care managers often work across multiple clinical departments, which means fractional FTEs are common. Therefore, we model $g$ as a continuous variable.

The cost of operating the Collaborative Care program is based on the salary and overhead costs for $g$ FTE care managers, which is given by $c_{c m} \times w_{0} \times g$ .

Patients under acute treatment with more frequent follow-ups require $w^{H}$ minutes of care manager time per month, and patients with less intensive follow-ups require $w^{L}$ minutes from the care manager per month. Therefore, if patients in health state $s$ receive treatment $a_{s} \in {H, L}$ , and there are $N_{s}$ patients in health state $s$ , then the total workload is $\sum_{s \in J \times K, s \notin D} N_{s}^{t} w^{a_{s}^{t}}$ . We note that discharge eligible, discharged, and attrition patients do not require care manager time.

Patients enrolled in the Collaborative Care program will receive treatment according to the treatment plan in every period. Patients are never turned away for lack of available care manager time. The clinic manages this by acquiring additional care manager hours at an overtime rate when required. The overtime cost of exceeding $w_{0} g$ hours of work is $c_{o} (\sum_{s \in J \times K, s \notin D} N_{s}^{t} w^{a_{s}^{t}} - w_{0} g)^{+}$ , where ${(x)}^{+} = max {0, x}$ .

Thus, the per period cost is given by $C (N^{t}, a^{t}, g) = c_{c m} w_{0} g + c_{o} (\sum_{s \in J \times K, s \notin D} N_{s}^{t} w^{a_{s}^{t}} - w_{0} g)^{+}$

Substituting the values of $Q$ , $R$ , and $C$ in equation (6), we get the following characterization of the reward function:

\begin{aligned} r (N^{t}, a^{t}, g) & = (1 - θ) ζ_{Q} {\sum_{s \in S} u_{s} N_{s}^{t}} \\ + θ ζ_{π} {β (\sum_{s \in J \times K, s \notin D} N_{s}^{t} {μ_{f}}^{a_{s}^{t}}) \\ + (1 - β) μ_{c} \sum_{s \in J \times K, s \notin D} N_{s}^{t} \\ - c_{c m} w_{0} g - c_{o} {(\sum_{s \in J \times K, s \notin D} N_{s}^{t} w^{a_{s}^{t}} - w_{0} g)}^{+}} \end{aligned}

(7)

4.4. Optimality Equation

The optimality equation for the MDP corresponding to the Collaborative Care treatment is then given by:

\begin{aligned} V (N) & = max_{\begin{matrix} a (N) \in A \\ g \geq 0 \end{matrix}} [r (N, a, g) + δ \sum_{N^{'} \in N} P [N^{'} | N, a] V (N^{'})] \end{aligned}

(8)

Since we perform steady state analysis, we drop the index $t$ for conciseness. Here, $V (N)$ is the optimal value function when the system is in state $N$ .

The optimal treatment policy is state-dependent, which means it depends on the number of patients in each health state. This is because the workload faced by the clinic, and hence the total overtime cost, which is convex in the total workload, depends on the distribution of patients across health states. As a result, the treatment decision for an individual patient may optimally depend on the current system state $N$ .

The reward function $r (N, a, g)$ is as given by equation (7), $P [N^{'} | N, a]$ is the system transition probability under treatment policy $a$ , the set of feasible treatment policies $A$ is defined as $A = {(a_{s})_{s \in J \times K} | a_{s} \in {L o w, H i g h}}$ . $N$ represents the set of feasible state spaces in the next period, and $δ$ is the discount factor.

5. Structural Properties

To characterize the optimal solution to equation (8), we restrict the model parameters to satisfy the following conditions:

Let, $(j, k), (j^{'}, k^{'}) \in J \times K$ such that $(j, k) \leq (j^{'}, k^{'})$ , then Conditions 1-3 are given as:

Condition 1
$(1 - θ) ζ_{Q} (u_{(j, k)} - u_{(j^{'}, k^{'})}) \geq θ ζ_{Π} β (μ_{f}^{H} - μ_{f}^{L})$
Condition 2

$\begin{aligned} \sum_{j^{″} \geq \hat{j}} P [e_{(j^{″}, \hat{k})} | e_{(j, k)}, a] & \leq \sum_{j^{″} \geq \hat{j}} P [e_{(j^{″}, \hat{k})} | e_{(j^{'}, k)}, a] \\ \forall (\hat{j}, \hat{k}) \in J \times K \end{aligned}$
(9)

$\begin{aligned} \sum_{k^{″} \geq \hat{k}} P [e_{(\hat{j}, k^{″})} | e_{(j, k)}, a] & \leq \sum_{k^{″} \geq \hat{k}} P [e_{(\hat{j}, k)} | e_{(j, k^{'})}, a] \\ \forall (\hat{j}, \hat{k}) \in J \times K \end{aligned}$
(10)

Condition 3

$\begin{aligned} \sum_{j^{″} \geq \hat{j}} {P [e_{(j^{″}, \hat{k})} | e_{(j, k)}, (a_{(j, k)} = H)] - P [e_{(j^{″}, \hat{k})} | e_{(j, k)}, (a_{(j, k)} = L)]} \\ \geq \sum_{j^{″} \geq \hat{j}} {P [e_{(j^{″}, \hat{k})} | e_{(j^{'}, k)}, (a_{(j^{'}, k)} = H)] \\ - P [e_{(j^{″}, \hat{k})} | e_{(j^{'}, k)}, (a_{(j^{'}, k)} = L)]} \end{aligned}$
(11)

$\begin{aligned} \sum_{k^{″} \geq \hat{k}} {P [e_{(\hat{j}, k^{″})} | e_{(j, k)}, (a_{(j, k)} = H)] - P [e_{(\hat{j}, k^{″})} | e_{(j, k)}, (a_{(j, k)} = L)]} \\ \geq \sum_{k^{″} \geq \hat{k}} {P [e_{(\hat{j}, k^{″})} | e_{(j^{'}, k)}, (a_{(j^{'}, k)} = H)] \\ - P [e_{(\hat{j}, k^{″})} | e_{(j^{'}, k)}, (a_{(j^{'}, k)} = L)]} \end{aligned}$
(12)

The term $\sum_{j^{″} \geq \hat{j}} P [e_{(j^{″}, \hat{k})} | e_{(j, k)}, a]$ in Condition 2 can be interpreted as the transition probability to any mental health state worse than $\hat{j}$ when the patient is currently in health state $(j, k)$ . Then, equations (9) and (10) imply that for the same treatment level, the transition probability to a worse health state is higher when the current health state is more sick. In other words, for a given treatment, healthier patients improve with a higher probability.

Equation (11) in Condition 3 implies that the marginal difference in probability of improved health after one time period from $H i g h$ treatment over $L o w$ treatment is greater when the starting health state is sicker. This is equivalent to saying that sicker patients benefit more from $H i g h$ treatment than $L o w$ treatment.

This observation is supported by clinical studies, justifying Condition 3. Prior research, such as Fournier et al. (2010), highlights that the marginal effectiveness of intensive interventions (e.g., Cognitive Behavioral Therapy (CBT) or antidepressants) increases with depression severity. Specifically, patients with mild or moderate depression experience negligible additional benefits from intensive therapies compared to less intensive interventions. Conversely, patients with severe depression demonstrate substantial improvements with higher-intensity interventions, aligning with stepped-care protocols in Collaborative Care models. Furthermore, Schueller et al. (2015) has established PHQ-9 cutoffs that guide the initiation of higher-intensity CBT therapy, indicating that patients with lower PHQ-9 scores derive limited benefit from such treatments, thereby supporting the rationale for allocating higher-intensity interventions to more severe cases.

Additionally, Kirsch et al. (2008), using both unpublished and published data submitted to the FDA for four new-generation antidepressants, found that the overall difference between the drug and placebo was clinically meaningful only in cases of very severe depression. Stepped-care approaches, widely utilized in Collaborative Care models, recommend allocating lower-intensity interventions to patients with milder symptoms to optimize resource use. These findings collectively support the notion that higher-intensity treatments are more beneficial for patients with greater initial severity, thereby validating Condition 3 within the framework of Collaborative Care models.

Next, given these conditions on parameter values, we obtain some structural properties of the model. These properties will help us arrive at a computationally efficient heuristic. All proofs of the propositions are provided in Section EC.8 in the Electronic Companion.
Proposition 1
Let $N^{0}$ be any feasible system state, and let $N$ and $N^{'}$ be two system states such that $N = N^{0} + e_{(j, k)}$ and $N^{'} = N^{0} + e_{(j^{'}, k^{'})}$ . $r (\cdot)$ be the per-period reward function as defined in equation (7), then for any feasible treatment policy, and under Condition 1:
$\begin{aligned} (j^{'}, k^{'}) \geq (j, k) ⟹ r (N^{'}, a) \leq r (N, a) \end{aligned}$
(13)

The above proposition implies that under Condition 1, the per-period reward is greater if an additional patient is in a healthier health state. In other words, the per-period objective increases when the health of enrolled patients improves.

Additionally, Proposition 1 establishes that, conditional on a given number of patients in the system, the clinic’s reward function is increasing in patient health. The proposition reflects the fact that healthier patients contribute more to QALY gains, and under Condition 1, lead to an increase in the objective function. Importantly, this result holds conditional on a nonempty system and should not be interpreted as suggesting that the clinic’s utility would continue to increase if the patient population vanished. Indeed, in the absence of any patients, both clinical outcomes and profits would be zero. Our model assumes a steady inflow of patients based on a given arrival rate, and the structural results in Proposition 1 are derived within this context.
Proposition 2
If $N$ and $N^{'}$ are as defined in the statement of Proposition 1 and Conditions 1,2, and 3 are true, then, $V (N^{'}) \leq V (N)$ .

The above proposition implies that the value function is greater when an additional patient is healthier.

These propositions lead us to the following result, which characterizes the optimal solution:
Proposition 3
Given Conditions (1)–(3), the following two statements are true: (a)
There exists an optimal treatment policy $a^{} (N)$ such that for each $k \in K$ , there is $J (k, N) \in J$ , such that for $j \geq J (k, N)$ , $a_{(j, k)}^{} = H i g h$ and for $j < J (k, N)$ , $a_{(j, k)}^{*} = L o w$ .
(b)
$J (k, N)$ is nonincreasing in $k$

Proposition 33 shows that for each diabetes state $k \in K$ , there is a $P H Q 9$ category threshold below which it is optimal to provide all patients with the $L o w$ treatment and above which it is optimal to give all patients $H i g h$ treatment. Thus, there is a switching curve in the physical-mental health space. This switching curve depends on the number and distribution of patients in the system, $N$ .

Proposition 33 shows that this switching curve is nonincreasing in the physical health state. In other words, if the patient does not have glycemic control, the higher intensive treatment with more frequent follow-up should be performed at a (weakly) lower $P H Q 9$ score than for patients with glycemic control.
6. Model Parameters for Collaborative Care for Depression and Diabetes

6.1. Transition Probabilities

We use a data-based approach to estimate $p_{(j, k) (j^{'}, k^{'})}^{a_{(j, k)}}$ . While there have been studies regarding the estimation of the transition probability of depression and diabetes separately (Huang, 2016; Varshney et al., 2020), to the best of our knowledge, no study has estimated the transition probability of patients suffering simultaneously from diabetes and depression.

6.1.1. Data Description

To estimate the transition probabilities, we use longitudinal EHR (Electronic Health Record) data of patients from a large academic medical center. This data was obtained from the electronic health record system, which contains details on patient visits, services received, and laboratory results such as the $P H Q 9$ and $H b A 1 c$ . We restrict our analysis to patients with depression and diabetes who are receiving psychiatric services and have $P H Q 9$ and $H b A 1 c$ lab tests. Our data consists of 17,152 patients from April 2014 to April 2019. These patients underwent Usual Care treatment, which involves measuring and treating patients at irregular intervals. We use this data to create a prediction model to predict the transition probability of patients undergoing monthly monitoring and treatment according to the Collaborative Care system.

The data consists of the following fields: Patient identifier, Date of PHQ9 test, PHQ9 score (PHQ9), Date of HbA1c test, HbA1c score (HbA1c), Date of depression therapy treatment, Current Procedural Terminology (CPT) code of depression therapy services provided, ED Visit Date, Psychiatry Visit date, Primary care visit date.

We categorize the treatment variable into the following three categories: $N o T r e a t m e n t$ if there is no psychiatric treatment provided in that time period, $H i g h$ treatment if the psychiatric therapy provided exceeds 60 minutes, and $L o w$ treatment if the psychiatric therapy in that month was lower than 60 mins. This categorization is based on the Collaborative Care treatment thresholds provided by CMS. We discuss this in additional detail in Section 6.3.

We transform the above data to extract the following additional features for each $P H Q 9$ and $H b A 1 c$ lab test: Last observed $P H Q 9$ test value $(P H Q 9_{p r e v})$ , Last observed $H b A 1 c$ test value $(H b A 1 c_{p r e v})$ , Months since last $P H Q 9$ test $(M o n t h s_{P H Q 9, p r e v})$ , Months since last $H b A 1 c$ test $(M o n t h s_{H b A 1 c, p r e v})$ , Level of previous treatment $(T r e a t)$ , Months since last treatment $(M o n t h s_{t r e a t, p r e v})$ . We use this data to estimate the transition probabilities. We discuss the estimation of these transition probabilities in the next section.

We note several challenges with estimating the required transition probabilities from this data.

First, the measurements for depression and diabetes are not observed every month since PHQ9 and HbA1c for patients may be measured at irregular intervals under the usual care treatment. Therefore, there may be months in which only one of the depression or diabetes measurements may be recorded. Second, several months may pass between periods when depression therapy is provided to patients, and therefore, the transition to the next health state in the next period is not observed. To mitigate these challenges, we need two prediction models. First, we need to impute the missing health dimension for periods where only one dimension of the health state is observed. Second, we need a model-based method to predict the transition probability of patients when they are monitored and treated in every period. We discuss these two analyses below.

6.1.2. Imputing Missing Health States

Several approaches have been used to input missing longitudinal data. The most common approach is linear or nonlinear interpolation. Shechter et al. (2008) use a spline-based interpolation method and Lee et al. (2019) use a linear interpolation method. In our case, we could not use these interpolation-based methods because of the two-dimensional nature of the health state. Also, since we have other characteristics, such as the level of treatment, a data-driven method for predicting the missing data would be more appropriate. For this prediction method, we use a regression-based method to predict the missing data. Regression-based methods to predict missing data are a common strategy for missing data in clinical longitudinal studies (Zhang, 2016).

We create the following two linear regression models for $P H Q 9$ and $H b A 1 c$ .

\begin{aligned} P H Q 9 & = P H Q 9_{p r e v} + H b A 1 c_{p r e v} + T r e a t + M o n t h s_{P H Q 9, p r e v} \\ + M o n t h s_{H b A 1 c, p r e v} + M o n t h s_{t r e a t, p r e v} \end{aligned}

(14)

\begin{aligned} H b A 1 c & = P H Q 9_{p r e v} + H b A 1 c_{p r e v} + T r e a t + M o n t h s_{P H Q 9, p r e v} \\ + M o n t h s_{H b A 1 c, p r e v} + M o n t h s_{t r e a t, p r e v} \end{aligned}

(15)

Then, using this data, for months when the $P H Q 9$ data is not observed, we predict the $P H Q 9$ value using equation (14), and similarly, we use equation (15) for months when the $H b A 1 c$ data is not observed. We validate the prediction model by comparing the predicted values to the actual values in the data set. We report the results of this analysis in the electronic companion in the Electronic Companion in Section EC.6.1.

6.1.3. Estimation of Transition Probability Matrices

After completing the missing data, we will use this data to create a model to estimate the state transition probability.

First, we categorized the variables $P H Q 9$ and $H b A 1 c$ based on the categorization discussed in Section 4.1. Then, we formulate the following multinomial logit model for predicting the likelihood of transitioning to a specific health state based on the last known $P H Q 9$ , $H b A 1 c$ , and treatment level.

\begin{aligned} P (P H Q 9_{t} = j | P H Q 9_{p r e v} = j^{'}, H b A 1 c_{p r e v} = k^{'}, T r e a t, X_{p r e v}) \\ = \frac{e^{β_{j}^{T} [j^{'}, k^{'}, T r e a t, X_{p r e v}]}}{\sum_{j^{″} \in J} e^{β_{j^{″}}^{T} [j^{'}, k^{'}, T r e a t, X_{p r e v}]}} \end{aligned}

(16)

\begin{aligned} P (H b A 1 c_{t} = k | P H Q 9_{p r e v} = j^{'}, H b A 1 c_{p r e v} = k^{'}, X_{p r e v}) \\ = \frac{e^{γ_{k}^{T} [j^{'}, k^{'}, T r e a t, X_{p r e v}]}}{\sum_{k^{″} \in K} e^{γ_{k^{″}}^{T} [j^{'}, k^{'}, T r e a t, X_{p r e v}]}} \end{aligned}

(17)

where,

X_{p r e v} = [M o n t h s_{P H Q 9, p r e v}, M o n t h s_{H b A 1 c, p r e v}, M o n t h s_{t r e a t, p r e v}]

The above multinomial logistic regression models for state transitions incorporate both HbA1c and PHQ9 levels as predictors, meaning that the transitions for both health dimensions HbA1c and PHQ9 are interdependent, capturing the joint evolution of these states in response to the clinic’s Collaborative Care program. Under the Collaborative Care model, the depression-focused interventions are designed to complement the standard of care already provided to patients for diabetes management. In practice, this means that the transitions for HbA1c implicitly reflect the effects of diabetes-related treatments, which are provided independently of the Collaborative Care interventions. For example, a patient receiving insulin for glycemic control continues this treatment regardless of the Collaborative Care program, and our model incorporates these dynamics within the estimated transition probabilities. In light of this, the actions in the model focus on interventions related to depression, as this is the primary addition under the Collaborative Care framework. Including HbA1c in the state space ensures that our model captures its influence on the overall patient health trajectory and its interaction with PHQ9 levels. This approach aligns with real-world practices, where PCPs manage glycemic control through a parallel care process.

We obtain the estimates $β$ and $γ$ by performing the above multinomial logistic regressions on 80% of the data set. Using the results of this multinomial regression, we can predict the probability of transition to each health state for the given last known depression state, diabetes state, treatment, and months since these tests and treatment by substituting the appropriate values in the above multinomial regression model. This allows us to model the transition of patients who undergo regular monitoring and treatment.

We validate the estimates model in two ways. First, we report the Likelihood Ratio (LR) test for each variable, demonstrating that the variables improve the fit and are significant. Second, we obtain predictions from the logit model for the remaining 20% of the data, and we report the Mean Absolute Error of the model compared to the actual transitions observed in the data. We note that the model has a low Mean Absolute Error, and the LR test shows that the variables are significant and improve the model’s fit. We present these results in the Electronic Companion in Section EC.6.1.

After we obtain the estimates $β$ and $γ$ , under collaborative care, when patients are monitored and treated every month, we can substitute $M o n t h s_{P H Q 9, p r e v} = 1, M o n t h s_{H b A 1 c, p r e v} = 1, M o n t h s_{t r e a t, p r e v} = 1$ and for each treatment level, $T r e a t = {N o T r e a t m e n t, H i g h, L o w}$ , we can predict the transition probabilities given in equations (17) and (16). Then, we can compute the transition probability to state $(j, k)$ starting in state $(j^{'}, k^{'})$ by the following expression:

\begin{aligned} P (P H Q 9_{t + 1} & = j, H b A 1 c_{t + 1} = k | P H Q 9_{t} = j^{'}, H b A 1 c_{t} \\ = k^{'}, T r e a t, X_{t}) = P (P H Q 9_{t + 1} = j | P H Q 9_{t} \\ = j^{'}, H b A 1 c_{t} = k^{'}, T r e a t, X_{t}), P (H b A 1 c_{t + 1} \\ = k | P H Q 9_{t} = j^{'}, H b A 1 c_{t} = k^{'}, T r e a t, X_{t}) \end{aligned}

(18)

Other methods for estimating transition probabilities with covariates, such as the approach in Barratt and Boyd (2022), are more computationally demanding. To ensure robustness, we also applied the method from Barratt and Boyd (2022) in our analysis and observed no significant differences in the final recommendations for staffing levels and treatment policies.

6.2. Parameters Related to Enrollment

The transition probability parameters are those for enrollment and attrition. We use $ν = 87 %$ as the acceptance ratio for enrollment, which we source from the proportion of eligible patients who consented to receive Collaborative Care under the IMPACT model (Unützer et al., 2002). Other trials, such as those conducted at Veterans Health (Hedrick et al., 2003), have also observed a similar acceptance ratio.

To estimate the proportion of patients in each $P H Q 9$ and $H b A 1 c$ category in the population, we use published studies regarding the demographic characteristics of primary care patients. Kroenke et al. (2001) analyzed the distribution of $P H Q 9$ patients with major depressive disorder in primary care. Menzin et al. (2010) conducted a study of the distribution of $H b A 1 c$ scores in primary care patients. Using these two studies, we arrive at the distribution of patients in each of the eight health states. We provide this distribution in Table 2.

Table 2.
Distribution of patients with co-morbid diabetes and depression.

Health Patient Health Proportion

state Questionnaire-9 HbA1c in primary care

$(j, k)$ (PHQ9) score score population ( $η_{s}$ )

$(1, 1)$ $P H Q 9 \leq 5$ $H b A 1 c \leq 7 %$ 0.044

$(2, 1)$ $5 < P H Q 9 \leq 9$ 0.132

$(3, 1)$ $9 < P H Q 9 \leq 14$ 0.122

$(4, 1)$ $14 < P H Q 9$ 0.220

$(1, 2)$ $P H Q 9 \leq 5$ $H b A 1 c > 7 %$ 0.041

$(2, 2)$ $5 < P H Q 9 \leq 9$ 0.123

$(3, 2)$ $9 < P H Q 9 \leq 14$ 0.114

$(4, 2)$ $14 < P H Q 9$ 0.204

Health	Patient Health		Proportion
$(1, 1)$	$P H Q 9 \leq 5$	$H b A 1 c \leq 7 %$	0.044
$(2, 1)$	$5 < P H Q 9 \leq 9$		0.132
$(3, 1)$	$9 < P H Q 9 \leq 14$		0.122
$(4, 1)$	$14 < P H Q 9$		0.220
$(1, 2)$	$P H Q 9 \leq 5$	$H b A 1 c > 7 %$	0.041
$(2, 2)$	$5 < P H Q 9 \leq 9$		0.123
$(3, 2)$	$9 < P H Q 9 \leq 14$		0.114
$(4, 2)$	$14 < P H Q 9$		0.204

We model the patient population as being drawn from a distribution of starting health states, which is detailed in Table 2. This aggregate perspective aligns with the operational focus of the study, where the goal is to manage care delivery at the clinic level rather than to provide individualized recommendations for each patient. While we recognize that patient heterogeneity is important for personalized medicine, incorporating such factors into the model would significantly increase its complexity and fall outside the tactical planning scope of our work.

Secondly, our data consists of patients who were enrolled in the population and, therefore, presents a biased sample. Consequently, we have utilized the demographic data from published sources for the incoming distribution of patients rather than relying on our sample.

We have also performed a sensitivity analysis to assess the impact of increasing the percentage of patients in severely depressed states ( $P H Q 9 > 14$ ) in Section EC.11.1 of the Electronic Companion.

6.3. Parameters for Time Spent by Care Managers

We use data from Liu et al. (2007), who have conducted a detailed time-motion analysis of the activities of care managers working in Collaborative Care for depression in three Veterans Health Administration clinics. The Collaborative Care process in their study was similar to the standard implementation of Collaborative Care described in Section 3. Care managers conducted follow-ups every two weeks for patients requiring more intensive intervention, and for less intensive intervention, care managers conducted follow-ups every four weeks. Each week, the care manager conducted a case review with a psychiatrist. Each follow-up resulted in a total care manager time of 60 minutes. This time consists of precall preparation, completed calls, postcall documentation, communication with providers, and all the time spent on unsuccessful call attempts. We describe the corresponding parameters in our model in Table 3.

Table 3.
Parameters for time spent by care managers.

Model parameter Parameter value Data source

$w^{L}$ 60 minutes per month

$w^{H}$ 120 minutes per month Liu et al. (2007)

Model parameter	Parameter value	Data source
$w^{L}$	60 minutes per month
$w^{H}$	120 minutes per month	Liu et al. (2007)

6.4. Parameters for Revenue

Under a fee-for-service model, the insurance firm pays the clinic every month. The amount paid depends on the minutes of work done by the care manager. We use payment rates published by CMS for Collaborative Care for the fee-for-service payment. The payment has a tiered structure. Time spent up to the tier limit is reimbursed at a particular amount. There is an additional payment fee for enrolling patients. This is the most common implementation of fee-for-service in Collaborative Care. Similar models are in place at other large insurance firms such as Aetna and Blue Cross Blue Shield.⁷

Since fee-for-service models have been identified as a potential implementation hurdle for Collaborative Care (Katon et al., 2010), health insurance providers and researchers have conducted large trials with per-member-per-month payment models. The fee-for-service is the most common payment model for Collaborative Care in practice (Carlo et al., 2018), including the payment for Collaborative Care under Medicare. It is expected that in the future, per-member-per-month models for mental health services will be more prevalent than they are today. Per-member-per-month payment models fall under the broad category of capitated payment models, where payment rates for a specific fixed duration are set prospectively. There have been some instances of the use of such models in the delivery of mental health services (Bachman et al., 2006). For the per-member-per-month payment model, we use the rates established by the New York State Department of Health for a Collaborative Care initiative conducted at 32 primary care clinics in 2012 (Moise et al., 2018). PCPs were given $150 per-member-per-month supplemental payment for adult patients receiving depression treatment under the Collaborative Care model. We use an inflation-adjusted rate of $165 for 2024.

We summarize the revenue parameters in Table 4.

Table 4.
Parameters for fee-for-service revenue model.

Parameter Payment type Parameter value Data source

$μ_{f}^{L}$ Fee-for-service $129.38 for at least 60 minutes of service per month Carlo et al. (2018)

$μ_{f}^{H}$ Fee-for-service $196.41 for at least 90 minutes of service per month

$μ_{c}$ Per-member-per-month $165 per month Moise et al. (2018)

Parameter	Payment type	Parameter value	Data source
$μ_{f}^{L}$	Fee-for-service	$129.38 for at least 60 minutes of service per month	Carlo et al. (2018)
$μ_{f}^{H}$	Fee-for-service	$196.41 for at least 90 minutes of service per month
$μ_{c}$	Per-member-per-month	$165 per month	Moise et al. (2018)

6.5. Parameters for Costs and Available Time

The program’s primary cost is the care manager’s salary and overtime. We use cost analysis by Katon et al. (2012), which estimates the cost of care managers’ time towards delivering services required for Collaborative Care. This cost includes the cost of overhead, benefits, and supervision of care managers.

The clinic can meet additional demand by allocating extra care manager time as required to satisfy workload requirements. This extra time will cost the clinic an overtime rate of 1.5 times the regular cost (Table 5).

Table 5.
Parameters for cost and available time.

Model parameter Parameter value Data source

$c_{c m}$ $42 per hour per full-time equivalent (FTE) Hourly pay per care manager (Katon et al., 2012)

$c_{0}$ $63 per hour per FTE 1.5 times the regular hourly cost of care manager

$w_{0}$ 160 hours per FTE per month 4 weeks at 40 hours per week per FTE

Model parameter	Parameter value	Data source
$c_{c m}$	$42 per hour per full-time equivalent (FTE)	Hourly pay per care manager (Katon et al., 2012)
$c_{0}$	$63 per hour per FTE	1.5 times the regular hourly cost of care manager
$w_{0}$	160 hours per FTE per month	4 weeks at 40 hours per week per FTE

6.6. Parameters for QALYs

QALYs for all patients are determined by the summation of the utility value or QoL score for each patient. In the case of co-morbid conditions where multiple health conditions determine the QoL, several approaches are suggested for computing the QoL score. One of the most common recommendations is to multiply the QoL score of the individual health conditions (Ara and Brazier, 2017). We use this multiplicative model to compute patients’ QoL scores for a particular health state. The simple multiplicative model for combining health utility scores in patients with co-morbid conditions has received empirical support in several studies. One prominent example is the work by Flanagan et al. (2006), who tested this approach and found it to be robust for deriving utilities from individual condition scores. Although they acknowledged that further research is necessary across a broader range of conditions and populations, their findings strongly endorse the multiplicative method as both practical and reasonably accurate for handling comorbidities.

We present the QoL scores corresponding to the individual conditions in Tables 6 and 7. The composite QoL score computed by multiplying the QoL scores of the individual health scores is presented in Table 8. We use a QoL score of 0 for patients who drop out of the program, as these patients do not contribute to the QALYs of the Collaborative Care program.

Table 6.
Parameters for QoL scores for depression.

Depression Depression

health state $j$ $P H Q 9$ Score QoL score Data source

1 $P H Q 9 \leq 5$ 0.812 Mann et al. (2009)

2 $5 < P H Q 9 \leq 9$ 0.689

3 $9 < P H Q 9 \leq 14$ 0.414

4 $14 < P H Q 9$ 0.362

Depression		Depression
1	$P H Q 9 \leq 5$	0.812	Mann et al. (2009)
2	$5 < P H Q 9 \leq 9$	0.689
3	$9 < P H Q 9 \leq 14$	0.414
4	$14 < P H Q 9$	0.362

PHQ9 = Patient Health Questionnaire-9; QOL = Quality-of-Life.

Table 7.

Parameters for quality-of-life (QoL) scores for diabetes.

Diabetes health	$H b A 1 c$	Diabetes
state $k$	Score	QoL score	Data source
1	$H b A 1 c \leq 7 %$	0.781	Aro et al. (2017)
2	$H b A 1 c > 7 %$	0.702

Table 8.

Parameters for QoL scores for patients with comorbid diabetes and depression.

Health state $(j, k)$	$P H Q 9$ Score	$H b A 1 c$ score	QoL score ( $u_{s}$ )
$(1, 1)$	$P H Q 9 \leq 5$	$H b A 1 c \leq 7 %$	0.634
$(2, 1)$	$5 < P H Q 9 \leq 9$		0.538
$(3, 1)$	$9 < P H Q 9 \leq 14$		0.323
$(4, 1)$	$14 < P H Q 9$		0.283
$(1, 2)$	$P H Q 9 \leq 5$	$H b A 1 c > 7 %$	0.570
$(2, 2)$	$5 < P H Q 9 \leq 9$		0.484
$(3, 2)$	$9 < P H Q 9 \leq 14$		0.291
$(4, 2)$	$14 < P H Q 9$		0.254

PHQ9 = Patient Health Questionnaire-9; QoL = quality of life.

6.7. Other Parameters

The other remaining parameter is the rate of monthly attrition. We use the attrition rate from the IMPACT study randomized control trial (Unützer et al., 2002). Utilizing this study, we have an average monthly attrition rate of $ϕ = 1.7 %$ . From the standard process followed for Collaborative Care (Unützer et al., 2002), we arrive at the enrollment threshold of $j = 3$ , and a discharge threshold of $j^{d} = 1$ . We use a monthly discount rate of $0.99$ .

As discussed in Section 4.2.1 and Section 4.2.2, we assume that the attrition probability and acceptance probability are independent of the health state. However, to demonstrate that our results are robust to this assumption, we perform a sensitivity analysis in Section EC.11.3 of the Electronic Companion, where we vary the attrition and acceptance probabilities based on the health state. We find that there is minimal impact on the overall model outcomes, indicating robustness to these assumptions.

In Section EC.4, we show that Conditions 1 to 3 are applicable for the above parameter values, verifying that the structural properties and, therefore, the solution methodology are appropriate for our problem.

6.8. Model Validation

To validate the model parameters, we conduct three sets of computations.

First, as discussed in Section 6.1, we perform an out-of-sample evaluation of the transition probability estimates. We report these results in the Electronic Companion in Section EC.6.1. The multinomial logit model for depression transition has a class prediction accuracy of 61% and is correct within one class 81% of the time. The depression transition model has a class prediction accuracy of 90.8%. Therefore, the transition probability estimates accurately predict the transition of patients in the data set.

Second, to demonstrate that the transition prediction model can accurately model patients’ transitions under collaborative care, we simulate a hypothetical cohort of 1,000 patients with the treatment policy as used in current practice. We then compare the fraction of patients discharged at six months to the prior study conducted by Angstman et al. (2012). We report this comparison in the Electronic Companion in Section EC.6.1. We find that the percentage of patients discharged at six months based on their starting depression level is close to the model-predicted rates of discharge at six months. This demonstrates that the model can accurately predict the transition of patients under Collaborative Care.

Finally, to test the results’ robustness to the estimation procedure, we perform the analysis with the approach outlined in Barratt and Boyd (2022) and show that the final recommendations of staffing levels and treatment policy do not show any significant difference for a wide range of model parameters. We report the results of this analysis in the Electronic Companion in Section EC.6.1.

7. Solution Methodology and Numerical Analyses

In this section, we first develop a computationally efficient solution methodology that utilizes the structural properties of the model to optimize the treatment and staffing levels jointly. Depending on the organization, the joint optimization of staffing and treatment may or may not be organizationally feasible to implement. Some individual clinics that are part of larger healthcare systems may not have the discretion to change the treatment policy. Alternatively, clinics may face hiring shortages and be unable to adjust staffing levels. We, therefore, compare the results of the joint optimization with individual optimization of treatment and staffing, where treatment and staffing are maintained at the levels in current practice at our partner organization. This also allows us to quantify the value of joint optimization treatment and staffing levels. We also compare the results of our model-based policy with the usual care process, where patients receive their depression treatment at ad hoc intervals as observed in the usual care without a Collaborative Care program, which we call the baseline policy.

7.1. Baseline Policy: Usual Care

Under the baseline policy, patients receive their depression treatment at ad-hoc intervals. We model the time between mental health treatment based on random interarrival visits for mental health treatment. We model this interarrival time as an exponential distribution with the mean interarrival rate obtained from our data. We compute the clinic profits and patient QALYs by simulating a system where the random treatment intervals are drawn from the above-mentioned exponential distribution. Then, we compute the QALYs gained and the clinic profits if patients were to arrive in the system according to the exponential distribution.

7.2. Current Practice for Collaborative Care

Under the current practice of Collaborative Care, the clinic follows a treatment policy of assigning $H i g h$ treatment to patients with PHQ9 greater than 15 and $L o w$ to patients with PHQ9 less than 15. This practice corresponds to care managers spending $w^{H}$ time with patients with a $P H Q 9$ score greater than 15 and $w^{L}$ time with patients with a $P H Q 9$ score of less than or equal to 15. This is a common strategy followed in implementing Collaborative Care (Liu et al., 2007). The clinic currently uses two care managers for an enrollment rate of 30 patients per month. So, we use $g = 2 w_{0}$ , and we set $m = 30$ patients per month. We simulate the system using these parameters and compute expected profits and QALYs gained.

7.3. Model-Based Joint Optimization of Treatment and Staffing

Proposition 3 shows that, for a given number of care managers, $g$ , the optimal policy is a threshold-type policy in the two-dimensional space $J \times K$ . Patients in health states above the threshold receive $w^{H}$ minutes of Collaborative Care per month, and those below the threshold receive $w^{L}$ . This threshold depends on the state of the system $N$ . To compute an optimal policy, we would need to calculate the threshold $J (k, N)$ for each state of the system $N$ and then subsequently compute the optimal staffing level $(g)$ for this treatment policy. This is computationally intractable and difficult to implement in practice. Therefore, we develop a computationally efficient and practical policy.

We take inspiration from the idea that the optimal policy has a threshold-type structure for each $N$ . In our approximate approach, we do not compute a threshold for each state $N$ , but instead, compute one static threshold $\hat{J} (k)$ as an approximation to $J (k, N)$ . This approach is computationally efficient and easier to implement in practice. With this policy, the decision-maker only needs to compute the threshold once for given problem parameters and not for every system state $N$ . This approach is also fairer for patients because the level of collaborative care services a patient receives does not depend on the health state of other patients. Therefore, this approach will be more acceptable for implementation. The threshold, $\hat{J} (k)$ , while not dependent on the entire system state $N_{s}$ , will depend on other clinical parameters such as monthly enrollments, costs, and payment rates. We show the sensitivity to these parameters in Section 8. From the model objective, we can see that the objective function is concave in the number of care managers $g$ for a given treatment policy. Therefore, a bisection search can determine the optimal number of care managers for a given treatment policy.

We use an infinitesimal perturbation analysis (IPA) approach to search for the threshold $\hat{J} (k)$ . We first assume that the threshold $\hat{J} (k)$ is a straight line parameterized by $ρ$ . For each iteration $i$ of the IPA, for a given threshold $ρ^{i}$ , we compute the gradients of the value function for 200 samples. The sample average of the gradient estimates the expectation of gradients. According to the IPA method, this expectation of gradients can be used to approximate the gradient of the expected value function (Glasserman and Ho, 1991). Using an adaptive step-size method, we update the threshold $ρ^{i + 1}$ . We stop the search if the value function does not show improvement greater than $0.5 %$ over 15 iterations.

Next, we note that the objective function is concave in the number of care managers, $g$ . Therefore, to find the optimal number of care managers, we solve an inner loop to solve for the optimal $g$ for each iteration of the above IPA method by a bisection search. We describe the algorithm in detail in the Electronic Companion (EC.10).

Since we have solved a state-independent policy for the threshold, this objective would be lower than a state-dependent policy. To demonstrate the performance bounds of the state-independent policy, we also solve for an upper bound of the objective function. We find that the gap of the upper bound is less than 7.4% for all the scenarios we consider. We describe the upper bound computation in Section 7.6.

We also solve for two restricted versions of the joint-optimization problem. First, we solve for the model-based policy for fixed staffing levels. We use the same algorithm described in Section 7.3, except we do not solve the optimization for staffing and keep it fixed at the current practice. Second, we solve for the optimized staffing by fixing the treatment level to the current practice described in Section 7.2, then solve for the optimal $g$ by the bisection method. This gives us the optimal staffing level for the current treatment policy.

We solve for these restricted versions of the joint optimization because, depending on the organizational restrictions, joint optimization may not be feasible to implement. Even in such a situation, we show that the model-based policy for treatment and staffing levels is superior to the current practice.

7.4. Policy Comparison

To compare the policies described above, we present results comparing the objective values in Table 9 and the differences in policy recommendations in Table 10 below. We present the results for a monthly arrival rate of 30 and 50 patients and for two values of $θ = {0.2, 0.9}$ . The values of $θ = 0.2$ and $θ = 0.9$ were selected to represent two extremes in clinic preferences: $θ = 0.2$ emphasizes financial outcomes, while $θ = 0.9$ emphasizes patient health. In practice, the choice of $θ$ would depend on the relative weights the clinic places on clinical outcomes and profits. For example, nonprofit health centers may operate under mission-driven mandates that emphasize population health. In contrast, private care organizations may focus more heavily on financial performance. These institutional differences motivate the use of a range of $θ$ values to capture the heterogeneity of clinic objectives in real-world settings.

Table 9.
Policy objective value comparison.

% Improvement over baseline

$θ$ Monthly arrival rate Optimization type Profit Quality-adjusted life year (QALYs) gained Profit QALYs gained

0.2 30 Baseline $752,879 1,780 0.0% 0.0%

Current practice $831,156 2,117 10.4% 18.9%

Joint $1,245,096 2,545 65.4% 43.0%

Staffing only $1,137,983 2,117 51.2% 18.9%

Treatment only $1,205,759 2,545 60.2% 43.0%

50 Baseline $1,484,798 3,046 0.0% 0.0%

Current practice $1,493,509 3,463 0.6% 13.7%

Joint $2,075,814 4,308 39.8% 41.4%

Staffing only $1,889,762 3,463 27.3% 13.7%

Treatment only $1,211,035 4,308 −18.4% 41.4%

0.9 30 Baseline $752,879 1,780 0.0% 0.0%

Current practice $831,156 2,117 10.4% 18.9%

Joint $1,245,096 2,545 65.4% 43.0%

Staffing only Opt $1,137,983 2,117 51.2% 18.9%

Treatment only $1,205,759 2,545 60.2% 43.0%

50 Baseline $1,484,798 3,046 0.0% 0.0%

Current practice $1,493,509 3,463 0.6% 13.7%

Joint $2,075,814 4,308 39.8% 41.4%

Staffing only $1,889,762 3,463 27.3% 13.7%

Treatment only $1,652,554 3,000 11.3% −1.5%

					% Improvement over baseline
0.2	30	Baseline	$752,879	1,780	0.0%	0.0%
		Current practice	$831,156	2,117	10.4%	18.9%
		Joint	$1,245,096	2,545	65.4%	43.0%
		Staffing only	$1,137,983	2,117	51.2%	18.9%
		Treatment only	$1,205,759	2,545	60.2%	43.0%
	50	Baseline	$1,484,798	3,046	0.0%	0.0%
		Current practice	$1,493,509	3,463	0.6%	13.7%
		Joint	$2,075,814	4,308	39.8%	41.4%
		Staffing only	$1,889,762	3,463	27.3%	13.7%
		Treatment only	$1,211,035	4,308	−18.4%	41.4%
0.9	30	Baseline	$752,879	1,780	0.0%	0.0%
		Current practice	$831,156	2,117	10.4%	18.9%
		Joint	$1,245,096	2,545	65.4%	43.0%
		Staffing only Opt	$1,137,983	2,117	51.2%	18.9%
		Treatment only	$1,205,759	2,545	60.2%	43.0%
	50	Baseline	$1,484,798	3,046	0.0%	0.0%
		Current practice	$1,493,509	3,463	0.6%	13.7%
		Joint	$2,075,814	4,308	39.8%	41.4%
		Staffing only	$1,889,762	3,463	27.3%	13.7%
		Treatment only	$1,652,554	3,000	11.3%	−1.5%

Table 10.

Policy recommendation comparison.

$θ$	Monthly arrival rate	Optimization type	Interarrival period	Number of care managers	Treatment policy
0.2	30	Baseline	Random	2.0	[(L, L), (L, L), (L, L), (H, H)]
		Current practice	Monthly	2.0	[(L, L), (L, L), (L, L), (H, H)]
		Joint	Monthly	2.1	[(L, L), (H, H), (H, H), (H, H)]
		Staffing only	Monthly	1.7	[(L, L), (L, L), (L, L), (H, H)]
		Treatment only	Monthly	2.0	[(L, L), (H, H), (H, H), (H, H)]
	50	Baseline	Random	2.0	[(L, L), (L, L), (L, L), (H, H)]
		Current practice	Monthly	2.0	[(L, L), (L, L), (L, L), (H, H)]
		Joint	Monthly	3.6	[(L, L), (H, H), (H, H), (H, H)]
		Staffing only	Monthly	2.8	[(L, L), (L, L), (L, L), (H, H)]
		Treatment only	Monthly	2.0	[(L, L), (H, H), (H, H), (H, H)]
0.9	30	Baseline	Random	2.0	[(L, L), (L, L), (L, L), (H, H)]
		Current practice	Monthly	2.0	[(L, L), (L, L), (L, L), (H, H)]
		Joint	Monthly	2.1	[(L, L), (H, H), (H, H), (H, H)]
		Staffing only	Monthly	1.7	[(L, L), (L, L), (L, L), (H, H)]
		Treatment only	Monthly	2.0	[(L, L), (H, H), (H, H), (H, H)]
	50	Baseline	Random	2.0	[(L, L), (L, L), (L, L), (H, H)]
		Current practice	Monthly	2.0	[(L, L), (L, L), (L, L), (H, H)]
		Joint	Monthly	3.6	[(L, L), (H, H), (H, H), (H, H)]
		Staffing only	Monthly	2.8	[(L, L), (L, L), (L, L), (H, H)]
		Treatment only	Monthly	2.0	[(L, L), (L, L), (L, L), (L, H)]

To compute the parameters $ζ_{Q}$ and $ζ_{Π}$ , we use the procedure described in Section EC.3. In Table EC.2, we present the values of the scaling parameters for the two monthly arrival rates of 30 and 50 patients per month used in the analysis for Tables 9 and 10.

For low $θ$ and low arrival rates, the Current Practice of Collaborative Care, in which patients receive current treatment policy with monthly monitoring, leads to 18.9% improvement in QALY and 10.4% improvement in profits over the baseline. This improvement is due to regular monitoring and monthly adjustment of the required treatment level for the patient. However, current practice is sensitive to arrival rates, and the improvement in QALYs and profits is lower for higher arrival rates.

The joint optimization policy leads to the highest improvement in QALYs and profits for both low and high $θ$ and low and high arrival rates. Joint optimization finds the highest treatment that can be provided and the optimal staffing level to maximize profits. We can observe that joint optimization adds more health states to higher treatment, and to deliver this treatment policy profitably, the optimal staffing level is greater than that obtained from only staffing optimization. Staffing-only optimization leads to higher profits than current practice, but since treatment is not optimized, it leads to the same QALYs as current practice. For a given arrival rate, staffing-only optimization is not sensitive to $θ$ . Staffing-only optimization continues to improve profits over current practice for higher arrival rates by optimizing the staffing level and, therefore, managing workforce costs better as the arrival rate increases. Treatment-only optimization shows more complex sensitivity to arrival rate and $θ$ . For low $θ$ , as the arrival rate increases from 30 patients per month to 50 patients per month, treatment-only optimization trades off profit to improve QALYs. In contrast, the profits become negative compared to baseline, and the QALYs are comparable to joint-optimization. For high $θ$ , treatment-only optimization prioritizes profits, and as the arrival rate increases, the profits increase, but the QALYs decrease. This is because the treatment policy is optimized to maximize profits, leading to an increase in workload. However, due to unoptimized staffing, profits decrease.

Next, we analyze some other characteristics of the model-based policy in Section 7.5.

7.5. Model-Based Heuristic Policy Dynamics

We provide two analyses to give intuition on how the model-based solution provides benefits. First, we show the path from enrollment to discharge for one representative patient in Table 11. Next, in Table 12, we compare the revenue and care manager workload for the model-based policy and current practice when a larger number of patients are enrolled.

Table 11.
Comparison of total discounted quality-adjusted life years (QALYs) and revenue for a single representative patient.

Current practice Joint optimization

Time period Health state Treatment QALYs Revenue Health state Treatment QALYs Revenue

0 (4,2) $H i g h$ 0.254 $196.41 (4,2) $H i g h$ 0.254 $196.41

1 (4,2) $H i g h$ 0.254 $196.41 (4,2) $H i g h$ 0.254 $196.41

2 (3,2) $L o w$ 0.291 $129.38 (3,2) $H i g h$ 0.291 $196.41

3 (3,2) $L o w$ 0.291 $129.38 (2,2) $H i g h$ 0.484 $196.41

4 (2,2) $L o w$ 0.484 $129.38 (2,1) $L o w$ 0.538 $129.38

5 (2,2) $L o w$ 0.484 $129.38 (2,1) $L o w$ 0.538 $129.38

6 (2,1) $L o w$ 0.538 $129.38 (1,1) $L o w$ 0.634 $129.38

7 (2,1) $L o w$ 0.538 $129.38 Discharge – 1 0

8 (1,1) $L o w$ 0.634 $129.38 Discharge – 1 0

9 discharge – 1 0 Discharge – 1 0

Total discounted sum 4.50 $1,252.30 5.65 $1,143.06

% change from current practice 25.51% −8.72%

	Current practice	Joint optimization
0	(4,2)	$H i g h$	0.254	$196.41	(4,2)	$H i g h$	0.254	$196.41
1	(4,2)	$H i g h$	0.254	$196.41	(4,2)	$H i g h$	0.254	$196.41
2	(3,2)	$L o w$	0.291	$129.38	(3,2)	$H i g h$	0.291	$196.41
3	(3,2)	$L o w$	0.291	$129.38	(2,2)	$H i g h$	0.484	$196.41
4	(2,2)	$L o w$	0.484	$129.38	(2,1)	$L o w$	0.538	$129.38
5	(2,2)	$L o w$	0.484	$129.38	(2,1)	$L o w$	0.538	$129.38
6	(2,1)	$L o w$	0.538	$129.38	(1,1)	$L o w$	0.634	$129.38
7	(2,1)	$L o w$	0.538	$129.38	Discharge	–	1	0
8	(1,1)	$L o w$	0.634	$129.38	Discharge	–	1	0
9	discharge	–	1	0	Discharge	–	1	0
Total discounted sum			4.50	$1,252.30			5.65	$1,143.06
% change from current practice	25.51%	−8.72%

Table 12.

Comparison of objective components $m = 50$ patients per month.

	Current	Joint
	practice	optimization
Number of patients under H	152.3	295.2
Number of patients under L	162.5	0
Total number of patients enrolled	314.8	295.2
Revenue	$50,946.53	$57,984.49
Wage	$13,440.00	$18,950.40
Workload (hrs)	467.1	590.4
Overtime (hrs)	147.2	139.2
Overtime cost	$9,273.60	$8,769.60
Profit	$28,232.93	$30,264.49

Numbers in this table are average steady-state values.

Table 11 shows transitions from health state $(4, 2)$ until discharge for a single patient under current practice and model-based policy. We observe that the patient in the current practice spends one period less in health states $(3, 2)$ and $(2, 2)$ under the model-based policy. This corresponds to treatment level $H i g h$ assigned to health states $(3, 2)$ and $(2, 2)$ under the model-based policy and the $L o w$ treatment level assigned under the current policy. We observe that the consequence of earlier discharge is that the total discounted QALYs gained are higher for the model-based policy than for the current practice. Also, while a higher revenue is collected for this patient under the model-based policy in periods 2 and 3, the patient spends less time enrolled in the program and contributes less total revenue. Consequently, for a single patient, in this scenario, the model-based policy has 25.51% greater QALYs gained and $8.72 %$ less revenue. Therefore, for a single patient, higher treatment leads to early discharge. This leads to higher QALYs gained but lower revenue because the patient spends less time in the program.

In Table 12, we present the average steady-state values for the per-period number of patients in each treatment level, the total number of patients enrolled, revenue, wage, workload, overtime hours, overtime costs, and profit. We present average steady-state values for ease of interpretation. We can observe that under Joint Optimization, all health states are given $H$ treatment, as there are no patients under $L$ treatment. However, the total number of patients is lower. This is because if more patients receive high treatment, they are discharged more quickly, and therefore, in a steady state, fewer patients are enrolled. Thus, while the workload for joint optimization is higher than that of current practice, it is not significantly higher. Since all patients receive high-level treatment, the revenue for joint optimization is substantially higher than that of the current practice. Also, since joint optimization staffs more care managers, the overtime is lower, while the wage cost is higher. Finally, the profit for joint optimization is higher than that of the current practice.

One might think that there might be caps on the total number of $H$ treatments provided by clinics based on the insurance plan. However, the treatment cap is often encoded in terms of how many $H$ or total treatments each patient can receive in a month. In our setting, this was restricted to two follow-ups in a month, which is how we have coded the $H$ treatment. This design ensures that no patient receives more than the reimbursable limit. If there are limits on the total number of treatments in a month that could be reimbursed, then our approach would need to be appropriately adapted.

There are two items we would like to highlight here. First, for the Collaborative Care program, the clinic should adopt a system perspective to determine the optimal allocation of care managers’ time because the trade-offs are different for a single patient compared to a population of enrolled patients. Second, the assignment of treatment levels to patient health states determines the number of patients and their distribution under $H i g h$ and $L o w$ treatment. The distribution of patients determines the revenue, workload, and health outcomes. Consequently, clinic profits and patient health outcomes can be improved by better assignment of treatment and staffing levels.

7.6. Model Upper Bound

To test the efficacy of our heuristic solution for the joint optimization, we compare the solution to an upper bound.

Proposition 4
The following linear program is an upper bound to the MDP given in equation (8).
$\begin{aligned} (D) max \sum_{\begin{matrix} N \in N \\ a \in A, w \in W \end{matrix}} θ_{N, a, w} r (N, a, g_{w}) \end{aligned}$
(19)
subject to,
$\begin{aligned} (1 - δ) \sum_{\begin{matrix} N \in N \\ a \in A, w \in W \end{matrix}} θ_{N, a, w} = 1 \end{aligned}$
(20)

$\begin{aligned} \sum_{\begin{matrix} N \in N_{m} \\ a \in A, w \in W \end{matrix}} θ_{N, a} (N_{s} - δ \sum_{s^{'} \in S} N_{s^{'}} P [N = e_{s^{'}} | N = e_{s}, a_{s}]) \\ \leq ν_{s^{'}} m + \sum_{s \in S} N_{s} P [N = e_{s^{'}} | N = e_{s}, a_{s}] \forall s \in S \end{aligned}$
(21)

$\begin{aligned} θ_{N, a, w} \geq 0 \forall a \in A, N \in N, w \in W \end{aligned}$
(22)

$P [N = e_{s^{'}} | N = e_{s}, a_{s}]$ is as given in equation 5.

We provide the proof of Proposition 4 in the Electronic Companion EC.8.2. The above program is linear in many variables; we solve this by the column generation method. We describe the column generation method in the Electronic Companion EC.9.

Next, we present the result of the model upper bound in Table 13. We present the results for a monthly arrival rate of 30 and 50 patients and two values of $θ = {0.2, 0.9}$ . We present the scaled values of the objective function and upper bound values.

Table 13.
Model gap to upper bound.

Theta Arrival rate Model objective value Upper bound Gap

0.2 30 0.857 0.9133906 6.58%

50 0.913 0.9836662 7.74%

0.9 30 0.88 0.9374728 6.53%

50 0.9027 0.96995115 7.45%

We observe from the results of Table 13 that the gap to the upper bound is less than 7.4% for all the scenarios we consider. This shows that the model-based policy is close to the upper bound.
8. Managerial Insights

Theta	Arrival rate	Model objective value	Upper bound	Gap
0.2	30	0.857	0.9133906	6.58%
	50	0.913	0.9836662	7.74%
0.9	30	0.88	0.9374728	6.53%
	50	0.9027	0.96995115	7.45%

In this section, we discuss sensitivity to changes in model parameters and provide managerial insights. First, we look at how the model solution changes for different values of $θ$ . As described in equation (7), $θ$ is the weight on the clinic profits, and $1 - θ$ is the weight on the total QALYs gained. Next, we examine the model’s sensitivity to the number of care managers. Lastly, we examine the model’s sensitivity to various payment models. In the following figures, to make the profit and QALYs gained comparable for different arrival rates, we divide the QALYs gained and profits by $m$ , so that we present the profits and QALYs gained per monthly arrival of patients.

8.1. Sensitivity to Arrival Rate

We perform a sensitivity analysis on the arrival rate of patients. We present the results for a range of arrival rates for two values of $θ = {0.2, 0.9}$ . We present the results for the profit and QALYs gained in Figures 1 and 2. Since QALYs and profits will always increase with increasing arrival rates, we normalize the QALYs and profits by dividing by the monthly arrival rate to make the results comparable.

Figure 1.

Profit sensitivity to arrival rate for $θ = {0.2, 0.9}$ . (a) Profit vs arrival rate for $θ = 0.2$ ; (b) profit vs arrival rate for $θ = 0.9$ .

Figure 2.

Quality-adjusted life year (QALY) gained sensitivity to arrival rate for $θ = {0.2, 0.9}$ . (a) QALY gained vs arrival rate for $θ = 0.2$ ; (b) QALY gained vs arrival rate for $θ = 0.9$ .

We can observe from Figure 1 that profit with staffing and joint optimization is not sensitive to an increase in arrival rate. This is because, with staffing optimization and joint optimization, the staffing level can be optimally decided to maximize the profit for any arrival rate. However, under treatment-only optimization and current practice, the profit first increases and then decreases. This is because when the arrival rate is low, increasing the arrival rate increases revenue, but there is no overtime. As the arrival rate crosses a particular threshold, any additional increase in arrival rate leads to overtime costs and, therefore, lowers profits.

We can also observe that for low $θ$ and low arrival rates, treatment-only optimization is better than current practice for both QALY and profit. This is because with small enrollment numbers, there is no possibility of overtime, and thus, treatment-only optimization can provide high treatment to a larger number of patients profitably. However, as the arrival rate increases, to prioritize QALYs, the treatment is kept high, but profitability starts decreasing.

For high $θ$ , we observe that under low arrival rate, as before, treatment-only optimization is better than current practice for both QALY and profit, as high treatment can be given profitably without any possibility of overtime. However, as the arrival rate increases, to prioritize profits, treatment-only optimization lowers the treatment level to improve profits until, finally, for a very high arrival rate, the treatment-only optimization is worse than the current practice for QALY.

These results show that different optimization types are uniquely sensitive to the arrival rate. Staffing-only optimization and joint optimization behave similarly, with joint optimization slightly better than staffing-only optimization in terms of profits. The response of treatment-only optimization and whether it performs better than current practice depends on the value of $θ$ .

Since there is no optimization under Current Practice, the response to the arrival rate does not change for different values of $θ$ . However, treatment optimization prioritizes QALYs for low $θ$ values; therefore, as the arrival rate increases, the profit under treatment optimization is lower than that under current practice. On the other hand, for high $θ$ , the treatment-only optimization has better profits than the current practice.

8.2. Sensitivity to Payment Model

Alongside the fee-for-service model developed by CMS, some locations have also implemented a fixed-fee payment model, known as the per-member-per-month model Moise et al. (2018). In this section, we assess the sensitivity of various optimization models to changes in the fixed-fee payment rate.

For the fixed fee payment, in our mathematical model, we set $β = 0$ . We analyze the sensitivity of profit and QALY gained to different payment rates for low and high values of $θ$ . In Figures 3 and 4, we present the profit and QALYs gained for different fixed fee payment rates for two values of $θ = {0.2, 0.9}$ .

Figure 3.

Profit sensitivity to fixed fee payment for $θ = {0.2, 0.9}$ . (a) Profit vs fixed fee payment for $θ = 0.2$ ; (b) Profit vs fixed fee payment for $θ = 0.9$ .

Figure 4.

Quality-adjusted life year (QALY) gained sensitivity to fixed-fee payment for $θ = {0.2, 0.9}$ . (a) QALY gained vs fixed fee payment for $θ = 0.2$ ; (b) QALY gained vs fixed fee payment for $θ = 0.9$ .

In Figure 3(a) and (b), profit increases linearly with the payment rate for the current practice and staffing-only optimization models. This linearity arises because the treatment level is fixed in these models. Since the treatment level determines the discharge rate, it also determines the number of patients in the system at a steady state. Therefore, the number of patients in the system remains constant regardless of the payment rate. Since each patient generates revenue proportional to the payment rate, profits linearly increase as the payment rate increases.

In contrast, we observe a nonmonotonic profit response in the joint optimization and treatment-only models. For payment rates below a certain threshold, revenues are insufficient to support higher treatment levels, resulting in low treatment levels across more health states. In this range, profit increases linearly as the payment rate rises. However, once the payment rate exceeds the threshold, the optimization model increases the treatment level, initially decreasing profits. Higher treatment levels lead to faster patient discharges, reducing the number of patients in the system at a steady state and, consequently, lowering revenue. As the payment rate continues to rise beyond the threshold, treatment levels stabilize at a higher level, and profits again increase linearly with the payment rate. Notably, the threshold for this switch is lower when $θ$ is lower, as the model prioritizes QALY improvement, making the benefits of higher treatment more attractive at lower payment rates. Moreover, the threshold for switching to higher treatment is lower for joint optimization than treatment-only optimization, as the joint model simultaneously reduces costs and increases revenue by optimizing both the staffing and treatment levels, facilitating an earlier switch to higher treatment. A similar pattern is observed in the sensitivity of QALYs, as shown in Figure 4. Higher treatment levels become optimal as payment rates increase, leading to greater QALY gains. The transition to higher treatment occurs at lower thresholds for lower $θ$ values. A lower threshold of transition to higher treatment is also observed for joint-optimization as compared to treatment-only optimization.

The managerial implication is that the sensitivity to QALY and Profit is nonmonotonic. For the clinics to improve treatment levels, a fixed fee payment model requires a payment rate above a certain threshold. This threshold depends on the clinic’s objective weight on QALY and profits, as well as the type of optimization performed. A higher weight on profit will result in the switch to higher treatment occurring at higher payment rates, whereas joint staffing and treatment optimization will lead to the switch at a lower payment rate.

9. Conclusions

Access to mental health has been increasingly challenging. The need for improved access to mental health services is particularly high for patients suffering from chronic physical conditions. Treating both the mental and physical health of patients has been shown to increase the likelihood of improving patients’ health outcomes. The Collaborative Care program is a crucial way to increase access to mental health services. However, as we mentioned in Section 1, the adoption of Collaborative Care programs has been slow due to several challenges, such as managing care manager resources and a lack of visibility in insurance payment models.

One of the primary operational levers in managing population-based chronic care programs, such as Collaborative Care, is allocating the time of the care managers to patients depending on their requirements. Based on the patient’s disease severity, they may need more time from the care manager for therapy and treatment. From standard clinical practice for collaborative care, patients can receive two levels of depression care from their care manager: A higher intensity, more frequent follow-up, or a lower intensity, less frequent follow-up. Insurance revenue payments are typically tied to the time spent by the care managers. Therefore, allocating care managers’ time to patients determines patient health outcomes and clinic profits.

In this paper, we develop a mathematical model for Collaborative Care to optimize the allocation of care managers’ time and offer managerial insights for effective program management. Our model-based approach has a population health perspective, where total revenue, costs, and patient outcomes for all enrolled patients are incorporated. We calibrate our model parameters using patient transition data from a large academic medical center.

From our model, we derive a theoretical result that the optimal allocation of the care managers’ time is a threshold-type policy in the two-dimensional patient health state of diabetes and depression severity. Patients with high diabetes severity should receive more intensive treatment with more frequent follow-up at lower depression severity. This threshold-type policy will lead to improvement in both QALYs and profits. This threshold depends on the distribution of the enrolled patients in the different health states.

The computation of the optimal state-dependent threshold policy is intractable. Additionally, the optimal threshold policy would lead to patients receiving a level of care that depends on the health status of other patients, which is impractical from an implementation perspective. Therefore, we derive a computationally efficient state-independent policy, implementing a static threshold that does not change based on the distribution of enrolled patients. We show that this policy is close to optimal through an upper bound.

Subsequently, we conduct numerical experiments to show that the model-based heuristic policy can dominate current practice for both QALYs and profits. The choice of treatment levels determines how long it takes patients to be discharged. This determines the number of enrolled patients and, consequently, the total system revenue and the total system workload. The model-based heuristic policy determines the treatment levels to balance QALYs, total revenue, and total workload to improve system performance.

Next, we show that for fixed-fee payment models, profit increases linearly with the payment rate in the current practice and staffing-only models due to fixed treatment levels, stabilizing the patient count and revenue. However, joint optimization and treatment-only models show a nonmonotonic profit response, with profits initially rising, then decreasing as treatment levels increase, and rising again once payment rates surpass a threshold. This threshold is lower for models prioritizing QALY and joint optimization, which balances staffing and treatment. Similarly, QALY gains improve with higher treatment levels at lower payment rates. Clinics need payment rates above this threshold, which varies based on the clinic’s focus on profit or QALY, and joint optimization leads to earlier treatment improvements at lower payment rates. Prior literature on fixed-fee payment models has identified that providers may deliver lower than optimal treatment, which is called skimping (Folland et al., 2016). However, we find that in the management of programs such as Collaborative Care, skimping may be avoided if the fixed-fee rate is high enough and by using joint optimization of staffing and treatment levels.

There have been several studies analyzing the relative impact of different payment models. Our study contributes by showing the effect of different payment models in a Collaborative Care context, with chronic care of patients suffering from multiple conditions.

There are several opportunities for future research. First, in our study, we have considered financial and health outcomes observed within primary care. Patients with improved access to mental health services can also be expected to show lower healthcare utilization outside of primary care. Patients under Collaborative Care could also show lower visits to emergency rooms and fewer inpatient admissions. Therefore, one extension of the model could be incorporating the impact of Collaborative Care on the broader healthcare system outside primary care.

We did not adopt a Partially Observable Markov Decision Process approach due to its computational complexity, interpretability challenges, and difficulty in determining the model’s structural properties. However, we perform a sensitivity analysis in the Electronic Companion (EC.12) to assess how policies derived under fully observable states perform when health states are subjected to observation errors. The joint-optimization continues to perform the best for both QALYs and profits under observation errors. However, the sensitivity to the observation errors depends on whether the errors lead to false positive errors or false negative errors.

Finally, this approach does not include patients with other severe mental health conditions such as schizophrenia, anxiety, bipolar disorder, or substance abuse. We chose depression because it is one of the most common co-morbid conditions with other physical conditions. Secondly, the $P H Q 9$ score is a validated quantitative measurement of the intensity of depression, allowing us to model the progression of the disease. Other mental health conditions have less established quantitative measures and thus would present significant modeling challenges.

Supplemental Material

sj-pdf-1-pao-10.1177_10591478251392328 - Supplemental material for Optimal Staffing and Treatment for Collaborative Care of Diabetes and Depression

Supplemental material, sj-pdf-1-pao-10.1177_10591478251392328 for Optimal Staffing and Treatment for Collaborative Care of Diabetes and Depression by Sandeep Rath, Jayashankar M Swaminathan and Charles A Coleman, Jr. in Production and Operations Management

Footnotes

Declaration of Conflicting Interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The authors received no financial support for the research, authorship and/or publication of this article.

ORCID iDs

Sandeep Rath

Jayashankar M Swaminathan

Supplemental Material

Supplemental material for this article is available online(doi: ).

Notes

How to cite this article

Rath S, Swaminathan JM and Coleman CA Jr (2025) Optimal Staffing and Treatment for Collaborative Care of Diabetes and Depression. Production and Operations Management XX(X): 1–24.

References

Adams

Boutilier

Deo

, et al. (2023) Planning a community approach to diabetes care in low-and middle-income countries using optimization. arXiv preprint arXiv:2305.06426. Available at: https://arxiv.org/abs/2305.06426.

Ahuja

Alvarez

Staats

(2020) Maintaining continuity in service: An empirical examination of primary care physicians. Manufacturing & Service Operations Management 22(5): 1088–1106.

Alagoz

Maillart

Schaefer

, et al. (2007) Determining the acceptance of cadaveric livers using an implicit model of the waiting list. Operations Research 55(1): 24–36.

Angstman

Pietruszewski

Rasmussen

, et al. (2012) Depression remission after six months of collaborative care management: Role of initial severity of depression in outcome. Mental Health in Family Medicine 9(2): 99.

Ara

Brazier

(2017) Estimating health state utility values for comorbidities. PharmacoEconomics 35(1): 89–94.

Aro

Karjalainen

Tiihonen

, et al. (2017) Glycemic control and health-related quality of life among older home-dwelling primary care patients with diabetes. Primary Care Diabetes 11(6): 577–582.

Ayer

Alagoz

Stout

(2012) Or forum—A POMDP approach to personalize mammography screening decisions. Operations Research 60(5): 1019–1034.

Ayer

Zhang

Bonifonte

, et al. (2019) Prioritizing hepatitis C treatment in US prisons. Operations Research 67(3): 853–873.

Bachman

Pincus

Houtsinger

, et al. (2006) Funding mechanisms for depression care management: Opportunities and challenges. General Hospital Psychiatry 28(4): 278–288.

10.

Baek

Boutilier

Farias

, et al. (2023) Policy optimization for personalized interventions in behavioral health. arXiv preprint arXiv:2303.12206. Available at: https://arxiv.org/abs/2303.12206.

11.

Barratt

Boyd

(2022) Fitting feature-dependent Markov chains. Journal of Global Optimization 87(2): 329–346.

12.

Bavafa

Savin

Terwiesch

(2019) Managing patient panels with non-physician providers. Production and Operations Management 28(6): 1577–1593.

13.

Bishop

Press

Keyhani

, et al. (2014) Acceptance of insurance by psychiatrists and the implications for access to mental health care. JAMA Psychiatry 71(2): 176–181.

14.

Carlo

Unützer

Ratzliff

, et al. (2018) Financing for collaborative care—A narrative review. Current Treatment Options in Psychiatry 5(3): 334–344.

15.

Chen

Ayer

Chhatwal

(2018) Optimal M-switch surveillance policies for liver cancer in a hepatitis C–infected population. Operations Research 66(3): 673–696.

16.

Clement

Schauman

Graham

, et al. (2015) What is the impact of mental health-related stigma on help-seeking? A systematic review of quantitative and qualitative studies. Psychological Medicine 45(1): 11–27.

17.

Collins

Corcoran

Perry

(2009) Anxiety and depression symptoms in patients with diabetes. Diabetic Medicine 26(2): 153–161.

18.

Dai

Tayur

(2020) Om forum—Healthcare operations management: A snapshot of emerging research. Manufacturing & Service Operations Management 22(5): 869–887.

19.

Dampier

(2018) Mental health care appointments often come with a long wait. 3 ways to cope while help is delayed. Available at: https://www.chicagotribune.com/lifestyles/sc-fam-mental-health-wait-times-1030-story.html (accessed 09 September 2020).

20.

de Véricourt

Perakis

(2020) Frontiers in service science: The management of data analytics services: New challenges and future directions. Service Science 12(4): 121–129.

21.

Denton

(2018) Optimization of sequential decision making for chronic diseases: From data to decisions. In: Recent Advances in Optimization and Modeling of Contemporary Problems. INFORMS, pp.316–348.

22.

Deo

Iravani

Jiang

, et al. (2013) Improving health outcomes through better capacity allocation in a community-based chronic care model. Operations Research 61(6): 1277–1294.

23.

Diamant

(2021) Dynamic multistage scheduling for patient-centered care plans. Health Care Management Science 24(4): 827–844.

24.

Flanagan

McIntosh

Le Petit

, et al. (2006) Deriving utility scores for co-morbid conditions: A test of the multiplicative model for combining individual condition scores. Population Health Metrics 4: 1–8.

25.

Folland

Goodman

Stano

(2016) The Economics of Health and Health Care: Pearson New International Edition. New York: Routledge.

26.

Fournier

DeRubeis

Hollon

, et al. (2010) Antidepressant drug effects and depression severity: A patient-level meta-analysis. JAMA 303(1): 47–53.

27.

Gilbody

Bower

Fletcher

, et al. (2006) Collaborative care for depression: A cumulative meta-analysis and review of longer-term outcomes. Archives of Internal Medicine 166(21): 2314–2321.

28.

Glasserman

(1991) Gradient Estimation Via Perturbation Analysis, Volume 116. New York: Springer Science & Business Media.

29.

Hedrick

Chaney

Felker

, et al. (2003) Effectiveness of collaborative care depression treatment in veterans’ affairs primary care. Journal of General Internal Medicine 18(1): 9–16.

30.

Liu

Zabinsky

(2019) A branch and bound algorithm for dynamic resource allocation in population disease management. Operations Research Letters 47(6): 579–586.

31.

Huang

(2016) Modeling depression progression dynamics from electronic health record. PhD Thesis, University of Washington.

32.

Kamalahmadi

Bretthauer

Helm

, et al. (2023) Mixing it up: Operational impact of hospitalist caseload and case-mix. Management Science 69(1): 283–307.

33.

Katon

Russo

Lin

, et al. (2012) Cost-effectiveness of a multicondition collaborative care intervention: A randomized controlled trial. Archives of General Psychiatry 69(5): 506–514.

34.

Katon

Unützer

Wells

, et al. (2010) Collaborative depression care: History, evolution and ways to enhance dissemination and sustainability. General Hospital Psychiatry 32(5): 456–464.

35.

Kern

Seirup

Rajan

, et al. (2018) Fragmented ambulatory care and subsequent healthcare utilization among medicare beneficiaries. The American Journal of Managed Care 24(9): e278–e284.

36.

Kirsch

Deacon

Huedo-Medina

, et al. (2008) Initial severity and antidepressant benefits: A meta-analysis of data submitted to the food and drug administration. PLoS Medicine 5(2): e45.

37.

Kroenke

Spitzer

(2002) The PHQ-9: A new depression diagnostic and severity measure. Psychiatric Annals 32(9): 509–515.

38.

Kroenke

Spitzer

Williams

(2001) The PHQ-9: Validity of a brief depression severity measure. Journal of General Internal Medicine 16(9): 606–613.

39.

Lee

Chertow

Zenios

(2008) Optimal initiation and management of dialysis therapy. Operations Research 56(6): 1428–1449.

40.

Lee

Lavieri

Volk

(2019) Optimal screening for hepatocellular carcinoma: A restless bandit model. Manufacturing & Service Operations Management 21(1): 198–212.

41.

Liu

Fortney

Vivell

, et al. (2007) Time allocation and caseload capacity in telephone depression care management. American Journal of Managed Care 13(12): 652.

42.

Liu

Hedrick

Chaney

, et al. (2003) Cost-effectiveness of collaborative care for depression in a primary care veteran population. Psychiatric Services 54(5): 698–704.

43.

Liu

Truong

Wang

, et al. (2019) Integrated scheduling and capacity planning with considerations for patients’ length-of-stays. Production and Operations Management 28(7): 1735–1756.

44.

Liu

Shi

Helm

, et al. (2024) An integrated approach to improving itinerary completion in coordinated care networks. Manufacturing & Service Operations Management 27(1): 21–39.

45.

Mann

Gilbody

Richards

(2009) Putting the ‘Q’in depression QALYs: A comparison of utility measurement using EQ-5D and SF-6D health related quality of life measures. Social Psychiatry and Psychiatric Epidemiology 44(7): 569–578.

46.

Mausser

(2006) Normalization and other topics in multi-objective optimization. In: Fields-MITACS Industrial Problems Workshop. Citeseer, p.89.

47.

Menzin

Korn

Cohen

, et al. (2010) Relationship between glycemic control and diabetes-related hospital costs in patients with type 1 or type 2 diabetes mellitus. Journal of Managed Care Pharmacy 16(4): 264–275.

48.

Mišić

Perakis

(2020) Data analytics in operations management: A review. Manufacturing & Service Operations Management 22(1): 158–169.

49.

Moise

Shah

Essock

, et al. (2018) Sustainability of collaborative care management for depression in primary care settings with academic affiliations across New York state. Implementation Science 13(1): 128.

50.

Mojtabai

Olfson

(2006) Treatment seeking for depression in Canada and the United States. Psychiatric Services 57(5): 631–639.

51.

Natarajan

Swaminathan

(2017) Multi-treatment inventory allocation in humanitarian health settings under funding constraints. Production and Operations Management 26(6): 1015–1034.

52.

Schueller

Kwasny

Dear

, et al. (2015) Cut points on the patient health questionnaire (PHQ-9) that predict response to cognitive–behavioral treatments for depression. General Hospital Psychiatry 37(5): 470–475.

53.

Shechter

Bailey

Schaefer

, et al. (2008) The optimal time to initiate HIV therapy under ordered health states. Operations Research 56(1): 20–33.

54.

Simon

Katon

VonKorff

, et al. (2001) Cost-effectiveness of a collaborative care program for primary care patients with persistent depression. American Journal of Psychiatry 158(10): 1638–1644.

55.

Tanner

(2024) Collaborative Care Management: The Influence of Location of Care on Mental Health Self-Stigma. PhD Thesis.

56.

Unützer

Katon

Callahan

, et al. (2002) Collaborative care management of late-life depression in the primary care setting: A randomized controlled trial. JAMA 288(22): 2836–2845.

57.

Unützer

Ratzliff

(2015) Evidence base and core principles. In: Integrated Care: Working at the Interface of Primary Care and Behavioral Health, pp.3–16.

58.

Unützer

Schoenbaum

Katon

, et al. (2009) Healthcare costs associated with depression in medically ill fee-for-service medicare participants. Journal of the American Geriatrics Society 57(3): 506–510.

59.

Varshney

Sharma

Goel

, et al. (2020) Estimation of transition probabilities for diabetic patients using hidden markov model. International Journal of System Assurance Engineering and Management 11(Suppl 2): 329–334.

60.

Yildirim

Gaynes

Keskinocak

, et al. (2021) The cost-effectiveness of depression screening for the general adult population. Journal of Affective Disorders 303: 306–314.

61.

Zhang

(2016) Missing data imputation: Focusing on single imputation. Annals of Translational Medicine 4(1): 9.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.47 MB

Health	Patient Health		Proportion
state	Questionnaire-9	HbA1c	in primary care
$(j, k)$	(PHQ9) score	score	population ( $η_{s}$ )
$(1, 1)$	$P H Q 9 \leq 5$	$H b A 1 c \leq 7 %$	0.044
$(2, 1)$	$5 < P H Q 9 \leq 9$		0.132
$(3, 1)$	$9 < P H Q 9 \leq 14$		0.122
$(4, 1)$	$14 < P H Q 9$		0.220
$(1, 2)$	$P H Q 9 \leq 5$	$H b A 1 c > 7 %$	0.041
$(2, 2)$	$5 < P H Q 9 \leq 9$		0.123
$(3, 2)$	$9 < P H Q 9 \leq 14$		0.114
$(4, 2)$	$14 < P H Q 9$		0.204

Depression		Depression
health state $j$	$P H Q 9$ Score	QoL score	Data source
1	$P H Q 9 \leq 5$	0.812	Mann et al. (2009)
2	$5 < P H Q 9 \leq 9$	0.689
3	$9 < P H Q 9 \leq 14$	0.414
4	$14 < P H Q 9$	0.362

	Current practice				Joint optimization
Time period	Health state	Treatment	QALYs	Revenue	Health state	Treatment	QALYs	Revenue
0	(4,2)	$H i g h$	0.254	$196.41	(4,2)	$H i g h$	0.254	$196.41
1	(4,2)	$H i g h$	0.254	$196.41	(4,2)	$H i g h$	0.254	$196.41
2	(3,2)	$L o w$	0.291	$129.38	(3,2)	$H i g h$	0.291	$196.41
3	(3,2)	$L o w$	0.291	$129.38	(2,2)	$H i g h$	0.484	$196.41
4	(2,2)	$L o w$	0.484	$129.38	(2,1)	$L o w$	0.538	$129.38
5	(2,2)	$L o w$	0.484	$129.38	(2,1)	$L o w$	0.538	$129.38
6	(2,1)	$L o w$	0.538	$129.38	(1,1)	$L o w$	0.634	$129.38
7	(2,1)	$L o w$	0.538	$129.38	Discharge	–	1	0
8	(1,1)	$L o w$	0.634	$129.38	Discharge	–	1	0
9	discharge	–	1	0	Discharge	–	1	0
Total discounted sum			4.50	$1,252.30			5.65	$1,143.06
% change from current practice							25.51%	−8.72%

Optimal Staffing and Treatment for Collaborative Care of Diabetes and Depression

Abstract

Keywords

1. Introduction

2. Literature Review

3. Collaborative Care for Depression and Diabetes in Primary Care

4. Model

4.1. Patient Health State and System State

Table 1. Patient Health Questionnaire-9( P H Q 9 ) and depression severity. Depression health state ( j ) P H Q 9 score Depression severity 1 0–4 None-minimal 2 5–9 Mild 3 10–14 Moderate 4 15+ Severe

4.2.1. Enrollment

6.1. Transition Probabilities

6.1.1. Data Description

6.1.2. Imputing Missing Health States

Table 3. Parameters for time spent by care managers. Model parameter Parameter value Data source w L 60 minutes per month w H 120 minutes per month Liu et al. (2007)

Table 6. Parameters for QoL scores for depression. Depression Depression health state j P H Q 9 Score QoL score Data source 1 P H Q 9 ≤ 5 0.812 Mann et al. (2009) 2 5 < P H Q 9 ≤ 9 0.689 3 9 < P H Q 9 ≤ 14 0.414 4 14 < P H Q 9 0.362

6.8. Model Validation

7. Solution Methodology and Numerical Analyses

7.1. Baseline Policy: Usual Care

7.2. Current Practice for Collaborative Care

7.3. Model-Based Joint Optimization of Treatment and Staffing

7.4. Policy Comparison

8.1. Sensitivity to Arrival Rate

Supplemental Material

sj-pdf-1-pao-10.1177_10591478251392328 - Supplemental material for Optimal Staffing and Treatment for Collaborative Care of Diabetes and Depression

Footnotes

Declaration of Conflicting Interests

Funding

ORCID iDs

Supplemental Material

Notes

How to cite this article

References

Supplementary Material

Table 1.
Patient Health Questionnaire-9( $P H Q 9$ ) and depression severity.

Depression health state $(j)$ $P H Q 9$ score Depression severity

1 0–4 None-minimal

2 5–9 Mild

3 10–14 Moderate

4 15+ Severe

Table 3.
Parameters for time spent by care managers.

Model parameter Parameter value Data source

$w^{L}$ 60 minutes per month

$w^{H}$ 120 minutes per month Liu et al. (2007)

Table 6.
Parameters for QoL scores for depression.

Depression Depression

health state $j$ $P H Q 9$ Score QoL score Data source

1 $P H Q 9 \leq 5$ 0.812 Mann et al. (2009)

2 $5 < P H Q 9 \leq 9$ 0.689

3 $9 < P H Q 9 \leq 14$ 0.414

4 $14 < P H Q 9$ 0.362