Abstract
Study Design
Commentary.
Objective
This commentary aims to discuss the practical applications of a clinical prediction rule (CPR) developed to predict functional status in patients undergoing surgery for the treatment of degenerative cervical myelopathy.
Methods
Clinical cases from the AOSpine CSM-North America study were used to illustrate the application of a prediction rule in a surgical setting and to highlight how this CPR can be used to ultimately enhance patient care.
Results
A CPR combines signs and symptoms, patient characteristics, and other predictive factors to estimate disease probability, treatment prognosis, or risk of complications. These tools can influence allocation of health care resources, inform clinical decision making, and guide the design of future research studies. In a surgical setting, CPRs can be used to (1) manage patients’ expectations of outcome and, in turn, improve overall satisfaction; (2) facilitate shared decision making between patient and physician; (3) identify strategies to optimize surgical results; and (4) reduce heterogeneity of care and align surgeons’ perceptions of outcome with objective evidence.
Conclusions
Valid and clinically-relevant CPRs have tremendous value in a surgical setting.
Keywords
What Is a Clinical Prediction Rule?
A clinical prediction rule (CPR) combines signs and symptoms, patient demographics, and other predictive factors to estimate disease probability, treatment prognosis, or risk of complications. 1 CPRs are valuable in the medical field because they may inform policy changes, influence clinical practice, and guide the design of future research studies. In terms of public policy, prediction models can help develop preventive strategies and outline protocols for the management of high-risk asymptomatic patients. In a clinical setting, CPRs are useful because they can (1) inform patients of their risk of disease development; (2) provide an evidence-based tool to help clinicians make decisions about screening, diagnostic testing, and therapeutic interventions (i.e., timing, type, and intensity); (3) facilitate shared decision making between patients and physicians; (4) help manage patients’ expectations; (5) evaluate the cost-effectiveness of treatment by providing information on prognosis; and (6) reduce heterogeneity of practice across centers. From a research perspective, knowledge of important predictors can help design appropriate inclusion and stratification criteria for future studies and can identify relevant confounders to optimize robust statistical analyses.
The Acute Physiology and Chronic Health Evaluation II (APACHE II) and the model for end-stage liver disease are two examples of common CPRs used in a clinical setting. 2 , 3 The APACHE II is an intensive care unit scoring system that incorporates 12 physiologic measurements, age, and medical history to assess disease severity and risk of mortality. 2 This score is used to identify life-threatening physiologic problems and to guide the development of suitable treatment strategies. Furthermore, this knowledge can inform concerned family members about the risk of subsequent hospital death. On the hand, the model for end-stage liver disease score uses a patient's bilirubin and creatinine levels and the international normalized ratio from prothrombin time to evaluate disease severity, estimate the chance of survival, and prioritize recipients of liver transplants. 3 These two examples demonstrate that CPRs can be used to identify high-risk patients, assess severity of disease, and provide prognostic information to patients and their families.
A Clinical Prediction Rule in a Surgical Setting
There is an increasing need for clinicians to accurately and objectively quantify a patient's likely surgical outcome. A CPR was recently developed to predict functional status in patients undergoing surgery for the treatment of degenerative cervical myelopathy (DCM). 4 The primary objective of this commentary is to describe the clinical application of this CPR and, through the use of examples, explain how it can ultimately improve patient care.
DCM is a progressive, degenerative spine disease and the most common cause of spinal cord dysfunction in adults worldwide.
5
,
6
The term
This CPR was developed to discriminate between patients with mild myelopathy at 1 year following surgery (modified Japanese Orthopaedic Association [mJOA] ≥16) and those with substantial residual neurologic impairment (mJOA < 16). 4 A score of 16 on the mJOA translates to greater social independence, minimal neurologic signs and symptoms, and an ability to perform activities of daily living. The original CPR included six statistically significant clinical variables and one clinically relevant imaging parameter. Based on the model, patients were more likely to achieve a score of ≥16 if they had milder disease preoperatively, had a shorter duration of symptoms, did not smoke, were younger, did not have psychiatric disorders or impaired gait, and had a larger transverse area. Results from a recent systematic review, however, revealed that transverse area is not predictive of outcome; this imaging parameter was consequently removed from the model to increase its relevance in a clinical setting. 11 , 12 The following equation (Eq. 1) summarizes this CPR:
where
Validation of This Clinical Prediction Rule
The original CPR was developed using data on patients enrolled at spine care centers in Canada and the United States. As a result, this model truly reflects the patient demographics, disease causation, management strategies, and health care systems of North America. Unfortunately, several differences may exist across countries, cities, and centers that could decrease the generalizability of our model and render it invalid at sites outside of North America. Some of these include differences in (1) disease definitions and clinical presentation, (2) average demographics, (3) interpretation of predictors or assessment of outcome, (4) access to care, and (5) management strategies.
To assess external validity, we evaluated the discrimination and calibration of our CPR on a data set of surgical patients enrolled at centers across the world. Based on our results, the model displayed good predictive performance and adequate calibration in the international data set. 12
Using Eq. 1 to Predict Outcome
The patient, a 57-year-old nonsmoking woman, presented with severe myelopathy (mJOA = 11) secondary to spondylosis, disk herniation, and hypertrophy of the ligamentum flavum. This patient complained of gait impairment, clumsy hands, and weakness and upon clinical examination displayed evidence of lower limb spasticity, hyperreflexia, broad-based unstable gait, and atrophied intrinsic hand muscles. Her preoperative duration of symptoms was 14 months. The patient also had a heart arrhythmia and moderate hypertension.
The variables relevant to the prediction equation are baseline mJOA, duration of symptoms, impaired gait, smoking status, psychiatric health, and age. Table 1 displays the numbers that must be entered into Eq. 1 to compute the probability of achieving a score ≥16. This probability was calculated as 48.95%.
An example of how to use a clinical prediction equation to evaluate outcome
Abbreviations:
Clinical Application of This Clinical Prediction Rule
In a surgical population, a CPR can be used to manage patients’ expectations, facilitate shared decision making, identify ways to optimize results, and standardize practice across centers. The following explanations and case examples highlight how this CPR can be used in these four ways to improve patient care.
Manage Patients’ Expectations and Improve Overall Satisfaction
In the United States, recent legislative developments require clinicians to report patient satisfaction in their assessment of overall treatment outcomes. 13 The Centers for Medicare and Medicaid Services (CMS), hospitals, and insurance providers have identified patient satisfaction as a major component and predictor of overall quality of care. 13 As part of a CMS hospital inpatient value-based purchasing program, Medicare reimbursements are increasingly linked to patient satisfaction. Furthermore, as of 2013, CMS has provided “value-based” incentive payments to acute care hospitals in part based on results from patient satisfaction surveys. 14
Patient satisfaction is not exclusively based on clinical outcomes and can be strongly influenced by several extraneous factors, including the politeness of the staff, cleanliness of the facilities, surgical wait times, and physician's communication skills. Furthermore, according to a study by Hamilton et al, the extent to which a patient's preoperative expectations are met is also a significant predictor of overall satisfaction of surgery. 15
Expectations are “the yardstick by which our patients measure the course of recovery, occurrence of complications and the outcome.” 16
Patients’ expectations, however, are often influenced by anecdotal evidence provided by friends and family, information obtained from the Internet, and various news sources and past surgical experiences. The attending surgeon must use the consent discussion as an opportunity to outline risks and benefits of the procedure, provide accurate outcome information, and correctly manage patients’ expectations. However, these expectations are also affected by how the patient understands the information provided by the surgeon and how he or she translates it into “actionable beliefs,” 16 which further depends on several factors such as mental state, attitude, and individual or cultural values. As a result, patients may form different expectations than those the surgeon is attempting to communicate.
A quantitative tool used to predict surgical outcome can help a surgeon more effectively convey prognostic information and give the patient a better understanding of how he or she should expect to fare following intervention. Patients’ expectations will therefore be more appropriately managed and, accordingly, overall satisfaction will likely improve.
Facilitate Shared Decision Making and Counsel Concerned Patients as to Potential Treatment Options
Predicting surgical outcome can also facilitate shared decision making between the patient and physician. An estimate of the risks and benefits must be presented to the patient in order for him or her to make informed decisions about the various treatment options. During the consent process, the discussion between the patient and physician should include a prediction of surgical outcome, the consequences of not undergoing surgery, and the probability of achieving social independence. This CPR can provide this information to both the surgeon and the patient and enable shared decision making.
Identify Strategies to Optimize Results
A CPR can also help clinicians identify strategies to optimize surgical results. We must first distinguish between the predictors that clinicians have control over from those that cannot be changed.
Age: A patient's age is unchangeable.
The other five factors are modifiable:
Duration of symptoms: A longer duration of symptoms is associated with a worst postoperative outcome. To improve the prognosis, clinicians can choose to operate earlier on patients with DCM rather than waiting for the disease to progress. This option, however, requires that primary care physicians accurately detect DCM at early disease stages, differentiate between it and mimicking diagnoses, and refer patients immediately for surgical consultation. 17
Baseline severity score: Severer preoperative myelopathy is also predictive of a worse outcome. This point can be changed by earlier detection of DCM and by prioritizing consultation and surgery for these patients.
Smoking status: It is unclear why smokers have a worse postoperative outcome than nonsmokers. Previous studies have suggested that smokers have higher rates of nonfusion and wound infections than nonsmokers. 18 We, however, speculated that smoking is a surrogate for lower socioeconomic status, poorer dietary choices, the presence of comorbidities, and less access to postsurgical care. Further research is required to confirm these hypotheses; however, until this research is done, surgeons should promote smoking cessation prior to surgery to optimize outcomes.
Psychiatric disorders: Patients with depression or bipolar disorder do not perform as well following surgery. Some surgeons will preselect their patients according to their mental state and will not operate on those with psychiatric comorbidities. It is recommended that these patients seek counseling prior to intervention to improve outcomes, recovery, and satisfaction.
Impaired gait: Patients with gait dysfunction typically have more severe spinal cord dysfunction and score lower on a preoperative mJOA assessment. The presence of impaired gait is also predictive of a worse outcome. This factor is related to a patient's duration of symptoms and the baseline severity score: if a patient is operated on earlier and at a milder disease stage, he or she may not have progressed to displaying evidence of gait impairment.
It is important to note that this model is not intended to evaluate surgical candidacy or identify patients who are more likely to benefit from surgical intervention than conservative management. Furthermore, there are only a few situations where our CPR would recommend against surgery: (1) the patient is very old and frail; (2) the patient is at high risk of experiencing a major complication; or (3) the patient has several comorbidities and poor general health status.
Reduce Heterogeneity of Care and Align Surgeons’ Perceptions of Outcome with Objective Evidence
According to a study by Davidson et al, there is substantial variability between clinicians in terms of the information they convey to their patients with spinal cord injuries. 19 In this study, a questionnaire was distributed to spine surgeons across North America who specialized in the management of traumatic spinal cord injury. This survey was designed to evaluate the type of prognostic information surgeons provide to their injured patients and to assess the variability of responses across practices. The questionnaire presented various case vignettes and asked questions such as, “How long will it take this patient to return to work?”; “What are the chances this patient will be free of back pain and stiffness 1 year after surgery?”; and “What do you tell your patient concerning the chances for functional recovery in his or her lower extremities?” Ideally, all surgeons would be well informed of the existing prognostic literature and provide similar responses to these questions. This ideal, however, was not the case; the results indicated substantial variability in the information surgeons provide to their patients about how they are expected to fare following intervention.
Similarly, in DCM, surgeons often have different perceptions of surgical prognosis, which is likely due to the controversy in the literature surrounding the most important clinical and imaging predictors of surgical outcome. Our CPR can be used across centers to objectively quantify a patient's likely outcome, align surgeons’ perceptions with more objective evidence, and ensure that consistent and accurate information is being conveyed to patients.
Example Cases
In this section, we present example cases to illustrate how this CPR can be used to manage patients’ expectations, counsel concerned patients as to the potential treatment options, and align surgeons’ perceptions with objective evidence. Table 2 summarizes how the probability of achieving a score of mJOA ≥16 was calculated.
Case examples and clinical applications of the clinical prediction rule
Abbreviations:
Note:
Manage Patients’ Expectations and Improve Overall Satisfaction
The following two cases demonstrate how predicting outcome before surgery can aid in managing expectations.
Case 1: A 49-year-old nonsmoking man presented with moderate myelopathy (mJOA = 14) secondary to spondylosis, disk herniation, and congenital stenosis. This patient had numb and clumsy hands, muscular weakness, corticospinal motor deficits, hyperreflexia, and upgoing plantar responses. The duration of symptoms was 2 months. The patient also had coexisting moderate hypertension, mild respiratory disease, and mild diabetes.
Case 2: A 69-year-old nonsmoking man presented with moderate myelopathy (mJOA = 13) secondary to spondylosis, disk herniation, and hypertrophied ligamentum flavum. This patient had numb and clumsy hands, an impaired gait, muscular weakness, corticospinal distribution motor deficits, hyperreflexia, a positive Hoffman sign, upgoing plantar responses, and a broad-based unstable gait. The duration of symptoms was 120 months. The patient had a mild stroke (Fig. 1).

Case 2.
Based on estimates computed by Eq. 1, case 1 has a 92.7% chance of improving to an mJOA score ≥16 whereas case 2 only has a 41.0% chance of achieving this outcome. These patients should be managed differently during the surgical consent process. The attending surgeon should inform both patients that they are likely to improve following surgery but should notify case 2 that he will still have substantial residual neurologic deficit and may require assistance with activities of daily living. This information will help manage case 2's expectations of outcome and, in turn, help to improve his overall satisfaction. With respect to observed outcome, case 1 was neurologically normal postoperatively (mJOA = 18), whereas case 2 improved from 13 to 15 but did not reach a score ≥16.
Facilitate Shared Decision Making and Counsel Concerned Patients as to Potential Treatment Options
Case 3 provides an example how a CPR can be used to facilitate shared decision making and to counsel concerned patients as to potential options.
Case 3: A 53-year-old nonsmoking man presented with mild myelopathy (mJOA = 17) secondary to spondylosis and disk herniation. This patient had numb and clumsy hands, bilateral arm paresthesia, muscular weakness, and atrophy of intrinsic hand muscles. The duration of symptoms was 4 months. The patient had unspecified endocrine comorbidities (Fig. 2).

Case 3.
This case is an example of a patient with mild myelopathy and a short preoperative duration of symptoms. This patient has an excellent surgical prognosis; however, he may be reluctant to consent to neurosurgery for such mild upper limb symptoms. This CPR can help surgeons counsel this patient and inform him that if he is operated on early and at his current disease state, he will achieve a better outcome than if he were to wait. An example is provided in Table 2. We assumed that if the patient waits 1 year before surgery, he will exhibit a 2-point decline on his preoperative mJOA score, will be 1 year older, and will have a significantly longer duration of symptoms. As a result, his probability of achieving a score ≥16 on the mJOA decreases from 93.6 to 85.0%. Furthermore, if he starts to exhibit signs and symptoms of gait dysfunction, this estimate further decreases to 69.6%. These figures would be valuable to help clinicians counsel their patients and to enable shared decision making.
Reduce Heterogeneity of Care and Align Surgeons’ Perceptions of Outcome with Objective Evidence
Case 4: A 62-year-old nonsmoking man presented with mild myelopathy (mJOA = 15) secondary to spondylosis, disk herniation, and congenital stenosis. This patient had numb hands, Lhermitte phenomena, weakness, atrophy of intrinsic hand muscles, and a positive Hoffmann sign. The duration of symptoms was 36 months. The patient had coexisting mild gastrointestinal (stomach/intestine) disorders (Fig. 3).

Case 4.
For each participant of the study, the surgeon was asked to predict how the subject would fare following surgical intervention: improve from baseline status, remain the same, or worsen. For case 4, the surgeon believed the subject would be the same as baseline. However, the CPR predicted a 76.4% chance the patient would achieve a score ≥16 and therefore improve by at least 1 point on the mJOA. The patient did indeed improve following surgery and was neurologically normal at 1-year follow-up (mJOA = 18). This example demonstrates how a CPR can help align surgeons’ perceptions with more objective evidence.
Limitations and Future Directions
This model was developed to predict functional status in patients undergoing surgery for the treatment of DCM. Unfortunately, the mJOA/JOA scales are not highly correlated with patient-reported outcomes that assess overall quality of life. This conclusion was determined by several studies: (1) Kato et al reported moderate correlation between the JOA/mJOA and the Short-Form (SF)-12 Physical Component Score and Neck Disability Index and low correlation with the SF-12 Mental Component Score
20
; (2) Kopjar et al also demonstrated poor correlation with the Neck Disability Index (
A randomized controlled trial is required to assess the health care utility and the true application of this CPR in a surgical spine setting. In this trial, we propose that 50% of the patients will have their outcome objectively predicted by our CPR, whereas the other 50% will have a regular consent conversation with their attending surgeon. Patient satisfaction and other health-utility measures will be evaluated and compared between the two groups to ascertain the real value of this CPR.
Conclusions
By using actual cases examples in a DCM population, this commentary highlights the clinical applications of CPRs and their value in a surgical setting. CPRs can be used to predict disease development, define the treatment prognosis, and estimate the risk of complications in various patient populations. In a surgical setting, these tools can facilitate shared decision making between patients and physicians, manage expectations, and help standardize care across centers. Furthermore, a CPR can help identify ways to optimize the results and assist in the evaluation of cost-effectiveness.
Disclosures
Lindsay Tetreault, none
David Le, none
Pierre Côté, Grants: Ontario Ministry of Finance and Financial Services Commission of Ontario, Canadian Institute of Health Research; Speaking and/or teaching arrangements: National Judicial Institute, European Spine Society, Societe des Medecins Experts du Quebec; Travel expenses: European Spine Society
Michael Fehlings, none
For the myelopathy scoring, the formula presented by Fehlings et al, however, entirely relies on discrete variables consisting of the modified Japanese Association score (mJOA), presence or absence of mental health disorders (specifically depression, bipolar disorder), and presence or absence of impaired gait, smoking, and duration of symptoms (split up into 5 discrete variables from 1 to 5). Statistical calculations that rely on discrete values with their categorical increments are very different from calculations that draw upon continuous variables in several ways. For instance, simple calculations of averages and means without representation of distribution have to be interpreted with caution. When discrete variables are used, such as in the ASIA grading system, fractional improvements do not necessarily represent a clinically meaningful change.
An advantage of using discrete variables lies in the option to express distributions in a tabular format, something that one cannot do with continuous variables. Such an arrangement may be seen as a significant advancement for some. For others it may introduce the threat of reducing complex human creatures into a formula with a possibility of encroaching on access to health care.
In this context it is worthwhile to briefly explore the possibility of variabilities or manipulation inherent to a formula-driven medicine. When scoring systems are used for an outcomes or probability formula, who is the best person to perform a rating? A treating physician, an “independent” coder, or the affected patient? In each of the four domains of the mJOA (motor function upper extremities, motor function lower extremities, sensory function, bowel/bladder control) there are subjective ratings involved with some room for interpretation. For those not inherently familiar with the mJOA, EBSJ/GSJ provides this score as reference in Table 1. For instance, a “slight difficulty” and a “marked difficulty” in buttoning a shirt is a difference of a score of 3 or 4 within the upper extremity domain. It stands to reason that for a violinist even a minor difficulty in hand use is a big deal while the same probably does not apply for an elderly long-time sufferer from advanced rheumatoid arthrosis deformans of the hands. For lower extremity functional assessments, how many physicians actually directly observe our affected patients while they climb stairs? It's a safe bet that most clinicians use some form of extrapolation to answer the lower extremity functional question: What are we to do with patients who have a non-neurologic form of gait impairment, such as joint contractures? Someone will need to abstract what the patient could do without a musculoskeletal comorbidity to meet the score. The differences of mild and severe loss of sensation and/or presence of pain are another example of a relatively subjective scoring, as is the determination of “mild or moderate micturition problems” (2 points) versus “marked problems with micturition” (1 point). It is not far-fetched to imagine a swing of 4 points or even more in the interpretation of patient symptoms if interested parties are invited to use the mJOA as a clinical determination tool. EBSJ/GSJ “up-” and “down-coding” have become a reality in health care in areas of reimbursement and risk profiling. Certainly the mJOA could be prone for variances in interpretation, once it is turned from the valued and time honored research tool that it hitherto has been into a key determinant of a health care formula. The other variables used in this formula are also subject to debate. Smoking is addressed as a dichotomous option, yet there are many subplots to this simple seeming category, such as time elapsed from smoking cessation—when does a smoker convert to a nonsmoker? For gait impairment, are orthopaedic lower extremity problems not associated with myelopathy reasons to be rated as “positive” in this category? As to duration of symptoms, this is a notoriously hard question, especially in areas which do not have the benefit of an elaborate primary care network, such as available in Canada. Patients frequently have a hard time providing an accurate personal health history narrative. This leads to the question: what other health dimensions might be missing in this CPR formula that are relevant for CSM patients? Why is there no role for a general health status coefficient or base line activity level? What about the patient reported perspective—shouldn't a CPR that relies on a provider derived scoring include a patient perspective, such as an EQ-5D?
Modified Japanese Orthopaedic Association (mJOA) score
With these questions, EBSJ/GSJ does not wish to criticize the authors for their contribution. The article raises important issues as health care is increasingly immersed into the data gathering and analytic capabilities of the information technology era. Without doubt, algorithm-based medicine is becoming an increasing reality for many. The presented Invited Commentary provides a thought provoking outlook on the potential use for algorithms in spine surgery and challenges us to find ways to become more consistent in our disease interpretation, decision making, and treatment of important and frequently overlooked disorders such as compressive myelopathy. EBSJ/GSJ looks forward to the comments of its readership on this topic.
Footnotes
Acknowledgments
Funding from AOSpine North America and International.
