Abstract
Study Design
Bibliometric analysis.
Objective
To determine trends, frequency, and distribution of patient-reported outcome instruments (PROIs) in degenerative cervical spine surgery literature over the past decade.
Methods
A search was conducted via PubMed from 2004 to 2013 on five journals (
Results
From 19,736 articles published, 241 articles fulfilled our study criteria. Overall, 53 distinct PROIs appeared. The top seven most frequently used PROIs were: Japanese Orthopaedic Association score (104 studies), visual analog scale for pain (100), Neck Disability Index (72), Short Form-36 (38), Nurick score (25), Odom criteria (21), and Oswestry Disability Index (15). Only 11 PROIs were used in 5 or more articles. Thirty-three of the PROIs were appeared in only 1 article. Among the included articles, 16% were of level 1 evidence and 32% were of level 4 evidence.
Conclusion
Numerous PROIs are currently used in degenerative cervical spine surgery. A consensus on which instruments to use for a given diagnosis or procedure is lacking and may be necessary for better communication and comparison, as well as for the accumulation and analysis of vast clinical data across multiple studies.
Keywords
Introduction
Patient-centered health measures have been recognized as valuable and useful tools to report and quantify the efficacy of surgical procedures. 1 , 2 , 3 Patient-centered health measures also enable outcome comparisons of cost-effectiveness across studies. Currently, there are many kinds of patient-reported outcome instruments (PROIs) in the field of cervical spine surgery without an established consensus as to which instrument should be used for a particular diagnosis or procedure. 4 Without a consensus, long-term efficacy or cost-effectiveness of different treatments becomes difficult to assess.
The purpose of this study was to determine the frequency and distribution of various PROIs used in the degenerative cervical spine surgery literature and to identify the trends of use of these instruments over the past decade.
Materials and Methods
A search was conducted on PubMed for publications from 2004 to 2013 in five major orthopedic and spine journals—
All the included articles were then thoroughly reviewed and analyzed for their journal, publication date, study design, level of evidence (LOE), and preoperative and 2-year follow-up scores for the most commonly used instruments. The LOE was assigned based on the definitions by the Oxford Center for Evidence-Based Medicine, and only articles with LOEs of 1 to 4 were included. 5 This study did not need Institutional Review Board approval.
Results
From 19,736 articles published,
5
we identified 1,079 articles reporting the use of PROIs in the field of spine surgery. Of these articles, 241 met our study criteria and were included (Table 1). Articles published per journal in descending order were

The contribution of each journal to the total of included articles. Abbreviations: EuroSpine,
The number of studies that contained patient-reported outcome instruments among the five journals studied
An upward trend in PROI usage was observed over the past decade (Fig. 2). Of note, about one third of these articles (30.7%, 74 articles) were published in the previous 2 years (2012 to 2013).

Overall trend of the use of patient-reported outcome instruments (PROIs) from 2004 to 2013 in the field of degenerative cervical spine surgery.
The five most used outcome measures in descending order were: the Japanese Orthopaedic Association (JOA) score (104 studies, 43.2%), visual analog scale (VAS) for pain (100 studies, 41.5%), Neck Disability Index (NDI; 72 studies, 29.9%), Short Form-36 (SF-36; 38 studies, 15.8%), and Nurick score (25 studies, 10.4%; Fig. 3). 6 , 7 , 8 , 9 , 10 , 11 , 12 Of the 53 total PROIs, only 11 were used in 5 or more articles (Table 2). Thirty-three of the PROIs appeared in only a single article. One third of the articles were of LOE 4 (32.0%), and only 16.4% of all articles were of LOE 1 (Fig. 4).

Frequency of appearance of each patient-reported outcome instrument in articles from 2004 to 2013.

Distribution of the level of evidence.
Most commonly used PROIs
Abbreviations: EQ-5D, Euro-Qol-5D; JOA, Japanese Orthopaedic Association; NDI, Neck Disability Index; ODI, Oswestry Disability Index; PROI, patient-reported outcome instrument; SF-12, Short Form-12; SF-36, Short Form-36; SOQ, Cervical Spine Outcomes Questionnaire; VAS, visual analog scale.
Note: Studies used more than 4 times with the number of appearance and % of appearance among total 241 studies. Besides these PROIs, 2 scoring systems were used 3 times, 6 systems were used twice, and 33 systems were used only once. In total, 465 PROIs appeared in 241 studies.
Discussion
With the advent of evidence-based medicine, research in the field of spine surgery increasingly requires objective statistical analyses of the treatment outcomes. Accordingly, researchers have responded by turning to PROIs to evaluate the health care outcomes. 1 , 2 , 3 PROIs are powerful tools for evaluating clinical results and objectively quantifying outcome success. In this study, the use of PROIs in degenerative cervical spine surgery was found to be increasing in five top-impact orthopedic and spine surgery journals in the past 10 years. Among the PROIs used in these journals, ∼30% occurred in the previous 2 years (2012 to 2013) and ∼41% in the previous 3 years (2011 to 2013). 6 , 7 , 8 , 9 , 10 , 11 , 12 Among the 241 articles, PROIs were used 465 times, a mean of 1.9 PROIs per study. Among the top PROIs, the JOA and NDI were used the most over the past decade. Although increasing, the SF-36 was not utilized significantly over the past decade despite the value of this health-related quality-of-life instrument in calculating quality-adjusted life-years and analyzing cost-effectiveness. Given that cost-effectiveness studies will be of increasing interest in the coming years, investigators should consider using the SF-36 or other validated scales with the capability of providing cost-effectiveness data (Fig. 5).

Trend of the use of each top-five patient-reported outcome instruments: (A) (JOA), (B) visual analog scale (VAS), (C) Neck Disability Index (NDI), (D) Short Form-36 (SF36), (E) Nurick score.
There are many different PROIs available, which often makes the choice of which to use unclear. 4 , 13 , 14 In fact, in the past 10 years of degenerative cervical spine surgery studies, 53 unique PROIs were used. Many of these PROIs (33) were used only once. Two factors are mainly responsible for the diversity of PROIs used. First, cervical spine disease can present with various clinical symptoms ranging from localized neck pain to systemic myelopathy, and therefore a single scoring system may not be sufficient to address all cervical spine outcomes. Second, no recommendation or clear guideline exists for the selection of PROIs in cervical spine studies, leaving investigators to choose from numerous existing PROIs or design their own custom tool. Although PROI use is increasing overall, the wide range of PROIs utilized complicates comparisons that we would hope to draw between studies.
Outcome measures are classified into three general categories: general quality of life, pain, and disease-specific outcomes. 8 Hunt and Hurwit recommended the use of combined disease-specific and general health PROIs. 15 For example, among the top seven PROIs, the SF-36 is a general health measure, the VAS is a pain scale, and the JOA, NDI, Nurick, and ODI are disease-specific measures. As a clinical scoring system that can be utilized widely, the ideal PROI must be valid, reliable, and responsive. 15 , 16 Multiple studies validated these PROIs by demonstrating a correlation with other PROIs—SF-36 in comparison with MDI and VAS 7 ; SF-36 with VAS, NDI, and MDI 17 ; SF-36 with VAS and Nurick 18 , 19 ; SF-12 with SF-36 20 ; Nurick with modified JOA and JOA. 10 , 21 , 22 In addition, interobserver and test–retest reliability of these most frequently used PROIs have been surveyed including for JOA, 23 NDI and VAS, 24 SF-36, 17 , 25 and SF-12. 26
Disease-Specific Outcome Measures
As mentioned previously, cervical spine pathology can present with diverse clinical symptoms and requires the appropriate choice of PROI for each diagnosis. Kalsi-Ryan et al recently published a statement regarding the use of PROIs for cervical spondylotic myelopathy. 27 They recommended that reliable, valid, and responsive PROIs be used, as well as ancillary measures to record and evaluate patients’ function. In particular, the authors recommended that the modified JOA, Nurick grade, NDI, Myelopathy Disability Index (MDI), and 30 m Walk Test should be used with the ancillary measures such as the shortened Disabilities of the Arm, Shoulders and Hand questionnaire; Berg Balance Scale; Graded Redefined of Strength Sensitivity and Prehension; Grip Strength; and GAITRite analysis. 27 Among the disease-specific PROIs, we recommend the JOA scale because it provides comprehensive coverage of myelopathic symptoms including bowel and bladder functions, as opposed to the Nurick, NDI, MDI, and 30 m Walk Test, which only assess upper or lower extremity function. The JOA score was also the most frequently used PROI in this study.
When it comes to cervical radiculopathy, however, the choice may be different from those for cervical myelopathy given the difference in symptoms. The questionnaires should contain items that can distinguish radicular arm pain from axial neck pain. From this perspective, the Nurick score that classifies radicular-only symptom as grade 0 will not be suitable nor sufficient to describe this group of patients well. Although the modified JOA and the European Myelopathy Scale are widely accepted as reliable and responsive instruments, they are designed to assess long tract signs including lower extremity, bowel, and bladder functions more than upper extremity symptoms and thus may not accurately reflect the intensity or change of cervical radiculopathy.
Cleland et al and Young et al studied the usefulness of the NDI, Patient-Specific Functional scale, and Numeric Pain Rating Scales in patients with radiculopathy. 24 , 28 Although they came to different conclusions regarding the test–retest reliability of the Patient-Specific Functional scale, these investigators thought there was fair reliability for both the NDI and Numeric Pain Rating Scales, and adequate responsiveness for all three scales. Among these scales, the NDI is currently widely used, has been proven valid and reliable in multiple studies, and thus may be an ideal choice for cervical radiculopathy. 29 , 30
General Health Questionnaires
Among the most frequent PROIs in this study, the general health questionnaires were the Euro-Qol-5D (EQ-5D), SF-36, and Odom criteria. The Odom criteria focus on the patients’ recovery of their preoperative symptoms following surgical intervention and do not truly assess their general health. 12 In our analysis, we observed a sharp decline in its use, and the Odom criteria may eventually be replaced by another general health PROI such as the EQ-5D.
The EQ-5D is a relatively new five-item scale that assesses health in terms of mobility, self-care, usual activities, pain/discomfort, and anxiety/depression. Its brevity and simplicity are the main advantages. Use of the EQ-5D is currently on the rise, and we may see more of this PROI in the future.
The SF-36 is older than the EQ-5D and is another well-established questionnaire for general health. It consists of 36 items. The SF-36 evaluates functional status (19 items), well-being (11 items), general health perception (5 items), and health change over the past year (1 item). This questionnaire is comprehensive and allows for a sensitive and detailed analysis, but compared with the EQ-5D, it is significantly more complex and time-consuming.
Given the above, the EQ-5D may be a reasonable choice to measure generic health as it covers the most important health-related quality-of-life dimensions for musculoskeletal diseases in five simple questions. 31 However, one weakness of the EQ-5D is the ceiling effect due to its relative simplicity. In one study, 32% of patients reached highest or lowest values at follow-up. 32 The large ceiling effect for the EQ-5D suggests that it would be more suitable for measuring the health of populations with more morbidity, where the distribution of the total score is less skewed. 33 For example, elderly responders tend to have multiple morbidities and more difficulties with answering complicated questions. The EQ-5D may be more appropriate when researchers target this kind of patient group.
To simplify the SF-36 without substantial loss of information, the eight-scale profile was reduced to two summary measures of health status, creating the SF-12. 33 The SF-12 contains questions regarding physical functioning, role-physical, role-emotional, mental health, bodily pain, vitality, social functioning, and general health. The former four have two items each, and the latter four have one item each. Jenkinson et al compared the SF-12 and SF-36 scores in various patient groups and concluded that these two instruments indicated the same magnitude of ill health and degree of change over time and that wherever two summary scores of health status are adequate, the SF-12 may be the instrument of choice. 34 For surveying musculoskeletal diseases including spinal pathologies, the SF-12 may be adequate. In addition, a comparison study by Johnson and Coons showed that the SF-12 was more sensitive than the EQ-5D. They found that for the subjects who reported no problems on the EQ-5D dimensions, the SF-12 component scores were still able to differentiate people with and without specific chronic medical conditions. 33 We believe either SF-12 or EQ-5D can be employed for general health surveys for degenerative cervical spinal pathologies with the understanding of its benefits and limitations.
Our study has a few limitations that the authors would like to acknowledge. The main one is that we did not include neurosurgical journals in our review. However,
Conclusions
The breadth of PROIs in degenerative cervical spine surgery research is extensive. A consensus on which instruments should be used for a given diagnosis or procedure may be necessary for better communication and comparison, as well as for the accumulation and analysis of vast clinical data across multiple studies. Additionally, validity and reliability studies for several of the top PROIs are incomplete and warrant further research as their PROI usage continues to increase. For degenerative cervical spine surgery, we recommend a combination of general health and cervical spine disease-specific PROIs, along with additional scales for pain (e.g., VAS) on an as-needed basis. Moreover, a consensus on PROI use in cervical spine surgery may be necessary to improve consistency and allow for interstudy comparisons in the future.
Disclosures
Haruki Ueda, none
Holt S. Cutler, none
Javier Z. Guzman, none
Samuel K. Cho, Consultant: Stryker
Footnotes
Acknowledgment
This study was sponsored by the AOSpine Research Commission Mentoring Program.
