Abstract
Introduction:
Total knee arthroplasty (TKA) is the preferred surgical treatment for end-stage osteoarthritis. However, substantial numbers of patients still experience poor outcomes. Consequently, it is important to identify which patient characteristics are predictive of outcomes in order to guide clinical decisions. Our hypothesis is that preoperative patient-reported outcome measures and radiographic measures may help to predict TKA outcomes.
Methods:
Using cohort data from the Osteoarthritis Initiative, we studied 172 patients who underwent TKA. For each patient, we compiled pre- and postoperative Western Ontario and McMaster University Arthritis Index (WOMAC) scores. Radiographs were measured for knee joint angles, femorotibial angles, anatomical lateral distal femoral angles, and anatomical medial proximal tibial angles; Kellgren and Lawrence (KL) grades were assigned to each compartment of the knee. All studied measurements were compared to WOMAC outcomes.
Results:
Preoperative WOMAC disability, pain, and total scores were positively associated with postoperative WOMAC total scores (
Conclusions:
All preoperative WOMAC scores demonstrated positive associations with postoperative WOMAC scores, while among the preoperative radiographic measurements only combined KL grades and joint angles showed any correlation with postoperative WOMAC scores. Higher preoperative KL grades and joint angles were associated with better (lower) postoperative WOMAC scores, demonstrating an inverse correlation.
Keywords
Introduction
Total knee arthroplasty (TKA) is an operation that has become the preferred surgical treatment for end-stage osteoarthritis (OA) of the knee. Furthermore, numerous studies have shown TKA to achieve favorable outcomes and functional gains for a great variety of patient groups. 1,2 Total knee arthroplasty has been shown to be highly cost effective, with a recent estimate of an US$11 548 cost per quality-adjusted life-year gained for patients with a mean age of 68 years, 3,4 and has excellent long-term success. Ranawat et al showed a 94.1% survivorship at 15 years for TKA. 5 Furthermore, TKA continues to be the only intervention, with highly significant gains in quality of life for those patients with severe knee OA. 6
With such high rates of efficacy and increasing demand, TKA operations have steadily grown in number to currently more than 600 000 cases per year in the United States, nearly doubling the per capita utilization of the procedure between 1991 and 2010. 7,8 Likewise, the annual number of TKA operations in the United States is expected to increase to nearly 3.5 million by 2030, with a majority of these operations performed for patients less than 65 years of age. 9,10 Consequently, much research has been done to better understand the patient-related factors that contribute to either favorable or poor patient-reported outcomes.
One of the more interesting findings of such recent studies is the overall effectiveness that TKA has had in patient groups that were previously suspected to be at higher risk of poor outcomes. Neither increased body mass index (BMI), increased age, nor gender has consistently proven to be associated with poor TKA outcomes. 2,11,12 However, there are a substantial number of patients who have unfavorable postsurgical function. 13,14 These patients may experience no improvement in symptoms after TKA and may even see a decline in overall knee function. 14,15 Likewise, it has been shown that up to 20% of patients may be dissatisfied with their TKA operation. 16
Although there will always be some unfavorable outcomes with any operation, there is a great interest in being able to preoperatively identify those patients who will have unsatisfactory outcomes after TKA. Many recent studies have sought to predict patient outcomes based on various preoperative patient characteristics, including radiographic findings, knee function scores, mental health, and socioeconomic status. 14,17 -24 Each of these variables has shown to have some predictive value but each has significant limitations. The situation is further complicated by the relatively weak association between symptoms of patients with OA and radiographic findings. 24 -26 Therefore, it is all the more important to determine which of these patient characteristics are comparatively most predictive of patient outcome to help develop clinical pathways to improve patient care and ultimate outcomes.
In order to better understand and delineate the compounding factors in TKA outcome, we decided to compare the predictive abilities of preoperative radiographic studies with those of patient-reported characteristics, such as the Western Ontario and McMaster Universities Arthritis Index (WOMAC), in determining TKA outcome. The purpose of our study is to simultaneously observe and compare preoperative radiological findings, patient-reported knee scores, and baseline characteristics in the context of postoperative outcomes. The study uses data from the Osteoarthritis Initiative (OAI), a National Institutes of Health multicenter cohort study of 4796 persons with or at high risk of developing knee OA. 27 We hypothesize that (1) preoperative patient-reported function scores will have a greater predictive value of surgical outcome than radiological measures and (2) that general radiological measures of OA, such as Kellgren and Lawrence (KL) grades, will be better predictors of outcome than any specific joint angle measurement.
Methods
The OAI is a publicly and privately funded multicenter, prospective, longitudinal observational study of 4796 patients who have either been diagnosed with clinically significant knee OA (progression subcohort), are at high risk of developing clinically significant knee OA (incidence subcohort), or were recruited as healthy controls. 27 All patients enrolled in the study were to attend a baseline visit for clinical evaluation, followed by subsequent annual visits. During each visit, clinical, radiological, and biomarker data were collected. Annual radiological data included a knee magnetic resonance imaging and a fixed flexion posteroanterior (PA) radiograph, among others. All data, including radiographic images, are publicly available on the Osteoarthritis Initiative website. 28
Using the data from the OAI database, our study’s inclusion criteria (Figure 1) included patients who (1) had a confirmed TKA operation (as confirmed through adjudicated medical records) during the course of the OAI study, (2) had OAI x-rays (PA) that confirmed the presence of a TKA, and (3) had at least 1 OAI study visit (baseline or annual visits) before the knee arthroplasty operation. We excluded any patients who (1) had no WOMAC score recorded beyond 6-month postsurgery or (2) had no OAI x-rays taken after the TKA operation. Using these criteria, 172 patients (comprising 174 total knee replacements) were included in our study.

Study inclusions and exclusions demonstrate the criteria used to select patients from the OAI study for our study; 172 people, comprising 174 knees, were included. OAI indicates Osteoarthritis Initiative.
Pre- and postsurgical WOMAC scores were compiled for each patient included in the study. All presurgical scores were obtained from the last possible recorded scores prior to the TKA. All postsurgical scores were taken from the last OAI visit for each individual patient, such that the longest follow-up period possible between surgery and evaluations was observed for each patient. We decided to use WOMAC scores as a measure of patient-reported outcome as this has been used and validated by various investigators in the past for evaluating both patients undergoing TKA and patients with OA in general. 13,17 -21,28 -30 Most comparisons used the WOMAC total score, the WOMAC disability (function) score, and the respective differences between preoperative and postoperative WOMAC scores, as we assessed these to be the most clinically relevant measures.
The WOMAC scores are reported such that higher WOMAC scores (0-96 scale for total) correlate with a more severe combination of OA symptoms. Changes in WOMAC scores are reported as decreases in WOMAC scores—such that a positive change in WOMAC scores represents a decrease in OA symptoms.
For each knee, the preoperative Knee injury and Osteoarthritis Outcome Score (KOOS) Quality Of Life (QOL) score from the last visit prior to TKA was compared to postoperative WOMAC scores. The KOOS QOL score is a patient-reported survey that reflects the perceived quality of life of the patient in regard to the specified knee.
Bilateral PA fixed flexion knee x-ray images were obtained during weight bearing as the patient stood in 20° to 30° of flexion (Rosenberg view), with feet internally rotated 10°. Both right and left knees were imaged together onto a 14 by 17-in film, using a focus-to-film projection of 72 in. The knees and feet were held in flexion and rotation using a plexiglass positioning frame. 27 These x-rays were used to measure pre- and postsurgical joint angle (Figure 2A), anatomical femorotibial angle (Figure 2B), anatomical medial proximal tibial angle (aMPTA; Figure 2C), and anatomical lateral distal femoral angle (aLDFA; Figure 2D) on the affected knee. Preoperative Kellgren-Lawrence (KL) grades were also assigned to medial and lateral compartments of the affected knee. For parts of the analysis, the scores for the medial and lateral KL grades were summed together in order to give an overall grade of radiographic OA in the observed knee. In order to specifically observe the condition of the knees prior to surgery and the immediate physical effect of the surgery, all preoperative images were from the last possible image set for each patient, while all postoperative images were from the first image set after the TKA. All radiological measurements were made by a single musculoskeletal radiologist and were reviewed by the senior author.

Joint line congruency angle: the angle between the distal femur joint orientation line (line drawn between the most distal points of each femoral condyle) and proximal tibia joint orientation line (line drawn along the tibial plateaus). In this example, it measures 0.8°. B, Anatomic femorotibial angle: the angle formed by the lines that pass through the center of the diaphysis along the length of each bone. In this example, it measures 173.5°. C, Anatomic medial proximal tibial angle: the angle between the proximal tibia joint orientation line (line drawn along the tibial plateaus) and the anatomic axis of the tibia. In this example, it measures 86.9°. D, Anatomic lateral distal femoral angle: the angle between the distal femur joint orientation line (line drawn between the most distal points of each femoral condyle) and the anatomic axis of the femur. In this example, it measures 82.2°.
Statistical Analysis
Data analysis was done using IBM SPSS Statistics Version 21 (Armonk, New York). The data were initially studied using Pearson correlation studies to find significant relationships between preoperative clinical measurements and surveys and postoperative outcomes, measured using total WOMAC and WOMAC subset scores.
For each patient, preoperative joint angle, anatomical femorotibial angle, KL grade, aMPTA, and aLDFA were compared to WOMAC outcomes to determine the association between radiographic observations and outcomes. Likewise, postoperative joint angle, anatomical femorotibial angle, aMPTA, and aLDFA were compared to postoperative WOMAC scores.
For the anatomical femorotibial angle, patients were divided into 2 groups depending on whether the angle was varus (angle > 174°) or valgus (<174°). For both the varus and the valgus groups, Pearson correlations were measured when compared to postoperative WOMAC total scores. The patients were also further subdivided into groups with valgus (<173°), varus (>178°), or neutral (>173° and <178°) femorotibial angles. These groups were determined using the work of Fang et al who showed that the standard deviation of postoperative femorotibial angles for 6070 knees with TKA was 2.4° to 7.2° of valgus. 31 Similar grouping was done for both aMPTA and aLDFA measurements. For aMPTA, patients with a less than 90° measurement were grouped as varus and those with above 90° were grouped as valgus. For aLDFA, patients with measurements below 82° were grouped as valgus and those with measurements above 86° were grouped as varus. Patients with aLDFA measurements of 82° to 86° were grouped as neutral. This grouping was done using the work of Ritter et al, which demonstrated significantly worse outcomes for knees with postoperative aLDFA measurements beyond 8° valgus. 32 For pre- and postoperative joint angle measurements, patients were separated into groups of 4° increments for analysis—these intervals simply represented what we considered to be low, middle, and high values. One-way analysis of variance (ANOVA) was used to test for significant differences in postoperative WOMAC scores between groups.
Results
The average preoperative WOMAC total score for the studied patients was 35.0 (range: 0-73 and standard deviation: 17.6) while the average postoperative WOMAC total score was 12.6 (range: 0-51.1 and standard deviation: 13.4), signifying an average change in WOMAC score of 21.5 (
Baseline Characteristics.
Abbreviations: BMI, body mass index; SD, standard deviation; N/A, not applicable; OAI, Osteoarthritis Initiative; preop; preoperative; postop; postoperative; TKA, total knee arthroplasty; WOMAC, Western Ontario and McMaster Universities Arthritis Index. aChange in WOMAC Total or WOMAC Disability is presented as a decrease in WOMAC score - i.e. a positive change would represent an overall decrease in score.
Preoperative Patient-Reported Measures
Among the patient-reported outcome measures (PROMs) used (Table 2), preoperative WOMAC disability, pain, and total scores were significantly associated with postoperative WOMAC total scores (
Associations Between Preoperative Patient Factors and Postoperative WOMAC Scores.
Abbreviations: BMI, body mass index; preop; preoperative; postop; postoperative; WOMAC, Western Ontario and McMaster Universities Arthritis Index.
a Change in WOMAC total or WOMAC disability is presented as a decrease in WOMAC score, that is, a positive change would represent an overall decrease in score. b
The preoperative KOOS QOL measure was negatively correlated with changes in WOMAC total (
Association Analysis of Radiographic Measurements
The preoperative femorotibial angle was not significantly associated with postoperative WOMAC total scores in patients with a varus or valgus femorotibial measurement (Tables 3 and 4). Neither the preoperative aLDFA nor the preoperative aMPTA was associated with postoperative WOMAC total or change in WOMAC score. The preoperative joint angle was positively associated with a change in WOMAC total scores (
Pearson Correlations Between Radiographic Measures and Postoperative WOMAC Scores.
Abbreviations: aLDFA, anatomical lateral distal femoral angle; aMPTA, anatomical medial proximal tibial angle; KL, Kellgren and Lawrence; WOMAC, Western Ontario and McMaster Universities Arthritis Index. a
Pearson Correlations Between Postoperative Radiographic Measures and Postoperative WOMAC Scores.
Abbreviations: aLDFA, anatomical lateral distal femoral angle; aMPTA, anatomical medial proximal tibial angle; WOMAC, Western Ontario and McMaster Universities Arthritis Index.
There were no significant associations between postoperative WOMAC scores and the postoperative joint angle, aLDFA, or aMPTA. The postoperative femorotibial angles of patients with varus or valgus measurements had no significant association with postoperative WOMAC total or change in WOMAC scores (Table 4).
Subgroup Analysis of Radiographic Measures
For patients grouped into 4° intervals of preoperative joint angle (Table 5), there was a significant difference in WOMAC total scores between the groups (
Preoperative Subgroups of Radiographic Measures.
Abbreviations: aLDFA, anatomical lateral distal femoral angle; aMPTA, anatomical medial proximal tibial angle; ANOVA, analysis of variance; KL, Kellgren and Lawrence; WOMAC, Western Ontario and McMaster Universities Arthritis Index. aThe group of knees with 4-8-degrees of joint angle was significantly different compared to the group with 0-4-degrees of joint angle with respect to WOMAC Total (
For postoperative radiographic measurements (Table 6), the 1-way ANOVA test demonstrated no significant difference in postoperative WOMAC scores between subgroups of postoperative femorotibial angle, aLDFA, or aMPTA. All patients had between 0° and 4° of postoperative joint angle so no further analysis was done using this data.
Postoperative Subgroups of Radiographic Measures.
Abbreviations: aLDFA, anatomical lateral distal femoral angle; aMPTA, anatomical medial proximal tibial angle; ANOVA, analysis of variance; WOMAC, Western Ontario and McMaster Universities Arthritis Index.
For patients separated into varus and valgus groups of preoperative femorotibial angle, we found no significant difference in postoperative WOMAC total or change in WOMAC total scores between groups. Similar results were seen with preoperative aMPTA varus and valgus grouping. One-way ANOVA testing did not show any significant difference in postoperative WOMAC total or change in WOMAC scores between aLDFA varus, neutral, or valgus groups.
Preoperative KL Grades
The medial KL grade by itself was not significantly associated with postoperative WOMAC scores although the lateral KL grade was positively associated with decreases in WOMAC total score, meaning that more severe OA in the lateral compartment (as determined by the KL grading scale) was associated with larger changes (or “gains”) for the patient (Table 3). The combined KL grade (formed by simple addition of the medial and lateral KL grade for the operated knee) was not significantly correlated with WOMAC total (
After grouping patients by their combined preoperative KL grade, 1-way ANOVA testing showed a significant difference in overall change in WOMAC total scores between groups (
Discussion
Addressing our first hypothesis, we found that preoperative WOMAC scores (especially the WOMAC total, WOMAC pain, or WOMAC disability scores) had a stronger correlation with postoperative WOMAC scores than any of the observed preoperative radiographic measures. Although some of the preoperative radiographic measures (ie, combined KL grades and joint angles) demonstrated a correlation with postoperative WOMAC scores, the strength of these relationships was substantially less than those seen with preoperative WOMAC scores and were limited to subsets of postoperative WOMAC disability and WOMAC pain (for KL grades) and changes in WOMAC total (KL grades and joint angles). Addressing our second hypothesis, of the preoperative radiographic measurements in our study, we found only KL grades and joint angles to show any correlation with subsets of postoperative WOMAC scores. Therefore, we conclude that these 2 preoperative radiographic measurements show the greatest correlation with postoperative outcomes among the common radiographic measures in our study although neither radiographic measurement was significantly correlated with
As expected, most patients in our study had an overall positive outcome after TKA surgery (average decrease in WOMAC total score was 21.5 points). We also confirmed the results of several previous studies that found preoperative PROMs to be positively associated with postoperative PROMs. 14,19,20,22 The observed relationship between pre- and postoperative PROMs described in this study may be a result of the continuous presence of detrimental factors, such as severe OA in other joints, other lower limb joint pain, and so on.
Slightly less intuitively understood are our results that show a strong association between preoperative WOMAC scores and greater decreases in WOMAC total and disability scores following TKA. Similar results were described by Fortin et al where the decrease between preoperative and postoperative WOMAC scores at 6 months showed a positive correlation with preoperative WOMAC scores. 20 As Fortin and colleagues suggested, these results seem to imply that the patients with worst PROMs have the most function to gain from TKA, since they have such poor pain control and function to begin with. This is not to say that these patients have better absolute outcomes since preoperative WOMAC totals are positively correlated with postoperative WOMAC totals but instead that these patients see the biggest change, or gain, in function. Consequently, these results allow for better clinical judgment in interpreting the worse outcomes associated with poor preoperative PROMs—while a patient with poor preoperative function may never reach the same absolute postoperative “score” as a patient with a higher preoperative function, they may likely experience greater relative gains that may translate to higher relative patient satisfaction and so should not necessarily be deterred from pursuing TKA surgery.
Radiological measurements showed interesting results in terms of their relation to the postoperative WOMAC scores. Larger preoperative joint angles were correlated with higher decreases in WOMAC total, again likely reflecting the possibility for larger improvements that patients with more severe knee OA (or more deformed knee joints) have. When grouped by preoperative joint angle, the patients with 4° to 8° of preoperative joint angle had the lowest postoperative WOMAC total scores (representing the best outcomes). The group of patients with only 0° to 4° of joint angle had the highest postoperative WOMAC scores (representing the worst outcomes). This finding suggests a relationship between preoperative joint angle and postoperative PROMs such that patients with joint angles in the 4° to 8° range typically have better TKA outcomes than patients with joint angles closer to 0. Furthermore, such a finding seems to suggest that patients with greater preoperative joint angles tend to have better TKA outcomes in general. However, the lack of significant differences between the 4° and 8° joint angle group and the 8° to 12° joint angle group suggests that this relationship may be nonlinear.
None of the postoperative radiographic measurements showed any association with the postoperative WOMAC measured outcomes. This may be due to the relatively small variation in postoperative radiographic measures seen between patients following TKA operations. For instance, the largest joint angle measured postoperatively was 2.1°, demonstrating a small range and far less variation. In contrast to our findings, Fang et al showed that patients with postoperative femorotibial angles greater than 7.2° of valgus or less than 2.4° of valgus had significantly worse outcomes than those in between these angles (neutral). 31 Likewise, Ritter et al showed that patients with less than 90° of postoperative aMPTA or greater than 8° valgus of postoperative aLDFA had significantly worse outcomes than patients with more neutral aMPTA and aLDFA measurements. 32 However, both of these studies observed more than 6 000 knees, lending far more power to the analysis. Consequently, the lack of significance in our study regarding these measurements may simply indicate the smaller study sample.
One of the more interesting results of our study was the association between preoperative KL grades and the postoperative WOMAC scores. When using medial or lateral compartment KL grades individually for each patient, only the lateral KL grade was shown to correlate with positive changes in WOMAC total scores. Just as with preoperative PROMs, this could be the result of simply more function to gain for patients with more severe knee OA initially, especially since lateral compartment involvement is generally related to later progression of OA. 24 Furthermore, when the KL grades were combined from each compartment, the results showed significant negative correlations for WOMAC disability and pain scores as well as stronger positive associations between KL grades and overall change in WOMAC total and disability, indicating that patients with more severe preoperative radiographic OA had greater overall improvement as measured by PROMs.
Importantly, the combined KL grades demonstrated an inverse correlation with absolute postoperative WOMAC disability and pain scores, meaning that higher KL grades (representing more severe radiographic OA) were shown to be associated with lower (less symptomatic) absolute postoperative WOMAC scores. Valdez et al described similar results when they compared radiographic severity to postoperative outcome, reporting lower absolute postoperative WOMAC scores when preoperative KL grades were higher. 28 Although this relationship seems to be contradictory to that observed between pre- and postoperative PROMs, the differences may, at the least, reflect the underlying, fundamental differences between radiographic measures of OA and patient-reported symptoms of OA. As Dowsey et al showed with their work comparing preoperative PROMs to radiographic measures of OA severity, there is surprisingly little association between radiographic assessments of OA and symptoms reported by the patient. 24 Therefore, it seems possible that the relationship between radiographic OA severity (as determined by KL grading) and postoperative outcomes might be relatively unrelated to the relationship seen between preoperative PROMs and postoperative outcomes. Further work is needed to more fully understand this relationship.
There are some limitations to this study. First, although the sample size of 172 patients was not insufficient for testing, a larger sample size would have provided more reliable and generalizable results. Second, due to the nature of the OAI cohort data, and our wish to use the longest follow-ups possible for WOMAC measurements, the time between the TKA operation and the postoperative WOMAC score used was highly variable. We considered this variability acceptable since we were judging eventual outcomes and not specific progression; however, a more uniform follow-up time would have provided more precise results. Finally, we realize that the WOMAC score is only one means of measuring patients’ outcomes after TKA; consequently, our results are limited in scope in this regard.
In concurrence with previous works in this area, we found preoperative PROMs, especially preoperative WOMAC scores, to be positively associated with postoperative WOMAC scores. Overall, our findings indicate that preoperative PROMs are more reliably associated with postoperative PROMs than are preoperative radiographic measurements. We also demonstrated that some preoperative radiographic measures, such as KL grades or joint angles, are inversely correlated with postoperative WOMAC scores although these relationships were only found regarding postoperative WOMAC disability and WOMAC pain scores (for KL grades) and overall change in WOMAC total (for KL grades and joint angles). However, the relationship between radiographic measurements and TKA outcomes deserves further investigation.
Footnotes
Acknowledgments
The OAI is a public–private partnership comprised of 5 contracts (N01-AR-2-2258; N01-AR-2-2259; N01-AR-2-2260; N01-AR-2-2261; N01-AR-2-2262) funded by the National Institutes of Health, a branch of the Department of Health and Human Services, and conducted by the OAI Study Investigators. Private funding partners include Merck Research Laboratories; Novartis Pharmaceuticals Corporation, GlaxoSmithKline; and Pfizer, Inc. Private sector funding for the OAI is managed by the Foundation for the National Institutes of Health. This article was prepared using an OAI public use data set and does not necessarily reflect the opinions or views of the OAI investigators, the NIH, or the private funding partners.
Authors’ Note
All authors contributed to the design of the study, the collection of data, the analysis and interpretation of the data, the drafting and revision of the article, and the final approval of the article.
Declaration of Conflicting Interests
The author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article. The senior author is on the editorial board for Journal of Arthroplasty, serves on the American Academy of Orthopaedic Surgeons, and is also a paid consultant of Smith and Nephew and holds stocks in Pristine. There are no other potential conflicts of interest.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
