Abstract
Methods:
We investigated 15-year patient-reported outcomes (PROMs) and their predictors in primary total hip arthroplasty (THA) for osteoarthritis using a cemented flanged cup and stem from a regional joint registry in New Zealand. Regional data were collected for all primary THAs with this cemented combination from 1 January 2003 to 30 June 2023 who had recorded PROMs on at least 1 occasion (n = 263). PROMs included Oxford Hip Score, Western Ontario and McMaster Universities Arthritis Index and Veterans Rand-12, evaluated against patient age, ethnicity, sex, body mass index (BMI), funding pathway, and American Society of Anesthesiologists (ASA) rating.
Results:
Significant improvements across preoperative PROMs were noted 1-year post-surgery, with a mean change above 23 in the Oxford Hip Score maintained at 5, 10, and 15 years (p ⩽ 0.001).
Conclusions:
Regression analysis indicated that being female, public funding, and higher BMI were associated with worse preoperative PROMs. Poorer preoperative scores, older age and ASA 3 rating correlated with poorer postoperative outcomes.
Introduction
Surgical success for joint replacement procedures has long been evaluated using implant survivorship models. 1 With current healthcare shifting focus to patient well-being and quality of life, 2 patient-reported outcome measures (PROMs) have become an important metric in quantifying surgical success. PROMs allow surgeons to quantify patients’ health-related quality of life and make decisions regarding optimal patient care, shifting the focus from surgeon-centred to patient-centred models of care.
The cemented, all-polyethylene flanged cup we chose to examine in this study, the Exeter Contemporary Flanged cup (Stryker Orthopaedics, Mahwah, NJ, USA), has been in use since 1991 and was the cemented cup most used in New Zealand (NZ) between 2003 and 2013.3,4 Schmitz et al. 5 reported good results for this cup in THA patients under 50 years with a component survival rate of 90% after 10 years. Furthermore, Maggs et al. 6 identified a 97.8% survivorship at 12.5 years in patients 46 years and above. In environments where THAs are completed in high volumes, the polished, stainless steel, tapered, and collarless cemented stem, the Exeter V40 stem (Stryker Orthopaedics) we are examining in this study, has demonstrated excellent functional outcomes based Western Ontario and McMaster Universities Arthritis Index (WOMAC) scores and 97.8% survivorship at 10 years.7,8 This stem has a reported 15-year survivorship of 96.6% (confidence interval [CI] 95%, 98.3–94.8) and rare periprosthetic fracture rates of 0.6% (CI 95%, 0.3–0.9) at 15 years, 9 although this may be underestimated in studies where periprosthetic fractures requiring alternative reoperations (such as open reduction and internal fixation, and where there is no stem exchange),10–12 have not been reported. In NZ, over 54,000 of these particular stems were used between 1999 and 2022 and currently represent 60% of all stems used in the UK.4,8 This cemented combination maintained a top 10 position of all hip combinations implanted in NZ between 2003 and 2013.
Although studies have reported survivorship of this Exeter Contemporary Flanged cup with Exeter V40 stem (Stryker Orthopaedics) combination, 6 few have examined PROMs. A more thorough investigation of this cemented cup-and-stem combination specific to osteoarthritis that includes short-, medium- and long-term PROMs evaluated against patient characteristics, comorbidities, and funding pathways may identify specific predictors affecting PROMs and populations susceptible to poorer outcomes.
Prospectively gathered data from a regional registry were used to examine preoperative, 1-year, 5-year, 10-year, and 15-year PROMs in primary THA for osteoarthritis with this cup-and-stem cemented combination against patient characteristics age, sex, and body mass index (BMI); comorbidities based on American Society of Anesthesiologists (ASA) rating; and funding pathways. Oxford Hip, 13 WOMAC, and health-related quality of life Veterans Rand 12-item (VR-12) 14 scores were analysed. The VR-12 has been validated by cross-system comparisons. 15
We aimed to report PROMs for this implant combination up to 15 years, establish a benchmark methodology for future studies of other implant combinations in this region, and determine predictors of these PROMs to guide THA management in this region of interest.
Method
Data source and collection
Tauranga Orthopaedic Research (TOR) is a non-profit organisation in the Bay of Plenty region of NZ which maintains a mandatory registry of all regional joint arthroplasty surgeries. The TOR registry is independent of the NZ National Joint Registry and includes procedures from 2 public and 1 private hospital. Data are collected with patient consent and entered into the Standardized Orthopaedic Cartilage Repair and Treatment Evaluation Software (SOCRATES version 3.5.8.26 10150).
The TOR registry collects Oxford, WOMAC, and VR-12 as their routine standard of care for THA preoperatively, at 1 year postoperatively, at 5 years postoperatively, and every 5 years thereafter. The University of Waikato’s Human Research Ethics Committee granted ethical approval to analyse the database for this study (HREC2023#12).
Anonymised prospective observational data from 1 January 2003 to 30 June 2023 were retrieved in July 2023. Data were eligible for analysis when patients underwent primary THA for osteoarthritis and received an Exeter Contemporary Flanged cup with Exeter V40 stem combination within the set observational period. Patient data needed to contain at least 1 of the PROMs at any of the timepoints of interest to meet inclusion. A pairwise deletion approach was implemented to handle missing PROM scores at any timepoint (questionnaires not returned, incomplete questionnaires or withdrawal from the study at that timepoint) so that all available data were used in statistical analysis (Figure 1). 16

Flow chart depicting sample sizes, deaths and revisions at each follow-up period. All unaccounted observations were those lost to follow-up (questionnaires not returned, incomplete questionnaires or withdrawal from the study at that timepoint). *See Supplemental Table 2 for detailed reasons for revision and revision type.
Outcomes
We considered preoperative PROM data as scores recorded at any point preoperatively; 1-year data as scores recorded from 10–14 months postoperatively; 5-year data as scores recorded from 4–6 years postoperatively; 10-year data as scores recorded from 9–11 years postoperatively; and 15-year data as scores recorded from 14–16 years postoperatively. Although 15-year data were available for some patients, not all measures of interest had observations. Consequently, comparison of PROMs up to 10 years postoperatively were included in subgroup analysis.
Predictors
The predictors extracted from the database included patient age, sex, BMI, ethnicity, ASA rating, and funding pathway. Age and BMI were treated as continuous variables. Other predictors were treated as categorical or rank variables, and included sex (female, male), funding pathway (public, private), ethnicity (NZ European, Māori, Other), and ASA rating (a single ASA 4 observation was recoded as ASA 3 for analysis). “Other” ethnicity included “undetermined”, “undisclosed”, and any ethnicity not recorded as Māori or NZ European.
Data analysis
Data were inspected and normality of distribution verified. Descriptive statistics were computed for all variables, including counts, means, standard deviations (SD), and ranges (minimum, maximum). Continuous numeric dependent variables were compared using 2-sample t-tests when normally distributed. Normal sampling mean distribution was assumed in samples ⩾30 according to central limit theorem given parametric tests produce more accurate and precise estimates with higher statistical powers. 17 Where variances were unequal, Welch 2-sample t-tests were performed. Analysis of variance (ANOVA) tests were performed to compare predictors with more than 2 categories, dependent on ANOVA assumptions being met. 18 The Tukey method was used in post-hoc testing.
Multiple linear regression analyses were performed on preoperative and postoperative PROMs to determine if any of the predictors correlated to any of the PROMs. The Bayesian Information Criterion was used to determine best model fit. 19 Estimates and coefficients of determination (R2) were extracted from the regressions to quantify the association between predictors and outcomes, as were standard errors to quantify uncertainty of estimates. Possible predictors of preoperative scores included age, ethnicity, sex, funding pathway, BMI, and ASA rating. Postoperative predictors included the same factors, but also preoperative scores as these have been identified as possible predictors of postoperative scores. 20 Collinearity was investigated with variance inflation factor and condition index. Significance levels were set to p < 0.05 and confidence intervals to 95% (lower, upper). All statistical tests were performed using RStudio 2023.09.1 build 494 “Desert Sunflower” statistical software. 21
Results
A total of 388 THAs met inclusion for the defined period. Observations that did not have at least 1 PROM recorded were excluded, leaving 263 THAs for analysis (Table 1), with follow-up rates of 91%, 68%, 53%, and 26% at 1, 5, 10, and 15 years, respectively. Of the 263 observations with at least 1 PROM recorded, females accounted for 54.3% (n = 143), ASA 2 rated patients for 62% (n = 164) and NZ Europeans for 92.1% (n = 242). The mean age of patients at surgery was 71.4 years (range 46–92) and mean BMI 28.01 kg/m2 (range 18.2–50.4). Five males and seven females received bilateral THAs. The standard 150-mm stem length accounted for 92.3% of all stems implanted, with 125-mm accounting for 4.6% and 205-mm at 0.7%. Stem length was not recorded in 2.4% of observations. No statistically significant differences in mean age, BMI or factor proportions between the total sample of 388 and the evaluated 263 observations were noted (Supplemental Table 1). Seventeen revisions during the observation period were noted. (Supplemental Table 2).
Descriptive statistics of preoperative patient characteristics for observations with at least 1 patient-reported outcome measure.
ASA, American Society of Anesthesiologists Physical Status classification; BMI, body mass index; NZEU, New Zealand European.
The values are given as the mean and the standard deviations.
Values are given as the number of patients, with percentage (number of patients divided by the total) in parentheses. Due to missing values, some categories do not equal the total number of patients for each column.
Patient-reported outcome measures
Descriptive statistics of PROMs, including the subscales of both WOMAC and VR-12, are reported in Table 1 for preoperative, 1-, 5-, 10-, and 15-year timepoints. 1-way ANOVA revealed statistically significant differences in mean Oxford, WOMAC, and VR-12 scores between at least 2 timepoints. Post-hoc testing revealed preoperative PROMs were statistically significantly different to their respective 1-, 5-, 10-and 15-year postoperative PROMs. There were no significant differences in postoperative PROMs between their respective 1-, 5-, 10-, and 15-year timepoints. Mean postoperative VR-12-physical scores for the sample were worse with scores less than the standardised T-score metric mean of 50 (Standard deviation = 10) across timepoints. In contrast, mean postoperative VR-12-mental scores were better, with scores greater than the standardised score of 50.
All mean preoperative PROMs were statistically significantly poorer in females than males (p ⩽ 0.041) (Table 2). Postoperatively, WOMAC-total and function scores for females were also statistically significantly poorer 1-year and 5 years postoperatively, as were WOMAC-motion and VR-12-physical scores 1-year postoperatively. Males and females showed no statistically significant difference in postoperative Oxford, WOMAC-pain, or VR-12-mental scores at 1, 5, and 10 years postoperatively.
Descriptive statistics of patient-reported outcomes.
Diff, difference; PROM, patient-reported outcome measure; CI, confidence interval; WOMAC, Western Ontario and McMaster Universities Arthritis Index; VR-12, Veterans Rand 12-item heath survey.
Oxford Hip Score minimum is 0 and the maximum score is 48 with poorer scores being closer to 0.
WOMAC minimum scores are 0 with 0 being the better score.
VR-12 ideal score is ⩾50 and acceptable is within 1 standard deviation either side.
Note: Post hoc p-value represents the pairwise comparison of the previous timepoint to the current timepoint.
Significance is set at p < 0.05.
Publicly funded patients had statistically significantly poorer preoperative PROMs and poorer postoperative PROMs than privately funded patients across most outcomes. Mean WOMAC-pain at 10 years, VR-12-mental at 1-year, and WOMAC-motion at 1, 5, and 10 years postoperatively were still poorer in publicly funded patients, although not reaching statistical significance (p ⩾ 0.053) (Table 3).
Male versus female patient-reported outcomes.
Diff, difference; PROM, patient-reported outcome measure; WOMAC, Western Ontario and McMaster Universities Arthritis Index; VR-12, Veterans Rand 12-item heath survey.
Oxford Hip Score minimum is 0 and the maximum score is 48 with poorer scores being closer to 0.
WOMAC minimum scores are 0 with 0 being the better score.
VR-12 ideal score is ⩾50 and acceptable is within 1 standard deviation either side.
Significance is set at p < 0.05.
Note: Post hoc p-value represents the pairwise comparison of the previous timepoint to the current timepoint.
Public versus private patient-reported outcomes.
Diff, difference; CI, confidence interval; PROM, patient-reported outcome measure; WOMAC, Western Ontario and McMaster Universities Arthritis Index; VR-12, Veterans Rand 12-item heath survey.
Oxford Hip Score minimum is 0 and the maximum score is 48 with poorer scores being closer to 0.
WOMAC minimum scores are 0 with 0 being the better score.
VR-12 ideal score is ⩾50 and acceptable is within 1 standard deviation either side.
Significance is set at p < 0.05.
Note: Post hoc p-value represents the pairwise comparison of the previous timepoint to the current timepoint.
1-way ANOVA revealed no statistically significant differences in any preoperative PROMs, postoperative WOMAC-pain, or postoperative VR-12-mental scores at any timepoint between ASA ratings. However, postoperative Oxford, WOMAC-function, WOMAC-motion, and VR-12-physical scores were typically poorer with higher ASA ratings at various timepoints. There were no statistically significant differences in any outcomes between ethnicities.
PROM predictors: preoperative
No statistically significant collinearity between predictors was identified. Regression analysis indicated preoperative Oxford and WOMAC scores were negatively influenced when funding pathway was public, patients were females, and BMI was greater (Table 5). Preoperative scores for VR-12 were also negatively influenced when funding pathway was public, and patients were females.
Multiple linear regression results for preoperative patient-reported outcomes.
ASA, American Society of Anesthesiologists physical status classification; PROM, patient-reported outcome measure; STD, standard; WOMAC, Western Ontario and McMaster Universities; Arthritis Index; VR-12, Veterans Rand 12-item health survey.
Significance set at p < 0.05.
PROM predictors: postoperative
As ANOVA testing of PROMs revealed no statistically significant difference in scores from 1-year to 15 years postoperatively, 1-year scores were used as the main outcome in regression analysis given the greater number of observations. As sex, funding pathway, and BMI were significant predictors of preoperative scores, they were removed as postoperative predictors. Therefore, regression models considered preoperative score, age, ethnicity, and ASA rating. Preoperative score was a statistically significant predictor of postoperative score for all PROMs (Table 6). Increased age and ASA 3 category were additional statistically significant predictors of poorer postoperative WOMAC-function and VR-12-physical scores.
Multiple linear regression results for postoperative patient-reported outcomes.
PROM, patient-reported outcome measure; VR-12, Veterans Rand 12-item health survey.
Significance set at p < 0.05.
Discussion
Our results indicate that primary THA with the specified cemented cup-and-stem combination for osteoarthritis improves PROMs in patients at 1-year postoperatively and that improvement is maintained in those without revision to 15 years post-surgery. Patients who were female, in the public system, and who had greater BMIs presented with poorer preoperative PROMs. Poorer preoperative PROMs negatively influenced postoperative scores across timepoints. For some outcomes, postoperative PROMs were negatively affected by higher comorbidity status (based on ASA rating) and increased age at surgery.
The substantial improvement in PROMs seen post-surgery up to 15 years with this cemented cup-and-stem combination sets a benchmark against which PROMs from other cemented or alternative implant combinations in the same or similar populations can be measured. Mean 1-, 5-, and 10-year Oxford scores were over 41, corresponding to a Kalairajah category 1 (excellent) rating. 22 Mean 15-year Oxford scores were 39.44, corresponding to a category 2 (very good) rating. 22 Poor 6-month Oxford scores have been identified as a risk factor for early revision within 2 years; 23 thus, our implant combination’s Oxford score suggests low early-revision risk, which needs confirmation via survivorship analysis. The mean difference in pre- and postoperative Oxford scores (>23) is substantially higher for this implant than the estimated minimally important change value of 11, 24 suggestive of meaningful improvements for patients.
We identified preoperative PROMs as significantly influencing postoperative PROMs, consistent with previous studies.25,26 Poor preoperative scores across multiple PROMs increase likelihood of patient dissatisfaction postoperatively. 26 Our regression models revealed patients who are female, in the public system, and have higher BMIs were associated with poorer preoperative functional and mental scores, which may predispose them to postoperative dissatisfaction. Studies show that females are more likely opposed to surgery due to competing responsibilities, need more information before deciding, and are less likely to be sent to a specialist; therefore, females drop to a lower preoperative functional level than males. 27 These observations align with our findings, wherein females’ preoperative scores were lower than males. Alongside poorer functional and mental preoperative scores, females also tend to have higher rates of depression, lower back pain, and comorbidities than males. 28 In the TOR registry, there was a greater number of females undergoing surgery than males, consistent with national THA surgery data from NZ and other parts of the world. 4 In the USA, not only are more females having THA surgeries, 25 they are also more likely to be older, have higher BMIs, and need preoperative support than males. 29 In a Korean study of over 1.15 million females, reproductive factors (e.g., later onset of menarche, longer time breastfeeding, hormone replacement treatment, and oral contraceptive use) were associated with higher risk for severe osteoarthritis and need for THA. 30 Further investigation into sex differences in THA is required to improve patient care – particularly of females – and may need to be considered in future risk-stratification models, 29 which should include more detailed reproductive history notetaking. Sex disparity in referral for surgery also warrants consideration.
Consistent with our observations, Greene et al. 31 identified that patients using public funding pathways for THA report poorer preoperative and postoperative PROMs than those using private funding. Poorer public outcomes are often attributed to longer wait times; however, literature indicates the effect of increased waiting time on THA outcome is unclear or inconsistent, 32 and that sex, age, and certain modifiable risk factors (e.g., BMI, income level, and comorbidities) impact postoperative outcomes to a greater extent. 33 Higher BMI has also been identified as a risk factor for increased complications and poorer postoperative THA outcomes. 34 Taken together, our findings support existing literature that women, public patients, patients with higher BMI, older patients, and patients with comorbidities (i.e., higher ASA ratings) are at greater risk of poorer postoperative THA outcomes and require greater preoperative and postoperative support. Specifically, these patients may benefit from additional mental and functional prehabilitation and rehabilitation support. Recording comorbidities in more detail than relying on ASA ratings within registry data may also help discern more specifically those conditions that may increase risk of poor THA outcomes.
Although preoperative PROMs, ASA rating, and age were identified as significant predictors of postoperative PROMs, these factors only explained between 7 and 28% of the variance (Table 5). Hence, other factors not included in our analysis likely influence the variation in postoperative scores within our population. One such factor may be smoking status. Yue et al. 35 identified 4 studies where smoking negatively affected postoperative THA PROMs between 6 months and 1−year. Smokers are reported to exhibit lower postoperative Oxford 36 and Harris hip 37 scores and an increased use of painkillers and anti-inflammatories 36 than non-smokers and ex-smokers. In smokers, PROMs were more likely to stagnate initially, and decline over time. 38 It may be beneficial to record smoking status in future THA studies and registries.
Few THA patients in the TOR registry were Māori (4%), with a lower representation than the NZ joint registry at 7%. 4 Considering Māori make up 29% of the Bay of Plenty region, 17% of the NZ population, and 16% of Māori are >55 years, uptake of surgery by Māori is relatively poor. 39 Bourke et al. 40 identified competing responsibilities (prioritising work and family), disrupted mana 1 (feeling disempowered), and systemic abdication (inconsistent information/diagnosis) as contributing factors to healthcare accessibility for Māori, which may explain their under-representation as THA patients. Social deprivation and socio-economic inequities are potential sources of health inequity for Māori, 41 which can extend to THA. Low numbers of Māori and “Other” patients made it challenging to discern differences based on ethnicity, which is further complicated due to the diversity of “Other” ethnicities.
Limitations
Observational studies are limited in their ability to define causation; however, they can identify associations between risk factors and a response variable. Additional variables, such as smoking status, physical activity, analgesic and anti-inflammatory use, rehabilitation programmes, and specific comorbidities were not documented in the registry and therefore not analysed. Older age was identified as a predictor of poorer health scores. However, health scores also tend to deteriorate over time as age increases regardless of surgery and this was not noted most likely due to small sample sizes at 15 years. Two thirds (66%) of all surgeries within the TOR registry were privately funded, which is higher than the 2023 NZ rate of 49%. 4 Hence, generalisation of our findings is not ensured to regions where the proportion of publicly funded surgeries is greater. Further, PROMs recorded by patients are subjective, susceptible to bias, and dependent on accurate recall.
Conclusion
The cemented combination studied demonstrated excellent PROMs out to 15 years with the association of poorer preoperative PROMs in THAs that were publicly funded, in females and those with higher BMIs, and association of poorer preoperative PROMs, older age and ASA 3 rating variables to poorer postoperative PROMs. Patients with poorer preoperative PROMs may require additional preoperative support to improve their postoperative outcomes, encompassing both functional and mental strategies. These patients, as well as those who are older and present with higher ASA ratings, may require additional postoperative support. These associated variables, sex, ethnic disparity and accessibility to surgery need further research in NZ.
Supplemental Material
sj-pdf-1-hpi-10.1177_11207000251371132 – Supplemental material for 15-year patient-reported outcomes of a cemented flanged cup and stem combination in primary total hip arthroplasty: a New Zealand study
Supplemental material, sj-pdf-1-hpi-10.1177_11207000251371132 for 15-year patient-reported outcomes of a cemented flanged cup and stem combination in primary total hip arthroplasty: a New Zealand study by Amy Pearce, Chaitanya Joshi, Georgina Chan, Tony Lamberton, Simon MacLean, Andrew Vane and Kim Hébert-Losier in HIP International
Supplemental Material
sj-pdf-2-hpi-10.1177_11207000251371132 – Supplemental material for 15-year patient-reported outcomes of a cemented flanged cup and stem combination in primary total hip arthroplasty: a New Zealand study
Supplemental material, sj-pdf-2-hpi-10.1177_11207000251371132 for 15-year patient-reported outcomes of a cemented flanged cup and stem combination in primary total hip arthroplasty: a New Zealand study by Amy Pearce, Chaitanya Joshi, Georgina Chan, Tony Lamberton, Simon MacLean, Andrew Vane and Kim Hébert-Losier in HIP International
Supplemental Material
sj-pdf-3-hpi-10.1177_11207000251371132 – Supplemental material for 15-year patient-reported outcomes of a cemented flanged cup and stem combination in primary total hip arthroplasty: a New Zealand study
Supplemental material, sj-pdf-3-hpi-10.1177_11207000251371132 for 15-year patient-reported outcomes of a cemented flanged cup and stem combination in primary total hip arthroplasty: a New Zealand study by Amy Pearce, Chaitanya Joshi, Georgina Chan, Tony Lamberton, Simon MacLean, Andrew Vane and Kim Hébert-Losier in HIP International
Supplemental Material
sj-pdf-4-hpi-10.1177_11207000251371132 – Supplemental material for 15-year patient-reported outcomes of a cemented flanged cup and stem combination in primary total hip arthroplasty: a New Zealand study
Supplemental material, sj-pdf-4-hpi-10.1177_11207000251371132 for 15-year patient-reported outcomes of a cemented flanged cup and stem combination in primary total hip arthroplasty: a New Zealand study by Amy Pearce, Chaitanya Joshi, Georgina Chan, Tony Lamberton, Simon MacLean, Andrew Vane and Kim Hébert-Losier in HIP International
Footnotes
Acknowledgements
Maggie Dee Worthington for her dedication in continuously gathering, updating and maintaining the Tauranga Orthopaedic Research data base.
Tauranga Orthopaedic Research for allowing access to the data and funding for the study.
Waikato University for the PhD scholarship.
Declaration of conflicting interests
The author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: TL and AV have disclosed paid presentations for Stryker in 2023: TL has a Training & Education contract for Stryker.
All other authors declare that there is no conflict of interest
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Funding for this study was provided by the Tauranga Orthopaedic Research (TOR) and the University of Waikato.
Supplemental material
Supplemental material is available online.
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
