Abstract
Background:
Total ankle arthroplasty has emerged as a treatment to successfully treat ankle arthritis. Recent studies have reported more than 40 000 total ankle arthroplasties (TAAs) being performed between 2009 and 2019 in the United States. Although recent studies have reported favorable patient-reported outcomes at short- and midterm follow-up, there is a paucity of aggregate literature reporting on long-term patient-reported outcomes (PROs) after TAA. The purpose of this review is to report an aggregate of literature on minimum 10-year patient-reported outcomes after TAA.
Methods:
A systematic review was conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) guidelines. PubMed, Scopus, and Cochrane Central Register of Controlled Trials (CENTRAL) were queried in June 2024. Primary research articles were included if they reported minimum 10-year PROs or satisfaction for patients who underwent primary TAA and were written in English. Survivorship was reported based on implant failure, which was determined uniquely by each study.
Results:
Eight studies met the inclusion criteria. A total of 595 ankles with a range of average ages from 51 to 73.7 years were included in the study with follow-up ranging from a minimum of 10 years to a minimum of 20 years. Six of the 8 studies reported average follow-up ranging from 11.9 to 15.8 years. Two of the 8 studies reported significant improvement in PROs following surgery. Survivorship at a minimum of 10-year follow-up ranged from 66% to 94.4%. Average time to implant failure ranged from 4.6 to 13.8 years.
Conclusion:
Patients undergoing primary TAA were reported to have generally improved PROs at minimum 10- year follow-up. However, they demonstrated variable rates of survivorship ranging from 66% to 94.4%. Of those experiencing implant failure, average time to failure ranged from 4.6 to 13.8 years. Survivorship should be interpreted with caution because of varying definitions between studies. Further studies should seek to standardize the definition of survivorship and reporting of PROs to allow for effective analysis of heterogeneity.
Introduction
Ankle arthritis is a debilitating condition that was previously solely surgically treated with ankle arthrodesis. 25 However, total ankle arthroplasty (TAA) has emerged as an alternative that leads to a greater range of motion after surgery while providing similar outcomes when compared to ankle arthrodesis.7,26,27 Recent trends have shown that the use of TAA has increased whereas the use of ankle fusion has decreased in the United States. 24 TAA’s growing popularity is evident with more than 40 000 TAAs performed in the United States from 2009 to 2019. 15
As a result, there has been an increasing amount of literature reporting on patient-reported outcomes (PROs) after TAA and implant survival rates.8,10,20 A review of the literature including more than 1000 ankles with minimum 5-year follow-up demonstrated that TAA has favorable outcomes. 22 This review found the mean difference between preoperative and postoperative American Orthopaedic Foot & Ankle Society (AOFAS) ankle-hindfoot score was 43.60. It also concluded that pooled prosthesis revision rates, excluding polyethylene exchanges, were 12.2% at minimum 5-year follow-up and 20.2% at minimum 10-year follow-up. 22 However, minimum 10-year data is scarce and further reviews are necessary to determine whether there is durability between midterm and long-term outcomes after TAA. The purpose of this review is to report an aggregate of literature on minimum 10-year patient-reported outcomes after TAA. The authors hypothesize that patients who did not undergo revision surgery would experience favorable outcomes and that there would be a moderate rate (20%) of implant failures at long-term follow-up.
Methods
Study Selection
A systematic review was conducted in PubMed, Scopus, and Cochrane Central Register of Controlled Trials (CENTRAL) in June 2024 following the Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) guidelines. A medical librarian was consulted in developing the search terms and extracted articles from the three databases. The search strategy is detailed in Appendix Table 1. The review was registered in Prospero under id: CRD42023393629. Articles were included if they were primary research articles reporting PROs or satisfaction after primary TAA with minimum 10-year follow-up in English. Articles were excluded if they were animal studies, biomechanical studies, case reports, opinion articles, review articles, technique articles, or did not report postoperative outcomes. Articles underwent title and abstract screening and full-text review by 2 independent reviewers (M.S.L.) and (L.M.). Disagreements were settled by rereview and discussion until reviewers were in unanimous agreement.
Quality Assessment
All studies were graded for quality using the Methodological Index of Non-Randomized Studies (MINORS). 28 Two independent reviewers (M.S.L.) and (L.M.) assessed each article, and disagreements were settled by regrading and discussing the scoring criteria until reviewers were in agreement. Articles that did not report level of evidence were assigned levels of evidence based on the standards previously described by Hohmann et al. 12
Data Extraction and Analysis
Patient-reported outcomes and endpoint and nonendpoint surgery rates were extracted. Additionally, patient demographics, functional outcomes, radiographic findings, and surgical procedures were extracted, if available. Average time to implant failure was calculated by summing up time to endpoint revision surgeries for all patients and dividing by the number of people who had implant failure if studies did not explicitly report average time to implant failure. Average time to specific non-endpoint secondary procedures were calculated using the same method. Forest plots were created for PROs with preoperative and postoperative values for 3 or more studies using Cochrane’s Review Manager program (RevMan Version 5.4; The Nordic Cochrane Center, The Cochrane Collaboration, 2020). The latest postoperative outcome was used when multiple time points were reported over 10-year follow-up. Heterogeneity was assessed with I2 using the Cochrane Handbook cutoffs. The I2 range of 50%-90% “may represent substantial heterogeneity.” 11 Data were not pooled because of low levels of evidence. 5 Statistical significance was defined as P <.05. Survivorship was defined as nonimplant failure and assessed for each cohort using the definitions provided in their respective studies. Time to implant failure and follow-up time were converted to years by dividing the number of months by 12.
Results
The initial query on PubMed, CENTRAL, and Scopus resulted in 3633 articles. There were 2470 articles remaining after duplicates were removed. Title and abstract review of the remaining articles for relevance yielded 42 articles for full-text review. The full text of these articles was reviewed, and 8 of these articles met inclusion criteria and were included in the study.1 -4,6,14,18,23 The article selection process is shown in Figure 1. Seven of the studies included in the systematic review were case series representing Level IV evidence.2 -4,6,14,18,23 One study was a retrospective cohort study representing Level III evidence. 1

Prisma Flowchart for Article Selection.
Demographics
Descriptive article information including study period, number of ankles, sex, average follow-up time, and average age at time of surgery were recorded (Table 1). The 8 studies in this review had study periods ranging from 1984 1 to 2009. 2 This review included a total of 595 ankles, of which 235 ankles had PRO follow-up of at least 10 years. Average age of patients ranged from 51 years 1 to 73.7 years 23 with follow-up ranging from a minimum of 10 years to a minimum of 20 years. Six out of 8 studies reported average follow-up ranging from 11.9 to 15.8 years.
Demographics.
Abbreviations: F, female; LCS, low contact stress; LOE, level of evidence; M, male; NR, not reported; PRO, patient-reported outcome score; STAR, Scandinavian Total Ankle Replacement; TAR, total ankle replacement.
Data are reported as mean ± SD (range) or {95% CI} and median [interquartile range]; underline denotes median.
Range of Motion and PROs
Three of the studies recorded pre- and postoperative ankle range of motion (ROM).2,6,14 These 3 studies all found improvement in dorsiflexion from pre- to postoperative measurements, and one of the studies found a significant improvement in total ROM following surgery. 2 Two studies3,23 reported only postoperative ankle ROM. Six of the 8 studies reported radiographic measurements including alpha, beta, and gamma angles.1 -3,6,14,23 Radiographic measurements and ankle range of motion results were recorded (Appendix Table 2).
All 8 studies reported PROs, with the most common PRO reported being the AOFAS ankle-hindfoot scale, which was utilized in 6 studies.2 -4,14,18,23 The average preoperative AOFAS score ranged from 25 points 3 to 39.6 points 23 of 100. The average postoperative AOFAS score ranged from 61 points 4 to 80.4 points 18 of 100. Of the 6 studies that recorded AOFAS scores, 2 of the studies2,3 reported statistically significant improvements after surgery. The remaining 4 studies4,14,18,23 demonstrated improvements in AOFAS scores postoperatively but without statistical significance. Improvement was calculated as the difference between average postoperative and average preoperative outcome scores. The difference between preoperative and postoperative scores ranged from 32 to 53.9. Additionally, PRO for pain was recorded using the visual analog scale (VAS) in 3 of the studies.2,3,14 Only 2 of these studies2,14 recorded both pre- and postoperative VAS scores, with 1 of the studies demonstrating significant improvement. 2 Exploring heterogeneity for AOFAS score yielded an I2 of 67%, which indicates there could be substantial heterogeneity between the studies reporting AOFAS scores. Additional PRO measures used included the Foot Function Index (FFI), 2 which demonstrated significant improvement following surgery. PROs are recorded in Table 2. Forest plot showing the pre- and postoperative AOFAS is shown in Figure 2.
Patient-Reported Outcomes and Clinical Benefit. a
Abbreviations: AOFAS, American Orthopaedic Foot & Ankle Society; AOS, Ankle Osteoarthritis Scale; FFI, Foot Function Index; NR, not reported; PROs, patient-reported outcome scores; VAS, visual analog scale.
Data are reported as median or average ± SD (range) or {95% CI} or [interquartile range]; underline denotes median.

Forest Plot for American Orthopaedic Foot & Ankle Society (AOFAS) preoperative and postoperative scores.
Revision surgeries and survivorship
Endpoint surgeries and reasons for endpoint revision (Table 3) and survivorship (Table 4) were recorded. The average time to implant failure of the studies ranged from 4.6 years 1 to 13.8 years. 14 Implant failures were most commonly treated with component revision or ankle arthrodesis. Aseptic loosening was the most common reason for implant failure in 3 studies2,4,18 and tied for the most common reason in 2 other studies.6,23 The average survivorship at a minimum of 10 years ranged from 66% 1 to 94.4%. 14
Deaths, Average Time to Implant Failure, Endpoint Revision Procedures, and Reasons for Revision. a
Abbreviation: TAA, total ankle arthroplasty.
Data are reported as n (%) or mean ± SD (range) unless otherwise indicated.
Four patients had revision surgeries at unknown times and were not factored into average time to implant failure.
Unrelated to ankle arthroplasty.
Definitions of Survivorship (Nonimplant Failure) and Survivorship Rates.
At minimum 20 years.
Five studies reported conducting polyethylene exchanges.1,3,6,14,23 One study reported that 17 ankles (22%) underwent open arthrolysis and percutaneous lengthening of the Achilles tendon. 3 Non-endpoint secondary surgeries are listed in Appendix Table 3.
Discussion
The main findings of this review were that (1) patients undergoing TAA with minimum 10-year follow-up showed improved patient-reported outcomes and (2) there are several definitions of survivorship, with studies reporting rates between 66% and 94.4%. Overall, there were a total of 595 ankles, of which 235 ankles had PRO follow-up of at least 10 years.
All 8 studies reported improvement between preoperative and postoperative outcomes after TAA. Improvement in AOFAS scores at long-term follow-up are consistent with the mean improvement after TAA at midterm follow-up of 43.6 (95% CI, 37.51-49.69) reported by Onggo et al. 22 This may show that outcomes from midterm to long-term follow-up after TAA are durable with minimal degradation over time. However, many studies may have omitted PRO data because of endpoint surgeries which could have influenced outcomes. One study reported Kofoed scores with a median improvement of 38 points after surgery and median postoperative score of 89 (interquartile range, 81-94). 6 Previous studies have defined that 89 would be an excellent outcome and the lower bound of the interquartile range of 81 would be a good outcome. 16 These results show that there may be sustained improvement in long-term outcomes after TAA.
It is important to note the high level of heterogeneity in the AOFAS scores. Heterogeneity measures the consistency between preoperative and postoperative AOFAS scores across studies and quantifies how much of the differences between the studies may be due to random chance. It can help determine if one study’s results significantly vary from the expected results. There was an I2 value of 67%, which indicates there could be “substantial heterogeneity.” 11 This may be due to the small sample size of included studies with varying standardized mean difference effect sizes ranging from 2.98 to 4.28. The 4.28 effective size of the Kraal et al 18 study may have increased the heterogeneity due to 93 ankles having preoperative PROs and 17 ankles having postoperative PROs. Although the risk of heterogeneity influencing the current results is high, it is important to note that all 3 studies demonstrated positive standardized mean differences in favor of TAA.2,3,18 This scarcity of studies undergoing I2 analysis are a limitation of the current literature and should not invalidate the improvement TAA can offer patients with arthritis. Future studies should report standardized PROs at preoperative and minimum 10-year follow-up with SD to allow for qualitative analysis of the heterogeneity of the data to validate the findings of the current review.
The survivorship rates of the study should be evaluated with caution as there were multiple definitions of what constituted survivorship. The highest rate of 10-year survival by Jastifer and Coughlin 14 was 94.4% and defined by retaining the original implant. However, it is important to note that 6 of the 18 patients in that study had secondary surgeries to maintain the original implant that were not defined as failure. Five of these patients had polyethylene exchanges. The high rate of implant survival may be due to the exclusion of polyethylene exchanges as 2 other studies that reported survivorship rates of 82.8% 4 and 78%, 6 respectively, included polyethylene exchanges in their survivorship rates. When excluding polyethylene exchanges and cyst fillings, the 10-year survival rate increased from 78% to 90%. 6 The lowest rate of survivorship was reported at 66% and defined as a revision to the tibial, talar, or both components or a conversion to arthrodesis. 2 Furthermore, this cohort had an extremely high satisfaction rate despite the high amount of failures as only 1 of the 34 patients was dissatisfied. 2 The survivorship rate here is lower than previous rates of 97% 21 reported at midterm follow-up and could possibly be due to the high loss of follow-up as 28 patients were deceased (8.7%) or were unable to be contacted (26.2%). 2 The variable definitions of what constituted implant survival should be considered when evaluating literature concerning the survivorship rates after TAA.
Further original research studies and systematic reviews should seek to standardize the definition of survivorship and implant failure to allow for more in-depth analyses. A universal definition of what constitutes survival could eventually lead to pooled data for systematic reviews. This would allow for accurate estimations of the likelihood of avoiding revision surgery after TAA and estimated average time to implant failure. Moreover, future reviews should seek to evaluate the long-term efficacy of TAA in comparison to arthrodesis. Multiple reviews have compared the 2 techniques at shorter follow-up; however, it is necessary to understand the long-term outcomes of both procedures.9,17 Further reviews should evaluate higher-quality evidence with randomization as all studies in the current review were case series with Level IV evidence or retrospective cohort studies with Level III evidence, which prevented data from being pooled.
This article has several strengths. First, the review evaluates patients with minimum 10-year outcomes, which can help assess the durability of outcomes after TAA. These results can build on previous reviews and help orthopaedic surgeons manage patient expectations on the longevity of their TAA. Second, the review uses forest plots to evaluate whether heterogeneity may influence PROs after TAA. This can help determine whether differences from studies may be due to random chance or the methodology and results of a specific study. In the current review, the high heterogeneity led to further analysis of effect sizes and found that lack of postoperative scores may have led to the high effect size. Third, the review provides a comprehensive list of secondary procedures after TAA. This list provides context on possible complications and further procedures patients may experience after TAA excluding secondary TAA or arthrodesis.
This study has limitations that must be acknowledged. All included studies were case series, which introduces considerable heterogeneity and bias into the study. Many patients had additional interventions, before, during, or after the implantation of the prosthesis, which may confound outcomes. Moreover, surgical technique has evolved over the years, and the outcomes of this review may not reflect the efficacy of modern total ankle arthroplasty. Certain implants included in the study may not be used currently. During the study period, the most commonly recorded assessments of the patients were the AOFAS scores. The AOFAS scoring systems are not purely patient-reported as they have aspects of the score completed by the surgeon. Although they have been previously defined as a PRO by multiple studies13,19 and used in ankle osteoarthritis, their utility is suspect and is not equivalent to validated scoring systems. Additionally, the forest plot has a high heterogeneity of I2 = 95%, which must have influenced the outcomes of the study. Also, no pooling of outcomes was performed because of low levels of evidence. Moreover, no subanalysis comparing TAA to arthrodesis was performed because of the novelty of 10-year follow-up and lack of comparative studies in the literature. The sample size of the review is modest, and further studies are needed to validate the results. Readers should cautiously interpret survivorship rates because survivorship definitions varied between studies. Finally, some patients were considered endpoints because of revision surgery or death and were not included in the postoperative patient-reported outcomes, which may have influenced the reported outcomes.
Conclusion
Patients undergoing primary TAA were reported to have generally improved outcomes at minimum 10-year follow-up. However, they demonstrated variable rates of survivorship ranging from 66% to 94.4%. Of those experiencing implant failure, average time to failure ranged from 4.6 to 13.8 years. Survivorship should be interpreted with caution because of the varying definitions between studies. Further studies should seek to standardize the definition of survivorship and reporting of PROs to allow for effective analysis of heterogeneity.
Supplemental Material
sj-pdf-1-fao-10.1177_24730114241294073 – Supplemental material for Long-term Outcomes After Total Ankle Arthroplasty: A Systematic Review
Supplemental material, sj-pdf-1-fao-10.1177_24730114241294073 for Long-term Outcomes After Total Ankle Arthroplasty: A Systematic Review by Michael S. Lee, Lucas Mathson, Clark Andrews, Dylan Wiese, Jessica M. Fritz, Andrew E. Jimenez and Brian Law in Foot & Ankle Orthopaedics
Footnotes
Appendix
Nonendpoint Secondary Procedures.
| Author and Year | Other Secondary Procedures, n (% Total) | Average Time to Secondary Procedures | |
|---|---|---|---|
| Bianchi et al 2 2021 | Debridement of bony impingement | 7 (12.9) | |
| Hardware removal | 6 (11.1) | ||
| Medial malleolus osteotomy | 6 (11.1) | ||
| Subtalar arthrodesis | 5 (9.2) | ||
| Achilles tendon lengthening | 2 (3.7) | ||
| Medial malleolus osteosynthesis | 1 (1.8) | ||
| Midtarsal osteotomy | 1 (1.8) | ||
| Brunner et al 3 2013 | Polyethylene insert exchange | 9 | 6.7 y (0.9-10.4) |
| Medial displacement calcaneal osteotomy | 2 | 3.2 y | |
| Z-shaped calcaneal osteotomy | 1 | 10.1 y | |
| Supramalleolar osteotomy of the tibia | 3 | 1.4 y | |
| Shortening osteotomy of the fibula | 4 | NR | |
| Medial ankle ligament reconstruction | 1 | 2 y | |
| Peroneal tendon transfer | 1 | 10.1 y | |
| Open arthrolysis and percutaneous lengthening of Achilles tendon | 17 | 3.3 y (0.7-6.3) | |
| Open cyst debridement and filling with autologous cancellous bone | 1 | 5.7 y | |
| Clough et al 4 2019 | NR | NR | NR |
| Frigg et al 6 2017 | Subtalar arthrodesis | 6 | |
| Talonavicular arthrodesis | 2 | ||
| Debridement of ankle joint | 24 | ||
| Lateral ligament repair | 1 | ||
| Posterior tibial tendon adhesiolysis | 1 | ||
| Dwyer osteotomy | 2 | ||
| Gastroc-lengthening | 2 | ||
| Polyethylene exchange | 24 | ||
| Jastifer and Coughlin 14 2015 | Ankle debridement, poly exchange, and tendon repair | 1 | 1 y 2 mo |
| Ankle debridement and poly exchange | 3 | 7 y 5 mo | |
| Triple arthrodesis | 1 | 12 y 6 mo | |
| Ankle debridement, poly exchange, and gastrorecession | 1 | 9 y 2 mo | |
| Kraal et al 18 2013 | NR | NR | NR |
| Palanca et al 23 2018 | Removal of bone spurs and TAL | 1 | 11.8 y |
| Bone grafting to the tibia and talus and polyethylene exchange | 1 | 10.8 y | |
| Subtalar fusion | 1 | 13.7 y | |
| Polyethylene replacement and bone grafting of the tibia | 2 | 10.4 y | |
| Calcaneal osteotomy | 2 | 9.4 y | |
| Bone grafting of the tibia, routine polyethylene exchange | 1 | 11.6 y | |
| Removal of the medial malleolus exostosis | 1 | 12.7 y | |
| ORIF | 1 | 44 d | |
| Bone grafting of the tibia and talus with routine polyethylene exchange and TAL | 1 | 11.6 y | |
| Pending polyethylene replacement | 1 | 15.6 y | |
| Bedard et al 1 2021 | NR | ||
Abbreviations: ORIF, open reduction internal fixation; TAL, tendo-Achilles lengthening.
Data are presented as n (range or %).
Acknowledgements
Alexandria Brackett, MA, MLIS, for formulating the search strategy and pulling references for screening.
Ethical Approval
This study was performed in accordance with the ethical standards in the 1964 Declaration of Helsinki.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article. Disclosure forms for all authors are available online.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
