Abstract
Background:
Interest in robotic-assisted total hip arthroplasty (RA-THA) continues to grow, despite inconsistent evidence on its effectiveness.
Purpose:
We sought to assess for the presence of spin bias in abstracts of systematic reviews and meta-analyses (SRMAs) comparing RA-THA to conventional total hip arthroplasty (C-THA).
Methods:
We conducted a systematic review of studies identified in a comprehensive search of MEDLINE, EMBASE, and the Cochrane Database of Systematic Reviews from each database’s inception to June 23, 2025. Search terms included “robotic,” “hip,” “arthroplasty,” “systematic review,” and “meta-analysis.” Inclusion criteria were (1) SRMAs evaluating outcomes of RA-THA, (2) studies comparing RA-THA to conventional techniques, and (3) publications in English. Studies were excluded if they (1) assessed RA-THA with other robotic-assisted joint arthroplasty procedure, or (2) assessed RA-THA surgery alongside other technologies. Abstracts were assessed for the 15 most severe forms of spin as defined by Yavchitz et al. Methodological quality was assessed using the AMSTAR 2 (A Measurement Tool to Assess Systematic Reviews-2) tool.
Results:
Twelve SRMAs were eligible for inclusion. Eleven SRMAs (91.7%) contained at least 1 of the 15 most severe forms of spin. Type 3 spin (selective reporting of efficacy outcomes favoring the intervention) was the most common (58%), followed by type 5 (conclusion claims the beneficial effect of the experimental intervention despite high risk of bias in primary studies; 50%), and type 11 (conclusion focuses selectively on statistically significant efficacy outcome; 33.3%). No differences in study characteristics were found between SRMAs with and without the aforementioned types of spin. All SRMAs were rated as “critically low” in methodological quality, with most not reporting funding source of included studies (91.7%).
Conclusion:
This systematic review found there is a high prevalence of spin bias in abstracts of SRMAs evaluating RA-THA. The most common type of spin involved selective reporting of outcomes favoring RA-THA (type 3). Future SRMAs can minimize spin by presenting balanced reporting of efficacy, or lack thereof, in their abstracts and strengthen methodological rigor by consistently reporting study funding sources.
Level of Evidence
Level IV, Systematic review of level-I to level-IV studies.
Plain Language Summary
Keywords
Introduction
Conventional total hip arthroplasty (C-THA) has long been the gold standard for treating end-stage hip osteoarthritis. 1 Recently, robotic-assisted total hip arthroplasty (RA-THA) has garnered interest as an innovation promising improvements in implant positioning and patient-reported outcome measures (PROMs). Nonetheless, evidence remains conflicted on the efficacy of RA-THA compared that of C-THA.2,3 There is ongoing debate on whether RA-THA results in significant improvements in PROMs, 3 clinically relevant optimization of implant positioning, 4 and/or reductions in complications (eg, dislocations). 2
Despite inconsistent evidence, public interest in robotic-assisted total joint arthroplasty (TJA) continues to grow. 5 When faced with fragmented and conflicting evidence, systematic reviews and meta-analyses (SRMAs) may offer clinicians an up-to-date consolidation and critical appraisal of the evidence. Nevertheless, SRMAs are prone to their own biases, most notable of which is “spin” bias, in which results are framed more favorably than the evidence suggests. As described by Yavchitz et al, 6 spin bias can be present in abstracts in various forms, broadly categorized as misleading reporting, misleading interpretation, and inappropriate extrapolation of results. For example, a common spin bias identified in orthopedic surgery SRMAs is the selective reporting or overemphasis of beneficial outcomes, which is known as “type 3 spin.”7,8 Spin bias can significantly influence readers’ interpretation of results, particularly in abstracts, which present the first impression of a review and may be the only accessible element of a manuscript. 9 Recent studies have identified spin bias in SRMAs evaluating joint interventions such as platelet-rich therapy for hip osteoarthritis and stemless total shoulder arthroplasty.10,11 However, none have investigated spin bias in the context of RA-THA SRMAs.
The primary aim of this study was to evaluate the presence of spin bias in abstracts of SRMAs comparing RA-THA to C-THA and to examine study characteristics that are associated with increased spin bias. 6 The secondary objective was to assess the methodological quality of the included studies using a validated quality assessment tool. 12 We hypothesized that, consistent with prior orthopedic literature, spin bias would be prevalent and may exaggerate the perceived benefits of RA-THA, potentially influencing findings in favor of robotics.
Methods
This systematic review was conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines (http://www.prisma-statement.org/). No institutional review board approval was required.
We conducted a comprehensive search of MEDLINE, EMBASE, and the Cochrane Database of Systematic Reviews databases from each databases’ inception to June 23, 2025. Search terms included “robotic,” “hip,” “arthroplasty,” “systematic review,” and “meta-analysis” (Supplemental Table 1).
The inclusion criteria were as follows: (1) SRMAs evaluating outcomes of RA-THA, (2) studies comparing RA-THA to conventional techniques, and (3) publications in English. SRMAs of all levels of evidence were included to achieve full comprehensiveness. Studies were excluded if they (1) assessed RA-THA with other robotic-assisted joint arthroplasty procedures, or (2) assessed RA-THA surgery alongside other technologies (eg navigation or computer assistance).
Two authors (A.M.A. and J.A.) screened the titles and abstracts of the retrieved studies in duplicate. Conflicts during title/abstract screening were advanced to the full-text review stage to avoid premature exclusion. Full-texts were then screened according to the aforementioned inclusion and exclusion criteria. Disagreements in full-text screening were resolved through consensus.
Two authors (A.M.A. and J.A.) independently extracted relevant study characteristics in duplicate using a premade template (Covidence, Veritas Health Innovation). Citation counts for each included SRMA were obtained from Web of Science on July 6, 2025. For each SRMA, all included primary studies were abstracted to identify the number of unique studies. Level of evidence was classified based on the guidelines by the Journal of Bone and Joint Surgery. 13 Discrepancies during data abstraction were resolved in consultation with a senior author (H.A.K.).
To assess the presence of spin bias, the full text of each included SRMA was reviewed and compared with the results and conclusions presented in its abstracts. Each abstract was then assessed for spin bias using the 15 common domains described by Yavchitz et al, 6 which are grouped into (1) misleading reporting, (2) misleading interpretation, and (3) inappropriate extrapolation. This assessment was performed independently and in duplicate by 2 reviewers (A.M.A. and J.A.) with prior training in assessment of spin bias. Conflicts were resolved through discussion with a senior author with graduate-level training in health research methodology (H.A.K.).
Quality assessment of the SRMAs included in our review was performed independently and in duplicate by 2 reviewers (A.M.A. and J.A.) using A Measurement Tool to Assess Systematic Reviews (AMSTAR-2). 12 This tool is composed of 16 items, 7 of which are considered “critical” domains. Overall confidence in study quality is determined based on the number of critical and noncritical domains and is categorized as “high,” “moderate,” “low,” or “critically low.” 12 Disagreements during the assessment were resolved in consultation with a senior author (H.A.K.).
Statistical Analysis
Study summary characteristics were presented as descriptive statistics, including medians and interquartile range (IQR) or weighted means and standard deviations (SD), as appropriate. The frequency of unique studies as well as their level of evidence was also tabulated. Frequencies of spin bias domains and fulfilled AMSTAR-2 items were presented as counts and percentages. The frequencies of both the 9 (Spin-9) and 15 (Spin-15) most severe types of spin bias were tabulated to allow for comparisons against prior studies.8,10 Given that prior studies lacked adequate power for multivariable regression, study characteristics were compared with and without the most common types of spin bias using the Mann–Whitney U test for nonparametric data and the independent samples t-test for parametric data.7,14 Statistical analyses were performed using SPSS software, with significance set at P < .05.
Results
A total of 12 studies were included in the final analysis15-26 (Figure 1). The median sample size was 2828 hips (IQR, 1297-4243). The mean number of co-authors was 6.1 ± 1.6 (Table 1). The median number of citations per SRMA was 17.5 (IQR, 6.3 to 49.8) with a mean citation density of 8.8 (SD, 5.5). Across all SRMAs, 68 unique primary studies were included (Supplemental Table 2); 35 (51.5%) were cited in more than 1 SRMA. A randomized controlled trial (RCT) by Lim et al 27 was the most frequently cited, appearing in 9 studies (75%); 6 other studies were included in 8 (66.7%) SRMAs.28-33

PRISMA. PRISMA, Preferred Reporting Items for Systematic Reviews and Meta-Analyses.
Study Characteristics.
Abbreviations: LOE, Level of evidence; MA, Meta-analysis; SR, Systematic review; Spin-9 and Spin-15 represent the presence of ≥1 of the 9 and 15 most common spin domains described by Yavchitz et al, respectively.
The median journal impact factor was 2.8 (IQR, 2.2 to 3.0). The United States and China were the most common countries of origin, each publishing 3 studies. Most SRMAs were of Level III evidence (n = 10, 83.3%); only 1 systematic review did not also include a meta-analysis (Table 2). Nine (75%) of the SRMAs included at least 1 RCT15-17,21-26; collectively, there were 10 unique RCTs included across all eligible SRMAs.27,31,32,34-40 Most SRMAs were not funded (n = 7, 58.3%). Objective outcomes (n = 3, 25%) and resource utilization (n = 3, 25%) both being the least reported outcomes. Four studies (33.3%) specified robotic-specific eligibility criteria.18-20,23
Frequencies of Study Characteristics.
Abbreviations: LOE, Level of evidence; MA, Meta-analysis; SR, Systematic review.
At least 1 of the 15 most severe forms of spin bias was present in the abstracts of 11 SRMAs (91.7%) (Table 1).15,16,18-26 The mean count of spin bias per study was 2 (SD, 1; range, 0-5). No differences in study characteristics were found across studies with 2 or less types of spin bias versus those with more than 2 (Supplemental Table 3). Ten studies (83.3%) presented with at least 1 form of the 9 most severe types of spin.15,16,18-25
Type 3 spin (selective reporting in favor of the intervention) was the most frequently occurring (n = 7, 58%), followed by type 5 (n = 6, 50%), and type 11 (n = 4, 33%) (Table 3). Misleading reporting and misleading interpretation spin categories were each present in 9 studies (n = 9, 75%). No study was found to have bias pertaining to inappropriate extrapolation. No significant differences in study characteristics were found between SRMA characteristics and the presence of the 3 most common types of spin bias (Type 3, 5, and 11; P > .05) (Supplemental Tables 4–6). While SRMAs with type 3 spin showed higher total citation counts (49.0 vs 13.0; P = .167), this difference became less pronounced when adjusting for publication year using citation density (9.6 vs 7.5; P = .575).
Spin Classification.
All included SRMAs were rated as “critically low” confidence according to AMSTAR-2. Common methodological limitations included failure to provide a list of excluded studies with reasons (n = 12, 100%), lack of funding reporting within included studies (item 10; n = 11, 91.7%), and poor meta-analysis rigor (items 11, 12, and 13; n = 10, 90.9%) (Table 4). All studies reported inclusion criteria with Patient/Population, Intervention, Comparison, and Outcome components (item 1; n = 12, 100%), and most disclosed potential conflicts of interest (item 16; n = 10, 83.3%) (Supplemental Table 4).
AMSTAR-2 Quality Assessment.
Abbreviations: PICO, Patient/Population, Intervention, Comparison, and Outcome; RoB, Risk of bias.
“-” denotes response not applicable to domain.
Denominator of studies adjusted to 11 as 1 study did not perform formal meta-analysis.
Discussion
This study is the first to systematically evaluate the presence of spin bias in the abstracts of SRMAs on RA-THA. Importantly, 91.7% of included studies (n = 11) contained at least 1 of the 15 most severe spin types. Type 3 spin was most common (n = 7, 58%), followed by type 5 (n = 6, 50%) and type 11 (n = 4, 33%). On average, SRMAs presented with 2 counts of spin bias (range, 0-5). There were no differences in journal impact factor, publication year, citation counts or density, number of co-authors, number of hips included as well as number of studies included between SRMAs with <2 types of spin bias versus ≥2. Similarly, no differences in study characteristics were identified across SRMAs with and without the 3 most common types of spin bias. All included SRMAs were rated as “critically low” in methodological quality based on the AMSTAR-2 assessment.
This study has its limitations. First, the small number of eligible SRMAs combined with the high rates of spin bias precluded the performance of a multivariable logistic regression due to limited sample size and event rates. Nonetheless, we compared baseline study characteristics across SRMAs with and without the most common types of spin bias, in effort to identify possible drivers of this phenomenon. Second, AMSTAR-2 scoring is stringent and may disproportionately penalize older reviews that were less familiar with the tool, as key reporting standards (such as protocol registration and heterogeneity discussion) were less consistently applied.22-24 Finally, heterogeneity in eligibility criteria and inconsistent labeling of robotic systems rendered it impossible to compare semi-active with fully active systems, which may differ in outcome profiles but highlights the spin bias present in many of these studies. Nonetheless, this review highlighted important methodological considerations in RA-THA literature. Further, it assessed the 15 most severe forms of spin per Yavchitz et al, rather than the traditional 9.6,8 Ultimately, the results of this study highlight the importance of transparency in reporting as the adoption and evaluation of robotic technology in arthroplasty continues to grow.
The high prevalence of spin bias in our study is consistent with prior SRMAs on novel orthopedic interventions.11,41 For example, Biedermann et al 11 reported spin bias in 83% of SRMAs on stemless shoulder arthroplasty, while Hwang et al 41 found a rate of 87% in SRMAs on anterior cruciate ligament repair. These findings highlight that abstract-level bias can be common when evaluating orthopedic interventions. Type 3 spin, defined as “selective reporting of or overemphasis on efficacy outcomes or analysis favoring the beneficial effect of the experimental intervention,” was the most frequent form observed (56%). This aligns with prior orthopedic studies7,8,42 that have reported rates between 16% to 56%. Concerningly, our study’s prevalence of type 5 spin (50%)—claims of beneficial effects of the experimental treatment despite high risk of bias in the primary studies—was higher than the majority of prior studies,7,8,14,42 which ranged from 10% to 38%. The comparatively higher rate of type 5 spin in RA-THA SRMAs may reflect the limited evidence base, with most relying on a narrow pool of Level III studies sharing similar methodological limitations. Moreover, only 10 unique RCTs were collectively included across all eligible SRMAs,27,31,32,34-40 with only 3 published in the last 5 years.37-39 These findings underscore the need for future RA-THA SRMAs to present conclusions with greater caution and qualification, while also highlighting the need for stronger evidence through prospective RCTs.
Our analysis did not identify any study characteristics significantly associated with the presence of common spin bias types. However, a notable trend with types 3 and 11 spin bias was the selective emphasis on radiographic outcomes that favor RA-THA, particularly the proportion of acetabular components positioned within the Lewinnek safe zones. More specifically, our review often identified type 3 spin bias in the context of authors concluding that RA-THA improved radiological outcomes relative C-THA, citing significant results in parameters such as the Lewinnek safe zone, while omitting nonsignificant results in arguably more clinically relevant measures like cup inclination, stem positioning, and hip stability.16,21,22,25 This is of particular importance considering that the clinical relevance of Lewinnek safe zones has come into recent question. 43 Given that most surgeons cite increased precision as the primary reason for adopting robotic-assisted technologies in arthroplasty, 44 we hypothesize that the frequent overemphasis on component positioning may reflect an attempt to further align with this rationale for adoption. There may also be additional positive externalities to the adoption of robotic-assisted technologies in TJA, particularly in the physically demanding THA, 45 that are not directly evaluated in studies to date, but may sway surgeon bias to favoring adoption of this technology. These include decreased surgeon strain 45 as well as improved intraoperative ergonomics. 46 Further, RA-THA may provide certain advantages in challenging patient populations, such as severely dysplastic hips, wherein the surgeon cannot rely on traditional intraoperative landmarks due to the patient’s abnormal anatomy. 47 As such, spin bias identified by our review is limited to outcomes that have been amalgamated by SRMAs to date, which may not capture seldom reported surgeon-centric outcomes or outcomes in unique patient cohorts that may particularly benefit from this technology.
Another important caveat to consider is the heterogeneity in the robotic platforms that were evaluated across studies. Not all robotic systems function in the same way, yet this is often unclear in abstracts and conclusions. Only a quarter of included SRMAs assessed a predefined robotic system,18-20 specifically, the semi-active MAKO system that primarily assists with acetabular component positioning. 19 Importantly, these studies avoided the third category of spin bias (inappropriate extrapolation) where authors might be tempted to generalize findings from MAKO to all robot systems. In contrast, first-generation, fully active systems such as ROBODOC target femoral stem preparation and positioning, and its results are reported across RA-THA literature.31,34 When SRMAs pool outcomes from both types of robots without clarifying these differences, particularly regarding stem positioning, conclusions about radiographic superiority can be misleading. Future RA-THA SRMAs should aim to minimize this heterogeneity and report system-specific analyses in their abstracts.
RA-THA SRMAs demonstrated critical methodological weaknesses, with all rated as “critically low” quality by AMSTAR-2 assessment. Unfortunately, critically low-quality SRMAs have commonly been identified across other orthopedic subspecialities.7,8,14 Of relevance is the lack of reporting on funding source, the second most common methodological limitation (n = 11, 91.6%). As demonstrated by Cavinatto et al, 48 the rates of industry-funded studies were significantly higher in robotic-assisted unicompartmental knee arthroplasty studies compared to manual technique (51 vs 29%, P < .01). Concerningly, studies on robotic-assisted TJA with relevant author disclosures have been found to more likely present favorable results, 49 with this trend corroborated across findings of health economic analyses of new technologies in TJA. 50 Given these trends, it’s critical that future SRMAs on RA-THA present the funding sources of included studies as well as relevant author disclosures.
In conclusion, there is a high prevalence of spin bias in abstracts of SRMAs evaluating RA-THA. The most common type of spin bias involved selective reporting of outcomes favoring RA-THA (type 3). Future SRMAs can minimize spin bias by presenting balanced reporting of both efficacy or lack thereof in their abstracts and strengthen methodological rigor by consistently reporting study funding sources.
Supplemental Material
sj-docx-2-hss-10.1177_15563316261430336 – Supplemental material for Evaluating Spin Bias in the Abstracts of Systematic Reviews and Meta-Analyses on Robotic-Assisted Total Hip Arthroplasty: A Systematic Review
Supplemental material, sj-docx-2-hss-10.1177_15563316261430336 for Evaluating Spin Bias in the Abstracts of Systematic Reviews and Meta-Analyses on Robotic-Assisted Total Hip Arthroplasty: A Systematic Review by Ayomide Michael Ade-Conde, Hassaan Abdel Khalik, James Abesteh, Wayne E. Moschetti and Olufemi R. Ayeni in HSS Journal®
Supplemental Material
sj-docx-3-hss-10.1177_15563316261430336 – Supplemental material for Evaluating Spin Bias in the Abstracts of Systematic Reviews and Meta-Analyses on Robotic-Assisted Total Hip Arthroplasty: A Systematic Review
Supplemental material, sj-docx-3-hss-10.1177_15563316261430336 for Evaluating Spin Bias in the Abstracts of Systematic Reviews and Meta-Analyses on Robotic-Assisted Total Hip Arthroplasty: A Systematic Review by Ayomide Michael Ade-Conde, Hassaan Abdel Khalik, James Abesteh, Wayne E. Moschetti and Olufemi R. Ayeni in HSS Journal®
Supplemental Material
sj-docx-4-hss-10.1177_15563316261430336 – Supplemental material for Evaluating Spin Bias in the Abstracts of Systematic Reviews and Meta-Analyses on Robotic-Assisted Total Hip Arthroplasty: A Systematic Review
Supplemental material, sj-docx-4-hss-10.1177_15563316261430336 for Evaluating Spin Bias in the Abstracts of Systematic Reviews and Meta-Analyses on Robotic-Assisted Total Hip Arthroplasty: A Systematic Review by Ayomide Michael Ade-Conde, Hassaan Abdel Khalik, James Abesteh, Wayne E. Moschetti and Olufemi R. Ayeni in HSS Journal®
Supplemental Material
sj-docx-5-hss-10.1177_15563316261430336 – Supplemental material for Evaluating Spin Bias in the Abstracts of Systematic Reviews and Meta-Analyses on Robotic-Assisted Total Hip Arthroplasty: A Systematic Review
Supplemental material, sj-docx-5-hss-10.1177_15563316261430336 for Evaluating Spin Bias in the Abstracts of Systematic Reviews and Meta-Analyses on Robotic-Assisted Total Hip Arthroplasty: A Systematic Review by Ayomide Michael Ade-Conde, Hassaan Abdel Khalik, James Abesteh, Wayne E. Moschetti and Olufemi R. Ayeni in HSS Journal®
Supplemental Material
sj-docx-6-hss-10.1177_15563316261430336 – Supplemental material for Evaluating Spin Bias in the Abstracts of Systematic Reviews and Meta-Analyses on Robotic-Assisted Total Hip Arthroplasty: A Systematic Review
Supplemental material, sj-docx-6-hss-10.1177_15563316261430336 for Evaluating Spin Bias in the Abstracts of Systematic Reviews and Meta-Analyses on Robotic-Assisted Total Hip Arthroplasty: A Systematic Review by Ayomide Michael Ade-Conde, Hassaan Abdel Khalik, James Abesteh, Wayne E. Moschetti and Olufemi R. Ayeni in HSS Journal®
Supplemental Material
sj-pdf-1-hss-10.1177_15563316261430336 – Supplemental material for Evaluating Spin Bias in the Abstracts of Systematic Reviews and Meta-Analyses on Robotic-Assisted Total Hip Arthroplasty: A Systematic Review
Supplemental material, sj-pdf-1-hss-10.1177_15563316261430336 for Evaluating Spin Bias in the Abstracts of Systematic Reviews and Meta-Analyses on Robotic-Assisted Total Hip Arthroplasty: A Systematic Review by Ayomide Michael Ade-Conde, Hassaan Abdel Khalik, James Abesteh, Wayne E. Moschetti and Olufemi R. Ayeni in HSS Journal®
Footnotes
Ethical Considerations
All procedures followed were in accordance with the ethical standards of the responsible committee on human experimentation (institutional and national) and with the Helsinki Declaration.
Consent to Participate
Informed consent was not required for this systematic review.
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
Declaration of Conflicting Interests
The authors declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: Olufemi R. Ayeni, MD, PhD, FRCSC, declares relationships with Stryker Canada, the Canadian Orthopaedic Association, and Notch Academy. The other authors declared no potential conflicts of interest.
Required Author Forms
Disclosure forms provided by the authors are available with the online version of this article as supplemental material.
Supplemental Material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
