Abstract
Background:
Healthcare providers need reliable evidence for supporting the adoption of new interventions, of which the source of evidence often originates from systematic reviews (SRs). However, little assessment on the rigor of SRs related to osteoarthritis interventions has been conducted. This cross-sectional study aimed to evaluate the methodological quality and predictors among SRs on osteoarthritis interventions.
Methods:
Four electronic databases (Cochrane Database of Systematic Reviews, MEDLINE, Embase, and PsycINFO) were searched, from 1 January 2008 to 10 October 2019. An SR was eligible if it focused on osteoarthritis interventions, and we performed at least one meta-analysis. Methodological quality was assessed using the validated AMSTAR 2 instrument. Multivariate regression analyses were conducted to assess predictors of methodological quality.
Results:
In total, 167 SRs were included. The most SRs were non-Cochrane reviews (88.6%), and 54.5% investigated non-pharmacological interventions. Only seven (4.2%) had high methodological quality. Respectively, eight (4.8%), 25 (15.0%), and 127 (76.0%) SRs had moderate, low, and critically low quality. Main methodological weaknesses were as follows: only 16.8% registered protocol a priori, 4.2% searched literature comprehensively, 25.7% included lists of excluded studies with justifications, and 30.5% assessed risk of bias appropriately by considering allocation concealment, blinding of patients and assessors, random sequence generation and selective reported outcomes. Cochrane reviews [adjusted odds ratio (AOR) 251.5, 95% confidence interval (CI) 35.5–1782.6], being updates of previous SRs (AOR 3.9, 95% CI 1.1–13.7), and SRs published after 2017 (AOR 7.7, 95% CI 2.8–21.5) were positively related to higher methodological quality.
Conclusion:
Despite signs of improvement in recent years, most of the SRs on osteoarthritis interventions have critically low methodological quality, especially among non-Cochrane reviews. Future SRs should be improved by conducting comprehensive literature search, justifying excluded studies, publishing a protocol, and assessing the risk of bias of included studies appropriately.
Keywords
Introduction
Osteoarthritis is a common disease and affects approximately 240 million people, as estimated from global disease burden studies. 1 It is one of the top 10 disabling diseases among developed countries. 2 Knee and hip osteoarthritis, the two most common types of osteoarthritis, account for 2.4% of all years lived with disability (YLDs). 1 As a result, osteoarthritis has been consistently ranked on the lists of leading contributors to global YLDs. 1 Age and obesity are two important risk factors for the development of osteoarthritis,3–6 which suggests that aging and the increasing obesity population will further contribute to the rapid increase of osteoarthritis prevalence. This will increase the burden of osteoarthritis and lead to a significant challenge to public health and health care system. 7
Pain and function impairment are two major symptoms of osteoarthritis. 7 Relieving pain and improving function restriction are the key goals for initiating intervention strategies. 7 Various strategies are available for the management of osteoarthritis. These include non-pharmacological options such as education, exercise, and weight control, 8 as well as pharmacological interventions. 7 Surgery is often reserved for end-stage osteoarthritis. 9
In evidence-based practice, systematic reviews (SRs) are considered as the highest level of evidence for supporting or refuting the efficacy or effectiveness of an osteoarthritis intervention. 10 The trustworthiness of a SR is dependent on its methodological quality. To ensure quality decision making, it is worthwhile to popularize rigorously conducted SRs, which might in turn lead evidence users to identify SRs being appraised as high methodological quality, or to assess the methodological quality solely prior to applying the SR results. The methodological quality of an SR can be appraised by the A MeaSurement Tool to Assess systematic Reviews 2 (AMSTAR 2) tool. 11 As a validated and reliable tool, 11 AMSTAR 2 has been applied to assess the methodological quality of SRs on robotic surgery, 12 overweight and obesity interventions, 13 mental disorders, 14 acute dental pain, 15 and polycystic ovarian syndrome. 16 These appraisals showed that the majority of SRs have poor rigor, with very few attaining high overall methodological quality.12–16 However, it is unclear whether such observations would be applicable to SRs on osteoarthritis interventions.
As the methodological quality of SRs on osteoarthritis interventions has not been assessed using AMSTAR 2, we conducted a cross-sectional study to (i) describe the bibliographical characteristics of an up-to-date sample of SRs on osteoarthritis interventions, (ii) evaluate the methodological quality of SRs with AMSTAR 2, and (iii) explore the potential predictors associated with the methodological quality. This study will help identify the potential factors associated with SRs of low methodological quality or having poor adherence to AMSTAR 2 items, so that targeted recommendations can be developed to help improve the quality of future SRs.
Methods
This is a cross-sectional study. Since all the assessment and analyses were performed based on the data retrieved from existing systematic reviews, it was not necessary to obtain ethics approval and informed consent.
Eligibility criteria
The Cochrane Library defines an SR as an “attempt to identify, appraise, and synthesize all the empirical evidence that meets prespecified eligibility criteria to answer a specific research question.” 17 Based on this definition, we judged a publication as an SR if it answered a research question by searching at least two electronic databases. 18 To be included in this cross-sectional study, SRs published from 1 January 2008 to 10 October 2019 that included only randomized controlled trials (RCTs) and applied at least one pairwise meta-analysis to pool the effect of any interventions for osteoarthritis were eligible. We focused on more recent SRs, as older reviews are less likely to be influential in clinical decision making. SRs required synthesized RCTs which employed established osteoarthritis diagnostic criteria for recruitment, based on the NICE clinical guidelines from the UK National Institute for Health and Care Excellence. 19 Any types of interventions, pharmacological and non-pharmacological, were eligible. There were no restrictions on the types of outcomes reported. For reviews that had one or more updates, the latest updated version was included. Overviews of SRs, SRs without meta-analysis, network meta-analyses, SRs focused on the diagnosis or risk factors of osteoarthritis, protocols, SRs published in languages other than English and Chinese, or SRs published in an abbreviated form such as letters for editors or conference abstracts were excluded.
Literature search
We searched four international databases (the Cochrane Database of Systematic Reviews, MEDLINE, Embase, and PsycINFO) to identify potentially eligible SRs from 1 January 2008 to 10 October 2019 (Supplemental Material Appendix 1 online). Specialized search filters for SR20–22 were used to identify non-Cochrane reviews via the OVID platform for the last three databases.
Literature selection, data extraction, and methodological quality assessment
The titles and abstracts of retrieved citations were screened, and full texts of potentially eligible publications were further assessed. Bibliographical characteristics of included SRs (e.g. publication year, number of review authors, number of included primary studies, etc.) were then extracted using a piloted data extraction form (Supplemental Appendix 2). The processes of SR selection and data extraction were completed by two researchers independently (LZ and HW), with disagreements being settled by consensus, or by consulting a senior researcher (VC) when disagreements persisted.
The methodological quality of the included SRs was assessed with AMSTAR 2, which included 16 items with seven being critical methodological items (items 2, 4, 7, 9, 11, 13, 15). 11 The outcome variables of items 1, 3, 5–6, and 10–16 were categorized as “yes” or “no” while outcome variables of items 2, 4, and 7–9 were categorized as “yes”, “partial yes”, or “no.” 11 Assessments were based on information reported in the full text of the SRs. 11 Each SR was classified into four levels (high, moderate, low, or critically low) of overall methodological quality, based on their performance on the 16 individual appraisal items. 11 The process of methodological quality assessment was also completed by two reviewers independently (LZ and HW), with disagreements being settled by consensus, or by consulting a senior researcher (CW) when disagreements persisted. Detailed operational guidelines for AMSTAR 2 are available online. 11
Data analysis
Results on methodological quality assessments and bibliographical characteristics were reported as frequencies with percentages or medians with ranges, as appropriate. Differences in overall methodological quality among SRs with different bibliographical characteristics were assessed with Kruskal–Wallis rank tests, with p < 0.05 indicating statistically significant differences.
Potential predictors associated with methodological quality of SRs were explored with multi-ordinal regression analyses. In these analyses, seven potential predictors (Cochrane review, pharmacological interventions, updated review, publication year, Journal Impact Factor (JIF) in the year before SR publication, number of authors, and continent of the corresponding author’s affiliation) were used as independent variables, while the overall methodological quality of the SRs was considered as a dependent variable. The Pearson test and Deviance test were used for assessing model fitting, with p > 0.1 indicating adequate fit.
Potential predictors related to the adherence of each AMSTAR 2 item were explored with either binary logistic regression (items 1, 3, 5–6, and 10–16) or multinomial logistic regression (items 2, 4, and 7–9). Model fittings of binary logistic regression and multinomial logistic regression were assessed with Hosmer–Lemeshow tests (with p > 0.1 indicating adequate fit) and likelihood ratio tests (with p < 0.5 indicating adequate fit), respectively. Associations between the predictors and AMSTAR 2 appraisal results were quantified with adjusted odds ratios (AORs), with p < 0.05 considered statistically significant. The Statistical Package for Social Sciences version 25 (IBM Corp., Armonk, NY, USA) was applied for conducting the analyses.
Results
SR search and selection
A total of 2309 records were retrieved through a database search; 557 duplicates were found and excluded; and a further 1285 citations were excluded based on title and abstract screening. Another 300 records were excluded after full-text assessments (Supplemental Appendix 3). Finally, 167 SRs met the eligibility criteria and were included in the final sample (Figure 1 and Supplemental Appendix 4).

Preferred Reporting Items for Systematic Reviews and Meta-analysis flowchart of the literature selection of systematic reviews on osteoarthritis interventions.
Bibliographical characteristics
Details of bibliographical characteristics are reported in Table 1. The 167 SRs synthesized a total of 2142 RCTs, which recruited 326,273 participants with osteoarthritis. They were published between 2008 and 2019, with a median publication year of 2016. Most of the SRs (88.6%) were non-Cochrane reviews. Only 14 (8.4%) were an updated version of previous SRs. Over half of the SRs (54.5%) focused on non-pharmacological interventions. Most of the pharmacological intervention SRs (85.5%) reported harms, while only 58.2% of SRs on non-pharmacological interventions did so. Around half of the SRs (50.9%) were conducted by corresponding authors from Asia, followed by those from Europe (24.6%), America (15.6%), and Oceania (9.0%).
Bibliographical characteristics of 167 included systematic reviews on osteoarthritis interventions.
Values are n (%), or median (range).
The percentages were calculated by using the total number of the categories as the denominator.
PRISMA, the Preferred Reporting Items for Systematic Reviews and Meta-analysis; SR, systematic review.
Twenty-one SRs (12.6%) did not report any details on literature search. Only 82 (49.1%) searched non-English databases, and 59 (35.3%) considered articles published only in English. Fourteen SRs (8.4%) did not report whether language restrictions were applied during literature selection. Over half of the SRs (61.7%) used the Cochrane Risk of Bias Tool for critical appraisal, followed by the Jadad scale (14.4%), and the Pedro Scale (9.0%).
Methodological quality
Overall methodological quality and associated factors
Only seven SRs (4.2%) had high methodological quality,23–29 of which all were Cochrane reviews. Eight (4.8%) had moderate quality, 25 (15.0%) had low quality, while the remaining 127 (76.0%) had critically low quality. Among the remaining 12 Cochrane reviews, three (1.8%) were of moderate quality, seven (4.2%) were of low quality, and two (1.2%) were of critically low quality. For non-Cochrane reviews, 125 (84.5%) were judged as having critically low quality. Studies that were a Cochrane review (p < 0.001), an update of a previous SR (p < 0.001), and published recently (p = 0.002) appeared to have higher methodological quality. Meanwhile, the use of the Jadad scale (p < 0.001) and lack of search strategies (p < 0.001) were associated with critically low quality. Details can be found in Table 2. Detailed assessment results of AMSTAR 2 individual and overall items of the 167 included SRs can be found in Supplemental Appendix 5.
Methodological quality of the 167 systematic reviews on osteoarthritis interventions.
Other approaches including other scales (e.g. Pedro Scale) and checklists (e.g. Delphi list).
Results from multi-ordinal logistic regression analyses indicated that the following factors were associated with higher overall methodological quality (Table 3): a Cochrane review (AOR 251.5, 95% confidence interval (CI) 35.5–1782.6), an updated SR (AOR 3.9, 95% CI 1.1–13.7), and published after 2017 (AOR 7.7, 95% CI 2.8–21.5).
Association between characteristics of systematic reviews on osteoarthritis interventions and methodological quality: multi-ordinal logistic regression analyses.
The p values of the Pearson Chi-Square and Deviance were >0.1, indicating good model fit for the multi-ordinal logistic regression analysis.
Non-Cochrane review was used as the reference.
Original systematic review was used as the reference.
Year of publication was divided into two groups (after 2017 and 2008 to 2017); the 2008 to 2017 subgroup was used as the reference.
Non-pharmacological intervention was used as the reference.
Corresponding author from Europe was used as the reference.
AOR, adjusted odds ratio; CI, confidence interval.
Adherence to each AMSTAR 2 item and associated factors
The assessment results indicated that most sampled SRs performed poorly on critical methodological items (Table 4). Only 30.5% assessed risk of bias appropriately in individual RCTs from (i) unconcealed allocation, (ii) lack of blinding of patients and assessors when assessing outcomes, (iii) allocation sequence that was not truly random and (iv) selective reported outcome from among multiple measurements; 4.2% had conducted a comprehensive literature search, 74.3% did not include lists of excluded studies with justifications, and 58.7% did not register protocol a priori or investigate publication bias. Two relatively better performing items were appropriate application of meta-analysis (89.8%) and accounting for risk of bias when interpreting results (73.1%).
Appraisal results on each AMSTAR 2 item among the 167 systematic reviews on osteoarthritis interventions.
Items were regarded as critical methodological items.
AMSTAR, Assessing the Methodological Quality of Systematic Reviews; N/A, not applicable; PICO, patients, intervention, comparison, and outcomes; RoB, risk of bias.
Performance on the following non-critical items was satisfactory. The majority of the sampled SRs included Problem/Patient/Population, Intervention/Indicator, Comparison, and Outcome (PICO) components in the research question (96.4%); performed duplicate data extraction (88.6%); literature selection (73.1%); discussed the observed heterogeneity (82.0%); and reported conflicts of interest (82.0%).
Binary logistic regression analyses showed that SRs published in journals with a higher JIF had better performance in presenting the research question with PICO components (AOR 2.4, 95% CI 1.1–5.5) and considering risk of bias during meta-analysis (AOR 1.4, 95% CI 1.1–1.8). SRs published more recently were more likely to account for risk of bias when interpreting results (AOR 1.3, 95% CI 1.1–1.5). SRs on pharmacological interventions showed higher quality in reporting source of funding among included studies (AOR 9.5, 95% CI 1.1–79.8) (Table 5). Results from the remaining binary logistic regression analyses and all the multinomial logistic regression analyses were not reported due to poor model fitting.
Association between characteristics of osteoarthritis interventions systematic reviews and methodological quality on individual AMSTAR 2 item: binary logistic regression analyses.
The p values of all Hosmer–Lemeshow tests were >0.1, indicating good model fit for all logistic regression analyses.
Non-pharmacological intervention was used as the reference.
AMSTAR, Assessing the Methodological Quality of Systematic Reviews; AOR, adjusted odds ratio; CI, confidence interval; PICO, patients, intervention, comparison and outcomes; RoB, risk of bias.
Discussion
This study assessed the methodological quality of an up-to-date sample of 167 SRs on osteoarthritis interventions published between 2008 and 2019. Our results indicated that the rigor of SRs in the field is disappointing, with 127 SRs (76.0%) having critically low quality. Cochrane reviews, being updates of previous SRs, and SRs published after 2017 were positively related to higher methodological quality. Unlike previous studies which showed that a higher JIF is associated with better methodological quality, 12 our results show that it is only predictive of better performance in including PICO components when formulating the research question and in considering risk of bias when interpreting results. For both SR authors and journal editors, it is advised to choose the Cochrane Handbook as a key resource for methodological standards.
While we observed methodological improvement among SRs published more recently, major limitations in methodology require urgent attention in critical aspects. Less than 50% of the appraised SRs satisfied these requirements: (i) registered a protocol a priori; (ii) assessed risk of bias for included studies appropriately by considering allocation concealment; blinding of patients and assessors, random sequence generation, and selective reported outcomes;, (iii) conducted a comprehensive literature search; and (iv) assessed the presence and potential impact of publication bias.
Assessing risk of bias with appropriate tools
The risk of bias of primary studies included in an SR is a key determinant for the validity of the evidence generated by the SR. Critical appraisal of included studies with a validated instrument is an important step in allowing the reviewers to judge how much confidence should be placed on the conclusion. The Cochrane Handbook explicitly discourages the use of scoring scales for assessing risk of bias because of lack of validity and reliability. 30 For instance, the Jadad scale was not recommended, as it emphasizes reporting rather than implementation of bias reduction measures, and does not assess important methodological aspects such as allocation concealment. 30 Unfortunately, 24 of our sampled SRs applied the Jadad scale, and overall they had critically low quality. It is important for future SR authors to use updated tools like the Cochrane Risk of Bias Tool 2, which allows an overall judgment on the risk of bias for each trial. This recent methodological development will facilitate stratified meta-analysis based on risk of bias. 31
Conducting comprehensive literature search and assessing publication bias
A thorough, objective, and reproducible search of multiple sources is required for ensuring comprehensiveness of an SR, 18 but most of the sampled SRs did not fulfill the comprehensiveness criteria. A search that covers only major bibliographic databases (e.g. MEDLINE, EMBASE, and the Cochrane Central Register of Controlled Trials) is not considered comprehensive according to the Cochrane Handbook 18 and the AMSTAR 2. 11 Identifying sources of unpublished evidence and including or consulting content experts in the field are two essential steps to ensuring comprehensiveness, as they reduce publication bias. 18 As indicated in an evaluation on the impact of including unpublished evidence in SRs, failure to include gray literature might lead to significant underestimation or overestimation of intervention effect, causing misuse of effective interventions.32,33 Although there is no perfect strategy to remove publication bias if identified, it is important to investigate the presence or absence thereof. The presence of publication bias often indicates an overestimation of intervention effect, of which evidence users should take into account when applying evidence in practice. 34 However, more than half of the sampled SRs did not carry out an adequate investigation of publication bias and discuss the likely impact on the results of the review, which calls for improvement of future SRs on osteoarthritis interventions.
Reducing selective outcome reporting
Like randomized trials, selective reporting of positive outcomes is common among SRs. 35 The dissemination of a well-developed protocol prior to conducting a SR would reduce such potential bias, 36 but this was not routinely practiced among our SR samples. Future SRs are suggested to register their protocol on PROSPERO, 37 while authors who conduct their reviews within Cochrane Review Groups are suggested to register their protocol in the Cochrane Database of Systematic Reviews. 17
Strengths and limitations
This study has some strengths. This study assessed an up-to-date sample of Cochrane and non-Cochrane SRs on osteoarthritis interventions using AMSTAR 2, 11 which is the latest validated instrument for appraising SRs. In addition, we focused on more recent SRs as they are more likely to be influential in clinical decision making.
There are also some limitations. First, factors associated with the compliance of 12 individual AMSTAR 2 items were not assessed due to poor model fitting in the regression analyses. Second, due to limited available resources, only SRs of RCTs and which applied at least one pairwise meta-analysis to pool the effect of any intervention for osteoarthritis were considered as eligible in this study. Our conclusions may not apply to SRs related to osteoarthritis interventions including non-RCTs, SRs without meta-analysis, and SRs beyond the search period. In addition, 24 SRs were excluded from our cross-sectional study because of inaccessible full text. They contribute approximately 14% of the included SRs. However, as the majority (76.0%) of the included SRs on osteoarthritis interventions were found to have critically low methodological quality, it is not likely that these 24 excluded SRs would pose a significant impact on our findings. Finally, our assessments solely relied on published information, which may be influenced by reporting quality. 12
Conclusion
The methodological quality of SRs on osteoarthritis interventions published in the last decade is disappointing, especially among non-Cochrane reviews. Joint efforts from SR authors, journal editors, and peer reviewers are required to improve the rigor of SRs in this field, especially in areas of disseminating SR protocol; assessing risk of bias for included studies with appropriate approach; conducting a comprehensive literature search; and assessing the presence and potential impact of publication bias.
Supplemental Material
Appendix_5_The_details_of_AMSTAR_2_assessment – Supplemental material for Methodological quality of systematic reviews on interventions for osteoarthritis: a cross-sectional study
Supplemental material, Appendix_5_The_details_of_AMSTAR_2_assessment for Methodological quality of systematic reviews on interventions for osteoarthritis: a cross-sectional study by Irene XY Wu, Huan Wang, Lin Zhu, Yancong Chen, Charlene HL Wong, Chen Mao and Vincent CH Chung in Therapeutic Advances in Musculoskeletal Disease
Supplemental Material
OA_Amstar_2_Appendix1to4_0908 – Supplemental material for Methodological quality of systematic reviews on interventions for osteoarthritis: a cross-sectional study
Supplemental material, OA_Amstar_2_Appendix1to4_0908 for Methodological quality of systematic reviews on interventions for osteoarthritis: a cross-sectional study by Irene XY Wu, Huan Wang, Lin Zhu, Yancong Chen, Charlene HL Wong, Chen Mao and Vincent CH Chung in Therapeutic Advances in Musculoskeletal Disease
Footnotes
Author contributions
VC and CW were involved in the study concept and design. HW and LZ performed the literature search. HW, LZ, and YC conducted literature selection, data extraction and methodological quality assessments. IW, HW, and LZ performed the analysis and interpretation of data. IW, CW, and VC were involved in drafting the manuscript. VC, CW, CM, and IW performed critical revision of the manuscript for important intellectual content. All authors read and approved the final manuscript. CW (email:
Conflict of interest statement
The authors declare that there is no conflict of interest.
Ethics statement
All data for this study were obtained from existing publications and ethical approval was not required for this research.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported by the High-level Talents Introduction Plan from Central South University (No. 502045003) and the National Natural Science Foundation of China (No. 81973709).
Informed consent
Not required.
Supplemental material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
