Abstract
Background:
Controversy exists concerning whether tenotomy or tenodesis is the optimal surgical treatment option for proximal biceps tendon lesions.
Purpose:
To evaluate the clinical outcomes after arthroscopic tenodesis and tenotomy in the treatment of long head of the biceps tendon (LHBT) lesions.
Study Design:
Systematic review; Level of evidence, 4.
Methods:
A systematic review was performed by searching PubMed, the Cochrane Library, Web of Science, and Embase to identify randomized controlled trials (RCTs) and cohort studies that compared the clinical efficacy of tenotomy with that of tenodesis for LHBT lesions. A standardized data extraction form was predesigned to obtain bibliographic information of the study as well as patient, intervention, comparison, and outcome data. A random-effects model was used to pool quantitative data from the primary outcomes.
Results:
A total of 21 eligible studies were separated into 3 methodological groups: (1) 4 RCTs with level 1 evidence, (2) 3 RCTs and 4 prospective cohort studies with level 2 evidence, and (3) 10 retrospective cohort studies with level 3 to 4 evidence. Analysis of the 3 groups demonstrated a significantly higher risk of the Popeye sign after tenotomy versus tenodesis (group 1: risk ratio [RR], 3.29 [95% CI, 1.92-5.49]; group 2: RR, 2.35 [95% CI, 1.43-3.85]; and group 3: RR, 2.57 [95% CI, 1.33-4.98]). Arm cramping pain remained significantly higher after tenotomy only in the retrospective cohort group (RR, 2.17 [95% CI, 1.20-3.95]). The Constant score for tenotomy was significantly worse than that for tenodesis in the prospective cohort group (standardized mean difference [SMD], –0.47 [95% CI, –0.73 to –0.21]), as were the forearm supination strength index (SMD, –0.75 [95% CI, –1.28 to –0.21]) and the Simple Shoulder Test (SST) score (SMD, –0.60 [95% CI, –0.94 to –0.27]).
Conclusion:
The results demonstrated that compared with tenodesis, tenotomy had a higher risk of a Popeye deformity in all 3 study groups; worse functional outcomes in terms of the Constant score, forearm supination strength index, and SST score according to prospective cohort studies; and a higher incidence of arm cramping pain according to retrospective cohort studies.
Lesions of the long head of the biceps tendon (LHBT) are a common cause of shoulder pain and disorders in patients with rotator cuff tears. 11,32,37 Biceps tenotomy and tenodesis are well-established treatment options for addressing LHBT lesions when nonoperative management options have failed, but there is no consensus as to the superiority of either technique. 24,25,29,33
Tenotomy might be a relatively easier technical procedure that prompts an earlier return to activity, faster rehabilitation, and fewer restrictions after the procedure. 2,13 It also may adversely lead to functional deterioration of the biceps brachii such as strength of elbow flexion and forearm supination as well as the cosmetic deformity known as the Popeye sign. 38 Conversely, tenodesis may maintain the LHBT with fewer cosmetic problems, preservation of elbow flexion and supination strength, alleviation of pain, and improvement of functional scores, 34,39,40 but it may require a more difficult technique as well as longer operation and rehabilitation times 7,23 with a higher complication rate (humeral fracture, incorrect muscle-tendon length, failure of tenodesis).
The methodology of a systematic review with meta-analysis allows the comparison of tenotomy and tenodesis across several outcome fields. Previous systematic reviews have analyzed small numbers of studies, using data only from prospective and retrospective studies, to investigate tenotomy or tenodesis. Although observational data provide some support for a comparison of 2 procedures, good-quality randomized controlled trials (RCTs) are limited. However, several RCTs 3,6,19,27,28 and cohort studies 20,29 not included in previous meta-analyses have recently been published. Including these studies may change the interpretation of existing data.
The purpose of this review and meta-analysis was to assess and compare the clinical effectiveness of arthroscopic tenodesis versus tenotomy in the treatment of LHBT lesions, as reported in RCTs and cohort studies. Our hypothesis was that arthroscopic tenodesis would have fewer complications and better functional outcomes than would arthroscopic tenotomy. To test this hypothesis, we established 3 groups of studies: group 1 included data from RCTs, group 2 included data from prospective cohort studies or low-quality RCTs, and group 3 included data from retrospective cohort studies. Then, given that the inclusion of low-evidence studies in the meta-analysis introduced higher levels of heterogeneity, we present 3 separate analyses for each outcome for a comparison where possible.
Methods
We performed a systematic review and meta-analysis in accordance with the MOOSE (Meta-analysis of Observational Studies in Epidemiology)
36
and PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines.
20
There were 2 authors (H.L. and X.S.) who independently searched PubMed, the Cochrane Library, Web of Science, and Embase for relevant articles without language restrictions between inception and April 30, 2020. The electronic search strategy used was as follows: Participants: patients undergoing arthroscopic shoulder surgery for LHBT lesions Intervention: tenotomy Comparator: tenodesis Outcomes: clinical efficacy and adverse events Study designs: RCTs, prospective cohort studies, and retrospective cohort studies
Exclusion criteria included all other study types (case reports and case series) and studies that did not meet the aforementioned inclusion criteria.
Data extraction for each study was performed by 2 authors (H.L. and X.S.) independently and then reviewed by a third author (Q.Z.). There was no need for funding or a third party to obtain any of the collected data. A standardized predesigned data extraction form was used to acquire the relevant data from each study including the author, publication year, baseline patient characteristics, numbers of participants enrolled and randomized, allocation concealment, blinding, interventions, outcomes of interest, and follow-up duration. The most up-to-date or comprehensive information was obtained if multiple publications involved the same study. Potential sources of bias in RCTs were assessed according to the Cochrane Collaboration risk of bias tool, 17 stratifying the risk as high, unclear, or low risk in a traffic light configuration for random sequence generation, allocation concealment, blinding of participants, blinding of outcomes, and attrition bias. For study groups 2 and 3, the Newcastle-Ottawa scale 39 was used to assess studies in 3 key areas including selection and comparability of cohorts as well as outcome.
Reporting Outcomes
We focused on important outcomes: (1) the composition of complications, including all incidences of a Popeye deformity, arm cramping pain, and retears of the rotator cuff; (2) functional outcomes, mainly consisting of the Constant score, visual analog scale (VAS) for pain score, American Shoulder and Elbow Surgeons (ASES) score, elbow flexion strength index, forearm supination strength index, Simple Shoulder Test (SST) score, and University of California, Los Angeles (UCLA) shoulder score; and (3) range of motion, mainly measuring forward flexion, external rotation with the arm at the side, and abduction.
Study Methodology Assessment
The modified Coleman Methodology Score (MCMS) 9 was used by 2 reviewers (Q.Z. and W.G.) to assess the methodologic quality of the selected studies. The primary outcomes assessed via the MCMS are study size and type, follow-up time, attrition rate, number of interventions per group, and proper description of the study methodology. Scores range from 0 to 100 and are categorized as excellent (85-100), good (70-84), fair (55-69), and poor (<55).
Statistical Analysis
The overall summary estimates were calculated using inverse variance–weighted random-effects meta-analysis.
18
Individual relative risk estimates and summary estimates were presented graphically in forest plots. The
Results
A total of 485 potentially relevant citations were initially identified using our search strategy; 4 additional reports were found during the manual search of references. There were 214 duplicate studies excluded using EndNote software (Version X9; Clarivate Analytics). An additional 248 studies were removed after screening the title and abstract. Then, 27 articles remained for a full-text assessment, and 6 were subsequently excluded. Thus, 21 studies ** reporting on 1753 participants (n = 958 tenotomy; n = 795 tenodesis) were included in the final analysis. The flowchart of selecting relevant articles is shown in Figure 1. The main characteristics of the included studies are listed in Table 1. The characteristics of study patients are shown in Table 2. LHBT and rotator cuff injury types and tenodesis methods of the included studies are presented in Table 3.

Flowchart of the number of studies identified and included in this meta-analysis based on PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines.
Characteristics of Included Studies
Characteristics of Study Patients
LHBT Injury Types, Rotator Cuff Injury Types, and Tenodesis Methods
Study Groups
Overall, 4 RCTs, 6,26,27,44 each with an evidence level of 1, were included in group 1. This allowed an analysis of 448 participants (n = 221 tenotomy; n = 227 tenodesis) from the data. Group 2 included 7 publications 3,10,19,24,25,28,31 of level 2 evidence, with 184 patients undergoing biceps tenotomy and 189 patients undergoing biceps tenodesis; 3 studies were low-quality RCTs, and 4 were prospective cohort studies. Group 3 included a further 10 retrospective cohort studies †† with a total of 932 patients (n = 553 tenotomy; n = 379 tenodesis). Of these retrospective studies, all were rated as level 3 evidence, except 2 publications of level 4 evidence. One written in Chinese 43 and another written in German 22 were included in this group.
Methodological Quality Assessment
The risk of bias of the included studies is shown in Tables 4 and 5. In group 1, the study by Castricini et al, 6 which did not assess supination strength, and the study by Lee et al, 26 which did not report the SD of the VAS pain score and Constant score at the last follow-up, were judged as having a high risk of attrition bias. In groups 2 and 3, there were 7 studies 3,8,10,19,24,25,29 that were awarded 8 stars, 4 studies 1,21,28,43 that were given 7 stars, and 6 studies 4,5,22,31,33,41 that were granted 6 stars.
Risk of Bias for Randomized Clinical Trials Using the Cochrane Collaboration Tool
Risk of Bias for Cohort Studies Using the Newcastle-Ottawa Scale
Modified Coleman Methodology Score
Table 1 shows the MCMS scores from the 21 included studies (mean, 81.7 ± 8.8). There were 9 studies 6,10,19,24 –28,44 that received excellent scores, 11 studies ‡‡ that received good scores, and 1 study 41 that received a fair score.
Outcomes
A summary of all data analyses is presented in Table 6. For ease of comparisons, data from study groups 1 to 3 are presented separately for each outcome measure. For ease of interpretation, the effect of treatment type (tenotomy or tenodesis) on the incidence of each outcome measure was summarized as forest plots (Figures 2 -13).
Tenotomy Versus Tenodesis Outcomes by Study Group
Popeye Deformity
A total of 20 studies §§ involving 948 tenotomy and 785 tenodesis procedures reported results for the incidence of a Popeye deformity. There was a significantly higher rate of a Popeye deformity after tenotomy versus tenodesis in groups 1, 2, and 3 (Figure 2). In group 1, data from the 4 RCTs showed a significantly higher incidence of a Popeye deformity after tenotomy (risk ratio [RR], 3.29 [95% CI, 1.92-5.49]). 6,26,27,44 Data from group 2 included 7 studies, with a significant RR of 2.35 (95% CI, 1.43-3.85). 3,10,19,24,25,28,31 Data from 9 retrospective cohort studies were included in group 3, which showed a significant RR of 2.57 (95% CI, 1.33-4.98). 1,4,8,21,22,29,41,43

Forest plot displaying the risk ratio and 95% CI for the effect of treatment using tenotomy or tenodesis on the incidence of a Popeye deformity.
Arm Cramping Pain
A total of 12 studies ∥∥ specifically reported arm cramping pain as an outcome measure. Data were included from 453 tenotomy and 436 tenodesis procedures in this analysis. There was no significant difference in groups 1 and 2 but a significantly higher rate of arm cramping pain after tenotomy than tenodesis in group 3 (Figure 3). In group 1, of 2 RCTs available, 1 reporting no incidence of arm cramping pain was excluded, so data from only 1 RCT showed an RR of 1.73 (95% CI, 0.61-4.92). 6,44 In group 2, data from 4 of 6 studies showed an RR of 3.16 (95% CI, 0.99-10.15); the other 2 studies showing no result were excluded. 3,19,24,25,28,31 In group 3, with the exclusion of 1 study with no result, data from 4 cohort studies showed an RR of 2.17 (95% CI, 1.20-3.95). 1,5,29,41

Forest plot displaying the risk ratio and 95% CI for the effect of treatment using tenotomy or tenodesis on the incidence of arm cramping pain.
Rotator Cuff Retear
A total of 3 studies 6,26,31 were included with 114 tenotomy and 127 tenodesis procedures. We found no significant difference in the risk of rotator cuff retears between tenotomy and tenodesis (Figure 4). In group 1, there were 2 RCTs that reported an RR of 1.03 (95% CI, 0.47-2.23). 6,26 Data from 1 study were included in group 2, showing a corresponding RR of 2.87 (95% CI, 0.61-13.61). 31 In group 3, no study reported the rotator cuff retear rate, so it could not be included in the analysis.

Forest plot displaying the risk ratio and 95% CI for the effect of treatment using tenotomy or tenodesis on the incidence of rotator cuff retears.
Constant Score
A total of 11 studies ¶¶ including 374 tenotomy and 376 tenodesis procedures specifically reported Constant scores. A higher score was related to a better clinical outcome. In this analysis, significantly higher functional Constant scores were achieved after tenodesis than after tenotomy in group 2, but the difference was not significant in groups 1 and 3 (Figure 5). In group 1, data were presented from 2 RCTs, 6,44 which showed a standardized mean difference (SMD) of –0.16 (95% CI, –0.57 to 0.24). In group 2, data from 4 studies 10,19,24,28 showed an SMD of –0.47 (95% CI, –0.73 to –0.21). In group 3, data from 5 retrospective studies 5,8,22,29,33 showed an overall SMD of –0.26 (95% CI, –0.60 to 0.08).

Forest plot displaying the standardized mean difference (SMD) and 95% CI for the effect of treatment using tenotomy or tenodesis on the Constant score.
VAS Pain Score
A total of 6 studies 3,6,27,31,43,44 reported VAS pain scores in 230 tenotomy and 222 tenodesis procedures. VAS scores were found to be equivocal after tenodesis compared with tenotomy in all study groups (Figure 6). In group 1, data were presented from 3 RCTs (SMD, 0.04 [95% CI, –0.18 to 0.26]). 6,27,44 In group 2, data from 2 prospective cohort studies showed an SMD of –0.36 (95% CI, –1.33 to 0.60). 3,31 Data from 1 retrospective cohort study in group 3 showed an SMD of 0.26 (95% CI, –0.37 to 0.89). 43

Forest plot displaying the standardized mean difference (SMD) and 95% CI for the effect of treatment using tenotomy or tenodesis on the visual analog scale pain score.
ASES Score
The ASES score was reported in 5 studies, 19,24,27,31,41 allowing an analysis of 155 tenotomy and 156 tenodesis procedures. In all levels of analysis, no significantly better ASES score was recorded for tenodesis or tenotomy (Figure 7). In group 1, data presented from 1 RCT demonstrated an SMD of 0.15 (95% CI, –0.22 to 0.51). 27 In group 2, data from 3 prospective studies showed an SMD of –0.39 (95% CI, –0.78 to 0.00). 19,25,31 In group 3, data from 1 retrospective study showed a corresponding SMD of 0.26 (95% CI, –0.41 to 0.92). 41

Forest plot displaying the standardized mean difference (SMD) and 95% CI for the effect of treatment using tenotomy or tenodesis on the American Shoulder and Elbow Surgeons score.
Elbow Flexion Strength Index
Data were presented in 3 studies 24,31,44 regarding elbow flexion strength index and included 145 tenotomy and 148 tenodesis procedures. The analysis showed no significantly different result of elbow flexion strength index in any level (Figure 8). In group 1, there was 1 RCT that presented an SMD of 0.00 (95% CI, –0.32 to 0.32). 44 In group 2, there were 2 studies presenting an SMD of –0.02 (95% CI, –0.35 to 0.31). 24,31 No data were available for inclusion from retrospective cohort studies in group 3.

Forest plot displaying the standardized mean difference (SMD) and 95% CI for the effect of treatment using tenotomy or tenodesis on elbow flexion strength index.
Forearm Supination Strength Index
Data from only 2 studies 31,44 were included regarding forearm supination strength index, including 104 tenotomy and 105 tenodesis procedures. In the analysis of all groups, only 1 prospective study from group 2 showed a significant trend that forearm supination strength index was higher after tenodesis than tenotomy (Figure 9). 44 The other study, from group 1, presented an SMD of 0.00 (95% CI, –0.32 to 0.32). 31 Group 3 studies presented no relevant data.

Forest plot displaying the standardized mean difference (SMD) and 95% CI for the effect of treatment using tenotomy or tenodesis on forearm supination strength index.
SST Score
A total of 3 studies 10,19,28 reported the SST score after tenotomy or tenodesis, including data from 70 tenotomy and 77 tenodesis procedures (Figure 10). In group 2, data were presented from 3 prospective cohort studies, which showed a significant increase in the SST score after tenodesis, with an SMD of –0.60 (95% CI, –0.94 to –0.27). No data were available in groups 1 and 3.

Forest plot displaying the standardized mean difference (SMD) and 95% CI for the effect of treatment using tenotomy or tenodesis on the Simple Shoulder Test score.
UCLA Score
Overall, 3 studies 8,33,43 reported the UCLA score for 69 tenotomy and 74 tenodesis procedures (Figure 11). Studies in groups 1 and 2 presented no relevant data. Data from 3 studies in group 3 showed no significantly different result, with an SMD of –0.18 (95% CI, –0.55 to 0.19).

Forest plot displaying the standardized mean difference (SMD) and 95% CI for the effect of treatment using tenotomy or tenodesis on the University of California, Los Angeles score.
Forward Flexion and External Rotation at the Side
A total of 2 studies 5,31 reported postoperative range of motion for 66 tenotomy and 64 tenodesis procedures (Figures 12 and 13). The studies showed no significant change in forward flexion or external rotation at the side. Studies in group 1 presented no relevant data. In group 2, there was 1 prospective cohort study that showed forward flexion or external rotation at the side, with an SMD of 0.06 (95% CI, –0.45 to 0.58) or 0.08 (95% CI, –0.43 to 0.60), respectively. 31 In group 3, data from 1 retrospective study were presented with an SMD of –0.39 (95% CI, –0.85 to 0.08) or –0.57 (95% CI, –1.04 to –0.10), respectively. 5

Forest plot displaying the standardized mean difference (SMD) and 95% CI for the effect of treatment using tenotomy or tenodesis on forward flexion.

Forest plot displaying the standardized mean difference (SMD) and 95% CI for the effect of treatment using tenotomy or tenodesis on external rotation at the side.
Discussion
The most important findings of this study were that patients undergoing tenodesis had a significantly lower risk of a cosmetic Popeye deformity than did those undergoing tenotomy in group 1 consisting of level 1 studies; that a lower incidence of a Popeye deformity as well as a better Constant score, forearm supination strength index, and SST score were seen in patients undergoing tenodesis than in those undergoing tenotomy from group 2 comprising level 2 studies; and that group 3 composed of level 3 or 4 studies showed a lower risk of a Popeye deformity and arm cramping pain after tenodesis. Otherwise, no significant difference was found in rotator cuff retears, VAS pain score, ASES score, elbow flexion strength index, UCLA score, and range of motion.
A residual aesthetic deformity after 2 potential procedures for the repair of LHBT injuries is a concern for surgeons and patients. A Popeye deformity often occurs in patients undergoing tenotomy, and some studies have shown a significant difference compared with tenodesis. The occurrence of the Popeye sign after tenotomy in our present study was also significantly higher in all study groups. In 2005, Wolf et al 42 performed a biomechanical analysis of the biceps and found that the occurrence of the Popeye sign was associated with a significant risk of distal biceps tendon migration and a lower load to failure after tenotomy. MacDonald et al 27 revealed that the contributing factors to the occurrence of a Popeye deformity in those undergoing tenodesis may have been undertensioning of the biceps tendon or anchoring of the biceps tendon in a place where there was too much slack in the tendon. The recent prospective RCT by MacDonald et al 27 also showed a 3.5-times higher risk of a cosmetic Popeye deformity after tenotomy compared with tenodesis. Several meta-analyses 30,34 have shown a similar result. In a review of the literature, Frost et al 13 found that the incidence of a Popeye deformity after tenotomy ranged from 3% to 70%. Otherwise, the literature available regarding the incidence of a Popeye deformity after tenodesis has been divided: some studies have shown no deformity after tenodesis, 10,29,41 while others have revealed that rates varied from 5.5% to as high as 24%. 6,12,24,26,44 Interestingly, a recent retrospective study by Godenèche et al 15 reported that no deformity at all was found after biceps tenodesis or tenotomy at 10-year follow-up.
Biceps pain as a result of cramping is a potential drawback of biceps tenotomy. Even if a cosmetic deformity is present, it rarely causes problems and is often even ignored by patients themselves, 5 but related arm cramping pain is noticed and might compromise patient satisfaction. Cramping and cramp-like arm pain have been revealed in anywhere between 8% and 40% of patients undergoing tenotomy. 41 Cramping pain in the bicipital groove was more frequently observed in patients undergoing tenotomy. In our retrospective cohort study group, a significantly higher risk of cramping pain in patients after tenotomy was also found, consistent with the findings of Ge et al 14 and Gurnani et al. 16
The Constant score is most frequently utilized as a tool for the assessment of the shoulder joint. Recently, a series of systematic reviews and meta-analyses 14,30,44 found a significantly higher Constant score after tenodesis than tenotomy, although an earlier meta-analysis derived from RCTs and cohort studies showed no significant difference in the Constant score between the 2 groups. 16 In the present study, the pooled Constant scores in the prospective cohort study group were significant after tenodesis, consistent with the results reported by Na et al, 30 Shang et al, 34 and Ge et al. 14
The researchers have reported decreases of 20% in forearm supination strength and 8% to 20% in elbow flexion strength after spontaneous LHBT ruptures. 29,35 We found no significant difference in terms of elbow flexion strength index. Because of different kinds of assessment tools and scales regarding forearm supination strength, we had to analyze 2 studies using forearm supination strength index as an outcome measure. Also, only a prospective cohort study by Oh et al 31 demonstrated a significantly higher forearm supination strength index after tenodesis. In addition, an RCT by Lee et al 26 found greater forearm supination power in the tenodesis group, and a level 3 study by Wittstein et al 41 reported that the tenodesis group had significantly higher peak supination torque than did the tenotomy group.
The SST is also a widely used measure that assesses functional limitations of the affected shoulder. To our knowledge, the present study is the first to report a higher SST score in patients undergoing tenodesis compared with tenotomy; it was a meaningful finding statistically.
Strengths and Limitations
We performed a detailed and robust search spanning multiple databases without language restrictions, which allowed us to include studies from all over the world, thus improving the generalizability of our findings. The application of trial data and cohort studies allowed the inclusion of as many data as possible, such as good-quality research data especially from retrospective studies that can often be omitted in systematic reviews, therefore providing the most comprehensive update on clinical outcomes after tenotomy and tenodesis.
There were several limitations. The analysis was limited by the few relevant RCTs 6,26,27,44 that have been published and small sample sizes, given the outcomes of interest. The inclusion of multiple study types inevitably results in significant heterogeneity, which therefore compromised the robustness of this study’s conclusions. In the retrospective cohort study group, with the inclusion of 2 level 4 studies, 29,33 the pooled results of a Popeye deformity, arm cramping pain, the Constant score, and the UCLA score as relevant outcome measures had high heterogeneity. Furthermore, there was variability in the populations assessed (age, sex, LHBT injury type, rotator cuff injury type, sedentary patient vs manual laborer, dominant vs nondominant extremity), the comparator (biceps tenodesis methods or materials), the reporting of outcomes (including different kinds of assessment tools and scales), and the follow-up duration.
Conclusion
This meta-analysis demonstrated that compared with tenodesis, tenotomy had a higher risk of a Popeye deformity in all study groups; worse functional outcomes in terms of the Constant score, forearm supination strength index, and SST score in the prospective cohort study group; and a higher incidence of arm cramping pain in the retrospective cohort study group.
Footnotes
Notes
Final revision submitted September 16, 2020; accepted November 11, 2020.
One or more of the authors has declared the following potential conflict of interest or source of funding: This study was funded by the Beijing Municipal Science and Technology Commission (grant Z171100001017209), National Natural Science Foundation of China (grants 81972130, 81703896, 81972107, 81902203, and 82072494), National Key Research and Development Program of China (grant 2017YFC0108102), and Capital Health Research and Development of Special Fund (grant 2020-2-4067). AOSSM checks author disclosures against the Open Payments Database (OPD). AOSSM has not conducted an independent investigation on the OPD and disclaims any liability or responsibility relating thereto.
