Published August 02, 2023
The Jobe test is commonly used to diagnose full-thickness (FT) supraspinatus (SSP) tendon tear. The original Jobe test used single-arm testing, although the double-arm Jobe test has also been used in clinical practice.
To evaluate the reliability, accuracy, and diagnostic value of the single-arm and double-arm Jobe test for diagnosis of FT SSP tear.
Cohort study (diagnosis); Level of evidence, 2.
Patients with shoulder pain requiring magnetic resonance imaging (MRI) of the shoulder between March 1, 2021, and March 31, 2022, were enrolled. Each patient underwent both single-arm and double-arm Jobe tests by 2 orthopaedic surgeons independently, and the presence of pain, weakness, or both during the test was documented. Diagnosis of FT SSP tear on MRI scan was used as the gold standard to compare the diagnostic value of the Jobe test. The interrater reliability of the Jobe test, and the inter- and intrarater reliability of the MRI evaluation (patients with vs without FT SSP tear) was performed using the kappa (κ) coefficient.
A total of 80 patients (57 females [71%]; mean age, 61.46 ± 9.61 years) were included. MRI scans revealed FT SSP tears in 32 (40%) of the patients. Both single- and double-arm Jobe tests had low diagnostic values (accuracy, 46.25% to 60%; sensitivity, 46.9% to 84.4%; specificity, 25% to 66.7%). The single-arm test with weakness revealed the highest sensitivity (84.4%). The double-arm test with weakness plus pain revealed the highest specificity (66.7%). Double-arm testing with pain had the highest accuracy (60%), with the highest positive likelihood ratio (1.5). The interrater reliability of the Jobe test indicated substantial agreement (double-arm vs single-arm, κ = 0.771 and 0.716, respectively, agreement 85%; P < .05). The interrater reliability of MRI scan evaluation of the FT SSP tear indicated substantial agreement (κ = 0.750, agreement 85%; P < .05), while the intrarater reliability indicated almost perfect agreement (κ = 0.917, agreement 96%; P < .05).
The Jobe test, either single- or double-arm, had low accuracy and diagnostic value in diagnosing FT SSP tear. The concern with a single-arm examination for weakness is that it may be an inappropriate diagnostic test for ruling out FT SSP with 84% sensitivity, while a double-arm examination provides a higher specificity.
The Jobe test is commonly used to diagnose full-thickness (FT) supraspinatus (SSP) tendon tear. The original Jobe test used single-arm testing, although the double-arm Jobe test has also been used in clinical practice.
To evaluate the reliability, accuracy, and diagnostic value of the single-arm and double-arm Jobe test for diagnosis of FT SSP tear.
Cohort study (diagnosis); Level of evidence, 2.
Patients with shoulder pain requiring magnetic resonance imaging (MRI) of the shoulder between March 1, 2021, and March 31, 2022, were enrolled. Each patient underwent both single-arm and double-arm Jobe tests by 2 orthopaedic surgeons independently, and the presence of pain, weakness, or both during the test was documented. Diagnosis of FT SSP tear on MRI scan was used as the gold standard to compare the diagnostic value of the Jobe test. The interrater reliability of the Jobe test, and the inter- and intrarater reliability of the MRI evaluation (patients with vs without FT SSP tear) was performed using the kappa (κ) coefficient.
A total of 80 patients (57 females [71%]; mean age, 61.46 ± 9.61 years) were included. MRI scans revealed FT SSP tears in 32 (40%) of the patients. Both single- and double-arm Jobe tests had low diagnostic values (accuracy, 46.25% to 60%; sensitivity, 46.9% to 84.4%; specificity, 25% to 66.7%). The single-arm test with weakness revealed the highest sensitivity (84.4%). The double-arm test with weakness plus pain revealed the highest specificity (66.7%). Double-arm testing with pain had the highest accuracy (60%), with the highest positive likelihood ratio (1.5). The interrater reliability of the Jobe test indicated substantial agreement (double-arm vs single-arm, κ = 0.771 and 0.716, respectively, agreement 85%; P < .05). The interrater reliability of MRI scan evaluation of the FT SSP tear indicated substantial agreement (κ = 0.750, agreement 85%; P < .05), while the intrarater reliability indicated almost perfect agreement (κ = 0.917, agreement 96%; P < .05).
The Jobe test, either single- or double-arm, had low accuracy and diagnostic value in diagnosing FT SSP tear. The concern with a single-arm examination for weakness is that it may be an inappropriate diagnostic test for ruling out FT SSP with 84% sensitivity, while a double-arm examination provides a higher specificity.
The rotator cuff (RC) plays a fundamental role in physiological shoulder motion. RC tears may cause weakness or pain in the shoulder but may also remain completely asymptomatic. 7 Tears of the superior-posterior RC (supraspinatus [SSP] and infraspinatus) are more common compared with tears of the anterior-superior RC (subscapularis and SSP). A variety of clinical tests have been specifically developed for SSP tendon tear. With the Jobe test, as described by Jobe and Moynes in 1982, 13 the determined activity of the SSP muscle can be isolated to some degree with the arm at 90° of abduction, 30° of horizontal flexion, and full internal rotation. Electromyographic records obtained with patients in this position indicate that the SSP is the dominant RC muscle firing during the maneuver. 13
The Jobe or empty-can test is a common physical test for evaluating SSP tear or pathology. The original Jobe test used single-arm testing, with pain during the test suggestive of RC pathology. 13 The Jobe test can be performed with a single-arm or double-arm maneuver. 3,8,12,17 However, studies have no evidence comparing the diagnostic value of single-arm Jobe test and double-arm maneuvers. 3,12,17
The purpose of this study was to evaluate the reliability, accuracy, and diagnostic value of the single-arm and the double-arm Jobe test for diagnosis of full-thickness (FT) SSP tear. Our hypothesis was that the double-arm Jobe test would be more accurate, as the imbalance of the periscapular muscle during the single-arm test may result in a false-positive (FP) result.
This was a cross-sectional study of 80 consecutive shoulder pain patients prospectively enrolled from a single hospital between March 1, 2021, and March 31, 2022. The study was conducted after obtaining approval from the Human Research Ethics Committee of Thammasat University, and informed consent was obtained from all participants. Inclusion criteria included patients over 45 years of age with more than 4 weeks of symptoms (shoulder pain or weakness), positive subacromial impingement sign on examination (Neer impingement sign or Hawkins-Kennedy test), no significant shoulder stiffness (passive forward elevation >120°, external rotation >45°), and shoulder magnetic resonance imaging (MRI) scan. Excluded were patients with current fractures around the upper extremity, previous shoulder surgery, bilateral shoulder pain, calcific tendinitis of the shoulder on imaging, significant weakness or pain from cervical spine pathologies, presence of pseudoparalysis, and patients who had received shoulder or subacromial injection within 3 months.
Data Collection and Jobe Test
We collected baseline patient characteristics (age, sex, body mass index), as well as symptom duration, affected side, dominant arm, pain visual analog scale score, and Western Ontario Rotator Cuff Index score. All patients underwent axial, coronal, and sagittal T2-weighted fat-saturated imaging performed on 3-T MRI to evaluate the pathology of the affected shoulder. Finally, all patients were examined in our outpatient orthopedic clinic by 2 orthopedic surgeons (A.A. and P.B.). The 2 examiners, who were blinded to the MRI results, independently examined all patients in 5- to 10-minute intervals. During the examination, conducted with the patient in the sitting position, the SSP of the affected shoulder was evaluated using the Jobe test in a fixed pattern: first the double-arm test, followed by the single-arm test. For each test, muscle strength was measured by manual compression (Figure 1). The presence of weakness, pain, or weakness plus pain during the test was documented. The interrater reliability of the examiner findings was calculated.
MRI Evaluation
The process of reviewing the MRI scans required a picture archiving and communication system workstation. MRI was used as the gold standard for definite diagnosis of FT SSP tear by 2 reviewers, a fellowship-trained musculoskeletal radiologist (W.S.B.) and an orthopaedic surgeon (A.A.), who independently categorized the patients into those with and those without FT SSP. The interrater and intrarater reliability of the MRI interpretations was assessed. To evaluate intrarater reliability, the orthopaedic surgeon (A.A.) re-evaluated the patients after 2 weeks.
The integrity of the RC was also evaluated on MRI scans and was categorized as RC syndrome/bursitis, partial-thickness RC tear (including low-grade tear [≤6 mm] and high-grade tear [>6 mm]), 5 and FT RC tear. 10 Some MRI scans from the study population are shown in Figure 2.
Statistical Analysis
The sample size was calculated a priori by assuming a sensitivity and specificity of at least 0.85 and a CI of 95%, resulting in a sample size of at least 53 patients with a power of 0.9 according to the McNemar method. An adjusted sample size with a 70% incidence of no FT SSP tear resulted in a study sample of at least 76 patients.
All statistical analyses were performed using the STATA software for Windows (Version 17; Stata Corp LP). To evaluate the diagnostic value of the Jobe test, the results of the clinical examinations were compared with the MRI scan results. Diagnostic value was evaluated in terms of test type (single-arm, double-arm, or combined single- and double-arm) and presence of weakness, pain, or weakness plus pain. The statistical analysis included the determination of the sensitivity, specificity, accuracy, positive likelihood ratio (LR+), negative likelihood ratio (LR-), positive predictive value (PPV), negative predictive value (NPV), and the area under the receiver operating characteristic (ROC) curve (AUC). 3 These parameters were calculated as follows:
The interrater reliability of the Jobe tests and the inter- and intrarater reliabilities of the MRI evaluations were calculated using the kappa (κ) coefficient, 15 in which κ < 0.00 was considered poor strength of agreement; 0.00 to 0.20, slight; 0.21 to 0.40, fair; 0.41 to 0.60, moderate; 0.61 to 0.80, substantial; and 0.81 to 1.00, almost perfect. 15 The percentage agreement between the 2 raters was also calculated.
A total of 80 patients were included in the study (71% female patients; mean age, 61.46 ± 9.61 years). The mean duration of shoulder pain was 9 ± 8.55 months. FT SSP tear on MRI scan was detected in 32 patients (40%). The patient characteristics are shown in Table 1.
Overall, the sensitivity of each type of Jobe test ranged from 46.9% to 84.4%, with a specificity from 25% to 66.7%. The accuracy of the Jobe test, either single-arm or double-arm, with weakness, pain, or weakness and pain, ranged from 46.25% to 60%. The single-arm test with weakness had the highest sensitivity, followed by single-arm test with pain and single-arm test with weakness plus pain (84.4%, 78.1%, and 78.1%, respectively). The double-arm test with weakness plus pain had the highest specificity, followed by the double-arm test with weakness (66.7% and 64.6, respectively). Double-arm testing with pain had the highest accuracy of 60% with the highest LR+ of 1.5. For all values, the combined single- and double-arm Jobe test showed the same results as for the double-arm test alone. The summarized diagnostic values of the Jobe tests are shown in Table 2.
The interrater reliability of the Jobe test indicated substantial agreement for both the double-arm test (κ = 0.771) and the single-arm test (κ = 0.716) (85% agreement; P < .05). The interrater reliability of MRI scan evaluation of the FT SSP tear indicated substantial agreement (κ = 0.750, 85% agreement; P < .05), while intrarater reliability was almost perfect (κ = 0.917, 96% agreement; P < .05).
The most important finding of the present study was that single-arm Jobe test with weakness revealed the highest sensitivity of 84.4%. Double-arm Jobe test with weakness plus pain revealed the highest specificity of 66.7%. Double-arm testing with pain had the highest accuracy of 60% with the highest LR+ of 1.5. The overall accuracy of the Jobe test in the diagnosis of FT SSP tear was 46.25% to 60%.
This study revealed the low diagnostic values of the Jobe test in FT SSP tear. Physical examination, augmented by MRI or ultrasound scans, certainly plays an important role in the diagnosis of RC tears. Various clinical tests have been developed to assess the SSP tendon tear. Several studies have investigated the diagnostic value of these tests, although different symptoms, such as pain, weakness, or both, were used to interpret the results. It is still unclear which of these symptoms is more accurate. 1,2,8 –11
The diagnosis of SSP tendon tear has involved many diagnostic physical tests. The Jobe, or empty can, test is a common physical test evaluating the SSP tear or pathology. The original Jobe test used single-arm testing, while pain during the test suggested RC pathology. 13 Some reported the positive test with pain or weakness indicating SSP pathology. 4,8 The Jobe test can be performed with a single-arm or double-arm maneuver, with reported a sensitivity of 54% to 96% and a specificity of 46% to 68%. 3,8,12,17
Itoi et al 8 determined the clinical usefulness of the full can (arm in 90° of elevation in the scapular plane and 45° of external rotation) and empty can (arm in 90° of elevation in the scapular plane and full internal rotation) tests for detecting of SSP tear in 143 shoulders. The empty can test with pain, weakness, and both pain and weakness showed a sensitivity of 63%, 77%, and 89%, respectively, whereas the specificity was 55%, 68%, and 50%, respectively. The authors concluded that muscle weakness was indicative of SSP tear in the empty can test. Sgroi et al 17 analyzed the diagnostic value of 7 clinical tests for the diagnosis of SSP tears in 115 patients. The results of the empty can test with pain, weakness, and both pain and weakness showed a sensitivity of 54%, 90%, and 96%, respectively, whereas the specificity was 61%, 46%, and 31%, respectively. A systematic review revealed the pooled estimated sensitivity of the Jobe test was 0.77 (CI, 0.67-0.85) and the specificity was 0.67 (CI, 0.59-0.73) in 5 eligible studies. 14 This finding was similar to our study and was a confirmation of the low diagnostic accuracy of the Jobe test alone in the diagnosis of posterosuperior RC tears. 6,12,14,18
The detection of RC tears affects treatment strategies (ie, physical therapy, injection, surgical intervention). Due to the low diagnostic values of the Jobe test and the other clinical tests in FT SSP tear, shoulder pain patients older than 50 years without significant shoulder stiffness may require a high level of suspicion or definitive determination of an RC tear. Then, other diagnostic methods including plain radiographs, ultrasound, or MRI scan should be considered.
According to our hypothesis, the imbalance of the periscapular muscle during performance of the single-arm Jobe test may result in an FP of the test, while the double-arm Jobe test may be more accurate. The single-arm testing had higher sensitivity compared with double-arm testing in all tests (weakness, pain, or weakness and pain), while having lower specificity, accuracy, and LR+. This finding supported the view that the single-arm Jobe test may result in an FP result and may lead to overestimation of the SSP tear.
The diagnostic accuracy of the combined single- and double-arm Jobe test revealed the same results as double-arm testing alone. This finding supported our hypothesis that periscapular muscle balance in the double-arm test would lead to more accuracy than the single-arm test; thus, if the double-arm test revealed a positive result, the single-arm test would also have a positive result. The combination of single- and double-arm tests did not increase the yield of diagnosis of FT SSP tear.
The interrater reliability κ statistic was 0.716 for the single-arm test and 0.771 for the double-arm test, which was in the range of substantial agreement (0.61-0.80), thus we might require a better clinical test in the future.
Limitations
Our study has some limitations. First, selection bias must be considered because the prevalence of FT SSP tears in our investigating center (positioning of tertiary care and shoulder center in our region) was high, at up to 40%. Second, clinical examinations were not sequential random between single-arm and double-arm tests. We started with a double-arm and then a single-arm test in all participants, this may create an FP for later tests after the pain was aggravated (although we waited about 5 minutes). Third, we classified the patients into only 2 groups (FT SSP tear and no FT SSP tear) and did not differentiate among low-grade and high-grade partial-thickness SSP tear groups. The high-grade SSP tear, which has clinical meaning, was recruited in the no FT SSP tear group, which may alter the results. Fourth, the sample size in our study compared with previous studies was relatively low. Fifth, MRI scan was used as the gold standard in RC tear diagnosis with substantial interrater agreement and almost perfect agreement of intrarater reliability. Diagnostic arthroscopy is still the gold standard for making the diagnosis. Sixth, the Jobe test was conducted in a sitting rather than a standing position as in the original test. We chose the sitting position, which might have more balance and stability between the left and right upper limbs during a test in elderly patients. Seventh, we did not evaluate the intrarater reliability of the Jobe test. The patients received treatment (such as injection, physical therapy, or medications) after the test, so the results of Jobe test might be different in the subsequent weeks. Last, combining the single-arm and double-arm Jobe test did not improve diagnostic values compared with the single clinical test.
Jobe test, either single- or double-arm, had low accuracy and diagnostic value in diagnosing superior-posterior RC FT tear. Better clinical testing or other diagnostic methods such as ultrasound or MRI scans should be considered with a high suspicion of the RC tear. The concern with a single-arm examination for weakness is that it may be an inappropriate diagnostic test for ruling out FT SSP with 84% sensitivity, while a double-arm examination provides higher specificity.
The authors thank the Department of Orthopaedics, Faculty of Medicine, Thammasat University and Thammasat University Hospital for their kind support.
Surangkana Katepun, Phanuwat Boonsun, Waraporn Srikhum Boonsaeng, Adinun Apivatgaroon
Orthopaedic Journal of Sports Medicine
Vol 2023, Issue 8, pp. -
Issue published date: August-01-2023
10.1177/23259671231187631