Sage Journals: Discover world-class research

Abstract

BACKGROUND:

The legitimacy of manual muscle testing (MMT) is dependent in part on the reliability of assessments obtained using the procedure.

OBJECTIVE:

The purpose of this review, therefore, was to consolidate findings regarding the test-retest and inter-rater reliability of MMT from studies meeting inclusion and exclusion criteria.

METHODS:

An electronic search of PubMed, Scopus, and CINAHL databases and a hand search were conducted to identify articles addressing the test-retest or inter-rater reliability of MMT. Data on participants, testing specifics, and findings regarding reliability were extracted.

RESULTS:

Of 189 unique articles identified, 9 were found to meet inclusion/exclusion criteria. The studies were highly variable in regard to the population tested, MMT procedure and scoring, and findings. Nevertheless, based on pairwise comparisons, substantial or almost perfect test-retest and inter-rater agreement was demonstrated for most muscle actions tested.

CONCLUSIONS:

Reliable assessments of strength may be obtained by MMT but not assumed. Further research is required to address the reliability of MMT across pathologies, muscle groups, and test procedures.

Keywords

Muscle strength measurement clinimetrics

1. Introduction

Manual muscle testing (MMT) has a long history as a clinical procedure for grading muscle strength [1]. Individuals conducting the procedure use observation of muscles’ ability to create movement and respond to manual resistance to assign ordinal scores. The validity of MMT is well established relative to other measures of muscle strength [2, 3, 4] and function [4, 5], even though it is notably lacking in sensitivity [6, 7, 8].

The reliability of MMT has been examined extensively but not adequately summarized [9]. The purpose of this systematic review, therefore, was to remedy this shortcoming by consolidating the findings of studies describing the test-retest and inter-rater reliability of MMT.

Table 1
Quality checklist applied to 9 articles included in systematic review*

Study	Study period	Inclusion/	MMT procedures	MMT grading	Blinding	Reliability % agreement	Total (9)
	and setting (2)	Exclusion (2)	described (1)	scale described (1)	addressed (1)	and $K_{w}$ (2)
Brandsma et al.	1	2	1	1	1	1	7
Florence et al.	1	2	0	1	1	1	6
Frese et al.	1	2	1	1	1	2	8
Hough et al.	2	2	0	1	1	2	8
Paternostro-Sluga et al.	1	2	1	1	0	1	6
Personius et al.	1	1	1	1	1	1	6
Pfister et al.	2	2	1	1	0	1	7
Savic et al.	1	1	1	1	0	2	6
Tan et al.	1	2	1	1	1	1	7

*Study period $=$ time over which study was conducted (e.g., January–April, 2017), Setting $=$ actual location of testing (e.g., “single academic county hospital”), inclusion and exclusion criteria $=$ one or both explicitly noted, MMT procedures described $=$ reference to specific procedures (e.g., Daniels and Worthingham), MMT grading scale $=$ number of levels with scores for each, Blinding addressed $=$ explicitly stated or clearly implied, Reliability $=$ description of % agreement and weighted kappa.

Figure 1.

PRISMA flowchart illustrating how the final sample of relevant articles was determined.

2. Method

Relevant research was identified by searches of the PubMed (since 1950), Scopus (since 1950), and CINAHL (since 1986) databases on April 19, 2018. The search string used for PubMed was: “manual muscle test*” AND (reliability or reproducibility). A hand search followed. Inclusion required that an article addressed the test-retest or inter-rater reliability of MMT and was written in English. Articles were excluded if they were reviews [9], addressed only composite scores rather than scores for individual muscles or actions [10], used MMT to simply identify the presence of weakness rather than gradations of strength [11], used inappropriate statistics to characterize reliability (e.g., Pearson or intraclass correlation coefficients) [12], or focused on tests scored in a unique manner (e.g., heel-raise test) [13]. For studies describing test-retest reliability, papers were excluded that addressed reliability determined during proximate sessions on the same day [14].

Articles retained on the basis of inclusion and exclusion criteria were abstracted by the author for information on participants (number, country of residence, medical condition), muscle test method, grading scale and range of grades assessed (based on Medical Research Council grades), muscles/actions tested, and findings relative to reliability (% agreement or weighted kappa ( $K_{w}$ )). For studies addressing test-retest reliability, the time between tests was delineated. For studies addressing inter-rater reliability the number and profession (e.g., therapists) of testers was recorded.

The quality of articles in this systematic review was scored using a 6 item, 9 point assessment (Table 1) similar to one described by Bohannon and Glenney [15]. Scores depended on the identification of the study period and setting, inclusion and exclusion criteria, MMT procedures, blinding, and reliability using % agreement and weighted kappa. Quality was not a criterion for inclusion.

3. Results

The database search identified 189 unique articles (Fig. 1). Three additional articles were identified by a hand searches. After application of inclusionary and exclusionary criteria, 9 relevant articles [16, 17, 18, 19, 20, 21, 22, 23, 24] remained to contribute data to this systematic review.

Table 2 summarizes information abstracted from 5 articles addressing test-retest reliability. The studies involved residents of 4 different countries with 5 different neuromuscular conditions. Test-retest intervals ranged from 1 day to 1 week. The MMT test methods were either not stated, self-described, Medical Research Council [25], Daniels and Worthingham [26] or MMT 8 [27] Grading scales consisted of 6 to 13 levels. Based on the Medical Research Council grading scale, studies indicating the range of grades assigned typically included muscles with grades throughout most of the 0 to 5 scale. The number of muscles/actions tested ranged from 3 to 18. The reliability of 31 different muscles/actions described using $K_{w}$ ranged from 0.35 to 0.98. Of the 59 $K_{w}$ reported, 28.8 % were between 0.61 and 0.80 (substantial) and 66.1% were between 0.81 and 1.00 (almost perfect) [28]. The $K_{w}$ for patients with myopathy (0.35 to 0.69) was lowest [21].

Table 2
Summary of 5 articles describing the test-retest reliability of manual muscle testing

Study	Participants and testing hiatus	Muscle test method andgrading (grade range)	Muscles/actions and findings
Brandsma et al.(1995)	28 Thai patients with leprosyneuropathyTested within 3 days	Self described6 level scale (notindicated)	Abduction of little finger: $K_{w}=$ 0.96Adduction of little finger: $K_{w}=$ 0.72Abduction of index finger: $K_{w}=$ 0.80Abduction of thumb: $K_{w}=$ 0.96Opposition of thumb: $K_{w}=$ 0.90Intrinsic plus of index finger: $K_{w}=$ 0.80Intrinsic plus of middle finger: $K_{w}=$ 0.85Intrinsic plus of ring finger: $K_{w}=$ 0.71Intrinsic plus of little finger: $K_{w}=$ 0.83
Florence et al.(1992)	102 American boys withDuchenne dystrophyTested 5 days apart	Not stated11 level ordinalscale (0–5)	Neck flexors: $K_{w}=$ 0.85Neck extensors: $K_{w}=$ 0.84Left and right shoulder abductors: $K_{w}=$ 0.87Left and right shoulder external rotators: $K_{w}=$ 0.89Left and right elbow flexors: $K_{w}=$ 0.82Left and right elbow extensors: $K_{w}=$ 0.86Left and right wrist flexors: $K_{w}=$ 0.65Left and right wrist extensors: $K_{w}=$ 0.69Left and right thumb abductors: $K_{w}=$ 0.71Left and right hip flexors: $K_{w}=$ 0.90Left and right hip extensors: $K_{w}=$ 0.88Left and right hip abductors: $K_{w}=$ 0.89Left and right knee flexors: $K_{w}=$ 0.79Left and right knee extensors: $K_{w}=$ 0.84Left and right ankle dorsiflexors: $K_{w}=$ 0.81Left and right ankle plantarflexors: $K_{w}=$ 0.71Left and right ankle everters: $K_{w}=$ 0.73Left and right ankle inverters: $K_{w}=$ 0.72
Paternostro-Slugaet al. (2008)	22 Austrian patients with paresisof radially innervated musclesTested 7 days (median) apart	Medical Research Council6 level scale (0–5)	Wrist extension: $K_{w}=$ 0.82Finger extension: $K_{w}=$ 0.86Grip: $K_{w}=$ 0.84
		9 level scale (0–5)	Wrist extension: $K_{w}=$ 0.81Finger extension: $K_{w}=$ 0.84Grip: $K_{w}=$ 0.88
Personius et al.(1994)	32 American patients withfacioscapular dystrophyTested 1 day apart	Daniels and Worthingham13 level ordinal scale(not indicated)	Right ankle dorsiflexors: $K_{w}=$ 0.90Left ankle dorsiflexors: $K_{w}=$ 0.92Right shoulder abductors: $K_{w}=$ 0.94Left shoulder abductors: $K_{w}=$ 0.89Right elbow extensors: $K_{w}=$ 0.92Left elbow extensors: $K_{w}=$ 0.93Right elbow flexors: $K_{w}=$ 0.98Left elbow flexors $K_{w}=$ 0.97Right shoulder external rotators: $K_{w}=$ 0.79Left shoulder external rotators: $K_{w}=$ 0.82Right shoulder horizontal abductors: $K_{w}=$ 0.92Left shoulder horizontal abductors: $K_{w}=$ 0.81Right shoulder horizontal adductors: $K_{w}=$ 0.87Left shoulder horizontal adductors: $K_{w}=$ 0.90Right knee extensors: $K_{w}=$ 0.92Left knee extensors: $K_{w}=$ 0.83Right knee flexors: $K_{w}=$ 0.85Left knee flexors: $K_{w}=$ 0.89
Pfister et al.(2018)	46 Swiss patients withmyopathyTested 1 week apart	MMT 811 level ordinal scale(1–5)	Shoulder abduction: $K_{w}=$ 0.64Elbow flexion: $K_{w}=$ 0.66Wrist extension: $K_{w}=$ 0.53Knee extension: $K_{w}=$ 0.49Ankle dorsiflexion: $K_{w}=$ 0.35Neck flexion: $K_{w}=$ 0.64Hip abduction: $K_{w}=$ 0.66Hip extension: $K_{w}=$ 0.69

Table 3

Summary of 8 articles describing the inter-rater reliability of manual muscle testing

Study	Participants and testsers	Muscle test method andgrading (grade range)	Muscles/actions and findings
Brandsma et al.(1995)	28 Thai patients with leprosy andneuropathy2 therapists	Self described6 level scale (not indicated)	Abduction of little finger: Pairwise $K_{w}=$ 0.79Adduction of little finger: Pairwise $K_{w}=$ 0.72Abduction of index finger: Pairwise $K_{w}=$ 0.74Abduction of thumb: Pairwise $K_{w}=$ 0.80Opposition of thumb: Pairwise $K_{w}=$ 0.81Intrinsic plus of index finger: Pairwise $K_{w}=$ 0.78Intrinsic plus of middle finger: Pairwise $K_{w}=$ 0.77Intrinsic plus of ring finger: Pairwise $K_{w}=$ 0.75Intrinsic plus of little finger: Pairwise $K_{w}=$ 0.93
Frese et al.(1987)	110 American patients referredfor therapy for musculoskeletal orneurological disorders11 therapists	Kendall and McCreary orDaniels and Worthingham13 level scale (2–5)	Right middle trapezius: Overall agreement $=$ 28%, $K_{w}=$ 0.58; Pairwise $K_{w}=$ 0.06–0.62†Left middle trapezius: Overall agreement $=$ 29%, $K_{w}=$ 0.29; Pairwise $K_{w}=$ 0.04–0.63†Right gluteus medius: Overall agreement $=$ 47%, $K_{w}=$ 0.25; Pairwise $K_{w}=$ 0.08–0.66†Left gluteus medius: Overall agreement $=$ 45%, $K_{w}=$ 0.11; Pairwise $K_{w}=$ 0.11–0.58†
		6 level compressed scale (2–5)	Right middle trapezius: Overall $K_{w}=$ 0.26Left middle trapezius: Overall $K_{w}=$ 0.26Right gluteus medius: Overall $K_{w}=$ 0.30Left gluteus medius: Overall $K_{w}=$ 0.42
Hough et al.(2011)	30 American patients withcritical illness2 physicians	Not stated6 level scale (not indicated,but median 4–5)	Right shoulder abduction: Pairwise agreement $=$ 57%, $K_{w}=$ 0.51Left shoulder abduction: Pairwise agreement $=$ 47%, $K_{w}=$ 0.36Right elbow flexion: Pairwise agreement $=$ 57%, $K_{w}=$ 0.35Left elbow flexion: Pairwise agreement $=$ 60%, $K_{w}=$ 0.23Right wrist extension: Pairwise agreement $=$ 80%, $K_{w}=$ 0.56Left wrist extension: Pairwise agreement $=$ 73%, $K_{w}=$ 0.44Right hip flexion: Pairwise agreement $=$ 53%, $K_{w}=$ 0.47Left hip flexion: Pairwise agreement $=$ 40%, $K_{w}=$ 0.32Right knee extension: Pairwise agreement $=$ 60%, $K_{w}=$ 0.29Left knee extension: Pairwise agreement $=$ 60%, $K_{w}=$ 0.29Right ankle dorsiflexion: Pairwise agreement $=$ 80%, $K_{w}=$ 0.64Left ankle dorsiflexion: Pairwise agreement $=$ 40%, $K_{w}=$ 0.32
Paternostro-Slugaet al. (2008)	31 Austrian patients with paresisof radially innervated muscles5 specialists in physical medicine and rehabilitation	Medical Research Council6 level scale (0–5)	Wrist extension: Mean $K_{w}=$ 0.78†; Pairwise $K_{w}=$ 0.67–0.90Finger extension: Mean $K_{w}=$ 0.77†; Pairwise $K_{w}=$ 0.64–0.93Grip: Mean $K_{w}=$ 0.78†; Pairwise $K_{w}=$ 0.64–0.88
		9 level scale (0–5)	Wrist extension: Mean $K_{w}=$ 0.78†; Pairwise agreement $=$ 51.6%, $K_{w}=$ 0.69, 0.70, 0.75, 0.77, 0.78, 0.79, 0.79, 0.81, 0.83, 0.89Finger extension: Mean $K_{w}=$ 0.81†; Pairwise $K_{w}=$ 0.72–0.92Grip: Mean $K_{w}=$ 0.81†; Pairwise $K_{w}=$ 0.74–0.86

Table 3, continued
Study	Participants and testsers	Muscle test method andgrading (grade range)	Muscles/actions and findings
Personiuset al. (1994)	6 American patients withfacioscapular dystrophy2 therapists	Daniels and Worthingham13 level scale (not indicated)	Right ankle dorsiflexors: Pairwise $K_{w}=$ 0.84Left ankle dorsiflexors: Pairwise $K_{w}=$ 0.57Right shoulder abductors: Pairwise $K_{w}=$ 0.72Left shoulder abductors: Pairwise $K_{w}=$ 0.72Right elbow extensors: Pairwise $K_{w}$ = 1.00Left elbow extensors: Pairwise $K_{w}$ = 1.00Right elbow flexors: Pairwise $K_{w}$ = 1.00Left elbow flexors Pairwise $K_{w}=$ 0.83Right shoulder external rotators: Pairwise $K_{w}=$ 0.67Left shoulder external rotators: Pairwise $K_{w}=$ 0.50Right shoulder horizontal abductors Pairwise: $K_{w}=$ 0.81Left shoulder horizontal abductors: Pairwise $K_{w}=$ 0.70Right shoulder horizontal adductors: Pairwise $K_{w}=$ 0.61Left shoulder horizontal adductors: Pairwise $K_{w}=$ 0.73Right knee extensors: Pairwise $K_{w}$ = 1.00Left knee extensors: Pairwise $K_{w}$ = 1.00Right knee flexors: Pairwise $K_{w}=$ 0.62Left knee extensors: Pairwise $K_{w}=$ 0.90
Pfisteret al. (2018)	46 Swiss patients with myopathy2 therapists	MMT 811 level scale (1–5)	Shoulder abduction: Pairwise $K_{w}=$ 0.33Elbow flexion: Pairwise $K_{w}=$ 0.30Wrist extension: Pairwise $K_{w}=$ 0.24Knee extension: Pairwise $K_{w}=$ 0.08Ankle dorsiflexion: Pairwise $K_{w}=$ 0.20Neck flexion: Pairwise $K_{w}=$ 0.54Hip abduction: Pairwise $K_{w}=$ 0.44Hip extension: $K_{w}=$ 0.58
Savicet al. (2007)	22 English patients with spinalcord injury1 clinical scientist and 1 therapist	ASIA6 level scale (0–5)	Right biceps brachii and brachialis: Pairwise agreement $=$ 91%, $K_{w}=$ 0.69Left biceps brachii and brachialis: Pairwise agreement $=$ 91%, $K_{w}=$ 0.65Right extensor carpi radialis: Pairwise agreement $=$ 86%, $K_{w}=$ 0.93Left extensor carpi radialis: Pairwise agreement $=$ 91%, $K_{w}=$ 0.97Right triceps brachii: Pairwise agreement $=$ 86%, $K_{w}=$ 0.97Let triceps brachii: Pairwise agreement $=$ 86%, $K_{w}=$ 0.97Right flexor digititorm profundus: Pairwise agreement $=$ 82%, $K_{w}=$ 0.97Left flexor digitorum profundus: Pairwise agreement $=$ 73%, $K_{w}=$ 0.95Right abductor digiti minimi: Pairwise agreement $=$ 71%, $K_{w}=$ 0.97Left abductor digiti minimi: Pairwise agreement $=$ 77%, $K_{w}=$ 0.96Right iliopsoas: Pairwise agreement $=$ 73%, $K_{w}=$ 0.96Left iliopsoas: Pairwise agreement $=$ 91%, $K_{w}=$ 0.99Right quadriceps femoris: Pairwise agreement $=$ 95%, $K_{w}=$ 0.99Left quadriceps femoris: Pairwise agreement $=$ 91%, $K_{w}=$ 0.99Right tibialis anterior: Pairwise agreement $=$ 77%, $K_{w}=$ 0.97Left tibialis anterior: Pairwise agreement $=$ 82%, $K_{w}=$ 0.95Right extensor hallucis longus: Pairwise agreement $=$ 77%, $K_{w}=$ 0.95Left extensor hallucis longus: Pairwise agreement $=$ 77%, $K_{w}=$ 0.96Right gastrocnemius and soleus: Pairwise agreement $=$ 77%, $K_{w}=$ 0.94Left gastrocnemius and soleus: Pairwise agreement $=$ 73%, $K_{w}=$ 0.95

Table 3, continued
Study	Participants and testsers	Muscle test method andgrading (grade range)	Muscles/actions and findings
Tan et al.(2017)	13 Australian patients with spina bifida1 of 2 experienced and 1 of 5 novice therapists	Daniels and Worthingham 6 level scale (0–5)	Quadratus lumborum: Pairwise agreement $=$ 75.0%Iliopsoas: Pairwise agreement agreement $=$ 69.2%Sartorius: Pairwise agreement agreement $=$ 65.4%Hip adductors: Pairwise agreement agreement $=$ 57.7%Gluteus medius: Pairwise agreement agreement $=$ 57.7%Gluteus maximus: Pairwise agreement agreement $=$ 65.4%Quadriceps: Pairwise agreement agreement $=$ 76.9%Medial hamstrings: Pairwise agreement agreement $=$ 61.5%Lateral hamstrings: Pairwise agreement agreement $=$ 50.0%Tibialis anterior: Pairwise agreement agreement $=$ 63.6%Tibialis posterior: Pairwise agreement agreement $=$ 69.2%Peroneus longus/brevis: Pairwise agreement agreement $=$ 73.1%Peroneus tertius: Pairwise agreement agreement $=$ 80.8%Extensor halluces longus: Pairwise agreement $=$ 81.8%Toe extensors: Pairwise agreement $=$ 80.8%Flexor halluces longus: Pairwise agreement $=$ 95.5%Toe flexors: Pairwise agreement $=$ 88.5%

*Of every therapist with every other therapist, †Of each therapist with each other therapist.

Table 3 summarizes information gleaned from 8 articles addressing inter-rater reliability. The studies involved residents of 6 different countries with 8 different conditions. Most studies used 2 raters, but 5, 7, and 11 raters were each used in 1 study. The MMT test methods were either not stated, self-described, Medical Research Council [25], Daniels and Worthingham [26], MMT 8 [27], ASIA [29], or Kendall [30]. The MMT grading scales consisted of 6 to 13 levels. Based on the Medical Research Council grading scale, studies indicating the range of grades assigned typically included muscles with grades throughout most of the 0 to 5 scale. The number of muscles/actions tested ranged from 3 to 20. The reliability of 34 different muscles/actions was described using percentage agreement or $K_{w}$ . In some cases the description related to overall (or mean) percentage agreement or $K_{w}$ . In other cases the description related to pairwise percentage agreement or $K_{w}$ . Pairwise agreement ranged from 40% to 95.5%. Pairwise $K_{w}$ ranged from 0.04–1.00. Of 75 specific pairwise $K_{w}$ reported, 30.7 % were between 0.61 and 0.80 (substantial) and 41.3% were between 0.81 and 1.00 (almost perfect) [28]. The $K_{w}$ for patients with myopathy (0.08 to 0.54) was lowest [21].

Scores on the quality checklist ranged from 6/10 to 8/10. The factors most often contributing to reduced scores were a failure to identify the period of time over which the study was conducted or the failure to describe both percentage agreement and $K_{w}$ .

4. Discussion

Considerable research has been published that focuses on the test-retest or inter-rater reliability of MMT. The majority of such research was excluded from this review because MMT was only used to dichotomously characterize individuals (e.g., weak versus not weak), the reliability of MMT was described using correlations (e.g., intraclass correlation coefficients) not suited for characterizing agreement between ordinal scores, or reliability of MMT was focused on composite scores of multiple muscle actions rather than individual muscle actions.

The results of studies summarized in this review were variable in how they were reported. For the sake of cogency, results discussed hereafter are focused on simple pairwise comparisons (e.g., test-retest by a single tester or single tests by one pair of testers). The $K_{w}$ for the majority (94.9%) of pairwise test-retest comparisons was either substantial or nearly perfect. The $K_{w}$ for the inter-rater comparisons were lower, but the majority (72.0%) was still either substantial or nearly perfect. Together these findings show that assessments of muscle strength obtained by MMT can be reliable, but that acceptable reliability cannot be assumed. The reliability of testers, therefore, should be assessed before their assessments are used to make clinical judgements regarding status or change. The findings of this review also suggest that, when possible, the same tester should be responsible for obtaining repeated MMT measures over time.

Factors limiting the reliability of MMT are well established. Chief among such factors are the subjectivity of force application by testers [31]. While this factor is not a problem when weakness is so great that the application of manual force is not necessary, it is a potential issue with higher MMT grades. Another major factor is tester strength. Weaker testers are able to apply less force. This is particularly problematic when testing muscle actions such as knee extension which can produce particularly high forces [32]. It is interesting to note that the two studies reporting the lowest inter-rater reliability had a large proportion of participants with maximum or near maximum strength scores. Over 65 percent of the patients whose left and right gluteus medius were tested by Frese et al received a Medical Research Council score of 5/5 [18]; the median Medical Research Council score assigned to all muscles in the study of Hough et al was 4/5 or more [19].

This study had several limitations. First, only one individual was involved in selecting and abstracting articles. Thus, while any reader is free to conduct the same searches and examination of the literature described herein, there is no internal confirmation of findings. Second, the consolidation of findings was limited by the very small number of studies included, different populations tested, array of MMT procedures and grading scales used, different muscles tested, and inconsistency in how reliability was reported. Further research addressing these variables is clearly needed.

5. Conclusion

Research reviewed herein indicates that it is possible to obtain reliable assesments of strength with MMT in the cohorts quoted. Test-retest reliability tends to be greater than inter-rater reliability. Nevertheless, the reliability of measures obtained in specific clinical and research settings cannot be assumed; rather it should be confirmed before conducted repeatedly over time or by different testers.

Footnotes

Conflict of interest

None to report.

References

Lovett

Martin

. Certain aspects of infantile paralysis with a description of a method of muscle testing. JAMA 1916; LXVI(10): 729-733.

Andres

Skerry

Thornell

Portney

Finison

Munsat

. A comparison of three measures of disease progression in ALS. J Neurol Sci 1996; 139(Suppl): 64-70.

Bohannon

. Measuring knee extensor muscle strength. Am J Phys Med Rehabil 2001; 80(1): 13-18.

Bohannon

. How informative are manual muscle test scores obtained from home-care patients? Isokinet Exerc Sci 2009; 17(1): 15-17.

Eriksrud

Bohannon

. Relationship of knee extension force to sit-to-stand performance in patients receiving acute rehabilitation. Phys Ther 2003; 83(6): 544-551.

Beasley

. Influence of method on estimates of normal knee extensor force among normal and post polio children. Phys Ther Rev 1956; 36(1): 21-41.

Bohannon

. Manual muscle testing: Does it meet the standards of an adequate screening test? Clin Rehabil 2005; 19(6): 662-667.

Dvir

. Grade 4 in manual muscle testing: The problem with submaximal strength assessment. Clin Rehabil 1997; 11(1): 36-41.

Cuthbert

Goodheart

. On the reliability and validity of manual muscle testing: A literature review. Chiropract Osteopathy 2007; 15: 4.

10.

Parry

Berney

Granger

Dunlop

Murphy

El-Ansary

, et al. A new two-tier strength assessment approach to the diagnosis of weakness in intensive care: An observational study. Critical Care 2016; 19: 52.

11.

Jepsen

. Can testing of six individual muscles represent a screening approach to upper limb neuropthic conditins. BMC Neurology 2014; 14: 90.

12.

Klingels

DeCock

Molenaers

Desloovere

Huenaerts

Jaspers

, et al. Upper limb motor and sensory impairments in children with hemiplegic cerebral palsy. Can they be measured reliably? Disabil Rehabil 2010; 32(5): 409-416.

13.

Harris-Love

Shrader

Davenport

Joe

Rakocevic

McElroy

, et al. Are repeated single-limb heel raises and manual muscle testing associated with peak plantar-flexor force in people with inclusion body myositis? Phys Ther 2014; 94(4): 543-552.

14.

Connolly

Malkus

Mendell

Flanagan

Miller

Schierbecker

, et al. Outcome reliability in nonambulatory boys/men with Duchenne muscular dystrophy. Muscle Nerve 2015; 51(4): 522-532.

15.

Bohannon

Glenney

. Minimal clinically important difference for change in comfortable gait speed of adults with pathology: A systematic review. J Eval Clin Pract 2014; 20(4): 295-300.

16.

Brandsma

Schreuders

TQR

Birke

Piefer

Oostendorp

. Manual muscle strength testing: Intraobserver and interobserver reliabilities for the intrinsic muscles of the hand. J Hand Ther 1995; 8(3): 185-190.

17.

Florence

Pandya

King

Robison

Baty

Miller

, et al. Intrarater reliability and manual muscle test (Medical Research Council Scale) grades in Duchenne’s muscular dystrophy. Phys Ther 1992; 72(2): 115-122.

18.

Frese

Brwon

Norton

. Clinical reliability of manual muscle testing. Middle trapezius and gluteus medius muscles. Phys Ther 1987; 67(7): 1072-1076.

19.

Hough

Lieu

Caldwell

. Manual muscle strength testing of critically ill patients: Feasibility and interobserver agreement. Critical Care 2011; 15R43.

20.

Paternostro-Sluga

Grim-Stieger

Posch

Schuhfried

Vacariu

Mittermaier

, et al. Reliability and validity of the Medical Research Council (MRC) scale and a modified scale for testing muscle strength in patients with radial palsy. J Rehabil Med 2008; 40(8): 665-671.

21.

Personius

Pandya

King

Tawll

McDermott

. Fascioscapulohumeral dystrophy natural history study: Standardization of testing procedures and reliability of assessments. Phys Ther 1994; 74(3): 253-263.

22.

Pfister

deBruin

Dterkele

Maurer

deBrie

Knols

. Manual muscle testing and hand-held dynamometry in people with inflammatory myopathy: An intra-and interrater reliability and validity study. PLOS One 2018; 13: 3.

23.

Savic

Bergström

EMK

Frankel

Jamous

Jones

. Inter-rater reliability of motor and sensory examinations performed according to American Spinal Injury Association standards. Spinal Cord 2007; 45(6): 444-451.

24.

Tan

Thomas

Johnston

. Reproducibility of muscle strength testing for children with spina bifida. Phys Occup Ther Pediatr 2017; 37(4): 362-373.

25.

O’Brien

. Aids to the Examination of the Peripheral Nervous System. Edinburgh: Saunders; 2010.

26.

Avers

Brown

. Daniels and Worthingham’s Muscle Testing: Techniques of Manual Examination and Performance Testing. 10

{}^{\text{th}}

edition. St Louis: Elsevier; 2018.

27.

Manual Muscle Testing Procedures for MMT8 Testing(June 18, 2007) https//www.google.com/search?q=MMT+8+procedures&rlz=1C1GGRV_enUS751US751&oq=MMT+8+procedures&aqs=chrome..69i57.6535j1j8&sourceid=chrome&ie=UTF-8 Accessed July 23, 2018.

28.

Landis

Koch

. The measurement of observer agreement for categorical data. Biometrics 1977; 33(1): 159-174.

29.

International Standards for the Classification of Spinal Cord Injury. Motor Exam Guide (June 2008) http://asia-spinalinjuryorg/wp-content/uploads/2016/02/Motor_Exam_Guide.pdf Accessed July 23, 2018.

30.

Kendall

McCreary

Provance

Rodgers

Romani

. Muscles: Testing and Function, with Posture and Pain. 5th edition. Philadelphia: Lippincott Williams and Wilkins; 2005.

31.

Knepler

Bohannon

. Subjectivity of forces associated with manual-muscle test grades of 3+, 4-, and 4. Percept Mot Skills 1998; 87(3): 1123-1128.

32.

Mulroy

Lassen

Chambers

Perry

. The ability of male and female clinicians to effectively test knee extension strength using manual muscle testing. J Orthop Sports Phys Ther 1997; 26(4): 192-197.

Reliability of manual muscle testing: A systematic review

Abstract

BACKGROUND:

OBJECTIVE:

METHODS:

RESULTS:

CONCLUSIONS:

Keywords

1. Introduction

Table 1 Quality checklist applied to 9 articles included in systematic review*

3. Results

Table 2 Summary of 5 articles describing the test-retest reliability of manual muscle testing

5. Conclusion

Footnotes

Conflict of interest

References

Table 1
Quality checklist applied to 9 articles included in systematic review*

Table 2
Summary of 5 articles describing the test-retest reliability of manual muscle testing