Abstract
This study investigated the validity and reliability of fixed strain gauge measurements of isometric quadriceps force in patients with chronic obstructive pulmonary disease (COPD). A total cohort of 138 patients with COPD were assessed. To determine validity, maximal volitional quadriceps force was evaluated during isometric maximal voluntary contraction (MVC) manoeuvre via a fixed strain gauge dynamometer and compared to (a) potentiated non-volitional quadriceps force obtained via magnetic stimulation of the femoral nerve (twitch (Tw); n = 92) and (b) volitional computerized dynamometry (Biodex; n = 46) and analysed via correlation coefficients. Test–retest and absolute reliability were determined via calculations of intra-class correlation coefficients (ICCs), smallest real differences (SRDs) and standard errors of measurement (SEMs). For this, MVC recordings in each device were performed across two test sessions separated by a period of 7 days (n = 46). Strain gauge measures of MVC demonstrated very large correlation with Tw and Biodex results (r = 0.86 and 0.88, respectively, both p < 0.0001). ICC, SEM and SRD were numerically comparable between strain gauge and Biodex devices (ICC = 0.96 vs. 0.93; SEM = 8.50 vs. 10.54 N·m and SRD = 23.59 vs. 29.22 N·m, respectively). The results support that strain gauge measures of quadriceps force are valid and reliable in patients with COPD.
Keywords
Introduction
Chronic obstructive pulmonary disease (COPD) is associated with systemic manifestations and comorbidities that impact functional capacity, health-related quality of life and prognosis. 1 Peripheral muscle weakness, particularly of the large quadriceps muscles, is highly prevalent in patients with COPD 2,3 and is an important target of comprehensive disease management due to its vital role in activities of daily living, its contribution to exercise intolerance, 4 known dysfunction compared to healthy controls 5 –7 and remediable nature. 8 Addressing this dysfunction is a key aim of the exercise training component of pulmonary rehabilitation. 3,9 Interestingly, little is known about the psychometric properties of strength measures in an elderly population in general and in patients with COPD in particular. Since the latter have altered structural and metabolic properties of their skeletal muscles, 6 it seems important to specifically validate techniques to assess muscle strength in this population.
Measurement of peripheral muscle force is typically simple and feasible for most patients with COPD. Common manoeuvres used to measure volitional muscle force include isometric, isotonic or isodynamic maximum voluntary contractions (MVCs) 10 or dynamic one-repetition maximum contraction (1RM). 11 Common equipment used for this purpose includes handheld dynamometry, seated strain gauge 12 or computerized dynamometry. 13 The choice of technique usually depends upon the desired level of accuracy and clinical indication(s). All tests suffer from potential error related to central fatigue, poor motivation or variability induced by the assessor. Non-volitional assessment of muscle force is performed via electrical or magnetic stimulation of a peripheral motor nerve to derive a measure of muscle twitch (Tw) force. While excellent correlations have been demonstrated between Tw with MVC force in healthy controls 14 and patients with COPD, 15 such measures are not routinely performed in clinical practice due to the high equipment costs and necessity for examiner skill. They remain, however, a reference method in research settings to answer specific physiologic questions.
In patients with COPD, quadriceps MVC manoeuvres are frequently performed via isometric contraction. 16 Isometric MVCs consist of maximal contractions conducted against a resistance at a fixed joint angle. 7 They are easily implemented into clinical practice and provide reliable and reproducible measures of muscle force. 6 Measurement of isometric quadriceps force is often performed via commercially available computerized dynamometers; however, despite its reputation as a ‘reference method’ 18 for volitional muscle force testing, its use in clinical practice is impeded by the high equipment costs and large space requirements. The fixed strain gauge offers simple and fast user applicability at considerably lower cost than computerized dynamometry and was recommended as a ‘low implementation cost’ technique to measure isometric force in the recent american thoracic society (ATS)/european respiratory society (ERS) statement on limb muscle dysfunction in COPD. 6 A review by Robles and colleagues 16 highlighted its increasing use in COPD research but cited a lack of COPD-specific reliability data as an important area for future research. This knowledge gap underpins the relevance of the present research.
The primary aim of this study was therefore to determine the validity (how well an instrument measures what it purports to measure), 19 test–retest reliability (the magnitude of the error in observed measurements of the inherent variability between subjects) 20 and agreement (how close two measurements from the same subject are) 20 of fixed strain gauge measures of quadriceps muscle force in patients with COPD. The secondary aims were to determine the presence of (1) test fatigue (defined by a decreased repeated force measurement during a single visit), (2) a learning effect (defined by an increased muscle force measurement during the second visit compared to the first visit, with 7 days in between), and (3) any true absolute difference between quadriceps force measurements obtained from the strain gauge and Biodex devices across both visits.
Methods
Test procedures
Data from a sample of convenience of 138 individuals participating in the previous 21 –23 or current (NCT02113748) clinical trials at UZ Gasthuisberg, Leuven (Belgium) were included in this combined retrospective/prospective study. All studies were approved by the ethics committee of University Hospital Leuven, and a written informed consent was obtained from all patients in accordance with the Declaration of Helsinki. Inclusion criteria comprised diagnosis of COPD according to global initiative for chronic obstructive lung disease (GOLD) recommendations, 1 age ≥40 years and smoking history ≥10 pack-years. Patients were ineligible for inclusion if they had a primary respiratory disease other than COPD (e.g. asthma) documented in their medical record, impairment of normal biomechanical movement (e.g. significant coexisting orthopaedic, neurological or other condition) or significant cognitive impairment, as judged by study investigators.
In one cohort of 92 patients, peak volitional contractile quadriceps force was assessed during an isometric MVC manoeuvre via a fixed strain gauge dynamometer with signal analogue force transducer (546QD; CDS Milan, Italy) and amplifier (Biopac MP150; Biopac Systems, Goleta, California, USA). Peak volitional force was compared with non-volitional Tw force obtained via magnetic stimulation of the femoral nerve at 100% power output of a Magstim stimulator (Magstim Co Ltd, Whitland, UK) 3 seconds post-MVC (in the passive, relaxed state). Maximality of the non-volitional contraction was ensured by increasing the power output of the magnetic stimulator and ensuring that the Tw force did not further increase between 90% and 100% of the power output (supramaximal stimulation). These measurements were performed during a single visit, with patients seated in a semi-reclined chair that provided 90° knee flexion and 120° hip flexion to optimize the stimulation of the femoral nerve, in accordance with previously published data. 21 Isometric quadriceps MVCs were sustained for 3 seconds and repeated a total of five times, with 30-second rest intervals between contractions. These data were retrospectively collected from patients’ records in the aforementioned studies.
An independent, second cohort of 46 patients was prospectively assigned to undergo repeated assessments conducted over two visits, separated by 1 week. In this group, measures of peak isometric quadriceps force and torque were obtained from both the fixed strain gauge and a computerized dynamometer (Biodex system 4 pro – Enraf Nonius; Delft, the Netherlands) with a minimum of 30 minutes rest between test procedures. Device sequence (strain gauge/Biodex or Biodex/strain gauge) was determined via random allocation and kept constant across visits (Figure 1).

Overview of data collection design for validity and reliability study (n = 46).
While measures of peak isometric quadriceps force were yielded from both methods, slight differences existed between the test procedures. In accordance with conventional procedures, MVCs for the Biodex were performed over four manoeuvres of 6 seconds duration with 20-second rest intervals and a knee position of 60° flexion. Quadriceps force expressed as absolute and percentage of predicted normal values. 24 Strain gauge measures were obtained over five MVC manoeuvres of 5 seconds duration with 30-second rest intervals. As this cohort did not need to perform non-volitional (Magstim) procedures, both the hip and knee joints were positioned in 90° flexion (conventional test position for strain gauge). In order to compare data between the strain gauge (expressed as force, in Newtons) and Biodex (expressed as torque, Newton metres [N·m]) in this cohort, leg length was measured from the middle of the fibula head (axis of rotation) to the top of malleolus (fixed point where the force was applied) and strain gauge torque measures calculated using the formula (N·m = leg length [m] × Newtons).
All data measurements were recorded after one practice trial on each device, and all patients received maximal encouragement by the investigator during MVC manoeuvres, including provision of visual feedback on a computer screen. Final test results were not disclosed to patients until completion of the last test procedure. All test procedures were conducted by the same assessor for each patient, and the assessment was standardized to the right leg. The highest (peak) value of three reproducible manoeuvres from five attempts (allowing no more than 5% variance) was used for analysis.
All participants underwent detailed lung function and functional exercise capacity (6-minute walk test) assessments according to ERS standards 25,26 for purposes of characterization.
Analysis
Statistical analyses were performed with SAS 9.4 (SAS Institute Inc., Cary, North Carolina, USA). Data are presented as mean ± SD. Statistical significance was denoted by p < 0.05 for all statistical tests.
Validity was investigated via two methods: inspection of the relationship between peak volitional strain gauge quadriceps force (Newton [N]) and Tw (N), in the cohort of 92 patients, and Biodex measures (N·m), in the cohort of 46 patients. Pearson correlation coefficients were calculated, with r values in the range of 0.0–0.1 considered trivial, 0.1–0.3 small, 0.3–0.5 moderate, 0.5–0.7 large, 0.7–0.9 very large and 0.9–1.0 extremely large. 27
Strain gauge and Biodex test–retest reliability (n = 46) across the two clinical visits were determined via calculation of intra-class correlation coefficients (ICCs) using the formula ICC = S 2 B /(S 2 B + S 2 W ), where S 2 B and S 2 W represent the between-subject variance (S 2 B ) and the within-subject variance (S 2 W ). ICC values were interpreted as <0.4 poor, 0.4–0.75 fair to good and >0.75 excellent. 28 Absolute reliability was evaluated by the standard error of measurement (SEM), calculated as SEM = Sx × √(1−ICC), where Sx is the standard deviation of the baseline measurement. The smallest real difference (SRD), indicating a 95% confidence interval around the SEM measurement, was calculated from the formula SRD = 1.96 × √2 × SEM. 29 The percentage was calculated as SRD% = (SRD/mean) × 100.
Test–retest agreement of volitional torque measures for both devices during the two clinic visits (n = 46) was also investigated via Bland–Altman plots (mean difference vs. average of the two visits) for each device (separately) using GraphPad Prism 5.0 and mean difference and limits of agreement reported. Repeatability was reported via the coefficient of repeatability and its precision, as described by Bland et al. 30
The secondary aims (n = 46) were addressed via linear mixed models with quadriceps torque as the dependent variable. Class variables were order of measurement (first or second assessment, indicative of ‘fatigue’), visit (first or second, indicative of ‘learning’), device (strain gauge or Biodex, indicative of absolute difference between both devices) and patient identification. An interaction factor (device × visit) was included to investigate any differences in learning effects attributable to device.
Results
Baseline characteristics of both cohorts from the study are presented in Table 1. One patient from the first cohort and three from the second did not perform the 6-minute walk test (6MWT). In two patients, one force measurement on one of the devices was missing, so these data were not included in the test–retest analyses, resulting in n = 45 and n = 44 for strain gauge and Biodex, respectively.
Participants’ characteristics.a
BMI: body mass index; FEV1: forced expiratory volume in 1 second; FVC: forced vital capacity; 6MWT: 6-minute walk test.
aData are presented as mean ± SD. One patient from the first cohort and three from the second did not perform 6MWT. One patient from the second cohort did not perform muscle force assessment during visit 1 due to leg pain after completion of the 6MWT.
bPercentage of predicted was calculated for quadriceps force measured by Biodex.
Validity
A very large correlation was evident between strain gauge measures of peak quadriceps force and non-volitional Tw force (r = 0.86, p < 0.001; Figure 2), independent of gender (n = 92). In the cohort of 46 patients, a very large correlation was also evident between MVC recorded from strain gauge (torque calculated from the original force measures) and MVC from Biodex (r = 0.88, p < 0.0001), as assessed during the first visit.

Correlation between non-volitional and maximal voluntary quadriceps force measured by the strain gauge (n = 92).
Test–retest and absolute reliability
A summary of peak volitional quadriceps force measures and reliability estimates (ICC, SEM, SRD, SRD%) obtained during the two visits for both devices is presented in Table 2.
Quadriceps force and reliability estimates obtained from visits 1 and 2 for strain gauge and Biodex (based on the cohort of 46 patients).
V1: first visit for measurements; N·m: Newton meter; V2: second visit for measurements; ICC: intra-class correlation coefficient; SEM: standard error of measurement; SRD: smallest real difference; %: percentage.
Test–retest Bland–Altman analyses for strain gauge (Figure 3) and Biodex (Figure 4) revealed good mean agreement and narrow limits of agreement across the two visits. For the strain gauge, mean difference was 3.74 N·m and limits of agreement −17.68 N·m to 25.15 N·m. For Biodex, mean difference was −1.67 N·m and limits of agreement −31.74 N·m to 28.41 N·m. The coefficients of repeatability were ±21.42 and ±30.07 N·m for strain gauge and Biodex, respectively.

Bland–Altman plot of test–retest agreement across visits 1 and 2, strain gauge (n = 44).

Bland–Altman plot of test–retest agreement across visits 1 and 2, Biodex (n = 45).
Further explorations of test performance (secondary study aims)
Mean muscle force significantly decreased (−6%) from the first to second test of each visit (136 ± 40 N·m vs. 128 ± 40 N·m, respectively; p < 0.001). No differences existed between mean muscle force measurements at visits 1 and 2 (132 ± 40 N·m vs. 133 ± 40 N·m, respectively; p = 0.53), and no learning effects were detected for either device (interaction device × visit, p = 0.18). Overall mean muscle force measures did not differ between strain gauge and Biodex (133 ± 40 N·m and 132 ± 40 N·m, respectively; p = 0.64).
Discussion
The present study findings are novel and relevant in supporting the validity of the strain gauge to measure MVCs in patients with COPD. It also indicates that measurements obtained with this device are at least as reliable and reproducible as those obtained via computerized dynamometry, considered the ‘gold standard’ for MVC measures. 18 As such, these data strongly support the recommendations of the ATS/ERS regarding assessment of quadriceps force using the strain gauge in patients with COPD. 6 This is important because robust computerized dynamometers, while commonly used to assess isometric force in COPD research, 17,31 are not easily available within the clinical environment. A recent international survey reported the evaluation of lower limb muscle force, upper limb force, lung function and body composition (pooled response option), occurs in only 20% of pulmonary rehabilitation programs, 32 potentially due to limitations such as the availability of appropriate equipment. Our findings add to this scant literature to support the use of a strain gauge as a simple but equally robust measure of quadriceps muscle force in patients with COPD.
A strength of the present study was validation of the strain gauge against both volitional and non-volitional quadriceps contractions. The very large relations between these measures in our data (p < 0.0001, r 2 = 0.76) are in line with those observed by Polkey et al. 14 who reported on the use of magnetic femoral nerve stimulation in healthy subjects and those with suspected muscle weakness (p < 0.0001, r 2 = 0.83). Validation of the strain gauge against the Biodex system enabled comparison with gold standard dynamometry for assessment of muscle function. The very large correlation between isometric MVC measures from the strain gauge and Biodex reinforces the validity of this technique.
To the best of our knowledge, test–retest reliability of the fixed strain gauge has not been previously reported, nor has direct comparison been made between the Biodex system 4 pro used in isometric mode. Test–retest reliability of the strain gauge was confirmed in our study through the verified ICC, SEM and SRD estimates. The results slightly favoured the strain gauge over Biodex (lower SEM and SRD values); however, the very small magnitude of difference is of questionable clinical relevance. These outcomes demonstrate high precision of the measurement 29 to discriminate small differences upon measurement. 33 No pattern of systematic over- or underestimation was observed for the strain gauge in the Bland–Altman plot and dispersion around the mean was less than the Biodex. Taken in consideration with the small mean differences of each device (3.74 and −1.67 N·m for strain gauge and Biodex, respectively) and the acceptable repeatability coefficients (± 21.42 for strain gauge and ± 30.07 for Biodex), we feel this supports the strain gauge as an adequate method for assessing quadriceps force compared to Biodex. Our test–retest reliability estimates for the Biodex system compare favourably with those of other studies performed in patients with COPD, 34 healthy subjects 35,36 and people with late effects of polio, 37 strengthening the external validity of our findings. While the SRD for both devices was relatively large in our study, this appears consistent with the findings from Flansbjer and Lexell 37 derived from Biodex measures of isometric extension of knee extension (SRD% = 17.8 in the less affected limb).
We described slight differences in the testing protocols between the two devices across the different patient cohorts, attributed primarily to positioning of the hip and knee joints. The increased hip extension with the strain gauge was necessary in order to provide effective femoral nerve stimulation with the magnetic stimulator, and the decreased knee flexion with the Biodex used in accordance with previous research in the COPD patient group that allows the comparison with predicted values. 24 Position variation may have influenced the generation of torque due to changes in neural activation, muscle fibres length–tension relationship (with 60° typically considered ‘ideal’), and/or the complex force transmission through the knee joint. 38,39 Early data from Knapik et al. 40 indicated that isometric peak torque of the leg extensors was greater at 60° than 90° flexion, and Hahn 41 more recently verified that the isometric multi-joint leg extension torque-generating capacity also differs according to knee angle in young healthy men. Data from Krishnan and Williams 39 and Herzog et al., 42 however, contradict this, showing an absence of difference in isometric torque generated at either 60° or 90° knee flexion. In consideration of this information, and the very comparable data pertaining to absolute force and ICCs from both devices in present study, we do not suspect knee position to have adversely affected the findings of our data. Interestingly, the detection of fatigue in our sample suggests that 30 minutes may not be enough for complete muscle recovery. This fact did not, however, prevent clear analysis of results regarding the validity and reliability of the strain gauge as the randomized device order enabled device-specific analysis to proceed with confidence.
Future research may be indicated to develop predictive normal values for strain gauge measures of quadriceps strength. Early research in this area 43 suggests normal values for adults’ MVC are approximately 75% of the body weight. While body weight is an important factor, a minimum age and gender should also be taken into account. These aspects were considered in the prediction formula later described by Seymour et al. 44 However, the need for fat-free mass measurement limits its applicability in clinical practice. An updated, robust but simple estimate would be of importance to the future clinical implementation of this technique – an integral outcome of the present research.
In summary, this study provides evidence that the fixed strain gauge method to measure quadriceps muscle strength, as proposed in the consensus statement of the ATS and ERS, 6 is valid and reliable for the measurement of isometric quadriceps force in patients with COPD.
Footnotes
Acknowledgements
The authors would like to thank the staff of the Pulmonary Function Department at Gasthuisberg University Hospital (Leuven, Belgium) and the clinical trial unit for their help with the clinical assessments of patients enrolled in this study.
Author contribution
Fernanda Machado Rodrigues and Heleen Demeyer have contributed equally. Thierry Troosters and Christian Osadnik supervised equally.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Flemish Research Foundation (FWO #G.0871.13) and PROactive IMI-JU.115011 FMMR and CAC are funded by The National Council for Scientific and Technological Development (CNPq), Brazil (249579/2013-8 and 202425/2011-8, respectively). CO was the recipient of a long-term European Respiratory Society Fellowship (LTRF 2014 – 3132). CB was a doctoral fellow of Research Foundation Flanders at the time of data collection. HD is the recipient of a joint ERS/SEPAR long-term fellowship (LTRF 2015).
