Abstract
We investigated whether the differences in exercise limitation between patients with chronic obstructive pulmonary disease (COPD) or chronic heart failure (CHF) affect the repeatability or responsiveness of incremental exercise tests. Patients with COPD (Medical Research Council dyspnoea grade 2–5) and patients with CHF (New York Heart Association class II–IV) performed two incremental shuttle walk tests (ISWT) following familiarisation and two incremental cycle ergometer tests (ICE) within 2 weeks. Both tests were repeated on completion of a pulmonary rehabilitation (PR) programme. One hundred and twelve patients were recruited. In response to exercise, patients with COPD were more likely than patients with CHF to have a ventilatory limitation (
Introduction
Cardiovascular disease and chronic obstructive pulmonary disease (COPD) are priority non-communicable diseases. 1 Individuals with either chronic heart failure (CHF) or COPD suffer from similar symptoms of exertional dyspnoea and fatigue. The relationship between worsening primary organ impairment and worsening exercise capacity weakens with increasing severity of the primary organ impairment as deconditioning from physical inactivity becomes a greater determinant. 2,3 Exercise training is an effective therapy for CHF 4 and COPD 5 by at least partially reversing the skeletal muscle impairments 6,7 leading to improvements in exercise performance. International guidance for the management of both CHF and COPD recommend the use of exercise-based interventions to improve exercise performance, dyspnoea, health-related quality of life and reduce healthcare utilisation. 8 –10
Patients with COPD and CHF can train effectively together using the same symptom-based exercise programme rather than the traditional disease-specific models of cardiac rehabilitation (CR) and pulmonary rehabilitation (PR). 11 There is interest in implementing combined exercise programmes, 12 and therefore a need to understand the measurement properties of any outcome measures between the populations. Both the incremental shuttle walking test (ISWT) and incremental cycle ergometry test (ICE) are validated measures of peak exercise capacity in COPD 13 and CHF. 14 Although both patients with COPD and CHF suffer from deconditioning and resultant skeletal muscle impairment, the precise limitation to exercise can be distinct. Patients with COPD frequently have a ventilatory limitation to exercise caused by a mechanical limitation through increasing operating lung volumes. This causes an abrupt halt at peak performance as inspiratory reserve volume is reached and intolerable tachypnoea and dyspnoea ensues. Peak performance may therefore be less variable. In contrast, the cardiovascular limitation to exercise seen in patients with CHF does not involve an increase in operating lung volumes, rather it is similar to the limitation seen in health (a further increase or sustained heart rate (HR) is unable to be tolerated) but at a lower level of peak oxygen uptake (VO2Peak). It could be postulated that individual tolerance of symptoms may result in greater variability of peak performance. However, it is currently unknown if the measurement properties of the tests differ between the two populations.
The overall aim was therefore to compare the repeatability and responsiveness (measurement properties) of the ISWT and ICE between patients with COPD and CHF. We aimed to compare the exercise responses between COPD and CHF, and to describe the mechanisms of exercise limitation, that is, cardiovascular or ventilatory for each patient. We aimed to compare the measurement properties of the ISWT and ICE between the two different diseases and by the type of limitation to exercise. We hypothesised that there would be no differences in the measurement properties of the ICE and the ISWT between the two chronic diseases despite any differences in the mechanism of limitation to exercise.
Materials and methods
Study design
The study protocol was approved by the Leicestershire Research Ethics Council (National Research Registry N0123134233). The results of the main study demonstrating that combined COPD and CHF exercise rehabilitation was feasible and effective are previously published. 15 Data generated from this study were utilised to evaluate the repeatability and responsiveness of commonly used incremental exercise tests between patients with COPD and CHF.
Participants
Patients with a clinical diagnosis of COPD, supporting spirometry of a forced expiratory volume in 1 second/forced vital capacity (FEV1/FVC) <70% and an FEV1 <80% predicted and Medical Research Council dyspnoea scale 2–5 were eligible to participate. Patients with a clinical diagnosis of CHF were eligible to participate, New York Heart Association class II–IV and evidence of left ventricular dysfunction (left ventricular ejection fraction <40% on echocardiography). Patients with combined COPD and CHF were excluded after clinical assessment with spirometry and N-terminal brain natriuretic peptide (BNP) level, respectively. Where the latter was higher than the normal age and gender predicted range an echocardiogram was requested. 11 Other exclusion criteria were any significant co-morbidity severely limiting the ability to exercise or not being test naive which may affect repeatability. 16 Written consent was obtained prospectively.
Pulmonary rehabilitation
Patients underwent a 7-week outpatient PR programme, involving twice weekly supervised sessions of exercise training (individually prescribed aerobic training predominantly by walking and at home) and education, previously described and in accordance with international guidance. 11 All patients with COPD were allocated to receive PR. 17 The patients with CHF were randomised to PR or usual care; the baseline data for both groups have been combined for the repeatability analysis. The data from those who were randomised to PR were included in the responsiveness analysis.
Outcomes
For baseline measures, patients completed the schedule of testing over four visits with at least 48 hours between visits; visit 1: baseline ICE1, visit 2: familiarisation ISWT and ISWT1, visit 3: second baseline ICE2 and visit 4: ISWT2. After a 7-week programme of PR, patients completed another ICE and ISWT on separate days. Randomisation of tests performed was not conducted as the completion of an ICE was required for patient safety and eligibility for inclusion in the study, and was therefore always the first test performed. Patients were excluded if they were already familiar with either test or they were unable to perform cycle ergometry due to musculoskeletal problems.
Patients performed a symptom-limited, maximal, ICE test on an electronically braked cycle tests ergometer with expired gas analysis using a step 10 W/minute protocol (Zan-680 Ergo Test; Zan Messgeraete GMbH, Oberthulba, Germany) to recommended guidelines. 16 Non-invasive electrocardiography monitoring and pulse oximetry were continuously monitored. Measures of blood pressure were recorded before and after each test and at 2-minute intervals during the test alternately with the Borg scale of breathlessness (BS) and perceived exertion (PE) scores.
Patients were asked to avoid eating 4 hours before testing. The tests were performed at the same time of day and by the same operator. Patients were asked to cycle between 40 revolutions/minute and 80 revolutions/minute. Data were analysed using breath-by-breath analysis for outcomes of the VO2Peak, peak ventilation (VEPeak) and respiratory exchange ratio. Patients were defined as ventilatory limited if the ratio of VEPeak divided by maximum voluntary ventilation (MVV) was >0.85, where MVV = FEV1 (L) × 37.5. 18 Cardiovascular limitation was defined as peak HR >80% of maximal predicted HR. 19 Chronotropic incompetence occurs for patients prescribed a beta blocker 20 and for these patients HRPeak was calculated as recommended. 21
The ISWT is a maximal, externally paced, symptom-limited test conducted along a 10 m course. 22 The ISWT has been validated both in patients with CHF 14 and with COPD 13 and is reproducible after one practice test (familiarisation). 23 A familiarisation test was performed as per recommendations. 22 Distance walked was recorded to the last 10 m completed along with the reason for termination. Measures of pulse oximetry and HR were recorded before and after each test along with the BS and PE scores.
Statistical analysis
Baseline demographic and exercise data were described as mean (standard deviation (SD)) or median (interquartile range) for normally and non-normally distributed continuous data, respectively. The comparison between patients with COPD and CHF of baseline demographic and exercise data was described using an independent
Results
The demographics between patients with CHF (
Baseline demographics for patients with CHF and COPD.a
BMI: body mass index; LVEF: left ventricular ejection fraction; NYHA: New York Heart Association; FEV1/FVC: forced expiratory volume in 1 second/forced vital capacity; BNP: N-terminal brain natriuretic peptide; MRC: Medical Research Council; COPD: chronic obstructive pulmonary disease; CHF: chronic heart failure; SD: standard deviation; IQR: interquartile range.
amedian (IQR).
Comparison of the baseline exercise test results between COPD and CHF
The results of the ISWT and ICE between COPD and CHF are shown in Table 2. Peak performance was similar between the patients with COPD and CHF. Patients with COPD more frequently had a ventilatory limitation to exercise than patients with CHF (57% vs. 2%,
A comparison of the physiological responses to the ISWT and ICE between patients with COPD and CHF.a
SpO2: oxygen pulse oximetry; BS: Borg breathlessness score; PE: perceived exertion score; VO2Peak: peak oxygen uptake; VCO2Peak: peak carbon dioxide production; VEPeak: peak ventilation; VE/MVV: peak ventilation/maximum voluntary ventilation; ventilatory limitation = >0.8 VE/MVV; RER: respiratory exchange ratio; ISWT2: baseline incremental shuttle walk test 2; ICE2: baseline incremental cycle ergometer test 2; COPD: chronic obstructive pulmonary disease; CHF: chronic heart failure; SD: standard deviation; IQR: interquartile range; HR: heart rate; CVS: cardiovascular.
a Data displayed as mean (SD) or median (IQR).
b Median (IQR).
Repeatability of ISWT and ICE
There was a mean (SD) increase of 20 (4) m between ISWT familiarisation and ISWT1 (

Repeatability of ICE VO2Peak and ISWT distance for patients with COPD and CHF. The solid line represents the mean difference between the two tests, the dashed line represents the coefficient of repeatability and the dotted line shows the level of precision (95% CI) of the coefficient of repeatability. Disease is represented by triangles for CHF and circles for COPD. CR: coefficient of repeatability; ISWT: incremental shuttle walk test; ICE: incremental cycle ergometer test; VO2Peak: peak oxygen uptake; ISWT1: Baseline incremental shuttle walk test 1; ISWT2: second ISWT; ICE1: baseline ICE test; ICE2: second ICE; COPD: chronic obstructive pulmonary disease; CHF: chronic heart failure.
Physiological response to exercise testing at baseline.
HR: heart rate; SpO2: oxygen pulse oximetry; BS: Borg breathlessness score; PE: perceived exertion score; VO2Peak: peak oxygen uptake; VCO2Peak: peak carbon dioxide production; VEPeak: peak ventilation; RER: respiratory exchange ratio; ISWT: baseline incremental shuttle walk test; ICE: baseline incremental cycle ergometer test; SD: standard deviation; IQR: interquartile range.
a Median (IQR).
b Difference between ISWT2 and ICE2,
response of ISWT and ICE to PR
The mean change in ISWT for both COPD and CHF following PR has previously been reported
11
and a large ES observed. There was no difference in responsiveness between COPD (

Responsiveness of the ISWT distance (m) and ICE VO2Peak (L/minute) to PR between patients with COPD or heart failure. Disease is represented by a solid line with circles for COPD and squares with a dashed line for CHF. # represents intragroup difference and * represents intergroup difference. SEM: standard error mean; ISWT: incremental shuttle walk test; ICE: incremental cycle ergometer test; VO2Peak: peak oxygen uptake; PR: pulmonary rehabilitation; COPD: chronic obstructive pulmonary disease; CHF: chronic heart failure.
Discussion
We compared the measurement properties of the ISWT and the incremental cardiopulmonary exercise test performed on a cycle ergometer (ICE) between patients with symptomatic COPD or CHF. Despite differences in exercise responses, namely increased frequency of oxygen desaturation and ventilatory limitation to exercise in COPD group, the ISWT distance and ICE VO2Peak were similarly repeatable in both populations. Our data showed no difference in the responsiveness to the intervention between either disease or the type of exercise limitation to exercise.
We report, in over a hundred patients, stability of ICE VO2Peak over 2 weeks in both COPD and CHF supported by high ICCs. Previous small studies have shown conflicting results regarding the repeatability of ICE VO2Peak in COPD or CHF.
26
Part of a systematic review of exercise tests in patients with COPD
27
described four studies assessing the repeatability of ICE VO2Peak over two tests (
In smaller studies in specific diseases, the time between testing varied from same day, 28 consecutive day 30 –32 or up to 2 weeks. 29,34 –36 Protocols also varied between an individualised approach of either 5 or 10 W, 30 and others employing a standard increment of between 10 W and 30 W. Only one study reported using a step protocol 30 and more often the type of protocol: step or ramp was not reported. We confirm stability of ICE VO2Peak over 2 weeks using a step protocol. In our cohort, the specific disease did not alter the repeatability, and we report for the first time that ICE VO2Peak is stable whether there is either a ventilatory or cardiac limitation to exercise despite the different mechanisms causing exercise termination. Overall, where VO2Peak derived on a cycle ergometer is an outcome measure, full familiarisation testing is not required for patients with COPD or CHF even when cycling is an unfamiliar exercise.
We report the ISWT distance (after a single familiarisation test) to be repeatable with high reliability for patients with either COPD or CHF. Our data support international guidelines for COPD which recommend the use of a familiarisation test for the ISWT 37 , as we found a significant increase from the familiarisation test to test 1 (20 ± 4 m) consistent with other studies either in CHF or COPD. 23,38,39 Similar results with a significant learning effect of 29.5 m have been reported for a CR population. 40 Following an initial increase after familiarisation, the ISWT is similarly repeatable in both COPD and CHF 22,39,41 and we report stability over 2 weeks. The stability of the ISWT distance was unaltered by disease or presence of either a ventilatory or cardiac limitation.
Our data highlight that the mean ISWT and ICE VO2Peak are stable over time in patients with COPD or CHF. However, care is needed when interpreting data for a single individual in clinical practice. We report significant individual variability over time (without an intervention) for both the ISWT (60 m) and ICE VO2Peak, (0.27 L/minute). The coefficient of repeatability for the 6-minute walk test (6MWT) is at least the same magnitude previously described at 82 m. 25 The individual variability is particularly relevant when individual responses to interventions are being used to directly inform clinical decisions, for example, response to pharmacological therapy.
We report that the magnitude of responsiveness of both tests is similar in both COPD and CHF tested under the same conditions by the same operators. There is little published data reporting the responsiveness ISWT in the heart failure population but it is responsive to CR in a typical cohort. 42 A small study suggested significant improvements of 60 m in the ISWT after heart failure rehabilitation post-hospitalisation. 43 While systematic reviews have reported a positive response in ICE testing and the 6MWT, the ISWT has not been included. 44 However, the ISWT distance has been widely used in patients with COPD undergoing PR and average improvements reported between 36 m and 61 m. 27 We describe the MID for a group of patients with either COPD or CHF undergoing PR to be 30 m. The estimation of this MID, while not applicable for use at an individual level, is a useful statistical threshold for use when powering studies and reviewing service performance.
The ICE VO2Peak has been more widely used in CHF rehabilitation than the ISWT and a recent systematic review demonstrated an overall response of 2.16 mL/kg/minute compared to usual care. 44 A systematic review and meta-analysis of PR versus usual care in COPD reported an increase in peak watts but did not report the effects on VO2Peak. 5 In our study, there was only a very small increase in ICE VO2Peak following rehabilitation with a low ES in both populations. The differences in response of the ISWT distance and ICE VO2Peak in our study may be explained by the difference in test platform compared to the modality of training.
We have specifically evaluated the peak measurements derived from the ISWT and a cardiopulmonary exercise test on a cycle ergometer as these are the most widely used outcome measures. We did not perform expiratory gas analysis for the ISWT; however, the validity of the ISWT distance compared to laboratory CPET (cardiopulmonary exercise test) has been described previously for both populations although on a treadmill rather than a cycle ergometer. 39,45,46
The possibility of a type II error cannot be excluded; however, for the majority of comparisons, the numerical values were similar for repeatability and responsiveness for both tests regardless of disease or precise limitation to exercise. It is therefore less likely that we are missing a meaningful difference. We have also provided the precision of the coefficient of repeatability which shows narrow 95% confidence intervals.
We provide novel information on two commonly used incremental exercise tests to help clinicians and researchers accurately interpret the results of these tests after an intervention to understand if a true change has occurred both in an individual patient and for a group. Healthcare provision is starting to be organised around symptoms for example breathlessness rather than specific diseases and will be aided by having well-validated outcome measures across different disease populations. Although we describe the MID of the ISWT by distribution method, the relatively small sample size is noted.
Conclusion
Our data demonstrate that the ISWT distance (after familiarisation) and ICE VO2Peak are similarly repeatable, reliable and responsive measures for patients with COPD or CHF despite their different exercise responses. A change of 60 m in the ISWT distance or 0.270 L/minute in ICE VO2Peak is required in clinical practice to be 95% certain that a true change has occurred within an individual patient. The ISWT distance was more responsive than the ICE VO2Peak in both conditions. For a group of patients with either COPD or CHF undergoing PR, the MID in ISWT distance is estimated to be 30 m.
Footnotes
Acknowledgements
RAE serves as the guarantor of the paper as a whole and takes responsibility for ensuring integrity of the data collected and along with TCHD, responsible for analysing, interpreting the data, and drafting and critically revising the manuscript. RAE, MDM and SJS contributed to the initial study design. MDM, SJS and MCS contributed to data interpretation and critically revising the manuscript.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship and/or publication of this article: Rachael A Evans is currently funded by an NIHR clinician scientist fellowship (CS-2016-020-16). The views expressed are those of the authors and not necessarily those of the NHS, the NIHR or the Department of Health.
