Sage Journals: Discover world-class research

Abstract

Pain intensity is the most clinically relevant dimension of nearly all headache attacks. Accurate, reliable measurement of pain is therefore critical to the evaluation of outcomes in clinical trials of headache treatments, but pain is inherently subjective and difficult to measure. A number of pain scales have been developed and are commonly used in clinical practice and research. Four of these are depicted in Figure 1.

Figure 1.

Commonly used pain scales – the VAS, the 4-point VRS, the 6-point VRS, the NRS and the Faces Pain Rating Scale (revised).

In headache research, an important precedent was established by the early triptan trials, which used a 4-point verbal rating scale (VRS) to measure pain intensity. International Headache Society guidelines for the conduct of controlled trials of headache treatments recommend the use of this scale or the 100 mm visual analogue scale (VAS). The 4-point VRS has the virtue of simplicity, but has been criticised for statistical reasons and because the relatively small number of categories may not adequately discriminate among clinically relevant changes in pain intensity (1,2). Although the use of the 4-point VRS is an established precedent, might there be better ways to measure pain in headache trials? Somewhat surprisingly, there is a paucity of research regarding the use of pain rating scales in headache, although there are many studies evaluating their performance in other types of acute and chronic pain. Studies in headache populations are particularly important in view of previous research suggesting that the interchangeability of pain rating scales may differ based on pain aetiology (3).

The study by Aicher et al. in this issue of the journal is therefore a welcome addition to the headache literature (4). The authors used information collected during a German clinical trial of an over-the-counter combination medication for acute treatment of headache. Subjects were given a 100 mm VAS and asked to mark a line representing one of the categories on a 6-point VRS, chosen at random. The VRS categories were given in German as follows (English translation in parentheses): kein Schmerz (no pain); leichter Schmerz (mild pain); maessiger Schmerz (moderate pain); starker Schmerz (severe pain); ueberaus starker Schmerz (very severe pain); and staerkster vorstellbarer Schmerz (most severe pain imaginable). This was repeated with a fresh, unmarked VAS until all six VRS categories had been assessed. The same procedure was repeated at the end of the study, providing an opportunity to assess reliability. Data were analysed from 1457 subjects with a median age of 38, three-quarters of whom were women.

The goals of this portion of the study, as described by the authors, were to assess both the VAS and 6-point VRS with respect to consistency of category rank order; to determine cut-off points on the VAS corresponding to the VRS categories; to evaluate how the categories of the VRS are represented on the VAS; and to assess test–retest reliability after repetition of the complete training procedure at study conclusion. Results showed that roughly three-quarters of subjects rated the six VRS categories in the same order on the VAS at the first and fourth (final) study visits. The most common inconsistencies in order (that is, categories marked on the VAS in reverse order from the VRS) were observed between mild and moderate pain (12.6% and 13.6% at visits 1 and 4), and severe and very severe pain (9.1% and 6.7% at visits 1 and 4).

Receiver operating characteristic (ROC) curves were used to determine the cut-off points for VAS values that best fit the VRS categories. A non-equidistant scaling was found to be the best match, with the smallest range of VAS ratings corresponding to the extreme categories of the VRS (0–2 mm for no pain and 96–100 mm for most severe pain imaginable). A broader range of VAS scores corresponded to intermediate VRS categories (for example, 17–47 mm for moderate and 47–77 mm for severe pain). The ability of the VAS to accurately distinguish between two VRS pain categories (sensitivity) ranged from 76.6% to 98%, depending on the categories in question. Test–retest agreement was high. The authors conclude that ‘… VRS categories cannot be presented in an equidistant manner on the VAS, and that against previous assumptions, the pain intensity descriptors are less clear and can have different meanings in different languages.’ Perhaps more controversially, they suggest that ‘both in the ICHD-III and in the guidelines for clinical trials of patients with headache illnesses, rather than a 4-grade VRS, a 6-grade or higher level VRS or a VAS should be recommended, with correspondingly broadly defined anchor points’.

A number of the study findings are noteworthy. The authors showed ingenuity in using routine clinical trial data to examine the relative performance and calibration of two pain intensity scales. Some study results, however, may reflect the study methodology, rather than ambiguities or translational instability of the anchor labels. With regard to the first study objective, evaluation of the consistency of category order, the incongruities were mainly noted at the extremes of the scale. This may be due to the fact that anchor labels were presented in random order; thus, some subjects may have been asked to supply a VAS rating for ‘mild’ or ‘very severe’ pain without knowing they would subsequently be asked to rate more extreme categories of ‘no’ or ‘most severe pain imaginable’. The study design prevented them from changing their answers when they did understand the full range of categories. In the future, researchers may wish to make subjects aware of the range of categories (and anchors) that they will be asked to rate ahead of time.

On the other hand, these findings are consistent with the conclusions of a recent systematic review of pain rating scales which concluded that ‘it seems likely that the labels influence the responses, maybe even more at the upper end of the scale than at the lower end, particularly so in different languages and cultures’ (5). In any case, the possible effect of anchor terms on responses to rating scales in headache trials certainly deserves more attention than it has received.

The finding of non-equidistant scaling of the VRS categories on the VAS is not surprising. Unlike the phrases describing intermediate levels of pain, there is very little ambiguity in either English or German about the phrases ‘no pain’ and ‘most severe pain imaginable’ (‘kein Schmerz’ and ‘staerkster vorstellbarer Schmerz’). Given this, the unexpected finding is that there was any range at all for these categories on the VAS. It is possible this is also an artefact of the study design just discussed.

The calibration of VRS categories with the VAS will certainly be of use to future researchers. Some caution is in order, however, in assuming these findings are generalisable to other headache studies. These data come from a mixed headache population (tension-type and migraine headaches) who typically use over-the-counter medication. They are likely to have milder, more treatment-responsive headaches than subjects in the majority of clinical trials of headache treatments. The study also compared paper versions of these pain rating scales, rather than the computer-administered versions that are increasingly used in clinical trials. It is possible that paper and computer-administered versions, even of the same scale, capture information differently, another matter that deserves future study. It is also plausible there are meaningful sex differences in the performance of various rating scales in headache sufferers, but this has not been systematically studied. Finally, the findings of this study are limited to a comparison of the VAS with a 6-point VRS. It is unclear how these scales would calibrate against other commonly used scales, in particular the 4-point VRS scale used in most triptan trials. A previous study that compared VAS and 4-point VRS in migraine patients addressed only the matter of statistical power, and concluded that the two measures were ‘approximately equal’. That study did not, however, evaluate other aspects of the scales such as ease of use, compliance, or responsiveness to change, which remain topics that deserve the attention of researchers. Table 1 summarises recommendations for future research.

Table 1.

Recommendations for future research on pain intensity scales in headache

• Continue to investigate the influence of anchor labels and language on pain ratings.

• Compare the VRS and VAS to other pain rating scales, particularly the NRS.

• Study different populations (e.g. those with severe headaches, those with cluster headache, etc.) during headaches and in the pain-free state.

• Investigate sex differences in pain ratings.

• Compare computer with paper versions of the scales.

Several other pain rating scales may deserve consideration for use in certain situations. For example, the Faces Pain Rating Scale can be used in children or non-verbal populations and also performs well in ordinary adults (6). Numerical rating scales (NRS) probably deserve particular attention, given several potential advantages in comparison with the VRS and VAS (7). The 11-point (0–10) NRS is ‘preferred by the majority of patients in different cultures’, according to the findings of a recent systematic literature review (5). The authors of that review identified 54 studies that compared NRS, VRS and VAS for unidimensional self-report of pain intensity. Most studied postoperative pain intensity. Eight versions of the NRS (NRS-6-NRS-101) were tested with 15 different descriptors used to anchor the NRS. The authors concluded that compliance with the NRS was superior to that with the VAS and VRS, and that the NRS was also more responsive to change and easier to use. They noted that although ‘many studies showed wide distributions of NRS scores within each category of the VRSs…’ in general the correspondence between these measures was good. Another study found that the VAS ‘tends to have higher failure rates than the NRS or VRS, probably because both the NRS and the VRS are very easy to understand and complete by patients’ (8). Finally, a recent study that compared all four of these pain scales in a population of volunteer university students concluded that there were only small differences in responsiveness among them, but that ‘most support emerged for the NRS as being both most responsive and able to detect sex differences in pain intensity’ (9).

Even if another pain rating scale is shown to be superior to the traditional 4-point VRS, it will continue to be relevant for historical reasons. It will always be desirable to compare the performance of newer drugs with older ones. Head-to-head trials are the gold standard for such comparisons but are not always feasible. Meta-analyses will be needed, and their findings will be most valid if included studies have used the same pain rating scales. Thus, it will remain important to continue to collect information using the traditional 4-point VRS scale, at least as a secondary outcome of headache treatment trials.

In conclusion, this study advances our knowledge of pain assessment in patients with headache, and provides researchers and trialists with valuable information to facilitate the design of future studies. It seems premature, though, to conclude that the guidelines for controlled trials in headache should be changed to recommend the 6-point VRS instead of the 4-point VRS. Instead, additional study is needed because there is a dearth of studies that directly compare these other measures in a wide range of headache populations.

References

Price

Bush

Long

. A comparison of pain measurement characteristics of mechanical visual analogue and simple numerical rating scales. Pain 1994; 56: 217–226.

Jensen

. Pain assessment in clinical trials. In: Wittink

Carr

(eds) Pain Management: Evidence, Outcomes and Quality of Life in Pain Treatment, Amsterdam: Elsevier, 2008; 57–58.

Lund

Lundeberg

Sandberg

. Lack of interchangeability between visual analogue and verbal rating pain scales: a cross sectional description of pain etiology groups. BMC Med Res Methodol 2005; 5: 31–31.

Aicher et al. Pain measurement: Visual Analogue Scale (VAS) and Verbal Rating Scale (VRS) in clinical trials with OTC analgesics in headache. Cephalalgia 13 December 2011, DOI: 10.1177/0333102411430856.

Hjermstad

Fayers

Haugen

. Studies comparing Numerical Rating Scales, Verbal Rating Scales, and Visual Analogue Scales for assessment of pain intensity in adults: a systematic literature review. J Pain Symptom Manage 2011; 41: 1073–1093.

Hicks

von Baeyer

Spafford

. The Faces Pain Scale – Revised: Toward a common metric in pediatric pain measurement. Pain 2001; 93: 173–183.

Bergh

Sjostrom

Oden

. An application of pain rating scales in geriatric patients. Aging 2000; 12: 380–387.

Dijkers

. Comparing quantification of pain severity by verbal rating and numeric rating scales. J Spinal Cord Med 2010; 33: 232–242.

Ferreira-Valente

Pais-Ribeiro

Jensen

. Validity of four pain intensity rating scales. Pain 2011; 152: 2399–2404.

Measuring pain intensity in headache trials: which scale to use?

Abstract

References