Abstract
Normal heat pain threshold (HPT) and cold pain threshold (CPT) repeatability should be estimated in order to identify thermal allodynia in longitudinal studies, but such data are scarce in the literature. The aim of our study was to estimate normal HPT and CPT repeatability in the face, forehead, neck and hand. In addition, we reviewed briefly normative studies of thermal pain thresholds relevant for headache research. Thermal pain thresholds were measured on three different days in 31 healthy headache-free subjects. Coefficients of repeatability and normal limits were calculated. HPT and CPT were lowest in the face. Pooled across regions, the lower repeatability limit for the test/retest ratio was 63% for HPT and 55% for CPT. The upper normal CPT limit varied between 24.5°C and 29.7°C. Lower HPT limits ranged between 35.5°C and 40.8°C. Quantitative sensory methods provide useful information about headache and pain pathophysiology, and it is important to estimate the normal test/retest repeatability range in follow-up studies.
Keywords
Introduction
Thermal quantitative sensory thresholds (QSTs) are useful in detecting peripheral neuropathy in diabetes (1) and for quantification of thermal allodynia in painful conditions like migraine (2,3) or other hyperalgesic conditions (4). In order to measure significant clinical disease- or condition-related changes (e.g., the onset of thermal allodynia), it is important to know the normal day-to-day variability of HPT and CPT. Moreover, it is mandatory to report reference limits (‘the normal range’) for this variability. One of the most useful measures of the normal day-to-day variability range is the coefficient of repeatability (CR) (5). The heat pain threshold (HPT) CR has been quantified as 5.9°C in the hand (6) while data on cold pain threshold (CPT) repeatability is particularly scarce in the literature. Even the most recent papers on thermal test repeatability in healthy subjects do not report CR (7,8). In order to use thermal pain thresholds in headache research, it is also mandatory to quantify normal repeatability in the face and neck regions. The present study on healthy subjects was part of a larger study to investigate thermal QST in migraineurs (9). The main aim of our analysis was to estimate HPT and CPT repeatability in the face, forehead, neck and hand in healthy subjects.
Subjects
Thirty-one healthy controls (28 women and 3 men), with a mean age of 40 years (standard deviation [SD] 11.4, range 19–64 years), mean height 168.5 (SD 7.4) cm, weight 70.8 (SD 10.2) kg and days from the last menstruation 14.3 (SD 5.9), were recruited from a local workplace and among blood donors. They were interviewed by an experienced research nurse to exclude headache, neurological disorders, psychiatric disorders, significant painful conditions, traumatic injury to the central or peripheral nervous system, cancer, arthritis, cardiovascular disease and chronic medication use. The age and sex distribution reflect that these subjects were recruited as controls for the migraine study (9). The test was repeated on three different visit days (days A, B and C) at the same time of the day. The interval between consecutive tests was 3 to 10 days (77% of intervals were between 6 and 8 days). Caffeine-containing beverages and tobacco were not allowed after midnight on the day of the investigation. Subjects signed a written informed consent form and received the equivalent of 150 US dollars after completing the three sessions to cover expenses. The study was approved by the Regional Committee for Research Ethics and Norwegian Social Science Data Services.
Methods
HPT and CPT were measured on SOMEDIC SenseLab equipment (Somedic Sales AB, Stockholm). This device uses the method of limits (ML) to determine pain thresholds (10,11). Although using ML to measure pain thresholds depends on reaction time, as opposed to the method of levels, ML is faster, widely used in research, and fully adequate for clinical use. Four sites were stimulated with a hand-held, rectangular 25 × 50 mm Peltier element thermode (Somedic Sales AB, Stockholm): the forehead (frontal region above the eyebrows aligned with the inner canthus), face (maxilla 1 cm below the lower orbital rim close to the nasal border), neck (at the insertion of the sternocleidomastoid muscle just below the lower margin of the processus mastoideus) and hand (thenar eminence overlying the abductor pollicis brevis muscle). The maximal temperature range was 5°C–50°C and the rate of change was 1°C per second. The baseline thermode temperature was set to 32°C.
The subject was lying supine on a couch. Three heat stimuli were followed by three cold stimuli with a random interstimulus interval of between 4 and 6 seconds. Stimuli were applied consecutively to the left and right hand, left and right forehead, left and right face, and left and right neck. This fixed order was used in all patients to reduce inter-individual variability. The subject was instructed to press the ‘stop’ button immediately when pain appeared. The button was held in the right hand, except for during the right thenar examination, when the left hand was used.
Data analysis
Mean (N = 3 stimuli per site within the session) HPT and CPT were calculated as the average of right- and left-sided thresholds. In subjects who did not report pain within the 5°C–50°C range, we defined HPT as 50°C (in 13% of measurements) and CPT as 5°C (in 27% of measurements).
Day A/day B and day B/day C test/retest differences for HPT and CPT, and the standard deviation (SD) of these differences (SDdif) were calculated. The day-to day difference was assessed by Wilcoxon’s test. A clinically useful measure of the normal day-to-day variability range is the CR, defined as 2 × SDdif (5). In addition, we calculated the difference thresholds (HPTd =HPT − 32 and CPTd = 32 − CPT) in order to detemine the ln(PTddayA/PTddayB) and ln(PTddayB/PTddayC) test/retest ratios (12,13). Ratios were calculated with random denominator, either ln(A/B) or ln(B/A) and either ln(B/C) or ln(C/B) to control for possible order-effects. ln-transformed variables are preferred when distributions are skewed; for example, when test/retest differences increase as the pain thresholds increase. Since ln(A/B) = ln(A)−ln(B), it is easy to realize that SD of an ln-transformed ratio is equivalent to SDdif. Mean and SD were calculated for each ln-transformed ratio, and repeatability limits (mean ± 2SD) were retransformed back to the ratio scale and tabulated as a percentage. These retransformed limits are analogous with ±2SDdif = ±CR. SYSTAT version 11 (Systat Software Inc, Chicago) and SPSS version 13 (SPSS Inc, Chicago) were used for statistical analysis.
Results
Heat and cold pain thresholds and repeatability in 31 headache-free controls
SD = standard deviation. PT = pain threshold. R = right. L = left. 1R + L average (°C) thresholds were determined on days A, B and C, 3–10 days apart. 2Coefficient of repeatability (CR) = 2 SDdif = 2 × SD of day A/B and day B/C differences. 3PT = thermal pain threshold on Celsius scale. 4PTd: PT difference (HPTd = HPT−32°C. CPTd = 32°C−CPT. 5LN(PTddayA/PTddayB) and LN(PTddayB/PTddayC) ratios: upper limit = mean + 2SD, lower limit = mean−2SD (retransformed to ratio scale; ratio = 1 is reported as 100%). 6SD of individual R-L differences. 7One outlying asymmetry (14°C) is excluded.
Pooled day A thresholds were slightly lower than day B thresholds (HPT mean 45.0°C [SD 3.8] and 46.1°C [SD 3.4]; Wilcoxon p < .0005 and CPT mean 12.6°C [7.6] and 14.5°C [7.6]; Wilcoxon p = .02). Pain thresholds did not differ on day B and day C.
Repeatability for CPT was lower in the forehead and the neck compared to the face and the hand. In the forehead and neck, HPT repeatability was slightly better than CPT repeatability. The lowest HPT and the highest CPT values were found in the face.
Review on normal arm and head thermal pain thresholds with methods of limits
BSS = between subjects sum-of-square. CPT = cold pain threshold in °C. CR = coefficient of repeatability. ESS = Error sum-of-square. HPT = Heat pain threshold in °C. ICC = intraclass correlation coefficient. LL = lower limit (reported, calculated or estimated from figures). M = mean. ML = method of limits. NR = no response. p = percentile. RT = reaction time. SD = standard deviation. UL = upper limit (reported, calculated or estimated from figures). aSubjects: number of healthy control subjects (F = female; G = girls; M = men). b Baseline: neutral starting temperature for the thermode. cMedian. d10 centers. eSites not specified further.
Discussion
In order to determine the normal range for day-to-day change in HPT and CPT, the CR variable is recommended (5). However, most authors use neither this measure nor its ratio-scale analogue. For example, Burstein et al. (2) did not report data on normal repeatability in their migraine study, and they defined heat allodynia as a change larger than or equal to the between-subjects SD (i.e., ≥3.8°C HPT and ≥6.8°C CPT) change. The definition of limits will obviously affect the prevalence of allodynia. Previously, the heat pain repeatability limit has been determined as 5.9°C in the hand (6). In the present study we extend these measurements to include both heat and cold pain thresholds at several body sites and we report CR for the hand, face, forehead and neck. An individual decrease in forehead HPT greater than 5.6°C or a relative decrease in HPTd to 67% or less would accordingly suggest definite thermal hyperalgesia (or allodynia) with our method. We recommend the use of such limits to define the development of hyper- or hypoalgesia in future longitudinal clinical studies.
Thermal allodynia in migraine has also been defined in other ways—for example, as absolute thresholds below 40°C or above 20°C (14) or below 42°C and above 18°C (15), or just by reporting the statistical group differences (16). It is apparently justified to score heat allodynia for thresholds below 40°C, because it is consistent with the observation that C polymodal nociceptors normally do not fire until a temperature of about 42°C–47°C has been reached (17). However, it has repeatedly been demonstrated (Table 2) that the lower normal HPT limit can be below 40°C. Even the most recent attempt to narrow the normal range for HPT and CPT by training sessions and elimination of expectation failed in this respect (7).
In addition, it is important to assess the influence of methodologically important details such as baseline temperature, thermode area, location, rate of change, ML calculation and substitution of floor- and ceiling-effect values (Table 2). An increasing rate of change may, for example, reduce pain thresholds with a reaction time (RT)-independent method (18), but this effect is counteracted by the RT-induced threshold elevation by the ML. It is accordingly recommended that each laboratory construct its own repeatability coefficients, particularly if methodological details differ from those used in published studies.
Repeatability between and within raters for cephalic thermal detection thresholds has previously been quantified and reported to be adequate (13). Reliability data on brush allodynia, also reported to be abnormal in migraine (19), is apparently unavailable, and repeatability data for mechanical punctate allodynia are also suprisingly sparse when we consider the reputed value of the latter method in pain research. Indeed, only a site × side × monofilament parameter seems to have been formally evaluated (20).
Test/retest correlation was reported to be similar in healthy subjects and nociceptive pain patients (8), although this method is inappropriate as a measure of repeatability (5). In addition, one study suggested that the chronic pain intensity level does not affect patients’ HPT and CPT (8). Our data suggest that pain thresholds habituate slightly when repeated after about one week; pain threshold habituation has also been found by others (21–24).
Our results are representative for women, and the data we present were collected as a normal range for a migraine study. This could be a limitation, but inconsistent data about possible sex difference in thermal thresholds have been reported in the past. Rolke et al. (25) found significantly lower HPT and CPT in women (effect size not reported), whereas other researchers did not report sex differences for thresholds obtained with ML (6,26).
We conclude that it is important to use appropriate methods to quantify normal test/retest reliability both for thermal testing and for the other sensory thresholds. Such information is mandatory to define cold and heat hyperalgesia or allodynia in the individual patient. Sensory thresholds cannot distinguish between central and peripheral hyperalgesia (or hypoalgesia). These measurements are nevertheless very useful in pain and headache research as well as for various clinical differential-diagnostic work-ups (27,28). More research is needed (29) in order to increase the reliability and specificity of sensory system quantification methods.
Footnotes
Acknowledgements
Marit Stjern, Grethe Helde, Gøril Gravdahl and Nikita Zhitniy provided technical assistance.
