Lighting for pedestrians: Does multi-tasking affect the performance of typical pedestrian tasks?

Abstract

Obstacle detection and facial emotion recognition are two critical visual tasks for pedestrians. In previous studies, the effect of changes in lighting was tested for these as individual tasks, where the task to be performed next in a sequence was known. In natural situations, a pedestrian is required to attend to multiple tasks, perhaps simultaneously, or at least does not know which of several possible tasks would next require their attention. This multi-tasking might impair performance on any one task and affect evaluation of optimal lighting conditions. In two experiments, obstacle detection and facial emotion recognition tasks were performed in parallel under different illuminances. Comparison of these results with previous studies, where these same tasks were performed individually, suggests that multi-tasking impaired performance on the peripheral detection task but not the on-axis facial emotion recognition task.

1. Introduction

Lighting for subsidiary roads is designed to meet the safety and perceived safety requirements of pedestrians.¹ Caminada and van Bommel² suggested that obstacle detection and evaluation of other people are critical tasks for pedestrians, and this is supported by a study using eye tracking to record pedestrians’ gaze behaviour.^3,4 Experimental research has therefore been carried out to investigate how the performances of these tasks are affected by changes in road lighting characteristics.

Table 1 shows four studies^5–8 which investigated the effect of changes in illuminance, spectral power distribution (SPD) and observer age on the detection of pavement surface obstacles in peripheral vision. The results of these experiments show that performance follows a plateau–escarpment relationship.⁹ At low illuminance, performance approaches the chance level: an increase in illuminance increases performance (as measured by detection rate or reaction time to detection), until a point is reached beyond which further increase in illuminance brings a negligible increase in performance. The transition point in these studies is at approximately 2.0 lx, as measured on the horizontal plane of the obstacle. Furthermore, observer age and SPD (as characterised using the scotopic/photopic (S/P) luminance ratio) affected detection only at illuminances lower than about 0.6 lx. A further review suggested the optimal illuminance to be 1.0 lx, this determined for a critical size of an obstacle (a change in vertical size of 10 mm), detected at a distance ahead of 3.4 m.¹⁰

Table 1

Past studies investigating the effects of illuminance and SPD on the detection of peripheral obstacles

Study	Experimental design				Sample	Lighting conditions
	Task	Observation period	Participant’s position in trials	Obstacle configuration		Target illuminances (horizontal)	SPD
Fotios and Cheal⁵	Forced-choice; which of six obstacles was raised.^a	300 ms	Seated	Six locations, raised, eight obstacle sizes (0.40–7.94 mm)	11 young, 10 old	0.2, 2.0, 20.0 lx	Three types of lamp (HPS and two types of MH lamp) (S/P = 0.6, 1.2 and 1.8)
Fotios and Cheal⁶	Forced-choice; which of four obstacles was raised^a	300 ms	Seated	Four locations, raised, eight obstacle sizes (0.40–6.31 mm)	4 young	0.20, 0.63, 2.00, 6.32, 20.0 lx	One HPS lamp (S/P = 0.6)
Uttley et al.⁷	Detection rate and detection height of a slowly rising obstacle	Continuous	Walking on a treadmill	One location, raised, seven obstacle sizes (0.5–28.4 mm)	15 young, 15 old	0.2, 0.6, 2.0, 6.3, 20.0 lx	Three S/P ratios (S/P = 1.2, 1.6 and 2.0)
Fotios et al.⁸	Forced-choice; which of four obstacles was raised/lowered	500 ms	Seated	Four locations, raised and lowered, five obstacle sizes for both directions (0.9–8.5 mm)	20 young	1.0 lx	One S/P ratio (S/P = 1.6)

In these tests, the task response was primarily yes/no (is there an obstacle) with location used to indicate a correct response rather than a false positive.

It has been proposed that facial emotion recognition (FER) is a suitable proxy for evaluating the intentions of other pedestrians.¹¹ This is operationalized as the ability to correctly identify the emotion portrayed by facial expression. Review of eye tracking data suggests pedestrians tend to fixate upon other people for a duration of about 500 ms and at a distance of 15 m.^12,13 Table 2 shows studies investigating the effects of changes in light level and SPD on performance of a FER task.^14–17 The results of these studies suggest significant effects of luminance and target distance on correct identification of facial expression but do not suggest an effect of SPD.

Table 2

Past studies investigating the effects of luminance and SPD on facial emotion recognition (FER)

Study	Experimental design				Sample	Lighting conditions
	Task	Observation period	Target distance	Facial expression^a		Luminances (measured on the face)	SPD
Fotios et al. 14	Forced-choice judgements of emotion and gaze direction	1000 ms	4, 10 and 15 m	6 emotions ^a	15 young 15 old	0.01, 0.1, 1.0 cd/m²	Two types of lamp (HPS and MH) (S/P = 0.6 and 1.8)
Yang and Fotios¹⁵	Forced-choice judgements of emotion	500 and 1000 ms	4 and 15 m	6 emotions ^a	20 young	0.01, 0.03, 0.1, 0.33, 1.0, 3.3 cd/m²	Two types of lamp (one HPS and two types of MH) (S/P = 0.6, 1.2 and 1.8)
Fotios et al.¹⁶	Forced-choice judgements of emotion	500 ms	4 and 15 m	6 emotions ^a ; in grey scale and colour versions	18 young	0.1, 0.33, 1.0 cd/m²	Two types of lamp (HPS and MH) (S/P = 0.6 and 1.8)
Li and Yang¹⁷	Forced-choice judgements of emotion	4000 ms	4 m	4 emotions b ; terracotta head models	30 young	0.33, 1.0, 2.0, 3.0, 10.0, 30.0 lx^c	Three types of lamp (HPS, MH and LED) (S/P = 0.6, 1.8 and 25.13)

In all cases, the six emotions were anger, disgust, fear, happiness, neutrality and sadness, displayed by 2D photographs.

In Li and Yang,¹⁷ the four emotions were happy, angry, sad and surprise, displayed by 3D terracotta head models.

Li and Yang reported illuminance rather than luminance.

While the results of FER studies tend to follow an escarpment–plateau relationship, for the typical interpersonal evaluation distance of 15 m performance at 3.33 cd/m², the highest luminance used in these trials,¹⁵ did not suggest the plateau had been reached. Whilst this, or higher luminance on the face might improve FER performance, it also raises the likelihood of glare, and thus optimal FER performance may be an unrealistic expectation. Therefore, an optimum luminance of 1.0 cd/m² was proposed for 50% correct identification rate when observing at 15 m distance.¹³ For FER at 10 m distance, a luminance of 1.0 cd/m² was suggested to be optimal,¹⁴ and at 4 m distance this was 0.33 cd/m².¹⁵

There are at least two limitations in these past studies. One limitation, associated with the FER studies, is that the targets being evaluated were 2D images of faces, these being photographs of actors displayed on a screen. Recognition accuracy is expected to increase when 3D information is available.¹⁸ However, merely exchanging a 2D image for a 3D model is unlikely to be of benefit as a static 3D face observed from a fixed viewpoint presents the same visual target as a 2D image of that same scene with the same lighting.

The second limitation is that past studies measured performance whilst instructing test participants to focus on one specific task. In natural situations, pedestrians are involved in multiple parallel tasks which reduces their attention toward any one task. Attention is the information processing capacity of an individual: attention capacity is limited and each task being performed requires a proportion of that capacity¹⁹ which means that when two tasks are performed concurrently, the performance on one or both tasks is therefore expected to be reduced if the available attention is insufficient.²⁰ Multi-tasking has been used in two studies related to driving. Bullough and Rea²¹ considered peripheral target detection in parallel with performance on a video driving game: they comment on the effect of changes in lighting but not the effect of multi-tasking on individual task performance. Fotios et al.²² considered detection of peripheral targets with simultaneous distraction tasks: they found a significant increase in reaction time to detection and a significant increase in missed targets in trials with distraction compared with a distraction-free control trial, but did not consider the effect of changes in lighting. The effect of multi-tasking on those tasks pertinent to pedestrians is unknown.

This paper reports two experiments carried out to investigate the implications of multi-tasking for two typical visual tasks of pedestrians, FER and obstacle detection. 3D face models were used to promote ecological validity but their static position and the absence of variation in light source position means that their use is not expected to be of significant advantage over 2D targets. The two experiments followed a similar procedure but with variations in levels of the independent variables.

2. Method

2.1 Apparatus

The test booth (Figure 1) was that used in previous work investigating obstacle detection.⁸ The floor contained a series of cylinders, normally flush with the floor surface, which could be raised or lowered to represent a pavement obstacle. Faces were presented above the rear wall at eye level.

Figure 1

Side section through the apparatus. Note, LED1 not used in the current work

The visible space inside the booth was of dimensions 1200 mm deep, 1200 mm wide, and 1200 mm in height, constructed from medium-density fibreboard (MDF). Visible vertical surfaces (side and rear walls) were matt black. The floor surface, upper and sides of the obstacles and inner surfaces of the tubular housing of each obstacle (which became visible when an obstacle lowered) were matt grey (Munsell N5, reflectance 0.2).

The faces were cast models of human faces.²³ These were fixed to a wheel (diameter = 800 mm) with 16 posts, installed behind the rear wall of the booth. The wheel was rotated by a servo motor, with rotation controlled to present a specific (or no) face for a given trial. The rear wall ensured that the observer could see only one target face, with the remainder hidden. The horizontal distance from the observation point to the face was 1290 mm.

The light sources were tuneable arrays of RGBW LEDs, identical to those used in previous work⁸ installed along the central line at three positions (Figure 2). In the current experiments, only LED2 and LED3 were used, and were used simultaneously in all trials. A vertical black screen above the participants’ eyes blocked direct view of these light sources from the observation position.

Figure 2

Plan view of apparatus. Note: (i) In experiment 1 all four obstacles were used; in experiment 2, obstacle 4 was not used. (ii) LED1 was not used in the current work but labelled here for consistency with previous work.⁸

Light source SPD was not varied in the current work. According to previous studies, it was expected that variation in S/P ratio would influence obstacle detection at horizontal illuminances ≤0.2 lx,^5–7 but variation in SPD would not affect recognition of facial expression at any light level.^14–16 The SPD used in this work had an S/P ratio of 1.6 (correlated colour temperature (CCT) = 2750 K, chromaticity x = 0.47, y = 0.41), chosen as the middle of the three levels used in previous research on obstacle detection.⁷

In trials the scene was observed for 500 ms, this being controlled using a pair of visual occlusion spectacles. 500 ms is the typical duration of fixation on other people.^11–13 Rather than investigate the effect of changes in observation duration, this single period was chosen to provide a degree of ecological validity. It was also the duration used in some previous studies of FER (see Table 2) and obstacle detection (Table 1) which aids comparison with those studies. In the open state, the spectacles allowed participants to look into the interior of the booth as if wearing normal clear glasses. In the closed state, details of observed scene could not be resolved but the lenses still transmit light as frosted glass.

The faces were 1:6 scale models of human heads, cast in light flesh-coloured resin. The face models have a luminance reflectance of 0.78 (see below for discussion of variation in skin tone). The visibility of facial features, facial contrast, is typically characterised using Michelson contrast for the mouth, brow and/or eye regions.^24,25 The current models exhibited a mouth contrast against the chin of 0.10; this is similar to the mean Michelson contrast of 0.12 calculated for the 151 Caucasian faces used by Russell.²⁵ However, note that luminance contrast for these models was a function of illumination geometry rather than variation in the reflectances of facial features. The vertical height of the face models from chin to the top of the head was approximately 36 mm and was viewed from a distance of 1290 mm. This configuration resembled a viewing distance of 10 m for a real-size head of height 216 mm, shorter than the suggested distance of 15 m due to apparatus constraints,¹³ which was also one of the distances used in previous work.¹⁴ When a face was rotated to the exposed position, 12 o’clock on the wheel, it was at the same height as the observer’s eyes. A chin rest was used to maintain a constant viewing position.

There were 11 different face models, varying by the emotion portrayed by facial expression (4 neutral, 4 happy, 1 sad and 2 angry) as shown in Figure 3. The models were fixed on radial posts of the wheel, positioned to face directly towards the observer during trials. Five posts of the wheel were left empty and were used for null condition trials.

Figure 3

Photographs of the 11 face models. These photographs were taken with the models in the apparatus in the position where they were exposed to observation during trials

The floor of the test booth simulated a pavement surface. The floor includes an array of 12 vertical cylinders (100 mm diameter) which were normally flush with the floor (Figure 2). Four of these (obstacles 1–4) were used in the current experiment. Using a servo-motor, it was possible to raise or lower individual cylinders by up to 25 mm in either direction. For the current experiment, the obstacles were only lowered, representing potholes; previous work demonstrated similar detection rates for raised and lowered objects of the same size.⁸

For the two experiments, obstacle 1 was the main target while obstacles 2–4 were used as distractors. The obstacles were intended to be detected in peripheral vision, with foveal fixation maintained towards the face targets. The distractor targets were used to avoid promoting a focus of attention to just one obstacle location.

Obstacles 1 and 4 were located on the centre line of the booth, directly ahead of the observer, with obstacle 1 furthest away from (1220 mm), and obstacle 4 nearest to (640 mm), the participant. Obstacles 2 and 3 were symmetrically located to the left and right of the centre line, at a horizontal distance of 1010 mm from the observer’s eyes. Visual angles to each obstacle, assuming the participant was looking directly at the face model presented at the back, are given in Table 3.

Table 3

Obstacle locations relative to fixation point

Target	Angular deviation of obstacle from fixation point (degrees)
Target	Down	Left/Right	Central angle
Obstacle 1	19.7	0	19.7
Obstacle 2 & 3	23.0	24.3	33.0
Obstacle 4	33.7	0	33.7

Note that only obstacles 1, 2 and 3 were used in experiment 2.

Between each trial, a masking noise was added to eliminate audible cues which might help participants to judge whether an obstacle appears or not. This masking noise was generated by an electric motor hidden beneath the obstacle field that switched on for two seconds coinciding with the resetting of the obstacle conditions (whether or not this actually involved a moving obstacle).

2.2 Test variables: Experiment 1

Four independent variables were involved in experiment 1: the location of the obstacle; depth of a pothole; light level; and emotion portrayed by facial expression.

Four obstacle locations were used in experiment 1 (Figure 2). Each obstacle was presented at each of five different depths, these following a geometric progression ratio of 1.58 (0.2 log unit steps) based on the Bailey–Lovie acuity chart,²⁶ and chosen to bracket detection performance from near 0% to near 100%. The sizes used in the apparatus (Table 4) were scaled to subtend the same visual angle as pothole depths of 4.0, 6.3, 10.0, 15.9 and 25.1 mm when observed 3.4 m ahead, with an eye height of 1.5 m above ground. The middle size, 10 mm, is suggested to be a critical size for trip hazards.¹⁰

Table 4

Size (height and depth) of the obstacles used in the experiment

Target	Depth of simulated pothole (mm)	Target size (min. arc)	Solid angle (sr)	Horizontal distance from eye to front edge of obstacle (mm)	Depth of test pothole (mm)
4 2 & 3 1	4.0	3.37	0.0002	640	0.9
			0.0001	1010	1.2
			0.0001	1220	1.3
4 2 & 3 1	6.3	5.34	0.0003	640	1.4
			0.0002	1010	1.9
			0.0001	1220	2.1
4 2 & 3 1	10.0	8.47	0.0006	640	2.3
			0.0003	1010	2.9
			0.0002	1220	3.4
4 2 & 3 1	15.9	13.44	0.0009	640	3.6
			0.0005	1010	4.7
			0.0004	1220	5.4
4 2 & 3 1	25.1	21.32	0.0014	640	5.7
			0.0007	1010	7.4
			0.0006	1220	8.5
Face model	n/a	72.84 (height)	0.0006	1290	n/a

In experiment 1, all 11 face models were available, of which nine were used in a test session, three positive (happy), three neutral and three negative (angry or sad). The three faces displaying positive and neutral emotion were randomly picked from the available four.

The test booth was lit from above by both LED2 and LED3. Two illuminances were used, 1.0 lx and 10.0 lx as measured on the top horizontal surface of obstacle 1 when flush with the surrounding pavement (Appendix 1). The average horizontal illuminances currently recommended for pedestrians and minor roads range from 2.0 to 15 lx.^1,27 For trials at 10 lx, vertical illuminance measured at the eye was 0.23 lx. An illuminance of 1.0 lx was suggested to be optimal for detection of pavement obstacles,^10,28 with negligible increase in detection with higher illuminances.⁷ The higher illuminance, being one log unit greater, was used to prompt an increase in performance if it were the case that previous work had underestimated the optimal illuminance.

At these illuminances, the luminances of the front of the face models were 0.16 and 1.65 cd/m², respectively, which brackets the suggested luminance (1.0 cd/m²) for optimal FER performance at 10 m.¹⁴ The effect of change in light level on task performance can be predicted using Relative Visual Performance (RVP).²⁹ Consider a young, female, Caucasian face,²⁴ with facial contrast averaged across the mouth, eye and brow regions of 0.314 (Weber contrast), subtending a target of 0.0006 steradians (in this apparatus, that simulated a distance of 9.2 m) to an observer age of 25 years. Adaptation luminance was estimated as the road surface luminance as recommended.³⁰ Figure 4 shows the change in RVP for road surface illuminances of 0.33, 1.0, 3.3, 10.0 and 33.3 lx, the extended range of illuminances used in experiment 2, the assumed diffuse reflectance of 0.2 giving adaptation luminances of 0.02, 0.06, 0.21, 0.64 and 2.12 cd/m², respectively. Figure 4 shows that for an adaptation luminance of about 0.21 cd/m² (3.3 lx) or above, further increase in adaptation luminance brings negligible increase in performance, whilst for lower luminances there is a rapid decline in performance. For experiment 1, it was therefore expected that performance on the FER task would be greater at 10 lx than at 1 lx.

Figure 4

Relative Visual Performance plotted against adaptation luminance for a facial contrast of 0.314, subtending a solid angle of 0.0006 steradians and an observer of age 25 years

These conditions are described using photopic measures, this being the manner in which lighting recommendations are given.^1,27 The FER task, being the fixation point, is a foveal task for which the photopic luminous efficiency function is appropriate. Given the low light levels and its peripheral location, it is more appropriate to define the obstacle task using the mesopic luminous efficiency function.³¹ Appendix 1 therefore also shows mesopic luminances for the obstacle, as calculated from the photopic luminances.³² Appendix 1 shows scalar and vector illuminances measured at the location where face models were presented, determined according to Cuttle³³: the vector/scalar ratio was about 3.3 in each case. The average luminance contrast of the target obstacle against its surround area was approximately 0.82.

2.3 Test variables: Experiment 2

Experiment 2 followed the same procedure as experiment 1 but with an extended range of light levels. The two light levels of experiment 1 were increased to five to better characterise the relationship between performance and light level. Specifically, 0.5 log unit steps were introduced below, in-between and above the two levels used in experiment 1 (Appendix 1).

Three further changes were made. Two changes were made to balance the trials and maintain a reasonable test session duration: obstacle 4 was excluded from the detection task and the number of face trials was reduced from nine to six. Three categories of facial expression were still presented to participants, reduced to two positive (happiness), two negative (one each, anger and sadness) and two neutral. The specific face models chosen were those achieving the highest rates of correct detection in experiment 1. The third change was that an additional small sample of faces were shown rotated on the vertical axis by 45° to either the left or right in addition to the straightforward position. The results of these rotated faces are not analysed in the current paper.

2.4 Test procedure

For each experiment, 30 participants were recruited from the students in the School of Architecture of the University of Sheffield. For experiment 1, they were aged 18–32 years, and in experiment 2 they were aged 17–31 years. An equal balance of male and female was used in both experiments. They received a small payment for taking part. Before starting the test, each participant was given an information sheet describing the experiment: if willing to proceed, a consent form was signed. Normal acuity (wearing corrective lenses if normally worn) and colour vision were confirmed using a Landolt-ring acuity test and the Ishihara colour test plates under a simulated daylight source (Verivide D65).

Digital photographs of each face were shown to the participant on a computer screen, one by one, with these photographs stating also the emotion conveyed by expression. Recognition of the emotion was then checked by showing the same images again, in a random order, but without the emotion being stated. The participant was then instructed to sit facing into the test booth and placed their head upon the chin rest. They put on the occlusion spectacles, which could be worn over their normal lenses.

The laboratory lighting was then switched off so that only the apparatus lighting was in use. A period of 20 min was allowed for adaptation to low light level. In this period, the experimenter first described the test procedure and then demonstrated the locations of the obstacles (four in experiment 1, three in experiment 2) and the corresponding response button to use when that specific obstacle was detected.

The participant then completed a practice session to confirm familiarity with the face expressions, conducted with the illuminance set to 10 lx. For experiment 1 there were 22 trials, this being each of the 11 faces presented twice, and for experiment 2 there were 12 trials, each of the six faces being presented twice. The faces were observed in random order. For these practise trials, the occlusion glasses were retained in the open state, i.e. the practice trials were not time limited. In test trials, the glasses opened for only 500 ms. Therefore, the final two practise trials allowed only a 500 ms exposure.

For a given trial, there were four steps. (1) With the occlusion spectacles in the closed state, the obstacle and/or chosen face was moved to the test position. (2) After a beep sound was played, the occlusion spectacles opened for 500 ms. During or immediately after this 500 ms period, the participant responded according to which target they had seen. To indicate the presence of an obstacle, a button was pressed (the button box had one button for each obstacle). To indicate a face had been seen, the participant stated aloud which expression it was. If neither a face nor obstacle was seen, the participant did not respond. (3) The spectacles then closed for 4 seconds, during which time the obstacle and/or face wheel moved back to the default position (no target displayed) and the light level was changed to that of the next trial. (4) The spectacles opened for 4 seconds, to help participants relocate the fixation point (face model position) and adapt to the new light setting. The spectacles then closed to initiate the next trial.

Each experiment included four types of target event (see Table 5). These were trials in which the target revealed was either a face only, an obstacle only, both a face and an obstacle, or neither – null condition trials in which neither a face nor an obstacle was presented.

Table 5

Summary of target presentations

	Target presented	Number of trials	Description
Experiment 1	Obstacle-only	25	Obstacle 1: five heights, each repeated twice Obstacles 2 to 4: five heights, each once only
	Face-only	27	Nine faces, each repeated three times
	Obstacle and face	25	Randomly picked 25 from 27 faces, and paired with 25 obstacle heights
	Null	23	No obstacle or face appeared
Experiment 2	Obstacle-only	20	Obstacle 1: five heights, each repeated twice Obstacle 2 and 3: five heights, each once only
	Face-only	18	Facing forward: six faces, each repeated twice. Facing 45°: six faces, once each in left or right directions.
	Obstacle and face	12	Six faces paired with obstacle 1: six faces paired with obstacle 2 or 3; no repeated trials. These dual task conditions always used the forward-facing face.
	Null	12	No obstacle or face appeared

In experiment 1 there were 200 trials, which included the four obstacle locations, each at the five pothole depths, and nine face models. The 100 trials shown in Table 5 were each repeated at two light levels. In experiment 2, the 310 trials included combinations of three obstacle locations, five pothole depths, six face models and null conditions. The 62 trials shown in Table 5 were each repeated at all five light levels. The sequential order of these trials (200 for experiment 1, 310 for experiment 2) was randomised. To reduce participants’ fatigue, a 5-minute break was offered after 100 trials (which took approximately 20 minutes to complete). Overall, the experiment took approximately 60 minutes (experiment 1) and 150 minutes (experiment 2) to complete for each participant, including the introduction, adaptation, practice trials and testing.

2.5 Data analysis

In these two experiments, the participants were asked to respond to two tasks, obstacle detection and FER. For the obstacle detection task, there were four within-subjects factors – obstacle position, obstacle depth, illuminance and task condition (single task or dual task). For the FER task, there were three within-subjects factors – facial emotion, illuminance and task condition (single task or dual task). The dependent variables are rates of correct identification of facial emotion and correct detection of obstacle position. For an obstacle detection to be correct, participants had to respond with the correct position: responses of the wrong position or a false alarm in null trials were both counted as incorrect responses.

The data analysed were the proportion of correct responses for each test participant. The normality of data distributions was checked by visual inspection of the distribution (histogram and box plot), checking skewness and kurtosis, and using the Shapiro-Wilk test. This suggested the data tended to be normally distributed and hence statistical analyses were carried out using parametric tests. An alpha level of 0.05 was chosen for all statistical tests.

3. Results: Experiment 1

3.1 Null condition

Null trials were those where there was neither a lowered obstacle nor face model when the occlusion spectacles opened. They were used to assess response bias, the tendency to say yes or no when unsure about stimulus detection (face and obstacle in this experiment), or random responding. False alarms within null condition trials could include obstacle response, a face response, or both (but this possibility did not happen in any trial).

In experiment 1, each test participant observed 27 null condition trials per illuminance, giving 1380 null condition trials in total (27 × 2 illuminances × 30 participants). Correct reactions to null condition trials (i.e. no response) were given in 1145 (83%) of these trials (Table 6). In 17% of trials, an obstacle false alarm was raised: this is a similar rate to the percentage of false alarms (13.7–24.8%) found in previous obstacle detection studies.^5,6,8 In four trials (0.003%), a face false alarm was raised.

Table 6

Responses in null condition trials in experiments 1 and 2

Experiment	Total number of null condition trials	Correct rejection	False alarms
Experiment	Total number of null condition trials	Correct rejection	Obstacle response	Face response
1	1380	1145 (83%)	235 (17.03%)	4 (0.003%)
2	1800	1611 (89.5%)	189 (10.5%)	4 (0.002%)

Note: These trials were repeated for each light level.

The sensitivity index (d′) is used to analyse how well the signal can be distinguished.³⁴ A higher d′ value indicates that the signal can be more readily detected while near zero suggests the performance was in a chance level, which might indicate that the participants did not concentrate on the task or the experimental design was not appropriate. Only the results of obstacle detection trials were used to calculate the d′ because the false alarm rate of facial emotion recognition task was extremely small. In experiment 1, a lowered pothole was correctly identified in 1998 (66.6%) of the 3000 trials in which it was presented. The average d′ score for all test participants was 1.44, which is within the range of previous work (1.06–3.28).^6,8 These data suggest that participants tended to report detection only when an obstacle was present and not respond when obstacles were absent.

3.2 Obstacle detection

A four-way repeat measures ANOVA was carried out with four independent variables being illuminance (two levels), task condition (two levels: single and dual), obstacle location (4 levels: back, left, right and front) and obstacle depth (5 levels: simulating 4.0, 6.3, 10.0, 15.9 and 25.1 mm) with obstacle detection rate as the dependent variable. The p-values produced from the ANOVA were corrected by Holm-Bonferroni adjustment to counteract the error of multiple comparisons.³⁵ The ANOVA results are shown in Appendix 2. If the ANOVA test revealed a statistically significant main effect or interaction, post hoc paired comparisons t-tests with Holm-Bonferroni correction were applied to assess the differences between levels on each variable.

The results suggested that task condition (p < 0.001), obstacle location (p = 0.004) and obstacle size (p < 0.001) have significant effects. The results do not suggest a significant effect of illuminance (Figure 5): the detection rates for 1 lx and 10 lx were similar (1 lx: mean = 65%, SD = 2.2%; 10 lx: mean = 66%, SD = 2.6%; p = 0.264).

Figure 5

The effects of illuminance and obstacle size on detection rate in experiment 1. Error bars show 95% confidence interval

Detection performance in single-task trials (74% correct, SD = 2.2%) was significantly better (p < 0.001) than performance in dual-task trials (58% correct, SD = 3.2%).

Four different obstacle locations were presented to participants. Obstacle 1 had the highest detection rate (70%, SD = 2.3%) while the obstacle 4 was the worst (60%, SD = 3.1%): the difference in detection rates between obstacles 1 and 4 was significant (p < 0.001). Obstacle 2 and 3 did not show a significant difference in performance (Obstacle 2: mean = 67%, SD = 3.1%; obstacle 3: mean = 67%, SD = 2.9%; p = 0.77) which validated the findings of previous work.⁸ After combining the results of obstacles 2 and 3, the difference between mid-distance obstacles (obstacle 2 and 3) and obstacle 4 was suggested to be significant (p = 0.016) but the difference with obstacle 1 was not significant (p = 0.23).

Five different obstacle depths were used in this experiment. Obstacle detection rate increased as the obstacle depth became larger, ranging from 28% (SD = 3.1%) for the smallest obstacle depth, which is about chance level (25%) to over 80% for the largest obstacle depth (Figure 5). Paired t-tests with Holm-Bonferroni correction suggested the differences in detection performance between successive increases in obstacle depth were significant (p < 0.002 in all cases).

One significant interaction was between illuminance and obstacle location (p = 0.001) (Appendix 2). For obstacle 2, the difference between two illuminances was suggested to be significant (p = 0.001), but the effect of illuminance was not significant for the other three obstacle locations.

Another significant interaction suggested in Appendix 2 was between task condition and obstacle size (depth) (p = 0.001). Figure 6 shows the detection rates increase as the obstacle depth became larger, from chance level to around 80% for both task conditions. The difference between obstacle depth and task condition was not suggested to be significant at the smallest obstacle depth (p = 0.493) but was significant for the larger four depths (p ≤ 0.001) with higher detection rates for the single task than the dual task.

Figure 6

Mean obstacle detection rates plotted against obstacle size for single-task and dual-task conditions in experiment 1. Error bar: 95% confidence interval

3.3 Facial emotion recognition task

Three variables were examined: face luminance (two levels: 0.16 cd/m² and 1.65 cd/m²), task condition (two levels: single and dual) and facial emotion (four levels: happiness, sadness, anger and neutral). The ANOVA results are shown in Appendix 3. The higher luminance had a significantly (p < 0.001) higher rate of correct facial emotion recognition (1.65 cd/m²: mean = 74.3%, SD = 2.46%, 0.16 cd/m²: mean = 61.2%, SD = 1.71%) as was predicted for a typical situation (Section 2.2). The statistical analysis did not suggest a significant effect of task condition nor facial emotion type.

All 11 face models (Figure 3) were used (four happiness, one sadness, two anger and four neutral) in experiment 1. Appendix 4 shows the recognition rates for each individual face model. The ANOVA test suggests a significant difference among the four happiness faces (p = 0.008) and among the two angry faces (p = 0.004) but did not suggest a significant difference among the four neutral faces (p = 0.709). Paired t-tests with Holm-Bonferroni correction suggested significant differences in recognition rates between happiness-1 and happiness-2 (p = 0.027), happiness-2 and happiness-4 (p = 0.021), anger-1 and anger-2 (p = 0.043).

3.4 Discussion

Experiment 1 investigated the effect of changes in illuminance (1 lx and 10 lx) on the performance of obstacle detection and FER tasks, and the impact of making both assessments simultaneously. For the obstacle detection task, there was no effect of light level. This suggests that performance reached a plateau before 1.0 lx, which agrees with previous work.^8,10 For the FER task, there was a significant effect of light level with better performance at the higher light level, which agrees with past work.^14,15 Regarding task condition, the effect was significant for obstacle detection but was not suggested to be significant for FER.

For obstacle detection, the results suggest performance is already at the plateau level and the data do not reveal the optimal illuminance: trials conducted at lower illuminance would explore this. The FER data do not suggest the optimal luminance has been reached – trials conducted at a higher level would explore this. Therefore, a second experiment was conducted using an expanded range of light levels (see Appendix 1).

4. Results: Experiment 2

4.1 Null condition

In experiment 2, each test participant observed 12 null condition trials per illuminance, giving 1800 null condition trials in total (12 × 5 illuminances × 30 participants).

False alarms where participants incorrectly reported an emotion were recorded in only four trials (false alarm rate = 0.002%). Correct rejection to null condition trials were 1611 (89.5%) in total (Table 6). False alarms where participants incorrectly responded detection of an obstacle occurred in 10.50% of these trials. This is lower than experiment 1 (17%) and also lower than previous studies (13.7–24.8%).^5,6,8

As with experiment 1, d′ can only be calculated for obstacle detection task as the false alarm rate of the FER task was near zero. Among the 4650 obstacle detection trials, the hit rate was 73.94% and the average d′ score was 1.82. This is similar to experiment 1 (1.44) and previous work (1.06–3.28).^5,6,8

4.2 Obstacle detection

Two three-way repeated measures ANOVAs were implemented in this analysis with three independent variables each. This was done instead of a four-way ANOVA because in the dual-task condition, obstacles were randomly paired with faces so that not every participant saw the same combinations. Two of the three variables in ANOVAs were the same, which were illuminance (5 levels: 0.33 lx, 1.0 lx, 3.3 lx, 10.0 lx and 33.3 lx) and obstacle size (5 levels: simulating 4.0, 6.3, 10.0, 15.9 and 25.1 mm). Additional variables in the ANOVAs were obstacle location (three levels: front, left and right) and task condition (two levels: single and dual). The Holm-Bonferroni correction and post-hoc paired-comparisons were applied to the results of both ANOVAs. The overall results were shown in Tables 7 and 8. Illuminance, obstacle size and task condition all revealed a significant difference while obstacle location not. The detection rates of all obstacle locations were nearly equal (mean = 75.8%, 75.7% and 74.0%, SD = 2.17%, 2.37% and 2.40%, for obstacles 1, 2 and 3, respectively). The difference between each location was not significant (p = 0.503).

Table 7

Results of three-way repeated-measures ANOVA in experiment 2, with illuminance, obstacle location and obstacle size as independent variables and detection rate as the dependent variable

Variable(s)	F-statistic (df)	p-value	Holm-Bonferroni corrected p-value threshold	Significant difference^a
Illuminance	12.113 (4, 116)	<0.001	0.007	Yes
Obstacle location	0.695 (2, 58)	0.503	0.05	No
Obstacle size	155.231 (4, 116)	<0.001	0.007	Yes
Illuminance × Obstacle location	3.513 (8, 232)	0.001	0.0125	Yes
Illuminance × Obstacle size	3.383 (16, 464)	<0.001	0.007	Yes
Obstacle location × Obstacle size	1.676 (8, 232)	0.105	0.025	No
Illuminance × Obstacle location × Obstacle size	1.869 (32, 928)	0.003	0.016	Yes

^aResult suggested to be statistically significant (p<0.05) according to a threshold corrected using Holm-Bonferroni.

Table 8

Results of three-way repeated-measures ANOVA in experiment 2, with illuminance, task condition and obstacle size as independent variables and detection rate as the dependent variable

Variable(s)	F-statistic (df)	p-value	Holm-Bonferroni corrected p-value threshold	Significant difference^a
Illuminance	10.303 (4, 116)	<0.001	0.007	Yes
Task condition	8.278 (1, 29)	0.007	0.017	Yes
Obstacle size	135.685 (4, 116)	<0.001	0.007	Yes
Illuminance × Task condition	5.438 (4, 116)	<0.001	0.007	Yes
Illuminance × Obstacle size	1.707 (16, 464)	0.042	0.025	No
Task condition × Obstacle size	5.947 (4, 116)	<0.001	0.007	Yes
Illuminance × Task condition × Obstacle size	1.425 (16, 464)	0.125	0.05	No

^aResult suggested to be statistically significant (p<0.05) according to a threshold corrected using Holm-Bonferroni.

Detection rates for five illuminances used in experiment 2 increased from 68.1% (SD = 2.45%) at 0.33 lx to 79.8% (SD = 2.23%) at 10.0 lx (Figure 7). However, at the highest illuminance (33.3 lx) performance dropped slightly to 75.3% (SD = 2.17%). ANOVA suggested that illuminance has significant effect on detection rate (p < 0.001). Thus, post hoc t-tests were carried out. Table 9 suggested that the performance at 0.33 lx differed significantly from 3.3 lx, 10.0 lx and 33.3 lx (p ≤ 0.001) but is not different to performance at 1.0 lx. For illuminances of 1.0 lx and above, the data do not suggest a significant difference, which suggests that the optimal illuminance is in the region of 1.0 lx.

Figure 7

The effects of illuminance and obstacle size on detection rate in experiment 2. Error bars show 95% confidence interval

Table 9

Post hoc paired sample t-test with Holm–Bonferroni correction for obstacle detection task under all illuminances in experiment 2

Horizontal illuminance (lx)	Horizontal illuminance (lx)
Horizontal illuminance (lx)	1.0	3.3	10.0	33.0
0.33	0.025	<0.001^a	<0.001^a	0.001^a
1.0		0.018	0.008	0.445
3.3			0.581	0.01
10.0				0.002^a

^aResult suggested to be statistically significant (p<0.05) according to a threshold corrected using Holm-Bonferroni.

Detection rate according to obstacle height ranged from 33.6% (SD = 4.01%) to 95.7% (SD = 0.84%) (Figure 7), similar to experiment 1. A series of paired t-tests with Holm–Bonferroni correction suggests that the differences between each obstacle size were significant (p < 0.001) except between 15.85 mm and 25.12 mm (p = 0.377).

As with experiment 1, the results of experiment 2 also suggest a significant difference between performance in the single task and dual task trials (p = 0.007). The detection rate in single task trials was higher (mean = 77.1%, SD = 1.98%) than in dual task trials (mean = 70.5%, SD = 2.99%).

There is one apparent anomaly in these data: performance at 33.3 lx is significantly lower (p = 0.002) than at 10.0 lx. The decline in performance was consistent for all three obstacle locations. Figure 8 shows performance on the single-task and dual-task conditions separately and shows that the decline in performance at 33.3 lx occurred with single-task trials but not with dual task trials.

Figure 8

Mean obstacle detection rates plotted against illuminance for single task and dual task conditions

4.3 Facial emotion recognition task

In experiment 2, a three-way ANOVA was performed with three independent variables: luminance (five levels: 0.05, 0.16, 0.53, 1.65 and 5.63 cd/m²), task condition (2 levels: single task and dual task) and facial emotion type (four levels: happiness, anger, sadness and neutral). Identification rate was the dependent variable. As above, the Holm-Bonferroni correction threshold was applied in the analysis. The results are shown in Table 10.

Table 10

Results of three-way repeated-measures ANOVA, with luminance, task condition and facial emotion type as independent variables and identification rate as the dependent variable

Variable(s)	F-statistic (df)	p-value	Holm-Bonferroni corrected p-value threshold	Significant difference^a
Luminance	56.655 (4, 116)	<0.001	0.007	Yes
Task condition	20.662 (1, 29)	<0.001	0.007	Yes
Emotion	37.968 (3, 87)	<0.001	0.007	Yes
Luminance × Task condition	1.510 (4, 116)	0.204	0.017	No
Luminance × Emotion	4.737 (12, 348)	<0.001	0.007	Yes
Task condition × Emotion	0.292 (3, 87)	0.831	0.05	No
Luminance × Task condition × Emotion	0.739 (12, 348)	0.713	0.025	No

^aResult suggested to be statistically significant (p<0.05) according to a threshold corrected using Holm-Bonferroni.

As shown in Figure 9, luminance, task condition and facial emotion type all revealed a significant difference. The rate of correct expression identification increased with increasing luminance, from 55.4% (SD = 2.15%) at 0.33 lx to 85.4% (SD = 1.93%) at 33.3 lx (p < 0.001) (Table 10).

Figure 9

The effects of luminance on identification rate in the second experiment. Error bars show 95% confidence interval

A series of t-tests were conducted to compare the luminance pairs (Table 11). These suggested that performance at 0.05 cd/m² is significantly lower than at higher luminance (p < 0.001); also, performance at 0.16 cd/m² is significantly lower than at higher luminance (p < 0.001) (the difference in performance at 0.16 cd/m² and 1.65 cd/m² confirms the result of experiment 1). However, at 0.53 cd/m² performance is not suggested to be different from under the higher luminance.

Table 11

Post hoc paired sample t-test with Holm–Bonferroni correction for FER under all luminance in experiment 2

Luminance on the face (cd/m²)	Luminance on the face(cd/m²)
Luminance on the face (cd/m²)	0.16	0.53	1.65	5.63
0.05	<0.001^a	<0.001^a	<0.001^a	<0.001^a
0.16		<0.001^a	<0.001^a	<0.001^a
0.53			0.188	0.026
1.65				0.135

^aResult suggested to be statistically significant (p<0.05) according to a threshold corrected using Holm-Bonferroni.

There was a significant effect of task condition on facial emotion recognition with a significantly higher (p < 0.001) percentage of correct identification of facial expression in the dual task (mean = 77.3%, SD = 1.64%) than in the single task (mean = 72.6%, SD = 1.67%). This suggests that during dual task trials, test participants tended to focus more attention onto the FER task at the expense of performance on the detection task.

Identification rates of all types of facial expression were above chance level, although the sad expression (mean = 56.0%, SD = 1.85%) was slightly lower than for the other expressions (happy: mean = 74.0%, SD = 1.79%; angry: mean = 77.3%, SD = 3.36%; neutral: mean = 82.7%, SD = 1.67%). Paired-sample t-tests suggested a significant difference between each type of emotion (p ≤ 0.01) except that the difference between happiness and anger was not suggested to be significant (p = 0.153).

5. Discussion

5.1 Summary of results

Two experiments were conducted to measure the detection of pavement obstacles and identification of emotion conveyed by facial expression under changes in light level. This extended previous work by conducting both tasks in parallel trials rather than as separate experiments: the next trial in a sequence could be obstacle detection, FER, both or neither. This was done to investigate the proposal that multi-tasking would reduce task performance and thus the extent to which this would affect the optimal light level determined from the data.

Experiment 1 used two light levels, photopic illuminances of 1.0 lx and 10 lx as measured at obstacle 1 (see Appendix 1). This did not lead to a significant difference in obstacle detection but, as predicted above using RVP, led to higher FER performance at the higher light level. The effect of task condition was significant for the obstacle detection task with lower detection rate in those trials when both tasks required a response than in those trials where only an obstacle was presented. For FER, there was no effect of task condition. This may be because the FER task was the fixation point.

Experiment 2 used five light levels (0.33 to 33.3 lx at obstacle 1, see Appendix 1). The change in illuminance led to a significant effect on obstacle detection, with a lower rate of detection at 0.33 lx than at higher illuminances. Note that the difference in performance between 1.0 lx and 10.0 lx was not suggested to be significant, which confirms the finding of experiment 1. Alongside with Fotios and Uttley¹⁰ and Boyce,²⁸ these results suggest that 1.0 lx is sufficient for pedestrians to detect trip hazards.

For FER, the effect of change in light level was also significant, with a progressive increase in identification rate at higher light levels. The results of experiment 2 suggested differences in FER performance for luminances in the range of 0.05–0.53 cd/m² were suggested to be significant, but not for luminances of 0.53 cd/m² or more. This is as expected according to RVP (see Figure 4). This optimal luminance of 0.53 cd/m² is slightly lower than that reported previously¹⁴ of 1.0 cd/m² for faces at a distance of 10 m. This may be a result of stimulus selection: in experiment 2 the five levels of face luminance did not include 1.0 cd/m² but stepped from 0.53 to 1.65 cd/m², while the previous study¹⁴ used only three levels of face luminance (0.01, 0.1 and 1.0 cd/m²) and thus offers a less precise estimate of the optimum.

The effect of task condition was significant for both the obstacle detection and FER tasks. For obstacle detection, performance was better when only an obstacle was presented, but FER performance was better in those trials where a face and an obstacle were presented simultaneously.

5.2 Multi-tasking and task performance

In typical laboratory trials (including those studies in Tables 1 and 2), the observer is required to focus on only one task, such as obstacle detection or FER, but not both. In natural situations, a pedestrian is required to multi-task to attend to multiple tasks, perhaps simultaneously, or at least does not know which of several possible tasks would next require their attention. The current work was designed to better resemble the natural situation, with responses required to one, both or neither of two tasks in a randomised order. The effect of multi-tasking on the performance of the individual tasks was determined by comparing results from the current work (single task trials) with those of previous studies (which were single task by default).

For obstacle detection, Figure 10 shows the results of experiments 1 and 2 along with three previous studies.^5–7 The results have been converted into visual angle (min arc) subtended at the observation point as these experiments used different apparatus and settings. Data used in this comparison (Table 12) were for obstacles in a similar location to obstacle 1 in the current work. For this comparison, only results from single-task trials in the current work are used (i.e. obstacle-only trials).

Figure 10

Obstacle height for 50% detection rate (in visual angle subtended at the eye) plotted against illuminance for three previous studies and the two current experiments (single task condition only). The conditions used for this comparison are shown in Table 12

Table 12

Conditions compared for five experiments of obstacle detection

Study	Light condition	Detection target	Fixation target	Obstacle configuration
Fotios and Cheal⁵	0.2, 2.0 and 20.0 lx (S/P = 1.8)	Obstacle 1 (10.5° off-axis at the centre line)	Static mark	Raised
Fotios and Cheal⁶	0.20, 0.63, 2.0, 6.32 and 20 lx (S/P = 0.6)	Obstacle 1 (10.5° off-axis at the centre line)	Static mark	Raised
Uttley et al.⁷	0.2, 0.6, 2.0, 6.3, 20.0 lx (S/P = 1.6)	1 (only one obstacle used)	Dynamic fixation target	Raised
Experiment 1	1 and 10 lx (S/P = 1.6)	Obstacle 1 (19.7° off-axis at the centre line)	3D face model	Lowered
Experiment 2	0.33, 1.0, 3.3, 10.0 and 33.3 lx (S/P = 1.6)	Obstacle 1 (19.7° off-axis at the centre line)	3D face model	Lowered

Note: Participants were all in young age (between 16 and 35 years old).

For the single task trials in the current study, test participants did not know, until the moment the occlusion spectacles opened, which of the two tasks they would be expected to undertake. As expected due to reduced attention, this led to impaired performance compared with previous studies where the task was known.

In the three previous studies,^5–7 detection performance increased as illuminance became higher and reached a performance plateau at around 0.63 lx. While performance in the two current experiments was impaired at all illuminances, requiring a larger obstacle size for 50% detection rate than the previous studies, it still suggests a performance plateau is reached at about 1.0 lx.

Figure 11 compares the current FER results with those of similar conditions (target distance and S/P ratio) in a previous study.¹⁴ The conditions compared are shown in Table 13. For this comparison, only results from single task trials in the current work are used (i.e. face-only trials). There is little difference between the current results and those of Fotios et al.¹⁴ for targets of similar luminance which suggests that the potential need to conduct an alternative or additional task did not affect performance.

Figure 11

Identification rate plotted against target luminance for two previous studies and current two experiments (only used single task condition data). The condition used to compare are listed in Table 13

Table 13

Conditions compared in Figure 11 for three FER experiments. The experiments simulated interpersonal distances of 10 m (Fotios et al.¹⁴) and 9.2 m (current work)

Study	Light condition	Fixation target
Fotios et al.¹⁴	0.01, 0.1 and 1.0 cd/m² (S/P = 1.8)	2D photographs
Experiment 1	0.16 and 1.65 cd/m² (S/P = 1.6)	3D model
Experiment 2	0.05, 0.16, 0.53, 1.65 and 5.63 cd/m² (S/P = 1.6)	3D model

Note: Observation durations were 1000 ms in Fotios et al.¹⁴ and 500 ms in the two current experiments.

The effect of multi-tasking as revealed by comparing performance on a task when only that task was conducted to performance on that task when a second task was also likely is suggested to be impaired performance on one task (peripheral detection task) but not the other (FER task). A similar conclusion was reached above in analysis of the task condition variable (single task versus dual task) using results of the current study only. The datum for both approaches to analysis is performance on a single, specific task; the difference is the comparator. In the former approach, it remains the same, single task, but with the uncertainty that it would be that task in the imminent experimental trial. In the latter it was performance of that task in the same 500 ms observation period as a second task.

The reduction in attention available to perform each task due to multi-tasking appears to have impaired one task but not the other. This may be a result of task priority. In experimental trials, this priority may be instructed by the experimenter, with the risk that participants do not follow such instruction,³⁶ while in natural settings the self-selected priority may depend on the consequences of impaired performance on each task. Attention is prioritised to stimuli which are threatening or feared.³⁷ If the unknown intentions of other people represent a greater threat or fear than does tripping, then priority attention would be devoted to the FER rather than the detection task leading to greater impairment on the detection task than on the FER task, as is seen in the results.

Rather than allocating the impairment of multi-tasking to a specific task type, an alternative explanation can be offered for task location: multi-tasking impaired performance of the off-axis but did not impair performance of the task located at the fixation point. If instead obstacle detection had been the task located at the fixation point, then task impairment may have changed. Specifically, this may have led instead to impairment of the FER task rather than the detection task. The approach used in the current work was intended to follow the typical experimental design of previous studies (FER being and on-axis task and obstacle detection being an off-axis task) and is suggested by eye tracking to be ecologically valid: if there is another person in the visual field, there is a tendency to look towards them.^3,4

5.3 Tripping risk

Evidence of gaze behaviour using eye tracking suggests a typical tendency to fixate on other people for about 500 ms. With gaze and attention focused on that person, there may be a risk of tripping over an unseen pavement obstacle. To successfully modify gait pattern and safely negotiate a detected hazard requires that it is seen at least two steps ahead, about 800–1000 ms.³⁸ The typical walking speed of a pedestrian varies with age, ranging from 1.25 m/s for a person aged 14–64 years, reducing to 0.97 m/s for people aged 65 and older.³⁹ Consider an obstacle located 3.4 m ahead, the typical distance for detecting hazards¹⁰ but which is not yet detected, and that the pedestrian spends the next 500 ms fixating another person. In that period they would typically walk distances of 0.62 m (younger) or 0.48 m (older). To walk the remaining 2.78 m (younger) or 2.92 m (older) would take a further 2.2 s (younger) or 3.0 s (older) which remains a longer duration than that needed to modify gait.

6. Limitations

The face models used in this study (Figure 3) comprised only male Caucasian faces. This was not a purposeful decision but a consequence of availability – attempts at 3D printing face models from a validated database did not produce models of sufficient resolution. This sample does not, therefore, represent female faces or non-Caucasian faces.

This raises the question as to whether gender and ethnicity matter for facial expression discrimination. If different skin tones lead to differences in facial contrast, then this may lead to differences in the ability to recognise facial expressions, with greater contrast leading to more rapid recognition. For the current work, an optimal luminance determined for one facial contrast may be suboptimal for a face with lower facial contrast. This was examined by comparing the RVP²⁹ for facial contrasts associated with different skin tones.

Facial contrast is characterised by the contrast of the lips, eyebrows and eyes against the skin immediately surrounding these features.^24,25 Note that while others report facial contrast as a Michelson contrast, here we use Weber contrast as is required to determine RVP. We used the young female faces reported by Porcheron et al.,²⁴ specifically the Caucasian and South African faces, which correspond approximately to types II and VI of the Fitzpatrick Scale.⁴⁰ Facial contrast was determined separately for each facial feature (eyes, eyebrows, mouth) and then averaged, leading to facial contrasts of 0.314 (Caucasian face) and 0.138 (South African face). The adaptation luminance was taken as the average the lit surface³⁰: an adaptation luminance of 0.6 cd/m² represents a road lit to an average illuminance of approximately 10 lx. To determine RVP, we assumed an observer age of 25 years and a target which subtended 0.0006 steradians, simulating an interpersonal distance of 9.2 m. RVP reduced from 0.94 for the Caucasian face to 0.87 for the South African face. In other words, the ability to discriminate the facial expression of a South African face at 10 lx is similar to that for a Caucasian face but at an illuminance of approximately 3.3 lx (Figure 4).

Gender is expected to influence facial emotion recognition because females tend to have higher facial contrast than males.²⁵ Balanced numbers of male and female targets were used in previous FER studies,^14–16 but did not report whether this influenced the results.

One limitation of the models (Figure 3) is that while the anger, sadness and neutral faces all had swept-back hair, this was not the same for all of the happiness faces: happiness face 4 had swept back hair but the other three have hair combed to the side. It may therefore be the case that discriminations were based on hairstyle rather than facial expression. If that were the case, happiness 4 would be more easily confused with the other facial expressions, and we would expect a higher error rate for happiness 4 than for happiness 1 to 3. This is not supported by the results: in experiment 1 the error rates for happiness faces 1 to 4 were 32%, 61%,42% and 38% (1.0 lx), and 10%, 37%, 27% and 15% (10.0 lx). Appendix 4 shows the recognition rates for the faces used in Experiment 1. As noted above, statistical analysis of correct recognition rates does not suggest a consistent difference between happiness 4 and the other three happiness faces.

Some pedestrians may choose to wear a hat, as protection against the weather or as a choice of style: none of the targets used in the current work wore hats or other head covering. A hat may influence perception of facial configuration, especially in the forehead region; hats may lead to an impairment in facial recognition⁴¹ which is why the current work sought recognition of facial expression rather than identity. A hat with a brim above the face has two implications for face evaluations under road lighting, both of which reduce the ability to recognise facial expressions: it may lower the overall luminance of the face, and it may reduce the luminance contrast of facial features and their shadows. Facial details may also be obscured by glasses and hands placed in front of the face.^42,43 Further work is required to identify the more critical of these possibilities hence to consider the impact of changes in lighting.

The accuracy of facial identification is maintained across a wide range of lighting directions but can be reduced by lighting from extreme directions.⁴⁴ In the current experiment, the light direction was fixed, from near-overhead sources (Figure 1). The vector/scalar ratio was about 3.3 for all cases (Appendix 1). Field measurements conducted in a subsidiary road to gain an idea of the typical range suggested vector/scalar ratios of about 3.5 for measurement underneath a lamp post, reducing to 1.0 when located midway between two successive lamp posts. Hence, the current experiment resembled observation of a face when that person was standing nearby a lamp post.

The face models used in this experiment were not designed nor validated for research purposes. This leads to two questions. First, were the different expressions repeatable? Table 14 shows the proportions of correct identification in different works. Ebner et al.⁴⁵ introduced the FACES database, photographs of actors portraying different expressions. In their evaluations, carried out under good lighting and without duration limit, the proportions of correct responses ranged from 0.68 to 0.96. A sample of the FACES images were used in a later study to compare expression recognition under different combinations of luminance and S/P ratio.¹⁵ Table 14 shows the correct response proportions for those trials with a face luminance of 0.33 cd/m², and averaged across the three types of lamp used. For trials simulating a 4 m distance, recognition accuracy (0.65 to 0.96) was similar to that reported by Ebner et al., but was greatly reduced (0.12 to 0.60) in those trials simulating a 15 m distance. In the current work, which simulated a distance of 9.2 m, the correct recognition proportions ranged from 0.63 to 0.83 which is between the rates found in previous work for evaluations simulating 4 m and 15 m.

Table 14

Proportion of correct identification of unique facial expressions as reported by Ebner et al.⁴⁵ and Yang and Fotios.¹⁵

Expression	Proportion of correct identification
Expression	Ebner et al.⁴⁵	Yang and Fotios¹⁵ (4 m)	Yang and Fotios¹⁵ (15 m)	Current work Exp. 1 (9.2 m)	Current work Exp. 2 (9.2 m)
Happy	0.96	0.96	0.58	0.68	0.74
Neutral	0.87	0.96	0.60	0.70	0.83
Angry	0.81	0.81	0.29	0.63	0.77
Fear	0.81	0.65	0.21	–	–
Sad	0.73	0.77	0.12	0.68	0.56
Disgust	0.68	0.71	0.17	–	–

Note: For Yang and Fotios these are data for face luminance of 0.33 cd/m², averaged across with targets scaled to represent interpersonal distances of 4 m and 15 m. For the current work the data are averaged across all combinations of light level and task condition. The expressions are listed in descending order as defined by the results of Ebner et al.⁴⁵

A second question about the validity of the face models is the degree to which they were confused with other expressions. Table 15 shows that for each expression presented in experiments 1 and 2, the correct response was given more frequently than incorrect responses. The data also suggest a response bias, where an incorrect response was given, and this was more likely to be the neutral expression than either happy, sad or angry.

Table 15

Proportions of responses given for each type of expression

Response	Proportions of responses given
	Experiment 1				Experiment 2
	Happy	Angry	Sad	Neutral	Happy	Angry	Sad	Neutral
Happy	0.68	0.07	0.03	0.04	0.74	0.06	0.01	0.02
Angry	0.11	0.63	0.03	0.10	0.09	0.77	0.03	0.02
Sad	0.05	0.04	0.68	0.13	0.04	0.04	0.56	0.11
Neutral	0.17	0.24	0.24	0.70	0.12	0.11	0.37	0.83

Note: Columns do not add to 100% due to misses – no response given after onset of target.

The tendency to say ‘neutral’ when unsure is a possible reason why the neutral expression received the greater proportion of correct responses in both experiments. In further work, this should be controlled for, either through the choice of visual target or by the frequency of presentation for each type of expression.

In this work, the obstacles were located in one of four locations. Whilst the order in which locations were used was randomised, the locations would have become familiar after a few trials. The effect of location familiarity can be seen in a study of intruder detection.⁴⁶ In their experiment 1, intruders were required to walk along the centre line of the test environment (an open field with fences to act as barriers for hiding behind) towards the observers; in their experiment 2, intruders were instructed to traverse the test environment in any manner they deemed likely to avoid detection. The results (their Table 18) show that detection distances were greater for the known (e.g. 86.8 m, HPS flood lighting) than unknown (60.4 m, HPS flood lighting) intruder routes. This suggests that obstacles are more easily detected in known or expected locations. Further work is required to determine whether this affects determination of optimal lighting for pedestrians.

This experiment used only a single lighting geometry. Variation in the relative locations of the obstacle, the lighting and the observer will change target contrast and shadow pattern. Previous work⁸ suggests this can lead to significant differences on detection rate, with a higher detection rate found when the light source was overhead and a lower detection rate when located in front of the obstacle. The current study used light sources at both locations (Figure 2) to average the differences.

Eye movements are proactive, seeking out the information needed for a task in the moments before that task is carried out.⁴⁷ In these experiments, test participants were required to fixate toward the location of a face model. It may be the case that they chose instead to fixate towards the obstacle rather than the face model, in particular on those trials where the face model was absent. Gaze behaviour was not measured in the current study: In previous work,⁴⁸ investigating gaze behaviour during peripheral obstacle detection it was demonstrated using eye tracking that when instructed to fixate towards a fixation mark, test participants tended to do so. However, that was for an experiment with only one task and with the fixation mark being present in all trials: further work is required to determine if this tendency to maintain fixation as instructed is maintained in trials involving different tasks at two locations or when the fixation mark (here, the face model) is absent. Gaze behaviour may further be affected by limitations on observation duration. Mean fixation durations are in the order of 200 ms to 500 ms,⁴⁹ but vary with task characteristics,⁵⁰ increasing in duration as task difficulty increases,⁴⁹ and can be as short as 120 ms.⁴⁷ In the current work, the observation duration was 500 ms. Shorter or longer observation durations may lead to changes in gaze behaviour. The results of one study using a search task⁴⁹ suggest that reductions in observation duration (from 3.0s, to 2.25 or 1.5s) did not reduce fixation durations. Test participants are capable of very brief fixation durations but may not do so as continuous maximum performance leads to stress.⁵⁰

Finally, while this article is phrased in terms of multi-tasking, only two tasks were considered. The need to attend to, or expect to attend to, more than two tasks would further reduce the attention for any one task and the expectation of the next task in a series, and in doing so may further impair task performance.

7. Conclusion

This paper describes two experiments set up to examine the performance of obstacle detection and FER tasks under different light levels. This extended previous work by the requirement for observers to consider both tasks in parallel rather than as individual tasks in separate experiments, thus better resembling the multi-tasking of natural pedestrian situations. To promote ecological validity, the faces used in this work were 3D models rather than 2D images.

Performance of the on-axis FER task followed prediction using RVP. At lower adaptation luminances, an increase in luminance increased task performance, but from an adaptation luminance of 0.21 cd/m² (a road surface illuminance of about 3.3 lx), further increase in luminance led to negligible increase in task performance. In the current work, this was a face luminance of 0.53 cd/m². Performance of the off-axis detection task followed that predicted in previous work with a lower rate of detection at 0.33 lx than at the four higher illuminances.

It was found that the potential need to carry out two tasks led to a reduction in performance of the peripheral detection task, but did not impair the foveal FER task. This was established by comparing the results of the current work with those of previous studies where each task was investigated in separate experiments. Despite the impaired detection performance, the current results reveal the same optimal illuminance as found in previous work, a horizontal illuminance of about 1.0 lx.

Footnotes

Declaration of conflicting interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was carried out with support from the Engineering and Physical Sciences Research Council (EPSRC) (grant number EP/M02900X/1).

ORCID iD

S Fotios

References

British Standards Institution. BS 5489-1:2013. Code of Practice for the Design of Road Lighting Part 1: Lighting of Roads and Public Amenity Areas, London: BSI, 2012.

Caminada

van Bommel

. New lighting considerations for residential areas. Journal of the Illuminating Engineering Society 1984; 13: 350–358.

Fotios

Uttley

Cheal

Hara

. Using eye-tracking to identify pedestrians’ critical visual tasks. Part 1. Dual task approach. Lighting Research and Technology 2015; 47: 133–148.

Fotios

Uttley

Yang

. Using eye-tracking to identify pedestrians’ critical visual tasks. Part 2. Fixation on pedestrians. Lighting Research and Technology 2015; 47: 149–160.

Fotios

Cheal

. Obstacle detection: A pilot study investigating the effects of lamp type, illuminance and age. Lighting Research and Technology 2009; 41: 321–342.

Fotios

Cheal

. Using obstacle detection to identify appropriate illuminances for lighting in residential roads. Lighting Research and Technology 2013; 45: 362–376.

Uttley

Fotios

Cheal

. Effect of illuminance and spectrum on peripheral obstacle detection by pedestrians. Lighting Research and Technology 2017; 49: 211–227.

Fotios

Mao

Uttley

Cheal

. Road lighting for pedestrians: Effects of luminaire position on the detection of raised and lowered trip hazards. Lighting Research and Technology 2020; 52: 79–93.

Boyce

Rea

. Plateau and escarpment: the shape of visual performance: Proceedings of the 21st Session of the CIE, Venice. CIE Publication 71, Vienna: CIE, 1987.

10.

Fotios

Uttley

. Illuminance required to detect a pavement obstacle of critical size. Lighting Research and Technology 2018; 50: 390–404.

11.

Fotios

Johansson

. Appraising the intention of other people: Ecological validity and procedures for investigating effects of lighting for pedestrians. Lighting Research and Technology 2019; 51: 111–130.

12.

Fotios

Yang

Uttley

. Observing other pedestrians: Investigating the typical distance and duration of fixation. Lighting Research and Technology 2015; 47: 548–564.

13.

Fotios

Uttley

Fox

. Exploring the nature of visual fixations on other pedestrians. Lighting Research and Technology 2018; 50: 511–521.

14.

Fotios

Yang

Cheal

. Effects of outdoor lighting on judgements of emotion and gaze direction. Lighting Research and Technology 2015; 47: 301–315.

15.

Yang

Fotios

. Lighting and recognition of emotion conveyed by facial expressions. Lighting Research and Technology 2015; 47: 964–975.

16.

Fotios

Castleton

Cheal

Yang

. Investigating the chromatic contribution to recognition of facial expression. Lighting Research and Technology 2017; 49: 243–258.

17.

Li T, Yang B. New empirical data for pedestrian lighting: effect on recognition ability on real 3D facial expression: Proceedings of CIE 2018 Topical Conference on Smart Lighting, Tapei, 26–27 April: 20182018: 106-111.

18.

Chelnokova

Laeng

. Three-dimensional information in face recognition: An eye tracking study. Journal of Vision 2011; 11: 1–15.

19.

Woollacott

Shumway-Cook

. Attention and the control of posture and gait: A review of an emerging area of research. Gait and Posture 2002; 16: 1–14.

20.

Pashler

. Dual-task interference in simple tasks: Data and theory. Psychological Bulletin 1994; 116: 220–244.

21.

Bullough

Rea

. Simulated driving performance and peripheral detection at mesopic and low photopic light levels. Lighting Research and Technology 2000; 32: 194–198.

22.

Fotios S, Robbins CJ, Fox SR, Cheal C, Rowe R. The effect of distraction, response mode and age on peripheral target detection to inform studies of lighting for driving. Lighting Research and Technology. First published 10 December 2020. DOI 10.1177/1477153520979011.

23.

Antheads Catalogue. Retrieved 17 December 2019 from http://www.antheads.co.uk/catguide/heads.

24.

Porcheron

Mauger

Soppelsa

Liu

Pascalis

Russell

Morizot

. Facial contrast is a cross-cultural cue for perceiving age. Frontiers in Psychology 2017; 8: 1–9.

25.

Russell

. A sex difference in facial contrast and its exaggeration by cosmetics. Perception 2009; 38: 1211–1219.

26.

Bailey

Lovie

. New design principles for visual acuity letter charts. American Journal of Optometry and Physiological Optics 1976; 53: 740–745.

27.

Commission Internationale de l’Éclairage. Lighting of Roads for Motor and Pedestrian Traffic. CIE 115:2010, Vienna: CIE, 2010.

28.

Boyce

. Movement under emergency lighting: the effect of illuminance. Lighting Research and Technology 1985; 17: 51–71.

29.

Rea

Ouellette

. Relative visual performance: A basis for application. Lighting Research and Technology 1991; 23: 135–144.

30.

Commission International de L’Éclairage. Interim Recommendation for Practical Application of the CIE System for Mesopic Photometry in Outdoor Lighting. CIE TN 007:2017, Vienna: CIE, 2017.

31.

Commission International de L’Éclairage. Recommended System for Visual Performance Based Mesopic Photometry. CIE 191:2010, Vienna: CIE, 2010.

32.

Yao

Fotios

. Effectiveness of an alternative model for establishing mesopic luminance. Lighting Research and Technology 2019; 51: 900–909.

33.

Cuttle

. Cubic illumination. Lighting Research and Technology 1997; 29: 1–14.

34.

Stanislaw

Todorov

. Calculation of signal detection theory measures. Behavior Research Methods, Instruments, and Computers 1999; 31: 137–149.

35.

Holm

. A simple sequential rejective multiple test procedure. Scandinavian Journal of Statistics 1979; 6: 65–70.

36.

Ranney

Mazzae

Garrott

Barickman

. Development of a test protocol to demonstrate the effects of secondary tasks on closed-course driving performance. Proceedings of the Human Factors and Ergonomics Society Annual Meeting 2001; 45: 1581–1585.

37.

Maratos

Pessoa

. What drives prioritized visual processing? A motivational relevance account. Progress in Brain Research 2019; 247: 111–148.

38.

Patla

Vickers

. How far ahead do we look when required to step on specific locations in the travel path during locomotion? Experimental Brain Research 2003; 148: 133–138.

39.

Knoblauch

Pietrucha

Nitzburg

. Field studies of pedestrian walking speed and start-up time. Transportation Research Record 1996; 1538: 27–38.

40.

Australian Radiation Protection and Nuclear Safety Agency (ARPANSA). Fitzpatrick skin phototype. Retrieved 3 July 2020, from https://www.arpansa.gov.au/sites/g/files/net3086/f/legacy/pubs/RadiationProtection/FitzpatrickSkinType.pdf.

41.

Freire

Lee

. Face recognition in 4- to 7-year-olds: Processing of configural, featural, and paraphernalia information. Journal of Experimental Child Psychology 2001; 80: 347–371.

42.

Drira

Ben Amor

Srivastava

Daoudi

Slama

. 3D face recognition under expressions, occlusions, and pose variations. IEEE Transactions on Pattern Analysis and Machine Intelligence 2013; 35: 2270–2283.

43.

Gros

Straub

. Human face images from multiple perspectives with lighting from multiple directions with no occlusion, glasses and hat. Data in Brief 2019; 22: 522–529.

44.

Liu

Collin

Burton

Chaudhuri

. Lighting direction affects recognition of untextured faces in photographic positive and negative. Vision Research 1999; 39: 4003–4009.

45.

Ebner

Riediger

Lindenberger

. FACES – a database of facial expressions in young, middle-aged, and older women and men: development and validation. Behavior Research Methods 2010; 42: 351–362.

46.

Boyce

Rea

. Security lighting: effects of illuminance and light source on the capabilities of guards and intruders. Lighting Research and Technology 1990; 22: 57–79.

47.

Land

. Eye movements and the control of actions in everyday life. Progress in Retinal and Eye Research 2006; 25: 296–324.

48.

Fotios

Uttley

Cheal

. Maintaining foveal fixation during a peripheral detection task. Lighting Research and Technology 2016; 48: 898–909.

49.

Hooge

ITC

Erkelens

. Adjustment of fixation duration in visual search. Vision Research 1998; 38: 1295–1302.

50.

Salthouse

Ellis

. Determinants of eye-fixation duration. The American Journal of Psychology 1980; 93: 207–234.