Abstract
Using a single human to supervise multiple robots helps to address manpower constraints while deriving the benefits of multiple-robot deployment such as efficiency and improved system reliability. However, it can also induce high supervisor workload and diminish situation awareness. This article explains workload and situation awareness. It reviews various studies related to human–robot systems to illustrate the effects and causes of workload and diminished situation awareness in such systems. The article reviews and discusses the application of automation to address workload and situation-awareness concerns. It also presents the issues that the use of automation can cause, highlighting that automation must be applied with care. The article advocates the consideration of sliding autonomy for four aspects of task execution: information acquisition, information analysis, decision selection and action implementation. It additionally encourages the appreciation for recognized methods of applying and triggering automation. The hope is for robots to be equipped with adjustable autonomy across multiple aspects of task performance to create robotic systems with highly flexible autonomy configurations. While robots from such systems may have the flexibility to deal with numerous situation requirements, the research challenge is understanding if and how such flexibility will affect human workload.
Introduction
This article explores the considerations of workload, situation awareness and automation when designing a system for single human supervision of multiple robots. A framework is proposed to assist system designers in recognizing possible automation types, how the automation may be applied and when the automation may be invoked.
In the first section the article explains the rationale behind such systems. The second and third sections discuss the related workload and situation awareness challenges, respectively. In the fourth section, it presents the complications and proposed methods of applying automation to address workload and situation-awareness pitfalls. In the fifth section, an existing single human multiple-robot system is discussed as as a case study to provide examples of the automation application concepts presented in this article. The last section is on conclusion.
Robots have often been suggested to be suitable for performing dirty, dull and dangerous work in place of humans. Such work may refer to toxic waste cleanup, surveillance and underwater or space exploration. Applications like these can benefit from the deployment not only of a single robot but also of a group of robots. The use of many robots brings about a number of benefits such as the ability to derive efficiency, 1 –3 improved system reliability, 4 the execution of tasks that are impossible with only a single robot 1 or simply an improvement in system performance. 5 –7
Robots can be imbued with autonomy. The Roomba vacuum cleaner robot 8 from the iRobot Corporation, for instance, is equipped with autonomy to sweep a floor space while avoiding obstacles. Current levels of artificial intelligence allow robots to perform impressive tasks such as intercontinental flight. The Northrop Grumman RQ-4 Global Hawk unmanned aerial vehicle (UAV) is an example of such a robot. 9 However, a robot even employing the current state-of-the-art technology has to be supervised. The current sentiment towards technology is that it is unable to replace the role of the human supervisor entirely 2 and that no system is flawless. At some point, a system is bound to perform erroneously. 10 Researchers such as Cummings and Guerlain, 11 Fong and Thorpe 12 as well as Sheridan 13 have all suggested that human supervision is necessary regardless of the level of autonomy in unmanned systems. In this article, robots are regarded also as unmanned systems.
The rationale behind single-human supervision of multiple robots
A robot or group of robots may be supervised by a group of people. For example, the General Atomics Aeronautical Systems MQ-9 Reaper UAV requires a pilot and a payload operator. Alternatively, robots can be supervised by a single human. In their analysis of the multiple unmanned vehicle deployment, Riley and Endsley 14 have suggested that the reduction in the ratio of human supervisors to unmanned vehicles deployed is a likely solution to performing functions such as reconnaissance and surveillance. This is because using a smaller human to robots ratio can help to make efficient use of possibly limited human resources. This addresses the motivation to reduce the number of personnel and the labour costs in work related to unmanned ground and aerial robots. 15 In some applications, human supervisors can be still exposed to the dangers of the robots’ mission environment despite a standoff distance between robots and supervisors. The development of a multiple-robot system that requires only a single human for robot supervision can help to reduce the number of people exposed to such dangers. For instance, in the description of their experience at ground zero in the aftermath of the September 2001 attacks, Murphy et al. 16 revealed that fire chiefs preferred as few as possible people in the disaster environment. Presumably, this was to prevent more casualties if the rubble collapsed further or if fires in the rubble resulted in explosions.
The cognitive challenges in multiple-robot supervision
While there are benefits in single human supervision of multiple autonomous robots, some drawbacks do exist. Supervising a single robot can already be challenging. The addition of more robots under the human’s supervision is likely to heighten this challenge. Consequently, the supervisor would experience more workload with the addition of each robot. 14,17 –20 There is then a greater propensity for the supervisor to fail to perform critical tasks. 21 Awareness for the actions of robots, the status of each robot as well as their progress in fulfilling the mission objective is critical when deploying robots. 22 However, maintaining the necessary situation awareness for every robot may be difficult as each robot competes for the supervisor’s limited attention resources. 2 Furthermore, the supervisor has to periodically switch attention from one robot to another. As attention is shifted, effort is needed to acquire awareness about the situation of the robot being attended to. This constant effort to realign the supervisor’s understanding of the robot’s situation requires time and may contribute to stress during time-critical deployments. That effort, in addition to maintaining a mental model of each robot’s status, progress and situation, distinguishes multiple-robot supervision from that of a single robot. The intensification of cognitive workload and competition of supervision attention resources can lead to a reduction in the supervisor’s level of awareness. 21 Impaired situation awareness in turn heightens the possibility for errors. 23
Workload
This section provides a brief introduction of mental and physical workload. It then presents the effects of high workload and also discusses possible causes of high workload during multiple-robot supervision.
Mental and physical workload
Workload is sometimes referred to as cognitive workload in order to highlight emphasis on mental exertion. Quite simply, it is the degree of being busy. 21 Although various attempts have been made to describe it, 24,25,26 there is yet to be a consensus in terms of a definition for cognitive or mental workload. 27 However, definitions of workload are predicated on the assumption of limited mental capacity for perception and response selection. As this finite supply of mental resources is depleted, mental workload increases. 21
Yet, workload not only pertains to mental activity but is related also to physical exertions. Some degree of physical activity is required even when using automated systems. Such activity may not be highly laborious but can nonetheless require movements of some complexity and time to execute. Therefore, neglecting the effects of physical effort even for tasks which are mostly cognitive, can be detrimental to system design.
In his review of studies on combining physical and mental workload, Tomporowski 28 indicated that aerobic activity in the low-to-moderate-intensity range can enhance information processing. This boost in mental performance may be seen in simpler tasks involving simple detection, visual search and discriminative choice response. 29 Alertness may be temporarily heightened following short-duration, submaximal physical exercises. However, it remains unclear if mental performance can consistently be boosted with the same physical exertions as the system user becomes increasingly experienced. In addition, physical activity has not been definitively found to benefit task aspects which are more complex, such as problem-solving and response selection. 28,30 Designers may wish to hold off on specifically incorporating physical tasks into the system in an effort to better cognitive performance. They may, however, feel less apprehensive about physical effort, especially if the activity is light and not prolonged.
The effects of high workload
Wickens and colleagues
21
presented four strategies that system users may adopt when too much workload is experienced: System performance may simply be allowed to degrade without any coping strategies. System performance is intentionally lowered so that experienced workload also decreases.
25
One way is to trade optimal approaches with satisficing techniques. Satisficing refers to the on-the-fly adoption of the most attractive plan even though the plan may not be the best in the long run. Humans may adopt this technique during problem-solving as continued evaluation of multiple plans can itself be costly and detrimental to task performance. It may instead be better to adopt a plan that is likely to work rather than incur the increasing (time and mental) costs of considering multiple plans. Satisficing was demonstrated in the multiple-missile control experiment by Cummings and Guerlain.
4
In that experiment, a single supervisor was tasked to monitor and assign missiles to targets. Participants when working under time pressure, directed missiles in a suboptimal fashion but were still able to meet mission objectives. Subtasks can be abandoned such that only those of high priority are performed. Subtasks may be randomly abandoned so that even those of high priority are neglected.
The degradation of system performance when workload exceeds capacity has been reflected in studies of human–robot systems. For example, heightened workload has been linked to longer mission completion times, increased number of collisions and supervisor errors in a multiple mobile robot deployment study. 17 In the study where robots were deployed to search for victims in an urban search and rescue scenario, the number of victims has been found to decrease as the workload increased. 31
The causes of workload in multiple-robot supervision
A number of factors have been suggested to affect supervisor workload within the domain of human-supervised robot deployment. These include task complexity, interface design and the levels of available autonomy. However, when a single human supervises a group of robots, it is the size of that group (or group size) that typically has the largest effect on workload. The following paragraphs discuss these factors.
First, in a study on multiple UAV deployment, high levels of task complexity (due to the processing of more targets) have been noted to increase supervisor workload. 32 Poor interface design can also contribute to increased workload. Chen et al. 19 suggested that in terms of the supervisory control of multiple robots, the effectiveness and usability of the interface can affect supervisor workload and system performance. For example, a study on soldier–robotic swarm interactions suggested that use of multimodal (visual, auditory and tactile) display techniques influences the effectiveness and efficiency of supervisors. 33
The level of autonomy available in the robotic system can affect supervisor workload as well. 2,34 Automation is often employed to address the problem of high workload. The use of automation to aid the human supervisor during multiple-robot deployment has been reported by Lif et al. 5 who provided some anecdotal evidence that participants in their experiment found robot autonomy useful. Wickens et al. 21 pointed out that the appeal of automation in high-time stress situations or in settings where there is a need to simultaneously handle multiple tasks is its potential in reducing workload.
The focus amongst researchers in terms of contributors to workload is the number of robots that a single-human supervisor can simultaneously deploy. 32 Studies which have attempted to relate group size to supervisor workload are numerous. For instance, Ruff et al. 35 examined the interaction between a human supervisor and up to four UAVs. They found that as more UAVs are deployed, perceived workload increases as well. Adams 17 presented a study in which participants deployed one, two and four mobile robots in a task to transport materials and saw that workload was significantly higher when four robots were deployed. In another effort, Trouvain and Wolf 20 examined workload as participants were tasked to deploy two, four and eight robots in separate runs to inspect areas within a given map. They found that workload increased significantly when participants deployed eight robots. Furthermore, the rise in workload when group size increases is expected to be exponential especially when it is necessary for the human to coordinate robot activities. 31
These studies are just a few examples of the works comparing workload levels as the human supervisor is tasked to deploy various numbers of robots. Many other research efforts 15,18,31,32,36,37 have reported similar results to the ones presented above, reinforcing the link between increased group size and heightened workload. Given that workload can only be increased up to a limit, it becomes evident that only a limited number of robots can be deployed by a single human supervisor. 26 The link between increased group size and added workload also suggests that solutions are required to address the desire to deploy more robots to boost system capabilities while preventing the workload demands induced by each robot from depleting the human supervisor’s cognitive resources.
The second section has introduced the concept of workload, pointing out that it consists of a mental and physical component. The section has also listed the behaviours that system users may adopt when faced with high-workload situations. These behaviours may not necessarily be elegant coping mechanisms and system performance will likely deteriorate should workload remain high. To help users manage workload, system designers should consider aspects such as task complexity, interface design and the available levels of autonomy in the system. More importantly, however, they should limit group size as its role in affecting workload is significant.
Situation awareness
This section first explains situation awareness and presents the effects of poor situation awareness. It then lists and discusses the causes of diminished situation awareness.
Defining situation awareness
Together with workload, situation awareness is also critical in influencing the ratio of human supervisors to robots. 38 According to Endsley, 39 situation awareness consists of three levels: perception, comprehension and projection. Cues are first perceived with human senses. These cues are used to comprehend the situation. This understanding of the situation is then employed to project what will happen in the near future.
Endsley’s definition of situation awareness is highly popular amongst researchers and is suitable for various applications. However, it does not point specifically to what the human supervisors of robots should be aware of. Drury et al. 22 presented a framework to define awareness for human–robot systems. The framework consists of five types of awareness, but only two are directly related to awareness required by the single human supervisor of multiple robots. The other three types of awareness include awareness associated with human and human collaboration, robot awareness of human commands and awareness that robots require to coordinate amongst themselves. The first of these two related types of awareness is human awareness of the robots. Drury and colleagues suggest that this awareness refers to knowledge of robot locations, identities, activities, statuses and surroundings. The second type of awareness is the knowledge required for the overall mission. This refers to the understanding of the goals of the mission as well as the progress made towards the goals. The framework presented by Drury and colleagues helps define situation awareness within the multiple-robot deployment and supervision domain.
The effects of poor situation awareness
Diminished situation awareness can result in a few problems such as poor decision-making, errors as well as slow response to the situation. The paragraphs below discusses these possible effects.
Situation awareness has been identified as ‘the major factor driving the quality of the decision process’. 40 The comprehension of a particular situation is the basis of an effective choice. 41 Its absence can result in poor decision-making and subsequently, erroneous task execution. The study by Yanco et al. 23 on four teams participating in a rescue robotics competition provides an example. One of the human supervisors in that event was found to have had poor awareness of the robot’s orientation, resulting in more robot collisions. The errors were due to the lack of awareness about the robot’s on-board camera’s orientation. Unbeknownst to the operator, the camera had been pointed off the centre. The supervisor’s lack of awareness resulted in wastage of more than half the team’s runtime and also fewer victims found. At the same event, the inability of another robot to detect the presence of Plexiglass resulted in poor situation awareness for the robot’s supervisor. This resulted in the robot being commanded to charge through the Plexiglass panel.
The possibility for errors is especially high when workload exceeds a critical level. 40 Endsley and Jones proposed that this is because the supervisor can only attend to a portion of the pertinent information. Alternatively, it may also be due to flawed or imperfect information perception and assimilation. Measures are necessary to prevent cognitive workload from exceeding the critical level. Errors have also been known to be closely related to stress, 21 which itself is a contributor to diminished situation awareness. The link between stress and impaired situation awareness is presented later in the section on the causes of poor situation awareness.
In their discussion of the situation-awareness requirements for deploying multiple collaborating robots, Riley and Endsley 14 pointed out that the lack of situation awareness can also retard intervention response. System problems may be less quickly detected and remedied as system users use more time to become mentally realigned with the situation to establish the nature of the problem. Burke and Murphy 42 reported that operators with good situation awareness were nine times more likely to spot victims during an urban search and rescue (USAR) task.
The causes of poor situation awareness
There are many factors that can contribute to decreased situation awareness. Endsley 39 suggested that such factors can include interface design, system capability, stress, workload, complexity and automation. In addition, Jones and Endsley 43 found that factors like inadequate mental models and inexperience can hinder situation awareness. This suggests that a lack of adequate training is also detrimental. The above-mentioned factors are discussed here. Some of these factors are interrelated and so these correlations are also pointed out within this discussion.
Interface design
Interface design directly affects information volume and compatibility. 39 An effective interface (combined with adequate training) can support situation awareness during routine system work as well as during instances of automation failure. 44 An interface that provides too little information starves the system user of the necessary information. 45 Yet, too much information presented could mean that only a small portion of the presented information is integrated to become situation awareness. This is because the integration of the presented information can be curtailed by how much the human supervisor can actually perceive and remember in the short-term. 37
The interface can also control how palatable or compatible the presented information is. For example, presenting remaining fuel of a UAV, in terms of remaining flight time or flight range is more compatible to situation awareness than presenting fuel in terms of weight. 39 Yanco et al. 23 found examples of interfaces with poor compatibility within two teams in a rescue robotics competition which placed video and laser or sonar ranging information on opposite sides of their interfaces. Poor fusion of the information on the part of interface design resulted in the system user having to mentally combine the provided information rather than have the interface provide the integrated picture. The interface should facilitate the situation-awareness acquisition with minimal workload. 39,46
System capability
Endsley’s 39 version of system capability refers to how well the system is able to accurately capture the true situation. At best, the system is only as capable as the system designer intended it to be. A system that is limited in capability may not capture the information required for later assimilation as situation awareness. A team in the rescue robotics competition demonstrated this when their robot was not able to inform the human about what was behind the robot. 23 This was due to the absence of the required sensors at the robot’s rear. This limitation resulted in collisions when the robot was commanded to reverse.
Limited system capability can also increase workload required to acquire the adequate awareness. Consider, for instance, the added hassle of having to instruct a robot without a rear-facing camera to do an about turn so that the forward-facing camera can view what’s behind the robot. The aim of this discussion is not to suggest that the robot should be equipped with all sorts of imaginable sensors. Rather, the aim is to point out that the responsibility is the system designer’s to ascertain how best to equip the robot with the sensors needed for the task while still staying within payload and size limits.
Stress
Stress is a common phenomenon associated with mental strain. The deployment of multiple robots in situations such as combat 47 or search and rescue 48,49 will certainly be stressful for human supervisors. Endsley 39 listed numerous causes of stress. These stressors can be divided into two groups: physical stressors and social physiological stressors. Physical stressors include noise, heat (or the lack of), poor or excessive lighting, atmospheric conditions (such as rain), boredom and fatigue. Social psychological stressors can include fear, uncertainty, the importance or consequences of events, cognitive workload, time limitations and frustration. 21
Endsley claimed that some stress can be beneficial by boosting alertness or vigilance for critical situation elements. Still, exceedingly high stress levels can highly detrimental. For instance, stress can result in the tunnelling of attention where focus is neglected for situation aspects which are less salient 21,40 but not necessarily less important. Stress can also cause people to jump to conclusions, 40,50 especially in the case of time-related stress. 21 Riley et al. 50 add that in USAR situations, stress can lead to cues pointing to the presence of victims being ignored.
Many of the physical and social psychological stressors listed by Endsley have been claimed to exist in USAR. Murphy 51 described her experience of using rescue robots at ground zero in New York as stressful due to time pressure, cognitively demanding as well as fatiguing and emotional. She also pointed out that the consequences of an error such as failing to spot a victim could be equivalent to condemning a survivor to death.
It is evident that the use of robot systems should not be a cause of stress to the robot supervisor. Although the system may not be able to reduce all the physical or social psychological stressors, it should be designed to prevent workload from becoming excessive, thereby limiting its effects as a stressor.
Workload
Endsley 52 has proposed that unless workload accumulates beyond a critical amount, situation awareness is largely independent of workload. Beyond the maximum workload capacity, situation awareness becomes increasingly hampered.
Since excessive levels of workload can limit the situation awareness that the human supervisor can acquire, the factors causing increased workload may also contribute to decreased situation awareness. Earlier in the section on the causes of workload in multiple-robot supervision, a number of contributing factors to workload have been presented. Most of these factors (such as interface design and the use of automation) are also presented here as factors which affect situation awareness. However, the earlier discussion on contributors to workload has also revealed the immense amount of research linking increases in robot group size to rising workload levels. This suggests that if group size is increased to a point where workload becomes excessive, situation awareness can also be negatively affected. For example, in a multiple-robot deployment study, 15 participants reported not only a higher level of workload when the number of robots was doubled from four to eight but also poorer situation awareness when supervising eight robots.
Complexity
Endsley 39 indicated that system complexity affects both workload and situation awareness. Complexity is related to the number of components in the system, the scale of interactions amongst the components and the pace at which changes occur in the components. A person’s task increases in complexity with the number of tasks to be performed as well as with the number of decisions required. As complexity increases, so too does the demand on cognitive resources required to maintain adequate situation awareness. When examining interface issues with UAV deployment, Olson and Wuennenberg 46 suggested that increased complexity can mask the logic behind system behaviours.
Inadequate training and experience
Exposure to system use is crucial. Jones and Endsley 43 reported instances of diminished situation awareness due to the lack of or incomplete mental models. Due to this, inexperienced users can misperform a task. Investigation into the 2009 loss of Air France Flight 447 has revealed that one of the less experienced pilots may not have had the correct mental model of the aircraft’s operation mode and wrongly assumed that the on-board automation would perpetually keep the aircraft within its flight envelope. Tragically, the Airbus A330 stalled and crashed into the Atlantic.
Users who are less familiar with the system may additionally require more steps and time to acquire the requisite awareness, thus contributing to workload and time stress. It is important to recognize also that qualified users may be complacent. They can additionally be overly reliant on default procedures or values, thereby missing cues when changes occur. For novice and expert users, training and regular refresher sessions are important to maintain high operator readiness levels.
Other factors
Situation awareness can be compromised due to some other factors such as fatigue or disruptions 43 which are less obvious. First, the user may be fatigued and be less attentive to cues. Fatigue can also result in memory lapses. Second, interruptions or distractions that prevent smooth execution can add to time stress. Proper reacquisition of the situation after the interruption can be difficult. If such interruptions are unavoidable, then lowered system complexity may be an important design consideration so that system users can call upon reserve attention resources and time, should interruptions occur.
The third section has introduced the concept of situation awareness within the domain of multiple-robot supervision. It has also shown the possible effects of diminished situation awareness, such as errors and slow response. To prevent these effects, a number of sources to impaired situation awareness have been identified and discussed in the preceding paragraphs. While automation has been identified as a factor influencing situation awareness by Endsley, 39 its role is highly complex and thus warrants a discussion of greater detail. Automation and its use in human-robot interaction (HRI) are presented in the fourth section.
Applying automation in human-supervised robot deployment
This section describes the use of automation to reduce human workload and to mitigate the loss of situation awareness. It also introduces the automated evaluation of the human–robot system based on a priori established models of such systems. Additionally, the section presents some issues associated with using automation. The latter part of this section presents sliding autonomy in further detail and provides a summary of what, how and when to automate.
Using automation to address workload and the loss of situation awareness
The use of automation can mitigate workload by performing certain functions for humans. 21,53 Traditionally, it is thought that the more automation is used, the lower is the level of workload. 34 This may not necessarily imply that automation is good only for replacing the human entirely or that automation is used only for jobs that humans cannot perform at all. Rather, automation can serve to assist (instead of replace) humans and also to address tasks that humans can do well only at the expense of increased workload. 21 Squire and Parasuraman 15 suggested that supervising multiple robots will ‘consequently increase the cognitive workload demands on operators and automation will be needed to support effective and safe human-system performance’.
Implementing automation is thought to be a commonly used approach to allow the deployment of greater numbers of robots without overloading the supervisor’s attention resources. 54 This is because the supervisor is relieved of mentally processing aspects of the task handled by the automation. Automation also replaces the supervisor’s need to select the relevant response for the robot, 2 reducing the number of physical actions that must be performed (i.e. the physical component of workload). If the number of responses that the supervisor has to perform is reduced, then it could be possible to increase the number of robots that can be supervised by a single human. 17
Workload can also be reduced with other methods besides the removal of some aspects of mental processing or response requirements from the human. A multiple-resource model to mental processing 21 suggests that perceptual cues related to task execution may be provided along more than one sensory modality or channel. In such a manner, information can be processed mentally in parallel rather than in a serial manner when information is restricted to being presented along only one sensory modality. An example of information presentation with multiple-sensory modalities can be found in military aviation. Successful missile lock on an enemy target is indicated to the pilot via an audio tone in the cockpit. That event is thus communicated to the pilot via the auditory channel, leaving the visual channel free to process other aspects of flying the aircraft.
Haptic displays are also relevant when considering the multiple-resource theory. Rösch et al. 55 presented a set-up for using a force-feedback joystick to control a mobile robot via the Internet. The joystick vibrated according to the magnitude of the force acting on the robot’s wheels, giving the remote operator information regarding the resistance faced by the robot. The multiple-resource model has been tested within the domain of multiple-robot supervision. An experiment conducted by Dixon et al. 56 showed that participants in a dual-UAV supervision task experienced less workload when information was presented via the auditory channel in addition to the typical visual sensory channel. However, it is important to note that cues presented in parallel even via different sensory modalities may not share attention resources perfectly. Tasks which are high in complexity can negate the benefits touted by the multiple-resource model. 21 Furthermore, the selection of responses to two competing tasks occurs only serially. The response to one of these competing tasks is inevitably delayed, but the selection of a response may still occur in parallel with perception. 57
The reduction of workload through automation use can also assist in mitigating workload. As described earlier in the section on the effects of poor situation awareness, excessive workload can limit situation awareness. Therefore, the prevention of excessive workload through automation can assist also with mitigating losses in situation awareness. There are, however, also automation techniques more specifically targeted at achieving good situation awareness.
For instance, Calhoun et al. 58 introduced automation to enhance the human supervisor’s situation awareness of the mission space while deploying multiple UAVs. Their effort produced automation to address the problem of disorientation when the human switches from interacting with one UAV to another. The resulting change in UAV camera imagery may cause confusion regarding the newly selected UAV’s position and heading. What Calhoun and colleagues did to address such confusion was to display the disengagement of interaction from one UAV by zooming out from a computer animation of the UAV. This would create a view in the animation, showing the deselected UAV’s position in the mission space. The animation would then zoom in towards the newly selected UAV, informing the human about this UAV’s position and heading relative to the previously selected UAV and to the mission space. This animation aid was found to have assisted in enhancing spatial awareness and reduced the time needed by experiment participants to acquire desired views of the ground.
Another example of automation aimed at boosting situation awareness sought to support airspace monitoring involving multiple combat UAVs. In that effort, Fern and Shively 59 found that a graphical map overlay on the tactical situation display significantly assisted in promoting situation awareness of the airspace compared to a text-based Internet chat room style interaction.
Model-based approaches to automated evaluation of human–robot systems
This review of relevant work on human–robot interaction has so far focused on empirical studies. However, computational studies also contribute to understanding human–robot systems by revealing subtle interactions between factors that may not be as easily detected with empirical techniques. Some of these computational studies have been geared towards the use of mathematical models. These models may be proposed based on the understanding of operator behaviour and making some assumptions about interaction episodes. It is also possible to derive models based on sets of data recorded from empirical studies. The resultant model (either proposed or derived) can then be used to evaluate the correctness of operator actions as well as the performance of the human–robot system.
An early example of a proposed model is that by Crandall et al. 60 for predicting the number of robots that a single human can supervise. Crandall’s team termed this number ‘fan-out’ and has claimed that it depends on neglect time (NT) and interaction time (IT). NT is defined as ‘the expected amount of time that a robot can be ignored before its performance drops below a threshold’ while IT means ‘the expected amount of time that a human must interact with a robot to bring it to peak performance’. The model assumed that the human interacted with the robots one at a time in sequence.
Breslow et al. 61 extended upon this model by considering the possibility of a highly dynamic mission environment. They postulated that instances where multiple robots will simultaneously require the human’s attention are likely. This leads to temporary overloading and can result in erroneous response from the human. The proposed model from Breslow’s team considers the available time which the human has for remedying a robot’s problem, the IT needed to make the necessary corrective actions, the time taken to realize that a particular robot requires attention as well as the time spent on manual operations not directly related to the robot in focus. The experiment conducted showed that their model could accurately predict instances where multiple robots would suddenly require interaction with the human. This model may thus be a useful tool for providing early notification of converging attention demands.
Another example of the model-based approach is the study by Boussemart and Cummings. 62 The resulting model was derived from the data recorded from an earlier empirical study. The model used hidden semi-Markov models (HSMM) to generate a predictor of operator actions in the domain of human-supervised automated systems. The HSMM predictor is also able to model the duration of a certain behaviour, thereby additionally evaluating the timeliness of operator actions. The system would score poorly in terms of model accuracy if consecutive operator actions and the timeliness of those actions do not match well with the predicted response. Consequently, suitable automation can be invoked to alert or assist the human when there is a mismatch between anticipated and actual responses.
Such system models can represent individual preferences and nuances so that automation can even be designed with the ability to compensate for individual weaknesses. If the design of automated system software also considers the models for operator behaviour, then such software may achieve validation from users more readily. 63
Still, it should be noted that such computational models may not yet represent the full myriad of (beneficial and improper) operator responses. It is thus possible for a model to score some responses poorly even when those responses are appropriate. The onus to ensure that operator actions are correct should not at this time, fall solely upon such models.
When comparing model-based approaches with empirical methods in human–robot interaction, the latter appears to be more common. This is likely due to the difficulty in producing an accurate and robust model of human interaction with automation. However, in addition to attempting to model the deployment on a macro scale, the modelling approach can also be applied to specific techniques such as that by Calhoun et al. 58 (described in the section on using automation to address workload and the loss of situation awareness). The automation aid by Calhoun’s team may then be implemented to occur at different speeds according to the model of a particular user’s interaction style for instance.
Issues with automation use
Despite the potential of automation for reliving workload, supporting situation awareness and evaluating operator actions during runtime, it can sometimes instead impose workload and degrade situation awareness. When it is improperly applied, automation can, in particular, result in diminished situation awareness. 40 Highly automated systems can also cause problems such as the reduction of user vigilance, 14 complacency, 64,65 being out of the loop 45 as well as skill loss or degradation. 19
The decrease in user vigilance that stems from automation-induced prolonged inactivity can result in users being disinterested in task monitoring. Events and changes go undetected because of a lack of direct task control. 64 This relationship lends further support to the work mentioned in subsection on mental and physical workload which reported improved cognitive performance that accompanies low to medium physical workload.
Complacency or overreliance results from a sense of overtrust in the automation’s ability to perform the task flawlessly. Users of highly reliable automation may be prone to being complacent. 64 Yet, the failure of such reliable systems is always possible. According to Wickens et al., 21 ‘an operator who expects that automation is doing its job will be less likely to monitor the job it is doing – losing awareness of the evolving state of, or surrounding, the automated system’. A study by Chen and Joyner 66 found that participants in a target detection task relied too much on an automatic aid for target detection, scoring more poorly than when the automatic target recognition feature was unavailable.
When automated systems are working, they may inhibit users from practicing the skills required to perform their tasks manually. This results in skill degradation over time and the inability to apply the required know-how when automation fails. This know-how may include the procedure to efficiently call up system information and acquire situation awareness. When users lack the skills to address automation failures, the consequences can be more severe than if no automation had been available in the first place. 67
In the domain of USAR, the out-of-the-loop syndrome is, in particular, one of the most pertinent effects of situation-awareness reduction that stems from automation use. For example, Riley and Strater 44 found that significantly higher level 3 situation-awareness scores were seen when participants deployed two robots using only teleoperation than when the robots were deployed with more autonomy. They explained that being out of the loop when more autonomy was used, may have lowered supervisor attention for the robots. This in turn degraded perception and comprehension necessary to select suitable robot actions. Case in point, automating the robot’s navigation such that the human is monitoring robot movements instead of driving the robot remotely, can diminish awareness about where the robot has been or how it arrived at the current location. 50 Even if the human supervisor remains vigilant throughout the deployment, sufficiently high levels of automation can hide the cause of some autonomous behaviours. 19 This leaves the supervisor confused when the robotic system exhibits unexpected behaviour. 68
The effects of the out-of-the-loop syndrome are thought to be at their most extreme when system designers implement automation to allow technology to handle as much of the system’s objective(s) as possible. Kaber et al. 53 have noted that designers of many robotic systems tended to adopt the belief that maximum autonomy is the most appealing approach. Such designers do not realize that this kind of implementation can render human intervention impossible and demotes the role of the human to a mere observer instead of a supervisor or an operator. 2 Furthermore, Kaber and Endsley 69 recommended that human–machine systems employed for dynamic applications will benefit from periods of manual control (i.e. temporarily scaling the level of autonomy to no autonomy at all). Such instances of manual control may assist in preventing skill degradation.
Perhaps another method for combating complacency and subsequently being out-of-the-loop is the use of psychophysiological measures 70 such as eye tracking to ascertain the level of human engagement with the task. In the event that the psychophysiological data suggests that the human’s level of engagement is below an acceptable threshold, the system can provide prompts or scale autonomy to a lower level to encourage re-engagement.
Maximizing autonomy may not even be worth the engineering effort. Wilson and Neal 71 conducted an experiment to compare the effort of implementing levels of autonomy with increases in efficiency. The experiment involved tasking a human to act as a ‘shepard’ to direct the robotic ‘sheepdog’ to herd a robotic ‘sheep’. Wilson and Neal claimed that the reduction in human interaction with the sheepdog did not warrant the effort in developing very high levels of autonomy.
Sliding autonomy
An increasingly popular method for applying autonomy in supervised robotic systems is sliding autonomy. Sheridan and Verplank 72 have proposed a spectrum of autonomy based on 10 possible autonomy levels. These autonomy levels begin from the lowest extreme where the human performs the whole task manually (level 1), to autonomy at a middle level where the system executes an action upon human approval (level 5), to the highest autonomy level where the automation handles the whole task (level 10). A human–machine system can be designed such that its level of autonomy is fixed. Such fixed autonomy is also referred to as static autonomy. With sliding autonomy, however, the system’s user can dynamically adjust its level of autonomy according to the needs of the situation. 73
Parasuraman et al. 74 compared the use of sliding autonomy, static autonomy and no autonomy in an experiment involving the supervision of unmanned ground and aerial vehicles. In the static autonomy condition, the UAV constantly assisted the human with an automatic target recognition function while that function was only invoked in the sliding autonomy condition when system performance fell below a set threshold. Their study showed that sliding autonomy assisted in promoting situation awareness and was also the most helpful condition for detecting changes and mitigating workload.
A benefit of such adjustable autonomy is that a high degree of automation can be present in the system to assist the human in coping with high-workload situations. However, that level of automation can be scaled back when possible, for the human to re-engage with the task. Such scaling back of autonomy and re-engagement of the human with the task may lower the risk of the human becoming out of the loop. 70
Sliding autonomy is useful for supervised robot deployment because there will very likely be situations where a robot’s on-board intelligence or sensors are insufficient to handle the task at hand. 75 Such instances are ideal for the scaling back of robot autonomy to allow for manual control of the robot(s) by the human. Take, for instance, navigation by a ground robot through an area with many large holes that the robot can fall into. The deployed robot may be equipped to detect obstacles visible from the ground up but not depressions on the ground. The human, at this point, can suspend autonomous robot navigation and select to teleoperate the robot through the challenging terrain.
Regardless of robotic system’s intended application, designing for the system’s autonomy to be dynamically adjustable is advocated because it can be difficult if not impossible to anticipate failure of the system’s automation. 75 Should the function for the scaling back of autonomy still be available despite the failure of higher level autonomy features, the robot can be saved from incapacitation. It may even be possible to continue with the mission, albeit possibly with some handicap due to the loss of high-level autonomy features.
Deployments of multiple robots can benefit from sliding autonomy as the level of autonomy on all the robots do not have to be the same. For instance, the human may lower the autonomy on one robot individual to teleoperate it out of an impasse, while the rest of the robots explore the mission environment autonomously.
It is worth noting that while it is beneficial for autonomy levels to be adjustable, the levels of autonomy which are available within the system could be much more important. Kaber and Endsley 69 described their study on the effects of static autonomy levels and automation allocation cycle time (AACT). They defined AACT as a ratio between the duration in which a preset level of autonomy is available and the duration where autonomy was not. To ascertain their effects in a simulated multitask scenario, the two factors were examined independently and in conjunction with each other. Kaber and Endsley noted that the batch processing level of autonomy may facilitate good system performance, improve situation awareness and moderate workload in static autonomy systems. Under that level of automation, the options to be performed are generated and selected by the human while the actual implementation is done by the automation. However, that level of autonomy did not offer the best combination with an AACT that has also enabled optimal results; that is, the benefits are not ‘additive’. Thus, Kaber and Endsley suggest that the identification of the appropriate autonomy level is crucial even for systems capable of sliding autonomy.
What, how and when to automate
The designer of a robot system or any system involving automation will have to consider what aspects of the task to automate, how automation can be implemented to assist and also when the installed automation feature should be invoked. This section explains the importance of these concerns and describes them in greater detail.
What to automate
Increasing automation levels did not guarantee improved routine performance. 44 Parasuraman and Riley 76 have added that instead of reducing the amount of work to be done, automation changes the nature of that work, possibly placing more workload on the human rather than decreasing it. Wickens et al. 44 explained that this can be due to the added effort when configuring powerful automation and the need for increased supervision. Therefore, it is also crucial to consider which aspects of the overall task to automate; in other words, what to automate.
Parasuraman et al. 77 proposed a model involving four aspects of task performance which are present when a human interacts with automation. These aspects are (1) information acquisition, (2) information analysis, (3) decision selection and (4) action implementation. That model also considers the degree at which automation can be applied to each of the four proposed aspects. For each of these four aspects, automation can be implemented at different degrees based on two sets of criteria.
First, the choice for the appropriate level of automation and the aspect of the task to automate should be based primarily on its impact on the following factors: human workload, situation awareness, complacency levels and the possibility of skill degradation. For the secondary set of criteria, Parasuraman and colleagues suggested that system designers should consider how reliable the applied automation is likely to be. They should consider also the costs or significance associated with the outcome of the decision or action. Highly unreliable automation can demand more rather than reduce workload and result in inferior situation awareness instead of promoting it. If the implications of automation failure are small, then a high level of automation may be justifiable. Powerful automation may also be acceptable when the human is under significant time stress. However, high levels of automation are not recommended when manual control of the system is needed in the event of anomalies. Manzey et al. 78 simulated the supervisory control of spacecraft life support systems and have revealed skill loss during ‘return to manual’ task performance after exposure to the highest level of automation in their experiment.
How to automate
Even for the same task, different types of automation can be applied to seek the same benefits. The designer has to consider the various automation types to ensure that the right kind is implemented to derive the most benefit. Feigh et al. 79 presented a useful model to facilitate such consideration.
The first part of this two-part framework deals with how a system may provide adaptive automation. It is based on four general automation techniques: (1) the modification of function allocation, (2) the modification of task scheduling, (3) the modification of interaction and (4) the modification of content. These techniques are described briefly in the following paragraph.
A human–robot system can be adaptive via the modification of function allocation. This means that the system adopts or relinquishes tasks during runtime. A system can adapt also through the modification of task scheduling by altering the time and duration which tasks are performed. Additionally, a system may be adaptive through the modification of interaction, which can entail altering the layout of the graphical interface or the synchronicity of information display. The modification of interaction can include as well, the changing of the sensory modality through which information is presented. The fourth type of adaptation is the modification of content. That pertains to the selection of the type of information provided and the level of detail at which it is presented.
When to automate
It is also important to recognize the opportunities that occur during robot deployment that automation can be invoked. The circumstances behind these opportunities can thus become automation triggers. This article uses the second part of Feight et al.’s 79 framework to present when automation can be activated. Five types of automation triggers are possible: (1) operator based, (2) system based, (3) environment based, (4) task based and mission based and (5) spatiotemporal.
First, a trigger may be classed as operator based because of the user’s activation of the automation. It is also considered operator based if automation is initiated due to the system’s recognition of the operator’s performance or workload. Byrne and Parasuraman 70 pointed out that system evaluation of operator workload is possible via psychophysiological measurements such as electroencephalogram, heart rate variability and palmar skin conductance. However, such measures should be used in conjunction with other methods like those for assessing operator performance as environmental factors and individual tendencies can affect psychophysiological data.
The second type of automation activation is system-based triggers. With this type of trigger, automation is applied by the system because of certain system states or modes. For instance, if a robot within a multiple-robot system becomes incapacitated, the remaining robots can coordinate amongst themselves to formulate a new plan which takes into account the system’s new handicapped state.
Third, triggers can be environment based because they are brought on by changes like surrounding light, temperature or humidity levels. Environment-based triggers also include events that occur within the system’s workspace.
Next, triggers can be task or mission based if automation is engaged because the mission has reached a particular phase. For instance, the return-to-deployment coordinates behaviour on multiple-robot system can also be triggered once the robots have completed their mission objectives. The fifth type of trigger in the framework of Feigh et al. is that of the spatiotemporal variety. Such triggers pertain to automation state that is altered because of changes in the system’s location or simply because the time calls for it.
Within the system, multiple triggers can be present. The activation of each trigger does not have to be simplistic but can depend on the status of other triggers as well. In a system of multiple autonomously coordinating robots, for instance, an incapacitated robot may serve as the criteria to trigger an automatic response from other robots to take over its unfinished task. If, however, the trigger for ‘return to base’ has been set, then remaining robots can ignore the trigger to assist the incapacitated individual.
The considerations for what, how and when to automate have been summarized in Table 1.
A summary of what, how and when to automate.
This section has introduced the application of automation to decrease workload and assist in acquiring situation awareness via empirical techniques as well as model-based approaches. It has also presented possible issues such as decreased user vigilance and being out of the loop when automation is misused. To avoid such potential pitfalls, the designer may consider imbuing the system with the ability for adjustable autonomy. The designer must also be aware of the types or classes of automation available, how such automation can be implemented and when during runtime the implemented automation should be invoked.
Case study
A single-human multiple-robot system (Figure 1a) previously presented by Wong et al. 80 is used here as a case study. This system serves as a real-world example of the aspects considered in the what, how and when of implementing automation within a human–robot system. It also features some functions which are examples of the low, middle and high autonomy levels.

(a) The deployed robots exploring the mission area and (b) the system’s graphical user interface.
A video of this system and its user interface can be found at http://www.youtube.com/watch?v=sHGP0jniz3g (accessed on 20 May 2016). This system deploys up to three mobile robots to search for victims in the aftermath of USAR scenarios. A human supervisor remotely interacts with the robots via a control station. The graphical user interface (Figure 1b) at the control station displays a video feed from each robot as well as information such as robot locations and status. Manual control of each robot using the interface is also possible.
Table 2 lists some relevant examples of automation features in the described system to demonstrate what, how and when to automate.
Example automation features found in the single-human multiple-robot supervision system described by Wong et al. 80
USAR: urban search and rescue.
The described system also serves as an example of sliding autonomy as many functions such as the modification of content to highlight notable features in the video feed can be toggled on and off, depending on preference. By toggling this ‘highlight’ feature off, ‘raw’ data are preserved, as advocated by Parasuraman et al. 77
The system features automation at the low, middle and high levels of the autonomy spectrum that Sheridan and Verplank 72 have proposed. For example, the system information analysis is automated when notable features in each robot’s camera feed are judged against a preset criteria which triggers an alert if the noted features are of a large enough size. This may be considered a low level of autonomy since it is entirely up to the human to decide if any action was to be adopted based on the robot’s suggestion. This automation feature, however, may not always be beneficial. If false detection occurs too frequently, the system user will disregard the feature. Overreliance on it may also cause a victim of the disaster to be missed by the robot. To really reduce workload and facilitate effective supervision of multiple robots by a single human, such a feature has to be very reliable.
The system has a feature that may be considered in the middle range of Sheridan and Verplank’s 72 proposed autonomy spectrum. This feature refers to the system’s ability to autonomously generate a coordinated plan for the deployed robots based on the latest situation in the mission space. Such autonomous coordination reduces human workload. Attention resources can thus be directed towards looking out for targets or hazards in the robots’ environment. The generated plan may not necessarily be accepted as the human can instruct the robots to explore areas other than the ones proposed by the system.
A higher level of autonomy may be seen in each robot’s ability to autonomously avoid obstacles during navigation. While this feature can be toggled off, its execution is wholly determined by each robot’s on-board algorithm. This feature also frees up the human supervisor’s attention resources to focus on the visual search task.
The single-human multiple-robot system described above has demonstrated the versatility of the sliding autonomy concept by describing instances of autonomy being adjustable for multiple aspects of task performance on one system. Designing sliding autonomy into these aspects of task performance can facilitate the creation of highly flexible systems with a myriad of automation configurations to suit various situations. Figure 2 illustrates the case of a system with different levels of autonomy for each of the four aspects to task performance. The autonomy configuration at time t 1 is different from that at time t 2 to produce an autonomy configuration suitable for the situation at different instances.

A system with dynamically adjustable autonomy level for four aspects of task performance.
Conclusion
The search for the right kind of automation to address workload and situation awareness is perpetual as new technology and techniques (empirical and model based) are constantly generated. Despite the new methods, fundamental concerns regarding automation implementation remain relevant. This article has drawn from existing literature to propose a framework in Table 1 to encourage appreciation for what to automation, how to automate and when to automate. In addition, the article points out to system designers, the various degrees of automation. It also shows the benefits of adjustable automation levels.
Automation can be applied elegantly such that robot autonomy is adjustable for the four aspects of task execution: information acquisition, information analysis, decision selection and action implementation. This facilitates the deployment of robots with automation that is highly reconfigurable to suit the needs of the robot’s situation. The deployment of more than one such robot forms a versatile robot group capable of having each robot working at its own automation configuration. Figure 3 illustrates such a robot group. Future research may be channelled to establishing if the presentation of such flexibility can be beneficial to performance as well as whether the presence of such a myriad of automation choices to the human supervisor would adversely affect workload.

A supervised multiple-robot group with each robot at its unique automation configuration.
Footnotes
Acknowledgements
This research is supported by the BeingTogether Centre, a collaboration between Nanyang Technological University (NTU) Singapore and University of North Carolina (UNC) at Chapel Hill. The BeingTogether Centre is supported by the National Research Foundation, Prime Minister’s Office, Singapore under its International Research Centers in Singapore Funding Initiative.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
