Abstract
Many applications for robots require them to work alongside people as capable members of human-robot teams and to collaborate in order to perform tasks and achieve common goals. These tasks can induce strain on the human due to time constraints. Additionally, humans can become highly stressed due to fatigue, resulting in decreased efficiency. The contribution of this paper is in the introduction of a human fatigue model and the application of this model to a mixed team coordination framework in order to predict team performance given the constraints of human fatigue. The human fatigue model - namely a FAtigue Prediction (FAP) model - is used to conduct numerical simulations that predict mixed team performances. Specifically, extensive simulations are performed to determine how human fatigue influences the choice of the number of agents for a given number of tasks. The novel mixed team coordination framework is a Stochastic Clustering Auction (SCA), which is based on a modification of the Swendsen-Wang method, called SW2SCA. It enables complex and efficient movement between clusters by connecting tasks that appear to be synergistic and then stochastically reassigning these connected tasks. In SW2SCA, the auctioneer makes stochastic movements with homogeneous or heterogeneous agents. The final discussion outlines a systematic procedure to predict the performance of human-robot systems with the FAP model in SCA.
Introduction
Many applications require robots to work alongside humans as capable members of mixed human-robot teams and to collaborate in order to perform tasks and to achieve common goals. Such applications have evolved from collaborative exploration and the military [9], space [15] and medical industries [16]. These tasks are often performed under strain due to time constraints. It is important to determine how human fatigue influences the choice of the number of agents utilized for such tasks. This paper introduces a human fatigue model into the mixed team coordination framework and uses it to predict team performance given the constraints of human fatigue. In this framework, humans and robots treat each other as partners, as opposed to a supervisor and subordinate relationship. There are three stages in predicting the performance of human-robot team collaboration while considering human fatigue. First, semi-structured questionnaires are conducted through personal digital assistants (PDAs) to collect information of the physiological condition of the human subjects so that the modelling parameters can be identified for the two-process model. Next, the human fatigue model is used to predict the task capabilities for the human-robot team. Afterwards, the task capabilities are used as constraints in coordinating the human-robot team.
In the first stage, the PDAs [9] are assumed to be already engaged in a dialogue between humans and robots, following which they then collect semi-structured questionnaires. These questionnaires are used to bring up new questions during the interview [11] to identify the modelling parameters.
In the second stage, our fatigue model is constructed based on the seminal two-process model of sleep regulation [3]. This model postulates that wakefulness and sleep are governed by two primary biological mechanisms: a homeostatic process and a circular process. The homeostatic process builds pressure for sleep during wakefulness and dissipates this pressure during sleep [2], while the circadian process modulates sleep pressure as a function of the time of day [8]. The two-process model has been successful in predicting individualized performance impairment for human fatigue [7, 12].
In the third stage, humans and robots treat each other as partners in the collaborative human-robot team. Existing approaches to human-robot team collaboration can be assigned according to three categories: fully centralized approaches [4], completely distributed approaches [14] and auction algorithms [5, 6, 17–22]. Dias et al. presented the seminal works on auctioning methods [5, 6]. SCAs [17–22] constitute a class of cooperative auction methods that enable the improvement of the global cost of the task allocations obtained using fast, greedy algorithms. SCAs fall within the class of Markov chain Monte Carlo methods [13] and optimize using simulated annealing [10]. By rearranging the clusters via transfer moves or swap moves, and by allowing both downhill and uphill movements, SCA algorithms provide some ability for avoiding local minima. SCAs provide an efficient framework for determining how human fatigue can influence the choice of the number of agents for a given number of tasks.
Related Works
Only a few studies have been found which cover mixed human-robot team coordination with human fatigue. A two-process model is proposed to characterize human fatigue and the cost function is constrained by the fatigue model [23]. A further study implements a similar fatigue model, but on a weighted cost function over boredom, forgetfulness, fatigue and situational awareness [24]. The two studies, however, have completely framed in the fully centralized approaches because a human operator or robot acts as a central allocator to determine the task distribution for the team [21]. There are three limitations to the fully centralized approaches. First, and foremost, fully centralized approaches require complete communication between the central allocator and each team member, which is not always feasible. Second, they require the central allocator to have substantial internal knowledge of each team member and to keep track of changes in the team members as the mission progresses, which may not be practical. Third, they are vulnerable to a single point failure.
The major contribution of this work lies in the development of a human fatigue model which is integrated in the context of human-robot interaction, whereby auction-based task allocation in a human-robot team is reformulated as an optimization problem.
Outline
The remainder of this paper is organized as follows. Section 2 formulates the basic optimization problem for task allocations in a collaborative human-robot team, introduces the bio-mathematical two-process model for human fatigue, provides a description of SCA, and discusses how SCA may be used for human-robot teams. Section 3 integrates the bio-mathematical two-process model for human fatigue with SCA in a mixed human-robot team, and it is seen that the integration of SCA with the bio-mathematical model can predict the performance of the mixed team in terms of cost. Section 4 summarizes a systematic engineering approach to human-robot system task allocation with the FAP model in SCA. Finally, Section 5 concludes the results.
Preliminaries
This section first presents the basic problem statement. Then, it introduces the bio-mathematical two-process model for human fatigue. After that, the SCA algorithm as constrained by human fatigue is presented and discussed.
Task Allocation Problem Statement
Let ℋ denote a set of k heterogeneous robots or humans, and let 𝒯 denote a set of n tasks, i.e., ℋ = h1, h2, …, hk, 𝒯 = t1, t2, …, tn. The tasks can be sequenced in any order and are categorized as “goto points”. Also, heterogeneity does not necessarily imply that the agents perform different tasks. It implies that they do not perform each task with the same efficiency. For example, the agents may move at different speeds, preventing them from reaching a destination (i.e., perform a task) with equal efficiency, even when starting from the same spatial location.
Let
Let 𝒞𝒜𝒫 denote the task capability of the human-robot team:
where
where cs(as) is the minimum cost for agent hs to complete the set of tasks as. The individual cost function cs(·) is based on the characteristics of each agent (e.g., the dynamic model of the agent, the state of the market, current task commitments and/or a human-inspired measure). The problem is to solve the optimization min 𝒞(𝒜). In practice, the cost function in (2) might be used to represent the maximum time taken to accomplish a task.
This section introduces a FAP model. FAP uses a bio-mathematical two-process model to predict how long a human can handle a task in hours under the influence of fatigue in a mixed human-robot team. The two processes that govern the regulation of the sleep-wake cycle are a sleep/wake-dependent homeostatic process (Process SL) and a sleep/wake-independent circadian process (Process CI) [2, 12]. Process SL provides a sleep/wake pattern whereby sleep intensity is governed by a homeostatic mechanism, while Process CI regulates the period of the pattern, whereby the timing of sleep is controlled by the circadian clock. As the bio-mathematical model builds upon this work, the mathematical basis is briefly reiterated here.
Process SL, which is dependent on the duration of prior sleep and waking, is typically represented as a pair of difference equations [2, 7, 12].
where
Process CI is independent of any sleep/wake history and represents a self-sustaining oscillator with a 24-hour period. The definition of
where
Let
where α and
The challenge is to correctly estimate the five unknown model parameters α,
With (5) and the known model parameters, the task capability
Steps 1 and 10 takes into account the task capability 𝒞𝒜𝒫 from the FAP model at time
The auctioneer partitions 𝒯 into k clusters to form an initial allocation A(0) = a1(0), a2(0), …, ak(0) with the task capability 𝒞𝒜𝒫(0), where each cluster as(0) is an unordered subset of 𝒯. Let 𝒜 = 𝒜(0) and 𝒜* = 𝒜(0) (𝒜 is the current algorithm allocation, while 𝒜* is the allocation that has the lowest cost).
Each agent hp ∊ ℋ (p = 1, 2, …, k) uses a “constrained Prim's Algorithm” (a greedy algorithm) to efficiently approximate the cost cp(ap) and submits its cost to the auctioneer. During this bid valuation stage, each cluster ap becomes an ordered subset of 𝒯. The auctioneer computes the global cost 𝒞(𝒜) using (2) and sets the temperature 𝒯 to a high initial value T0.
The auctioneer randomly selects two agents hs and ht for either transfers or swaps and lets ℋ = hs, ht.
Each agent hp ∊ ℋ constructs an adjacency graph 𝒢(ap) = 〈(ap), ε(ap)〉 for each cluster ap, where ε(ap) is the edge set of 𝒯(ap).
Each agent hp ∊ ℋ submits lmin(ap) to the auctioneer, where
For each e ∊ ε(ap) (p = s, t), the agent turns the edge e off (such that the tasks at the end of the edges are no longer considered connected) with a probability 1 − pe, e.g.,
The auctioneer collects all the connected components in the set 𝒞𝒫.
With a 50%-50% probability, the auctioneer selects either a transfer or a swap.
Algorithm 1 describes the mechanism of SCAs within the framework of FAP.
For the non-greedy SCA, the auctioneer accepts the proposal with probability α S (𝒜 → 𝒜i(s,t)) or α 𝒟 (𝒜 → 𝒜i,j(s,t)) or 𝒫𝒮(i, s, t, 𝒯) or 𝒫𝒟(i, j, s, t, 𝒯), while for the greedy SCA the auctioneer accepts the proposal if 𝒞(𝒜i(s,t)) or 𝒞(𝒜i,j(s,t)) is less than 𝒞(𝒜*). Then, 𝒜 is updated and the cost 𝒞(𝒜) is updated and stored. Otherwise, the auctioneer declines the proposal and the auctioneer keeps the current configuration.
If the auction evolution termination criterion is satisfied (i.e., 𝒯 < 𝒯cut, where 𝒯cut is some threshold temperature), then the auction is terminated and the final allocation is 𝒜* with final cost 𝒞(𝒜*) ≤ 𝒞(𝒜(0)). If the criterion is not satisfied, reduce 𝒯, using 𝒯 ← 𝒯/β where β > 1, and go to Step 3.
Simulation Results of a SCA for FAP
This section of the paper presents a simulation study for the verification of the SCA within the framework of FAP. First, the simulation setup and the setup metrics are introduced for FAP and a SCA. Then, Section 3.1 presents the simulation results before and after the total sleep deprivation. Afterwards, Section 3.2 concludes the results during the total sleep deprivation.
The simulated data set is generated by running the FAP of (5) with fixed and known parameters and superimposing random noise to emulate the variability observed in human performance, which is assumed to be obtained through PDAs. The five model parameters are configured as α = 30.30,
The measure of the SNVR is performed to compare the level of a desired signal to the level of background noise. The definition of the SNVR is as the reciprocal of the coefficient of variation introduced in Definition 4:
These variances correspond to signal-to-noise ratios of 59.10 (59.10/1), 14.77(59.10/4) and 3.7(59.10/16), where 59.10 is the variance of the performance data signal simulated in (5) with the above mentioned model parameters for 82 hours 1 [12].
𝒞𝒜𝒫 is utilized as a constraint for the allocation of tasks in a cooperative human-robot team. The simulation setup of the SCA is described as follows. For each simulation of a SCA, the stochastic random scenario appears in a 10000m × 10000m area involving 1,000 random scenarios for a given number of tasks and agents. The initial agent positions were evenly distributed along one edge of the area (when the initial agent positions are distributed differently, e.g., randomly, the qualitative results given below remain the same). Furthermore, for each simulation, the following SCA parameters were used: initial temperature, T = 1000; termination temperature, Tcut = 20; small cooling schedule ratio β = 1.001 with slow annealing, where T and Tcut are a priori knowledge regarding the scenario scale such that they are scenario-dependent. However, they are always the same for different agents, different tasks and different missions in a fixed scenario scale.
The concept of an agent type vector (ATV) is introduced in Definition 5 to measure the different types of agents that constitute a given human-robot team:
The cost function is a M
Assumed ATV, as the number of agents varies from 2 to 10 and at(1) is human occupied vehicle
Assumed ATV, as the number of agents varies from 2 to 10 and at(1) is human occupied vehicle
The next section will illustrate the costs of SCA before and after total sleep deprivation by prediction in the FAP model.
In the subsequent simulations, the number of agents was variable. In particular, 1,000 random scenarios were again studied for a given number of agents and tasks, with the number of agents now ranging from two to 10 and the number of tasks ranging from 10 to 260 in increments of 10.
Figure 1 displays the costs of the SCA as a function of the number of agents and tasks when human fatigue is considered and not considered. It shows that, where human fatigue is considered, an additional number of agents is required to achieve the same level of overall performance. For example, in Figure 1(c) (which shows the costs for 200 tasks), if human fatigue is not taken into account, the cost corresponding to two agents is 217 hours, while similar costs are required for four agents to achieve the same level of mixed team performance if human fatigue is considered after total sleep deprivation. In contrast, consider Figure 1(d): two agents are required for the team performance cost in 100 hours if human fatigue is not considered, while three agents are required to maintain the same level of team performance if human fatigue is considered. The reason is that the sacrifice of adding robotic agents leverages the impaired individuals. In general, for a fixed number of tasks, the corresponding “slice” of a 3-D curve, such as in Figures 1(c) and 1(d), may be used to determine how human fatigue influences the choice of the number of agents in ATVs, and hence provides a guideline for choosing the number of agents in ATVs for the mission.

Mean cost vs. the number of agents and the number of tasks for the SCA with FAP and the ATVs in Table 1
Comparison of the predicted costs of Figure 1 reveals that, when the number of tasks is large, the FAP model has a substantial impact on the performance of the mixed team. For example, in Figure 1(c) (which shows the costs for 200 tasks), the cost ranges in [0.7,217.5] hours if human fatigue is not considered, while the cost ranges in [1.1,523.4] hours if human fatigue is considered. This is a cost increase of [30.5%,140.6%], due to the number of tasks being far beyond the task capability of impaired individuals at the specified speed at(1). It should also be noted that the FAP model presented here can be used to predict mixed team performances based on fatigued individuals for other task and agent numbers. This capability is displayed in the next section, which shows the costs of the SCA as a function of the sleep deprivation time.
Given specific ATVs and numbers of tasks, the costs of the SCA are a function of the sleep deprivation time. Figure 2 shows the simulated task capability profiles for the group and the various individuals at specified ATVs for 200 tasks. They are collected every two hours during 82 hours of total sleep loss for 200 tasks and 1,000 random scenarios. The simulated performance is represented with a signal-to-noise ratio of 14.77. The group profile is averaged for 11 people and the individual profile is a representative, single individual.

Task capability of FAP in a group and for an individual collected every two hours during 82 hours of total sleep loss for 200 tasks and 1,000 random scenarios
The trend transitions of cost with respect to TIME AWAKE can be observed from Figure 2, such that cost increases as TIME AWAKE increases. Since the cost represents the task capability of a given human agent, it reveals that a human agent's capability degenerates as TIME AWAKE increases. This trend is characterized by four major phases, corresponding to three transition stages and four levels. A transition stage represents a time segment where the task capability rapidly degenerates, while a level represents a relatively stable range of magnitude for human task capability.
Phase I: The task starts at level I, and transition I takes place from hour 12 to hour 14. At hour 12, it sees a distinct degeneration as the cost rapidly increases to a higher level, called ‘level II’. This first transition happens within a very short time (i.e., about two hours), until hour 14. During the transition, cost jumps from 224 to 228 for M(2), 118 to 134 for M(3), and 1.38 to 1.52 for M(8), as shown in Figure 3.

The costs of the SCA as a function of sleep deprivation time collected every two hours during 82 hours of total sleep loss for 200 tasks and 1,000 random scenarios
Phase II: Level II starts around hour 14, followed by transition II from hour 30 to hour 40. During transition II, the cost increases from 234 to 266 for M(2), 124 to 138 for M(3), and 1.46 to 1.64 for M(8).
Phase III: Level III starts at hour 40 and lasts until hour 48, followed by transition III from hour 48 to hour 56. During the transition, the cost jumps from 268 to 298 for M(2), 136 to 166 for M(3), and 1.64 to 1.65 for M(8).
Phase IV: For the remainder, the task capability is unable to maintain a relatively smooth pattern, oscillating randomly for the rest of the hours. The cost oscillates throughout as well.
We generalize the above observations with three main points: 1) the total TIME AWAKE has a negative effect on human agent task capability throughout. 2) TIME AWAKE's effect on human agent task capability can be classified in in terms of three stages with four levels. 3) TIME AWAKE's effect on human agent task capability takes conforms to two patterns: a rapid transient pattern and a gradual transition pattern. 4) When TIME AWAKE exceeds a limit (80 hours in a one hour simulation), the human agent is allocated few (or no) tasks and the SCA cost approaches a constant value.
From Figure 3, it is of interest to predict the performance of the SCA under the constraints of FAP model. There are two critical performance times: 20 hours and 52 hours. Before 20 hours, the FAP model has only a slight limitation effect on the costs of the SCA, while after 52 hours the FAP model has a great impact on the costs of the SCA because the human's capacity to handle tasks deteriorates as the sleep deprivation time increases.
Based on the results of this paper and the simulation gained in developing these results, we propose a systematic engineering approach to human-robot system task allocation using a SCA under FAP. It is assumed that a mission is given in which the physical scale of the distribution of tasks is known, along with the range of the number of tasks. It is also assumed that a set of candidate agents is potentially available for the mission, and guidelines are provided as to how human fatigue influences the choice of the number of agents. The approach is presented below in six steps.
Semi-structured Questionnaires to Derive the Five Unknown Model Parameters. Semi-structured questionnaires are designed, and the humans are subject to the semi-structured questionnaires via a PDA [9] following certain periods of total sleep deprivation, with the data from the subjects in this study used to derive the five unknown model parameters in FAP. Further research is needed to determine what kind of simulations are necessary in questionnaires to be mapped to the parameters.
Task Capability to Predict and Update. With the known model parameters obtained from Step 1 in FAP, the task capability is predicted and updated in the task capability. It is utilized as a constraint to bound the proposed movement of tasks allocated in a cooperative human-robot team.
Simulations to Determine Annealing Parameters. The humans and robots treat each other as partners in a collaborative human-robot team. For the human-robot task allocation problem considered in both this and prior SCA research, it has been observed that the choice of the annealing suite {T0,Tcut,β} depends largely on the physical scale of the area over which the tasks are distributed, rather than on the number of tasks or the number of agents (it is revealed that different types of task allocation missions will have similar invariance [22]). Hence, T0, Tcut and β may be chosen based on a relatively small number of randomly distributed tasks and - initially - randomly distributed agents, as long as the area of the task distribution is as that envisioned for the actual mission. For this problem, it feasible to solve for an optimal or nearly optimal task allocation using a combinatorial algorithm. Using non-greedy centralized SCA, this knowledge is helpful in using trial and error to choose T0 and Tcut and determining values of β that correspond to slow and fast annealing. Slow annealing should yield a near optimal task allocation, while fast annealing should converge quickly (though with noticeable improvement over the initial task allocation, which will typically be chosen to be the best solution resulting from a suite of fast, greedy auctioning algorithms). T0 and Tcut are not expected to change in the following steps, but β may be chosen for slow annealing, fast annealing or something in between.
Simulations to Determine the Mission Agents. Because the exact tasks to be performed are not known a priori, stochastic simulations are chosen to represent the possible task distributions and the number of tasks in the actual mission. The numbers of the different types of agents (and possibly their initial locations) are systematically varied in simulation. A non-greedy centralized SCA, using β chosen for slow annealing (or faster annealing if time does not permit), is then used to compute a (multi-dimensional) table that shows the expected cost vs. distribution of the numbers of each agent-type.
Task Allocation at the Beginning of the Mission. The agent team was determined in Step 4. At the beginning of an actual mission, it is assumed that a set of tasks are actually known and need to be assigned to the agents. Assuming full communication between agents, we can choose a non-greedy centralized SCA with β chosen for slow annealing (or faster annealing if time does not permit) to develop the initial task allocation under the constraints of task capability in FAP.
Task Allocation During the Mission. Once the mission commences, distributed auctioning using a greedy SCA with β chosen for fast annealing (or slower annealing if the time and communication are available) is used to reallocate tasks under the constraints of task capability in FAP, as the tasks and costs change during the mission.
Final Discussion
This paper first integrates a human fatigue model in a novel, mixed team coordination framework, and then uses this model to predict human performance under time constraints. The human fatigue model is a FAP model, which uses a bio-mathematical two-process model to predict the task capability and performance for human individuals and human-robot teams. A SW2SCA, based on a modified Swendsen-Wang method in a suite of SCAs, is utilized in the mixed team coordination framework to allocate and reassign tasks. It enables complex and efficient movements between clusters by connecting tasks that appear to be synergistic, and then stochastically reassigning these connected tasks. In the SW2SCA, the auctioneer makes stochastic movements with either homogeneous or heterogeneous agents.
The FAP model is validated in the presence of degradation in human capacity and provides a guideline for fatigue prediction in actual field operations relating to a mixed team's performance. The random simulation for a centralized SCA shows how human fatigue influences the choice of the number of agents. This enables an appropriate choice of the number of agents for a given mission that has an expected number of tasks when human fatigue is considered.
Footnotes
6.
This work was sponsored by the Army Research Office under grant W911NF-09-1-0568.
1
82 hours is chosen in consideration of extreme conditions of deprivation of sleep for humans.
