Sage Journals: Discover world-class research

Abstract

This work addresses the issue of intelligent robot–human‐coordinated parts‐to‐picker order fulfillment carried out in a human‐friendly manner. One unique feature of the proposed approach involves integrating a real‐time data‐driven stochastic‐dynamic model with a fatigue accumulation function. The optimal solutions help achieve coordination between human pickers and robots, such that robots can agilely adapt to the coordinated pickers’ efficiency and fatigue conditions. Specifically, the proposed method estimates human pickers’ instantaneous performance and robot queue lengths, which are then fed back in real time as indexes to adjust robots’ speeds of handling racks and moving them to human pickers. Using data that are provided by a giant electronic commerce (e‐commerce) company, our analyses demonstrate that the proposed robot–picker coordination system permits alleviating a picker's fatigue without much influence on picking efficiency. In particular, a picker's accumulated fatigue can be reduced by 53.74% at the expense of lowering picker efficiency by 14.79% if the proposed robot–picker coordination system is applied in the focal firm of the study case. Through our scenario design and sensitivity analysis, additional findings and managerial insights, including the rules of human‐friendly robot behaviors for coordination with human pickers in different operational scenarios are provided. They facilitate the development of “human‐friendly” intelligent robot‐human‐coordinated order‐fulfillment systems for intelligent logistics operations.

Keywords

automated warehousing system data‐driven stochastic optimal control intelligent logistics mobile rack robot‐human coordination

INTRODUCTION

Business operations have entered the Industry 4.0 era in which robotics are widely applied in the production, logistics, and e‐commerce companies (Choi et al., 2022; Loffler et al., 2022; Olsen & Tomlin 2020; Rai et al., 2021). This is evidenced by the claims made by Amazon.com:

It's better for everybody…Workers no longer would have to walk massive warehouse floors to find the right power drill–instead, robots would bring the drill directly to them… The hour or more it took to process a package had been shaved down to as little as 15 minutes…Robots, Amazon insists, are good for workers… “They make the job safer,”… (Evans, 2020)

However, picking three times faster also implies more wear and tear due to repetitive motion and working faster at lifting and handling products… So along with the drive to automate more warehouse tasks comes much higher expectations for workers…The robots have raised the average picker's productivity from around 100 items per hour to what Mr. Long and others have said is a target of around 300 or 400, though the numbers vary across teams and facilities… (Del Rey, 2019).

Indeed, there is no doubt that advancements in such disruptive technologies as robotics and related applications (Azadeh et al., 2019; Chung, 2021; Perera et al., 2020; Shi et al., 2021; Wang et al., 2022) have brought new benefits to electronic commerce (e‐commerce) companies (such as Amazon, Alibaba, and Jingdong) by increasing the efficiency of order fulfillment. However, they are simultaneously creating new safety challenges.

Order picking is a critical step in customer order fulfillment (Batt & Gallino, 2019; de Koster et al., 2007; de Vries et al., 2016; Frazelle, 2002). In a typical warehouse, it is not only costly and time‐consuming but also labor‐intensive. In general, the cost of order picking is estimated to account for more than 50% of the total warehousing cost (Tompkins et al., 2010). Furthermore, order picking typically relies on manual labor to retrieve multiple items from storage, sort them, and then package them to fulfill customer orders (C.‐M. Chen et al., 2010; de Koster et al., 2007; de Vries et al., 2016). Owing to the large variety of customer orders and stock‐keeping units (SKUs) in most e‐commerce operations, each order‐picking task for a customer order is difficult to replicate. The ordering picking's efficiency and accuracy depend mainly on the human pickers’ experience (Batt & Gallino, 2019) and performance (de Vries et al., 2016). Moreover, in e‐commerce, agile order picking can be more challenging when the ordered products are diverse, demand is fluctuating, customer returns are increasing and customers expect more flexible and expeditious logistics services (Frazelle, 2002). Briefly, order picking, which depends on intensive labor and pickers’ performance, remains crucial to the success of fulfilling customer orders with high service quality, regardless of the methods/technologies that are deployed in a warehouse.

Despite a variety of innovative automation technologies, such as autonomous mobile robots, have been increasingly introduced to carry out order fulfillment jointly with human pickers in warehouses (Banker, 2016; Tobe, 2018; Wang et al., 2022), “parts‐to‐picker” order fulfillment systems raise several new issues on humanity and safety. This creates a challenge on robot–human coordination. A certain number of robots that carry racks automatically from the storage area to the order picking area may be deployed to facilitate the picking tasks of human pickers. Poor robot–human coordination typically increases the number of robots that carry, queue and await handling by pickers in the order‐picking area. Note that a human picker's efficiency is not as controllable as that of a robot and may vary with time and psychophysical condition. Thus, an unexpected delay in picking items may inevitably occur if robots just continue to carry racks to her and are unable to respond to her performance. More seriously, recent reports have revealed a substantial rising injury rate in automated warehouses (Al Elew & Oh, 2020; Doll, 2020; Evans, 2020). For example, Amazon's warehouse injury rates have increased every year since 2016 after they began using robots in parts‐to‐picker order fulfillment. The serious injury rate in 2019 was 7.7 injuries per 100 employees, which was 33% higher than in 2016 and nearly double the corresponding recent industry standard (Evans, 2020). Order picking often requires pickers to perform repetitive tasks in awkward postures for a whole day, easily causing musculoskeletal disorders (Grosse et al., 2015; Lavender et al., 2012), which accounts for up to 33% of all injuries and illnesses in the United States in 2013 (Bureau of Labor Statistics, 2014).

Motivated by the aforementioned robot–human coordination issues in parts‐to‐picker order fulfillment practices, this paper aims to answer the following questions.

How should human pickers and coordinated robots react and adapt to each other to fulfill orders in a human‐friendly manner considering robot–human collaboration?

How does such a human‐friendly robot–human coordination scheme affect the performance of an order fulfillment system in a parts‐to‐picker order fulfillment center? Specifically, what are the critical factors and how do they influence the systems’ performance from humanity and safety perspectives?

To address the above research questions, we build a stochastic optimal control‐based model and develop a data‐driven solution approach. We propose a robot–picker‐coordinated order fulfillment mechanism under which the robot and picker efficiencies have the following characteristics: (i) The robot and picker efficiencies can be effectively controlled and coordinated if real‐time data concerning newly arriving racks are available and used. (ii) The robot efficiency heavily depends on real‐time data concerning newly arriving racks, whereas picker efficiency highly depends on the length of the queue of unprocessed racks. (iii) The picker‐to‐robot relative performance (RP) from the perspective of either efficiency or effectiveness during peak hours is less than that during either normal or off‐peak hours. The picker‐to‐robot RP is best in the “off‐peak” scenario. Moreover, we find that the robot efficiency is not the most important metric. Rather, human efficiency dominates the efficiency of a parts‐to‐picker order fulfillment system, particularly during peak hours. Accordingly, controlling the number of queued racks by slowing down their delivery by robots to the picking–packaging area is in fact a more efficient measure than slowing down the rack‐processing of a picker (in order to alleviate that picker's fatigue); this is especially prominent during a peak‐hour period. Last but not least, employing data that were provided by a well‐established e‐commerce company, our analyses demonstrate that a picker's accumulated fatigue can be reduced by over 50% at the expense of lowering picker efficiency by around 15% if the proposed robot–picker coordination system is applied.

The rest of this paper is organized as follows. Section 2 reviews the relevant literature to elucidate its contribution to the field of innovative logistics. Section 3 describes the problem and characterizes the proposed robot–picker‐coordinated order fulfillment system. Section 4 presents a proposed stochastic‐dynamic optimal control model. Section 5 develops a real‐time data‐driven approach to estimating and controlling the state variables of the proposed robot–picker‐coordinated order fulfillment system. Section 6 presents the computational results of a practice‐based real data analysis. Section 7 draws conclusions, provides managerial implications, and discusses future research. To enhance exposition, all technical proofs are put in an Online Appendix.

LITERATURE REVIEW

Recently, in the Industry 4.0 era, robotic technologies (such as Amazon robotics and Jingdong robots) that are used by e‐commerce companies (Luo & Choi, 2022) for parts‐to‐picker order fulfillment have emerged (Banker, 2016; Shi et al., 2021; Tam, 2014; Tobe, 2018). They have revolutionalized the way of order fulfillment operated in warehouses. Most of the extant literature on parts‐to‐picker order fulfillment systems focuses mainly on increasing the warehousing efficiency utilizing diverse methodologies. For example, Enright and Wurman (2011) identified several parts‐to‐picker order fulfillment‐related resource allocation problems, including order allocation and robot allocation problems. Recently, Boysen et al. (2017) proposed a mixed integer programming model to minimize the number of rack visits to a stationary picking station in a mobile robot‐based order‐picking problem. The authors solved the optimization problem by first decomposing the problem into two subproblems of rack sequencing (for a given order sequence) and order sequencing (for a given rack sequence). By contrast, Lamballais et al. (2017) proposed four queuing network models associated with various warehousing layouts and robot zoning strategies to analyze the performance of an automated storage and parts‐to‐picker system with the objectives of optimizing robot utilization, maximizing order throughput, and minimizing order cycle time. Different from the above optimization‐based analytical models, Bozer and Aldarondo (2018) utilized a simulation‐based method to evaluate and compare the performances of two types of parts‐to‐picker picking systems (called “miniload” and “Kiva” systems) in the aspects of expected throughputs and expected container retrieval times in order processing. Some of the subsequent literature, including Weidinger et al. (2018) and Yuan et al. (2019), aimed at exploring the problem of pod/rack storage, which is the antecedent of parts‐to‐picker order fulfillment, where the inventory of items is stored and spread over multiple mobile pods that are carried by robots moving between storage and picking (stowing) zones. More recently, Wang et al. (2022) utilized an “approximate dynamic programming” based branch‐and‐price approach to solve the optimal robot scheduling problem for parts‐to‐picker order fulfillment systems. Different from the above literature, which solely focuses on efficiency, Wang et al. (2022) incorporated fluctuations of the working states of human pickers into the model, and hence the human factor was explored. However, human safety was not yet examined by Wang et al. (2022). In addition, Boldrer et al. (2022) proposed a hierarchical framework to address the problem of multi‐robot navigation in human‐shared working environments. Even though the approach of Boldrer et al. (2022) does not focus on parts‐to‐picker order fulfillment systems, the authors’ idea of conceptualizing a safe and socially aware navigation in their proposed methodological framework is noteworthy and consistent with our research goal.

Despite remarkable advances made to increase efficiency in parts‐to‐picker order fulfillment, issues such as safety and harmony in the human–robot interactions remain under‐explored in relevant intralogistics operations and related areas (Azadeh et al., 2019; Chen et al., 2022; Enright & Wurman, 2011). As argued by Enright and Wurman (2011), systems optimization at the high level, considering dual objective functions for both workers and robots, remains challenging as these two objectives may not be compatible. Details of the operational features and challenges of parts‐to‐picker order fulfillment can also be found in Enright and Wurman (2011). Drawing from experimental results, Chen et al. (2022) further suggested the urgent necessity of developing novel robotic motion methods to ensure human safety for applications of human—robot coordination in warehouses. Note that Azadeh et al. (2019) comprehensively reviewed the literature on robotized and automated warehouse systems. They pointed out that issues related to human—machine interaction in automated warehousing environments have not been fully addressed.

Another stream of relevant literature concerns the association of human factors with order‐picking performance (Batt & Gallino, 2019; Grosse et al., 2015, 2017; Loske, 2022; de Vries et al., 2016). As argued by Grosse et al. (2017), human factors determine the performance of an order‐picking system, which is time‐consuming and labor‐intensive, but they are often ignored in management‐oriented research. Incorporating human factors into order‐picking models for intralogistics operations and warehouse management is, thus, indispensable—particularly with respect to production, operations, and logistics management (de Koster et al., 2007). Observe that the related literature can be divided into two categories: the first focuses on relevant aspects of human cognition (e.g., human learning) to improve productivity/efficiency, and the second cares about the welfare and health of workers. The corresponding literature reviews are detailed below.

Prior studies on cognition have investigated its impact on order‐picking performance. Some of them (Grosse & Glock, 2015) integrated the concept of human learning (characterized by learning curves) with mathematical models to study how pickers’ performance can be improved by learning from experience. de Vries et al. (2016) conducted a field experiment to examine three types of picker‐to‐parts order‐picking methods under two incentive schemes for pickers with different regulatory foci (“prevention focus” vs. “promotion focus”). The work of de Vries et al. (2016) is pioneering in “elaborately” aligning order‐picking methods, incentive mechanisms, and regulatory focus to bridge the gaps between organizational behavior theories and warehousing practices. Batt and Gallino (2019) empirically estimated the effects of several factors, including a picker's learning from the experience of walking and searching processes, on picking time for order fulfillment in online women's apparel retailing. One remarkable feature of their work is the empirical demonstration of the effects of pickers’ heterogeneous learning capabilities, characterized by different learning curves, on order‐picking performance. Rather recently, Loske (2022) combined parametric and nonparametric approaches to analyze how the interactions between humans, machines, and intelligent software impact human learning and perception of work characteristics in the transition to an automated order‐picking system. The author's empirical findings verified that the real‐time feedback provided by the order‐picking system can facilitate human learning by doing tasks in the perception–cognition–motor–action cycle, thus clarifying the need for a human‐centered work system design.

The literature on the welfare and health of order‐picking workers aims to develop integrated models that consider both ergonomic (such as human energy expenditure and fatigue) and economic (such as picking time) performances in order picking (Grosse et al., 2015, 2017). For example, Grosse et al. (2015) proposed a conceptual framework that considers human factors in four critical categories (viz., the “perceptual, mental, physical, and psychosocial”). The authors suggested that human factors can be incorporated into the planning models to improve the performance of order‐picking systems. More recently, Glock, Grosse, Abedinnia, and Emde (2019) proposed an integrated model to determine the optimal processing sequence for orders, pallet rotation, and picker routing to minimize total picking effort. The authors considered the spinal loads on pickers and consequent risks of injury. Similar efforts had been made previously (Glock, Grosse, Kim, et al., 2019; Larco et al., 2017).

Despite the fact that the idea of integrating human factors into order‐picking decision support systems is promising and necessary for human wellbeing, the above‐cited order‐picking studies on human factors are concerned mostly with manual order picking (picker‐to‐parts order picking) rather than parts‐to‐picker order‐picking systems, which are the focus of our study. To be specific, a parts‐to‐picker order‐picking system has the following operational features that differentiate it from a traditional picker‐to‐parts order‐picking system, which relies mainly on manpower.

First, a parts‐to‐picker ordering picking system relies substantially on the coordination between pickers and robots to fulfill customer orders. Mobile robots lift racks that store SKUs and transport them from the storage area to stationary pickers to facilitate subsequent picking and packaging tasks by those pickers. In such parts‐to‐picker picking environments, robots must move forward and backward between the storage area and stationary pickers, forming closed loops between the storage area and the locations of the coordinated pickers. Accordingly, the efficiency of order fulfillment in a parts‐to‐picker picking system is contingent jointly upon the coordination between robots and pickers in carrying out the two consecutive tasks of handling racks (by mobile robots) and packaging (by pickers). Nevertheless, human–robot coordination often generates coordinating complexity and uncertainty as a result of the difficulty of sharing information, communication, and mutual adaption by the coordinated dyad (Faraj & Xiao, 2006).

Second, from a managerial perspective, an environment that includes working robots/machines is likely to differ from the one that includes only human labor. Psychological effects (e.g., stress and strains) have attracted attention since computer‐integrated/computer‐aided/robot‐aided manufacturing technologies were introduced for factory automation (Karuppan & Schniederjans, 1995; Rosenthal, 1984). As argued by Rosenthal (1984), the most difficult problems in automating a factory are managerial rather than technical. Olsen and Tomlin (2020) further highlighted the issue of managing worker–machine interfaces when robotics and artificial intelligence were introduced to convert manual operations into lean, digitized, and highly automated operations. Therefore, the issues that are raised by working with robots/machines seem not to be limited to those around technology and productivity. Instead, in the post‐automation era, new psychological (Karuppan & Schniederjans, 1995), behavioral (Hinds, 2004), and organization‐behavioral (Barrett et al., 2012; Beane & Orlikowski, 2015) issues should be crucial for operations managers.

Considering the above research gaps in the literature on parts‐to‐picker order fulfillment and intralogistics operations, this work contributes to the field of intelligent logistics operations in the following three ways.

First, motivated by the sociotechnical theory (Cooper & Foster, 1971; Mumford, 2006), this work urges that the harmony of interrelations between human workers and robots in a parts‐to‐picker order fulfillment center should be considered in realizing intelligent logistics systems. In particular, this work promotes the development of adaptive human‐friendly robotics to assist pickers in fulfilling orders with the fewest human–robot coordination conflicts. On the way toward “perfect automation,” the coexistence of robots and humans in task environments is inevitable, requiring carefully planned mechanisms for agile robot—human coordination that are humanity‐oriented and machine‐assisted.

Second, the proposed methodology is novel in conceptualizing the aforementioned philosophy of machine‐assisted humanity orientation into modeling and analysis to address the issue of robot–human coordination for parts‐to‐picker order fulfillment systems. Methodologically, this work integrates a discrete‐time nonlinear dynamic stochastic model with a time‐varying fatigue accumulation function to characterize and estimate a human picker's fatigue‐dependent working state. Then, this work develops a stochastic optimal control‐based data‐driven solution approach, which permits not only estimating human pickers’ instantaneous performance and robot queue lengths but also feeding back the aforementioned human picker's fatigue‐dependent working states in real time to the coordinated robots to adjust their efficiencies in handling and distributing mobile racks to the picker. Such a real‐time data‐driven stochastic optimal control‐based approach has its unique features and relative merits in addressing the critical yet under‐explored issue of real‐time robot–human coordination for machine‐assisted order fulfillment systems.

Third, this work is innovative in the realization of ideas from occupational psychology (psychology), organizational behavior (sociology), and robotics (engineering) to address picker–robot coordination for intelligent logistics in contemporary operations (management). Psychological factors are considered in the development of a novel robot–human‐coordinated mechanism for use in an automated parts‐to‐picker order fulfillment system. Through normative analysis and real‐data‐based empirical studies, the above concept (i.e., machine‐assisted humanity orientation) and proposed robot–picker‐coordinated order fulfillment mechanism are confirmed to be theoretically reasonable, scientifically solid and demonstrated to be of practical value in resolving the issue of interest.

PROBLEM DESCRIPTION

Typically, with our discussions with managers from a well‐established e‐commerce company in mainland China and referring to the problem settings of Wang et al. (2022), the workflow of mobile robot‐based parts‐to‐picker order fulfillment operations investigated in this paper contains the following steps. First, SKUs are received in the receiving area of a warehouse/distribution center; meanwhile, mobile robots that carry empty racks are dispatched to load the received SKUs. This is followed by the step of storage assignment, where mobile robots move loaded racks to the storage area (Li et al., 2020). If the SKUs in the racks are required in an order processing list, then the mobile robots move the racks with those required SKUs from the storage area to the picking–packaging area (termed as the step of rack handling and distribution). Then, stationary pickers in the picking–packaging area pick out the SKUs that are identified in the picking list and package them into cardboard boxes to fulfill customer orders (termed as the step of picking and packaging). This step is followed by automatically moving the completed cardboard boxes by means of conveyers to the outbound area for vehicular loading and dispatching. Mobile robots that have completed the delivery of the racks with the required SKUs to the stationary pickers return from the picking area to the storage area for repeated execution of the step of rack handling and distribution, to facilitate pickers’ carrying out picking and packaging for order fulfillment. Similar mobile robot‐based parts‐to‐picker order fulfillment operations can also be readily found in numerous practical cases, including Amazon.com (Demaitre, 2019; Gharehgozli & Zaerpour, 2020; Garland, 2022).

This work focuses on coordinating the step of rack handling and distribution (executed by robots) as well as the step of picking and packaging (executed by pickers). Specifically, the performance of mobile robots in handling and moving racks (number of racks handled and moved per robot per time unit) significantly influences the associated picker's performance in the subsequent picking and packaging tasks. The effect of psychological and behavioral uncertainties, which may be evident in the interaction between pickers and robots, on an associated picker's performance is considered herein. Accordingly, the problem of interest and goals of this work are as follows.

Consider a few‐to‐one parts‐to‐picker picking system, in which a few mobile robots work collaboratively with a human picker. A set of C customer orders, involving S SKUs, and being stored in N racks must be handled and moved by M mobile robots from the storage area to the picking area. Then, they have to be picked and packaged by one associated picker in a given working period T (e.g., 3 hours) without the intervention of scheduled breaks. A shared storage (scattered storage) policy, which is typically applied to mobile robot‐based parts‐to‐picker picking systems (Weidinger et al., 2018), is adopted. Different SKUs may be stored in a rack, and each rack can be associated with one or more customer orders. Two goals must be achieved. The first is to complete the picking and packaging of the aforementioned S SKUs that are stored and scattered over R racks by smoothly coordinating the operations of M mobile robots and the associated picker during a working period T. This goal must be achieved under the condition that the picker's fatigue increment (

Δ ρ_{k}

) in any time interval k (

k = {1, 2, 3 … K}

;

T = K \cdot \bar{t}

) during

T

should not exceed a predetermined fatigue degree (

\bar{ρ}

), such that the accumulated fatigue degree (

ρ_{K}

) in the last time interval K does not exceed its upper bound (ρ_max), where

\bar{t}

is the length of a time interval. Meanwhile, in the few‐to‐one parts‐to‐picker picking system, the second goal is that the processed racks in the picking area are efficiently returned to the storage area by mobile robots for the next round of order processing.

Based on the above problem description and defined goals, this work proposes a real‐time robot–picker‐coordinated order fulfillment system that comprises the following four subsystems: (1) robot‐based rack handling (Subsystem 1), (2) mobile robot‐carried rack movement (Subsystem 2), (3) human‐based picking and packaging (Subsystem 3), and (4) robot‐based rack return (Subsystem 4), which are depicted in Figure 1. The number of newly arriving (

A_{k}

) racks that are loaded with SKUs from the receiving area to the storage area is treated as an exogenous variable, which is given in each time interval k (

\forall k

). Subsystems 2 and 4 are virtual as they are specified to determine the time‐varying distribution rates (

μ_{k}

and

ν_{k}

) of forward and reverse logistics for mobile racks that move between the storage (Subsystem 1) and picking–packaging (Subsystem 3) areas in any time interval k (

\forall k

). Subsystems 1 and 3 correspond to task operations that are executed by robots and associated pickers in the storage and picking–packaging areas, respectively. The outputs of Subsystems 1 and 3 are the number of racks (

X_{k}

) that are handled by mobile robots and the number of racks (

Y_{k}

) that are processed by a picker in each time interval k (

\forall k

). Both

X_{k}

and

Y_{k}

can be treated as metrics that are evaluated by contemporary detection technologies, such as image processing‐based detectors.

FIGURE 1

The proposed real‐time robot–picker‐coordinated order fulfillment system

The proposed robot–picker‐coordinated order fulfillment system has three distinctive features. First, the rack‐based order processing cycle, which integrates the forward and reverse logistical flows of mobile racks between the storage and picking–packaging areas, is considered. A closed‐loop rack reuse process facilitates the corresponding logistical resource management in the system. Second, the interactions between robots and associated pickers are considered, and they are characterized by the instantaneous inputs (

Q_{k}

and

Q_{k | k}

), which are defined as the perceived queue lengths of unprocessed racks at the beginning and end of time interval k, respectively, and output (

Y_{k}

, which is the number of racks that are processed by a picker in time interval k).

Q_{k}

and

Q_{k | k}

are given by:

\{\begin{matrix} Q_{k} = Q_{k - 1 | k - 1} + {\tilde{X}}_{k} \\ Q_{k | k} = Q_{k} - Y_{k} \end{matrix}, \forall k,

where

Q_{k - 1 | k - 1}

(

Q_{k - 1 | k - 1} = Q_{k - 1} - Y_{k - 1}

) represents the number of unprocessed racks in the queue in the picking–packaging area (Subsystem 3) at the end of time interval

k - 1

; and

{\tilde{X}}_{k}

represents the number of unprocessed racks that arrive at the picking–packaging area (Subsystem 3) in time interval k. Third, the picker's psychological and physical conditions, characterized by the accumulated fatigue degree (

ρ_{k}

) and its association with

Y_{k}

and

Q_{k}

, are considered and fed back to adjust the handling, transportation, and return decisions of the robots that are executed in Subsystems 1, 2, and 4, respectively. Hence, the picking and packaging tasks can be completed in a manner that includes critical human factors.

MODELING

This section proposes a discrete‐time nonlinear dynamic stochastic model to formulate and solve the above problem. Specifically, Section 4.1 defines the variables of system states (including state variables and control variables), and Section 4.2 explicates the proposed dynamic stochastic model.

Specification of system states

The state variables of Subsystem 3 are specified to define the expected performance of a picker in coordination with mobile robots. Then, the state variables for Subsystem 1 are specified to characterize the performance of the coordinated robots. Thereafter, the state variables of Subsystems 2 and 4 are specified to characterize the forward and reverse logistical flows of mobile racks that are required for coordination between Subsystems 1 and 3 (robots’ and human pickers’ actions).

States of human picker

Consider the states that are ideally associated with a picker's performance (Subsystem 3). Ideally, the picking and packaging tasks of a picker are carried out efficiently by seamlessly receiving and processing the loaded racks that are delivered by mobile robots. However, such an ideal state of Subsystem 3 is obtained at the cost of the picker's accumulated fatigue (

ρ_{k}

). Worker fatigue is multidimensional and often increases with time (Glock, Grosse, Kim, et al., 2019; Jaber et al., 2013), stress, and workload (Do et al., 2018; Kc & Terwiesch, 2009; MacDonald, 2003), particularly when a picker must perform repetitive picking and packaging tasks. Based on the learning–forgetting–fatigue–recovery models established in the ergonomics‐related literature (Glock, Grosse, Kim, et al., 2019; Jaber et al., 2013), the commonly used deterministic exponential form of accumulated fatigue is extended herein into a dynamic stochastic form. Specifically, the fatigue growth rate (

λ_{k}

) that determines the speed of fatigue accumulation is associated with the perceived queue length of unprocessed racks (

Q_{k}

) and work rate (

Y_{k}

). Therein,

Q_{k}

and

Y_{k}

can be treated, respectively, as the instantaneous mental and physical workloads that have been empirically verified to have an adverse impact on fatigue (Tan & Netessine, 2014). This model utilizes

Q_{k}

and

Y_{k}

to capture how both the instantaneous mental and physical workloads affect a picker's accumulated fatigue in any given time interval k (

\forall k

). Thus, the picker's accumulated fatigue (

ρ_{k}

) in time interval k (

\forall k

) varies with time and can be expressed in a recursive form given by:

ρ_{k} = ρ_{k - 1} + Δ ρ_{k}, \forall k,

where

ρ_{k - 1}

is the picker's accumulated fatigue in time interval

k - 1

, and

Δ ρ_{k}

represents the fatigue increment in time interval k and equals

1 - e^{- λ_{k}}

(

Δ ρ_{k} = 1 - e^{- λ_{k}}

). According to the empirical results of MacDonald (2003) from multiple regression analyses, both workloads (

Q_{k}

) and work rates (

Y_{k}

) have amplified impact on fatigue, inferring that

λ_{k}

can be characterized by

Q_{k}

and

Y_{k}

in a simple linear manner as:

λ_{k} = β_{0} + β_{1} Y_{k} + β_{2} Q_{k}, \forall k,

where β₀ (

β_{0} \geq 0

) is the intrinsic fatigue growth rate; β₁ (

β_{1} \geq 0

) and β₂ (

β_{2} \geq 0

) are the coefficients that capture the associations of

Y_{k}

and

Q_{k}

with

λ_{k}

, respectively. The following condition on the picker's fatigue that accumulates in any time interval (Equation 4) must hold to ensure the wellbeing of the human worker):

ρ_{k} \leq ρ_{\max}, \forall k .

The accumulated fatigue (

ρ_{k}

) in time interval k may have a negative ergonomic effect on a picker's work rate (

Y_{k + 1}

) in the next time interval

k + 1

(Jaber et al., 2013; MacDonald, 2003): a higher

ρ_{k}

often corresponds to a lower

Y_{k + 1}

. Similarly, as shown by Kc and Terwiesch (2009), the overwork that is associated with accumulated fatigue has been demonstrated to affect a worker's performance (e.g., service rate). Do et al. (2018) assumed that the service rate has a multiplicative form under the effect of overwork. With reference to the fatigue‐recovery model of Jaber et al. (2013) and the above literature, in this work, a simple negative exponential function (

f (ρ_{k})

f (ρ_{k}) \equiv e^{- ρ_{k}}

) is used to capture the moderating effect of

ρ_{k}

, which can be expressed in a simple multiplicative form as shown in Equation (5):

Y_{k + 1} = f (ρ_{k}) Y_{k} = e^{- ρ_{k}} Y_{k}, \forall k .

Accordingly, Equation (5) can be used to elucidate how the accumulated fatigue in any given time interval k influences a picker's instantaneous picking and packaging performance in the next time interval

k + 1

. Moreover, the system states and control variables of robots in response to a picker's fatigue‐dependent performance can also be determined.

In addition to the above concern about accumulated fatigue, the efficiency of picking and packaging tasks at each time interval is an important issue. Ideally, a picker processes all loaded racks as they arrive to the picking–packaging area from the storage area so that no racks remain in a queue in any time interval (

Q_{k} = Y_{k}

\forall k

). Let

γ_{k}

be the picker‐efficiency state, which is defined as

γ_{k} \equiv \frac{Y_{k}}{Q_{k}} = \frac{Y_{k}}{Q_{k - 1} + {\tilde{X}}_{k}}

(Equation 1), and

\bar{γ}

(

\bar{γ} = 1, \forall k

) be the ideal state of

γ_{k}

. Then,

γ_{k}

is expected to remain stable in each time interval (

γ_{k + 1} = γ_{k}

\forall k

) under the ideal condition. If it does so, then

γ_{k + 1} = γ_{k} = \bar{γ} = 1

in all time intervals (

\forall k

), including the initial time interval (

k = 0

). In practice,

γ_{k}

may change over time and exhibit stochastic features under the influence of other system states (e.g., accumulated fatigue

ρ_{k}

) and exogenous variables (e.g.,

A_{k}

\forall k

). According to Equation (5),

Y_{k + 1}

, which is one component of

γ_{k + 1}

, can be influenced by

f (ρ_{k})

owing to the moderating effect of accumulated fatigue. Given

Q_{k}

and

{\tilde{X}}_{k + 1}

, which are exogenous variables to this subsystem (Subsystem 3),

γ_{k + 1}

can be associated with

γ_{k}

in a simple deterministic form as

γ_{k + 1} = e^{- ρ_{k}} γ_{k}

to indicate the effect of accumulated fatigue on worker performance (Jaber et al., 2013). If the stochastic features of the state variables are further considered, the above deterministic form can be extended into a recursive form with a Gaussian white noise term (

w_{γ_{k}}

), as shown in Equation (6), to capture its dynamic and stochastic features:

\{\begin{matrix} γ_{k + 1} = e^{- ρ_{k}} γ_{k} + w_{γ_{k}}, \forall k (k > 0), \\ γ_{0} = \bar{γ} = 1, \end{matrix}

where

w_{γ_{k}}

is a Gaussian white noise term of

γ_{k}

. Then, the ideal state of

γ_{k}

(

\bar{γ}

) is

\bar{γ} = 1

(

\forall k

), which means that no racks are queuing in the picking–packaging area in any time interval.

States of mobile robots

In the ideal state, robots handle SKUs and racks in the storage area (Subsystem 1) in a way that maintains equal volumes of inbound and outbound rack flows of Subsystem 1 (

X_{k} = A_{k} + {\tilde{Y}}_{k}

\forall k

). Let

δ_{k}

be the robot‐efficiency rate, which is defined as

δ_{k} \equiv \frac{X_{k}}{A_{k} + {\tilde{Y}}_{k}}

, and

\bar{δ}

(

\bar{δ} = 1

\forall k

) be the ideal state of

δ_{k}

. Then,

δ_{k}

should remain the same in all time intervals under the ideal condition, such that

δ_{k + 1} = δ_{k} = \bar{δ} = 1

(

\forall k

). Despite the fact that fatigue accumulation does not apply to Subsystem 1, involving robots’ operations, a robot‐efficiency state (

δ_{k}

) can be influenced by the exogenous variables (e.g.,

A_{k}

\forall k

) and

{\tilde{Y}}_{k},

which is contingent upon the picker's performance. Like

γ_{k}

δ_{k}

has dynamic and stochastic features. Thus,

δ_{k}

can be expressed in the recursive form with a Gaussian white noise term (

w_{δ_{k}}

) as follows:¹

\{\begin{matrix} δ_{k + 1} = δ_{k} + w_{δ_{k}}, \forall k (k > 0), \\ δ_{0} = \bar{δ} = 1 . \end{matrix}

Control variables for forward and reverse logistics flows of racks

Subsystems 2 and 4 are designed to regulate the forward and reverse logistical distribution flows of racks between the storage area and picking–packaging area. Let

μ_{k}

and

ν_{k}

be the rack distribution rates that are associated with Subsystems 2 and 4 (Figure 1) and defined as

μ_{k} \equiv \frac{{\tilde{X}}_{k}}{X_{k}}

and

v_{k} \equiv \frac{{\tilde{Y}}_{k}}{Y_{k}}

, respectively. Then,

μ_{k}

and

ν_{k}

can be used as control variables to regulate the numbers of unprocessed (

{\tilde{X}}_{k}

) and processed (

{\tilde{Y}}_{k}

) racks that arrive at Subsystems 3 and 1 in each given time interval k by setting

{\tilde{X}}_{k} = u_{k} X_{k}

and

{\tilde{Y}}_{k} = ν_{k} Y_{k}

, respectively, such that the system states (

γ_{k}

and

δ_{k}

) that are associated with the human picker and robots become:

γ_{k} = \frac{Y_{k}}{Q_{k - 1} + μ_{k} X_{k}}, \forall k,

δ_{k} = \frac{X_{k}}{A_{k} + ν_{k} Y_{k}}, \forall k .

Based on the above defined efficiency‐related states (

γ_{k}

and

δ_{k}

) and control variables (

μ_{k}

and

ν_{k}

), and given the number of newly arriving racks (

A_{k}

) and the measured previous queue length of racks (

Q_{k - 1}

), the following analytical results (Theorem 4.1) regarding the outputs

{\hat{X}}_{k}

and

{\hat{Y}}_{k}

(estimates of instantaneous efficiency rates associated with coordinated robots and picker) from Subsystems 1 and 3, respectively, can be derived. Note that all technical proofs are provided in Online Appendix A. Theorem 4.1

Given

γ_{k}

δ_{k}

μ_{k}

ν_{k}

A_{k}

Q_{k - 1}

and

Ω_{k} \equiv δ_{k} μ_{k} ν_{k} γ_{k}

, the estimates of instantaneous projections of

X_{k}

and

Y_{k}

(denoted by

{\hat{X}}_{k}

and

{\hat{Y}}_{k}

) are obtained as:

{\hat{X}}_{k} = \frac{1}{1 - Ω_{k}} (δ_{k} A_{k} + \frac{Ω_{k} Q_{k - 1}}{μ_{k}}), \forall k,

{\hat{Y}}_{k} = \frac{1}{1 - Ω_{k}} (\frac{Ω_{k} A_{k}}{ν_{k}} + γ_{k} Q_{k - 1}), \forall k .

In Equations (10) and (11) of Theorem 4.1,

Ω_{k}

is the product of the subsystem states (

γ_{k}

δ_{k}

) and control variables (

μ_{k}

ν_{k}

);

A_{k}

can be treated as an exogenous variable of the system; and

Q_{k - 1}

refers to queuing racks in the previous time interval

k - 1

. Both

A_{k}

and

Q_{k - 1}

are predetermined; however,

γ_{k}

δ_{k}

μ_{k}

, and

ν_{k}

(

\forall k

) are unknown and must be estimated (as detailed in Section 4.2).

Based on Theorem 4.1, Corollary 4.1 is also used to characterize and estimate the time‐varying fatigue growth rate (

λ_{k}

) and fatigue increment (

Δ ρ_{k}

) in any time interval k (

\forall k

). Corollary 4.1

Given Theorem 4.1 , estimates of

λ_{k}

and

Δ ρ_{k}

(denoted by

{\hat{λ}}_{k}

and

Δ {\hat{ρ}}_{k}

) are obtained as:

{\hat{λ}}_{k} = β_{0} + (β_{1} γ_{k} + β_{2}) (\frac{δ_{k} μ_{k} A_{k} + Q_{k - 1}}{1 - Ω_{k}}), \forall k,

Δ {\hat{ρ}}_{k} = 1 - e^{- (β_{0} + (β_{1} γ_{k} + β_{2}) (\frac{δ_{k} μ_{k} A_{k} + Q_{k - 1}}{1 - Ω_{k}}))}, \forall k .

Based on Corollary 4.1, substituting Equation (13) into Equation (2) yields

ρ_{k} = ρ_{k - 1} + 1 - e^{- (β_{0} + (β_{1} γ_{k} + β_{2}) (\frac{δ_{k} μ_{k} A_{k} + Q_{k - 1}}{1 - Ω_{k}}))}, \forall k .

Using the correlations between the system states and control variables specified above, the human picker and robots can be readily coordinated for superior performance in an order fulfillment center once the optimal values of the state and control variables can be determined. It is noted that the proposed model is data‐driven in which the input data required (including

Y_{k}

and

A_{k}

) are collected in each time interval k (

\forall k

) for estimating efficiency‐related states (

γ_{k}

and

δ_{k}

) and control variables (

μ_{k}

and

ν_{k}

) such that the state variables (

γ_{k}

and

δ_{k}

) can move toward the associated ideal values. Therein, the data used are collected in each time interval, which should be expressed in a discrete‐time manner rather than a continuous time manner for dynamically estimating and controlling state variables. Thus, the problem is formulated in a discrete‐time rather than a continuous‐time manner as presented in the next Section 4.2.

Model

This subsection proposes a discrete‐time nonlinear dynamic stochastic model that is characterized by recursive and measurement equations subject to boundary constraints (Anderson, Jr. et al., 2006; Santina et al., 1994; Sheu, 2002). Specifically, the time‐varying relationships among state variables, control variables, and measurement data defined in the previous section (Section 3), can be characterized in three forms: (1) recursive equations, (2) measurement equations, and (3) boundary constraints, as follows.

The recursive equations specify time‐varying relationships between the states of a dynamic and stochastic system in the next and current time intervals (

k + 1

and k,

\forall k

) . Among such system states,

γ_{k}

, and

δ_{k}

are treated as independent states, whereas

ρ_{k}

is a state that depends on

γ_{k}

and

δ_{k}

, according to Corollary 4.1 and Equation (14). In the proposed model, only the time‐varying relationships of independent states (

γ_{k}

, and

δ_{k}

) must be formulated recursively; then, the dependent state

ρ_{k}

can be readily determined. Based on Equations (6) and (7), the recursive equations that are associated with

γ_{k}

and

δ_{k}

can be expressed in a generalized vector form as in Equation (15):

Φ_{k + 1} = F_{k} + W_{k}, \forall k,

where

Φ_{k + 1}

F_{k}

and

W_{k}

are 2 × 1 time‐varying state vectors and can be further expressed as:

Φ_{k + 1} = [\begin{matrix} γ_{k + 1} \\ δ_{k + 1} \end{matrix}], \forall k,

F_{k} = [\begin{matrix} e^{- ρ_{k}} γ_{k} \\ δ_{k} \end{matrix}], \forall k,

W_{k} = [\begin{matrix} w_{γ_{k}} \\ w_{δ_{k}} \end{matrix}], \forall k,

where

ρ_{k} = ρ_{k - 1} + 1 - e^{- (β_{0} + (β_{1} γ_{k} + β_{2}) (\frac{δ_{k} μ_{k} A_{k} + Q_{k - 1}}{1 - Ω_{k}}))}

as in Equation (14) of Corollary 4.1.

Accordingly, the recursive equations (Equations 15 –18) reveal that the system states

γ_{k}

and

δ_{k}

change over time, following the standard Gaussian–Markov processes and can be used to project

γ_{k + 1}

, and

δ_{k + 1}

one‐time interval ahead, based on

γ_{k}

and

δ_{k}

in any time interval k(

\forall k

The measurement equations characterize time‐varying relationships between measurements and system states. In this work,

Y_{k}

(the number of mobile racks that are processed by a picker in each given time interval k) is used as the measurement variable because it can be detected/measured readily using various detection technologies (such as image processing and automated counters). Based on Equation (11) in Theorem 4.1, we can further postulate the deterministic form of time‐varying relationships between the measurement (

Y_{k}

) and system states as

Y_{k} = \frac{1}{1 - Ω_{k}} (\frac{Ω_{k} A_{k}}{ν_{k}} + γ_{k} Q_{k - 1})

. Owing to the measurement error in

Y_{k}

in any given time interval k (

\forall k

), a Gaussian white noise term

ε_{k}

is used to characterize this property. Since only one measurement variable (

Y_{k}

) is involved, one measurement equation is associated with

Y_{k}

in the proposed model. The generalized form of the measurement equation is given by:

Y_{k} = H_{k} + ε_{k}, \forall k,

where

Y_{k}

H_{k}

and

ε_{k}

are 1 × 1 time‐varying state vectors, which can be further expressed as:

Y_{k} = Y_{k}, \forall k,

H_{k} = [\frac{1}{1 - Ω_{k}} (\frac{Ω_{k} A_{k}}{ν_{k}} + γ_{k} Q_{k - 1})], \forall k,

ε_{k} = ε_{k}, \forall k .

Additionally, the picker's accumulated fatigue (

ρ_{k}

\forall k

) in any time interval should not exceed a preset upper bound (

ρ_{\max}

), as indicated in Equation (4). This constraint is incorporated into the proposed model.

REAL‐TIME DATA‐DRIVEN STATE ESTIMATION

This section presents a stochastic optimal control‐based method for the real‐time estimation of system states (

γ_{k}

and

δ_{k}

) and control variables (

μ_{k}

and

ν_{k}

), given information about the number of newly arriving racks (

A_{k}

) and the number of racks processed by the picker,

Y_{k}

, in each time interval (

\forall k

). Based on the principles of optimality that are applied in stochastic optimal control theory (Santina et al., 1994), the proposed algorithm searches for the optimal solutions for

γ_{k}

δ_{k}

μ_{k}

and

ν_{k},

which are updated using measurements of

Y_{k}

and the measurement equation (Equation 19) in each time interval k (

\forall k

) and then fed back as inputs into the recursive equation (Equation 15) such that the objective function ξ (Equation 23) is minimized:

\begin{matrix} ξ & = & \min E \{\sum_{k = 1}^{K} {({\hat{Φ}}_{k} - Φ_{k}^{*})}^{T} Ψ_{k}^{Φ} ({\hat{Φ}}_{k} - Φ_{k}^{*}) \\ + {({\hat{U}}_{k} - U_{k}^{*})}^{T} Ψ_{k}^{U} ({\hat{U}}_{k} - U_{k}^{*})\}, \end{matrix}

where

{\hat{Φ}}_{k}

and

{\hat{U}}_{k}

are 2 × 1 vectors that contain the estimated system states (

γ_{k}

and

δ_{k}

) and estimated control variables (

μ_{k}

and

ν_{k}

) in time interval k, respectively;

Φ_{k}^{*}

and

U_{k}^{*}

are 2 × 1 vectors that contain the ideal values of system states and control variables that are associated with

{\hat{Φ}}_{k}

and

{\hat{U}}_{k}

, respectively; and

Ψ_{k}^{Φ}

and

Ψ_{k}^{U}

represent 2 × 2 time‐varying diagonal, positive‐definite weighting matrixes that are associated with

{\hat{Φ}}_{k}

and

{\hat{U}}_{k}

, respectively. The objective function ξ in Equation (23) is the cost function, which is a scalar quadratic performance measure that indicates the deviation of the estimated system states and control variables from their ideal values in the order processing period.

To estimate the aforementioned system states and control variables in real time, a stochastic optimal control‐based method is developed using an extended Kalman filter. Kalman filtering is a well‐known statistical method for the linear‐quadratic estimation of system states of dynamic and stochastic systems. It has been successfully and extensively utilized in solving diverse estimation and control problems in various fields, such as the tracking and navigation of different sorts of vehicles and traffic signal control (Sheu, 2002). Whereas the basic Kalman filter applies to linear problems, the extended Kalman filter has been developed particularly for nonlinear problems that are characterized by either recursive or measurement equations in dynamic and stochastic models (Santina et al., 1994). The primary computational steps in the proposed estimation method are summarized below.

Initialization

This step initializes system states and all of the inputs that are required to trigger the subsequent computational steps. Let

k = 0

and

Δ Φ_{k} \equiv Φ_{k} - {\hat{Φ}}_{k}

(

\forall k

). Then, preset the initial estimates of the 2 × 1 state vector (

{\hat{Φ}}_{0}

), the 2 × 1 control variable vector (

{\hat{U}}_{0}

), the number of queued racks (Q ₀), and the 2 × 2 covariance matrix of the state estimation error (

{\hat{P}}_{0 | 0}

), where

Q_{0} = 0

;

{\hat{Φ}}_{0}

{\hat{U}}_{0}

and

{\hat{P}}_{0 | 0}

are given by:

\{\begin{matrix} {\hat{Φ}}_{0} \equiv [\begin{matrix} {\hat{γ}}_{0} \\ {\hat{δ}}_{0} \end{matrix}] = [\begin{matrix} \bar{γ} \\ \bar{δ} \end{matrix}] \\ {\hat{U}}_{0} \equiv [\begin{matrix} {\hat{μ}}_{0} \\ {\hat{ν}}_{0} \end{matrix}] = [\begin{matrix} 1 \\ 1 \end{matrix}] \end{matrix} (when k = 0),

{\hat{P}}_{0 | 0} \equiv E [Δ Φ_{0} Δ Φ_{0}^{T}] = P_{0}, (where k = 0) .

Prior prediction of system states

In this step, the state vector

Φ_{k + 1}

and covariance matrix of the state estimation error

P_{k + 1}

in time interval

k + 1

are predicted using the estimates of system states and measurements made in time interval k (

\forall k

). The prior prediction of

Φ_{k + 1}

(

{\hat{Φ}}_{k + 1 | k}

) will be updated for the linear minimum mean square (LMMS) estimation of

Φ_{k + 1}

using the extended Kalman filter in the next stage; and the prior prediction of

P_{k + 1}

(

{\hat{P}}_{k + 1 | k}

) is a prerequisite for the aforementioned state estimation. Using Equation (15), Corollaries 5.1 and 5.2 are provided to facilitate the prior predictions of

{\hat{Φ}}_{k + 1 | k}

and

{\hat{P}}_{k + 1 | k}

, respectively. Corollary 5.1

Given Equation ( 15 ) (recursive equation) and its characteristics, let

{\hat{F}}_{k} \equiv [\begin{matrix} e^{- {\hat{ρ}}_{k}} {\hat{γ}}_{k} \\ {\hat{δ}}_{k} \end{matrix}]

, where

{\hat{γ}}_{k}

and

{\hat{δ}}_{k}

are the LMMS estimates of

γ_{k}

and

δ_{k}

that are based on the data regarding

A_{k}

and

Y_{k}

collected in time interval k . Then, the prior prediction of the LMMS estimate of

Φ_{k + 1}

(

{\hat{Φ}}_{k + 1 | k}

) that is based on the data regarding

A_{k}

and

Y_{k}

that were collected in time interval k is given by:

{\hat{Φ}}_{k + 1 | k} = {\hat{F}}_{k}, \forall k .

The prior prediction (

{\hat{P}}_{k + 1 | k}

) of the covariance matrix of the state estimation error

P_{k + 1}

as well as

{\hat{Φ}}_{k + 1 | k}

must be determined in this stage. Using Equations (15) and (26) (Corollary 5.1), Corollary 5.2 is obtained to determine

{\hat{P}}_{k + 1 | k}

. Corollary 5.2

Given Equation ( 15 ) (the recursive equations) and Corollary 5.1 , let

{\hat{F}}^{'}_{k} = {\frac{\partial F_{k}}{\partial Φ_{k}} |}_{Φ_{k} = {\hat{Φ}}_{k}}

based on the data regarding

A_{k}

and

Y_{k}

that were collected in time interval k, and let

R_{k} \equiv E [W_{k} W_{k}^{T}]

. Then, the prior prediction of

P_{k + 1}

(

{\hat{P}}_{k + 1 | k}

) that is based on the data regarding

A_{k}

and

Y_{k}

that were collected in time interval k is given by:

{\hat{P}}_{k + 1 | k} = {\hat{F}}^{'}_{k} {\hat{P}}_{k | k} {\hat{F}}^{' T}_{k} + R_{k}, \forall k .

Correction of prior predictions

In this stage, the prior predictions

{\hat{Φ}}_{k + 1 | k}

and

{\hat{P}}_{k + 1 | k}

are corrected using the Kalman gain (

G_{k + 1}

) and the measurements made through time interval

k + 1

(

\forall k

). To achieve this purpose, Corollary 5.3, which associates the corrected state vector (

{\hat{Φ}}_{k + 1}

) with its prior prediction (

{\hat{Φ}}_{k + 1 | k}

), the Kalman gain (

G_{k + 1}

) and the measurements made through time interval

k + 1

(

\forall k

), is provided. Corollary 5.3

Given Equations ( 5 ) (fatigue prediction function) and ( 19 ) (measurement equation), let

Δ y_{k + 1 | k}

be the measurement residual in time interval

k + 1

, defined as

Δ y_{k + 1 | k} \equiv Y_{k + 1} - {\hat{Y}}_{k + 1 | k} = Y_{k + 1} - e^{- {\hat{ρ}}_{k}} {\hat{H}}_{k}

(

\forall k

), where

{\hat{H}}_{k} = [\frac{1}{1 - {\hat{Ω}}_{k}} (\frac{{\hat{Ω}}_{k} A_{k}}{{\hat{ν}}_{k}} + {\hat{γ}}_{k} Q_{k - 1})]

(by Equation 21 ), and let

Δ Y_{k}

be a

k \times 1

residual vector of the measurement residuals through time interval k that is defined as

Δ Y_{k} \equiv [\begin{matrix} Δ y_{1 | 0} \\ Δ y_{2 | 1} \\ ⋮ \\ Δ y_{k | k - 1} \end{matrix}]

. Then, the LMMS estimate of

Φ_{k + 1}

(

{\hat{Φ}}_{k + 1}

), which is based on the data that were collected through time interval

k + 1

is given by:

{\hat{Φ}}_{k + 1} = {\hat{Φ}}_{k + 1 | k} + G_{k + 1} \cdot Δ y_{k + 1 | k} + m_{Φ}, \forall k,

where m _Φ is the mean of

Φ_{k + 1}

, and

G_{k + 1}

is the Kalman gain, which is defined as:

G_{k + 1} \equiv E [Φ_{k + 1} Δ y_{k + 1 | k}^{T}] \cdot {\{E [Δ y_{k + 1 | k} Δ y_{k + 1 | k}^{T}]\}}^{- 1}, \forall k .

As seen in Equations (28) and (29) of Corollary 5.3, the LMMS estimate of

Φ_{k + 1}

(

{\hat{Φ}}_{k + 1}

) given the measurements made through time interval

k + 1

can be derived following the determination of the Kalman gain

G_{k + 1}

as in Equation (29). Therefore, Corollary 5.4 is provided to determine the Kalman gain

G_{k + 1}

. Corollary 5.4

Given Corollaries 5.2 and 5.3, and Equation ( 19 ) (measurement equation), let

{\hat{H}}^{'}_{k + 1} = {\frac{\partial H_{k + 1}}{\partial Φ_{k + 1}} |}_{Φ_{k + 1} = {\hat{Φ}}_{k + 1 | k}}

and

Λ_{k + 1} \equiv E [ε_{k} ε_{k}^{T}]

. Then, the Kalman gain

G_{k + 1}

is given by:

\begin{matrix} G_{k + 1} = {\hat{P}}_{k + 1 | k} {\hat{H}}^{' T}_{k + 1} {({\hat{H}}^{'}_{k + 1} {\hat{P}}_{k + 1 | k} {\hat{H}}^{' T}_{k + 1} + Λ_{k + 1})}^{- 1}, \forall k . \end{matrix}

Based on Corollaries 5.3 and 5.4, the corrected state vector

{\hat{Φ}}_{k + 1}

can be obtained using Equations (28) and (30). Alternatively,

{\hat{Φ}}_{k + 1}

can also be expressed as:

\begin{matrix} {\hat{Φ}}_{k + 1} & = & {\hat{Φ}}_{k + 1 | k} + {\hat{P}}_{k + 1 | k} {\hat{H}}^{' T}_{k + 1} {({\hat{H}}^{'}_{k + 1} {\hat{P}}_{k + 1 | k} {\hat{H}}^{' T}_{k + 1} + Λ_{k + 1})}^{- 1} \\ \times Δ y_{k + 1 | k} + m_{Φ}, \forall k . \end{matrix}

Finally, the recursive equation for correcting the prior prediction of the covariance matrix of state estimation error

{\hat{P}}_{k + 1 | k}

into

{\hat{P}}_{k + 1 | k + 1}

must be derived for the recursive estimation of system states in the next time interval

k + 1

. To achieve this purpose, Corollary 5.5 is provided. Corollary 5.5

Given Corollaries 5.2 to 5.4, let the corrected covariance matrix of state estimation error

{\hat{P}}_{k + 1 | k + 1}

be defined as

{\hat{P}}_{k + 1 | k + 1} \equiv E [Δ Φ_{k + 1 | k + 1} Δ Φ_{k + 1 | k + 1}^{T}]

, where

Δ Φ_{k + 1 | k + 1} \equiv Φ_{k + 1} - {\hat{Φ}}_{k + 1}

. Then,

{\hat{P}}_{k + 1 | k + 1}

is given by:

{\hat{P}}_{k + 1 | k + 1} = (I - G_{k + 1} {\hat{H}}^{'}_{k + 1}) {\hat{P}}_{k + 1 | k} + m_{Φ} m_{Φ}^{T}, \forall k .

The result of Corollary 5.5 in (32) is important for the estimation of the control variable vector as shown in the next subsection.

Estimation of control variable vector

Using the corrected state vector

{\hat{Φ}}_{k + 1}

(Equation 31) and the covariance matrix of the state estimation error

{\hat{P}}_{k + 1 | k + 1}

(Equation 32), the control variable vector

{\hat{U}}_{k + 1}

(

{\hat{U}}_{k + 1} \equiv [\begin{matrix} {\hat{μ}}_{k + 1} \\ {\hat{ν}}_{k + 1} \end{matrix}]

\forall k

) can be estimated. The principles of stochastic optimal control theory and existing algorithms (Santina et al., 1994; Sheu, 2002) are applied here to estimate

{\hat{U}}_{k + 1}

. Consistent with the principles of stochastic optimal control, the corrected state vector

{\hat{Φ}}_{k + 1}

is fed back through the optimal control gain matrix

Θ_{k + 1}

to determine the control variable vector

{\hat{U}}_{k + 1}

{\hat{U}}_{k + 1} = - Θ_{k + 1} {\hat{Φ}}_{k + 1} + η_{k + 1}, \forall k .

In Equation (33),

Θ_{k + 1}

and

η_{k + 1}

are given by:

\begin{matrix} Θ_{k + 1} = {[B_{k + 1}^{T} Γ_{k + 2} B_{k + 1} + Ψ_{k + 1}^{U}]}^{- 1} B_{k + 1}^{T} Γ_{k + 2} {\hat{F^{'}}}_{k + 1}, \forall k, \end{matrix}

\begin{matrix} η_{k + 1} & = & {[B_{k + 1}^{T} Γ_{k + 2} B_{k + 1} + Ψ_{k + 1}^{U}]}^{- 1} \\ \times [B_{k + 1} Ψ_{k + 1}^{Φ} Φ_{k + 1}^{*} + Ψ_{k + 1}^{U} U_{k + 1}^{*}], \forall k, \end{matrix}

where

B_{k + 1} = {\frac{\partial F_{k + 1}}{\partial U_{k + 1}} |}_{U_{k + 1} = {\hat{U}}_{k | k}}

(

\forall k

), and matrix

Γ_{k + 2}

should satisfy the Riccati equation as follows:

\begin{matrix} Γ_{k + 1} = Ψ_{k + 1}^{Φ} + {F^{'}}_{k + 1}^{T} Γ_{k + 2} {F^{'}}_{k + 1} - {F^{'}}_{k + 1}^{T} Γ_{k + 2} B_{k + 1} Θ_{k + 1}, \forall k . \end{matrix}

All of the system states that are characterized by the corrected state vector (

{\hat{Φ}}_{k + 1}

) and the control variable vector (

{\hat{U}}_{k + 1}

) in time interval

k + 1

can be estimated using the aforementioned approach and the data that are collected through time interval

k + 1

Utilizing the estimates of

{\hat{Φ}}_{k + 1}

and

{\hat{U}}_{k + 1}

, the state‐dependent variables, including

{\hat{X}}_{k + 1}

and

{\hat{Y}}_{k + 1}

(by Theorem 4.1), together with

Q_{k + 1}

(by Equation 1), can be updated and used as indices of the instantaneous performance of the system in terms of the efficiencies of the coordinated robots and the picker. Moreover, a picker's fatigue degree (

{\hat{ρ}}_{k + 1}

) accumulated to time interval

k + 1

can be estimated (by Corollary 4.1 and Equation 14) in real time to determine whether it satisfies the boundary constraint (Equation 4) that protects the safety of the workers.

The proposed data‐driven real‐time state estimation‐control approach can be coded in any appropriate computer programming language. This can not only facilitate the recursive calculation and optimal control of system states but also monitor system performance in real time. Figure 2 summarizes the primary algorithmic computational procedures based on the proposed approach.

FIGURE 2

Algorithmic procedures for data‐driven real‐time state estimation‐control

REAL‐PRACTICE‐BASED ANALYSIS

To demonstrate the validity and applicability of the proposed real‐time data driven state estimation and control approach for robot–picker‐coordinated order fulfillment, data were collected from a dominant e‐commerce company in mainland China that uses robots for parts‐to‐picker order fulfillment. The data were obtained from a warehouse of that company, which is a 2500m² robot‐carried mobile‐rack warehouse that stores small cosmetic items, in which 11 human pickers work with mobile robots to pick out customer orders. The raw data provide details about the pickers’ picking and packaging tasks at the study site over 6 days in March 2019. Data concerning the number of racks (

A_{k}

) that carry newly arriving SKUs to the storage area and the number of racks (

Y_{k}

) that are processed by each picker in the picking–packaging area at each time interval are used.

A total of 21 datasets that contain a total of 756 data points are generated from the collected raw data to facilitate real‐time state estimation and control in this empirical study. Each dataset contains 18 pairs of data points

A_{k}

and

Y_{k}

, measured every 10 min in a 3‐hour (3‐h) task period. The data points

A_{k}

are inputs to and

Y_{k}

are the measurements for the proposed approach. The generated 21 datasets are classified into three groups, associated with peak‐hour, normal, and off‐peak operational scenarios to demonstrate the applicability of the proposed approach to diverse operational scenarios in order fulfillment centers. Specifically, seven, eight, and six datasets are obtained for Scenarios I (peak hour), II (normal), and III (off‐peak), respectively. Using the 21 datasets, the robot–picker‐coordinated order fulfillment performance and accumulated fatigue of the pickers in peak‐hour, normal, and off‐peak operational scenarios (Scenarios I, II, and III) are comparatively analyzed.

Given the aforementioned 21 datasets (756 data points), the real‐practice‐based empirical analysis is carried out through two phases to demonstrate the relative performance (RP) of the proposed robot–picker coordination mechanism for parts‐to‐picker order fulfillment.

The first phase of the empirical analysis aims to compare the results generated using the proposed approach with the measurements gained from the 21 datasets in the aspects of picker efficiency (the average of

Y_{k}

values) and accumulated fatigue (the average of ρ _K values). The comparative results are presented in Figure 3, which help assess the RP of the proposed approach against the current parts‐to‐picker order fulfillment operations of the focal firm in the aspects of picker efficiency and fatigue alleviation, respectively.

FIGURE 3

Empirical results of phase I (robot–picker coordination relative to real operations). Scenarios: I, II, and III represent “peak,” “normal,” and “off‐peak” scenarios; Indexes: a and b represent the scenarios “with” and “without” considering robot–picker coordination; ARP, average of relative performance; RP, relative performance

Overall, the empirical results yielded in the first phase provide the following findings as summarized in Observation 6.1. Observation 6.1

The proposed robot–picker coordination system permits alleviating a picker's fatigue in order fulfillment without much influence on picking efficiency, compared with the current performance of the focal firm in the study case—As can be seen in Figure 3, a picker's accumulated fatigue can be reduced by 53.74% if the proposed robot–picker coordination system is applied in the focal firm of the study case. Such a fatigue alleviation, however, is carried out at the expense of lowering picker efficiency by 14.79%. Nevertheless, the above observation is encouraging for the applicability of the proposed method in practical cases of parts‐to‐picker order fulfillment systems as it implies that a more harmonious and enjoyable robot–human co‐working environment can be created to facilitate the development of intelligent and human‐friendly robot–human‐coordinated order fulfillment systems.

In the second phase of the empirical analysis, this work aims to demonstrate the RP of robot–picker coordination in contrast with the case without considering the robot–picker coordination for gaining more managerial insights. Therein, empirical results that are obtained with and without consideration of the robot–picker coordination mechanism are compared. Without robot–picker coordination, both control variables

μ_{k}

and

ν_{k}

are set to 1 (

μ_{k} = ν_{k} = 1

) to mimic the current parts‐to‐picker order fulfillment operations of the focal firm of the case studied.

Figure 4 plots the empirical results concerning practice‐based empirical analysis, including the average values of scenario‐based state variables, control variables, system performance, and associated RP, as determined by comparing the outputs with and without robot–picker coordination. The findings and managerial implications of Figure 4 are discussed below.

FIGURE 4

Empirical results of phase II (robot–picker coordination performance). Scenarios: I, II, and III represent “peak,” “normal,” and “off‐peak” scenarios; indexes: a and b represent the scenarios “with” and “without” considering robot–picker coordination; RP, relative performance

Observation 6.2

The proposed robot–picker coordination system outperforms the system without coordination not only in efficiency but also the fatigue of human pickers—As seen in Figure 4, the RP values that are associated with

γ_{k}

in all three scenarios (peak‐hour, normal, and off‐peak scenarios) are positive, indicating that the pickers’ working performance with robot–picker coordination is more efficient than without coordination. This higher efficiency is attributed to shorter queue racks (

Q_{k}

), owing to adjustments of the number of mobile racks that arrive at the picking and packaging area in each time interval based on estimates of the control variable μ _k in real time using the proposed approach. Additionally, the values of accumulated fatigue ( ρ _k) that are found in all three scenarios with robot–picker coordination are lower than the scenarios without coordination. This observation is highly encouraging as it implies that customer orders can be efficiently fulfilled without the need for pickers to expend more energy and make more efforts while operating under the proposed robot–picker coordination, relative to the situation without it.

Observation 6.3

Robot–picker coordination improves system performance more in the “normal” scenario (Scenario II) than in the “peak‐hour” and “off‐peak” scenarios (Scenarios I and III)—This inference is drawn by comparing the RP values, particularly those associated with

Q_{k}

and

ρ_{k}

in the three scenarios. The “minus” signs of the RP values that are associated with

Q_{k}

and

ρ_{k}

imply that the proposed robot–picker coordination system alleviates pickers’ accumulated fatigue (

ρ_{k} ↓

) by reducing the number of racks that queue (

Q_{k} ↓

) in the picking–packaging area. Such an effect of robot–picker coordination on either

Q_{k}

ρ_{k}

is highly significant under normal conditions (Scenario II) and is reasonable as the length of queues of racks (

Q_{k}

) a key factor that is related to the stress and workloads of pickers as captured by Equation (3) in the proposed model.

Observation 6.4

(Human‐friendly robot behavior for coordination with human pickers in different operational scenarios). Under the proposed robot–picker coordination scheme, robots can exhibit different coordination behaviors in response to the instantaneous performance of coordinated pickers in different scenarios, as follows.

Under peak‐hour conditions (Scenario I)—Robots tend to handle racks with high efficiency (by increasing

X_{k}

value) in the storage area, and this efficiency almost matches the coordinated picker's efficiency in processing racks (

X_{k} \approx Y_{k}

). Moreover, robots tend to stabilize the volume of racks that are delivered to the coordinated pickers (by stabilizing

μ_{k}

value); meanwhile, the rack volume that is returned to the storage area is adjusted (by reducing

ν_{k}

value) based on the coordinated pickers’ instantaneous efficiency (by estimating

Y_{k}

value). Consequently, the accumulated number of queuing racks in the picking—packaging area can be well controlled (by reducing the

Q_{k}

value) to mitigate the accumulated stress and fatigue of coordinated human pickers during peak hours.

Under normal conditions (Scenario II)—Robots adjust their efficiency of handling racks in the storage area to keep the number of racks that are moved to the picking–packaging area almost equal to the number of racks that are processed by the coordinated pickers (

X_{k} \approx Y_{k}

) per unit time such that the accumulated number of racks that queue in the picking–packaging area remains stable. (

Q_{k}

remains approximately double the value of

Y_{k}

in this case). Meanwhile, the volumes of mobile racks (carried by robots) that move between the storage and picking–packaging areas are controlled to be almost equal to each other (

μ_{k} \approx ν_{k}

Under off‐peak conditions (Scenario III)—Robots handle racks in the storage area more slowly (by reducing

X_{k}

value) to keep their efficiency a little lower than the coordinated pickers’ efficiency in processing racks (

X_{k} \leq Y_{k}

). Meanwhile, the volumes of racks that are moved between the storage area and the picking–packaging area are adjusted in response to the coordinated picker's processing of racks. Specifically, robots tend to move racks forward fast (by increasing

μ_{k}

value) and to return racks from the picking–packaging area more slowly (by reducing

ν_{k}

value) during off‐peak hours.

Three empirical examples, based on three selected datasets, are further used to present graphically the aforementioned characteristics of robot–picker coordination behavior that is typically exhibited during “peak,” “normal,” and “off‐peak” hours as shown in Figures 5 to 7. Briefly, the coordinated robots adjust their efficiency (

X_{k}

) in handling racks in the storage area and controlling the volumes of racks (by

μ_{k}

and

ν_{k}

) that move between the storage area and the picking–packaging area in response to the coordinated pickers’ efficiency in processing racks (by

Y_{k}

) and accumulated fatigue (

ρ_{k}

). Under such a proposed robot–picker coordination scheme, the lengths of the queues of racks in the picking–packaging area can also be controlled.

FIGURE 5

Empirical example (1)—system output during peak hours (Scenario I)

FIGURE 6

Empirical example (2)—system output during normal hours (Scenario II)

FIGURE 7

Empirical example (3)—system output during off‐peak hours (Scenario III)

The results of empirical example 1 (Figure 5) reveal a large difference between accumulated fatigue (

ρ_{k}

) in a 3‐h peak period in Scenario I and those shown in the other two scenarios. Thus, Scenario I is used to conduct sensitivity analysis on two key parameters (β₁, and β₂), which determine the effects of picker efficiency (

Y_{k}

) and rack queue length (

Q_{k}

), respectively, on the fatigue growth rate (

λ_{k}

) as captured by Equation (3). The parameter settings (

β_{1} = 0.00005

and

β_{2} = 0.00005

) and dataset that are used in empirical example 1 provide the baseline in the sensitivity analysis. Figure 8 plots the empirical results of the sensitivity analysis of the relationships among

ρ_{k}

, β₁, and β₂, where the value of ρ is the average of the

ρ_{k}

values that are estimated during the 3‐h peak period. Figure 8 has managerial implications concerning the psychological and physical effects of workload on human pickers, which are determined by the perceived rack queue lengths and the picking efficiency of pickers in coordination with robots during peak hours.

FIGURE 8

Sensitivity analysis of ρ (with respect to β₁ and β₂) in peak‐hour scenario

Observation 6.5

A picker's accumulated fatigue (ρ) increases significantly with either the picker's efficiency (

Y_{k}

) or perceived rack queue length (

Q_{k}

) during peak hours; the increase in ρ with

Q_{k}

exceeds that with

Y_{k}

During peak hours, a human picker is likely to be more sensitive to the perceived length of the queue of unprocessed mobile racks than to the number of racks processed. Therefore, perceiving any anomalous increase in the length of the queue of mobile racks would easily have a negative effect on the picker's psychological and behavioral responses (such as stress, anxiety, and panic).

CONCLUSIONS, MANAGERIAL IMPLICATIONS, AND RECOMMENDATIONS

Conclusion

Motivated by the introduction of robots and advanced technologies into intelligent logistics operations and the impact thereof on worker safety and wellbeing, this work has presented a real‐time data‐driven stochastic optimal control‐based approach to address the challenges of robot–picker‐coordinated order fulfillment. Specifically, the proposed approach combines a discrete‐time nonlinear stochastic‐dynamic model with a fatigue accumulation function (Section 4) into a real‐time system state estimation and control method (Section 5), to find the optimal solutions for the coordination between human pickers and robots. Theoretically and in practice, the coordinated robots can react and adapt to picker efficiency in real time, such that the coordinated robots and pickers can jointly fulfill orders in collaboration and in a human‐friendly manner. Sections 4 and 5 have presented the methodological characteristics and novelty of the proposed approach, which help address the first research question raised in Section 1. To answer the second question (Section 1), an empirical analysis has been performed using actual data that were collected from a dominant e‐commerce company in China that uses robots for “parts‐to‐picker” order fulfillment. Our empirical study has been conducted to generate various insights (called “observations”), which help answer the second question, concerning how the proposed human‐friendly, robot–human coordination system improves the performance of a parts‐to‐picker order fulfillment center in terms of both safety and human wellbeing.

Managerial implications

The analytical results that have been obtained using the proposed model (Sections 4 and 5) and the empirical results of the empirical analysis of actual data (Section 6) provide the following managerial implications.

Implication 1. Under the proposed robot–picker‐coordinated order fulfillment mechanism, the robot and picker efficiencies (

X_{k}

and

Y_{k}

) have the following characteristics.

The robot and picker efficiencies (

X_{k}

and

Y_{k}

) can be effectively controlled and coordinated if real‐time data concerning newly arriving racks (

A_{k}

) that are loaded with SKUs are obtained.

Robot efficiency (

X_{k}

) depends strongly on

A_{k}

, whereas picker efficiency (

Y_{k}

) depends strongly on the length of the queue of unprocessed racks (

Q_{k}

The above managerial insights are derived from Theorem 4.1 and Observations 6.2 and 6.5. The effect of the perceived length of the queue (

Q_{k}

) of unprocessed racks on systems performance is emphasized and has implications for operations managers (e.g., in warehousing), enabling them better to manage the number of racks in queues by appropriately controlling the speeds with which robots deliver racks to the picking–packaging area.

Implication 2. Under the proposed robot–picker‐coordinated order fulfillment mechanism, the picker‐to‐robot RP in the aspect of either efficiency (

Y_{k} / X_{k}

) or effectiveness (

γ_{k} / δ_{k}

) in peak hours is less than that in either normal or off‐peak hours. The picker‐to‐robot RP is greatest in the “off‐peak” scenario. This finding is drawn from a comparison of empirical results (Figure 4) for

Y_{k} / X_{k}

and

γ_{k} / δ_{k}

among the three scenarios. It may help operations managers to determine appropriate criteria and standards for evaluating the efficiency and effectiveness of pickers in different scenarios.

Implication 3. Robot efficiency is not the most important metric. Rather, human efficiency dominates the efficiency of a parts‐to‐picker order fulfillment system, particularly during peak hours. Accordingly, controlling the number of queued racks by slowing down their delivery by robots to the picking–packaging area is more efficient than slowing down the rack‐processing of a picker to alleviate that picker's fatigue, particularly during a peak‐hour period. This implication is drawn from the empirical results (Figure 8) and is consistent with Observation 6.5 and Implication 1(b). It suggests the importance of accounting for human wellbeing and safety in the design and implementation of a human‐friendly intelligent robot–human‐coordinated order fulfillment system. In such a system, humans are the key element, and robots should be sufficiently intelligent and accommodating to assist humans in the joint intelligent logistics task of order fulfillment.

Recommendations for future research

Despite the novelty of the proposed methodology and its advantages in realizing a human‐friendly intelligent robot–human‐coordinated order fulfillment system, this research has several limitations. First, the proposed model has not incorporated factors related to worker psychology (e.g., perceived job worth, job attitude, and emotional involvement) and behavior (e.g., body posture) toward coordination and cooperation with robots, and these factors should be further explored and empirically verified in the future (MacDonald, 2003; Tan & Netessine, 2014; Tan et al., 2022). For example, body posture is regarded as a kind of physical demand, which may contribute to psychosocial hazards (i.e., negative effects on employees such as stress and health problems), as claimed in MacDonald (2003). Thus, body posture can be another issue to improve employees’ stress and health problems in human–robot collaborative work settings. Other related tasks that are undertaken in inbound logistics (such as storage assignment) and outbound logistics (such as resource/fleet management, vehicular loading, dispatching, and routing) have not yet been considered. They may deserve further investigation. Extending the proposed model for the cases of multiple working periods intervened by scheduled breaks is analytically challenging yet meaningful for future research. Last but not least, considering how robotics can be used to help cope with capacity bottlenecks and supply chain disruptions due to manpower shortage under/after the Coronavirus disease (COVID‐19) pandemic (Gupta et al., 2022) will also be interesting to study in the future.

Footnotes

ACKNOWLEDGMENT

This study is partially supported by MOST project from Ministry of Science and Technology of Taiwan, R.O.C. (code: MOST 109‐2410‐H‐002‐076‐MY3).

1

We let δ ₀ (initial value of δ) start from the ideal value, meaning that its initial state starts from an ideal condition.

ORCID iD

Tsan‐Ming Choi

References

Al Elew

(2020, September 29). What are injury rates like at Amazon warehouses. Reveal News. https://revealnews.org/article/amazon‐injury‐rates/ (accessed 1 February, 2022)

Anderson

E. G.

Jr. Morrice

D. J.

Lundeen

(2006). Stochastic optimal control for staffing and backlog policies in a two‐stage customized service supply chain. Production and Operations Management, 15(2), 262–278.

Azadeh

De Koster

Roy

(2019). Robotized and automated warehouse systems: Review and recent developments. Transportation Science, 53(4), 917–945.

Banker

(2016, January 11). Robots in the warehouse: It's not just Amazon. Forbes. https://www.forbes.com/sites/stevebanker/2016/01/11/robots‐in‐the‐warehouse‐its‐not‐just‐amazon/#1bd3d6e540b8 (accessed 1 February, 2022)

Barrett

Oborn

Orlikowski

W. J.

Yates

(2012). Reconfiguring boundary relations: Robotic innovations in pharmacy work. Organization Science, 23(5), 1448–1466.

Batt

R. J.

Gallino

(2019). Finding a needle in a haystack: The effects of searching and learning on pick‐worker performance. Management Science, 65(6), 2624–2645.

Beane

Orlikowski

W. J.

(2015). What difference does a robot make? The material enactment of distributed coordination. Organization Science, 26(6), 1553–1573.

Boldrer

Antonucci

Bevilacqua

Palopoli

Fontanelli

(2022). Multi‐agent navigation in human‐shared environments: A safe and socially‐aware approach. Robotics and Autonomous Systems, 149, 103979.

Boysen

Briskorn

Emde

(2017). Parts‐to‐picker based order processing in a rack‐moving mobile robots environment. European Journal of Operational Research, 262, 550–562.

10.

Bozer

Y. A.

Aldarondo

F. J.

(2018). A simulation‐based comparison of two goods‐to‐person order picking systems in an online retail setting. International Journal of Production Research, 56(11), 3838–3858.

11.

Bureau of Labor Statistics . (2014). Nonfatal occupational injuries and illnesses requiring days away from work, 2013 . USDL‐14‐2246. United States Department of Labor.

12.

Chen

C.‐M.

Gong

deKoster

R. B. M.

vanNunen

J. A. E. E.

(2010). A flexible evaluative framework for order picking systems. Production and Operations Management, 19(1), 70–82.

13.

Chen

Yang

(2022). Influence of mobile robots on human safety perception and system productivity in wholesale and retail trade environments: A pilot study. IEEE Transactions on Human‐Machine Systems, 52(4), 624–635.

14.

Choi

T. M.

Kumar

Yue

Chan

H. L.

(2022). Disruptive technologies and operations management in the industry 4.0 era and beyond. Production and Operations Management, 31(1), 9–31.

15.

Chung

S. H.

(2021). Applications of smart technologies in logistics and transport: A review. Transportation Research Part E, 153, 102455.

16.

Cooper

Foster

(1971). Sociotechnical systems. American Psychologist, 26(5), 467–474.

17.

Del Rey

(2019, December 11). How robots are transforming Amazon warehouse jobs—for better and worse. Vox. https://www.vox.com/recode/2019/12/11/20982652/robots‐amazon‐warehouse‐jobs‐automation (accessed 1 February, 2022)

18.

De Koster

Le‐Duc

Roodbergen

K. J.

(2007). Design and control of warehouse order picking: A literature review. European Journal of Operational Research, 182(2), 481–501.

19.

De Vries

deKoster

Stam

(2016). Aligning order picking methods, incentive systems, and regulatory focus to increase performance. Production and Operations Management, 25(8), 1363–1376.

20.

Demaitre

(2019, July 11). Geek+ raises another $150M for global expansion of autonomous mobile robots. The Robot Report. https://www.therobotreport.com/geek‐raises‐funds‐autonomous‐mobile‐robots/ (accessed 1 February, 2022)

21.

H. T.

Shunko

Lucas

M. T.

Novak

D. C.

(2018). Impact of behavioral factors on performance of multi‐server queueing systems. Production and Operations Management, 27(8), 1553–1573.

22.

Doll

(2020, September 29). Amazon's automated warehouses have reportedly led to more worker injuries. Screen Rant. https://screenrant.com/amazon‐automated‐warehouses‐worker‐injuries‐increase‐report/ (accessed 1 February, 2022)

23.

Enright

Wurman

(2011). Optimization and coordinated autonomy in mobile fulfillment systems. Workshops at the Twenty‐Fifth AAAI Conference on Artificial Intelligence , 2011, San Francisco, CA.

24.

Evans

(2020, September 29). How Amazon hid its safety crisis. Reveal News. https://revealnews.org/article/how‐amazon‐hid‐its‐safety‐crisis/ (accessed 1 February, 2022)

25.

Faraj

Xiao

(2006). Coordination in fast‐response organizations. Management Science, 52(8), 1155–1169.

26.

Frazelle

E. A.

(2002). World‐class Warehousing and Material Handling, McGraw‐Hill.

27.

Garland . (2022, June 28). Amazon unveils first fully autonomous mobile robot. Supplychain Dive. https://www.supplychaindive.com/news/amazon‐unveils‐first‐fully‐autonomous‐mobile‐robot‐proteus‐warehouse/626164/ (accessed 1 February, 2022)

28.

Gharehgozli

Zaerpour

(2020). Robot scheduling for pod retrieval in a robotic mobile fulfillment system. Transportation Research Part E, 142, 102087.

29.

Glock

C. H.

Grosse

E. H.

Abedinnia

Emde

(2019). An integrated model to improve ergonomic and economic performance in order picking by rotating pallets. European Journal of Operational Research, 273(2), 516–534.

30.

Glock

C. H.

Grosse

E. H.

Kim

Neumann

W. P.

Sobhani

(2019). An integrated cost and worker fatigue evaluation model of a packaging process. International Journal of Production Economics, 207, 107–124.

31.

Grosse

E. H.

Glock

C. H.

(2015). The effect of worker learning on manual order picking processes. International Journal of Production Economics, 170, 882–890.

32.

Grosse

E. H.

Glock

C. H.

Jaber

M. Y.

Neumann

W. P.

(2015). Incorporating human factors in order picking planning models: Framework and research opportunities. International Journal of Production Research, 53(3), 695–717.

33.

Grosse

E. H.

Glock

C. H.

Neumann

W. P.

(2017). Human factors in order picking: A content analysis of the literature. International Journal of Production Research, 55(5), 1260–1276.

34.

Gupta

Starr

M. K.

Farahani

R. Z

Asgari

& N.

(2022). Pandemics/epidemics: Challenges and opportunities for operations management research. Manufacturing & Service Operations Management, 24(1), 1–23.

35.

Hinds

P. J.

Roberts

T. L.

Jones

(2004). Whose job is it anyway? A study of human‐robot interaction in a collaborative task. Human‐Computer Interaction, 19(1), 151–181.

36.

Jaber

M. Y.

Givi

Z. S.

Neumann

W. P.

(2013). Incorporating human fatigue and recovery into the learning–forgetting process. Applied Mathematical Modelling, 37, 7287–7299.

37.

Karuppan

C. M.

Schniederjans

M. J.

(1995). Sources of stress in an automated plant. Production and Operations Management, 4(2), 108–126.

38.

D. S.

Terwiesch

(2009). Impact of workload on service time and patient safety: An econometric analysis of hospital operations. Management Science, 55(9), 1486–1498.

39.

Lamballais

Roy

deKoster

M. B. M.

(2017). Estimating performance in a Robotic Mobile Fulfillment System. European Journal of Operational Research, 256, 976–990.

40.

Larco

J. A.

deKoster

Roodbergen

K. J.

Dul

(2017). Managing warehouse efficiency and worker discomfort through enhanced storage assignment decisions. International Journal of Production Research, 55(21), 6407–6422.

41.

Lavender

S. A.

Marras

W. S.

Ferguson

S. A.

Splittstoesser

R. E.

Yang

(2012). Developing physical exposure‐based back injury risk models applicable to manual handling jobs in distribution centers. Journal of Occupational and Environmental Hygiene, 9(7), 450–459.

42.

Hua

Huang

Sheu

J. ‐B.

Cheng

T. C. E.

Huang

(2020). Storage assignment policy with awareness of energy consumption in the Kiva mobile fulfilment system. Transportation Research Part E, 144, 102158.

43.

Loffler

Boysen

Schneider

(2022). Picker routing in AGV‐assisted order picking systems. INFORMS Journal on Computing, 34(1), 440–462.

44.

Loske

(2022). Empirical evidence on human learning and work characteristics in the transition to automated order picking. Journal of Business Logistics, 43(3), 302–342.

45.

Luo

Choi

T. M.

(2022). E‐commerce supply chains with considerations of cyber‐security: Should governments play a role? Production and Operations Management, 31(5), 2107–2126.

46.

MacDonald

(2003). The impact of job demands and workload on stress and fatigue. Australian Psychologist, 38(2), 102–117.

47.

Mumford

(2006). The story of socio‐technical design: reflections on its successes, failures and potential. Information Systems Journal, 16(4), 317–342.

48.

Olsen

T. L.

Tomlin

(2020). Industry 4.0: Opportunities and challenges for operations management. Manufacturing and Service Operations Management, 22(1), 113–122.

49.

Perera

Dawande

Janakiraman

Mookerjee

(2020). Retail deliveries by drones: How will logistics networks change? Production and Operations Management, 29(9), 2019–2034.

50.

Rai

Tiwari

M. K.

Ivanov

Dolgui

(2021). Machine learning in manufacturing and industry 4.0 applications. International Journal of Production Research, 59(16), 4773–4778.

51.

Rosenthal

S. R.

(1984). Progress toward the factory of the future. Journal of Operations Management, 4(3), 203–229.

52.

Santina

M. S.

Stubberud

A. R.

Hostetter

G. H.

(1994). Stochastic systems and recursive estimation. In Santina

M. S.

Stubberud

A. R.

Hostetter

G. H.

(Eds.), Digital Control System Design (pp. 580–679). Saunders College Publishing, Harcourt Brace College Publishers.

53.

Sheu

J. ‐B.

(2002). A stochastic optimal control approach to real‐time incident‐responsive traffic signal control at isolated intersections. Transportation Science, 36(4), 418–434.

54.

Shi

Yue

(2021). Analytics for IoT‐enabled human–robot hybrid sortation: an online optimization approach. Production and Operations Management. Advance online publication. https://doi.org/10.1111/poms.13626 (accessed 1 February, 2022)

55.

Tam

(2014, November 30). Meet Amazon's busiest employee—the Kiva robot. CNET. http://www.cnet.com/news/meet‐amazons‐busiest‐employee‐the‐kiva‐robot/ (accessed 1 February, 2022)

56.

Tan

P. M.

Koopman

McClean

S. T.

Zhang

J. H.

C. H.

De Cremer

C. T. S.

(2022). When conscientious employees meet intelligent machines: An integrative approach inspired by complementarity theory and role theory. Academy of Management Journal, 65(3), 1019–1054.

57.

Tan

T. F.

Netessine

(2014). When does the devil make work? An empirical study of the impact of workload on worker productivity. Management Science, 60(6), 1574–1593.

58.

Tobe

(2018, May 6). Warehousing, fulfillment and DC transformation trends. The Robot Report. https://www.therobotreport.com/warehousing‐fulfillment‐and‐dc‐transformation‐trends/ (accessed 1 February, 2022)

59.

Tompkins

J. A.

White

J. A.

Bozer

Y. A.

Tanchoco

J. M. A.

(2010). Facilities planning. John Wiley & Sons.

60.

Wang

Sheu

J.‐B.

Teo

C.‐P.

Xue

(2022). Robot scheduling for mobile‐rack warehouses: Human‐robot coordinated order picking systems. Production and Operations Management, 31(1), 98–116.

61.

Weidinger

Boysen

Briskornb

(2018). Storage assignment with rack‐moving mobile robots in KIVA warehouses. Transportation Science, 52(6), 1479–1495.

62.

Yuan

Graves

S. C.

Cezik

(2019). Velocity‐based storage assignment in semi‐automated storage systems. Production and Operations Management, 28(2), 354–373.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.22 MB

Can we work more safely and healthily with robot partners? A human‐friendly robot–human‐coordinated order fulfillment scheme

Abstract

Keywords

INTRODUCTION

LITERATURE REVIEW

PROBLEM DESCRIPTION

MODELING

Specification of system states

States of human picker

States of mobile robots

Control variables for forward and reverse logistics flows of racks

Model

REAL‐TIME DATA‐DRIVEN STATE ESTIMATION

Initialization

Prior prediction of system states

Correction of prior predictions

Estimation of control variable vector

REAL‐PRACTICE‐BASED ANALYSIS

CONCLUSIONS, MANAGERIAL IMPLICATIONS, AND RECOMMENDATIONS

Conclusion

Managerial implications

Recommendations for future research

Footnotes

ACKNOWLEDGMENT

1

ORCID iD

References

Supplementary Material