Abstract
This work addresses the issue of intelligent robot–human‐coordinated parts‐to‐picker order fulfillment carried out in a human‐friendly manner. One unique feature of the proposed approach involves integrating a real‐time data‐driven stochastic‐dynamic model with a fatigue accumulation function. The optimal solutions help achieve coordination between human pickers and robots, such that robots can agilely adapt to the coordinated pickers’ efficiency and fatigue conditions. Specifically, the proposed method estimates human pickers’ instantaneous performance and robot queue lengths, which are then fed back in real time as indexes to adjust robots’ speeds of handling racks and moving them to human pickers. Using data that are provided by a giant electronic commerce (e‐commerce) company, our analyses demonstrate that the proposed robot–picker coordination system permits alleviating a picker's fatigue without much influence on picking efficiency. In particular, a picker's accumulated fatigue can be reduced by 53.74% at the expense of lowering picker efficiency by 14.79% if the proposed robot–picker coordination system is applied in the focal firm of the study case. Through our scenario design and sensitivity analysis, additional findings and managerial insights, including the rules of human‐friendly robot behaviors for coordination with human pickers in different operational scenarios are provided. They facilitate the development of “human‐friendly” intelligent robot‐human‐coordinated order‐fulfillment systems for intelligent logistics operations.
Keywords
INTRODUCTION
Business operations have entered the Industry 4.0 era in which robotics are widely applied in the production, logistics, and e‐commerce companies (Choi et al., 2022; Loffler et al., 2022; Olsen & Tomlin 2020; Rai et al., 2021). This is evidenced by the claims made by Amazon.com: It's better for everybody…Workers no longer would have to walk massive warehouse floors to find the right power drill–instead, robots would bring the drill directly to them… The hour or more it took to process a package had been shaved down to as little as 15 minutes…Robots, Amazon insists, are good for workers… “They make the job safer,”… (Evans, 2020) However, picking three times faster also implies more wear and tear due to repetitive motion and working faster at lifting and handling products… So along with the drive to automate more warehouse tasks comes much higher expectations for workers…The robots have raised the average picker's productivity from around 100 items per hour to what Mr. Long and others have said is a target of around 300 or 400, though the numbers vary across teams and facilities… (Del Rey, 2019).
Indeed, there is no doubt that advancements in such disruptive technologies as robotics and related applications (Azadeh et al., 2019; Chung, 2021; Perera et al., 2020; Shi et al., 2021; Wang et al., 2022) have brought new benefits to electronic commerce (e‐commerce) companies (such as Amazon, Alibaba, and Jingdong) by increasing the efficiency of order fulfillment. However, they are simultaneously creating new safety challenges.
Order picking is a critical step in customer order fulfillment (Batt & Gallino, 2019; de Koster et al., 2007; de Vries et al., 2016; Frazelle, 2002). In a typical warehouse, it is not only costly and time‐consuming but also labor‐intensive. In general, the cost of order picking is estimated to account for more than 50% of the total warehousing cost (Tompkins et al., 2010). Furthermore, order picking typically relies on manual labor to retrieve multiple items from storage, sort them, and then package them to fulfill customer orders (C.‐M. Chen et al., 2010; de Koster et al., 2007; de Vries et al., 2016). Owing to the large variety of customer orders and stock‐keeping units (SKUs) in most e‐commerce operations, each order‐picking task for a customer order is difficult to replicate. The ordering picking's efficiency and accuracy depend mainly on the human pickers’ experience (Batt & Gallino, 2019) and performance (de Vries et al., 2016). Moreover, in e‐commerce, agile order picking can be more challenging when the ordered products are diverse, demand is fluctuating, customer returns are increasing and customers expect more flexible and expeditious logistics services (Frazelle, 2002). Briefly, order picking, which depends on intensive labor and pickers’ performance, remains crucial to the success of fulfilling customer orders with high service quality, regardless of the methods/technologies that are deployed in a warehouse.
Despite a variety of innovative automation technologies, such as autonomous mobile robots, have been increasingly introduced to carry out order fulfillment jointly with human pickers in warehouses (Banker, 2016; Tobe, 2018; Wang et al., 2022), “parts‐to‐picker” order fulfillment systems raise several new issues on humanity and safety. This creates a challenge on robot–human coordination. A certain number of robots that carry racks automatically from the storage area to the order picking area may be deployed to facilitate the picking tasks of human pickers. Poor robot–human coordination typically increases the number of robots that carry, queue and await handling by pickers in the order‐picking area. Note that a human picker's efficiency is not as controllable as that of a robot and may vary with time and psychophysical condition. Thus, an unexpected delay in picking items may inevitably occur if robots just continue to carry racks to her and are unable to respond to her performance. More seriously, recent reports have revealed a substantial rising injury rate in automated warehouses (Al Elew & Oh, 2020; Doll, 2020; Evans, 2020). For example, Amazon's warehouse injury rates have increased every year since 2016 after they began using robots in parts‐to‐picker order fulfillment. The serious injury rate in 2019 was 7.7 injuries per 100 employees, which was 33% higher than in 2016 and nearly double the corresponding recent industry standard (Evans, 2020). Order picking often requires pickers to perform repetitive tasks in awkward postures for a whole day, easily causing musculoskeletal disorders (Grosse et al., 2015; Lavender et al., 2012), which accounts for up to 33% of all injuries and illnesses in the United States in 2013 (Bureau of Labor Statistics, 2014).
Motivated by the aforementioned robot–human coordination issues in parts‐to‐picker order fulfillment practices, this paper aims to answer the following questions. How should human pickers and coordinated robots react and adapt to each other to fulfill orders in a human‐friendly manner considering robot–human collaboration? How does such a human‐friendly robot–human coordination scheme affect the performance of an order fulfillment system in a parts‐to‐picker order fulfillment center? Specifically, what are the critical factors and how do they influence the systems’ performance from humanity and safety perspectives?
To address the above research questions, we build a stochastic optimal control‐based model and develop a data‐driven solution approach. We propose a robot–picker‐coordinated order fulfillment mechanism under which the robot and picker efficiencies have the following characteristics: (i) The robot and picker efficiencies can be effectively controlled and coordinated if real‐time data concerning newly arriving racks are available and used. (ii) The robot efficiency heavily depends on real‐time data concerning newly arriving racks, whereas picker efficiency highly depends on the length of the queue of unprocessed racks. (iii) The picker‐to‐robot relative performance (RP) from the perspective of either efficiency or effectiveness during peak hours is less than that during either normal or off‐peak hours. The picker‐to‐robot RP is best in the “off‐peak” scenario. Moreover, we find that the robot efficiency is not the most important metric. Rather, human efficiency dominates the efficiency of a parts‐to‐picker order fulfillment system, particularly during peak hours. Accordingly, controlling the number of queued racks by slowing down their delivery by robots to the picking–packaging area is in fact a more efficient measure than slowing down the rack‐processing of a picker (in order to alleviate that picker's fatigue); this is especially prominent during a peak‐hour period. Last but not least, employing data that were provided by a well‐established e‐commerce company, our analyses demonstrate that a picker's accumulated fatigue can be reduced by over 50% at the expense of lowering picker efficiency by around 15% if the proposed robot–picker coordination system is applied.
The rest of this paper is organized as follows. Section 2 reviews the relevant literature to elucidate its contribution to the field of innovative logistics. Section 3 describes the problem and characterizes the proposed robot–picker‐coordinated order fulfillment system. Section 4 presents a proposed stochastic‐dynamic optimal control model. Section 5 develops a real‐time data‐driven approach to estimating and controlling the state variables of the proposed robot–picker‐coordinated order fulfillment system. Section 6 presents the computational results of a practice‐based real data analysis. Section 7 draws conclusions, provides managerial implications, and discusses future research. To enhance exposition, all technical proofs are put in an Online Appendix.
LITERATURE REVIEW
Recently, in the Industry 4.0 era, robotic technologies (such as Amazon robotics and Jingdong robots) that are used by e‐commerce companies (Luo & Choi, 2022) for parts‐to‐picker order fulfillment have emerged (Banker, 2016; Shi et al., 2021; Tam, 2014; Tobe, 2018). They have revolutionalized the way of order fulfillment operated in warehouses. Most of the extant literature on parts‐to‐picker order fulfillment systems focuses mainly on increasing the warehousing efficiency utilizing diverse methodologies. For example, Enright and Wurman (2011) identified several parts‐to‐picker order fulfillment‐related resource allocation problems, including order allocation and robot allocation problems. Recently, Boysen et al. (2017) proposed a mixed integer programming model to minimize the number of rack visits to a stationary picking station in a mobile robot‐based order‐picking problem. The authors solved the optimization problem by first decomposing the problem into two subproblems of rack sequencing (for a given order sequence) and order sequencing (for a given rack sequence). By contrast, Lamballais et al. (2017) proposed four queuing network models associated with various warehousing layouts and robot zoning strategies to analyze the performance of an automated storage and parts‐to‐picker system with the objectives of optimizing robot utilization, maximizing order throughput, and minimizing order cycle time. Different from the above optimization‐based analytical models, Bozer and Aldarondo (2018) utilized a simulation‐based method to evaluate and compare the performances of two types of parts‐to‐picker picking systems (called “miniload” and “Kiva” systems) in the aspects of expected throughputs and expected container retrieval times in order processing. Some of the subsequent literature, including Weidinger et al. (2018) and Yuan et al. (2019), aimed at exploring the problem of pod/rack storage, which is the antecedent of parts‐to‐picker order fulfillment, where the inventory of items is stored and spread over multiple mobile pods that are carried by robots moving between storage and picking (stowing) zones. More recently, Wang et al. (2022) utilized an “approximate dynamic programming” based branch‐and‐price approach to solve the optimal robot scheduling problem for parts‐to‐picker order fulfillment systems. Different from the above literature, which solely focuses on efficiency, Wang et al. (2022) incorporated fluctuations of the working states of human pickers into the model, and hence the human factor was explored. However, human safety was not yet examined by Wang et al. (2022). In addition, Boldrer et al. (2022) proposed a hierarchical framework to address the problem of multi‐robot navigation in human‐shared working environments. Even though the approach of Boldrer et al. (2022) does not focus on parts‐to‐picker order fulfillment systems, the authors’ idea of conceptualizing a safe and socially aware navigation in their proposed methodological framework is noteworthy and consistent with our research goal.
Despite remarkable advances made to increase efficiency in parts‐to‐picker order fulfillment, issues such as safety and harmony in the human–robot interactions remain under‐explored in relevant intralogistics operations and related areas (Azadeh et al., 2019; Chen et al., 2022; Enright & Wurman, 2011). As argued by Enright and Wurman (2011), systems optimization at the high level, considering dual objective functions for both workers and robots, remains challenging as these two objectives may not be compatible. Details of the operational features and challenges of parts‐to‐picker order fulfillment can also be found in Enright and Wurman (2011). Drawing from experimental results, Chen et al. (2022) further suggested the urgent necessity of developing novel robotic motion methods to ensure human safety for applications of human—robot coordination in warehouses. Note that Azadeh et al. (2019) comprehensively reviewed the literature on robotized and automated warehouse systems. They pointed out that issues related to human—machine interaction in automated warehousing environments have not been fully addressed.
Another stream of relevant literature concerns the association of human factors with order‐picking performance (Batt & Gallino, 2019; Grosse et al., 2015, 2017; Loske, 2022; de Vries et al., 2016). As argued by Grosse et al. (2017), human factors determine the performance of an order‐picking system, which is time‐consuming and labor‐intensive, but they are often ignored in management‐oriented research. Incorporating human factors into order‐picking models for intralogistics operations and warehouse management is, thus, indispensable—particularly with respect to production, operations, and logistics management (de Koster et al., 2007). Observe that the related literature can be divided into two categories: the first focuses on relevant aspects of human cognition (e.g., human learning) to improve productivity/efficiency, and the second cares about the welfare and health of workers. The corresponding literature reviews are detailed below.
Prior studies on cognition have investigated its impact on order‐picking performance. Some of them (Grosse & Glock, 2015) integrated the concept of human learning (characterized by learning curves) with mathematical models to study how pickers’ performance can be improved by learning from experience. de Vries et al. (2016) conducted a field experiment to examine three types of picker‐to‐parts order‐picking methods under two incentive schemes for pickers with different regulatory foci (“prevention focus” vs. “promotion focus”). The work of de Vries et al. (2016) is pioneering in “elaborately” aligning order‐picking methods, incentive mechanisms, and regulatory focus to bridge the gaps between organizational behavior theories and warehousing practices. Batt and Gallino (2019) empirically estimated the effects of several factors, including a picker's learning from the experience of walking and searching processes, on picking time for order fulfillment in online women's apparel retailing. One remarkable feature of their work is the empirical demonstration of the effects of pickers’ heterogeneous learning capabilities, characterized by different learning curves, on order‐picking performance. Rather recently, Loske (2022) combined parametric and nonparametric approaches to analyze how the interactions between humans, machines, and intelligent software impact human learning and perception of work characteristics in the transition to an automated order‐picking system. The author's empirical findings verified that the real‐time feedback provided by the order‐picking system can facilitate human learning by doing tasks in the perception–cognition–motor–action cycle, thus clarifying the need for a human‐centered work system design.
The literature on the welfare and health of order‐picking workers aims to develop integrated models that consider both ergonomic (such as human energy expenditure and fatigue) and economic (such as picking time) performances in order picking (Grosse et al., 2015, 2017). For example, Grosse et al. (2015) proposed a conceptual framework that considers human factors in four critical categories (viz., the “perceptual, mental, physical, and psychosocial”). The authors suggested that human factors can be incorporated into the planning models to improve the performance of order‐picking systems. More recently, Glock, Grosse, Abedinnia, and Emde (2019) proposed an integrated model to determine the optimal processing sequence for orders, pallet rotation, and picker routing to minimize total picking effort. The authors considered the spinal loads on pickers and consequent risks of injury. Similar efforts had been made previously (Glock, Grosse, Kim, et al., 2019; Larco et al., 2017).
Despite the fact that the idea of integrating human factors into order‐picking decision support systems is promising and necessary for human wellbeing, the above‐cited order‐picking studies on human factors are concerned mostly with manual order picking (picker‐to‐parts order picking) rather than parts‐to‐picker order‐picking systems, which are the focus of our study. To be specific, a parts‐to‐picker order‐picking system has the following operational features that differentiate it from a traditional picker‐to‐parts order‐picking system, which relies mainly on manpower.
First, a parts‐to‐picker ordering picking system relies substantially on the coordination between pickers and robots to fulfill customer orders. Mobile robots lift racks that store SKUs and transport them from the storage area to stationary pickers to facilitate subsequent picking and packaging tasks by those pickers. In such parts‐to‐picker picking environments, robots must move forward and backward between the storage area and stationary pickers, forming closed loops between the storage area and the locations of the coordinated pickers. Accordingly, the efficiency of order fulfillment in a parts‐to‐picker picking system is contingent jointly upon the coordination between robots and pickers in carrying out the two consecutive tasks of handling racks (by mobile robots) and packaging (by pickers). Nevertheless, human–robot coordination often generates coordinating complexity and uncertainty as a result of the difficulty of sharing information, communication, and mutual adaption by the coordinated dyad (Faraj & Xiao, 2006).
Second, from a managerial perspective, an environment that includes working robots/machines is likely to differ from the one that includes only human labor. Psychological effects (e.g., stress and strains) have attracted attention since computer‐integrated/computer‐aided/robot‐aided manufacturing technologies were introduced for factory automation (Karuppan & Schniederjans, 1995; Rosenthal, 1984). As argued by Rosenthal (1984), the most difficult problems in automating a factory are managerial rather than technical. Olsen and Tomlin (2020) further highlighted the issue of managing worker–machine interfaces when robotics and artificial intelligence were introduced to convert manual operations into lean, digitized, and highly automated operations. Therefore, the issues that are raised by working with robots/machines seem not to be limited to those around technology and productivity. Instead, in the post‐automation era, new psychological (Karuppan & Schniederjans, 1995), behavioral (Hinds, 2004), and organization‐behavioral (Barrett et al., 2012; Beane & Orlikowski, 2015) issues should be crucial for operations managers.
Considering the above research gaps in the literature on parts‐to‐picker order fulfillment and intralogistics operations, this work contributes to the field of intelligent logistics operations in the following three ways.
First, motivated by the sociotechnical theory (Cooper & Foster, 1971; Mumford, 2006), this work urges that the harmony of interrelations between human workers and robots in a parts‐to‐picker order fulfillment center should be considered in realizing intelligent logistics systems. In particular, this work promotes the development of adaptive human‐friendly robotics to assist pickers in fulfilling orders with the fewest human–robot coordination conflicts. On the way toward “perfect automation,” the coexistence of robots and humans in task environments is inevitable, requiring carefully planned mechanisms for agile robot—human coordination that are humanity‐oriented and machine‐assisted.
Second, the proposed methodology is novel in conceptualizing the aforementioned philosophy of machine‐assisted humanity orientation into modeling and analysis to address the issue of robot–human coordination for parts‐to‐picker order fulfillment systems. Methodologically, this work integrates a discrete‐time nonlinear dynamic stochastic model with a time‐varying fatigue accumulation function to characterize and estimate a human picker's fatigue‐dependent working state. Then, this work develops a stochastic optimal control‐based data‐driven solution approach, which permits not only estimating human pickers’ instantaneous performance and robot queue lengths but also feeding back the aforementioned human picker's fatigue‐dependent working states in real time to the coordinated robots to adjust their efficiencies in handling and distributing mobile racks to the picker. Such a real‐time data‐driven stochastic optimal control‐based approach has its unique features and relative merits in addressing the critical yet under‐explored issue of real‐time robot–human coordination for machine‐assisted order fulfillment systems.
Third, this work is innovative in the realization of ideas from occupational psychology (psychology), organizational behavior (sociology), and robotics (engineering) to address picker–robot coordination for intelligent logistics in contemporary operations (management). Psychological factors are considered in the development of a novel robot–human‐coordinated mechanism for use in an automated parts‐to‐picker order fulfillment system. Through normative analysis and real‐data‐based empirical studies, the above concept (i.e., machine‐assisted humanity orientation) and proposed robot–picker‐coordinated order fulfillment mechanism are confirmed to be theoretically reasonable, scientifically solid and demonstrated to be of practical value in resolving the issue of interest.
PROBLEM DESCRIPTION
Typically, with our discussions with managers from a well‐established e‐commerce company in mainland China and referring to the problem settings of Wang et al. (2022), the workflow of mobile robot‐based parts‐to‐picker order fulfillment operations investigated in this paper contains the following steps. First, SKUs are received in the receiving area of a warehouse/distribution center; meanwhile, mobile robots that carry empty racks are dispatched to load the received SKUs. This is followed by the step of storage assignment, where mobile robots move loaded racks to the storage area (Li et al., 2020). If the SKUs in the racks are required in an order processing list, then the mobile robots move the racks with those required SKUs from the storage area to the picking–packaging area (termed as the step of rack handling and distribution). Then, stationary pickers in the picking–packaging area pick out the SKUs that are identified in the picking list and package them into cardboard boxes to fulfill customer orders (termed as the step of picking and packaging). This step is followed by automatically moving the completed cardboard boxes by means of conveyers to the outbound area for vehicular loading and dispatching. Mobile robots that have completed the delivery of the racks with the required SKUs to the stationary pickers return from the picking area to the storage area for repeated execution of the step of rack handling and distribution, to facilitate pickers’ carrying out picking and packaging for order fulfillment. Similar mobile robot‐based parts‐to‐picker order fulfillment operations can also be readily found in numerous practical cases, including Amazon.com (Demaitre, 2019; Gharehgozli & Zaerpour, 2020; Garland, 2022).
This work focuses on coordinating the step of rack handling and distribution (executed by robots) as well as the step of picking and packaging (executed by pickers). Specifically, the performance of mobile robots in handling and moving racks (number of racks handled and moved per robot per time unit) significantly influences the associated picker's performance in the subsequent picking and packaging tasks. The effect of psychological and behavioral uncertainties, which may be evident in the interaction between pickers and robots, on an associated picker's performance is considered herein. Accordingly, the problem of interest and goals of this work are as follows.
Consider a few‐to‐one parts‐to‐picker picking system, in which a few mobile robots work collaboratively with a human picker. A set of C customer orders, involving S SKUs, and being stored in N racks must be handled and moved by M mobile robots from the storage area to the picking area. Then, they have to be picked and packaged by one associated picker in a given working period T (e.g., 3 hours) without the intervention of scheduled breaks. A shared storage (scattered storage) policy, which is typically applied to mobile robot‐based parts‐to‐picker picking systems (Weidinger et al., 2018), is adopted. Different SKUs may be stored in a rack, and each rack can be associated with one or more customer orders. Two goals must be achieved. The first is to complete the picking and packaging of the aforementioned S SKUs that are stored and scattered over R racks by smoothly coordinating the operations of M mobile robots and the associated picker during a working period T. This goal must be achieved under the condition that the picker's fatigue increment (
Based on the above problem description and defined goals, this work proposes a real‐time robot–picker‐coordinated order fulfillment system that comprises the following four subsystems: (1) robot‐based rack handling (Subsystem 1), (2) mobile robot‐carried rack movement (Subsystem 2), (3) human‐based picking and packaging (Subsystem 3), and (4) robot‐based rack return (Subsystem 4), which are depicted in Figure 1. The number of newly arriving (

The proposed real‐time robot–picker‐coordinated order fulfillment system
The proposed robot–picker‐coordinated order fulfillment system has three distinctive features. First, the rack‐based order processing cycle, which integrates the forward and reverse logistical flows of mobile racks between the storage and picking–packaging areas, is considered. A closed‐loop rack reuse process facilitates the corresponding logistical resource management in the system. Second, the interactions between robots and associated pickers are considered, and they are characterized by the instantaneous inputs (
MODELING
This section proposes a discrete‐time nonlinear dynamic stochastic model to formulate and solve the above problem. Specifically, Section 4.1 defines the variables of system states (including state variables and control variables), and Section 4.2 explicates the proposed dynamic stochastic model.
Specification of system states
The state variables of Subsystem 3 are specified to define the expected performance of a picker in coordination with mobile robots. Then, the state variables for Subsystem 1 are specified to characterize the performance of the coordinated robots. Thereafter, the state variables of Subsystems 2 and 4 are specified to characterize the forward and reverse logistical flows of mobile racks that are required for coordination between Subsystems 1 and 3 (robots’ and human pickers’ actions).
States of human picker
Consider the states that are ideally associated with a picker's performance (Subsystem 3). Ideally, the picking and packaging tasks of a picker are carried out efficiently by seamlessly receiving and processing the loaded racks that are delivered by mobile robots. However, such an ideal state of Subsystem 3 is obtained at the cost of the picker's accumulated fatigue (
The accumulated fatigue (
Accordingly, Equation (5) can be used to elucidate how the accumulated fatigue in any given time interval k influences a picker's instantaneous picking and packaging performance in the next time interval
In addition to the above concern about accumulated fatigue, the efficiency of picking and packaging tasks at each time interval is an important issue. Ideally, a picker processes all loaded racks as they arrive to the picking–packaging area from the storage area so that no racks remain in a queue in any time interval (
States of mobile robots
In the ideal state, robots handle SKUs and racks in the storage area (Subsystem 1) in a way that maintains equal volumes of inbound and outbound rack flows of Subsystem 1 (
Control variables for forward and reverse logistics flows of racks
Subsystems 2 and 4 are designed to regulate the forward and reverse logistical distribution flows of racks between the storage area and picking–packaging area. Let
Based on the above defined efficiency‐related states (
Given
In Equations (10) and (11) of Theorem 4.1,
Based on Theorem 4.1, Corollary 4.1 is also used to characterize and estimate the time‐varying fatigue growth rate (
Given Theorem
4.1
, estimates of
Based on Corollary 4.1, substituting Equation (13) into Equation (2) yields
Using the correlations between the system states and control variables specified above, the human picker and robots can be readily coordinated for superior performance in an order fulfillment center once the optimal values of the state and control variables can be determined. It is noted that the proposed model is data‐driven in which the input data required (including
Model
This subsection proposes a discrete‐time nonlinear dynamic stochastic model that is characterized by recursive and measurement equations subject to boundary constraints (Anderson, Jr. et al., 2006; Santina et al., 1994; Sheu, 2002). Specifically, the time‐varying relationships among state variables, control variables, and measurement data defined in the previous section (Section 3), can be characterized in three forms: (1) recursive equations, (2) measurement equations, and (3) boundary constraints, as follows.
The recursive equations specify time‐varying relationships between the states of a dynamic and stochastic system in the next and current time intervals (
Accordingly, the recursive equations (Equations 15
–18) reveal that the system states
The measurement equations characterize time‐varying relationships between measurements and system states. In this work,
Additionally, the picker's accumulated fatigue (
REAL‐TIME DATA‐DRIVEN STATE ESTIMATION
This section presents a stochastic optimal control‐based method for the real‐time estimation of system states (
To estimate the aforementioned system states and control variables in real time, a stochastic optimal control‐based method is developed using an extended Kalman filter. Kalman filtering is a well‐known statistical method for the linear‐quadratic estimation of system states of dynamic and stochastic systems. It has been successfully and extensively utilized in solving diverse estimation and control problems in various fields, such as the tracking and navigation of different sorts of vehicles and traffic signal control (Sheu, 2002). Whereas the basic Kalman filter applies to linear problems, the extended Kalman filter has been developed particularly for nonlinear problems that are characterized by either recursive or measurement equations in dynamic and stochastic models (Santina et al., 1994). The primary computational steps in the proposed estimation method are summarized below.
Initialization
This step initializes system states and all of the inputs that are required to trigger the subsequent computational steps. Let
Prior prediction of system states
In this step, the state vector
Given Equation (
15
) (recursive equation) and its characteristics, let
The prior prediction (
Given Equation (
15
) (the recursive equations) and Corollary
5.1
, let
Correction of prior predictions
In this stage, the prior predictions
Given Equations (
5
) (fatigue prediction function) and (
19
) (measurement equation), let
where
As seen in Equations (28) and (29) of Corollary 5.3, the LMMS estimate of
Given Corollaries 5.2 and 5.3, and Equation (
19
) (measurement equation), let
Based on Corollaries 5.3 and 5.4, the corrected state vector
Finally, the recursive equation for correcting the prior prediction of the covariance matrix of state estimation error
Given Corollaries 5.2 to 5.4, let the corrected covariance matrix of state estimation error
The result of Corollary 5.5 in (32) is important for the estimation of the control variable vector as shown in the next subsection.
Estimation of control variable vector
Using the corrected state vector
In Equation (33),
All of the system states that are characterized by the corrected state vector (
Utilizing the estimates of
The proposed data‐driven real‐time state estimation‐control approach can be coded in any appropriate computer programming language. This can not only facilitate the recursive calculation and optimal control of system states but also monitor system performance in real time. Figure 2 summarizes the primary algorithmic computational procedures based on the proposed approach.

Algorithmic procedures for data‐driven real‐time state estimation‐control
REAL‐PRACTICE‐BASED ANALYSIS
To demonstrate the validity and applicability of the proposed real‐time data driven state estimation and control approach for robot–picker‐coordinated order fulfillment, data were collected from a dominant e‐commerce company in mainland China that uses robots for parts‐to‐picker order fulfillment. The data were obtained from a warehouse of that company, which is a 2500m2 robot‐carried mobile‐rack warehouse that stores small cosmetic items, in which 11 human pickers work with mobile robots to pick out customer orders. The raw data provide details about the pickers’ picking and packaging tasks at the study site over 6 days in March 2019. Data concerning the number of racks (
A total of 21 datasets that contain a total of 756 data points are generated from the collected raw data to facilitate real‐time state estimation and control in this empirical study. Each dataset contains 18 pairs of data points
Given the aforementioned 21 datasets (756 data points), the real‐practice‐based empirical analysis is carried out through two phases to demonstrate the relative performance (RP) of the proposed robot–picker coordination mechanism for parts‐to‐picker order fulfillment.
The first phase of the empirical analysis aims to compare the results generated using the proposed approach with the measurements gained from the 21 datasets in the aspects of picker efficiency (the average of

Empirical results of phase I (robot–picker coordination relative to real operations). Scenarios: I, II, and III represent “peak,” “normal,” and “off‐peak” scenarios; Indexes: a and b represent the scenarios “with” and “without” considering robot–picker coordination; ARP, average of relative performance; RP, relative performance
Overall, the empirical results yielded in the first phase provide the following findings as summarized in Observation 6.1.
The proposed robot–picker coordination system permits alleviating a picker's fatigue in order fulfillment without much influence on picking efficiency, compared with the current performance of the focal firm in the study case—As can be seen in Figure 3, a picker's accumulated fatigue can be reduced by 53.74% if the proposed robot–picker coordination system is applied in the focal firm of the study case. Such a fatigue alleviation, however, is carried out at the expense of lowering picker efficiency by 14.79%. Nevertheless, the above observation is encouraging for the applicability of the proposed method in practical cases of parts‐to‐picker order fulfillment systems as it implies that a more harmonious and enjoyable robot–human co‐working environment can be created to facilitate the development of intelligent and human‐friendly robot–human‐coordinated order fulfillment systems.
In the second phase of the empirical analysis, this work aims to demonstrate the RP of robot–picker coordination in contrast with the case without considering the robot–picker coordination for gaining more managerial insights. Therein, empirical results that are obtained with and without consideration of the robot–picker coordination mechanism are compared. Without robot–picker coordination, both control variables
Figure 4 plots the empirical results concerning practice‐based empirical analysis, including the average values of scenario‐based state variables, control variables, system performance, and associated RP, as determined by comparing the outputs with and without robot–picker coordination. The findings and managerial implications of Figure 4 are discussed below.

Empirical results of phase II (robot–picker coordination performance). Scenarios: I, II, and III represent “peak,” “normal,” and “off‐peak” scenarios; indexes: a and b represent the scenarios “with” and “without” considering robot–picker coordination; RP, relative performance
The proposed robot–picker coordination system outperforms the system without coordination not only in efficiency but also the fatigue of human pickers—As seen in Figure 4, the RP values that are associated with
Robot–picker coordination improves system performance more in the “normal” scenario (Scenario II) than in the “peak‐hour” and “off‐peak” scenarios (Scenarios I and III)—This inference is drawn by comparing the RP values, particularly those associated with (Human‐friendly robot behavior for coordination with human pickers in different operational scenarios). Under the proposed robot–picker coordination scheme, robots can exhibit different coordination behaviors in response to the instantaneous performance of coordinated pickers in different scenarios, as follows. Under peak‐hour conditions (Scenario I)—Robots tend to handle racks with high efficiency (by increasing Under normal conditions (Scenario II)—Robots adjust their efficiency of handling racks in the storage area to keep the number of racks that are moved to the picking–packaging area almost equal to the number of racks that are processed by the coordinated pickers ( Under off‐peak conditions (Scenario III)—Robots handle racks in the storage area more slowly (by reducing
Three empirical examples, based on three selected datasets, are further used to present graphically the aforementioned characteristics of robot–picker coordination behavior that is typically exhibited during “peak,” “normal,” and “off‐peak” hours as shown in Figures 5
to 7. Briefly, the coordinated robots adjust their efficiency (

Empirical example (1)—system output during peak hours (Scenario I)

Empirical example (2)—system output during normal hours (Scenario II)

Empirical example (3)—system output during off‐peak hours (Scenario III)
The results of empirical example 1 (Figure 5) reveal a large difference between accumulated fatigue (

Sensitivity analysis of ρ (with respect to β1 and β2) in peak‐hour scenario
A picker's accumulated fatigue (ρ) increases significantly with either the picker's efficiency (
During peak hours, a human picker is likely to be more sensitive to the perceived length of the queue of unprocessed mobile racks than to the number of racks processed. Therefore, perceiving any anomalous increase in the length of the queue of mobile racks would easily have a negative effect on the picker's psychological and behavioral responses (such as stress, anxiety, and panic).
CONCLUSIONS, MANAGERIAL IMPLICATIONS, AND RECOMMENDATIONS
Conclusion
Motivated by the introduction of robots and advanced technologies into intelligent logistics operations and the impact thereof on worker safety and wellbeing, this work has presented a real‐time data‐driven stochastic optimal control‐based approach to address the challenges of robot–picker‐coordinated order fulfillment. Specifically, the proposed approach combines a discrete‐time nonlinear stochastic‐dynamic model with a fatigue accumulation function (Section 4) into a real‐time system state estimation and control method (Section 5), to find the optimal solutions for the coordination between human pickers and robots. Theoretically and in practice, the coordinated robots can react and adapt to picker efficiency in real time, such that the coordinated robots and pickers can jointly fulfill orders in collaboration and in a human‐friendly manner. Sections 4 and 5 have presented the methodological characteristics and novelty of the proposed approach, which help address the first research question raised in Section 1. To answer the second question (Section 1), an empirical analysis has been performed using actual data that were collected from a dominant e‐commerce company in China that uses robots for “parts‐to‐picker” order fulfillment. Our empirical study has been conducted to generate various insights (called “observations”), which help answer the second question, concerning how the proposed human‐friendly, robot–human coordination system improves the performance of a parts‐to‐picker order fulfillment center in terms of both safety and human wellbeing.
Managerial implications
The analytical results that have been obtained using the proposed model (Sections 4 and 5) and the empirical results of the empirical analysis of actual data (Section 6) provide the following managerial implications.
Implication 1. Under the proposed robot–picker‐coordinated order fulfillment mechanism, the robot and picker efficiencies ( The robot and picker efficiencies ( Robot efficiency (
The above managerial insights are derived from Theorem 4.1 and Observations 6.2 and 6.5. The effect of the perceived length of the queue (
Implication 2. Under the proposed robot–picker‐coordinated order fulfillment mechanism, the picker‐to‐robot RP in the aspect of either efficiency (
Implication 3. Robot efficiency is not the most important metric. Rather, human efficiency dominates the efficiency of a parts‐to‐picker order fulfillment system, particularly during peak hours. Accordingly, controlling the number of queued racks by slowing down their delivery by robots to the picking–packaging area is more efficient than slowing down the rack‐processing of a picker to alleviate that picker's fatigue, particularly during a peak‐hour period. This implication is drawn from the empirical results (Figure 8) and is consistent with Observation 6.5 and Implication 1(b). It suggests the importance of accounting for human wellbeing and safety in the design and implementation of a human‐friendly intelligent robot–human‐coordinated order fulfillment system. In such a system, humans are the key element, and robots should be sufficiently intelligent and accommodating to assist humans in the joint intelligent logistics task of order fulfillment.
Recommendations for future research
Despite the novelty of the proposed methodology and its advantages in realizing a human‐friendly intelligent robot–human‐coordinated order fulfillment system, this research has several limitations. First, the proposed model has not incorporated factors related to worker psychology (e.g., perceived job worth, job attitude, and emotional involvement) and behavior (e.g., body posture) toward coordination and cooperation with robots, and these factors should be further explored and empirically verified in the future (MacDonald, 2003; Tan & Netessine, 2014; Tan et al., 2022). For example, body posture is regarded as a kind of physical demand, which may contribute to psychosocial hazards (i.e., negative effects on employees such as stress and health problems), as claimed in MacDonald (2003). Thus, body posture can be another issue to improve employees’ stress and health problems in human–robot collaborative work settings. Other related tasks that are undertaken in inbound logistics (such as storage assignment) and outbound logistics (such as resource/fleet management, vehicular loading, dispatching, and routing) have not yet been considered. They may deserve further investigation. Extending the proposed model for the cases of multiple working periods intervened by scheduled breaks is analytically challenging yet meaningful for future research. Last but not least, considering how robotics can be used to help cope with capacity bottlenecks and supply chain disruptions due to manpower shortage under/after the Coronavirus disease (COVID‐19) pandemic (Gupta et al., 2022) will also be interesting to study in the future.
Footnotes
ACKNOWLEDGMENT
This study is partially supported by MOST project from Ministry of Science and Technology of Taiwan, R.O.C. (code: MOST 109‐2410‐H‐002‐076‐MY3).
1
We let δ 0 (initial value of δ) start from the ideal value, meaning that its initial state starts from an ideal condition.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
