Abstract
Order splitting is one of the key issues in the e-commerce order fulfillment process. It increases operational costs, elevates carbon emissions, and compromises customer satisfaction. This article focuses on determining the product assortments to store within the multi-warehouse logistics network to minimize the total number of split orders subject to cardinality constraints. We show that this minimizing split orders (MSO) problem is NP-hard and demonstrate that even finding an optimal order fulfillment strategy with a given assortment selection is NP-hard. To further analyze the MSO problem, we introduce a concept termed the second-order dominant indexing rule. This indexing rule corresponds to a group of demand distributions, under which we are able to characterize the structure of the optimal assortment selection for various scenarios. In particular, when assortment overlapping is prohibited, the optimal selection can be explicitly derived. When the demand exhibits a total nested structure, an optimal selection is non-overlapping with more popular products allocated to larger warehouses. We also bridge the two-warehouse order splitting minimization problem with the single-warehouse assortment selection problem in the literature. Building upon this connection, we propose an extended marginal choice indexing (MCI) policy, which is proven to achieve optimality when the demand has a second-order dominant MCI. In addition, we propose an Iterative Improvement Heuristic that refines any existing assortment selection. The efficiency of the proposed heuristics is validated by extensive numerical experiments, demonstrating that the extended MCI policy performs near-optimally even when customer demand is not ideal, and both heuristics outperform the best benchmark in existing literature. Additional experiments on real-world data further confirm their effectiveness and scalability. Finally, we extend our findings to a two-tier multi-warehouse scenario with a back-end warehouse.
Keywords
Introduction
Online shopping has become an indispensable part of our daily lives. According to a recent report by Boston Consulting Group (Barthel et al., 2023), the compound annual growth rate (CAGR) for e-commerce is anticipated to be 9% through 2027, expecting it to account for 41% of global retail sales by 2027, a significant rise from just 18% in 2017. This growth reflects a significant shift in consumer preferences toward online shopping. In response, many e-commerce companies are expanding their product assortments to attract a broader customer base, boost sales, and enhance customer satisfaction (Oboloo, 2021). Additionally, companies like JD and Freshippo have implemented free shipping for orders exceeding a certain value, encouraging customers to purchase more in a single transaction, reducing shipping costs per unit, and increasing the perceived value of purchases.
However, the growing diversity of product offerings and the emergence of large orders with distinct items present significant challenges on how to efficiently fulfill customer orders. On one hand, the increasing variety of products requires multiple warehouses to store the products since storing all the products in a single warehouse is usually too costly, if feasible. In reality, each warehouse is limited to storing only some of the products, necessitating e-commerce companies to operate multiple warehouses. On the other hand, when no single warehouse contains all the products, order splitting occurs. A single order is then split into multiple suborders that are handled by different warehouses. An increase in suborders implies the engagement of more manpower in the fulfillment process, thus raising operational costs. In addition, suborders also result in extra packing materials and additional shipments, higher material and energy consumption, and subsequently, greater carbon emissions (Zhang et al., 2021).
To overcome the ever-growing challenge of split orders, this article aims to minimize the expected split orders by selecting appropriate product assortments at warehouses in a logistic network. By examining this problem, we aim to provide valuable insights for companies in designing warehouse assortments in logistics networks and promoting more efficient, responsive, yet sustainable supply chain solutions. In general, the multi-warehouse assortment selection problem is notoriously challenging due to its combinatorial nature. To focus on the assortment selection problem, we adopt a common approach in the literature that ignores the inventory decisions. In other words, we assume that if a product is allocated to a warehouse, it is always available for fulfillment. For each warehouse, the warehouse capacity in our model is defined by the number of distinct stock-keeping units (SKUs) that can be stored in that warehouse. This constraint is based on the understanding that managing an increased variety of products within the limited space of a warehouse requires substantial organizational effort and affects operational efficiency (Alfaro and Corbett, 2003; Lopienski, 2021).
We have made the following key contributions. First, we present a broader framework for the multi-warehouse assortment selection problem than those in the existing literature. Our objective is to reduce the total number of split orders, which proves to be NP-hard even with just two warehouses. Additionally, we establish that determining an optimal fulfillment strategy to minimize split orders for any specified warehouse assortment selection is already NP-hard. To address these challenges, we present mixed-integer linear programming (MILP) formulations for both problems.
Second, we introduce a novel concept termed second-order dominant indexing rule, which aids in identifying optimal assortment selections. This rule highlights a class of demand distributions with distinct structural properties that inform optimal decisions.
Third, we explore three distinct scenarios where the optimal assortment selection can be directly obtained when a second-order dominant indexing can be found regarding the demand distribution: (i) non-overlapping assortment restrictions, (ii) demands with nested structures, and (iii) systems with two warehouses. In these cases, the optimal strategy involves allocating more popular products to larger warehouses. Specifically, when assortments cannot overlap, the optimal solution is explicitly determined. For nested demand structures, an optimal non-overlapping assortment assigns popular products to larger warehouses. For two-warehouse systems, we propose an extended marginal choice indexing (MCI) policy that achieves optimality. Additionally, we establish a connection between the single-warehouse assortment problem and the split-order minimization problem in two-warehouse systems, leveraging this insight to develop an iterative improvement heuristic (IIH) that refines any initial assortment selection for any demand distribution.
Fourth, through extensive numerical experiments, we demonstrate the efficacy of the proposed heuristics. The results confirm that the extended MCI policy attains optimal solutions when demand is independent and achieves near-optimal performance under general multi-purchase choice models, such as the multi-purchase multinomial logit model. While not explicitly designed as a stand-alone solver, the IIH consistently outperforms existing benchmarks, even when initialized with arbitrary assortment selections, offering robust and computationally efficient solutions. Furthermore, experiments on real-world data further validate the effectiveness and scalability of the proposed methods.
Lastly, we extend our insights into multi-tier multi-warehouse scenarios, featuring a back-end warehouse that consistently supports multiple front-end warehouses. Inspired by our findings from single-tier cases, we reveal the optimal assortment selection under the non-overlapping constraint when a second-order dominant MCI is identifiable. For scenarios involving two warehouses, we introduce a polynomial-time solvable heuristic that enumerates less than the number of times equal to the smaller capacity of the two warehouses. This heuristic achieves optimality when the demand has a second-order dominant MCI. For general cases involving multiple front-end warehouses, we introduce a greedy algorithm and another iterative heuristic to address the problem.
The remainder of the article is structured as follows. Section 2 reviews relevant literature. Section 3 formulates the problem and introduces the second-order dominant indexing rule. Section 4 examines optimal assortments under specific scenarios. Section 5 extends the analysis to multi-tier networks. Finally, Section 6 synthesizes the findings. Proofs are available in the Section EC.2.
Literature Review
This study intersects with existing research on warehouse assortment selection and product allocation in supply chain management. It also relates to multi-location assortment optimization in revenue management.
Warehouse Assortment Selection and Product Allocation
We contribute to the literature on warehouse assortment selection problems. Catalán and Fisher (2012) first formally discuss the multi-warehouse assortment selection problem, the goal of which is to minimize split orders. The authors prove the problem is NP-hard even if there are only two warehouses and provide four heuristics for solving the problem. Our problem setting is similar, with the difference that we model customer demand through a probability distribution function, whereas they consider each customer order individually. This distinction enables a more thorough analysis of the impacts stemming from different customer choice behaviors, thereby making it possible to identify certain structural properties of the optimal selection. In this article, we further demonstrate that, given an assortment selection, determining an optimal order fulfillment policy that minimizes split orders is NP-hard.
Zhu et al. (2021) study a similar problem without allowing assortments to overlap between different warehouses and propose a K-links heuristic clustering algorithm, which is based on the distribution of multi-item orders. Nonetheless, this heuristic lacks a performance guarantee. In contrast, we revisit this special case, delineating the optimal selection when the demand exhibiting particular patterns. Söylemez (2021) examines another similar problem, focusing solely on the occurrence of order splitting, instead of the total number of split orders. Bonekamp (2019) develops an optimization model to maximize the sum of cross-selling factors of products while minimizing split shipments to solve the multi-warehouse assortment allocation problem. The model is an extension of the Quadratic Multiple Knapsack Problem (QMKP), which accounts for product overlapping and load balancing among warehouses. To solve the problem, an extended greedy heuristic and a genetic algorithm are proposed, which are shown to find near optimal solutions in a real-world case study.
Our study is also related to Li et al. (2024), where the authors study the single-warehouse assortment selection problem under a cardinality constraint. They examine two distinct types of cost functions, associated with order splitting cost and spillover fulfillment cost, and focus on minimizing the fulfillment cost with respect to a single warehouse. Although the problem setting is quite different, we connect our problem to theirs in scenarios involving only two warehouses. Leveraging this connection, we present an iterative heuristic to improve any given two-warehouse assortment selection. Moreover, inspired by their introduction of the dominant indexing rule for analyzing the single-warehouse assortment selection problem, we innovatively introduce a concept termed second-order dominant indexing rule that assists us in identifying the structure of the optimal selection in various scenarios for our problem.
From another perspective, Li et al. (2019) investigate product allocation across multiple warehouses in the context of large-scale e-commerce, focusing on allocation density, defined as the ratio of the total number of allocated products to the total number of possible allocations. They find that shipping costs decrease significantly at low allocation densities, whereas at high densities, increases in allocation density result in only marginal cost reductions. Note that they consider product shipment in an aggregated manner rather than at the individual customer order level.
Multi-Location Assortment Optimization
This work is also related to multi-location assortment optimization. Unlike assortment optimization in the classic single-location setting (see e.g., Gallego and Topaloglu, 2019) and the omni-channel setting (see e.g., Dzyabura and Jagabathula, 2018), in the multi-location context, if a requested product is not available in the assortment of one location, it can be transshipped from other locations, incurring additional shipment costs. If none of the locations stocks the product, the customer may consider a substitutable product. The goal is to maximize expected profit by determining the optimal assortment of products to offer at each location.
To our best knowledge, very limited research studies the multi-location assortment optimization problem. Fisher and Vaidyanathan (2014) and Corsten et al. (2018) study similar problems without considering transshipment between different locations. Bebitoglu et al. (2018) and Çömez-Dolgan et al. (2022) study the capacitated multi-location assortment optimization problem with transshipment among different locations allowed, where each location has its own capacity limit, and both of them show the problem is NP-complete. Bebitoglu et al. (2018) assume that different location serves a separate geographical region whose customers’ demand is governed by a separate multinomial logit model. The authors propose a conic quadratic mixed integer programming formulation to solve it. Through numerical experiments, they show that their approach outperforms the mixed integer linear programming formulation. Çömez-Dolgan et al. (2022) assume customers’ demand are exogenous or independent for each location and if both the first and second choice of a customer cannot be satisfied by any location, the demand will be lost. The authors provide structural properties of the optimal assortments to simplify and speed up the search for the optimal solution. The authors also provide an upper threshold for the probability of customer substitution at each location, which ensures that the location’s capacity is fully utilized at optimal solutions if the substitution probability is below this threshold. Additionally, the authors demonstrate that when all locations have the same capacity, using them to the maximum can decrease expensive transshipment occurrences.
Contrasting with our setting, these studies concentrate solely on single-purchase scenarios where order splitting is not an issue, with the main goal being revenue maximization instead of minimizing split orders.
Problem Setting and Second-Order Dominant Indexing Rule
In this section, we begin by providing a formal definition of the assortment selection problem within a multi-warehouse logistics network, focusing particularly on minimizing split orders. Then, we discuss the complexity of this problem. Lastly, we introduce a novel concept pertaining to demand distribution that enables us to identify structural properties of the optimal assortment selection.
Problem of Minimizing Split Orders and Its Complexity
We consider an e-commerce company offering
We assume the e-company operates
All orders are required to be fulfilled, but no individual warehouse can store all products.
Assumption 1 implies that
To mitigate such instances and contribute to environmental sustainability, the company aims to optimize the assortment of each warehouse to minimize the total number of suborders. We present the minimizing split order (MSO) problem as follows:
For a given
Given a multi-warehouse system with predefined assortments, it is NP-hard to determine a fulfillment policy that minimize the number of suborders for an arbitrary customer order.
Proposition 1 is proved through a reduction from the Set Cover problem. It underscores the challenge of identifying an optimal fulfillment policy but it does not directly translate to the complexity of solving the MSO problem. In fact, solving MSO is also NP-hard, as formally stated in the following proposition:
The problem of Minimizing Split Orders is NP-hard even when limited to just two warehouses.
Note that Catalán and Fisher (2012) propose an MILP to solve the MSO problem. For completeness, we present the MILP reformulation for (MSO) as follows:
While using commercial solvers for MILP (2) is a valid approach for addressing the MSO problem, this approach is not scalable and provides limited insights into assortment decisions in response to different customer demands. Subsequently, we will introduce a novel concept termed second-order dominant indexing rule. This rule pertains to a specific category of demand distributions. Within this category, we are able to characterize numerous structural properties of the optimal assortment selection, offering deeper understanding and guidance for strategic decision-making in planning warehouse storage.
Solving the problem of minimizing split orders is generally NP-hard. When customers purchase at most one product, the demand simplifies to traditional single-purchase discrete choice models (DCMs), where order splitting does not occur, and any feasible assortment is optimal. Therefore, we focus on scenarios involving multiple purchases. In this subsection, we introduce a special class of demand distributions, which exhibits beneficial properties for addressing multi-warehouse assortment selection problems.
In what follows, we first introduce several preliminary concepts before formally defining this family of demand distributions. An indexing rule
Introduced by Li et al. (2024), the (First-Order) Dominant Indexing rule is defined as follows: an indexing rule
However, the dominant indexing rule alone is insufficient to ensure the structure of the optimal assortments in multi-warehouse settings, even for just two warehouses, as we must determine which subset combinations minimize order splitting. As such, we introduce a new concept after exploring the bundle multivariate logit (BundleMVL) Model (Russell and Petersen, 2000; Tulabandhula et al., 2023), a multi-purchase choice model that captures substitution and complementarity effects among products. In a BundleMVL-
(An Example of BundleMVL Model with Four Products)
Consider four products, indexed as
Following Example 1, we observe that the marginal choice probabilities satisfy
However, in multi-warehouse settings, the task of identifying optimal assortments is more complex than simply prioritizing a single subset of products based on their individual popularity or demand. Specifically, it requires determining combinations of product subsets that minimize the likelihood of order splitting while ensuring that all products remain available across multiple warehouses to meet customer demand. From the above example, we observe that prioritizing popular customer orders and products can reduce order splitting. This motivates us to investigate demand distributions where such prioritization strategies can be leveraged effectively. While the dominant indexing rule provides insights into preferences among subsets of products, it does not account for the interactions between combinations of subsets, which are critical in multi-warehouse assortment selection. To address this limitation, we introduce the concept of the Second-Order Dominant Indexing Rule, which identifies a class of demand distributions where prioritizing the allocation of popular products leads to solutions that effectively minimize order splitting.
(Second-Order Dominant Indexing Rule w.r.t. Demand Function
)
Consider a universal set
Condition (i) means that for two subsets of the same size, the subset containing smaller indexed products is more popular, which is exactly the definition of first-order dominant indexing. Condition (ii) further posits that adding one more product to each subset would result in a larger total market share if the more popular product is added to the more popular subset, rather than to the less popular subset. Referring to Example 1, this condition can be verified for subsets
For a given demand function
In what follows, we present useful structural properties of demand functions characterized by first-order or second-order dominant indexing rules.
Consider a given demand function if if
This proposition essentially extends the conditions in the definitions of first-order and second-order dominant indexing rules to situations with several subsets, which in turn play a crucial role in helping us identify optimal assortment selections, as will be illustrated in the next section. While we use the BundleMVL model as an illustrative example, second-order dominant MCI rules are not identifiable for all cases within this model. Here, we present two classes of multi-purchase choice models for which second-order dominant MCI rules can be identified.
The marginal choice indexing rule of the following two multi-purchase choice models are second-order dominant:
Note that while an MCI is always second-order dominant for any ICM, this does not necessarily hold for the BundleMVL model. Only a specific subclass meets these conditions, with Example 1 being one such instance. Specifically, the first half of the conditions in Proposition 5 (2) ensures condition (i) in Definition 1 by preserving the ranking where lower-indexed products have higher purchase probabilities. The second half guarantees condition (ii) by ensuring that adding a more popular product to a more popular subset yields a greater demand increase than alternative pairings.
To simplify analysis, we denote a second-order dominant marginal choice indexing rule as 2-MCI and assume that if a demand function has a 2-MCI
In this section, we demonstrate how to derive the optimal warehouse assortment when demands are characterized by a 2-MCI. Starting with the general MSO problem without additional restrictions, we show that under specific conditions, popular products should be allocated to larger warehouses. To illustrate, we first introduce a key definition.
Given a demand function
Building on this definition, we can now present the following theorem that illustrates how we should allocate products that are not repeatedly stored in multiple warehouses.
Given a demand distribution
Theorem 1 provides useful insights into the structure of the optimal assortment for the general MSO problem. Roughly speaking, when demand features a 2-MCI, products of similar popularity should be stored together, and the more popular products should be stored in larger warehouses. Specifically, Theorem 1 implies that if a larger warehouse stocks more popular products, it should also exclusively house some relatively popular products. Conversely, less popular products, intended to be stored in only one warehouse, should be allocated to smaller warehouses alongside similarly less popular items. Indeed, this theorem is particularly useful when the logistics network is almost established, with only a few products remaining, each awaiting allocation to a single warehouse.
However, Theorem 1 does not provide a straightforward method for determining the optimal selection. In the remainder of this section, we will explore three specific scenarios: settings with a non-overlapping assortment restriction, demands with nested structures, and networks with only two warehouses. These scenarios will allow us to directly derive the optimal decision.
Recently, Zhu et al. (2021) consider the assortment allocation problem for multi-warehouse systems with the constraint that assortments for different warehouses are non-overlapping. While they propose a K-links heuristic clustering algorithm, the algorithm lacks theoretical guarantees. In contrast, we identify the optimal assortment selection under the same restriction when the demand has a 2-MCI.
By default, we let
When the assortments are not allowed to overlap, if the demand distribution
Theorem 2 implies that when assortments are not allowed to overlap, we can fully characterize the optimal warehouse assortment if the demand distribution has a 2-MCI. Under the optimal warehouse assortment, larger warehouses store more popular products, and products with similar popularity levels are stored in the same warehouse.
The insights derived from Theorems 1 and 2 lead us to a distinct characterization of the optimal product allocation for items stored exclusively in one location, as elaborated in the subsequent corollary.
Given a 2-MCI
This corollary delineates the optimal assortment for two warehouses with uniquely stored products, which can be easily extended to multiple warehouses under the same storage condition. Beyond Corollary 1, Theorem 2 also inspires an investigation into whether certain demand distributions inherently lead to non-overlapping optimal assortments without imposing it as a constraint. The answer is affirmative, and the next subsection explores one such class of distributions.
Now, we explore the MSO problem when the demand distribution features some nested structures. We start with the most fundamental nested structures in product demand distribution.
Total Nested Structure
A demand distribution
Definition 3 allows for a natural hierarchy of products based on their demand, with the most popular product indexed by
If a demand distribution has the total nested structure, then the corresponding MCI is second-order dominant.
Given that the MCI of a total nested demand distribution is second-order dominant, Theorem 2 can be applied to find the optimal selection under a non-overlapping restriction. Moreover, in this special scenario, we can delve deeper. The following theorem characterizes the optimal warehouse assortment selection for MSO when the demand distribution exhibits a total nested structure.
If the demand distribution
Theorem 3 indicates that larger warehouses should store more popular products, while smaller warehouses should house less popular ones. Within each warehouse, products should have similar popularity levels, with roughly equivalent selection probabilities. Furthermore, Theorem 3 provides a stronger result than Theorem 2 for MSO under totally nested demand distributions. Specifically, the given demand pattern ensures that allocating products repeatedly to the remaining capacity does not reduce order splitting, naturally leading to a non-overlapping optimal assortment.
Yet, these insights are specific to totally nested demand distributions. For a broader range of nested structures, although the corresponding MCI may not be second-order dominant, our observations from the totally nested patter can still help us identify structures of the optimal assortments. Next, we will examine two other demand functions showcasing more general nested structures.
A demand distribution
It is worth noting that the total nested structure is a special case of a nestable structure with
(Example of Nestable Structure in Demand)
Consider a universal set
According to Theorem 3 and leveraging the relationship between the total nested structure and the nestable structure, we can further gain intuition on the optimal assortment selection for MSO if the demand distribution is nestable.
If the demand distribution
Corollary 2 provides an important insight into the structure of the optimal solution for MSO when the demand distribution is nestable. Once the number of products allocated from each subset
Beyond the total nested and nestable structures, certain settings exhibit more complex relationships. In particular, a product may serve as both an add-on for one product and a main product for others, resulting in a continuously nested demand structure with add-ons, as defined below.
A demand distribution
Example 3 provides an illustration of the nested-nested structure in customer demand.
(Example of Nested-Nested Structure in Demand)
Consider a universal set
It’s noteworthy that the total nested demand structure can also be seen as a special case of the nested-nested demand structure, which is not defined on a collection of nested sets. In the nested-nested demand structure, the indexing defines a sequence of product subsets
If the demand distribution
Corollary 3 infers that an optimal assortment selection at warehouse
Although perfectly nested structures are rare in practice, more flexible variations, such as nestable and nested-nested structures, are common. Even without strict nesting, high co-purchase likelihood with primary products is frequently observed. For example, in a university textbook series like Calculus I, Calculus II, and Calculus III, purchasing the first textbook often leads to the purchase of subsequent ones to continue learning. Similarly, in the luggage industry, buying a large suitcase typically creates demand for related items like carry-ons or toiletry bags. In toy sales, the initial purchase of a Barbie doll often drives demand for compatible accessories. In home appliances, products like coffee machines or blenders frequently lead to the purchase of related items such as filters or extra blades. Lastly, in the furniture sector, IKEA illustrates how demand for core products like beds or sofas generates demand for complementary items like mattresses and cushions. These examples highlight the prevalence of nested demand structures in real-world e-commerce, where customer purchases are strongly influenced by primary products. Understanding these patterns is crucial for developing effective assortment selection strategies.
According to previous discussions, when assortments are restricted to be non-overlapping, we are able to discern useful structures of the optimal selection. However, finding the optimal allocation for products that can be stored in multiple warehouses is much more challenging, as calculating the minimum number of split orders is generally NP-hard. Nonetheless, this problem becomes more manageable in scenarios involving only two warehouses. We will focus on this special case, referred to as MSO2, in this subsection. Initially, we propose an extended marginal choice indexing policy, which achieves optimality when the demand has a 2-MCI. Following this, we explore the relationship between solving the MSO problem and the single-warehouse assortment selection (SWAS) problem, and propose an innovative iterative heuristic designed to improve any given two-warehouse assortment selection. Furthermore, we validate the effectiveness of these two proposed heuristics through extensive numerical experiments.
When there are only two warehouses, a product is either stored in only one warehouse or in both. Although addressing the MSO problem remains NP-hard, as indicated by Proposition 2, it is not necessary to solve MILP (1) to determine the optimal fulfillment strategy for each order. To clarify, for any given assortment across the two warehouses, if an order cannot be satisfied by either warehouse alone, it will be split into two suborders, with each warehouse fulfilling one. According to these observations and acknowledging that fulfilling each order requires at least one shipment for fulfillment, we can reformulate Problem (MSO) for two warehouses, denoted as (MSO2), as follows:
Extended Marginal Choice Indexing Policy
Revisit Example 1 and consider a scenario with two warehouses, where

Venn diagram representation of the extended MCI policy.
It worth noting that the extended MCI Policy builds on the MCI policy introduced in Li et al. (2024), which was originally designed for single-warehouse assortment selection. The MCI policy selects products with the highest marginal choice probabilities for storage and achieves optimality under mild assumptions, demonstrating efficiency in practical applications. In extending the MCI policy to two-warehouse systems, the most popular products are first allocated to the larger warehouse, with the remaining products assigned to the smaller one to ensure all orders can be fulfilled. The spare capacity of the smaller warehouse is then supplemented with the most popular products, ensuring an efficient allocation. In both strategies, the marginal choice probabilities of products play a crucial role in allocation. This is supported by evidence that, for certain demand distributions, a corresponding MCI demonstrates compelling properties and helps to determine the optimal assortment selection for a single warehouse. We will show that under certain mild conditions, the extended MCI policy guarantees an optimal solution for the MSO2 problem.
Under Assumption 1, for a given demand function
Theorem 4 highlights the critical role of a second-order dominant indexing rule for establishing the optimality of the extended MCI policy in solving MSO within two-warehouse systems. Additionally, this confirms Theorem 1 in the context of two-warehouse systems. As illustrated in Figure 1, to obtain a lower
Clearly, if the underlying demand distribution follows those presented in Proposition 5, the extended MCI policy achieves optimality. Although Theorem 4 provides us with critical insights into the optimality of the extended MCI policy for certain demand distributions, it is important to recognize that not all demand has a second-order dominant MCI. Consequently, the application of the extended MCI policy does not universally guarantee an optimal solution.
Despite that many distributions may not exhibit second-order dominant preferences among products’ demand, or such characteristics are only present within restrictive subclasses of the choice model, such as the multi-choice random utility model described in Example EC.1 (due to its complexity, this example is excluded from the main text; see Lin et al., 2025 for reference), we will demonstrate in later numerical experiments that the extended MCI policy yields near-optimal solutions. Moreover, in the next subsection, we will introduce the Iterative Improvement Heuristic, which can be applied to the extended MCI policy to help achieve even better performance.
While extending the MCI policy to a two-warehouse setting is intuitive, applying it to multi-warehouse settings presents significant challenges, especially when aiming to achieve similar optimality as stated in Theorem 4. The complexity arises from the need to account for products that may be stored in multiple warehouses with varying frequencies. Despite this, we offer one possible version in Section EC.5 for consideration, acknowledging that this variation may not always maintain such good property.
Tackling the multi-warehouse assortment selection problem is generally challenging because assortments across various warehouses need to be jointly optimized. Now, consider the instance in which the assortment for one of the two warehouses, denoted by
Leveraging this insight, we present the following proposition, which establishes the equivalence between Problem (4) and an OFRM problem.
In the two-warehouse assortment selection problem, if one of the assortments is fixed, then the problem reduces to an order fill rate maximization problem for a single warehouse.
To see this, we define
Subsequently, we will present a heuristic designed to improve any two-warehouse assortment selection strategy, which we refer to as the Iterative Improvement Heuristic (IIH). One of the most important elements in the IIH is to solve Problem (4) or (5). One can always update
The Iterative Improvement Heuristic incrementally enhances the current two-warehouse assortment selection at each step and converges as the number of iterations becomes sufficiently large.
Second, it may not be necessary to calculate the objective value
Although two-warehouse assortment selection closely relates to the order fill rate maximization problem, it is worth noting that such an observation cannot be easily found in general multi-warehouse settings. This is because, in a two-warehouse system, an order will be split into two suborders if neither warehouse can fulfill it independently. In this context, if an assortment is fixed, improving the order fill rate of another warehouse can enhance the efficiency of the whole system. However, this is not necessarily the case when it comes to more than two warehouses, since the improvement of the order fill rate for one warehouse does not necessarily lower the chance that an order be split into two or more suborders. What’s worse, even checking whether a new selection reduces the number of split orders is NP-hard.
In this subsection, we numerically evaluate the performance of the extended MCI policy and the Iterative Improvement Heuristic by applying them to a two-warehouse assortment selection scenario aimed at minimizing split orders.
For our comparative benchmark, we choose the Bestsellers heuristic, originally proposed in Catalán and Fisher (2012). Among the four greedy-based heuristics explored in their study, this heuristic has been shown to perform the best. The Bestsellers heuristic initially assigns the top
In the following, we conduct two sets of synthetic experiments. The first is based on the independent choice model (ICM), where the choice of each product is independent of others (Lin et al., 2025). The second simulates the multi-purchase multinomial logit (MP-MNL) Model, which captures interactions among multiple products within a single purchase (Bai et al., 2023). Across each experimental set, we conduct trials involving varying numbers of products and diverse warehouse capacities. Specifically, we examine cases with
Experiments Based on the Independent Choice Model
In this section, we simulate customer choices based on the Independent Choice Model to conduct experiments and analyze outcomes. The ICM assumes that the selection of each product is independent of others, leading to the choice probability of any subset
As per the ICM’s definition, once individual product choice probabilities are identified, we can accurately calculate the choice probability for all subsets within the universal set. Accordingly, we randomly sample
The results of the comparisons are summarized in Table 1, with visualizations for each sub-case shown in Figures 2 to 7 (additional details are provided in Section EC.4.1). In the table, the “opt ratio” represents the optimality ratio, calculated as one minus the relative difference in

ICM with 6 items (1).

ICM with 8 items (1).

ICM with 10 items (1).

ICM with 6 items (2).

ICM with 8 items (2).

ICM with 10 items (2).
Comparison of different policies under the ICM simulations.
These results validate Theorem 4, demonstrating that the extended MCI policy achieves optimality when the demand distribution follows the ICM. As illustrated in Figures 2 to 7, the OPT and eMCI lines overlap, consistent with the theoretical guarantee provided by the theorem. In general, the extended MCI policy outperforms the IIH when starting with a randomly assigned initial assortment, both proposed heuristics surpass the performance of the Bestsellers heuristic. Surprisingly, although the IIH is intended to enhance any existing assortment selection without guaranteed performance, it exceeds the benchmark in all instances, no matter what initial assortment is selected. Furthermore, when the assortment of one warehouse is fixed, an increase in the capacity of the other warehouse results in a reduction of the total
Furthermore, we extend our experiments to scenarios involving a larger number of products. As the number of products increases, the exponential growth in possible customer orders (
In the following experiments, we simulate customer choices using the Multi-Purchase Multinomial Logit model. In this model, customers have a random intended purchase quantity (IPQ), denoted as
Comparison of different policies under the ICM model for larger product sets.
Comparison of different policies under the ICM model for larger product sets.
It can be verified that the MP-MNL model does not meet the sufficient conditions for the extended MCI policy to achieve optimality. In light of this, we apply the IIH to the selection obtained by the extended MCI policy, specifically by setting the initial assortment for warehouse
The results of the comparisons are summarized in Table 3, with accompanying visualizations for each sub-case depicted in Figures 8 to 13 (more details can be found in Section EC.4.2). Notably, despite the optimal condition from Theorem 4 not being met in this scenario, the performance of the MCI policy still approaches optimality. The strong performance of the extended MCI policy may be attributed to the presence of a first-order dominant MCI for the MP-MNL model, as established in Corollary 2 of Li et al. (2024). Although the MCI of MP-MNL models does not fully satisfy condition (ii) in Definition 1, our numerical tests show that fewer than 5% of all valid tuples

MP-MNL with 6 items (1).

MP-MNL with 8 items (1).

MP-MNL with 10 items (1).

MP-MNL with 6 items (2).

MP-MNL with 8 items (2).

MP-MNL with 10 items (2).
Comparison of different policies under the MP-MNL model simulations.
For all examined scenarios, the two proposed heuristics greatly outperform the Bestsellers heuristic. Importantly, even when the demand distribution fails to meet optimal conditions, the extended MCI policy still provides near-optimal solutions. Moreover, further application of the IIH can enhance its performance and reduce split orders.
Similar to the experiments conducted under the ICM, we extend our analysis to scenarios involving a larger number of products using the MP-MNL model. In these experiments, we assume that a customer’s IPQ can take values up to 5, each with equal probability. This assumption imposes an upper bound on the number of potential customer orders, allowing us to evaluate cases with a substantially larger number of products, up to 1,000. The results are presented in Table 4. We observe that the proposed extended MCI policy and the IIH consistently outperform the Bestseller heuristic across all tested cases. Specifically, the extended MCI policy exceeds the performance of the Bestseller heuristic by at least 4% in all instances. When comparing these results to those in Table 2, a notable difference emerges in Table 4. Unlike the ICM, where customers can purchase up to
Besides synthetic experiments, we also conduct numerical experiments using real-world data from RiRiShun Logistics to further validate the effectiveness and scalability of the proposed methods. Due to space limitations, the details are provided in Section EC.4.3.
In conclusion, the proposed Extended MCI policy and Iterative Improvement Heuristic (IIH) exhibit both efficiency and scalability across various demand structures, including independent product choices and cases with product interdependencies. Despite its computational cost, the MILP formulation (2) remains a valuable tool, particularly when warehouse assortment decisions are not time-sensitive and demand patterns are uncertain, as it provides an exact optimal solution. In contrast, the two proposed algorithms offer efficient and practical alternatives, especially when specific demand patterns can be identified, providing high-quality solutions in a timely manner.
Comparison of different policies under the MP-MNL model for larger product sets.
Until now, we have discussed the multi-warehouse assortment selection problem to minimize order splitting under Assumption 1, which requires all products to be stored within a single-tier logistics network. However, in practical scenarios, many e-commerce logistics companies, such as JD.com and Ririshun Logistics, actually operate multi-tier logistics networks. In multi-tier logistics networks, back-end warehouses typically feature larger capacities and offer essential support to front-end warehouses. Apparently, warehouse assortment selection becomes more complicated in this context. To distinguish this situation, we refer to it as multi-tier multi-warehouse assortment selection (MMWAS), in contrast to the previously discussed single-tier multi-warehouse assortment selection (MWAS).
It is noteworthy that if every front-end warehouse can receive support from any back-end warehouse, determining assortment selections for back-end warehouses simplifies to the MWAS problem. Also, if each front-end warehouse has its own service region without any overlap, determining the assortment selection for a single front-end warehouse reduces to the single-warehouse assortment selection discussed in Li et al. (2024). Conversely, without these conditions, new challenges such as overlapping service regions, evaluating spillover fulfillment strategies (i.e., fulfilling a customer order directly from a back-end warehouse rather than from front-end warehouses), and the joint allocation of products within the multi-tier system must be considered to address the MMWAS problem.
Building on our findings from MWAS, we extend our analysis to the MMWAS problem. In this section, we focus on a two-tier system with a single back-end warehouse storing all product varieties, aiming to determine the assortment selections for multiple front-end warehouses. For simplicity, we retain the previously used notation.
To avoid triviality, we posit that the fulfillment cost incurred by the back-end warehouse exceeds that of any front-end warehouse. This assumption guarantees that relying solely on the back-end warehouse for order fulfillment is not always the optimal strategy. However, if the spillover fulfillment cost from the back-end warehouse is extremely high, all products will be forced to be stored within front-end warehouses, reducing the problem to MWAS. Thus, we make the following assumption:
While fulfilling an order,
Assumption 2 is based on the observation that the strategic placement of front-end warehouses closer to customers leads to reduced transportation expenses and quicker delivery times, thereby making order fulfillment from these locations typically more cost-efficient than from the back-end warehouse. However, splitting orders into multiple suborders incurs additional handling, transport, and operational costs, outweighing the singular higher cost of spillover fulfillment. This assumption highlights the trade-offs between warehousing expenses, fulfillment efficiency, and customer satisfaction. Notably, this setting can also be regarded as if no single front-end warehouse can fully fulfill an order, then the order is considered a lost sale. Additionally, the requirement that
Under Assumption 2, we have the following two observations:
To solve the MSO-S problem, if assortment overlapping is not allowed among front-end warehouses, the following theorem characterizes an optimal warehouse assortment allocation when a second-order dominant indexing rule exists for the demand function
Under Assumption 2, if the demand distribution
Similar to Theorem 2, the optimal assortment selection at the front-end warehouses also involves storing more popular products in larger warehouses, with products of similar popularity levels being stored together in the same warehouse. If the front-end warehouses cannot handle all the products, then the least popular
To address MSO-S without the restriction on non-overlapping assortments among front-end warehouses, we begin with the simplest scenario involving just two front-end warehouses, denoted as MSO2-S. By comparing the MSO2-S problem with the MSO2 problem, we establish the following proposition about the optimal solution structure for MSO2-S.
Under Assumption 2, if the demand distribution
Although Proposition 9 does not directly provide an optimal policy for solving the MSO2-S problem when the demand distribution has a second-order dominant MCI, the insights it provides are immensely valuable. Given that the value of
For general cases involving multiple front-end warehouses, it is intuitive to come up with a greedy algorithm that reduces the chance of fulfilling an arbitrary order from the back-end warehouse step by step. Consider any given set of assortments
Building on the similarity between MSO-S and MSO2, we develop an iterative algorithm (Algorithm 4) similar to Algorithm 1 to refine any existing assortment selection, including solutions from the greedy algorithm. The convergence of Algorithm 4 is established in Proposition 10.
Algorithm 4 converges when the number of iterations is sufficiently large.
It is essential to note that Algorithm 3 and 4 cannot be directly applied to solve the general MSO problem without significant adjustments. This limitation stems from the requirement for a single-tier multi-warehouse system to store all products. Indeed, there may be scenarios where including some of the least popular products has a minimal impact on reducing the objective value, especially when compared to the inclusion of more popular items. Such less popular products might be ignored by these two algorithms, thus failing to meet the requirement to fulfill all orders as required by the MSO problem.
To sum up, we have expanded upon our findings from single-tier multi-warehouse assortment selection to address the multi-tier multi-warehouse assortment selection. Note that this represents a preliminary exploration into a specific subclass of the MMWAS problem. We believe this direction is important, rich in potential, and holds great promise for future research.
This article investigates the multi-warehouse assortment selection problem, with a primary focus on minimizing split orders, a task that proves to be NP-hard. To address this challenging problem, we provide an MILP formulation that can be solved using commercial solvers. However, such an approach, while computationally viable, does not yield structural insights into how products should be optimally allocated across multiple warehouses. To bridge this gap, we introduce the concept of the second-order dominant indexing rule, which facilitates the characterization of optimal assortment structures under in various scenarios. Specifically, when a 2-MCI exists for the demand distribution, we can explicitly determine the optimal non-overlapping assortment. For total nested demand structures, the optimal assortment allocates more popular products to larger warehouses. In two-warehouse contexts, we develop the extended MCI policy, which is proven optimal under demand distributions with 2-MCI. Additionally, we establish a link between the MSO
As for future research directions, several aspects of multi-warehouse assortment selection could be further explored. First, expanding the focus beyond order-splitting minimization to incorporate fulfillment cost trade-offs and more complex multi-warehouse configurations would provide valuable insights. Investigating multi-tier systems where limited splits are preferable to back-end fulfillment also presents a promising direction. Additionally, examining the interplay between fulfillment policies and assortment decisions could offer practical implications for optimizing multi-warehouse logistics. Beyond its application in warehouse assortment selection, the second-order dominant indexing rule has potential applications in various business environments. For instance, in inventory management, firms can leverage the structured demand pattern to optimize stock levels by prioritizing high-demand products, improving inventory efficiency and service levels. Specifically, integrating assortment selection with inventory planning while considering volume and quantity constraints could yield deeper insights.
Supplemental Material
sj-pdf-1-pao-10.1177_10591478251365581 - Supplemental material for Multi-Warehouse Assortment Selection: Minimizing Order Splitting in E-Commerce Logistics
Supplemental material, sj-pdf-1-pao-10.1177_10591478251365581 for Multi-Warehouse Assortment Selection: Minimizing Order Splitting in E-Commerce Logistics by Hongyuan Lin, Xiaobo Li and Fang Liu in Production and Operations Management
Footnotes
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The authors received the following financial support for the research, authorship and/or publication of this article: The work of Xiaobo Li was supported in part by the National Natural Science Foundation of China (Grant 72171156), and by the Singapore Ministry of Education Academic Research Fund [Tier 1 Grants 23-0619-P0001 and 24-0500-A0001; Tier 3 Grant MOE-2019-T3-1-010]. The work of Fang Liu was supported by the Major Program of the National Natural Science Foundation of China (Grants 72192843 and 72192840).
Notes
How to cite this article
Lin H, Li X and Liu F (2025) Multi-Warehouse Assortment Selection: Minimizing Order Splitting in E-Commerce Logistics. Production and Operations Management xx(x): 1–21.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
