Abstract
Recent advancements in Information Technology have provided an opportunity to significantly improve the effectiveness of inventory systems. The use of in‐cycle demand information enables faster reaction to demand fluctuations. In particular, for the newsvendor (NV) system, we exploit the newly available data to perform an additional review (AR) of inventory at an endogenously determined, a priori set time during the sales period, and perform an additional replenishment if necessary. We implemented our innovative model at a market‐leading media group. The results of the initial pilot were dramatic, indicating that the proposed model achieves an increase of 4%–24% in profits compared to the policy before implementation. As a result, the company started following the proposed model for all their printed magazines and observed a significant reduction in operational costs. In a generalized setting, we provide a tractable search‐based optimization algorithm, based on the problem's structural properties, for determining the optimal initial quantity, AR timing, and quantity to restock at that time. Based on these theoretical results, we propose a simple heuristic that can be used for many practical situations including our implementation at Yedioth. Through a computational experiment, we show that our algorithm finds the optimal solution quickly and that the proposed heuristic performs well. We also provide additional insights into the problem—for instance, that our system exhibits properties similar to inventory pooling, provided that the demand rate is large enough.
Introduction
Two major changes have affected supply chain management in recent years. The first is increased product variety accompanied by a shortening of product lifecycles. This combination has led to short‐term, highly uncertain demand profiles, making single‐period inventory models particularly relevant and widely used.
The second change is the recent advancements in Information Technology, such as Electronic Data Interchange (EDI) systems and Radio Frequency Identification (RFID) tags, which provide decision makers with extensive, accurate, and often real‐time, data. Wisely used, this newly available data can assist in improving the performance of the supply chain by reducing operational costs. For single‐period systems, such as the newsvendor (NV) system, one way to leverage this new information is to perform an additional review (AR) and possibly restock at an endogenously determined, a priori set time during the sales period, essentially creating two phases. Our goal is to leverage the available in‐cycle sales information to react to and address demand fluctuations. Whenever the words “replenishment” or “lotsizing” are used in this work, they equally refer to production or procurement. To clarify the problem setting, consider the following example.
A representative example of the print industry, and the motivating real‐world business case for this work, is Yedioth Group, the largest media group in Israel. Its distribution channel that is relevant to the current work includes around 8000 competing retailers. According to a common policy in the print industry, all lotsizing decisions are made solely by Yedioth and not by the retailers. This is due to the contractual mechanism and power relationship between the two parties. At the end of the sales period, excess copies are collected and retailers are fully refunded for unsold copies. In this way, all the surplus risk is borne by Yedioth, and the trade‐off faced by Yedioth for each retailer independently is captured by the standard NV problem. Yedioth uses an EDI system as well as RFID technology to monitor the sales of its print products at the points‐of‐sale (see, e.g., Avrahami et al. 2014). Thus, in‐cycle demand information is available at Yedioth virtually at any time. Moreover, it is possible to have an additional printing run during the sales period, provided that capacity is reserved well in advance. Once capacity has been reserved for the production of a certain print product, the actual quantity to be printed is flexible. Dynamic capacity allocation is not possible due to the printing facilities being shared by multiple products. Moreover, dynamic capacity allocation would have limited or no benefit due to the need of Yedioth's salespersons to commit to a schedule to visit the retailers at the beginning of the month.
Yedioth's situation is typical in the print industry where companies face ever‐growing competition from Internet‐based content. To survive and make a profit, the industry must streamline and reduce operational costs associated with print products. Thus, before the sales period begins, a printing house can decide that at a certain point during the period, after obtaining partial sales information, it may print and distribute another batch of the same product to meet the expected demand during the rest of the period. Due to the nature and sales profile of these products, the print industry fits very well into the newsvendor framework. Unmet demand is lost, surplus inventory is scrapped at the end of the period or is redirected to a secondary sales channel, and there is no holding cost during the period. In case of double‐printing and distribution, the most substantial part of the fixed production cost, that is, preparing the specially produced aluminum printing plates, is not incurred for the second production since these remain from the initial production. Because the printing process is considerably faster than the demand rate, the lead time is considered negligible and additional review and production are easily implementable. The costs associated with the distribution include mainly fuel, but the sales agents typically visit the retailers once more during the sales period anyway for purposes of promotion, advertisement, receiving payment, negotiations, and so on. The days of the additional visit are usually flexible, but must be prearranged for a few reasons. First, the retailers expect to know when the sales agent is coming. Second, the sales agents need to plan these visits among their other activities. Finally, for logistical efficiency, these additional visits need to be on the same day for geographically close retailers.
We model the problem as a stochastic planning problem with recourse and simultaneously optimize the initial order quantity, the timing of the AR, and the quantity to restock at that time. As verified numerous times in the literature, a correctly selected AR timing positively affects system performance compared to an arbitrarily selected one (Erkip 1984, van der Heijden 1999, Jönsson and Silver 1987, McGavin et al. 1993, Smirnov et al. 2021). These studies provide motivation for finding the optimal AR timing in the current work. As discussed in section 4, optimization over time is a complicating factor.
The AR timing in this work is endogenous but must be set before the start of the period. The order quantity at the time of the AR, however, is determined based on in‐cycle demand information. The reason for investigating this type of policy is two‐fold: first, in many industries, production facilities and resources are shared by multiple products. In this way, while the timing of the AR and production is potentially flexible, one must reserve production capacity in advance. Otherwise, the resources are likely to be occupied at the desired time of the additional replenishment. Second, as explained earlier, visits for an additional distribution generally cannot be carried out spontaneously due to other commitments of the sales agents.
The remainder of this study is organized as follows. In section 2, we review the relevant literature. In section 3, we provide the problem description and introduce the notation. In section 4, we analyze the problem and derive structural properties. In section 5, we develop our solution algorithm and section 5.1 presents a simple and practical heuristic. In section 6, we report on a computational experiment and discuss its results. In section 7, we report on the implementation of our model at Yedioth and the savings achieved. We conclude the work in section 8. All proofs appear in the online appendix.
Literature Review
Our work is related to two branches of research, namely, the two‐phase single‐retailer NV problem (e.g., Bulinskaya 1964), and determining the optimal review timing in a stochastic inventory system.
The two‐phase single‐retailer NV problem has been examined in the literature in several contexts such as operational constraints (e.g., Fisher and Raman 1996), forecast updating (e.g., Cachon and Swinney 2009, 2011, Choi et al. 2003), and supply chain coordination (e.g., Donohue 2000, Linh and Hong 2009). These studies utilize the fact that at the point of the AR, demand has been partially revealed. A more general setting includes multiple phases with a replenishment opportunity before each phase begins, and is not limited to a single retailer. The approach usually applied to this kind of problem is stochastic dynamic programming (e.g., Avrahami et al. 2014, Crowston et al. 1973, Nambiar et al. 2020), sometimes combined with Bayesian information updating (Eppen and Iyer 1997, Murray Jr. and Silver 1966). Most of the studies perform forecast updating, an issue not addressed in the current work. In contrast, our work optimizes the AR timing, an issue not considered in these papers.
Determining the optimal review or reorder timing in a stochastic inventory system is a problem addressed in various settings, including multiple periods, echelons or retailers (e.g., Liu and Song 2012, Rao 2003, Shang et al. 2015, Wang 2013, Wang and Axsäter 2013, Wang and Tomlin 2009). Some of the above studies involve lotsizing decisions in addition to timing decisions, and in others, optimization over timing is numerical. Our study, in contrast, simultaneously addresses both lotsizing and AR timing decisions in a single‐period setting with two ordering opportunities.
Closely related to our work is that of Milner and Kouvelis (2005), who study the value of order quantity and reorder timing flexibility in a single‐retailer, single‐period setting with two ordering opportunities. Aside from differences in the cost structure and demand processes, our work differs in several meaningful ways. We analyze the structure of the objective function in all decision variables. As a result, our algorithm always terminates with an optimal solution. In several cases, Milner and Kouvelis (2005) rely on a numerical search in a space whose structure is unknown. Milner and Kouvelis (2005) consider four ordering policies. We consider mainly one ordering policy that is similar to Milner and Kouvelis (2005)'s “quantity flexible” ordering policy (i.e., the reorder timing is fixed and the quantity of the second order is determined using in‐cycle demand information). Moreover, we propose a simple and practical heuristic that performs very well and numerically investigate alternate ordering policies. Finally, we justify our model by a real‐world business case and report on its implementation at a large printing house, including handling real sales data.
Problem Statement
Our focus in this study is Yedioth. In order to address Yedioth's printing decisions, however, we first model a slightly more general problem which lends itself to analysis and the development of effective and efficient algorithms for obtaining solutions. After computationally investigating our algorithms, we return to the case of Yedioth and show how our work has had a meaningful impact.
Planning Horizon and Demand Process
We consider a finite planning horizon, time‐scaled to the interval [0, 1]. The horizon consists of a single sales period, long enough to enable a meaningful division into two parts, [0, t] and [t,1], which we refer to as phases. Product units are discrete and demand follows a homogeneous Poisson counting process, N
t
≥ 0, with a constant arrival rate λ, an assumption that both reflects typical customer arrival and is standard in the literature. We define
By definition, demand in the first phase follows a Poisson distribution with parameter λt, while demand in the second phase follows a Poisson distribution with parameter λ(1−t). It should be noted that for the example of Yedioth, the company experts found no correlation between the demand at the beginning of the sales period and the demand at its end. We confirm this by our own correlation analysis; see section 7.2.
Order of Events and Cost Structure
Prior to describing the order of events and cost structure, we introduce some basic notation. We define z
+≡max{z,0} and z
−≡max{−z,0}. We use Δ to denote a difference and z
* to denote the optimal value of z. We use
Before the initial ordering decision, the stock level at the single retailer we consider is zero. Lead time and fixed replenishment costs are negligible.
At time 0, the system orders a quantity Q
0 at a unit cost of c
1, thus incurring an initial variable ordering cost of c
1
Q
0, and items are received. Subsequently, the sales period starts and demand gradually unfolds. V is the unit selling price. Contrary to the standard NV problem where no AR takes place, the stock level is reviewed at an endogenously determined a priori set time t, and an additional replenishment is (possibly) performed at a unit cost of c
2 ≥ c
1. The AR can be performed at any moment throughout the sales period. We remind the reader, however, that the AR timing must be determined before the period begins, as the salespersons’ visits to the retailers must be scheduled in advance. By time t, the system receives a revenue of
Depending on the realization of
After the AR, sales continue and result in a revenue of

Order of Events in the Proposed System
We assume c 2 ≥ c 1 as we expect this relation to hold for most practical situations. We conjecture that without this assumption, all results hold or can be adjusted with minor modifications. Moreover, in most cases, including in the bakery and printing house examples, the relation is c 1 = c 2. A situation of c 2 > c 1 might occur if, for instance, outsourcing or emergency production is used for the additional replenishment. To avoid uninteresting situations, we make the standard assumptions p+V > c 2 and c 1 > −h.
Analysis
Suppose for now that t is fixed and consider the problem faced at time t. This problem is a building block in the problem for the entire period.
Given t ∈ [0, 1], if Q
0 units are ordered at time 0 and there are n arrivals before time t, the expected profit of the second phase, [t,1], is the standard NV expected profit, that is,
Observation 1 implies the following result (
Given t ∈ [0,1], the expected profit of the second phase is discrete concave in S(t) and it holds that
Constraining S(t) = 0, or setting t = 0 or t = 1 reduces the problem to the simple NV system. The proposed system clearly has an expected profit no less than a simple NV system solved at time 0. Since both probabilities
For all
Objective Function
As mentioned in section 1, optimization with respect to t is a complicating factor for the model. It means adding a continuous decision variable to a problem in which the other decision variables are discrete, thus creating a mixed discrete‐continuous optimization problem. Additionally, and possibly for the same reason, having timing as a decision variable is nonstandard in the literature of stochastic planning problems with recourse.
In its most general form, our objective—the expected profit for the period (π 0)—is a function of three decision variables, namely the quantity to order at time 0 (Q 0), the AR timing (t) and the order‐up‐to level at the time of the AR (S(t)).
Conditioning on the number of arrivals in the first phase and on the number of arrivals in the second phase, and removing the dependence between the AR timing and the order‐up‐to level at the time of the AR, the expected profit for the period is:
To gain insight into the problem, let us examine Figure 2a, which, for a particular problem instance, depicts π
0 as a function of t for a fixed Q
0 = 16 for various order‐up‐to levels. Each curve corresponds to an order‐up‐to level x (

Expected Profit as a Function of t for a Fixed Q
0 = 16 [Color figure can be viewed at
Proposition 1 established that the optimal order‐up‐to level at time t depends only on t; thus, the dimension of the problem can be reduced from three variables to two variables by defining
In the problem we investigate, however, Q
0 is not fixed. We now formulate our objective function as a stochastic planning problem with recourse:
Breakpoints
Let us consider the timing of the AR. As the AR is performed later in the planning horizon, less inventory will be needed in the second phase (Observation 2). When the AR is performed close to time 1, one would prefer not to restock. Since demand is integral, S *(t) is a decreasing step function of t with a step height of one. For an illustrative example of the impact of t on S *(t), see Figure 3.

Illustration of the Impact of t on S
*(t) When
A breakpoint
For all
The sequence of breakpoints is thus
When
Bounds on the Initial Order Quantity,
In this section, we establish upper and lower bounds on the optimal quantity to order at time 0, when the AR timing lies between
Given t ∈ [0,1],
Defining
An upper bound and lower bound on
Analysis with Respect to the Additional Review Timing
In this section, we provide an analysis of the expected profit as a function of the AR timing (t) for given Q 0 and x. The results of this analysis are used later in our solution algorithm (section 5). The following lemma is useful for identifying structural properties of the problem such as unimodaity between breakpoints and breakpoints not being optimal, as well as for developing a bound on the expected profit between breakpoints:
Given Q
0 and x, the derivative of the expected profit for the period with respect to t is
Returning to the pseudo‐cloud‐shaped function (see Figure 2b), we note that for AR timings other than the breakpoints, it holds that
No breakpoint
A key result that is used in our solution algorithm (section 5) concerns the function's piecewise behavior. Despite the fact that
Given Q
0 and x, Q
0 ≥ x,
Theorem 3 implies that
For a given Q
0, if
Given Q
0 and x, we derive an upper bound on the expected profit in the interval

Illustration of the Upper Bound on the Expected Profit from Theorem 4 [Color figure can be viewed at
Given Q
0 and x, an upper bound on the expected profit over
The reader is referred to Appendix B for expressions for
Solution Algorithm
Our optimization algorithm (see Algorithm 1) begins by finding an initial candidate solution. Subsequently, it checks interval–initial quantity (IQ) pairs,
Note that at the end of the execution of Algorithm 1,
For each IQ pair
If an IQ pair is not eliminated, the first‐order condition is applied in that interval (Line 10) using Newton's method. Since
As stated above, Algorithm 1 begins by finding an initial candidate solution. The algorithm terminates with the optimal solution no matter what initial solution is used. This solution, however, does affect the execution time. Depending on the value of λ, we apply two different heuristics.
When λ is small (λ < 16), the optimization algorithm is extremely fast, hence we do not invest significant time in finding the initial solution. In this case, we use a heuristic rule inspired by McGavin et al. (1993), who, although addressing a different problem, heuristically divided the period into two phases of equal length. Thus, we set the initial AR timing to be 0.5 and optimize with respect to Q 0 by enumerating all Q 0's starting with S *(0.5) until we find one for which Equation (6) is nonpositive (see Lemma 1).
For larger λ, it becomes worthwhile to invest effort in finding a good initial solution. Although the optimal solution is not found at a breakpoint (Theorem 2), for λ ≥ 16, we find a two‐dimensional locally optimal breakpoint as the initial candidate solution. We use alternating optimization, that is, starting with
Discrete‐Time Heuristic
In many practical situations, there is a finite set of possible AR times. For such cases, we present a simple and useful heuristic, whose principle is enumerate the possible AR timings and, for each possible AR timing, optimize the initial quantity and the optimal order‐up‐to level. We define
Computational Experiments
In this section, we report numerical results that provide insight beyond what can be seen from the analytical results, thus completing the picture of our system with an AR. Moreover, we demonstrate the speed of our algorithm.
We now examine what we believe to be the most interesting case, that is, c 1 = c 2≡c. Assuming the same unit ordering cost in both phases also enables us to reduce the number of parameters in our experiment to just two.
The expected profit for the period (Equation 5) for the case c
1 = c
2≡c can be rewritten as:
Examining Equation (10), we get the following result.
If the unit ordering cost is identical in both phases, the cost parameters (V, c, h and p) influence the optimal solution
By Proposition 3, each problem instance examined in the computational experiment is characterized by a combination of λ and a critical ratio, (p+V−c)/(p+V+h). Note that this result is similar to the result regarding the optimal order quantity in the standard NV problem.
Due to the fact that the expected profits of both the system with an AR and the NV system depend on V, c, h, and p (and not only on the critical ratio), we decided to compare problem instances using a unitless measure that depends solely on λ and the critical ratio. This measure, which we call PRUC (Percent Reduction in Uncertainty Cost), is defined as the portion of the costs due to uncertainty reduced by applying our AR model, relative to a system with perfect information.
We conducted a full‐factorial experiment involving ten different values of λ and seven different values of the critical ratio, 70 instances in all. Specifically, we used λ = 2 k , k ∈ {0, 1, 2, 3, 4, 5, 6, 7, 8, 9} and (p+V−c)/(p+V+h) ∈ {0.05, 0.1, 0.3, 0.5, 0.7, 0.9, 0.95}.
In our implementation of the calculation of the objective function, we replaced the limits of the infinite sums (e.g.,
Our results are reported in Table 1, we exclude critical ratios of 0.05 and 0.1 for conciseness as they do not affect our observations and conclusions. The first two columns of both sides show the parameters, λ and the CR. The next three columns present the optimal solutions, that is, providing the optimal initial order quantity, the optimal AR timing, and the optimal order‐up‐to level at the time of the AR. The sixth column contains the expected quantity ordered (EQO) for both phases (i.e., the optimal initial quantity plus the expected quantity replenished at the time of the AR). This quantity is given by
Results of the Computational Study and a Comparison to the Newsvendor Model
The impact of the CR on

Impact of the CR on

Impact of λ on
The optimal AR timing can be either smaller or larger than 0.5, regardless of the CR. We note that For a given λ, the optimal EQO is increasing in the CR. For a given CR, the EQO ordered is increasing in λ. Both trends are in line with intuition and parallel NV problem behavior. The relation between the EQO in our system and In most cases, Similarly, in most cases, One can observe a general trend whereby the PRUC of the optimal two‐phase increases in λ and in the CR. For λ ≥ 8, the PRUC often seems to converge for high CRs. This behavior can be partly attributed to the apparent convergence in
The run times for Algorithm 1 depend mainly on λ. Table 2 shows the average (over the CRs) minimum and maximum run times for the different demand rates. Note that the range between the minimum and the maximum times is small.
Average, Minimum and Maximum Run Times for Algorithm 1 for Different Values of λ
Table 3 shows the performance of our discrete‐time heuristic for several combinations of λ and the CR and a number of equally spaced AR timings. The combinations of λ and the CR belong to a subset of the original full‐factorial experiment. We chose the number of feasible AR timings to be a power‐of‐two minus one, because in this case, points are added to the existing set of feasible solutions. Thus, increasing the number of points necessarily improves the objective function.
The Percent Reduction in Uncertainty Cost of the Discrete‐time Heuristic vs. the Optimal Two‐phase
Table 3 suggests that our discrete‐time heuristic performs very well, almost as well as the optimal algorithm, even when the number of candidate AR timings is small. Whereas many viewing this result may choose to use this heuristic solution, we feel that the optimal solution, which is not exceedingly complex, provides additional value.
Implementation at Yedioth Group
Our field study was conducted on a monthly print health magazine called Menta®. Typically, Menta® is distributed to retailers on or around the first Sunday of each month by a team of sales agents, each responsible for supplying several periodicals to several retailers. The distribution day is the same for all retailers and is set by Yedioth's marketing department exogenously to our model.
The production planning and logistics department in Yedioth, internally called the Research Department (RD), is responsible for all decisions regarding Menta's® printing and distribution quantities. Before our involvement, Yedioth employed a two‐stage decision process. Using historical data, first the RD decided on the total number of copies to print for a certain issue and then decided how many copies to supply to each retailer. The RD hoped to supply the correct number of copies to match predicted demand. Occasionally, Yedioth would monitor the real‐time inventory levels at the retailers, and would perform a resupply if a large retailer faced a stockout early in the month. The resupply procedure, however, was not part of Yedioth's standard business practices.
The implementation of our model at Yedioth is related to Avrahami et al. (2014), in which the retailers are supplied twice a period, with the second supply coming from pooled undistributed stock held back at an exogenously determined time. Avrahami et al. (2014) employed simulation‐based optimization with convergence guarantees. We, in contrast, propose two‐phased production and endogenously combine jointly optimal lotsizing and timing decisions. Note that our policy is guaranteed to yield a higher expected profit and is thus preferable in situations that allow the implementation of either model. Moreover, we compare the performance of our policy with that of Avrahami et al. (2014) and verify that our policy performs better for all months of implementation.
We begin by comparing our results in Table 1 with the algorithm of Avrahami et al. (2014). We note that whereas we assume demand is discrete arising from a Poisson distribution, Avrahami et al. (2014) assumes that demand is continuous. Thus, we use the normal approximation to the Poisson distribution to make our comparison. This approximation is most reliable for large values of λ and thus we perform the comparison for λ = 512. The PRUC for our algorithm is based on a single retailer and does not change when considering multiple identical retailers. In contrast, the PRUC for Avrahami et al. (2014)'s algorithm does increases in the number of retailers; we consider 2, 8, 32, and 128 retailers.
The PRUC for our algorithm is 59.7%±0.5% for the various critical ratios (see Table 1). For all number of retailers considered, the PRUC for Avrahami et al. (2014)'s is nearly identical for all critical ratios: For two retailers the PRUC is 13.6%±0.3%, for 8 retailers the PRUC is 25.0%±0.1%, for 32 retailers the PRUC is 28.2%±0.0%, and for 128 retailers the PRUC is 29.0%±0.0%. Even with a large number of retailers our model and algorithm is significantly better. The phenomenon that the PRUC is insensitive to the critical ratio appears in Table 1, particularly for larger values of λ and requires further research.
Adapting the Model to Yedioth's Case
Implementing our model at Yedioth required several modifications. For each individual retailer, the problem's structure is similar to the one discussed previously. From Yedioth's perspective, however, all retailers are considered jointly as a system. We add the index j (j = 1, …, J) to denote the retailer, with J being the number of retailers.
Although inventory levels can be reviewed at any time, the number of possibilities for an additional visit is finite. In practice, a sales agent can resupply a retailer any morning of a given month, that is, around 28–35 days. Thus, we treat time as a discrete variable with a limited state space. Applying a discrete version of our AR model (see section 5.1) to each retailer separately would most likely result in several different additional production/visit days, but the company is unwilling to engage in more than one additional production run and distribution per magazine title. 1 Moreover, the sales agents can visit the retailers at most once during the month without altering the usual business practices and incurring additional costs. Hence, the day of the additional visit is constrained to be the same for all the retailers.
Unlike in the basic AR model, demand at each retailer is assumed inhomogeneous Poisson. To be able to use our previous results, we perform time rescaling and transform the demand process into a homogeneous one. Let T be the number of possible additional visit days in a month, s be the joint AR day index (s ∈ {1, …, T}),
We use vector notation for the inventory decisions. Our problem is to decide upon the initially supplied quantities
Given the AR day, we can also determine the optimal initial quantities based on Lemma 3. Thus, the optimization problem becomes a function of a single decision variable:
Pilot Study Details and Results
We selected 174 large retailers for the pilot field study, all connected to an EDI system and all associated with the same supermarket chain. The field study was performed during five months in the second half of 2015, which contained no special events such as holidays or price discounts.
We gathered two types of data from Yedioth. The first, obtained from Yedioth's information system, is monthly aggregate historical data, which included, for every issue, the quantities supplied to each retailer, the monthly sales, the number of returns and the number of additional copies supplied in emergencies, if any. The second, obtained from the EDI system, represented daily sales data for each retailer. The two sources gave consistent information in all but a few cases.
As mentioned earlier, we assume that the individual retailer demands originate from a Poisson distribution. This is consistent with the how demand presents itself, that is, a large population with a small proportion choosing to buy the magazine. Estimating demand parameters from sales information required developing a special procedure, due to censored demand. This procedure is referred to as “demand uncensoring” hereafter. The essence of our procedure is finding the maximal likelihood estimator (MLE) for the mean demand (λ), based on the censored sales observations and the supplied quantities. For each retailer, the likelihood function (denoted by L) is a modification of the one found in Conrad (1976) for the case of time‐varying quantities. Details are provided in Appendix A. Our procedure uses only first‐order information to find the MLE and its correctness is based on the following result.
The likelihood function, L(λ), is unimodal in λ.
Menta® is a low‐demand item. Historically, sales have never exceeded 37 for an individual retailer, and were typically well below 20. Moreover, we observed that daily sales peak before weekends (especially on Thursdays) and decline to almost zero on Saturdays 2 , and that daily sales decline throughout the month. This behavior is illustrated in Figure 7, which shows the aggregate sales of the 174 participating retailers for the June 2015 issue. According to Yedioth's RD, this pattern has been consistent for many years. We verified this by examining the sales profile of an issue from the past (August 2011) and observed a very similar pattern.

Total Daily Sales of the June 2015 Issue of Menta® [Color figure can be viewed at
After examining the sales profile of the individual retailers and consulting Yedioth's experts, we made the assumption that all retailers have the same the sales profile, and that for a given issue, a retailer's daily sales profile is the same as the aggregate daily sales profile. This assumption is reasonable since all the retailers in the field study belong to the same supermarket chain.
We used the data from the EDI system to obtain an estimate of the daily sales distribution in each month. First, we neutralized the effect of the weekend by ignoring the observations from Thursdays and Saturdays, and fit a logarithmic decaying curve to the remaining data. This was done for three different issues (June 2015, R 2 = 0.96; July 2015, R 2 = 0.85; and August 2015, R 2 = 0.91). Next, for the same issues, we obtained the Thursday and Saturday coefficients by calculating the ratio of the real daily sales to the predicted daily sales without the weekend effect. The coefficients of the other days of the week were set to one. We multiplied the daily sales predicted by the logarithmic curve, by the coefficients (shifted to match the varying days of the week in each issue), and normalized the total monthly sales to one, to obtain estimates for the proportions of the daily sales in future months. We also accounted for the varying number of days on the shelf of different issues. For each issue, we relied on different fitted curves and coefficients, depending on the available data at the time of decision. Table 4 provides a summary of information used in the field study while Figure 8 summarizes the order of actions in a typical month.
Summary of Information Used in the Field Study

Actions Taken in a Typical Month
For the pilot field study, we relied on Yedioth's experts’ analysis as to the lack of demand correlation over time, but after the completion of the pilot we used the full EDI data to test this. We calculated the correlation coefficient based on a total of 870 pairs of observations. Each pair consisted of (uncensored) demand before and after the AR in each month of implementation. The resulting coefficient is ρ = 0.271, which implies a positive but weak correlation between sales over time. The significance of the correlation is p v < 0.001 based on a standard t‐test (see Appendix C for details). This implies that Yedioth can draw little or no insight from demand forecast updating.
Although the ultimate aim was to perform a second printing for Menta® based on the suggestions of our model, during the pilot field study, a convenient and readily available source of magazines for the additional replenishment was a central excess stock of Menta® magazines, which Yedioth prints and keeps to hedge against uncertain yields (a common practice in the print industry). This option required almost no logistical modifications while enabling us to convince Yedioth's decision makers about the importance of our proposed model. There was always enough excess stock to accomplish the replenishment.
The unit cost information was obtained as follows. The unit selling price is known (V = 21.2 NIS). The unit production cost was obtained from the production department (c = 2.54 NIS) and is the same for both production runs. The unit disposal cost, also obtained from Yedioth, was estimated as h = −1 NIS. This cost is negative due to a secondary use of the returned copies (selling to secondary markets at a reduced price, using as samples in selling subscriptions, handing out to visitors, etc.). The unit penalty cost, estimated by the marketing department as p = 2 NIS, incorporates the loss of potential readers and the loss of exposure to advertisement. This means a high critical ratio of 0.93. To avoid the fixed visit cost, we asked the sales agents to perform their standard additional visit at the timing suggested by our model.
During the pilot field study we encountered and addressed several challenges. Lag in obtaining the EDI data. This lag was generally around 48 hours, which meant that our decisions regarding the additional shipment quantities were based on slightly less information than desired. Nevertheless, we increased Yedioth's profits, and expect even better results when this lag is eliminated. Lack of timely information regarding the number of days an issue will be on the shelves. Due to last‐minute changes common in the print industry, we had to make appropriate assumptions. The assumed number of days later turned out to deviate from the actual number of days by no more than two. Recalling the decline in the sales throughout the month, we believe that this mismatch had very little impact. Decline in sales over the past years. As with many other print products around the world, Menta® is experiencing an overall decline in sales. The decaying sales pattern meant that some historical data did not predict future demand. After consulting Yedioth's experts, only relevant observations were used in estimating the mean demands. It should be pointed out, however, that not all the 174 retailers that participated in the study experienced decaying sales. Some showed no trend, and a minority even showed an increase over the past years. Censored demand, as detailed in Appendix A. We addressed this issue by implementing the uncensoring procedure described earlier in this section, which did not change estimates much. Skepticism of the RD. We compared the initial quantities obtained from our two‐phase model to those obtained by Yedioth's RD for each retailer. The company's quantities were often lower than the quantities we suggested. This is possibly because the in‐house model did not consider properly the penalty cost and the revenue loss from a lost sales. We discussed the quantities with Yedioth each time a large difference occurred. While Yedioth's RD partly accepted our suggestions, it did make minor manual adjustments to them. Nonetheless, our model managed to perform better than Yedioth's RD even after these adjustments. As we demonstrate later, our model without adjustments would have performed even better. Moreover, we compared the quantities suggested by our model to NV quantities. Lack of cooperation on the part of the sales agents and the retailers. Due to the magnitude of demand, the quantities to be supplied during the additional visit were sometimes as low as a single copy. Additionally, our model can sometimes suggest performing an additional visit when the on‐hand stock level is positive. Due to various objections from the sales agents, we surrendered the idea of one‐copy replenishing for a retailer who had positive on‐hand stock. Our model outperformed Yedioth's model even with these changes.
Table 5 contains the PRUC for the system of 174 retailers for each of the five months under consideration. The profits in Equation (11) are aggregated over the 174 retailers. The results point to a significant savings provided by our model. On average, 19.35% of the gap between a system of separate NVs and a system with perfect demand information can be overcome with the aid of our model, which is considered a particularly high savings for the print industry. Table 5 also reports the PRUC when we relax the constraint that the AR day to be the same for all retailers. For each month, we observe a minor increase ( < 1% in all cases) in the PRUC. We note that the optimal unconstrained AR days are scattered around the optimal constrained AR day. More importantly, we emphasize that this unconstrained solution information is presented for illustrative purposes as this policy is not implementable. Logistical constraints dictate that geographically close retailers have the same AR day.
PRUC, Second Half of 2015
We performed a counterfactual analysis 3 to compare the performance of the various models. In August and December, Yedioth followed our proposed policy. In the other months, they followed their own policy. Upon obtaining complete sales data, the policies were compared. Thus, for July, October and November, the analysis of Yedioth's policy is real while the analysis of our policy is counterfactual. For August and December, the opposite is true. For each month, we also examined what would have happened had the standard NV model or Avrahami et al. (2014)'s model been followed. We remind the reader that the essence of Avrahami et al. (2014)'s model is that the second supply comes from pooled undistributed stock held back at an exogenously determined time, as opposed to our model which allows for a second production at an endogenously determined time. Since manual adjustments to our suggested decisions were applied by Yedioth, for August and December we also provide a counterfactual analysis of what would have happened if our model had been used as is.
To evaluate the difference in the models’ performance, we needed to estimate the lost sales at retailers with a stockout. The number of lost sales was estimated using the EDI data to determine when the last copy was sold. We then approximated the lost sales to be the expected demand until the end of the period.
Table 6 presents the comparison (see Table 7 for the results of statistical tests), with headers indicating which model was used. Note that the actual sales, quantity produced, returns, and lost sales in July, October and November appear in the second column, while the actual numbers in August and December appear in the fifth column. These numbers are in bold for easy reference. Our model tended to suggest that an additional visit should be carried out on average after 55% of the monthly predicted demand has been observed.
Counterfactual Analysis, Second Half of 2015
Pairwise Comparison of Policies (percentage)
One can observe that our two‐phase model, even when manually adjusted, always gives better results than Yedioth's model. While increasing the total production quantity and the number of returns, it also reduces the number of lost sales (which is the same as increasing the number of sales). In November, the demand in the first phase happened to be larger than the mean demand, and our model compensated for that by suggesting producing more in the second phase, such that the total quantity produced exceeded the one in the NV model. Compared to the standard NV model and Avrahami et al. (2014)'s model, our two‐phase adjusted model is superior in four out of five months. Moreover, in all five months, the two‐phase model without adjustments is superior to all others. While a considerable portion of the improvement over Yedioth's policy can be achieved by simply implementing a standard NV model, our two‐phase model can increase profits further. The reason is that the two‐phase model with an optimally timed additional visit captures the benefits of risk pooling over time and enables the use of in‐cycle sales information. We attribute the reduced difference between Two‐Phase and Two‐Phase Adjusted in December compared to August to the trust that we built with Yedioth which meant that they were less likely to “adjust.”
Table 7 details the pairwise percentage increase in the profit (relative to the latter policy in each pair) for each month. As expected, Yedioth gave the lowest profit followed by NV, Avrahami et al. (2014), and Two‐Phase, which gave the highest profit. Moreover, Two‐Phase gave higher profit than Two‐Phase Adjusted. We verified the significance of these differences through a series of one‐tailed Wilcoxon signed‐rank tests at the 5% significance level. We used this non‐parametric test rather than a paired t‐test because our data was not normally distributed. The stars in the first column indicate that the result is significant for the overall profit across the five months of implementation. The stars in the other columns indicate that the result is significant for the individual retailer profit for the month indicated. The plus signs in the first column indicate that the result is significant for the individual retailer profit for all the months for which data was available. Based on these last tests our policy (Two‐Phase) performs significantly better than all the other policies.
Expanded Implementation at Yedioth
After the successful completion of the pilot field study, Yedioth's management was ready to fully implement the proposed two‐phase model and change the way they make lotsizing decisions. In fact, Yedioth scaled our model to all its printed magazine titles and started routinely following it. Initial shipment quantities are determined accounting for the additional distribution later in the sales period. For simplicity, Yedioth decided to determine these quantities based on a heuristic whereby a NV problem is solved for the first phase alone, and the resupply quantities are determined by our algorithm. Since multiple magazine titles are involved, Yedioth examines several practical options for the AR day and selects the best one subject to the availability of the salesperson. Given that most Yedioth magazines are weekly (in fact, only Menta® is monthly), there are only a few options for the AR day. Moreover, Yedioth resolved the issue that had caused the lag in obtaining the EDI data.
Contrary to Yedioth's initial willingness to engage in another printing run in the case of a successful pilot, the company eventually decided not to perform a second printing. Rather, it continues to use the additional copies that are printed as safety stock against uncertain yield. Yedioth's experts estimate the savings from the changes to be 250,000 USD per year, net of the logistics cost.
Concluding Remarks
In this work, we presented a model and solution that we implemented at a major media group in Israel, Yedioth. We observed a considerable increase in profits compared to the existing policy. Our model is also shown to be superior to the standard NV model and Avrahami et al. (2014)'s model. Recalling that Yedioth has as many as 8000 points‐of‐sale, the increase in profits due to our two‐phase model is substantial. Moreover, Yedioth now successfully follows our two‐phase model for all their printed magazines, thus achieving a competitive advantage through a considerable cost saving, even if logistics costs, which do not exceed 2% in the case of Yedioth, are taken into account. The implementation at Yedioth shows robustness of our model. The savings from our model which assumes Poisson demand are considerable, even if demand may not be Poisson.
We analyzed a two‐phase NV problem with a possibility for an AR and replenishment. The novelty of our approach is in simultaneously determining the optimal initial order quantity, the optimal AR timing and the optimal order‐up‐to level at the time of the AR. We identify the problem's structural properties and propose an exact and tractable solution algorithm. The algorithm scans
We also developed and tested a simple heuristic solution that works exceedingly well. Nonetheless, we prefer the slightly more involved optimal algorithm due to the substantial costs being considered. As retail costs are significant in the total value chain of a product, the savings enabled by our model are important for decision makers. Our model is general enough to be applied to other printing houses and industries that have similar setup conditions and operate within the NV framework. This includes bakeries (some of which routinely practice double‐baking with the second batch being based on observed demand), other food industries, and the apparel industry.
A possible direction for future developments is to consider demand forecast update at the time of the AR, which would enable decision makers to obtain improved insights. In the case of Yedioth, however, this is not crucial as demand exhibits very low correlation over time.
Footnotes
Acknowledgment
The authors express their gratitude to Yedioth Group for providing the necessary data and running the pilot field study.
The weekend in Israel is on Friday and Saturday, and on Saturdays most shops are closed.
Ideally, we would have randomly picked a subset of retailers to follow our proposed policy while leaving the others to perform the current policy. Such an experiment was not allowed by Yedioth.
