Abstract
In this paper, we study the effectiveness of incentives on delivery service time slot choices. In particular, we focus on the use of green labels that specify time slots as environmentally friendly and that intrinsically motivate customers to choose a specific delivery time slot in lieu of price incentives based on extrinsic motivation. We argue that this is important since green labels’ intrinsic nature affects costumer choice in fundamentally different ways than price incentives. We conduct two experiments and two simulation studies to study the effects of using green labels. Our experimental findings suggest that: (i) green labels are an effective tool to steer shoppers toward a certain delivery option, (ii) green labels are more effective for people who are more eco‐conscious, (iii) green labels remain effective in the presence of price incentives, while price incentives offer little added value beyond that of just green labels, and (iv) the effectiveness of green labels vs. price discounts remains high when time slots are less appealing (i.e., longer). Our simulation findings suggest that green slots, compared to price incentives or no incentives, offer providers a way to effectively steer consumer time slot choices to yield shorter routes, fewer delivery vehicles used, and more per‐customer revenue. We thus conclude that steering individuals to select delivery time slots through intrinsic motivation via green labels may be a promising, no‐cost direction for (online) retailers and an important topic for further research.
Introduction
The 2020 coronavirus pandemic has boosted the demand for online grocery delivery services across the globe. International grocery retailer Ahold Delhaize, for example, has seen an increase in online grocery sales of 77.6% in the second quarter of 2020 as compared to 2019 (Ahold Delhaize 2020). The lockdowns and safety concerns have pushed many new consumers to the online sales channel. However, while demand is surging, e‐grocery service providers are still struggling with the logistical challenges of effectively and efficiently providing home delivery. This is particularly the case since customers must be present to accept the provided services. Here, it is common to offer customers a choice of delivery time slots via an online booking system.
In this context, customer time‐slot choices directly impact the operational efficiencies of service providers (Agatz et al. 2008, Boyer et al. 2009). For example, a delivery truck may have to visit the same street multiple times daily to serve customers with different delivery times. Thus, attended service may generate twice the vehicle miles as unattended home delivery systems (Agatz et al. 2011, Lin and Mahmassani 2002, Punakivi and Tanskanen 2002).
Inefficiencies in the last‐mile not only yield higher costs for the service provider, but also impose substantial societal costs such as carbon emissions, air pollution, and traffic congestion. These are important issues since air pollution is associated with millions of premature deaths across the globe (WHO 2016), and traffic congestion costs hundreds of billions of dollars in lost productivity and fuel waste (INRIX 2018).
Companies have typically focused on reducing inefficiencies through supply‐side process improvements related to network design (de Vries et al. 2020), inventory deployment, and route‐dispatch decisions (Dayarian and Savelsbergh 2020, Emadikhiav et al. 2020). Another approach, at least as powerful, focuses on influencing demand as a way to enhance system performance. Regarding attended home delivery, such demand‐side management can help steer customers toward service options that are expected to cut system‐wide vehicle miles.
The prevailing demand‐management paradigm uses pricing to steer behavior (Amornpetchkul et al. 2018, Ceryan et al. 2018, Shen and Yu 2019). For instance, some service providers enlist price incentives to encourage customers to select delivery time slots that facilitate efficient route plans. Such incentives are typically presented as discounts on the delivery fee. The question of how to dynamically offer such price incentives to maximize profits has been much studied in recent years (Campbell and Savelsbergh 2006, Yang and Strauss 2017, Yang et al. 2014). However, price incentives pose several major drawbacks. First, price incentives kill margins. In low‐margin business models, such as delivered groceries, small discounts can greatly harm profitability (Srinivasan et al. 2002). Second, customers may come to expect lower prices (DelVecchio et al. 2007) and delay purchases hoping for more discounts (Zhang et al. 2019). This type of strategic behavior is a known bane to effectiveness in dynamic pricing policies (Liu and Zhang 2013, Su 2007). Third, dynamic pricing can be perceived as unfair by the customer, undermining trust in the online provider (Weisstein et al. 2013, Xia et al. 2004).
Alternatively, one can manage demand based on differentiating the product or service offering (Agatz et al. 2013). For example, service providers can limit the number of time slot options in a given geographical area to cluster customer orders and reduce the distance traveled per order. Several papers have focused on developing prescriptive methods to decide which slots to open and in which areas, both static, prior to the order in‐take (Agatz et al. 2011) and dynamic, based on already accepted orders (Campbell and Savelsbergh 2005, Ehmke and Campbell 2014, Köhler et al. 2019). The disadvantage of slot‐based approaches is that they limit the number of time slots choices for the customer which may make the service proposition less attractive.
Fortunately, there are other ways to steer demand that are not based on pricing and slotting. Specifically, we argue that companies can also encourage customers to choose delivery times that reduce routing distance by featuring environmental benefits. Several online grocery retailers, including Ocado and Peapod, are appealing to the customer environmental concerns by indicating which delivery times are associated with lower fuel consumption and emissions. We refer to such non‐price incentives as green labels where time slots are green and indicate their associated environmental savings (see Table 1).
The Use of Green Labels at UK‐based E‐grocer Ocado.
Green labels fundamentally depart from price incentives in how they steer customer behavior. While price incentives are extrinsic motivators offering financial reward, green labels are intrinsic motivators featuring an alternative that feels “right” attuned to customers’ values. Green labels thus steer customers into specific slots free of price incentive problems. This makes green labels appealing for service providers. Beyond being an alternative to price discounts, green labels also exert very different influences on customer behavior due to their intrinsic nature. As such, well‐known banes linked to price incentives do not apply to green labels. Although online retailers in practice are starting to use green labels as part of their service offerings, this topic remains largely unexamined in the literature. This study aims to fill this void. As far as we know, this is the first study to scrutinize the effectiveness of green labels in the area of last‐mile delivery.
In this study, we focus on several key questions related to the use of green labels in this context. In our experimental studies, we investigate some of these questions: First and foremost, we examine the effectiveness of green labels in steering customers toward more sustainable service options compared to using price incentive or using no incentives. Second, to demonstrate that green labels are intrinsic motivators, we study whether green labels especially appeal to customers that value the natural environment. Third, we test the impact of combining green labels with price incentives to argue, based on the intrinsic nature of green labels, that price incentives offer no extra value. Fourth, we investigate whether the effects of green slots persist when targeted options become less convenient (i.e., longer windows vs. other available slots) to argue, again based on the intrinsic nature of green labels, that green label outperform price incentives.
Finally, we simulate the impact of green‐label steering customer behavior on the efficiency of the last‐mile delivery system: are green labels truly pro‐environment, and how do they affect delivery as a whole? In particular, these simulations consider two different demand‐management settings where green labels can be used in last‐mile delivery. In the dynamic setting, customers choose from a set of time slots equal in length, for example, 1‐hour time slots. The service provider dynamically determines, as each customer engages, delivery time slots likely associated with the least routing distance adding that customer to the given schedule. Here, we investigate the effects of green labels vs. price incentives. In the static setting, the customer can choose among time slots of different lengths, for example, 1‐ or 5‐hour slots. Longer time slots offer more routing flexibility and less travel (Köhler et al. 2019, Strauss et al. 2020). Here, longer slots are incentivized for all customers before routing. In this setting, too, we investigate the effects of green labels vs. price incentives.
Our main contributions to the Operations Management (OM) literature can be stated as follows. First, we debut empirically testing of green‐label effectiveness in steering service‐choice behavior. We provide evidence that green labels intrinsically motivate to work in fundamentally different ways than commonly studied in extrinsic motivators such as price promotions. Moreover, we show that green labels and price promotions/price incentives do not work well together. While prior OM literature (Chen and Chen 2015) has primarily focused on price‐based approaches, we show that green labels can be equally or more effective (albeit different) in steering behavior. We hope to spark future work in this area. Second, we pair a simulation model based on state‐of‐the‐art time slotting and routing heuristics with our empirically validated customer‐choice model to quantify the potential impact of steering demand. This means we complement extant studies that explore the impact of demand management approaches in the last‐mile based on artificial and stylized customer demand data (Campbell and Savelsbergh 2006, Ehmke and Campbell 2014) or those chiefly based on price incentives (Yang et al. 2014).
Finally, our study merges the advantages of experimental study and simulation to overcome some of the limitations these methods have in isolation to complement existing frameworks that combine surveys or case studies with simulation. Specifically, Chandrasekaran et al. (2018) overview the latest possible uses for the combining empirical and simulation methods. Absent from their model is our use of experiments offering causal evidence of effects joined with simulation that quantify these effects on delivery operations systems (Chandrasekaran et al. 2018). We believe this mode of merging experiment with simulation is widely applicable and useful in operations management.
The remainder of this study is organized as follows: section 2 provides our theoretical foundation and the hypotheses; section 3 and 4 feature results from empirical plus simulation study of the dynamic setting for equal length slots; sections 5 and 6 discuss results from empirical plus simulation study of the static setting for slots of different lengths; and section 7 concludes.
Theory and Hypotheses
The Effect of Green Labels
The core premise of this research is that the use of green labels in last‐mile delivery is an effective tool to steer demand. The assumption behind this premise is that people prefer “green” services. There are several streams of literature that testify to this idea.
First, we present evidence concerning “green” consumption. Over recent decades, companies have increasingly developed and produced eco‐sustainable products and services to reduce their environmental impact while appealing to the burgeoning number of eco‐conscious consumers. Organizations and individuals concerned about the environment are willing to take steps toward eco‐protection (Peattie and Ratnayaka 1992). Research suggests that consumers generally prefer “green” products (Ottman 1993, Rana and Paul 2017, Salzman 1991). In a recent European study, three‐quarters of participants claimed willingness to pay up for eco‐friendly products. In a similar vein, surveys in the United States, Japan, and India show that consumers are willing to pay more for “green” electricity (Nomura and Akai 2004, Rana and Paul 2017, Roe et al. 2001, Yadav and Pathak 2017).
Second, apart from “green” consumption, there is also a large body of empirical research on stimulating people to display pro‐environmental behavior, that is, behavior that incurs little harm to the environment, or even benefits it (Steg and Vlek 2009). More than three decades of research suggest that (near) real‐time feedback can change household‐level behavior in terms of energy and water consumption. However, recent meta‐studies show that the results and effect sizes differ widely across and even within studies (Delmas et al. 2013, Vine et al. 2013). Moreover, much debate remains about the long‐term effectiveness of feedback interventions (Buchanan et al. 2015, Lynham et al. 2016). Interestingly, several studies based on field experiments show that listing environmental information can more effectively elicit conservation behavior than offering mere financial information. Jain et al. (2013), for example, saw tallying environmental externality units (e.g., the number of “trees needed to offset emissions” of customer consumed energy) to be more effective in cutting energy use than other information strategies, both short‐ and long‐term.
To summarize, providing green information works. It may, in some circumstances, even be more effective than the alternatives since green information may intrinsically motivate people to buy a product, to display a behavior, or in our study, to choose a certain delivery service option. Ryan and Deci (2000) defined intrinsic motivation as the motivation to conduct an activity simply for the enjoyment, pleasure, or satisfaction of the activity itself. Extrinsic motivation refers to acts seeking some separable outcomes, for example, perceived utility, getting rewards, and avoiding penalties. Providing green information about certain activities may help customers to see the value of these and, to the extent that sustainability fits their value systems, intrinsically motivate them to pursue these actions. As a consequence, the consumer may thus perceive the buying of green products to be more satisfying or the using of less energy to be more important.
We argue that the same holds for green delivery services that provide attended home delivery. Using green labels to highlight the sustainable delivery times can allow customers to opt for slots that better align with their values. Consequently, we argue that the use of green labels are effective in steering customers toward these specific slots. This leads to the following hypothesis:
Delivery time slots featuring a green label are more likely to be selected than those without a green label.
One important premise underlying green labels is that they work through intrinsic motivation. This, we propose, has several important consequences for the effectiveness and use of green labels. Specifically, extending prior research on intrinsic motivation, we propose that green labels, due to their intrinsic nature, will (i) be more effective for eco‐conscious customers, (ii) not synergize with price incentives, and (iii) be less sensitive to the degree of inconvenience (length) in slot options. The following sections explain these ideas.
Green Labels and Eco‐consciousness
The notion that green labels work through intrinsic motivation also implies varying effectiveness across different customer segments. The idea behind the intrinsic value of green labels is that they match the value system of customers to create intrinsic motivation. Indeed, values and norms are widely known as important drivers of behavior. For instance, the theory of planned behavior posits that individual behavior is influenced by attitudes toward it, by subjective norms, and by perceived control (Ajzen 1991). Wilson and Dowlatabadi (2007) have argued that behavior is determined by psychological factors such as values, attitudes, and personal norms, as well as by contextual factors such as available choices, economic incentives, and social norms.
Buying green products or displaying green behavior is altruism influenced by personal norms (Schwartz 1977). More specifically, one important factor in green behavior is eco‐consciousness (Gleim et al. 2013, Laroche et al. 2001, Rana and Paul 2017). Eco‐consciousness is an individual’s stable inclination to favor sustainability and value the natural environment. Research suggests that eco‐conscious customers tend to buy green products (Kim and Chung 2011). For instance, a survey on consumers in Iran has shown a positive correlation between consumer eco‐concern and green purchase intention (Vazifehdoust et al. 2013). An empirical study by Suki (2016) showed similar results. Green labels thus stimulate eco‐conscious customers to act with environmental responsibility (D’Souza et al. 2007, Leonidou and Leonidou 2011, Menon and Menon 1997, Yazdanifard and Mercy 2011).
We argue that green labels work best when they align with customer values and concerns that are pro‐environment. Indeed, according to Steg et al. (2016), obligation‐based intrinsic motivation underlies pro‐environment behavior. This implies that green labels may heighten intrinsic motivation in customers who care more about the environment. This leads to the next hypothesis:
Customers who are more eco‐conscious are more likely to choose a delivery slot featuring a green label.
Green Labels and Price Incentives
We earlier suggested that the use of green labels benefit companies. Given the predominance of price incentives, this begs the question of how green labels and price incentives may interact.
Specifically, price incentives are known to be an effective way to increase customer purchase intention and choices (Chen et al. 1998, Duvvuri et al. 2007). Price incentives can be effective in shaping customer time slot choices as well (Yang et al. 2014). Some companies, for example, Albert Heijn Online and Peapod, now merge price incentives with green labels. Yet, it is unclear how a combination of price incentives with green labels affects slot choices: Is the combination of these parts equal to, bigger or smaller than its summation? Research on extrinsic and intrinsic motivation speaks to this issue. Prior research has predominantly implied that external rewards may actually suppress intrinsic motivation (Deci 1971, Lepper et al. 1973). When extrinsically motivated tasks are linked with external rewards, touting external rewards may curb people’s intrinsic motivation for the task (Schwartz et al. 2015). This is often referred to as the “crowding‐out” effect (Lepper et al. 1973).
The idea behind the crowding‐out effect is that feelings of control and autonomy induce more intrinsic motivations (Ryan and Deci 2000). For instance, autonomy‐supportive teachers yield more intrinsic‐motivated students (Ryan and Grolnick 1986). External rewards, though, are often perceived as autonomy‐diminishing or arm‐twisting, thus desensitizing any intrinsic motivation (Bowles 2008). Meta‐analyses show rewards attenuating intrinsic motivations (Deci et al. 1999).
When a price incentive is offered for a behavior, it implies that conduct can be bought and sold, a commodity view rarely associated with social norms. In this way, price incentives reduce the intrinsic motivation for a specific behavior (Gneezy et al. 2011). For instance, a penalty on parents who pick up children late at the day‐care center can actually increase the number of late pickups (Gneezy and Rustichini 2000). A price incentive can be perceived as distrust, even disrespect. That is, an individual’s intrinsic motivation is deemed too weak to undertake the task without a price incentive. Since people exert more pro‐social efforts when they feel trusted and respected (Ellingsen and Johannesson 2007), the price incentive crowds out intrinsic motivation.
This suggests that the impact of simultaneously pairing a price incentive with a green label may underperform their separate effects. The price incentives (as a form of external rewards) may undermine (crowd out) the intrinsic motivation triggered by the green labels. Customers may feel that choosing a delivery option that has both a price incentive and a green label may not constitute a behavior in tune with their values toward sustainability. However, this occurs only when price incentives are seen as “controlling” by customers, that is, when shoppers feel that companies are steering them toward certain slots via discounts. Assuming that choosy customers understand the value of demand management for companies, we believe the crowding‐out effect really occurs. This analysis leads to our third hypothesis:
The effect of the green label is less strong if the same slots are also associated with a price incentive.
Green Labels and Longer Time Slots
Another consequence of the intrinsic motivation from green labels is that they may still work even when the green service is not appealing. This is important in the area of last‐mile delivery since slot length is often crucial to both the costs and effectiveness of demand management systems (Campbell and Savelsbergh 2005, Solomon 1987). Longer time windows, for example, 5 hours vs. 1 hour, are much less appealing for customers waiting at home for an uncertain delivery time. Encouraging customers to choose longer delivery slots via price incentives may prove difficult and costly. Indeed, when price incentives are used to prompt certain behavior, an individual’s effort positively correlates to compensation amount (Heyman and Ariely 2004). In other words, price incentives trigger an economic transaction rationale where cost and benefit are weighted. In order to steer customers to longer time slots, companies must offer greater price incentives or, put differently, the same price incentive will be less effective for longer time slots.
This may be less so for green labels and their intrinsic motivating function. There is evidence that intrinsic motivation may be more resilient than extrinsic motivation in the face of obstacles or difficulties in the transaction. For instance, Amabile and Pratt (2016) have noted that intrinsic motivation is a stronger predictor of creative behavior than extrinsic motivation, eliciting greater persistence in the face of obstacles in creative tasks. Persistence correlates positively to creativity (Nijstad et al. 2010). Moreover, when customers are intrinsically motivated to pro‐environmental action, they willingly exert more effort than the extrinsically motivated ones (Heyman and Ariely 2004). In other words, intrinsic motivation may not trigger an economic rationale, and it remains more motivating even for less appealing time slots. Therefore, we propose the following hypothesis:
The longer the time slot, the stronger the effect of green labels as compared to the effects of financial incentives.
Overview Studies
We conducted two experimental studies and two simulation studies to test our hypotheses and examine their impacts on the operational system in last‐mile delivery. The first empirical experiment tests the first three hypotheses in a setting where participants select from time slots of equal duration, some of which are incentivized. We used the effects found in this study in a simulation experiment enlisting a dynamic approach to incentivize time slots and assess impacts on the operating system. A second empirical experiment aimed to replicate the effects found in the first experiment and also test Hypothesis 4 in a setting where participants opt between short and longer, incentivized time slots. A second simulation translates these findings into effects on the operating system.
Empirical Study 1—Incentivizing Identical Time Slots
Experimental Design
In this study, we tested our first three hypotheses: whether adding green labels makes time slots more likely to be selected (Hypothesis 1); whether participants who are more eco‐conscious are more likely to select a green‐label time slot (Hypothesis 2); and whether the effect of the green label fades when merged with a price incentive (Hypothesis 3). We operationalized the “green label” by using a green‐colored label accompanied by the following explanation: “In the green time slots a delivery van will already be in your area. So, booking it will help to save fuel and reduce emissions.”
We designed the study for high external validity and encouraged participants to reveal their actual views. We staged our study as a validation study of a new survey (for the eco‐consciousness survey, see below). Before showing participants the survey, we told them that by participating in the study, they also enter a lottery where they may win an actual home delivery in the time slot of their choice. The time‐slot selection embodied the actual experiment. We emphasized that they must be at home during the entire time slot to receive the goods. To make the incentives as realistic as possible, participants could win $100 of groceries. When selecting a discounted slot, the discount would be added to the shopping voucher. This ensured that the study concerned choosing the delivery time for an actual (probabilistic) delivery.
In particular, we used the following text: “Thank you for filling out our survey. As a token of our appreciation, you will have the chance to win a $100,‐ voucher to shop for groceries at a major online grocery retailer in your area. To schedule the delivery of your groceries in case you win, we now ask you to select a day (from Monday to Friday) and a one‐hour time window from a menu (from 10am to 8pm). Note that if you win, the groceries of your choice will be delivered in this time window so make sure that you are home then. A standard delivery fee ($8) will be paid from your voucher.”
Participants had to select one time slot from a set of ten non‐overlapping 1‐hour time slots for delivery from 10 am to 8 pm. This stylized, discrete‐choice experiment reflects the main service platform of an online grocery retailer. We designed different experimental conditions where we randomly assign labels or price incentives for two of the ten time slots. We refer to these manipulated slots as our focal slots. The other time slots displayed no labels or price incentives. We then recorded the number of participants who selected one of the focal slots in each condition. Figure 1 shows an example of the time‐slot menu where the two focal slots feature green labels.

Example for Condition "Green, Green"
In this study, we enlisted nine different experimental conditions. To avoid carryover effects, we used a between‐subjects design with each subject randomly assigned to one of the conditions. In the “No Label” control condition, no slots displayed a label or a price incentive. Both focal slots featured a green label in the “Green, Green” condition. We used three discount conditions offering a price incentive on the deliver fee of $2, $5, or $8. The $2 (25%) discount is common in practice. Since the default delivery fee in practice is about $8, we added the “‐$8, ‐$8” condition to represent a setting with free delivery on the focal slots (at a very high discount). We also add a discount condition between them. We further included a condition where both focal slots combine the green label and a price incentive of $2 (Green‐$2) to investigate any synergy between these types of incentives. Note that in these six conditions, all focal slots have the same label or price incentive. However, given that actual consumers often have a choice among various incentives, we added three more conditions for two focal slots featuring different mixes of green labels and $2 price incentives (since $2 price incentives are most common, and further discounts would have created a large, unwieldy study). Table 2 presents an overview of the conditions array.
Overview of conditions for study 1.
To avoid sampling bias in setting the focal slots, we randomly chose two out of ten slots as focal slots for each participant. For each condition, we have either 45 different focal slot settings when participants were offered with focal slots of the same label or 90 different focal slot settings when participants saw focal slots with two different labels.
At the end of the survey, participants were asked to fill out a questionnaire to assess their eco‐consciousness. The questionnaire comprised nine questions based on related measurement methods in Rivera‐Becerra and Lin (1999) and Alsmadi (2007). Participants were asked to rate the extent of their care for the environment on a 5‐point scale, that is, “strongly agree” (5) to “strongly disagree” (1).
Participants
We operated our experiment in Qualtrics (
Study 1—Sample Composition per Condition
Results
To test our hypotheses, we conducted two types of tests. First, we directly compared the percentages of participants choosing focal slots among the various conditions. The number of participants opting for one of the focal slots in the different conditions is shown in Figure 2. At first glance, it appears that green incentives are quite effective and more so than monetary incentives. Interestingly, it also appears that participants reacted on monetary incentives, but that the size of the discount did not matter as “‐$2” incentives appear as effective as “‐$5” and “‐$8” incentives.

Number of Participants that Selected a Focal Slot
Second, to study the choice behavior in more detail, we ran a multinomial (MNL) regression analysis to estimate the utilities for the different slots (
Table 4 presents results for three distinct MNL model specifications. In Model 1, we used a binary variable
*p<0.05; **p<0.01; ***p<0.001
aThis model is based on the data of the condition “Green, Green” and condition “−$2, −$2”.
bIt includes −$2,−$5, and −$8 discounted labels.

Eco‐consciousness and Participant Choice
For ease of exposition, we later discuss the results for the different conditions associated with the different hypotheses in separate subsections.
The Effect of the Green Label
Figure 2 shows that substantially more participants selected a focal slot with a green label (81.4%) vs. no label (22.6%). This result is statistically significant (
The results from the MNL model are similar. Even while explicitly controlling for the popularity differences among slots (Models 1 and 2), we see that the green labels yielded a significant uplift in the utility of a time slot. This again supports the notion that green labels raise the likelihood of slot selection. Moreover, we observe that the utility of a green label outperforms that of a price incentive. These results strongly support Hypotheses 1.
The Effect of the Combined Green Label and Price Incentive
Figure 2 shows that fewer participants selected a focal slot in the “Green−$2, Green−$2” condition (63.6%) than in the “Green, Green” condition (81.4%) under statistical significance
Notably, when participants can choose between a combined slot and a slot with a green label or price incentive (Green−$2, Green vs. Green−$2, −$2), we observe more participants choosing the combined slot
Eco‐consciousness
We next investigated the impact of eco‐consciousness on participant time slot choice. We grouped the participants into a high eco‐conscious group (comprising those outscoring the median eco‐scores) and a low eco‐conscious group (comprising those scoring lower than the median in eco‐scores). We present participant choices for the green label, the $2 price incentive, and the “Green−$2” label under the six conditions in Figure 3.
Figure 3 depicts $2 price incentives as chosen significantly more often by low eco‐conscious participants (71.7%) than by highly eco‐conscious (54.7%)
Discussion
The findings of Study 1 highlight the potential influence of green labels and attest their relation to intrinsic motivation. First, the results show green labels exerting significant impact on slot choice. Adding green labels not only greatly boosts slot appeal over those without incentive; green labels also outperform very substantial price incentives! These results demonstrate Hypothesis 1. Second, analysis of the effects of eco‐consciousness identifies highly eco‐conscious participants as more prone to select green‐labelled slots. These results support Hypothesis 2, further suggesting, as outlined in the theory section, that the effects of green labels may work via intrinsic motivation for people having strong intrinsic motivation to improve the environment.
Third, we find there is little to no added value in merging green labels with price incentives. Adding price incentives to one or both of the green labelled slots does not encourage more participants to select a focal slot, thus confirming Hypothesis 3. Interestingly, even though the condition with one green label and one green label with price incentive lead to less people choosing focal slots than the condition with two green labels, in the prior condition a majority of participants choose the green label with price incentives over the green label. This suggests that although adding price incentives to green labels will not persuade more people to choose a focal slot, people already preferring a focal slot will be more likely to go for the combined green label—price incentivized slot. As a side‐note, we observe that higher discounts do not lead to a stronger impact on choice behavior. This suggests that the participants that are susceptible to extrinsic price incentives will already change their behavior provided with a “−$2” discount. This means that there is little value in providing higher discounts.
Two things are noteworthy. First, the advantage of green labels over price incentives is that they are cost‐free to the provider, while a two‐ to eight‐euro price incentive evaporates small profit margins. This advises that, given their influence on outcomes, it may be better for companies to “go green.” Before making definite conclusions, though, we need more comprehensive insight into the effects of green labels on the delivery system. The next section thus details a simulation to input findings from this experiment that mimics a delivery system to test the potential effects of using green labels. Second, if intrinsic motivation is the driving force behind green labels, we may expect their effects (vs. extrinsic motivation via price incentives) to hold even when slots become less attractive (see Hypothesis 4). We test this idea in a second experiment.
Simulation Study 1—Dynamic Time Slots of Equal Length
Here, we use a simulation experiment to study the impact of the green slots and price incentives that dynamically steer customers to reduce system‐wide vehicle usage for last‐mile delivery operations. We draw a customer time slot choice model based on Model 3 (see Table 4) to study the effect of using the green label or the $2 price incentive vs. no incentives in reducing distances of the last‐mile delivery. We choose these two incentive types since they act as the single best incentives (pairing them seems clearly less beneficial than using green labels alone).
Dynamically Incentivizing Time Slots
In this section, we describe how to dynamically determine which slots to incentivize based on already accepted orders. The key idea is to steer customers toward the (focal) delivery time slots to help minimize the system‐wide vehicle kilometers that trigger proportional variable costs (e.g., labor, maintenance, and fuel) and environmental impact (e.g., emissions). Based on the tentative route plans for the already accepted orders, we feature for each new customer time slots associated with the smallest detour. This means that the set of focal slots differs per customer and over time.
Customers continuously book their delivery time slots. During a booking period, we maintain a tentative route plan with vehicles serving prior‐scheduled orders within their selected time windows. When a new customer selects a time slot, we know the delivery location and shopping basket size. To quickly assess the feasibility and costs of accepting a customer in each of the different time windows, we use cheapest insertion (Campbell and Savelsbergh 2004). Since customers do not like waiting and this approach is very fast, it is often used to dynamically solve the underlying routing problems in real‐time (Campbell and Savelsbergh 2005, 2006, Ehmke and Campbell 2014).
We determined the focal slots by using the insertion approach as follows. Assign
The route plan is optimized each iteration by re‐planning all routes using greedy cheapest insertion after each customer has selected a certain time slot (Algorithm 1). When it is impossible to find a new feasible schedule in this way, we inserted the customer into the existing route plan.
Simulation Setup
Inspired by real‐world e‐grocery order data, we randomly generated 100 base instances, each having 400 customers uniformly distributed over an area of 30 km × 30 km (900
We varied the customer densities (low, medium, high) in the instances by selecting different subsets of customers from the base instances. The highest density used all customers in the base instances. The medium density used the first 200 customers, and the low case used the first 100 customers. Here, we reduced variance among the different experiments since the set of customers (and associated preferences) of the lower density instances was a subset of the higher density instances. This effectively means that the first 100 customers were the same in each density scenario.
For each instance, we ran three computational experiments using green labels, $2 price incentives, or no labels on 2 of the 10 one‐hour time slots between 10 am and 8 pm. Customers selected one time slot from the 10 options, consistent with the setting used in our empirical studies.
Next, we ran computational experiments on labels for different effects in driving customers toward focal slots. We tallied a ratio of customers selecting labeled slots to indicate the effect level. Effect ratios for green labels, $2 price incentives, and no labels are 81%, 62%, and 20%, respectively. In order to gain insights into additional levels of incentive effectiveness, we also ran experiments featuring extremely high effect level (100%) and low effect level (40%). We enlisted the 100 medium customer density setting for these experiments.
Each vehicle’s delivery capacity was 20 customers traveling at constant speed of 30 km/hour. Vehicles were deployed in two shifts: an early shift (from 9 am to 4 pm) and a late shift (2 pm–9pm). The service time for each customer was 10 minutes.
Results
We rated the final route plans using the distance per customer and the number of vehicles used. We compared the percentage savings to the reference “no label” setting. Table 5 shows the average results for the different customer densities and labels over 100 random instances.
Simulation Results for Different Densities and Incentives
athe number of customers in each instance
bthe travel distance per customer on average
cthe average travel distance savings per customer (taking the results of “No Label, No Label” as the benchmark)
dthe percentage of travel distance saving (taking the results of “No Label, No Label” instances as the benchmark)
ethe number of vehicles used in instances on average
fvehicle savings of instances on average
gthe percentage of vehicle savings on average
Results clearly show that steering customers helped to reduce both travel distances and number of vehicles used. As more customers chose focal slots, more distance savings accrued. We also note that green labels outperformed price incentives—the added influence on customer behavior from green labels pays off! Distance savings likely translates to costs savings in terms of fuel and labor. The green label thus yields greater cost savings without any loss in revenue using price incentives. Offering $2 price incentives cuts revenues by $1.12‐$1.15 per customer on average.
Results suggest that dynamically steered time‐slot choices lead to more benefits when densities are low. For green labels, savings are higher (12.6%) for low density than for high‐density instances (8.9%). A similar pattern is observed for price incentives. One explanation for this is that the average intra‐customer distances (and savings) fall with density. The marginal gain abates when increasing the spatial and temporal clustering of customers where distances between them are already low.
While Table 5 shows the average savings, Figure 4 specifies per‐instance savings for the different runs. At one end of the spectrum, savings reach 400 km (30%) for low densities. On the other end, some cases of customer steering can actually raise the total system‐wide travel distances, especially when label effect levels weaken. To understand this, we must realize that we have to make slotting decisions for each individual customer arrival without knowing all future demand. This creates uncertainty about the impact of certain decisions. We still observe, given green label’s higher effectiveness, the negative impacts were less frequent (and even non‐existent in high‐density conditions) with green labels than with price incentives. Overall, the simulation provides strong support for the positive effect of green labels on the operational performance.

System‐wide Savings in Vehicle Distance (km) per Instance
To test how sensitive results are to the incentive efficacy, we ran another simulation study for medium‐density instances. Figure 5 shows results from four different effect‐sizes: 62% corresponding to the effectiveness of the price incentive, 81% to green‐label effectiveness, 100% level for all customers selecting the incentivized focal slot, and the less‐strong 40% effect. As expected, we see that savings increase with incentive effective size. However, results further suggest that savings can still be achieved even when the effect is modest (i.e., 40%).

System‐wide Distance Savings for Different Incentive Effects
Empirical Study 2: Incentivizing Time Slot with Different Lengths
Experiment Design
Study 1 tested the effect of green‐labeled delivery time slots of equal length. Study 2 investigates the effect of using green labels to steer people to longer, less convenient time slots. More specifically, we study whether green labels would be more effective than price incentives in drawing participants to longer time slots. We use the same design as in Study 1 except for the time slot manipulation. In this study, we offer participants the choice of unincentivized slots of 1 hour vs. incentivized slots of varying lengths (2‐, 3‐, or 5‐hour options). We predict that the longer the incentivized time slot offered, the less strong the effectiveness of price incentives vs. green labels (Hypothesis 4). In other words, “green labels” will outperform discounts for longer time slots.
As with Study 1, we enlisted a between‐subjects design to avoid carryover effects in this study. We asked each participant to choose one from a menu of slots for receiving groceries. The menu featured 10 one‐hour slots (as in Study 1) and a number of overlapping longer slots (2‐, 3‐, or 5‐hour; see Table 6). We next manipulated the slot length of incentivized time slots for each participant. Referencing a standard delivery fee of € 8, we offered long slots using either a green label or a € 2 price incentive. Overall, six different conditions were featured. Figure 6 shows examples of time slot menus used in this study.
Study 2–Overview of Conditions

Example of Conditions with Longer Slots
As in Study 1, we presented a green‐colored label explained this study:“By choosing a longer time window, you provide the retailer with more routing flexibility which allows them to save fuel and reduce emissions.” At the end of the survey, we asked participants to fill out the same questionnaire as in Study 1 to assess levels of eco‐consciousness.
Participants
We implemented our experiments in Qualtrics. (
Study 2—Sample Composition per Instance
Results
Figure 7 shows the booked time slot choices. We see that both green labels and price incentives helped to steer participants toward the longer slots. The effect decreased when slots became longer and less convenient.

Number of Participants that Selected a Longer Slot
Importantly, we see varied effectiveness of the incentive options over different slot widths. While green labels and price incentives worked equally well in steering participants toward 2‐hour slots (price incentives did even slightly better), we see that green labels outperform price incentives in steering participants to 3‐hour and 5‐hour slots. In general, the pattern suggests that the effectiveness of green labels vs. price incentives is less influenced by slot length, in line with hypothesis 4. This effect is most manifest when comparing differences in the effectiveness for green labels vs. price incentives for the 2‐hour and 3‐hour conditions. Green labels were as effective in promoting 3‐hour slots (81.2%) as 2‐hour slots (85.5%;
Next, we present MNL models of participant choice for different labels and varied slot lengths in Table 8. For 2‐, 3‐, and 5‐hour slots, different MNL models were enlisted for estimating different label utilities. The MNL models for 2‐hour slots are quite similar, and we forged a joint MNL model for green labels vs. price incentives in Table A1. Similar to Study 1 (section 3), we can model the utility of slot s by
aSee the indexes of time slots in Table 6
p‐value<0.05.
p‐value<0.01.
p‐value<0.001.
We note that participants disclosed different preferences for specific long slots. The latest slots in the afternoon especially (including slot 15 among 2‐hour options, slot 16 among 3‐hour options, and slot 12 among the 5‐hour choices) proved most popular. Accordingly, we favored estimating separate label utilities for each of the longer slots.
From the MNL models in Table 8, we see that the two‐hour utilities as a whole generally exceeded those of the 3‐ and 5‐hour slots, as expected. Green label utility mostly prevailed over price‐incentive utility for 3‐ and 5‐hour slots. These results confirm those in Figure 7.
Eco‐consciousness
To show the impact of eco‐consciousness on participant choice, we grouped participants into high eco‐conscious (ranking above the median) and low eco‐conscious groups (ranking below median) according to their eco‐scores. We present slot choices of the two groups in Figure 8.

Eco‐consciousness and Participant Choice
The results for 2‐hour slots show no significant difference in choices of high eco‐conscious vs. low eco‐conscious participants, either for the green label or discount incentive.
Again, however, this pattern changed for the 3‐hour slot. Here, green labels significantly steered high eco‐conscious (93.9%) over low eco‐conscious participants (69.4%) to choose the 3‐hour slots
We did not detect significant differences in longer slot (5‐hour) choices between high and low eco‐conscious participants, either for green labels or price discounts.
In sum, 2‐hour slots were indifferent to incentive types. The 3‐hour slots clearly favored green labels, while price steered notably fewer low eco‐conscious shoppers to tolerate the 5‐hour slot. Together, these results provide some support for Hypothesis 4.
Discussion
In Study 2, we investigated how effective green labels and price incentives were in steering people to pick 2‐, 3‐, or 5‐hour slots over available 1‐hour slots. Results show notable insights. First, we find green labels and price incentives are both effective in driving participants to longer time slots, but that incentives abate for longer slots. Second, we find that this pattern of lower effectiveness for longer slots to diverge for green labels vs. price incentives. Specifically, green labels remained relatively more effective than discounts when incentivized time slots lengthened. However, this green label advantage vanished in the longest (5‐hour) time slots. This incentive pattern supports hypothesis 4, albeit with the caveat that the effect disappears with very long time slots. A reason is that waiting at home for five hours is too inconvenient that many people cannot make it.
Our analysis of eco‐consciousness reveals that participants high in eco‐consciousness respond better to green labels, but only in the 3‐hour time slot condition. This offers some, but not full support for Hypothesis 2. Our study reveals green labels to be useful, in some cases more so than using price incentives, in steering people toward longer time slots—an effect highly relevant in practice. In order to assess the practical impact of these effects in a delivery system, we finally conducted a second simulation experiment described in section 4.
Simulation Study 2: Static Time Slot with Different Lengths
In this section, we simulate customer time slot choices of different widths based on MNL models in Table 8 to study the impact of green labels and price incentives. The goal remains to minimize system‐wide vehicle kilometers for last‐mile delivery.
Longer time slots offer more planning flexibility and thus outperform the tighter time slots. Contrasting the dynamic labeling policy in section 4, we used static labeling in this experiment. Only after all customers have selected a time slot do we finalize a route plan attaining the cheapest insertion heuristic (Algorithm 2). As with the prior study, each participant again received a time slot menu with one‐hour slots amid longer slots incentivized for all customers.
Simulation Setup
We ran 100 instances with medium customer density (200 customers in a 30 km × 30 km area) from section 4 using the slot settings of Study 2 (including 1‐, 2‐, 3‐, and 5‐hour slots). Customer slot choice models were based on the MNL models in Table 8. As with section 4, we used Gumbel random variables to simulate the random utilities of slots to customers.
We ran six computational conditions: (1) a joint experiment for 2‐hour slots to simulate both green labels and price incentives (based on the joint MNL model in Table 11 due to similar effects); (2) using only green labels for 3‐hour slots, (3) using only price incentives for 3‐hour slots; (4) using only green labels for 5‐hour slots;, (5) using only price incentives for 5‐hour slots, with all five of these against; and (6) unlabeled 1‐hour slots serving as a control reference for comparison.
Parameters were the same per section 4: every vehicle (either in the first shift from 9 am to 4 pm, or in the late shift from 2 pm to 9 pm) served at most 20 customers with 10 minutes as a fixed service time for each customer, traveling at a constant speed of 30 km/hour.
Results
We use the same metrics per section 4 to assess the performance of different labels on long slots (vs. the controlled only one‐hour slot benchmark). Table 9 lists the average results from different labels for long slots of different widths across 100 random instances.
Simulation Results for Different Time Slot Lengths and Incentives
aThe same customer choice model based on the MNL model in Appendix B is used for the instances in “Green, 2‐Hour” and “−€ 2, 2‐Hour” due to similar effects between the two.
As expected, when more participants selected a longer slot, this improved the performance of the system in terms of the total travel distances. However, slightly more vehicles were needed for the 2‐hour case vs. the control group, since offering 2‐hour slots clustered a large portion of customers into a few time slots, for example, 32% of the customers booked time slots between 10:00 and 12:00 (including Slot 1, Slot 2, and Slot 11). Here, more vehicles were needed than where time slots might have been more equally selected by customers.
Results show that featuring the green label on 3‐ and 5‐hour slots enabled shorter distances than using the price incentive. We find that using the green label reduced travel distance for 3‐hour slots 7.1% more (20.4% savings) than the price incentive (13.3%). Using the green label for 5‐hour slots cut travel distance by 3.3% more (21.8%) than the price incentive (18.5%). In contrast, the € 2 price incentives to customers choosing longer slots lost revenues of € 1.72 for the 2‐hour option, € 1.43 for the 3‐hour case, and € 1.22 for the 5‐hour instance, on average.
Discussion and Conclusions
In this study, we describe two experimental studies and two simulation studies investigating the role of green labels in steering demand in last‐mile delivery. Our studies provide five main findings. First, green labels are an effective tool to manage demand and work as well as commonly used price incentives. Second, eco‐conscious customers are more responsive to green labels. Third, combining green labels and price incentives yields little added effect over green labels alone. Fourth, green labels work better than price incentives in steering people toward longer time slots. Fifth, the varied effects of green labels exert quite strong impact on downstream operational system performance. These results contribute in several important ways.
First, the research offers a theoretical contribution toward the field of demand management in operations management and beyond. This is, to our knowledge, the first study to demonstrate that green labels are effective in steering demand in last‐mile delivery and strongly impacts the operating system of home delivery. This is notable amid all the research on the customer side of demand management currently focused on price incentives. Our studies demonstrate that there are alternatives to price incentives that are just as effective. Equally important, however, are the findings that pertain to the underlying processes at work as green labels steer customers toward specific time slots. Green labels strongly engage more eco‐conscious customers, demonstrating a value‐alignment process that intrinsically motivates. This has several key implications, prompting us to suggest, for instance, that price incentives and green labels would not synergize, and that green labels would be especially effective in steering toward “inconvenient” (longer) time slots. In this sense, our studies more broadly contribute to general literature on incentives that steer customer behavior.
In pondering and testing the nature of green labels, our studies more broadly contribute to general literature on incentives that steer customer behavior. Although extant literature casts green incentives as more embedded, our categorization of incentives as intrinsic vs. extrinsic motivators helps us to better understand different incentives. In particular, our research findings provide a strong contribution to this broad field by suggesting (with supporting evidence) that (i) incentives that steer customer behavior can be categorized as extrinsic motivators vs. intrinsic motivators, (ii) intrinsic and extrinsic motivators do not mix, and (iii) intrinsic motivators outperform extrinsic motivators in steering customer behavior especially as customer business costs rise.
Our second contribution is more methodological in terms of modeling and research strategy. In terms of modeling, we designed a simulation model based on state‐of‐the‐art time slotting and routing heuristics for an empirically validated customer choice model that quantified potential impacts in steering demand on the operating system. This means that we complemented extant studies exploring the impact of demand management approaches in the last‐mile based on artificial and stylized customer demand data (Campbell and Savelsbergh 2006, Ehmke and Campbell 2014). We also note that our use of incentivization in our choice experiments, although not completely novel (see Ding et al. 2005), is a creative example of how experimental work can create incentivization without reducing the internal validity of the study. Given the benefits of adding incentives for behavioral work, we stimulate other researchers to follow this example.
In terms of research strategy, our set of studies also contributes. Indeed, there has been recent emphasis in operations management on merging empirical studies with simulation in research. Specifically, Chandrasekaran et al. (2018) have developed an overview of how simulations and empirical studies can synergize. In our type of research, theory testing research, Chandrasekaran et al. (2018) discuss how simulation can tackle issues of endogeneity in empirical (survey) studies. Our study combines the advantages of experimental study and simulation to thus overcome limitations of these methods in isolation and complement existing frameworks that blend surveys or case studies with simulation studies. While experiments do not suffer endogeneity issues, they can incur validity problems in large, complex systems. This is precisely where simulation can benefit. We thus recommend the complementary use of experimental and simulation studies in future operations management papers.
A third contribution is practical in nature. Operations management practitioners can well exploit our findings. Indeed, our research strongly advises that using green slots either dynamically or by incentivizing longer slots yields strong positive effects on last‐mile delivery operations in terms of costs and environmental criteria. Our simulation findings further caution that there may be instances where green labeling actually incurs more costs and emissions, but these cases are rare. Overall, using green slots is beneficial. Our empirical results suggest that green slots especially work well with more eco‐consciousness customers. This insight, together with insights from the research on the relationship between eco‐consciousness and demographic and cultural characteristics (Golob and Kronegger 2019, Gray et al. 2019), may help practitioners to predict the effect of green labels for specific customer populations. The way companies implement green slots, beyond dynamic or to incentivized longer slots, may also be important. When going “green” to steer demand to improve operations, we show that it is not beneficial to combine green slots and price incentives for the same slot. However, this does not imply that we advise against using pricing altogether. From a marketing perspective, pricing is still a powerful tool to exploit differences in willingness between different market segments increasing revenues. The design of the pricing and green incentives for last‐mile delivery should take these different considerations into account.
In sum, the current research investigated the effects of green labels on customer slot choice for last‐mile delivery. We find strong effects from green slots, providing a theoretical model and research results that suggest interesting nuances as to these effects: they work better for eco‐friendly customer, they do not synergize with discounts, and they work especially well when the costs of slot choice rise to real impact on operating systems in last‐mile delivery. We hope that this theory and these results not only provide important insights in the field of demand management in last‐mile delivery, and the wider field of green incentives, but will also stimulate other scholars to continue this line of research to unleash the full potential of green labels or incentives.
As this is the first paper that studies the use of green labels in last‐mile delivery operations, we see several directions for future research at the interface between marketing and operations management. First, we would welcome any field research that would replicate our findings. Although the strength of experimental designs such as ours is that they yield causal evidence of effects, the downside may invoke validity issues. Using a field experiment to study the impact of green labels provides various methodological challenges given the many confounding factors that are associated with a dynamic and fast growing business such as online grocery. Second, combining green labels and price discounts asks for new dynamic slotting methods that explicitly incorporate the complex interactions of the different incentives on slot choice behavior. Third, the advent of omnichannel retailing provides relevant new applications for using green labels to steer customers between sales channels. This gives rise to new research questions on how to design and dynamically execute these incentives across different channels while taking into account both marketing and operational considerations. Fourth, we believe that the general idea of our paper—steering customers by providing more insight into the environmental consequences of their choices—may provide fertile ground for future research in many Operations Management applications.
Acknowledgment
We thank the anonymous reviewers for their constructive feedback and efforts toward improving our paper. The co‐author Yingjie Fan was funded by NWO as part of the Complexity in Transport and Logistics Programme.
