Sage Journals: Discover world-class research

Abstract

This study aims to solve the multi-product Capacitated Lot-Sizing problem (CLSP) under the stochastic demand condition, with the main objective of minimizing the total cost, containing the production, inventory, setup, and shortage costs. To this end, this study proposes an optimization algorithm based on deep reinforcement learning (DRL), which incorporates a proximal policy optimization (PPO) algorithm based on a continuous action space. The considered problem is first described in detail and then transformed into the corresponding reinforcement learning model using the Markov decision process (MDP). In addition, a new state space and a continuous-discrete conversion process are designed to handle demand uncertainty and applied to the PPO algorithm. The proposed algorithm is verified by experiments. The experimental results demonstrate that the improved PPO algorithm can minimize the total cost and is applicable to instances under various conditions. The proposed algorithm is also compared with Aggregate modified base-stock (AMBS) heuristic algorithms. The comparison results prove the effectiveness of the proposed algorithms in solving instances of different scales.

Keywords

capacity lot-sizing problem stochastic demand deep reinforcement learning continuous action space proximal policy optimization

Get full access to this article

View all access options for this article.

References

Qin

, et al. Research on integrated optimization of production scheduling and equipment maintenance for limited capacity batch problems. Modern Manuf Eng 2019; 7: 30–35.

Erlenkotter

. Note – An early classic misplaced: Ford W. Harris’s economic order quantity model of 1915. Manag Sci 1989; 35(7): 898–900.

Akbalik

Penz

. Exact methods for single-item capacitated lot sizing problem with alternative machines and piece-wise linear production costs. Int J Product Econ 2009; 119(2): 367–379.

Chen

, et al. Supply chain coordination issues considering platform empowerment and carbon trading mechanism under demand fluctuations. J Indus Eng Eng Manag 2024; 38(04): 271–282.

Wagner

Whitin

. Dynamic version of the economic lot size model. Manag Sci 2004; 50(12): 1770–1774.

Akbalik

Penz

. Exact methods for single-item capacitated lot sizing problem with alternative machines and piece-wise linear production costs. Int J Product Econ 2009; 119(2): 367–379.

Esra

Irmak

. Capacitated lot sizing problem with periodic carbon emission constraints and multiple resources. Int J Product Res 2023; 61(19): 6589–6601.

Malekian

Mirmohammadi

Bijari

. Polynomial-time algorithms to solve the single-item capacitated lot sizing problem with a 1-breakpoint all-units quantity discount. Comput Oper Res 2021; 134: 105373.

Van Pelt

Fransoo

. A note on “Linear programming models for a stochastic dynamic capacitated lot sizing problem.” Comput Oper Res 2018; 89: 13–16.

10.

Yang

Bai

. Joint optimization of EPQ and condition-based maintenance based on variable demand. Indus Eng 2023; 26(6): 138–146.

11.

. Multi-period production decision optimization considering semi-finished product inventory under random demand. Oper Res Manag 2014; 23(02): 49–54.

12.

Gurkan

Tunc

Tarim

. The joint stochastic lot sizing and pricing problem. Omega 2022; 108: 102577.

13.

Chan

FTS

Tibrewal

Prakash

, et al. A biased random key genetic algorithm approach for inventory-based multi-item lot-sizing problem. Proc Inst Mech Eng Part B: J Eng Manuf 2015; 229(1): 157–171.

14.

Park

Jang

. Discrete lot-sizing problem of single machine based on reinforcement learning approach. Int Symp Semicond Manuf Intell 2022.

15.

Felizardo

Fadda

Hernandez

MDE

, et al. Reinforcement learning approaches for the stochastic discrete lot-sizing problem on parallel machines. Exp Syst Appl 2024; 246: 123036.

16.

Lotte

Nico

Tom

, et al. Using the proximal policy optimisation algorithm for solving the stochastic capacitated lot sizing problem. Int J Product Res 2023; 61(6): 1955–1978.

17.

Dulac-Arnold

Evans

Van Hasselt

, et al. Deep reinforcement learning inlarge discrete action spaces. arXiv preprint arXiv:1512.07679, 2015.

18.

Vanvuchelen

Boute

. The use of continuous action representations to scale deep reinforcement learning: an application to inventory control. IMA J Manag Math 2025; 36(1): 51–66.

19.

Schulman

Wolski

Dhariwal

, et al. Proximal policy optimization algorithms. arXiv preprint arXiv:1707. 06347, 2017.

Solving the stochastic-demand capacitated lot-sizing problem using proximal policy optimization in continuous action spaces

Abstract

Keywords

Get full access to this article

References