Enhanced PV power prediction using LSTM-integrated soft actor–critic model based on long short-term memory

Abstract

Accurate PV power prediction is crucial in efficiently operating intelligent power grid systems. Data-driven approaches have shown high performance in predictive tasks. Deep reinforcement learning (DRL) merges deep learning with reinforcement learning and has been widely studied for optimization challenges in various fields. However, limited research has focused on applying DRL to ultra-short-term PV power prediction. Hence, a soft actor–critic (SAC) model using long short-term memory (LSTM) is proposed for predicting PV power. To accomplish this, first, the PV power problem is modeled as a Markov decision process with historical weather data and PV power data as state inputs. Then, LSTM is integrated into the critic network of SAC to enhance its memory capability, thus improving prediction accuracy. Ultimately, the agent engages with the environment to address the optimization problem. Experimental results indicate that the proposed model attains greater prediction accuracy. This study explores the potential of DRL for PV power prediction, and the proposed method can be extended to other prediction fields, including grid prediction and wind power prediction.

Keywords

Deep reinforcement learning PV power prediction soft actor–critic long short-term memory

Get full access to this article

View all access options for this article.

References

Huang

Zhang

, et al. A comparison of hour-ahead solar irradiance forecasting models based on LSTM network. Math Probl Eng 2020; 2020: 1–15.

Wang

Zhang

Liu

, et al. Generative adversarial networks and convolutional neural networks based weather classification model for day ahead short-term photovoltaic power forecasting. Energy Convers Manag 2019; 181: 443–462.

Guermoui

Melgani

Danilo

. Multi-step ahead forecasting of daily global and direct solar radiation: a review and case study of Ghardaia region. J Clean Prod 2018; 201: 716–734.

Kalate Ahani

Salari

Shadman

. Statistical models for multi-step-ahead forecasting of fine particulate matter in urban areas. Atmos Pollut Res 2019; 10: 689–700.

Song

, et al. Mathematical modelling and performance evaluation of a hybrid photovoltaic-thermoelectric system. Energy Convers Manag 2019; 198: 111800.

Chin

Salam

Ishaque

. An accurate modelling of the two-diode model of PV module using a hybrid solution based on differential evolution. Energy Convers Manag 2016; 124: 42–50.

Sharadga

Hajimirza

Balog

. Time series forecasting of solar power generation for large-scale photovoltaic plants. Renew Energy 2020; 150: 797–807.

Ding

Wang

. An ANN-based approach for forecasting the power output of photovoltaic system. Procedia Environ Sci 2011; 11: 1308–1315.

Cervone

Clemente Harding

Alessandrini

, et al. Short-term photovoltaic power forecasting using artificial neural networks and an analog ensemble. Renew Energy 2017; 108: 274–286.

10.

Liu

Zhu

, et al. Convolutional neural network with feature reconstruction for monitoring mismatched photovoltaic systems. Sol Energy 2020; 212: 169–177.

11.

Kuremoto

Kimura

Kobayashi

, et al. Time series forecasting using a deep belief network with restricted Boltzmann machines. Neurocomputing 2014; 137: 47–56.

12.

Qing

Niu

. Hourly day-ahead solar irradiance prediction using weather forecasts by LSTM. Energy 2018; 148: 461–468.

13.

Zhu

Kong

, et al. EA-LSTM: evolutionary attention-based LSTM for time series prediction. Knowl Base Syst 2019; 181: 104785.

14.

Zhou

, et al. A hybrid deep learning model for short-term PV power forecasting. Appl Energy 2020; 259: 114216.

15.

Wang

Wei

Zeng

. Ultra-short-term photovoltaic power prediction based on VMD-LSTM-RVM model. IOP Conf Ser Earth Environ Sci 2021; 781: 042020.

16.

Wang

Peng

, et al. Deterministic and probabilistic forecasting of photovoltaic power based on deep convolutional neural network. Energy Convers Manag 2017; 153: 409–422.

17.

Zhang

Wang

. Deep reinforcement learning for real-time photovoltaic power forecasting under dynamic weather conditions. IEEE Trans Ind Inf 2024; 20(4): 2345–2356.

18.

Smith

Brown

Johnson

. Enhancing PV power prediction models using deep reinforcement learning and transfer learning. Appl Energy 2023; 321: 1234–1245.

19.

Mnih

Kavukcuoglu

Silver

, et al. Human-level control through deep reinforcement learning. Nature 2015; 518: 529–533.

20.

Sangiovanni

Incremona

Piastra

, et al. Self-configuring robot path planning with obstacle avoidance via deep reinforcement learning. IEEE Control Syst Lett 2021; 5(2): 397–402.

21.

Huang

Wang

. Deep-reinforcement-learning-based capacity scheduling for PV-battery storage system. IEEE Trans Smart Grid 2021; 12: 2272–2283.

22.

Bao

Lai

Lin

. A deep reinforcement learning-based MPPT control for PV systems under partial shading condition. Sensors 2020; 20: 3039.

23.

Sutton

Barto

. Reinforcement learning: an introduction. Cambridge, MA: MIT Press, 2018, pp. 1–2.

24.

Haarnoja

Zhou

Abbeel

, et al. Soft actor–critic: off-policy maximum entropy deep reinforcement learning with a stochastic actor. Proc 35th Int Conf Mach Learn 2018; 80: 1861–1870.

25.

Hochreiter

Schmidhuber

. Long short-term memory. Neural Comput 1997; 9(8): 1735–1780.

26.

Zhang

Wang

. Deep reinforcement learning for real-time photovoltaic power forecasting under dynamic weather conditions. IEEE Trans Ind Inf 2024; 20(4): 2345–2356.

27.

DKASC . 1B DKASC Alice Springs PV system data. https://dkasolarcentre.com.au/locations/alice-springs?source=1B (n.d.).

28.

Smith

Brown

Johnson

. Enhancing PV power prediction models using deep reinforcement learning and transfer learning. Appl Energy 2023; 321: 1234–1245.

29.

Huang

Wang

. Deep-reinforcement-learning-based capacity scheduling for PV-battery storage system. IEEE Trans Smart Grid 2021; 12(3): 2272–2283.