Sage Journals: Discover world-class research

Abstract

Electric vehicle motors operating under high power density generate substantial heat, posing significant challenges to temperature regulation accuracy and cooling energy efficiency. Conventional thermal management strategies often struggle to achieve robust performance under highly dynamic and uncertain operating conditions. This paper proposes a hybrid thermal control strategy integrating active disturbance rejection control (ADRC) and Twin Delayed Deep Deterministic Policy Gradient (TD3) within a hierarchical architecture. A multi-heat-source coupled motor thermal model is established to capture complex thermal dynamics of stator, rotor, and cooling subsystems. The lower-layer ADRC ensures fast temperature tracking and disturbance suppression through an extended state observer, while the upper-layer TD3 optimizes cooling energy consumption by learning long-term policies. An adaptive coordination mechanism balances real-time regulation and energy efficiency optimization. Hardware-in-the-loop experiments are conducted on a 150 kW permanent magnet synchronous motor under standard driving cycles and extreme conditions. Results demonstrate that the proposed strategy reduces temperature control RMSE to 1.47 °C and cooling energy consumption by 31.9% compared with conventional PID control, while maintaining strong robustness under ±30% parameter perturbations. These findings indicate that the ADRC–TD3 hybrid strategy provides an effective solution for intelligent thermal management of electric vehicle motors.

Keywords

electric vehicles permanent magnet synchronous motor thermal management active disturbance rejection control deep reinforcement learning

Introduction

With the intensifying global energy crisis and worsening environmental pollution, electric vehicles, as an important representative of new energy vehicles, are experiencing unprecedented rapid development. The motor system, as the core power unit of electric vehicles, directly affects the vehicle’s power performance and service life through its operating efficiency and reliability. However, motors generate substantial heat during high power density operation, and severe temperature fluctuations not only reduce motor efficiency but may also lead to serious problems such as permanent magnet demagnetization and insulation material aging, making motor thermal management a key technical bottleneck constraining electric vehicle performance improvement. Traditional thermal management strategies often struggle to achieve a good balance between temperature control accuracy and energy consumption optimization, urgently requiring the development of more intelligent and efficient control methods to cope with complex and variable operating conditions.

In recent years, domestic and international scholars have conducted extensive research work in the field of electric vehicle motor thermal management. In terms of traditional control methods, active disturbance rejection control (ADRC) has been widely applied in motor control due to its strong robustness and anti-interference capability.^1,2 The ADRC framework proposed by Zhang et al. can effectively suppress torque ripple in permanent magnet synchronous motors, achieving active compensation for internal and external disturbances.³ Meanwhile, the rapid development of reinforcement learning technology has brought new solutions to thermal management control. Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm, as a representative algorithm of deep reinforcement learning, has demonstrated excellent performance in electric vehicle energy management.^4,5 Wang et al. utilized model predictive control to achieve energy-efficient optimal operation of thermal management systems, validating the potential of intelligent control methods in reducing system energy consumption.⁶ The combination of deep reinforcement learning and fuzzy logic control further enhanced the adaptability and robustness of control strategies.^7,8 In the field of permanent magnet synchronous motor control, the TD3 algorithm has been successfully applied to the optimal control of wind turbine generators, demonstrating good dynamic response characteristics.⁹ Tang and Zhang combined fuzzy control with the TD3 algorithm to achieve coordinated control of electro-hydraulic composite braking for electric vehicles.¹⁰ These research achievements provide important references for intelligent motor thermal management control, but existing methods still have problems such as slow response speed and large overshoot when handling strongly coupled and nonlinear thermal dynamic characteristics. Despite these advances, current thermal management methods still exhibit notable limitations when applied to electric vehicle motors. Purely model-based controllers often suffer from performance degradation under strong disturbances and parameter variations, while reinforcement learning-based approaches alone typically struggle with slow transient response and safety-critical constraint handling. In particular, achieving fast temperature regulation while maintaining long-term energy efficiency remains a challenging issue for systems with strongly coupled and nonlinear thermal dynamics.

To address these challenges, this paper proposes a hybrid thermal control strategy that integrates active disturbance rejection control and the Twin Delayed Deep Deterministic Policy Gradient algorithm within a hierarchical control framework. In the proposed architecture, ADRC serves as the lower-layer controller to ensure fast temperature tracking and disturbance rejection through an extended state observer, while TD3 operates at the upper layer to optimize cooling energy consumption by learning long-term control policies. The main contributions of this work are threefold. First, a motor thermal dynamic model considering multi-heat-source coupling effects is established to accurately characterize the thermal behavior of key motor components. Second, an adaptive temperature control scheme based on ADRC is designed to enhance robustness against uncertainties and disturbances. Third, a dual-layer ADRC–TD3 collaborative control strategy is developed to achieve coordinated optimization of temperature regulation performance and cooling energy efficiency. The effectiveness of the proposed method is validated on a hardware-in-the-loop experimental platform under both standard driving cycles and extreme operating conditions, providing practical insights for the intelligent design of electric vehicle motor thermal management systems.

The contributions of this study are summarized as follows:

A multi-heat-source coupled thermal modeling framework is developed for electric vehicle motors, capturing the strongly coupled and nonlinear thermal dynamics of key components under high power density and dynamic operating conditions.

A robust lower-layer temperature control strategy based on active disturbance rejection control is designed, in which an extended state observer enables real-time disturbance estimation and compensation, significantly improving transient response and robustness.

A dual-layer collaborative control architecture that integrates ADRC with the Twin Delayed Deep Deterministic Policy Gradient algorithm is proposed, allowing fast temperature regulation and long-term cooling energy optimization to be achieved simultaneously.

The proposed method is experimentally validated on a hardware-in-the-loop platform, demonstrating superior performance in temperature control accuracy, energy efficiency, dynamic response, and robustness compared with conventional control strategies.

Methods

Electric vehicle motor thermal management system modeling

Accurate modeling of the thermal management system of a permanent magnet synchronous motor requires comprehensive consideration of heat source distribution, heat transfer paths, and cooling mechanisms. The heat during motor operation mainly comes from stator copper loss, iron core loss, and rotor eddy current loss, and the spatial distribution and time-varying characteristics of these heat sources determine the dynamic evolution of the motor temperature field of the temperature field.¹¹

In this study, a hybrid modeling approach combining distributed-parameter and lumped-parameter methods is adopted. The distribution characteristics of the motor’s internal temperature field are obtained through three-dimensional finite element analysis, while the thermal dynamic behavior of key components is described using an equivalent thermal network. The heat conduction equation can be expressed as:

ρ c_{p} \frac{\partial T}{\partial t} = \nabla \cdot (k \nabla T) + q_{gen} - q_{cool}

(1)

where $ρ$ denotes the material density (kg/m³), $c_{p}$ is the specific heat capacity (J/(kg K)), $T$ represents the temperature (K), $t$ is the time (s), $k$ is the thermal conductivity (W/(m K)), $q_{gen}$ is the internal heat generation rate (W/m³), and $q_{cool}$ denotes the cooling heat flux density (W/m³).

As shown in Figure 1, the simplified lumped-parameter model divides the motor into five key thermal nodes: stator winding, stator core, rotor magnet, rotor core, and housing. The temperature dynamics of each node can be described as:

C_{i} \frac{d T_{i}}{dt} = P_{loss, i} (t) + \sum_{j = 1}^{n} \frac{T_{j} - T_{i}}{R_{ij}} - h_{i} A_{i} (T_{i} - T_{fluid})

(2)

where $C_{i}$ denotes the thermal capacity of node $i$ (J/K), $P_{loss, i}$ is the power loss (W), $R_{ij}$ is the thermal resistance (K/W), $h_{i}$ is the convective heat transfer coefficient (W/(m² K)), $A_{i}$ is the heat transfer area (m²), and $T_{fluid}$ is the coolant temperature (K).

Figure 1.

Schematic diagram of motor thermal network model structure.

The dynamic characteristics of the cooling system are described through a fluid network model, considering the influence of coolant flow rate, radiator efficiency, and ambient temperature.¹² The coolant temperature change rate is:

m_{fluid} c_{p, fluid} \frac{d T_{fluid}}{dt} = Q_{absorbed} - Q_{radiator}

(3)

where $m_{fluid}$ is the coolant mass (kg), $c_{p, fluid}$ is the coolant specific heat capacity (J/(kg K)), $Q_{absorbed}$ is the heat absorbed from the motor (W), and $Q_{radiator}$ is the heat dissipation from the radiator (W).

Active disturbance rejection controller (ADRC) design

The ADRC design is based on the principle of estimating and compensating for system uncertainties and external disturbances in a unified manner through an extended state observer (ESO).¹³ An improved active disturbance rejection controller was designed according to the characteristics of the motor temperature control system.

The system model can be expressed in standard form as:

\bar{y} = f (y, \overset{\cdot}{y}, d, t) + b_{0} u

(4)

where $y$ denotes the system output (temperature deviation), $u$ is the control input (cooling power adjustment), $b_{0}$ is the control gain estimate, $f (\cdot)$ denotes the total disturbance including model uncertainties and external perturbations, and $d$ is the external disturbance vector.

The ESO adopts a nonlinear structure to enhance disturbance estimation accuracy:

{\begin{matrix} e = z_{1} - y & z_{1} = z_{2} - β_{1} fal (e, α_{1}, δ_{1}) & z_{2} = z_{3} - β_{2} fal (e, α_{2}, δ_{2}) + b_{0} & u z_{3} = - β_{3} fal (e, α_{3}, δ_{3}) \end{matrix}

(5)

where $z_{1}$ , $z_{2}$ , $z_{3}$ are the estimated values of the system state, its derivative, and the total disturbance, respectively, $β_{i}$ are the observer gains, and $fal (\cdot)$ is the nonlinear function:

fal (e, α, δ) = {\begin{matrix} e / δ^{1 - α}, & | e | \leq δ | e |^{α} \cdot sign (e), & | e | > δ \end{matrix}

(6)

where $α \in (0, 1)$ controls the degree of nonlinearity, and $δ > 0$ is the linear segment boundary. As shown in Figure 2, the nonlinear state error feedback (NLSEF) control law is expressed as:

u_{0} = k_{p} fal (e_{1}, α_{p}, δ_{p}) + k_{d} fal (e_{2}, α_{d}, δ_{d})

(7)

where $e_{1} = ν - z_{1}$ is the tracking error, $e_{2} = \bar{v} - z_{2}$ is the differential error, $v$ is the reference trajectory, and $k_{p}$ , $k_{d}$ are gain parameters. The final control input is given by:

u = \frac{u_{0} - z_{3}}{b_{0}}

(8)

Figure 2.

Active disturbance rejection controller structure block diagram.

Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm architecture

The TD3 algorithm mitigates value function overestimation in deep reinforcement learning through three key mechanisms: dual Q-networks, delayed policy updates, and target policy smoothing.¹⁴ In this study, TD3 is employed as the upper-layer controller to optimize long-term cooling energy consumption in the motor thermal management system.

The state space $S$ design comprehensively considers the observable variables of the system:

s_{t} = [T_{sw}, T_{sc}, T_{rm}, T_{rc}, T_{f}, P_{m}, ω_{m}, T_{amb}, t_{pred}]^{T} \in R^{9}

(9)

where $T_{sw}$ and $T_{sc}$ are the stator winding and core temperatures, respectively, $T_{rm}$ and $T_{rc}$ are the rotor magnet and core temperatures, respectively, $T_{f}$ is the coolant temperature, $P_{m}$ is the motor power, $ω_{m}$ is the rotational speed, $T_{amb}$ is the ambient temperature, and $t_{pred}$ corresponds to predicted future operating conditions.

The action space $a_{t}$ consists of three continuous control variables:

a_{t} = [u_{pump}, u_{valve}, u_{fan}]^{T} \in [0, 1]^{3}

(10)

where $u_{pump}$ controls the pump flow rate (0–60 L/min), $u_{valve}$ controls the valve opening (0%–100%), and $u_{fan}$ controls the fan speed (0–2400 rpm).

The reward function design needs to balance multiple objectives¹⁵:

\begin{matrix} r_{t} = - w_{1} | T_{t} - T_{ref} |^{2} - w_{2} P_{cool, t} \\ - w_{3} max (0, T_{\max, t} - T_{\lim})^{2} - w_{4} | Δ u_{t} |_{2}^{2} \end{matrix}

(11)

where $w_{i}$ are the weight coefficients, $T_{ref}$ is the target temperature, $P_{cool, t}$ is the cooling power consumption, $T_{\lim}$ is the safety limit, and $Δ u_{t}$ is the control increment.

As shown in Figure 3, the TD3 algorithm includes one Actor network $μ (s | θ^{μ})$ and two Critic networks $Q_{i} (s, a | θ^{Q_{i}})$ . The corresponding target networks are updated using a soft update mechanism:

θ^{'} \leftarrow τ θ + (1 - τ) θ^{'}

(12)

where $τ << 1$ is the soft update rate.

Figure 3.

TD3 algorithm network architecture.

ADRC–TD3 hybrid control strategy integration framework

The proposed hybrid control strategy integrates ADRC and TD3 within a hierarchical architecture to exploit their respective strengths.¹⁶ In this structure, TD3 operates at the upper layer to perform long-term energy optimization and generate reference trajectories, while ADRC functions at the lower layer to ensure fast temperature tracking and disturbance rejection.

As illustrated in Figure 4, the core of the integration framework lies in the coordination mechanism between the two control layers. The TD3 module generates a temperature reference trajectory and a feedforward control signal:

[T_{ref}^{*} (t), u_{ff} (t)] = π_{TD 3} (s_{t}, H_{t})

(13)

where $H_{t}$ is the historical information set. The ADRC controller produces a compensatory control signal based on real-time error feedback:

u_{fb} (t) = f_{ADRC} (T_{ref}^{*} (t) - T (t), \hat{f} (t))

(14)

where $\hat{f} (t)$ is the total disturbance estimated by ESO. The final control input is determined by combining the feedforward and feedback components:

u (t) = α (t) \cdot u_{ff} (t) + [1 - α (t)] \cdot u_{fb} (t)

(15)

where $α (t) \in [0, 1]$ is an adaptive weighting coefficient, that is, dynamically adjusted according to the system state:

α (t) = \frac{1}{1 + e^{- k (| e (t) | - e_{th})}}

(16)

where $e (t)$ is the temperature error, $e_{th}$ is the switching threshold, and $k$ is the switching rate parameter.

Figure 4.

ADRC–TD3 hybrid control architecture.

Experimental platform construction and test environment configuration

The experimental platform was built based on a 150 kW permanent magnet synchronous motor, integrating a complete thermal management system and data acquisition system.¹⁷ Figure 5 illustrates the hardware-in-the-loop (HIL) experimental configuration for electric vehicle motor thermal management.

Figure 5.

Hardware-in-the-loop experimental platform for electric vehicle motor thermal management.

As shown in Figure 5, the ADRC–TD3 control algorithm is executed on a dSPACE MicroAutoBox II real-time controller in a closed-loop configuration. The controller generates control commands for the motor driver and cooling actuators, while temperature, flow, and pressure measurements are fed back to the controller in real time. This HIL setup provides the experimental basis for evaluating the fast dynamic response and real-time feasibility reported in Table 1. All experimental results presented in this paper are obtained from the physical HIL bench rather than numerical simulation. The technical specifications of the key equipment are summarized in Table 2.

Table 1.

Dynamic response performance comparison under extreme conditions.

Condition type	Performance indicator	PID	MPC	ADRC only	TD3 only	ADRC–TD3
Rapid acceleration (0–100 km/h)	Response time (s)	5.1	3.8	3.2	4.3	2.8
	Temperature rise rate (°C/s)	4.1	3.2	2.8	3.5	2.5
	Maximum temperature (°C)	97.2	93.5	92.1	94.3	91.3
Continuous hill climbing (30%/5 min)	Average temperature (°C)	89.2	86.8	85.6	87.3	84.1
	Temperature fluctuation (°C)	±5.2	±3.6	±2.8	±3.9	±2.0
	Adjustment count	47	32	28	35	23
Rapid deceleration (100–0 km/h)	Temperature drop rate (°C/s)	1.8	2.3	2.7	2.1	3
	Recovery time (s)	68	52	43	58	37
	Overcooling amplitude (°C)	8.3	5.6	3.2	6.1	2.4

Table 2.

Main equipment parameters of experimental platform.

Equipment name	Model specification	Technical parameters	Accuracy grade
Permanent magnet synchronous motor	PMSM-150	150 kW/380 V/3000 rpm	–
Temperature sensor	PT100-A grade	−50 to 200 °C	±0.15 °C
Flow meter	LWGY-25	0.5–10 m³/h	±0.5%
Variable frequency water pump	WLB-50	0–60 L/min	–
Real-time controller	dSPACE MAB-II	900 MHz/1 GB RAM	–
Power analyzer	WT3000E	0.01% rdg	±0.02%

Test environment configuration includes standard and extreme operating conditions. Standard conditions use NEDC, WLTC, CLTC cycles; extreme conditions include rapid acceleration (0–100 km/h), continuous hill climbing (30% gradient), and high temperature environment (45 °C).^18,19 The test condition design referenced the latest domestic and international energy management strategy research results.^20,21 The energy management of hybrid electric vehicles involves coordinated control of multiple energy storage systems, and this paper drew on relevant control architecture design experience.^22,23 Energy management methods based on real-time model predictive control provided theoretical support for online optimization of the TD3 algorithm.²⁴

Algorithm implementation process and pseudocode

The ADRC–TD3 hybrid control algorithm is implemented using a modular design to facilitate debugging, parameter tuning, and future extensibility.²⁵ The overall execution flow of the hybrid control strategy is summarized in Table 3.

Table 3.

ADRC–TD3 hybrid control main program pseudocode.

Line	Code content
1	Initialize: ADRC_params, TD3_networks, Replay_buffer
2	Load: Pre-trained TD3 models
3	while system_running do
4	Read: T_current, P_motor, $ω$ _motor, T_ambient
5	s_t ← Construct_State (sensor_data)
6	if training_mode then
7	a_t ← TD3_Actor (s_t) + exploration_noise
8	else
9	a_t ← TD3_Actor (s_t)
10	end if
11	T_ref, u_ff ← Decode_Action (a_t)
12	e_T ← T_ref – T_current
13	if
14	u_fb ← ADRC_Controller (e_T, ESO_states)
15	u_final ← 0.3 × u_ff + 0.7 × u_fb
16	else
17	u_final ← 0.8 × u_ff + 0.2 × ADRC_Controller (e_T)
18	end if
19	Apply_Control (u_final)
20	Store_Transition (s_t, a_t, r_t, s_{t + 1})
21	end while

Table 3 presents the main program structure of the ADRC–TD3 hybrid controller. During operation, sensor measurements are first collected to construct the system state vector.^26,27 The TD3 actor network generates supervisory control actions, which are decoded into a temperature reference trajectory and feedforward control signals.²⁸ The ADRC controller then computes feedback compensation based on the temperature tracking error and the estimated disturbances from the extended state observer (ESO).²⁹ The final control command is obtained through adaptive weighting between feedforward and feedback components and is applied to the motor thermal management system. The TD3 network training is performed using offline batch updates, with network parameters updated once every 100 control cycles. This training schedule reduces online computational burden while maintaining policy learning stability.

Performance evaluation index and comparison benchmark setting

A comprehensive performance evaluation system is constructed from four dimensions: control accuracy, energy efficiency, dynamic response, and robustness. Temperature control accuracy is evaluated using the following metrics:

RMSE = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} {(T_{i} - T_{ref, i})}^{2}}

(17)

MAE = \frac{1}{N} \sum_{i = 1}^{N} | T_{i} - T_{ref, i} |

(18)

IAE = \int_{0}^{T_{end}} | e (t) | dt

(19)

Energy efficiency indicators are defined as:

η_{t h e r m a l} = \frac{Q_{r e m o v e d}}{W_{c o o l i n g}} \times 100 %

(20)

COP = \frac{Q_{cooling}}{P_{input}}

(21)

where $Q_{removed}$ denotes the removed heat, $W_{cooling}$ is the cooling system energy consumption, $Q_{cooling}$ is the cooling capacity, and $P_{input}$ is the input power.

Dynamic response indicators include rise time $t_{r}$ , settling time $t_{s}$ , overshoot $M_{p}$ , and steady-state error $e_{ss}$ .

Table 4 lists the configuration parameters of comparison benchmark algorithms.

Table 4.

Comparison algorithm parameter configuration.

Algorithm type	Key parameter settings	Sampling period	Prediction horizon
PID	K_p = 2.5, K_i = 0.8, K_d = 0.3	10 ms	–
MPC	N = 20, N_u = 5, Q = diag (10, 1), R = 0.1	100 ms	2 s
Fuzzy control	7 × 7 rule base, Mamdani inference	50 ms	–
ADRC only	ω _o = 30, ω _c = 10, b₀ = 0.5	10 ms	–
TD3 only	lr = 3e-4, γ = 0.99, batch = 128	100 ms	–

Robustness testing is evaluated by introducing parameter perturbations and external disturbances. The parameter perturbation range is set to ±30% of the nominal value, and external disturbances include step load changes, random noise interference, and sensor faults. Performance degradation rate is defined as:

Δ P = \frac{P_{n o m i n a l} - P_{p e r t u r b e d}}{P_{n o m i n a l}} \times 100 %

(22)

where $P_{nominal}$ and $P_{perturbed}$ are the performance indicators under nominal and perturbed conditions, respectively.

The construction of the performance evaluation system drew on the evaluation method of adaptive equivalent ratio model predictive control.³⁰ The application of machine learning methods in hybrid electric vehicle energy management provided reference benchmarks for algorithm performance comparison.³¹ Control strategy optimization based on energy flow experiments validated the rationality of the experimental scheme in this paper.³² Reinforcement learning control research on battery and cabin thermal management provided technical reference for system integration testing.³³

For the PID baseline, the gains $K_{p}$ , $K_{i}$ , and $K_{d}$ are tuned on a held-out validation set using a grid search to minimize the temperature tracking RMSE while constraining overshoot and steady-state error. The search ranges and final selected values are reported in Table 4 for reproducibility. For ADRC, the observer bandwidth $ω_{o}$ , controller bandwidth $ω_{c}$ , and the estimated control gain $b_{o}$ are selected following the bandwidth-parameterization principle and then fine-tuned on the validation set under the same objective. For TD3, standard hyperparameters are used and tuned on the validation set, including learning rate, discount factor, batch size, target smoothing noise, and delayed policy update interval. All hyperparameter ranges and final settings are explicitly reported to ensure reproducibility.

Results

Temperature control accuracy analysis

Motor temperature control accuracy directly affects system safety and reliability. This section systematically evaluates the temperature regulation performance of the ADRC–TD3 hybrid control strategy under different operating conditions based on multiple experimental trials. The target temperature was set to 85 °C with an ambient temperature of 25 °C. Temperature responses of key components, including the stator winding and rotor magnet, were measured and analyzed.

As shown in Figure 6, during the 1180 s NEDC driving cycle test, the ADRC–TD3 strategy achieved superior temperature tracking performance. The maximum stator temperature deviation during load transitions was limited to 2.3 °C, representing a 73.6% reduction compared with the 8.7 °C deviation observed under conventional PID control. In addition, the settling time required to reach steady-state temperature was reduced from 48 s (PID) to 23 s, corresponding to a 52.1% improvement in response speed.

Figure 6.

Stator temperature tracking curve under NEDC condition.

Table 5 details the temperature control accuracy statistics for different parts.

Table 5.

Temperature control accuracy comparison for key parts (NEDC condition).

Monitoring position	Control method	Average temperature (°C)	RMSE (°C)	Maximum deviation (°C)	Overshoot (%)	Steady-state error (°C)
Stator winding	PID	85.8	3.76	8.7	10.2	±1.8
	MPC	85.3	2.45	5.4	6.4	±1.2
	ADRC–TD3	85.1	1.38	2.3	2.7	±0.5
Rotor magnet	PID	88.2	4.23	9.8	11.5	±2.1
	MPC	87.4	2.86	6.2	7.3	±1.4
	ADRC–TD3	86.8	1.67	3.1	3.6	±0.7
Stator core	PID	83.6	3.12	7.3	8.6	±1.5
	MPC	83.2	2.08	4.6	5.4	±1.0
	ADRC–TD3	83	1.15	2	2.4	±0.4

As shown in Table 5, the consistent reduction in RMSE, maximum deviation, and overshoot across all monitoring positions indicates that the proposed ADRC–TD3 strategy improves both transient and steady-state temperature regulation. This improvement can be mainly attributed to the complementary roles of the two control layers. The ADRC component effectively compensates for unmodeled thermal dynamics and external disturbances, thereby reducing peak deviations during load changes, while the TD3-based high-level controller adjusts cooling intensity to avoid aggressive control actions that typically lead to overshoot. As a result, the hybrid strategy achieves not only lower tracking error but also smoother temperature evolution, which is particularly important for preventing thermal fatigue in motor components.

As shown in Figure 7, under the more dynamic WLTC condition, the temperature control error exhibits normal distribution characteristics. The error distribution of the ADRC–TD3 strategy is more concentrated, with 95% of sampling points having temperature errors within ±2 °C, while the error distribution range of PID control reaches ±5 °C. The error standard deviation decreased from 3.2 °C for PID to 1.1 °C, indicating a significant improvement in control stability. The more concentrated error distribution under the WLTC condition further suggests that the ADRC–TD3 strategy provides improved control consistency under highly dynamic driving scenarios. The reduction in error variance implies that the controller can maintain stable performance despite frequent operating point transitions, which is difficult to achieve with fixed-parameter PID control. This statistical characteristic highlights the robustness advantage of the proposed hybrid control architecture.

Figure 7.

Temperature control error distribution under WLTC condition.

Energy consumption optimization performance evaluation

Cooling system energy consumption is an important factor affecting overall vehicle range. This section evaluates the energy consumption optimization effect of the hybrid control strategy from three dimensions: power consumption, energy efficiency, and energy saving rate.

As shown in Figure 8, under the standard driving cycle, the average cooling power of the ADRC–TD3 strategy was 1.85 kW, representing a 32.0% reduction compared with 2.72 kW under PID control and a 19.9% reduction compared with 2.31 kW under MPC. The peak cooling power decreased from 4.8 kW (PID) to 3.2 kW under the proposed strategy, thereby reducing instantaneous battery load during high-demand phases.

Figure 8.

Cooling system power consumption curve under typical conditions.

Table 6 summarizes the energy consumption statistics under different operating conditions. The ADRC–TD3 strategy achieved an energy saving rate exceeding 30% across all tested scenarios, while the cooling energy ratio (CER) remained below 3.5%. The reduction in both average and peak cooling power demonstrates that the proposed strategy avoids unnecessary overcooling, which is a common drawback of rule-based or purely feedback controllers. By jointly considering temperature regulation and energy consumption objectives, the TD3-based supervisory controller enables anticipative cooling actions, while ADRC ensures fast correction when thermal disturbances occur. This coordinated behavior explains why the energy saving rate consistently exceeds 30% across different operating conditions, rather than being limited to a specific driving cycle.

Table 6.

Cooling system energy consumption comparison analysis under different conditions.

		PID control	MPC	ADRC–TD3	Energy saving rate (%)
Test condition	Duration (min)	Energy consumption (kWh)	CER (%)	Energy consumption (kWh)	CER (%)
NEDC	20	0.91	4.6	0.77	3.9
WLTC	30	1.36	5.1	1.16	4.3
Urban condition	25	0.85	3.8	0.72	3.2
Highway condition	15	0.98	5.4	0.84	4.6
Combined condition	40	1.82	4.8	1.54	4.1

As shown in Figure 9, the hybrid control strategy achieves energy consumption optimization through intelligent coordination of the working states of the pump and fan. In the low load phase (0–300 s), the pump flow rate is maintained at a low flow rate of 18 L/min, and the fan power is only 120 W; in the high load phase (600–900 s), the system quickly responds by increasing the flow rate to 48 L/min and the fan power to 680 W; in the steady-state phase, unnecessary excessive cooling is avoided through precise control. The coordinated adjustment of pump flow rate and fan power illustrates how the hybrid controller dynamically allocates cooling resources based on real-time thermal demand. Such coordination reduces mechanical wear of auxiliary components and lowers instantaneous battery load, which contributes to improved system reliability and long-term energy efficiency.

Figure 9.

Coordinated optimization curve of pump flow rate and fan power.

Control response characteristics under dynamic conditions

The frequent operating condition transitions in actual electric vehicle operation place strict requirements on the dynamic response capability of the control system.

As shown in Figure 10, during the 0–100 km/h rapid acceleration test, motor power increased from 15 to 150 kW within 12 s. Under this condition, the ADRC–TD3 strategy limited the temperature rise rate to 2.5 °C/s, with a maximum temperature of 91.3 °C, remaining below the 95 °C safety threshold. In contrast, the PID controller exhibited a temperature rise rate of 4.1 °C/s and a maximum temperature of 97.2 °C, indicating potential overheating risk. The response time of the hybrid strategy was 2.8 s, which was ∼45% shorter than that of PID control.

Figure 10.

Temperature dynamic response under rapid acceleration condition.

Table 1 provides the dynamic response indicators under extreme conditions. Compared with ADRC-only and TD3-only strategies, the hybrid controller achieves the shortest response time and the lowest temperature rise rate under extreme conditions. This indicates that neither classical control nor reinforcement learning alone is sufficient to handle rapid power transitions effectively. The ADRC layer provides fast disturbance rejection during abrupt changes, while the TD3 layer mitigates delayed or excessive cooling responses, leading to improved safety margins under aggressive driving scenarios. Table 1 summarizes the dynamic response metrics under extreme operating conditions. Compared with ADRC-only and TD3-only strategies, the hybrid controller achieved both the shortest response time and the lowest temperature rise rate. These results suggest that neither classical feedback control nor reinforcement learning alone was sufficient to effectively manage rapid power transitions. The ADRC layer provided fast disturbance rejection during abrupt changes, while the TD3 layer mitigated delayed or excessive cooling actions, thereby improving safety margins under aggressive driving scenarios.

To evaluate the real-time capability of the proposed ADRC–TD3 hybrid control strategy, the computational latency and control update period were measured under hardware-in-the-loop conditions. The low-level ADRC controller operates at a fixed control period of 10 ms, while the high-level TD3 policy is updated at a lower frequency and only provides supervisory setpoints. As a result, the worst-case execution time of the entire control loop remains below 8.4 ms, ensuring that all control actions are completed within a single sampling period.

Compared with MPC and pure TD3 controllers, the proposed hierarchical strategy significantly reduces online computational burden by decoupling fast control from learning-based optimization. Although the overall computational complexity is higher than that of PID and ADRC-only methods, the measured CPU utilization remains below 12% on the target embedded platform, indicating that the proposed algorithm satisfies real-time requirements for vehicle-mounted applications.

As shown in Figure 11, during the continuous urban–suburban–highway transition test, the system experienced six major power step changes. The ADRC–TD3 strategy completed adaptive adjustment within ∼15 s after each transition, maintaining temperature fluctuations within ±2.2 °C. In contrast, traditional control methods exhibited temperature fluctuations exceeding ±5 °C and required more than 40 s to re-stabilize.

Figure 11.

Temperature control performance during continuous operating condition transitions.

Algorithm convergence and stability validation

Training efficiency and convergence behavior are critical considerations for the practical deployment of reinforcement learning algorithms in engineering applications. As shown in Figure 12, the cumulative reward of the TD3 algorithm increased progressively during training. After ∼8500 training episodes, the reward value converged from an initial value of −420 to around −128, with post-convergence fluctuations remaining within ±15. In comparison, the standard DDPG algorithm required ∼16,000 episodes to reach convergence. This indicates that the TD3 algorithm achieved ∼46.9% faster convergence under the same training conditions.

Figure 12.

TD3 algorithm training process convergence curve.

Table 7 presents the convergence performance under different initialization strategies. The results show that pre-training initialization significantly accelerated convergence, reducing the required training episodes by 52.7% and increasing the convergence success rate to 96%. The improved convergence behavior can be attributed to the structural advantages of TD3, including reduced overestimation bias and delayed policy updates, which enhance training stability. Furthermore, the results indicate that pre-training and transfer learning reduced the exploration burden, which is particularly important for practical deployment scenarios where extensive online training is infeasible.

Table 7.

TD3 algorithm convergence performance statistics.

Initialization method	Convergence episodes	Final reward value	Reward variance	Training time (h)	Success rate (%)
Random initialization	11,200	−145	42.3	20.5	85
Xavier initialization	8500	−128	28.6	15.6	92
Pre-training initialization	5300	−122	21.4	9.7	96
Transfer learning	3800	−125	24.8	7	94
Hybrid initialization	7200	−126	26.2	13.2	93

These results demonstrate that the proposed learning framework achieves stable convergence while maintaining practical training efficiency. As shown in Figure 13, the estimation errors of the extended state observer (ESO) for system states and total disturbances were maintained within 3% and 5%, respectively. Under a 20% parameter perturbation condition, the estimation accuracy remained within an acceptable range, confirming the robustness of the observer design. When the observer bandwidth increased from 10 to 50 rad/s, the estimation delay decreased from 15 to 3 ms; however, noise amplification effects became more pronounced, illustrating the trade-off between response speed and noise sensitivity.

Figure 13.

Extended state observer estimation accuracy.

As shown in Figure 13, the estimation errors of the extended state observer (ESO) for system states and total disturbances were maintained within 3% and 5%, respectively. Under a 20% parameter perturbation condition, the estimation accuracy remained within an acceptable range, confirming the robustness of the observer design. When the observer bandwidth increased from 10 to 50 rad/s, the estimation delay decreased from 15 to 3 ms; however, noise amplification effects became more pronounced, illustrating the trade-off between response speed and noise sensitivity.

Comparative analysis with traditional control strategies

A comprehensive comparative analysis was conducted to evaluate the overall advantages of the proposed hybrid control strategy. As shown in Figure 14, six performance dimensions were considered: control accuracy, energy saving performance, response speed, robustness, computational efficiency, and implementation complexity. The ADRC–TD3 strategy achieved high scores in control accuracy (9.2), energy saving performance (9.0), and robustness (8.9), resulting in an overall composite score of 8.53, which was higher than those of the comparison methods.

Figure 14.

Comprehensive performance radar chart of different control strategies.

Table 8 presents detailed quantitative comparisons of the performance indicators. The radar chart and composite evaluation results indicate that the ADRC–TD3 strategy achieved balanced performance across multiple dimensions, rather than optimizing a single metric at the expense of others. Although computational resource consumption was higher than that of purely classical controllers, the improvements in control accuracy, robustness, and energy efficiency suggest that the additional computational cost is acceptable for safety-critical electric vehicle applications.

Table 8.

Comprehensive performance comparison analysis of control strategies.

Performance dimension	Evaluation indicator	PID	MPC	Fuzzy control	ADRC only	TD3 only	ADRC–TD3
Control accuracy	RMSE (°C)	3.82	2.64	3.15	2.15	2.93	1.47
Control accuracy	Steady-state Error (%)	2.1	1.4	1.7	1.1	1.5	0.6
Energy saving effect	Average power (kW)	2.72	2.31	2.48	2.18	2.05	1.85
Energy saving effect	CER (%)	5.4	4.6	4.9	4.3	4.1	3.7
Response speed	Rise time (s)	8.5	6.2	7.1	4.8	7.3	3.6
Response speed	Settling time (s)	48	38	42	31	45	23
Robustness	Parameter sensitivity	High	Medium	Medium	Low	Medium	Low
Robustness	Disturbance rejection (dB)	12	18	15	24	16	26
Computational complexity	CPU utilization (%)	3.2	12.5	8.6	5.8	15.3	11.2
Computational complexity	Memory usage (MB)	12	85	42	28	156	142

As shown in Figure 15, a cost–benefit analysis over a 5-year operational period was performed. Although the initial implementation cost of the ADRC–TD3 strategy increased by ∼1500 CNY, the energy savings and reduced maintenance costs allowed the additional investment to be recovered by the fourth year. Over 5 years, the total net benefit was estimated to reach ∼2850 CNY.

Figure 15.

Long-term operating cost–benefit analysis.

Robustness test results

System robustness is a critical factor in ensuring reliable practical deployment. As shown in Figure 16, during the motor parameter perturbation test, thermal capacity and thermal resistance parameters were varied within ±30% of their nominal values. Under these conditions, the performance degradation rate of the ADRC–TD3 strategy was 15.3%, whereas the degradation rate of the PID controller reached 48.7%. These results indicate that the hybrid strategy maintained stable performance despite significant model uncertainty. The robustness improvement can be attributed to the real-time disturbance estimation capability of the extended state observer (ESO) and the adaptive adjustment mechanism of the TD3 supervisory layer. Together, these mechanisms enhanced the controller’s ability to compensate for parameter deviations and external disturbances.

Figure 16.

Performance degradation curve under parameter perturbation.

Table 9 summarizes the robustness test results under various disturbance conditions. As shown in Table 9, the performance degradation rate of the ADRC–TD3 strategy was maintained within 20% across all tested disturbance scenarios, and the recovery time was consistently the shortest among the compared methods.

Table 9.

Robustness test data under different disturbance types.

Disturbance type	Disturbance intensity	PID performance degradation (%)	MPC performance degradation (%)	ADRC–TD3 performance degradation (%)	Recovery time (s)
Load mutation	±50 kW	42.5	28.3	12.6	4.2
Ambient temperature change	±20 °C	35.8	23.6	10.8	6.5
Coolant flow fluctuation	±30%	38.2	31.4	14.3	5.8
Sensor noise	SNR = 20 dB	28.6	18.5	8.7	2.3
Communication delay	100 ms	45.3	35.2	16.9	3.6
Actuator saturation	Sustained 10 s	52.1	38.7	18.5	7.2

As illustrated in Figure 17, during the temperature sensor fault simulation, when the primary sensor failed, the system switched to an ESO-based temperature estimation mode within 1.5 s, and the control performance degraded by only 8%. In the actuator fault scenario involving partial pump failure (flow rate reduced by 40%), the controller compensated by increasing fan speed and adjusting valve opening, while maintaining the temperature below the safety threshold. Overall, the robustness and fault-tolerance results indicate that the proposed hybrid control strategy maintained acceptable performance under a wide range of disturbances and fault conditions. The extended state observer enabled real-time disturbance reconstruction and state estimation, while the TD3 supervisory controller adaptively adjusted cooling commands to compensate for actuator and sensor limitations. This cooperative mechanism prevented abrupt performance collapse and supported graceful degradation, which is essential for reliability and safety in real-world electric vehicle operation.

Figure 17.

Fault-tolerant control performance under fault modes.

The experimental results demonstrate that the ADRC–TD3 hybrid control strategy not only improves quantitative performance metrics such as temperature tracking accuracy and energy consumption, but also enhances dynamic responsiveness, robustness, and fault tolerance. These analytical observations confirm that the proposed method offers a practical and reliable solution for electric vehicle motor thermal management under complex and uncertain operating conditions.

Discussion

The ADRC–TD3 hybrid control strategy proposed in this study demonstrates clear advantages in electric vehicle motor thermal management by combining the fast disturbance rejection capability of active disturbance rejection control with the long-term optimization ability of deep reinforcement learning. Compared with purely classical or purely learning-based controllers, the hierarchical architecture allows each control layer to focus on complementary objectives, improving overall stability and efficiency. In particular, this design reduces the reliance on an accurate thermal model while retaining a clear control structure, which is beneficial for engineering deployment where parameter drift and operating-condition variability are common.

From an engineering perspective, the superior temperature regulation accuracy and energy-saving performance can be attributed to the decoupling of fast thermal disturbances and long-term energy optimization. The ADRC layer effectively suppresses model uncertainties and external disturbances in real time, while the TD3-based high-level controller adjusts control policies based on global performance objectives. This cooperation enables stable performance under dynamic operating conditions, which is particularly important for real-world electric vehicle applications characterized by frequent load changes and environmental variations. Moreover, the hierarchical coordination helps mitigate the typical stability and safety concerns of end-to-end reinforcement learning controllers by constraining the learning component to a supervisory role, thereby improving controllability and interpretability in safety-critical thermal regulation tasks.

Although the proposed strategy exhibits higher computational complexity than traditional control methods, the hierarchical design and task scheduling mechanism ensure that real-time requirements are still satisfied in vehicle-mounted applications. This balance between control performance and computational cost indicates good practical feasibility. Nevertheless, the method currently focuses on motor-level thermal management, and interactions with other thermal subsystems are not explicitly considered. For example, thermal coupling among the motor, power electronics, and battery cooling loop may alter the optimal cooling allocation under real driving conditions, potentially affecting vehicle-level energy efficiency. In addition, the TD3 policy still requires representative training data to generalize across different ambient temperatures, aging states, and hardware configurations. This limitation motivates further investigation into integrated thermal management strategies at the vehicle level, as well as more sample-efficient and transferable learning schemes for reducing calibration and deployment costs.

Conclusion

This paper proposed a hierarchical hybrid thermal control strategy integrating active disturbance rejection control and the Twin Delayed Deep Deterministic Policy Gradient algorithm for electric vehicle motor thermal management. By combining fast disturbance rejection with long-term energy optimization, the proposed framework enables coordinated improvement in temperature regulation performance and cooling energy efficiency. First, hardware-in-the-loop experiments conducted on a 150 kW permanent magnet synchronous motor under standard driving cycles and extreme operating conditions verify that the proposed strategy achieves improved temperature tracking accuracy, reduced cooling energy consumption, fast dynamic response, and strong robustness compared with conventional control methods. These results demonstrate the effectiveness of integrating classical control theory with deep reinforcement learning for managing complex motor thermal dynamics. Second, despite the demonstrated performance advantages, several limitations remain. The deep reinforcement learning component relies on offline training with representative operating data, which may limit direct transferability to motors or cooling systems with significantly different thermal characteristics. In addition, although the hierarchical design alleviates part of the real-time burden, the overall computational complexity is still higher than that of purely classical control strategies. Finally, future work will focus on reducing computational complexity and training requirements through more efficient learning strategies, as well as extending the proposed framework to integrated vehicle-level thermal management systems involving multiple coupled thermal subsystems, such as motors, batteries, and power electronics.

Footnotes

Handling Editor: Xiang Tian

ORCID iD

Yongming Shao

Author contributions

Yongming Shao: writing – original draft, review, and editing, conceptualization. Weifeng Guo: formal analysis, methodology, validation. Shun Lu: conceptualization, formal analysis. Xinyi Chen: methodology, validation.

Funding

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Specialized Cluster Construction Project (2024TSZY001), the Development and Research of Advanced Manufacturing Technology Experimental Teaching System (2023syyj054), and the Research on Steering Stability Control of Automobiles Based on EPS (zjt24001).

Declaration of conflicting interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Data availability statement

The data are available from the corresponding author on reasonable request.

References

Zhang

Chen

Sun

. Overview of active disturbance rejection control for permanent magnet synchronous motors. J Electr Eng Technol 2024; 19: 1237–1255.

Patelski

Pazderski

. Adaptive active disturbance rejection control with recursive parameter identification. Electronics 2024; 13(16): 3114.

Wang

Chen

, et al. Research on a torque ripple suppression method of fuzzy active disturbance rejection control for a permanent magnet synchronous motor. Electronics 2024; 13(7): 1280.

Liu

, et al. A novel energy management strategy of hybrid electric vehicle via an improved TD3 deep reinforcement learning. Energy 2021; 224: 120118.

Yuan

Chang

Liu

, et al. The application of machine learning based energy management strategy in multi-mode plug-in hybrid electric vehicle, part I: Twin Delayed Deep Deterministic Policy Gradient algorithm design for hybrid mode. Energy 2022; 243: 123061.

Wang

Ren

Yin

, et al. Energy-efficient operation of the thermal management system in electric vehicles via integrated model predictive control. J Power Sources 2024; 596: 234040.

Yang

, et al. Data-driven energy management for electric vehicles using offline reinforcement learning. Nat Commun 2025; 16: 2191.

Iqbal

Ahmed

Hassan

. Deep reinforcement learning and fuzzy logic controller codesign for energy management of hydrogen fuel cell powered electric vehicles. Sci Rep 2024; 14: 31042.

Holtayev

Rubagotti

. Deep reinforcement learning for PMSG wind turbine control via Twin Delayed Deep Deterministic Policy Gradient (TD3). Optim Control Appl Methods 2024; 45(4): 1547–1565.

10.

Tang

Zhang

. Research on coordinated control of electro-hydraulic composite braking for an electric vehicle based on the fuzzy-TD3 deep reinforcement learning algorithm. Control Eng Pract 2025; 156: 106274.

11.

Yuan

Guo

Zhang

. Novel active disturbance rejection-based sliding-mode control for permanent magnet linear synchronous motor drives. IEEJ Trans Electr Electron Eng 2024; 19(1): 119–129.

12.

Liu

Zhang

You

. Robust compensation for active disturbance rejection control of a class of high-order systems with large time delay: simulation and experimental studies. Int J Robust Nonlinear Control 2024; 34(2): 1168–1187.

13.

Zhang

Shi

, et al. Research on the control method of a brushless DC motor based on second-order active disturbance rejection control. Machines 2024; 12(4): 244.

14.

Huang

Zhao

, et al. Integrated thermal-energy management for electric vehicles in high-temperature conditions using hierarchical reinforcement learning. Expert Syst Appl 2025; 268: 127536.

15.

Zhang

Xue

Zhou

. Deep reinforcement learning based fast charging and thermal management optimization of an electric vehicle battery pack. J Energy Storage 2024; 84: 110466.

16.

Wei

Wang

Huang

. Energy-Efficient integrated thermal management for electric vehicles using evolutionary deep reinforcement learning. Expert Syst Appl 2025; 273: 126478.

17.

Oncken

Sachdeva

Wang

, et al. Integrated thermal and energy management of connected hybrid electric vehicles using deep reinforcement learning. IEEE/ASME Trans Mechatron 2023; 28(5): 2790–2800.

18.

Huang

Tao

, et al. Real-time battery thermal management for electric vehicles based on deep reinforcement learning. IEEE Control Syst Lett 2022; 6: 2705–2710.

19.

Kuang

Zhao

, et al. Thermal analysis of fifteen-phase permanent magnet synchronous motor under different fault tolerant operations. IEEE Access 2020; 8: 43316–43330.

20.

Sharifi

Eikani

. Numerical and analytical investigations of the water immersion cooling strategy for a permanent magnet synchronous motor. IET Electr Power Appl 2025; 19(1): 124–138.

21.

Chen

Liu

Wang

. Temperature control of fuel cell based on PEI–DDPG. Energies 2024; 17(7): 1728.

22.

Wang

Zhu

. Improved active disturbance rejection control for permanent magnet synchronous motor. Electronics 2024; 13(15): 3023.

23.

Chen

Zhou

. Overview on active disturbance rejection control for electro-mechanical actuation servo drive. Chin J Aeronaut 2025; 38(2): 103884.

24.

Mittal

Shah

. Energy management strategies for hybrid electric vehicles: a technology roadmap. World Electr Veh J 2024; 15(9): 424.

25.

Veerendra

Mohamed

Leung

, et al. Energy management control strategies for energy storage systems of hybrid electric vehicle: a review. Energy Storage 2024; 6(2): e573.

26.

Liu

Wang

Zhang

. An energy management strategy for fuel cell hybrid electric vehicle based on a real-time model predictive control and pontryagin’s maximum principle. Int J Green Energy 2024; 21(11): 2640–2652.

27.

Mohseni

Khaburi

. Energy management strategies of hybrid electric vehicles: a comparative review. IET Smart Grid 2024; 7(1): 18–35.

28.

Cao

Zhou

. Energy management optimization of hybrid electric vehicles based on deep learning model predictive control. Intell Decis Technol 2024; 18(3): 2259–2275.

29.

Zhang

Wang

. Optimization of energy management strategies for multi-mode hybrid electric vehicles driven by travelling road condition data. Sci Rep 2025; 15: 3648.

30.

Ali

Abbas

. Energy management strategy for hybrid electric vehicles based on adaptive equivalent ratio-model predictive control. Electricity 2024; 5(4): 972–990.

31.

Hassan

Tahir

Ali

, et al. Optimal energy management strategies for hybrid electric vehicles: a recent survey of machine learning approaches. J Eng Res 2024; 12(1): 166–178.

32.

Feng

. Control strategy optimization of hybrid electric vehicle for fuel saving based on energy flow experiment and simulation. J Clean Prod 2023; 418: 138344.

33.

Zhang

Huang

, et al. Reinforcement learning-based control for the thermal management of the battery and occupant compartments of electric vehicles. Sustain Energy Fuels 2024; 8: 588–603.

A hierarchical hybrid thermal control strategy for electric vehicle motors based on ADRC and TD3

Abstract

Keywords

Introduction

Methods

Electric vehicle motor thermal management system modeling

Active disturbance rejection controller (ADRC) design

Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm architecture

ADRC–TD3 hybrid control strategy integration framework

Experimental platform construction and test environment configuration

Algorithm implementation process and pseudocode

Performance evaluation index and comparison benchmark setting

Results

Temperature control accuracy analysis

Energy consumption optimization performance evaluation

Control response characteristics under dynamic conditions

Algorithm convergence and stability validation

Comparative analysis with traditional control strategies

Robustness test results

Discussion

Conclusion

Footnotes

ORCID iD

Author contributions

Funding

Declaration of conflicting interests

Data availability statement

References