Abstract
Improving the path tracking performance of two-wheel differential mobile robots is very important, especially in problems that consider dynamic nonlinearities and motor torque constraints. This paper proposes a hybrid controller that combines backstepping control (BSC) with fractional order PID controller (FOPID). The parameters of BSC after being proven globally stable by Lyapunov will be determined by the hedge algebra method. The parameters of the FOPID controller are optimized using an improved metaheuristic algorithm, which is a combination of the wolf optimizer (GWO) algorithm with the slime molding algorithm. This is intended to improve trajectory tracking accuracy. In this work, the optimization process for FOPID is based on a cost function consisting of Integral absolute error and integral squared error, which helps to reduce the position and velocity errors. The controller performance is validated through MATLAB-Simulink with various trajectory scenarios and compared with conventional optimization methods such as standard PSO and GWO. Simulation results show superior trajectory tracking, error reduction and improved control performance.
Keywords
Introduction
In robotic fields, differential wheeled mobile robots (DWMRs) are widely deployed due to their simple structures and practical performance with different types of complex navigation tasks. Different from omnidirectional robots that require sophisticated mechanical systems, DWMRs operate based on a minimalist configuration with two independently driven wheels and one passive caster wheel. 1
Improving trajectory tracking capabilities is important for the DWMR in various applications. For example, autonomous cleaning robots may follow predefined paths precisely to ensure that the area in dynamic environments is covered. 2 Similarly, in leader-follower control scenarios, the robot follows behind a human or another robot. And precise path tracking becomes necessary.3,4 These practical applications usually involve nonlinear and dynamic trajectories. Hence, which necessitate advanced control strategies to ensure stable and accurate tracking performance.
The performance of DWMRs performance is constrained by actuator limitations, especially in terms of velocity and torque that result in nonlinear system dynamics.5,6 As a result, many different control techniques have been proposed to deal with these challenges. In some related work, Li et al. 7 provided an adaptive neural network (NN)-based controller tailored for trajectory tracking in DWMRs with full-state constraints. Another study demonstrated the effectiveness of the backstepping control (BSC) technique in considering nonlinearities in DWMR dynamic control, improving the overall system performance.8–11
Fuzzy set theory has proven effective in constructing flexible, highly adaptable control systems, especially in uncertain environments. However, some limitations remain, such as the heavy reliance on expert experience in establishing control laws, a lack of accuracy in handling large and complex systems, and limited scalability. Classical control techniques, despite their solid mathematical foundation, often require precise models, making them difficult to apply to real-world systems with significant disturbances or nonlinearities. Hedge algebra (HA) is a useful mathematical tool for modeling the linguistic values of a linguistic variable. These values are quantified by real numbers from 0 to 1. Therefore, HA has many advantages over traditional fuzzy set theory (FC) based controllers in terms of setup steps, control performance, computation time and optimization. 12 In Mac et al., 13 the authors used HA as a position controller for the trajectory tracking problem of non-uniform mobile robots. Where HA is combined with improved PSO (APSO) resulting in an optimized version due to better tuned HA. However, the application of HA as a solution to optimize parameters for other controllers has not been mentioned by many studies.
Beyond kinematic control, velocity control plays a critical role in overall trajectory tracking performance. Several studies have focused on improving conventional PID controllers for DWMRs. For instance, Khan et al. 14 optimized a traditional PID controller to enhance velocity tracking, while Ahmad 15 applied the H∞ control technique to improve PID robustness against parameter variations caused by power fluctuations. Although there are many improvements proposed, conventional PID controllers are facing inherent limitations in handling nonlinear system dynamics and transient behavior, primarily due to their reliance on integer-order derivatives. To solve further problems, many researchers introduced controllers based on fractional-order calculus. Abed et al. 16 demonstrated that a neural network-based FOPID controller achieves better velocity tracking than its integer-order counterpart. In addition, Gheisarnejad and Khooban 17 provided the practical implementation of FOPID controllers in mobile robots.
A key challenge in both PID and FOPID controllers is the choice of optimal control parameters. Both manual random tuning methods often result in suboptimal system responses. To solve this problem, there are some methods that use bio-inspired swarm intelligence algorithms that have gained popularity. The methods require only the evaluation of a cost function rather than analytical gradients or integrals. 18 Several papers have focused on the problem of intelligent control and optimization for systems with nonlinear, uncertain, and time-varying characteristics. To reduce dependence on model inaccuracies, studies often use non-model-based or semi-model-based intelligent controllers, such as BELBIC, fuzzy logic, and FOPID.19,20 These are then combined with modern metaheuristic optimization algorithms such as NSGA-III, GA, COA, MOQSOA, etc., to tune controller parameters, aiming to balance global exploration and local exploitation.21–23
In the context, some work used Particle Swarm Optimization (PSO) to enhance path planning for DWMR under velocity constraints. 24 This work proposed by Aner et al. 25 presents a PSO-optimized PID controller that overcomes a PSO-optimized fuzzy logic controller (FLC) in trajectory tracking. Moreover, grey wolf optimizer (GWO) has been used to tune adaptive fractional-order fuzzy PID controllers, 26 and hybrid schemes combining GWO with LQR and FOPID have also been reported to enhance DWMR performance. 27
Although numerous studies have investigated either kinematic controllers (such as backstepping controller (BSC)) or dynamic controllers (such as PID/FOPID) separately, integrating a kinematic controller with PID or FOPID controllers significantly enlarges the parameter space to be optimized. This results in greater computational demands. As noted by the authors, the optimization of such hybrid control architecture remains an underexplored area in existing literature. Based on the existing limitations as presented, our research focuses on developing a more effective control strategy, with highlights including. • A hybrid optimization strategy combining the algorithm with the Slime Mushroom (SMA) algorithm is developed for 10 parameters of the FOPID controller of the dynamic element. • A cost function integrating integral absolute error (IAE) and integral squared error (ISE) is proposed. The weights of the cost function consider the effects of positional and angular errors. • Hedge algebra is designed and used to determine the parameters of the BSC. • A closed-loop control architecture integrating the BSC with a fractional order PID controller (FOPID) is proposed, where the FOPID parameters are optimized with velocity constraints. • Simulations were performed with complex trajectories, including Lemniscate, Cloverleaf and Sinusoidal trajectories. Data showed that the new optimization techniques improved over other conventional methods such as PSO
25
and GWO
28
in terms of accuracy and cost function minimum target.
The rest of this paper is organized as follows. The background and research methods including Monitoring Control System, Backstepping based Trajectory Controller and FOPID based speed controller are provided in Section 2. In section 3, the combination of GWO and SMA is addressed. Section 4 provides the Improved GWO optimization algorithm. Simulation results are discussed in Section 5. Finally, Section 6 provides conclusions and future work.
Background and research methods
The DWMR is equipped with two independently driven rear wheels and a passive front wheel that provides balance, as depicted in Figure 1. In this figure, Q represents the midpoint of the wheel axis, G denotes the center of mass, and b is the distance between points G and B. Point Q = [xQ, yQ]T, lies on the XY plane. TWMR mobile robot.
The standard kinematic model describing the relationship between the velocity (u, ω) and state (x, y, Ψ) of a differential robot is expressed as equation (1):
Additionally, a denotes the wheelbase of the robot, Ψ represents its posture (orientation), and γ corresponds to the angular velocity. The mathematical model of the DWMR is defined as follows
29
:
The above DWMR kinematic and the dynamic model only show the motion and speed characteristics of the mobile robot under ideal conditions. It does not consider the practical limitations due to mechanical constraints. The inputs of the linear velocity and angular velocity of the vehicle for the dynamic model are limited to the following ranges:
Monitoring Control System
In this study, a closed-loop control structure is proposed to control the motion of the DWMR. The inner loop uses a FOPID controller for angular velocity and linear velocity. Given the current velocity of the robot
The outer loop is the BSC controller for the kinematic model of the mobile robot. The BSC ensures stability of the position and attitude tracking of the mobile robot. When the reference trajectories for the robot are set to
The Saturation blocks shown in Figure 2 are used to constrain the velocity outputs from the FOPID controller. This reflects practical limitations, as motors and actuators are subject to torque constraints, which in turn restrict both angular and linear velocities. Overview of the cascade closed-loop control structure.
Backstepping based Trajectory Controller
Signals from the reference orbit, in addition to providing information about the desired position and attitude coordinates, also provide the desired linear and angular velocities:
BSC is a nonlinear control technique based on Lyapunov theory. It allows to construct control laws from Lyapunov functions. For the position and attitude of the DWMR, the position error transformation is expressed as follows:
To ensure the stability of the learning controller, the Lyapunov function is chosen as follows:
Determine two variables
Substitute equation (10) into equation (9) and then simplify as follows:
To determine the three parameters
The HA calculation process is carried out in six main steps including: • Determine three input variables which are position and direction errors ( • Normalize the input variables in the domain [-1,1] and determine the semantic domain for the output variables in [0,1]. • Divide the semantic domain into 9 linguistic value levels corresponding to the fuzzy levels “very low”, “low”, “slightly low”, “medium”, “slightly high”, “high”, “very high”, etc. Each level is quantified by the SQM value φ_i∈{0.0,”“ 0.125,”“ 0.250,”“ 0.375,”“ 0.500,”“ 0.625,”“ 0.750,”“ 0.875,”“ 1.000}. • Construct functions belonging to triangle • Establish quantitative semantic mapping for each control parameter through the relationship in formula (12): • Solve the approximate reasoning problem to determine the output quantitative semantic value, from which interpolate the quantitative semantic curve and determine the actual control value
Reference semantic quantitative metrics (SQM) for 9 levels.
HA rule basis for 2 variables e_x and e_y.
FOPID based speed controller
Two FOPID controllers are designed to control the dynamics of the DWMR. The linear velocity and angular velocity are controlled separately.
The transfer function of the FOPID controller is (15):
The FOPID controller consists of five parameters
The schematic diagram of the FOPID controller is shown in Figure 3. Internal configuration of the loop speed control.
The outer-loop recursive filter is given by :
The transfer functions of the linear velocity controller and the angular velocity controller based on FOPID for mobile robots can be expressed by the following equations:
The velocity saturation components are limited by the following piecewise function:
FOPID velocity controller under the velocity constraints as above. The DWMR closed-loop system is represented by state-space equations:
Therefore, optimizing the parameters of the velocity controller is essential to ensure accurate trajectory tracking by the robot. To this end, we propose a hybrid optimization strategy that integrates the Grey Wolf Optimizer (GWO) with the Slime Mold Algorithm (SMA).
Optimal combination of GWO and SMA
GWO and SMA are two commonly used hyper-visual optimization algorithms in solving engineering problems. However, they operate on different search mechanisms. SMA is inspired by the behavior of slime molds in nature, while GWO is inspired by the social structure and hunting patterns of gray wolves. Although both are effective, GWO usually has good sensitivity and early convergence parameters. SMA strikes a balance between exploration and exploitation. In the context of trajectory tracking for DWMR, the optimization task becomes more complicated. This study combines GWO and SMA into a hybrid optimization method to improve the trajectory tracking performance of DWMR.
The Grey Wolf Optimizer (GWO) was first introduced by Mirjalili et al.
30
and has since been widely adopted in various scientific and engineering optimization problems. In the algorithm, each grey wolf represents a possible solution in a D-dimensional searching space. There are top three fittest wolves, denoted as α (alpha), β (beta), and δ (delta), identified during each iteration and responsible for guiding the rest of the pack towards the optimal solution. In the following equations, the position of the i-th wolf
The SMA provides the foraging behavior of slime molds, balancing between exploration and exploitation.
31
The updating position mechanism of the SMA is managed by adaptive weights. The adaptive weights are calculated based on the weight vector W, which affects the search direction and the intensity:
Initially, the agents adjust their positions based on the adaptive weights to explore the search space effectively and move toward promising regions. They then proceed to search for optimal paths leading to the food source. Once potential food sources are located, agents surround the best one and update their positions accordingly. This process ensures a balance between exploration and exploitation, improving convergence towards the optimal solution. The position update equation is as follows:
Improved GWO optimization algorithm
To effectively solve the optimization problem presented in Part 3. The operating mechanism of the hybrid algorithm as well as the objective function will be presented in this section. To improve the optimization process, the search space
The pseudocode for the hybrid optimization technique is shown in Algorithm 1:
The SMA is carried out in
In order to validate the effectiveness of the proposed method, the hybrid method for BSC-FOPID optimization, there is a comparison with PSO,
25
and GWO.
28
The maximum number of iterations and the population size are set to be uniform to 25 and 60, respectively. The parameters of the DWMR system are:
The actual position of the DWMR in the global coordinate system is
In order to demonstrate the generalization ability of the optimized controller and ensure objective evaluation, the Cloverleaf and the Sin trajectory is added to validate the controller’s performance. This trajectory is mathematically described as follows
33
:
Results and discussion
Setting initial values and limits of FOPID parameters.
The controller simulation diagram is built on Simulink as shown in Figures 4 and 5. Simulation diagram on Simulink. Building cost function in Simulink.

The surface has a “symmetrical fold” shape, showing a clear change of Semantic aspect of k1 according to e_x and e_y when determined by HA.
The surface has a symmetrical “V” shaped fold, similar to Semantic aspect of k2 according to e_x and e_y when determined by HA.
The 3D surface is smoothly curved and symmetrical, showing a soft nonlinear relationship between angular velocity and deviations. When the deviation Backstepping output angular velocity when determining parameters using HA.
The smooth, continuous surface form demonstrates the smooth interpolation capability of the HA system during the determination of the Backstepping output velocity when parameterizing with HA.
From the trajectory responses of DWMR and reference trajectory, the position and angle errors are calculated and used to build the cost function for the optimization process. Details are shown in Figures 10,11, and 12. In all three cases, our solution for robot trajectory closely followed the reference trajectory better than GWO and PSO, especially in areas with large curvature and rapid turns. Compared to the other two methods, GWO-SMA reduced positional error, limited phase lag and amplitude oscillation, and improved the system’s convergence speed. Lemniscate orbit response of optimization techniques. Cloverleaf orbit response of optimization techniques. Sin orbit response of optimization techniques.


The cost function, in addition to combining the two indexes ISE and IAE to evaluate the optimization process as presented in Part II, the author also builds the ITAE index to be able to expand the evaluation and build new cost functions for subsequent studies.
The trajectory tracking simulation results show that the optimal solution with improved GWO (red line) tracks the set trajectory better than GWO (blue line) and PSO (yellow line). Especially in the Sin trajectory, the obtained trajectory has high agreement with the set trajectory.
The x- and y-axis deviation results are shown and compared as in Figure 13(a),(b). It can be seen that the GWO-SMA method shows superior performance compared to PSO and GWO in all aspects. Specifically, the positional error along the x and y axes of GWO-SMA is mainly in the range of ±0.03–0.05 m, while GWO and PSO achieve approximately ±0.10–0.12 m and ±0.18–0.22 m respectively, representing a reduction in error of up to 70–80% compared to PSO. Although there is a fluctuation of the deviation, the convergence speed to zero of the y-axis deviation is under 10 s. Error results along x-axis, y-axis and theta angle deviation comparison. a) Error results along x-axis b) Error results along y-axis c) Theta angle deviation comparison.
While the pure GWO optimization and PSO have much slower convergence speeds.
Comparing the Theta angle error (Figure 13(c)) shows that our method, although having a larger initial error than GWO (approximately 0.75 rad), has the fastest convergence speed (7.02 s). PSO has the largest error and convergence speed among the three methods.
The angular orientation error is also significantly reduced, from approximately ±1.0 rad (PSO) and ±0.5 rad (GWO) to ±0.15–0.2 rad with GWO-SMA. Regarding velocity error, the maximum amplitudes of linear and angular velocities decreased from 1.1 to 1.2 m/s and ±0.8–1.0 rad/s (PSO), respectively, to approximately 0.1–0.15 m/s and ±0.1–0.15 rad/s when using GWO-SMA (Figure 14). The inertial velocity and angular velocity errors show that the improved GWO gives better results than GWO and PSO; especially the angular velocity error shows superiority. Although the inertial velocity error does not show a clear improvement, the oscillation amplitude and the state transition in the trajectory sections with sharp turns show smoother results than the pure PSO and GWO. Comparison of linear velocity and angular velocity between optimization methods. a) Linear velocity error b) Angular velocity error.
Setting parameters for each optimization algorithm.
After the first 4 iterations, the objective function has changed significantly, and the final Best Fitness is 23.1754 (Figure 15). The parameters of the 2 FOPID sets controlling the linear velocity and the obtained angular velocity are: [kp1, ki1, kd1, λ1, μ1, kp2, ki2, kd2, λ2, μ2] = [15.8028, 29.9719, 1.2855, 0.9937, 0.4726, 16.2090, 7.7351, 9.0467, 0.3469, 0.9689]. Change of objective function over 25 iterations with improved GWO - SMA optimization technique.
GWO optimization shows that the objective function starts to decrease sharply from iteration 15 onwards. However, the final Best Cost value obtained after 25 iterations is still 73.391023 (Figure 16). The parameters of the final FOPID obtained are: [kp
1
, ki
1
, kd
1
, λ
1
, μ
1
, kp
2
, ki
2
, kd
2
, λ
2
, μ
2
] = [7.2634, 4.5317, 19.7268, 0.3842, 0.9968, 0.0491, 0.0752, 0.7005, 0.0621, 0.9125]. Change of objective function over 25 iterations with GWO optimization technique.
The PSO technique shows that the objective function has a much smaller change than GWO and improved GWO. This may be because the number of iterations 25 is small compared to the PSO technique while the optimization space and the number of parameters of the optimization process are large (Figure 17). The final Global Best Cost of PSO is 41.0424 and the FOPID parameters obtained after optimization are. The PSRs (Probability of Random Strategy Switching) parameter we chose is 0.34. Change of objective function over 25 iterations with PSO optimization technique.
[kp 1 , ki 1 , kd 1 , λ 1 , μ 1 , kp 2 , ki 2 , kd 2 , λ 2 , μ 2 ] = [23.4820, 18.2709, 14.3248 0.6651, 0.7969, 14.9474, 13.5385, 8.0807, 0.3314, 0.7090].
Summary comparison of performance indicators of optimization methods.
Conclusions and future work
This study proposes an innovative solution for trajectory tracking of DWMRs by integrating a Backstepping Controller (BSC) with a Fractional Order PID (FOPID) controller within a closed-loop control system. We have applied hedge algebra to determine the parameters for the Backstepping controller. The improvement of an enhanced Grey Wolf Optimization (GWO) strategy, in integration with the Slime Mold Algorithm (SMA), proven that it leads to improved trajectory tracking performance. By integrating both ISE and IAE into the cost function with carefully selected weights, the proposed method achieves better balance between exploration and exploitation while compared to conventional optimization techniques such as PSO and standard GWO. The results are verified with the efficiency of the proposed BSC-FOPID control approaches. Despite its contributions, our BSC–FOPID method, combined with HA-GWO–SMA optimization techniques, still has some limitations. The selection of weights in the ISE–IAE objective function is still based on experience, and we have only compared and evaluated it with other techniques based on the objective function we developed.
For future work, there are potential research directions that can expand upon these findings. The improved GWO optimization framework could be deployed to other types of mobile robots or robotic systems to evaluate the adaptability and the effectiveness across different applications. In addition, further improvement of the weighting parameters or integration of alternative metrics could reduce late-stage errors in the optimization process and enhance system performance in complex environments. Additionally, the improved GWO against other nature-inspired algorithms such as the Whale Optimization Algorithm (WOA) and Ant Colony Optimization (ACO) could be further exploited. Finally, we could integrate machine learning techniques to detect and mitigate wheel slip in real-time to increase the robustness and adaptability of DWMRs in unpredictable and dynamic settings.
Footnotes
Acknowledgements
The authors would like to thank Thai Nguyen University of Technology (TNUT), Viet Nam, for the support.
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
