Abstract
Safe and efficient navigation for multi-robot systems in cluttered, dynamic environments remains challenging, primarily due to uncertainties from internal disturbances and dynamic external conditions. While existing methods enable real-time navigation in static or sparse settings, they often fail to respond effectively in environments dense with both static and moving obstacles. To address this limitation, this article proposes a novel safety probability field-based extended state model predictive control (SPF-EMPC) planner. The framework first introduces a safety probability field to model dynamic obstacles and integrates it with an unconstrained optimization approach for online generation of collision-free trajectories. Subsequently, an extended state model predictive controller ensures accurate trajectory tracking by explicitly accounting for robot model constraints and state perturbations, thereby guaranteeing practical feasibility. Both simulations and physical experiments demonstrate that the proposed method reliably prevents inter-robot and robot–obstacle collisions, even under significant motion and control uncertainties.
Introduction
The deployment of autonomous multi-robot systems in real-world applications is rapidly expanding, spanning areas such as post-disaster search and rescue, 1 forest fire detection, 2 logistics and warehousing, 3 and infrastructure damage inspection. 4 Such scenarios require systems that can not only navigate agilely through cluttered, unstructured environments but also maintain robustness against both internal disturbances and dynamic external conditions. However, achieving simultaneous resilience to these dual challenges remains an open problem in current research.
In cluttered and dynamic environments, mission failures in large-scale multi-robot deployments typically stem from two interrelated sources of uncertainty: environmental uncertainty and robot-induced uncertainty. Environmental uncertainty refers to limitations or inaccuracies in perception, which hinder robotic swarms from reliably detecting and fully avoiding dynamic obstacles. This perceptual gap creates significant collision risks, especially in densely populated and highly mobile scenarios. Velocity Obstacle (VO) and its variants, such as Reciprocal Velocity Obstacle and Hybrid Reciprocal Velocity Obstacle,5–7 are widely used to address dynamic obstacles in such settings. However, these methods remain limited in handling prediction uncertainty: errors in position and velocity measurements directly influence the computation of the VO cone, potentially introducing collision risks. Meanwhile, ego-series algorithms,8–10 which perform well in dense static environments, generally assume known or constant obstacle trajectories when dealing with dynamic obstacles—an assumption that often does not hold in practice, where trajectories are typically unknown and highly variable. Robot-induced uncertainty, on the other hand, arises from inaccuracies in the robots’ physical models (e.g. dynamics parameters) or their localization systems. These errors introduce internal perturbations that can degrade control performance, resulting in inter-robot collisions or significantly prolonged mission times. To explicitly address internal uncertainties, PUMA 11 addresses internal uncertainty by integrating uncertainty propagation into the optimization constraints, dynamically adjusting collision avoidance bounds and velocity limits based on predicted state covariances. Similarly, SWIFT 12 handles internal uncertainties (e.g. sensor noise and motion drift) through a confidence-guided cross-attention mechanism to selectively weight neighbor interactions. Despite these advances, simultaneously handling unpredictable dynamic obstacles and internal system uncertainties remains a daunting challenge.
To solve these problems, we propose a two-layer robust framework for multi-robot motion planning: the safety probability field-based extended state model predictive control (
Further, we perform a distributed deployment in a real-world environment as shown in Figure 1. The environment is unknown in advance, and robots need to reach their destinations safely without formation in a cluttered environment, in which the irregular movement of dynamic obstacles and imprecise control of the robots pose potential collision risks. The main contributions are summarized below:

Illustration of multi-robot navigation in a cluttered environment using the proposed safety probability field-based extended state model predictive control (SPF-EMPC) framework. (Top) The three-wheeled omnidirectional mobile robot platform used for real-world experiments. (Bottom) Physical navigation scenario involving static obstacles, a moving obstacle (highlighted with a green marker within a red circle), and multiple robots. Colored markers beneath obstacles represent perception data from different robots, demonstrating the distributed perception and real-time planning capabilities of the system.
Online trajectory generator for dynamic obstacle avoidance with uncertainties: We construct a safety probability field for obstacles that takes into account obstacle velocity and position perturbations, and apply an unconstrained optimization method to generate trajectories online for multiple robots in cluttered environments. Robust trajectory tracker for precise robot following considering perturbations: We design extended state model predictive control to filter robot localization and model perturbations, and accurately track the generated trajectories using a temporal self-sampling method. Simulation and physical validation: We conducted multi-robot traversal experiments in both simulation and physical environments to validate real-time performance and effectiveness of the proposed method.
Related work
In recent years, significant progress has been made in trajectory planning for obstacle avoidance using voxel maps13,14 and optimization methods.15,16 The space occupied by the obstacle is usually represented as a set of constraints on the free space, and the selection of constraint type determines the performance of the optimization problem. In deterministic environments, some work, such as ego-serials,8–10 achieve real-time navigation in complex obstacle environments, but treat dynamic obstacles as instantaneous static obstacles or cooperative robots, and the voxel maps they rely on do not adequately characterize dynamic obstacles. For obstacles with motion perturbations, there are two main strategies in the literature to characterize them: using differentiable surfaces such as ellipsoids, 17 or using convex polyhedra such as rectangles.18,19
The obstacle can be described by a differentiable surface as a nonlinear constraint on the optimization problem, which can be efficiently solved by a gradient-based solver. 15 To facilitate real-time processing, conservative estimates of obstacles with motion uncertainty, such as sigma hulls based on confidence levels. 20 Although the method does not guarantee global optimality, its reduced computational cost makes it widely adopted for most time-critical motion planning tasks. 21 The overly conservative obstacle models guarantee collision-free behavior, they greatly consume the remaining free space, thus further diminishing the feasibility of time-critical navigation tasks in cluttered environments containing multiple robots and obstacles. 17 In addition, existing methods only make conservative estimates on the obstacle volume and do not fully utilize other information of dynamic obstacles.
Alternatively, obstacles can be characterized by convex polyhedra occupied in the space, using the occupied space as a hard constraint to be avoided in the optimization problem. Xu et al. 19 use the convex polyhedral space of obstacles to construct safe flight corridors to ensure collision-free trajectories for static obstacles. In recent years, model predictive control (MPC) methods22–24 have emerged actively as a dynamic obstacle avoidance method. MPC can be combined with dynamic obstacle avoidance to generate and track trajectories that satisfy the robot’s motion and actuation constraints. In the work, 25 moving obstacles are packed into ellipses, and spatial constraints are imposed as optimization metrics. To further consider obstacles perturbations, Jian et al. 26 integrate a control barrier function within the MPC framework to avoid pedestrians. There are also several work27,28 that utilize geometric representations and MPC to probabilistically avoid dynamic obstacles. Recent efforts have made some progress in computational efficiency,17,27 but their computational cost grows exponentially with the number of obstacles and robots, and they are unable to meet real-time requirements.
In this article, we combine the advantages of efficient computation of the gradient-based planner and robust control of the MPC to construct a two-layer framework that is easy to extend. In order to prevent cooperative robots from becoming dynamic obstacles due to control errors from localization and model perturbations, this article integrates the interference between obstacle motion and robot control in cluttered environments, which is different from previous work shown in Table 1. The designed safety probability field can fully utilize the size and velocity information of dynamic obstacles, and the extended state observer can utilize the historical information of robot states. The proposed method can effectively cope with perturbations, and generate and track safe trajectories for multi-robot in cluttered environments online.
Comparison of recent motion planning methods for dynamic obstacle avoidance based on key attributes in the table.
EGO: ESDF-free Gradient-based lOcal; MADER: Multiagent Trajectory Planner Robust to Communication Delay in Dynamic Environments; RMADER: Robust Multiagent Trajectory Planner Robust to Communication Delay in Dynamic Environments; DPMPC: Dynamic Polynomial-based Model Predictive Control; SPF-EMPC: safety probability field-based extended state model predictive control.
Methodology
System framework overview
This work introduces a multi-robot distributed trajectory planning framework encompassing three principal components: the sensing and position module, the local trajectory planning module, and a trajectory tracking module, as shown in Figure 2.

Schematic overview of the proposed SPF-EMPC framework. The system operates in a closed-loop manner: upon local sensing of a dynamic obstacle, a real-time SPF is generated. This SPF, along with the local static map, informs an unconstrained local trajectory planner to iteratively compute a collision-free path. The resulting time-parameterized trajectory is then accurately tracked by an EMPC, which compensates for robot state and model perturbations. Throughout this process, perception and localization modules provide continuous state feedback and collision checking. SPF-EMPC: safety probability field-based extended state model predictive control; SPF: safety probability field; EMPC: extended state model predictive controller.
Each robot has a distinct sensing range within this distributed system for its planning and control processes. And trajectories are communicated through a prioritization mechanism (This article sets a fixed priority according to the robot number), robots with a lower priority need to consider obstacle avoidance with other robots with a higher priority. The whole system faces disruptions from moving obstacles, robot’s localization and model.
Safety probability field
In contrast to the fixed nature of static obstacles, the future state of dynamic obstacles similar to pedestrians cannot be accurately predicted, making it difficult to effectively generate safety trajectories. Therefore, this section constructs a safety probability field to describe dynamic obstacles with motion perturbations.
This work enhances the real-time performance and effectiveness of obstacle avoidance by referring to studies on pedestrian prediction10,31 and introducing an elliptic dynamic probability field with a binary Gaussian distribution, as shown in Figure 3. This elliptical safety probability field with motion bias describes the moving intention of the obstacle and degenerates into a general circular field when the velocity is zero (static obstacle), providing accurate guidance for the robot to avoid obstacle. By avoiding regions with high collision probability, safe trajectories can be generated that consider finite future states and avoid being overly conservative.

The adaptive safety probability field (SPF) around a dynamic obstacle. The elliptical field, shaped by the obstacle’s current velocity and size, visualizes collision risk (color gradient). The dashed circle indicates the obstacle’s predicted position at the next time step.
The probability field is designed concerning the Gaussian distribution as follows:

Safety probability field illustrations for varying obstacle velocities. The green circle indicates the robot’s current position.
For the covariance matrix
Assuming that the dynamic obstacle can be wrapped by a minimum outer circle with radius
With the real-time motion of dynamic obstacles, the safety probability field is dynamically updated according to their latest position, velocity, and dimensional information. This real-time adaptive mechanism enables the robot to consistently assess the threat intensity imposed by nearby moving obstacles
Multi-robot trajectory generation
In cluttered environments with multiple static and moving obstacles, the rapid generation of safe trajectories is essential for robots to respond to environmental changes on time and navigate safely. In this section, an unconstrained optimization method based on safety probability field is used to study the multi-robot trajectory generation problem.
Trajectory representation
For the following system:
According to the differential flatness property, the MINCO (Minimum Control) trajectory parameterization technology
33
can simplify high-dimensional trajectories into a series of waypoints
Optimization problem construction
According to equations (14) and (15), the omnidirectional mobile robot studied in this paper has a differential flatness property similar to the quadrotor when the flat output
The safety probability field is employed to find guidance points for estimating the collision cost and gradient required by the gradient-based optimizer. The process of setting specific guidance points is illustrated in Figure 5. SPF can calculate the probability that a collision occurs at a given point, and a collision probability exceeding the prescribed threshold indicates a potential collision. When a collision risk is detected for the given waypoints within the current trajectory, the guide points will be sought according to the safe gradients of the waypoints to generate a collision-free trajectory. To ensure completeness of the trajectory solution, an alternative trajectory is obtained on the other side according to the symmetry line. Ultimately, the optimal trajectory is selected based on a comparison of the costs of the trajectories. It should be noted that the optimization-based approach itself is susceptible to oscillations due to the dynamic nature of the environment. However, engineering settings can improve this limitation, for example, by automatically adjusting the threshold based on the number of oscillations.

Trajectory optimization using the safety probability field (SPF). The initial trajectory (green curve) enters the SPF of a dynamic obstacle. A risky waypoint (light green dot) is identified and projected along the collision probability gradient (black arrow) to a safe guidance point (blue dot). An alternative trajectory (blue dashed curve) is then generated on the obstacle’s opposite side, guided by a symmetry line (purple dashed) through the risk zone.
According to the SPF, each dangerous waypoint
The gradient of the collision cost concerning the trajectory polynomial coefficients
Trajectory tracking control
In practice, robots are subject to both internal and external disturbances. External disturbances introduce localization errors, which compromise the accuracy of control force computation, while internal disturbances cause model mismatches that degrade the application of control forces. Together, these perturbations induce control uncertainties, resulting in trajectory tracking errors and reduced navigation efficiency. Consequently, robust controllers capable of explicitly accounting for such disturbances are essential for safe and reliable operation. Since both localization inaccuracies and model deviations manifest as state uncertainties, this section presents an extended state observer designed to estimate these compounded perturbations. Combined with a self-sampling model predictive control scheme, the proposed approach enables precise and robust tracking of iteratively planned trajectories.
Robot model
The three-wheeled omnidirectional mobile robot employed in this article is characterized by its flexible motion capabilities, as depicted in Figure 6.

Schematic diagram of an omnidirectional mobile robot model.
The positive direction of the robot body coordinate system’s x-axis corresponds to the wheel axis direction, and the wheels are evenly distributed at intervals of
Extended state MPC
The effects caused by model error and localization noise can be jointly attributed to the uncertainties of the robot states. According to equations (14) to (16), the discrete state transfer equation can be expressed as follows:
After obtaining the estimated state, this work fits a polynomial trajectory using the positions, velocities, and timestamps of adjacent waypoints from the reference trajectory
Results and discussion
To further assess the efficacy of the proposed method, we perform a series of experiments in both simulation and real-world scenarios. The L-BFGS algorithm 35 and the CasADi toolkit 36 are utilized to address unconstrained trajectory optimization and optimal control problems, respectively. These algorithms are integrated into the ROS environment through C++ code. Separate ROS nodes have been engineered to allow a transition between simulation and physical application without altering the codebase, operating in a distributed manner on an Intel i9-13900K. The full experimental video is available at https://www.bilibili.com/video/BV1xpmNYJE5F/.
Figure 7 shows the execution time of the trajectory generator and the tracking controller, demonstrating the superior real-time performance of combining the two parts. The trajectory generator is capable of avoiding obstacles in cluttered environments in real time. Although it requires additional time to construct the SPF when encountering dynamic obstacles with motion perturbations, the average optimization time is still within 2 ms. The tracking controller focuses on the robot’s control problem and can compute control quantities that meet the robot’s constraints within 8 ms, taking into account state perturbations induced by localization and model. The total duration of the overall framework is <10 ms, allowing for a timely response to dynamic changes in the environment.

Algorithm running time, where
Experimental setup
For trajectory planning, we set the attention in equation (2)
To describe the perturbations in the environment, we assume that the state of the
Simulation experiment
A multi-robot simulation is conducted in a complex dynamic environment, as depicted in Figure 8. Each robot is capable of independently avoiding obstacles and safely reaching its destination. Their target positions are set in opposing configurations to ensure a complete interaction among the robots.

Multi-robot navigation in cluttered dynamic environments using SPF-EMPC. Upside: Twelve robots navigate among static (squares) and dynamic (circled) obstacles with distributed perception (colored markers). Downside: An individual robot tracks iterative trajectories via extended state MPC while accounting for estimated state uncertainty (green region) and a dynamic obstacle (purple circle), demonstrating simultaneous handling of internal and external uncertainties. SPF-EMPC: safety probability field-based extended state model predictive control; MPC: model predictive control.
Due to the lack of source code for related work and differences in scenario configurations, this paper compares the trajectory generation aspects with the current state-of-the-art method and performs an ablation study of the components of the overall framework. These evaluations are intended to demonstrate the necessity and superiority of the proposed framework.
Comparison of trajectory generation
The local planner EGO-Swarm2 37 and its variant with a accurate prediction of dynamic obstacles serve as benchmarks. The accurate prediction variant treats obstacles as collaborative robots that share their future accurate trajectories at all times, which is too ideal to be realized in reality, but is used here as a comparison to validate the performance of the proposed method. The comparative data at varying levels of perturbation are presented in Table 2. Among the performance metrics, the overtime rate indicates the likelihood that the robot is trapped in the environment, failing to reach its target. The number of iterations refers to the mean number of optimizations required for a single robot within a single experimental trial.
Quantitative comparison under motion uncertainty.
SPF-EMPC: safety probability field-based extended state model predictive control; SR: success rate; OR: overtime rate.
Performance of the proposed SPF-EMPC is benchmarked against EGO-Swarm2 (ES2) and its ideal variant with perfect prediction (ES2*). Results are averaged over 50 Monte Carlo trials conducted in a cluttered randomized environment (five robots, four dynamic, and six static obstacles) under three perturbation levels (0.25
We conduct the simulations in a spatially limited environment containing multiple irregular moving obstacles with speeds close to the robot’s, making it difficult to achieve full obstacle avoidance. As illustrated in Table 2, the success rate of the three methods declines gradually with increasing perturbation level. Notably, ES2 exhibits the lowest overtime rate and requires the shortest distance and time to navigate. However, ES2 perceives dynamic obstacles as static in real-time and does not account for motion perturbations, resulting in the lowest success rate and most iterations. The variant ES2*, which assumes that obstacles follow perfectly predictable trajectories, markedly enhances the success rate and iteration count compared to ES2. Nevertheless, it tends to induce oscillations when confronted with uncertain obstacles, leading to a higher overtime rate. Moreover, its ideal prediction model is challenging to implement in practical scenarios. In contrast, the proposed method achieves a considerably higher success rate than ES2 and nearly matches the ideal ES2* performance, with the fewest iterations and dynamic distance. The result indicates that the SPF-based unconstrained planning approach converges rapidly and avoids excessive conservatism in generating safe trajectories.
Ablation comparison
The superiority of the proposed two-layer framework in real-time is shown in Figure 7. To confirm the efficacy of each component of our proposed approach in terms of obstacle avoidance success rate under consideration of perturbations, we performed an ablation comparison, as shown in Table 3. Specifically, in the variant lacking self-sampling (SS), the trajectory referenced in equation (19) is substituted with a generic MPC based on the initial discrete trajectory. For the version without SPF, the speed of the safety probability field is set to zero, degenerating into a circular field with the minimum safe distance.
Performance comparison of the proposed SPF-EMPC with other versions at different perturbation levels (
SPF-EMPC: safety probability field-based extended state model predictive control; SR: success rate; OR: overtime rate; TT: travel time; SS: self-sampling; SPF: safety probability field; ESO: extended state observer.
The data in the table show that even under ideal localization conditions (
Enhanced performance validation
In this section, we further validate the performance of the proposed algorithm in both narrow environment and dynamic obstacle environment. To evaluate the collaborative obstacle avoidance capability of the proposed algorithm in constrained spaces, we constructed a narrow passage crossing scenario involving eight robots, as illustrated in Figure 9. The passage width was configured at 1.2 times the robot diameter, with initial robot positions symmetrically distributed at both ends of the passage and target positions set on the opposite side. Experimental results demonstrate that the robots initiated coordinated avoidance maneuvers at 16 seconds, formed an orderly queue by 23 seconds, and successfully completed the passage crossing within 30 seconds, exhibiting continuous trajectory curvature without any deadlock situations. To address the motion uncertainty of dynamic obstacles, the position deviation of the obstacle is set as

Coordinated navigation of eight robots through a constrained narrow passage. The sequence demonstrates the efficacy of the proposed safety probability field-based extended state model predictive control (SPF-EMPC) framework in dense, structured environments: At
For clarity, dynamic obstacles are displayed in a circular configuration, as shown in Figure 10. The robot avoids obstacles in real time by leveraging the safety probability field generated by dynamic obstacles. At t=16 seconds and t=20 seconds, the robot proactively avoids the obstacles by predicting their movement trends based on the probability field.

Eight robots navigate through a dense environment with two dynamic obstacles. (a)
Physical experiment
In a real-world scenario, a scaled-down platform,
38
measuring
Figure 11 shows an example of actual running. In the experiment, several static obstacles and one dynamic obstacle are set up to hinder the movement of the robots. The motion of the dynamic obstacle, the interference at the platform seams, and the robots’ traveling all pose challenges. Robots need to utilize their limited perception and communication capabilities to avoid various obstacles and other robots, and ultimately reach the goal point. Practical experiments demonstrate that the proposed method can enable the safe navigation of multiple robots in a dynamic environment. In this process, the results of the velocities returned by the robot are shown in Figure 12. The profiles confirm that all control commands remained within actuator limits, validating the dynamic feasibility and constraint compliance of the proposed trajectory tracking controller.

Physical validation in a dynamic environment. (a) Setup with eight robots, eight static obstacles, and a dynamic obstacle. (b) A robot adjusts its trajectory in real-time to handle the irregular motion of the dynamic obstacle and localization inaccuracies at platform seams. (c) All robots reach their target positions collision-free, demonstrating the robustness of the safety probability field-based extended state model predictive control (SPF-EMPC) framework under real-world uncertainties.

Wheel speed profiles from physical experiment, showing the measured linear velocities
Discussion
In this work, we propose SFC-EMPC, a novel method that demonstrates superior performance to the EGO-Swarm2 baseline in multi-robot cooperative motion planning under uncertain dynamic obstacles. Specifically, it achieves a 40% higher success rate and a 75% improvement in optimization efficiency in such challenging environments. Compared to the conventional MPC method, the proposed approach not only achieves the shortest path time in the presence of state perturbations, but also improves the collision avoidance success rate by 46% while tracking time-varying trajectories under such perturbed conditions.
The proposed method achieves higher success rates and optimization efficiency against uncertain dynamic obstacles due to two key mechanisms. First, a Gaussian-based safety probability field (SPF) models obstacle motion intention, providing a predictive ‘‘look-ahead” capability. By integrating this probabilistic forecast with the MINCO spatio-temporal joint trajectory optimization in a distributed manner, the planner can anticipate dynamic obstacle behavior in advance, thereby significantly improving navigation success. The SPF is essentially a normalized 2D Gaussian function; when multiple obstacles fall within the robot’s planning horizon, their individual fields can be efficiently fused into a Gaussian mixture-like collective SPF. This formulation is computationally lightweight, contributing to high efficiency during joint optimization. Second, the extended state observer (ESO) embedded in the MPC framework continuously estimates and compensates for both internal and external disturbances in real time, enabling safe and robust tracking of the planned trajectory under practical uncertainties.
Despite its advantages, our approach has several limitations. First, the coordination among multiple robots heavily relies on communication, and communication delays or packet loss will inevitably lead to collisions. Second, The MINCO optimizer utilized in SFC-MPC necessitates robots with differential flatness properties, leaving it unsuitable for other robot types. Third, when multiple robots navigate through narrow passages, deadlocks may arise, especially as the number of robots increases significantly.
Future work will focus on enhancing system robustness and generality by addressing three critical challenges: improving robustness to communication delays through asynchronous, event-driven planning; reducing dependency on specific dynamic models (e.g. differentially flat systems) by developing adaptive, platform-agnostic frameworks such as primitive-based approaches; and resolving deadlocks in confined spaces via spatio-temporal corridor scheduling to pre-allocate passage rights.
Conclusion
This article proposes SPF-EMPC, a multi-robot motion planner robust to dynamic obstacles and system uncertainties. Its key contributions are asfollows:
Introduces a safety probability field for efficient, iterative online trajectory generation via unconstrained optimization. Employs polynomial fitting and an extended state observer to ensure accurate trajectory tracking under disturbances. In simulation, matches the performance of an ideal baseline with significantly fewer optimization iterations. Hardware experiments with eight robots demonstrate real-time capability and reliability in practice.
Footnotes
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data availability section
None.
