SPF-EMPC planner: A real-time motion planner for multi-robot in cluttered environments with dynamic obstacles and uncertainties

Abstract

Safe and efficient navigation for multi-robot systems in cluttered, dynamic environments remains challenging, primarily due to uncertainties from internal disturbances and dynamic external conditions. While existing methods enable real-time navigation in static or sparse settings, they often fail to respond effectively in environments dense with both static and moving obstacles. To address this limitation, this article proposes a novel safety probability field-based extended state model predictive control (SPF-EMPC) planner. The framework first introduces a safety probability field to model dynamic obstacles and integrates it with an unconstrained optimization approach for online generation of collision-free trajectories. Subsequently, an extended state model predictive controller ensures accurate trajectory tracking by explicitly accounting for robot model constraints and state perturbations, thereby guaranteeing practical feasibility. Both simulations and physical experiments demonstrate that the proposed method reliably prevents inter-robot and robot–obstacle collisions, even under significant motion and control uncertainties.

Keywords

Multi-robot collision avoidance motion planning cluttered environment

Introduction

The deployment of autonomous multi-robot systems in real-world applications is rapidly expanding, spanning areas such as post-disaster search and rescue,¹ forest fire detection,² logistics and warehousing,³ and infrastructure damage inspection.⁴ Such scenarios require systems that can not only navigate agilely through cluttered, unstructured environments but also maintain robustness against both internal disturbances and dynamic external conditions. However, achieving simultaneous resilience to these dual challenges remains an open problem in current research.

In cluttered and dynamic environments, mission failures in large-scale multi-robot deployments typically stem from two interrelated sources of uncertainty: environmental uncertainty and robot-induced uncertainty. Environmental uncertainty refers to limitations or inaccuracies in perception, which hinder robotic swarms from reliably detecting and fully avoiding dynamic obstacles. This perceptual gap creates significant collision risks, especially in densely populated and highly mobile scenarios. Velocity Obstacle (VO) and its variants, such as Reciprocal Velocity Obstacle and Hybrid Reciprocal Velocity Obstacle,^5–7 are widely used to address dynamic obstacles in such settings. However, these methods remain limited in handling prediction uncertainty: errors in position and velocity measurements directly influence the computation of the VO cone, potentially introducing collision risks. Meanwhile, ego-series algorithms,^8–10 which perform well in dense static environments, generally assume known or constant obstacle trajectories when dealing with dynamic obstacles—an assumption that often does not hold in practice, where trajectories are typically unknown and highly variable. Robot-induced uncertainty, on the other hand, arises from inaccuracies in the robots’ physical models (e.g. dynamics parameters) or their localization systems. These errors introduce internal perturbations that can degrade control performance, resulting in inter-robot collisions or significantly prolonged mission times. To explicitly address internal uncertainties, PUMA¹¹ addresses internal uncertainty by integrating uncertainty propagation into the optimization constraints, dynamically adjusting collision avoidance bounds and velocity limits based on predicted state covariances. Similarly, SWIFT¹² handles internal uncertainties (e.g. sensor noise and motion drift) through a confidence-guided cross-attention mechanism to selectively weight neighbor interactions. Despite these advances, simultaneously handling unpredictable dynamic obstacles and internal system uncertainties remains a daunting challenge.

To solve these problems, we propose a two-layer robust framework for multi-robot motion planning: the safety probability field-based extended state model predictive control (SPF-EMPC) planner, which combines a lightweight local trajectory planner with an iterative trajectory tracker. In trajectory planning, a safety probability field is constructed to model dynamic obstacles with motion perturbations. Requirements such as obstacle avoidance and smoothness are integrated into an efficient unconstrained optimized trajectory generation, thus reducing the computational load. In trajectory tracking, we explicitly account for state localization and model errors. Model predictive control using an extended state observer and a temporal self-sampling approach enables the robot to accurately track targets with time demands, ensuring the effectiveness of the planned trajectory and the robustness of the whole system. The entire framework successfully scales to multi-robot applications with significant advantages in terms of success rate and number of iterations, avoiding the over-conservatism of existing real-time methods while providing security at a low computational cost.

Further, we perform a distributed deployment in a real-world environment as shown in Figure 1. The environment is unknown in advance, and robots need to reach their destinations safely without formation in a cluttered environment, in which the irregular movement of dynamic obstacles and imprecise control of the robots pose potential collision risks. The main contributions are summarized below:

Figure 1.

Illustration of multi-robot navigation in a cluttered environment using the proposed safety probability field-based extended state model predictive control (SPF-EMPC) framework. (Top) The three-wheeled omnidirectional mobile robot platform used for real-world experiments. (Bottom) Physical navigation scenario involving static obstacles, a moving obstacle (highlighted with a green marker within a red circle), and multiple robots. Colored markers beneath obstacles represent perception data from different robots, demonstrating the distributed perception and real-time planning capabilities of the system.

Online trajectory generator for dynamic obstacle avoidance with uncertainties: We construct a safety probability field for obstacles that takes into account obstacle velocity and position perturbations, and apply an unconstrained optimization method to generate trajectories online for multiple robots in cluttered environments.

Robust trajectory tracker for precise robot following considering perturbations: We design extended state model predictive control to filter robot localization and model perturbations, and accurately track the generated trajectories using a temporal self-sampling method.

Simulation and physical validation: We conducted multi-robot traversal experiments in both simulation and physical environments to validate real-time performance and effectiveness of the proposed method.

Related work

In recent years, significant progress has been made in trajectory planning for obstacle avoidance using voxel maps^13,14 and optimization methods.^15,16 The space occupied by the obstacle is usually represented as a set of constraints on the free space, and the selection of constraint type determines the performance of the optimization problem. In deterministic environments, some work, such as ego-serials,^8–10 achieve real-time navigation in complex obstacle environments, but treat dynamic obstacles as instantaneous static obstacles or cooperative robots, and the voxel maps they rely on do not adequately characterize dynamic obstacles. For obstacles with motion perturbations, there are two main strategies in the literature to characterize them: using differentiable surfaces such as ellipsoids,¹⁷ or using convex polyhedra such as rectangles.^18,19

The obstacle can be described by a differentiable surface as a nonlinear constraint on the optimization problem, which can be efficiently solved by a gradient-based solver.¹⁵ To facilitate real-time processing, conservative estimates of obstacles with motion uncertainty, such as sigma hulls based on confidence levels.²⁰ Although the method does not guarantee global optimality, its reduced computational cost makes it widely adopted for most time-critical motion planning tasks.²¹ The overly conservative obstacle models guarantee collision-free behavior, they greatly consume the remaining free space, thus further diminishing the feasibility of time-critical navigation tasks in cluttered environments containing multiple robots and obstacles.¹⁷ In addition, existing methods only make conservative estimates on the obstacle volume and do not fully utilize other information of dynamic obstacles.

Alternatively, obstacles can be characterized by convex polyhedra occupied in the space, using the occupied space as a hard constraint to be avoided in the optimization problem. Xu et al.¹⁹ use the convex polyhedral space of obstacles to construct safe flight corridors to ensure collision-free trajectories for static obstacles. In recent years, model predictive control (MPC) methods^22–24 have emerged actively as a dynamic obstacle avoidance method. MPC can be combined with dynamic obstacle avoidance to generate and track trajectories that satisfy the robot’s motion and actuation constraints. In the work,²⁵ moving obstacles are packed into ellipses, and spatial constraints are imposed as optimization metrics. To further consider obstacles perturbations, Jian et al.²⁶ integrate a control barrier function within the MPC framework to avoid pedestrians. There are also several work^27,28 that utilize geometric representations and MPC to probabilistically avoid dynamic obstacles. Recent efforts have made some progress in computational efficiency,^17,27 but their computational cost grows exponentially with the number of obstacles and robots, and they are unable to meet real-time requirements.

In this article, we combine the advantages of efficient computation of the gradient-based planner and robust control of the MPC to construct a two-layer framework that is easy to extend. In order to prevent cooperative robots from becoming dynamic obstacles due to control errors from localization and model perturbations, this article integrates the interference between obstacle motion and robot control in cluttered environments, which is different from previous work shown in Table 1. The designed safety probability field can fully utilize the size and velocity information of dynamic obstacles, and the extended state observer can utilize the historical information of robot states. The proposed method can effectively cope with perturbations, and generate and track safe trajectories for multi-robot in cluttered environments online.

Table 1.

Comparison of recent motion planning methods for dynamic obstacle avoidance based on key attributes in the table.

Method	Group scale	Static obstacles	Perturbation consideration	Verification
EGO-Swarm⁹	Multiple	Yes	Not considered	Hardware
EGO-Swarm2¹⁰	Multiple	Yes	Not considered	Hardware
MADER²⁹	Multiple	Yes	Not considered	Simulation
RMADER³⁰	Multiple	Yes	Not considered	Hardware
DPMPC²⁵	Single	Yes	obstacles	Simulation
VIGO¹⁶	Single	Yes	Not considered	Hardware
CCNMPC¹⁷	Multiple	No	Obstacles	Hardware
SPF-EMPC (Ours)	Multiple	Yes	Obstacles and robots	Hardware

EGO: ESDF-free Gradient-based lOcal; MADER: Multiagent Trajectory Planner Robust to Communication Delay in Dynamic Environments; RMADER: Robust Multiagent Trajectory Planner Robust to Communication Delay in Dynamic Environments; DPMPC: Dynamic Polynomial-based Model Predictive Control; SPF-EMPC: safety probability field-based extended state model predictive control.

Methodology

System framework overview

This work introduces a multi-robot distributed trajectory planning framework encompassing three principal components: the sensing and position module, the local trajectory planning module, and a trajectory tracking module, as shown in Figure 2.

Figure 2.

Schematic overview of the proposed SPF-EMPC framework. The system operates in a closed-loop manner: upon local sensing of a dynamic obstacle, a real-time SPF is generated. This SPF, along with the local static map, informs an unconstrained local trajectory planner to iteratively compute a collision-free path. The resulting time-parameterized trajectory is then accurately tracked by an EMPC, which compensates for robot state and model perturbations. Throughout this process, perception and localization modules provide continuous state feedback and collision checking. SPF-EMPC: safety probability field-based extended state model predictive control; SPF: safety probability field; EMPC: extended state model predictive controller.

Each robot has a distinct sensing range within this distributed system for its planning and control processes. And trajectories are communicated through a prioritization mechanism (This article sets a fixed priority according to the robot number), robots with a lower priority need to consider obstacle avoidance with other robots with a higher priority. The whole system faces disruptions from moving obstacles, robot’s localization and model.

Safety probability field

In contrast to the fixed nature of static obstacles, the future state of dynamic obstacles similar to pedestrians cannot be accurately predicted, making it difficult to effectively generate safety trajectories. Therefore, this section constructs a safety probability field to describe dynamic obstacles with motion perturbations.

This work enhances the real-time performance and effectiveness of obstacle avoidance by referring to studies on pedestrian prediction^10,31 and introducing an elliptic dynamic probability field with a binary Gaussian distribution, as shown in Figure 3. This elliptical safety probability field with motion bias describes the moving intention of the obstacle and degenerates into a general circular field when the velocity is zero (static obstacle), providing accurate guidance for the robot to avoid obstacle. By avoiding regions with high collision probability, safe trajectories can be generated that consider finite future states and avoid being overly conservative.

Figure 3.

The adaptive safety probability field (SPF) around a dynamic obstacle. The elliptical field, shaped by the obstacle’s current velocity and size, visualizes collision risk (color gradient). The dashed circle indicates the obstacle’s predicted position at the next time step.

The probability field is designed concerning the Gaussian distribution as follows:

U (p_{i}^{o}, v_{i}^{o}, p) = \frac{\exp (- \frac{1}{2} {(p - p_{c_{i}}^{o})}^{T} Σ^{- 1} (p - p_{c_{i}}^{o}))}{2 π | Σ |^{1 / 2}}

(1)

p_{c_{i}}^{o} = p_{i}^{o} + (1 - λ) v_{i}^{o} d t, λ \in [0, 1]

(2)

where

p_{i}^{o}

and

v_{i}^{o}

denote the current position and velocity of the

i th

obstacle;

p

is the query point;

p_{c_{i}}^{o}

is the center of the probability field;

d t

is the sampling interval;

λ \in [0, 1]

weights the focus on the current position; and

Σ

is the covariance matrix. The function

U (p_{i}^{o}, v_{i}^{o}, p)

outputs the likelihood of the dynamic obstacle being located at

p

, as shown in Figure 4. The figure illustrates the spatially varying collision probability, which decays radially outward from each ellipse’s center—higher values correspond to greater risk. A circular field indicates a stationary obstacle (modeled as i.i.d.), while an elliptical one reflects velocity-driven uncertainty. By circumventing high-risk zones, our approach generates safe trajectories that incorporate the obstacle’s near-future motion, thereby reducing computational cost and avoiding excessive conservatism in planning.

Figure 4.

Safety probability field illustrations for varying obstacle velocities. The green circle indicates the robot’s current position.

For the covariance matrix $Σ$ , we set $θ_{i} = \arctan (V_{i y}^{o} / V_{i x}^{o})$ so that the rotation matrix $R$ and deflation matrix $Λ$ are related to the obstacle’s size and velocity:

\begin{matrix} Σ = R Λ R^{T} \end{matrix} = [\begin{matrix} \cos θ_{i} & - \sin θ_{i} \\ \sin θ_{i} & \cos θ_{i} \end{matrix}] [\begin{matrix} {a_{i}}^{2} & 0 \\ 0 & {b_{i}}^{2} \end{matrix}] {[\begin{matrix} \cos θ_{i} & - \sin θ_{i} \\ \sin θ_{i} & \cos θ_{i} \end{matrix}]}^{T}

(3)

Assuming that the dynamic obstacle can be wrapped by a minimum outer circle with radius $r_{i}^{o}$ , the parameters $a_{i} = r_{i}^{o} + v_{i}^{o} t / 2$ and $b_{i} = r_{i}^{o}$ are defined for the inner elliptical circle in Figure 3. According to the above formulas, the probability field for the moving obstacle is established, and its partial derivative concerning the position can be calculated as follows:

\frac{\partial U (p_{i}^{o}, v_{i}^{o}, p)}{\partial p} = - U Σ^{- 1} (p - p_{c_{i}}^{o})

(4)

With the real-time motion of dynamic obstacles, the safety probability field is dynamically updated according to their latest position, velocity, and dimensional information. This real-time adaptive mechanism enables the robot to consistently assess the threat intensity imposed by nearby moving obstacles

Multi-robot trajectory generation

In cluttered environments with multiple static and moving obstacles, the rapid generation of safe trajectories is essential for robots to respond to environmental changes on time and navigate safely. In this section, an unconstrained optimization method based on safety probability field is used to study the multi-robot trajectory generation problem.

Trajectory representation

For the following system:

\dot{x} = f (x, u)

(5)

where

f

is the state transition function,

x

is the state variable, and

u

is the input. The system is said to be differentially flat³² if

x

and

u

can be represented by an analytic expression with a flat output

z

and its finite derivatives. In this case, only the higher-order differentiable trajectory of the center of mass and its yaw angle need to be planned for the trajectory planning of the system.

According to the differential flatness property, the MINCO (Minimum Control) trajectory parameterization technology³³ can simplify high-dimensional trajectories into a series of waypoints ${p_{11}^{r}, \dots, p_{1 k}^{r}, \dots, p_{i j}^{r}, \dots, p_{N k}^{r}, p_{e}^{r}}$ and the corresponding time $t_{i j}$ to flexibly respond to complex and changing environments, where $p_{11}^{r}$ and $p_{e}^{r}$ represent the starting position and the ending position, respectively. The notation $p_{i j}^{r}$ denotes the $j$ th waypoint of the $i$ th segment within the trajectory. The variable $N$ is the total number of segments, while $k$ indicates the number of waypoints per segment. For notational convenience, this article adopts the shorthand $p_{i j}^{r} \equiv p_{(i - 1) \times k + j}^{r}$ . Consequently, the set of waypoints can be represented as follows:

S = {p_{1}^{r}, \dots, p_{i}^{r}, \dots, p_{N \times k}^{r}, p_{N \times k + 1}^{r}}, p_{i}^{r} \in R^{n}

(6)

where

n

is the dimension of the motion space. For the cost function

G (P, T) = H (C (P, T), T)

G

and

H

are the same costs in two different forms. The partial derivatives of the cost concerning the waypoints

p_{i}

and the unconstrained time

T_{i}

are derived from the subsequent equations:

\begin{aligned} \frac{\partial G}{\partial p_{i}} & = tr {{(\frac{\partial C}{\partial p_{i}})}^{T} \frac{\partial H}{\partial C}} \\ = tr {{(L^{- 1} \frac{\partial (L C)}{\partial p_{i}})}^{T} \frac{\partial H}{\partial C}} \end{aligned}

(7)

\begin{aligned} \frac{\partial G}{\partial T_{i}} & = \frac{\partial H}{\partial T_{i}} + tr {{(\frac{\partial C}{\partial T_{i}})}^{T} \frac{\partial H}{\partial C}} \\ = \frac{\partial H}{\partial T_{i}} - tr {{(\frac{\partial L}{\partial T_{i}} C)}^{T} L^{- T} \frac{\partial H}{\partial C}} \end{aligned}

(8)

where

C

is the coefficient matrix,

L

is the condition matrix linking the trajectory segments, and

tr {\cdot}

denotes the operation of obtaining the trace of the matrix. The specific settings can be related to the work.³³

Optimization problem construction

According to equations (14) and (15), the omnidirectional mobile robot studied in this paper has a differential flatness property similar to the quadrotor when the flat output $z = [x_{w}, y_{w}, θ_{bw}]$ is set. Combining this differential flatness with the MINCO trajectory parameterization method transforms the trajectory optimization problem into an unconstrained problem to speed up the optimization process. The overall cost function is defined as follows:

C_{tot} = C_{ego} + ω_{obs} \cdot C_{obs}

(9)

where

C_{ego}

is the weighted cost combination of smoothness, dynamic feasibility, time and mutual avoidance. The

C_{obs}

and

ω_{obs}

are the cost and weight of static and dynamic obstacle avoidance, respectively. In this article, we specifically discuss the obstacle avoidance part and refer to previous work¹⁰ to set

C_{ego}

The safety probability field is employed to find guidance points for estimating the collision cost and gradient required by the gradient-based optimizer. The process of setting specific guidance points is illustrated in Figure 5. SPF can calculate the probability that a collision occurs at a given point, and a collision probability exceeding the prescribed threshold indicates a potential collision. When a collision risk is detected for the given waypoints within the current trajectory, the guide points will be sought according to the safe gradients of the waypoints to generate a collision-free trajectory. To ensure completeness of the trajectory solution, an alternative trajectory is obtained on the other side according to the symmetry line. Ultimately, the optimal trajectory is selected based on a comparison of the costs of the trajectories. It should be noted that the optimization-based approach itself is susceptible to oscillations due to the dynamic nature of the environment. However, engineering settings can improve this limitation, for example, by automatically adjusting the threshold based on the number of oscillations.

Figure 5.

Trajectory optimization using the safety probability field (SPF). The initial trajectory (green curve) enters the SPF of a dynamic obstacle. A risky waypoint (light green dot) is identified and projected along the collision probability gradient (black arrow) to a safe guidance point (blue dot). An alternative trajectory (blue dashed curve) is then generated on the obstacle’s opposite side, guided by a symmetry line (purple dashed) through the risk zone.

According to the SPF, each dangerous waypoint $p_{i}^{r}$ for dynamic obstacles $p_{j}^{o}$ is paired with a corresponding guidance point $p_{i j}^{g}$ . Subsequently, using the predefined safety threshold $d_{saf}$ , the collision cost and gradient at the collision control point can be calculated as follows:

C_{dyn} = \sum_{i} \sum_{j} λ_{i} {(d_{saf} - (p_{i}^{r} - p_{i j}^{g}) \cdot \frac{\nabla g_{i j}}{‖ \nabla g_{i j} ‖})}^{3}

(10)

\nabla g_{i j} = U (p_{j}^{o}, v_{j}, p_{i}^{r}) Σ^{- 1} (p_{i}^{r} - p_{c_{j}}^{o})

(11)

where

i

and

j

denote the path point and obstacle serial numbers, respectively, the weight assigned to the path point is

λ_{i}

. The direction of the negative gradient of the SPF, denoted by

\nabla g_{i j}

, can influence the trajectory by guiding waypoints towards a safe region.

The gradient of the collision cost concerning the trajectory polynomial coefficients $C$ and time $T$ are calculated using the chain rule as follows:

\begin{aligned} \frac{\partial C_{dyn}}{\partial C} = & - 3 \sum_{i} \sum_{j} λ_{i} β (t_{i}) {(\frac{\nabla g_{i j}}{‖ \nabla g_{i j} ‖})}^{T} \\ \cdot {(d_{saf} - (p_{i}^{r} - p_{i j}^{g}) \frac{\nabla g_{i j}}{‖ \nabla g_{i j} ‖})}^{2} \end{aligned}

(12)

\begin{aligned} \frac{\partial C_{dyn}}{\partial T} = & - 3 \sum_{i} \sum_{j} λ_{i} v_{i} α_{i} {(\frac{\nabla g_{i j}}{‖ \nabla g_{i j} ‖})}^{T} \\ \cdot {(d_{saf} - (p_{i}^{r} - p_{i j}^{g}) \frac{\nabla g_{i j}}{‖ \nabla g_{i j} ‖})}^{2} \end{aligned}

(13)

where

β (t_{i}) = [1, t_{i}, {t_{i}}^{2}, {t_{i}}^{3}, {t_{i}}^{4}, {t_{i}}^{5}]

is the basis function of the trajectory segment,

α_{i} = i / k

is the scaling factor of the trajectory segment, and

v_{i}

is the required speed corresponding to the waypoint. According to equations (7) and (8), the derivatives of the dynamic collision cost concerning the waypoints and the unconstrained time can be calculated. After that, unconstrained optimization will accelerate the trajectory generation process.

Trajectory tracking control

In practice, robots are subject to both internal and external disturbances. External disturbances introduce localization errors, which compromise the accuracy of control force computation, while internal disturbances cause model mismatches that degrade the application of control forces. Together, these perturbations induce control uncertainties, resulting in trajectory tracking errors and reduced navigation efficiency. Consequently, robust controllers capable of explicitly accounting for such disturbances are essential for safe and reliable operation. Since both localization inaccuracies and model deviations manifest as state uncertainties, this section presents an extended state observer designed to estimate these compounded perturbations. Combined with a self-sampling model predictive control scheme, the proposed approach enables precise and robust tracking of iteratively planned trajectories.

Robot model

The three-wheeled omnidirectional mobile robot employed in this article is characterized by its flexible motion capabilities, as depicted in Figure 6.

Figure 6.

Schematic diagram of an omnidirectional mobile robot model.

The positive direction of the robot body coordinate system’s x-axis corresponds to the wheel axis direction, and the wheels are evenly distributed at intervals of $120^{\circ}$ . The kinematic and kinetic equations of the robot are as follows:

\dot{z} = [\begin{matrix} v_{w x} \\ v_{w y} \\ ω \end{matrix}] = A ({[\begin{matrix} 0 & - 1 & - L \\ \cos 60 \circ & \sin 60 \circ & - L \\ - \cos 60 \circ & \sin 60 \circ & - L \end{matrix}]}^{- 1} + Δ_{1}) [\begin{matrix} v_{1} \\ v_{2} \\ v_{3} \end{matrix}] + w_{v}

(14)

\ddot{z} = [\begin{matrix} {\dot{v}}_{w x} \\ {\dot{v}}_{w y} \\ \dot{ω} \end{matrix}] = A ([\begin{matrix} - \frac{\sqrt{3}}{2 MR} & - \frac{\sqrt{3}}{2 MR} & 0 \\ - \frac{1}{2 MR} & - \frac{1}{2 MR} & \frac{1}{MR} \\ - \frac{L}{IR} & - \frac{L}{IR} & - \frac{L}{IR} \end{matrix}] + Δ_{2}) [\begin{matrix} τ_{1} \\ τ_{2} \\ τ_{3} \end{matrix}] + w_{a}

(15)

A = [\begin{matrix} \cos θ_{bw} & - \sin θ_{bw} & 0 \\ \sin θ_{bw} & \cos θ_{bw} & 0 \\ 0 & 0 & 1 \end{matrix}]

(16)

where

A

is the coordinate transformation matrix;

v_{i}

and

τ_{i}

denote the wheels’ linear velocities and control torques, respectively;

ω

is the robot’s angular velocity;

θ_{bw}

defines the body-to-world orientation;

v_{b x}

and

v_{w x}

are the robot’s velocities expressed in the body and world frames. Physical parameters include the platform radius

L = 0.04 m

, mass

M

, wheel radius

R

, and moment of inertia

I

. The terms

w_{v}

and

w_{a}

represent measurement noise in the kinematic and kinetic states, while

Δ_{1}

and

Δ_{2}

correspond to model errors.

Extended state MPC

The effects caused by model error and localization noise can be jointly attributed to the uncertainties of the robot states. According to equations (14) to (16), the discrete state transfer equation can be expressed as follows:

{\hat{z}}_{k} = f ({\hat{z}}_{k - 1}, u_{k - 1}) + w_{k}

(17)

where

{\hat{z}}_{k}

is the current measured state,

w_{k}

is the total perturbation, and

f

is the state transfer function. For each state variable, this paper employs a unified extended state observer (ESO)³⁴ to monitor the extended perturbed state and provide compensation for the actual measured state as follows:

[\begin{matrix} {\dot{\bar{z}}}_{k} \\ {\dot{d}}_{k} \end{matrix}] = [\begin{matrix} - l_{1} & 1 \\ - l_{2} & 0 \end{matrix}] [\begin{matrix} {\bar{z}}_{k} \\ d_{k} \end{matrix}] + [\begin{matrix} b & l_{1} \\ 0 & l_{2} \end{matrix}] [\begin{matrix} u_{k} \\ {\hat{z}}_{k} \end{matrix}]

(18)

where

{\bar{z}}_{k}

is the state estimation and

d_{k}

is the perturbation estimate. According to the kinematic model, the control gain is set to

b = 1

, and the ESO gains are

l_{1} = 2 w

and

l_{2} = w^{2}

, with the poles of the system uniformly configured at

- w

After obtaining the estimated state, this work fits a polynomial trajectory using the positions, velocities, and timestamps of adjacent waypoints from the reference trajectory ${t r a j}_{ref, k}$ . This efficiently transforms the future waypoint data into trajectory parameters, enabling accurate tracking of the continuously iterated and optimized local trajectories from the planner.

{t r a j}_{ref, k} = t r a j ({\bar{z}}_{k}, t_{0} + (k - 1) d t), \forall k = 1, 2, \dots, N

(19)

where

{\bar{z}}_{k}

is the current estimated state after ESO processing. Adaptive sampling is performed based on the current time

t_{0}

, the time horizon

H

, the sampling interval

d t

, and the information about the trajectory parameters. Finally, the optimal control problem is formulated as follows:

min_{z_{k}, u_{k}} \sum_{k = 0}^{H - 1} (λ_{p} {‖ p_{k} - {t r a j}_{ref, k} ‖}^{2} + λ_{u} {‖ u_{k} ‖}^{2})

(20\rm a)

s . t . {\bar{z}}_{k} = eso ({\hat{z}}_{k}, u_{k})

(20\rm b)

{\hat{z}}_{k + 1} = f ({\bar{z}}_{k}, u_{k}) + w_{k + 1}

(20\rm c)

z_{0} = z_{init}

(20\rm d)

z_{lb} \leq z_{k} \leq z_{ub}

(20\rm e)

u_{lb} \leq u_{k} \leq u_{ub}

(20\rm f)

where the state variables are represented by the vector

z = [x_{w}, y_{w}, θ_{bw}]

, and the control inputs are represented by the vector

u = [v_{1}, v_{2}, v_{3}]

. The

w_{k + 1}

denotes the total noise. The non-zero track error factor

λ_{p}

and the control input factor

λ_{u}

are designed to limit the solution space to balance tracking accuracy with the smoothness of the control input.

Results and discussion

To further assess the efficacy of the proposed method, we perform a series of experiments in both simulation and real-world scenarios. The L-BFGS algorithm³⁵ and the CasADi toolkit³⁶ are utilized to address unconstrained trajectory optimization and optimal control problems, respectively. These algorithms are integrated into the ROS environment through C++ code. Separate ROS nodes have been engineered to allow a transition between simulation and physical application without altering the codebase, operating in a distributed manner on an Intel i9-13900K. The full experimental video is available at https://www.bilibili.com/video/BV1xpmNYJE5F/.

Figure 7 shows the execution time of the trajectory generator and the tracking controller, demonstrating the superior real-time performance of combining the two parts. The trajectory generator is capable of avoiding obstacles in cluttered environments in real time. Although it requires additional time to construct the SPF when encountering dynamic obstacles with motion perturbations, the average optimization time is still within 2 ms. The tracking controller focuses on the robot’s control problem and can compute control quantities that meet the robot’s constraints within 8 ms, taking into account state perturbations induced by localization and model. The total duration of the overall framework is <10 ms, allowing for a timely response to dynamic changes in the environment.

Figure 7.

Algorithm running time, where $μ$ denotes the mean execution time. Left: The execution time of trajectory generation, with average times of 0.63, 0.67, and 1.34 ms in obstacle-free, static obstacle, and dynamic obstacle scenarios, respectively. Right: The execution time of extended state model predictive controller (EMPC) for trajectory tracking, with an average of 8 ms. The total algorithm runtime remains within 10 ms, enabling real-time operation.

Experimental setup

For trajectory planning, we set the attention in equation (2) $λ = 0.5$ in the safety probability field and select a fifth-order polynomial for trajectory parameterization in unconstrained optimization, achieving a balance between the algorithm’s real-time performance and trajectory smoothness. In trajectory tracking, the ESO gains in equation (18) are set to $l_{1} = 1$ and $l_{2} = 0.25$ . The MPC domain in equation (19) is configured with $H = 5$ , $N = 15$ , using a time step of $d t = 0.1 s$ .

To describe the perturbations in the environment, we assume that the state of the $i$ th obstacle at the next time step conforms to $p_{i, t_{k + 1}}^{o} = p_{i, t_{k}}^{o} + v_{i, t_{k}}^{o^{'}} (t_{k + 1} - t_{k}) + p_{i e}^{o}$ and $v_{i, t_{k}}^{o^{'}} = v_{i, t_{k}}^{o} + v_{i e}^{o}$ . Both the position deviation $p_{i e}^{o}$ and the velocity deviation $v_{i e}^{o}$ of the obstacles adhere to a Gaussian distribution. Furthermore, the state deviations (1 m deviation between the x and y states, $0.1 \circ$ deviation in $θ$ ) are introduced for the robot in the simulation, thereby simulating the measured state.

Simulation experiment

A multi-robot simulation is conducted in a complex dynamic environment, as depicted in Figure 8. Each robot is capable of independently avoiding obstacles and safely reaching its destination. Their target positions are set in opposing configurations to ensure a complete interaction among the robots.

Figure 8.

Multi-robot navigation in cluttered dynamic environments using SPF-EMPC. Upside: Twelve robots navigate among static (squares) and dynamic (circled) obstacles with distributed perception (colored markers). Downside: An individual robot tracks iterative trajectories via extended state MPC while accounting for estimated state uncertainty (green region) and a dynamic obstacle (purple circle), demonstrating simultaneous handling of internal and external uncertainties. SPF-EMPC: safety probability field-based extended state model predictive control; MPC: model predictive control.

Due to the lack of source code for related work and differences in scenario configurations, this paper compares the trajectory generation aspects with the current state-of-the-art method and performs an ablation study of the components of the overall framework. These evaluations are intended to demonstrate the necessity and superiority of the proposed framework.

Comparison of trajectory generation

The local planner EGO-Swarm2³⁷ and its variant with a accurate prediction of dynamic obstacles serve as benchmarks. The accurate prediction variant treats obstacles as collaborative robots that share their future accurate trajectories at all times, which is too ideal to be realized in reality, but is used here as a comparison to validate the performance of the proposed method. The comparative data at varying levels of perturbation are presented in Table 2. Among the performance metrics, the overtime rate indicates the likelihood that the robot is trapped in the environment, failing to reach its target. The number of iterations refers to the mean number of optimizations required for a single robot within a single experimental trial.

Table 2.

Quantitative comparison under motion uncertainty.

Perturbation level	Methods	SR (%)	OR (%)	Dynamic distance (m)	Path length (m)	Travel time (seconds)	Iterations
0.25 $Σ$	ES2	14	0	2.30 $\|$ 0.67 $\|$ 1.70	25.43 $\|$ 25.13 $\|$ 0.44	34.22 $\|$ 32.00 $\|$ 4.15	253.74
	ES2*	76	10	1.76 $\|$ 0.97 $\|$ 1.11	26.41 $\|$ 25.15 $\|$ 1.29	35.30 $\|$ 31.01 $\|$ 2.71	69.92
	Ours	72	2	1.52 $\|$ 0.62 $\|$ 0.85	27.17 $\|$ 25.14 $\|$ 2.42	38.83 $\|$ 32.00 $\|$ 7.02	58.4
1 $Σ$	ES2	10	0	1.50 $\|$ 0.73 $\|$ 0.79	25.62 $\|$ 25.15 $\|$ 0.44	31.80 $\|$ 31.00 $\|$ 0.57	270.16
	ES2*	62	4	1.41 $\|$ 0.85 $\|$ 3.41	26.53 $\|$ 25.28 $\|$ 1.33	36.08 $\|$ 31.50 $\|$ 3.59	92.52
	Ours	56	2	1.38 $\|$ 0.69 $\|$ 0.62	29.64 $\|$ 25.18 $\|$ 4.44	39.34 $\|$ 33.50 $\|$ 5.10	55.78
2 $Σ$	ES2	12	2	1.39 $\|$ 0.57 $\|$ 0.68	26.21 $\|$ 25.20 $\|$ 1.28	33.33 $\|$ 31.50 $\|$ 1.57	300.96
	ES2*	52	4	1.39 $\|$ 0.57 $\|$ 0.68	27.37 $\|$ 25.46 $\|$ 1.88	38.33 $\|$ 33.00 $\|$ 5.37	101.74
	Ours	40	2	1.47 $\|$ 0.53 $\|$ 1.01	29.35 $\|$ 25.17 $\|$ 3.71	39.60 $\|$ 31.50 $\|$ 5.35	60.36

SPF-EMPC: safety probability field-based extended state model predictive control; SR: success rate; OR: overtime rate.

Performance of the proposed SPF-EMPC is benchmarked against EGO-Swarm2 (ES2) and its ideal variant with perfect prediction (ES2*). Results are averaged over 50 Monte Carlo trials conducted in a cluttered randomized environment (five robots, four dynamic, and six static obstacles) under three perturbation levels (0.25 $Σ$ , 1 $Σ$ , 2 $Σ$ ; where (1 $Σ : {σ_{p}}^{o} = 1 \times 10^{- 4}, {σ_{v}}^{o} = 1 \times 10^{- 2}$ )). Reported metrics include SR, OR, dynamic distance, path length, travel time, and optimization iterations. For each metric, values are shown as mean $|$ min $|$ std, with the best performer per row highlighted in bold.

We conduct the simulations in a spatially limited environment containing multiple irregular moving obstacles with speeds close to the robot’s, making it difficult to achieve full obstacle avoidance. As illustrated in Table 2, the success rate of the three methods declines gradually with increasing perturbation level. Notably, ES2 exhibits the lowest overtime rate and requires the shortest distance and time to navigate. However, ES2 perceives dynamic obstacles as static in real-time and does not account for motion perturbations, resulting in the lowest success rate and most iterations. The variant ES2*, which assumes that obstacles follow perfectly predictable trajectories, markedly enhances the success rate and iteration count compared to ES2. Nevertheless, it tends to induce oscillations when confronted with uncertain obstacles, leading to a higher overtime rate. Moreover, its ideal prediction model is challenging to implement in practical scenarios. In contrast, the proposed method achieves a considerably higher success rate than ES2 and nearly matches the ideal ES2* performance, with the fewest iterations and dynamic distance. The result indicates that the SPF-based unconstrained planning approach converges rapidly and avoids excessive conservatism in generating safe trajectories.

Ablation comparison

The superiority of the proposed two-layer framework in real-time is shown in Figure 7. To confirm the efficacy of each component of our proposed approach in terms of obstacle avoidance success rate under consideration of perturbations, we performed an ablation comparison, as shown in Table 3. Specifically, in the variant lacking self-sampling (SS), the trajectory referenced in equation (19) is substituted with a generic MPC based on the initial discrete trajectory. For the version without SPF, the speed of the safety probability field is set to zero, degenerating into a circular field with the minimum safe distance.

Table 3.

Performance comparison of the proposed SPF-EMPC with other versions at different perturbation levels ( $1 Σ^{'}$ : 1 m deviation between the x and y states, $0.1 \circ$ deviation in $θ$ ) in a simulated environment, based on SR, OR, and TT.

Perturbation level	Methods	SR (%)	OR (%)	TT (seconds)
$0 Σ^{'}$	SPF-EMPC w/o SS	48	0	47.50
	SPF-EMPC w/o SPF	56	4	30.00
	Ours	86	2	33.12
$1 Σ^{'}$	SPF-EMPC w/o ESO	42	2	76.43
	Ours	88	0	33.42

SPF-EMPC: safety probability field-based extended state model predictive control; SR: success rate; OR: overtime rate; TT: travel time; SS: self-sampling; SPF: safety probability field; ESO: extended state observer.

The data in the table show that even under ideal localization conditions ( $0 Σ^{'}$ ), the SPF-EMPC cannot accurately follow the iterative trajectory without SS, ultimately resulting in the lowest recorded success rate. Furthermore, the absence of the SPF severely affects the system’s ability to avoid dynamic obstacles, resulting in a significant decrease in success rate. Considering the inherent deviation in the robot’s state, the lack of ESO complicates the controller’s ability to monitor the state, potentially issuing erroneous commands and increasing the complexity of real-time optimization. As a result, this leads to a decrease in success rate and a significant increase in travel time. These findings demonstrate the effectiveness and necessity of integrated components in our proposed approach, especially when facing irregularly moving obstacles and imprecise robot state.

Enhanced performance validation

In this section, we further validate the performance of the proposed algorithm in both narrow environment and dynamic obstacle environment. To evaluate the collaborative obstacle avoidance capability of the proposed algorithm in constrained spaces, we constructed a narrow passage crossing scenario involving eight robots, as illustrated in Figure 9. The passage width was configured at 1.2 times the robot diameter, with initial robot positions symmetrically distributed at both ends of the passage and target positions set on the opposite side. Experimental results demonstrate that the robots initiated coordinated avoidance maneuvers at 16 seconds, formed an orderly queue by 23 seconds, and successfully completed the passage crossing within 30 seconds, exhibiting continuous trajectory curvature without any deadlock situations. To address the motion uncertainty of dynamic obstacles, the position deviation of the obstacle is set as $v_{i e}^{o} \sim N (μ p, Σ)$ , and the velocity deviation as $p {i e}^{o} \sim N (μ_{v}, Σ)$ . Here, $μ p$ and $μ v$ represent the position and velocity of the obstacle at the previous time step, respectively. The deviation for the position component is $σ p = 1 \times 10^{- 4}$ , and for the velocity component, it is $σ v = 1 \times 10^{- 2}$ . The maximum speed of the obstacle matches the robot’s speed.

Figure 9.

Coordinated navigation of eight robots through a constrained narrow passage. The sequence demonstrates the efficacy of the proposed safety probability field-based extended state model predictive control (SPF-EMPC) framework in dense, structured environments: At $t = 16$ seconds, robots initiate distributed collision-avoidance maneuvers. By $t = 23$ seconds, they self-organize into a collision-free queue. At $t = 30$ seconds, all robots successfully traverse the passage with smooth, non-deadlocking trajectories, validating the method’s capability for cooperative motion in tight spaces.

For clarity, dynamic obstacles are displayed in a circular configuration, as shown in Figure 10. The robot avoids obstacles in real time by leveraging the safety probability field generated by dynamic obstacles. At t=16 seconds and t=20 seconds, the robot proactively avoids the obstacles by predicting their movement trends based on the probability field.

Figure 10.

Eight robots navigate through a dense environment with two dynamic obstacles. (a) $t$ =17 seconds and (b) $t$ =20 seconds.

Physical experiment

In a real-world scenario, a scaled-down platform,³⁸ measuring $1.374 \times 2.432 m$ , was constructed for conducting experiments. A three-wheeled omnidirectional robot was designed for physical testing, as shown in Figure 1.

Figure 11 shows an example of actual running. In the experiment, several static obstacles and one dynamic obstacle are set up to hinder the movement of the robots. The motion of the dynamic obstacle, the interference at the platform seams, and the robots’ traveling all pose challenges. Robots need to utilize their limited perception and communication capabilities to avoid various obstacles and other robots, and ultimately reach the goal point. Practical experiments demonstrate that the proposed method can enable the safe navigation of multiple robots in a dynamic environment. In this process, the results of the velocities returned by the robot are shown in Figure 12. The profiles confirm that all control commands remained within actuator limits, validating the dynamic feasibility and constraint compliance of the proposed trajectory tracking controller.

Figure 11.

Physical validation in a dynamic environment. (a) Setup with eight robots, eight static obstacles, and a dynamic obstacle. (b) A robot adjusts its trajectory in real-time to handle the irregular motion of the dynamic obstacle and localization inaccuracies at platform seams. (c) All robots reach their target positions collision-free, demonstrating the robustness of the safety probability field-based extended state model predictive control (SPF-EMPC) framework under real-world uncertainties.

Figure 12.

Wheel speed profiles from physical experiment, showing the measured linear velocities $(v_{1}, v_{2}, v_{3})$ via onboard encoders.

Discussion

In this work, we propose SFC-EMPC, a novel method that demonstrates superior performance to the EGO-Swarm2 baseline in multi-robot cooperative motion planning under uncertain dynamic obstacles. Specifically, it achieves a 40% higher success rate and a 75% improvement in optimization efficiency in such challenging environments. Compared to the conventional MPC method, the proposed approach not only achieves the shortest path time in the presence of state perturbations, but also improves the collision avoidance success rate by 46% while tracking time-varying trajectories under such perturbed conditions.

The proposed method achieves higher success rates and optimization efficiency against uncertain dynamic obstacles due to two key mechanisms. First, a Gaussian-based safety probability field (SPF) models obstacle motion intention, providing a predictive ‘‘look-ahead” capability. By integrating this probabilistic forecast with the MINCO spatio-temporal joint trajectory optimization in a distributed manner, the planner can anticipate dynamic obstacle behavior in advance, thereby significantly improving navigation success. The SPF is essentially a normalized 2D Gaussian function; when multiple obstacles fall within the robot’s planning horizon, their individual fields can be efficiently fused into a Gaussian mixture-like collective SPF. This formulation is computationally lightweight, contributing to high efficiency during joint optimization. Second, the extended state observer (ESO) embedded in the MPC framework continuously estimates and compensates for both internal and external disturbances in real time, enabling safe and robust tracking of the planned trajectory under practical uncertainties.

Computational complexity. The computational efficiency of the SPF-EMPC framework stems from its distributed nature. The trajectory generator’s per-robot cost is dominated by the gradient-based optimizer, whose complexity scales with the number of local waypoints and obstacles within the SPF. Crucially, this cost is independent of the total number of robots, as each agent performs its own local planning. The extended state MPC solves a fixed-horizon problem, resulting in a constant per-step computation that does not scale with team size. This structure yields an overall per-robot complexity of $O (1)$ with respect to the number of robots, which is empirically validated by the consistent sub-10 ms execution times measured in our multi-robot experiments (Figure 8).

Despite its advantages, our approach has several limitations. First, the coordination among multiple robots heavily relies on communication, and communication delays or packet loss will inevitably lead to collisions. Second, The MINCO optimizer utilized in SFC-MPC necessitates robots with differential flatness properties, leaving it unsuitable for other robot types. Third, when multiple robots navigate through narrow passages, deadlocks may arise, especially as the number of robots increases significantly.

Future work will focus on enhancing system robustness and generality by addressing three critical challenges: improving robustness to communication delays through asynchronous, event-driven planning; reducing dependency on specific dynamic models (e.g. differentially flat systems) by developing adaptive, platform-agnostic frameworks such as primitive-based approaches; and resolving deadlocks in confined spaces via spatio-temporal corridor scheduling to pre-allocate passage rights.

Conclusion

This article proposes SPF-EMPC, a multi-robot motion planner robust to dynamic obstacles and system uncertainties. Its key contributions are asfollows:

Introduces a safety probability field for efficient, iterative online trajectory generation via unconstrained optimization.

Employs polynomial fitting and an extended state observer to ensure accurate trajectory tracking under disturbances.

In simulation, matches the performance of an ideal baseline with significantly fewer optimization iterations.

Hardware experiments with eight robots demonstrate real-time capability and reliability in practice.

Footnotes

ORCID iD

Zhiwen Zeng

Funding

The authors received no financial support for the research, authorship, and/or publication of this article.

Declaration of conflicting interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Data availability section

None.

References

Simon

Baldissera

de Queiroz

, et al. Multi-robots coordination system for urban search and rescue assistance based on supervisory control theory. J Control, Autom Elect Syst 2023; 34: 484–495.

John

Harikumar

Senthilnath

, et al. An efficient approach with dynamic multiswarm of UAVs for forest firefighting. IEEE Trans Syst Man, Cybernet: Syst 2024; 54: 2860–2871.

Krnjaic

Steleac

Thomas

, et al. Scalable multi-agent reinforcement learning for warehouse logistics with robotic and human co-workers. In: 2024 IEEE/RSJ International conference on intelligent robots and systems (IROS), 2024, pp.677–684. IEEE.

Halder

Afsari

. Robots in inspection and monitoring of buildings and infrastructure: a systematic review. Appl Sci 2023; 13: 2304.

Huang

Zeng

Chi

, et al. Velocity obstacle for polytopic collision avoidance for distributed multi-robot systems. IEEE Robot Autom Lett 2023; 8: 3502–3509.

Arul

Manocha

. V-RVO: decentralized multi-agent collision avoidance using voronoi diagrams and reciprocal velocity obstacles. In: 2021 IEEE/RSJ International conference on intelligent robots and systems (IROS), 2021, pp.8097–8104.

Wang

Jiang

, et al. Multi-robot smooth path planning considering local motion control based on TEB-VO. Rob Auton Syst 2025; 197: 105304.

Zhou

Wang

, et al. EGO-planner: an ESDF-free gradient-based local planner for quadrotors. IEEE Robot Autom Lett 2020; 6: 478–485.

Zhou

Zhu

Zhou

, et al. EGO-swarm: a fully autonomous and decentralized quadrotor swarm system in cluttered environments. In: 2021 IEEE international conference on robotics and automation (ICRA), 2021, pp.4101–4107. IEEE.

10.

Zhou

Wen

Wang

, et al. Swarm of micro flying robots in the wild. Sci Robot 2022; 7: eabm5954.

11.

Kondo

Tewari

Peterson

, et al. PUMA: fully decentralized uncertainty-aware multiagent trajectory planner with real-time image segmentation-based frame alignment. In: 2024 IEEE International conference on robotics and automation (ICRA), 2024, pp.13961–13967. IEEE.

12.

Wang

Zhang

Sun

, et al. Swift: a distributed one-stage planner for efficient multi-quadrotor trajectory optimization. IEEE Trans Autom Sci Eng 2025; 22: 20951–20965.

13.

Chen

Wan

, et al. Octocache: caching voxels for accelerating 3D occupancy mapping in autonomous systems. In: Proceedings of the 30th ACM international conference on architectural support for programming languages and operating systems, Volume 2, 2025, pp.704–718.

14.

Han

Gao

Zhou

, et al. Fiesta: fast incremental Euclidean distance fields for online motion planning of aerial robots. In: 2019 IEEE/RSJ International conference on intelligent robots and systems (IROS), 2019, pp.4423–4430, IEEE.

15.

Zhou

Gao

Wang

, et al. Robust and efficient quadrotor trajectory generation for fast autonomous flight. IEEE Robot Autom Lett 2019; 4: 3529–3536.

16.

Xiu

Zhan

, et al. Vision-aided UAV navigation and dynamic obstacle avoidance using gradient-based b-spline trajectory optimization. In: 2023 IEEE International conference on robotics and automation (ICRA), 2023, pp.1214–1220. IEEE.

17.

Zhu

Alonso-Mora

. Chance-constrained collision avoidance for MAVs in dynamic environments. IEEE Robot Autom Lett 2019; 4: 776–783.

18.

Lefkopoulos

Kamgarpour

. Using uncertainty data in chance-constrained trajectory planning. In: 2019 18th European control conference (ECC), 2019, pp.2264–2269. IEEE.

19.

Şenbaşlar

Hönig

Ayanian

. RLSS: real-time, decentralized, cooperative, networkless multi-robot trajectory planning using linear spatial separations. Autonom Robot 2023; 47: 921–946.

20.

Lee

Duan

Patil

, et al. Sigma Hulls for Gaussian belief space planning for imprecise articulated robots amid obstacles. In: 2013 IEEE/RSJ International conference on intelligent robots and systems, 2013, pp.5660–5667. IEEE.

21.

Castillo-Lopez

Sajadi-Alamdari

Sanchez-Lopez

, et al. Model predictive control for aerial collision avoidance in dynamic environments. In: 2018 26th Mediterranean conference on control and automation (MED), 2018, pp.1–6. IEEE.

22.

Sun

Dai

Wang

. An efficient moving obstacle avoidance scheme for UAVs via output feedback robust MPC. IEEE Trans Aeros Electr Syst 2024; 60: 6199–6212.

23.

Zhang

Wang

, et al. Model predictive formation tracking-containment control for multi-UAVs with obstacle avoidance. IEEE Trans Syst Man, Cybernet: Syst 2024; 54: 3404–3414.

24.

Zhang

Pan

, et al. Toward scalable multirobot control: fast policy learning in distributed MPC. IEEE Trans Robot 2025; 41: 1491–1512.

25.

Deng

Dong

, et al. DPMPC-planner: a real-time UAV trajectory planning framework for complex static environments with dynamic obstacles. In: 2022 International conference on robotics and automation (ICRA), 2022, pp.250–256. IEEE.

26.

Jian

Yan

Lei

, et al. Dynamic control barrier function-based model predictive control to safety-critical obstacle-avoidance of mobile robot. In: 2023 IEEE International conference on robotics and automation (ICRA), 2023, pp.3679–3685. IEEE.

27.

Lin

Zhu

Alonso-Mora

. Robust vision-based obstacle avoidance for micro aerial vehicles in dynamic environments. In: 2020 IEEE International conference on robotics and automation (ICRA), 2020, pp.2682–2688, IEEE.

28.

Castillo-Lopez

Ludivig

Sajadi-Alamdari

, et al. A real-time approach for chance-constrained motion planning with dynamic obstacles. IEEE Robot Autom Lett 2020; 5: 3620–3625.

29.

Tordesillas

How

. MADER: trajectory planner in multiagent and dynamic environments. IEEE Trans Robot 2021; 38: 463–476.

30.

Kondo

Tordesillas

Figueroa

, et al. Robust MADER: decentralized and asynchronous multiagent trajectory planner robust to communication delay. In: 2023 IEEE International conference on robotics and automation (ICRA), 2023, pp.1687–1693. IEEE.

31.

Bhatt

Khajepour

Hashemi

. MPC-PF: socially and spatially aware object trajectory prediction for autonomous driving systems using potential fields. IEEE Trans Intell Transport Syst 2023; 24: 5351–5361.

32.

Fliess

Lévine

Martin

, et al. Flatness and defect of non-linear systems: introductory theory and examples. Int J Control 1995; 61: 1327–1361.

33.

Wang

Zhou

, et al. Geometrically constrained trajectory optimization for multicopters. IEEE Trans Robot 2022; 38: 3259–3278.

34.

Han

. From PID to active disturbance rejection control. IEEE Trans Indust Electron 2009; 56: 900–906.

35.

Liu

Nocedal

. On the limited memory BFGS method for large scale optimization. Mathemat Program 1989; 45: 503–528.

36.

Andersson

Gillis

Horn

, et al. Casadi a software framework for nonlinear optimization and optimal control. Mathemat Program Comput 2019; 11: 1–36.

37.

Zhou

Miao

Chen

, et al. Human-behaviour-based social locomotion model improves the humanization of social robots. Nature Mach Intell 2022; 4: 1040–1052.

38.

Zhu

Zeng

Yao

, et al. DVRP-MHSI: dynamic visualization research platform for multimodal human-swarm interaction, 2025.