Abstract
Cooperative path planning for unmanned aerial vehicle (UAV) systems is challenged by complex kinematic constraints, coordination constraints, and maintaining optimized formations. This work characterizes mission-oriented performance constrained formation and integrates them into cooperative path planning under multiple constraints. A mission-oriented performance constrained formation model covering reconnaissance, penetration, damage and communication capabilities is proposed, and the simulated annealing particle swarm optimization algorithm (SAPSO) is used for optimization to achieve the desired formation. A cooperative path planning algorithm based on the radau pseudospectral method (RPM) is proposed, by converting the optimal control problem into a nonlinear programming problem, an cooperative path planning method under multiple constraints is realized. Simulations of a UAV formation validate that the proposed approach generates smooth, collision-free trajectories that maintain the optimized formation while satisfying all kinematic, performance, and cooperative constraints.
Keywords
Introduction
Multi-UAV Systems can be applied to cooperative path planning in complex environments to accomplish missions that are difficult for a single UAV to complete.1–3 Multi-UAV cooperative path planning is a research field that has emerged in recent years, involving the collaboration between multiple UAVs to achieve path planning under multiple constraints. The multi-UAV path planning problem is characterized by a high degree of complexity and numerous challenges, primarily manifested in environmental complexity and the need for cooperative operations. These challenges require ensuring that the UAVs can coordinate with each other, avoid collisions during flight, and maintain a desired formation to maximize operational effectiveness.4–8
For single-UAV path planning, several classic algorithms have been proposed.9–11Compared to the single-UAV case, the cooperative path planning of a multi-UAV system must satisfy additional constraints such as communication topology, collision avoidance, and formation maintenance, which poses significant challenges. In recent years, several methods for multi-UAV cooperative path planning have been introduced, with common approaches including the artificial potential field method,12–14 graph search methods,15–17 heuristic algorithms,18–21 reinforcement learning methods,22,23 and model predictive control.24,25 However, the aforementioned methods for formation flight path planning, obstacle avoidance, and cooperative control often model the UAVs as point masses. This simplification ignores the 6 degree-of-freedom(6-DOF) dynamic characteristics of the UAVs as well as the effects of unavoidable lumped disturbances, leading to a certain degree of deviation from the complexities of real-world multi-UAV formation flight scenarios.
The pseudospectral method is an effective numerical technique for solving optimal control problems. 26 Due to its advantages, such as convenient constraint handling, rapid convergence, high accuracy, and relatively low sensitivity to initial values,27,28 it has been widely applied in the trajectory optimization of hypersonic vehicles during their climb, glide, and re-entry phases,29–31 the trajectory optimization of combined-cycle powered vehicles,32,33 and missile guidance trajectories. 34 It is evident that the pseudospectral method has achieved significant success in the field of vehicle trajectory optimization, but it has not yet been applied to solve multi-UAV cooperative trajectory optimization problems. Current research using pseudospectral methods for multi-UAV trajectory optimization has not yet considered key challenges such as cooperative constraints, obstacle avoidance strategies, and formation keeping, making it difficult to meet the collaborative mission requirements of practical flight. Regarding combat formations, traditional design approaches primarily evaluate formation metrics using methods like bi-level programming models35,36 and potential field models. 37 These methods rely on inter-UAV distances and situational assessment for formation selection. Although their models are simple and easy to implement, they fail to effectively integrate weapon system performance with operational effectiveness, making it difficult to guarantee the optimality of the formation for mission efficiency and safety.
To address the above challenges, this paper investigates multi-UAV cooperative path planning using the RPM integrated with a formation optimized for mission-oriented formation selection. To overcome the multi-constraint challenges posed by complex dynamics and cooperative relationships, a multi-objective path planning approach based on RPM is proposed, which transforms the problem into a nonlinear programming formulation under multiple constraints and achieves both high solution accuracy and computational efficiency. In addition, an evaluation model is established to assess formation performance under mission-oriented constraints, and the SAPSO algorithm is employed to solve the formation optimization problem, thereby enhancing cooperative combat capability. Simulation results demonstrate that the proposed method, RPM based cooperative path planning with mission-oriented formation optimization, can simultaneously satisfy dynamic and cooperative constraints, while effectively designing and maintaining formations with superior operational effectiveness, thereby validating the methodology.
The innovations of this paper include:
Aiming at the key performance indicators of multi-UAV cooperative operations,a formation optimization modeling method based on mission performance constraints is proposed, and the SAPSO algorithm is used to achieve the desired formation design, providing a theoretical basis for the formation design of multi-UAVs. To address the inadequacy of existing nonlinear programming models in representing multi-UAV cooperation, a model capable of characterizing multi-UAV cooperative relationships is established, which provides a complete representation of cooperative constraints such as synchronous arrival, formation keeping, and inter-agent collision avoidance. In order to overcome the insufficient representation of the real 6-DOF dynamics of UAV modeling in multi-UAV path planning, a multi-constraint RPM is proposed, which comprehensively considers dynamic equations constraints, cooperative relationships, and the mission-oriented formation keeping.
The content of this paper is arranged as follows. Section 2 introduces the UAV and formation modeling and the related problems. The mission-oriented performance constrained formation design using the SAPSO optimization algorithm is proposed in Section 3. Section 4 presents the multi-UAV cooperative path planning algorithm based on the RPM. Simulation studies are demonstrated in Section 5. The conclusion and future prospects are discussed in Section 6.
Problem statement
UAV and formation modeling
The dynamic modeling of UAV primarily describes its navigational motion in three-dimensional space, represented by changes in position, velocity, and orientation. Let
where the subscript i and c denote the i -th UAV within a multi-UAV formation and the reference commands. The variables x , y , and z constitute the position vector of the UAV relative to an inertial ground frame. V represents the velocity.
The aerodynamic forces can be expressed as:
The modeling of a multi-UAV formation is primarily developed based on the relative kinematic relationships between any two aircraft within the formation. This paper adopts the leader-follower structure for formation modeling due to its advantages, which include a simple architecture and strong robustness, as well as its ability to simplify control design based on the positional relationships among the UAVs. A leader-follower formation model is typically realized by establishing the relationship between a single leader aircraft and multiple follower aircraft. The leader is responsible for guiding the formation's overall motion, while the followers maintain the formation by tracking the leader's trajectory. In this configuration, the tracking model for the leader is consistent across all followers. Consequently, the collective motion of the multi-UAV formation can be analyzed by examining the relative motion between a single follower and the leader. The leader-follower structure of the multi-UAV formation is shown in Figure 1.

Leader-follower formation structure.
In the figure, the subscripts L and F denote the leader and follower UAVs, respectively. The
Based on the geometric relationships shown in the figure, the transformation of the relative longitudinal and lateral positions,
Simultaneously considering the relative altitude relationship between the UAVs in the formation, let
Mission-oriented performance constrained formation framework
Multi-UAV formation design refers to the selection of an appropriate formation configuration to ensure mission efficiency and safety. Traditional approaches utilize inter-UAV distance and situational assessment as the basis for formation selection, are characterized by their simplicity and ease of implementation. However, they are deficient in effectively integrating weapon systems’ performance with operational effectiveness, thereby struggling to guarantee the optimality of the formation when complex mission-oriented formation selection is required.
This paper proposes a multi-UAV formation model based on mission-oriented formation selection. The model encompasses four key aspects: cooperative detection capability, cooperative maneuvering penetration capability, target destruction capability, and communication command capability. The specific components of this formation evaluation model are detailed as follows:
The cooperative detection capability: It refers to the capacity for comprehensive and precise awareness, ensuring the UAV formation can accurately ascertain the distribution of targets, threat status, and dynamic changes within the operational area. The cooperative maneuvering penetration capability: It represents the formation's ability to evade enemy defense systems through agile maneuvering, ensuring the multi-UAV system can adapt to complex operational environments and enhance its survivability. The target destruction capability: It describes the capacity for precision strikes against designated targets, ensuring the combat effectiveness of the multi-UAV formation during the execution of coordinated strike missions. The communication command capability: It signifies the ability of individual UAVs within the formation to maintain real-time, reliable communication, which serves as the foundation for the entire formation to sustain cohesive and coordinated flight.
Based on the preceding analysis, the structure of the mission-oriented performance constrained UAV formation evaluation model is illustrated in Figure 2.

The mission-oriented performance constrained UAV formation evaluation model.
Multi-UAV cooperative path planning framework
Cooperative path planning for multi-UAV systems aims to enable formation flight while adhering to a set of constraints, including dynamic characteristics, performance limitations, and no-fly zones. To ensure the formation exhibits high performance and efficiency throughout the path planning process, specific optimization objectives must be satisfied. Furthermore, to achieve synchronous and safe arrival in a designated formation, the cooperative relationships among the UAVs must be maintained.
Therefore, multi-UAV cooperative path planning must satisfy three fundamental requirements: safety, efficiency, and coordination. Safety, the paramount consideration, involves respecting the UAVs’ dynamic and performance constraints to operate within safe operational envelopes, adhering to no-fly zone restrictions, and ensuring continuous collision avoidance between the formation and any obstacles. Efficiency aims to enhance overall operational effectiveness by optimizing for metrics such as minimum flight distance or minimal control effort, identifying optimal paths that minimize flight time and expenditure while adhering to all safety criteria and mission objectives. Coordination demands that all UAVs in the formation cooperate to prevent inter-agent collisions and to arrive at the target location synchronously in the desired formation.
To fulfill these requirements, the modeling of multi-UAV cooperative path planning must comprehensively account for the dynamic characteristics, performance limitations, and no-fly zone constraints of each formation member, as well as the optimization objectives and cooperative relationships of the group.
Formation design for mission-oriented formation selection using the SAPSO optimization algorithm
This section, guided by actual mission requirements, establishes a comprehensive evaluation model for multi-UAV combat formations based on mission-oriented formation selection. This model incorporates four key capabilities: cooperative reconnaissance, maneuver and penetration, target destruction, and command and communication.
Subsequently, the section introduces the SAPSO optimization algorithm. This algorithm is then integrated with the formation evaluation model to solve for and determine the desired formation layout. This approach facilitates a multi-UAV formation design that is explicitly optimized for mission-oriented formation selection.
Modeling of the mission-oriented performance constrained formation
This subsection proposes an evaluation model for multi-UAV formations based on mission-oriented formation selection. The model comprises four key aspects: cooperative reconnaissance capability, maneuver and penetration capability, target destruction capability, and command and communication capability.
Building upon the relative positions
Radar detection width
Radar detection width refers to the maximum lateral distance at which a multi-UAV formation can detect a target. A greater detection width indicates that the radar system can cover a larger area, thus possessing a greater capability to detect and identify more targets. The detection volume of a radar is typically modeled as a conical region in three-dimensional space, where the maximum detection range is represented by the slant height of the cone

Schematic diagram of radar detection width.
To maximize the detection width and avoid significant blind zones, the multi-UAV formation should adopt a more concentrated or compact pattern, ensuring that the distances between members are not too dispersed. Therefore, the evaluation model for radar detection width can be expressed as
According to this radar detection width model, it is necessary to prevent the spacing between formation members from becoming so large that discontinuous blind zones appear in the detection coverage. From Equation (11), it follows that if the lateral distance between the leader and follower exceeds the sum of the individual members’ detection widths, the value of the radar detection width modeling equation is set to 1, according to the normalization method.
Radar detection depth
Radar detection depth signifies the maximum longitudinal distance at which the multi-UAV formation can detect a target. A greater detection depth implies that the radar system can cover a larger area, thus possessing a greater capability to detect and identify more targets. Following the same modeling approach as for the radar detection width, the evaluation model for radar detection depth can be expressed as
In the process of modeling both radar detection width and depth, the slant height of the UAV's conical radar detection area is equivalent to the maximum radar detection range,
While Equation (13) analyzes the impact of various radar system parameters on the maximum search radius, the formation evaluation model designed in this paper primarily focuses on analyzing the effect of this maximum search radius on the multi-UAV formation's geometry. Therefore, the evaluation models designed in Equation (12) and previously in Equation (11) represent a further stage of research that commences after the maximum radar search radius has already been determined from considerations of factors such as system delay time and the parameters in the radar range equation.
Maneuvering safety distance
In multi-UAV formation operations, the relative distance between members is critically important, as it directly affects whether their maneuvers interfere with one another. When a formation executes an obstacle avoidance task, it often requires agile, high-g maneuvering. Therefore, it is essential to ensure that the relative distance between any two members in the formation is greater than the UAV's minimum maneuvering radius. This guarantees that members do not impede each other during maneuvers, effectively preventing inter-agent collisions and ensuring flight safety. The evaluation model for maneuvering safety distance can be expressed as
Effective coverage area
The methods employed by UAVs to neutralize targets are not limited to traditional munition delivery and missile interception. Low-cost, direct-impact kinetic strikes can also be used to inflict direct physical damage and achieve high kill effectiveness. Therefore, in a departure from considerations for traditional fighter aircraft, a multi-UAV formation in a cooperative strike mission must consider the total size of the area it can effectively cover, rather than focusing solely on weapon factors like missile range and attack precision. A larger coverage area corresponds to a stronger capability to destroy targets. By evaluating this effective kill area, the attack capability of the multi-UAV formation and its potential for target destruction can be quantified.
Therefore, the evaluation model for the weapon's effective coverage area can be expressed as
Communication response time
Communication response time is a critical factor for the command and communication capability of a multi-UAV formation. A long response time leads to communication latency, which can cause a lack of information synchronization between the leader and follower aircraft. This asynchrony may result in the formation losing control or becoming unstable, and could even lead to collisions between UAVs.
For a given multi-UAV formation, the relationship between its command and communication capability and its geometric parameters primarily depends on the communication distance between the leader and the farthest follower. The shorter this distance, the shorter the response time and the stronger the communication link for the followers tracking the leader. The evaluation model for communication response time can be expressed as
Formation design based on the SAPSO optimization algorithm
The SAPSO algorithm employed in this paper is an adaptive optimization method based on Particle Swarm Optimization (PSO) and Simulated Annealing (SA), designed to address the inherent problems of using either PSO or SA alone. Traditional PSO algorithms are prone to premature convergence, often resulting in a local optimum. Conversely, the precision and speed of the SA algorithm are highly dependent on the initial temperature setting and the cooling schedule, leading to poor optimization time performance. To overcome these issues, this paper utilizes the SAPSO algorithm, which combines PSO and SA.
The objective of the SAPSO algorithm is to leverage the global search capabilities of PSO and the local search capabilities of SA, allowing them to complement each other to enhance both the accuracy and speed of the optimization. In this manner, the SAPSO algorithm overcomes the respective limitations of standalone PSO and SA, offering a more effective solution for optimization problems.
A critical step in the SAPSO algorithm is the use of the Metropolis acceptance criterion to determine whether to accept the iterative solution for each particle in each generation. This criterion governs the probability of accepting a new solution, thereby facilitating a more effective search of the global solution space. When a new solution is superior to the current one, it is always accepted. When a new solution is inferior, it still has a certain probability of being accepted according to the Metropolis criterion, which helps the algorithm avoid becoming trapped in a local optimum. If a new solution is not accepted, the particle is iterated again until the algorithm terminates, yielding the optimal solution.
Additionally, a constriction factor is introduced in the SAPSO algorithm. Its function is to eliminate the need for boundary limits on velocity (velocity clamping), permitting particles a greater range of movement during the search process. This helps to augment the exploration of the search space and improves the algorithm's global search capability.
To ensure the algorithm's convergence, it is necessary to select appropriate parameters. This includes adjusting the acceptance probability within the Metropolis criterion and controlling the degree of influence of the constriction factor. Through rational parameter selection, a balance between the accuracy and speed of the SAPSO algorithm can be achieved during the search process, and the need for boundary limits on velocity can be eliminated.
First, the update equations for the position and velocity of the PSO algorithm incorporating a constriction factor are introduced as follows:
where
As can be seen in the velocity update Equation (17), the standard algorithm uses the global best position,
Therefore, the velocity update Equation (17) can be transformed to
By incorporating the SA algorithm, the personal best position (
Finally, the selection of
Having detailed the design of the SAPSO algorithm, the establishment of the optimization algorithm's objective function will now be specified. The design and performance of an optimization algorithm are directly influenced by the characteristics and requirements of its objective function, making the objective function a core component of the algorithm. A well-designed objective function can provide effective guidance for the optimization, enabling the algorithm to converge more rapidly to the optimal solution.
The optimization objective of this paper is to design a multi-UAV formation that maximizes operational effectiveness. Assuming a multi-UAV formation composed of N aircraft, and based on the relative positions
Using the optimization parameters defined above, and in conjunction with the mission-oriented performance constrained formation model proposed in Section 3.1, the objective function for the formation optimization algorithm is established as
To conclude the discussion of the SAPSO optimization algorithm design, Figure 4 provides a visual depiction of the SAPSO algorithm's workflow, summarizing the entire process.

Flowchart of the SAPSO algorithm.
Multi-UAV cooperative path planning algorithm based on the RPM
Cooperative path planning for multiple UAVs must account for a variety of factors, including performance constraints, synchronous arrival, obstacle avoidance, and performance optimization. Traditional path planning methods are often unable to find optimal solutions due to the problem's complexity and computational inefficiency. The RPM, however, is a numerical technique designed for solving nonlinear dynamic systems that can effectively address the nonlinear, multi-constraint, and multi-objective optimization challenges inherent in path planning.
This method is well-suited for the demands of multi-UAV cooperative path planning, such as handling dynamic equation constraints, path planning with obstacle avoidance, and ensuring simultaneous arrival, which enhances the adaptability of the path planning to various scenarios. Furthermore, the RPM utilizes high-order polynomials to approximate the path curves, yielding smoother and more continuous flight trajectories that are better aligned with the mission requirements of UAVs.
The fundamental concept of the RPM is to discretize the state and control variables at a set of Legendre-Gauss-Radau (LGR) collocation points. These discrete points are then used as nodes to construct Lagrange interpolation polynomials that approximate the state and control trajectories. By differentiating the global interpolation polynomial for the state, the time derivative of the state is approximated, thereby transforming the system's differential equations into algebraic constraints.
Concurrently, integral terms within the performance index or control effort are calculated using Gauss-Legendre quadrature, and the terminal state is determined through the integration of the system dynamics from the initial state.
Through this transformation, the multi-UAV cooperative path planning problem is converted into a parameter optimization problem subject to a series of algebraic constraints, which can be solved as a Nonlinear Programming (NLP) problem.
Modeling for multi-UAV cooperative path planning
The modeling of multi-UAV cooperative path planning requires consideration of the dynamic characteristics, performance constraints, no-fly zone constraints, optimization objectives, and cooperative relationships for each member of the formation. These components are specified as follows:
Dynamic Equations Constraints
The dynamic characteristics of the multi-UAV formation are described by the dynamic model presented previously in Equations.(1)-(6). This ensures that the generated trajectories are dynamically feasible and continuously adhere to the inherent flight dynamics of the UAVs. By manipulating inputs such as the aerodynamic angles, which include the angle of attack and sideslip angle, and engine thrust, the changes in each UAV's position, velocity, and orientation in three-dimensional space are controlled.
Performance constraints
To account for the structural integrity and performance stability of the UAV, its performance constraints typically include load factor constraints and control input constraints.
The structural strength of a UAV directly impacts its flight safety and durability, particularly during agile, high-g maneuvers which cause an increase in the load factor. If the aircraft's structural strength is insufficient to withstand these high loads, it can lead to structural damage, fracture, or failure, potentially resulting in accidents. Therefore, including a load factor constraint is a critical element for ensuring flight safety. The constraint on the load factor n can be expressed as:
The UAV's angle of attack, sideslip angle, and flight path bank angle serve as the control command inputs for trajectory planning and directly influence the aircraft's stability. The angle of attack command affects the UAV's lift and drag; an excessive angle of attack can lead to an aerodynamic stall. Similarly, changes in the sideslip angle impact the UAV's lateral stability, and an excessive sideslip angle can cause a lateral loss of control. The flight path bank angle command input ensures stable turns and good maneuverability. Therefore, the constraints on these control inputs and their rates of change can be expressed as:
No-fly zone constraints
In complex operational environments, it is imperative to fully consider constraints imposed by various no-fly zones to ensure the safety of the UAVs and the success rate of the mission. These primarily include constraints related to physical obstacles and enemy radar and artillery threats.
Physical flight obstacles are typically modeled as infinitely tall cylinders. The constraint for avoiding such an obstacle can be expressed as:
Radar and artillery systems constitute primary threats from enemy weaponry. Specifically, enemy radar systems can detect friendly UAVs, while artillery systems can engage them. Therefore, it is necessary to consider these threats comprehensively to ensure the UAVs can effectively evade enemy detection and attack. These radar and artillery threats are modeled as hemispherical zones, and the avoidance constraint can be expressed as:
Optimization objective
In trajectory planning, the selection of an optimization objective depends on the specific mission requirements and performance indices. Common objectives include minimizing path length, minimizing flight time, and minimizing control effort. Considering the advantages of minimizing control effort in cooperative multi-UAV flight, such as extending flight endurance, reducing operational costs, this study adopts control effort minimization as the primary optimization objective.
Therefore, the objective function is formulated as a quadratic form with respect to the control variable u, can be expressed as:
Cooperative constraints
In multi-UAV path planning, cooperative constraints typically include safety distance constraints, formation-keeping constraints, and synchronous arrival requirements. Unlike single-vehicle flight, a primary concern in cooperative multi-UAV path planning is maintaining a safe separation distance between vehicles to ensure flight safety. To address this, a safety distance constraint is formulated to guarantee that the distance between any two UAVs remains greater than a prescribed minimum safety threshold throughout the trajectory. This constraint effectively ensures collision-free cooperative flight. The multi-UAV safety distance constraint can be expressed as:
In cooperative multi-UAV trajectory planning, the desired formation selected based on mission requirements ensures both efficiency and safety. The design of this formation, based on mission-oriented formation selection, was completed in Section 3. Therefore, the path planning will be based on this optimal geometry, and a formation keeping constraint is imposed to ensure the stability and accuracy of the formation throughout the flight. Based on the previously established formation model, the formation keeping constraint can be expressed as:
where the subscript L denotes the leader aircraft, and
Synchronous arrival requires that all UAVs reach their designated target points at the same location and time, which is critical for mission coordination and maximizing operational effectiveness. To achieve this, the cooperative strategy begins with a time prediction step, in which the feasible arrival times of each UAV at its target position are estimated. Through optimization, the minimum and maximum feasible flight times of the i-th UAV, denoted as
Following this, a time coordination step uses these individual feasible intervals
Finally, in the cooperative trajectory optimization phase, a single terminal time,
Next, building upon the established model for multi-UAV cooperative path planning, a study of the trajectory optimization method using the RPM will be conducted.
Optimal control discretization via the RPM
The RPM employs LGR collocation points to discretize the optimal control problem into a NLP problem. For multi-UAV cooperative path planning, the problem can thus be reformulated as an optimal control problem with multiple constraints. Accordingly, the procedure for transforming the path planning problem into an NLP is as follows.
First, collocation points are selected. In RPM, optimization is performed using LGR points. Suppose that Z collocation points are chosen; then, the LGR points are defined as the roots of the polynomial
According to the flight conditions in trajectory optimization, let
As described above, the path planning time interval is
In RPM, the state variable
It should be noted that the basis polynomials
By differentiating the state approximation with respect to
In summary, to handle the dynamic equations constraints, the differential equations governing the dynamics are transformed into a set of discrete algebraic equality constraints, represented at each collocation point as
Path constraints are conditions that restrict the behavior of the multi-UAV system during its trajectory. The performance constraints, no-fly zone constraints, and safety distance constraints, which were discussed in detail previously, all fall into the category of inequality path constraints. These constraints can be approximated in the following general form, which must hold at each collocation point:
Boundary constraints impose specific requirements on the initial and terminal points of the trajectory to ensure feasibility and to meet mission or system objectives. The synchronous arrival requirement, as discussed previously, is enforced using boundary constraints. This can be expressed in a general form as:
The optimization objective designed previously, after discretization using the pseudospectral method, can be approximated as:
In summary, the workflow of the multi-UAV cooperative path planning algorithm based on the RPM is depicted in the flowchart in Figure 5. The algorithm follows a three-stage process. The first stage involves initializing the problem constraints, such as the initial positions and the dynamic models. In the second stage, the algorithm solves for the optimal trajectory of each UAV individually, subject to its respective dynamic, no-fly zone, and performance constraints, while optimizing for the primary objective. In the final stage, cooperative constraints—including safety distances, formation keeping, and synchronous arrival—are introduced to refine the individual paths into a set of multi-constraint, coordinated trajectories for the entire formation.

Flowchart of the cooperative path planning algorithm using the RPM.
Mathematical simulation
Numerical implementation
The multi-UAV cooperative path planning problem as described in the previous sections is solved using RPM using the numerical values for the parameters shown in Table 1. The optimization is transcribed into a large-scale nonlinear programming (NLP) problem via the RPM integration method, and is subsequently solved using the NLP solver IPOPT. The simulations are executed on a standard personal computer equipped with an [Intel Core i7-12700 K @ 3.60 GHz] and [32 GB RAM].
Parameters used in the simulation.
For the pseudospectral discretization setting, a dynamic hp-adaptive mesh refinement strategy is adopted,
38
with a relative error tolerance of
For the numerical optimization settings, the NLP error tolerance is set to
The initial positions of the UAV formation are fixed, but in the cooperative formation trajectory planning problem, the terminal position of each UAV is uncertain; only the terminal position of the UAV at the center of the formation is determined by the target position. The initial and terminal positions for the multi-UAV cooperative path planning scenario are specified in Table 2. Simultaneously, the initial velocity of each UAV in the formation is set to 200 m/s, while the terminal velocity is uncertain. For the remaining states, specifically the climb angle
The initial and terminal positions for multi-UAV trajectory planning.
The initial guesses for each UAV.
Simulation results and analysis of the formation optimalization
To validate the effectiveness of the multi-UAV formation design, which is based on mission-oriented formation selection, a five-UAV formation flight scenario was designed.
The simulation parameters were set as follows: a radar detection radius of
Using the mission-oriented formation selection model and the SAPSO optimization algorithm, the desired formation relative positions were obtained and are presented in Table 4. In this formation, UAV1 is designated as the leader, while UAV2 through UAV5 are designated as the followers.
Optimized relative positions for the desired multi-UAV formation.
The response of the SAPSO optimization fitness function and the resulting optimized formation are shown below.
As can be seen from Figure 6.(a), the optimization algorithm converges after 50 iterations, which indicates that the formation optimized by SAPSO, based on the mission-oriented performance constrained model, has achieved maximum operational effectiveness. Based on the relative positions of the leader and followers in the formation, as shown in Table 4, and in conjunction with the leader-follower multi-UAV formation structure from Figure 1, the resulting optimized UAV formation is depicted in Figure 6.(b). In this figure, the dashed lines outline the shape of the multi-UAV formation. It is evident that this diamond-shaped UAV formation is the optimal configuration for a reconnaissance and strike scenario. Subsequently, this optimized diamond formation will be used as the input for future research on cooperative guidance and control.

Formation optimization simulation results: (a) fitness function curve; (b) formation illustration.
Path planning simulation results and analysis
The coordinates of the radar and artillery threats for the simulation are provided in Table 5.
Location of radar and artillery threats.
The coordinates of the flight obstacle zone are given in Table 6.
Location of the flight obstacle zone.
In the simulation scenario, UAV1 is designated as the leader, with the remaining aircraft acting as followers. The formation constraint is defined by the desired relative positions from the desired formation shown in Table 4. The simulation results for the multi-UAV cooperative path planning are presented below.
As illustrated in Figure 7, which show the 3D and X-Y plane trajectories for the multi-UAV cooperative path planning simulation, respectively, all members of the formation are able to arrive at the designated location in the specified formation despite starting from different initial states. The results demonstrate that the formation effectively avoids the flight obstacle zone and the radar/artillery threats. Furthermore, the trajectories are smooth and do not violate the safety distance constraints, which validates the effectiveness of the cooperative trajectory planning, obstacle avoidance, and overall cooperative design.

Position trajectories of the cooperative path planning with five UAVs: (a) three-dimensional; (b) X–Y plane.
Figure 8 shows the response curves for the position states x,

Position states of the cooperative path planning with five UAVs: (a) x; (b) y; (c) z.
Figure 9 depict the response curves for each UAV's velocity V, climb angle

Intermediate states of the cooperative path planning with five UAVs: (a) v; (b)
The responses of the control inputs and constraints for the multi-UAV cooperative path planning are shown in Figure 10–11. These results show that the control commands for each UAV in the cooperative path planning scenario exhibit a rapid response. The state responses for the flight path bank angle

Control inputs of the cooperative path planning with five UAVs: (a)

Constrains of the cooperative path planning with five UAVs: (a)
Figure 11. (d) illustrates the load factor n response for each member of the multi-UAV formation. As can be seen, the load factor throughout the planned trajectories does not exceed the constraint value, indicating a good dynamic response and ensuring the safety of the cooperative flight. Meanwhile, the oscillation period of the load shown in the figure is usually around 3–5 s, which is less than 1 Hz, completely within the response bandwidth of the actuator, and will not damage the structure of the UAV. This demonstrates the effectiveness of the performance constraint design within the cooperative path planning framework.
Figure 12 present the relative distances between each UAV and all other members of the formation throughout the cooperative path planning simulation. As shown in the plots, each member of the multi-UAV formation successfully maintains the minimum safe separation distance from all other aircraft during the entire trajectory, thereby satisfying the flight safety distance constraint.

Relative distance between the specific and all other formation members:
To quantitatively evaluate the effectiveness and rigorous constraints satisfaction of the proposed multi-UAV cooperative path planning, several key numerical metrics are extracted from the optimization results and summarized in Table 7.
Quantitative evaluation metrics for each UAV.
As shown in Table 7, the uniform flight times clearly verify the successful satisfaction of the simultaneous arrival constraint. Although the entire mission involved maneuvering around no-fly zones, the minimum inter-UAV distance recorded during the flight was 39.78 m, strictly satisfying the 30 m collision avoidance threshold. Similarly, the clearance distances to physical obstacles and radar/artillery threat areas remained sufficiently large, ensuring the safety of the formation flight. These quantitative indicators comprehensively demonstrate the feasibility and cooperative performance of the proposed algorithm.
In conclusion, the simulation results verify that the proposed approach, based on the RPM, can simultaneously satisfy multiple objectives and constraints. These include the dynamic equations constraints, performance limitations, obstacle avoidance, minimum control effort, safety distance requirements, and formation keeping constraints. Ultimately, the method successfully achieves synchronous arrival at the destination, which demonstrates the effectiveness of the multi-objective cooperative path planning methodology designed in this paper.
To demonstrate the algorithmic superiority, the proposed RPM is compared against Particle Swarm Optimization (PSO) 20 and Model Predictive Control (MPC) 24 under identical dynamic models and constraints. It is important to emphasize that the cited PSO algorithm integrates a violation function into its cost evaluation to effectively handle constraints, such as dynamic equations and performance boundaries, thus ensuring a fair comparison. The simulation results are illustrated in Figures 13–15.

3D trajectories of the cooperative path planning with five UAVs using different methods:


As Shown in Figures 13–15, while the PSO algorithm maintains geometric avoidance boundaries, the angle of attack exhibits severe high-frequency chattering, yielding an unsmooth flight trajectory. Although the MPC generates smooth trajectories and chatter-free states, its flight path bank angle varies considerably, resulting in aggressive maneuvering. In contrast, the proposed RPM effectively executes collision avoidance while strictly adhering to both the dynamic equations and performance constraints. The resulting trajectories and states remain perfectly smooth, strictly conforming to principles of actual flight.
To quantitatively demonstrate the algorithm's advantages, Table 8 summarizes the relevant state indices of different methods. The analysis results show that, compared with the PSO and MPC methods, the proposed RPM algorithm achieves a shorter total flight time and a lower average flight time for formation members. These quantitative indices fully demonstrate the effectiveness and feasibility of the proposed RPM algorithm.
Comparison of metrics from different algorithms.
To further evaluate the practical applicability of the proposed RPM-based multi-UAV cooperative path planning algorithm, a computation efficiency and scalability analysis is conducted. The empirical computation times for different UAV formation scales are summarized in Table 9.
The empirical computation times for different UAV formation scales.
As illustrated in Table 9, the computation time exhibits an upward trend with the increasing number of UAVs. The fundamental bottleneck regarding the scalability of this method is the combinatorial explosion of constraints associated with the growing number of UAVs. For example, regarding the cooperative safety distance constraint, according to Equation (30), the collision avoidance requirement between any two distinct UAVs (i.e.,
Conclusion and future work
This paper studies the multi-UAV cooperative path planning that integrates formation design optimized for mission-oriented performance constraints and multi-constraint in detail. A mission-oriented performance constrained formation model was established, accounting for cooperative detection, cooperative maneuvering penetration, target destruction, and communication command capability, and the SAPSO algorithm was employed to derive desired formation. Building on this foundation, a cooperative path planning algorithm based on the RPM was proposed, transforming a complex optimal control problem with 6-DOF dynamics, performance bounds, no-fly zones, formation maintenance, and synchronous arrival into a nonlinear programming problem. Simulations verified that the proposed approach generates smooth, safe trajectories satisfying all dynamic and cooperative constraints, while consistently preserving optimized formation geometry and ensuring synchronous arrival. Future research will extend the approach to more dynamic scenarios with moving obstacles, real-time replanning, and heterogeneous swarms, with practicality and robustness validated through hardware-in-the-loop simulations and outdoor multi-UAV flight experiments.
Footnotes
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data availability statement
The data that support the findings of this study are available from the corresponding author upon reasonable request.
