Abstract
Cooperative path planning of multiple unmanned aerial vehicles is a complex task. The collision avoidance and coordination between multiple unmanned aerial vehicles is a global optimal issue. This research addresses the path planning of multi-colonies with multiple unmanned aerial vehicles in dynamic environment. To observe the model of whole scenario, we combine maximum–minimum ant colony optimization and differential evolution to make metaheuristic optimization algorithm. Our designed algorithm, controls the deficiencies of present classical ant colony optimization and maximum–minimum ant colony optimization, has the contradiction among the excessive information and global optimization. Moreover, in our proposed algorithm, maximum–minimum ant colony optimization is used to lemmatize the pheromone and only best ant of each colony is able to construct the path. However, the path escape by maximum–minimum ant colony optimization and it treated as the object for differential evolution constraints. Now, it is ensuring to find the best global colony, which provides optimal solution for the entire colony. Furthermore, the proposed approach has an ability to increase the robustness while preserving the global convergence speed. Finally, the simulation experiment results are performed under the rough dynamic environment containing some high peaks and mountains.
Keywords
Introduction
From the last few decades, the advancement in the field of aeronautics and astronautics has widely increased. The unmanned aerial vehicle (UAV) is the best invention of this domain.1–3 In this era, UAVs have been widely used for the military missions but it can widely use for the applications in rescue, surveillance and mapping scenarios. Moreover, we can also use UAVs for long missions, such as in remote sensing and dangerous areas, where accurate and flexible maneuvering is required. 4 In order to enhance the performance of combat scenarios and reduce the overall mission accomplishment time, a swarm of UAVs takes part. 5 The path-planning process is an important area of interest in the usage of multiple UAVs (M-UAVs). 6 It requires an optimal path or route from the initial position (base station) to the targeted position while avoiding obstacles and consuming minimum fuel. 7 Optimization issues are encountered to real-world problems in different areas such as mathematics, engineering, science and economics. Scientific algorithms are used to resolve these optimizations issues; it may require more extensive calculations as the problem size gets bigger. Thus, we need optimization techniques that require less memory and computational power but give better outcomes. From the last three decades, scientist made stochastic-based different bio-inspired optimization algorithms which are more accurate and efficient to compare with the analytical methods.8–12
The autonomous flight control is one of the most essential feature in modern control UAVs to adapt the dynamic environment and to find the most suitable and optimal route for the mission. 13 When using M-UAVs, we also have to deal with additional problems like simultaneous arrival and collision avoidance and so on. One way to solve these problems is to use classical algorithms because they are quite efficient, but they require a lot of time and calculations. 14 On the other hand, many scientists have made evolutionary bio-inspired algorithms such as ant colony optimization (ACO) by Colorni et al., 15 particle swarm optimization (PSO) by Kennedy and Eberhart, 16 differential evolution (DE) by Storn and Price 17 and genetic algorithm (GA) by John Holland. 18 These algorithms are quite efficient as well as reliable and quicker in finding the optimal solution to complete the mission requirement robustly. The combination of these methods has emerged to have quicker convergence speed and superior solutions. Examples of these hybrid algorithms include those between PSO and GA,19,20 PSO and DE21–23 and ACO and DE.24–26
However, in this research, our main concern is on ACO; it follows the hunting behavior of actual ant colonies to solve the optimization problems. It consists of synthetic ant colonies which work together to find the best possible solution. The ants convey the information to each other using virtual pheromones. While ACO is a very effective method, proven by its widespread usage, it has some disadvantages, too. Its convergence speed is slow, and it can fall into local optimum. Occasionally, it comes up with identical outcomes that lower the probability of getting the best result.
Cekmez et al. 27 present a multi-colony ACO to counter the previous aforementioned issues of slow convergence speed and local optimum. In contrast to normal ACO, the authors suggest to use multiple ant colonies. They utilize different pheromone tables for every colony to explore the area completely. However, in every iteration, each colony provides varied outcomes. After that, all colonies interchange their optimal solution with the surrounding colonies. Then, every colony modifies their pheromone tables with the acquired data. Shao et al. 28 also discuss a similar path-planning problem concerning M-UAVs. They explore a distributed cooperative particle swarm optimization (DCPSO) method intended for secure navigation of each UAV.
Dewangan et al. 29 used a grey wolf optimization (GWO) technique to resolve the three-dimensional (3D) path-planning problem of UAV. In addition, the major task is to search the most reasonable path and to avoid collision and other obstacles. In Gu et al., 30 M-UAVs trajectory planning is done using PSO with (receding horizon) framework. Moreover, obstacle avoidance–based approach is developed to eliminate collision between UAVs. Furthermore, a decentralized control hierarchy for an individual UAV is proposed to control individual UAV. An intelligent Bézier curve–based model for path planning is proposed in Tharwat et al. 31 The main aim of the designed algorithm is to search the shortest and smooth path between the start point and the targeted point. Beside this, chaotic particle swarm optimization (CPSO) strategy has also been applied to enhance the control points of the Bézier curve.
The inspiration of this research work has been taken from the work in the previous literature;26,27 the presented study states the issues of UAV path planning by taking the benefit of smart bio-inspired optimization algorithms, in the presence of dynamic environment. In this research, we design a novel maximum–minimum ant colony optimization (MMACO) with DE to make a metaheuristic hybrid algorithm. Although combining these two algorithms increases the complexity of the system, but it will surely provide a globally optimal solution for the path planning of multi-colonies with M-UAVs.
The primary achievements of this research are as follows:
A multi-colonies optimization having different sub–ant colonies exists in the entire ant colony, where each sub-colony performed independently to explore the overall area.
Three different colonies have been made, and each colony is independent to find the optimal or shortest distance to reach the targeted area.
To enhance the performance of the entire colony, each colony needs to share its knowledge with others.
To find the best and the shortest route from the initial point to the targeted area from a colony in which the upgradation of route and lemmatized the number of participants are done by MMACO.
Finally, DE will globally determine which is the most suitable sub-colony to reach the targeted area in optimal time.
The reminder of this manuscript is structured as follows. In section “Problem statement,” we define the problem statement and mission requirement. Cooperative path planning, collision avoidance and coordination between UAVs are discussed in section “Cooperative path planning.” It is followed by section “Hybrid MMACO-DE algorithm,” in which our designed hybrid MMACO-DE algorithm is discussed. Section “Simulation results and discussions” presents the simulation results and discussion. Finally, the overall manuscript is concluded in section “Conclusion.”
Problem statement
The M-UAVs path-planning problem is shown in Figure 1. To consider three UAVs take off from three different locations of an entire colony. The termed different locations are called as sub-colonies, that is, 1, 2 and 3, target the designated area. Moreover, there are some obstacles in the form of mountains and high rocks in the route, due to these aforementioned threats in the fly zone need to escape from them robustly. Furthermore, during the whole scenario, UAVs are required to maintain safe distance to avoid collision and arrive at their targeted area by time synchronization to adjust the length and the route of the complete tour.

Multiple UAVs path-planning scenario.
Mission requirement
Our main concern is to find which colony is most suitable to target the designated area optimally.
Cooperative path planning
To compare with the path planning of individual UAV, M-UAVs required perfect coordination between neighbor UAVs, to meet with the specific point or area without any collision. The requirements to achieve the mission and physical parameters are different from a single UAV. The cooperation and coordination between several UAVs bring different co-constraints, that is, timing coordination and avoidance to collide with each other. In this study, we mainly focus on collision avoidance and coordination between UAVs.
Collision avoidance
An additional requirement of path planning is to avoid collision and to minimize the threat of collision, which is also termed as air coordination or formation. In our scenario, the 3D map is used and we must ensure that all UAVs’ altitude must change according to the given condition of the unavoidable mountain peaks in the route. Due to the above route, we must maintain safe distance between the UAVs and apply some proper coordination strategy.
13
The path-planning constraints of M-UAVs also maintain the time and space correlation between all UAVs route. We consider short safe distance
where
where
Coordination between UAVs
To meet with the spatial and temporal requirements for the group of UAVs, we introduce two coordinate coefficients that must add in each UAV as a function of its path or route. It is expressed as
Therefore,
where

Path-planning coordination between UAVs.
For the path-planning scenario, each UAV communicates with other UAV and calculates its spatial and temporal coordination coefficient. Finally, the proposed algorithm will select the most suitable path, which will not only meet the fitness but also our mission requirements.
Hybrid MMACO-DE algorithm
In this section, we design an improved metaheuristic hybrid algorithm to resolve the path-planning issues in a 3D environment. The metaheuristic algorithm consists of MMACO and DE. The main aim is to design this algorithm and to apply it in the scenario of multi-colonies path planning of M-UAVs. In order to enhance the effectiveness of path planning, UAVs must arrive at the targeted area using the shortest path, complete the mission and improve the reliability of the whole scenario. First, we apply the classical model of ACO and after that speed up the whole strategy by applying our hybrid metaheuristic algorithm. Finally, only the most optimal colony will be selected to complete the mission requirement.
As shown in Figure 3, the entire ant colony divided into three sub-colonies, where each sub-colony is restricted only best ant to construct and update the route. Moreover, each colony executes as an independent agent, to fulfill path-planning task more robustly. The main objective is to find the most optimal distance from the entire colony, that is, which sub-colony is most suitable to reach the targeted area. Figure 3 represents the block of MMACO-DE strategy; it also shows the knowledge sharing between multi-colonies path planning of M-UAVs.

Block diagram of MMACO-DE algorithm.
Basic ACO algorithm
The natural representation of basic ACO algorithm depends on ant colonies. Real ants are proficient to find the shortest path from a food source to their colony. They hunt the food without any visual clues by manipulating and updating the pheromone data. Ants walk on the earth, deposit the pheromone and track in probability the pheromone earlier dropped by the former ants. 32 The advantage of the algorithm is that in the end all ants follow the shortest path. A procedure where ants utilize their pheromones to search for a shortest path between colony and food source is presented in Figure 4 below.

Real ant’s food hunting procedure.
The action of real ants is inspired by the ACO, where a set of virtual ants collaborate with each other. For the solution of problem by interchanging the information through pheromones placed on the edges of the grid boundaries.
Improved ACO algorithm
The parallel process of ACO in general comprises two essential practices: adaptability and collaboration. In adaptability, the contender solutions are carried out to readapt their organizations on the source of collecting the information. On the other hand, in collaboration stage, the contender offer interchanges the information to offer the best possible solution. The first ACO strategy applied to the traveling salesperson problem (TSP) is to discover the minimum closed-loop path between two cities or node. Even though the UAV path planning is to discover the best optimistic flight path, UAV is able to complete the desired task and avoid the unwanted threats, that is, mountain peaks and different threats. 33
ACO provides flexible route to solve M-UAVs path-planning issue under hazardous battlefield environment. The scenario is designed for the
where the term
The sum of pheromone by the
The sum of pheromone trail
where
where
Now,
where
maximum–minimum ACO
To increase the performance of classical ant system, improve the issues related to the initial stagnation. In order to fasten the convergence speed, the following enhancement in the algorithm is established in the ACO strategy.
32
Now, the
If and only if the route cost of the
DE
DE is the best and versatile optimization method; it formerly designed as an algorithm for the global continuous optimization. It consists of three basic constraints named as mutation, crossover and selection operator. Initially, it generates some random solution to search the target. After that, it enlarges the alteration vector concerning two population vectors. First, it generates the trail mutation and hereinafter it joins the trail mutation and the target ones to produce an updated individual. If an updated individual has good fitness results, it will accept and replace it with the previous individual. 33
In this research, we check the pheromone on the route that left by the ants in MMACO as the objective in DE. To solve the path-planning issues, the best ant of each colony represents the colony and only the best ant will be able to update the pheromone trails for the whole scenario. Some modifications are done in the model of MMACO strategy to divide the whole colony into three different colonies, and the colony number is termed as
where
where
For the 3D path planning of UAVs, all the ants in each colony develop their own route using the transition probability
where the original pheromone left by the

Flowchart of the designed hybrid algorithm.
Following are the steps of our proposed hybrid algorithm:
Step 1: Initiate all the constraints of ACO, that is, set the number of and maximum number of iterations along with the maximum number of ants utilize to explore the path-planning scenario.
Step 2: Divide the entire ant colony into three different located ant colonies or sub-colonies. However, the number of ants in each colony logged
Step 3: Initialize the edge pheromone for every sub-colony path where ants are allowed to construct the route and to move to the next node.
Step 4: Check whether the ants reached at the target node or not; if no, repeat step 3; if yes, proceed to the next step.
Step 5: Now calculate the route covered by the ants and making condition best ant update the route for every colony. Also calculate the cost from city
Step 6: Mutation and crossover process is applied on the original trail pheromone
Step 7: Set
Step 8: Now, separate each ant of the
Step 9: Compare
Step 10: Two conditions are there; if in the previous step, condition is yes, it will allow to explore new path and update the pheromone; otherwise, do not need it. Selection process of DE will select the route.
Step 11: Return to Step 6 till
Step 12: The designed proposed algorithm gives the best optimal route in 3D environment.
Simulation results and discussions
The simulations of path planning via multi-colonies with M-UAVs using MMACO-DE algorithm are divided into two different scenarios. Now, complete the overall scenario constraints of each UAV including the length or distance of flight, attitude angle, altitude and 3D land environment. When UAV fly from one point to the nearest way point, it requires some time to fine tune its attitude and altitude to reach at the nearest node. The length between two nodes or way points are calculated using the formula
In equation (17),
where
In order to find the possibility and efficiency of the designed hybrid algorithm for path planning of multi-colonies with M-UAVs, two different simulation scenarios were simulated. Both these simulations were performed in Simulink MATLAB 2016, along with the programming on an Intel Core i7 8th generation. The initial parameters for both scenarios are set to be
Constraints of dynamic environment.
Scenario 1
In order to verify the effectiveness of our prosed algorithm first, we compare our proposed hybrid algorithm with classical ACO. In this case, two different algorithms are initiated from the same base station and to reach at the designated area by applying cooperative path-planning methodology of UAV. The path-planning optimization results are shown in Figure 6(a) and (b), respectively, and it is clearly evident that our designed algorithm reaches at the destination using the optimal route. Figure 7 shows the estimation costs of both the algorithms, and Table 2 defines the overall comparison of these strategies.

(a and b) Different views of path-planning scenario by comparing ACO and MMACO-DE.

Estimation costs of ACO and MMACO-DE.
Comparison of ACO and our proposed algorithm.
ACO: ant colony optimization; MMACO-DE: maximum–minimum ant colony optimization and differential evolution.
Scenario 2
In this case, three best (ants) UAVs that belong to three different colonies for an aerial combat formation of path planning are assigned to reach at the targeted area simultaneously. All the initial parameters in this case are same except the number of colonies, that is,

(a and b) Different views of path-planning scenario using multi-colonies with multiple UAVs.

Estimation costs of different colonies.
Comparison of proposed algorithm using three different takeoff points.
UAV: unmanned aerial vehicle.

The best-optimized route determined by our designed algorithm.
Conclusion
This study uses hybrid metaheuristic algorithm obtained via MMACO in combination with DE strategy. The proposed algorithm provides optimal 3D route for the path planning of M-UAVs. As seen in the simulation, the ant colony is further divided into small colonies to explore optimal colony for mission completion. The above designed hybrid algorithm offers a platform for multi-colonies path-planning concept that is implemented for the real-world scenario.
Footnotes
Acknowledgements
The authors thank the reviewers and editors for giving valuable comments, which are very helpful for improving this manuscript.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship and/or publication of this article: This research is supported by China National Nature Science Foundation with grant number 61374165.
