Abstract
The problem of three-dimensional path planning in obstacle-crowded environments is a challenge (an NP-hard problem), which becomes even more complex when considering environmental uncertainty and system control. Int this paper, we mainly focused on more challenging problem, that is, path planning in obstacle-crowded environments, and we try to find the relation between contact information and obstacle modeling. We proposed a newactive exploring sampling-based algorithm based on rapidly exploring random tree (RRT), namely, guiding attraction–based random tree (GART). GART introduces bidirectional potential field to redistribute each newly sampled state, such that the in-collision samples can be redistributed for extension. Furthermore, dynamic constraints are deployed to establish forward extending region by GART. Thus, GART can ensure kinodynamic reachability as well as smoothness. Theoretical analysis demonstrate that GART is probabilistic complete, and it obtains faster convergence rate because of its redistribution ability. In addition to theoretical analysis, this article provides comparative simulations as well as experiments under typical situations. Results demonstrate that GART has a much better time-efficiency performance than RRT*, retraction-based RRT, and other referred algorithms when applying redistribution and dynamic constraints on random exploration.
Keywords
Introduction
Robots path planning in cluttered environment has been one of the most significant elements of mission definition, and it is widely discussed in all kinds of areas related to planning and navigation. Generally, coverage path planning aims to generate a real-time global path to target obstacle avoidance and considers kinodynamic (kinematic and dynamic) constraints. However, this kind of algorithms explores under a static situation and also they are not able to do online processing. We aim to solve the problem to ensure fast exploring and obstacle avoiding in obstacle-crowded environments under dynamics constraints.
Nowadays, sampling-based exploring algorithms such as probabilistic roadmap 1 and rapidly exploring random tree (RRT) 2 attract more and more interests, which are capable of solving the path planning (or motion planning) problem in high dimension. Compared to numerical methods (such as convex optimization 3 that relies on accurate modeling of both environments and system) and heuristic algorithms (such as A* 4 ), sampling-based algorithms (SBAs) are flexible in obtaining kinodynamic constrains, 5,6 uncertain factors, 7 and system dynamics. 6
RRT is currently the most successful single-query method proposed to generate probabilistic complete solution in various environments. RRT samples randomly from the entire configuration space to ensure generating possible solution. RRT* and rapidly exploring random graph were proposed in the study by Karaman and Frazzoli, 8 which are asymptotically optimal compared to RRT because they introduce “rewire” to update new connection with lower cost around the newly sampled state. Our work is based on RRT*, and we aim to solve three most heated discussion problems: (1) how to increase the extending rate for random sampling to extend without fail in obstacle-crowded environments, (2) how to obtain the cost minimal neighbor choosing metric without useless collision check, and (3) how to represent uncertainty to enable safe tracking.
Two strategies were proposed for solving the first problem; filtering techniques such as Gaussian sampling, which increase the sampling probability near obstacle regions, 9 and retraction-based techniques that use the contact information 10 or bridge test 11 to support new connections. There are heated discussions on how to solve the second problem, 12 the third problem, handling uncertain factors. 6,13,14 However, each problem is partly solved, and the relationship between system dynamics and environmental information is not well constructed.
Our approach is inspired by artificial potential filed, which introduces bidirectional potential field (BPF) for exploration. It is an extension and improved work of our previous work, 15 where we elaborately discussed the algorithm with improvement in dynamics consideration and BPF redistribution. We concentrated on the problem that robots (including unmanned aerial vehicle (UAVs) and this article focused on tackling UAV system kinodynamic constraints) cannot reach any state by random sampling of RRT*, and environmental information is abundant while exploring. Then we proposed guiding attraction–based random tree (GART). The method checks collision state of each newly sampled state and then redistributes it to enable re-extending toward the narrow region or wide free region based on BPF. Redistribution process can generate one or two states under two situations, and it enables every sampling works with the best location. We define this as active exploring as the redistribution process relies on obstacles’ information and can explore the gap between any obstacles. To decrease the waste of time in collision checking of general RRTs, 8 we proposed a forward reachable region metric based on system dynamics constraints, which defines only the dynamically feasible state, can be selected. Furthermore, we employ the eclipse description method 13 to model the sensing of uncertainty and control uncertainty.
Demonstration of outstanding asymptotic optimal property has been provided by the given theoretical analysis and conclusion demonstrates that the redistribution heuristic optimized the performance of Monte Carlo method. 16 Simulation experiments are carried out with the following cases: (1) 2-D environment with convex or nonconvex obstacles; (2) 3-D environment mountain-like terrene; (3) S-like tunnel environment to compare narrow corridor performance; and (4) Bug-trap-like box environment to compare narrow corridor performance.
Related works
Since first work proposed in last century, RRT,2 which presents a single-query way of exploring without post evaluation has been drawing attention among many researchers. Then the improved version called RRT* 8 is introduced to find the optimal solution, thus overcoming the weakness of RRT. It achieves the optimal solution can be smoothed to be post-smoothed by numerical algorithm, 17 RRT* only achieves optimal solution using geometric metric, which is not sufficient for stable following by robots, let alone the optimal ability.
Several attempts have been made to ensure kinodynamic adaptivity with asymptotically optimal. Lee et al. 18 introduce differential constraints to optimize the nearest neighbor and distance metric, thus enabling high-speed maneuvering by considering high-dimensional dynamical system. In the study by Jaillet et al., infinite-horizon linear-quadratic regulator (LQR) was introduced to calculate the local extension between two states, 19 where in each step a forward simulation to extent toward the newly sampled state is executed to test dynamics feasibility. Fixed final state and fixed final time optimal control problems are solved and embedded in RRT* by Webb and van den Berg. 5 The method is called kinodymamic RRT* that includes control effort in the cost principle and can output the global optimal solution.
Ability to handle extreme condition is the next problem after generating optimal solution that fulfills system constraints, especially tackling the narrow corridors between obstacles, with places having low extending efficiency. 9 A solution is provided by introducing the Gaussian sampling, which, unlike uniformly sampling based Monte Carlo method, applies random sampling principle. The method changes the sampling distribution by testing whether the robot is intersected with obstacles or not. Hsu et al. 11 proposed bridge test for boosting the sampling density in the narrow corridor by supporting new feasible connections. Lee et al. 20 improved the algorithm by combining noncolliding line test and contact information. 10 The method is proposed based on retraction-based RRT (RRRT), which tested to be efficient than dynamic-domain RRT (DDRRT). 21 The study by Cheng 22 solved narrow passage problem with uncertainty using environment-guided RRT. Environment-guided RRT integrates the merits of reachability-guided RRT, 23 which records the failures of exploration to increase the change in narrow passage, and resolution-complete RRT, 24 which can obtain system dynamics to avoid useless extension.
To solve uncertain control and sensing problem while exploring, various efforts are carried out. Particle RRT 14 was introduced to solve uncertain factors during navigation, where the method was inspired by particle filter to decrease the behavior uncertainty of the robot. Luders et al. proposed chance-constrained RRT 25 to overcome environmental uncertainty. The method represents the uncertainty as Gaussian noise, thus enables collision free by providing maximum dangerous region for each state. A probabilistic description was used to avoid spatially varying disturbance forces as shown by Desaraju and Michael, 26 which generates dynamically feasible paths. Rodriguez et al. 27 solve the uncertainty problem using forward prediction and then slightly adjusting the state according to the predicted results. LQG-MP 13 further introduces a state prediction method based on LQG controller, which solves the uncertainty involved as well as the dynamically feasible paths.
Obstacle information guiding or environmental information guiding is another perspective to achieve fast exploration, 28 employing triangles to describe the obstacles, and then considering each triangle to support the six obstacle vectors for extending. Qureshi et al. 29 proposed a direct shifting way to the newly sampled state by employing artificial potential field. Our work can be a combination of the above two works but only with near concepts. We apply BPF that obtains repulsion in the inner part and attraction in the outer region.
Article organization
The remainder of this article is organized as follows. The section on “Description of problem” outlines the basic concept of path planning and presents the problem of planning under uncertainty. The concept of dynamic concerned optimization is proposed to ensure the applicability of the path and accelerates the process of nearest-neighbor choosing. The proposed algorithm, that is, improved GART is analyzed in detail in section “Algorithm,” where the pseudocode is also provided. In section “Algorithm comparison and analysis,” we theoretically demonstrate that GART is more efficient than general RRT* in finding a feasible path and finally converging to the optimal one. Section “Experiment results” provides the comparative simulations as well as experiment with AscTec Pelican (Krailling, Germany) that demonstrates the efficiency of our algorithm. Finally, section “Conclusion” provides conclusions and perspectives for future work.
Description of problem
Problem with uncertainty
For path planning problems, imagine there exists a workspace set W ∈ Rn, where
L denotes the generated path. Sensors such as laser range finders and cameras vary with respect to their accuracy under changing states. Thus, the configuration of the robot will often be imperfect (see Figure 1(a)), which means that the accurate boundaries of Robs and Rfree are unknown. In addition, due to the disturbance and model errors, the generated path cannot be accurately followed, which is called control uncertainty (see Figure 1(b)). As the general path planning method leaves out these uncertainties, it cannot generate a safe path.

Two main uncertain representation of UAV. (b) Dotted line is the planned path and solid line is UAV real path.
Let S be the state of the UAV and Φ be the unstable set corresponding to each state. The policy π : Φ → S is a map that prescribes an uncertain sensing set φi for the current state si. The control uncertainty corresponding to the current state si is Δ ⋅ yi. Here, we address path planning under uncertainty and regard the sensing and control uncertainty combination as the obstacle region. The obstacle region can be described as Robs + (φi + Δ ⋅ yi), and the free region is Rfree − (φi + Δ ⋅ yi). For path planning with uncertainty, the first goal of planning under uncertainty is to find an obstacle-free path Puncertain from Xstart to Xgoal, where Puncertain ∈ (Rfree − (φi + Δ ⋅ yi)) and Puncertain ∩ (Robs + (φi + Δ ⋅ yi)) = ⊘
Dynamic constrained optimization
The cost principle to extend the exploring tree directly influences the efficiency of the whole exploring process. SBAs extend the exploring tree by solving the nearest-neighbor problem, where the nearest neighbors are nodes that connect the newly sampled nodes. Classical metrics of selecting nearest nodes, particularly the parent node, adopt the integral of the cost along the path (i.e. length), time, energy, and so on.
5,12
For RRT*, in order to guarantee asymptotic optimal, for each iteration the method selects the neighbor by applying
This article tries to increase the convergence speed from the dynamic reachability point of view, that is, the UAV reachability region is constrained by the steering constraints (i.e. dynamic constraints) of the vehicle. It is illustrated in Figure 2(a) that the UAV can only reach the nodes within steering reachability. Thus, for red newly sampled node in Figure 2(b), only X2 and X3 are possible nodes to connect, and X1 is excluded because of its inability to reach Xnew. This is caused by the fact that the actuators have limited ability, that is, Steering constraints of UAV, which decide the extending region of each state.
Algorithms
The metric of choosing the appropriate neighbors to connect from is the bottleneck of RRTs, 12 which must able to handle huge amount of data within limited time. The random sampling attempts are especially difficult when the environment is crowded with obstacles. 11,20
To overcome the low exploring efficiency, we propose the dynamic concerned exploration, which is inspired by kinodynamic planning 5 and control effort evaluation. 19 The dynamic concerned exploration facilitates all choosing neighbors to be connectable without complex cost comparison. The redistribution method BPF is also proposed to extend with maximum amount of samplings.
Illustration of proposed GART
Pseudocode of GART is presented in algorithm 1, given the preliminary conditions: Control bounds U, initial and goal position, and initial heading at vroot, GART first samples a random state vs (line 3), selects the possible k-nearest parents VNeg to connect from under dynamic constraints R+ (line 4), vs is redistributed under R+ of vnear (lines5 and 6), this article focuses using BPF to solve three events (discussed in section “Guiding steer algorithm”) and redistributes vs or v′s (lines 8–14) with total re-distribution vector Fv. Finally, states after re-distributed Vnew will be added to the tree (line 14) with rewire (lines15 and 16). The whole process stops if the goal is reached (line 2).
GART improves general RRT* 8 in several ways, including the metric based on dynamics cost (lines 4, 6, and 15); the obstacles around each candidate are selected to support the BPF to redistribute the newly sampled states to reasonable position (lines 8 and 11); the final node is added to the tree, which is redistributed for safe and fast convergence (line 14). These main topics will be discussed in the following sections, and we also provide an LQG-MP inspired method to handle uncertain factors.
Obstacle modeling with uncertainty
For path planning, the first problem is how to reconstruct the environment online with the appropriate model. Two types of methods are commonly used to model the obstacles. One is based on detailed polynomial mathematical equation to rebuild the obstacle region. 30 This method requires sufficient and complex resources to describe the obstacles with much computational resources. The other methods model the obstacle region with occupancy grid, 31 which represents the environment in detail using large amount of cells. Ning 32 proposed a promising method called fuzzy boundaries to handle nonlinear boundaries, which is quite promising. Both are widely implemented for UAV and many other robots; however, they need more storage and time to be explored, particularly in cluttered environment with uncertainty.
This article introduces ellipse (and ellipsoid in 3-D) to model the obstacles with some promotion, which is quite commonly used and tested in real environments. 13,33,34 We are concerned about the uncertainty of the sensors, which also affected the uncertainty of the control phase. 13 This article regards the sensor detection results obey Gaussian distribution (Figure 3(a)), that is

On board sensing noise is described as Gaussian noise in (a). Then inspired by LQG-MP 13 to calculate the margin factor of each sensed node to represent the obstacle, which are the red ellipses ELi in (b).
μg denotes the center position of each sensed node or along a certain axis, δg is the deviate error, and px.y.z is the position of the node. We are inspired by LQG-MP 13 to calculate the distribution bound of obstacle region under uncertainty using probability. Onboard sensors are supposed to obtain Gaussian noise (equation (3)), and the probability of any node on the obstacle within Os is
Here, d is the dimension of Gaussian distribution. In order to ensure safe modeling of the obstacle, we define that for any sensed node vKF by Kalman-filter
13
probability bound
Dynamic concerned exploration
To enable the post processing of the planned path, the path planners are required: The connection between any two states must meet the dynamic constraints. Any local connections between joints should be kept as smooth as possible.
We are inspired by numerous research works, which aim to find the most platform-adaptive and implementable extending metrics for RRTs. The LQR cost evaluated the neighbor-chosen metric, 19 expended control, and time combined cost-bounded extending, 5 they share the same merits of obtaining dynamic reachability. Our method also obtains a reachable region, that is, R+ denotes the feasible forward extending region (FER) (the illustrated region in Figure 3(a))
θm denotes the maximum reaching angle range at each v ∈ V in 2-D (which can be easily extended to 3-D) in δt time, J(α) is farthest distance can be reached, where
ui(t + Δt) is described in section “Dynamic constrained optimization,” which obtains a bound in any time interval. R+ is a varying bound that is related to (state, control) := (X, U). For GART, unlike r = γ(logn/n)(1/d) radius-bounded RRTs 5,8 or k-nearest RRT, 8 which are analyzed in geometric aspect to obtain asymptotic optimality, it first selects k dynamic feasible parents nearest(k) based on equation (6) for newly sampled vs. Then, we introduce the cost metric
φ(v, vs − v) returns the angle between v and vs − v. The metric is associated with control effort, where v ∈ Nearest(k). GART chose only the minimal cost vertex to connect from. From Figure 3(b), for example, vroot, v0, v2, and v5 can be the parent of vs if set k ≥ 4. Then, v2 is chosen as the parent vertex to connect from by applying equation (5).
Guiding steer algorithm
Guiding steer is used to calculate the redistribution vector to redistribute the newly sampled vertex and thus enables the efficient extending, which is used in algorithm 1 at line 5. To discuss the algorithm, we explain this with two parts, which are BPF and redistribution vector.
BPF
BPF improved a lot of random SBAs by reusing the obstacle-collided samplings and also enables the best redistribution for nodes, which will lead to stop in front of the edge (nodes that are ended without extending region in front of obstacle region edges). In order to make BPF works, a preliminary work, which is obstacle detection and counting should be explained first.
Obstacle detection and counting
Before applying the BPF, let us first discuss the obstacle detection (i.e. environment modeling). Onboard sensors have limited sensing area, thus only obstacles within the detection range can be taken into consideration. In section Obstacle modeling with uncertainty,” we talked about using ellipse to describe the uncertainty of sensing, and these ellipse cannot cover if there exist a gap. This article defines that nodes detected are represented with (angle and position), that is, (θ(v), v). Thus, for a set of continuous nodes

The obstacle detection algorithm regards there exists an obstacle if a continuous edge is detected. It cannot distinguish the gap within a obstacle and regards them as two obstacles.
BPF does not simply introduce the PF to slightly adjust the position of the sampling as employed in the study by Qureshi et al. 29 and it tries to obtain active exploring ability for algorithms. Based on the work of Qureshi et al, 29 it just simply introduces the obstacle support repulsion and the goal support attraction, that is
As J(vs, vOi) < Jdm, where Jdm denotes the maximum detection range, thus repulsion OUrep existed for detected obstacles confronted. Fusing this with RRT seems promising, but there are two problems: (1) The newly sampled vs always get repulsion from the nearby obstacles, thus if there is a narrow corridor, the chance of going through decreases. From Figure 4(b), for example, if there exist a corridor between cl1 and cl2, method adpoted by Qureshi et al. 29 fails. (2) The method still neglects the nodes in collision, which thus waste more time in collision detection and sampling.
As illustrated in Figure 5(a), the PF is easy to achieve. But the repulsion of obstacles provides the force to move the sampling away. BPF introduces bidirectional potential for obstacles, that is, the inner involves repulsion to make the newly sampling out, and the outer involves attraction to support active exploring in crowded. It can be represented as

BPF is introduced unlike general APF (a), which makes the inner with repulsion and outer with attraction (b). BPF: bidirectional potential field; APF: artificial potential field.
Redistribution vector
BPF introduces When newly sampled state vs (the red node) locates in dangerous region, this attempt to find the state in same direction is inappropriate even if Every time a new state vs is sampled, which is located in the collision-free region and can be connected, the neighbor obstacles are detected and counted by applying BPF (described in section “BPF.”). If the number of the obstacles,
O
N, exceeds one
O
N ≥ 1 (see Figure 6(b)). vs tries to find the nearest states (
O
va and
O
vb in Figure 6(b)) on the nearby obstacles, respectively. For this situation, it is called attraction. A special case of the second event is that
O
N = 1, the method in event 1 is still applicable. Attraction is reversed to become repulsion and thus enables the sampled or redistributed states generated by event 1 to stay in collision-free and re-extendable state.

When any new sampled state located in dangerous region, two reachable states, which are collision free, are chosen as candidates (a). Then for all states in safe region with multi-obstacles crowded are redistributed by using attracting force (b).
We hereafter discuss the process of generating redistribution vector in detail. Consider vs is sampled randomly, the previous whole state tree T := {vi, i = 1, 2, ... }, where v1 = vroot. vs tries to find the nearest state vnear among all the reachable neighbors
Then let us consider event 1, if vs is in collision as illustrated in Figure 6(a), the extending along the same direction is presented as the blue state ∈ R+(vnear), which is generally adopted or just resampled. Discussion and comparison will be given later. We note that every sampling is useful and extendable, two new states (av1 and av2) that are out of collision or with maximum turning angle are selected
Here, the second constraint is to define that the state should be collision free, and the second constraint defines that the turning angle is the maximum, maxφ = θ, if none or only one collision free can be found. J(avi) = maxJ(θ) denotes the maximum distance can be reached. Thus, the redistribution with repulsion is
Still for event 1, if vs can connect to vnear successfully without collision, we only execute reachable extension in the same direction vnear → vs within R+(vnear). The newly generated candidate is
However, vf3 is not repulsion as vf1 or vf2, which is the accessible state with fixed direction within R+. Now, the possible candidate states are selected that subject to dynamical constraints. Guiding attraction, which is exactly the main method proposed by this article to enable active exploration through obstacle-crowded environment. Two situations are discussed separately in events 2 and 3, let us first discuss event 2 where multi-obstacles exist (a simple illustration is given in Figure 6(b)). We define that the obstacles support attraction to the state, for each obstacle, the nearest node Ovi margin node is selected among all the margin nodes (the red nodes) within detection range. Again, we define the attraction is proportional to the L 2 norm of Ovi−avj, which is
Thus, the total attraction is
Here, k1 is the same as the scale of the magnitude of the attraction for all candidate states. af denotes the total forces to redistribute the candidate avj. But if only one obstacle is nearby, the attraction is reversed to become repulsion avj, we define the magnitude is reverse to the distance to the nearest margin node Ovi. Thus, the repulsion vector is
Here
Finally, we still apply the goal attraction
Here, Fv denotes the final redistributed nodes set, i ∈ {1, 2, 3}, which is discussed in equations (12) and (13), j ∈ {a, r} that is discussed in equations (15) and (16).
Algorithm comparison and analysis
In this section, we demonstrate that GART has better performance than RRT and RRT* in completeness and convergence ability. The two most important properties of GART are completeness and convergence ability. We analyze the probability completeness of GART, which guarantees whether the final feasible path can be achieved. Asymptotic convergence ability is also analyzed and compared with RRT*.
For further discussion, the following notations are presented based on the work of Karaman: (1) The RRT algorithm is not asymptotically optimal and (2) RRT* and RRT represent probabilistic completeness. In this article, in order to keep the flow of the presentation, the proofs of important thorems are presented in the Appendix .
Convergence analysis
GART is based on Monte Carlo sampling method and has its own special properties. Thus, its convergence ability must be analyzed to prove that GART can guarantee global convergence.
Probabilistic completeness
LaValle et al. proved that the probability that RRT can find a feasible path from start to goal approaches 1 when the number of iterations approaches infinity. GART is based on RRT*, and outperformed RRT* using samples in collision to generate extra extendable samples, which ensures higher probability to expand the tree.
Theorem 4.1 (completeness of GART)
Given initial condition (xstart, xgoal, and W), GART is able to solve the path planning problem. When there exists a b > 0 and n0inN, the probability is such that
Here b and n0 are constant that dependent only on Rfree and Rgoal (the goal region). For proof of this theorem, it can be found in Appendix 1.
Asymptotic optimal
For RRT*, it has to choose the best parent within the appropriate ball range γRRT*, that is, RRT* must have a new node that is no longer than γRRT* away for extending. RRT* guarantees convergence to the asymptotic optimal with
GART obtains change in the sampling distribution, while in order to analyze asymptotically optimal, let us first discuss the following notations: (1) notice the reality for the optimal path that is based on the minimal cost exist, and let Sδ denotes the length of the optimal path δ, γGART denotes the neighbour chosen radius; (2) to guarantee optimal, the optimal path δ must cover a number of d-dimension balls B (with γδ radius), and each neighbor balls are assumed to have a overlapping range αγδ; and (3) the path obtains γ clearance for all balls, and αγδ := γ/1 + ′α. Thus, the number of balls B can be computed as follows
The optimal path convergence problem then can be regarded as that there exists m ∈ {1, 2, ... M} d-dimensional balls. If there exists at least a pair of nodes
Theorem 4.2 (Optimality of GART)
GART is able to redistribute the samples within the consideration of increasing the convergence speed, it is asymptotically optimal for any path planning problem (xstart, xgoal, and W). It means that the cost Cn of the path of GART will finally approach an optimal solution cfinite with probability 1
The asymptotic condition is as follows. Detailed proof of Theorem 4.2 can be found in Appendix 2.
When compared with RRT*, the result shows
As this article models the obstacle region as ellipse in 2-D (cylindroid or ellipsoid in 3-D), that is, μ(Robstacle) ≤ μ(ellipse), thus
Substituting equation (22) into equation (23), the result shows
The result indicates that GART minimizes the neighbor search distance of finding best parent node (see line 11 of algorithm 1) and thus decreases much resources to execute cost comparison and collision detection of those parent nodes. This is why our GART has better performance than RRT and its variants in speeding up the convergence rate.
Computational complexity
GART holds true to the general process of RRT*, meanwhile, it introduces BPF and dynamic concerned exploration for fast convergence. According to algorithm 1, GART applies neighbor search, which obtains dynamic concerned exploration, “guiding steer,” and “nearest” search to generate the path. Based on Karaman’s work, 8 we may find that RRT and RRT* share the same processing time complexity O(nlogn) and the same query time complexity O(n).
Nearest searches through all n states to find k-nearest states. The dynamic constrained cost function only with the time complexity of O(1). Thus, the total time complexity is still O(logn). Guiding steer procedure is concerned with the obstacles, where the obstacle number m results the function time complexity of O(2 log d m). Neighbor searches all n nodes to pick our the nodes within certain distance, then its time complexity is O(logn). Thus, the total processing time is O(n(2 logn + 2 log d m)), which does not waste any more time than RRT*.
Experiment results
Requirements of the planner are 2-D and 3-D space with detectable obstacles. First, we illustrated the time and obstacle tackling efficiency of our active pursuing planner by providing comparisons. This can be shown by comparing the first-directive-connect (FDC) time and first-glance-connect (FGC) time. Also, we analyze the relationship between the environment scale and the exploring ability for the planner. For all simulations, they were carried out in Matlab 2013B with a Core I5 processor and 2G RAM, and the real experiment was carried out using robot operating system (ROS).
Exploring performance in cluttered environments
We design a representative map with obstacles (see Figure 7), the comparisons are carried out between RRT* and our GART. The setup is the following: we want the planner to explore in 110 × 110 space, from starting location to goal location, and we employ the unmanned ground vehicle model to represent the 2-D system model of UAV, that is

Evaluation of FDC of comparative algorithms. FDC: first-directive-connect.
We define v = 5 m/s, ϖ = 0.2 ⋅ π, Δt = 1, Δt defines the forward extending duration for taking into consideration. Therefore,
Let us first evaluate the performance under two requests: FDC (see Figure 7(a)) and FGC (see Figure 8(a)). These two kinds of situations indicate the obstacle-crowded environment adaptive ability and exploring efficiency. Following indicators such as iterations (I), number of vertex in the tree (n), cost (C), time (t), and percentage of useful samples (POUS) are discussed in this article. FDC results of GART (Figure 7(a)), APGD-RRT* (Figure 7(b)), and RRT* (Figure 7(c)) are illustrated in Figure 7. GART introduces the BPF to redistribute sampling in FER, it enables generating of more re-extendable samples to achieve dynamic feasibility. In Figure 7(a), the blue tree is generated without using rewire as general RRT*, which is almost smooth as RRT*. The most exciting result is that GART can generate 452 states within 360 iterations, a collision state can be redistributed as one or two collision-free states toward the best extending direction and thus enables more efficient exploring. FGC results are illustrated in Figure 8(a) to 8(c). Although GART outputs with large cost path than APGD-RRT* and RRT*, it can generate a much more diverse tree with almost the same iteration, that is, the algorithm enables a higher probabilistic of completeness as well as convergence. We note that Figure 8(a) is the worst case of GART in finding FGC, and the average performance is better than other algorithms.

Evaluation of FGC of comparative algorithms. FGC: first-glance-connect.
For FDC and FGC, Table 1 summarizes the main indications of each algorithm both in 2-D and in 3-D. For cluttered environments, both RRT* and APGD-RRT* are unable to fully and reasonably sample within the configuration space, this is caused by collision connect within the states. Thus, RRT* and APGD-RRT* only obtains 71% and 73.4% of useful states (POUS), while our GART can generate 127% POUS with iterations. According to this, it is also rational that GART can find the goal with average 473 iterations for FDC and 195 iterations for FGC. Time complexity are tested, we record the time for each iteration of each method, then we calculate the multiple by executing tGART/tRRT* and tAPGD-RRT*/tRRT*. Result is illustrated in Figure 12, where we can deduce GART to obtain a higher time complexity than APGD-RRT* and RRT*. The medium value of time complexity of GART compared to RRT* is 1.23, while the exploration time will not increase much as the iteration efficiency is much higher than other algorithms.
FDC and FGC comparison between RRT*, APGD-RRT*, and GART.
GART: guiding attraction-based random tree; RRT: rapidly exploring random tree; FDC: first-directive-connect; FGC: first-glance-connect.
Comparisons are also implemented in 3-D environment (see Figures 9 and 10). We test FDC for GART, APGD-RRT*, and RRT* and results are illustrated in Table 1 and Figure 9(a) to 9(c). The terrain-like environment contains four mountains, GART can find a way out easily by redistributing the states in danger, and thus it can reache the goal within 580 iterations for average. However, both APGD-RRT* and RRT* obtain no collision process method, they need almost 2 and 4 times time to find the path. Another advantage of the environmental information-based redistribution is that the cost can be much more smaller even at the the first time of reaching the goal, that is, c = 93.77 of GART compared with c = 97.99 of APGD-RRT*, and c = 104.22 of RRT*. We apparently see that the red path is smoother and without much turning in Figure 9(a) than Figure 9(b) and Figure 9(c).

Illustration of the first arriving to the goal of GART, APGD-RRT*, and RRT*. GART: guiding attraction based random tree; RRT: rapidly exploring random tree.

3-D results after 10,000 iterations, black lines indicate the tree and red line indicates the path.
As discussed earlier, the POUS is very important to indicate the efficiency of obstacle handling, here we also discuss another indicator, that is, percentage of rewire (POR). POR reveals the dispersion (or reasonable sampling rate) of the algorithm. For the 3-D environment (Figure 9), we make the methods to explore for 10,000 iterations and record the POUS and POR for each 1000 iterations. Results are illustrated in Figure 11, GART obtains an average 100% POUS (Figure 11(a)), while RRT* and APGD-RRT* obtain low efficiency for exploration. For POR, GART obtains a high percentage at the beginning (Figure 11(b)) and can also obtain a high POR at the end. Higher POR means smoother, as GART introduces the dynamic constraints and thus keeps a rational distribution of states as well as shorter path can be found for FDC. The 3-D step time is also recorded and illustrated in Figure 12, the variation between 2D and 3D is caused by the distribution of the obstacles. The time average time of GART compared to RRT* is no more than 1.5 times, while iteration efficiency is four times more. The final results after 10,000 iterations of each algorithm is illustrated in Figure 10.

3-D comparison of FDC between GART, APGD-RRT*, and RRT*. FDC: first-directive-connect; GART: guiding attraction-based random tree; RRT: rapidly exploring random tree.

Time multiples between GART, APGD-RRT*, and RRT*. GART: guiding attraction-based random tree; RRT: rapidly exploring random tree.
Efficiency of handling narrow corridors
Narrow corridor tests are implemented in two kinds of representative benchmarks, which are S-like tunnel (Figure 13(a)) and Bug-trap-like box (Figure 13(b)). The S-like tunnel is consisted of rectangles, which constrains the robot to pass within the corridor, and this article tries to test the efficiency of the proposed algorithm by scaling the width of this S-like tunnel. S-tunnel 1.0 means that the corridor is wide enough for robot to pass through, S-tunnel 0.5 is the moderate level of width for robot, and S-tunnel 0.3 with tight corridor is hard for robot to pass. Bug-trap-like box is also designed for robot to find a way out of the box that has a narrow corridor directly to the inner region, the width of the corridor is also scaled with the above definition with Bug-trap 1.0, 0.6, and 0.4.

Two representative benchmarks that have narrow corridors, which are challenges for algorithms to find a path quickly.
Comparisons with RRT, DDRRT and RRRT are implemented within the above designed narrow corridor environments to demonstrate the ability of our algorithm. RRRT and DDRRT are all proved to be more efficient than RRT in most situations. 20 For all the benchmarks, our GART spends less than the average time to find a collision-free path which at most needs 5 s to find a feasible path.
For the easy S-like tunnel, GART shows relative better performance, which are 13.1% and 6% times faster than basic RRT and RRRT. But with over 10 times faster than DDRRT (Table 2). The performance increases when the corridor is with moderate difficulty for GART, where the time decreases to find a path. In the moderate situation, the multiple of RRRT increases with 42% and more than 100% for DDRRT. For the hard-type S-tunnel, our method shows higher performance than RRT, that is, 53.8% faster than RRT. GART is also 5.8% faster to explore in the narrow situation when compared with RRRT. For all the benchmarks, DDRRT does not show good performance, which is due to the fact that DDRRT is sensitive with the radius of the obstacle-contact circles.
Performance comparisons with RRT, (DDRRT, and RRRT under representative environments with different difficulty level.
RRRT: retraction-based RRT; DDRRT: dynamic-domain RRT; RRT: rapidly exploring random tree; GART: guiding attraction-based random tree.
The performance in Bug-trap boxes is illustrated in Table 2. For RRT, DDRRT, and RRRT, their time increases almost linear with difficulty, that is, the harder the narrow corridor is, the more time consumes. Time needed for GART with moderate environment increases 20.3% when compared with easy one, but only with 65% more when compared to hard environment. However, RRT and RRRT increase with 100% more time, which is even more for DDRRT. Both results illustrate that our method is not sensitive with the difficulty of the environments, particularly even faster when the environments become tighter to support more guiding. The results prove that GART can successfully tackle narrow corridor with relative good performance.
Experimental demonstration with unmanned aerial vehicle
Previous works of The City College of New York (CCNY) robotics lab have demonstrated that the control system 35 is stable in indoor environment, and visual localization method 36 is able to do real-time control. Here, we set three obstacles (illustrated in Figure 14(a)) in a narrow experiment space (around 2000 × 2500mm2), where the distance between each obstacle is able to let the drone to fly through. For navigation, we do not directly use the discrete nodes output by GART, that is, we use a path smoother 37 to generate a smooth path with yaw continuity. Beside the obstacles, we set the quadrotor to fly around a constant speed of 0.5 m/s and the height is maintained at a constant of 330 mm, where final pose array can be found in Figure 14(a).

Experiments with AscTec Pelican using visual simultaneous localization and mapping (SLAM) method to do localization.
For path planner, one of the most important issues is how to represent the environment, which directly results whether a method can be used to solve the problem. Since we use visual SLAM to do localization, we use the point clouds to act as perception. GART regards that if a cluster of clouds is confronted within safe distance, then we try to calculate the area of the obstacle to model with the method proposed in section “Obstacle modeling with uncertainty.” Thus, according to the point clouds, it is easy to get the environment information (where both orientation and position are supported). The result of our path planner is illustrated in Figure 15(a), and the tracking error is also provided in Figure 15(b) (the unit of the error is meter) with a mean error 0.06 m.

Illustration of planned path using GART and its tracking error. GART: guiding attraction based random tree.
Conclusion
We propose an algorithm called GART for UAV on-line path planning under obstacle-crowded environments. GART introduces environmental information to guide the general random process of RRT*. Environmental information is in the form called “attraction,” which is inspired by the potential of artificial potential field. The algorithm based on the consumption that the obstacles can support the direction toward safety when confronted. Our algorithm focuses on UAV path planning, taking the control and sensing uncertainty into consideration. Despite the safety, GART also includes the dynamic constraints of heading punishment to guarantee smoothness. This article also provides theoretical analysis to prove the faster convergence ability of GART over RRT* and the ability of dealing with narrow corridor.
Comparisons were performed to prove searching, convergence, and narrow corridor passing ability in 2-D and 3-D environments. Results shows that GART outperforms RRT* with respect to searching and smoothing, and the 3-D comparisons illustrate that GART performs well in obstacle-filled situations. Narrow corridor passing efficiency is also tested with two kinds of benchmarks, and the results prove that GART has good performance in managing narrow corridors as well as RRRT. Finally, we did an experiment on UAV, which is much faster in flying through narrow corridor. However, as GART is based on local information, it does not offer any advantages if there are no obstacles.
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported in part by National Natural Science Foundation of China under grant #$61503369A and #$61528303, Priority Academic Program Development of Jiangsu Higer Education Institutions(PAPD), Jiangsu Collaborative Innovation Center on Atmospheric Environment and Equipment Technology(CICAEET).
