Sage Journals: Discover world-class research

Abstract

This article proposes and completely describes a modification of the Hybrid A* method used for navigation of a non-holonomic mobile wheeled robot. Our modification allows straightforward multi-criterial adjustment of the algorithm according to the desired behavior considering not only traveled distance but also time, changing of direction, stopping, going backwards while avoiding obstacles. The obstacle avoidance algorithm evaluates the danger of collision smoothly (not-binarily) using danger fields. Such behavior reflects human-like sensing of danger—the closer to the obstacle the robot is, the higher is the danger of collision. A modified uniform state expansion method has been used to cover the state space of the robot more uniformly providing the possibility of precise near-target navigation. A greed factor has been introduced to decrease the computational time and improve the real-time performance of the algorithm.

Keywords

Non-holnomic robot heuristics navigation path-planning obstacle avoidance

Introduction

The navigation of the mobile robot is a dynamic control process during which the control algorithm computes actions which lead the mobile robot to the target. The position of the robot and the target can be defined by three degrees of freedom (3 DoF)—Cartesian coordinates x, y, and direction ψ. The non-holonomic robot has non-zero minimal turning radius $R_{min}$ —its 3 DoF are not fully controllable. To connect the current position and the target by a physically achievable trajectory, the robot cannot travel straight to the target due to its physical constraints. More constraints are introduced by obstacles. To take all these factors into account, it is possible to use a form of a Model Predictive Controller (MPC).¹ This type of the controller can also be used in many non-robotic applications (see e.g. Hoy et al.²). To intercept the paths of the moving obstacles (e.g. humans in the working area), Bayesian representation of the obstacles has been used.³

One of the great methods of navigation for wheeled mobile robots is a Vector Field Histogram (VFH) combined with the A* searching algorithm (abbreviated as VFH*). The method was originally proposed by Ulrich and Borenstein.⁴ The concept of the method is described by following steps:

The current position of the robot is expanded by admissible actions into new projected positions.

Each new position is evaluated with respect to (w.r.t.) the target and the previous position. The position that is closer to the target and requires less maneuvering, has a lower cost. If the new position collides with an obstacle, the position is rejected. The total cost of the position is a sum of the previously spent cost and the estimated further cost.

Positions are recursively explored up to a given depth or until the target is reached. The A* searching algorithm is used to achieve optimal behavior. Searching is optimal and complete if the estimated cost is always smaller or equal to the true cost.

The first action along the found trajectory is physically applied to the robot and the whole process repeats. The navigation is local (does not consider the whole world but only limited surroundings) and real-time.

The original paper⁴ proposed the following cost function for the first expansion

g (r_{1}) = μ_{1} \cdot Δ (ψ_{1}, ψ_{t}) + μ_{2} \cdot Δ (ψ_{1}, ψ_{0}) + μ_{3} \cdot Δ (ψ_{1}, ψ_{d, n - 1})

and for deeper expansions

g (r_{i}) = λ^{i - 1} [{μ'}_{1} \cdot max (Δ (ψ_{i}, ψ_{t}), Δ (ψ_{e}, ψ_{t})) + {μ'}_{2} \cdot Δ (ψ_{i}, ψ_{0}) + {μ'}_{3} \cdot Δ (ψ_{i}, ψ_{i - 1})]

where i is the depth of the expansion, $ψ_{i}$ is the direction at position $r_{i}$ , $ψ_{t}$ is the direction to the target, $ψ_{0}$ is the current direction, $ψ_{d, n - 1}$ is the previously decided direction, $ψ_{e}$ is the effective direction

ψ_{e} = atan2 (y_{i} - y_{i - 1}, x_{i} - x_{i - 1})

The delta operator over directions is defined by formula

Δ (ψ_{a}, ψ_{b}) = min (| ψ_{a} - ψ_{b} |, | ψ_{a} - ψ_{b} - 2 π |, | ψ_{a} - ψ_{b} + 2 π |)

A heuristic estimation of the future cost is

h (r_{i}) = λ^{i} [{μ'}_{1} \cdot Δ (ψ_{e}, ψ_{t}) + {μ'}_{2} \cdot Δ (ψ_{t}, ψ_{A}) + {μ'}_{3} \cdot Δ (ψ_{t}, ψ_{i - 1})]

Coefficient $μ_{1}$ increases orientation of the robot to the target, coefficients $μ_{2}$ and $μ_{3}$ increase the smoothness of the trajectory. Coefficient λ is the discount factor and improves the overall behavior of the searching algorithm.⁴

The main drawbacks of the original VFH* algorithm are:

The target is defined only by direction (as the target is in infinity) within the method.

The original cost function does not consider the distance travelled by the robot.

Each state used in prediction is expanded only to different directions, but not to different distances (see Figure 1). Therefore, the state space is not covered uniformly.

Figure 1.

Exploration of the nearby robot’s positions.

Other navigational methods are mainly based on grids. Chung and Huang described a navigation algorithm which represents the world as a 2-D grid of occupied or non-occupied cells. The occupancy grid can be obtained simultaneously from the readings of the robots’ sensors.⁵ Many different sensors can be used to detect the surrounding obstacles and the robot’s own position (e.g. encoders, ultrasonic sensors, electronic compass, etc.).⁶ Szabo⁷ described a method for integration of the sensors’ readings by an event representation. The final path obtained by A* searching contains only eight possible directions alongside the grid or diagonally.⁸ Daniel et al. introduced the Theta* algorithm that also uses a grid, but applies an any-angle search algorithm, which allows connection of more distant grid vertices by a straight line (line of sight). The obtained trajectory is more smooth and efficient than only 8-directional trajectory.⁹ Yap et al. proposed the Block A* method that utilizes a data base of local distances between blocks. The data base contains more than eight nearest blocks; therefore, it is more efficient than A* or Theta*.¹⁰

To overcome the non-holonomic nature of the planned grid trajectories (not smooth), the Stanford University team building robot Junior for DARPA Urban Challenge¹¹ developed algorithm named Hybrid A*. The algorithm uses continuous states within 4-D state space (two horizontal coordinates, orientation, and direction of motion—forward or reverse). The surroundings of the best node were explored in six defined directions (combinations of forward/backward throttle and left/straight/right steering), which results in six child states. Each child state is assigned to one horizontal grid cell. To improve the efficiency of the expansion, two different heuristic functions have been used to evaluate each state: the first considering non-holonomic nature of the robot and the second considering obstacles. It results in fewer nodes being required to be explored during navigation. The resultant path is smoothed by Conjugate Gradient filtering.¹²

Uniform expansion method

Using the original version of VFH*, the local maneuvering of the robot to a nearby target defined by all three coordinates is not working perfectly. One of the reasons is that the expansion uses constant travel distance (the original histogram used in VFH considered only directions). If the target is closer than the expansion distance, the robot will oscillate. The Hybrid A* algorithm is a good starting point in solving the problem.

To cover the state space $〈 x, y, ψ 〉$ more uniformly, we have to discretize the state space into a grid. The grid size corresponds to the tolerance of the whole navigation algorithm. The position of the robot, when snapped to the grid is

[x, y, ψ] = [n_{x} G_{x y}, n_{y} G_{x y}, n_{ψ} G_{ψ}]

where $G_{x y}$ is a translational grid cell size, $G_{ψ}$ is a rotational grid cell size, and $n_{x}$ , $n_{y}$ , $n_{z}$ are integers. Introducing the grid also enables fast searching for already visited positions within the A* searching algorithm, which avoids oscillations and loops during the search.

The exploration of the near positions around the currently best position is done reversely: first, we set the desired position and then we compute the required action to achieve that position. We assume that the robot’s steering control does not need to be continuous (the curvature of the trajectory may change rapidly). The scenario is illustrated in Figure 2, note that direction (angle) $ψ_{i - 1}$ is represented by a directional arrow.

Figure 2.

Approaching the nearby position in the ith expansion depth.

The length and the direction of the distance vector $P_{i} P_{i} _{- 1}$ are (in that order)

d_{i} = | d_{i} | = \sqrt{{(x_{i} - x_{i - 1})}^{2} + {(y_{i} - y_{i - 1})}^{2}}

ψ_{d i} = atan2 (y_{i} - y_{i - 1}, x_{i} - x_{i - 1})

The angle $δ_{i}$ is equal to

δ_{i} = (ψ_{d i} - ψ_{i - 1}) - 2 π ‖ \frac{ψ_{d i} - ψ_{i - 1}}{2 π} ‖

where $‖ x ‖$ denotes the nearest integer to a given real number x. The distance $d_{i}$ is:

d_{i} = 2 R_{i} sin δ_{i}

where $R_{i}$ is the turning radius. Then we obtain

R_{i} = \frac{d_{i}}{2 sin δ_{i}}

If we assume that the vehicle has two control variables: throttle T and steering S, both are within range $〈 - 1, 1 〉$ . Steering is then

S_{i} = \frac{R_{min}}{R_{i}} = \frac{2 R_{min} sin δ_{i}}{d_{i}}

where $R_{min}$ is the minimal turning radius of the robot. The length of the trajectory is

L_{i} = {\begin{array}{l} d_{i} & if | δ_{i} | < ε, \\ d_{i} \frac{| δ_{i} |}{sin | δ_{i} |} & if ε \leq | δ_{i} | \leq \frac{π}{2}, \\ - d_{i} \frac{(π - | δ_{i} |)}{sin (π - | δ_{i} |)} & if \frac{π}{2} < | δ_{i} | \leq (π - ε), \\ - d_{i} & if | δ_{i} | > (π - ε) \end{array}

where ε is the numerical precision of the data type used in the implementation (e.g. $ε = 10^{- 6}$ ). A negative length $L_{i}$ means going backwards.

Within one step of the algorithm, it is reasonable to explore up to the maximal achievable distance, which corresponds to full throttle

L_{max} = v_{max} Δ t

where $v_{max}$ is the maximal speed of the robot and $Δ t$ is the time period of one exploration step. The throttle is then

T_{i} = \frac{L_{i}}{L_{max}}

The final direction of the robot after the turn is

ψ_{i} = ψ_{i - 1} + \frac{L_{i}}{R_{i}} = ψ_{i - 1} + \frac{2 L_{i} sin δ_{i}}{d_{i}}

If the computed throttle $T_{i}$ or steering $S_{i}$ is outside the $〈 - 1, 1 〉$ boundary, the point is unreachable. Figure 3 shows reachable positions within distance $L_{max} = \frac{π}{2} R_{min}$ (quarter turn). The horizontal grid cell size was set to $G_{x y} = L_{max} / 10$ .

Figure 3.

Reachable positions from the current position within one exploration step.

To speed up the calculation, it is possible to precompute a set of possible actions $[T, S]$ that will later be used for exploration.

Distance between the robot and the obstacle

Evaluation of collisions requires the distance between the robot’s projection into given position $P_{i}$ and the surrounding obstacles. In many cases, the robot and the obstacles have rectangular horizontal footprint. More complex shapes can be computed as a combination of rectangles. The problem of finding the distance to the nearest obstacle is therefore transformed to computing the distance between two arbitrary rotated rectangles. The distance between rectangles is the minimum of distances between all corners of one rectangle and sides of the other rectangle. Figure 4 illustrates the whole situation.

Figure 4.

Minimal distance between the robot and the obstacle.

Position of the robot’s origin is shifted from the rectangle’s centrum $C_{i}$ to the centrum of the non-controlled axle $P_{i}$ by dislocation $e_{0}$ . Dimensions of the robot are $a_{0} \times b_{0}$ , dimensions of the kth obstacle are $a_{k} \times b_{k}$ . The position of the robot’s geometrical centrum is

C_{i} = P_{i} + e_{0} \cdot {[cos ψ_{i} sin ψ_{i}]}^{T}

Positions of the obstacle’s corners are

M_{k, n} = C_{k} + \frac{1}{2} [\begin{matrix} cos ψ_{k} & - sin ψ_{k} \\ sin ψ_{k} & cos ψ_{k} \end{matrix}] \cdot [\begin{matrix} \pm a_{k} \\ \pm b_{k} \end{matrix}]

where $ψ_{k}$ is the orientation (angle of rotation) of the obstacle and $n = 1, 2, \dots 4$ is the index of the corner. The positions of the obstacle’s corners in robot’s local coordinate system $〈 x^{'}, y^{'} 〉$ are

{M'}_{k, n} = [\begin{matrix} {x^{'}}_{k, n} \\ {y^{'}}_{k, n} \end{matrix}] = [\begin{matrix} cos ψ_{i} & sin ψ_{i} \\ - sin ψ_{i} & cos ψ_{i} \end{matrix}] \cdot (M_{k, n} - P_{i})

The distances between the obstacle’s corner and the robot are

d_{k, n} = min (| {x^{'}}_{k, n} - e_{0} | - a_{0}, | {y^{'}}_{k, n} | - b_{0})

Using similar approach, we obtain positions of the robot’s corners $N_{i, n}$

N_{i, n} = C_{i} + \frac{1}{2} [\begin{matrix} cos ψ_{i} & - sin ψ_{i} \\ sin ψ_{i} & cos ψ_{i} \end{matrix}] \cdot [\begin{matrix} \pm a_{i} \\ \pm b_{i} \end{matrix}]

The positions of the robot’s corners w.r.t. the obstacle are

{N^{'}}_{i, n} = [\begin{matrix} {x^{'}}_{i, n} \\ {y^{'}}_{i, n} \end{matrix}] = [\begin{matrix} cos ψ_{k} & sin ψ_{k} \\ - sin ψ_{k} & cos ψ_{k} \end{matrix}] \cdot (N_{i, n} - C_{k})

The distances $d_{i, n}$ between the robot’s corners and the obstacle are

d_{i, n} = min (| {x^{'}}_{i, n} | - a_{0}, | {y^{'}}_{i, n} | - b_{0})

The distance between the ith robot’s projection and the kth obstacle is the minimum of all eight distances

c_{i, k} = min (d_{k, 1}, d_{k, 2}, d_{k, 3}, d_{k, 4}, d_{i, 1}, d_{i, 2}, d_{i, 3}, d_{i, 4})

Clearance $c_{i}$ is the minimum of $c_{i, k}$ across all obstacles k.

If the obstacle or robot is significantly larger than the grid cell, the proposed method provides significant speedup of the clearance calculation compared to widely used computation of clearance using distances between occupied cells. It also allows more precise maneuvering, since each object may occupy its border cells only partially and the robot and the obstacle may occupy the same cell safely without collision. If the true horizontal footprint of the robot is not rectangular, we use minimal area bounding rectangle.

The reliability and safety of the proposed method depends on the way how the position of the obstacle is obtained. The obstacle may be detected by the robots’ local sensors, or, in case of multiple robots, the information may be provided by communication among them. Then, the most critical factor is the safety of the communication, which has to be evaluated separately.¹³

The cost of single exploration step

To find the optimal trajectory, the A* method expands a non-opened node with the smallest (best) total cost during each iteration. The total cost $f_{i}$ is a sum of the cost $g_{i}$ spent by reaching the ith node and the estimated future cost $h_{i}$ that has to be spent to reach the target. The spent cost $g_{i}$ increases with distance, changing of the direction, changing of the speed, or going backwards.

The cost of distance

The cost of distance is proportional to the length of the trajectory

g_{L i} = K_{L} \cdot | L_{i} | [K_{L}] = m^{- 1}

where $K_{L}$ is the (constant) cost per meter.

The cost of changing the direction

If it is required to maintain a smooth trajectory, the cost of changing the heading may be introduced. The cost is proportional to the arc angle

g_{S i} = K_{S} \cdot | ψ_{i} - ψ_{i - 1} | = K_{S} \cdot | \frac{2 L_{i} sin δ_{i}}{d_{i}} | [K_{S}] = {rad}^{- 1}

where $K_{S}$ is the (constant) cost per radian.

The cost of stopping the robot

To avoid too many reversal points (point where the vehicle has to stop and change the polarity of throttle), we introduce the cost of stopping

g_{T i} = K_{T} \cdot neg (T_{i - 1} \cdot T_{i}) [K_{T}] = 1

where $T_{i - 1}$ is the previous throttle, $K_{T}$ is a fixed cost per one reversal point and $neg (x)$ is the following function

neg (x) = {\begin{array}{l} 1 & if x < 0 \\ 0 & if x \geq 0 \end{array}

The cost of going backwards

For certain mobile robots, it is not convenient to go backwards (e.g. due to worse maneuvering abilities). The cost of going backwards is

g_{b i} = - K_{b} \cdot neg (L_{i}) \cdot L_{i} [K_{b}] = m^{- 1}

where $K_{b}$ is the (constant) cost per one meter of going backwards.

The cost of danger

The most crucial part of the navigation is avoiding collisions with obstacles. Chen and Zhang¹⁴ proposed a method for estimation of the distance between moving non-holonomic robots. Since the estimation of the position is not absolutely precise, collision detection should not be binary (the real position of the robot could collide with an obstacle while the estimated position does not collide). Therefore, we have introduced a danger field, which is derived from the potential field navigation method. The danger field defines a dimensionless danger of collision as a function of the robot’s position $[x, y, ψ]$ . The function is as follows

D_{i} = {[\frac{c_{1} - c_{0}}{max (c_{i} - c_{0}, ε)}]}^{κ}

where $c_{i}$ is a clearance (minimal distance) between the robot projected to position $P_{i}$ and the surrounding obstacles, $c_{0}$ is a safety border in meters around each obstacle, $c_{1}$ is a normalization constant which corresponds to the distance at which the danger is equal to 1. Coefficient κ is the steepness of the danger function (we use $κ = 2$ ) and ε is a very small positive number (prevents division by zero). If any obstacle moves, the danger field function also changes due to the changes of clearance.

The cost of danger is then

g_{D i} = K_{D} \cdot D_{i} [K_{D}] = 1

where $K_{D}$ is the weight (dimensionless constant) of the danger. If the $K_{D}$ is higher, the robot is “more afraid” of collision.

The cost of time

The same trajectory can be achieved by a smaller number of large steps (higher speed) or by a larger number of small steps (small speed). It is desirable to select the faster trajectory, so we have introduced the cost of time. Each time step has the cost proportional to parameter $K_{t}$

g_{t i} = K_{t} Δ t [K_{t}] = s^{- 1}

The overall estimated spent cost is then given by the sum of all partial costs and the cost $g_{i} _{- 1}$ spent for reaching the position $P_{i - 1}$

g_{i} = g_{i - 1} + g_{L i} + g_{S i} + g_{T i} + g_{b i} + g_{D i} + g_{t i}

Estimation of future cost

In this chapter, we will solve the problem of the minimal distance which has to be run by a non-holonomic wheeled robot to move from one position to another. The space of all possible trajectories is too large to search and contains discontinuities; therefore, it is necessary to choose a certain pattern of the trajectory.

Possible trajectories

We have chosen the trajectory pattern shown in Figure 5 which consists of an initial arc $P X$ , then followed by a straight line $X Y$ , and finished by another arc $Y B$ . To keep the trajectory length minimal, the radius of the initial and the final arc turns is $R_{min}$ . Such trajectory has been proved by Dubins to be the shortest.¹⁵ These trajectories are commonly referred to as Dubins paths. Due to the motion planning purposes, also reversing should be considered. By modifying the Dubins principle, Reeds and Shepp¹⁶ introduced a more complex method using at most five segments of arcs and/or straight lines. This article proposes the usage of a three-segment curve but also considers reversing.

Figure 5.

Trajectory pattern, forward trajectory.

First, we define process variables (we omit the index i of the position $P_{i}$ and of other variables):

$P = [x_{P}, y_{P}]$ the evaluated horizontal position,

$ψ_{P}$ the evaluated direction,

$B = [x_{B}, y_{B}]$ the target horizontal position,

$ψ_{B}$ the target direction,

$R_{1}$ the radial vector of the initial turn (perpendicular to the $ψ_{P}$ ),

$R_{3}$ the radial vector of the final turn (perpendicular to the target $ψ_{B}$ ),

$d$ the distance vector between the centrums of the turns,

$ψ_{d}$ the direction of vector $D$ ,

$L_{1}$ the length of the initial arc $P X$ ,

$L_{2}$ the length of the straight line $X Y$ ,

$L_{3}$ the length of the final arc $Y B$ ,

$α_{1}$ the initial arc angle,

$α_{3}$ the final arc angle,

$s_{1}$ the polarity of the initial arc (1 = counterclockwise, −1 = clockwise),

$s_{3}$ the polarity of the final arc (1 = counterclockwise, −1 = clockwise).

According to Figure 5, the centrum of the initial arc is

C_{1} = P + s_{1} R_{1} = P + s_{1} R_{min} [- sin ψ_{P}, cos ψ_{P}]

analogically, the centrum of the final turn is

C_{3} = B + s_{3} R_{min} [- sin ψ_{B}, cos ψ_{B}]

The distance vector between the centrum of the turns is then

d = C_{3} - C_{1} = [d_{x}, d_{y}]

and its direction is

ψ_{d} = atan2 (d_{y}, d_{x})

In the following sections, we will compute all possible trajectories matching the given trajectory pattern.

The same polarity of turns

First, we will discuss the case when both turns have the same polarity

s_{1} = s_{3}

The arc angles are then (according to Figure 5)

α_{1 +} = s_{1} (ψ_{d} - ψ_{P})

α_{3 +} = s_{3} (ψ_{B} - ψ_{d})

Positive sign in indices denotes that the straight segment of the trajectory is traveled forwardly. Polarities $s_{- 1}$ and $s_{3}$ were introduced, because we want to avoid negative angles. To keep the arc angles within the interval $[0, 2 π)$ , we remove the period

α_{k} \leftarrow α_{k} - 2 π ⌊ \frac{α_{k}}{2 π} ⌋

where $⌊ x ⌋$ denotes the nearest smaller integer to a given number x. A similar relation is valid for all used angles. Positions $X$ and $Y$ can be reached not just by turning by angles $α_{1 +}$ and $α_{3 +}$ , respectively, but also by complementary angles $(2 π - α_{1 +})$ and $(2 π - α_{3 +})$ in reverse manner (displayed by the dashed line).

There is also a possibility of travelling the straight line reversely (see Figure 6).

Figure 6.

The same polarity of turns, reverse trajectory.

The arc angles in the reverse case are

α_{1 -} = s_{1} (ψ_{d} + π - ψ_{P})

α_{3 -} = s_{3} (ψ_{B} - ψ_{d} - π)

Operation (41) has to be applied to both angles (42) and (43) to keep them within the interval $[0, 2 π)$ . The length of the straight element is

d = \sqrt{d_{x} ​^{2} + d_{y} ​^{2}}

To denote all possible trajectories in a compact way, we will denote each trajectory as a vector $[L_{1}, L_{2}, L_{3}]$ . The negative sign will mean that the robot should travel backwards. Using this notation, the possible trajectories for the case of the same polarity of turns are

\begin{matrix} [R_{min} α_{1 +}, d, R_{min} α_{3 +}] \\ [- R_{min} (2 π - α_{1 +}), d, R_{min} α_{3 +}] \\ [R_{min} α_{1 +}, d, - R_{min} (2 π - α_{3 +})] \\ [- R_{min} (2 π - α_{1 +}), d, - R_{min} (2 π - α_{3 +})] \\ [R_{min} α_{1 -}, - d, R_{min} α_{3 -}] \\ [- R_{min} (2 π - α_{1 -}), - d, R_{min} α_{3 -}] \\ [R_{min} α_{1 -}, - d, - R_{min} (2 π - α_{3 -})] \\ [- R_{min} (2 π - α_{1 -}), - d, - R_{min} (2 π - α_{3 -})] \end{matrix}

The opposite polarity of turns

The case when initial and final turns have opposite polarities is shown in Figure 7, the reverse variant is in Figure 8.

Figure 7.

Opposite direction of turns, forward trajectory.

Figure 8.

Opposite direction of turns, reverse trajectory.

The arc angles are

β_{1 +} = s_{1} (ψ_{D} - ψ_{P}) + δ

β_{3 +} = s_{3} (ψ_{B} - ψ_{D}) + δ

where the angle δ (according to the figure) is

δ = asin (\frac{R_{min}}{d / 2}) = asin (\frac{2 R_{min}}{d})

The complementary arc angles (dashed arcs) are computed in the same way as in the previous case. From equation (48), it is clear that the trajectory exists only if

R_{min} < \frac{d}{2}

The length of the straight section of the trajectory is

s = \sqrt{d_{x} ​^{2} + d_{y} ​^{2} - 4 R_{min} ​^{2}}

Like in the case when both circles have the same polarity, there is also the possibility to travel reversely (see Figure 8).

The corresponding arc angles are

β_{1 -} = s_{1} (ψ_{D} + π - ψ_{P}) - δ

β_{3 -} = s_{3} (ψ_{B} - ψ_{d} - π) - δ

After removing the period from all arc angles using equation (41), we obtain another set of possible trajectories (valid for the case of opposite arc polarities)

\begin{matrix} [R_{min} β_{1 +}, s, R_{min} β_{3 +}] \\ [- R_{min} (2 π - β_{1 +}), s, R_{min} β_{3 +}] \\ [R_{min} β_{1 +}, s, - R_{min} (2 π - β_{3 +})] \\ [- R_{min} (2 π - β_{1 +}), s, - R_{min} (2 π - β_{3 +})] \\ [R_{min} β_{1 -}, - s, R_{min} β_{3 -}] \\ [- R_{min} (2 π - β_{1 -}), - s, R_{min} β_{3 -}] \\ [R_{min} β_{1 -}, - s, - R_{min} (2 π - β_{3 -})] \\ [- R_{min} (2 π - β_{1 -}), - s, - R_{min} (2 π - β_{3 -})] \end{matrix}

Computing the minimal cost of travelling to the target

There are 4 possible combinations of arc polarities

[s_{1}, s_{3}] \in {[- 1, - 1], [- 1, 1], [1, - 1], [1, 1]}

For each combination, we obtain eight possible trajectories (using equation (45) for the same polarities and equation (53) for the opposite polarities). The full set contains 4 × 8 = 32 possible trajectories matching the defined pattern. Each trajectory has its cost, which reflects the requirements of the application. Generally, the cost increases with the distance, changing of the heading, stopping the robot, or going backwards. Note that we do not evaluate danger (since it must be evaluated at each point of the trajectory). The formulas for predicted future costs are similar to the formulas for the spent cost (equations (25) to (33))

h_{L i} = K_{L} (| L_{1 i} | + | L_{2 i} | + | L_{3 i} |)

h_{S i} = K_{S} \frac{(| L_{1 i} | + | L_{3 i} |)}{R_{min}}

h_{T i} = K_{T} [neg (T_{i} \cdot L_{1 i}) + neg (L_{1 i} \cdot L_{2 i}) + neg (L_{2 i} \cdot L_{3 i})]

h_{b i} = K_{b} [neg (L_{1 i}) \cdot L_{1 i} + neg (L_{2 i}) \cdot L_{2 i} + neg (L_{3 i}) \cdot L_{3 i}]

h_{t i} = K_{t} \frac{(| L_{1 i} | + | L_{2 i} | + | L_{3 i} |)}{v_{max}}

where $T_{i}$ is the throttle which was used to reach position $P_{i}$ (corresponds to the forward speed of the robot). The cost of time is computed as if the speed was maximal during the whole trajectory (minimal time). The overall estimated future cost is then given by the sum of all costs

h_{i} = h_{d i} + h_{S i} + h_{T i} + h_{b i} + h_{t i}

The estimated future cost is computed for each of the 32 possible trajectories and the best trajectory is selected.

The greed factor

If the original A* searching algorithm is used, the total cost of any predicted position $P_{i}$ is

f_{i} = g_{i} + h_{i}

The navigation algorithm finds the optimal trajectory but it examines too many nodes which results in poor real-time performance. To speed up the calculation and searching, we have introduced the greed factor γ. It sacrifices a small portion of the optimal (minimal cost) to decrease the number of nodes required to be expanded during search. The modified total cost function replaces equation (61)

f_{i} = (1 - γ) g_{i} + h_{i}

When the greed factor is equal to 1, the searching does not consider the spent cost $g_{i}$ that results in greedy search.

Each predicted state of the robot will be represented by the structure Node :

structure Node

. [x, y, ψ] -- position

. [T, S] -- action

. parent -- previous node

. g -- spent cost

. h -- estimated future cost

. f -- total cost

end

The cost spent from the starting state into predicted state is computed using function spent_cost shown in Algorithm 1: The estimated cost from predicted state into next state is computed by the function future_cost described in Algorithm 2:

Algorithm 1.

function spent_cost (node, obstacles)

Algorithm 2.

function future_cost (node, target)

Exploration of the child states is accomplished by the function expand shown in Algorithm 3:

Algorithm 3.

function expand(node, target, obstacles)

Finally, the function navigate implements A* searching with greed factor (see Algorithm 4):

Algorithm 4.

function navigate (from, target, obstacles, T ₀ , S ₀)

Experimental results

The proposed system has many adjustable parameters. To obtain results, that are comparable across experiments, we have used simulation in the simulated world (see Figure 9). The parameters of the robot were as follows: $v_{max} = 100$ pixels s⁻¹, $R_{min} = 50 pixels$ , the tolerance of reaching the target (also the grid cell size): 10 pixels/5°. The resolution of the visualization was set to 640 × 480 pixels. The simulation was implemented in the OpenCV framework using the C++ programming language. The algorithm was evaluated on a standard PC with double-core Intel Pentium G870 3.10 GHz.

Figure 9.

The simulated world and the optimal trajectory found (300,260 nodes has been explored).

The world contains 12 static obstacles which makes it difficult for the robot to choose the optimal trajectory (many trajectories have the same cost).

We have evaluated the cost of the found trajectory, the computation time, and the count of explored nodes as a function of the greed factor. All the experiments were conducted with the same pseudorandom set of 1000 targets and 20 different settings of the greed factor (20,000 experiments). The cost constants which describe the properties of the optimal trajectory were set to the following values: the prediction step $Δ t = 480 ms$ , the cost of distance $K_{L} = 1 / pixel$ , the cost of steering $K_{S} = 20 {rad}^{- 1}$ , the cost of stopping $K_{T} = 100$ , the cost of going backwards $K_{b} = 0.3$ , the cost of danger $K_{D} = 200$ , the cost of time $K_{t} = 1 s^{- 1}$ . The parameters of the danger function were as follows: the safety border $c_{0} = 5 pixels$ , the unit distance $c_{1} = 15 pixels$ , the exponent $κ = 2$ (Figure 10 shows a map of danger around the obstacles and the robot itself).

Figure 10.

Map of danger.

To limit the simulation time, each experiment was limited to 1 s of computational time. If the algorithm did not find the solution within the given time limit, it was considered as unsuccessful. Such targets were removed from the set of targets. The remaining experiments were averaged for the same setting of the greed factor across all targets.

To demonstrate the importance of the uniform expansion method, we compare its performance with the state expansion in six directions only, which was proposed by the original Hybrid A* method.

As predicted, a higher greed factor allows the method to find the trajectory faster but the obtained trajectory is slightly suboptimal (compare Figures 9 and 11). Figure 12 shows the relation between the average cost per one target and the greed factor. With the greed factor from the range of 0 to 0.8, the cost of the trajectory is increasing only slightly (the greed factor $γ = 0.8$ caused an average increase of the cost by 35%). Further increase of the greed factor causes rapid increase of the cost (highly suboptimal behavior).

Figure 11.

A suboptimal path computed using a greed factor γ = 0.4 (only 49,240 nodes has been explored).

Figure 12.

Average cost of the trajectory versus greed factor.

Figure 13 shows the average count of nodes which had to be explored to find the solution. With the greed factor from 0 to 0.3, the average count of nodes decreases almost linearly, and for a greed factor above 0.6, it is almost constant.

Figure 13.

Average count of explored nodes versus greed factor.

Computational time is closely related to the count of explored nodes, therefore the relation in Figure 14 is similar to the one in Figure 13. A higher greed factor means lower computational time. For a greed factor above 0.5, the average computational time was below 100 ms.

Figure 14.

Average computation time versus greed factor.

Since we require the algorithm to operate in real-time, the computation time was limited to 1 s. Figure 15 shows the probability of finding the trajectory within the given time limit. Without the greed factor, solutions for only 20% of the targets were found on time. For a greed factor above 0.5, more than 70% of the solutions were found on time. Note that using only six directions for expansion significantly increases the count of explored nodes, thus finding the solution times out in many cases. Figure 14 shows average of successful searches, therefore, it does not reflect poor probability of finding the solution in case of the six-directional expansion.

Figure 15.

Probability of finding the trajectory within timeout (1 s) versus greed factor.

The algorithm has been evaluated in real world using e-puck robot (e-puck is a small mobile robot developed by GCtronic).^17,18 The testing environment with the e-puck robot inside can be seen in Figure 16. The environment has been modeled also in the control program (see Figure 17). The robot accomplished to pass the proposed trajectory while its position has been estimated using onboard odometers.

Figure 16.

Testing environment with e-puck robot.

Figure 17.

Designed trajectory.

Conclusion

Our proposed method improves the evaluation method of the Hybrid A* algorithm. First, we have introduced a uniform state expansion method which improves searching speed and decreases count of nodes needed to be explored. Then, we have modified and simplified the heuristic method used to estimate the future cost required to reach a given target. The navigation algorithm considers not only the length of the trajectory but also reversing, going backwards, changing the direction, the danger of collision, and the time of travel. Each feature is penalized by a separate parameter which allows simple adjustment of the behavior of the algorithm according to the requirements of any application. The evaluation of danger is not binary as used in many implementations of Hybrid A* (e.g. studies by Kurzer^1,19), but smooth, which reflects the limited precision of the robot’s localization system.

To speed up the real-time computations, we have used the greed factor which allows us to set the right balance between the required computational power and the cost of the projected trajectory. Our experiments show that a greed factor around 0.5 decreases the cost of the obtained trajectory only by 16% but decreases the computational time 5 times, which greatly improves the real-time performance of the algorithm.

Footnotes

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work has been supported by the Educational Grant Agency of the Slovak Republic KEGA, within the projects 014ŽU-4/2018 and 016ŽU-4/2018.

ORCID iD

Marián Hruboš

References

Kurzer

. Hybrid A* Path Planner for the KTH Research Concept Vehicle, 2015. https://github.com/karlkurzer/path_planner (accessed 29 May 2018). GitHub.

Hoy

Matvev

Savkin

. Algorithms for collision-free navigation of mobile robots in complex cluttered environments: a survey. Robotica 2015; 33(3): 463–497.

Hrbček

Šimák

. Implementation of multi-dimensional model predictive control for critical process with stochastic behavior. In: Zheng

(ed) Advanced model predictive control. Rijeka, Croatia: IntechOpen, 2011, p. 19.

Ulrich

Borenstein

. VFH*: local obstacle avoidance with look-ahead verification. In: Proceedings 2000 ICRA. Millennium conference. IEEE International conference on robotics and automation. Symposia Proceedings, 24–28 April 2000, San Francisco, CA, USA, pp. 2505–2511. San Francisco, CA, USA: IEEE.

Chung

Huang

. Predictive navigation by understanding human motion patterns. Int J Adv Robot Syst 2011; 8(1): 13.

Gomez

Hernandez

Crespo

. A topological navigation system for indoor environments based on perception events. Int J Adv Robot Syst 2017; 14(1): 12.

Szabo

. Topological navigation of simulated robots using occupancy grid. Int J Adv Robot Syst 2004; 1(4): 6.

Choset

Lynch

Hutchinson

. Principles of robot motion: theory, algorithms, and implementations. Cambridge Center, Cambridge, MA: MIT Press, 2005.

Daniel

Nash

Koenig

. Theta*: any-angle path planning on grids. J Artif Int Res 2010; 39: 533–579.

10.

Yap

Burch

Holte

. Block A*: database-driven search with applications in any-angle path-planning. In: AAAI’11 proceedings of the twenty-fifth AAAI conference on artificial intelligence, 07–11 August 2011, San Francisco, California. pp. 120–125. San Francisco, California: AAAI Press.

11.

Buehler

Iagnemma

Singh

(eds). Junior: the Stanford entry in the urban challenge. In: The 2005 DARPA grand challenge: the great robot race. Germany: Springer-Verlag Berlin Heidelberg, 2006, p. 31. DOI: 10.1002/rob.20258.

12.

Dolgov

Thrun

Montemerlo

. Path planning for autonomous vehicles in unknown semi-structured environments. Int J Robot Res 2010; 29(5): 485–501.

13.

Franeková

Rástočný

. Safety evaluation of fail-safe fieldbus in safety related control system. J Elect Eng 2011; 61(6): 350–356.

14.

Chen

Zhang

. An indoor mobile robot navigation technique using odometry and electronic compass. Int J Adv Robot Syst 2017; 14(3): 15.

15.

Dubins

. On curves of minimal length with a constraint on average curvature, and with prescribed initial and terminal positions and tangents. Am J Math 1957; 79(3): 497–516.

16.

Reeds

Shepp

. Optimal paths for a car that goes both forwards and backwards. Pacific J Math 1990; 145(2): 367–393.

17.

Bernabeu

Valera

Gomez-Moreno

. Distance computation between non-holonomic motions with constant accelerations. Int J Adv Robot Syst 2013; 10(9): 15.

18.

Nolfi

Mirolli

(eds) Evolution of communication and language in embodied agents. New York: Springer, 2009, p. 4.

19.