Abstract
Wheeled mobile robots are widely utilized for environment-exploring tasks both on earth and in space. As a basis for global path planning tasks for wheeled mobile robots, in this study we propose a method for establishing an energy-based cost map. Then, we utilize an improved dual covariant Hamiltonian optimization for motion planning method, to perform point-to-region path planning in energy-based maps. The method is capable of efficiently handling high-dimensional path planning tasks with non-convex cost functions through applying a robust active set algorithm, that is, non-monotone gradient projection algorithm. To solve the problem that the path planning process is locked in weak minima or non-convergence, we propose a randomized variant of the improved dual covariant Hamiltonian optimization for motion planning based on simulated annealing and Hamiltonian Monte Carlo methods. The results of simulations demonstrate that the final paths generated can be time efficient, energy efficient and smooth. And the probabilistic completeness of the method is guaranteed.
Keywords
Introduction
An important technique for the exploration of extraterrestrial planets during recent years is path planning. 1 However, there hardly exists a single path planning approach that can achieve all the exploration objectives or fulfill all the actual constraints. The main reason is that the tasks are performed in uncertain and complex environments and affected by additional hazards, such as communication time delays and limited daytime for exploiting solar power. 2 Taking the task of looking for signs of past water on Mars 3 as an example, one can assume that the objective of exploration is to navigate wheeled mobile robots (WMRs) to the region of interest for further investigation while avoiding unnecessary detours. This mission-oriented characteristic makes point-to-region (PTR) path planners suitable for planetary explorations.
Point-to-point (PTP) path planning methods, such as cell decomposition methods, 4 potential-field-based methods 5 and intelligence methods, 6 have long been studied. Some researchers have realized that PTR problems can potentially be too complicated for PTP planners, when there are multiple goals and final states, or the goal region covers a large area.7,8 Thus, research on PTR planners has practical significance. The research in this paper intends to develop a path planning approach for WMRs performing mission-oriented exploration tasks in extraterrestrial planets.
In general, path planning methods based on optimization methods9,10 and probability theories such as rapidly exploring random tree (RRT)-based methods11,12 have been shown to be more suitable for PTR problems in the environment previously mentioned. Peynot et al. 7 have constructed a control policy involving mobility prediction on different terrains to navigate a robot to a square goal region. This approach emphasizes the traversability of the path more than the path quality such as smoothness. Persson and Sharf 8 regard path planning toward a goal region as a sampling-based optimization problem, so that probabilistic completeness and fast convergence are guaranteed within a binary map. However, the method cannot work directly in non-binary maps which usually contain more information. Davoodi et al. 13 have developed an evolutionary algorithm to solve multi-objective path planning problems by utilizing a geometrical smoothing technique. But the smoothness is not inherent, which may affect the actual feasibility of the smooth path in real environment. Choudhury and Scherer 14 have derived a dual form of the original covariant Hamiltonian optimization for motion planning (CHOMP) method by adding a box constraint, 15 which can solve linear problems for constrained goal regions or confine the path in a specific linear search region. However, this method does not work well with non-convex cost functions, and it lacks probabilistic completeness.
The main contributions of this study are as follows:
We improve dual covariant Hamiltonian optimization for motion planning (Dual CHOMP) 14 to function well in ill-distributed energy-based cost maps by implementing simplified non-monotone gradient projection algorithm (NGPA). 16
We propose a new variant of the improved Dual CHOMP with probabilistic completeness.
By combining it with an energy-based cost map, the method can guide a WMR through a complicated environment while maintaining energy efficiency.
With these improvements, this novel CHOMP-based method is capable of overcoming more challenging cost maps, which are hard for the methods described in Choudhury and Scherer 14 and Zucker et al. 15 A series of simulations demonstrate that the new algorithm is efficient and that the quality of the path is much better than those obtained by other approaches in many situations.
The remainder of this paper is organized as follows. In section “Representation of the environment,” we propose a procedure for establishing an energy-based map (EMAP). In section “The definition of PTR path planning,” we define the path planning problem in the EMAP. In section “Improvement of the Dual CHOMP method,” we propose the improved Dual CHOMP method. In section “Simulation experiments for path planning tasks,” we describe the simulation experiments performed to demonstrate the performances of the new method. Finally, in section “Conclusion,” we present our conclusions.
Representation of the environment
For mobile robots such as legged robots 17 and WMRs, 18 it is hard to designate some regions to be definitely unreachable or untraversable in a certain map. Usually cost maps are capable of solving this problem. By penalizing the traversing costs, a WMR can traverse regions with relatively high elevation, which are usually considered as obstacles in binary maps, as illustrated in Figure 1.

(a) The WMR navigates around a region considered as an obstacle in a binary map. (b) The WMR traverses through the smooth and low-cost side of the same region, while keeping away from the steep side with a high cost in a cost map.
The WMR is assumed to move uniformly in this paper, in order to focus the discussion on the tendency of traverse energy change brought by the different elevations of continuous terrains. The energy consumption required to traverse a given region is obviously positively associated with the slope angle of that region, whether the WMR with the uniform velocity goes uphill or downhill. Thus, we propose the following procedure to establish an EMAP:
Step 1. Compute the slope angles from the elevations of adjacent nodes along the positive directions of the X- and Y-axes. The process is shown in Figure 2 taking X-axis as an example.
Step 2. Compute the energy-based changes corresponding to the slopes by
where m is the mass of the WMR,
Step 3. Let the traversing energy at the grid with the coordinate
Step 4. By comparing
where

Let
This procedure is proposed according to our preliminary research about the total traversing cost change while a real six-wheeled rover is climbing different slopes.18,19 For a path of a given length, the lower the total energy-based cost accumulated along the path, the less the actual energy consumed by a real WMR.
There are some regions which are impossible or unallowable to be traversed in a real map. These obstacle regions can be from certain rules of constructing a cost map or manual interference to designate regions to be untraversable. We set these regions as containing high costs in EMAPs. The cost function of an obstacle region is defined as
where x is any possible position of the WMR in the map. The function
The overall cost map

(a) The original digital elevation map (DEM) of a given terrain. The color indicates the elevations in meters. (b) The energy-based cost map obtained from (a) before weighting. (c) A binary representation of artificial obstacle regions and regions with high-cost grids filled by penalized high slope angles. (d) The complete EMAP
The definition of PTR path planning
The purpose of path planning in this paper is to safely navigate a WMR to a target region
We consider that the goal region T is a square to simplify the discussion. Once an optimal path connecting
Improvement of the Dual CHOMP method
In this section, we improve the original Dual CHOMP method to solve path planning problems in general cost maps which usually involve non-convex cost functions. A new variant based on the improved deterministic Dual CHOMP which can guarantee probabilistic completeness is proposed.
Implementing NGPA to improve Dual CHOMP
The objective functional is defined as follows 15
where
Let
Let Hessian
The continuous obstacle functional can be written as
where
A linear inequality constraint, representing the constraint of the goal region T, to the original optimization problem in iteration k is written as 14
where
Letting u be a Lagrange multiplier, we can obtain the primary solution from the Lagrange dual function
When applying the original Dual CHOMP on an EMAP, the Hessian
We simplify the NGPA here to improve computational efficiency. Let
The complete deterministic variant of the improved Dual CHOMP for path planning is shown in Algorithm 2, where
Applying simulated annealing and Hamiltonian Monte Carlo to Dual CHOMP
As pointed out in Zucker et al., 15 the initial configuration of the trajectory can greatly affect the performance of CHOMP, which also happens while applying the dual solution. We chose Hamiltonian Monte Carlo (HMC)15,20 to modify the initial path and simulated annealing 21 to handle global optimization. Meanwhile, the improved deterministic Dual CHOMP still functions as a key step.
Algorithm 3 shows the simulated annealing HMC Dual CHOMP (SHD CHOMP). RRT-connect with a large step size is performed ahead of Dual CHOMP for efficiently generating an initial rough configuration
Let
where
Thus, the kth step of the common leapfrog scheme in the metric space A based on equation (11) can be written as
where
The HMC procedure can encumber the computational efficiency when the path has too many nodes. A practical approach to boost the computational efficiency is to uniformly sample some nodes from the previously invalid path at the first iteration in Algorithm 3, until a rough path is determined. Then, after performing the leapfrog step with the rough path, the current path is interpolated for enough nodes before step 6 in Algorithm 3.
Simulation experiments for path planning tasks
The virtual simulation workspace is based on a square piece of the satellite map of Devon Island (around
Comparison of the deterministic Dual CHOMP and RRT-connect
We tested the improved deterministic Dual CHOMP against RRT-connect for comparison, referring to the comparisons of the original CHOMP. 15 These methods are popular and practical techniques for high-dimensional path planning problems. What we are trying to show is that Dual CHOMP can have significantly better performance under some circumstances, which provides us another choice dealing with practical problems.
The goal of RRT-connect was selected randomly in
It is important to inspect the performances without applying parameter sets specifically trained in advance for solving each problem for better outputs, as done in Zucker et al. 15 This means quite a high bias toward the randomized planner in our simulations. Optimization-based methods tend to be locked in local minima if the parameters and testing problems remain unchanged, which does not happen to randomized methods. Table 1 lists the test parameters for each algorithm. The improved Dual CHOMP and the original CHOMP shared the same parameters. The untraversable regions were dilated with 2 grid points around their outline for a safer planning. The total cost was accumulated according to the costs of grids traversed in the EMAP. The path length was a scalar value when the side length of the square cost map was set at 1. The time budget was set at 10 s.
The parameters for the algorithms.
EMAP: energy-based map; CHOMP: covariant Hamiltonian optimization for motion planning; SHD CHOMP: simulated annealing HMC Dual CHOMP.
Figure 4 shows the final results of the comparison of all the three methods mentioned above. For PTP planners, the original CHOMP does outperform RRT-connect when it succeeds. As a PTR planner, the improved Dual CHOMP achieves a success rate nearly two times higher than the original CHOMP, given the nature of being region constrained and robust to ill-distributed costs.

The comparison results of the three methods. From left to right: the success rates of three methods within a time limit of 10 s, the average running time, the average path cost, the average path length of the three methods considering only successful runs and the time distributions of successful runs with improved Dual CHOMP and RRT-connect only.
For CHOMP-based methods, the optimization-driven characteristic and the untrained parameters limit the performance in success rate. When the instant success rate is a prior issue, that could be a potential problem. And that is why an approach with probabilistic completeness is necessary, which will be tested and discussed in the next section. Nevertheless, the improved Dual CHOMP significantly outperforms the randomized planner in all other aspects we are interested in. Figure 5 shows an illustration of two successful runs of Dual CHOMP and RRT-connect in the same test problem in the EMAP and the real word map. The original CHOMP fails to find a feasible path in this case. As we can see, Dual CHOMP can usually promise a shorter and smoother path, while RRT-connect usually generates some unnecessary detours encumbering its performance.

(a) The final paths of two successful runs with improved Dual CHOMP (green) and RRT-connect (magenta). (b) A 3D illustration of the paths in the real world. The landscape color indicates the elevation (unit: m).
Another prominent advantage of CHOMP-based methods is that the number of trajectory nodes can be controlled without significant impact on the performance. The WMR can simply follow the path without further interpolation which may cause path distortion. Table 2 presents a comparison to show how much the number of nodes affects the performances of different methods. In contrast, after the step size was down-sized 10 times as well, RRT-connect failed to find a any feasible path within the time budget in nearly all conditions.
Comparison of the three methods.
CHOMP: covariant Hamiltonian optimization for motion planning.
Comparison of SHD CHOMP and RRT*
We tested RRT*, 24 an optimization-driven variant of RRT, against SHD CHOMP to make a fair comparison, since they are both approaches of combining optimization and randomization. During the tests, we implemented this method using the same parameters as for the deterministic version. Table 1 lists the parameters we used for simulated annealing. The step size for RRT* was 2 grid spacings.
The two methods can both yield feasible and optimal results if kept running. The comparison here is mainly focused on evaluating the performance of the first feasible path. We used them to test 255 problems which were previously unsolvable for improved deterministic Dual CHOMP. The time budget was set at 10 s. The results shown in Table 3 are from 1275 runs with each algorithm. Figure 6 shows an example of two successful runs of SHD CHOMP and RRT* for the same test problem in an EMAP.
Simulation results of previously unsolvable problems.
SHD CHOMP: simulated annealing HMC Dual CHOMP; RRT: rapidly exploring random tree.

(a) The final paths of two successful runs with SHD CHOMP (green) and RRT* (magenta). (b) A 3D illustration of the paths in the real world. The landscape color indicates the elevation (unit: m).
Through the implementation of simulated annealing and HMC, SHD CHOMP significantly outperforms RRT* in success rate and average planning time. If a larger step size is chosen for initial path generator, that is, RRT-connect, the run time can be shortened while maintaining a high success rate according to our research (with a step size of 4 grid spacings, the success rate is 89.65% and the mean run time is 0.93 s). Smoothness is not well guaranteed by SHD CHOMP in the current tests. The primary reason is that the parameters of improved Dual CHOMP have remained unchanged in our study as mentioned in the previous section. This limits its flexibility to balance the smoothness and obstacle cost functions, leading to weak minima, that is, non-smooth paths.
Conclusion
In this paper, we propose the improved Dual CHOMP method, which is an optimization-driven PTR path planning method. It is designed to smoothly navigate a planetary WMR from its current position to a region of research interests from a global view. The method is composed of two variants: the improved deterministic Dual CHOMP can achieve a path of quality efficiently; SHD CHOMP can guarantee the probabilistic completeness on the basis of the deterministic variant. By switching to the randomized variant when the deterministic one fails, we can achieve an overall success rate of 97.68% in our problem set, without training different preset sets of parameters for each problem in advance or online adjusting parameters specifically.
By establishing an energy-based cost map with estimated energy consumption, the optimal result from the improved Dual CHOMP method can be energy saving. The generated path is of high quality for mission-oriented planetary exploration using a WMR. By designing other types of cost maps, the method can solve similar optimization problems.
The main weakness of the method is that the deterministic variant has a relatively lower success rate against other randomized planners. This makes the approach more suitable for planning tasks placing a priority on path quality instead of an instant success. Also it cannot deal with path planning problems in maze-like maps directly. One has to segment maze-like maps before applying the method. SHD CHOMP, the randomized variant, suffers from bad smoothness on account of the unchanged parameters in this research. On the other hand, this indicates that it can function better in our further research of developing a task-directed parameter adjustment technique with a real WMR. The method proposed is not designed to replace other path planning approaches in all practical situations, but rather provide a better alternative when available.
Footnotes
Acknowledgements
The authors would like to thank Sanjiban Choudhury and Sebastian Scherer for their deduction and open source code of the original Dual CHOMP.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship and/or publication of this article.
Funding
This study was supported by the National Natural Science Foundation of China (Grant No. 51822502) and the National Basic Research Program of China (Grant No. 2013CB035502).
