Abstract
We propose a new local path planning approach based on optimization methods with probabilistic completeness in this article. This approach adds a linear constraint to the original covariant Hamiltonian optimization for motion planning problem with a new cost function. By deducing the dual form, the path planning problem is described as a box-constrained quadratic programming problem. The nonmonotone gradient projection algorithm is introduced to solve the dual problem, which makes the algorithm adaptable to non-convex cost functions. In order to prevent early convergence at local minima that can occur when applying optimization methods, this article introduces Hamiltonian Monte Carlo to the modification, which constantly forces the initial path to jump out of the local extremum, thus improving the robustness and success rate of the path planning approach. Compared with other methods through simulations, this approach is proven to provide balanced planning efficiency and path quality. The feasibility in a real environment is experimentally validated by applying the approach to a wheeled mobile robot.
Keywords
Introduction
Path planning has long been a research topic in the development of mobile robots. It is highly related to robot autonomy, as well as their performance and completion of motion tasks. In the field of space exploration, almost all missions involving exploration of other planets use wheeled mobile robots (WMRs).1,2 For WMRs with limited steering ability such as planetary rovers, the path cannot be well tracked if not smooth enough. Meanwhile, planetary exploration missions usually involve regions of research interest instead of a single target position. Therefore, research into smooth and mission-directed path planning methods is practically significant for these situations.
Some researchers consider general path planning research to involve researching point-to-point (PTP) locomotion mostly, because a single complicated path planning problem can be decomposed into a series of continuous PTP tasks.3,4 For low-dimensional path planning problems, there are some popular approaches: cell decomposition methods like A*-based 5 and D*-based 6 methods, potential field methods like the artificial potential field method and its variants, 7 graphical methods like the Voronoi diagram, 8 and other intelligent methods. 9 These methods stress the instant success prior to quality of paths. Effective methods for solving high-dimensional path planning problems are randomization-based and optimization-based approaches, such as rapidly-exploring random trees (RRT), 10 covariant Hamiltonian optimization for motion planning (CHOMP), 11 and stochastic trajectory optimization for motion planning (STOMP). 12 Optimization-based methods can usually maintain a balance between the success rate and path quality.
In recent years, researchers have realized that point-to-region (PTR) path planning problems are difficult to solve with traditional PTP planners. Optimization and probabilistic methods were shown to be more suitable for solving PTR problems such as mission-directed explorations previously mentioned, due to their efficiency in addressing high-dimensional tasks. T Peynot et al. 13 constructed a probability-based control policy involving mobility prediction in various terrain types for robot navigation toward a square goal region, which emphasizes path traversability rather than path quality. S Persson and I Sharf 14 regard path planning toward a goal region as a sampling-based optimal problem, thus probabilistic completeness and fast convergence are guaranteed. However, the path obviously lacks smoothness. M Davoodia et al. 15 used an evolutionary algorithm to solve multi-objective path planning problems using a geometric smoothing technique. However, the post-smoothing technique can actually distort the original paths to some extent, leading to uncertainty regarding the feasibility of a forced-smoothed path in a real environment. S Choudhury and S Scherer 16 derived the dual form of the original CHOMP problem by adding a box constraint to the original CHOMP problem, 11 which can effectively solve linear goal region–constrained problems. However, this method and the original CHOMP do not work well with non-convex cost functions.
In this article, we take a step further to improve dual CHOMP by redesigning the cost function and introducing a nonmonotonic gradient searching approach. After these improvements, it becomes capable of local PTR path planning missions within ill-distributed cost maps, which are difficult for traditional CHOMP-based methods. We propose using the Hamiltonian Monte Carlo (HMC) method to improve the dual CHOMP method in order to guarantee the probabilistic completeness. Thus, it can solve local minima problems frequently encountered in the practical utilization of CHOMP-based method.
The remainder of this article is organized as follows. The map representation of environment and the kinematic model for a WMR are presented in “Map representation and kinematic model for a WMR” section. In “The path planning method based on improved dual CHOMP” section, the original dual CHOMP method is improved by modifying the cost function and introducing a nonmonotonic gradient searching approach for path planning missions in ill-distributed cost maps. The modified method based on HMC in Riemannian space is applied for probabilistic completeness. In “Simulations and analysis” section, simulations are performed to demonstrate the performance of the method. In “Experiment in a real environment” section, an experiment with our planetary rover in a real environment is performed to test the actual feasibility of the method. Finally, conclusions are presented in the “Conclusion” section.
Map representation and kinematic model for a WMR
A grid cost map is established to represent obstacles that may interfere the locomotion of a WMR, such as areas occupied by walls and rocks. We assume the obstacles are stationary relative to the WMR from a local view within limited time. The following cost of a grid is defined to prevent a WMR from being too close to obstacles
where x is any grid in the map,
A WMR system can generally be classified as a nonholonomic system, which has under-actuated characteristics. In order to focus the discussion on path planning problems, the locomotion of a WMR is assumed to be performed in a two-dimensional space. The model of the six-wheel robot we are using is simplified to a two-wheel kinematic model regardless of slip and skid, as shown in Figure 1. The pose of the entire WMR in a global coordinate system is defined as

The kinematic model of the wheeled mobile robot.
The pose error of the WMR can be written as
Thus, the following differential equation for the pose error can be derived
We can see that the connection between the control input
The path planning method based on improved dual CHOMP
CHOMP-based methods treat an overall path
where
Improved dual CHOMP
The start and goal points in a two-dimensional space can be seen as the locations of WMR at times
where
The smoothness functional we propose in this article is different from that used in the original CHOMP method. 11 Intuitively, the virtual Coriolis force applied to the current path node changes along with the steering angle and linear velocity while the WMR follows the planned path. This accounts for radial and tangential changes. On the contrary, the original CHOMP method only considers changes in the tangential value along the path, which we find will cause the planning procedure to be too sensitive to the smoothness functional during our research. After applying this modification, the value of the smoothness cost functional will change gently even when the nodes of path is sparse. On the contrary, the original CHOMP method tends to either diverge quickly when path nodes become sparse or converge too fast due to the fast growth of the virtual kinetic energy. The following theorem and proof are given regarding this point.
Theorem
The change in kinetic energy between two nodes of the path calculated from the original CHOMP will be larger than defined in equation (5) if the two path nodes are far away enough and the curvature of the path during a time interval
Proof
Assume the time interval
Considering an extreme situation where
where the curvature is
Here, we consider two different extreme conditions when the virtual acceleration
If
If
Thus,
Under the above assumptions, we know that the inequality
Normally,
After applying a first-order Taylor expansion to the cost functional
where
The equation (10) is actually a non-convex programming problem without constraints, which usually must be solved by deducing its dual form. Therefore, referring to the literature,
16
a general linear constraint
The primal solution is recovered from equation (11) and is written as
The dual factor u can be updated by iteratively applying equation (11). The original problem can then be solved. This equation by nature is a box-constrained quadratic convex programming problem. However, during daily exploration missions, we do not usually apply strong interpolation to the map in case unpredictable map distortion occurs. Therefore, the original dual CHOMP method cannot work well in a non-smooth cost map by generating non-convex cost functions.
We introduce nonmonotone gradient projection algorithm (NPGA) to amend this weak point brought by equation (11) and solve the dual problem. We simplify NGPA for an easy application in a real robot computing platform. Let

Gradient projection step.
The path planning problem can now be divided into two major steps. First, the dual factor
The complete improved dual CHOMP method for non-convex cost functions is shown in Algorithm 1, where
HMC dual CHOMP algorithm
The improved dual CHOMP uses gradient descent scheme to find minima through iterations. This may also make the algorithm be locked to local minima causing failures in path planning, which commonly occur in optimization-based path planners. Therefore, it is necessary to find an approach that guarantees probabilistic completeness of the method in this article, thus increasing the success rate of finding valid paths.
The initial path chosen for path searching has a great influence on the result when other parameters are held constant. Here, we propose introducing HMC 19 to improve dual CHOMP in order to increase the path searching ability autonomously without requiring training in advance as done in He et al. 11
When the algorithm detects that the final path generated by Algorithm 1 is locked to a local minimum, that is, obstacles interfere with the path, the algorithm will determine a new initial path using HMC in the metric space M. After repeating this procedure, the algorithm will eventually find an available path for a WMR.
The key step of HMC is the leapfrog procedure. According to the theory presented in He et al.,
11
we consider that the path
Thus, the leapfrog scheme of HMC in metric space M can be rewritten as
where
Since we treat the path as a high-dimensional point, the kinetic energy of WMR in HMC is set as
The entire algorithm is shown in Algorithm 2, where
Thus, the entire path searching progress functions via the following two major steps:
Perform Algorithm 1 to determine an optimal path for WMR for the first time.
Perform Algorithm 2 when Algorithm 1 fails and repeat Algorithm 2 until a feasible path is found.
A practical approach to boost the computational efficiency is to uniformly sample from the previously invalid path at the first iteration in Algorithm 2 until a rough path is determined. Then, after the leapfrog step with the rough path, we interpolate it linearly with enough nodes just before performing Algorithm 1.
Simulations and analysis
We performed a series of path planning simulations that consider problems which may occur in real environments. To test the performance of the HMC dual CHOMP method, we compared it with the original CHOMP and RRT* algorithm. 20 We compared HMC dual CHOMP with RRT* because they are integrated path planning methods that perform both random searching and optimization-based searching.
The method in this article and RRT* both have probabilistic completeness, which means they can autonomously find feasible solutions after enough attempts and suitable parameter adjustments under most situations. Therefore, we test their performances on the following strict conditions:
The preset parameters remain unchanged in all simulations.
We only compare the performance of the first feasible solution regardless of all the other feasible solutions that may be found after further attempts.
Solutions must be found within a certain time budget, or else, the solutions are seen as invalid.
Algorithm 2 will uniformly sample 10 nodes out of the 100 nodes from the current path
Although RRT* is a PTP planner, it can also be seen as a PTR path planner by sampling from the goal region in the HMC dual CHOMP method (Table 1). We let RRT* sample five available goal points from the goal region and calculated the average performance for comparison. All the initial points of the three path planners were the same and the time budget for path searching was set to 20 s.
HMC dual CHOMP and original CHOMP parameters.
HMC: Hamiltonian Monte Carlo; CHOMP: covariant Hamiltonian optimization for motion planning.
We performed simulations with the three methods in the same preset distance field cost map on the same computer with a G4600 CPU. The start points were selected from the left edge of the map. The linear goal regions were selected from the right edge of the map. This yielded a total of 90 sets of start–goal combinations. We performed five simulations for each start-goal set with the three path planning methods. The results from the simulations are shown in Table 2. The length of each path is a scalar value when the scalar length of the map border is set to
The comparison of simulation results from the three algorithms.
HMC: Hamiltonian Monte Carlo; CHOMP: covariant Hamiltonian optimization for motion planning.
The simulation results show that the overall performance of the HMC dual CHOMP method significantly outperforms the other methods in most aspects. One should note that we found
The number of path nodes from CHOMP-based methods were preset, while that from RRT* were post-calculated. Normally, the number of path nodes cannot be accurately controlled while performing RRT-based methods. The computational burden will grow exponentially with RRT-based methods when the size of the searching step decreases, while it only grows linearly with the CHOMP-based method. In a practical application, one can directly use a path with enough nodes as the tracking target for a WMR. Post interpolation may cause great distortion of the original path if a low number of path nodes is used.
The paired successful path searching results from HMC dual CHOMP and RRT* are shown in Figure 3. The cost value of the cost field map is shown in grayscale. The starting point is marked with a circle on the left edge. The final reached goal points are marked with a star or a triangle. The target goal region is confined by a dashed line. Obviously, the path generated with HMC dual CHOMP is smoother and the WMR can track it more easily. On the contrary, the RRT* path has sharp turns that may impede smooth locomotion.

HMC dual CHOMP and RRT* results in the same path planning problem.
The searching history of the HMC dual CHOMP method when Algorithm 1 is successful is shown in Figure 4. The overall cost of the path will push the trajectory away from the high cost regions as they are inhibited by obstacles. The changing history of the overall path cost is shown in Figure 5. The obstacle cost functional initially drives the path away from the obstacles. Then, the smoothness cost functional plays the major role in smoothing the path until the path searching terminal conditions are fulfilled.

Searching history with the HMC dual CHOMP method in a distance field map.

Overall path cost history.
The progress of the HMC dual CHOMP method as it drives the path away from local minima, that is, encountering collisions, is shown in Figure 6. We can see that Algorithm 2 will continuously search for valid paths after adding random momentum to the path when Algorithm 1 fails.

Progress of the algorithm as it jumps out of local minima.
It should be mentioned that HMC dual CHOMP, which places greater emphasis on optimization, is shown to be an effective approach for daily exploration tasks while maintaining high path qualities that we are interested in. However, the RRT-based method and other methods, which place greater emphasis on randomization, will obviously have more advantages for the maze-like map. By segmenting large maps into smaller local maps, the method proposed in this article can still intuitively function well. This is also the reason we emphasize that HMC dual CHOMP is a strong candidate for addressing local path planning problems.
Experiment in a real environment
We built an experimental environment for a planetary WMR in order to verify the feasibility of the algorithm in this study in a real environment. The hardware configuration of the entire system is shown in Figure 7. The host PC is responsible for managing the visual measurement information of OptiTrack, which contains the positions of obstacles and current locomotive status of the WMR. With this information, the host PC can perform path planning with the HMC dual CHOMP. The slave PC is in charge of commanding the WMR to track the planned path.

Hardware configuration in the experiment.
The layout of the experimental site is shown in Figure 8. We chose the far corner region as the target region. The WMR moves with a uniform linear velocity of

Experimental site.

Comparison of the planned path and the actual path (unit: mm).
Conclusion
We propose using a new PTR path planning method based on the original dual CHOMP method to address local exploration tasks for a WMR. After improving the structure of the cost functions and introducing a simplified NGPA, the method can now sufficiently handle ill-distributed cost maps containing non-convex cost functions. We went a step further toward enhancing the improved dual CHOMP method with probabilistic completeness by proposing the HMC dual CHOMP method. The method can provide an inherently smooth path for WMRs with limited steering ability such as a planetary rover performing local exploration missions. The generated path can usually be directly used to guide a WMR. The success rate is significantly higher than that provided by other algorithms. The path quality is also guaranteed by emphasizing optimization in our method, for example, the path is smoother and the path length is reduced by 22.4% at most. The average planning time in the simulations is 0.246 s with a CPU of low frequency, which can be used for real-time path planning in a real environment.
This method currently cannot sufficiently address path planning problems in large maps and maze-like maps. Although the simulations have produced some positive results, more thorough and complicated experiments in a real complex environment should be performed in future because the size of the experimental site is currently confined. The method can be modified to provide better performance for our WMR by specifically considering the structure of the planetary six-wheel robot we are using to meet the nonholonomic constrain. The real-time performance can be boosted by improving the computational efficiency with state-of-the-art nonlinear fractional calculus methods21,22 in our further work. The results from the simulation and experiment indicate that the method presented in this article can be modified specifically and applied to other common mobile robots when performing PTR path planning tasks.
Footnotes
Acknowledgements
The authors would like to thank Sanjiban Choudhury and Sebastian Scherer for their deduction and open source code of the original dual CHOMP.
Handling Editor: James Baldwin
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study was supported by the National Natural Science Foundation of China (Grant No. 51822502) and National Basic Research Program of China (Grant No. 2013CB035502).
