Determining a globally optimal solution of belief space planning (BSP) in high-dimensional state spaces directly is computationally expensive, as it involves belief propagation and objective function evaluation for each candidate action. However, many problems of interest, such as active SLAM, exhibit structure that can be harnessed to expedite planning. Also, in order to choose an optimal action, an exact value of the objective function is not required as long as the actions can be sorted in the same way. In this paper we leverage these two key aspects and present the topological belief space planning (t-bsp) concept that uses topological signatures to perform this ranking for information-theoretic cost functions, considering only topologies of factor graphs that correspond to future posterior beliefs. In particular, we propose a highly efficient topological signature based on the von Neumann graph entropy that is a function of graph node degrees and supports an incremental update. We analyze it in the context of active pose SLAM and derive error bounds between the proposed topological signature and the original information-theoretic cost function. These bounds are then used to provide performance guarantees for t-bsp with respect to a given solver of the original information-theoretic BSP problem. Realistic and synthetic simulations demonstrate drastic speed-up of the proposed method with respect to the state-of-the-art methods while retaining the ability to select a near-optimal solution. A proof of concept of t-bsp is given in a small-scale real-world active SLAM experiment.
Decision-making under uncertainty in partially observable domains is inescapable for autonomous systems operating in the real world. An autonomous agent (robot) needs to make predictions about its own state and state of the world given often partial and uncertain data and decide its own actions accordingly. In principle, it has to perform inference about unknown quantities and evaluate the task-related objective on available actions. These operations can be quite expensive, especially in high-dimensional state spaces and with information-theoretic cost functions, such as differential entropy. Online decision-making under uncertainty in such a setting is essential in numerous problems in robotics, such as single and multi-robot active SLAM, sensor deployment, search and rescue, and autonomous environmental monitoring.
However, many problems of interest, for example, active SLAM, exhibit structure. In such problems, only subsets of variables interact directly with each other. It means there exist conditional independencies between variables and it is possible to simplify the model of their joint probability distribution. One way to represent certain dependencies or causality among variables of a stochastic system is via a graphical model of their joint probability distribution (Koller and Friedman 2009), for example, a factor graph and a Markov Random Field.
A graphical model approach involves expressing the joint probability density/mass function (belief) as a product of local functions of its variables. A structure of this model, described by a topology induced from the connectivity of the variables and factors they belong to, reveals probabilistic and algorithmic features of the model (e.g., independence, learning complexity) often indicating how to efficiently perform marginalization and conditioning operations on that model using only local computations (Lauritzen and Spiegelhalter 1988).
Moreover, sometimes it is possible to make decisions without explicitly performing the computationally expensive inference of future beliefs and cost function calculation by looking at some other features of the system highly related to the objective. Using this strategy, we can focus on important features first and leave the details for later if necessary, or when resources become available. The difficulty within this approach lies in identifying such features, that, on top of being discriminative, are less expensive to compute than solving the original optimization problem explicitly, and in understanding how close the solution based on them is to optimal.
In this work, we identify and study such features relevant to information-theoretic decision-making under uncertainty induced from structural information of the corresponding posterior probability distributions. Our approach is applicable to high-dimensional Gaussian belief space planning (BSP) problems that exhibit the specific structure of pairwise potentials, that is, each factor involves only two variables. It therefore particularly supports variants of the active SLAM problem (see, e.g., a recent survey (Placed et al. 2023)), that exhibit such a structure.
To get the intuition behind this idea, see Figure 1. Different actions generally lead to factor graphs with different topologies and more complex graphs tend to produce more accurate estimates, with smaller entropy. The idea is then to develop a computationally lightweight topological signature, that is, a function of the graph topology, that is highly correlated with the original information-theoretic cost, and to utilize that signature for decision-making. Such a concept is therefore expected to be especially beneficial in decision-making problems with large graphs, such as in active SLAM.
Conceptual illustration of topological belief space planning. Given that the objective is uncertainty minimization of the trajectory, an optimal action can be often deduced solely from posterior’s topology (middle column), that is, without predicting the values of future measurements and performing belief propagation (right column), but only by predicting that the trajectory a1 will have loop closures due to re-observing previously mapped area, while a2 won’t as it leads to an unknown (gray shaded) area (left column).
The general BSP problem with Markovian models can be naturally represented by a Partially Observable Markov Decision Process (POMDP). Determining globally optimal solutions to the POMDP problem is computationally intractable for most but the simplest real-world problems due to both the curse of dimensionality and the curse of history (see, e.g., Pineau et al. 2006). Computational complexity remains an issue even with simplifying assumptions on the probability distribution of the states, finite planning horizons, discrete states, actions or observations (Papadimitriou and Tsitsiklis 1987). As a result, a large subset of prior work has focused on approximately solving the POMDP problem to provide better scalability. Some examples include using sampling-based motion planners (e.g., Kavraki et al. 1996; Prentice and Roy 2009; Agha-Mohammadi et al. 2014; Davidson and Hutchinson 2009), local optimization methods for continuous state spaces (e.g., Indelman et al. 2015; Platt et al. 2010; Van Den Berg et al. 2012), or point-based value iteration (e.g., Pineau et al. 2006; Porta et al. 2006; Kurniawati et al. 2008).
The paradigm proposed herein is related to computationally efficient information-theoretic BSP in the context of active pose SLAM. We formulate the optimization problem in a topological space where it is more easily solved, yet its solution is close to a solution of the original BSP optimization problem. Unlike existing topological methods in robot motion planning that consider a topology of the configuration space of the robot to generate or topologically classify paths (Kim et al. 2013; Bhattacharya et al. 2015) or to create and use topological maps of the environment (Ranganathan and Dellaert 2011; Choset and Nagatani 2001; Angeli et al. 2009; Paul and Newman 2010; Blöchliger et al. 2018), topology in our work refers to the structure of the belief, which encodes dependency relationships among states, and is used in planning.
An information-theoretic cost in an objective function of BSP is some measure of system uncertainty (information), typically (conditional) entropy or mutual information. Evaluating it requires determining the expected posterior belief of a candidate action. For Gaussian distributions the corresponding computations usually involve calculating a determinant of a posteriori covariance (information) matrix whose complexity is O(n3) in the general case, where n is the state dimension. Moreover, these calculations need to be performed for each candidate action. In Indelman et al. (2015) this challenge was addressed by resorting to the information form and utilizing sparsity; however, calculations still involve expensive access to marginal probability distributions. The rAMDL approach (Kopitkov and Indelman 2017, 2019) performs a one-time calculation of marginal covariance of variables involved in the candidate actions, and then applies an augmented matrix determinant lemma (AMDL) to efficiently evaluate the information-theoretic cost for each candidate action. Nevertheless, that approach still requires recovery of appropriate marginal covariances, the complexity of which depends on state-dimensionality and system sparsity.
Our work is motivated by the results of Khosoussi et al. (2019) in the context of measurement selection and pose-graph pruning problems in SLAM that characterizes the impact of the graph topology of SLAM (described by weighted tree-connectivity) on estimate reliability. In Chen et al. (2021), the authors extend these results to a 3D SLAM with relative pose measurements and make a comparison between the T-optimality and the D-optimality metrics.
1.2. Contributions
The main purpose of this work is to set theoretical foundations of the topological BSP (t-bsp) concept introduced in our preliminary conference publication (Kitanov and Indelman 2018), extending the main results from (Khosoussi et al. 2019) to BSP. In this paper we further develop the potential of t-bsp as an approximate method to efficiently solve BSP in high-dimensional state spaces in problems that exhibit certain structure and develop performance guarantees, as discussed below. Specifically, we consider the structure of pairwise potentials, which is often the case in active SLAM problems. The main contributions of this paper are as follows.
First, we propose new topological t-bsp signatures: one is based on the number of spanning trees of the topological posterior factor graph representation, and the other on its von Neumann graph entropy (Mowshowitz and Dehmer 2012; Passerini and Severini 2009; Petz 2001). Both signatures have a strong correlation with the information-theoretic cost, but are computationally easier to calculate.
Second, we give a definition of an action consistent objective to formalize the equivalence of two criteria (functions) in terms of action ranking which extends previously defined action consistent state approximations proposed in (Elimelech and Indelman 2017b, 2017a, 2017c), building upon (Indelman, 2015, 2016). Definition 1 defines an approximation error in decision-making between any two functions (possibly on different domains) besides of the usual looking at the values of the same function for two different arguments (states or beliefs). This enables us to quantify an error between different cost functions and establish optimality guarantees. Specifically, we express the error of t-bsp via bounds on the information-theoretic cost (Corollary 1), and derive the bounds analytically for an active 2D pose SLAM, considering the joint differential entropy as the cost function. These bounds are a function of topological terms, measurement noise density and an a-priori state estimate. A special emphasis is given to efficiency in calculation of error bounds to make our method appropriate for online use. Using these bounds, we are able to provide online performance guarantees of t-bsp, with any of the topological signatures. We emphasize these performance guarantees are with respect to a given solver (i.e., estimator) of the original information-theoretic BSP problem (6), and not with respect to the corresponding theoretical BSP problem (1). In this work we consider the solver to utilize maximum likelihood observations (Platt et al. 2010), although the bounds are applicable also to additional solvers.
Third, we propose a new method to calculate the number of spanning trees of a graph efficiently via a convergent power series. This is a result of reformulating Kirchoff’s matrix tree theorem via eigenvalues of the normalized Laplacian matrix and graph node degrees (Theorem 1). The power series reveals the connection between our proposed topological signatures and the information-theoretic cost in the limit.
Forth, we develop an incremental variant of t-bsp based on the von Neumann graph entropy that, under certain conditions, which are often met in operating conditions (e.g., SLAM), achieves near constant-time complexity.
We extensively evaluate our batch and incremental t-bsp algorithms in realistic Gazebo/ROS simulations of active 2D pose SLAM and in optimization of synthetic pose graphs. Additionally, we provide a proof of concept of t-bsp in a small-scale real-world active SLAM experiment.
2. Notations and problem formulation
Decision-making under uncertainty can be formulated as a solution to BSP or stochastic control problem where the optimal non-myopic control action at planning time k is found with respect to the objective function J related to the design task as
The BSP problem with Markovian models is an instance of POMDP. The state of the system X is not directly observable by the controller, but through a set of stochastic measurements from which the future posterior beliefs must be inferred upon optimization. The expectation in (1) is taken with respect to future (unknown) observations . The optimal solution of BSP provides a control strategy for L look-ahead steps, but L can generally vary among control actions or planning sessions. We use and to denote the objective and the optimal control, respectively, in a given planning session (at time k). The objective function in its general form reflects the design task through immediate cost functions cl, which depend on the belief evolution (to be defined) and a control action applied at time k + l, and through a final cost cL. For example, the cost functions can be chosen to minimize trajectory uncertainty, time or energy required to reach a goal, state uncertainty of variables of interest at some specific time instant etc. In information-theoretic BSP, one is interested in state uncertainty minimization which can be expressed through some information-theoretic cost c(·) (see, e.g., Carrillo et al. 2012). This type of cost function is usually computationally the most expensive to optimize in many BSP problems in robotics.
The given notation abstracts the details of the controller design for the purpose of generality of the formulation. The basic problem we aim to solve is evaluating the objective function J, which is belief dependent (POMDP setting), for a finite sequence of controls (L look-ahead steps). This can be used in any POMDP solver no matter whether the control is closed-loop or open-loop. In this paper we implement both open-loop control (action sequences) and a Model Predictive Control (MPC) type planner and assume that a low-level controller can stabilize the system to follow the nominal path between two planning sessions using feedback based on the expected value of the state. This is similar to an LQG approach and is justified for systems that are either linear or locally well approximated by their linearization (Van Den Berg et al. 2011).
Solving the optimization problem (1) explicitly would require belief propagation for a given control and evaluating an information-theoretic objective which we want to avoid and use a topological approach instead. Let us first look at how the belief evolves over time by separating controls and observations to those obtained by planning time k and to future controls and observations after L look-ahead steps.
2.1. Belief propagation
Let represent the posterior probability density function (pdf) at planning time k over states of interest Xk of the robot. In the pose SLAM framework states of interest are the robot’s current and past poses, that is, Xk = {x0, x1, …, xk}. History contains all observations and controls by time k. Consider conventional state transition and observation models
with zero-mean Gaussian process and measurement noise and , and with known information matrices Ωw and Ωvij. The belief can be written as
Let the future sampled states of the robot along one of its candidate paths generated at planning time k be {xk+1, …, xk+L}. The future posterior belief that would be obtained by following the path , can be written in terms of the belief at planning time and the corresponding state transition and observation models as
where and represent controls and (unknown) observations, respectively, to be acquired by following the path .
2.2. Assumptions
The Topological BSP concept is general as an idea of introducing a new cost function, a function of the belief’s topological features, to approximate actions ranking with respect to information-theoretic cost. However, identifying relevant topological representations and signatures in general BSP is an open problem. The approach we develop aims at BSP in problems that exhibit certain structure, such as active SLAM. In this work, our definition of topological space applies to all beliefs which can be factorized to a product of pairwise factors associated with a set of independent measurements. We develop this concept for a special class of Gaussian BSP, targeting efficient decision-making in high-dimensional state spaces, with a joint differential entropy cost function and propose two topological signatures for the case of block isotropic measurement noise (5). For clarity here we only consider the final state cost term , that is, the joint entropy at the end of the planning horizon. The immediate information-theoretic cost functions can be treated in a similar way.
In summary, the assumptions used in this paper to develop t-bsp are:
(A1) the belief is modeled by a multivariate Gaussian distribution and can be factorized to a product of pairwise factors,
(A2) the measurement noise covariance can be written in the form
for some κ blocks and m measurements with variances ,
(A3) taking maximum likelihood observations (Platt et al. 2010), and
(A4) the information-theoretic cost is the joint differential entropy at the end of the planning horizon. Under (A3), the objective function becomes
where I(Xk+L) denotes the estimated Fisher information matrix of the robot’s belief b[Xk+L], and N represents the dimension of the state Xk+L.
Note that t-bsp signatures designed to be action consistent under assumption (A4), will also be consistent with respect to an information-gain reward since all actions have the same prior entropy. Specifically, in this paper we demonstrate our approach in an active 2D pose SLAM problem in which all relative positions and orientations are normally distributed with variances and , respectively. Measurement noise covariance Σ then can be written as a diagonal matrix , with m being the number of relative pose measurements.
Another application of t-bsp are linear systems with relative measurements zi,j = xi − xj + vi,j such as: active pose SLAM in which the robot’s orientation is known (e.g., from a compass), sensor network localization, clock synchronization and motion consensus (Barooah and Hespanha 2007) in the context of measurement selection problem etc. For those types of problems under the assumptions (A1)-(A4), t-bsp is always action consistent according to the Definition 1 since the determinant of the posterior information matrix is given by , where t is the number of spanning trees of a topological graph associated to a posterior factor graph, so negative equals the topological signature based on the number of spanning trees.
An extension to a feature-based SLAM and map entropy is possible under the proposed framework as long as the measurements can be expressed in the form of pairwise relations between states. This is quite common since the robot’s landmark measurements are often given relative to the robot’s pose, for example, as range or bearing measurements. The Fisher information matrix of the joint state of feature-based SLAM under the ML framework with isotropic measurement noise is derived in Chen et al. (2018) in terms of its graph topology and our results apply to it.
3. General approach in topological belief space planning
The main idea of topological BSP is ranking candidate actions based on structural information (topology) of their associated beliefs. Under the t-bsp concept, the original problem (1) is reformulated in a topological space, where it can be solved more easily, and then its solution is related to the solution of the original problem. The importance of the topology and a quality of the solution based on it will clearly be dependent on the optimization objective, the influence of non-topological information (noise distribution, geometric realization) and diversity of candidate actions. As will be shown in this work for the information-theoretic BSP, the belief topology is of great importance and can be used to improve efficiency of decision-making.
3.1. Topological representation of beliefs
The structure of the probability density function (pdf) is determined from conditional independences present in it. In this paper, we use a factor graph probabilistic graphical model (Frey et al. 1997) to represent them explicitly. A factor graph defines two types of nodes: the nodes over state variables X, and the factor nodes such that . Variable nodes and factor nodes are independent sets in a bipartite graph. Edges go between factors and variables that those factors depend on. One of the reasons factor graphs are important is the existence of efficient inference algorithms that operate on them, for example, the sum-product message passing algorithm (Kschischang et al. 2001), which allow recursive computation of marginal distributions conditioned on observed data for both Maximum Likelihood (ML) and Maximum-A-Posteriori (MAP) estimation frameworks.
As will be shown later on, factor graphs are also a suitable graphical representation in information-theoretic BSP because of their connection to the objective function under the ML estimation framework for Gaussian systems.
Under assumption (A1), F contains only pairwise factors. The topological representation of the posterior belief is defined as a topological graph G associated to a posterior factor graph FG of a candidate action (e.g., see Figure 2), which is a graph isomorphic to the posterior Markov Random Field (MRF), that is, G ≃ MRF(b[Xk+L]). Different candidate paths typically yield different factor graphs. A graph G herein is understood to be simple (without self-loops or parallel edges), connected, and undirected. Each graph G corresponds to a topological space called the geometric realization (Archdeacon 1996). In the standard sense of topology, such a graph is a simplicial 1-complex (Gross and Tucker 1987). Therefore, a node in the graph, a point in its geometric realization and the corresponding point when embedding the graph in a topological space are often used interchangeably.
Each candidate action corresponds to posterior belief b[Xk+L], that is, trajectory uncertainty (a) which can be represented with a factor graph (b) and assigned a topology. In the case of active pose SLAM, topology can be defined with a simple undirected graph such that graph nodes V represent robot’s poses and edges E pose constraints between them (c). Topological BSP aims to determine a graph invariant topological signature which is highly correlated with the information-theoretic cost and maintains action consistent decision-making.
A topological signature is some graph invariant, that is, a function that assigns to G a number calculated from its structure according to a well-defined procedure, whose value is independent of how the graph is drawn or how its nodes are numbered.
3.2. Decision-making via t-bsp
Generally, we aim to find a topological representation G of the posterior belief corresponding to a control action and a function , which we call topological signature, such that decision-making using the function s does not (or minimally) change the optimal action according to a BSP objective . In other words, suppose we can express the problem of minimizing the objective and its solution on the given set/interval of actions as maximizing the topological signature s on the same actions set where each action induces some topology, that is,
We say that decision-making is action consistent if and only if . Moreover, if this is true for any subset of actions, a topological signature s preserves action order too. This is a much stronger condition than action consistency, and as we will see, it will not always be required when we are looking only for the best action, which greatly simplifies design of a topological signature so that we can focus more on computational complexity.
Figure 3 shows all possible situations regarding the relation between the topological signature and objective function in t-bsp. In Figure 3(a), a signature is action consistent on the whole domain of control actions, and therefore the solution of t-bsp is the same as BSP. However, although ordering of actions is not kept in the whole domain in the case shown in Figure 3(b), the solution is still optimal. Figure 3(c) shows a case where an action ordering given by a signature influences the optimal action.
Different topological signatures and their influence on decision-making. The graphs show topological signature and negative objective as functions of control action.
Clearly, topological signatures should be designed to exhibit strong correlation with the objective, but in the general case, the obtained best action may be somewhat different than the optimal action from (1), leading to some error in the quality of solution.
We adopt the definition of action consistent state approximations proposed in Elimelech and Indelman (2017b) and modify it to support t-bsp and action consistent objective approximations in the following way.
Denote by the optimal solution of t-bsp and by the optimal solution of BSP problem (1). Let be a family of monotone decreasing real functions and such that and , where the functional . Then, the error of t-bsp is
and γf the action ordering error of the signature s.
In particular, ϵ(J, s) = 0 corresponds to t-bsp being action consistent, that is, , and when also f exists for which γf = 0, simplified representation preserves action order too and we call it action ordering (ranking) consistent. Essentially, f is a mapping from the signature image to the objective function image and is required for defining action consistency of objective functions.
Yet, as in Elimelech and Indelman (2017b), calculating ϵ(J, s) is essentially equivalent to solving the original problem. Therefore, a key aspect will be to provide online performance guarantees by developing bounds on ϵ(J, s) that can be evaluated online. One can then resort to topological BSP to drastically reduce computational cost while carefully monitoring a conservative estimate on the sacrifice in performance, that would be provided by the bound on ϵ(J, s). Another perspective of using this bound is to guarantee global optimality of the anytime t-bsp algorithm we proposed in Kitanov and Indelman (2018). In that approach, actions were ranked by a topological signature and the objective function was evaluated sequentially from best to worst. Having bounds on t-bsp error would provide a stopping condition for action consistent t-bsp, similar to the action elimination scheme proposed in Elimelech and Indelman (2017b) and its application to belief sparsification. The optimal solution is then obtained by evaluating only a subset of candidate actions (for sufficiently tight bounds, empty set), while discarding the rest. Doing so will generally reduce the number of variables for which the marginal covariance needs to be recovered.
4. Information-theoretic topological BSP
We propose two topological signatures for solving information-theoretic BSP with uncertainty of all states considered. Both signatures are based on the spectrum of a graph (normalized) Laplacian associated to the posterior factor graph: a normalized number of spanning trees of a graph, sST(G), motivated by the SLAM topological metric (Khosoussi et al. 2015) and adjusted for BSP, and a von Neumann graph entropy, sVN(G).
A Laplacian matrix of a graph G = (V, E) is by definition L(G) = D(G) − A(G), where A(G) is its |V|×|V| adjacency matrix with elements
and D(G) its node degree matrix defined as a diagonal matrix with graph node degrees on its main diagonal, that is, D(i, i) = d(i) = ∑j∈V A(i, j). Therefore,
A normalized Laplacian matrix is defined as which can be also written in the form
Both L and are symmetric and positive semi-definite so all their eigenvalues are real and non-negative. Hence, they can be ordered. For a given connected graph G with n = |V| nodes, let λ1 ≤ λ2 ≤ ⋯ ≤ λn be the eigenvalues of L, and , the eigenvalues of . The smallest eigenvalues . Also, and, since G has no isolated nodes, (for proof see Chung 1997).
4.1. Normalized number of spanning trees
The number of spanning trees of a graph t(G) can be determined using Kirchhoff’s Matrix tree theorem (Biggs 1993, Theorem 6.3) which involves calculating any cofactor of a graph Laplacian, or equivalently the product of its non-zero eigenvalues, that is, . Here, denotes L with the r-th row and column removed for some r. Typically, this number gets quite large for factor graphs in BSP so we use its logarithm in a topological signature instead to avoid arithmetic overflow in computer implementations, τ(G) = ln t(G). As has been shown in Khosoussi et al. (2015), there exists a strong positive correlation between D-optimality criterion and τ(G) in general SLAM ML optimization while in some problems with linear measurement models, D-optimality criterion is solely characterized by τ(G). In Kitanov and Indelman (2018), we have proposed a normalization of τ(G) based on its relation to the optimization objective in BSP which gives this BSP topological signature
normalized in such a way to account for different path lengths n and dimension of the robot’s pose κ (e.g., κ = 3 in 2D active pose SLAM) that maintains a high correlation with the information-theoretic cost (6).
Determining sST is an operation with cubic complexity in the number of nodes for general dense graphs. Because SLAM graphs typically have a small number of edges due to a limited sensor range and field of view, their graph Laplacian is sparse and state-of-the-art algorithms use this fact to efficiently calculate τ(G) by performing a sparse matrix factorization. For example, let R be the lower triangular Cholesky factor of a reduced Laplacian of graph G, that is, , then
This approach, to be efficient, requires finding a good fill-reducing permutation of such that the Cholseky factor of is sparser than the Cholseky factor of . The problem of finding the best ordering is an NP-complete problem (Yannakakis 1981) and is thus intractable, so heuristic methods are used instead. Overall, the computational time for sST depends on the state dimension and system sparsity.
4.2. Von Neumann graph entropy
The second topological signature we propose for information-theoretic BSP is the von Neumann entropy of G, sVN(G), and its simplification by node degrees d of G.
The von Neumann entropy sVN(G) of graph G is the Shannon entropy associated with its normalized Laplacian’s eigenvalues and was introduced in Passerini and Severini (2009). In this work we use the following definition
Using Han’s quadratic approximation (Han et al. 2012) and given that , it can be simplified to
It can be easily seen that from which the final expression for our approximated von Neumann graph entropy follows which we use as a topological signature in BSP
Notice that its computation depends only on graph node degrees and, in the general case, has quadratic complexity in the number of nodes, O(n2) but in the case of BSP, where is sparse, it depends only on the small number of non-zero elements of A(G), that is, the number of edges |E| in the graph G. Also, the expression (13) can be computed incrementally, as new edges (measurements) are added to the factor graph as the robot explores the environment or re-plans its path. This effectively improves computational complexity of the t-bsp algorithm to nearly O(1) since the number of measurements between two planning sessions is limited.
4.2.1. Incremental mode
Let Gk = (Vk, Ek) be a topological graph of a prior belief b[Xk], a prior graph at planning time k for short, and Gk+L = (Vk+L, Ek+L) the posterior topological graph corresponding to b[Xk+L] after obtaining future predicted observations associated to a control action , a posterior graph for short. Their corresponding graph signatures, and , are
By subtracting equation (14) from (15) we obtain the recursive update rule for the posterior graph signature
Now, the key observation is that in order to calculate Δq we do not need to iterate over all edges of the corresponding graphs, but only over those whose node degrees changed or are associated to the new states added to the prior graph, that is,
In equation (18), ΔE = Ek+L − Ek represents newly added edges to the prior graph Gk due to the control action yielding posterior graph Gk+L, VI nodes incident to ΔE without new nodes and Eδ edges from Ek+L incident with VI that belong also to Ek. An example illustrating the incremental t-bsp concept is given in Figure 4.
Incremental t-bsp: Consider the planning session starting at time k=4 when the robot’s current pose is x4, and a control action that generates a candidate path from which states (marked with red circles) are sampled and added to the prior graph together with new edges marked with blue dashed lines as a result of predicted relative pose measurements. In that case, involved states are (marked with blue circles), and affected edges in the prior graph .
The complexity of updating graph signature per candidate action is O(|ΔE| + |MB(VI)|), where MB(VI) denotes the Markov blanket in Gk of the variables involved and is usually bounded owing to limits on sensor range. Since the number of actions per planning session is also bounded, for sparse graphs this means that incremental t-bsp algorithm has effective complexity O(1).
4.3. Maximum likelihood estimation
In this work, we investigate error bounds of t-bsp on the active 2D pose SLAM problem. We first examine results on passive SLAM reliability regarding its graph structure described by a topological metric τ(G)≐ln(t(G)) proposed in Khosoussi et al. (2015) and then, in the next section, we show how these results can be extended to a BSP problem to provide online performance guarantees of t-bsp. In that section, it will also be clear how we chose the topological signature sST.
In the context of Maximum Likelihood Estimation (MLE) in pose SLAM, the optimal set of poses Xk for which the belief (3) is maximized can be obtained by fixing one of the poses, for example, x0, and treating the rest as unknown (or with an uninformative prior) while minimizing the sum of weighted squared errors between predicted and measured relative poses,1 that is,
In this formulation, a measurement Δk represents a vector of m stacked relative pose measurements from motion and observation model (2) at time k with m = |E|, the number of edges in the topological graph of the belief (3). Relative pose measurements in pose SLAM resulting from state transitions can be obtained by the motion composition zi+1,i(xi, xi+1) = ⊖xi ⊕ xi+1 = ⊖xi ⊕ f(xi, ui, wi). In this work, we assume independent relative pose measurements with additive noises
For simplicity, we also assume a 2D pose SLAM setting in which all relative positions and orientations between poses xi and xj have equal variance, and , respectively, that is, . Measurement noise covariance Σ in that case can be written as a diagonal matrix by reordering elements of Δk.
The information matrix I(Xk) of the MLE is I(Xk) = HTΣ−1H (Sorenson 1980), where H = ∂h/∂Xk. is a measurement Jacobian. I(Xk) evaluated at the true value of Xk is known as Fisher Information Matrix (FIM) and its inverse the Cramér-Rao lower bound (CRLB). Commonly, FIM is approximated with . In Khosoussi et al. (2015) bounds of the determinant of I(Xk) are expressed in terms of pose SLAM graph topology, geometry and noise as
where is a reduced Laplacian of a graph G obtained by removing an arbitrary r-th row and r-th column of the graph Laplacian L, Io(Xk) estimated information matrix based on the odometry measurements Δk = {zi+1,i, i < k}, and where ξ = σθ / σp, and . Notice that generally Ψ depends on the noise variances, geometry, and topology of the SLAM graph. In Khosoussi et al. (2015) these bounds are not used except to prove the limiting case, Ψ → 0. Therefore, for small values of Ψ a good approximation of ln detI(Xk) is its bound that depends on τ(G), that is, lower and upper bounds become tight. Even when it is not negligible, if there is only one path realization to consider as in passive SLAM, information gain of adding relative pose measurements with a constant noise distribution to a SLAM odometry graph is solely characterized by graph G topology, that is, Ψ = Ψ(G). Similar logic applies to graph pruning and the measurement selection problem (Khosoussi et al. 2019) where all graphs are only subgraphs of the original graph with the same embedding in metric space. To demonstrate why we cannot use the same metric τ(G) in BSP problems nor guarantee optimality using the bounds on I(Xk), consider the example given in Figure 5.
BSP vs. the measurement selection problem: In measurement selection problems, the robot considers a single path and its factor graph (e.g., marked with blue color) at planning time k and which subset of measurements (marked with blue dashed lines) to take. Pose samples Xk+L are fixed and minimizing entropy (6) by a measurements subset (action) is the same as maximizing ln |I(Xk+L)| − ln |Io(Xk+L)|. Its bounds as can be seen from (21) depend only on the topology and ξ, that is, all actions share the same pose variables Xk+L which can be considered constant in optimization. However, in BSP where a robot needs to compare different paths corresponding to different factor graphs (e.g., red and green), this is not true anymore. To determine the best action according to (6), one has to account additionally for different path geometries and path lengths and larger variety of topologies.
So, in BSP we need to consider different path realizations Xk+L and therefore , in contrast to graph pruning and measurement selection. Notice that we do not need to propagate belief in planning for determining Ψ under the ML observations assumption, that is, where future sampled poses from the path corresponding to action are added to the prior state estimate which is the same for all actions.
4.4. Entropy bounds for active SLAM
In BSP, we have to consider multiple path realizations from different controls, with greater variety in both topology of factor graphs and other non-topological factors that influence estimation accuracy, for example, non-fixed geometry and different path lengths. In Kitanov and Indelman (2018), we developed a topological signature sST (9) for t-bsp. Here, we show its derivation and provide global optimality guarantees of t-bsp for any signature via estimated entropy bounds that are functions of sST for active SLAM and related problems as described in the Section 2.2.
For a control action corresponding to robot’s poses Xk+L = {x1, …, xk+L} each of dimension κ, the entropy of the future posterior belief b[Xk+L] can be written using equation (6) as
Notice that the number of graph nodes |V| = n = k + L + 1. Using Lemma 2 and Lemma 3 from Khosoussi et al. (2015), for posterior belief corresponding to an odometry factor graph whose topological graph Go is a tree, it follows
Inequalities (21) and equations (22) and (23) give the entropy bounds
Similarly, for the lower bound we get
In equation (26) we used the Matrix tree theorem that states (recall τ(G) = ln t(G)), and in equation (27) Shur’s determinant lemma. From equation (27) we see that when Ψ → 0, the entropy goes to the BSP topological signature and solely depends on the belief’s topology and path length. Otherwise, we have to account for the graph embedding in the metric space as well, as it appears in the scalar function Ψ of the second term in the above equations. Now, given all candidate control actions, the following corollary provides the bounds on the t-bsp error as defined by Definition 1.
The upper bound of the entropy is already determined by the topological signature sST as can be seen from equation (24). Moreover, is the minimum upper bound of all actions since we select action in t-bsp according to equation (7). However, calculating the lower bound requires an additional cost due to the second term in equations (25)-(27). Calculating (25)-(26) requires evaluating the determinant of a sparse matrix , whereas calculating (27) is even more complex because it requires inverting and finding a determinant of a dense matrix.
Another idea is to find some fast method for limiting the determinant of M from above to replace the second term in equation (25) that will not introduce a big difference in tightness of the lower bound.
4.4.2. Exact lower bound
A direct approach performs from scratch some sparse matrix factorization of M, for example, Cholesky factorization M = RRT where R is the lower triangular matrix, from which then it is easy to calculate its determinant. However, this approach, to be efficient, still requires finding a good fill-reducing permutation PM of M, but we notice that some calculations from the topological signature can be re-used. In particular, since M differs from only in diagonal elements, they both have the same sparsity pattern. Therefore, if is the best fill-reducing permutation of (already found for determining ), it can be re-used for calculation of the lower bound, that is, .
4.4.3. Hadamard bound
Since is a reduced graph Laplacian, it is symmetric positive definite (SPD), and because also Ψ > 0, the matrix M is SPD. For large values of Ψ, the matrix M becomes diagonally dominant, in fact strongly so, and Hadamard inequality gives a good approximation of its determinant, that is,
Calculation of the right side of inequality (28) requires only multiplication of node degrees with added value of Ψ. Applying (28) to equation (25), we can get a somewhat more conservative but faster to compute lower bound
Algorithm 1.t-bsp with performance guarantees
4.5. t-bsp algorithm
Algorithm 1 summarizes our proposed method and highlights its possible uses in BSP regarding desired performance specifications. Performance guarantees can be either in the form of selected solution’s entropy upper bound, that is, guarantees on the accuracy, or in bounding the error of t-bsp with respect to the optimal solution.
The first form can be used when the maximum admissible path uncertainty is known at planning time, for example, for obstacle avoidance. In that case, one can get an answer if a t-bsp solution satisfies the specification by ranking actions using the very efficient topological signature and calculating only its entropy’s upper bound (Alg. 1, line 7). If global optimality guarantees are required, the lower entropy bound needs to be calculated for all actions. Although in equation (29) it is given as a function of sST, in the next section we will see how it can be expressed via . In the first usage, if uncertainty specification is not met by t-bsp one can still use its ranked actions set A in anytime algorithm as we proposed in Kitanov and Indelman (2018). Also, if an optimal solution needs to be found, we can use t-bsp to eliminate suboptimal actions.
5. Von Neumann entropy based t-bsp with performance guarantees
In this section we develop a mechanism to provide global optimality guarantees as functions of the von Neumann entropy topological signature (13). The key idea is to make a connection of to the normalized number of spanning trees topological signature sST based on which we have already established t-bsp error bounds (see Figure 6). At the same time, we want to preserve computational efficiency. Theorem 1 is central for this, and has also much wider applicability than in the context of BSP as a reformulation of the Matrix tree theorem.
Schematic overview: Topological belief space planning based on approx. von Neumann entropy with performance guarantees.
5.1. Taylor series for τ(G)
First, we provide a method for calculating the number of spanning trees of a graph via a convergent Taylor series expansion as a function of only the graph’s node degrees, which avoids matrix factorization, the current state-of-the-art.
Recall that and representing it with a Taylor power series is possible only for all inside the interval of convergence of the logarithm function, around point λi0 = 1. However, the spectrum of the graph Laplacian can be outside this interval. We know that the largest Laplacian eigenvalue is bounded by the number of non-isolated vertices, that is, λn⩽n (Mohar 1991, Theorem 2.2). On the other hand, the normalized graph Laplacian’s spectrum without zero lies inside for every graph G (Wilson and Zhu 2008) and therefore the von Neumann graph entropy is well-defined via its Taylor series. This motivates us to look for an alternative representation of τ(G) that involves eigenvalues of . The following theorem gives the required relation.
Let G = (V, E) be a connected, undirected graph with n nodes, and let t(G) be the number of spanning trees of G. Then,
whereare non-zero eigenvalues of the normalized Laplacian matrixof G, and d1, d2…, dnnode degrees of G.
We will often use the logarithm of t(G) in the following study
Let us denote by the Taylor series expansion of the function f(x) = ln x centered around the point x0. Then, on the interval of convergence , where the r-th partial sum of the Taylor series is given by
Considering function f defined over the non-zero normalized graph Laplacian’s eigenvalues and the linearization point x0 = 1, we obtain the following r-th order Taylor series approximation of
Calculating the topological signature sST directly using the Matrix tree theorem, using Theorem 1 or via a Taylor series approximation using equation (32) would require determining the graph spectrum of the associated graph matrix, which is not desirable, from a computational perspective, in high-dimensional state spaces. Similarly, applying equation (10) requires matrix factorization, whose time complexity is only by a constant scaling factor smaller than the time complexity of calculating the information-theoretic cost.
In this paper, we show how this can be avoided by developing the power series of τ(G) and expressing it with sums of traces of powers of normalized graph Laplacian which can be more easily calculated because they involve multiplication of sparse matrices. Importantly, for low order Taylor series approximations, they can be given in closed-form as simple functions of graph node degrees.
Denote the k-th term in the sum given by equation (32) by
After expanding a power using Binomial theorem we can write
Observe that in equation (34), akl is defined by the horizontal underbrace in equation (33).
By convention and given that
For the Taylor polynomial , such that a remainder term is
where for and for .
Finally, using Theorem 1 and equation (35) we obtain
is the approximation of τ(G) by the r-th order Taylor polynomial and can be used to calculate a topological signature of t-bsp with reduced complexity compared to calculating it by matrix factorization as in (10), by an operation which involves sparse matrix multiplications instead.
5.2. Performance guarantees based on
The next proposition reveals a property of the remainder term which can be used to provide performance guarantees of t-bsp efficiently, from the approximated number of spanning trees of the topological graph by its Taylor polynomial, without the need to know the remainder of all, but only the selected control action.
If the logarithm of the number of spanning trees of the graph G is approximated by the r-th order Taylor polynomial, its approximation errorhas the following properties:
Figure 7 illustrates the properties of the remainder stated by the Proposition 1.
Taylor series approximation error of the logarithm of the number of spanning trees for three different graphs: isosahedral, antenna, and Foster026. For odd polynomial orders, this error is either zero or negative.
The question that naturally arises is whether we can use the approximated logarithm of the number of spanning trees to provide performance guarantees if we choose such an order r that makes . From equation (29) it is easy to see that the lower bound of the entropy calculated from will remain the lower bound, but the upper bound might be compromised depending on the approximation error ɛ(G, r). So, ΔJmax needs to account for that change as well. Let’s denote by the t-bsp error bound calculated by using instead of the true τ(G). Then,
where .
5.3. Relation between topological signatures
By establishing a relation between sST and it would be possible to use benefits from both worlds, speed of sVN with optimality guarantees for sST.
A relation between two proposed information-theoretic BSP topological signatures sST and can be derived in the limit r → ∞ using Theorem 1 and Taylor series expansion of the logarithms of the normalized graph Laplacian’s eigenvalues, which appear in both of them (see Appendix E):
is a harmonic number of r. In practice, we can often get very good approximation for finite small orders r.
For r = 3, using equation (61) we find a simple closed-form expression given by equation (41), that does not involve traces of matrix powers, and in combination with Proposition 1 gives a way to formulate t-bsp in terms of the approximated von Neumann entropy and lower bound the information-theoretic cost for all actions .
For the logarithm of the number of spanning trees of a graph G corresponding to a control action approximated by the third order polynomial we can write
In equation (41) which means that the right-hand side is purely a function of nodes’ degrees and a number of graph nodes, both of which are easy to maintain for graph operations of adding or deleting an edge encountered in BSP or graph pruning problems.
Substituting (41) into (9), we obtain the third order relation between the topological signatures
Here, we used the shorter notation . Since r is odd, after determining the lower bounds of the entropy from for all actions, in order to provide performance guarantees for t-bsp, we have to find an upper entropy bound for only the selected action , for example, by finding or bounding the Taylor remainder term according to equation (38). As we can see, the relation (42) enables finding without doing expensive matrix factorizations which means that performance guarantees for t-bsp can be established in a very efficient way.
Equation (42) reveals also how to improve normalization of for different candidate path lengths n which corresponds to its scaled third term, that is,
In that way, an action ranking of follows the action ranking of given that the node degree distribution corresponding to the second term in equation (42) does not change much and is already captured by .
5.3.1. Keeping action consistency
In general, given two different candidate actions and , when can we say that ordering them by one topological signature is the same as ordering by another, that is, ? For any r and
where αr, βr, and γr are some functions that depend on the polynomial order r used for approximating τ(G) that follow from equation (40). For a given r, αr is a constant, βr is a function of graph’s node degrees, and γr an affine function of the number of its nodes.
The difference of topological signatures of order r between graphs G1 and G2 with n1 = |V1| and n2 = |V2| can be written
Then, for any two graphs G1 and G2 from the control actions set, if and only if
This is another way of formulating the relation between topological signatures in terms of action consistency between them. If the normalization is applied to using (43), then Δγr(n1, n2) = 0.
6. Results
We evaluated our approach in Gazebo simulations of a single-robot active pose SLAM and in optimization of synthetic pose graphs. t-bsp was compared to the standard BSP with the ML inference problem solved using GTSAM library (Dellaert 2012). We studied empirically different topological signatures and their correspondences to information-theoretic cost (6) together with a time required for decision-making.
In active pose SLAM simulations, the robot performs LIDAR-based 2D SLAM assuming perfect data association. A probabilistic roadmap (PRM) (Kavraki et al. 1996) is used to discretize the environment and generate the roadmap while k-diverse shortest path algorithm (Voss et al. 2015) is used to generate topologically diverse candidate paths over it. The action generation process uses a ground truth map to validate actions and avoid possible re-planning stages, since they do not contribute to the goal of BSP analysis, which are a result of incremental map building. Local path corrections due to localization errors or environment changes are handled by the low-level controller and obstacle avoidance algorithm. Of course, in the deployment of active SLAM, an inferred map will be used in generation of candidate paths using the same principles.
Topological properties of candidate actions are determined by the configuration of obstacles in the environment and by the parameters of the PRM algorithm (e.g., number of samples) and k-diverse shortest path algorithm (e.g., k, path length, paths’ distance) which control the number and diversity of candidate actions with respect to their geometrical properties and homology class. Topological graphs corresponding to these candidate actions depend on the robot’s trajectory and sensor’s characteristics and pre-processing (SLAM front-end). In particular, we assume that a link between certain trajectory samples will exist whenever the Euclidean distance between them is below a threshold δ and relative orientation below ϑ, which model the range and the field of view with high reliability of ICP method used for matching LIDAR scans. In all simulations when predicting future measurements, we assume that the robot can follow the candidate path perfectly and that trajectory pose samples with LIDAR scans are taken after each robot’s relative displacement of 1 m and at the desired predefined goal poses.
6.1. Small-scale realistic simulation
First we validated our theoretical results in a small-scale scenario with the main goal to demonstrate properties of t-bsp in elementary planning modes (exploration and exploitation) under different probabilistic models of relative pose measurements.
The robot was performing two planning sessions (S1 and S2) with both exploration (in S1) and exploitation trajectories (in S2) considered to show influence of different candidate path topology and geometry in t-bsp. Figure 8 shows the considered scenario in Gazebo and the generated candidate paths for the robot in two planning sessions S1 (Figure 8a) and S2 (Figure 8b). To demonstrate the effect of noise level on t-bsp, we simulated three cases. All relative pose measurements had standard deviation of orientation error σθ either 0.01, 0.035, or 0.085 rad while we kept the same standard deviation of the position error σp = 0.1 m. This corresponds to three different values of ξ ∈ {0.1, 0.35, 0.85} for each planning session. The parameter δ was set to 2 m and ϑ to 2π radians, that is, without the constraint on relative orientation in topology prediction model.
Gazebo scenario with robot’s start pose marked with and goal pose with in each planning session.
During planning session S1, the robot is performing mainly exploration as the environment was completely unknown in the beginning and its first goal is set far in the unknown area so its future posterior beliefs have topologies that resemble tree graphs, with loop closing edges between poses nearby in time. In planning session S2, the robot was instructed to return to the previously mapped area causing larger diversity among candidate actions. Topologies of the most and least complex graph of each planning session are shown in Figure 9 together with their corresponding posterior belief. In both planning sessions t-bsp based on all proposed signatures was action consistent, that is, correctly identified the best action which can be seen from Figure 11g-11l. A maximum of a topological signature is indeed a minimum of the joint entropy. Notice that it does not imply necessarily that the pose uncertainty will be the lowest at the goal or that the shortest trajectory will be selected as is evident form S1’s solution (Figure 9(a)-9(b)) since currently the entropy/uncertainty of the entire system is considered. This opens one other relevant research direction where uncertainty of a subset of state variables might be considered and its relation to topological information. Among exploitation trajectories, t-bsp prefers the one with larger loop closings leading to highest information gain (see Figure 9(c)-9(d)).
Trajectory uncertainty after optimization and topologies of the worst/best actions calculated by topological BSP.
Our results in a realistic simulation show the promise of the proposed approach for efficiently solving BSP. All topological signatures of the posterior factor graphs are strongly correlated with the information-theoretic cost (Figure 11(g)-11(l)); yet, t-bsp based on an exact von Neumann entropy (sVN) and the number of spanning trees (sST) of a graph outperforms standard BSP by an order of magnitude in terms of time complexity as shown in Figure 10. This is due to operation in a topological (less dimensional) space instead of a metric state space. The dimension of a graph Laplacian matrix in this problem is κ = 3 times smaller than the dimension of an information matrix because every 2D pose in the state vector corresponds to one node in the graph. The relative speed of sVN and sST is similar as both of them require determining a graph spectrum of the associated Laplacian or its cofactor. On the other hand, the advantage of using over sVN and sST is that it is much faster, O(|E|), with possible incremental calculation that depends only on the node degrees which we investigate in the large-scale simulation in Subsection 6.4. It effectively enables real-time performance of t-bsp given that typically at each measurement update a limited number of factors is added to the graph. Also visible from Figure 10 is that t-bsp error bounds calculation adds a small, especially in the case with Hadamard bound, additional cost.
Computation time per candidate action in each planning session (circle represents the mean, “x” the median and line interval one σ confidence region) for standard BSP, t-bsp by signatures based on the number of spanning trees (ST), the von Neumann entropy (VN exact and approx.) and for bounds determination, exact (equation (25)) and Hadamard (equation (29)). Notice that computation time of the standard BSP approach has been scaled by 10.
While t-bsp was action consistent in all sessions for all proposed topological signatures (ϵ(J, s) = 0), action ordering consistency (γf = 0) was kept only for sST in all cases, while only in the first planning session for sVN and . can happen when topologies among actions are very similar and then other factors, for example, noise or geometry, determine the solution. However, γf was still small. As we already stated, the determined bounds of the entropy can always be used to provide a globally optimal solution but at the cost of evaluating the objective function of actions inside them, that is, the ones whose lower bound is below an upper bound of the selected action. From Figures 11(a)–11(f) we can see that a negative topological signature sST is a very good approximation of the estimated posterior entropy even with large orientation noise, that is, large ξ values. On the other hand, its determined lower bound is very sensitive to it. For larger ξ values, bounds are less tight and therefore our performance guarantees are more conservative. In practice, however, more important is that the upper bound is close to the real entropy since we make decisions based on it. We can also notice that for higher ξ, Hadamard approximation of the lower bound is getting better, meaning that we practically have very small computational cost in determining the bounds accurately.
Estimated entropy of posterior belief and its bounds determined by t-bsp (top) and correlation of t-bsp and standard BSP (bottom) for three different values of ξ in planning sessions S1 and S2. Notice different y-axis on the correlation plots since only action ordering consistency is important in BSP.
6.2. Real-world experiment
The experiment was performed on a real robot using a fixed measurement noise model in inference and planning, considering a similar scenario as in the simulation from Section 6.1. The results are presented in Extension 1 as a proof of concept of our method.
6.3. Synthetic simulation
Apart from the SLAM setting, next we consider general pose graphs randomly created and optimized by the Maximum Likelihood Estimator (MLE) and their relation to the topological signatures developed in this paper. We show that good action consistency is still kept even when comparing significantly varying pose graphs. We generated pose graphs with varying number of nodes n and edges m of which correspond to odometry and the rest to LC = m − n + 1 loop closing factors. We choose , causing average node degree of a graph increasing in steps of from a given odometry graph of size n (whose average node degree is approx. 2). For each pair (n, m) we create a group of 10 random topologies, calculate their topological signatures sST and , and measure the average time for their calculation, tST and tVN, respectively. The results of this simulation are given in Figure 12. We can see that correlation with the posterior entropy of FIM of MLE is still high for sST on the whole range of topologies while follows the overall trend and is high for graphs with the same n, but breaks the action consistency with jumps of n (Figure 12(b), green curve). This suggests that a normalization of should be applied according to its relation to the objective according to the Definition 1. As explained in Section 5.3, after applying correction term given by the equation (43), we can see that actions consistency improves significantly (Figure 12(b), black curve).
Performance on simulated random graphs.
Although less accurate, the plot in Figure 12(a) demonstrates why might still be interesting to use for fast decision-making in some situations. Clearly, the gain in speed is significant in high-dimensional BSP problems.
6.4. Large-scale realistic simulation
A large-scale scenario tests t-bsp performance in a complex environment depicted in Figure 13 with larger number of planning sessions. Its main goal is to investigate the proposed approach in the settings closer to active SLAM application domains. It serves also as a test bed for comparing the batch and incremental BSP algorithms.
Gazebo simulation environment with Pioneer-3AT ground robot marked with white bounding box. Grid cell size is 1 m x 1 m.
For the second purpose, we design two experiments which mainly differ in the planning horizon length L considered:
(a) a goal-to-goal planning, and
(b) Model Predictive Control (MPC) style planning.
The experiment a) was primarily designed to test the execution time of the batch BSP algorithms whereas the experiment b) for their incremental counterparts. Since action consistency does not depend on the particular operation mode, it is enough to show it only once. In the first experiment, the robot is instructed to visit ten predefined goals in a given order and to plan with a horizon sufficient to connect consecutive goals. The second considers the same set of goals with the difference that re-planning is done also on several intermediate waypoints on the selected route between goals choosing the next waypoint as a sub-goal. We call this MPC mode, because the robot considers changing its immediate control action on a way. In this simulation, however, for ensuring similar conditions in both experiments, the robot does not take any detours from the original route. This mode would make sense in a dynamic environment or if the inferred map is used by the action generator method due to partial map availability and possibility that a better global path exists.
Consequentially, in goal-to-goal planning, the robot executes 10 planning sessions and in each, many measurements are added during the motion and prediction steps. Contrary to that, MPC mode has more planning sessions with fewer new measurements acquired between them. In both, the topology prediction parameter δ was set to 2 m and ϑ to 1 rad. Standard deviations of relative orientation and position measurement noise were kept constant at σp = 0.1 m and σθ = 0.01 rad.
6.4.1. Batch BSP algorithms
The standard batch BSP inference was solved by Levenberg-Marquardt algorithm from GTSAM library with the information-theoretic cost (6) calculated exactly using Cholesky factorization of the information matrix of the posterior belief required for determining the determinant. The same Cholesky method was used to factorize the reduced Laplacian matrix of the topological posterior graph needed for t-bsp based on the number of spanning trees topological signature sST. Also, in every planning session, the approximated von Neumann graph entropy was calculated from scratch using the expression (13) and then normalized using (43).
As expected, the run-time of the t-bsp based on sST is by a constant multiplicative factor lower than the standard BSP (Figure 14) since the sparsity pattern of both information matrix and reduced Laplacian matrix is the same, whereas the latter’s dimension is κ times reduced, κ being the dimension of the robot’s pose. On the other hand, the proposed t-bsp based on shows almost linear-time complexity on average planning time per candidate action and achieves by three orders of magnitude higher speed by the last planning session.
Run-time comparison of batch BSP algorithms. Standard BSP is based on Levenberg-Marquardt optimization and explicit calculation of objective function (6) by doing Cholesky factorization of information matrix. t-bsp based on sST uses the same method of Cholesky factorization to calculate the number of spanning trees of a topological graph.
Importantly, an optimal action was selected in all planning sessions by the proposed t-bsp based on as can be seen from Figure 15. Its only limitation is that optimality cannot always be guaranteed using topological entropy bounds. Information gain of candidate actions, in those planning sessions where it is the case, is indeed very similar and then non-topological factors determine the solution.
Action consistency of t-bsp based on approx. von Neumann entropy and optimality guarantees in relation to information gain of candidate actions (each dot of the same color represents one candidate action in a single planning session). Optimal action is selected by t-bsp in all planning sessions, but its optimality can be guaranteed in sessions 1, 2, 5, 7, and 10 where one action is clearly distinctive in information gain.
Let us have a closer look into two extreme cases: one in which the optimal action can be guaranteed and the other with a small percentage of suboptimal actions detected by t-bsp. The representative of the first case is the planning session 5 (S5) with relevant plots shown in Figure 16, and of the second, planning session 8 (S8) analyzed in Figure 17. The whole simulation and analysis of the rest of the planning sessions can be seen in Extension 2.
Planning session 5.
Planning session 8.
S5’s candidate paths together with the inferred map and the past robot’s trajectory are shown in Figures 16(a)–16(b). It can be seen that there exists a single candidate path (marked with light green color) which leads to the previously mapped area and from the information-theoretic perspective is optimal as it significantly reduces uncertainty of the robot’s trajectory as shown in Figure 16(f). For illustration, we also show the uncertainty of the worst action in that session in Figure 16(e) where ellipses represent the marginal pose’s uncertainty in both plots. In this case, topology solely is enough to disqualify other actions (Figure 16(d)).
S8’s candidate paths together with the inferred map and the past robot’s trajectory are shown in Figures 17(a)–17(b). First we notice that in that session there was a smaller number of candidate paths (only 4) which belong to two homology classes, either going left or right around the wall obstacle. Three of them are quite similar in the geometrical shape and all of them passing through already mapped areas. Therefore, it makes sense that the expected accuracy (path entropy) obtained by following either path will roughly be the same as can be seen in Figures 17(e)–17(f) for the worst and best actions from that set. Even in that unfavorable situation for t-bsp, it still managed to find an optimal action (Figure 17(c)) where maximum of corresponds to the minimum of the joint entropy, but not to guarantee its optimality. Two other actions need to be inspected further for that, which is visible from Figure 17(d). We can use an upper topological bound which gives an uncertainty margin for deciding that. We argue that in most practical applications this difference in information gain will be insignificant and t-bsp is good for coarse-level planning especially, when a loss of precision up to the model approximation gap is higher than a difference between actions’ objective values.
6.4.2. Incremental BSP algorithms
In the experiment b) we compare running times of incremental BSP algorithms. Standard incremental BSP was realized by employing state-of-the-art incremental smoothing and mapping algorithm ISAM2 from GTSAM library. It updates Cholesky factorization of an information matrix incrementally using operations on a graphical model called Bayes tree (Kaess et al. 2010). For the purpose of this analysis, we measure the run-time of only updating the Bayes tree of posterior beliefs corresponding to the candidate actions, since after obtaining this form calculating the information-theoretic cost is a simple operation of calculating a determinant of a triangular matrix. Topological BSP is based on the approximated von Neumann entropy signature calculated incrementally using the formula (16). The cumulative planning time across all planning sessions in MPC mode is shown in Figure 18(a) for standard BSP and in Figure 18(b) for the t-bsp based on . Again, t-bsp outperforms standard BSP (by two orders of magnitude). For reference, we also show non-incremental t-bsp based on which is also much faster then ISAM2 BSP. The graph in Figure 18c presents planning time per candidate action with the change of state dimension over time. As the theoretical analysis suggests, time complexity is near O(1) for the incremental t-bsp and near O(m) for its batch equivalent, where m is the number of edges in the topological graph, which in this case increases almost linearly with the state dimension.
Run-time comparison of incremental BSP algorithms in MPC mode. Standard BSP is based on ISAM2 algorithm for belief propagation and information matrix factorization (Figure 18(a)). Topological BSP is based on approximated von Neumann entropy signature (Figure 18(b)). Topological signature update per candidate action demonstrates almost constant-time complexity of incremental t-bsp and near linear-time of its batch equivalent (Figure 18(c)). The plot in Figure 18(b) shows cumulative planning time, that is, summation of planning times over the sessions and candidate actions from Figure 18(c).
7. Conclusions
This work provides theoretical foundations for information-theoretic topological belief space planning (t-bsp), a novel paradigm for highly efficient decision-making under uncertainty. The main idea consists of transforming the original optimization problem to a topological space where a solution is easier to find and then establish the error bounds with respect to the optimal solution. Doing so, we avoid belief propagation and explicitly calculating an objective function which offers significant speed-up to decision-making. Very importantly, even when using such approximations, an optimal BSP solution can still be obtained in cases when two optimization problems are action consistent, a concept introduced in this work for objective function approximations. In the general case, one can resort to t-bsp to drastically reduce computational cost while carefully monitoring a conservative estimate on the sacrifice in performance, that would be provided by the error bounds, or to find globally optimal solution by evaluating generally a much smaller number of candidate actions.
Topological signatures are defined on topologies of factor graphs that correspond to future posterior beliefs associated to candidate actions and their design depends on the objective function. In this work, we consider information-theoretic objective functions and pairwise factors whose topologies can be represented with simple graphs. Two topological signatures are proposed: normalized number of spanning trees of a graph and approximation of its von Neumann entropy by a function of node degrees. The first shows better action consistency, but the special focus in this paper is given to the second signature as it enables real-time performance of BSP while still offering good action consistency. It can be used in either batch or incremental mode. For both signatures, we derived error bounds of t-bsp that depend on topological parameters, measurement noise, and prior state estimate under maximum likelihood estimation framework and the bounds can be calculated with a small additional computational cost.
We presented an evaluation of our approach in realistic active SLAM simulations and in random pose-graph optimizations. Results show t-bsp was optimal in all planning sessions, but optimality could not always had been guaranteed depending on the measurement noise characteristics and diversity of candidate actions (topological and geometrical). However, in those sessions candidate paths carry very similar information gain and which action will be chosen is often irrelevant in practice, but when it is important, t-bsp can be used as a pre-processing tool to reduce the action search space. Also, decision-making is done by the upper entropy bound which is much more resilient to noise than the lower bound needed for the guarantees. Time performance improved drastically compared to state-of-the-art. Incremental t-bsp based on von Neumann entropy exhibits O(1) asymptotic complexity in active SLAM MPC-style planning. Also, update of the topological signature is very intuitive and its complexity is independent on which variables loop closure affects.
This work opens several research directions which might be interesting in the context of simplifications approaches for BSP. One is to investigate if a wider class of objective functions to which t-bsp is applicable exists, that is, where the solution is strongly characterized by the belief topology. Also, we might consider relaxing the assumptions on Gaussian beliefs, isotropic measurements noise or pairwise relations in measurement model used in this work. A first work in that direction is (Shienman et al. 2021) where uncertainty minimization of a predefined subset (focused set) of variables and uncorrelated differently weighted measurements are addressed, building upon the t-bsp approach for the unfocused case introduced here. Improving tightness of the error bounds would be of great interest for guarantees in fine-level planning, when control actions are topologically similar. Finally, topology prediction models could be further improved to increase prediction accuracy, for example, by taking into account probabilities of data associations. Overall, the presented approach contains both fundamental and practical contributions to belief space planning and we expect it will motivate further research and find diverse applications.
Supplemental Material
Supplemental Material
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was partially supported by the Israel Science Foundation (grant No. 371/20).
ORCID iD
Andrej Kitanov
Supplemental Material
Supplemental material for this article is available online.
Note
Appendix
References
1.
Agha-MohammadiAAChakravortySAmatoNM (2014) FIRM: sampling-based feedback motion planning under motion uncertainty and imperfect measurements. Intl J of Robotics Research33(2): 268–304.
2.
AngeliADoncieuxSMeyerJ, et al. (2009) Visual topological SLAM and global localization. In IEEE Intl. Conference On Robotics and Automation (ICRA),Kobe, Japan, 2009. pp. 4300–4305.
3.
ArchdeaconD (1996) Topological graph theory: a survey. In Congressus Numerantium. Utilitas Mathematica Pub. Incorporated, Volume 115, pp. 5–54.
4.
AtanasovNNyJLDaniilidisK, et al. (2015) Decentralized active information acquisition: theory and application to multi-robot SLAM. In IEEE Intl Conf On Robotics and Automation (ICRA),Seattle, WA, USA, 2015, pp. 4775–4782.
5.
BajcsyR (1988) Active perception. Proceedings of the IEEE76(8): 996–1005.
6.
BarooahPHespanhaJP (2007) Estimation on graphs from relative measurements. IEEE Control Systems Magazine27(4): 57–74.
7.
BhattacharyaSGhristRKumarV (2015) Persistent homology for path planning in uncertain environments. IEEE Trans Robotics31(3): 578–590.
8.
BianFKempeDGovindanR (2006) Utility based sensor selection. In Proceedings of the 5th Intl Conference On Information Processing in Sensor Networks,Nashville, TN, USA, 2006, pp. 11–18.
9.
BiggsN (1993) Algebraic Graph Theory. 2nd edition. Cambridge University Press.
10.
BlöchligerFFehrMDymczykM, et al. (2018) Topomap: topological mapping and navigation based on visual SLAM maps. In IEEE Intl Conf On Robotics and Automation (ICRA),Brisbane, QLD, Australia, 2018, pp. 3818–3825.
11.
BrouwerAEHaemersWH (2011) Spectra of Graphs. Springer.
12.
Carlevaris-BiancoNKaessMEusticeRM (2014) Generic node removal for factor-graph SLAM. IEEE Trans Robotics30(6): 1371–1385.
13.
CarrilloHReidICastellanosJ (2012) On the comparison of uncertainty criteria for active SLAM. In IEEE Intl. Conf. On Robotics and Automation (ICRA),Saint Paul, MN, USA, 2012, pp. 2080–2087.
14.
ChenYHuangSFitchR, et al. (2018) Efficient active SLAM based on submap joining, graph topology and convex optimization. In IEEE Intl. Conf. On Robotics and Automation (ICRA),Brisbane, QLD, Australia, 2018, pp. 5159–5166.
15.
ChenYHuangSFitchR, et al. (2019) On-line 3D active pose-graph SLAM based on key poses using graph topology and sub-maps. In IEEE Intl Conf On Robotics and Automation (ICRA),Montreal, QC, Canada, 2019, pp. 169–175.
16.
ChenYHuangSZhaoL, et al. (2021) Cramér–rao bounds and optimal design metrics for pose-graph SLAM. IEEE Transactions on Robotics37: 627–641. DOI: 10.1109/TRO.2020.3001718.
17.
ChosetHNagataniK (2001) Topological simultaneous localization and mapping (SLAM): toward exact localization without explicit localization. IEEE Trans Robot Automat17(2): 125–137.
18.
ChungFR (1997) Spectral Graph Theory. American Mathematical Soc.
19.
DavidsonJCHutchinsonSA (2009) A sampling hyperbelief optimization technique for stochastic systems. In Intl. Workshop on the Algorithmic Foundations of Robotics (WAFR). Springer, pp. 217–231.
20.
DellaertF (2012) Factor graphs and GTSAM: a hands-on introduction. In Technical Report GT-RIM-CP&R-2012-002. Georgia Institute of Technology.
21.
DellaertFKaessM (2017) Factor graphs for robot perception. Foundations and Trends in Robotics6(1–2): 1–139.
22.
DuJCarloneLNgMK, et al. (2011) A comparative study on active SLAM and autonomous exploration with particle filters. In Proc. of the IEEE/ASME Int. Conference on Advanced Intelligent Mechatronics, Budapest, Hungary, 03-07 July 2011.
23.
ElimelechKIndelmanV (2017a) Consistent sparsification for efficient decision making under uncertainty in high dimensional state spaces. In IEEE Intl. Conference On Robotics and Automation (ICRA), Singapore, 29 May 2017 - 03 June 2017, pp. 3786–3791.
24.
ElimelechKIndelmanV (2017b) Fast action elimination for efficient decision making and belief space planning using bounded approximations. In Proc Of the Intl Symp of Robotics Research (ISRR). Puerto Varas, Chile, 2017.
25.
ElimelechKIndelmanV (2017c) Scalable sparsification for efficient decision making under uncertainty in high dimensional state spaces. In IEEE/RSJ Intl. Conf. On Intelligent Robots and Systems (IROS), Vancouver, BC, 24-28 September 2017, pp. 5668–5673.
26.
FolkessonJChristensenH (2004) Graphical SLAM – a self-correcting map. In IEEE Intl. Conf. on Robotics and Automation ICRA), New Orleans, LA, 26 April 2004 - 01 May 2004, pp. 383–390.
27.
FreseU (2006) Treemap: an O(log n) algorithm for indoor simultaneous localization and mapping. Autonomous Robots21(2): 103–122.
28.
FreyBKschischangFLoeligerHA, et al. (1997) Factor graphs and algorithms. In Proc 35th Allerton Conf Communications, Control,and Computing, University of Illinois, 1997, pp. 666–680
29.
GrossJLTuckerTW (1987) Topological Graph Theory. New York, NY: Wiley.
30.
HanLEscolanoFHancockER, et al. (2012) Graph characterizations from von Neumann entropy. Pattern Recognition Letters33(15): 1958–1967.
31.
HornRAJohnsonCR (2013) Matrix Analysis. 2nd edition. Cambridge University Press.
32.
HovlandGEMcCarragherBJ (1997) Dynamic sensor selection for robotic systems. IEEE Intl. Conf. on Robotics and Automation (ICRA)Leuven, Belgium, May 16-20, 1998.
33.
HuangSKwokNDissanayakeG, et al. (2005) Multi-step look-ahead trajectory planning in SLAM: possibility and necessity. In IEEE Intl. Conf. On Robotics and Automation (ICRA),Barcelona, Spain, 2005, pp. 1091–1096.
34.
IndelmanV (2015) Towards information-theoretic decision making in a conservative information space. In American Control Conference, Chicago, IL, USA, 2015, pp. 2420–2426.
35.
IndelmanV (2016) No correlations involved: decision making under uncertainty in a conservative sparse information space. IEEE Robotics and Automation Letters1(1): 407–414.
36.
IndelmanVCarloneLDellaertF (2015) Planning in the continuous domain: a generalized belief space approach for autonomous navigation in unknown environments. Intl J of Robotics Research34(7): 849–882.
37.
JoshiSBoydS (2009) Sensor selection via convex optimization. IEEE Transactions on Signal Processing57(2): 451–462.
38.
KaessMRanganathanADellaertF (2008) iSAM: incremental smoothing and mapping. IEEE Trans Robotics24(6): 1365–1378.
39.
KaessMIlaVRobertsR, et al. (2010) The Bayes tree: an algorithmic foundation for probabilistic robot mapping. In Intl Workshop on the Algorithmic Foundations of Robotics. Springer.
40.
KaessMJohannssonHRobertsR, et al. (2012) iSAM2: incremental smoothing and mapping using the Bayes tree. Intl J of Robotics Research31(2): 217–236.
41.
KavrakiLSvestkaPLatombeJC, et al. (1996) Probabilistic roadmaps for path planning in high-dimensional configuration spaces. IEEE Trans. Robot Automat12(4): 566–580.
42.
KhosoussiKHuangSDissanayakeG (2015) Good, bad and ugly graphs for SLAM. In Robotics: Science and Systems (RSS). Workshop on the Problem of Mobile Sensors. Springer.
43.
KhosoussiKGiamouMSukhatmeGS, et al. (2019) Reliable graph topologies for SLAM. Intl J of Robotics Research38(2–3): 260–298.
44.
KimAEusticeRM (2014) Active visual SLAM for robotic area coverage: theory and experiment. Intl J of Robotics Research34(4–5): 457–475.
45.
KimSBhattacharyaSGhristR, et al. (2013) Topological exploration of unknown and partially known environments. In IEEE/RSJ Intl Conf On Intelligent Robots and Systems (IROS), Tokyo, Japan, 03-07 November 2013, pp. 3851–3858.
46.
KitanovAIndelmanV (2018) Topological multi-robot belief space planning in unknown environments, In IEEE Intl Conf On Robotics and Automation (ICRA), Brisbane, QLD, 21-25 May 2018, pp. 5726–5732.
47.
KollerDFriedmanN (2009) Probabilistic Graphical Models: Principles and Techniques. The MIT Press.
48.
KopitkovDIndelmanV (2017) No belief propagation required: belief space planning in high-dimensional state spaces via factor graphs, matrix determinant lemma and re-use of calculation. Intl J of Robotics Research36(10): 1088–1130.
49.
KopitkovDIndelmanV (2019) General purpose incremental covariance update and efficient belief space planning via factor-graph propagation action tree. Intl J of Robotics Research38(14): 1644–1673.
50.
KretzschmarHStachnissC (2012) Information-theoretic compression of pose graphs for laser-based SLAM. Intl J of Robotics Research31(11): 1219–1230.
51.
KschischangFFreyBLoeligerHA (2001) Factor graphs and the sum-product algorithm. IEEE Trans Inform Theory47(2): 498–519.
52.
KurniawatiHHsuDLeeWS (2008) SARSOP: efficient point-based POMDP planning by approximating optimally reachable belief spaces. In Robotics: Science and Systems (RSS). Singapore, 2008.
53.
LauritzenSLSpiegelhalterDJ (1988) Local computations with probabilities on graphical structures and their application to expert systems. Journal of the Royal Statistical Society. Series B (Methodological)50(2): 157–224.
54.
MoharB (1991) The Laplacian spectrum of graphs. In Graph Theory, Combinatorics, and Applications. Wiley, pp. 871–898.
55.
MowshowitzADehmerM (2012) Entropy and the complexity of graphs revisited. Entropy14(3): 559–570.
56.
PapadimitriouCTsitsiklisJ (1987) The complexity of Markov decision processes. Mathematics of operations research12(3): 441–450.
57.
PaskinM (2003) Thin junction tree filters for simultaneous localization and mapping. In Intl Joint Conf OnAI (IJCAI). San Francisco, CA, USA, August2003, pp. 1157–1164.
58.
PasseriniFSeveriniS (2009) Quantifying complexity in networks: the von Neumann entropy. International Journal of Agent Technologies and Systems (IJATS)1(4): 58–67.
59.
PaulRNewmanP (2010) Fab-map 3d: topological mapping with spatial and visual appearance. In IEEE Intl Conf On Robotics and Automation (ICRA),Anchorage, AK, USA, 2010, pp. 2649–2656.
60.
PetzD (2001) Entropy, von Neumann and the von Neumann entropy. In John Von Neumann and the Foundations of Quantum Physics. Springer, pp. 83–96.
61.
PineauJGordonGJThrunS (2006) Anytime point-based approximations for large POMDPs. Journal of Artificial Intelligence Research27: 335–380.
62.
PlacedJAStraderJCarrilloH, et al. (2023) A survey on active simultaneous localization and mapping: state of the art and new frontiers. IEEE Trans Robotics39(3): 1686–1705.
63.
PlattRTedrakeRKaelblingL, et al. (2010) Belief space planning assuming maximum likelihood observations. In Robotics: Science and Systems (RSS). Zaragoza, Spain: MIT, pp. 587–593.
64.
PortaJMVlassisNSpaanMT, et al. (2006) Point-based value iteration for continuous POMDPs. Journal of Machine Learning Research7: 2329–2367.
65.
PrenticeSRoyN (2009) The belief roadmap: efficient planning in belief space by factoring the covariance. Intl. J. of Robotics Research28(11–12): 1448–1465.
RegevTIndelmanV (2017) Decentralized multi-robot belief space planning in unknown environments via efficient re-evaluation of impacted paths. In Autonomous Robots Special Issue on Online Decision Making in Multi-Robot Coordination. Springer.
68.
ShienmanMKitanovAIndelmanV (2021) FT-BSP: focused topological belief space planning. IEEE Robotics and Automation Letters6(3): 4744–4751.
69.
SorensonHW (1980) Parameter Estimation: Principles and Problems. M. Dekker, Volume 9.
70.
StachnissCHaehnelDBurgardW (2004) Exploration with active loop-closing for FastSLAM. In IEEE/RSJ Intl Conf on Intelligent Robots and Systems IROS), Sendai, Japan, 28 September 2004 - 02 October 2004, pp. 1505–1510.
71.
SucarLE (2015) Probabilistic Graphical Models Principles and Applications. London: Springer.
72.
ThrunSLeonardJJ (2008) Simultaneous localization and mapping. In Springer Handbook of Robotics. Springer Verlag, pp. 871–889.
73.
ValenciaRMiróJVDissanayakeG, et al. (2012) Active pose SLAM. In: IEEE/RSJ Intl Conf On Intelligent Robots and Systems (IROS).Vilamoura-Algarve, Portugal, 2012, pp. 1885-1891.
74.
Van Den BergJAbbeelPGoldbergK (2011) LQG-MP: optimized path planning for robots with motion uncertainty and imperfect state information. The International Journal of Robotics Research30(7): 895–913.
75.
Van Den BergJPatilSAlterovitzR (2012) Motion planning under uncertainty using iterative local optimization in belief space. Intl J of Robotics Research31(11): 1263–1278.
76.
VossCMollMKavrakiLE (2015) A heuristic approach to finding diverse short paths. In IEEE Intl Conf On Robotics and Automation (ICRA),Seattle, WA, USA, 2015, pp. 4173–4179.
77.
WilsonRCZhuP (2008) A study of graph spectra for comparing graphs and trees. Pattern Recognition41(9): 2833–2841.
78.
YannakakisM (1981) Computing the minimum fill-in is NP-complete. SIAM Journal on Algebraic Discrete Methods2(1): 77–79.
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.