Sage Journals: Discover world-class research

Abstract

We present novel, convex relaxations for rotation and pose estimation problems that can a posteriori guarantee global optimality for practical measurement noise levels. Some such relaxations exist in the literature for specific problem setups that assume the matrix von Mises-Fisher distribution (a.k.a., matrix Langevin distribution or chordal distance) for isotropic rotational uncertainty. However, another common way to represent uncertainty for rotations and poses is to define anisotropic noise in the associated Lie algebra. Starting from a noise model based on the Cayley map, we define our estimation problems, convert them to Quadratically Constrained Quadratic Programs (QCQPs), then relax them to Semidefinite Programs (SDPs), which can be solved using standard interior-point optimization methods; global optimality follows from Lagrangian strong duality. We first show how to carry out basic rotation and pose averaging. We then turn to the more complex problem of trajectory estimation, which involves many pose variables with both individual and inter-pose measurements (or motion priors). Our contribution is to formulate SDP relaxations for all these problems based on the Cayley map (including the identification of redundant constraints) and to show them working in practical settings. We hope our results can add to the catalogue of useful estimation problems whose solutions can be a posteriori guaranteed to be globally optimal.

Keywords

Rotation estimation pose estimation quadratically constrained quadratic program semi-definite program Lagrangian duality Cayley map

1. Introduction

State estimation is concerned with fusing several noisy measurements (and possibly a prior model) into a less noisy estimate of the state (e.g., position, velocity, orientation) of a vehicle, robot, or other object of interest. Real-world state estimation problems often involve measurement functions and motion models that are nonlinear with respect to the state. Alternatively, the state itself may not be an element of a vector space, such as the rotation of a rigid body. These challenging aspects typically mean that when we set up our estimator as an optimization, it is a nonconvex problem; the cost function, the feasible set, or both are not convex. Nonconvex optimization problems are in general much harder to solve than convex ones because they can have local minima and common gradient-based optimizers can easily become trapped therein.

For example, we might have a generic nonlinear least-squares problem such as

min_{x} \sum_{m = 1}^{M} {(y_{m} - g (x))}^{T} (y_{m} - g (x)),

(1)

where x is the unknown state, y_m are noisy measurements, and g(⋅) is a measurement function. If g(x) is linear in x, then this problem is convex, but otherwise it often is not. Using gradient descent or Gauss-Newton to solve this problem means we usually require a good initial guess for x to arrive at the global minimum. What if such an initial guess is not available? Could we solve a problem such as this one globally without such a guess? It turns out the answer may be yes, depending on the specific problem to be solved.

There has been quite a bit of work in robotics and computer vision aimed at the idea of solving estimation problems globally. Most of these works employ sophisticated tools from the optimization literature to achieve this. In particular, Lagrangian duality is used to derive convex relaxations, which can be solved globally. Boyd and Vandenberghe (2004, §5) provide the necessary background on duality theory. We will be following a common pathway where we first convert our nonconvex optimization problem into another nonconvex form called a Quadratically Constrained Quadratic Program (QCQP); from here we relax this to a (convex) Semi-definite Program (SDP), amenable to off-the-shelf solvers (e.g., the interior-point-based SDP solver in mosek (ApS, 2019)). This last step is known as Shor’s relaxation (Shor, 1987). Theoretically, SDPs admit polynomial-time solutions because they are convex; in practice, modern SDP problems with a few thousand variables can be solved in reasonable time but struggle to scale up beyond this. Our contribution in this paper is to show that we can solve a set of estimation problems involving rotations and poses globally, using these convex relaxation tools; the novelty lies in the fact that our particular problems (formulated using the Cayley map) have not been examined in this globally optimal framework before.

As mentioned, the convex relaxation procedure we employ has been well known in the optimization community for some time. In particular, it has been used for polynomial optimization (Parrilo, 2003) and in various combinatorial optimization problems, such as quadratic assignment (Nesterov et al., 2000) and max-cut graph partitioning (Anjos and Wolkowicz, 2002). To the authors’ knowledge, the first use of SDP relaxations in the robotics community was by Liu et al. (2012) for planar Simultaneous Localization and Mapping (SLAM), though their application in computer vision (Kahl and Henrion, 2005) and signal processing (Krislock and Wolkowicz, 2010; Luo et al., 2010) occurred earlier. More generally, Cifuentes et al. (2022) provides a nice overview of some common problems in computer vision and robotics where these tools have been applied before, as well as providing a rationale for why they are so effective. One of the most commonly investigated relaxations in the robotics and vision communities is rotation synchronization¹ (Bandeira et al., 2017; Dellaert et al., 2020; Eriksson et al., 2018, 2021); here several rotations are linked through noisy relative rotation measurements. Rotation synchronization turns out to be the nucleus of several of the other problems under study including pose-graph optimization (Briales and Gonzalez-Jimenez, 2017a; Carlone and Calafiore, 2018; Carlone et al., 2016; Rosen et al., 2019; Tian et al., 2021), point-set registration (Chaudhury et al., 2015; Iglesias et al., 2020; Yang et al., 2021), calibration (Giamou et al., 2019; Wise et al., 2020), mutual localization (Wang et al., 2022b), and landmark-based SLAM (Holmes and Barfoot, 2023). Rotation synchronization and its cousins have been shown to admit fast solutions through low-rank factorizations (Rosen et al., 2019). More recently, other measurement models such as range sensing have also been incorporated into globally certifiable problems (Dümbgen et al., 2023; Papalia et al., 2023).

It is worth also mentioning the works of Forbes and De Ruiter (2015); Horowitz et al. (2014) on globally optimal pointcloud alignment, which also employ convex relaxation, but the route to the SDP is not Shor’s relaxation; instead, Lie group optimization variables are relaxed to live in a convex set. On the surface, this approach is not applicable to the problems considered herein; for example, our cost functions are not always initially convex.

A common thread that ties most of the existing literature together is that the chordal distance is used to construct the terms of the cost function that involve rotation variables. Viewed through a probabilistic lens, the chordal distance is related to the matrix Langevin or matrix von Mises-Fisher distribution, whose density can be written in the form

p (C) \propto \exp (- \frac{1}{2} σ^{- 1} tr ((C - \bar{C}) {(C - \bar{C})}^{T})),

(2)

where C is a rotation matrix,

\bar{C}

is the mode, and σ > 0 is a scalar concentration parameter; this distribution is isotropic, which is one limitation we aim to overcome in this work. This is not the only way to represent rotational uncertainty. Another common way is to use exponential coordinates (e.g., Barfoot and Furgale (2014); Long et al. (2013)), where a rotational distribution can have a density of the form

p (C) \propto \exp (- \frac{1}{2} \ln {(C {\bar{C}}^{T})}^{\lor^{T}} Σ^{- 1} \ln {(C {\bar{C}}^{T})}^{\lor}),

(3)

where C is a member of the matrix Lie group SO(3),

\bar{C}

is also a member of SO(3) and represents the mean, and Σ is an anisotropic matrix covariance. We also have exp(⋅) as the matrix exponential, ln(⋅) the matrix logarithm, and ∨ a Lie algebra operator detailed a bit later in the paper. Here we are essentially defining a Gaussian distribution in the vector space of the Lie algebra associated with SO(3) and then mapping the uncertainty to the Lie group through the matrix exponential. This allows for anisotropic distributions and the same approach can be easily extended to any matrix Lie group, such as the special Euclidean group SE(3) that represents poses (see, e.g., Barfoot and Furgale (2014); Long et al. (2013)). Our aim in this paper is to present some novel convex relaxations where rotational uncertainty is defined closer to this exponential coordinate model; to achieve this, we use the Cayley map, which is very close to the exponential map for small-to-moderate rotational uncertainty. Our Cayley distributions will have the form

p (C) \propto cay (- \frac{1}{2} {cay}^{- 1} {(C {\bar{C}}^{T})}^{\lor^{T}} Σ^{- 1} {cay}^{- 1} {(C {\bar{C}}^{T})}^{\lor}),

(4)

where cay(⋅) is the Cayley map. This also allows us to define our optimization problems directly on SE(3) rather than

S O (3) \times R^{3}

when poses are involved.

To our knowledge, the examination of global optimality for state estimation problems where rotational (and pose) uncertainty is defined in this way has not be explored previously in the literature. Our novel contribution is therefore a family of specific convex relaxations of rotation and pose estimation problems formulated using the Cayley map (including redundant constraints needed to make them work in practice); this is important as it opens the door to providing certification for a broad class of state estimation problems used in practice.

This paper is organized as follows. In Section 2, we review the relevant mathematical background including Lie groups, the Cayley map, and the convex relaxation procedure that we will employ. Section 3 presents the basic problems of averaging a number of noisy rotation or pose measurements. In Section 4, we expand the method to include discrete-time trajectory estimation of several poses based on individual and inter-pose measurements. Section 5 expands this to include so-called continuous-time trajectory estimation where we have a smoothing assumption on the trajectory and estimate both pose and twist at each state. In each of Sections 3 to 5, we provide experimental results that demonstrate the viability of our convex relaxations to find globally optimal solutions. Section 6 concludes the paper. Appendix A discusses the similarity between distributions defined using the exponential and Cayley maps while Appendix B presents the baseline local solvers to which we compare our new global estimates.

2. Mathematical background

We begin by reviewing the relevant background concepts for the paper including Lie groups, the Cayley map, and convex relaxations of nonconvex optimization problems via Lagrangian duality.

2.1. Lie groups for rotations and poses

The special orthogonal group, representing rotations, is the set of valid rotation matrices:

S O (3) = \{C \in R^{3 \times 3} | C C^{T} = I, det (C) = 1\},

(5)

where I is the identity matrix. It is common to map a vector,

ϕ \in R^{3}

, to a rotation matrix, C, through the matrix exponential,

C (ϕ) = \exp (ϕ^{\land}),

(6)

where (⋅)^∧ is the skew-symmetric operator,

ϕ^{\land} = {[\begin{matrix} ϕ_{1} \\ ϕ_{2} \\ ϕ_{3} \end{matrix}]}^{\land} = [\begin{matrix} 0 & - ϕ_{3} & ϕ_{2} \\ ϕ_{3} & 0 & - ϕ_{1} \\ - ϕ_{2} & ϕ_{1} & 0 \end{matrix}],

(7)

and

ϕ = φ a \in R^{3}

, the product of the angle and unit axis of rotation. The mapping is surjective-only, meaning every C can be produced by many different values for ϕ .

The special Euclidean group, representing poses (i.e., translation and rotation), is the set of valid transformation matrices:

S E (3) = \{T = [\begin{matrix} C & r \\ 0^{T} & 1 \end{matrix}] \in R^{4 \times 4} | C \in S O (3), r \in R^{3}\} .

(8)

It is again common to map a vector, $ξ \in R^{6}$ , to a transformation matrix, T ∈ SE(3), through the matrix exponential,

T (ξ) = \exp (ξ^{\land}),

(9)

where we have overloaded the ∧ operator as

ξ^{\land} = {[\begin{matrix} ρ \\ ϕ \end{matrix}]}^{\land} = [\begin{matrix} ϕ^{\land} & ρ \\ 0^{T} & 0 \end{matrix}] .

(10)

Notationally, we will use ∨ to mean the inverse operation of ∧. As is common practice (Barfoot, 2024), we have broken the pose vector, ξ , into a translational component, ρ , and a rotational component, ϕ . The mapping is also surjective-only, meaning every T can be produced by many different values for ξ .

Finally, the adjoint of a pose is given by

T (ξ) = Ad (T) = [\begin{matrix} C & r^{\land} C \\ 0 & C \end{matrix}],

(11)

which is now 6 × 6. We will refer to the set of adjoints as Ad(SE(3)). We can map a vector, $ξ \in R^{6}$ , to an adjoint transformation matrix again through the matrix exponential map:

T (ξ) = \exp (ξ^{⋏}),

(12)

where

ξ^{⋏} = {[\begin{matrix} ρ \\ ϕ \end{matrix}]}^{⋏} = [\begin{matrix} ϕ^{\land} & ρ^{\land} \\ 0 & ϕ^{\land} \end{matrix}] .

(13)

Notationally, we will use ⋎ to mean the inverse operation of ⋏. The mapping is again surjective-only, meaning every

T

can be produced by many different values for ξ .

2.2. Cayley map

While the exponential map is the canonical way to map from a Lie algebra (vector space) to a Lie group, it is not the only possibility. There are in fact infinitely many such vectorial mappings for SO(3) (Bauchau and Trainelli, 2003), SE(3) (Barfoot et al., 2022), and Ad(SE(3)) (Bauchau, 2011; Bauchau and Choi, 2003).

In particular, it is well known that for the Cayley-Gibbs-Rodrigues parameterization of rotation we can write the rotation matrix in terms of the Cayley map, $cay (A) = {(I - \frac{1}{2} A)}^{- 1} (I + \frac{1}{2} A)$ , according to Bauchau and Trainelli (2003); Borri et al. (2000):

\begin{align} C (ϕ) & = cay (ϕ^{\land}) = {(I - \frac{1}{2} ϕ^{\land})}^{- 1} (I + \frac{1}{2} ϕ^{\land}), \end{align}

(14a)

\begin{align} ϕ & = {cay}^{- 1} {(C)}^{\lor} = {(2 (C - I) {(C + I)}^{- 1})}^{\lor}, \end{align}

(14b)

for some $ϕ = 2 \tan \frac{φ}{2} a \in R^{3}$ with φ the rotation angle and a the unit axis. Borri et al. (2000) and later Selig (2007) demonstrated that the Cayley map can also be used to map pose vectors to SE(3) according to

\begin{align} T (ξ) & = cay (ξ^{\land}) = {(I - \frac{1}{2} ξ^{\land})}^{- 1} (I + \frac{1}{2} ξ^{\land}), \end{align}

(15a)

\begin{align} ξ & = {cay}^{- 1} {(T)}^{\lor} = {(2 (T - I) {(T + I)}^{- 1})}^{\lor}, \end{align}

(15b)

for some $ξ \in R^{6}$ . Although we will not need it, the Cayley map can be used to map pose vectors to Ad(SE(3)) according to

\begin{align} T (ξ) & = cay (ξ^{⋏}) = {(I - \frac{1}{2} ξ^{⋏})}^{- 1} (I + \frac{1}{2} ξ^{⋏}), \end{align}

(16a)

\begin{align} ξ & = {cay}^{- 1} {(T)}^{⋎} = {(2 (T - I) {(T + I)}^{- 1})}^{⋎} . \end{align}

(16b)

However, Selig (2007) demonstrates that starting from the same ξ and applying (15a), (15b), (16a) and (16b) does not result in an equivalent transformation, that is, $T (ξ) \neq Ad (T (ξ))$ ; the commutative property for adjoints does not hold. Nevertheless, we shall not require this property here.

Figure 1 provides examples comparing two rotational distributions derived from the exponential and Cayley maps that have approximately the same variance; we can see that even with quite large rotational uncertainty they match quite closely. Appendix A provides some further discussion on how closely these distributions can be made to match. The key idea of the paper will be to replace instances of the exponential map with the Cayley map, which we will see is more amenable to producing polynomial optimization problems, a key prerequisite to our route to global optimality.

Figure 1.

Comparison of uncertainty on rotation angle, φ, for the exponential and Cayley maps, where the variances have been approximately matched (see Appendix A for further discussion of how this was done). (left) Standard deviation of rotational uncertainty is σ = 0.2 [rad]. (right) σ = 0.5 [rad]. The match is good in both cases with more divergence as rotational uncertainty increases.

The Cayley map has been used in the past for rotation, pose, and trajectory estimation (Alismail et al., 2014; Barfoot et al., 2022; Junkins et al., 2011; Majji et al., 2011; Mortari et al., 2007; Qian et al., 2020; Wong et al., 2018; Wong and Majji, 2016), typically to parameterize rotations or poses thereby creating a simpler unconstrained quadratic optimization problem. The drawback of these approaches is that they are still subject to singularities and local minima. The Cayley map has also been used to achieve global optimality in the perspective-n-point (PnP) problem (Nakano, 2015; Wang et al., 2022a); we take a quite different approach, however, through the use of convex relaxations. Additionally, the Cayley map has found application in parametrizing lines in structure from motion, as an unconstrained alternative to parametrizations such as Plücker coordinates (Zhang and Koch, 2014).

The Cayley map has also been employed in areas of robotics other than estimation. In Kobilarov (2014); Kobilarov and Marsden (2011), for example, the authors suggest using the Cayley map for rotation parametrization in the context of optimal control of mechanical systems on Lie groups. The authors observe that, compared with the exponential map, the Cayley map is computationally more efficient because of its simpler structure, in particular as it circumvents trigonometric functions. It is also noted that the Cayley map has no singularities in its gradients, which is of advantage for the numerical stability of commonly used solvers (Kobilarov and Marsden, 2011). In Solo and Wang (2019), the Cayley map is employed for simulating stochastic differential equations that evolve on Stiefel manifolds, which is subsequently used in Wang and Solo (2020) for a novel particle filter variant. It is possible that our global optimality approach to using the Cayley map could be employed within some of these applications, but we leave this investigation for future work.

2.3. Convex relaxations

We next summarize the key optimization tools that we will use. Boyd and Vandenberghe (2004) provide the appropriate background. Suppose that we have a nonconvex optimization problem of the form

\begin{matrix} \min & f (z) \\ w . r . t . & z \\ s . t . & g_{i} (z) = 0 (\forall i) . \end{matrix}

(17)

We attempt to introduce appropriate nonlinear substitution variables, x, to replace z so that both the objective and the constraints can be written in a standard, homogeneous, quadratic form:

\begin{matrix} \min & x^{T} Q x \\ w . r . t . & x \\ s . t . & x^{T} A_{0} x = 1 \\ x^{T} A_{i} x = 0 (\forall i \neq 0) . \end{matrix}

(18)

This problem is a Quadratically Constrained Quadratic Program (QCQP), which is still nonconvex and typically of higher dimension than the original problem, but possesses more exploitable structure. Next, by defining X = xx^T, we rewrite this problem exactly as

\begin{matrix} \min & tr (Q X) \\ w . r . t . & X \\ s . t . & X ⪰ 0 \\ rank (X) = 1 \\ tr (A_{0} X) = 1 \\ tr (A_{i} X) = 0 (\forall i \neq 0) . \end{matrix}

(19)

Finally, if we drop the rank(X) = 1 constraint, we have a convex relaxation of the problem in the form of a Semidefinite Program (SDP):

\begin{matrix} \min & tr (Q X) \\ w . r . t . & X \\ s . t . & X ⪰ 0 \\ tr (A_{0} X) = 1 \\ tr (A_{i} X) = 0 (\forall i \neq 0) . \end{matrix}

(20)

This is known as Shor’s relaxation (Shor, 1987). If the solution to this problem happens to result in rank(X) = 1, then we have an a posteriori² guarantee that we also have a global solution to the original problem, x. Since SDPs are convex problems, we can attempt to use standard solvers, such as interior-point methods, to solve them numerically. While there are tractability issues to be addressed to scale up to very large problem instances, we will see that for the problems in this paper, this approach is viable for nontrivial problem sizes.

Unfortunately, for most problems in this paper, Shor’s relaxation is not tight out of the box. In this paper, tightness means rank(X) = 1 (and therefore that the optimal cost matches that of the original problem). However, there is still a way forward. We can attempt to introduce additional so-called redundant constraints to tighten up the relaxation. These constraints do not affect the feasible set of the original optimization problem, but they do reduce the feasible set of the SDP in order to tighten it.

The technique of adding redundant constraints to improve the tightness of a given SDP relaxation has been known for some time in the optimization literature (Anstreicher and Wolkowicz, 2000; Nesterov et al., 2000). More recently, there have been several cases in which redundant constraints have been used in the robotics (Yang et al., 2021; Yang and Carlone, 2023; Giamou et al., 2019; Wang et al., 2022b) and machine vision (Briales et al., 2018; Briales and Gonzalez-Jimenez, 2017b; Garcia-Salguero et al., 2022; Kezurer et al., 2015) literature. With redundant constraints, our problem becomes

\begin{matrix} \min & tr (Q X) \\ w . r . t . & X \\ s . t . & X ⪰ 0 \\ tr (A_{0} X) = 1 \\ tr (A_{i} X) = 0 (\forall i \neq 0) \\ (r e d u n d a n t) & tr (B_{j} X) = 0 (\forall j \neq 0), \end{matrix}

(21)

where the B_j encapsulate these additional redundant constraints. In theory, Lasserre (2001) tells us how to tighten our SDP, if possible, by adding a progression of variables and constraints, but adding too many constraints can be computationally expensive and in practice not necessary for tightness.³ On the other hand, devising a sufficient set of constraints can be challenging by trial and error. In our concurrent work, we have been developing a tool to automatically find such constraints (Duembgen et al., 2023), which we used to help identify some of the constraints reported in this paper. In all of the pose estimation problems to follow, we do require redundant constraints and we will be explicit in enumerating ones that in practice result in tight SDP relaxations of our problems. This could be viewed as the core contribution of the paper.

3. Averaging

We will build up our optimization problems gradually starting with simply ‘averaging’ several noisy estimates of rotation or pose.

3.1. Rotation averaging

In order to average M rotations, we could set up an optimization problem as

\begin{matrix} \min & \sum_{m = 1}^{M} \ln {(C {\tilde{C}}_{m}^{T})}^{\lor^{T}} W_{m} \ln {(C {\tilde{C}}_{m}^{T})}^{\lor} \\ w . r . t . & C \\ s . t . & C \in S O (3), \end{matrix}

(22)

where

{\tilde{C}}_{m} \in S O (3)

are the noisy rotations to be averaged and W_m is a matrix weight. This type of cost function is used frequently in rotational estimation problems (Barfoot, 2024) and can represent a maximum-likelihood problem when the generative model for the noisy measurements is of the form

{\tilde{C}}_{m} = \exp (ϕ_{m}^{\land}) C, ϕ_{m} \sim N (0, W_{m}^{- 1}) .

(23)

Alternatively, we can view our cost as the negative log-likelihood of the joint distribution of the measurements if each obeys (3). The trouble is that the matrix exponential and logarithm are difficult expressions to manipulate into the QCQP form we seek.

This is where the main insight of the paper comes in. We can substitute the Cayley map for the exponential map without too much effect on the stated problem (see Figure 1). With this substitution, our generative model for noisy rotations becomes

{\tilde{C}}_{m} = cay (ϕ_{m}^{\land}) C, ϕ_{m} \sim N (0, W_{m}^{- 1}),

(24)

and so our optimization problem can be restated as

\begin{matrix} \min & \sum_{m = 1}^{M} {cay}^{- 1} {(C {\tilde{C}}_{m}^{T})}^{\lor^{T}} W_{m} {cay}^{- 1} {(C {\tilde{C}}_{m}^{T})}^{\lor} \\ w . r . t . & C \\ s . t . & C \in S O (3) . \end{matrix}

(25)

Now our cost represents the negative log-likelihood of the joint distribution of the measurements, assuming each obeys (4). Turning this into a QCQP is then fairly easy:

\begin{matrix} \min & \sum_{m = 1}^{M} ϕ_{m}^{T} W_{m} ϕ_{m} \\ w . r . t . & c_{1}, c_{2}, c_{3}, ϕ_{m} (\forall m) \\ s . t . & c_{i}^{T} c_{j} = δ_{i j} (\forall i, j) \\ (I - \frac{1}{2} ϕ_{m}^{\land}) c_{i} = (I + \frac{1}{2} ϕ_{m}^{\land}) {\tilde{c}}_{m, i} (\forall i, m), \end{matrix}

(26)

where δ_ij is the Kronecker delta and

C = [\begin{matrix} c_{1} & c_{2} & c_{3} \end{matrix}], {\tilde{C}}_{m} = [\begin{matrix} {\tilde{c}}_{m, 1} & {\tilde{c}}_{m, 2} & {\tilde{c}}_{m, 3} \end{matrix}] .

(27)

We have essentially introduced variables, ϕ _m, for the residual errors of each term in the cost and used these to connect C to each

{\tilde{C}}_{m}

through the Cayley map; by bringing the inverse factor of the Cayley map to the other side, this becomes a quadratic constraint. Thus we have both a quadratic cost and quadratic constraints and hence a QCQP. The dimension of the problem is now higher since we must now optimize over C and all the ϕ _m; however, we can follow the approach of Section 2.3 to produce a SDP relaxation of the problem. Note, we have quietly dropped the det(C) = 1 constraint on the rotation and will simply check it at the end⁵; our optimization then only guarantees C ∈ O(3) not SO(3). We leave the details of manipulating (26) into the standard form of (20) to the reader. We did not find any redundant constraints were necessary to tighten this relaxation; the SDP solution produced remains rank 1 in practice for reasonably high amounts of noise.

The details of a baseline local solver can be found in the appendix. For the global (SDP) solver we used cvx in Matlab with mosek (ApS, 2019). The solution costs of the global and local solvers agree to high precision⁶ if a good initial guess is given to the local solver. Figure 2 provides an example where the local solver converges from a poor initial guess to a local minimum, while the global solver finds the optimal solution near the groundtruth. Figure 3 provides a quantitative study of the tightness of the SDP solution with increasing measurement noise; we selected the measurement covariance as $W_{m}^{- 1} = σ^{2} I$ , with σ increasing. To gauge numerically whether the SDP solution, X, is rank 1, we define the logarithmic Singular Value Ratio (SVR) as the base-10 logarithm of the ratio of the largest to second-largest singular values of X; we consider a log SVR of at least 5 to represent rank 1. We see there is a large range for the noise over which the local solver can become trapped in a local minimum while the global solver remains rank 1. With M = 10 rotations to be averaged, the local solver took on average 0.0012s while the SDP solver took on average 0.3486s.

Figure 2.

Rotation Averaging: An example of noisy rotation averaging where the randomly initialized local solver (dotted) becomes trapped in a poor local minimum while the global solver (dashed) finds the correct global solution, which is closer to the groundtruth rotation (solid).

Figure 3.

Rotation Averaging: A quantitative evaluation of the tightness of the rotation averaging problem with increasing measurement noise level, σ. At each noise level, we conducted 1000 trials of averaging 10 noisy rotations. (left) We see that the local solver (randomly initialized) finds the global minimum with decreasing frequency (green) as the measurement noise is increased, while the SDP solver (blue) successfully produces rank-1 solutions (we consider log SVR of at least 5 to be rank 1) to much higher noise levels. For completeness, we also show how frequently the local solver converges to any minimum (red). (right) Boxplots of the log SVR of the SDP solution show that the global solution remains highly rank 1 over a wide range of measurement noise values.

3.2. Pose averaging

Pose averaging follows a very similar approach to the previous section.⁴ An optimization problem based on the Cayley map can be stated as

\begin{matrix} \min & \sum_{m = 1}^{M} {cay}^{- 1} {(T {\tilde{T}}_{m}^{- 1})}^{\lor^{T}} W_{m} {cay}^{- 1} {(T {\tilde{T}}_{m}^{- 1})}^{\lor} \\ w . r . t . & T \\ s . t . & T \in S E (3), \end{matrix}

(28)

where

{\tilde{T}}_{m}

are noisy pose measurements with matrix weights, W_m.

We convert the residual pose errors, $ξ_{m} = {cay}^{- 1} {(T {\tilde{T}}_{m}^{- 1})}^{\lor}$ , to variables and now our optimization can be stated as

\begin{matrix} \min & \sum_{m = 1}^{M} ξ_{m}^{T} W_{m} ξ_{m} \\ w . r . t . & T, ξ_{m} (\forall m) \\ s . t . & C^{T} C = I \\ (I - \frac{1}{2} ξ_{m}^{\land}) T = (I + \frac{1}{2} ξ_{m}^{\land}) {\tilde{T}}_{m} (\forall m), \end{matrix}

(29)

where we have again dropped the det(C) = 1 constraint. Since the bottom row of a transformation matrix is constant, we can parameterize it in the following way

\begin{align} T & = [\begin{matrix} C & r \\ 0^{T} & 1 \end{matrix}] = [\begin{matrix} c_{1} & c_{2} & c_{3} & r \\ 0 & 0 & 0 & 1 \end{matrix}], \end{align}

(30a)

\begin{align} {\tilde{T}}_{m} & = [\begin{matrix} {\tilde{C}}_{m} & r_{m} \\ 0^{T} & 1 \end{matrix}] = [\begin{matrix} {\tilde{c}}_{m, 1} & {\tilde{c}}_{m, 2} & {\tilde{c}}_{m, 2} & {\tilde{r}}_{m} \\ 0 & 0 & 0 & 1 \end{matrix}], \end{align}

(30b)

\begin{align} ξ_{m} & = [\begin{matrix} ρ_{m} \\ ϕ_{m} \end{matrix}], \end{align}

(30c)

and then rewrite the optimization problem using the reduced set of variables as

\begin{matrix} \min & \sum_{m = 1}^{M} ξ_{m}^{T} W_{m} ξ_{m} \\ w . r . t . & c_{i}, r, ρ_{m}, ϕ_{m} (\forall i, m) \\ s . t . & c_{i}^{T} c_{j} = δ_{i j} (\forall i, j) \\ (I - \frac{1}{2} ϕ_{m}^{\land}) c_{i} = (I + \frac{1}{2} ϕ_{m}^{\land}) {\tilde{c}}_{m, i} (\forall i, m) \\ (I - \frac{1}{2} ϕ_{m}^{\land}) r = (I + \frac{1}{2} ϕ_{m}^{\land}) {\tilde{r}}_{m} + ρ_{m} (\forall m) . \end{matrix}

(31)

This is now a QCQP, but unfortunately when we relax to a SDP, it is not always tight even for low-noise levels. We found that introducing specific redundant constraints for each m tightens the problem nicely for practical noise levels. One such useful constraint can be found by premultiplying the last constraint in (31) by r^T whereupon

r^{T} r - \frac{1}{2} \underset{0}{\underset{⏟}{r^{T} ϕ_{m}^{\land} r}} = r^{T} {\tilde{r}}_{m} - \frac{1}{2} r^{T} {\tilde{r}}_{m}^{\land} ϕ_{m} + r^{T} ρ_{m .}

(32)

The key is that the second cubic term vanishes, leaving a new quadratic constraint that it is not simply a trivial linear combination of the existing constraints (Yang and Carlone, 2023). However, this constraint is redundant because it does not restrict the original feasible set at all. In the lifted SDP space it serves to restrict the feasible set and ultimately tighten the relaxation. Another useful redundant constraint can be formed by combining the last two of (31); the second last can be written as

\frac{1}{2} {(c_{i} + {\tilde{c}}_{m, i})}^{T} ϕ_{m}^{\land} = - {(c_{i} - {\tilde{c}}_{m, i})}^{T},

(33)

while the last can be premultiplied by

{(c_{i} + {\tilde{c}}_{m, i})}^{T}

and written as

\begin{align} {(c_{i} + {\tilde{c}}_{m, i})}^{T} (r - {\tilde{r}}_{m}) = \underset{- {(c_{i} - {\tilde{c}}_{m, i})}^{T}}{\underset{⏟}{\frac{1}{2} {(c_{i} + {\tilde{c}}_{m, i})}^{T} ϕ_{m}^{\land}}} (r + {\tilde{r}}_{m}) \\ + {(c_{i} + {\tilde{c}}_{m, i})}^{T} ρ_{m} . \end{align}

(34)

After performing the indicated substitution, this becomes

\frac{1}{2} {(c_{i} + {\tilde{c}}_{m, i})}^{T} ρ_{m} = c_{i}^{T} r - {\tilde{c}}_{m, i}^{T} {\tilde{r}}_{m},

(35)

which is once again a quadratic constraint.

Summarizing, the following QCQP offers a reasonably tight SDP relaxation in practice:

\begin{matrix} \min & \sum_{m = 1}^{M} ξ_{m}^{T} W_{m} ξ_{m} \\ w . r . t . & c_{i}, r, ρ_{m}, ϕ_{m} (\forall i, m) \\ s . t . & c_{i}^{T} c_{j} = δ_{i j} (\forall i, j) \\ (I - \frac{1}{2} ϕ_{m}^{\land}) c_{i} = (I - \frac{1}{2} ϕ_{m}^{\land}) {\tilde{c}}_{m, i} (\forall i, m) \\ (I - \frac{1}{2} ϕ_{m}^{\land}) r = (I + \frac{1}{2} ϕ_{m}^{\land}) {\tilde{r}}_{m} + ρ_{m} (\forall m) \\ (r e d .) & \frac{1}{2} {(c_{i} + {\tilde{c}}_{m, i})}^{T} ρ_{m} = c_{i}^{T} r - {\tilde{c}}_{m, i}^{T} {\tilde{r}}_{m} (\forall i, m) \\ r^{T} r = r^{T} {\tilde{r}}_{m} - \frac{1}{2} r^{T} {\tilde{r}}_{m}^{\land} ϕ_{m} + r^{T} ρ_{m} (\forall m) . \end{matrix}

(36)

We leave it to the reader to manipulate this into the standard form of (21).

The appendix again provides a baseline local solver for this problem. For the global (SDP) solver we used cvx in Matlab with mosek (ApS, 2019). The solution costs of the global and local solvers agree to high precision if a good initial guess is given to the local solver. Figure 4 provides examples of the local solver becoming trapped in poor local minima while the global solver converges to the correct minima near the groundtruth. Figure 5 provides a quantitative study of the tightness of the SDP solution with increasing measurement noise; we selected the measurement covariance as $W_{m}^{- 1} = σ^{2} I$ , with σ increasing. We again see there is a large range for the noise over which the local solver can become trapped in a local minimum while the global solver remains tight. With M = 10 poses to be averaged, the local solver took on average 0.0064s while the SDP solver took on average 0.5944s.

Figure 4.

Pose Averaging: Four examples of noisy pose averaging where the randomly initialized local solver (dotted) becomes trapped in a poor local minimum while the global solver (dashed) finds the correct solution, which is closer to the groundtruth pose (solid). The noisy pose measurements being averaged are shown in grey.

Figure 5.

Pose Averaging: A quantitative evaluation of the tightness of the pose averaging problem with increasing measurement noise level, σ. At each noise, we conducted 1000 trials of averaging 10 noisy poses. (left) We see that the local solver (randomly initialized) finds the global minimum with decreasing frequency (green) as the measurement noise is increased, while the SDP solver (blue) successfully produces rank-1 solutions (we consider log SVR of at least 5 to be rank 1) to much higher noise levels. For completeness, we also show how frequently the local solver converges to any minimum (red). (right) Boxplots of the log SVR of the SDP solution show that the global solution remains highly rank 1 over a wide range of measurement noise values.

To justify the need for the redundant constraints, we conducted an ablation study (see Figure 6) wherein we varied the number of redundant constraints. The study shows that with more redundant constraints, we can tolerate a higher level of measurement noise while keeping the SDP tight. We always included the last redundant constraint in (36) as this enforces that the search space for the SDP remains compact⁷ and it is therefore well posed. In the interest of space, we forgo similar ablation studies for the subsequent problems (discrete-time and continuous-time trajectory estimation), which reuse the pose averaging redundant constraints and then build on top of them. The studies are similar in that the more redundant constraints we add, the larger the noise region for which we can a priori predict that we will achieve rank-1 SDP solutions.

Figure 6.

Pose Averaging Ablation Study: Here we show the effect on SDP tightness of varying the number of redundant constraints in the pose averaging problem. The rightmost column shows our full set of recommended redundant constraints with the light-grey box indicating the region of measurement noise for which our problem can be deemed tight. The same grey box is shown in the other columns for reference, indicating that including fewer redundant constraints results in a lower level of noise for which we can keep the solution tight.

4. Discrete-time trajectory estimation

Our next problem is to consider estimation of a trajectory of K poses, T_k, where we have noisy measurements of each pose, ${\tilde{T}}_{k}$ , as well as noisy relative measurements, ${\tilde{T}}_{k + 1, k}$ , from one pose to the next. The optimization problem that we want to solve is

\begin{matrix} \min & \sum_{k = 1}^{K} {cay}^{- 1} {(T_{k} {\tilde{T}}_{k}^{- 1})}^{\lor^{T}} W_{k} {cay}^{- 1} {(T_{k} {\tilde{T}}_{k}^{- 1})}^{\lor} \\ + \sum_{k = 1}^{K - 1} {cay}^{- 1} {(T_{k + 1} T_{k}^{- 1} {\tilde{T}}_{k + 1, k}^{- 1})}^{\lor^{T}} \\ \times W_{k + 1, k} {cay}^{- 1} {(T_{k + 1} T_{k}^{- 1} {\tilde{T}}_{k + 1, k}^{- 1})}^{\lor} \\ w . r . t . & T_{k} (\forall k) \\ s . t . & T_{k} \in S E (3) (\forall k), \end{matrix}

(37)

for some weight matrices, W_k and W_k+1,k. Figure 7 depicts the estimation problem as a factor graph. Similarly to the pose averaging problem, we introduce new optimization variables for the residual errors:

\begin{align} ξ_{k} & = {cay}^{- 1} {(T_{k} {\tilde{T}}_{k}^{- 1})}^{\lor}, \end{align}

(38a)

\begin{align} ξ_{k + 1, k} & = {cay}^{- 1} {(T_{k + 1} T_{k}^{- 1} {\tilde{T}}_{k + 1, k}^{- 1})}^{\lor}, \end{align}

(38b)

Figure 7.

Discrete-time Trajectory Estimation: Factor graph representation of the discrete-time estimation problem. Each black dot represents one of the error terms in the cost function of (37).

so that the optimization problem can be stated as a QCQP:

\begin{matrix} \min & \sum_{k = 1}^{K} ξ_{k}^{T} W_{k} ξ_{k} + \sum_{k = 1}^{K - 1} ξ_{k + 1, k}^{T} W_{k + 1, k} ξ_{k + 1, k} \\ w . r . t . & T_{k}, ξ_{k}, ξ_{k + 1, k} (\forall k) \\ s . t . & C_{k}^{T} C_{k} = I (\forall k) \\ (I - \frac{1}{2} ξ_{k}^{\land}) T_{k} = (I + \frac{1}{2} ξ_{k}^{\land}) {\tilde{T}}_{k} (\forall k) \\ (I - \frac{1}{2} ξ_{k + 1, k}^{\land}) T_{k + 1} \\ = (I + \frac{1}{2} ξ_{k + 1, k}^{\land}) {\tilde{T}}_{k + 1, k} T_{k} (\forall k), \end{matrix}

(39)

where the det(C_k) = 1 constraints have been dropped. Decomposing the matrices according to

\begin{align} T_{k} = [\begin{matrix} C_{k} & r_{k} \\ 0^{T} & 1 \end{matrix}] = [\begin{matrix} c_{k, 1} & c_{k, 2} & c_{k, 3} & r_{k} \\ 0 & 0 & 0 & 1 \end{matrix}], \end{align}

(40a)

\begin{align} {\tilde{T}}_{k} = [\begin{matrix} {\tilde{C}}_{k} & {\tilde{r}}_{k} \\ 0^{T} & 1 \end{matrix}] = [\begin{matrix} {\tilde{c}}_{k, 1} & {\tilde{c}}_{k, 2} & {\tilde{c}}_{k, 2} & {\tilde{r}}_{k} \\ 0 & 0 & 0 & 1 \end{matrix}], \end{align}

(40b)

\begin{align} {\tilde{T}}_{k + 1, k} = [\begin{matrix} {\tilde{C}}_{k + 1, k} & {\tilde{r}}_{k + 1, k} \\ 0^{T} & 1 \end{matrix}] \end{align}

(40c)

\begin{align} = [\begin{matrix} {\tilde{c}}_{k + 1, k, 1} & {\tilde{c}}_{k + 1, k, 2} & {\tilde{c}}_{k + 1, k, 2} & {\tilde{r}}_{k + 1, k} \\ 0 & 0 & 0 & 1 \end{matrix}], \end{align}

(40d)

\begin{align} ξ_{k} = [\begin{matrix} ρ_{k} \\ ϕ_{k} \end{matrix}], ξ_{k + 1, k} = [\begin{matrix} ρ_{k + 1, k} \\ ϕ_{k + 1, k} \end{matrix}], \end{align}

(40e)

the QCQP optimization problem can be rewritten compactly as

\begin{matrix} \min & \sum_{k = 1}^{K} ξ_{k}^{T} W_{k} ξ_{k} + \sum_{k = 1}^{K - 1} ξ_{k + 1, k}^{T} W_{k + 1, k} ξ_{k + 1, k} \\ w . r . t . & c_{k, i}, r_{k}, ρ_{k}, ϕ_{k}, ρ_{k + 1, k}, ϕ_{k + 1, k} (\forall i, k) \\ s . t . & c_{k, i}^{T} c_{k, j} = δ_{i j} (\forall i, j, k) \\ (I - \frac{1}{2} ϕ_{k}^{\land}) c_{k, i} = (I - \frac{1}{2} ϕ_{k}^{\land}) {\tilde{c}}_{k, i} (\forall i, k) \\ (I - \frac{1}{2} ϕ_{k}^{\land}) r_{k} = (I + \frac{1}{2} ϕ_{k}^{\land}) {\tilde{r}}_{k} + ρ_{k} (\forall k) \\ (I - \frac{1}{2} ϕ_{k + 1, k}^{\land}) c_{k + 1, i} \\ = (I - \frac{1}{2} ϕ_{k + 1, k}^{\land}) {\tilde{C}}_{k + 1, k} c_{k, i} (\forall i, k) \\ (I - \frac{1}{2} ϕ_{k + 1, k}^{\land}) r_{k + 1} \\ = (I + \frac{1}{2} ϕ_{k + 1, k}^{\land}) ({\tilde{C}}_{k + 1, k} r_{k} + {\tilde{r}}_{k + 1, k}) \\ + ρ_{k + 1, k} (\forall k) . \end{matrix}

(41)

Similarly to the pose averaging problem, if we convert this QCQP to a SDP, it is not always tight even for low-noise levels. We need to include some redundant constraints to improve tightness. For each of the ξ _k variables, we can create copies of the redundant constraints required in the pose averaging problem. However, this is still not enough; we require some additional constraints involving the ξ _k+1,k variables.

Such additional redundant constraints can be formed by combining the last two of (41); the second last can be written as

\begin{align} \frac{1}{2} {(c_{k + 1, i} + {\tilde{C}}_{k + 1, k} c_{k, i})}^{T} ϕ_{k + 1, k}^{\land} \\ = - {(c_{k + 1, i} - {\tilde{C}}_{k + 1, k} c_{k, i})}^{T}, \end{align}

(42)

while the last can be premultiplied by

{(c_{k + 1, i} + {\tilde{C}}_{k + 1, k} c_{k, i})}^{T}

and written as

\begin{align} {(c_{k + 1, i} + {\tilde{C}}_{k + 1, k} c_{k, i})}^{T} (r_{k + 1} - {\tilde{C}}_{k + 1, k} r_{k} - {\tilde{r}}_{k + 1, k}) \\ = \underset{- {(c_{k + 1, i} - {\tilde{C}}_{k + 1, k} c_{k, i})}^{T}}{\underset{⏟}{\frac{1}{2} {(c_{k + 1, i} + {\tilde{C}}_{k + 1, k} c_{k, i})}^{T} ϕ_{k + 1, k}^{\land}}} \\ \times (r_{k + 1} + {\tilde{C}}_{k + 1, k} r_{k} + {\tilde{r}}_{k + 1, k}) \\ + {(c_{k + 1, i} + {\tilde{C}}_{k + 1, k} c_{k, i})}^{T} ρ_{k + 1, k} . \end{align}

(43)

After performing the indicated substitution, this becomes

\begin{align} \frac{1}{2} {(c_{k + 1, i} + {\tilde{C}}_{k + 1, k} c_{k, i})}^{T} ρ_{k + 1, k} \\ = c_{k + 1, i}^{T} r_{k + 1} - c_{k, i}^{T} (r_{k} + {\tilde{C}}_{k + 1, k}^{T} {\tilde{r}}_{k + 1, k}), \end{align}

(44)

which is once again a quadratic constraint.

Summarizing, the following QCQP offers a reasonably tight SDP relaxation in practice:

\begin{matrix} \min & \sum_{k = 1}^{K} ξ_{k}^{T} W_{k} ξ_{k} + \sum_{k = 1}^{K - 1} ξ_{k + 1, k}^{T} W_{k + 1, k} ξ_{k + 1, k} \\ w . r . t . & c_{k, i}, r_{k}, ρ_{k}, ϕ_{k}, ρ_{k + 1, k}, ϕ_{k + 1, k} (\forall i, k) \\ s . t . & c_{k, i}^{T} c_{k, j} = δ_{i j} (\forall i, j, k) \\ (I - \frac{1}{2} ϕ_{k}^{\land}) c_{k, i} = (I - \frac{1}{2} ϕ_{k}^{\land}) {\tilde{c}}_{k, i} (\forall i, k) \\ (I - \frac{1}{2} ϕ_{k}^{\land}) r_{k} = (I + \frac{1}{2} ϕ_{k}^{\land}) {\tilde{r}}_{k} + ρ_{k} (\forall k) \\ (I - \frac{1}{2} ϕ_{k + 1, k}^{\land}) c_{k + 1, i} \\ = (I - \frac{1}{2} ϕ_{k + 1, k}^{\land}) {\tilde{C}}_{k + 1, k} c_{k, i} (\forall i, k) \\ (I - \frac{1}{2} ϕ_{k + 1, k}^{\land}) r_{k + 1} \\ = (I + \frac{1}{2} ϕ_{k + 1, k}^{\land}) ({\tilde{C}}_{k + 1, k} r_{k} + {\tilde{r}}_{k + 1, k}) \\ + ρ_{k + 1, k} (\forall k) \\ (r e d .) & \frac{1}{2} {(c_{k, i} + {\tilde{c}}_{k, i})}^{T} ρ_{k} = c_{k, i}^{T} r_{k} - {\tilde{c}}_{k, i}^{T} {\tilde{r}}_{k} (\forall i, k) \\ r_{k}^{T} r_{k} = r_{k}^{T} {\tilde{r}}_{k} - \frac{1}{2} r_{k}^{T} {\tilde{r}}_{k}^{\land} ϕ_{k} + r_{k}^{T} ρ_{k} (\forall k) \\ \frac{1}{2} {(c_{k + 1, i} + {\tilde{C}}_{k + 1, k} c_{k, i})}^{T} ρ_{k + 1, k} \\ = c_{k + 1, i}^{T} r_{k + 1} \\ - c_{k, i}^{T} (r_{k} + {\tilde{C}}_{k + 1, k}^{T} {\tilde{r}}_{k + 1, k}) (\forall i, k) . \end{matrix}

(45)

We again leave it to the reader to manipulate this into the standard form of (21).

The appendix provides a baseline local solver for this problem. For the global (SDP) solver we used cvx in Matlab with mosek (ApS, 2019). The solution costs of the global and local solvers agree to high precision if a good initial guess is given to the local solver. Figure 8 provides examples of the local solver becoming trapped in poor local minima while the global solver converges to the correct minima near the groundtruth. Figure 9 provides a quantitative study of the tightness of the SDP solution with increasing measurement noise; we selected the measurement covariances as $W_{k}^{- 1} = W_{k + 1, k}^{- 1} = σ^{2} I$ , with σ increasing. We again see there is a large range for the noise over which the local solver can become trapped in a local minimum while the global solver remains tight; in fact, even at very low-noise levels it is quite easy to have the local solver become trapped. With K = 20 poses in the trajectory, the local solver took on average 0.1574s while the SDP solver took on average 14.32s.

Figure 8.

Discrete-time Trajectory Estimation: Two examples of discrete-time trajectory estimation where the randomly initialized local solver (red) becomes trapped in a poor local minimum while the global solver (green) finds the correct solution, which is closer to the groundtruth (blue). The noisy pose measurements are also shown (grey). It is interesting to note that the poor local solver solutions are twisted around the groundtruth.

Figure 9.

Discrete-time Trajectory Estimation: A quantitative evaluation of the tightness of the discrete-time trajectory estimation problem with increasing measurement noise, σ. At each noise level, we conducted 100 trials with the geometry of the trajectory as in the left example of Figure 8. (left) We see that the local solver (randomly initialized) finds the global minimum with decreasing frequency (green) as the measurement noise is increased, while the SDP solver (blue) successfully produces rank-1 solutions (we consider log SVR of at least 5 to be rank 1) to much higher noise levels. For completeness, we also show how frequently the local solver converges to any minimum (red). (right) Boxplots of the log SVR of the SDP solution show that the global solution remains highly rank 1 over a wide range of measurement noise values.

5. Continuous-time trajectory estimation

Finally, we consider so-called continuous-time trajectory estimation. Continuous-time methods come in parametric (Furgale et al., 2012) and nonparametric (Anderson and Barfoot, 2015; Barfoot et al., 2014) varieties; here we will discuss the latter. We consider a continuous-time Gaussian Process (GP) prior over the trajectory known as White Noise on Acceleration (WNOA); this serves to smooth the trajectory and is fused with pose measurements provided at discrete times. We will still ultimately have to discretize the trajectory for the purpose of estimation and so will have K states comprising both pose and generalized velocity (a.k.a., twist), $\{T_{k}, ϖ_{k}\}$ . Figure 10 depicts the situation as a factor graph. In practice, we may not actually have pose measurements at every time at which we introduce a state variable.

Figure 10.

Factor graph representation of the continuous-time estimation problem. Each block dot represents one of the error terms in the cost function of (46).

The optimization problem that we want to solve in this case is

\begin{matrix} \min & \sum_{k = 1}^{K} {cay}^{- 1} {(T_{k} {\tilde{T}}_{k}^{- 1})}^{\lor^{T}} W_{k} {cay}^{- 1} {(T_{k} {\tilde{T}}_{k}^{- 1})}^{\lor} \\ + {({\overset{ˇ}{ϖ}}_{1} - ϖ_{1})}^{T} Q_{1}^{- 1} ({\overset{ˇ}{ϖ}}_{1} - ϖ_{1}) \\ + \sum_{k = 1}^{K - 1} e_{k + 1, k}^{T} Q_{k + 1, k}^{- 1} e_{k + 1, k} \\ w . r . t . & T_{k}, ϖ_{k} (\forall k) \\ s . t . & T_{k} \in S E (3) (\forall k), \end{matrix}

(46)

for some weight matrices, W_k, and

\begin{align} e_{k + 1, k} = [\begin{matrix} (t_{k + 1} - t_{k}) ϖ_{k} - {cay}^{- 1} {(T_{k + 1} T_{k}^{- 1})}^{\lor} \\ ϖ_{k} - ϖ_{k + 1} \end{matrix}], \end{align}

(47a)

\begin{align} Q_{k + 1, k} = [\begin{matrix} \frac{1}{3} {(t_{k + 1} - t_{k})}^{3} Q_{c} & \frac{1}{2} {(t_{k + 1} - t_{k})}^{2} Q_{c} \\ \frac{1}{2} {(t_{k + 1} - t_{k})}^{2} Q_{c} & (t_{k + 1} - t_{k}) Q_{c} \end{matrix}] . \end{align}

(47b)

The t_k are known timestamps of the states, Q_c is a power-spectral density matrix affecting smoothness of the GP prior, and ${\overset{ˇ}{ϖ}}_{1}$ together with Q₁ represent a Gaussian prior on the initial generalized velocity. The GP prior defined by (47a) is similar in spirit to the one first defined by Anderson and Barfoot (2015), only now adapted to work with the Cayley map. Looking at e_k+1,k, the first row encourages the change in pose from one time to the next to be similar to the generalized velocity multiplied by the change in time; the second row encourages the generalized velocity to remain constant over time (i.e., no acceleration). The process noise covariance, Q_k+1,k, comes from integrating the WNOA prior over the same time interval (Barfoot, 2024; Barfoot et al., 2014).

Similarly to the discrete-time trajectory estimation case, we introduce the following substitution variables⁸:

\begin{align} ξ_{k} & = {cay}^{- 1} {(T_{k} {\tilde{T}}_{k}^{- 1})}^{\lor}, \end{align}

(48a)

\begin{align} ξ_{k + 1, k} & = {cay}^{- 1} {(T_{k + 1} T_{k}^{- 1})}^{\lor}, \end{align}

(48b)

so that the optimization problem can be stated as a QCQP:

\begin{matrix} \min & \sum_{k = 1}^{K} ξ_{k}^{T} W_{k} ξ_{k} + {({\overset{ˇ}{ϖ}}_{1} - ϖ_{1})}^{T} Q_{1}^{- 1} ({\overset{ˇ}{ϖ}}_{1} - ϖ_{1}) \\ + \sum_{k = 1}^{K - 1} {[\begin{matrix} (t_{k + 1} - t_{k}) ϖ_{k} - ξ_{k + 1, k} \\ ϖ_{k} - ϖ_{k + 1} \end{matrix}]}^{T} \\ \times Q_{k + 1, k}^{- 1} [\begin{matrix} (t_{k + 1} - t_{k}) ϖ_{k} - ξ_{k + 1, k} \\ ϖ_{k} - ϖ_{k + 1} \end{matrix}] \\ w . r . t . & T_{k}, ϖ_{k}, ξ_{k}, ξ_{k + 1, k} (\forall k) \\ s . t . & C_{k}^{T} C_{k} = I (\forall k) \\ (I - \frac{1}{2} ξ_{k}^{\land}) T_{k} = (I + \frac{1}{2} ξ_{k}^{\land}) {\tilde{T}}_{k} (\forall k) \\ (I - \frac{1}{2} ξ_{k + 1, k}^{\land}) T_{k + 1} = (I + \frac{1}{2} ξ_{k + 1, k}^{\land}) T_{k} (\forall k), \end{matrix}

(49)

where the det(C_k) = 1 constraints have been dropped. Decomposing the matrices according to

\begin{align} T_{k} = [\begin{matrix} C_{k} & r_{k} \\ 0^{T} & 1 \end{matrix}] = [\begin{matrix} c_{k, 1} & c_{k, 2} & c_{k, 3} & r_{k} \\ 0 & 0 & 0 & 1 \end{matrix}], \end{align}

(50a)

\begin{align} {\tilde{T}}_{k} = [\begin{matrix} {\tilde{C}}_{k} & {\tilde{r}}_{k} \\ 0^{T} & 1 \end{matrix}] = [\begin{matrix} {\tilde{c}}_{k, 1} & {\tilde{c}}_{k, 2} & {\tilde{c}}_{k, 2} & {\tilde{r}}_{k} \\ 0 & 0 & 0 & 1 \end{matrix}], \end{align}

(50b)

\begin{align} ξ_{k} = [\begin{matrix} ρ_{k} \\ ϕ_{k} \end{matrix}], ξ_{k + 1, k} = [\begin{matrix} ρ_{k + 1, k} \\ ϕ_{k + 1, k} \end{matrix}], \end{align}

(50c)

the QCQP optimization problem can be rewritten compactly as

\begin{matrix} \min & \sum_{k = 1}^{K} ξ_{k}^{T} W_{k} ξ_{k} + {({\overset{ˇ}{ϖ}}_{1} - ϖ_{1})}^{T} Q_{1}^{- 1} ({\overset{ˇ}{ϖ}}_{1} - ϖ_{1}) \\ + \sum_{k = 1}^{K - 1} {[\begin{matrix} (t_{k + 1} - t_{k}) ϖ_{k} - ξ_{k + 1, k} \\ ϖ_{k} - ϖ_{k + 1} \end{matrix}]}^{T} \\ \times Q_{k + 1, k}^{- 1} [\begin{matrix} (t_{k + 1} - t_{k}) ϖ_{k} - ξ_{k + 1, k} \\ ϖ_{k} - ϖ_{k + 1} \end{matrix}] \\ w . r . t . & c_{k, i}, r_{k}, ρ_{k}, ϕ_{k}, ρ_{k + 1, k}, ϕ_{k + 1, k}, ϖ_{k} (\forall i, k) \\ s . t . & c_{k, i}^{T} c_{k, j} = δ_{i j} (\forall i, j, k) \\ (I - \frac{1}{2} ϕ_{k}^{\land}) c_{k, i} = (I - \frac{1}{2} ϕ_{k}^{\land}) {\tilde{c}}_{k, i} (\forall i, k) \\ (I - \frac{1}{2} ϕ_{k}^{\land}) r_{k} = (I + \frac{1}{2} ϕ_{k}^{\land}) {\tilde{r}}_{k} + ρ_{k} (\forall k) \\ (I - \frac{1}{2} ϕ_{k + 1, k}^{\land}) c_{k + 1, i} \\ = (I - \frac{1}{2} ϕ_{k + 1, k}^{\land}) c_{k, i} (\forall i, k) \\ (I - \frac{1}{2} ϕ_{k + 1, k}^{\land}) r_{k + 1} \\ = (I + \frac{1}{2} ϕ_{k + 1, k}^{\land}) r_{k} + ρ_{k + 1, k} (\forall k) . \end{matrix}

(51)

Similarly to the problems above, if we convert this QCQP to a SDP, it is not tight even for low-noise levels. We need to include some redundant constraints to improve tightness. For each of the ξ _k and ξ _k+1,k variables, we can create copies of the redundant constraints required in the discrete-time trajectory estimation problem. However, this is still not always enough to tighten the problem, particularly when we do not have a pose measurement at every state time (see below comment regarding sparser measurement graphs).

We can generate some additional redundant constraints fairly easily. First, we can premultiply the second constraint of (51) by $ϕ_{k}^{T}$ so that

\begin{align} ϕ_{k}^{T} (I - \frac{1}{2} ϕ_{k}^{\land}) c_{k, i} = ϕ_{k}^{T} (I - \frac{1}{2} ϕ_{k}^{\land}) {\tilde{c}}_{k, i} \\ \Rightarrow ϕ_{k}^{T} c_{k, i} = ϕ_{k}^{T} {\tilde{c}}_{k, i}, \end{align}

(52)

where we have used that

ϕ_{k}^{T} ϕ_{k}^{\land} = 0^{T}

. Similarly, premultiplying the third constraint in (51) by

ϕ_{k}^{T}

results in

ϕ_{k}^{T} r_{k} = ϕ_{k}^{T} {\tilde{r}}_{k} + ϕ_{k}^{T} ρ_{k} .

(53)

Premultiplying the fourth and fifth constraints in (51) by $ϕ_{k + 1, k}^{T}$ results in

\begin{align} ϕ_{k + 1, k}^{T} c_{k + 1, i} = ϕ_{k + 1, k}^{T} c_{k, i}, \end{align}

(54a)

\begin{align} ϕ_{k + 1, k}^{T} r_{k + 1} = ϕ_{k + 1, k}^{T} r_{k} + ϕ_{k + 1, k}^{T} ρ_{k + 1, k} . \end{align}

(54b)

Next, we can exploit the fact that columns of a rotation matrix satisfy $c_{ℓ}^{\land} c_{m} = c_{n}$ where $ℓ m n \in \{123,231,312\}$ . If we premultiply the second constraint of (51) by $c_{k, m}^{T}$ we have

\begin{align} c_{k, m}^{T} (I - \frac{1}{2} ϕ_{k}^{\land}) c_{k, ℓ} = c_{k, m}^{T} (I - \frac{1}{2} ϕ_{k}^{\land}) {\tilde{c}}_{k, ℓ} \\ \Rightarrow c_{k, m}^{T} c_{k, ℓ} - \frac{1}{2} ϕ_{k}^{T} c_{k, n} = c_{k, m}^{T} {\tilde{c}}_{k, ℓ} - \frac{1}{2} c_{k, m}^{T} {\tilde{c}}_{k, ℓ}^{\land} ϕ_{k}, \end{align}

(55)

which is still a quadratic constraint. Finally, if we premultiply the last constraint of (51) by

{(r_{k + 1} + r_{k})}^{T}

, this results in

r_{k + 1}^{T} r_{k + 1} = r_{k + 1}^{T} ρ_{k + 1, k} + r_{k}^{T} ρ_{k + 1, k} + r_{k}^{T} r_{k,}

(56)

which is once again a quadratic constraint.

Summarizing, the following QCQP offers a reasonably tight SDP relaxation in practice:

\begin{matrix} \min & \sum_{k = 1}^{K} ξ_{k}^{T} W_{k} ξ_{k} + {({\overset{ˇ}{ϖ}}_{1} - ϖ_{1})}^{T} Q_{1}^{- 1} ({\overset{ˇ}{ϖ}}_{1} - ϖ_{1}) \\ + \sum_{k = 1}^{K - 1} {[\begin{matrix} (t_{k + 1} - t_{k}) ϖ_{k} - ξ_{k + 1, k} \\ ϖ_{k} - ϖ_{k + 1} \end{matrix}]}^{T} \\ \times Q_{k + 1, k}^{- 1} [\begin{matrix} (t_{k + 1} - t_{k}) ϖ_{k} - ξ_{k + 1, k} \\ ϖ_{k} - ϖ_{k + 1} \end{matrix}] \\ w . r . t . & c_{k, i}, r_{k}, ρ_{k}, ϕ_{k}, ρ_{k + 1, k}, ϕ_{k + 1, k}, ϖ_{k} (\forall i, k) \\ s . t . & c_{k, i}^{T} c_{k, j} = δ_{i j} (\forall i, j, k) \\ (I - \frac{1}{2} ϕ_{k}^{\land}) c_{k, i} = (I - \frac{1}{2} ϕ_{k}^{\land}) {\tilde{c}}_{k, i} (\forall i, k) \\ (I - \frac{1}{2} ϕ_{k}^{\land}) r_{k} = (I + \frac{1}{2} ϕ_{k}^{\land}) {\tilde{r}}_{k} + ρ_{k} (\forall k) \\ (I - \frac{1}{2} ϕ_{k + 1, k}^{\land}) c_{k + 1, i} \\ = (I - \frac{1}{2} ϕ_{k + 1, k}^{\land}) c_{k, i} (\forall i, k) \\ (I - \frac{1}{2} ϕ_{k + 1, k}^{\land}) r_{k + 1} \\ = (I + \frac{1}{2} ϕ_{k + 1, k}^{\land}) r_{k} + ρ_{k + 1, k} (\forall k) \\ (r e d .) & \frac{1}{2} {(c_{k, i} + {\tilde{c}}_{k, i})}^{T} ρ_{k} = c_{k, i}^{T} r_{k} - {\tilde{c}}_{k, i}^{T} {\tilde{r}}_{k} (\forall i, k) \\ r_{k}^{T} r_{k} = r_{k}^{T} {\tilde{r}}_{k} - \frac{1}{2} r_{k}^{T} {\tilde{r}}_{k}^{\land} ϕ_{k} + r_{k}^{T} ρ_{k} (\forall k) \\ ϕ_{k}^{T} c_{k, i} = ϕ_{k}^{T} {\tilde{c}}_{k, i} (\forall i, k) \\ ϕ_{k}^{T} r_{k} = ϕ_{k}^{T} {\tilde{r}}_{k} + ϕ_{k}^{T} ρ_{k} (\forall k) \\ c_{k, m}^{T} c_{k, ℓ} - \frac{1}{2} ϕ_{k}^{T} c_{k, n} \\ = c_{k, m}^{T} {\tilde{c}}_{k, ℓ} - \frac{1}{2} c_{k, m}^{T} {\tilde{c}}_{k, ℓ}^{\land} ϕ_{k} \\ (\forall ℓ m n \in \{123,231,312\}, k) \\ \frac{1}{2} {(c_{k + 1, i} + c_{k, i})}^{T} ρ_{k + 1, k} \\ = c_{k + 1, i}^{T} r_{k + 1} - c_{k, i}^{T} r_{k} (\forall i, k) \\ r_{k + 1}^{T} r_{k + 1} \\ = r_{k + 1}^{T} ρ_{k + 1, k} + r_{k}^{T} ρ_{k + 1, k} + r_{k}^{T} r_{k} (\forall k) \\ ϕ_{k + 1, k}^{T} c_{k + 1, i} = ϕ_{k + 1, k}^{T} c_{k, i} (\forall i, k) \\ ϕ_{k + 1, k}^{T} r_{k + 1} = ϕ_{k + 1, k}^{T} r_{k} + ϕ_{k + 1, k}^{T} ρ_{k + 1, k} (\forall k) . \end{matrix}

(57)

We again leave it to the reader to manipulate this into the standard form of (21). We can also notice that the ϖ _k variables are not involved in any of the constraints and thus remain unconstrained variables. Similar to Rosen et al. (2019); Holmes and Barfoot (2023), at implementation we use the Schur complement to marginalize these variables out of the problem, thereby keeping the size of the SDP as small as possible; we can easily compute them after the main solve.

The appendix provides a baseline local solver for this problem. For the global (SDP) solver we used cvx in Matlab with mosek (ApS, 2019). The solution costs of the global and local solvers agree to high precision if a good initial guess is given to the local solver. Figure 11 provides examples of the local solver becoming trapped in poor local minima while the global solver converges to the correct minimum near the groundtruth. Figure 12 provides a quantitative study of the tightness of the SDP solution with increasing measurement noise; we selected the measurement covariances as $W_{k}^{- 1} = σ^{2} I$ , with σ increasing. We again see there is a large range for the noise over which the local solver can become trapped in a local minimum while the global solver remains tight; in fact, even at very low-noise levels it is quite easy to have the local solver become trapped. With K = 21 poses in the trajectory, the local solver took on average 0.1928s while the SDP solver took on average 13.42s.

Figure 11.

Continuous-time Trajectory Estimation: Four examples of continuous-time trajectory estimation where the randomly initialized local solver (red) becomes trapped in a poor local minimum while the global solver (green) finds the correct solution, which is closer to the groundtruth (blue). The noisy pose measurements (occurring only at the start, middle, and end of each trajectory) are also shown (grey). The local minimum in the leftmost example is very similar to one reported by Lilge et al. (2022).

Figure 12.

Continuous-time Trajectory Estimation: A quantitative evaluation of the tightness of the continuous-time trajectory estimation problem with increasing measurement noise, σ. At each noise level, we conducted 100 trials with the geometry of the trajectory as in the left example of Figure 11. (left) We see that the local solver (randomly initialized) finds the global minimum with decreasing frequency (green) as the measurement noise is increased, while the SDP solver (blue) successfully produces rank-1 solutions (we consider log SVR of at least 5 to be rank 1) to much higher noise levels. For completeness, we also show how frequently the local solver converges to any minimum (red). (right) Boxplots of the log SVR of the SDP solution show that the global solution remains reasonably rank 1 over a wide range of measurement noise values.

There are also a few noteworthy differences in the continuous-time experiment as compared to the discrete-time case. First, the log SVR values are quite a bit lower in Figure 12 as compared to Figure 9. This seems to be mainly due to the fact that we are now using a sparser set of measurements. Our continuous-time experiments had pose measurements only at the start, middle, and end, whereas the discrete-time case had pose measurements at every timestep. It is known that a sparser measurement graph can impact SDP tightness (Holmes and Barfoot, 2023). Second, the low-noise test cases experienced some numerical issues with getting the SDP solver to reliably converge. It seems this is related to matrix conditioning resulting from the fact that we are marginalizing out the ϖ _k variables before solving the SDP. We found it was necessary to adjust the scaling of the Q matrix in (21) to get reliable solutions. Despite our best efforts, we see that there was one out of 100 test cases at the σ = 0.01 noise level where the SDP failed to solve. On the log SVR plot this shows as a red plus sign at 0 and results in the success rate of the SDP solver being 0.99 instead of 1. Still, we know that the solver failed and therefore not to trust the answer. Overall, we still have log SVR values almost always above 5 up to about σ = 1, which makes the solution practical; it is easy for the local solver to become stuck in local minima in this noise range.

6. Conclusion and future work

We have presented several new convex relaxations for pose and rotation estimation problems based on the Cayley map. Our results indicate that for small problem sizes, we can successfully achieve global optimality with realistic amounts of noise and even with measurement sparsity in the case of continuous-time trajectory estimation. In each of the experiments, we indicated that covariance of the error associated with each pose measurement cost term is σ²I. In other words, σ is the standard deviation of the measurement noise. In the case of rotational degrees of freedom, σ = 0.5 is already shown in Figure 1 to represent quite a lot of rotational uncertainty, indeed more than typically occurs in practice. Since the standard deviation is in radians, this implies that uncertainty spreads out over a large part of a full circle with σ = 0.5. The fact that our convex relaxations empirically remain tight (ensuring global optimality) beyond the σ = 0.5 level (often beyond σ = 1) means our technique works over most practical situations. For translational degrees of freedom, σ will have units of distance. The trajectories in the examples have poses that are spaced one distance unit apart so when the noise on the measurement of one of these poses is σ = 0.5 distance units, that is again quite a lot of noise in comparison to the spacing of the poses. The implication is again that our convex relaxations remain tight (ensuring global optimality) for most practical situations.

While our results are promising, we are still relying on off-the-shelf solvers once our problem has been converted to an SDP, which means that we will not be able to scale up to extremely large state sizes. To scale up, there are a few possibilities that we could explore. First, perhaps we might be satisfied with merely certifying our local solver solutions. Other works have focussed on this. The challenge is that in most of the problems of this paper, we require redundant constraints to tighten our SDPs. This means that we do not meet the technical condition of Linearly Independent Constraint Qualification (LICQ) (Boyd and Vandenberghe, 2004). It turns out that this makes it more challenging to calculate an optimality certificate. Yang et al. (2021) is a practical example where a certificate has been constructed for this type of situation, but there are still scaling issues. Another possibility is to solve our problems globally using the approach of Burer and Monteiro (2005) (studied more recently by Boumal et al. (2016)). This was exploited with very impressive results by Rosen et al. (2019); Dellaert et al. (2020); however, these problems enjoyed LICQ. To our knowledge, this approach has not been applied to problems in robotics with redundant constraints. Or, perhaps generic SDP solvers can be made to better exploit problem-specific structure (e.g., chordal sparsity). We plan to explore these and other ways of scaling global optimality for a larger range of state estimation problems.

Footnotes

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Swiss National Science Foundation, Postdoc Mobility under Grant 206954 and in part by the Natural Sciences and Engineering Research Council of Canada (NSERC).

ORCID iD

Timothy D. Barfoot

Notes

Appendix

References

Alismail

Baker

Browning

(2014) Continuous trajectory estimation for 3D SLAM from actuated lidar. In: Proceedings of the IEEE international conference on robotics and automation (ICRA), Hong Kong, China, 31 May 2014, 6096–6101.

Anderson

Barfoot

(2015) Full steam ahead: exactly sparse Gaussian process regression for batch continuous-time trajectory estimation on se(3). In: Proceedings of the IEEE/RSJ international conference on intelligent robots and systems (IROS). Hamburg, Germany, 28 September - 3 October 2015, 157–164. DOI: 10.1109/IROS.2015.7353368.

Anjos

Wolkowicz

(2002) Strengthened semideÿnite relaxations via a second lifting for the Max-Cut problem. Discrete Applied Mathematics 119: 79–106.

Anstreicher

Wolkowicz

(2000) On Lagrangian relaxation of quadratic matrix constraints. SIAM Journal on Matrix Analysis and Applications 22(1): 41–55.

ApS

(2019) The MOSEK optimization toolbox for MATLAB manual. https://docs.mosek.com/9.3/toolbox/index.html.

Bandeira

Boumal

Singer

(2017) Tightness of the maximum likelihood semidefinite relaxation for angular synchronization. Mathematical Programming 163(1-2): 145–167.

Barfoot

(2024) State Estimation for Robotics. 2nd edition. Cambridge, UK: Cambridge University Press. DOI: 10.1017/9781009299909.

Barfoot

Furgale

(2014) Associating uncertainty with three-dimensional poses for use in estimation problems. IEEE Transactions on Robotics 30(3): 679–693. DOI: 10.1109/TRO.2014.2298059.

Barfoot

Tong

Sarkka

(2014) Batch continuous-time trajectory estimation as exactly sparse Gaussian process regression. In: Proceedings of robotics: science and systems (RSS), Berkeley, USA, 12–16 July 2014.

10.

Barfoot

Forbes

D’Eleuterio

GMT

(2022) Vectorial parameterizations of pose. Robotica 40(7): 2409–2427. DOI: 10.1017/S0263574721001715.

11.

Bauchau

(2011) Flexible Multibody Dynamics. Cham, Switzerland: Springer Science & Business Media.

12.

Bauchau

Choi

(2003) The vector parameterization of motion. Nonlinear Dynamics 33(2): 165–188.

13.

Bauchau

Trainelli

(2003) The vectorial parameterization of rotation. Nonlinear Dynamics 32(1): 71–92.

14.

Bishop

(2006) Pattern Recognition and Machine Learning. Cham, Switzerland: Springer.

15.

Borri

Trainelli

Bottasso

(2000) On representations and parameterizations of motion. Multibody System Dynamics 4(2): 129–193.

16.

Boumal

Voroninski

Bandeira

(2016) The non-convex burer-monteiro approach works on smooth semidefinite programs. Advances in Neural Information Processing Systems 29.

17.

Boyd

Vandenberghe

(2004) Convex Optimization. Cambridge, UK: Cambridge University Press.

18.

Briales

Gonzalez-Jimenez

(2017a) Cartan-sync: fast and global se (d)-synchronization. IEEE Robotics and Automation Letters 2(4): 2127–2134.

19.

Briales

Gonzalez-Jimenez

(2017b) Convex global 3d registration with lagrangian duality. In: Proceedings of the IEEE conference on computer vision and pattern recognition, Honolulu, HI, USA, 21 July 2017, 4960–4969.

20.

Briales

Kneip

Gonzalez-Jimenez

(2018) A certifiably globally optimal solution to the non-minimal relative pose problem. In: 2018 IEEE/CVF conference on computer vision and pattern recognition, Salt Lake City, UT, 18 June 2018: IEEE, 145–154.

21.

Burer

Monteiro

(2005) Local minima and convergence in low-rank semidefinite programming. Mathematical Programming 103(3): 427–444.

22.

Carlone

Calafiore

(2018) Convex relaxations for pose graph optimization with outliers. IEEE Robotics and Automation Letters 3(2): 1160–1167.

23.

Carlone

Calafiore

Tommolillo

, et al. (2016) Planar pose graph optimization: duality, optimal solutions, and verification. IEEE Transactions on Robotics 32(3): 545–565.

24.

Chaudhury

Khoo

Singer

(2015) Global registration of multiple point clouds using semidefinite programming. SIAM Journal on Optimization 25(1): 468–501.

25.

Cifuentes

Agarwal

Parrilo

, et al. (2022) On the local stability of semidefinite relaxations. Mathematical Programming 193: 629–663.

26.

Dellaert

Rosen

, et al. (2020) Shonan rotation averaging: global optimality by surfing so (p) n so (p) n. In: Computer Vision–ECCV 2020: 16th European conference, Glasgow, UK, 23–28 August 2020, Springer, 292–308, Proceedings, Part VI 16.

27.

Duembgen

Holmes

Agro

, et al. (2023) Toward globally optimal state estimation using automatically tightened semidefinite relaxations. IEEE Transactions on Robotics, 2024.

28.

Dümbgen

Holmes

Barfoot

(2023) Safe and smooth: certified continuous-time range-only localization. IEEE Robotics and Automation Letters 8(2): 1117–1124.

29.

Eriksson

Olsson

Kahl

, et al. (2018) Rotation averaging and strong duality. In: Proceedings of the IEEE conference on computer vision and pattern recognition, Salt Lake City, UT, USA, 18 June 2018, 127–135.

30.

Eriksson

Olsson

Kahl

, et al. (2021) Rotation averaging with the chordal distance: global minimizers and strong duality. IEEE Transactions on Pattern Analysis and Machine Intelligence 43(1): 256–268.

31.

Forbes

de Ruiter

(2015) Linear-matrix-inequality-based solution to wahba’s problem. Journal of Guidance, Control, and Dynamics 38(1): 147–151.

32.

Furgale

Barfoot

Sibley

(2012) Continuous-time batch estimation using temporal basis functions. In: Proceedings of the IEEE international conference on robotics and automation (ICRA). St. Paul, USA, 14-18 May 2012, 2088–2095. DOI: 10.1109/ICRA.2012.6225005.

33.

Garcia-Salguero

Briales

Gonzalez-Jimenez

(2022) A tighter relaxation for the relative pose problem between cameras. Journal of Mathematical Imaging and Vision 64: 493–505.

34.

Giamou

Peretroukhin

, et al. (2019) Certifiably globally optimal extrinsic calibration from per-sensor egomotion. IEEE Robotics and Automation Letters 4(2): 367–374.

35.

Henrion

Korda

Lasserre

(2021) The Moment-SOS hierarchy: lectures in probability, statistics, computational geometry, control and nonlinear PDEs, Series on optimization and its applications, volume 04. WORLD SCIENTIFIC (EUROPE), New Jersey.

36.

Holmes

Barfoot

(2023) An efficient global optimality certificate for landmark-based slam. IEEE Robotics and Automation Letters 8(3): 1539–1546.

37.

Horowitz

Matni

Burdick

(2014) Convex relaxations of se (2) and se (3) for visual pose estimation. In: 2014 IEEE international conference on robotics and automation (ICRA), Hong Kong, China, 31 May 2014. IEEE, 1148–1154.

38.

Iglesias

Olsson

Kahl

(2020) Global optimality for point set registration using semidefinite programming. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, Seattle, WA, USA, 13–19 June 2020, 8287–8295.

39.

Junkins

Majji

Macomber

, et al. (2011) Small body proximity sensing with a novel hd 3d ladar system. Advances in the Astronautical Sciences 141: 341–354.

40.

Kahl

Henrion

(2005) Globally optimal estimates for geometric reconstruction problems. In: Tenth IEEE International Conference on Computer Vision (ICCV’05) 1-2. pp. 978–985.

41.

Kezurer

Kovalsky

Basri

, et al. (2015) Tight relaxation of quadratic matching. Computer Graphics Forum 34(5): 115–128.

42.

Kobilarov

(2014) Discrete optimal control on lie groups and applications to robotic vehicles. In: Proceedings of the IEEE international conference on robotics and automation (ICRA), 31 May - 7 June 2014, 31 May 2014, 5523–5529.

43.

Kobilarov

Marsden

(2011) Discrete geometric optimal control on lie groups. IEEE Transactions on Robotics 27(4): 641–655.

44.

Krislock

Wolkowicz

(2010) Explicit sensor network localization using semidefinite representations and facial reductions. SIAM Journal on Optimization 20(5): 2679–2708.

45.

Lasserre

(2001) Global optimization with polynomials and the problem of moments. SIAM Journal on Optimization 11(3): 796–817.

46.

Lasserre

(2010) Moments, Positive Polynomials and Their Applications. Hackensack, NJ: Imperial College Press; Distributed by World Scientific Publishing Co. Number v. 1 in Imperial College Press Optimization Series. London : Signapore.

47.

Lilge

Barfoot

Burgner-Kahrs

(2022) Continuum robot state estimation using Gaussian process regression on se(3). The International Journal of Robotics Research 41(13-14): 1099–1120. DOI: 10.1177/02783649221128843.

48.

Liu

Huang

Dissanayake

, et al. (2012) A convex optimization based approach for pose SLAM problems. In: 2012 IEEE/RSJ international conference on intelligent robots and systems, Vilamoura, Algarve, 7-12 October 2012, 1898–1903.

49.

Long

Wolfe

Mashner

, et al. (2013) The banana distribution is Gaussian: a localization study with exponential coordinates. Robotics: Science and Systems VIII 265: 1.

50.

Luo

, et al. (2010) Semidefinite relaxation of quadratic optimization problems. IEEE Signal Processing Magazine 27(3): 20–34.

51.

Majji

Davis

Doebbler

, et al. (2011) Terrain mapping and landing operations using vision based navigation systems. In: AIAA guidance, navigation, and control conference, Portland, Oregon, 08 August 2011 - 11 August 2011, 6581.

52.

Mortari

Markley

Singla

(2007) Optimal linear attitude estimator. Journal of Guidance, Control, and Dynamics 30(6): 1619–1627.

53.

Müller

(2021) Review of the exponential and cayley map on se (3) as relevant for lie group integration of the generalized Poisson equation and flexible multibody systems. Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences 477(2253): 20210303.

54.

Nakano

(2015) Globally optimal dls method for pnp problem with cayley parameterization. In: Proceedings of the British machine vision conference, Swansea, UK, 7-10 September 2015 78: 1–78.

55.

Nesterov

Wolkowicz

(2000) Semidefinite programming relaxations of nonconvex quadratic optimization. In: Hillier

Wolkowicz

Saigal

, et al. (eds). Semidefinite Programming Relaxations of Nonconvex Quadratic Optimization. Boston, MA: Springer US, Vol. 27, 361–419.

56.

Nocedal

Wright

(1999) Numerical Optimization. Cham, Switzerland: Springer.

57.

Papalia

Morales

Doherty

, et al. (2023) Score: A Second-Order Conic Initialization for Range-Aided Slam. IEEE Int. Conf. on Robotics and Automation (ICRA).

58.

Parrilo

(2003) Semidefinite programming relaxations for semialgebraic problems. Mathematical Programming 96: 293–320.

59.

Qian

Charland-Arcand

Forbes

(2020) Twolate: total registration of point-clouds using a weighted optimal linear attitude and translation estimator. In: 2020 IEEE conference on control technology and applications (CCTA), Montreal, QC, 24 August 2020. IEEE, 43–48.

60.

Rosen

Carlone

Bandeira

, et al. (2019) Se-sync: a certifiably correct algorithm for synchronization over the special euclidean group. The International Journal of Robotics Research 38(2-3): 95–125.

61.

Selig

(2007) Cayley maps for SE(3). In: 12th international federation for the promotion of mechanism and machine science world congress, Tokyo, Japan, 5–10 November 2023, London South Bank University, 6.

62.

Shor

(1987) Quadratic optimization problems. Soviet Journal of Computer and Systems Sciences 25: 1–11.

63.

Solo

Wang

(2019) Numerical methods for stochastic differential equations in Stiefel manifolds via the cayley transform. In: IEEE 58th conference on decision and control (CDC), Nice, France, 11-13 December 2019, 3303.

64.

Tian

Khosoussi

Rosen

, et al. (2021) Distributed certifiably correct pose-graph optimization. IEEE Transactions on Robotics: A Publication of the IEEE Robotics and Automation Society 37(6): 2137–2156.

65.

Wang

Chou

, et al. (2022a) Solving the pnl problem using the hidden variable method: an accurate and efficient solution. The Visual Computer 38: 95–106.

66.

Wang

Solo

(2020) Lie group state estimation via optimal transport. In: Proceedings of the IEEE international conference on acoustics, speech and signal processing (ICASSP), Barcelona, Spain, 4-8 May 2020, 5625–5629.

67.

Wang

Wen

Yin

, et al. (2022b) Certifiably optimal mutual localization with anonymous bearing measurements. IEEE Robotics and Automation Letters 7(4): 9374–9381.

68.

Wise

Giamou

Khoubyarian

, et al. (2020) Certifiably optimal monocular hand-eye calibration. In: 2020 IEEE international conference on multisensor fusion and integration for intelligent systems (MFI), Karlsruhe, Germany, 14-16 September 2020. IEEE, 271–278.

69.

Wong

Majji

(2016) A structured light system for relative navigation applications. IEEE Sensors Journal 16(17): 6662–6679.

70.

Wong

Singla

Lee

, et al. (2018) Optimal linear attitude estimator for alignment of point clouds. In: 2018 IEEE/CVF conference on computer vision and pattern recognition workshops (CVPRW), Salt Lake City, UT, 18 – 22 June 2018. IEEE, 1577–15778.

71.

Yang

Carlone

(2023) Certifiably optimal outlier-robust geometric perception: semidefinite relaxations and scalable global optimization. IEEE Transactions on Pattern Analysis and Machine Intelligence 45(3): 2816–2834.

72.

Yang

Shi

Carlone

(2021) Teaser: fast and certifiable point cloud registration. IEEE Transactions on Robotics 37(2): 314–333.

73.

Zhang

Koch

(2014) Structure and motion from line correspondences: representation, projection, initialization and sparse bundle adjustment. Journal of Visual Communication and Image Representation 25(5): 904–915.

Certifiably optimal rotation and pose estimation based on the Cayley map

Abstract

Keywords

1. Introduction

2. Mathematical background

2.1. Lie groups for rotations and poses

2.2. Cayley map

2.3. Convex relaxations

3. Averaging

3.1. Rotation averaging

3.2. Pose averaging

4. Discrete-time trajectory estimation

5. Continuous-time trajectory estimation

6. Conclusion and future work

Footnotes

Declaration of conflicting interests

Funding

ORCID iD

Notes

Appendix

References