Sage Journals: Discover world-class research

Abstract

This work presents a novel approach for the optimization of dynamic systems on finite-dimensional Lie groups. We rephrase dynamic systems as so-called neural ordinary differential equations (neural ODEs), and formulate the optimization problem on Lie groups. A gradient descent optimization algorithm is presented to tackle the optimization numerically. Our algorithm is scalable, and applicable to any finite-dimensional Lie group, including matrix Lie groups. By representing the system at the Lie algebra level, we reduce the computational cost of the gradient computation. In an extensive example, optimal potential energy shaping for control of a rigid body is treated. The optimal control problem is phrased as an optimization of a neural ODE on the Lie group SE(3), and the controller is iteratively optimized. The final controller is validated on a state-regulation task.

Keywords

Nonlinear control deep learning differential geometry

1. Introduction

Many physical systems are naturally described by the action of Lie groups on their configuration manifolds. This can range from finite-dimensional systems such as rigid bodies, where poses are acted on by the special Euclidean group SE(3) (Murray et al., 1994), towards infinite-dimensional systems such as flexible bodies or fluid dynamical systems, where the diffeomorphism group acts on the configuration of the continuum (Schmid, 2010).

Geometric control systems on Lie groups (Brockett, 1973; Jurdjevic, 1996) exploit the Lie group structure of the underlying physical systems to provide numerical advantages (Marsden and Ratiu, 1999). For example, PD controllers for rigid bodies were defined on SO(3) and SE(3) by Bullo and Murray (1995), and more recently geometric controllers were applied in the context of UAV’s (Goodarzi et al., 2013; Lee et al., 2010; Rashad et al., 2019). Examples for efficient optimal control formulations on Lie groups include linear (Ayala et al., 2021) and nonlinear systems (Spindler, 1998), as well as efficient numerical optimization methods (Kobilarov and Marsden, 2011; Saccon et al., 2013; Teng et al., 2022).

In an orthogonal development over the recent years, there has been a surge of machine learning applications in control (Dev et al., 2021) and robotics (Ibarz et al., 2021; Soori et al., 2023; Taylor et al., 2021). This surge is driven by the need for controllers that work in high-dimensional robotic systems and approximate complex decision policies that require the use of data. The implementation of such controllers through classical control theoretic approaches is prohibitive, and it led to a paradigm shift towards data-driven control (Taylor et al., 2021). Examples of machine learning within high-dimensional systems extend to soft robotics (Kim et al., 2021) and control of fluid systems (Paris et al., 2021). The literature also aims to address common concerns of safety (Brunke et al., 2022; Hewing et al., 2020) both during the training process and in the deployment of systems with machine learning in the loop.

The so-called Erlangen program of machine learning by Bronstein et al. (2021) stresses the importance of geometric machine learning methods: symmetries of data sets can restrict the complexity of functions that are to be learned on them, and thus increase the numerical efficiency of learning frameworks. This rationale also led to extensions of machine learning approaches to Lie groups (Fanzhang et al., 2019; Lu and Li, 2020), with recent applications by Huang et al. (2017); Chen et al. (2021); Forestano et al. (2023).

Indeed, the fundamental symmetry groups in robotics are naturally represented by Lie groups (Marsden and Ratiu, 1999). As such, Lie group-based learning methods are of interest to the robotics community. In an excellent example of a control application Duong and Atanasov (2021b) extended neural ODEs to SE(3) and applied it to the adaptive control of a UAV in Duong and Atanasov (2021a). In their recent work Duong et al. (2024) also highlight the practical use of neural ODEs on Lie groups.

However, a general approach for geometric machine learning in the context of dynamic systems on Lie groups is missing. We believe that such an approach would be of high interest, especially for control applications. In this paper, we address this issue by formalizing neural ODEs on Lie groups.

Our contributions are

1. the formulation of neural ODEs on any finite-dimensional Lie group, with a particular focus on matrix Lie groups;

2. computational simplifications with respect to manifold neural ODEs through use of a compact equation to compute gradients on Lie groups, and reduced dimension with respect to non-intrinsic approaches on Lie groups

3. a pytorch & torchdyn compatible algorithm for the optimization of a general potential energy shaping and damping injection control on SE(3), for which stability is implemented as a design requirement; available at github.com/YPWotte/Lie_nODEs.

4. the formulation of a minimal exponential Atlas on the Lie group SE(3).

The article is divided into two parts (see also Figure 1): first the formulation of neural ODEs on finite-dimensional matrix Lie groups, and second an extensive example of optimal potential energy shaping on SE(3).

Figure 1.

Overview of the main contribution and structure of the article. Given a parameterized dynamical system on a Lie group, the generalized adjoint method on Lie groups lets us compute the parameter gradient of a cost-functional over system trajectories by solving a set of differential equations. This parameter gradient can then be used to iteratively update parameters by gradient descent. In practice, we sample multiple initial conditions and approximate the parameter gradient of the expected cost $C_{f_{θ}}^{T} (θ) : = E_{g_{0} \sim P} C_{f_{θ}}^{T} (g_{0}, θ)$ .

Section 2 presents the main technical contribution of the article: the generalized adjoint method on Lie groups, which is at the heart of a gradient descent algorithm for dynamics optimization via neural ODEs on Lie groups.

A number of technical tools are required to apply this algorithm on a given matrix Lie group, which are introduced in Section 3. The exponential Atlas allows to implement a numerical procedure for exact integration on Lie groups, while a compact formula for the gradient of a function on a Lie group reduces complexity of the gradient computation.

Section 4 presents the Lie groups SO(3) and SE(3), and gives concrete examples of the technical tools presented in the previous section. One aspect of this is the formulation of a minimal exponential Atlas on SE(3), which is used to formulate an integration procedure on SE(3). This treatment prepares the stage for control optimization of a rigid body on SE(3).

Section 5 introduces the example of optimizing potential energy and damping injection controllers for rigid bodies on SE(3). The class of controllers is defined and it is shown that it guarantees stability by design. Afterward, the optimization of a cost-functional over the defined class of controllers is derived from the general procedure in Section 2.

Finally, Section 6 provides two examples of optimizing controllers for a rigid body on SE(3). The first example concerns pose control, without gravity, and results are compared to a quadratic controller of the type presented by Rashad et al. (2019). In the second example, the controller’s performance is investigated in the presence of gravity.

The article ends with a discussion in Section 7 and a conclusion in Section 8.

1.1 Neural ODEs and relation to existing works

Neural ODEs were first introduced by Chen et al. (2018), who derived them as the continuous limit of recurrent neural nets, taking inputs on $R^{n}$ . Their cost-functionals only admitted intermediate and final cost terms, for which they showed that the so-called adjoint method allows a memory-efficient computation of the gradient.

Massaroli et al. (2022) introduced a more general framework of neural ODEs, showing the power of state-augmentation and connections to optimal control, while also showing that the cost-functional can include integral cost terms. To this end, they presented the generalized adjoint method.

There are two highly relevant examples in the recent literature that extend neural ODEs to manifolds. The so-called extrinsic picture is presented by Falorsi and Forré (2020), who show that neural ODEs on a manifold $M$ can be optimized as classical neural ODEs on an embedding $R^{n}$ . Given an extension of the manifold neural ODE to $R^{n}$ , they show that the adjoint method on $R^{n}$ can be applied for optimization of the manifold neural ODE.

An intrinsic picture is presented by Lou et al. (2020), who show that neural ODEs on a manifold $M$ can be expressed in local charts on the manifold, where the adjoint method holds locally. They use exponential charts on Riemannian manifolds, and achieve a dimensionality-reduction and geometric exactness with respect to Falorsi et al. Both Falorsi and Forré (2020) and Lou et al. (2020) carefully extend neural ODEs to manifolds, and consider neural ODE on Lie groups to be a sub-class of the presented manifold neural ODEs. With respect to their work, we show how to include integral costs in a generalized adjoint method on manifolds and Lie groups, and show the advantages of considering neural ODEs on Lie groups as a specialized class of algorithms.

An example of neural ODEs to control of robotic systems described on $R^{n}$ is described in Massaroli et al. (2020), where an IDA-PBC controller is optimized.

Duong and Atanasov (2021a,b) apply neural ODEs to control optimization for a rigid body on SE(3). The work focuses on the formulation of an IDA-PBC controller, uses it for dynamics learning and trajectory tracking, and uses neural ODEs as a tool for this optimization. While the integration procedure used is not geometrically exact and the Lie group constraints are violated, the approach is highly successful. However, Duong et al. do not connect their contribution to geometric machine learning literature such as neural ODEs on manifolds. In recent work Duong et al. (2024), steps are made to extend the extrinsic approach to general matrix Lie groups, however without making full use of the geometric structure given by Lie groups.

With respect to Duong et al. (2024), we present neural ODEs on arbitrary finite-dimensional Lie groups. By extending the intrinsic formulation to Lie groups, our example on SE(3) has a reduced number of dimensions (24 instead of 36), and the use of local charts allows geometrically exact integration.

1.2 Notation

While the main results are accessible with a background of linear algebra and vector calculus, the derivations heavily rely on differential geometry and Lie group theory, see, for example, Isham (1999) and Hall (2015) for a complete introduction, or Solà et al. (2021) for a brief introduction with examples in robotics.

Calligraphic letters $M, N, U, P$ denote smooth manifolds. Respectively, $T_{x} M$ and $T_{x}^{*} M$ denote the tangent and cotangent space at $x \in M$ , $T M$ , and $T^{*} M$ denote the tangent bundle and cotangent bundle of $M$ , and $Γ (T M)$ and $Γ (T^{*} M)$ are the sets of sections that collect vector fields and co-vector fields over $M$ . Curves $x : R \to M$ are evaluated as x(t), and their tangent-vectors are denoted as $\dot{x} \in T_{x (t)} M$ .

Upper case letters G, H denote Lie groups, while lower case letters g, h denote their elements. A lower case e denotes the group identity e ∈ G, an upper case I denotes the identity matrix. The Lie algebra is $g$ , and its dual is $g^{*}$ . Letters $\tilde{A}, \tilde{B}$ denote vectors in the Lie algebra, while letters A, B denote vectors in $R^{n}$ .

Furthermore $C^{k} (M, N)$ denotes the set of continuous, k-times differentiable functions between $M$ and $N$ . For $ϕ \in C^{k} (M, N)$ , let $ϕ_{*} : Γ (T M) \to Γ (T N)$ and $ϕ^{*} : Γ (T^{*} N) \to Γ (T^{*} M)$ denote the push-forward and pullback, respectively.

For $V \in C^{1} (M, R)$ , let $d V \in Γ (T^{*} M)$ denote the gradient co-vector field. When $M = R^{k}$ , the gradient at $x \in R^{k}$ is denoted by $\partial V / \partial x \in R^{k}$ .

When coordinate expressions are concerned, the Einstein summation convention is used, that is, the product of variables with lower and upper indices implies a sum a_ibⁱ≔ ∑_ia_ibⁱ.

Let $(X, D, P)$ denote a probability space with X a topological space, D the Borel σ-algebra and $P : D \to [0, 1]$ a probability measure. Given a vector space L and a random variable C: X → L, denote by $E_{x \sim P} (C) : = \int_{X} C (x) d P (x)$ the expectation of C w.r.t. $P$ .

2. Main result

After a brief introduction to Lie groups in Section 2.1, the optimization problem is introduced on abstract Lie groups in Section 2.2. A gradient descent optimization algorithm is presented in Section 2.3. Our main technical result, the generalized adjoint method on Lie groups, lies at the core of the gradient computation. For the sake of exposition, we present it in the context of matrix Lie groups, and relegate the derivations and the formulation on abstract Lie groups to Appendix A.

2.1 Lie groups

A finite-dimensional Lie group G is an n-dimensional manifold together with a group structure, such that the group operation is a smooth map on G (Isham, 1999). G is a real matrix Lie group if it is a subgroup of the general linear group $G L (m, R)$ ¹

G L (m, R) : = {g \in R^{m \times m} | \det (g) \neq 0}

(1)

where the group operation for a matrix Lie group is given by matrix multiplication (Hall, 2015). For g, h ∈ G the left-translation by h is defined as

L_{h} : G \to G; L_{h} (g) : = h g

(2)

We denote the Lie algebra of G as $g : = T_{e} G$ , and its dual as $g^{*} : = T_{e}^{*} G$ .

Define a basis $E : = {{\tilde{E}}_{1}, \dots, {\tilde{E}}_{n}}$ with ${\tilde{E}}_{i} \in g$ , and define the (invertible, linear) map $Λ : R^{n} \to g$ as²

Λ : R^{n} \to g; (x^{1}, \dots, x^{n}) \mapsto \sum_{i} x^{i} {\tilde{E}}_{i}

(3)

The dual of Λ is the map $Λ^{*} : g^{*} \to R^{n}$ . Define the dual basis ${{\bar{E}}^{1}, \dots, {\bar{E}}^{n}}$ with ${\bar{E}}^{i} \in g^{*}$ by ${\bar{E}}^{i} ({\tilde{E}}_{j}) = δ_{j}^{i}$ with $δ_{j}^{i}$ the Kronecker delta. Then ${Λ^{*}}^{- 1}$ is explicitly given by

{Λ^{*}}^{- 1} : R^{n} \to g^{*}; (x_{1}, \dots, x_{n}) \mapsto \sum_{i} x_{i} {\bar{E}}^{i}

(4)

For a matrix Lie group the Lie algebra $g$ is a subspace of the Lie algebra $g l (m, R)$ of $G L (m, R)$ . Here $g l (m, R)$ is defined as

g l (m, R) : = R^{m \times m}

(5)

For $\tilde{A}, \tilde{B} \in g$ the adjoint map ${ad}_{\tilde{A}} (\tilde{B})$ is a bilinear map defined in terms of the (left) Lie bracket

ad : g \times g \to g; {ad}_{\tilde{A}} (\tilde{B}) = \tilde{A} \tilde{B} - \tilde{B} \tilde{A}

(6)

Using the operator Λ, a matrix representation of ad is obtained as $Λ^{- 1} ({ad}_{Λ (A)} Λ (\cdot)) \in R^{n \times n}$ , called the adjoint representation. By an abuse of notation, we denote the adjoint representation as ad_A, without a tilde in the subscript.

On matrix Lie groups and for functions $V \in C^{1} (G, R)$ the gradient $d_{g} V \in R^{n}$ (see Section 3.4 for details) is found as

d_{g} V = \frac{\partial}{\partial q} V {(g (I + Λ (q)))}_{| q = 0}

(7)

2.2 Optimization problem

We consider a variant of the optimal control problem on a Lie group (Jurdjevic, 1996) with a finite horizon T. Given parameters $θ \in R^{n_{θ}}$ , denote the parameterized dynamics on a Lie group as f_θ(g, t)≔f(g, t, θ). Then, given the dynamic system

\dot{g} = f_{θ} (g, t), g (0) = g_{0}

(8)

denote the solution operator (also called the flow) as

Ψ_{f_{θ}}^{t} : G \to G; g (0) \mapsto g (t)

(9)

and define the real valued cost function

C_{f_{θ}}^{T} (g_{0}, θ) = F (Ψ_{f_{θ}}^{T} (g_{0}), θ) + \int_{0}^{T} r (Ψ_{f_{θ}}^{s} (g_{0}), θ, s) d s

(10)

where we call F the final cost term and r the running cost term.

Indicating a probability space $(G, D, P)$ , we are interested in solving the minimization problem

\min_{θ} E_{g_{0} \sim P} C_{f_{θ}}^{T} (g_{0}, θ)

(11)

Remark 2.1

The chief reason for our interest in this optimization problem is that it includes, as a sub-class, the optimization of state-feedbacks $u_{θ} : G \times [0, T] \to U$ by considering dynamics of the form f(g, t, u_θ(g, t)), where u_θ denotes the control input of the system.

Remark 2.2

The dynamics f_θ(g, t) can also be parameterized with neural nets, in which case f_θ(g, t) is referred to as a neural ODE on a Lie group. Indeed, for the Lie group $(R^{n}, +)$ , the formulation agrees with the definition of a neural ODE given in Massaroli et al. (2022) who define them as dynamics $\dot{x} = f_{θ} (x, t) w i t h x \in R^{n}$ .

2.3 Optimization algorithm

We use a stochastic gradient descent optimization algorithm (Robbins and Monro, 1951) to approximate a solution to the optimization problem (11) on a matrix Lie group.

Denote the total cost in (11) as

J (θ) : = E_{g_{0} \sim P} C_{f_{θ}}^{T} (g_{0}, θ)

(12)

Additionally, denote by $θ_{k} \in R^{n_{θ}}$ the parameters at the k-th iteration, and by η_k a positive scalar learning rate at the k-th iteration. Then a standard gradient descent algorithm computes the parameters θ_k+1 by an application of the update rule

θ_{k + 1} = θ_{k} - η_{k} \frac{\partial}{\partial θ} J (θ)

(13)

In stochastic gradient descent N initial conditions g_i are sampled from the probability distribution corresponding to the probability measure

P

. The expectation in (13) is approximated by averaging the gradients of costs

C_{i} = C_{f_{θ}}^{T} (g_{i}, θ)

of the individual trajectories starting at g_i as

\frac{\partial}{\partial θ} J (θ) = E_{g_{0} \sim P} \frac{\partial}{\partial θ} C_{f_{θ}}^{T} (g_{0}) \approx \frac{1}{N} \sum_{i = 0}^{N} \frac{\partial}{\partial θ} C_{i}

(14)

For convex cost-functions J(θ) and a sufficiently small η_k, the parameter θ_k approaches the optimal parameters θ^⋆ as k increases (Robbins and Monro, 1951). For non-convex cost-functions stochastic gradient descent does not have a guarantee of global optimality, but it is still widely used as a light and scalable algorithm (Ruder, 2017) that results in robust local optima (Xie et al., 2021).

In order to compute the gradient ∂/∂θC_i of the cost for a single trajectory (10), we derived the generalized adjoint method on matrix Lie groups. It is the main technical result of this paper, and it is stated in the following:

Theorem 2.1

Generalized Adjoint Method on Matrix Lie Groups. Given are the dynamics (8) and the cost (10). Denote by ${\tilde{f}}_{θ} (g, t) : = g^{- 1} f_{θ} (g, t) \in g$ . Then the parameter gradient $\partial / \partial θ C_{f_{θ}}^{T} (g_{0})$ of the cost is given by the integral equation

\frac{\partial}{\partial θ} C_{f_{θ}}^{T} (g_{0}) = \frac{\partial F}{\partial θ} + \int_{0}^{T} \frac{\partial}{\partial θ} (λ_{g}^{⊤} Λ^{- 1} ({\tilde{f}}_{θ})) + \frac{\partial r}{\partial θ} d t

(15)

where the state g(t) ∈ G and adjoint state

λ_{g} (t) \in R^{n}

are the solutions of the system of equations

\begin{array}{l} \dot{g} = f_{θ}, g (0) = g_{0}, \end{array}

(16)

\begin{array}{l} {\dot{λ}}_{g} & = - d_{g} (λ_{g}^{⊤} Λ^{- 1} ({\tilde{f}}_{θ}) + r) + {a d}_{{\tilde{f}}_{θ}}^{⊤} λ_{g}, λ_{g} (T) = d_{g} F \end{array}

(17)

Proof. For a derivation of Theorem 2.1 via neural ODEs on manifolds, refer to Appendix A.

The generalized adjoint method (Massaroli et al., 2022) on $R^{n}$ is recovered as a special case for the Lie group $(R^{n}, +)$ , for which the adjoint term ad_T = 0 such that equation (17) agrees with the adjoint equation on $R^{n}$ .

Just as the generalized adjoint method on $R^{n}$ , the generalized adjoint method on Lie groups has a constant memory efficiency with respect to the network depth T. This makes it an advantageous choice for the gradient computation compared to, for example, back-propagation through an ODE-solver.

Various technical tools are required to apply Theorem 2.1 in practice. This includes the exponential Atlas for exact integration of $\dot{g}$ , a tractable expression of the gradient operator $d_{g} : C^{1} (G, R) \to R^{n}$ , and the composition of matrix Lie groups to create new matrix Lie groups from old ones. These tools are the subject of Section 3.

Remark 2.3

Equations (16) and (17) are solved by integrating (16) forward in time, computing d_gF at g = g(T), and integrating (17) backwards in time, reusing g(t) from the forward integration. Equation (15) is solved by integrating its differential alongside Equation (17). See especially Figure 1. The memory efficiency of neural ODEs stems from the fact that trajectories g(t) and λ_g(t) do not need to be stored, apart from a few way-points of g(t), and that the dependency of g(t) on parameters is largely ignored in the forward pass - this avoids overheads that arise, for example, through automatic differentiation over an ODE-solver.

Remark 2.4

The choice of group action for $(R^{n}, •)$ plays an important role in recovering the adjoint equation on $R^{n}$ . This choice is a nontrivial degree of freedom of the optimization (see also Remark 5.3).

3. Technical tools

A number of technical tools are presented in the context of matrix Lie groups. Given mild adaptations of the definitions these tools also apply to abstract finite-dimensional Lie groups (see Appendix A.4).

3.1 Atlas and minimal atlas on Lie groups

In this section the exponential map and logarithmic maps will be used to construct an atlas of exponential charts for finite-dimensional Lie groups, and the concept of a minimal exponential atlas will be defined. Here an atlas is defined as follows:

Definition 3.1

Atlas and Charts. An atlas $A$ for an n-dimensional smooth manifold $M$ is a collection of charts (U, X), where $U \subseteq M i s a n o p e n s e t, X : U \to R^{n}$ is a diffeomorphism called a chart map , and the chart domains satisfy $\underset{(U, X) \in A}{\cup} U = M$ .

For finite-dimensional Lie groups the exponential map $\exp : g \to G$ is a local diffeomorphism (Isham, 1999: Chapter 4.2.3). Its inverse $\log : U \to g$ is defined by exp ◦ log = id_U, for a neighborhood U of the identity e ∈ G, and id_U is the identity map on U.

For a matrix Lie group, the exponential map is given by the infinite sum (Hall, 2015: Chapter 3.7):

\exp (\tilde{A}) : = \sum_{n = 0}^{\infty} \frac{1}{n!} {\tilde{A}}^{n}

(18)

Conversely, the log map for matrix Lie groups is given by the matrix logarithm, when it is well-defined (Hall, 2015: Chapter 2.3):

\log (g) = \sum_{n = 1}^{\infty} {(- 1)}^{n + 1} \frac{{(g - I)}^{n}}{n}

(19)

On a case-by-case basis the infinite sums in (18) and (19) can further be reduced to a finite sum by use of the Cayley-Hamilton theorem (Visser et al., 2006), which often allows one to find a closed-form expression of the exp and log maps.

The logarithmic map (19) and Λ in equation (3) can then be used to construct a local exponential chart (U, X) for G, where

X : U \to R^{n}; g \mapsto Λ^{- 1} \log (g)

(20)

assigns so-called coordinates

q \in R^{n}

to group elements g ∈ U ⊆ G, with the zero coordinates assigned to the group identity e.

To create a chart “centered” on any h ∈ G (i.e., the zero coordinates are assigned to h), both the region U and the chart map X can be left-translated³by L_h to define the chart (U_h, X_h) with

\begin{array}{l} U_{h} = {h g | g \in U}, \end{array}

(21)

\begin{array}{l} X_{h} : U_{h} \to R^{n}; g \mapsto Λ^{- 1} \log (h^{- 1} g), \end{array}

(22)

\begin{array}{l} X_{h}^{- 1} : R^{n} \to G; q \mapsto h \exp (Λ (q)) \end{array}

(23)

The collection $A$ of charts (U_h, X_h) is then called an exponential atlas. This atlas covers the Lie group G, and is fully determined by the choice of basis $E \subset g$ and chart region U ⊂ G.

In order to use a finite number of charts, we are interested in constructing a minimal exponential atlas. A minimal atlas is defined as follows:

Definition 3.2

Minimal Atlas. An atlas $A$ is minimal if it covers the manifold, that is, $M = \underset{(U, X) \in A}{\cup} U$ , and if it does so with the minimum number of charts.

Remark 3.1

Given a manifold $M$ , the size of a minimal atlas is determined by a topological invariant, the integer $C a t (M)$ (the Lusternik-Schnirrelmann category, see Grafarend and Kühnel (2011); Oprea (2014)). Given a Lie group G, Cat(G) provides a lower bound on the size of a minimal exponential atlas.

Remark 3.2

Given a minimal exponential atlas (which can be seen to be always countable) we use integers j to number the relevant chart-centers as g_j, the corresponding charts as (U_j, X_j), and denote the chart-coordinates in the j-th chart as $q_{j} \in R^{n}$ .

3.2 Vectors on Lie groups

Given a curve $g : R \to G$ and its coordinate representation q_j(t) = X_j(g(t)) through (22), one can relate the tangent-vector ${\dot{q}}_{j} \in T_{q_{j}} Q_{j}$ of $q_{j} \in Q_{j} \subseteq R^{n}$ to an element $\tilde{A} \in g$ , and further to an element $A = Λ^{- 1} (\tilde{A}) \in R^{n}$ (see Figure 2) by

A = K (q_{j}) {\dot{q}}_{j}

(24)

The map

K : R^{n} \to R^{n \times n}

is called the derivative of the exponential map, and it is given by the power series (Rossmann, 2006)

K (q_{j}) = \sum_{k = 0}^{\infty} \frac{{(- 1)}^{k}}{(k + 1)!} {ad}_{q_{j}}^{k}

(25)

Figure 2.

Commutative diagram of a generic Lie group G. Boxes represent sets, while arrows represent functions between sets. Relevant variables in a given set are indicated in red.

Recall that ${ad}_{q_{j}}$ is an n-by-n matrix, and ${ad}_{q_{j}}^{k}$ is the k-th power of such matrices. As with the matrix exponential (18), the infinite sum can then be reduced to a finite sum by use of the Cayley-Hamilton theorem (Visser et al., 2006).

Remark 3.3

The expression (24) is invariant under choice of exponential chart, that is, for any charts (U_j, X_j) and (U_k, X_k) one has that $K (q_{j}) {\dot{q}}_{j} = K (q_{k}) {\dot{q}}_{k}$ (see Appendix B.1).

3.3 Lie group integrators

We adapt the Runge-Kutta-Munthe-Kaas (RKMK) method (Munthe-Kaas, 1999) for exact integration of the dynamics (8). The RKMK method uses the Runge-Kutta scheme on Lie groups—we instead allow for arbitrary numerical integration schemes. For an overview of Lie group integrators, see, for example, Iserles et al. (2000); Celledoni and Owren (2003); Celledoni et al. (2014).

Using equation (24), the dynamics (8) can be represented in a local exponential chart as (Celledoni et al., 2014)

{\dot{q}}_{j} = K {(q_{j})}^{- 1} Λ^{- 1} {\tilde{f}}_{θ} (X_{j}^{- 1} (q_{j}), t)

(26)

where we denote

{\tilde{f}}_{θ} (g, t) : = g^{- 1} f_{θ} (g, t) \in g

. An arbitrary numerical integration scheme can then be used to integrate the dynamics, as long as the state q_j remains in the chart region

Q_{j} = X_{j} (U_{j}) \subseteq R^{n}

Remark 3.4

Note that the chart-dynamics (26) are not necessarily well-defined for all $q_{j} \in R^{n}$ , since K(q_j) can have singularities. Yet, the chart-dynamics are well-defined for q_j in the chart region Q_j, where K(q_j) is guaranteed to be of full rank.

To make sure the state remains within the chart region we switch charts when needed by application of

q_{i} = X_{i} (X_{j}^{- 1} (q_{j}))

(27)

Here the conditions for chart switches are a degree of freedom. It is possible to always choose the chart (X_h, U_h) with

h = X_{j}^{- 1} (q_{j} (t))

, that is, to switch charts after every step of the numerical integrator as in Munthe-Kaas (1999); Celledoni et al. (2014).

Given a minimal exponential atlas, we choose to reduce the amount of chart switches. To this end, we introduce indicator functions $σ_{j} : G \to R$ s.t. σ_j(g) > 0 if g ∈ U_j, and switch charts when σ_j(g) is smaller than a threshold-value (see Appendix B.2).

Remark 3.5

The use of a minimal exponential atlas allows to store way-points of a trajectory $g (t) \in G \subset G L (m, R)$ in terms of the n + 1 numbers (q_j(t), j), rather than storing the m × m entries of g. In principle, this more memory-efficient storing does not require an integration procedure based on a minimal atlas, since it can be implemented as a post-processing step given the trajectory g(t). While not done in this work, it is worth consideration to use standard numerical integrators such as RKMK, which come with well-known error-bounds (Munthe-Kaas, 1999).

3.4 Gradients on Lie groups

The gradient of a function $V \in C^{1} (G, R)$ is the co-vector field dV ∈ Γ(TG). For any given g ∈ G the gradient $d V (g) \in T_{g}^{*} G$ is a co-vector, and transforms in a dual manner to a vector $\dot{g} \in T_{g} G$ . With reference to Figure 2, the gradient can be represented as

{\bar{d}}_{g} V : = {(L_{g})}^{*} d V (g) \in g^{*}

(28)

and as

d_{g} V : = Λ^{*} ({\bar{d}}_{g} V) \in R^{n}

(29)

Equivalently, d_gV can be found from the computation in a chart (U_j, X_j) as (indeed, dual to (24))

d_{g} V = K {(q_{j})}^{- ⊤} {\frac{\partial V}{\partial q_{j}}}_{| q_{j} = X_{j} (g)}

(30)

Here, a computationally advantageous choice can be made for the chart map X_j: by choosing the chart (U_j, X_j) = (U_g, X_g) in (21) one finds that K(X_j(g)) = K(0) = I_n, such that the computation of (25) can be avoided:

\begin{array}{l} d_{g} V & = K {(q_{j})}^{- ⊤} \frac{\partial}{\partial q_{j}} V {(g \exp (Λ (q_{j})))}_{| q_{j} = 0} \\ = \frac{\partial}{\partial q_{j}} V {(g (I + Λ (q_{j})))}_{| q_{j} = 0} \end{array}

(31)

The final simplification in equation (31) holds for matrix Lie groups, where higher order terms of the power series (18) can be neglected.

3.5 Composition of Lie groups

We briefly review the composition of Lie groups. Lie groups G and H can always be composed to form a product Lie group G × H. For matrix Lie groups $G \subset G L (m, R)$ and $H \subset G L (l, R)$ , a product Matrix Lie group $G \times H \subset G L (m + l, R)$ can be defined as (Hall, 2015, Definition 4.17)

G \times H : = {[\begin{array}{l} g & 0 \\ 0 & h \end{array}] | g \in G, h \in H}

(32)

The composition of matrix Lie groups has a block-diagonal structure. This block-diagonal structure reappears in the construction of the corresponding Lie algebra $g \oplus h \subset g l (m + l, R)$ , the adjoint map, exponential map and the logarithmic map, which consist of their counterparts for G and H. The algebra representation $Λ : R^{n + k} \to g \oplus h$ can likewise be chosen to consist of the components $Λ_{G} : R^{n} \to g$ and $Λ_{H} : R^{k} \to h$ .

4. The cases SO(3) and SE(3)

Prior theory is applied to the Lie groups SO(3) and SE(3).

4.1 The matrix Lie groups SO(3) and SE(3)

Here, the special orthogonal group SO(3) and the special Euclidean group SE(3) are directly defined as matrix Lie groups that collect transformations of the Euclidean 3-space $R^{3}$ . SO(3) can be described as the collection of rotations of a vector space $R^{3}$ , and SE(3) as the collection of simultaneous rotations and translations of $R^{3}$ , implemented on the vector space of homogeneous vectors (vectors of the form $(\begin{array}{l} x \\ 1 \end{array})$ with $x \in R^{3}$ , see (Murray et al., 1994, Ch. 3.1)).

Define $S O (3) \subset G L (3, R)$ and $S E (3) \subset G L (4, R)$ as the matrix Lie groups

\begin{array}{l} S O (3) : = {R \in R^{3 \times 3} | R^{⊤} R = I, \det (R) = 1}, \end{array}

(33)

\begin{array}{l} S E (3) : = {[\begin{array}{l} R & p \\ 0 & 1 \end{array}] \in R^{4 \times 4} | R \in S O (3), p \in R^{3}} \end{array}

(34)

in both cases using matrix composition as the group operation.

Remark 4.1

Concerning notation for relative poses of rigid bodies: $H_{B}^{A} \in S E (3)$ indicates the pose of a reference frame Ψ_B as seen from Ψ_A, while $H_{A}^{B} = {H_{B}^{A}}^{- 1}$ .

The Lie algebras of SO(3) and SE(3) are the vector spaces $s o (3) \subset g l (3, R)$ and $s e (3) \subset g l (4, R)$ , respectively, with their Lie bracket given by the matrix commutator. The Lie algebras so(3) and se(3) are given by

\begin{array}{l} s o (3) : = {\tilde{ω} \in R^{3 \times 3} | \tilde{ω} = - {\tilde{ω}}^{⊤}}, \end{array}

(35)

\begin{array}{l} s e (3) : = {[\begin{array}{l} \tilde{ω} & v \\ 0 & 0 \end{array}] \in R^{4 \times 4} | \tilde{ω} \in s o (3), v \in R^{3}} \end{array}

(36)

Arbitrary elements of so(3) and se(3) will be denoted by

\tilde{ω}

and

\tilde{T}

, respectively.

The vector space isomorphism $Λ_{S O (3)} : R^{3} \to s o (3)$ is defined as

Λ_{S O (3)} (ω) = Λ_{S O (3)} (\begin{array}{l} ω_{1} \\ ω_{2} \\ ω_{3} \end{array}) : = [\begin{array}{l} 0 & - ω_{3} & ω_{2} \\ ω_{3} & 0 & - ω_{1} \\ - ω_{2} & ω_{1} & 0 \end{array}]

(37)

For SE(3), define

Λ_{S E (3)} : R^{6} \to s e (3)

for

T = (\begin{array}{l} ω \\ v \end{array}) \in R^{6}

with

ω, v \in R^{3}

, via

Λ_{S E (3)} (T) : = [\begin{array}{l} Λ_{S O (3)} (ω) & v \\ 0 & 0 \end{array}]

(38)

Both Λ_SO(3) and Λ_SE(3) will be denoted as Λ in the following, since it is clear from context which one is meant.

Remark 4.2

Concerning notation for the relative twists (velocities) of rigid bodies: Consider a curve $H_{B}^{A} : R \to S E (3), t h e n t w i s t s \tilde{T} \in s e (3)$ appear as the left- and right-translated change-rates ${\dot{H}}_{B}^{A} = d / d t H_{B}^{A} (t)$ by

{\tilde{T}}_{B}^{B, A} = H_{A}^{B} {\dot{H}}_{B}^{A}, {\tilde{T}}_{B}^{A, A} = {\dot{H}}_{B}^{A} H_{A}^{B}

(39)

Both

{\tilde{T}}_{B}^{B, A}

and

{\tilde{T}}_{B}^{A, A}

represent the generalized velocity (twist) of frame Ψ_B with respect to Ψ_A, but

{\tilde{T}}_{B}^{B, A}

is expressed in Ψ_B while

{\tilde{T}}_{B}^{A, A}

is expressed in Ψ_A (Murray et al., 1994, Ch. 2.4).

The adjoint representations of so(3) and se(3) follow from the definition (6) as

{ad}_{ω} = Λ (ω), {ad}_{T} = [\begin{array}{l} Λ (ω) & 0 \\ Λ (v) & Λ (ω) \end{array}]

(40)

The exponential maps for SO(3) and SE(3) are almost-global diffeomorphisms that relate $\tilde{ω} \in s o (3)$ to R ∈ SO(3) via (41) and $\tilde{T} \in s e (3)$ to H ∈ SE(3) via (42) (Murray et al., 1994, App. A, Sec. 2.3):

e^{\tilde{ω}} = \sum_{n = 0}^{\infty} \frac{1}{n!} {\tilde{ω}}^{n} = I + \sin (θ) \tilde{\hat{ω}} + (1 - \cos (θ)) {\tilde{\hat{ω}}}^{2},

(41)

e^{\tilde{T}} = \sum_{n = 0}^{\infty} \frac{1}{n!} {\tilde{T}}^{n} = [\begin{array}{l} e^{\tilde{ω}} & \frac{1}{θ^{2}} (I - e^{\tilde{ω}}) \tilde{ω} v + ω^{⊤} v ω \\ 0 & 1 \end{array}]

(42)

with

θ = {‖ ω ‖}_{2} = \sqrt{ω^{⊤} ω}

and

\tilde{\hat{ω}} = \tilde{ω} / θ

For θ < π their inverses are presented in equations (43) and (44), respectively: the log map for SO(3) is (Murray et al., 1994, Appendix A, Section 2.3)

\log (R) = {\begin{cases} \cos^{- 1} (\frac{1}{2} (Tr (R) - 1)) \frac{A}{‖ A ‖}, & R \neq I, \\ 0_{3 \times 3}, & R = I, \end{cases}

(43)

with A = 1/2(R − R^⊤) the anti-symmetric part of R, while

‖ A ‖ : = \sqrt{- 1 / 2 Tr (A^{2})}

Denoting $\tilde{ω} = \log (R)$ , the log map for SE(3) is (Murray et al. (1994), Appendix A, Section 2.3)

\log ([\begin{array}{l} R & p \\ 0 & 1 \end{array}]) = [\begin{array}{l} \tilde{ω} & Q p \\ 0 & 0 \end{array}],

(44)

Q = I - \frac{1}{2} \tilde{ω} + \frac{2 \sin (θ) - θ (1 + \cos (θ))}{2 θ^{2} \sin (θ)} {\tilde{ω}}^{2}

(45)

Since lim_θ→0 Q = I, a well-defined Q is given by (45) regardless of R, such that the logarithm on SE(3) (44) has the range of validity of the logarithm on SO(3) (43), bounded only by the rotational part.

4.2 Minimal atlas

Here we construct minimal exponential atlases for SO(3) and SE(3) as special cases of the exponential atlas (21) – (23) for the respective Lie groups. Both atlases use four charts, which is the theoretical minimum size of an atlas for SO(3) and SE(3) (Grafarend and Kühnel, 2011).

For the atlas on SO(3) the four exponential charts are centered on the elements

\begin{array}{l} R_{0} = [\begin{array}{l} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{array}], R_{1} = [\begin{array}{l} 1 & 0 & 0 \\ 0 & - 1 & 0 \\ 0 & 0 & - 1 \end{array}], \\ R_{2} = [\begin{array}{l} - 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & - 1 \end{array}], R_{3} = [\begin{array}{l} - 1 & 0 & 0 \\ 0 & - 1 & 0 \\ 0 & 0 & 1 \end{array}] \end{array}

The full minimal atlas for SO(3) is then given by

\begin{array}{l} A_{\min}^{S O (3)} & : = & {(U_{j}, x_{j}) | j \in {0, 1, 2, 3}} & , \end{array}

(46)

\begin{array}{l} U_{j} & : = & {R_{j} e^{\tilde{ω}} | ω \in R^{3}, | ω | < π} & , \end{array}

(47)

\begin{array}{l} x_{j} (R) & : = & Λ^{- 1} \log (R_{j}^{⊤} R) & , \end{array}

(48)

\begin{array}{l} x_{j}^{- 1} (q_{j}) & = R_{j} e^{Λ (q_{j})} \end{array}

(49)

Intuitively speaking, the open set U_j contains all orientations that are reachable from R_j by a rotation through an angle less than π.

A proof that $A_{\min}^{S O (3)}$ covers SO(3) is shown in Appendix B.3.

For the atlas on SE(3), define the centers of the exponential charts on SE(3) as

H_{j} : = [\begin{array}{l} R_{j} & 0 \\ 0 & 1 \end{array}], j \in {0, 1, 2, 3}

The full minimal atlas on SE(3) is then given by

\begin{array}{l} A_{\min}^{S E (3)} & : = & {(U_{j}, X_{j}) | (U_{j}, x_{j}) \in A_{\min}^{S O (3)}} & , \end{array}

(50)

\begin{array}{l} U_{j} & : = & {[\begin{array}{l} R & p \\ 0 & 1 \end{array}] | R \in U_{j}, p \in R^{3}} & , \end{array}

(51)

\begin{array}{l} X_{j} (H) & : = & Λ^{- 1} \log (H_{j}^{- 1} H) & , \end{array}

(52)

\begin{array}{l} X_{j}^{- 1} (q_{j}) & = H_{j} e^{Λ (q_{j})} \end{array}

(53)

4.3 Expressing scalar functions

We briefly highlight how to represent scalar-valued functions $F_{S O (3)} : S O (3) \to R$ and $F_{S E (3)} : S E (3) \to R$ . One approach is to define functions on manifolds by restricting a function defined on an embedding space $R^{n \times n}$ (Falorsi and Forré, 2020). For Lie groups, it is immediately applicable whenever a matrix representation is available.

For example, on SE(3) one restricts a function $F_{R^{4 \times 4}} : R^{4 \times 4} \to R$ to arguments from $S E (3) \subset R^{4 \times 4}$ , see Figure 3. The gradient of such functions are computed by an application of equation (31):

d_{H} F_{S E (3)} = \frac{\partial}{\partial q} F_{S E (3)} {(H (I + \tilde{q}))}_{q = 0}

(54)

Figure 3.

Commutative diagram highlighting how the natural embedding $↪ : S E (3) \to R^{4 \times 4}$ can be used to restrict a general matrix function to SE(3).

The approach also holds for SO(3), which can be embedded in $R^{3 \times 3}$ , instead of $R^{4 \times 4}$ .

5. Optimizing a rigid body control

The optimization procedure in Section 2 is applied to potential energy and damping injection based control of a fully actuated rigid body.

The core idea of potential energy shaping and damping injection is to combine advantages of energy-balancing passivity based control (EB-PBC) (Ortega and Mareels, 2000) and of control by interconnection (Ortega et al., 2008), which provide stability guarantees when interfacing with physical systems. Our article presents a class of controllers that generalizes the architecture presented by Rashad et al. (2019). We address common safety concerns about machine learning in control loops by optimizing a class of controllers that guarantees stability and a bounded energy by design.

5.1 Control of a rigid body

The trajectory of a rigid body in Euclidean 3-space is fully described by the curve $H_{b}^{0} : R \to S E (3)$ that gives the relative position and orientation of a frame Ψ_b attached to the rigid body with respect to an inertial frame Ψ₀ (see Remark 4.1). Following equation (39), the twist of the body with respect to Ψ₀, expressed in the body frame Ψ_b is

T_{b}^{b, 0} = Λ^{- 1} (H_{0}^{b} {\dot{H}}_{b}^{0})

(55)

Given the inertia tensor

I \in R^{6 \times 6}

P^{b} = I T_{b}^{b, 0} \in R^{6}

represents the momentum in the body frame. The indices remain fixed in the subsequent treatment, and are suppressed to avoid cluttering (i.e.,

H : = H_{b}^{0}, T : = T_{b}^{b, 0}, P : = P^{b}

The dynamics of a rigid body follow from the Hamiltonian equations on matrix Lie groups ((100) and (101) in Appendix A.2) by setting G = SE(3) and letting $H (H, P) = 1 / 2 P^{⊤} I^{- 1} P$ . Including an external input wrench $W \in R^{6}$ , the dynamics read:

\begin{array}{l} \dot{H} & = H Λ (I^{- 1} P), \end{array}

(56)

\begin{array}{l} \dot{P} & = {ad}_{I^{- 1} P}^{⊤} P + W \end{array}

(57)

In control by potential energy shaping and damping injection, this external wrench is constructed as a sum of a potential gradient term W_V and a damping term W_D:

W = W_{V} + W_{D}

(58)

In our approach the potential gradient term

W_{V} = - d_{H} V

(59)

is computed by an application of equation (54).

Nonlinear, configuration-dependent viscous damping takes the form

W_{D} = - B (H, P) P

(60)

with

B (H, P) I \in R^{6 \times 6}

a symmetric and positive definite matrix.⁴

Remark 5.1

In this context, the control architecture of Rashad et al. (2019) corresponds to the popular yet very particular choice of a constant B(H, P) for the damping injection, while their potential V(H) shows a quadratic dependence on translations and a nearly quadratic dependence on rotations. Their controller may be interpreted as a linear PD controller on SE(3), where our work may be seen as a nonlinear PD controller on SE(3).

5.2 Stability

We present here a general proof of stability for the class of controllers.

Theorem 5.1

Stability. Given the system (56), (57) together with the controller (58) given as

W (H, P) = - d_{H} V - B (H, P) P

(61)

\forall H \in S E (3), P \in R^{6} : B (H, P) I > 0

and

V : S E (3) \to R

is lower-bounded, then lim_t→∞P(t) = 0 and any local minimum of V is locally asymptotically stable.

Proof. With E_Pot = V(H) and $E_{Kin} = 1 / 2 P^{⊤} I^{- 1} P$ , take the system’s total energy E = E_Pot + E_Kin as the Lyapunov function candidate. Then

\begin{array}{l} {\dot{E}}_{Pot} & = {(d_{H} V)}^{⊤} T \end{array}

(62)

\begin{array}{l} \begin{array}{l} {\dot{E}}_{Kin} & = {(\dot{P})}^{⊤} I^{- 1} P = {(\dot{P})}^{⊤} T \end{array} \\ = {({ad}_{T}^{⊤} P - d_{H} V)}^{⊤} T - {(P)}^{⊤} B T \\ = {(- d_{H} V)}^{⊤} T - T^{⊤} B I T, \end{array}

(63)

such that

\dot{E} = {\dot{E}}_{Pot} + {\dot{E}}_{Kin} = - T^{⊤} B I T \leq 0

(64)

By LaSalle’s invariance principle, the system converges to the greatest invariant subset where

\dot{E} = 0

. Since B(H, P) > 0, the set with

\dot{E} = 0

is simply the set

S = {(H, P) | P = 0}

(65)

By inspection of the dynamics (57) the greatest invariant subset of S is the set with d_HV = 0, corresponding to the maxima and minima of V. Since

\dot{E} \leq 0

, the system cannot converge to maxima of V, leaving the minima of V as limit sets and local minima of V as asymptotically stable equilibria.

5.3 Optimization by the adjoint method on SE(3)

In order to apply the adjoint method on Lie groups (Theorem 2.1), we re-define the dynamics (56) and (57) on the Lie group G = SE(3) × se*(3). To this end, we choose to equip G with the element-wise composition (H₁, P₁) ◦ (H₂, P₂) = (H₁H₂, P₁ + P₂).

As a composition of matrix Lie groups SE(3) and $(R^{6}, +)$ , $G \subset G L (11, R)$ is defined as

G : = {[\begin{array}{l} H & 0 & 0 \\ 0 & I & P \\ 0 & 0 & 1 \end{array}] | H \in S E (3), P \in R^{6}}

(66)

where matrix multiplication indeed corresponds to the element-wise composition in the abstract G. For details on the construction of G, the Lie algebra

g

, the choice of Lie algebra representation

Λ : R^{12} \to g

, adjoint map and exponential map see Appendix B.5.2. The dynamics for Γ(t) ∈ G read

\dot{Γ} = f_{θ} (Γ) = [\begin{array}{l} \dot{H} (H, P) & 0 & 0 \\ 0 & 0 & {\dot{P}}_{θ} (H, P) \\ 0 & 0 & 0 \end{array}]

(67)

where

\begin{array}{l} \dot{H} (H, P) & = H Λ (I^{- 1} P), \end{array}

(68)

\begin{array}{l} {\dot{P}}_{θ} (H, P) & = {ad}_{I^{- 1} P}^{⊤} P + W_{θ} (H, P) \end{array}

(69)

Here, the control-wrench

W_{θ} : S E (3) \times R^{6} \to R^{6}

is parameterized by

θ \in R^{n_{θ}}

Given a cost $C_{f_{θ}}^{T} (Γ, θ)$ of the type (10) and a distribution $P$ of initial conditions Γ₀, define an optimization problem in the form of (11):

\min_{θ} E_{Γ_{0} \sim P (Γ_{0})} [C (Γ_{0}, θ)] = \min_{θ} J (θ)

(70)

As in Section 2.3, approximate $J (θ) \approx \sum_{i = 0}^{N} C_{f_{θ}}^{T} (Γ_{i}, θ)$ and apply Theorem 2.1 to compute the parameter gradient of the $C_{f_{θ}}^{T} (Γ_{i}, θ)$ by equation (15).

The dynamics of λ_Γ follow from equation (17) as

{\dot{λ}}_{Γ} = - d_{Γ} (λ_{Γ} (Λ^{- 1} ({\tilde{f}}_{θ} (Γ)) + r (Γ, θ)) + {ad}_{{\tilde{f}}_{θ}}^{⊤} λ_{Γ},

(71)

λ_{Γ} (T) = d_{Γ} F

(72)

where

Λ^{- 1} ({\tilde{f}}_{θ})

Λ^{- 1} ({\tilde{f}}_{θ}) : = Λ^{- 1} (Γ^{- 1} f_{θ} (Γ)) = (\begin{array}{l} I^{- 1} P \\ {\dot{P}}_{θ} (H, P) \end{array})

(73)

The gradient $d_{Γ} : C^{1} (G, R) \to R^{12}$ is given by equation (31). We split it into components $d_{H} : C^{1} (S E (3), R) \to R^{6}$ and $d_{P} : C^{1} (Vec (6, R), R) \to R^{6}$ defined on the Lie groups SE(3) and $(R^{6}, +)$ , respectively.

Further, split λ_Γ = (λ_H, λ_P) into components $λ_{H}, λ_{P} \in R^{6}$ , and write out the control-wrench W_θ(H, P) = −d_HV_θ − B_θ(H, P)P. Then the equation for ${\dot{λ}}_{Γ}$ can be resolved to

{\dot{λ}}_{Γ} =

(74)

(\begin{array}{l} d_{H} (λ_{P}^{⊤} (d_{H} V_{θ} + B_{θ} (H, P) P) - r (Γ, θ)) - {ad}_{I^{- 1} P}^{⊤} λ_{H} \\ d_{P} (λ_{P}^{⊤} (B_{θ} (H, P) P - {ad}_{I^{- 1} P}^{⊤} P) - r (Γ, θ)) - I^{- 1} λ_{H} \end{array}),

Λ_{H} (T) = d_{H} F, Λ_{P} (T) = d_{P} F .

Remark 5.2

Note that the second derivative term $d_{H} (λ_{P}^{⊤} (d_{H} V_{θ}))$ in the dynamics (74) is well-defined, since $λ_{P}^{⊤} (d_{H} V_{θ}) \in C^{1} (S E (3), R)$ .

Remark 5.3

Rather than constructing the direct product group SE(3) × se*(3), the semi-direct product group SE(3) ⋉ se*(3) could have been defined using the Coadjoint representation $A d H * : {s e}^{*} (3) \to s e * (3), l e a d i n g t o t h e g r o u p o p e r a t i o n (H_{1}, P_{1}) • (H_{2}, P_{2}) = (H_{1} H_{2}, P_{1} + A d_{H_{1}}^{*} P_{2})$ . Since an alternative choice of group action does not affect the optimum of the optimization, use of the semi-direct product group was not further investigated.

6. Simulations

We numerically solve the optimization problem (11) for the dynamics (67). We investigate various choices of final and running costs, distributions $P$ and parameterizations of V_θ, B_θ.

6.1 Quadratic vs. general potential shaping

A controller with quadratic potential and linear damping injection (Section 6.1.2) is compared to a controller with NN-parameterized potential and damping injection (Section 6.1.3).

6.1.1 Choice of cost C and distribution $P$

We determine a final cost F and a running cost r to stabilize a static target state with H = H_F, p = 0 over a horizon of T = 3 seconds. The key properties of F and r are that both are differentiable and have their minimum in the target pose. Denote components of H and P

H = [\begin{array}{l} R & p \\ 0 & 1 \end{array}], P = (\begin{array}{l} P_{ω} \\ P_{v} \end{array})

where

R \in R^{3 \times 3}

, and

p, P_{ω}, P_{v} \in R^{3}

. With weights

w_{1}, \dots, w_{9} \in R_{+}

, we choose F and r as

\begin{array}{l} F (Γ) & = - w_{1} Tr (H_{F}^{- 1} H) + w_{2} ‖ p ‖_{2}^{2} \end{array}

(75)

+ w_{3} ‖ P_{ω} ‖^{2} + w_{4} ‖ P_{v} ‖_{2}^{2},

\begin{array}{l} r (Γ) & = - w_{5} Tr (H_{F}^{- 1} H) + w_{6} ‖ p ‖_{2}^{2} \end{array}

(76)

+ w_{7} {{‖ P_{ω} ‖}^{2} + w_{8} ‖ P_{v} ‖_{2}^{2} + w_{9} ‖ W_{θ} (H, P) ‖}^{2}

Given scalars $α, d, θ_{p}, d_{p} \in R$ and vectors $ω, v, ω_{p}, v_{p} \in R^{3}$ , and an average initial pose H_I = H_F, an initial condition Γ₀ is constructed as

Γ_{0} = (\begin{array}{l} H \\ P \end{array})

(77)

q = (\begin{array}{l} α ω / {‖ ω ‖}_{2} \\ d v / ‖ v |_{2} \end{array}),

(78)

H = H_{I} \exp (\tilde{q}),

(79)

P = (\begin{array}{l} α_{p} ω_{p} / {‖ ω_{p} ‖}_{2} \\ d_{p} v_{p} / ‖ v_{p} |_{2} \end{array})

(80)

The distribution

P

of Γ₀ is implemented by sampling

α \sim Uniform [0, π], d \sim Uniform [0, 1],

(81)

α_{p} \sim Uniform [0, 0.03], d_{p} \sim Uniform [0, 1]

(82)

and sampling

ω, v, ω_{p}, v_{p} \in R^{3}

from a normal distribution

N (μ, σ^{2})

with standard deviation μ = (0,0,0)^T and variance σ² = I_3×3.

6.1.2 Quadratic potential and linear damping injection

The quadratic controller coincides with the controller presented by Rashad et al. (2019), in a setting of motion control. As such the quadratic potential $V_{Q, θ} : S E (3) \to R$ is given by

\begin{array}{l} V_{Q, θ} ([\begin{array}{l} R & p \\ 0 & 1 \end{array}]) = & \frac{1}{4} p^{⊤} K_{θ} p + \frac{1}{4} p^{⊤} R K_{θ} R^{⊤} p \end{array}

(83)

- Tr (G_{θ} (R - I_{3})),

and the constant damping injection is characterized by

B_{Q, θ} (H, P) = B_{Q, θ} \in R^{6 \times 6}

(84)

Here the translational stiffness matrix

K_{θ} \in R^{3 \times 3}

, the rotational co-stiffness matrix

G_{θ} \in R^{3 \times 3}

and the damping injection matrix B_Q,θ are chosen as

\begin{array}{l} K_{θ} & = [\begin{array}{l} \exp (θ_{1}) & 0 & 0 \\ 0 & \exp (θ_{2}) & 0 \\ 0 & 0 & \exp (θ_{3}) \end{array}] \end{array}

(85)

\begin{array}{l} G_{θ} & = [\begin{array}{l} \exp (θ_{4}) & 0 & 0 \\ 0 & \exp (θ_{5}) & 0 \\ 0 & 0 & \exp (θ_{6}) \end{array}] \end{array}

(86)

\begin{array}{l} B_{Q, θ} & = [\begin{array}{l} \exp (θ_{7}) & 0 & 0 & 0 & 0 & 0 \\ 0 & \exp (θ_{8}) & 0 & 0 & 0 & 0 \\ 0 & 0 & \exp (θ_{9}) & 0 & 0 & 0 \\ 0 & 0 & 0 & \exp (θ_{10}) & 0 & 0 \\ 0 & 0 & 0 & 0 & \exp (θ_{11}) & 0 \\ 0 & 0 & 0 & 0 & 0 & \exp (θ_{12}) \end{array}] \end{array}

(87)

where the diagonal elements exp(θ_i) ensure that the matrices are positive definite. Note that the conditions of Theorem 5.1 are guaranteed: V_Q is lower-bounded, and symmetry of

B_{Q, θ} I

is guaranteed since B_Q,θ is diagonal and positive definite.

The control-law is then of the form

W_{Q, θ} (H, P) = - d_{H} V_{Q, θ} - B_{Q, θ} P

(88)

The parameters are optimized over 1200 training epochs, using the ADAM optimizer with decay γ = 0.999, a learning rate of η = 0.001 for the initial 1000 epochs and restarting training at a learning rate of η = 0.01 for the final 200 epochs. Additional parameters of the training are summarized in Appendix C.1, Table 1. The training progress is summarized in Figure 4, where Figure 4(a) shows a monotonous decrease of the cost function over the training epochs, while Figures 4(b) and 4(c) indicate a steady improvement of the final system states with respect to the target pose. The resulting controller’s performance is shown in Figure 5. Here Figures 5(a) and 5(b) show that the controlled rigid bodies approach the target configuration. Figure 5(c) shows that the controller uses the potential V_Q,θ to guide the rigid bodies towards the target pose, and that kinetic energy is quickly dissipated.

Table 1.

Hyperparameters Corresponding to Optimizing the Quadratic Controller in Section 6.1.2.

Variable	Value
H _F	$[\begin{array}{l} I & 0 \\ 0 & 1 \end{array}]$
T	3
w ₁	4
w ₂	20
w ₃	5
w ₄	1
w ₅	1
w ₆	1
w ₇	1
w ₈	1
w ₉	1
Epochs	1200
η over first 1000 epochs	1e − 3
η over final 200 epochs	1e − 2
γ	0.999
Batch size	2048
ODE-solver	Dormand-Prince 5
rtol	1e − 5
atol	1e − 4
rtol_adjoint	1e − 5
atol_adjoint	1e − 4

Figure 4.

Visualization of the training progress of the quadratic controller characterized by V_Q,θ and B_Q,θ. All figures show data averaged over 2048 sample trajectories at the given epoch, with initial conditions sampled from $P (Γ_{0})$ .

Figure 5.

Visualization of the performance of the quadratic controller characterized by V_Q,θ and B_Q,θ, over 100 trajectories of rigid bodies with initial conditions sampled from $P (Γ_{0})$ .

6.1.3 Nonlinear potential and damping injection

Here we showcase the optimization of a nonlinear potential $V_{N, θ} : S E (3) \to R$ and a nonlinear damping injections $B_{N, θ} (H, P) \in R^{6 \times 6}$ . Both functions are parameterized by neural nets with one hidden layer of 64 neurons, using softplus and tanh activation functions. V_N,θ has 12 inputs (the components of R and p, this is a projection of H) and 1 output, while B_N,θ has 18 inputs (components of R, p and P) and 6 outputs, which are then put through an element-wise exponential function and turned into a diagonal 6 by 6 matrix to guarantee positive-definiteness of $B_{N, θ} (H, P) I$ .

The control-law is of the form

W_{N, θ} (H, P) = - d_{H} V_{N, θ} - B_{N, θ} (H, P) P

(89)

The parameters are optimized over 1000 training epochs, using the ADAM optimizer with decay γ = 0.999, and a learning rate of η = 0.001. Additional parameters of the training are summarized in Appendix C.1, Table 2.

Table 2.

Hyperparameters Corresponding to Optimizing the nonlinear Controller in Section 6.1.3.

Variable	Value
H _F	$[\begin{array}{l} I & 0 \\ 0 & 1 \end{array}]$
T	3
w ₁	4
w ₂	10
w ₃	5
w ₄	1
w ₅	1
w ₆	1
w ₇	1
w ₈	1
w ₉	1
Epochs	1200
η over first 1000 epochs	1e − 3
η over final 200 epochs	1e − 3
γ	0.999
Batch size	2048
ODE-solver	Dormand-prince 5
rtol	1e − 5
atol	1e − 4
rtol_adjoint	1e − 5
atol_adjoint	1e − 4

The training progress is summarized in Figure 6. It can be seen that the final loss of the nonlinear controller in Figure 6(a) is equivalent to that of the quadratic controller Figure 4(a). In particular, the performance of a quadratic and a nonlinear controller for this scenario are close: the final angle and distance in Figures 6(b) and 6(c) are comparable to those of Figures 4(b) and 4(c), respectively. The resulting controller’s performance is shown in Figure 7. Here the qualitative behavior shown in Figures 7(a), 7(b) and 7(c) resembles that of the quadratic case in Figures 5(a), 5(b), and 5(c), respectively.

Figure 6.

Visualization of the training progress of the nonlinear controller characterized by V_N,θ and B_N,θ. All figures show data averaged over 2048 sample trajectories at the given epoch, with initial conditions sampled from $P$ .

Figure 7.

Visualization of the performance of the nonlinear controller characterized by V_N,θ and B_N,θ, over 100 trajectories of rigid bodies with initial conditions sampled from $P (Γ_{0})$ .

6.2 General potential shaping with gravity

We optimize an NN-parameterized potential and damping injection in a system with gravity in Section 6.2.2, and show the effect of an adapted target configuration in Section 6.2.3.

6.2.1 Adapted running cost r

In the presence of a gravitational potential $V_{g} : S E (3) \to R$ the momentum dynamics of a controlled rigid body (69) pick up an additional term −d_HV_g:

{\dot{P}}_{θ} (H, P) = {ad}_{I^{- 1} P}^{⊤} P + W_{θ} (H, P) - d_{H} V_{g}

(90)

The gravitational potential is unbounded: it can therefore not be globally compensated for by a bounded potential V_N,θ. To circumvent this issue, we separately implement gravity compensation W_g(H) = d_HV_g by choosing the total control-wrench as

W_{θ} (H, P) = W_{g} (H) + W_{N, θ} (H, P)

(91)

such that the momentum dynamics again read

{\dot{P}}_{θ} (H, P) = {ad}_{I^{- 1} P}^{⊤} P + W_{N, θ} (H, P)

(92)

To take gravity into account in the optimization, the only required adaptation is to use the adapted W_θ(H, P) in the running cost (76).

Minimizing the term ‖W_θ(H, P)‖ indirectly minimizes the required gravity compensation by reducing the total wrench exerted on the plant. Indeed, when ‖W_θ‖ = 0 the dynamics are

{\dot{P}}_{θ} (H, P) = {ad}_{I^{- 1} P}^{⊤} P - d_{H} V_{g}

(93)

Thus, for ‖W_θ‖ = 0 the learned control-action W_N,θ(H, P) cancels the gravity compensation, such that the external gravitational potential is utilized to exert a force on the rigid body.

6.2.2 Nonlinear potential and damping injection

Here, we apply the ADAM optimizer with learning rate η = 0.001 and decay γ = 0.999 to optimize the nonlinear potential V_N,θ and damping B_N,θ for an adapted running cost. Additional parameters of the training are summarized in Appendix C.1, Table 3. The training progress is summarized in Figure 8, and the resulting controller’s performance is shown in Figure 9. Notably, the results do not differ strongly from Figures 6 and 5.

Table 3.

Hyperparameters Corresponding to Optimizing a nonlinear Controller Including Gravity in the Cost, as in Section 6.2.

Variable	Value
H _F	$[\begin{array}{l} I & p_{F} \\ 0 & 1 \end{array}]$ with p_F = (0,0,−1)^T
T	3
w ₁	4
w ₂	4
w ₃	5
w ₄	5e − 4
w ₅	1
w ₆	1
w ₇	1
w ₈	1e − 4
w ₉	1
Epochs	1000
η	1e − 3
γ	0.999
Batch size	2048
ODE-solver	Dormand-Prince 5
rtol	1e − 5
atol	1e − 4
rtol_adjoint	1e − 5
atol_adjoint	1e − 4

Figure 8.

Visualization of the training progress of the nonlinear controller characterized by V_N,θ and B_N,θ, in the presence of gravity. All figures show data averaged over 2048 sample trajectories at the given epoch, with initial conditions sampled from $P (Γ_{0})$ .

Figure 9.

Visualization of the performance of the nonlinear controller characterized by V_N,θ and B_N,θ, in the presence of gravity. The results show 100 trajectories of rigid bodies with initial conditions sampled from $P (Γ_{0})$ .

6.2.3 Asymmetric initial distribution

To better highlight the influence of the adapted running cost, and how the optimization in the presence of gravity differs from an optimization in the absence of gravity, an initial distribution asymmetric about the goal pose H_F (77) is introduced by choosing

H_{F} = [\begin{array}{l} I & p_{F} \\ 0 & 1 \end{array}]

with p_F = (0,0,−1)^T.

The parameters of this training coincide with those of the symmetric scenario, and are likewise summarized in Appendix C.1, Table 3. The training progress is summarized in Figure 10, and the resulting controller’s performance is shown in Figure 11.

Figure 10.

Visualization of the training progress of the nonlinear controller characterized by V_N,θ and B_N,θ, in the presence of gravity and with an initial distribution whose mean is above the target position. All figures show data averaged over 2048 sample trajectories at the given epoch, with initial conditions sampled from $P (Γ_{0})$ .

Figure 11.

Visualization of the performance of the nonlinear controller characterized by V_N,θ and B_N,θ, in the presence of gravity and with an initial distribution whose mean is above the target position. The results show 100 trajectories of rigid bodies with initial conditions sampled from $P (Γ_{0})$ .

7. Discussion

7.1 Neural ODEs on Lie groups

The proposed formulation of neural ODEs on Lie groups immediately applies to arbitrary matrix Lie groups, where parameterized maps can be learned with a global validity. The optimization of Neural ODEs on Lie groups by the gradient descent via the generalized adjoint method is a scalable approach. The key aspects that contribute to this scalability are: First, the generalized adjoint method on Lie groups preserves the memory efficiency of the generalized adjoint method used for neural ODEs on $R^{k}$ . Second, the formulation of the adjoint dynamics at the algebra level achieves a dimensionality-reduction with respect to extrinsic formulations of Duong and Atanasov (2021b); Falorsi and Forré (2020). Finally, this formulation at the algebra level also alleviates the need for chart switches of the adjoint state, and it allows for the use of the compact expression (31) of the gradient, bypassing the need for gradient computations in local charts.

The work can be generalized further: Theorem 2.1 assumes the cost to be of the form (10), while the derivation in Appendix A.4 in principle allows for a more general choice of cost that may be of interest in, for example, learning of periodic trajectories (Wotte et al., 2023). The accompanying code is currently written specifically for the Lie group $S E (3) \times R^{6}$ , and future work will produce code that is applicable to other matrix Lie groups as well.

7.2 Optimal potential shaping

The optimization of an NN-parameterized potential and damping injection was successful and the large number of parameters used in the optimization confirms that it scales to the large parameter scenario. The optimization was also successful when including gravity in a nonlinear running cost. Stability was guaranteed by design, by implementing the requirements of Theorem 5.1 on the level of architecture and activation functions. As a further advantage the resulting controller is global on SE(3), as opposed to only being applicable in a limited chart region.

Regarding limitations of the approach, the numerical stability of the adjoint method on SE(3) was observed to strongly depend on the smoothness of the running cost, which suggests added value in considering different Lie group integrators that accommodate this lack of smoothness. Lastly, while the structure of the presented controller is highly interpretable and the various components of the energy are readily visualized, the space of possible initial conditions and trajectories remains large, and the high-dimensional state-space obscures low-level properties and a deep understanding of the eventual controller, beyond safety guarantees and numerical verification of stability.

Alternative choices for the final and running costs, as well as the weights in these costs are worth investigating. The design space of possible controllers is also large and other control architectures may be advantageous. In future work the controller will be applied to a real drone, and other cost-functions and control structures will be investigated.

8. Conclusion

Lie groups are ubiquitous in engineering, and so are dynamic systems on Lie groups. We proposed a method for dynamics optimization that works on arbitrary, finite-dimensional Lie groups and for a large class of cost-functions. The resulting method is highly scalable, and more compact than alternative manifold formulations. The key steps in the formulation related to using canonical Lie group structure to create a compact gradient descent algorithm: we phrased the generalized adjoint method at the Lie algebra level, we utilize a compact expression for the gradient as an element of the dual to the Lie algebra, and we use a generic Lie group integrator for dynamics integration. The method was successfully applied to optimize a controller for a rigid body that is globally valid on the Lie group SE(3). A key aspect of choosing the class of controllers was stability by design, which guided the architecture of the neural nets that parameterize the potential energy shaping and damping injection controller.

Footnotes

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

This research was supported by the PortWings project funded by the European Research Council [Grant Agreement No. 787675].

ORCID iDs

Yannik P Wotte

Stefano Stramigioli

Notes

Appendix

A The generalized adjoint method on Lie groups

In this Appendix the generalized adjoint method on matrix Lie groups (Theorem 2.1) is derived in four steps.

B Theory

C Training

References

Ayala

Jouan

Torreblanca

, et al. (2021) Time optimal control for linear systems on Lie groups. Systems & Control Letters 153: 104956. DOI: 10.1016/J.SYSCONLE.2021.104956.

Brockett

(1973) Lie algebras and lie groups in control theory. Geometric Methods in System TheoryLetters 15: 43–82. DOI: 10.1007/978-94-010-2675-8_2.

Bronstein

Bruna

Cohen

, et al. (2021) Geometric Deep Learning.

Brunke

Greeff

Hall

, et al. (2022) Safe learning in robotics: from learning-based control to safe reinforcement learning. Annual Review of Control, Robotics, and Autonomous Systems 5: 411–444. DOI: 10.1146/annurev-control-042920-020211.

Bullo

Lewis

(2005) Supplementary Chapters for Geometric Control of Mechanical Systems?.

Bullo

Murray

(1995) Proportional Derivative (PD) Control on the Euclidean Group.

Celledoni

Owren

(2003) Lie group methods for rigid body dynamics and time integration on manifolds. Computer Methods in Applied Mechanics and Engineering 192(3): 421–438. DOI: 10.1016/S0045-7825(02)00520-0.

Celledoni

Marthinsen

Owren

(2014) An introduction to lie group integrators – basics, new developments and applications. Journal of Computational Physics 257: 1040–1061. DOI: 10.1016/j.jcp.2012.12.031.

Chen

RTQ

Rubanova

Bettencourt

, et al. (2018) Neural ordinary differential equations. CoRR abs/1806: 07366.

10.

Chen

Lal

, et al. (2021) Machine learning Lie structures & applications to physics. Physics Letters B 817: 136297. DOI: 10.1016/J.PHYSLETB.2021.136297.

11.

Dev

Jain

Kumar Arora

, et al. (2021) Machine learning and its impact on control systems: a review. Materials Today: Proceedings 47, 3744–3749. DOI: 10.1016/j.matpr.2021.02.281. 3rd International Conference on Computational and Experimental Methods in Mechanical Engineering.

12.

Duong

Atanasov

(2021a) Adaptive control of SE(3) Hamiltonian dynamics with learned disturbance features. IEEE Control Systems Letters 6: 2773–2778. DOI: 10.1109/LCSYS.2022.3177156.

13.

Duong

Atanasov

(2021b) Hamiltonian-based Neural ODE Networks on the SE(3) Manifold for Dynamics Learning and Control.

14.

Duong

Altawaitan

Stanley

, et al. (2024) Port-hamiltonian Neural Ode Networks on Lie Groups for Robot Dynamics Learning and Control.

15.

Falorsi

Forré

(2020) Neural Ordinary Differential Equations on Manifolds.

16.

Fanzhang

Zhao

(2019) Lie Group Machine Learning. De Gruyter. ISBN 9783110499506. DOI: 10.1515/9783110499506/EPUB.

17.

Forestano

Matchev

Matcheva

, et al. (2023) Deep learning symmetries and their lie groups, algebras, and subalgebras from first principles. Machine Learning: Science and Technology 4: 025027. DOI: 10.1088/2632-2153/acd989.

18.

Goodarzi

Lee

(2013) Geometric nonlinear pid control of a quadrotor uav on se(3). In: 2013 European Control Conference (ECC), 2013, 3845–3850. DOI: 10.23919/ECC.2013.6669644.

19.

Grafarend

Kühnel

(2011) A minimal atlas for the rotation group SO(3). GEM - International Journal on Geomathematics 2(1): 113–122. DOI:10.1007/S13137-011-0018-X.

20.

Hall

(2015) Lie groups, lie algebras, and representations: an elementary introduction. Graduate Texts in Mathematics (GTM, volume 222). Springer. ISBN 978-3319134666.

21.

Hewing

Wabersich

Menner

, et al. (2020) Learning-based model predictive control: toward safe learning in control. Annual Review of Control, Robotics, and Autonomous Systems 3: 269–296. DOI: 10.1146/annurev-control-090419-075625.

22.

Huang

Wan

Probst

, et al. (2017) Deep learning on lie groups for skeleton-based action recognition. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Los Alamitos, CA, 2017, IEEE Computer Society, 1243–1252. DOI: 10.1109/CVPR.2017.137.

23.

Ibarz

Tan

Finn

, et al. (2021) How to train your robot with deep reinforcement learning: lessons we have learned. The International Journal of Robotics Research 40(4-5): 698–721. DOI: 10.1177/0278364920987859.

24.

Iserles

Munthe-Kaas

Nørsett

, et al. (2000) Lie-group methods. Acta Numerica 9: 215–365. DOI: 10.1017/S0962492900002154.

25.

Isham

(1999) Modern Differential Geometry for Physicists. ISBN 981-02-3555-0. DOI: 10.1142/3867.

26.

Jurdjevic

(1996) Geometric Control Theory. Cambridge press. DOI: 10.1017/CBO9780511530036.

27.

Kim

, et al. (2021) Review of machine learning methods in soft robotics. PLoS One 16(2): e0246102.

28.

Kobilarov

Marsden

(2011) Discrete geometric optimal control on lie groups. IEEE Transactions on Robotics 27(4): 641–655. DOI: 10.1109/TRO.2011.2139130.

29.

Lee

Leok

McClamroch

(2010) Geometric tracking control of a quadrotor uav on se(3). In: 49th IEEE Conference on Decision and Control. 2010, CDC, 5420–5425. DOI: 10.1109/CDC.2010.5717652.

30.

Lou

Lim

Katsman

, et al. (2020) Neural manifold ordinary differential equations. Advances in Neural Information Processing Systems 2020-Decem 20: 175.

31.

(2020) Survey on lie group machine learning. Big Data Mining and Analytics 3(4): 235–258. DOI: 10.26599/BDMA.2020.9020011.

32.

Marsden

Ratiu

(1999) Introduction to Mechanics and Symmetry. New York: Springer, Vol. 17. ISBN 978-1-4419-3143-6. DOI: 10.1007/978-0-387-21792-5.

33.

Massaroli

Poli

Park

, et al. (2020) Dissecting neural ODEs. In: Advances in Neural Information Processing Systems 2020-Decem(NeurIPS) 2: 199.

34.

Massaroli

Poli

Califano

, et al. (2022) Optimal energy shaping via neural approximators. SIAM Journal on Applied Dynamical Systems 21(3): 2126–2147. DOI: 10.1137/21M1414279.

35.

Munthe-Kaas

(1999) High order Runge-Kutta methods on manifolds. Applied Numerical Mathematics 29(1): 115–127. DOI: 10.1016/S0168-9274(98)00030-0. Proceedings of the NSF/CBMS regional Conference on numerical Analysis of Hamiltonian differential equations.

36.

Murray

Sastry

(1994) A Mathematical Introduction to Robotic Manipulation. DOI: 10.1201/9781315136370.

37.

Oprea

(2014) Applications of Lusternik-Schnirelmann category and its Generalizations. 36: 59–97. DOI: 10.7546/jgsp-36-2014-59-97.

38.

Ortega

Mareels

(2000) Energy-balancing passivity-based control. Proceedings - American Control Conference 2: 1265–1270. DOI: 10.1109/ACC.2000.876703.

39.

Ortega

van der Schaft

Castaños

, et al. (2008) Control by interconnection and standard passivity-based control of port-Hamiltonian systems. IEEE Transactions on Automatic Control 53: 2527–2542. DOI: 10.1109/TAC.2008.2006930.

40.

Paris

Beneddine

Dandois

(2021) Robust flow control and optimal sensor placement using deep reinforcement learning. Journal of Fluid Mechanics 913: A25. DOI: 10.1017/jfm.2020.1170.

41.

Rashad

Califano

Stramigioli

(2019) Port-Hamiltonian passivity-based control on SE(3) of a fully actuated UAV for Aerial physical interaction near-hovering. IEEE Robotics and Automation Letters 4(4): 4378–4385. DOI: 10.1109/LRA.2019.2932864.

42.

Robbins

Monro

(1951) A stochastic approximation method. The Annals of Mathematical Statistics 22(3): 400–407. DOI: 10.1214/aoms/1177729586.

43.

Rossmann

(2006) Lie Groups: An Introduction through Linear Groups. Oxford graduate texts in mathematics. Oxford University Press. ISBN 9780199202515.

44.

Ruder

(2017) An Overview of Gradient Descent Optimization Algorithms.

45.

Saccon

Hauser

Aguiar

(2013) Optimal control on lie groups: the projection operator approach. IEEE Transactions on Automatic Control 58(9): 2230–2245. DOI: 10.1109/TAC.2013.2258817.

46.

Schmid

(2010) Infinite-dimensional lie groups and algebras in mathematical physics. Advances in Mathematical Physics 2010: 280362. DOI: 10.1155/2010/280362.

47.

Solà

Deray

Atchuthan

(2021) A Micro Lie Theory for State Estimation in Robotics.

48.

Soori

Arezoo

Dastres

(2023) Artificial intelligence, machine learning and deep learning in advanced robotics, a review. Cognitive Robotics 3: 54–70. DOI: 10.1016/j.cogr.2023.04.001.

49.

Spindler

(1998) Optimal control on lie groups with applications to attitude control. Mathematics of Control, Signals, and Systems 11(3): 197–219. DOI: 10.1007/BF02741891.

50.

Taylor

Berrueta

Murphey

(2021) Active learning in robotics: a review of control principles. Mechatronics 77(May): 102576. DOI: 10.1016/j.mechatronics.2021.102576.

51.

Teng

Clark

Bloch

, et al. (2022) Lie algebraic cost function design for control on lie groups. In: 2022 IEEE 61st Conference on Decision and Control. 2022, CDC, 1867–1874. DOI: 10.1109/CDC51059.2022.9993143.

52.

Visser

Stramigioli

Heemskerk

(2006) Cayley-Hamilton for roboticists. IEEE International Conference on Intelligent Robots and Systems 1: 4187–4192. DOI: 10.1109/IROS.2006.281911.

53.

Wotte

Dummer

Botteghi

, et al. (2023) Discovering efficient periodic behaviors in mechanical systems via neural approximators. Optimal Control Applications and Methods 44(6): 3052–3079. DOI: 10.1002/oca.3025.

54.

Xie

Sato

Sugiyama

(2021) A Diffusion Theory for Deep Learning Dynamics: Stochastic Gradient Descent Exponentially Favors Flat Minima.

Optimal potential shaping on SE(3) via neural ordinary differential equations on Lie groups

Abstract

Keywords

1. Introduction

1.1 Neural ODEs and relation to existing works

1.2 Notation

2. Main result

2.1 Lie groups

2.2 Optimization problem

2.3 Optimization algorithm

3. Technical tools

3.1 Atlas and minimal atlas on Lie groups

3.2 Vectors on Lie groups

3.3 Lie group integrators

3.4 Gradients on Lie groups

3.5 Composition of Lie groups

4. The cases SO(3) and SE(3)

4.1 The matrix Lie groups SO(3) and SE(3)

4.2 Minimal atlas

4.3 Expressing scalar functions

5. Optimizing a rigid body control

5.1 Control of a rigid body

5.2 Stability

5.3 Optimization by the adjoint method on SE(3)

6. Simulations

6.1 Quadratic vs. general potential shaping

6.1.1 Choice of cost C and distribution P

6.1.2 Quadratic potential and linear damping injection

6.1.3 Nonlinear potential and damping injection

6.2 General potential shaping with gravity

6.2.1 Adapted running cost r

6.2.2 Nonlinear potential and damping injection

6.2.3 Asymmetric initial distribution

7. Discussion

7.1 Neural ODEs on Lie groups

7.2 Optimal potential shaping

8. Conclusion

Footnotes

Declaration of conflicting interests

Funding

ORCID iDs

Notes

Appendix

A The generalized adjoint method on Lie groups

B Theory

C Training

References

6.1.1 Choice of cost C and distribution $P$