Sage Journals: Discover world-class research

Abstract

Model-based approaches to navigation, control, and fault detection that utilize precise nonlinear models of vehicle plant dynamics will enable more accurate control and navigation, assured autonomy, and more complex missions for such vehicles. This paper reports novel theoretical and experimental results addressing the problem of parameter estimation of plant and actuator models for underactuated underwater vehicles operating in 6 degrees-of-freedom (DOF) whose dynamics are modeled by finite-dimensional Newton-Euler equations. This paper reports the first theoretical approach and experimental validation to identify simultaneously plant-model parameters (parameters such as mass, added mass, hydrodynamic drag, and buoyancy) and control-actuator parameters (control-surface models and thruster models) in 6-DOF. Most previously reported studies on parameter identification assume that the control-actuator parameters are known a priori. Moreover, this paper reports the first proof of convergence of the parameter estimates to the true set of parameters for this class of vehicles under a persistence of excitation condition. The reported adaptive identification (AID) algorithm does not require instrumentation of 6-DOF vehicle acceleration, which is required by conventional approaches to parameter estimation such as least squares. Additionally, the reported AID algorithm is applicable under any arbitrary open-loop or closed-loop control law. We report simulation and experimental results for identifying the plant-model and control-actuator parameters for an L3 OceanServer Iver3 autonomous underwater vehicle. We believe this general approach to AID could be extended to apply to other classes of machines and other classes of marine, land, aerial, and space vehicles.

Keywords

Calibration and identification adaptive systems parameter identification learning and adaptive systems mechanics design and control underactuated robots

1. Introduction

This paper addresses the problem of simultaneously estimating the plant-model and control-actuator parameters for an underactuated underwater vehicle (UV) operating in 6 degrees-of-freedom (DOF). In particular, we report a new adaptive identification (AID) algorithm, including a novel proof of asymptotic parameter convergence, and associated simulation and experimental results.

To the best of our knowledge, this paper reports the first theoretical approach and experimental validation to identify simultaneously UV dynamical plant-model parameters (for critical terms such as mass, added mass, hydrodynamic drag, and buoyancy) and control-actuator parameters (for control-surface models and thruster models) in 6-DOF. Most previously reported studies on parameter identification assume that the control-actuator parameters are known a priori. Moreover, this paper reports the first proof of convergence of the parameter estimates to the true set of parameters for this class of vehicles when a persistence of excitation condition is satisfied. Most previously reported AID approaches for this class of systems only show stability and boundedness of the parameter estimates.

The performance of a dynamic model in predicting a robotic vehicle’s state depends primarily on the accuracy of the model structure and parameters. Accurate estimates of these parameters are required for model-based navigation, control, fault detection, and simulation. Indeed, a broad class of nonlinear model-based trajectory-tracking controllers for robot arms developed since the 1980s require exact knowledge of the plant’s kinematic and dynamic parameters—examples include nonlinear tracking controllers for second-order rotating plants Koditschek (1988) and exactly linearizing model-based trajectory-tracking controllers for open kinematic chains Freund (1983); Luh et al. (1980).

While the general form of dynamical plant models for underwater vehicles has been understood for decades, Society of Naval Architects and Marine Engineers (U.S.). Technical and Research Committee. Hydrodynamics Subcommittee (1950), the dynamic-model parameters—that is, parameters for terms including mass, added mass, hydrodynamic drag, buoyancy, and control actuators—are impossible to determine analytically and are often not provided by manufacturers of the robotic vehicles. In contrast to plant kinematic parameters, which are often easily obtained from design documents and generally do not vary with time, plant dynamic parameters generally must be measured empirically and may change over time. Sources of such changes may include but are not limited to: substantial payload changes to the vehicle or robotic arm; physical changes to the vehicle, such as lengthening or shortening a UV to accommodate different payloads; variation in ballasting and trim conditions; variation in water density which affects vehicle buoyancy; and changes to on-board equipment and instrumentation (both external and internal). Model parameters must be re-estimated whenever the robot physical configuration is significantly altered.

Estimating plant-model parameters is generally more difficult for underactuated, torpedo-shaped UVs than for fully actuated UVs with thrusters because: (i) the reduced actuation available on underactuated UVs limits the plant excitation that can be induced from the control inputs, and (ii) torpedo-shaped UVs and many uncrewed aerial vehicles (UAVs) are often actuated with control surfaces (e.g., fins, wings, elevators, and rudders), which are difficult to characterize independently of the plant-model parameters.

For these reasons, we seek an approach to parameter estimation for underactuated UVs in 6-DOF that simultaneously estimates plant and actuator parameters and can be performed routinely in the field with minimal time and effort by the vehicle operator. We believe the general approach to AID reported here is not limited to underwater vehicles and could be extended to apply to other classes of machines and other classes of marine, land, aerial, and space vehicles.

1.1. Motivation for accurate model identification

Model-based approaches to navigation, control, and fault detection that utilize precise nonlinear models of vehicle dynamics will enable more accurate control and navigation, assured autonomy, and more complex missions for robotic vehicles. In particular:

(1) Accurately identified model parameters can be utilized in forward simulation to predict vehicle performance and tune gains for various controllers (open-loop and closed-loop), navigation systems, and motion-planning algorithms.

(2) Accurately identified model parameters can be utilized in model-based state-estimation (navigation) systems that utilize the full UV plant model in their solution Harris and Whitcomb (2018).

(3) Accurately identified model parameters can enable model-based control algorithms that offer improved performance over traditional controllers. For example, previously reported studies have shown that nonlinear adaptive model-based control (AMBC) can outperform proportional derivative control (PDC) in trajectory tracking for fully actuated UVs McFarland and Whitcomb (2014) and fully actuated robot arms Craig et al. (1987); Slotine and Li (1987); Sadegh and Horowitz (1990); Whitcomb et al. (1993).

(4) Online approaches for identifying plant parameters may lead to new methods for online fault detection and fault-tolerant control of UVs, for example Mao and Whitcomb (2021).

1.2. Common approaches to parameter identification

The three most common approaches to model-parameter estimation for vehicles with dynamics that cannot be fully characterized analytically, such as UVs, are:

(1) utilizing data obtained in captive-motion experiments (e.g., towing a vehicle in a hydrodynamic test tank or wind tunnel),

(2) utilizing data from computational fluid dynamics (CFD) simulations, or

(3) utilizing data obtained in full-scale experimental trials of a robotic vehicle in controlled free motion (e.g., under open-loop or closed-loop control).

Captive-motion experiments are the standard in naval architecture Van Manen and Van Ossanen (1988); however, they are time consuming, expensive, and difficult to perform properly. Further, the results are valid only for the specific vehicle configuration tested, and it is often impractical to repeat the experiments for every possible configuration of the robotic vehicle. Typically, the experimental setup involves either rotating-arm experiments Van Manen and Van Ossanen (1988) or planar motion mechanism (PMM) experiments, such as towing a vehicle on a carriage with a load cell at a fixed velocity in a hydrodynamic test facility, for example, Prestero (2001). Captive-motion experiments can be quite accurate in certain DOF, but these experiments often require decoupling the DOF, which can lead to model inaccuracies. Typically, separate tests are conducted with and without the control surfaces (fins) installed in order to isolate the fin drag from the body drag, and separate test facilities are often required to characterize the fin lift and drag as a function of the angle of attack.

Parameter estimation based on computational fluid dynamics uses numerical models of fluid flow around the vehicle to simulate virtual PMM tests. CFD-based approaches to model identification vary widely in accuracy, time, and cost. Additionally, CFD-based approaches require detailed computer-aided design (CAD) models of the UV, which may be difficult or impossible to obtain from commerical off-the-shelf (COTS) UV manufacturers. If a CAD model is created by the end user for a COTS UV, often the vehicle is modeled as a smooth hull—appendages such as antennas, strobe lights, sensors (e.g., CTD sensors and sonar transducers), and acoustic modem transducers are left unmodeled Liu et al. (2020). An advantage of CFD-based parameter estimation, however, is the ability to inform design choices during the preliminary vehicle-design process Phillips et al. (2010).

While these approaches have advantages, both captive-motion experiments and CFD-based approaches are often infeasible for many UV end users for reasons of cost, time, and practicality.

Parameter identification based on data collected in full-scale experimental trials of UVs in controlled free motion has several advantages over captive-motion experiments and CFD. First, the approach is accessible to any end user who can deploy an existing UV. Additionally, though beyond the scope of this paper, some approaches that utilize full-scale experimental data, such as the approach described herein, can be extended to run in real time during vehicle missions. Such algorithms can also enable online model-based fault detection and fault isolation Mao and Whitcomb (2021).

1.3. Parameter identification of underactuated robotic vehicles

Many robots, such as torpedo-shaped autonomous underwater vehicles (AUVs), aerial drones, and aerospace vehicles are not fully actuated. Instead, these robots are underactuated. We adopt the following definition for Underactuated Control Differential Equations from Tedrake (2023): A second-order control differential equation described by the equations $\ddot{q} = f (q, \dot{q}, u, t)$ is fully actuated in state $x = {[q^{T}, {\dot{q}}^{T}]}^{T}$ and time t if the resulting map f is surjective: for every $\ddot{q}$ there exists a control input u such that $\ddot{q} = f (q, \dot{q}, u, t)$ which produces the desired response. Otherwise it is underactuated (at state x at time t).

The robotic vehicle used to validate our theoretical results, an Iver3 (L3Harris OceanServer, Fall River, MA, USA) AUV, is clearly underactuated. The vehicle has only five control inputs (4 commanded fin angles and 1 commanded main propeller angular velocity), fewer than the UV’s 6-DOF. Thus, the Iver3 robotic vehicle is underactuated by definition, regardless of the particular method used to model the actuators. The Iver3 belongs to the common class of torpedo-shaped AUVs. This class of UV differs significantly from fully-thruster-actuated remotely operated vehicles (ROVs), where the thrusters are often capable of exerting arbitrary 6-DOF forces and moments on the vehicle. Torpedo-shaped UVs are controllable only when in forward motion, are generally incapable of hovering, and are physically unable to track fully general 6-DOF reference trajectories.

Parameter estimation of underactuated UVs presents real theoretical and practical challenges. A main concern is whether the UV parameters are observable. In other words: do the control actuators have sufficient control authority to excite the plant so that the plant model parameters are observable? The authors have previously defined necessary and sufficient conditions for mass and rotational-inertia parameters in rigid-body dynamical plants to be uniformly completely observable (UCO) Paine and Whitcomb (2021). Previously reported simulation studies confirmed that underactuated rigid-body plants can meet these conditions. We conjectured that a similar approach could be used to show UCO of other parameters such as drag coefficients and gravity terms. In this paper, via the equivalent concept of persistence of excitation (PE), we define sufficient conditions for convergence of plant and actuator parameter estimates to a true parameter set.

To the best of our knowledge, with the exception of Harris et al. (2018), Harris (2019), Paine (2018), upon which this work is based, there are no previously reported studies for parameter identification of underactuated robotic vehicles that simultaneously identify plant-model parameters and actuation parameters in 6-DOF.

With few exceptions, namely McFarland and Whitcomb (2021), many previously reported AID methods require model-based adaptive tracking controllers and are not applicable when the UV is operating under any control law other than a specific adaptive tracking controller. On commercially available robotic vehicles, including UVs, the user is often limited to using the proprietary control system provided by the manufacturer, and the user cannot easily replace the manufacturer’s proprietary controller with an adaptive trajectory-tracking controller (ATTC). The AID approach reported herein works for robotic plants operating under any control inputs (specifically, it not limited to the case of trajectory-tracking control), and is thus applicable in the common situation of a UV operating with a proprietary, manufacturer-provided controller or in open-loop control.

In summary, the advantages of this AID approach include:

• This AID method simultaneously estimates both plant and actuator parameters that are present in a physics-based model of robot dynamics, either fully actuated or underactuated. Most previously reported studies on parameter identification assume that the control-actuator parameters are known a priori.

• This AID method is the first reported AID approach with a proof of convergence of the parameter estimates to the true set of parameters for this class of vehicles, provided that a PE condition is satisfied. Most previously reported AID approaches for this class of systems only show stability and boundedness of the parameter estimates.

• This AID method, like other adaptive identifiers, does not require body-relative linear and angular acceleration signals, which is beneficial because body-relative linear acceleration terms are difficult to obtain. Although not discussed in detail here, we find this AID method is more robust to sensor noise when compared to the least-squares–based methods described in Harris et al. (2018).

• This AID method ensures stability and asymptotic convergence of the identification plant velocity error and convergence of the parameter estimates, in contrast to many machine learning (ML) approaches which are generally unable to provide such guarantees.

• The simulation and experimental studies reported here show that AID-estimated parameters can converge with a “single training set” containing on the order of 1000 seconds of free-motion AUV dive data, and the performance of the resulting model was verified in both self-validation and cross-validation.

• This AID method may be performed by a user in the field when the plant dynamics change, unlike captive-motion experiments or CFD.

• This AID method works with any open or closed-loop control law, provided that a PE condition is satisfied. Unlike many previously reported AID methods, this method does not require a trajectory-tracking controller, which is often unsuitable for underactuated vehicles without control authority in all DOF.

2. Literature review

Most previously reported model-based parameter identification methods employ one of two general approaches: (i) least-squares linear regression of experimental data or (ii) adaptive model-based trajectory-tracking control and adaptive identification. Khalil and Dombre provide a detailed overview of these approaches Khalil and Dombre (2002). We separately group machine learning (ML) and neural network (NN) techniques as a third type of method for system identification, because these methods do not explicitly use a dynamic model for all or part of the plant. Instead, these methods largely map plant input-to-output behavior. The following section describes recent work in these three areas and compares the different approaches.

2.1. Least squares and Kalman filtering

A variety of previously reported studies have employed least-squares, total least-squares, or extended Kalman filters to identify plant parameters entering linearly into the plant equations of motion for robot manipulators Khosla and Kanade (1985); Atkeson et al. (1986); An et al. (1988); Armstrong et al. (1986); Swevers et al. (2007), uncrewed underwater vehicles (UUVs) Caccia et al. (2000); Alessandri et al. (1998); Martin and Whitcomb (2014); Harris et al. (2018), and spacecraft Norman et al. (2011); Keim et al. (2006). Most methods require the time derivative of joint angles or time derivative of 6-DOF body-relative linear and angular velocity, which is difficult to measure directly, and often is numerically estimated by differentiating sensor measurements—a process that is sensitive to sensor noise. Some studies ameliorate the problem of sensor noise through use of signal filtering. However, unless performed using an acausal method, filtering introduces time delay. These issues are (sometimes) adequately handled during offline data processing, but pose a significant challenge for online parameter estimation.

2.2. Adaptive trajectory-tracking control

Adaptive model-based tracking controllers have been proposed for both fully actuated McFarland and Whitcomb (2014), Sahu and Subudhi (2014) and underactuated marine vehicles Jiang (2002), Aguiar and Hespanha (2007), spacecraft Ahmed et al. (1998), Wong et al. (2001), and mobile robots Fukao et al. (2000), Shojaei et al. (2011). However, precise definitions for persistence of excitation (PE) to ensure asymptotic convergence of parameter-estimation error are absent from many studies, such as Shojaei et al. (2011), Sahu and Subudhi (2014), Wong et al. (2001). PE-like conditions were reported in Aguiar and Hespanha (2007), with a focus on stability of the trajectory-tracking error and robustness to parametric uncertainty. Necessary and sufficient PE conditions for asymptotic identification were reported in Ahmed et al. (1998), but only for parameters in the inertia matrix for 3-DOF rotational plants.

In Craig et al. (1987) the authors report a stable adaptive trajectory-tracking algorithm for fully actuated robot arms that require instrumentation of joint acceleration. In Slotine and Li (1987); Sadegh and Horowitz (1990); Whitcomb et al. (1993) the authors report three different stable adaptive trajectory-tracking algorithms for fully actuated robot arms that do not require instrumentation of joint acceleration

2.2.1. Relationship of adaptive identification to adaptive trajectory tracking control

The present study addresses AID; it does not address the problem of simultaneously performing adaptive model identification and model-based trajectory tracking control, that is, ATTC. There are numerous previously reported results for ATTC of mechanical systems in which the plant state is controlled to follow a smooth trajectory known a priori. ATTC, however, will not work when either (i) the plant is underactuated or (ii) the plant is controlled by an existing controller that cannot be modified, as is often the case with commercially available robot systems, or (iii) the plant is under open-loop control, as is often the case for system identification experiments in the early stages of vehicle development.

In general, papers that report novel adaptive model-based tracking controllers prove convergence of the trajectory-tracking error, and prove stability and boundedness (but not convergence) of the parameter-estimation error.

2.3. Adaptive parameter estimation

Adaptive methods for parameter identification of linear time-invariant plants are well understood Narendra and Annaswamy (1989), Sastry and Bodson (1989), Astrom (1989), Tao (2003). In Hsu et al. (1987), the authors report an AID algorithm for robot arms that requires instrumentation of joint acceleration, as well as a low-pass filter approach for the parameter update law to avoid this requirement.

Relatively few studies have considered model-parameter identification of underactuated UVs, for example, Graver et al. (2003), or underactuated robot arms, for example, Ayusawa et al. (2014). To the best of our knowledge, only two previous studies, Paine and Whitcomb (2018), McFarland and Whitcomb (2021), have addressed AID of underactuated UVs.

Few UV parameter identification methods which do not require direct instrumentation of acceleration or trajectory-tracking controllers have been reported. The AID algorithm presented here may prove useful in applications with controlled or uncontrolled plants in which reference trajectory tracking is impractical or infeasible.

AIDs for fully actuated multi-DOF UV plant models were first reported by Smallwood and Whitcomb in Smallwood and Whitcomb (2003), but this AID was limited to fully diagonal plant models in which the dynamics of each DOF is fully decoupled and independent from the dynamics of other DOF. McFarland and Whitcomb in McFarland and Whitcomb (McFarland and Whitcomb, 2013, 2021) report an AID for fully coupled 6-DOF UV plant models; the AID algorithm applies to fully actuated or underactuated plants, and the experiments reported utilized 6-DOF actuation. That study is the foundation for the AID reported here. In Paine and Whitcomb (2018), Paine and Whitcomb reported an extension of the AID reported in McFarland and Whitcomb (2013) to 3-DOF underactuated UV plant models and includes a simulation study with Gaussian noise.

2.4. Machine learning

In their pioneering paper Narendra and Parthasarathy (1990), Narendra and Parthasarthy developed the theoretical foundations for the application of neural networks (NNs) to perform system identification of nonlinear dynamical plants. More recently, experimental results using ML and NN-based identification methods have been reported for underactuated UVs Wehbe et al. (2017), Wehbe and Krell (2017), Karras et al. (2013). Broad interest in using these methods for system identification has led to a variety of different approaches.

In general, ML approaches to model identification appear to be divided on the level to which they utilize physics-based models. Some authors caution against adopting techniques and algorithms from domains void of underlying physics (such as natural language processing) to the natural sciences; instead, these authors argue specifically for increased usage of physics-based models in ML Willcox et al. (2021). An interesting hybrid approach is reported in Woo et al. (2018), wherein the authors decompose the dynamics of a uncrewed surface vessel (USV) into linear and nonlinear terms; they estimate the parameters corresponding to the linear terms using the parameter-estimation technique from Sonnenburg and Woolsey (2013) and use ML for the nonlinear model. A similar hybrid approach for UVs was proposed in Van de Ven et al. (2007). In this case, the underlying structure of the nonlinear dynamic model is maintained via the careful design of separate neural networks to estimate each of the mass, damping, and buoyancy terms. However, other authors in ML, for example, Bagherzadeh (2018), argue against the use of a predetermined structure, saying that the use of predetermined model structure reduces the problem of model identification to one of parameter estimation, which may not fully capture the nonlinearities of the dynamics. Clearly, there is some disagreement in this field, and some researchers report satisfactory results obtained via their particular approach to system identification.

2.5. Approach comparison

We can offer some comparison of these approaches. Least-squares estimation is often performed offline with some signal filtering, whereas the adaptive techniques, including the AID reported herein, can be implemented online. Additionally, least-squares estimation typically requires signals of translational and rotational body-frame acceleration, while AIDs generally require only body-frame velocities. Although some argue that ML and NN methods may be able to perform system identification by capturing complex nonlinear dynamics that are difficult to model, these methods are not guaranteed to be stable or, in some cases, to generalize beyond the training data set. Additionally, in most cases, significant computational time and training data are needed to complete estimates using ML and NN methods. On the other hand, the adaptive techniques can be proven to be stable and to provide asymptotic error convergence, as we will show in Section 5.2 and 5.3.

3. Mathematical conventions

For a matrix $A \in R^{n \times n}$ , we denote eigenvalues ordered from minimum to maximum magnitude as λ_min(A) ≤ λ_n−1(A)… ≤ λ₂(A) ≤ λ_max(A). For a vector $x \in R^{n}$ , we define the following vector norm:

• Euclidean (L²) norm:

{‖ x ‖}_{2} = {(x^{T} x)}^{1 / 2}

(1)

We also define the following norms on the matrix A:

• Spectral norm:

‖ A ‖_{2} = \max_{{‖ x ‖}_{2} = 1} ‖ A x ‖_{2}

(2)

• Frobenius norm:

{‖ A ‖}_{F} = {(\sum_{i = 1}^{n} \sum_{j = 1}^{n} a_{i j}^{2})}^{\frac{1}{2}}

(3)

noting that ‖A‖₂ ≤ ‖A‖_F. When not specified, the ‖ ·‖₂ norm is implied for both vectors and matrices.

For a vector $x \in R^{n}$ , we also define the functions:

• The diagonal matrix operator $diag (\cdot) : R^{n} \to R^{n \times n}$ such that $\forall x \in R^{n}$ ,

diag (x) = [\begin{array}{c} x_{1} & 0 & \dots & 0 \\ 0 & ⋱ & ⋱ & ⋮ \\ ⋮ & ⋱ & ⋱ & 0 \\ 0 & \dots & 0 & x_{n} \end{array}]

(4)

• The skew-symmetric operator ${(\cdot)}^{\land} : R^{3} \to R^{3 \times 3}$ such that $\forall x = {[x_{1} x_{2} x_{3}]}^{T} \in R^{3}$ ,

x^{\land} = [\begin{array}{c} 0 & - x_{3} & x_{2} \\ x_{3} & 0 & - x_{1} \\ - x_{2} & x_{1} & 0 \end{array}]

(5)

• The se(3) adjoint operator $a d (\cdot) : R^{6} \to R^{6 \times 6}$ such that $\forall v = {[ν^{T} ω^{T}]}^{T} \in R^{6}$ ,

a d_{s e (3)} (v) = [\begin{array}{c} ω^{\land} & 0_{3 \times 3} \\ ν^{\land} & ω^{\land} \end{array}]

(6)

4. Finite-dimensional dynamical plant models for 6-DOF vehicles

We consider the class of plants which can be modeled by the Newton-Euler equations in body-relative coordinates. We assume the inertial terms include the mass of the body used in classical rigid-body dynamics, and also include the additional “added mass” or “hydrodynamic mass” arising from the accelerating/decelerating submerged body’s interaction with surrounding fluid. Furthermore, we consider that such plants can be subject to quadratic drag forces and net gravitational forces and moments.

4.1. State and control input representation

• $η (t) = {[x {(t)}^{T} φ {(t)}^{T}]}^{T} \in R^{6}$ is the position and orientation of the body frame with respect to an inertial frame of reference, which contains the position of the body-frame origin $x (t) \in R^{3}$ and its orientation in Euler angles $φ (t) \in R^{3}$ . We assume that roll and pitch are bounded to avoid the kinematic singularities arising in the Euler angle representation.

• $v (t) = {[ν {(t)}^{T} ω {(t)}^{T}]}^{T} = {[v_{1} v_{2} v_{3} v_{4} v_{5} v_{6}]}^{T} \in R^{6}$ is the body-frame velocity, which contains the linear body velocity, $ν (t) = {[v_{1} v_{2} v_{3}]}^{T} \in R^{3}$ , and angular body velocities, $ω (t) = {[v_{4} v_{5} v_{6}]}^{T} \in R^{3}$ .

• $\dot{v} (t) = {[{\dot{v}}_{1} {\dot{v}}_{2} {\dot{v}}_{3} {\dot{v}}_{4} {\dot{v}}_{5} {\dot{v}}_{6}]}^{T} \in R^{6}$ is the time derivative of body-frame velocity.

• $ξ (t) \in R^{k + i}$ is the control input vector, with k propellers and i hydrodynamic control surfaces (e.g., fins).

The control inputs ξ(t) for the specific vehicle studied in this paper are defined in Section 4.3 and Appendix A.

4.2. Commonly accepted plant model

This section describes the commonly accepted finite-dimensional nonlinear equations of motion for 6-DOF vehicles subject to quadratic drag and gravitational forces. We will provide a brief history of the development of this model for 6-DOF underwater vehicles. However, we argue that the nonlinear dynamics of many 6-DOF plants, including terrestrial, aerospace, and underwater vehicles, can be modeled by these same equations of motion, which are simply the Newton-Euler equations with additional terms to represent drag, buoyancy, and control forces.

4.2.1. History of the development of plant models for underwater vehicles

The dynamics of a rigid-body underwater vehicle includes the finite-dimensional dynamics of the rigid-body vehicle body itself and the infinite-dimensional dynamics of the fluid surrounding the vehicle. While the former can be described by a finite-dimensional ordinary differential equation (ODE), the latter is described by the incompressible Navier–Stokes equation. With the exception of a few special cases, the infinite-dimensional Navier–Stokes equation has no closed-form solution and, for the case of marine vehicles, generally cannot be solved in real time (Larsson et al. 1998).

The most commonly accepted finite-dimensional models for submarine vehicles trace their lineage to studies beginning in the late 1950s and early 1960s at the U.S. Navy’s David Taylor Model Basin, (Goodman, 1960, Imlay, 1961), which referred to mass of the system resulting from immersion in a fluid as “added mass” and “derivative mass.” In Gertler and Hagen (1967), Feldman (1979) the authors expanded upon these equations, which have since become widely known as the “Standard equations of motion for submarine simulation” (Gertler and Hagen, 1967) and the “DTNSDC revised standard submarine equations of motion” (Feldman, 1979). More recently, Fossen in Fossen (1994, 2002) built upon these earlier studies to express in matrix form the equations of motion of an immersed body subject to non-conservative hydrodynamic forces. The matrix model is now commonly used throughout the literature.

4.2.2 Plant model

The commonly accepted second-order finite-dimensional dynamic model of this class of dynamical plants, as described in a body-relative frame coincident with the center of mass, is given by

τ (v, φ, ξ) = M \dot{v} + C (v) v + D_{q} (v) v + G (φ)

(7)

where

• $τ (v, φ, ξ) \in R^{6}$ is the vector of control-actuator forces and moments acting on the rigid body,

• $M \in R^{6 \times 6}$ is the diagonal positive definite symmetric (PDS) inertia matrix,

• $C (v) \in R^{6 \times 6}$ is the Coriolis matrix,

• $D_{q} (v) \in R^{6 \times 6}$ is the quadratic hydrodynamic drag matrix, and

• $G (φ) \in R^{6}$ is the vector of buoyancy or gravitational forces and moments.

The body-frame angular velocity vector $v (t) \in R^{6}$ and the time derivative of the body-frame position and attitude vector $\dot{η} (t) \in R^{6}$ are related by the kinematic equation

\dot{η} (t) = J (η (t)) v (t)

(8)

where

J (η) \in R^{6 \times 6}

is the usual kinematic Jacobian relation between v(t) and

\dot{η} (t)

, which remains bounded for bounded pitch, as described in Section 2.2.1 of Fossen (1994).

Rearranging (7) to find the time derivative of the body velocity, $\dot{v} (t)$ , yields

\dot{v} = M^{- 1} (- C (v) v - D_{q} (v) v - G (φ) + τ (v, φ, ξ)) .

(9)

In the sequel, for convenience we will use the term “plant acceleration” to denote

\dot{v} (t)

, the time derivative of the body velocity defined in (9).

We assume the mass matrix is diagonal and can have different inertia quantities in each translational degree of freedom, that is, each diagonal entry of M can be unique. This is a common way to model hydrodynamic “added mass,” that is

M = diag ({[m_{11} m_{22} m_{33} m_{44} m_{55} m_{66}]}^{T})

(10)

= diag (θ_{m})

(11)

where

θ_{m} ≜ {[m_{11} m_{22} m_{33} m_{44} m_{55} m_{66}]}^{T} \in R^{6}

is the mass parameter vector. Because the mass parameters enter linearly into the vector-valued function

M \dot{v} \in R^{6}

, we can factor

M \dot{v}

such that

M \dot{v} = W_{m} (\dot{v}) θ_{m},

(12)

where

W_{m} (\dot{v}) \in R^{6 \times 6}

is the mass regressor matrix-valued function.

We parameterize the Coriolis matrix C(v) from M,

C (v) = [\begin{array}{c} 0 & - {(M_{11} ν)}^{\land} \\ - {(M_{11} ν)}^{\land} & - {(M_{22} ω)}^{\land} \end{array}]

(13)

where (·)^∧ is the skew-symmetric operator defined in (5),

M_{11} ≜ diag ({[m_{11} m_{22} m_{33}]}^{T})

and

M_{22} ≜ diag ({[m_{44} m_{55} m_{66}]}^{T})

. Equivalently, we can write

C (v) v = a d_{s e (3)} (v) M v

(14)

and because the mass parameters enter linearly into (14), we have the factorization

C (v) v = a d_{s e (3)} (v) W_{m} (v) θ_{m}

(15)

where

W_{m} (v) \in R^{6 \times 6}

is the same mass regressor matrix as in (12), with the velocity v instead of the plant acceleration

\dot{v}

as its argument, and θ_m is the same mass parameter vector.

We assume the quadratic drag matrix, D_q(v), is diagonal positive semidefinite such that

D_{q} (v) = diag (| v |) diag ({[d_{11} d_{22} d_{33} d_{44} d_{55} d_{66}]}^{T})

(16)

= diag (| v |) diag (θ_{d})

(17)

with

θ_{d} ≜ {[d_{11} d_{22} d_{33} d_{44} d_{55} d_{66}]}^{T} \in R^{6}

containing all of the drag parameters. We can again factor the linearly-entering parameters from (17), yielding

D_{q} (v) v = diag (| v |) diag (θ_{d}) v

(18)

= diag (| v |) diag (v) θ_{d}

(19)

= W_{d} (v) θ_{d}

(20)

where

W_{d} (v) \in R^{6 \times 6}

is the drag regressor matrix-valued function.

We define the function $G (φ) \in R^{6 \times 1}$ , which represents the gravitational forces and moments arising from net effective mass and righting moments,

G (φ) = [\begin{array}{c} R {(φ)}^{T} e_{3} & 0_{3 \times 3} \\ 0_{3 \times 1} & {(R^{T} (φ) e_{3})}^{\land} \end{array}] [\begin{array}{c} g \\ b \end{array}],

(21)

where

e_{3} = {[\begin{array}{c} 0 & 0 & 1 \end{array}]}^{T}

g = g_{c} (m - ρ \nabla) \in R

is the net effective buoyant force (in Newtons) with gravitational acceleration constant g_c, dry mass m, and displacement volume ∇, which displaces fluid with density ρ. The vector

b = g_{c} \bar{b} ρ \nabla \in R^{3}

represents the net righting torque (in Newton-meters) resulting from a center of buoyancy not co-incident with the center of mass, where

\bar{b} \in R^{3}

is the vector from the center of mass to the center of buoyancy. We note that this formulation assumes the origin of the body frame is at the center of mass of the UV. Factoring the linearly-entering parameters from (21), we have

G (φ) = W_{G} (φ) θ_{G},

(22)

where

W_{G} (φ) \in R^{6 \times 4}

is the gravitational regressor matrix-valued function and

θ_{G} = {[\begin{array}{c} g & b^{T} \end{array}]}^{T} \in R^{4}

is the gravitational parameter vector, which is composed of both the buoyancy and righting-moment parameters.

4.2.3 Actuator model

The vector $τ (v, φ, ξ) \in R^{6}$ represents the model of control-actuator forces and moments that are a result of body velocity v, attitude of the vehicle φ, and the control inputs $ξ \in R^{k + i}$ to the control actuators, such as fin angle and propeller angular velocity, with parameters $θ_{a} \in R^{a}$ . If the actuation parameters θ_a enter linearly into τ(v, φ, ξ), as is true for the UV actuator model described explicitly in Appendix A, then we may factor the control vector such that

τ (v, φ, ξ) = W_{a} (v, φ, ξ) θ_{a},

(23)

where

W_{a} (v, φ, ξ) \in R^{6 \times a}

is the (usually) nonlinear actuator regressor matrix and

θ_{a} \in R^{a}

is the parameter vector that contains the minimum number of parameters required to fully characterize the control actuators.

The AID in this study is applicable to actuator models of the general form (23), with linearly-entering parameters and the additional requirement that W_a(v, φ, ξ) be uniformly continuous in v, φ, ξ and bounded for bounded v, φ, ξ.

4.3. Combined parameter vector

4.3.1. Plant parameters

In summary, our goal is to identify the following plant parameters:

• Mass parameters:

θ_{m} = {[m_{11} m_{22} m_{33} m_{44} m_{55} m_{66}]}^{T} \in R^{6}

(24)

• Drag parameters:

θ_{d} = {[d_{11} d_{22} d_{33} d_{44} d_{55} d_{66}]}^{T} \in R^{6}

(25)

• Buoyant-force and righting moment parameters:

θ_{G} = {[g b_{1} b_{2} b_{3}]}^{T} \in R^{4}

(26)

For convenience, we define the combined plant parameter vector

θ_{p} ≜ {[\begin{array}{c} θ_{m}^{T} & θ_{d}^{T} & θ_{G}^{T} \end{array}]}^{T} \in R^{p} .

(27)

4.3.2. Control-actuator parameters

Our goal is to identify the following fin and thruster actuator parameters, described in greater detail in Appendix A. For convenience, we define the control-actuator parameter vector

θ_{a} ≜ {[\begin{array}{c} θ_{a_{1}} & \dots & θ_{a_{a}} \end{array}]}^{T} \in R^{a} .

(28)

4.3.3. Plant and actuator parameter vector

We define the full plant and actuator parameter vector as

θ ≜ {[\begin{array}{c} θ_{p}^{T} & θ_{a}^{T} \end{array}]}^{T} \in R^{(p + a)} .

(29)

For our specific parameterization of the Iver3 AUV used for experimental validation, the dimensionality of θ is $R^{(p + a)} = R^{(16 + 6)} = R^{22}$ . The details regarding the a = 6 actuator parameters for the Iver3 AUV are described in Appendix A. However, in the sequel we continue to use $θ \in R^{(p + a)}$ where possible for generality.

4.4. Regressor formulation of plant and actuator dynamics

The equations of motion (7) can be written in regressor form by utilizing (12), (15), (20), (22), and (23), which yields

\begin{array}{l} W_{a} (v, φ, ξ) θ_{a} = [W_{m} (\dot{v}) + a d_{s e (3)} (v) W_{m} (v)] θ_{m} \\ + W_{d} (v) θ_{d} + W_{G} (φ) θ_{G} \end{array}

(30)

\begin{array}{c} W_{a} (v, φ, ξ) θ_{a} = W_{p} (\dot{v}, v, φ) θ_{p} \end{array}

(31)

where the plant regressor

W_{p} (\dot{v}, v, φ) \in R^{6 \times p}

is defined as

W_{p} (\dot{v}, v, φ) ≜ {[\begin{array}{c} {(W_{m} (\dot{v}) + a d_{s e (3)} (v) W_{m} (v))}^{T} \\ W_{d} {(v)}^{T} \\ W_{G} {(φ)}^{T} \end{array}]}^{T} .

(32)

Rearranging (31) to combine the plant and control-actuator dynamics yields

\begin{array}{c} 0 = [W_{p} (\dot{v}, v, φ), - W_{a} (v, φ, ξ)] [\begin{array}{c} θ_{p} \\ θ_{a} \end{array}] \end{array}

(33)

= W (\dot{v}, v, φ, ξ) θ

(34)

where

W (\dot{v}, v, φ, ξ) \in R^{6 \times (p + a)}

is the (known) time-varying combined plant-and-actuator regressor matrix-valued function, defined as

\begin{array}{c} W (\dot{v}, v, φ, ξ) ≜ [W_{p} (\dot{v}, v, φ), - W_{a} (v, φ, ξ)] \end{array}

(35)

= {[\begin{array}{c} {(W_{m} (\dot{v}) + a d_{s e (3)} (v) W_{m} (v))}^{T} \\ W_{d} {(v)}^{T} \\ W_{G} {(φ)}^{T} \\ - W_{a} {(v, φ, ξ)}^{T} \end{array}]}^{T} .

(36)

4.5. Regressor nullspace properties

4.5.1. Non-uniqueness of the parameter vector

To the best of our knowledge, previous studies of UV model identification, with the exception of Harris et al. (2018) and Paine (2018), assume that the actuator parameters are known. When the actuator parameters are known and the plant model is minimally parameterized, the true plant parameters represent a single, unique point in the linear vector space of possible plant parameters.

In contrast, the AID addressed herein identifies plant and actuator parameters simultaneously. From (34), we note that $θ \in R^{(p + a)}$ is in the nullspace of the combined regressor $W (\dot{v}, v, φ, ξ)$ for an evolution of $\dot{v} (t), v (t), φ (t), ξ (t)$ that satisfies the plant equation of motion (7).

A consequence of this nullspace structure is that many different parameter vectors satisfy the same equations of motion, even when the model is minimally parameterized. As an example, any nonzero scalar multiple of θ equivalently satisfies (34) and thus also satisfies (7).

Given a “true” parameter vector $θ \in R^{p + a}$ and all possible bounded evolutions of $\dot{v}, v, φ, ξ$ that satisfy the plant equation of motion (7), and thus also satisfy the nullspace relationship (34) for this θ, there is then also a set $P (θ) \subset R^{(p + a)}$ of all nonzero parameter vectors θ* that equivalently satisfy (7) for the same evolutions of $\dot{v}, v, φ, ξ$ and (34) for the regressor matrix-valued function $W (\dot{v}, v, φ, ξ)$ of these arguments. This set P(θ) is the “persistent nullspace” of the regressor, defined as

\begin{array}{l} P (θ) ≜ {θ^{*} : θ^{*} \neq 0 and \forall t, \dot{v}, v, φ, ξ s . t . \\ W (\dot{v}, v, φ, ξ) θ = 0, W (\dot{v}, v, φ, ξ) θ^{*} = 0} \end{array}

(37)

Thus, the “true” parameter vector is not only a unique single vector in the parameter space

R^{p + a}

, but any element of the set P(θ). Note that the set P(θ) does not contain the origin, and that the set P(θ) ∪ 0 is a linear vector subspace of

R^{(p + a)}

In Section 4.5.2, we explore the identical input-output plant dynamics that arise from different parameter vectors in P(θ). These properties will be utilized in Section 5, where we report a novel adaptive identifier for θ and show that under a persistence of excitation (PE) condition, the estimate of θ converges to the true parameter set P(θ).

4.5.2. Acceleration input-output dynamics

We now prove our earlier claim that the input-output dynamics of (7), namely the resulting plant acceleration given control inputs and parameters, are the same for any parameter vector belonging to the true parameter set P(θ).

From the plant acceleration (9), after making the appropriate regressor substitutions (15), (20), and (22), as well as writing the true mass M as a function of the parameter vector θ ∈ P(θ) from which it arises, we obtain

\begin{array}{l} \dot{v} = M {(θ)}^{- 1} (- a d_{s e (3)} (v) W_{m} (v) θ_{m} - W_{d} (v) θ_{d} \\ - W_{G} (φ) θ_{G} + W_{a} (v, φ, ξ) θ_{a}) . \end{array}

(38)

Observing the structure of (38), we define $W_{\dot{v}} (v, φ, ξ) \in R^{6 \times (p + a)}$ as

W_{\dot{v}} (v, q, ξ) ≜ {[\begin{array}{c} - a d_{s e (3)} (v) W_{m} {(v)}^{T} \\ - W_{d} {(v)}^{T} \\ - W_{G} {(φ)}^{T} \\ W_{a} {(v, φ, ξ)}^{T} \end{array}]}^{T} \in R^{6 \times (p + a)}

(39)

= [W_{m} (\dot{v}) 0_{6 \times (p + a - 6)}] - W (\dot{v}, v, φ, ξ),

(40)

which allows us to write the plant acceleration

\dot{v}

(38) as a function of the underlying parameters θ:

\dot{v} = \dot{v} (θ) ≜ M {(θ)}^{- 1} W_{\dot{v}} (v, φ, ξ) θ .

(41)

Now we choose another vector θ* ∈ P(θ). By definition of P(θ) (37), we have the nullspace equation

W (\dot{v}, v, φ, ξ) θ^{*} = 0,

(42)

which, substituting (40), results in

\begin{array}{c} ([W_{m} (\dot{v}), 0_{n \times (p + a - 6)}] - W_{\dot{v}} (v, φ, ξ)) θ^{*} = 0 \end{array}

(43)

\begin{array}{c} M (θ^{*}) \dot{v} - W_{\dot{v}} (v, φ, ξ) θ^{*} = 0 \end{array}

(44)

With further rearranging, we have

\begin{array}{c} M (θ^{*}) \dot{v} = W_{\dot{v}} (v, φ, ξ) θ^{*} \end{array}

(45)

\begin{array}{c} \dot{v} = M {(θ^{*})}^{- 1} W_{\dot{v}} (v, φ, ξ) θ^{*} \end{array}

(46)

and thus obtain an equivalent expression to (9) for the plant acceleration

\dot{v}

, where (46) can also be written as a function of its parameter vector

\begin{array}{c} \dot{v} (θ^{*}) ≜ M {(θ^{*})}^{- 1} W_{\dot{v}} (v, φ, ξ) θ^{*} \end{array}

(47)

= \dot{v}

(48)

= \dot{v} (θ)

(49)

which verifies our claim in Section 4.5.1 that any parameter vector θ ∈ P(θ) yields the same input–output behavior.

It is important to note that a nonzero parameter vector $\tilde{θ} \notin P (θ)$ by definition does not satisfy $W (\dot{v} (θ), v, φ, ξ) \tilde{θ} = 0 \forall t$ and over all $\dot{v} (θ), v, φ, ξ$ that satisfy (7, 34) for the underlying true parameter vector θ. However, given $\tilde{θ} \notin P (θ)$ we can define a corresponding acceleration signal arising from these parameters

\dot{v} (\tilde{θ}) ≜ M {(\tilde{θ})}^{- 1} W_{\dot{v}} (v, φ, ξ) \tilde{θ} .

(50)

The term $\dot{v} (\tilde{θ})$ , (50), represents the time derivative of the body velocity given the same velocity v, attitude φ, and control inputs ξ as before, but for the different set of parameters $\tilde{θ}$ .

We can obtain an analogous nullspace relationship to (34) for $\dot{v} (\tilde{θ})$ by rearranging (50) and substituting the regressor form of the inertia term defined in (12), which yields

\begin{array}{c} M (\tilde{θ}) \dot{v} (\tilde{θ}) = W_{\dot{v}} (v, φ, ξ) \tilde{θ} \end{array}

(51)

\begin{array}{c} W_{m} (\dot{v} (\tilde{θ})) {\tilde{θ}}_{m} = W_{\dot{v}} (v, φ, ξ) \tilde{θ}, \end{array}

(52)

and applying (40), we obtain

\begin{array}{c} W_{m} (\dot{v} (\tilde{θ})) {\tilde{θ}}_{m} - W_{\dot{v}} (v, φ, ξ) \tilde{θ} = 0 \end{array}

(53)

\begin{array}{c} [[W_{m} (\dot{v} (\tilde{θ})) 0_{6 \times (p + a - 6)}] - W_{\dot{v}} (v, φ, ξ)] \tilde{θ} = 0 \end{array}

(54)

\begin{array}{c} W (\dot{v} (\tilde{θ}), v, φ, ξ) \tilde{θ} = 0 . \end{array}

(55)

In other words, when the plant acceleration in the regressor matrix (34) is replaced by an acceleration resulting from a different parameter class,

\dot{v} (\tilde{θ})

, we have an analogous nullspace relationship

\tilde{θ} \in null (W (\dot{v} (\tilde{θ}), v, φ, ξ))

, which holds ∀t and over all

\dot{v} (\tilde{θ}), v, φ, ξ

that satisfy (7,34) given a different parameter class

P (\tilde{θ})

. We utilize this fact in the design of the adaptive identifier reported in Section 5.

5. Adaptive identifier

This section reports the theoretical derivation and stability analysis of a new adaptive identification (AID) algorithm for plants of the form (7). The AID reported herein significantly expands upon and extends a previous approach for adaptive parameter estimation of UV plant models by McFarland and Whitcomb (McFarland and Whitcomb, 2013, 2021). The specific differences are:

(1) The AID reported herein performs simultaneous identification of plant and actuator model parameters. In contrast, the AID reported in McFarland and Whitcomb (McFarland and Whitcomb, 2013, 2021) identifies only plant parameters.

(2) This study provides a mathematical proof of stability and of parameter-estimate convergence. In contrast, McFarland and Whitcomb (McFarland and Whitcomb, 2013, 2021) provide only a stability proof.

Problem Statement: The goal of this adaptive identification algorithm is to estimate the parameters θ (29) for plants and actuators of the form (7) or, equivalently, of the form (34), using only the available signals of vehicle attitude φ(t), linear and angular body velocity v(t), and the control input ξ(t).

Error Coordinates: We define the velocity error $Δ v (t) \in R^{6}$ as the difference between the velocity $\hat{v} (t) \in R^{6}$ of an identification plant, whose dynamics are defined in (63), and the actual vehicle velocity, $v (t) \in R^{6}$ , whose dynamics are defined in (9) (and equivalently in (41)), as

Δ v (t) ≜ \hat{v} (t) - v (t) .

(56)

We define the vector of parameter estimates as

\hat{θ} (t) \in R^{p + a}

and choose a constant parameter vector

θ^{*} \in R^{p + a}

, θ* ∈ P(θ), from the set P(θ) of “true” parameters defined in (37), which results in the error coordinates

Δ θ (t) ≜ \hat{θ} (t) - θ^{*} .

(57)

Task: Our task is to design update laws $\dot{\hat{v}}$ , $\dot{\hat{θ}}$ for the identification plant and estimated parameters, respectively, such that all signals remain bounded and

\lim_{t \to \infty} Δ v (t) = 0,

(58)

\lim_{t \to \infty} \hat{θ} \in P (θ) .

(59)

Assumptions: We make the following assumptions.

• v(t), η(t), ξ(t) are bounded.

• $W (\dot{v} (θ^{*}), v, φ, ξ)$ (35) is locally uniformly continuous in $\dot{v} (θ^{*}), v, φ, ξ$ and a bounded function of these variables.

• As a result, $W_{\dot{v}} (v, φ, ξ)$ (39) is locally uniformly continuous in v, φ, ξ and a bounded function of these variables.

• The control inputs ξ are uniformly continuous in time and bounded.

• The true parameters θ* are bounded and constant.

• $\hat{v} (t_{0}) = v (t_{0})$ , that is Δv(t₀) = 0

• $\exists ϵ \in R^{+}$ such that $T {(t_{0})}^{1 / 2} + ϵ \leq λ_{\min} (M (θ^{*}))$ , where

T (t_{0}) ≜ λ_{\max} (Γ) Δ θ {(t_{0})}^{T} Γ^{- 1} Δ θ (t_{0})

(60)

and

Γ \in R^{(p + a) \times (p + a)}

is a parameter adaptation gain matrix which will be defined below.

In the remainder of this section, we present a novel AID algorithm and show that it achieves (58, 59) with all signals bounded. Section 5.1 presents the update laws. Section 5.2 reports a Lyapunov analysis to prove uniform stability with respect to the origin of the error system, which we denote as

S^{*} = {Δ v = 0, Δ θ = 0},

(61)

convergence of Δv, and boundedness of all signals. Note that

Δ θ = 0 \Rightarrow \hat{θ} = θ^{*} \in P (θ)

, which means S* is a subset of

S = {Δ v = 0, \hat{θ} \in P (θ)} .

(62)

In Section 5.3, we present a novel proof of convergence to the larger set S under a persistence of excitation (PE) condition.

5.1. Update laws and error dynamics

We choose the update laws

\begin{array}{c} \dot{\hat{v}} = - A Δ v + \dot{v} (\hat{θ}) \end{array}

(63)

\begin{array}{c} \dot{\hat{θ}} & = Γ W {(\dot{v} (\hat{θ}), v, φ, ξ)}^{T} Δ v \end{array}

(64)

where (63) defines the adaptive identification plant dynamics, (64) defines the adaptive parameter update law,

A \in R^{6 \times 6}

(65)

is a diagonal positive definite symmetric (PDS) gain matrix for the identification plant,

Γ \in R^{(p + a) \times (p + a)}

(66)

is a diagonal PDS gain matrix for the parameter estimates, and

\dot{v} (\hat{θ})

is the time derivative of the body velocity arising from the parameter estimate

\hat{θ}

instead of the true parameters θ*, with properties as described in Section 4.5, defined as

\dot{v} (\hat{θ}) ≜ M (\hat{θ}) W_{\dot{v}} (v, φ, ξ) \hat{θ} .

(67)

Similarly to our usage of the term “plant acceleration” to denote $\dot{v} (θ^{*})$ defined in (9), as noted in Section 4.2.2, in the sequel, for convenience we will use the term “identification plant acceleration” to denote $\dot{v} (\hat{θ})$ defined in (67).

The rate of change of the velocity error is

\begin{array}{c} Δ \dot{v} = \dot{\hat{v}} - \dot{v} \end{array}

(68)

\begin{array}{c} Δ \dot{v} = - A Δ v + \dot{v} (\hat{θ}) - \dot{v} (θ^{*}) \end{array}

(69)

\begin{array}{c} Δ \dot{v} = - A Δ v + M {(θ^{*})}^{- 1} M (θ^{*}) \dot{v} (\hat{θ}) - \dot{v} (θ^{*}) \end{array}

(70)

Because M is parameterized by θ*, we can rearrange and use the definitions (12, 35, 47) to find

\begin{array}{c} Δ \dot{v} & = - A Δ v \\ + M {(θ^{*})}^{- 1} [\begin{array}{c} W_{m} (\dot{v} (\hat{θ})) & 0_{6 \times (p + a - 6)} \end{array}] θ^{*} \end{array}

(71)

- M {(θ^{*})}^{- 1} W_{\dot{v}} (v, φ, ξ) θ^{*}

(72)

\begin{array}{c} Δ \dot{v} = - A Δ v + M {(θ^{*})}^{- 1} W (\dot{v} (\hat{θ}), v, φ, ξ) θ^{*} . \end{array}

(73)

Using the fact that $\hat{θ} \in null (W (\dot{v} (\hat{θ}), v, φ, ξ))$ from Section 4.5.2 and the error coordinate definition (57), we can rearrange the second term in (73) to find

\begin{array}{c} M {(θ^{*})}^{- 1} & W (\dot{v} (\hat{θ}), v, φ, ξ) θ^{*} \end{array}

(74)

= M {(θ^{*})}^{- 1} W (\dot{v} (\hat{θ}), v, φ, ξ) (θ^{*} - \hat{θ})

(75)

= - M {(θ^{*})}^{- 1} W (\dot{v} (\hat{θ}), v, φ, ξ) Δ θ .

(76)

Substituting (76) into (73) yields

Δ \dot{v} = - A Δ v - M {(θ^{*})}^{- 1} W (\dot{v} (\hat{θ}), v, φ, ξ) Δ θ .

(77)

The parameter error dynamics follow directly from the error coordinates (57), the parameter update law (64), and the assumption that the true parameters are constant, that is, ${\dot{θ}}^{*} = 0$ , thus the parameter error dynamics can be written

Δ \dot{θ} = Γ W^{T} (\dot{v} (\hat{θ}), v, φ, ξ) Δ v .

(78)

By defining the full error state vector for the AID as

z (t) = [\begin{array}{c} Δ v (t) \\ Δ θ (t) \end{array}],

(79)

we can construct a representation of this error system using the velocity error dynamics (77) and parameter error dynamics (78):

\begin{array}{c} \dot{z} (t) = [\begin{array}{c} M^{- 1} & 0_{6 \times (p + a)} \\ 0_{(p + a) \times 6} & Γ \end{array}] [\begin{array}{c} - M A & - W (t, \hat{θ}) \\ W {(t, \hat{θ})}^{T} & 0_{(p + a) \times (p + a)} \end{array}] z (t) \end{array}

(80)

\begin{array}{c} = (\underset{E}{\underset{⏟}{[\begin{array}{c} M^{- 1} & 0_{6 \times (p + a)} \\ 0_{(p + a) \times 6} & Γ \end{array}]}} \underset{F (t)}{\underset{⏟}{[\begin{array}{c} 0_{6 \times 6} & - W (t, \hat{θ}) \\ W {(t, \hat{θ})}^{T} & 0_{(p + a) \times (p + a)} \end{array}]}} \\ + \underset{G}{\underset{⏟}{[\begin{array}{c} - A & 0_{6 \times (p + a)} \\ 0_{(p + a) \times 6} & 0_{(p + a) \times (p + a)} \end{array}]}}) z (t) \end{array}

(81)

where we adopt the notation

\begin{array}{c} W (t, \hat{θ}) = W (\dot{v} (\hat{θ}), v, φ, ξ) \end{array}

(82)

\begin{array}{c} M = M (θ^{*}) \end{array}

(83)

for readability. Note that the matrix

E = E^{T} = [\begin{array}{c} M^{- 1} & 0_{6 \times (p + a)} \\ 0_{(p + a) \times 6} & Γ \end{array}]

(84)

is block diagonal PDS,

F (t) = - F {(t)}^{T} = [\begin{array}{c} 0_{6 \times 6} & - W (t, \hat{θ}) \\ W {(t, \hat{θ})}^{T} & 0_{(p + a) \times (p + a)} \end{array}]

(85)

is skew-symmetric, and

G = G^{T} = [\begin{array}{c} - A & 0_{6 \times (p + a)} \\ 0_{(p + a) \times 6} & 0_{(p + a) \times (p + a)} \end{array}]

(86)

is block diagonal and negative semidefinite.

5.2 Convergence of the velocity error

Theorem 1

The system (80)

\dot{z} (t) = [\begin{array}{c} M^{- 1} & 0 \\ 0 & Γ \end{array}] [\begin{array}{c} - M A & - W (t, \hat{θ}) \\ W {(t, \hat{θ})}^{T} & 0 \end{array}] z (t)

where z(t), M, Γ,

A

W (t, \hat{θ})

are defined in (79, 11, 66, 65, 82) is locally uniformly stable about the set S* (61), all signals remain bounded, and lim_t→∞Δv = 0.

Proof. Consider the following Lyapunov function candidate:

V (t) = \frac{1}{2} z {(t)}^{T} E^{- 1} z (t) .

(87)

Note that $V (t) \in C^{1}$ and is radially unbounded in z(t), positive definite since E⁻¹ is PDS, and equal to zero if and only if z(t) = 0. The time derivative of (87) is

\begin{array}{c} \dot{V} (t) & = \frac{1}{2} z {(t)}^{T} {(E F (t) + G)}^{T} E^{- 1} z (t) \\ + \frac{1}{2} z {(t)}^{T} E^{- 1} (E F (t) + G) z (t) . \end{array}

(88)

E = E^T and G = G^T, so

\dot{V} (t) = \frac{1}{2} z^{T} ({F (t)}^{T} + G E^{- 1}) z + \frac{1}{2} z^{T} (F (t) + E^{- 1} G) z,

(89)

and since F(t) = −F(t)^T and E⁻¹ and G commute, we have

\begin{array}{c} \dot{V} (t) = z^{T} E^{- 1} G z \end{array}

(90)

\begin{array}{c} \dot{V} (t) = z^{T} [\begin{array}{c} - M A & 0_{6 \times (p + a)} \\ 0_{(p + a) \times 6} & 0_{(p + a) \times (p + a)} \end{array}] z \leq 0 . \end{array}

(91)

This expression is negative definite in Δv and negative semidefinite in Δθ, satisfying the requirements on a Lyapunov function to show that the system (81) is uniformly stable about the set S* and that Δv and Δθ are bounded. We note that with Δθ bounded and θ* constant, it is implied that $\hat{θ}$ is bounded.

To show that $Δ \dot{v}$ (77) is bounded, it remains to be shown that $M {(\hat{θ})}^{- 1}$ is bounded. We adapt the proof in McFarland and Whitcomb (2013). Consider that ∀t > t₀, the following hold:

• Δθ(t)^TΓ⁻¹θ(t) ≥ 0 from PDS of Γ

• 0 ≤ V(t) ≤ V(t₀) from (91)

• Δv(t₀) = 0 by assumption

These facts can be used to show

\begin{array}{c} \frac{1}{λ_{\max} (Γ)} {‖ Δ θ_{m} (t) ‖}_{2}^{2} & \leq Δ θ {(t)}^{T} Γ^{- 1} Δ θ (t) \end{array}

(92)

\begin{array}{c} \leq 2 V (t) \end{array}

(93)

\begin{array}{c} \leq 2 V (t_{0}) \end{array}

(94)

\begin{array}{c} \leq Δ θ {(t_{0})}^{T} Γ^{- 1} Δ θ (t_{0}) . \end{array}

(95)

This implies

{‖ Δ θ_{m} (t) ‖}_{2}^{2} \leq λ_{\max} (Γ) Δ θ {(t_{0})}^{T} Γ^{- 1} Δ θ (t_{0})

(96)

and since, by assumption (60),

{[λ_{\max} (Γ) Δ θ {(t_{0})}^{T} Γ^{- 1} Δ θ (t_{0})]}^{\frac{1}{2}} \leq λ_{\min} (M (θ^{*})) - ϵ,

(97)

we can state

{‖ Δ θ_{m} (t) ‖}_{2}^{2} \leq λ_{\min} (M (θ^{*})) - ϵ .

(98)

Using the Rayleigh-Ritz theorem, we will show that the smallest eigenvalue of $M (\hat{θ})$ is bounded away from zero as follows:

λ_{\min} (M (\hat{θ})) = \min_{‖ x ‖ = 1} (x^{T} M (\hat{θ}) x)

(99)

= \min_{‖ x ‖ = 1} (x^{T} M (θ^{*}) x + x^{T} M (Δ θ) x)

(100)

\geq \min_{‖ x ‖ = 1} (x^{T} M (θ) x) - \max_{‖ x ‖ = 1} | x^{T} M (Δ θ) x | .

(101)

The second term in (101) is equal to the maximum magnitude of the eigenvalues of M(Δθ). Since M(Δθ) is symmetric, this is equivalent to its maximum singular value or ‖M(Δθ)‖₂. Furthermore, ‖M(Δθ)‖₂ ≤ ‖M(Δθ)‖_F, and for diagonal mass matrices, ‖M(Δθ)‖_F = ‖Δθ_m‖₂ by definition. Thus, we have

\begin{array}{c} λ_{\min} (M (\hat{θ})) & \geq λ_{\min} (M (θ^{*})) - {‖ M (Δ θ) ‖}_{2} \end{array}

(102)

\begin{array}{c} \geq λ_{\min} (M (θ^{*})) - {‖ M (Δ θ) ‖}_{F} \end{array}

(103)

\begin{array}{c} \geq λ_{\min} (M (θ^{*})) - {‖ Δ θ_{m} ‖}_{2} . \end{array}

(104)

Substituting (98), we have

λ_{\min} (M (\hat{θ})) \geq ϵ,

(105)

thus

M {(\hat{θ})}^{- 1}

exists and is bounded.

All signals in the right-hand side of (77) are thus shown to be bounded. This implies the time-derivative of the velocity error, $Δ \dot{v}$ , is bounded. We note that the plant acceleration $\dot{v}$ is also bounded, since all of its components are bounded (41). Since both $\dot{v}$ and $Δ \dot{v}$ are bounded, the velocity update law $\dot{\hat{v}}$ is also bounded (68), and from bounded $\dot{\hat{v}}$ , Δv we have bounded $\dot{v} (\hat{θ})$ (63).

From (79), (87), and (91) it can easily be shown that $Δ v \in L_{2} \cap L_{\infty}$ , which together with bounded $Δ \dot{v}$ implies that lim_t→∞Δv = 0. Thus we have shown that this AID achieves (58). Regarding the parameter estimates, however, we can only conclude that Δθ is bounded, that the update law $\dot{\hat{θ}}$ (64) is bounded, and that $\lim_{t \to \infty} Δ \dot{θ} = \lim_{t \to \infty} \dot{\hat{θ}} = 0$ (78). Additional arguments and conditions are needed to show mathematically the convergence of $\hat{θ}$ to the set P(θ).

5.3. Convergence of the parameter estimate

Section 5.2 showed that lim_t→∞Δv(t) = 0 and $\lim_{t \to \infty} \dot{\hat{θ}} (t) = 0$ . This argument by itself does not show that the parameter estimate converges to the true parameter set P(θ), defined in (37). Moreover, because the set of true parameters is not defined by a single point in parameter space but rather by the set P(θ), the goal is not to show convergence of $\hat{θ} (t)$ to a single point (i.e., the goal is not to show that lim_t→∞Δθ(t) = 0). This Section will state conditions for and show the convergence of $\hat{θ} (t)$ to P(θ) (59).

The remainder of this Section is organized as follows:

• Section 5.3.1 introduces a change of coordinates to characterize $\hat{θ} (t)$ as the sum of two components residing in orthogonal subspaces related to P(θ).

• Section 5.3.2, then defines an equivalent error system using these coordinates.

• Finally, Section 5.3.3 shows a sufficient condition for $\hat{θ} (t)$ to converge to the true parameter space P(θ), that is, $\lim_{t \to \infty} \hat{θ} (t) \in P (θ)$ .

We note that the problem of parameter convergence with respect to a unique set of true parameters is well understood for a class of linear time-varying systems, as originally reported in Morgan and Narendra (1977). To the best of our knowledge, however, we are unaware of any previously reported convergence proof applicable to the nonlinear system (80) with nullspace parameter structure.

5.3.1. Coordinate transformation

Given the set P(θ) (37), we can define the linear vector subspace P_s = P(θ) ∪ 0, with dim(P_s) = r, and we can define an orthonormal basis {p₁, …, p_r} for P_s, as well as the matrix

\bar{P} ≜ [p_{1}, \dots, p_{r}] \in R^{(p + a) \times r}

(106)

whose rank is r, with properties

\begin{array}{c} {\bar{P}}^{T} \bar{P} & = I_{r \times r} \end{array}

(107)

\begin{array}{c} W (\dot{v} (θ^{*}), v, φ, ξ) \bar{P} & = 0_{6 \times r} . \end{array}

(108)

Similarly, we can define the linear vector subspace P_⊥ as the orthogonal complement of P_s, with orthonormal basis {q₁, …, q_(p+a)−r}, as well as the matrix

{\bar{P}}_{⊥} ≜ [q_{1}, \dots, q_{(p + a) - r}] \in R^{(p + a) \times [(p + a) - r]}

(109)

whose rank is (p + a) − r, with properties

\begin{array}{c} {\bar{P}}_{⊥}^{T} {\bar{P}}_{⊥} & = I_{[(p + a) - r] \times [(p + a) - r]} \end{array}

(110)

\begin{array}{c} {\bar{P}}_{⊥}^{T} \bar{P} & = 0_{[(p + a) - r] \times r} . \end{array}

(111)

We note that $R^{(p + a)}$ is the direct sum of the orthogonal subspaces P_⊥ and P_s, that is, $R^{(p + a)} = P_{⊥} \oplus P_{s}$ .

We combine $\bar{P}$ and ${\bar{P}}_{⊥}$ into the orthogonal matrix

Q ≜ [\begin{array}{c} {\bar{P}}_{⊥} & \bar{P} \end{array}] \in R^{(p + a) \times (p + a)}

(112)

Using Q, we can perform a change of coordinates on the parameter estimate $\hat{θ}$ so that

\hat{θ} = Q \bar{θ}

(113)

\dot{\hat{θ}} = Q \dot{\bar{θ}},

(114)

where

\bar{θ} \in R^{(p + a)}

can be decomposed into

{\bar{θ}}_{P_{⊥}} \in R^{(p + a) - r}

and

{\bar{θ}}_{P} \in R^{r}

, the coordinates of

\hat{θ}

in P_⊥ and P, respectively:

\bar{θ} = [\begin{array}{c} {\bar{θ}}_{P_{⊥}} \\ {\bar{θ}}_{P} \end{array}] .

(115)

5.3.2. Equivalent error dynamics

We can obtain an expression equivalent to (73) for $Δ \dot{v}$ by factoring out $M (\hat{θ})$ instead of M(θ*) and substituting (67) for $\dot{v} (\hat{θ})$ in the derivation (68–73):

\begin{array}{c} Δ \dot{v} = \dot{\hat{v}} - \dot{v} \end{array}

(116)

\begin{array}{c} = - A Δ v + \dot{v} (\hat{θ}) - \dot{v} (θ^{*}) \end{array}

(117)

\begin{array}{c} = - A Δ v + \dot{v} (\hat{θ}) - M {(\hat{θ})}^{- 1} M (\hat{θ}) \dot{v} (θ^{*}) \end{array}

(118)

\begin{array}{c} = - A Δ v + M {(\hat{θ})}^{- 1} W_{\dot{v}} (v, φ, ξ) \hat{θ} \\ - M^{- 1} (\hat{θ}) [\begin{array}{c} W_{m} (\dot{v} (θ^{*})) & 0_{6 \times ((p + a) - 6)} \end{array}] \hat{θ} \end{array}

(119)

\begin{array}{c} = - A Δ v - M {(\hat{θ})}^{- 1} ([\begin{array}{c} W_{m} (\dot{v} (θ^{*})) & 0_{6 \times ((p + a) - 6)} \end{array}] \\ - W_{\dot{v}} (v, φ, ξ)) \hat{θ} \end{array}

(120)

\begin{array}{c} = - A Δ v - M {(\hat{θ})}^{- 1} W (\dot{v} (θ^{*}), v, φ, ξ) \hat{θ} . \end{array}

(121)

Using the coordinates ${\bar{θ}}_{P_{⊥}}, {\bar{θ}}_{P}$ (113,115) for $\hat{θ}$ and the property (108), we have

\begin{array}{c} Δ \dot{v} = - A Δ v - M {(\hat{θ})}^{- 1} W (\dot{v} (θ^{*}), v, φ, ξ) Q \bar{θ} \end{array}

(122)

\begin{array}{c} = - A Δ v \\ - M {(\hat{θ})}^{- 1} W (\dot{v} (θ^{*}), v, φ, ξ) ({\bar{P}}_{⊥} {\bar{θ}}_{P_{⊥}} + \bar{P} {\bar{θ}}_{P}) \end{array}

(123)

\begin{array}{c} = - A Δ v - M {(\hat{θ})}^{- 1} W (\dot{v} (θ^{*}), v, φ, ξ) {\bar{P}}_{⊥} {\bar{θ}}_{P_{⊥}}, \end{array}

(124)

and defining the term

W_{⊥} (t) ≜ M^{- 1} (\hat{θ} (t)) W (\dot{v} (θ^{*}, t), v (t), φ (t), ξ (t)) {\bar{P}}_{⊥},

(125)

we can rewrite (124) as

Δ \dot{v} = - A Δ v - W_{⊥} {\bar{θ}}_{P_{⊥}} .

(126)

We also have the derivative of $\bar{θ}$ , given by (114)

\begin{array}{c} \dot{\bar{θ}} & = Q^{T} \dot{\hat{θ}} \end{array}

(127)

\begin{array}{c} [\begin{array}{c} {\dot{\bar{θ}}}_{P_{⊥}} \\ {\dot{\bar{θ}}}_{P} \end{array}] & = [\begin{array}{c} {\bar{P}}_{⊥}^{T} \dot{\hat{θ}} \\ {\bar{P}}^{T} \dot{\hat{θ}} \end{array}] \end{array}

(128)

From (64, 126), this results in the full transformed system

[\begin{array}{c} Δ \dot{v} \\ {\dot{\bar{θ}}}_{P_{⊥}} \\ {\dot{\bar{θ}}}_{P} \end{array}] = [\begin{array}{c} - A & - W_{⊥} & 0 \\ {\bar{P}}_{⊥}^{T} Γ W {(t, \hat{θ})}^{T} & 0 & 0 \\ {\bar{P}}^{T} Γ W {(t, \hat{θ})}^{T} & 0 & 0 \end{array}] [\begin{array}{c} Δ v \\ {\bar{θ}}_{P_{⊥}} \\ {\bar{θ}}_{P} \end{array}]

(129)

in which we have partially isolated the velocity error dynamics

Δ \dot{v}

from the coordinates

{\bar{θ}}_{P}

\hat{θ}

that already belong to the true parameter set P(θ), although a nonlinear state dependence remains in W_⊥ through

\hat{θ}

(125).

5.3.3. Convergence of $Δ \dot{v} (t)$

Before showing a sufficient condition for $\lim_{t \to \infty} \hat{θ} \in P (θ)$ , we state and prove the following useful lemma.

Lemma 1

Given the system described in Theorem 1 and the transformed system (129), then

\lim_{t \to \infty} W_{⊥} (t) {\bar{θ}}_{P_{⊥}} (t) = 0 .

(130)

Proof. One conclusion of Theorem 1 is that lim_t→∞Δv(t) = 0. We apply this result to the transformed system (129) to find

\begin{array}{c} \lim_{t \to \infty} Δ \dot{v} (t) = \lim_{t \to \infty} (- A Δ v (t) - W_{⊥} (t) {\bar{θ}}_{P_{⊥}} (t)) \end{array}

(131)

\begin{array}{c} = \lim_{t \to \infty} - W_{⊥} (t) {\bar{θ}}_{P_{⊥}} (t) . \end{array}

(132)

By Barbalat’s Lemma (Appendix B.1), if lim_t→∞Δv(t) is finite and $Δ \dot{v} (t)$ is uniformly continuous (UC) with respect to time, then $\lim_{t \to \infty} Δ \dot{v} (t) = 0$ . We show the details regarding UC of $Δ \dot{v}$ in Appendix B.2. With the conditions of Barbalat’s Lemma satisfied for each element in the vector Δv(t), we conclude that

\lim_{t \to \infty} Δ \dot{v} (t) = \lim_{t \to \infty} - W_{⊥} (t) {\bar{θ}}_{P_{⊥}} (t) = 0,

(133)

and this completes the proof of Lemma 1.

Examining the structure of W_⊥ (125), we observe that the nullspace of $M {(\hat{θ})}^{- 1} W (\dot{v} (θ^{*}), v, φ, ξ)$ at any time t always contains P(θ) (37), but that ${\bar{P}}_{⊥}$ is by definition a matrix of basis vectors orthogonal to P(θ); thus P(θ) is removed from null(W_⊥(t)). Intuitively, if W_⊥(t) has no persistent non-trivial nullspace, the only way to satisfy (130) is for ${\bar{θ}}_{P_{⊥}} (t)$ to converge to zero. We will use this argument in the following theorem, which presents a sufficient condition for convergence of $\hat{θ}$ to P(θ).

5.3.4. Convergence of $\hat{θ} (t)$ to the true parameter set P(θ)

Theorem 2

For the system described by Theorem 1 and W_⊥(t) defined in (125), if there exist ϵ₀, δ₀ > 0 such that for any t ≥ t₀ and any unit vector $w \in R^{(p + a) - r}$ ,

‖ \int_{t}^{t + δ_{0}} W_{⊥} (σ) w d σ ‖ \geq ϵ_{0},

(134)

then

\lim_{t \to \infty} \hat{θ} (t) \in P (θ)

In other words, if W_⊥(t) is persistently exciting (PE) as defined in (134), then $\hat{θ} (t)$ converges to the true parameter set P(θ) (37), and the full state z(t) converges to the set S (62).

Proof. It suffices to show that $\lim_{t \to \infty} {\bar{θ}}_{P_{⊥}} (t) = 0$ , which implies $\lim_{t \to \infty} \hat{θ} (t) \in P (θ)$ . We prove this theorem by contradiction. First, we state assumptions and conditions of the theorem in Part 1. In Part 2, we suppose that $\lim_{t \to \infty} {\bar{θ}}_{P_{⊥}} (t) \neq 0$ , which we will later show to be a contradiction. In Part 3, we show that “when Δv is small, ${\bar{θ}}_{P_{⊥}}$ is flat,” using the approach described in Morgan and Narendra (1977). Finally, in Part 4 we use the flatness property and the PE condition (134) find a result that contradicts Lemma 1.

• Let all of the assumptions in Section 5 be satisfied such that Theorem 1 is true, including that lim_t→∞Δv(t) = 0.

• Let the PE condition in (134) be satisfied.

• Lemma 1 showed that $\lim_{t \to \infty} W_{⊥} (t) {\bar{θ}}_{P_{⊥}} (t) = 0$ .

• We also define the bounds b, b_⊥ > 0 such that

\begin{array}{c} \forall t \geq t_{0}, b_{⊥} \geq ‖ W_{⊥} (t) ‖ \end{array}

(135)

\begin{array}{c} \forall t \geq t_{0}, b \geq ‖ W (t, \hat{θ}) ‖ . \end{array}

(136)

These bounds exist since $W (\dot{v} (\hat{θ} (t)), v, φ, ξ)$ is a bounded function of $\dot{v} (\hat{θ}), v, φ, ξ$ , whose boundedness has been discussed in Section 5.2, and

‖ W_{⊥} (t) ‖ \leq ‖ M {(\hat{θ})}^{- 1} ‖ ‖ W (t, θ^{*}) ‖ ‖ {\bar{P}}_{⊥} ‖

(137)

where $M {(\hat{θ})}^{- 1}$ and $\dot{v} (θ^{*})$ were shown to exist and be bounded in Section 5.2, $W (t, θ^{*}) = W (\dot{v} (θ^{*}), v, φ, ξ)$ is thus a bounded function of bounded functions, and $‖ {\bar{P}}_{⊥} ‖ = 1$ .

Let us suppose that $\lim_{t \to \infty} {\bar{θ}}_{P_{⊥}} (t) \neq 0$ . Then there exists an ϵ > 0 and an unbounded sequence of times ${t_{i}}_{i = 1, \dots, \infty}$ such that $‖ {\bar{θ}}_{P_{⊥}} (t_{i}) ‖ \geq ϵ$ .

We will show that “when Δv is small, ${\bar{θ}}_{P_{⊥}}$ is flat” using an approach included in the proof of Sublemma 2 in Morgan and Narendra (1977). Given T′, ϕ > 0, ∃ϵ′ > 0 such that if ∥Δv(t)∥ ≤ ϵ′ ∀t ∈ [t_i, t_i + T′], then

‖ {\bar{θ}}_{P_{⊥}} (t) - {\bar{θ}}_{P_{⊥}} (t_{i}) ‖ \leq ϕ \forall t \in [t_{i}, t_{i} + T^{'}] .

(138)

To verify this, given a T′ and ϕ, we set

ϵ^{'} = \frac{ϕ}{T^{'} λ_{\max} (Γ) b}

(139)

and observe that, ∀t ∈ [t_i, t_i + T′], using the expression for ${\dot{\bar{θ}}}_{P_{⊥}}$ in (129), the fact that $‖ {\bar{P}}_{⊥} ‖ = 1$ , and the bound b (136),

\begin{array}{c} ‖ {\bar{θ}}_{P_{⊥}} (t) & - {\bar{θ}}_{P_{⊥}} (t_{i}) ‖ = ‖ \int_{t_{i}}^{t} {\dot{\bar{θ}}}_{P_{⊥}} (σ) d σ ‖ \end{array}

(140)

\begin{array}{c} \leq ‖ \int_{t_{i}}^{t} {\bar{P}}_{⊥} Γ W (σ, \hat{θ}) Δ v (σ) d σ ‖ \end{array}

(141)

\begin{array}{c} \leq \int_{t_{i}}^{t} ‖ {\bar{P}}_{⊥} ‖ ‖ Γ ‖ ‖ W (σ, \hat{θ}) ‖ ‖ Δ v (σ) ‖ d σ \end{array}

(142)

\begin{array}{c} \leq \int_{t_{i}}^{t} λ_{\max} (Γ) b ϵ^{'} d σ \end{array}

(143)

\begin{array}{c} \leq (t - t_{i}) λ_{\max} (Γ) b ϵ^{'} \end{array}

(144)

\begin{array}{c} \leq T^{'} λ_{\max} (Γ) b ϵ^{'} \end{array}

(145)

\begin{array}{c} \leq ϕ . \end{array}

(146)

Since lim_t→∞Δv(t) = 0, for any ϵ′ > 0 we can always find a t′ ≥ t₀ such that ∥Δv(t)∥ ≤ ϵ′ ∀t ≥ t′. Thus using ϵ₀ from the PE condition (134), ϵ from the assumption made in Part 2, and the bound b_⊥ (135), we can choose

\begin{array}{c} ϕ = \frac{ϵ_{0} ϵ}{2 δ_{0} b_{⊥}} \end{array}

(147)

\begin{array}{c} T^{'} = δ_{0} \end{array}

(148)

and find a t′ ≥ t₀ such that the flatness property (138) holds for each interval [t_i, t_i + δ₀] where t_i ≥ t′.

We will now show that for each t_i ≥ t′, there is a t_j ∈ [t_i, t_i + δ₀] where $‖ W_{⊥} (t_{j}) {\bar{θ}}_{P_{⊥}} (t_{j}) ‖ > 0$ . For each t_i ≥ t′, we use the reverse triangle inequality and the flatness property (138) to show:

\begin{array}{c} ‖ \int_{t_{i}}^{t_{i} + δ_{0}} W_{⊥} (σ) {\bar{θ}}_{P_{⊥}} (t_{i}) d σ ‖ \\ - ‖ \int_{t_{i}}^{t_{i} + δ_{0}} W_{⊥} (σ) {\bar{θ}}_{P_{⊥}} (σ) d σ ‖ \\ \leq ‖ \int_{t_{i}}^{t_{i} + δ_{0}} W_{⊥} (σ) [{\bar{θ}}_{P_{⊥}} (t_{i}) - {\bar{θ}}_{P_{⊥}} (σ)] d σ ‖ \end{array}

(149)

\begin{array}{c} \leq \int_{t_{i}}^{t_{i} + δ_{0}} ‖ W_{⊥} (σ) ‖ ‖ {\bar{θ}}_{P_{⊥}} (t_{i}) - {\bar{θ}}_{P_{⊥}} (σ) ‖ d σ \end{array}

(150)

\begin{array}{c} \leq \int_{t_{i}}^{t_{i} + δ_{0}} b_{⊥} ϕ d σ \end{array}

(151)

\begin{array}{c} \leq δ_{0} b_{⊥} ϕ, \end{array}

(152)

We rearrange (152) and choose the unit vector in the PE condition (134) to be

w ≜ \frac{{\bar{θ}}_{P_{⊥}} (t_{i})}{‖ {\bar{θ}}_{P_{⊥}} (t_{i}) ‖},

(153)

so that

\begin{array}{c} ‖ \int_{t_{i}}^{t_{i} + δ_{0}} W_{⊥} (σ) {\bar{θ}}_{P_{⊥}} (σ) d σ ‖ \\ \geq ‖ \int_{t_{i}}^{t_{i} + δ_{0}} W_{⊥} (σ) {\bar{θ}}_{P_{⊥}} (t_{i}) d σ ‖ - δ_{0} b_{⊥} ϕ \end{array}

(154)

\begin{array}{c} \geq ‖ {\bar{θ}}_{P_{⊥}} (t_{i}) ‖ ‖ \int_{t_{i}}^{t_{i} + δ_{0}} W_{⊥} (σ) w d σ ‖ - δ_{0} b_{⊥} ϕ . \end{array}

(155)

We now invoke the lower bound in the PE condition (134), substitute our choice of ϕ (147), and recall the assumption made in Part 2 that $‖ {\bar{θ}}_{P_{⊥}} (t_{i}) ‖ \geq ϵ$ . This results in:

\begin{array}{c} ‖ \int_{t_{i}}^{t_{i} + δ_{0}} W_{⊥} (σ) {\bar{θ}}_{P_{⊥}} (σ) d σ ‖ \\ \geq ‖ {\bar{θ}}_{P_{⊥}} (t_{i}) ‖ ϵ_{0} - δ_{0} b_{⊥} ϕ \end{array}

(156)

\begin{array}{c} \geq ‖ {\bar{θ}}_{P_{⊥}} (t_{i}) ‖ ϵ_{0} - \frac{ϵ_{0} ϵ}{2} \end{array}

(157)

\begin{array}{c} \geq \frac{ϵ_{0} ϵ}{2} \end{array}

(158)

\begin{array}{c} > 0 . \end{array}

(159)

Furthermore, we have

\begin{array}{c} \int_{t_{i}}^{t_{i} + δ_{0}} ‖ W_{⊥} (σ) {\bar{θ}}_{P_{⊥}} (σ) ‖ d σ \\ \geq ‖ \int_{t_{i}}^{t_{i} + δ_{0}} W_{⊥} (σ) {\bar{θ}}_{P_{⊥}} (σ) d σ ‖ \end{array}

(160)

\begin{array}{c} > 0 . \end{array}

(161)

By the mean value theorem for integrals, there must be a t_j ∈ [t_i, t_i + δ₀] where

\int_{t_{i}}^{t_{i} + δ_{0}} ‖ W_{⊥} (σ) {\bar{θ}}_{P_{⊥}} (σ) ‖ d σ = ‖ W_{⊥} (t_{j}) {\bar{θ}}_{P_{⊥}} (t_{j}) ‖ δ_{0},

(162)

and therefore from (161, 162)

‖ W_{⊥} (t_{j}) {\bar{θ}}_{P_{⊥}} (t_{j}) ‖ δ_{0} > 0

(163)

‖ W_{⊥} (t_{j}) {\bar{θ}}_{P_{⊥}} (t_{j}) ‖ > 0 .

(164)

Thus, we have found an unbounded sequence of times ${t_{j}}_{j = 1, \dots, \infty}$ where $‖ W_{⊥} (t_{j}) {\bar{θ}}_{P_{⊥}} (t_{j}) ‖ > 0$ , which contradicts Lemma 1 and implies that the assumption $\lim_{t \to \infty} {\bar{θ}}_{P_{⊥}} (t) \neq 0$ must be false. Therefore $\lim_{t \to \infty} {\bar{θ}}_{P_{⊥}} (t) = 0$ , which is equivalent to $\lim_{t \to \infty} \hat{θ} (t) \in P (θ)$ .

Together with the conclusion in the proof of Theorem one that lim_t→∞Δv(t) = 0, we have shown a sufficient condition for convergence of the full state z(t) to S, thereby achieving the goals (58,59) of this AID.

6. Simulation results

We report simulation results in which the “true” parameters of the simulated plant were chosen to match approximately those of the JHU Iver3 AUV. We include this simulation study to demonstrate that the AID achieves parameter estimate convergence to the true parameter set. Since the true parameter set of the Iver3 is unknown, we are unable to verify convergence of the parameter estimates to the “true” parameter set in the experimental results.

For this study, we simulated 5000 seconds of plant motion for the AID process. All available control inputs were excited in open loop to achieve the PE condition (134). A summary of the input signals is provided in Table 1. The set of adaptation gains used in the simulation study are listed in Table 2. We note that with different gain values, it may be possible to achieve more rapid parameter convergence, and that the selection of adaptation gains is an open problem in adaptive systems theory.

Table 1.

Simulation Study Control Inputs.

Actuator	Training data
Propeller	3.5 + 4 * Sin(0.7 t) N
R Fin	5°+10° * Sin(0.4 t)
T Fin	10° * Sin(0.2 t)
L Fin	5°+10° * Sin(0.4 t)
B Fin	10° * Sin(0.2 t)

Table 2.

Simulation Study AID Adaptation Gains.

$A$	$= I$
Γ_m	$= 1000 I$
Γ_d	$= 8000 I$
$Γ_{G}$	= diag([1,20,20,20]^T)
Γ_a	$= 500 I$

Gaussian noise was added to the attitude and velocity signals to mimic the expected measurement noise of the JHU Iver3 AUV sensor suite. All sensors were assumed to be operating at the same frequency, 10 Hz, which is the sampling frequency of the JHU Iver3’s actual MicroStrain 3DM-GX5-25 attitude and heading reference sensor (AHRS). The specific noise characteristics used in this simulation study are provided in Table 3. These values were determined from manufacturer-provided data sheets and bench-top testing.

Table 3.

Measurement Sources, Accuracies, and Usage in the Reported Results.

		Measurement	Simulation	Experimental	Experimental
State	Source	Standard Deviation	Results	Results (training)	Results (validation)
Attitude	MicroStrain MSF	0.25°	X	X
Angular velocity	MicroStrain MST	0.1°/s		X
Angular velocity	KVH 1775 FOG	0.01°/s	X		X
Trans velocity	RDI 600 kHz DVL	1.3 cm/s	X	X	X

Figure 1, shows that the estimated parameters converge to “true” parameters in simulation, provided that the PE condition (134) defined in Section 5.3 is satisfied. We note that in order to compute a parameter error between the estimated parameters and the non-unique set of “true” parameters, we employed a scaling factor with respect to a single known parameter.

Figure 1.

Simulation Study: Difference between true and estimated parameters (Y-Axis) versus time in seconds (X-axis). This Figure shows that the AID-estimated parameters converge to the true parameter set in simulation when a PE condition is satisfied.

7. Experimental results

This section reports the results of three at-sea field trials to evaluate experimentally the adaptive identification (AID) parameter estimator reported above. We utilized experimental data obtained with JHU’s Iver3 AUV (L3 OceanServer, Fall River, MA, USA), shown in Figure 2. We deployed the JHU Iver3 AUV in the Chesapeake Bay, MD, on November 11, 2019, and we report data from three dives (Dive 86, 87, and 88), which are described in Section 7.1.

Figure 2.

The Iver3 AUV is an underactuated AUV whose control authority is provided by the commanded rotational speed of its ducted propeller and commanded angles for the four red/yellow tail fins, all located at the stern of the vehicle. The 100 m depth-rated Iver3 AUV is one of several commercially available small AUVs designed for oceanographic survey operations including biological, physical-oceanographic, and bathymetric survey missions.

For these experiments, the Iver3 AUV was operating under “waypoint following” control, wherein the user entered a set of pre-programmed waypoints and the Iver3 proprietary controller determined the specific control inputs required to follow the waypoints.

To estimate the parameters for the Iver3 AUV using the AID, we used signals from a MicroStrain 3DM-GX5-25 (Parker LORD MicroStrain, Willston, VT, USA) which is a microelectromechanical systems (MEMS) AHRS, for the attitude and angular rate, and a Teledyne RDI Explorer (Teledyne RDInstruments, Poway, CA, USA) 600 kHz phased-array Doppler velocity log (DVL) for the translational velocity. The RDI DVL is part of the standard sensor suite on the Iver3 AUV installed by the manufacturer; the MicroStrain AHRS was integrated into the nose cone of the Iver3 AUV by the authors. An auxiliary inertial measurement unit (IMU) sensor, a KVH 1775 (KVH Industries, Middletown, RI, USA) fiber-optic gyroscope (FOG), which measures angular rate, was installed inside the Iver3 for these experimental trials. The KVH FOG, which is more accurate than the MicroStrain MEMS gyro, was used as an external source of “ground truth” for the angular rate DOFs during validation. The sensors used for AID and their noise characteristics are listed in Table 3.

7.1. Dive mission plans

Three experimental trials were conducted:

• Dive 86 was a waypoint-following undulate mission with relatively high variation in motion.

• Dive 87 was a waypoint-following mission with a lawnmower pattern at constant depth

• Dive 88 was the same mission as Dive 86.

An overview of the three dives is given in Figure 3. While it may have been ideal to utilize open-loop control signals for these experiments, as we did in simulation, it would have been infeasible in practice when operating in open water with an untethered UV. Instead, we utilized the manufacturer-provided mission planning tool to enter pre-programmed waypoints for the proprietary controller to follow.

Figure 3.

Overview of the three dives conducted in the Chesapeake Bay using the Iver3. The translational position during each dive as estimated by the front seat of the Iver3 is shown above in blue. The depth profile of each dive is shown below in red.

The waypoint-following undulate missions (Dives 86 and 88) were designed to excite the UV dynamics. The UV attempted to follow a series of zig-zagging waypoints while constantly undulating between 1 m and 5 m depths at a fixed 20° pitch angle. The constant-depth lawnmower mission (Dive 87), where the UV followed a series of waypoints in a rectangular grid, was designed to emulate typical seafloor mapping with a multi-beam sonar.

While Dives 86 and 88 employed the same mission, the control inputs determined by the manufacturer’s controller were not identical. External factors such as the UV’s initial position and the ambient water current resulted in different control inputs when following the same mission waypoints.

7.2. Experimental evaluation methodology

Our approach to evaluating the performance of the new AID was a “training and validation” procedure, in which we identified the parameters using the AID algorithm on experimental data from one dive, termed the “training” data, and then used those parameters in forward simulations of different dives, termed the “validation” data. We compared the resulting 6-DOF simulated model velocities to the actual experimentally-observed 6-DOF vehicle velocities. Since the control algorithms of the Iver3 feedback control system are not disclosed by the manufacturer, we used the experimentally-recorded control signals as inputs to the simulation of each dive. We also performed a “self-validation” procedure by computing a forward simulation of the training dive using its own estimated parameters.

The Iver3 AUV control inputs (fin angles and propeller angular velocity) and relevant components of the vehicle state (attitude, translational velocity, and angular velocity) were logged and time-stamped in real-time to the Iver3 AUV’s on-board CPU during the experimental trials. These data were used in post-processing to evaluate the performance of the proposed AID approach. No filtering of any sensor signal was used, the adaptation gains were held constant across all simulations, and the nonlinear differential equations which make up the AID were evaluated using simple Euler integration.

This AID method is well suited for online parameter estimation—assuming that the programming implementation is capable of handling asynchronous sensor readings, the sensor alignments are calibrated, and suitable adaptation gains are chosen. The appropriate adaptation gains can be found using simulation studies or data from previous experiments.

7.2.1. Adaptation gains

During the training procedure, we empirically selected adaptation gains. The adaptation gains were then held constant throughout the self-validation and validation procedure for all dives reported below. We note that the AID reported here is not particularly sensitive to the choice of adaptation gains; that is, small changes in the magnitude of a gain did not result in large changes in performance of the AID.

The adaptation gains for the AID were tuned to maximize performance on the training data set, Dive 86, with the AID active only when the UV was fully submerged. We found Dive 86 to contain over 24 uninterrupted minutes of maneuvering below the surface. The adaptation gains are given in Table 4.

Table 4.

Experimental Study AID Adaptation Gains.

$A$	= diag([6,7.5,7.5,6,0.75,0.75]^T)
Γ_m	= diag([34,2,2,2,34,34]^T)
Γ_d	= diag([80,23,23,23,80]^T)
$Γ_{G}$	= diag([7.5,6,6,60]^T)
Γ_a	$= diag ({[0.75,5,10,1.5,10,5 \times 10^{- 12}]}^{T})$

7.2.2. AID initialization

The parameter estimates for the AID were initialized to values given in Table 5.

Table 5.

Parameter Estimate Initial Values.

${\hat{θ}}_{m} (t_{0})$	= [45,70,70,20,110,110]^T
${\hat{θ}}_{d} (t_{0})$	= [2.73,245,245,0.05,70,70]^T
${\hat{θ}}_{G} (t_{0})$	= [6,1.2,1.0,−11.7353]^T
${\hat{θ}}_{a} (t_{0})$	$= {[0.1,1.0,2.0,0.1,1.5,5.8 \times 10^{- 6}]}^{T}$

7.3. AID performance

We report the performance of the AID as the mean absolute error (MAE), where the error is the difference between the simulated plant velocity resulting from the AID parameters and the velocity measured via the DVL and KVH gyro.

Table 6 reports the MAE when using the parameters estimated from the data of Dive 86 (the “training data”) on a forward simulation of the same dive and of Dives 87 and 88 (the “validation data”). A plot of the body velocity during the validation of Dive 88 is shown in Figure 4.

Table 6.

Mean Absolute Error of Simulated 6-DOF UV Plant Using Identified Parameters.

DOF	Training Dive 86	Validation Dive 87	Validation Dive 88
v ₁	0.0419 m/s	0.0394 m/s	0.0396 m/s
v ₂	0.0439 m/s	0.0619 m/s	0.0428 m/s
v ₃	0.0294 m/s	0.0117 m/s	0.0267 m/s
v ₄	0.0464 rad/s	0.0284 rad/s	0.0447 rad/s
v ₅	0.0288 rad/s	0.0085 rad/s	0.0289 rad/s
v ₆	0.0238 rad/s	0.0248 rad/s	0.0245 rad/s

Figure 4.

Validation results of Dive 88 using parameters identified during Dive 86. Gray traces are (unfiltered) translational and rotational velocities as measured by the DVL and KVH gyroscope. Blue traces are the plant velocities in forward simulation using the parameters as they were initialized in the AID. Red traces are plant velocities in forward simulation using the parameters identified via the new AID method.

The parameter set

\hat{θ}

that was found after training on Dive 86 is reported in Table 7. As explained in Section 4.3, the dynamics of the AID plant converges to that of the real plant, but the true parameter set is not unique. Moreover, this parameter set is only applicable to the particular parameterization of the plant and actuator model satisfying (34).

Table 7.

Estimated Parameters for a Plant and Actuator Model (34) of the Iver3 3026, With Components Described in Section 4.3.

Param	Value	Param	Value	Param	Value	Param	Value
m ₁₁	50.5 kg	d ₁₁	-15.5 kg/m	g	3.5 N	θ _a3	0.68
m ₂₂	70.5 kg	d ₂₂	243.7 kg/m	b ₁	-0.73 N⋅m	θ _a4	1.44
m ₃₃	69.5 kg	d ₃₃	242.8 kg/m	b ₂	0.80 N⋅m	θ _a5	0.56
m ₄₄	19.8 kg⋅m²	d ₄₄	-6.98 kg⋅m²	b ₃	-12.5 N⋅m	θ _a6	7.7 × 10⁻⁶ kg⋅m
m ₅₅	109.2 kg⋅m²	d ₅₅	-80.9 kg⋅m²	θ _a1	1.3
m ₆₆	109.9 kg⋅m²	d ₆₆	60.5 kg⋅m²	θ _a2	0.0048

7.4. Discussion

The results presented here in Table 6 and Figure 4 show that the ability of a forward simulation of the plant model to match experimentally observed plant performance depends highly on the model parameter values, a well-known fact that has been reported previously, for example in Harris et al. (2018); McFarland and Whitcomb (2021).

In Section 6, we showed that the adaptively identified parameters converge to the true parameters from an arbitrary initialization in a simulation study (in which the simulated “true” parameters are known) when the PE condition (134) is satisfied. However, we cannot compare the experimentally identified parameters to the true model parameters for two reasons:

(1) We do not know the true Iver3 AUV model parameters a priori.

(2) As observed in Section 4.5, the “true” parameters are not unique, but may be any element of the set P(θ) (37).

Thus, the experimental validity of this AID algorithm can be evaluated only by the ability of the estimated parameters to predict the UV’s velocity in forward simulation. As proven in Section 4.5.2, any parameter vector in the set P(θ) (37) will result in identical vehicle model input-output dynamics.

The results reported in Figure 4 show that the experimentally identified model (Table 7) performs far better than the model using the arbitrarily chosen initial parameter set (Table 5). Further, these results show that the identified parameter set performs approximately as well on the validation data as it does on the training data. We note that the angular-velocity data obtained by the more accurate KVH sensor were not used in the training of the AID and were used only to provide as close to a source of “ground truth” as is available for angular velocity.

We observed that the performance of the identified model is worst in sway and roll DOF. We believe this phenomenon is caused by the lack of precisely controlled excitation in these DOF.

8. Conclusion

This paper reports the theory and experimental evaluation of a novel adaptive identification (AID) algorithm for the simultaneous estimation of plant-model parameters and actuator parameters for underactuated vehicles in six degrees-of-freedom (DOF).

The simulation and experimental results reported herein indicate that it is feasible to use the AID algorithm for parameter estimation of underactuated underwater vehicles (UVs) in 6-DOF. Underactuated UVs are by far the most common class of UVs, but parameter estimation is more difficult for underactuated UVs than for fully actuated UVs because the reduced actuation necessarily reduces the plant excitation that can be induced from the control inputs. Additionally, underactuated UVs are often controlled with hydrodynamic control surfaces which are difficult to characterize.

The theoretical results reported herein provide the first reported AID algorithm with proof of convergence of the parameter estimates to the true set of parameters for this class of vehicles when a persistence of excitation condition is satisfied. Most previously reported AID approaches for this class of systems only show stability and boundedness of the parameter estimates.

The simulation and experimental results show that parameters estimated using this AID can converge with a “single training set” containing on the order of 1000 s of free-motion AUV dive data. The performance of the resulting model was verified in both self-validation and cross-validation. Additionally, this AID algorithm does not require access to body linear and angular acceleration signals, which can offer an advantage over other parameter-estimation methods.

Footnotes

Acknowledgments

We are grateful for discussions with and input from Dr Joseph Moore, of the Johns Hopkins University Applied Physics Laboratory (JHU APL) and the Department of Mechanical Engineering in the JHU Whiting School of Engineering. We also gratefully acknowledge the support of Mr Rick and Ms Valerie Smith, owners of Smiths Marina, Crownsville, MD, for their gracious support of the JHU Iver3 AUV field trials reported herein.

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the National Science Foundation under Awards 1319667 and 1909182, the National Defense Science and Engineering Graduate Fellowship (Harris), the In-House Laboratory Independent Research (ILIR) Program of the U.S. Navy Office of Naval Research (Paine), the DOD Science, Mathematics, And Research For Transformation (SMART) Defense Scholarship Program (Paine), and a Johns Hopkins University (JHU) Applied Physics Laboratory (APL) Graduate Fellowship (Mao).

ORCID iDs

Zachary J. Harris

Annie M. Mao

Tyler M. Paine

Louis L. Whitcomb

UV control inputs

This section describes a model of actuator forces and moments represented by the term τ(v, φ, ξ) defined in (23), for a class of torpedo-shaped UVs which includes the JHU Iver3. The JHU Iver3 is actuated with a combination of control surfaces and a propeller, as is common in many aerial and underwater vehicles.

We note that the AID described and utilized herein requires the value of the control input signals, ξ, but it is completely agnostic to the specific control law utilized by the UV.

We define the following coordinate frames for each fin:

• V – Vehicle coordinate frame, centered at the UV’s center of pressure (CP) with positive x forward, positive y starboard, and positive z down.

• F – Fin coordinate frame, centered at the fin CP when the fin is at commanded fin angle δ, with the x-axis along the chord line of the fin and the y-axis pointing away from the center line of the vehicle.

• F₀ – Fin coordinate frame at δ = 0.

• W – Flow coordinate frame, corresponding to the flow of water across the fin, where α is the angle of attack of the incident flow with respect to the chord of the fin.

We note that the commanded fin angle, δ, is not the fin angle of attack to incident flow, α, so that the F and W frames are generally not coincident. The position of the ith fin in the vehicle frame is specified by $^{V} p_{i} \in R^{3}$ , the vector from the vehicle’s center of gravity (CG) to the CP of the ith fin, and $^{V} ϕ_{i} \in R$ is the angular position of the ith fin in the vehicle frame.

We define the transformations between coordinate frames of each fin using rotation matrices R ∈ SO(3) as follows: (165)

\begin{array}{c} _{F}^{V} R_{i} =_{F_{0}}^{V} R_{i}_{F}^{F_{0}} R_{i} \in S O (3) \end{array}

(166)

\begin{array}{c} _{W}^{V} R_{i} =_{F}^{V} R_{i}_{W}^{F} R \in S O (3) \end{array}

where the notation

_{F}^{V} R_{i}

is the transformation from the fin frame to the vehicle frame and

_{W}^{V} R_{i}

is the transformation from the flow frame to the vehicle frame. The individual transformations are defined as (167)

\begin{array}{c} _{F_{0}}^{V} R_{i} = R_{x} (V_{ϕ_{i}})) \end{array}

(168)

\begin{array}{c} _{F}^{F_{0}} R_{i} = R_{y} (δ_{i}) \end{array}

(169)

\begin{array}{c} _{W}^{F} R_{i} = R_{y}^{T} (α_{i}), \end{array}

where R_x, R_y ∈ SO(3) are elemental rotations about the x-and y-axis, respectively. The form of (167) is assumed because, for the JHU Iver3 used to obtain the experimental results presented in Section 7 and many other UVs, the x-axis of the fin frame aligns with the x-axis of the vehicle frame.

The velocity of the ith fin through the water at the fin CP in vehicle coordinates is (170)

^{V} {\dot{p}}_{i} = ν + {(ω)}^{\land} \cdot^{V} p_{i},

and the velocity of the ith fin in the fin frame F is thus (171)

F_{{\dot{p}}_{i}} =_{F}^{V} R_{i}^{T}^{V} {\dot{p}}_{i .}

Assuming flow along the span of the fin foil does not affect the lift or drag, we use a projection matrix to find the flow along the x- and z-axes (172)

F_{{\dot{p}}_{x z_{i}}} = [\begin{array}{c} 1 & 0 & 0 \\ 0 & 0 & 0 \\ 0 & 0 & 1 \end{array}]^{F} \dot{p_{i}},

from which we can find the angle of attack, α, as (173)

α_{i} = atan 2 (^{F} {\dot{p}}_{z_{i}},^{F} {\dot{p}}_{x_{i}}),

where atan2 is the four-quadrant arc-tangent function.

We then compute the hydrodynamic lift and drag force acting on each fin as (174)

L (α_{i}) = \frac{1}{2} ρ A C_{L} (α_{i}) ‖^{F} {\dot{p}}_{x z_{i}} ‖^{2}

(175)

D (α_{i}) = \frac{1}{2} ρ A C_{D} (α_{i}) ‖^{F} {\dot{p}}_{x z_{i}} ‖^{2},

where A is the surface area of the fin. The lift and drag coefficients of the fin, C_L(α_i) and C_D(α_i), respectively, are a function of the angle of attack. We parameterize these functions as polynomials. The lift coefficient of each fin is modeled as a two-parameter cubic polynomial: (176)

C_{L} (α_{i}) = θ_{a 1} α_{i} + θ_{a 2} α_{i}^{3} .

The drag coefficient of each fin is modeled as a three-parameter quadratic polynomial: (177)

C_{D} (α_{i}) = θ_{a 3} + θ_{a 4} α_{i}^{2} + θ_{a 5} α_{i}^{4} .

These polynomial lift and drag coefficient models approximate the lift and drag coefficients from a symmetric NACA foil shape.

The resultant force vector in the flow frame W is $^{W} f = - {[\begin{array}{c} D & 0 & L \end{array}]}^{T}$ . Thus, the force vector from the ith fin in the vehicle frame is (178)

V_{f_{i}} =_{W}^{V} R_{i}^{W} f_{i},

and the force and moment vector from the ith fin is (179)

τ_{i} = [\begin{array}{c} ^{V} f_{i} \\ {(V r_{i})}^{\land} \cdot^{V} f_{i} \end{array}] .

The total force and moment vector on a typical underactuated torpedo-like UV with a total of N control surfaces and one propeller is thus (180)

τ (v, φ, ξ) = \sum_{i}^{N} τ_{i} (v, φ, ξ) + {[\begin{array}{c} θ_{a 6} ω_{p}^{2} (t) & 0_{1 \times 5} \end{array}]}^{T},

where θ_a6 is the propeller thrust coefficient relating radial velocity ω_p to axial thrust T: (181)

T = θ_{a 6} ω_{p}^{2} .

This simplified thruster model is a reasonable assumption for class of UVs because the propeller is ducted with a high jet velocity compared to the advance velocity of the vehicle Newman (1977).

We have described a model of actuator forces and moments which act on the UV, and this model is expressed as τ(v, φ, ξ) with control inputs (182)

ξ = {[\begin{array}{c} δ_{1} & δ_{2} & δ_{3} & δ_{4} & ω_{p} \end{array}]}^{T} .

While τ(v, φ, ξ) is a nonlinear function of v, φ, and ξ, it is linear in the actuator parameters θ_a, defined in this case as (183)

θ_{a} = {[\begin{array}{c} θ_{a 1} & θ_{a 2} & θ_{a 3} & θ_{a 4} & θ_{a 5} & θ_{a 6} \end{array}]}^{T} \in R^{6} .

Parameter convergence

References

Aguiar

Hespanha

(2007) Trajectory-tracking and path-following of underactuated autonomous vehicles with parametric modeling uncertainty. IEEE Transactions on Automatic Control 52(8): 1362–1379. DOI: 10.1109/TAC.2007.902731.

Ahmed

Coppola

Bernstein

(1998) Adaptive asymptotic tracking of spacecraft attitude motion with inertia matrix identification. Journal of Guidance, Control, and Dynamics 21(5): 684–691. DOI: 10.2514/2.4310.

Alessandri

Caccia

Indiveri

, et al. (1998) Application of LS and EKF techniques to the identification of underwater vehicles. In: Proceedings of the IEEE International Conference on Control Applications. Trieste, Italy, pp. 1084–1088.

Atkeson

Hollerbach

(1988) Model-Based Control of a Robot Manipulator. Cambridge, MA, USA: MIT Press.

Armstrong

Khatib

Burdick

(1986) The explicit dynamic model and inertial parameters of the PUMA 560 arm. In: Proceedings of the IEEE International Conference on Robotics and Automation. San Francisco, California, USA: IEEE, pp. 510–518.

Astrom

(1989) Adaptive Control. Reading, MA, USA: Addison-Wesley.

Atkeson

Hollerbach

(1986) Estimation of inertial parameters of manipulator loads and links. The International Journal of Robotics Research 5(3): 101–119. DOI: 10.1177/027836498600500306.

Ayusawa

Venture

Nakamura

(2014) Identifiability and identification of inertial parameters using the underactuated base-link dynamics for legged multibody systems. The International Journal of Robotics Research 33(3): 446–468. DOI: 10.1177/0278364913495932.

Bagherzadeh

(2018) Nonlinear aircraft system identification using artificial neural networks enhanced by empirical mode decomposition. Aerospace Science and Technology 75: 155–171. DOI: 10.1016/j.ast.2018.01.004.

10.

Caccia

Indiveri

Veruggio

(2000) Modeling and identification of open-frame variable configuration underwater vehicles. IEEE Journal of Oceanic Engineering 25(2): 227–240.

11.

Craig

Hsu

Sastry

(1987) Adaptive control of mechanical manipulators. The International Journal of Robotics Research 6(2): 16–28.

12.

Feldman

(1979) DTNSRDC Revised Standard Submarine Equations of Motion. Technical report. Bethesda, MD, USA: US Department of Defense. Available at: https://apps.dtic.mil/sti/pdfs/ADA071804.pdf.

13.

Fossen

(1994) Guidance and Control of Ocean Vehicles. New York, NY: John Wiley and Sons.

14.

Fossen

(2002) Marine Control Systems: Guidance, Navigation, and Control of Ships, Rigs, and Underwater Vehicles. Trondheim, Norway: Maine Cybernetics.

15.

Freund

(1983) Fast nonlinear control with arbitrary pole placement for industrial robots and manipulators. The International Journal of Robotics Research 1(1): 65–78.

16.

Fukao

Nakagawa

Adachi

(2000) Adaptive tracking control of a nonholonomic mobile robot. IEEE Transactions on Robotics and Automation 16(5): 609–615. DOI: 10.1109/70.880812.

17.

Gertler

Hagen

(1967) Standard Equations of Motion for Submarine Simulation. Technical report, David W Taylor Naval Ship Research and Development Center, Bethesda MD. Available at: https://apps.dtic.mil/sti/citations/AD0653861.

18.

Goodman

(1960) Experimental techniques and methods of analysis used in submerged body reseach. In: Proc. Of the Third Symposium on Naval Hydrodynamics. Holland: Scheveningen, pp. 379–449.

19.

Graver

Bachmayer

Leonard

, et al. (2003) Underwater glider model parameter identification. In: Proc. 13th Int. Symp. on Unmanned Untethered Submersible Technology (UUST). Durham, NH, USA, 1. 12–13.

20.

Harris

(2019) Model-based cooperative acoustic navigation and parameter identification for underactuated underwater vehicles. PhD Thesis. Baltimore, MD USA: The Johns Hopkins University. Available at: http://jhir.library.jhu.edu/handle/1774.2/62228.

21.

Harris

Whitcomb

(2018) Preliminary evaluation of cooperative navigation of underwater vehicles without a DVL utilizing a dynamic process model. In: Proceedings of 2018 IEEE International Conference on Robotics and Automation. Brisbane, Australia: IEEE, pp. 4897–4904. DOI: 10.1109/ICRA.2018.8460970.

22.

Harris

Paine

Whitcomb

(2018) Preliminary evaluation of null-space dynamic process model identification with application to cooperative navigation of underwater vehicles. In: Proceedings of 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). Madrid, Spain, pp. 3453–3459. DOI: 10.1109/IROS.2018.8594257.

23.

Hsu

Bodson

Sastry

, et al. (1987) Adaptive indentification and control for manipulators without using joint accelerations. In: Proceedings of the IEEE International Conference on Robotics and Automation. Raleigh, NC, USA, pp. 1210–1215.

24.

Imlay

(1961) The Complete Expressions for Added Mass of a Rigid Body Moving in an Ideal Fluid. Technical report: David Taylor Model Basin, Washington DC. Available at: https://archive.org/details/DTIC_AD0263966.

25.

Jiang

(2002) Global tracking control of underactuated ships by Lyapunov’s direct method. Automatica 38(2): 301–309. DOI: 10.1016/S0005-1098(01)00199-6.

26.

Karras

Bechlioulis

Leonetti

, et al. (2013) On-line identification of autonomous underwater vehicles through global derivative-free optimization. In: 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems. Tokyo, Japan, pp. 3859–3864. DOI: 10.1109/IROS.2013.6696908.

27.

Keim

Behcet Acikmese

Shields

(2006) Spacecraft inertia estimation via constrained least squares. In: Proceedings of the 2006 IEEE Aerospace Conference. MT, USA: Big Sky, p. 6. DOI: 10.1109/AERO.2006.1655995.

28.

Khalil

Dombre

(2002) Modeling, Identification & Control of Robots. New York, NY: Taylor & Francis.

29.

Khosla

Kanade

(1985) Parameter identification of robot dynamics. In: Proceedings of the 24th IEEE Conference on Decision and Control. Fort Lauderdale, FL, USA: IEEE, pp. 1754–1760. DOI: 10.1109/CDC.1985.268838.

30.

Koditschek

(1988) Application of a new Lyapunov function to global adaptive attitude tracking. In: Proceedings of the 27th IEEE Conference on Decision and Control. Austin, TX, USA: IEEE, 1, pp. 63–68. DOI: 10.1109/CDC.1988.194270

31.

Larsson

Regnstrom

Broberg

, et al. (1998) Failures, fantasies, and feats in the theoretical/numerical prediction of ship performance. In: Proceedings of the 22nd Symposium on Naval Hydrodynamics. Washington, DC, USA, pp. 11–32.

32.

Liu

Yang

Zhang

, et al. (2020) Computational fluid dynamics prediction of the dynamic behavior of autonomous underwater vehicles. IEEE Journal of Oceanic Engineering 45(3): 724–739. DOI: 10.1109/JOE.2019.2916119.

33.

Luh

Walker

Paul

(1980) Resolved-acceleration control of mechanical manipulators. IEEE Transactions on Automatic Control 25(3): 468–474. DOI: 10.1109/TAC.1980.1102367.

34.

Mao

Whitcomb

(2021) A novel quotient space approach to model-based fault detection and isolation: theory and preliminary simulation evaluation. In: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). Prague, Czech Republic, pp. 7119–7126. DOI: 10.1109/IROS51168.2021.9636026.

35.

Martin

Whitcomb

(2014) Experimental identification of six-degree-of-freedom coupled dynamic plant models for underwater robot vehicles. IEEE Journal of Oceanic Engineering 39(4): 662–671. DOI: 10.1109/JOE.2013.2280492.

36.

McFarland

Whitcomb

(2013) Comparative experimental evaluation of a new adaptive identifier for underwater vehicles. In: 2013 IEEE International Conference on Robotics and Automation (ICRA). Karlsruhe, Germany, pp. 4614–4620. DOI: 10.1109/ICRA.2013.6631233.

37.

McFarland

Whitcomb

(2014) Experimental evaluation of adaptive model-based control for underwater vehicles in the presence of unmodeled actuator dynamics. In: 2014 IEEE International Conference on Robotics and Automation (ICRA). China, Hong Kong, pp. 2893–2900. DOI: 10.1109/ICRA.2014.6907275.

38.

McFarland

Whitcomb

(2021) Stable adaptive identification of fully-coupled second-order 6 degree-of-freedom nonlinear plant models for underwater vehicles: theory and experimental evaluation. International Journal of Adaptive Control and Signal Processing 35(5): 786–810. DOI: 10.1002/acs.3235.

39.

Morgan

Narendra

(1977) On the stability of nonautonomous differential equations

\dot{x}

= [A + B(t)]

x

, with skew symmetric matrix B(t). SIAM Journal on Control and Optimization 15(1): 163–176. doi:10.1137/0315013.

40.

Narendra

Annaswamy

(1989) Stable Adaptive Systems. Englewood Cliffs, NJ, USA: Prentice-Hall.

41.

Narendra

Parthasarathy

(1990) Identification and control of dynamical systems using neural networks. IEEE Transactions on Neural Networks 1(1): 4–27. DOI: 10.1109/72.80202.

42.

Newman

(1977) Marine Hydrodynamics. Cambridge, MA: MIT Press. Available at: https://mitpress.mit.edu/9780262534826/marine-hydrodynamics.

43.

Norman

Peck

O’Shaughnessy

(2011) In-orbit estimation of inertia and momentum-actuator alignment parameters. Journal of Guidance, Control, and Dynamics 34(6): 1798–1814.

44.

Paine

(2018) Robust model identification methods for nonlinear second-order plant models for underwater vehicles. Master’s Thesis. Baltimore, MD, USA: Johns Hopkins University. Available at: http://jhir.library.jhu.edu/handle/1774.2/59285.

45.

Paine

Whitcomb

(2018) Adaptive parameter identification of underactuated unmanned underwater vehicles: a preliminary simulation study. In: OCEANS 2018 MTS/IEEE Charleston. USA: Charleston, SC, pp. 1–6. DOI: 10.1109/OCEANS.2018.8604828.

46.

Paine

Whitcomb

(2021) Uniform complete observability of mass and inertial parameters in adaptive identification of rigid body plant dynamics. In: 2021 IEEE International Conference on Robotics and Automation (ICRA). Prague, Czech Republic: IEEE, pp. 52–58. DOI: 10.1109/ICRA48506.2021.9561892.

47.

Phillips

Turnock

Furlong

(2010) The use of computational fluid dynamics to aid cost-effective hydrodynamic design of autonomous underwater vehicles. Proceedings of the Institution of Mechanical Engineers - Part M: Journal of Engineering for the Maritime Environment 1: 1–16. DOI: 10.1243/14750902JEME199.

48.

Prestero

(2001) Verification of a Six-Degree of Freedom Simulation Model for the REMUS Autonomous Underwater Vehicle. Master’s Thesis, Cambridge, MA, USA: Massachusetts Institute of Technology/Woods Hole Oceanographic Institution Joint Program.

49.

Sadegh

Horowitz

(1990) Stability and robustness analysis of a class of adaptive controllers for robotic manipulators. The International Journal of Robotics Research 9(3): 74–92.

50.

Sahu

Subudhi

(2014) Adaptive tracking control of an autonomous underwater vehicle. International Journal of Automation and Computing 11(3): 299–307.

51.

Sastry

Bodson

(1989) Adaptive Control: Stability, Convergence, and Robustness, Englewood Cliffs, NJ, USA: Prentice-Hall. Available at: https://my.ece.utah.edu/∼bodson/acscr/.

52.

Shojaei

Shahri

Tarakameh

, et al. (2011) Adaptive trajectory tracking control of a differential drive wheeled mobile robot. Robotica 29(3): 391–402. DOI: 10.1017/S0263574710000202.

53.

Slotine

JJE

(1987) On the adaptive control of robot manipulators. The International Journal of Robotics Research 6(3): 49–59.

54.

Slotine

JJE

(1991) Applied Nonlinear Control. New Jersey: Prentice Hall.

55.

Smallwood

Whitcomb

(2003) Adaptive identification of dynamically positioned underwater robotic vehicles. IEEE Transactions on Control Systems Technology 11(4): 505–515. DOI: 10.1109/TCST.2003.813377.

56.

Society of Naval Architects and Marine Engineers (US) Technical and Research Committee Hydrodynamics Subcommittee (1950) Nomenclature for Treating the motion of a submerged body through a fluid: Report of the American towing tank conference. Technical and Research Bulletin, New York, NY, USA: Society of Naval Architects and Marine Engineers.

57.

Sonnenburg

Woolsey

(2013) Modeling, identification, and control of an unmanned surface vehicle. Journal of Field Robotics 30(3): 371–398. DOI: 10.1002/rob.21452.

58.

Swevers

Verdonck

De Schutter

(2007) Dynamic model identification for industrial robots. IEEE Control Systems Magazine 27(5): 58–71. DOI: 10.1109/MCS.2007.904659.

59.

Tao

(2003) Adaptive Control Design and Analysis. Hoboken, NJ: John Wiley and Sons.

60.

Tedrake

(2023) Underactuated Robotics: Algorithms For Walking, Running, Swimming, Flying, and Manipulation (course notes for MIT 6.8210: Underactuated Robotics), Cambridge, MA, USA: Russ Tedrake. Available at: http://underactuated.mit.edu/(Accessed 20 June 2023).

61.

van de Ven

Johansen

Sørensen

, et al. (2007) Neural network augmented identification of underwater vehicle models. Control Engineering Practice 15(6): 715–725. doi:10.1016/j.conengprac.2005.11.004. Special Section on Control Applications in Marine Systems.

62.

Van Manen

Van Ossanen

(1988). In: Lewis

(ed) Principles of Naval Architecture, Second Revision, Volume II: Resistance, Propulsion, and Vibration. Jersey City, New Jersey USA: Society of Naval Architects and Marine Engineers.

63.

Wehbe

Krell

(2017) Learning coupled dynamic models of underwater vehicles using support vector regression. OCEANS 2017 - Aberdeen. UK: Aberdeen, pp. 1–7.

64.

Wehbe

Hildebrandt

Kirchner

(2017) Experimental evaluation of various machine learning regression methods for model identification of autonomous underwater vehicles. In: 2017 IEEE International Conference on Robotics and Automation (ICRA). Singapore, pp. 4885–4890.

65.

Whitcomb

Rizzi

Koditschek

(1993) Comparative experiments with a new adaptive controller for robot arms. IEEE Transactions on Robotics and Automation 9(1): 59–70.

66.

Willcox

Ghattas

Heimbach

(2021) The imperative of physics-based modeling and inverse theory in computational science. Nature Computational Science 1(3): 166–168. DOI: 10.1038/s43588-021-00040-z.

67.

Wong

de Queiroz

Kapila

(2001) Adaptive tracking control using synthesized velocity from attitude measurements. Automatica 37(6): 947–953. DOI: 10.1016/S0005-1098(01)00038-3.

68.

Woo

Park

, et al. (2018) Dynamic model identification of unmanned surface vehicles using deep learning network. Applied Ocean Research 78: 123–133. DOI: 10.1016/j.apor.2018.06.011.

Stable nullspace adaptive parameter identification of 6 degree-of-freedom plant and actuator models for underactuated vehicles: Theory and experimental evaluation

Abstract

Keywords

1. Introduction

1.1. Motivation for accurate model identification

1.2. Common approaches to parameter identification

1.3. Parameter identification of underactuated robotic vehicles

2. Literature review

2.1. Least squares and Kalman filtering

2.2. Adaptive trajectory-tracking control

2.2.1. Relationship of adaptive identification to adaptive trajectory tracking control

2.3. Adaptive parameter estimation

2.4. Machine learning

2.5. Approach comparison

3. Mathematical conventions

4. Finite-dimensional dynamical plant models for 6-DOF vehicles

4.1. State and control input representation

4.2. Commonly accepted plant model

4.2.1. History of the development of plant models for underwater vehicles

4.2.2 Plant model

4.2.3 Actuator model

4.3. Combined parameter vector

4.3.1. Plant parameters

4.3.2. Control-actuator parameters

4.3.3. Plant and actuator parameter vector

4.4. Regressor formulation of plant and actuator dynamics

4.5. Regressor nullspace properties

4.5.1. Non-uniqueness of the parameter vector

4.5.2. Acceleration input-output dynamics

5. Adaptive identifier

5.1. Update laws and error dynamics

5.2 Convergence of the velocity error

5.3. Convergence of the parameter estimate

5.3.1. Coordinate transformation

5.3.2. Equivalent error dynamics

5.3.3. Convergence of Δ v ˙ ( t )

5.3.4. Convergence of θ ^ ( t ) to the true parameter set P(θ)

6. Simulation results

7. Experimental results

7.1. Dive mission plans

7.2. Experimental evaluation methodology

7.2.1. Adaptation gains

7.2.2. AID initialization

7.3. AID performance

7.4. Discussion

8. Conclusion

Footnotes

Acknowledgments

Declaration of conflicting interests

Funding

ORCID iDs

UV control inputs

Parameter convergence

References

5.3.3. Convergence of $Δ \dot{v} (t)$

5.3.4. Convergence of $\hat{θ} (t)$ to the true parameter set P(θ)