Abstract
Model-based approaches to navigation, control, and fault detection that utilize precise nonlinear models of vehicle plant dynamics will enable more accurate control and navigation, assured autonomy, and more complex missions for such vehicles. This paper reports novel theoretical and experimental results addressing the problem of parameter estimation of plant and actuator models for underactuated underwater vehicles operating in 6 degrees-of-freedom (DOF) whose dynamics are modeled by finite-dimensional Newton-Euler equations. This paper reports the first theoretical approach and experimental validation to identify simultaneously plant-model parameters (parameters such as mass, added mass, hydrodynamic drag, and buoyancy) and control-actuator parameters (control-surface models and thruster models) in 6-DOF. Most previously reported studies on parameter identification assume that the control-actuator parameters are known a priori. Moreover, this paper reports the first proof of convergence of the parameter estimates to the true set of parameters for this class of vehicles under a persistence of excitation condition. The reported adaptive identification (AID) algorithm does not require instrumentation of 6-DOF vehicle acceleration, which is required by conventional approaches to parameter estimation such as least squares. Additionally, the reported AID algorithm is applicable under any arbitrary open-loop or closed-loop control law. We report simulation and experimental results for identifying the plant-model and control-actuator parameters for an L3 OceanServer Iver3 autonomous underwater vehicle. We believe this general approach to AID could be extended to apply to other classes of machines and other classes of marine, land, aerial, and space vehicles.
Keywords
1. Introduction
This paper addresses the problem of simultaneously estimating the plant-model and control-actuator parameters for an underactuated underwater vehicle (UV) operating in 6 degrees-of-freedom (DOF). In particular, we report a new adaptive identification (AID) algorithm, including a novel proof of asymptotic parameter convergence, and associated simulation and experimental results.
To the best of our knowledge, this paper reports the first theoretical approach and experimental validation to identify simultaneously UV dynamical plant-model parameters (for critical terms such as mass, added mass, hydrodynamic drag, and buoyancy) and control-actuator parameters (for control-surface models and thruster models) in 6-DOF. Most previously reported studies on parameter identification assume that the control-actuator parameters are known a priori. Moreover, this paper reports the first proof of convergence of the parameter estimates to the true set of parameters for this class of vehicles when a persistence of excitation condition is satisfied. Most previously reported AID approaches for this class of systems only show stability and boundedness of the parameter estimates.
The performance of a dynamic model in predicting a robotic vehicle’s state depends primarily on the accuracy of the model structure and parameters. Accurate estimates of these parameters are required for model-based navigation, control, fault detection, and simulation. Indeed, a broad class of nonlinear model-based trajectory-tracking controllers for robot arms developed since the 1980s require exact knowledge of the plant’s kinematic and dynamic parameters—examples include nonlinear tracking controllers for second-order rotating plants Koditschek (1988) and exactly linearizing model-based trajectory-tracking controllers for open kinematic chains Freund (1983); Luh et al. (1980).
While the general form of dynamical plant models for underwater vehicles has been understood for decades, Society of Naval Architects and Marine Engineers (U.S.). Technical and Research Committee. Hydrodynamics Subcommittee (1950), the dynamic-model parameters—that is, parameters for terms including mass, added mass, hydrodynamic drag, buoyancy, and control actuators—are impossible to determine analytically and are often not provided by manufacturers of the robotic vehicles. In contrast to plant kinematic parameters, which are often easily obtained from design documents and generally do not vary with time, plant dynamic parameters generally must be measured empirically and may change over time. Sources of such changes may include but are not limited to: substantial payload changes to the vehicle or robotic arm; physical changes to the vehicle, such as lengthening or shortening a UV to accommodate different payloads; variation in ballasting and trim conditions; variation in water density which affects vehicle buoyancy; and changes to on-board equipment and instrumentation (both external and internal). Model parameters must be re-estimated whenever the robot physical configuration is significantly altered.
Estimating plant-model parameters is generally more difficult for underactuated, torpedo-shaped UVs than for fully actuated UVs with thrusters because: (i) the reduced actuation available on underactuated UVs limits the plant excitation that can be induced from the control inputs, and (ii) torpedo-shaped UVs and many uncrewed aerial vehicles (UAVs) are often actuated with control surfaces (e.g., fins, wings, elevators, and rudders), which are difficult to characterize independently of the plant-model parameters.
For these reasons, we seek an approach to parameter estimation for underactuated UVs in 6-DOF that simultaneously estimates plant and actuator parameters and can be performed routinely in the field with minimal time and effort by the vehicle operator. We believe the general approach to AID reported here is not limited to underwater vehicles and could be extended to apply to other classes of machines and other classes of marine, land, aerial, and space vehicles.
1.1. Motivation for accurate model identification
Model-based approaches to navigation, control, and fault detection that utilize precise nonlinear models of vehicle dynamics will enable more accurate control and navigation, assured autonomy, and more complex missions for robotic vehicles. In particular: (1) Accurately identified model parameters can be utilized in forward simulation to predict vehicle performance and tune gains for various controllers (open-loop and closed-loop), navigation systems, and motion-planning algorithms. (2) Accurately identified model parameters can be utilized in model-based state-estimation (navigation) systems that utilize the full UV plant model in their solution Harris and Whitcomb (2018). (3) Accurately identified model parameters can enable model-based control algorithms that offer improved performance over traditional controllers. For example, previously reported studies have shown that nonlinear adaptive model-based control (AMBC) can outperform proportional derivative control (PDC) in trajectory tracking for fully actuated UVs McFarland and Whitcomb (2014) and fully actuated robot arms Craig et al. (1987); Slotine and Li (1987); Sadegh and Horowitz (1990); Whitcomb et al. (1993). (4) Online approaches for identifying plant parameters may lead to new methods for online fault detection and fault-tolerant control of UVs, for example Mao and Whitcomb (2021).
1.2. Common approaches to parameter identification
The three most common approaches to model-parameter estimation for vehicles with dynamics that cannot be fully characterized analytically, such as UVs, are: (1) utilizing data obtained in captive-motion experiments (e.g., towing a vehicle in a hydrodynamic test tank or wind tunnel), (2) utilizing data from computational fluid dynamics (CFD) simulations, or (3) utilizing data obtained in full-scale experimental trials of a robotic vehicle in controlled free motion (e.g., under open-loop or closed-loop control).
Captive-motion experiments are the standard in naval architecture Van Manen and Van Ossanen (1988); however, they are time consuming, expensive, and difficult to perform properly. Further, the results are valid only for the specific vehicle configuration tested, and it is often impractical to repeat the experiments for every possible configuration of the robotic vehicle. Typically, the experimental setup involves either rotating-arm experiments Van Manen and Van Ossanen (1988) or planar motion mechanism (PMM) experiments, such as towing a vehicle on a carriage with a load cell at a fixed velocity in a hydrodynamic test facility, for example, Prestero (2001). Captive-motion experiments can be quite accurate in certain DOF, but these experiments often require decoupling the DOF, which can lead to model inaccuracies. Typically, separate tests are conducted with and without the control surfaces (fins) installed in order to isolate the fin drag from the body drag, and separate test facilities are often required to characterize the fin lift and drag as a function of the angle of attack.
Parameter estimation based on computational fluid dynamics uses numerical models of fluid flow around the vehicle to simulate virtual PMM tests. CFD-based approaches to model identification vary widely in accuracy, time, and cost. Additionally, CFD-based approaches require detailed computer-aided design (CAD) models of the UV, which may be difficult or impossible to obtain from commerical off-the-shelf (COTS) UV manufacturers. If a CAD model is created by the end user for a COTS UV, often the vehicle is modeled as a smooth hull—appendages such as antennas, strobe lights, sensors (e.g., CTD sensors and sonar transducers), and acoustic modem transducers are left unmodeled Liu et al. (2020). An advantage of CFD-based parameter estimation, however, is the ability to inform design choices during the preliminary vehicle-design process Phillips et al. (2010).
While these approaches have advantages, both captive-motion experiments and CFD-based approaches are often infeasible for many UV end users for reasons of cost, time, and practicality.
Parameter identification based on data collected in full-scale experimental trials of UVs in controlled free motion has several advantages over captive-motion experiments and CFD. First, the approach is accessible to any end user who can deploy an existing UV. Additionally, though beyond the scope of this paper, some approaches that utilize full-scale experimental data, such as the approach described herein, can be extended to run in real time during vehicle missions. Such algorithms can also enable online model-based fault detection and fault isolation Mao and Whitcomb (2021).
1.3. Parameter identification of underactuated robotic vehicles
Many robots, such as torpedo-shaped autonomous underwater vehicles (AUVs), aerial drones, and aerospace vehicles are not fully actuated. Instead, these robots are underactuated. We adopt the following definition for Underactuated Control Differential Equations from Tedrake (2023): A second-order control differential equation described by the equations
The robotic vehicle used to validate our theoretical results, an Iver3 (L3Harris OceanServer, Fall River, MA, USA) AUV, is clearly underactuated. The vehicle has only five control inputs (4 commanded fin angles and 1 commanded main propeller angular velocity), fewer than the UV’s 6-DOF. Thus, the Iver3 robotic vehicle is underactuated by definition, regardless of the particular method used to model the actuators. The Iver3 belongs to the common class of torpedo-shaped AUVs. This class of UV differs significantly from fully-thruster-actuated remotely operated vehicles (ROVs), where the thrusters are often capable of exerting arbitrary 6-DOF forces and moments on the vehicle. Torpedo-shaped UVs are controllable only when in forward motion, are generally incapable of hovering, and are physically unable to track fully general 6-DOF reference trajectories.
Parameter estimation of underactuated UVs presents real theoretical and practical challenges. A main concern is whether the UV parameters are observable. In other words: do the control actuators have sufficient control authority to excite the plant so that the plant model parameters are observable? The authors have previously defined necessary and sufficient conditions for mass and rotational-inertia parameters in rigid-body dynamical plants to be uniformly completely observable (UCO) Paine and Whitcomb (2021). Previously reported simulation studies confirmed that underactuated rigid-body plants can meet these conditions. We conjectured that a similar approach could be used to show UCO of other parameters such as drag coefficients and gravity terms. In this paper, via the equivalent concept of persistence of excitation (PE), we define sufficient conditions for convergence of plant and actuator parameter estimates to a true parameter set.
To the best of our knowledge, with the exception of Harris et al. (2018), Harris (2019), Paine (2018), upon which this work is based, there are no previously reported studies for parameter identification of underactuated robotic vehicles that simultaneously identify plant-model parameters and actuation parameters in 6-DOF.
With few exceptions, namely McFarland and Whitcomb (2021), many previously reported AID methods require model-based adaptive tracking controllers and are not applicable when the UV is operating under any control law other than a specific adaptive tracking controller. On commercially available robotic vehicles, including UVs, the user is often limited to using the proprietary control system provided by the manufacturer, and the user cannot easily replace the manufacturer’s proprietary controller with an adaptive trajectory-tracking controller (ATTC). The AID approach reported herein works for robotic plants operating under any control inputs (specifically, it not limited to the case of trajectory-tracking control), and is thus applicable in the common situation of a UV operating with a proprietary, manufacturer-provided controller or in open-loop control.
In summary, the advantages of this AID approach include: • This AID method simultaneously estimates both plant and actuator parameters that are present in a physics-based model of robot dynamics, either fully actuated or underactuated. Most previously reported studies on parameter identification assume that the control-actuator parameters are known a priori. • This AID method is the first reported AID approach with a proof of convergence of the parameter estimates to the true set of parameters for this class of vehicles, provided that a PE condition is satisfied. Most previously reported AID approaches for this class of systems only show stability and boundedness of the parameter estimates. • This AID method, like other adaptive identifiers, does not require body-relative linear and angular acceleration signals, which is beneficial because body-relative linear acceleration terms are difficult to obtain. Although not discussed in detail here, we find this AID method is more robust to sensor noise when compared to the least-squares–based methods described in Harris et al. (2018). • This AID method ensures stability and asymptotic convergence of the identification plant velocity error and convergence of the parameter estimates, in contrast to many machine learning (ML) approaches which are generally unable to provide such guarantees. • The simulation and experimental studies reported here show that AID-estimated parameters can converge with a “single training set” containing on the order of 1000 seconds of free-motion AUV dive data, and the performance of the resulting model was verified in both self-validation and cross-validation. • This AID method may be performed by a user in the field when the plant dynamics change, unlike captive-motion experiments or CFD. • This AID method works with any open or closed-loop control law, provided that a PE condition is satisfied. Unlike many previously reported AID methods, this method does not require a trajectory-tracking controller, which is often unsuitable for underactuated vehicles without control authority in all DOF.
2. Literature review
Most previously reported model-based parameter identification methods employ one of two general approaches: (i) least-squares linear regression of experimental data or (ii) adaptive model-based trajectory-tracking control and adaptive identification. Khalil and Dombre provide a detailed overview of these approaches Khalil and Dombre (2002). We separately group machine learning (ML) and neural network (NN) techniques as a third type of method for system identification, because these methods do not explicitly use a dynamic model for all or part of the plant. Instead, these methods largely map plant input-to-output behavior. The following section describes recent work in these three areas and compares the different approaches.
2.1. Least squares and Kalman filtering
A variety of previously reported studies have employed least-squares, total least-squares, or extended Kalman filters to identify plant parameters entering linearly into the plant equations of motion for robot manipulators Khosla and Kanade (1985); Atkeson et al. (1986); An et al. (1988); Armstrong et al. (1986); Swevers et al. (2007), uncrewed underwater vehicles (UUVs) Caccia et al. (2000); Alessandri et al. (1998); Martin and Whitcomb (2014); Harris et al. (2018), and spacecraft Norman et al. (2011); Keim et al. (2006). Most methods require the time derivative of joint angles or time derivative of 6-DOF body-relative linear and angular velocity, which is difficult to measure directly, and often is numerically estimated by differentiating sensor measurements—a process that is sensitive to sensor noise. Some studies ameliorate the problem of sensor noise through use of signal filtering. However, unless performed using an acausal method, filtering introduces time delay. These issues are (sometimes) adequately handled during offline data processing, but pose a significant challenge for online parameter estimation.
2.2. Adaptive trajectory-tracking control
Adaptive model-based tracking controllers have been proposed for both fully actuated McFarland and Whitcomb (2014), Sahu and Subudhi (2014) and underactuated marine vehicles Jiang (2002), Aguiar and Hespanha (2007), spacecraft Ahmed et al. (1998), Wong et al. (2001), and mobile robots Fukao et al. (2000), Shojaei et al. (2011). However, precise definitions for persistence of excitation (PE) to ensure asymptotic convergence of parameter-estimation error are absent from many studies, such as Shojaei et al. (2011), Sahu and Subudhi (2014), Wong et al. (2001). PE-like conditions were reported in Aguiar and Hespanha (2007), with a focus on stability of the trajectory-tracking error and robustness to parametric uncertainty. Necessary and sufficient PE conditions for asymptotic identification were reported in Ahmed et al. (1998), but only for parameters in the inertia matrix for 3-DOF rotational plants.
In Craig et al. (1987) the authors report a stable adaptive trajectory-tracking algorithm for fully actuated robot arms that require instrumentation of joint acceleration. In Slotine and Li (1987); Sadegh and Horowitz (1990); Whitcomb et al. (1993) the authors report three different stable adaptive trajectory-tracking algorithms for fully actuated robot arms that do not require instrumentation of joint acceleration
2.2.1. Relationship of adaptive identification to adaptive trajectory tracking control
The present study addresses AID; it does not address the problem of simultaneously performing adaptive model identification and model-based trajectory tracking control, that is, ATTC. There are numerous previously reported results for ATTC of mechanical systems in which the plant state is controlled to follow a smooth trajectory known a priori. ATTC, however, will not work when either (i) the plant is underactuated or (ii) the plant is controlled by an existing controller that cannot be modified, as is often the case with commercially available robot systems, or (iii) the plant is under open-loop control, as is often the case for system identification experiments in the early stages of vehicle development.
In general, papers that report novel adaptive model-based tracking controllers prove convergence of the trajectory-tracking error, and prove stability and boundedness (but not convergence) of the parameter-estimation error.
2.3. Adaptive parameter estimation
Adaptive methods for parameter identification of linear time-invariant plants are well understood Narendra and Annaswamy (1989), Sastry and Bodson (1989), Astrom (1989), Tao (2003). In Hsu et al. (1987), the authors report an AID algorithm for robot arms that requires instrumentation of joint acceleration, as well as a low-pass filter approach for the parameter update law to avoid this requirement.
Relatively few studies have considered model-parameter identification of underactuated UVs, for example, Graver et al. (2003), or underactuated robot arms, for example, Ayusawa et al. (2014). To the best of our knowledge, only two previous studies, Paine and Whitcomb (2018), McFarland and Whitcomb (2021), have addressed AID of underactuated UVs.
Few UV parameter identification methods which do not require direct instrumentation of acceleration or trajectory-tracking controllers have been reported. The AID algorithm presented here may prove useful in applications with controlled or uncontrolled plants in which reference trajectory tracking is impractical or infeasible.
AIDs for fully actuated multi-DOF UV plant models were first reported by Smallwood and Whitcomb in Smallwood and Whitcomb (2003), but this AID was limited to fully diagonal plant models in which the dynamics of each DOF is fully decoupled and independent from the dynamics of other DOF. McFarland and Whitcomb in McFarland and Whitcomb (McFarland and Whitcomb, 2013, 2021) report an AID for fully coupled 6-DOF UV plant models; the AID algorithm applies to fully actuated or underactuated plants, and the experiments reported utilized 6-DOF actuation. That study is the foundation for the AID reported here. In Paine and Whitcomb (2018), Paine and Whitcomb reported an extension of the AID reported in McFarland and Whitcomb (2013) to 3-DOF underactuated UV plant models and includes a simulation study with Gaussian noise.
2.4. Machine learning
In their pioneering paper Narendra and Parthasarathy (1990), Narendra and Parthasarthy developed the theoretical foundations for the application of neural networks (NNs) to perform system identification of nonlinear dynamical plants. More recently, experimental results using ML and NN-based identification methods have been reported for underactuated UVs Wehbe et al. (2017), Wehbe and Krell (2017), Karras et al. (2013). Broad interest in using these methods for system identification has led to a variety of different approaches.
In general, ML approaches to model identification appear to be divided on the level to which they utilize physics-based models. Some authors caution against adopting techniques and algorithms from domains void of underlying physics (such as natural language processing) to the natural sciences; instead, these authors argue specifically for increased usage of physics-based models in ML Willcox et al. (2021). An interesting hybrid approach is reported in Woo et al. (2018), wherein the authors decompose the dynamics of a uncrewed surface vessel (USV) into linear and nonlinear terms; they estimate the parameters corresponding to the linear terms using the parameter-estimation technique from Sonnenburg and Woolsey (2013) and use ML for the nonlinear model. A similar hybrid approach for UVs was proposed in Van de Ven et al. (2007). In this case, the underlying structure of the nonlinear dynamic model is maintained via the careful design of separate neural networks to estimate each of the mass, damping, and buoyancy terms. However, other authors in ML, for example, Bagherzadeh (2018), argue against the use of a predetermined structure, saying that the use of predetermined model structure reduces the problem of model identification to one of parameter estimation, which may not fully capture the nonlinearities of the dynamics. Clearly, there is some disagreement in this field, and some researchers report satisfactory results obtained via their particular approach to system identification.
2.5. Approach comparison
We can offer some comparison of these approaches. Least-squares estimation is often performed offline with some signal filtering, whereas the adaptive techniques, including the AID reported herein, can be implemented online. Additionally, least-squares estimation typically requires signals of translational and rotational body-frame acceleration, while AIDs generally require only body-frame velocities. Although some argue that ML and NN methods may be able to perform system identification by capturing complex nonlinear dynamics that are difficult to model, these methods are not guaranteed to be stable or, in some cases, to generalize beyond the training data set. Additionally, in most cases, significant computational time and training data are needed to complete estimates using ML and NN methods. On the other hand, the adaptive techniques can be proven to be stable and to provide asymptotic error convergence, as we will show in Section 5.2 and 5.3.
3. Mathematical conventions
For a matrix
• Euclidean (L2) norm:
We also define the following norms on the matrix A:
• Spectral norm:
• Frobenius norm:
For a vector
• The diagonal matrix operator • The skew-symmetric operator
• The se(3) adjoint operator
4. Finite-dimensional dynamical plant models for 6-DOF vehicles
We consider the class of plants which can be modeled by the Newton-Euler equations in body-relative coordinates. We assume the inertial terms include the mass of the body used in classical rigid-body dynamics, and also include the additional “added mass” or “hydrodynamic mass” arising from the accelerating/decelerating submerged body’s interaction with surrounding fluid. Furthermore, we consider that such plants can be subject to quadratic drag forces and net gravitational forces and moments.
4.1. State and control input representation
• • • •
The control inputs ξ(t) for the specific vehicle studied in this paper are defined in Section 4.3 and Appendix A.
4.2. Commonly accepted plant model
This section describes the commonly accepted finite-dimensional nonlinear equations of motion for 6-DOF vehicles subject to quadratic drag and gravitational forces. We will provide a brief history of the development of this model for 6-DOF underwater vehicles. However, we argue that the nonlinear dynamics of many 6-DOF plants, including terrestrial, aerospace, and underwater vehicles, can be modeled by these same equations of motion, which are simply the Newton-Euler equations with additional terms to represent drag, buoyancy, and control forces.
4.2.1. History of the development of plant models for underwater vehicles
The dynamics of a rigid-body underwater vehicle includes the finite-dimensional dynamics of the rigid-body vehicle body itself and the infinite-dimensional dynamics of the fluid surrounding the vehicle. While the former can be described by a finite-dimensional ordinary differential equation (ODE), the latter is described by the incompressible Navier–Stokes equation. With the exception of a few special cases, the infinite-dimensional Navier–Stokes equation has no closed-form solution and, for the case of marine vehicles, generally cannot be solved in real time (Larsson et al. 1998).
The most commonly accepted finite-dimensional models for submarine vehicles trace their lineage to studies beginning in the late 1950s and early 1960s at the U.S. Navy’s David Taylor Model Basin, (Goodman, 1960, Imlay, 1961), which referred to mass of the system resulting from immersion in a fluid as “added mass” and “derivative mass.” In Gertler and Hagen (1967), Feldman (1979) the authors expanded upon these equations, which have since become widely known as the “Standard equations of motion for submarine simulation” (Gertler and Hagen, 1967) and the “DTNSDC revised standard submarine equations of motion” (Feldman, 1979). More recently, Fossen in Fossen (1994, 2002) built upon these earlier studies to express in matrix form the equations of motion of an immersed body subject to non-conservative hydrodynamic forces. The matrix model is now commonly used throughout the literature.
4.2.2 Plant model
The commonly accepted second-order finite-dimensional dynamic model of this class of dynamical plants, as described in a body-relative frame coincident with the center of mass, is given by • • • • •
The body-frame angular velocity vector
Rearranging (7) to find the time derivative of the body velocity,
We assume the mass matrix is diagonal and can have different inertia quantities in each translational degree of freedom, that is, each diagonal entry of M can be unique. This is a common way to model hydrodynamic “added mass,” that is
We parameterize the Coriolis matrix C(v) from M,
We assume the quadratic drag matrix, D
q
(v), is diagonal positive semidefinite such that
We define the function
4.2.3 Actuator model
The vector
The AID in this study is applicable to actuator models of the general form (23), with linearly-entering parameters and the additional requirement that W a (v, φ, ξ) be uniformly continuous in v, φ, ξ and bounded for bounded v, φ, ξ.
4.3. Combined parameter vector
4.3.1. Plant parameters
In summary, our goal is to identify the following plant parameters: • Mass parameters: • Drag parameters: • Buoyant-force and righting moment parameters:
For convenience, we define the combined plant parameter vector
4.3.2. Control-actuator parameters
Our goal is to identify the following fin and thruster actuator parameters, described in greater detail in Appendix A. For convenience, we define the control-actuator parameter vector
4.3.3. Plant and actuator parameter vector
We define the full plant and actuator parameter vector as
For our specific parameterization of the Iver3 AUV used for experimental validation, the dimensionality of θ is
4.4. Regressor formulation of plant and actuator dynamics
The equations of motion (7) can be written in regressor form by utilizing (12), (15), (20), (22), and (23), which yields
Rearranging (31) to combine the plant and control-actuator dynamics yields
4.5. Regressor nullspace properties
4.5.1. Non-uniqueness of the parameter vector
To the best of our knowledge, previous studies of UV model identification, with the exception of Harris et al. (2018) and Paine (2018), assume that the actuator parameters are known. When the actuator parameters are known and the plant model is minimally parameterized, the true plant parameters represent a single, unique point in the linear vector space of possible plant parameters.
In contrast, the AID addressed herein identifies plant and actuator parameters simultaneously. From (34), we note that
A consequence of this nullspace structure is that many different parameter vectors satisfy the same equations of motion, even when the model is minimally parameterized. As an example, any nonzero scalar multiple of θ equivalently satisfies (34) and thus also satisfies (7).
Given a “true” parameter vector
In Section 4.5.2, we explore the identical input-output plant dynamics that arise from different parameter vectors in P(θ). These properties will be utilized in Section 5, where we report a novel adaptive identifier for θ and show that under a persistence of excitation (PE) condition, the estimate of θ converges to the true parameter set P(θ).
4.5.2. Acceleration input-output dynamics
We now prove our earlier claim that the input-output dynamics of (7), namely the resulting plant acceleration given control inputs and parameters, are the same for any parameter vector belonging to the true parameter set P(θ).
From the plant acceleration (9), after making the appropriate regressor substitutions (15), (20), and (22), as well as writing the true mass M as a function of the parameter vector θ ∈ P(θ) from which it arises, we obtain
Observing the structure of (38), we define
Now we choose another vector θ* ∈ P(θ). By definition of P(θ) (37), we have the nullspace equation
It is important to note that a nonzero parameter vector
The term
We can obtain an analogous nullspace relationship to (34) for
5. Adaptive identifier
This section reports the theoretical derivation and stability analysis of a new adaptive identification (AID) algorithm for plants of the form (7). The AID reported herein significantly expands upon and extends a previous approach for adaptive parameter estimation of UV plant models by McFarland and Whitcomb (McFarland and Whitcomb, 2013, 2021). The specific differences are: (1) The AID reported herein performs simultaneous identification of plant and actuator model parameters. In contrast, the AID reported in McFarland and Whitcomb (McFarland and Whitcomb, 2013, 2021) identifies only plant parameters. (2) This study provides a mathematical proof of stability and of parameter-estimate convergence. In contrast, McFarland and Whitcomb (McFarland and Whitcomb, 2013, 2021) provide only a stability proof.
• v(t), η(t), ξ(t) are bounded. • • As a result, • The control inputs ξ are uniformly continuous in time and bounded. • The true parameters θ* are bounded and constant. • • In the remainder of this section, we present a novel AID algorithm and show that it achieves (58, 59) with all signals bounded. Section 5.1 presents the update laws. Section 5.2 reports a Lyapunov analysis to prove uniform stability with respect to the origin of the error system, which we denote as
5.1. Update laws and error dynamics
We choose the update laws
Similarly to our usage of the term “plant acceleration” to denote
The rate of change of the velocity error is
Because M is parameterized by θ*, we can rearrange and use the definitions (12, 35, 47) to find
Using the fact that
Substituting (76) into (73) yields
The parameter error dynamics follow directly from the error coordinates (57), the parameter update law (64), and the assumption that the true parameters are constant, that is,
By defining the full error state vector for the AID as
5.2 Convergence of the velocity error
The system (80) Note that This expression is negative definite in Δv and negative semidefinite in Δθ, satisfying the requirements on a Lyapunov function to show that the system (81) is uniformly stable about the set S* and that Δv and Δθ are bounded. We note that with Δθ bounded and θ* constant, it is implied that To show that • Δθ(t)
T
Γ−1θ(t) ≥ 0 from PDS of Γ • 0 ≤ V(t) ≤ V(t0) from (91) • Δv(t0) = 0 by assumption These facts can be used to show This implies Using the Rayleigh-Ritz theorem, we will show that the smallest eigenvalue of The second term in (101) is equal to the maximum magnitude of the eigenvalues of M(Δθ). Since M(Δθ) is symmetric, this is equivalent to its maximum singular value or ‖M(Δθ)‖2. Furthermore, ‖M(Δθ)‖2 ≤ ‖M(Δθ)‖
F
, and for diagonal mass matrices, ‖M(Δθ)‖
F
= ‖Δθ
m
‖2 by definition. Thus, we have Substituting (98), we have All signals in the right-hand side of (77) are thus shown to be bounded. This implies the time-derivative of the velocity error, From (79), (87), and (91) it can easily be shown that
5.3. Convergence of the parameter estimate
Section 5.2 showed that limt→∞Δv(t) = 0 and
The remainder of this Section is organized as follows: • Section 5.3.1 introduces a change of coordinates to characterize • Section 5.3.2, then defines an equivalent error system using these coordinates. • Finally, Section 5.3.3 shows a sufficient condition for
We note that the problem of parameter convergence with respect to a unique set of true parameters is well understood for a class of linear time-varying systems, as originally reported in Morgan and Narendra (1977). To the best of our knowledge, however, we are unaware of any previously reported convergence proof applicable to the nonlinear system (80) with nullspace parameter structure.
5.3.1. Coordinate transformation
Given the set P(θ) (37), we can define the linear vector subspace P
s
= P(θ) ∪ 0, with dim(P
s
) = r, and we can define an orthonormal basis {p1, …, p
r
} for P
s
, as well as the matrix
Similarly, we can define the linear vector subspace P⊥ as the orthogonal complement of P
s
, with orthonormal basis {q1, …, q(p+a)−r}, as well as the matrix
We note that
We combine
Using Q, we can perform a change of coordinates on the parameter estimate
5.3.2. Equivalent error dynamics
We can obtain an expression equivalent to (73) for
Using the coordinates
We also have the derivative of
From (64, 126), this results in the full transformed system
5.3.3. Convergence of
Before showing a sufficient condition for
Given the system described in Theorem 1 and the transformed system (129), then By Barbalat’s Lemma (Appendix B.1), if limt→∞Δv(t) is finite and Examining the structure of W⊥ (125), we observe that the nullspace of
5.3.4. Convergence of
to the true parameter set P(θ)
For the system described by Theorem 1 and W⊥(t) defined in (125), if there exist ϵ0, δ0 > 0 such that for any t ≥ t0 and any unit vector In other words, if W⊥(t) is persistently exciting (PE) as defined in (134), then
• Let all of the assumptions in Section 5 be satisfied such that Theorem 1 is true, including that limt→∞Δv(t) = 0. • Let the PE condition in (134) be satisfied. • Lemma 1 showed that • We also define the bounds b, b⊥ > 0 such that These bounds exist since where Let us suppose that We will show that “when Δv is small, To verify this, given a T′ and ϕ, we set and observe that, ∀t ∈ [t
i
, t
i
+ T′], using the expression for Since limt→∞Δv(t) = 0, for any ϵ′ > 0 we can always find a t′ ≥ t0 such that ∥Δv(t)∥ ≤ ϵ′ ∀t ≥ t′. Thus using ϵ0 from the PE condition (134), ϵ from the assumption made in Part 2, and the bound b⊥ (135), we can choose and find a t′ ≥ t0 such that the flatness property (138) holds for each interval [t
i
, t
i
+ δ0] where t
i
≥ t′. We will now show that for each t
i
≥ t′, there is a t
j
∈ [t
i
, t
i
+ δ0] where We rearrange (152) and choose the unit vector in the PE condition (134) to be so that We now invoke the lower bound in the PE condition (134), substitute our choice of ϕ (147), and recall the assumption made in Part 2 that Furthermore, we have By the mean value theorem for integrals, there must be a t
j
∈ [t
i
, t
i
+ δ0] where Thus, we have found an unbounded sequence of times Together with the conclusion in the proof of Theorem one that limt→∞Δv(t) = 0, we have shown a sufficient condition for convergence of the full state z(t) to S, thereby achieving the goals (58,59) of this AID.
6. Simulation results
We report simulation results in which the “true” parameters of the simulated plant were chosen to match approximately those of the JHU Iver3 AUV. We include this simulation study to demonstrate that the AID achieves parameter estimate convergence to the true parameter set. Since the true parameter set of the Iver3 is unknown, we are unable to verify convergence of the parameter estimates to the “true” parameter set in the experimental results.
Simulation Study Control Inputs.
Simulation Study AID Adaptation Gains.
Measurement Sources, Accuracies, and Usage in the Reported Results.
Figure 1, shows that the estimated parameters converge to “true” parameters in simulation, provided that the PE condition (134) defined in Section 5.3 is satisfied. We note that in order to compute a parameter error between the estimated parameters and the non-unique set of “true” parameters, we employed a scaling factor with respect to a single known parameter. Simulation Study: Difference between true and estimated parameters (Y-Axis) versus time in seconds (X-axis). This Figure shows that the AID-estimated parameters converge to the true parameter set in simulation when a PE condition is satisfied.
7. Experimental results
This section reports the results of three at-sea field trials to evaluate experimentally the adaptive identification (AID) parameter estimator reported above. We utilized experimental data obtained with JHU’s Iver3 AUV (L3 OceanServer, Fall River, MA, USA), shown in Figure 2. We deployed the JHU Iver3 AUV in the Chesapeake Bay, MD, on November 11, 2019, and we report data from three dives (Dive 86, 87, and 88), which are described in Section 7.1. The Iver3 AUV is an underactuated AUV whose control authority is provided by the commanded rotational speed of its ducted propeller and commanded angles for the four red/yellow tail fins, all located at the stern of the vehicle. The 100 m depth-rated Iver3 AUV is one of several commercially available small AUVs designed for oceanographic survey operations including biological, physical-oceanographic, and bathymetric survey missions.
For these experiments, the Iver3 AUV was operating under “waypoint following” control, wherein the user entered a set of pre-programmed waypoints and the Iver3 proprietary controller determined the specific control inputs required to follow the waypoints.
To estimate the parameters for the Iver3 AUV using the AID, we used signals from a MicroStrain 3DM-GX5-25 (Parker LORD MicroStrain, Willston, VT, USA) which is a microelectromechanical systems (MEMS) AHRS, for the attitude and angular rate, and a Teledyne RDI Explorer (Teledyne RDInstruments, Poway, CA, USA) 600 kHz phased-array Doppler velocity log (DVL) for the translational velocity. The RDI DVL is part of the standard sensor suite on the Iver3 AUV installed by the manufacturer; the MicroStrain AHRS was integrated into the nose cone of the Iver3 AUV by the authors. An auxiliary inertial measurement unit (IMU) sensor, a KVH 1775 (KVH Industries, Middletown, RI, USA) fiber-optic gyroscope (FOG), which measures angular rate, was installed inside the Iver3 for these experimental trials. The KVH FOG, which is more accurate than the MicroStrain MEMS gyro, was used as an external source of “ground truth” for the angular rate DOFs during validation. The sensors used for AID and their noise characteristics are listed in Table 3.
7.1. Dive mission plans
Three experimental trials were conducted: • Dive 86 was a waypoint-following undulate mission with relatively high variation in motion. • Dive 87 was a waypoint-following mission with a lawnmower pattern at constant depth • Dive 88 was the same mission as Dive 86.
An overview of the three dives is given in Figure 3. While it may have been ideal to utilize open-loop control signals for these experiments, as we did in simulation, it would have been infeasible in practice when operating in open water with an untethered UV. Instead, we utilized the manufacturer-provided mission planning tool to enter pre-programmed waypoints for the proprietary controller to follow. Overview of the three dives conducted in the Chesapeake Bay using the Iver3. The translational position during each dive as estimated by the front seat of the Iver3 is shown above in blue. The depth profile of each dive is shown below in red.
The waypoint-following undulate missions (Dives 86 and 88) were designed to excite the UV dynamics. The UV attempted to follow a series of zig-zagging waypoints while constantly undulating between 1 m and 5 m depths at a fixed 20° pitch angle. The constant-depth lawnmower mission (Dive 87), where the UV followed a series of waypoints in a rectangular grid, was designed to emulate typical seafloor mapping with a multi-beam sonar.
While Dives 86 and 88 employed the same mission, the control inputs determined by the manufacturer’s controller were not identical. External factors such as the UV’s initial position and the ambient water current resulted in different control inputs when following the same mission waypoints.
7.2. Experimental evaluation methodology
Our approach to evaluating the performance of the new AID was a “training and validation” procedure, in which we identified the parameters using the AID algorithm on experimental data from one dive, termed the “training” data, and then used those parameters in forward simulations of different dives, termed the “validation” data. We compared the resulting 6-DOF simulated model velocities to the actual experimentally-observed 6-DOF vehicle velocities. Since the control algorithms of the Iver3 feedback control system are not disclosed by the manufacturer, we used the experimentally-recorded control signals as inputs to the simulation of each dive. We also performed a “self-validation” procedure by computing a forward simulation of the training dive using its own estimated parameters.
The Iver3 AUV control inputs (fin angles and propeller angular velocity) and relevant components of the vehicle state (attitude, translational velocity, and angular velocity) were logged and time-stamped in real-time to the Iver3 AUV’s on-board CPU during the experimental trials. These data were used in post-processing to evaluate the performance of the proposed AID approach. No filtering of any sensor signal was used, the adaptation gains were held constant across all simulations, and the nonlinear differential equations which make up the AID were evaluated using simple Euler integration.
This AID method is well suited for online parameter estimation—assuming that the programming implementation is capable of handling asynchronous sensor readings, the sensor alignments are calibrated, and suitable adaptation gains are chosen. The appropriate adaptation gains can be found using simulation studies or data from previous experiments.
7.2.1. Adaptation gains
During the training procedure, we empirically selected adaptation gains. The adaptation gains were then held constant throughout the self-validation and validation procedure for all dives reported below. We note that the AID reported here is not particularly sensitive to the choice of adaptation gains; that is, small changes in the magnitude of a gain did not result in large changes in performance of the AID.
Experimental Study AID Adaptation Gains.
7.2.2. AID initialization
Parameter Estimate Initial Values.
7.3. AID performance
We report the performance of the AID as the mean absolute error (MAE), where the error is the difference between the simulated plant velocity resulting from the AID parameters and the velocity measured via the DVL and KVH gyro.
Mean Absolute Error of Simulated 6-DOF UV Plant Using Identified Parameters.

Validation results of Dive 88 using parameters identified during Dive 86. Gray traces are (unfiltered) translational and rotational velocities as measured by the DVL and KVH gyroscope. Blue traces are the plant velocities in forward simulation using the parameters as they were initialized in the AID. Red traces are plant velocities in forward simulation using the parameters identified via the new AID method.
Estimated Parameters for a Plant and Actuator Model (34) of the Iver3 3026, With Components Described in Section 4.3.
7.4. Discussion
The results presented here in Table 6 and Figure 4 show that the ability of a forward simulation of the plant model to match experimentally observed plant performance depends highly on the model parameter values, a well-known fact that has been reported previously, for example in Harris et al. (2018); McFarland and Whitcomb (2021).
In Section 6, we showed that the adaptively identified parameters converge to the true parameters from an arbitrary initialization in a simulation study (in which the simulated “true” parameters are known) when the PE condition (134) is satisfied. However, we cannot compare the experimentally identified parameters to the true model parameters for two reasons: (1) We do not know the true Iver3 AUV model parameters a priori. (2) As observed in Section 4.5, the “true” parameters are not unique, but may be any element of the set P(θ) (37).
Thus, the experimental validity of this AID algorithm can be evaluated only by the ability of the estimated parameters to predict the UV’s velocity in forward simulation. As proven in Section 4.5.2, any parameter vector in the set P(θ) (37) will result in identical vehicle model input-output dynamics.
The results reported in Figure 4 show that the experimentally identified model (Table 7) performs far better than the model using the arbitrarily chosen initial parameter set (Table 5). Further, these results show that the identified parameter set performs approximately as well on the validation data as it does on the training data. We note that the angular-velocity data obtained by the more accurate KVH sensor were not used in the training of the AID and were used only to provide as close to a source of “ground truth” as is available for angular velocity.
We observed that the performance of the identified model is worst in sway and roll DOF. We believe this phenomenon is caused by the lack of precisely controlled excitation in these DOF.
8. Conclusion
This paper reports the theory and experimental evaluation of a novel adaptive identification (AID) algorithm for the simultaneous estimation of plant-model parameters and actuator parameters for underactuated vehicles in six degrees-of-freedom (DOF).
The simulation and experimental results reported herein indicate that it is feasible to use the AID algorithm for parameter estimation of underactuated underwater vehicles (UVs) in 6-DOF. Underactuated UVs are by far the most common class of UVs, but parameter estimation is more difficult for underactuated UVs than for fully actuated UVs because the reduced actuation necessarily reduces the plant excitation that can be induced from the control inputs. Additionally, underactuated UVs are often controlled with hydrodynamic control surfaces which are difficult to characterize.
The theoretical results reported herein provide the first reported AID algorithm with proof of convergence of the parameter estimates to the true set of parameters for this class of vehicles when a persistence of excitation condition is satisfied. Most previously reported AID approaches for this class of systems only show stability and boundedness of the parameter estimates.
The simulation and experimental results show that parameters estimated using this AID can converge with a “single training set” containing on the order of 1000 s of free-motion AUV dive data. The performance of the resulting model was verified in both self-validation and cross-validation. Additionally, this AID algorithm does not require access to body linear and angular acceleration signals, which can offer an advantage over other parameter-estimation methods.
Footnotes
Acknowledgments
We are grateful for discussions with and input from Dr Joseph Moore, of the Johns Hopkins University Applied Physics Laboratory (JHU APL) and the Department of Mechanical Engineering in the JHU Whiting School of Engineering. We also gratefully acknowledge the support of Mr Rick and Ms Valerie Smith, owners of Smiths Marina, Crownsville, MD, for their gracious support of the JHU Iver3 AUV field trials reported herein.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the National Science Foundation under Awards 1319667 and 1909182, the National Defense Science and Engineering Graduate Fellowship (Harris), the In-House Laboratory Independent Research (ILIR) Program of the U.S. Navy Office of Naval Research (Paine), the DOD Science, Mathematics, And Research For Transformation (SMART) Defense Scholarship Program (Paine), and a Johns Hopkins University (JHU) Applied Physics Laboratory (APL) Graduate Fellowship (Mao).
UV control inputs
This section describes a model of actuator forces and moments represented by the term τ(v, φ, ξ) defined in (23), for a class of torpedo-shaped UVs which includes the JHU Iver3. The JHU Iver3 is actuated with a combination of control surfaces and a propeller, as is common in many aerial and underwater vehicles.
We note that the AID described and utilized herein requires the value of the control input signals, ξ, but it is completely agnostic to the specific control law utilized by the UV.
We define the following coordinate frames for each fin: • V – Vehicle coordinate frame, centered at the UV’s center of pressure (CP) with positive x forward, positive y starboard, and positive z down. • F – Fin coordinate frame, centered at the fin CP when the fin is at commanded fin angle δ, with the x-axis along the chord line of the fin and the y-axis pointing away from the center line of the vehicle. • F0 – Fin coordinate frame at δ = 0. • W – Flow coordinate frame, corresponding to the flow of water across the fin, where α is the angle of attack of the incident flow with respect to the chord of the fin.
We note that the commanded fin angle, δ, is not the fin angle of attack to incident flow, α, so that the F and W frames are generally not coincident. The position of the ith fin in the vehicle frame is specified by
We define the transformations between coordinate frames of each fin using rotation matrices R ∈ SO(3) as follows:
The velocity of the ith fin through the water at the fin CP in vehicle coordinates is
Assuming flow along the span of the fin foil does not affect the lift or drag, we use a projection matrix to find the flow along the x- and z-axes
We then compute the hydrodynamic lift and drag force acting on each fin as
The drag coefficient of each fin is modeled as a three-parameter quadratic polynomial:
These polynomial lift and drag coefficient models approximate the lift and drag coefficients from a symmetric NACA foil shape.
The resultant force vector in the flow frame W is
The total force and moment vector on a typical underactuated torpedo-like UV with a total of N control surfaces and one propeller is thus
This simplified thruster model is a reasonable assumption for class of UVs because the propeller is ducted with a high jet velocity compared to the advance velocity of the vehicle Newman (1977).
We have described a model of actuator forces and moments which act on the UV, and this model is expressed as τ(v, φ, ξ) with control inputs
