Multiautonomous underwater vehicle consistent collaborative hunting method based on generative adversarial network

Abstract

The time-varying ocean currents and the delay of underwater acoustic communication have caused the uncertainty of single autonomous underwater vehicle (AUV) tracking target and the inconsistency of multi-AUV coordination, which make it difficult for multiple AUVs to form a hunting alliance. To solve the above problems, this article proposes the multi-AUV consistent collaborative hunting method based on generative adversarial network (GAN). Firstly, the three-dimensional (3D) kinematic model of AUV is established for the underwater 3D environment. Secondly, combined with the Laplacian matrix, the topology of the hunting alliance in the ideal environment is established, and the control rate of AUV is calculated. Finally, using the GAN network model, the control relationship after environmental interference is used as the input of the generative model. The control rate in the ideal environment is used as the comparison object of the discriminative model. Using the iterative training of GAN to generate a control rate that adapts to the current interference environment and combining multi-AUV topological hunting model to achieve successful hunting of noncooperative target, the experimental results show that the algorithm reduces the average hunting time to 62.53 s and the success rate of hunting is increased to 84.69%, which is 1.17% higher than the particle swarm optimization-constant modulus algorithm (PSO-CMA) algorithm.

Keywords

Multi-AUV system generative adversarial network cooperative hunting time-varying ocean currents communication delay

Introduction

Autonomous underwater vehicle (AUV) is an intelligent robot that can complete the underwater work without the operator.¹ At present, AUV has been studied by many scientists and applied to a variety of tasks. However, as the complexity of current work is getting higher and higher, a single AUV can no longer complete the work independently and multiple AUVs need to work together to complete the task.

In the field of multi-AUV hunting, scholars have proposed many new collaborative algorithms and hunting algorithms. Some of these algorithms use different optimization methods to reduce the amount of calculations in the hunting process, thereby reducing the hunting time. Some scholars have improved the hunting success rate of the algorithm by improving the control precision of hunting AUV. But in the actual ocean underwater environment, time-varying ocean currents and communication delays can have a great impact on the hunting system, how to achieve successful hunting of noncooperative targets in a complex ocean underwater environment is a major test for various algorithms.

At present, the generative adversarial network (GAN) has become one of the research hotspots, mainly used in image fields, such as target recognition, image generation, and data enhancement. The combination of the generative model and the discriminative model can generate ideal target information from noise. In this article, GAN is applied to the multi-AUV hunting field. The influences of time-varying ocean currents and communication delays on the hunting system are used as random noise. Using the combination of the generated model and the discriminative model, the algorithm can generate the control rate of the hunting AUV adapted to the complex interference environment. It can reduce the effects of time-varying ocean currents and communication delays on multi-AUV hunting and improve the success rate of the algorithm.

In summary, scholars have made effective improvements in multi-AUV hunting. However, in real-world underwater environments, time-varying ocean currents and hydroacoustic communication delays cause significant interference with the multi-AUV hunting system. In this article, the GAN model is introduced into the multi-AUV hunting field and used to train the control rate that can adapt to the underwater interference environment and improve the hunting success rate of the multi-AUV hunting system, as shown in Figure 1

Figure 1.

The multi-AUV consistent collaborative hunting method based on GAN. Time-varying ocean currents and communication delays affect the ideal topological hunting model, leading to changes in the hunting model. The GAN model is used to iteratively train to adapt to the current interference control rate so that the multi-AUV hunting model can achieve successful hunting in the interference environment.

Related work

In terms of multi-AUV control, many scholars have conducted a lot of research in two-dimensional^2,3 and three-dimensional (3D)^4
–6 environments. Aiming at the path tracking control problem of AUV, a new Lyapunov-based model predictive control method is proposed by Shen et al.⁷ to improve the performance of noncooperative target tracking control and the robustness of tracking control. Some researchers^8,9 proposed a self-optimizing control method based on tracking differentiator and active disturbance rejection control (ADRC) theory to achieve target location and tracking. In terms of depth control of AUV, the literature¹⁰ uses the internal shift mechanism to change the center of gravity of AUV and proposed an adaptive learning control method based on distribution and deterministic learning to accurately and effectively control the depth of AUV.

In the aspect of multi-AUV hunting, Cai et al.^11,12 used multi-AUV collaborative methods to identify and hunt noncooperative targets, which improved the robustness of the hunting process. Ni et al.¹³ proposed a new method based on the spinal nerve system for the unknown 3D underwater environment, which can keep multi-AUV stable in formation without obstacle collision. For cooperative hunting of multi-AUV system, not only basic problems, such as path planning and collision avoidance, should be considered but also task assignments in a dynamic way. Cao et al.¹⁴ proposed an integrated algorithm combining self-organizing map neural network and Glash’s biological heuristic neural network method to improve the efficiency of multi-AUV collaborative hunting.

Multi-AUV systems are subject to communication delays in underwater environments. Some researchers^15
–17 proposed a time compensation method or a method of enhancing the time delay controller for the communication delay problem, which reduces the influence of underwater acoustic communication delay on the hunting alliance. The literature^18,19 use the optimized communication topology to coordinate the multi-AUV hunting alliance, which can carry out hunting tasks in the communication delay environment, and improve the reliability of the algorithm. However, during the hunting process, there is also the influence of time-varying ocean currents, which brings great difficulties to the control of multi-AUV and needs to be considered during the hunting process. Some researchers^20,21 used the topology information of sensor networks to reduce the impact of errors and make the system more stable.

At present, the GAN algorithm is mainly applied to image recognition, such as video recognition^22,23 and image translation.^24,25 Some scholars have applied the GAN model to other fields. Ren and Xu²⁶ proposed a fully data-driven approach for phasor measurement unit (PMU)-based prefault dynamic security assessment with incomplete data measurements, and it can reduce the impact of data loss on fault assessment. For the production planning and control problems of aircraft remanufacturing systems, Zheng et al.²⁷ proposed an adaptive replanning strategy with triggers and replanning procedures to solve the problem of different types of data imbalance in the reengineering system. Tang et al.²⁸ proposed a feature combination method based on prior knowledge, which is used to maximize a series of risk returns in the securities market. Zhang et al.²⁹ used the GAN method in the field of image recognition transportation, in which the proposed method has better versatility and flexibility in image recognition transportation.

The rest of this article is organized as follows. The second section provides an overview of the relevant literature. The third section describes the 3D kinematics equation of AUV, the multi-AUV topology hunting model based on noncooperative target escape point, and the multi-AUV consistency cooperative hunting method based on GAN. The fourth section analyzes the simulation experiment. Finally, the fifth section summarizes the article.

Proposed approach

Autonomous underwater vehicle kinematics equation in three-dimensional space

To effectively control the hunting alliance to hunt noncooperative targets faster and more accurately, the AUV kinematics equation in 3D space is established. A fixed point on the sea level is the origin O of the inertial coordinate system. The $O X$ axis and the $O Y$ axis are perpendicular to each other in the horizontal plane, and the OZ axis is perpendicular to the $X O Y$ plane to the center of the Earth. Take the center of gravity E of the AUV as the origin of the carrier coordinate system, the $E X_{0}$ axis is the AUV forward direction, $E Y_{0}$ is the traverse direction, and $E Z_{0}$ is the latency direction, as shown in Figure 2.

Figure 2.

Carrier coordinate system and inertial coordinate system of AUV. AUV: autonomous underwater vehicle.

In Figure 2, ϕ, θ, and ψ correspond to the heeling angle, the trim angle, and the slant angle, respectively, in the inertial coordinate system of the AUV (the counterclockwise direction is positive). u, v, and w are the three coordinate components of AUV in the carrier coordinate system. p, q, and r are the three components of the velocity of the AUV in the carrier coordinates, respectively. In the inertial coordinate system, the state of the AUV uses six degrees of freedom to represent the vector $η = {[\begin{matrix} \begin{matrix} x & y & z \end{matrix} & \begin{matrix} ϕ & ψ & θ \end{matrix} \end{matrix}]}^{T}$ . $x, y, z$ are the positions in the inertial coordinate system and the motion state $V = {[\begin{matrix} \begin{matrix} u & v & w \end{matrix} & \begin{matrix} p & r & q \end{matrix} \end{matrix}]}^{T}$ . AUV cannot perform side shifting and roll under normal conditions. Let $ϕ = 0$ , $p = v = 0$ , the attitude in the inertial coordinate system becomes $η = {[\begin{matrix} \begin{matrix} x & y & z \end{matrix} & \begin{matrix} ψ & θ \end{matrix} \end{matrix}]}^{T}$ , and the motion state $V = {[\begin{matrix} \begin{matrix} u & v & w \end{matrix} & \begin{matrix} r & q \end{matrix} \end{matrix}]}^{T}$ . The 3D kinematics model of AUV is

\dot{η} = [\begin{matrix} \begin{matrix} \dot{x} \\ \dot{y} \\ \dot{z} \end{matrix} \\ \begin{matrix} \dot{ψ} \\ \dot{θ} \end{matrix} \end{matrix}] = J (η) V = [\begin{matrix} \begin{matrix} \begin{matrix} cos ψ cos θ \\ sin ψ cos θ \\ \begin{matrix} - sin θ \\ 0 \\ 0 \end{matrix} \end{matrix} & \begin{matrix} - sin ψ \\ cos ψ \\ \begin{matrix} 0 \\ 0 \\ 0 \end{matrix} \end{matrix} & \begin{matrix} cos ψ sin θ \\ sin ψ sin θ \\ \begin{matrix} cos θ \\ 0 \\ 0 \end{matrix} \end{matrix} \end{matrix} & \begin{matrix} \begin{matrix} 0 \\ 0 \\ \begin{matrix} 0 \\ 1 / cos θ \\ 0 \end{matrix} \end{matrix} & \begin{matrix} 0 \\ 0 \\ \begin{matrix} 0 \\ 0 \\ 1 \end{matrix} \end{matrix} \end{matrix} \end{matrix}] [\begin{matrix} \begin{matrix} u \\ v \\ w \end{matrix} \\ \begin{matrix} r \\ q \end{matrix} \end{matrix}]

Topology construction of the hunting alliance

Multi-AUV hunts noncooperative targets in a 3D environment, as shown in Figure 3. Among them, ${AUV}_{1}, {AUV}_{2}, \dots, {AUV}_{n}$ are hunting points of the AUV hunting alliance, T is a noncooperative target, and the hunting points are evenly distributed around the target.

Figure 3.

Schematic diagram of multi-AUV hunting. AUV: autonomous underwater vehicle.

It is assumed that the noncooperative target has the same dynamic model as the hunting alliance, and the communication topology of the alliance is fully connected, as shown in Figure 4. In a hunting system consisting of n AUVs, the communication topology-dependent Laplacian matrix L can be expressed as

L = (\begin{matrix} \begin{array}{l} \begin{matrix} 4 & - 1 & - 1 \end{matrix} \\ \begin{matrix} - 1 & 4 & - 1 \end{matrix} \\ \begin{matrix} - 1 & - 1 & 4 \end{matrix} \end{array} & \begin{matrix} - 1 \\ 0 \\ - 1 \end{matrix} & \begin{array}{l} \begin{matrix} - 1 & \dots & 0 \end{matrix} \\ \begin{matrix} - 1 & \dots & - 1 \end{matrix} \\ \begin{matrix} 0 & \dots & - 1 \end{matrix} \end{array} \\ \begin{matrix} - 1 & 0 & - 1 \end{matrix} & 4 & \begin{matrix} - 1 & \dots & - 1 \end{matrix} \\ \begin{array}{l} \begin{matrix} - 1 & - 1 & 0 \end{matrix} \\ \begin{matrix} ⋮ & ⋮ & ⋮ \end{matrix} \\ \begin{matrix} 0 & - 1 & - 1 \end{matrix} \end{array} & \begin{matrix} - 1 \\ ⋮ \\ - 1 \end{matrix} & \begin{array}{l} \begin{matrix} 4 & \dots & - 1 \end{matrix} \\ \begin{matrix} ⋮ & ⋱ & ⋮ \end{matrix} \\ \begin{matrix} - 1 & \dots & 4 \end{matrix} \end{array} \end{matrix})

The ${AUV}_{i}$ kinetic equation can be expressed as a nonlinear system as follows

\begin{array}{l} {\dot{η}}_{i} (t) = f (x_{i} (t)) + g (x_{i} (t)) u_{i} (t) \\ ν_{i} (t) = k (x_{i} (t)) \end{array}

where $x_{i} \in ℝ^{n}$ represents the system state, u _i represents the system control rate, and $ν_{i}$ is the system output. $f (\cdot)$ , $g (\cdot)$ , and $k (\cdot)$ are system functions with corresponding dimensions. Combined with the above communication topology relationship, the control rate $u_{i} (t)$ of ${AUV}_{i}$ can be expressed as

u_{i} (t) = δ_{i} (X (t), L (t))

where $X (t) = {[\begin{matrix} x_{1}^{T} (t) & \begin{matrix} x_{2}^{T} (t) & \dots & x_{n}^{T} (t) \end{matrix} \end{matrix}]}^{T}$ represents the state of each individual of the multi-AUV system at the time t set. $L (t)$ is a Laplacian matrix at time t. $δ_{i} (\cdot)$ indicates the controller. The specific process is shown in Figure 5.

Figure 4.

Multi-AUV communication topology. AUV: autonomous underwater vehicle.

Figure 5.

Hunting alliance topology optimization process.

Multiautonomous underwater vehicle consistent collaborative hunting control based on generative adversarial network

When the multi-AUV hunting alliance hunts the mobile noncooperative target, the ideal hunting point changes at any time due to the escape of the noncooperative target. Through the detection of the noncooperative target, a hunting model is established to determine the ideal hunting position of each hunting AUV. As shown in Figure 6, when the noncooperative target escapes from T to $T^{'}$ , the ideal hunting point of ${AUV}_{i}$ is changed from ${A 1, A 2, \dots, A 6}$ to ${A 1^{'}, A 2^{'}, \dots, A 6^{'}}$ . The noncooperative target has the escaping attribute. When the escape direction points to the ${AUV}_{i}$ gap center, the escape probability is the largest and the point is the escape point x^d. The distance from the noncooperative target to the point x^d is D, and the distance from ${AUV}_{i}$ to the point x^d is d. Only when $D > d$ is maintained can the noncooperative target be able to escape the hunting of the hunting alliance, as shown in Figure 7.

Figure 6.

Schematic diagram of multi-AUV hunting. AUV: autonomous underwater vehicle.

Figure 7.

Schematic diagram of noncooperative target escape points.

Let $h (t)$ is the state of the noncooperative target at time t, the multi-AUV hunting alliance topology optimization model is

{\begin{array}{l} lim_{t \to \infty} ∥ h (t) - x^{d} (t) ∥ ≃ 0 \\ lim_{t \to \infty} ∥ x_{i} (t) - x_{j} (t) ∥ ≃ 0 \\ lim_{t \to \infty} ∥ x^{d} (t) - x_{i} (t) ∥ \leq lim_{t \to \infty} ∥ x^{d} (t) - h (t) ∥ \end{array}

where $x_{i} (t)$ represents the actual state of ${AUV}_{i}$ at time t and $x_{j} (t)$ represents the actual state of ${AUV}_{j}$ at time t.

However, in the actual underwater environment, multi-AUV systems are subject to time-varying ocean currents and the communication delay between AUVs is unpredictable, which have a great impact on the cooperation of multi-AUV. This article trains and generates a cooperative hunting control strategy based on GAN model.

According to the hunting alliance topology, a multi-AUV collaborative hunting strategy in an ideal environment can be generated, which leads to the unpredictable deviation of hunting AUV. Using the GAN model to train the above data can generate a more ideal control rate $U_{i} (t)$ , reduce the impact of the environment on multi-AUV collaborative hunting strategy, and improve the robustness of the algorithm.

The generated confrontation network is mainly composed of the generator G and the discriminator D. The purpose of the discriminator D is to maximize the distinction between real data and counterfeit data generated by the generator. The formula can be expressed as

arg max_{D} E_{x \sim P_{data}} [log D (x)] + E_{x \sim P_{G}} [log (1 - D (x))]

In equation (6), x is the actual state of ${AUV}_{i}$ after the influence of time-varying ocean currents and communication delays. $P_{data}$ represents real data, which is represented in this article as the multi-AUV control rate in an ideal environment. P_G represents the data generated by the generator. When the generator G is fixed, the generated data can be approximated to the true data to the greatest extent. The goal of the generator is to be able to generate a multi-AUV collaboration strategy that is closer to the ideal environment, so that the discriminator cannot identify the generated data as false. The specific formula can be expressed as follows

min_{G} max_{D} V (G, D) = E_{x \sim P_{d a t a}} [log D (x)] + E_{x \sim P_{G}} [log (1 - D (x))]

By optimizing the control rate, a new control rate of a more suitable interference environment is generated, which reduces the influence of time-varying ocean currents and communication delays on multi-AUV cooperative hunting strategies, and improves the success rate of hunting.

The training of GAN is divided into two parts: generator training and discriminator training. During the generator training process, set the real data tag to 1 and the tag of the data generated by the generator to 0. Then, the generated data are sent to the discriminator together with the real data, and the training discriminator can recognize which is the real data and which is the generated data. The update process for the discriminator is as follows

\begin{array}{l} L = \frac{1}{m} \sum_{i = 1}^{m} log D (x_{i}) + \frac{1}{m} \sum_{i = 1}^{m} (1 - log D (x_{i})) \\ θ_{d} \leftarrow θ_{d} + γ \nabla L (θ_{d}) \end{array}

where L is the loss function, $θ_{d}$ is the discriminator parameter, $γ$ is the updated step size, and m is the size of the batch data.

After determining the discriminator parameters, update the parameters of the generator. The multi-AUV control rate in the time-varying ocean current and communication delay interference environment is input to the generator, and the newly generated control rate label of the generator is set to 1. It is then sent to the discriminator for discrimination, and the error is fed back to the generator. The trained generator can generate a control rate $U_{i} (t)$ that resists time-varying ocean currents and communication delay interference. The update formula is

\begin{array}{l} L = \frac{1}{m} \sum_{i = 1}^{m} log (1 - D (G (U_{i}))) \\ θ_{g} \leftarrow θ_{g} + γ \nabla L (θ_{g}) \end{array}

The discriminator and the generator are iteratively trained until the discriminator cannot distinguish between the generated data and the real data. Then, the control rate $U_{i} (t)$ can more accurately control the multi-AUV system in the complex underwater environment.

To achieve consistent synergy of multi-AUV hunting systems, the discrete controller-based coordinated controller design method enables ${AUV}_{i}$ in multi-AUV hunting alliances to satisfy the following relationships

U_{i} (t) = K \sum_{j \in N_{i}} a_{i j} (t) (ξ_{j} (t_{k}) - ξ_{i} (t_{k})), t_{k} \leq t < t_{k + 1}

where K is the controller gain, $ξ_{j} (t_{k})$ represents the system state of ${AUV}_{j}$ at time t_k, and $ξ_{i} (t_{k})$ represents the system state of ${AUV}_{i}$ at time t_k, and $a_{i j}$ is an adjacency matrix.

By referring to the GAN model, the multi-AUV collaborative hunting strategy under time-varying ocean currents and communication delay interference is generated. It can effectively improve the success rate of hunting for noncooperative goals and make the algorithm more robust. The specific algorithm flow is provided in Table 1, and the schematic diagram is shown in Figure 8.

Table 1.

Multi-AUV consistent collaborative hunting method based on GAN.

Input: Target status information ${[\begin{matrix} \begin{matrix} u & v & w \end{matrix} & \begin{matrix} p & r & q \end{matrix} \end{matrix}]}^{T}$ , number of AUVs n. Output: The hunting point of a non-cooperative target and its hunting results.
1. Calculate the multi-AUV hunting strategy under ideal conditions based on the target state; 2. Calculate the control rate $u_{i} (t)$ of $A U V_{i}$ ; 3. The hunting AUV is disturbed, and the actual position is in error with the ideal position; 4. GAN input x is the actual location of the hunting AUV; 5. Generator G generates a new control rate $U_{i} (t)$ ; 6. If discriminator D determines that $U_{i} (t)$ is true 7. Go to step 10; 8. If discriminator D determines that $U_{i} (t)$ is false 9. Go to step 5; 10. Calculate multi-AUV consistent collaborative; 11. Output the control rate $U_{i} (t)$ of $A U V_{i}$ ;

Input: Target status information

{[\begin{matrix} \begin{matrix} u & v & w \end{matrix} & \begin{matrix} p & r & q \end{matrix} \end{matrix}]}^{T}

, number of AUVs n. Output: The hunting point of a non-cooperative target and its hunting results.

1. Calculate the multi-AUV hunting strategy under ideal conditions based on the target state;
2. Calculate the control rate

u_{i} (t)

A U V_{i}

;
3. The hunting AUV is disturbed, and the actual position is in error with the ideal position;
4. GAN input x is the actual location of the hunting AUV;
5. Generator G generates a new control rate

U_{i} (t)

;
6. If discriminator D determines that

U_{i} (t)

is true
7. Go to step 10;
8. If discriminator D determines that

U_{i} (t)

is false
9. Go to step 5;
10. Calculate multi-AUV consistent collaborative;
11. Output the control rate

U_{i} (t)

A U V_{i}

;

GAN: generative adversarial network; AUV: autonomous underwater vehicle.

Figure 8.

Multi-AUV consistent collaborative hunting control based on GAN. GAN: generative adversarial network; AUV: autonomous underwater vehicle.

Simulation

The simulation calculation runs on a small server with a CPU of E5-2630 v4, the main frequency of 2.2 GHz, and a memory of 32 GB. The algorithm in this article simulates the data in MATLAB R2016a under the window10 system. Let AUV have a depth of 10 m and a speed of u = 1.5 m/s. The effect of time-varying ocean currents and communication delays on AUV obeys a normal distribution with the mean of 0 and the standard deviation of 0.5. Set the initial state of the AUV to $[\begin{matrix} \begin{matrix} 0 & 0 \end{matrix} & \begin{matrix} 10 & 0 \end{matrix} \end{matrix}]$ , and the sampling point is 0.1 s.

Train the GAN model, as shown in Figure 9. The red curve represents the real target information, and the blue curve represents the hunting AUV control information generated by the generator. In Figure 9(a), the blue lines are constantly approaching the red lines. Figure 9(b) shows the algorithm after 30 iterations of learning. Most of the blue lines are closer to the red line, indicating that the training results are very close to the true value. Figure 9(c) shows the system output optimal control strategy. During the training of the generator, the data error is shown in Figure 10. The final output error is controlled within 3%.

Figure 9.

GAN generator training process: (a) 5th iteration, (b) 30th iteration, and (c) output result. GAN: generative adversarial network.

Figure 10.

GAN generator training process error: (a) 5th iteration, (b) 30th iteration, and (c) output result. GAN: generative adversarial network.

It is difficult to stabilize the control of the hunting AUV due to the influence of time-varying ocean currents and communication delays. In the simulation process, the algorithm is compared with linear quadratic gaussian controller with loop transfer recovery (LQG/LTR) algorithm, fuzzy proportion integration differentiation (PID) algorithm, and fuzzy adaptive PID algorithm in the same environment. The control curve of different algorithms for hunting AUV in the interference environment is shown in Figure 11.

Figure 11.

Control curve of different algorithms for AUV under time-varying ocean current and communication delay interference: (a) plane position control and (b) depth control. AUV: autonomous underwater vehicle.

In Figure 11(a), when the algorithm LQG/LTR is employed, the maximum deviation of plane position change is 0.99 m and the average deviation is 0.6 m, that of fuzzy PID and fuzzy adaptive PID is 0.8 and 0.7 m, and the average plane position error is 0.57 and 0.46 m. When using our algorithm, the maximum positional deviation occurs in the range of 0.5 m, and the average positional deviation is in the range of 0.42 m.

In Figure 11(b), the maximum deviation of the LQG/LTR algorithm control depth is 0.97 m and the average deviation is 0.43 m, while that of the algorithm fuzzy PID and fuzzy adaptive PID is 0.92 and 0.6, relatively, and the average depth deviation is 0.46 and 0.33 m. When the AUV depth is controlled by the algorithm of this article, the maximum deviation is 0.42 m and the average deviation is 0.21 m. Compared with the above four algorithms, the algorithm proposed in this article is more stable in the environment of interference.

The multi-AUV consistent cooperative hunting control method based on GAN can reduce the influence of time-varying ocean current and communication delay in the multi-AUV hunting process. The algorithm of this article is compared with other algorithms in the same interference environment for multi-AUV system consistency. Each initial position is generated by adding environmental errors to the AUV of the hunting point of the noncooperative target. The simulation results are shown in Figure 12.

Figure 12.

Multi-AUV consistent collaborative error: (a) 6 hunting AUVs, (b) 8 hunting AUV s, and (c) 12 hunting AUVs. AUV: autonomous underwater vehicle.

Figure 12 shows the consistent synergistic error curves for the number of hunting AUVs of 6, 8, and 12, respectively. When the number of hunting AUVs is 6, the minimum consistent collaborative error is 0.21 m of our algorithm. As the number of hunting AUVs increases, the value of the consistency error will change. When the number of rounded AUVs is 12, the maximum consistency error of our algorithm is only 0.51 m. In other algorithms, when the number of hunting AUVs is 6, the evolutionary artificial neural networks (EANNS) algorithm has a minimum error of 0.37 m, but it increases with the number of hunting AUVs. When the number of hunting AUVs is 12, the consistent collaborative error will reach 0.94 m. From the above data analysis, our algorithm can effectively reduce the impact of time-varying ocean currents and communication delays on multi-AUV systems, but the convergence speed is slow, and the algorithm needs to be optimized in future work.

The noncooperative target hunting simulation of different algorithms in the same interference environment, the specific data of hunting time, and success rate are provided in Table 2. When the number of hunting AUVs is 6, the hunting time of the BB algorithm is at least 53.34 s, but the success rate of hunting is only 65.29%. When the number of hunting AUVs is 8, the PF algorithm has a minimum hunting time of 62.58 and the highest hunting success rate is 84.74% of the algorithm proposed in this article. When the number of hunting AUVs increased to 12, the success rate of hunting was 85.25% of the algorithm proposed in this article and the hunting time was at least 56.37 s.

Table 2.

Multi-AUV hunting experiments (comparison of different algorithm hunting time and success rate, bold indicates the best data).

Methods	AUV = 6		AUV = 8		AUV = 12		Average
Methods	Time (s)	Success rate (%)	Time (s)	Success rate (%)	Time (s)	Success rate (%)	Time (s)	Success rate (%)
BB	53.34	65.29	69.25	67.61	75.74	74.15	66.11	69.02
RB	56.58	69.32	64.92	72.15	71.52	76.57	64.34	72.68
PF	65.69	58.53	62.58	61.82	60.25	70.06	62.84	63.47
AP	68.36	70.36	70.35	74.16	76.51	80.72	71.74	75.08
LP-rule	69.48	76.29	73.29	79.58	84.86	83.56	75.88	79.81
SVF-model	64.26	82.37	71.47	82.94	76.52	83.25	70.84	82.53
MSAC	63.57	79.71	65.52	80.32	69.26	84.17	66.12	81.40
SRFH	69.32	74.91	64.21	76.77	60.17	79.83	64.57	77.17
PSO	79.17	77.40	80.21	78.16	83.07	82.57	80.82	79.38
PSO-CMA	70.34	82.46	68.26	83.74	62.71	84.36	67.10	83.52
EANNS	69.27	82.18	64.38	82.62	59.36	84.04	64.34	82.95
Ours	67.63	84.07	63.59	84.74	56.37	85.25	62.53	84.69

BB: behavior-based; RB: rule-based; AP: artificial physics; PF: potential functions; LP-rule: loose-preference rule; SVF: simplify virtual forces; MSAC: m-estimator sample consensus algorithm; SRFH: swarm robots for hunting.

In summary, when the number of hunting AUVs is 6 and 8, the algorithm proposed in this article does not have the minimum hunting time, but the hunting success rate is the highest, and there is no large fluctuation. Under the interference environment of time-varying ocean current and communication delay, the average hunting success rate of the algorithm in this article is 84.69%, and the hunting time is 62.53 s. But the convergence speed of multiple AUV systems is relatively slow. In the future, we will continue to optimize the algorithm to reduce the hunting time and improve the success rate of hunting.

Summary

Consistent coordinated control of multiple AUVs is the key to the noncooperative target hunting process, but how to achieve the successful hunting of multi-AUV against noncooperative targets under the influence of time-varying ocean currents and communication delays is unclear.. In view of the above problems, this article proposed a multi-AUV consistent collaborative hunting method based on GAN. The GAN network is introduced into the multi-AUV collaborative hunting field. The generator is used to generate the coordinated control rate suitable for the current complex environment, and the successful hunting of noncooperative targets under the time-varying ocean current and communication delay interference environment is realized. Experiments show that the algorithm proposed in this article shows a good hunting effect, but it needs to be further improved in terms of hunting time. In future work, we will focus on the above issues, making the algorithm more efficient.

Footnotes

Data availability statement

The data set in this article is a self-built multi-AUV data set. The data set contains the data of the confidential information, such as the performance parameters and tactical technical indicators of AUV. Therefore, the data set of this article has certain confidentiality and cannot be released.

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article. This work was supported by the National Key Research and Development Project [2019YFB1311002], the National Natural Science Foundation of China [61703143], National Defense Science and Technology Innovation Special Zone, Science and Technology Project of Henan Province [192102310260], the young backbone teacher training project of Henan University [2017GGJS123], and Science and Technology Major Special Project of Xinxiang City [ZD18006].

ORCID iD

Lei Cai

References

Leonard

Bahr

. Autonomous underwater vehicle navigation. IEEE J Oceanic Eng 2010; 35(3): 663–678.

Caharija

Pettersen

Bibuli

, et al. Integral line-of-sight guidance and control of underactuated marine vehicles: theory, simulations, and experiments. IEEE Trans Control Syst Technol 2016; 24(5): 1623–1642.

Woods

Bauer

Seto

. Automated ballast tank control system for autonomous underwater vehicles. IEEE J Oceanic Eng 2012; 37(4): 727–739.

Jia

Zhang

Cheng

, et al. Three-dimensional path following control for an underactuated UUV based on nonlinear iterative sliding mode. Acta Automatica Sinica 2012; 38(2): 308–314.

Bobkova

Kudryashova

Mel’mana

, et al. Autonomous underwater navigation with 3D environment modeling using stereo images. Gyro Navig 2018; 9(1): 67–75.

Cao

. Multi-AUV cooperative target search algorithm in 3-D underwater workspace. J Navig 2017; 70(6): 1–19.

Shen

Shi

Buckham

. Trajectory tracking control of an autonomous underwater vehicle using Lyapunov-based model predictive control. IEEE Trans Ind Electron 2018; 65(7): 5796–5805.

Ayyangar

VBS

Krishnankutty

Korulla

, et al. Stability analysis of a positively buoyant underwater vehicle in vertical plane for a level flight at varying buoyancy, BG and speeds. Ocean Eng 2018; 148: 331–348.

Mai

Choi

Seo

, et al. Development and control of a new AUV platform. Int J Control Autom 2014; 12(4): 886–894.

10.

Tanakitkorn

Wilson

Turnock

, et al. Depth control for an over-actuated, hover-capable autonomous underwater vehicle with experimental verification. Mechatronics 2017; 41: 67–81.

11.

Cai

Sun

, et al. Multi-AUV collaborative target recognition based on transfer-reinforcement learning. IEEE Access 2020; 8(1): 39273–39284.

12.

Cai

Zhou

Zhang

. Multi-AUV collaborative hunting method for the non-cooperative target in underwater environment. In: 2018 3rd international conference on advanced robotics and mechatronics (ICARM), Singapore City, Singapore, 18 July 2018, pp. 72–76. IEEE.

13.

Yang

, et al. An improved spinal neural system-based approach for heterogeneous AUVs cooperative hunting. Int J Fuzzy Syst 2018; 20(2): 672–686.

14.

Cao

Sun

. Dynamic task assignment for multi-AUV cooperative hunting. Intell Autom Soft Comput 2019; 25(1): 25–34.

15.

Kim

Yoo

. MAC delay-free AUV localization based on hyperbolic frequency modulation signal. J Korea Inst Commun Inform Sci 2018; 43(3): 541–552.

16.

Xiao

Wang

Deng

, et al. An acoustic communication time delays compensation method for master-slave AUV cooperative navigation. IEEE Sens J 2017; 17(2): 504–513.

17.

Kim

Joe

, et al. Time-delay controller design for position control of autonomous underwater vehicle under disturbances. IEEE Trans Ind Electron 2016; 63(2): 1052–1061.

18.

Liang

Sun

Shi

. Reliability analysis for mutative topology structure multi-AUV cooperative system based on interactive Markov chains model. Robotica 2017; 35 (8): 1761–1772.

19.

Das

Subudhi

Pati

. Co-operative control of a team of autonomous underwater vehicles in an obstacle-rich environment. J Marine Eng Technol 2016; 15(3): 135–151.

20.

Chuanbo

Zidong

Qinyuan

, et al. Recursive distributed filtering for a class of state-saturated systems with fading measurements and quantization effects. IEEE Trans Syst Man Cybern 2018; 48(6): 930–941.

21.

Chuanbo

Zidong

Jun

, et al. Recursive filtering for state-saturated systems with randomly occurring nonlinearities and missing measurements. Int J Robust Nonlinear Control 2018; 28: 1715–1727.

22.

Chen

, et al. Exploiting images for video recognition: heterogeneous feature augmentation via symmetric adversarial learning. IEEE Trans Image Proc 2019; 28(11): 5308–5321.

23.

Chen

Kong

, et al. Towards real-time advancement of underwater visual quality with GAN. IEEE Trans Ind Electron 2019; 66(12): 9350–9359.

24.

Tang

Zhang

, et al. Asymmetric GAN for unpaired Image-to-image translation. IEEE Trans Image Proc 2019; 28(12): 5881–5896.

25.

Chen

, et al. Quality-aware unpaired image-to-image translation. IEEE Trans Multimedia 2019; 21(10): 2664–2674.

26.

Ren

. A fully data-driven method based on generative adversarial networks for power system dynamic security assessment with missing data. IEEE Trans Power Syst 2019; 34(6): 5044–5052.

27.

Zheng

Wang

Zhang

, et al. An adaptive CGAN/IRF-based rescheduling strategy for aircraft parts remanufacturing system under dynamic environment. Robot CIM Int Manuf 2019; 58: 230–238.

28.

Tang

Zhu

. Deep hierarchical strategy model for multi-source driven quantitative investment. IEEE Access 2019; 7: 79331–79336.

29.

Zhang

Jia

Zheng

, et al. A novel generative adversarial network for estimation of trip travel time distribution with trajectory data. Transp Res Part C: Emerg Technol 2019; 108: 223–244.