Abstract
The multi-autonomous underwater vehicles (Multi-AUVs) cluster is an important means to solve the marine tasks effectively. The heterogeneous Multi-AUVs are constrained by cooperative relationship, and a model of multi-autonomous underwater vehicles role-based collaborative task planning is proposed. The Multi-AUVs are set to different roles depending on the functional properties. To analyze the accountability of each role, and to ensure the reliability, the desired behavior and the estimate state of each role are described in the model. Task allocation needs to be implemented dynamically in path planning, for the existing of the cooperative relationships and the demand of tasks changes. Role-based task assessment and allocation methods are proposed to achieve dynamic adjustment of roles according to task requirements. Due to poor underwater communication conditions, the implicit coordination framework is implied to the coordinate information interaction to compensate the large delays in underwater communications and the reliance between Multi-AUVs. To adapt to the implicit collaborative framework and poor communication conditions, a variable communication radius (contract network) is proposed. The simulation result shows that the proposed method has well performance.
Keywords
Introduction
Autonomous underwater vehicle (AUV) is a highly integrated underwater intelligent vehicle which can perform complex marine tasks by carrying various sensors and special equipment. By programming, the AUV can autonomously perform complex marine tasks without humans. Due to the demands of special tasks, the size of AUV should not be too large, which requires us to design micro AUV. The problem with micro AUV, however, is that they have limited space, the sensors and other functional modules they carry. Therefore, heterogeneous Multi-AUVs are applied to the actual scenario. The efficient and reliable tasks execution can be realized through reasonable measurement and configuration of its strategy. 1,2 The collaborative task planning problem of Multi-AUVs is an non-deterministic polynomial-hard problem with time-sequence logical constraints. In this article, a role-based task planning method is proposed. According to tasks requirements, the Multi-AUVs will be given different roles, to make the task collection be decomposed into multiple subtasks of the same attributes. The method can reduce with the size of the temporal logic programming problem, and shorten the planning time.
The main marine tasks that the Multi-AUVs can perform are hydrological monitoring, military confrontation, and underwater operations. 3 The tasks set can be divided into different task subsets according to different sensors required, and these subsets are the basis for assigning roles to AUV. The functional modules loaded by heterogeneous AUV are different, so the types of tasks performed are subject to the functional modules. In this article, a functional hierarchy is proposed based on the types of tasks performed by AUV and the time–space constrained task execution strategy. To achieve high task execution efficiency, the parallel management method is adopted for AUV by referring to the research methods of multiple agents. Each same constructed AUV can be identified as a role with unique advantages for performing a given task. 4
The role-based AUV classification method is adopted to make AUV change from passive receiving instruction information to active analyzing of other collaborative AUV status in the process of task execution, to achieve the reliability of collaboration under poor communication conditions and unknown environment. It is also the guiding ideology of parallel management. Partially observable Markov decision-making process (POMDP) method applied in parallel role model is appropriate. 5
A contract network algorithm with variable communication radius (VCR) is proposed, which can effectively solve the problem of cooperative control in weak communication environment, reduce communication range and communication bandwidth; and at the same time, a Multi-AUVs tasks execution scheme that can meet the task demand is presented.
The rest of this article is as follows. Second section presents the collaborative task planning modeling and the implicit collaborative framework. In the third section, the Multi-AUVs role assignment and task assignment is provided. The simulation and experimental is shown in fourth section and we conclude this article in the last section.
Collaborative task planning modeling
Role description
The structure of parallel management is the same as that of AUV distributed structure. Due to communication restrictions, it is commonly used to find a distributed structure (master–slave structure) of AUV as the main AUV, but this will make the AUV limited. 6 The application of implicit cooperation can make the decentralized AUV structure have the same synergistic effect as the master–slave structure, which means reflected in the improvement of freedom and intelligence.
As shown in Figure 1, heterogeneous Multi-AUVs clusters are divided into different roles according to their respective functions. For a given task, the task can be performed by

AUV and role relational topology. AUV: autonomous underwater vehicle.
A belief function
where
To predict the behavior of a role, the AUV, which is both the role holder and the planner of the action, must know the policy functions of all roles.
8
A set of policy functions
When AUV performs the next action, it needs to combine the policy with the current belief. The belief states of other roles need to be updated by the prediction function. By observing the actual actions of the teammates’ roles after the execution of the actions, Bayes estimates that the updates can be made
where
The parameter z is the actual observation of the other roles, thence every possible observation
To get the only state of belief, k describes a set of critical actions that lead the process of tasks. The performer of the critical actions is the AUV that takes the role of path planner, denoted by
This function ensures the accuracy of the prediction while promoting the task. And from above, get the following equation
Implicit coordination modeling
When Multi-AUVs perform tasks, due to the influence of environmental uncertainty and limited communication interaction, the intelligence degree of collaboration between AUVs plays an important role in the completion of collaborative tasks. For specific tasks, the functional requirements for AUV are different. Therefore, it is necessary to divide AUV with specific functions to achieve the reliability of AUV collaboration under unknown environment and poor communication conditions. 9 –11
By using POMDP to analyze inaccurate observation information and uncertain execution effect, the role model of Multi-AUVs was established. Because of the role assignment, Multi-AUVs can solidify several behaviors in each role’s knowledge base, thereby reducing the amount of data generated by adopting a fixed communication format. 12,13
To realize AUV cooperation under poor communication conditions, it is assumed that AUV broadcasts a clue c ∈ Ca to other members after performing an action a. The non-empty Ca is all possible clue of a. The implicit coordination framework is shown in Figure 2.

The implicit coordination framework.
From the role model, we can get the policy
The prediction function is used to predict the further action from the prediction set of belief state. It can make decision by each belief state
Consider a simple model with two states
To further explain the model, specific objects are proposed. During the execution of a series of tasks, three roles may be used, such as path planning role, function execution role, and communication and computing role. Each role has several states of its own, such as movement and cessation. When the path planning role arrives at the designated location, it will stop moving, so that other roles will start to perform topographic scanning and detection when they find that the path planning role has stopped moving. 14 –16
Task description
Marine tasks are classified by corresponding Multi-AUVs sensor modules, and the tasks set and Multi-AUVs collection must describe these attributions. To fully describe the Multi-AUVs task assignment, the Multi-AUVs cooperative task allocation can be described with multi-tuple
In equation (10), the Multi-AUVs roles collection is
As shown in Figure 3, there are six groups of roles and different types of tasks in the environment. Each group of Multi-AUVs must perform one kind of task sequence. The objective functions of AUV execution are the shortest time, which is also the shortest path when there is no current information in the task environment

Task description.
Assuming that there are n groups of roles in the area to carry out the marine tasks, m targets need to be allocated, then the target distribution matrix
In equation (11),
In equation (12),
Multi-AUVs role assignment and task planning
Role assignment
As shown in second section, tasks and role groups are one-to-one correspondence, and role groups contain at least one heterogeneous AUV. Therefore, to simplify the problem, the role assignment problem is simplified to the matching problem of the role group and the same attribute task. According to the requirements of tasks for AUV function modules, specific AUV is assigned to different roles, and AUV resource constraints and task attribute constraints are considered in the allocation process. Meanwhile, equations (7) and (8) are adopted in the role group to allocate the role and obtain the best expected cumulative reward. Since this article mainly introduces the cooperative task assignment problem, the traditional Hungarian algorithm can be adopted for the role pre-assignment problem. 22
Task planning algorithm
Collaborative task planning is usually divided into task allocation and path planning problems. In this article, the methods of dealing with the task allocation problem is as follows: instead of assigning each AUV to the role group, the heterogeneous AUV with the cooperative relation can form the cooperative relation, and then solve each specific task. For the path planning problem, it can be seen from the modeling in the second section that an AUV is required to assume the role of the path planner. Therefore, collaborative path planning becomes a Multi-AUVs task traversal problem. The objective function of path planning in this article adopts the strategy of shortest path, and the objective function is also the shortest time if the underwater water flow environment is not considered.
For the task planning problem, the contract network is used to optimize the solution. The role group will bid value as the control variable for negotiation and competition and determine the distribution of tasks. Due to the limitation of underwater communication conditions, the VCR contract network algorithm is adopted for tenderers to send bidding signals to tenderers within a certain communication radius (actual space distance). If no suitable tenderers are found, the communication radius will be extended. The flowchart of VCR contract network algorithm is shown in Figure 4.
Task announcement stage: For a new target, if the current role group is unable to complete the task, the tenderer role group will inform the bidders to participate in the bid. The bidding stage is based on the communication mechanism. The tenderer sets the communication radius and issues the bidding information to the role group within the communication radius.
Bidding stage: When the role group within the communication radius of the tenderer receives the bidding information, it evaluates the change in the effectiveness of the target after the execution of the contract according to the requirements of the contract and its own status and sends the calculated results to the tenderer. When the bidder finds that the role group within the current communication radius cannot give a high-performance change value, the communication radius will be enlarged and the bidding information will be sent to the role group that has not made the bidding.
Awarding stage: The task is assigned to the winning bidder when the role group within the current communication radius of the tenderer can give a higher performance value.
Monitoring stage: The successful bidder role group and the tenderer role group have no opinion on the proposed contract. A commitment monitor relationship is formed and the implementation of the tasks enters the schedule. Bidding is completed, and the task will be executed successfully.

Block diagram of VCR contract network algorithm. VCR: variable communication radius.
When the task assignment is completed, the role group forms a formation to perform the corresponding task sequence. The path planning
To highlight the problems studied, the shortest path algorithm is adopted to realize the path planning of multi-role groups. A feasible path planning algorithm can be seen in the article. 23 Without considering the effect of obstacles and ocean current in the underwater environment, the shortest path will be the broken line connection of the task location in the task sequence.
Simulation and experimental
In this article, the simulation environment is a three-dimensional (3-D) underwater environment of 1000 × 1000 × 2 (km). To facilitate observation, the 3-D space is projected into the two-dimensional space, where role group 1, 3, 6 can perform task type 1, and role group 2, 4, 5 can perform task type 2. Task types are distinguished by triangular and square in Figure 5.

Comparison of task planning using two contract networks.
As shown in Figure 5, the task planning results of the traditional contract network and the VCR contract network are, respectively, used.
In the traditional contract network task planning results, the global execution efficiency of tasks is high, and each role group can perform corresponding tasks. However, it is obvious to pursue the global task load rate and role groups are more inclined to make tasks uniformly executed, resulting in serious bending of a single role group in path planning.
It can be seen from the VCR contract network simulation results: Tasks are all nearby principles, so that qualified role groups can perform corresponding tasks, which can not only meet the requirements of weak communication conditions but also realize shorter paths. From the VCR contract network simulation results, it can also be seen that role group 1 does not need to perform tasks, thus saving AUV equipment resources.
The Multi-AUV task allocation method based on role groups enables the AUV group to have strong task execution capability, which is more reliable than the single AUV task sequence. The task path length of each role group based on the VCR contract network and the traditional contract network is shown in Table 1.
The path length of two contract network algorithms (units: kilometer).
VCR: variable communication radius.
It can be seen from Table 1 that the path length of the traditional contract network algorithm increases because of global bidding, while the overall path length of the contract network algorithm with VCR decreases because of adopting the principle of proximity.
At the same time, the method of VCR is well adapted to the underwater poor communication environment. When using the VCR contract network algorithm, the total path of a certain role group may be too long, but the above path scheme meets the needs of energy or path length during the bidding process of the contract network. If the global load rate needs to be balanced, improvements can be made during the modeling process. For example, when a first-level contract network can meet the tender requirements, it needs to be compared with the role groups within the second-level communication radius. To some extent, this approach can balance the communication and load rate requirements.
Conclusions
In this article, the implicit collaboration of role mechanism is adopted to make Multi-AUVs work effectively under the condition of poor communication. At the same time, a role-based task planning method was designed, which greatly increased the reliability of Multi-AUVs task execution. Contract network algorithm based on VCR also has good application characteristics for poor communication environment. And the overall path length of the contract network algorithm with VCR decreases because of adopting the principle of proximity. This article focuses on solving the cooperative control problem of collaborative task planning under the condition of poor communication with the function-centered role mechanism. The future will be considered how to solve the key AUV fault task re-planning method within the role group.
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported in part by the National Natural Science Foundation of China subsidization project (51579047), the Natural Science Foundation of Heilongjiang Province (QC2017048), the Natural Science Foundation of Harbin (2016RAQXJ077), and the Fundamental Research Funds for the Central Universities (HEUCF180407).
