Abstract
Chemical plume tracing based on autonomous underwater vehicle uses chemical as a guidance to navigate and search in the unknown environments. To solve the key issue of tracing and locating the source, this article proposes a path-planning strategy based on partially observable Markov decision process algorithm and artificial potential field algorithm. The partially observable Markov decision process algorithm is used to construct a source likelihood map and update it in real time with environmental information from the sensors on autonomous underwater vehicle in search area. The artificial potential field algorithm uses the source likelihood map for accurately planning tracing path and guiding the autonomous underwater vehicle to track along the path until the source is detected. This article carries out simulation experiments on the proposed algorithm. The experimental results show that the algorithms have good performance, which is suitable for chemical plume tracing via autonomous underwater vehicle. Compared with the bionic method, the simulation results show that the proposed method has higher success rate and better stability than the bionic method.
Introduction
Olfactory is a long-distance perceptual behavior, which is widely used by animals in search of food and finding mates, 1 –3 such as Pacific salmon returning to their habitat, 4 Antarctic seabirds foraging, 5 and some insects mating and feeding. 6 –8 In nature, olfactory plays a vital role in most animals. When searching for targets such as food and nests, animal olfactory cues are more effective than the visual and auditory sense cues. 9 Although the detection of smell is simpler than hearing and vision, how to use olfactory cues to search for a specific target is still a difficult task. 10 So far, few studies exist on this powerful and effective basic sensory system in robots.
With the tremendous development in the fields of robot technology, sensor technology, artificial intelligence, active olfactory technology has attracted attention of more and more scholars. And it will have a huge application prospect in civil, military, or anti-terrorism fields. Use of robots with olfactory technology can detect the leak of hazardous chemicals, lock down unexploded bombs and naval mine, search and rescue trapped people at the disaster site, detect fire sources in fire places, 11 and so on. Olfactory robot can even perform some complex and difficult tasks that require the cooperation of humans and animals (such as dogs) with highly developed olfactory.
Hayes et al. 12 described active olfactory as a task on how to use mobile robots to search odor in two-dimensional environment and divided it into three behaviors: plume finding, plume traversal, and odor source declaration. Later, the entire search task was divided into four behavior patterns: plume finding, tracing the plume, reacquiring the plume, and odor source declaration, as researched by Li et al. 13 Plume finding is a behavior pattern used at the beginning of search tasks, which can make the robot find the plume as soon as possible. The robot may lose plume target which makes it impossible to keep tracking because flow field in the environment is constantly changing. So it needs to use some known plume information for finding and tracking again. The whole process is called reacquiring the plume behavior. Odor source declaration is a behavior of the robot to determine whether it has reached the source of the smell and confirm the source location.
The current research studies mainly focus on active olfactory tracing method based on bionics principle. Human has developed olfactory tracing and locating method based on the theory of bionics by imitating the strategy of some living creatures using developed olfactory to search odors. Some organisms can trace chemicals by the concentration gradient of the chemical information obtained (such as slime mold), which is the most commonly used method of bionics called chemotaxis. Most animals have two odor sensors: nostrils or tentacles. They will compare the concentration of odors detected by the nostrils and then adjust to the direction of the strongest signal. Based on this phenomenon, it is first put forward that this method can be applied to robot olfactory research by Berg and Brown. The robot can move in a direction of higher concentration which is detected by two chemical sensors in different positions. 14 The movement behavior of lobster and moth were researched by Grasso and Belanger, and the result showed that all organisms had the similar movement behavior. According to the abovementioned behavior, an efficient simulation platform can construct a movement of moving along a z-type in the wind. The simulation platform can test, optimize, and verify the active detection method designed in the initial phase. This form of movement can keep robot itself remain inside the plume and gradually approach the plume source. 15 –19 By studying the process of tracking odors in moths, Farrell and Pang developed a task planner for searching plume used on the autonomous underwater vehicle (AUV). After several tracing chemical plume experiments, the results showed that the method carried out by Farrell and Pang is an effective method for tracking and locating plume. 20,21
At present, the successful detection algorithms are based on bionics, but few scholars have developed a tracking positioning method that is based on the theory of engineering which can be used in actual production. The computer system of the AUV is equipped with powerful storage capability and analysis ability, the nature that the organisms do not have. For example, the AUV’s computer system can record the flow speed/direction and detect the chemical information concentration, locate the AUV real-time position information, and store the functions about the concentration of the flow speed and chemical information in the memory. Organisms cannot accurately remember where they had been precisely before, where it had detected odor, and the recent historical data of wind speed and wind direction. In this article, a tracking and positioning method based on engineering theory will be adopted to efficiently utilize the storage and computational analysis abilities far beyond those of natural organisms for the AUV olfactory search and detection. Compared with the pure bionics algorithm, the tracking and location method proposed in this article can make full use of the storage capacity and compute and analyze resources of the AUV to achieve more stable performance.
This article presents a partially observable Markov decision process (POMDP) method used on the probability distribution map of the chemical source and is based on the artificial potential field (APF) method of the AUV olfaction search positioning path-planning method. Specific details about the APF olfactory path-planning method are available in the literature. 22 The main works of this article were as follows: Firstly, the POMDP method is used to estimate the probability distribution of the source of the chemical plume, and the source likelihood map is obtained. Secondly, according to the characteristics of the probability distribution map, the chemical plume is tracked by the APF method. Finally, the method is verified by experiments. Experimental results showed that the abovementioned methods could effectively and steadily solve the AUV’s sense of smell for tracking and locating problems.
Construction of chemical source likelihood map
Framework of partial observation Markov decision process theory
The traditional partial observation Markov decision process can be described as a six-tuple,
In this article, observation values are detected by the AUV’s chemical sensor, (
This binary observation avoids the fluctuation of chemical plume concentration in time-varying environment, with the fuzzy concentration value.
After making the decision
Using full probability formula for calculating probability
Combine equations (1) and (2) to be the update equation
Establishment of probability distribution map model of chemical source location
Tracing and locating chemical plume accords with the characteristics of POMDP method: Chemical source position is the hidden state which cannot be observed, but the AUV’s position is the explicit state which can be observed by localization sensors. When the AUV is tracing plume, the flow field is changing all the time and the AUV guides APF algorithm to make decision (path planning) only by learning the change states surrounding. The decision path may be right or wrong, a correct decision can make the AUV go further toward completing the task, but a wrong decision cannot cause the task to be failed. Performing a series of correct decisions can make the task to be successful, and a series of wrong decisions can make the task to be failed, which meets characteristics of decision-making process. POMDP method has a wide range of applications in the AUV planning field, we use POMDP method to build the chemical plume source likelihood map in this section.
The establishment of the state space
To make easier calculation of constructing source likelihood map, the plume search area is divided into several rectangular cells, and the setting of the coordinate system, and the division of the cells are shown in Figure 1. Cell vector

Cell partitioning sketch of the search area.
According to the known number of cell sequences number
Int(c1, d1) represents the largest integer less than or equal to c1/d1, and rem(c1, d1) represents the remainder of c1/d1. This proves that the two cell representation methods, Cf , g and Ci , are equivalent.
Cell vector
The AUV location information
The AUV’s state vector
Determination of decision
Decision is heading angle command for the AUV which perform searching and locating task, decision space is 1-D continuous space
State transfer function
The chemical source location in the search area is constant, state
Observation, observation model, and observation function
When the AUV detects the cells surrounding by the sensor carried on the AUV, gets all the detection information is observation result, we call this process observation. In this article, chemical sensor is used for searching chemical plume and locating the chemical plume based on the change of chemical concentration.
What can be determined is that the AUV can detect the presence of chemical plume in some cell by the chemical sensor. Before starting the search and localization task, we set a chemical concentration threshold. If the chemical concentration is detected by the sensor is higher than this set threshold, the plume is thought to have been detected,
Setting a threshold can avoid chemical concentration and gradient information failure in complex flow field, which results in achieving greater simplicity and higher reliability. The value of threshold in this article is obtained according to the observation results of flow field, normally taking 10% above the maximum concentration value in all observation results as the threshold value.
Observation model can be calculated by chemical plume motion model. Many scholars used two motions together for studying chemical plume instantaneous concentration distribution: molecular Brown motion and the fluid flow motion. Based on this theory, the location of chemical plume is
where
In the literature, 23 the plume diffusion distribution model is studied on the basis of equation (8). This section is only a brief introduction. Pang divided the plume diffusion model into single plume diffusion distribution model and continuous plume diffusion distribution model.
(1) Single releasing chemical plume model: Equation (8) can be transformed into
where
When a plume is released at time
Define μ is the probability that plume exists in the cell, it also is the reliability of the chemical sensor. In this article, setting the reliability is
Continuous releasing chemical plume model: The previous discussion considered situation of releasing a single plume. Normally, it should be continuous releasing chemical plume situation. The calculation formula of
Vector
The previous discussion considered that the situation of plume can be detected. If the AUV cannot detect the plume at time
where vector
It is hard to determine the initial moment of starting to release plume, because the source may have released plume for a long time, and plume was already moving outside the research area, so it is already not helpful for searching and locating. If the AUV detects plume, we can determine that plume was released from the source not long ago. The AUV cannot keep all the observation information, but can only keep information in the last l time steps. So
Based on the model of
For
Because
Based on the model of
The reason for using
Online mapping of chemical source likelihood map
In the previous content, update formula for the belief state is estimated as
and
Hidden state
Decision
Therefore, the belief state is updated, which means the chemical source likelihood map is updated for the model of
When
When
The model is established on the premise that there is only single chemical source in the entire search area.
The belief state updated equation based on POMDP method is proposed in this section, it is also the method of constructing chemical source likelihood map. Updating of belief state
Artificial potential field-based online planning
APF method was first proposed in 1986, used in movement planning of robotic arm by Khatib, and made a good exercise effect. 24 –26 APF method has the following characteristics of simple model, small calculation, and real-time performance, is widely used as method of path planning, and is a relatively stable and efficient method in current path planning methods.
In this article, APF method is used for tracing plume, and the movement of robot can be considered as causing the force field in a virtual space. The force field is APF, which includes gravitational field and repulsive field. Repulsive field is generated by the obstacle for the AUV, and gravitational field is generated by the plume information for the AUV. We assume that there is no obstacle in the plume search and localization area. Traditional APF method needs to be modified: Repulsive field is removed and gravitational field is retained. When the target is near the obstacle in the traditional APF method, the AUV cannot reach the target location, and the previous method we proposed can avoid this situation. In this case, the AUV is only subjected to the gravitational field in the APF environment and is moving toward the source. But the location of the plume is uncertain, and we need to calculate source likelihood map for generating gravitational field to attract the AUV. Source likelihood map represents the probability values of chemical source in each cell. The greater the probability value, the more likely the chemical source is in the cell. When the AUV is moving, it can get a series of plume as detected d and not detected
When the AUV is moving in the flow field, a

Method diagram of potential field.
The virtual force
where
The direction of joint force
The chemical source search method flow is shown in Figure 3. Firstly, the AUV will start a massive z-type search; when the detected chemical concentration is higher than the set threshold, the AUV will update the source probability map, and the APF algorithm will drive the AUV tracing plume toward the source position. In the tracing process, if the AUV did not detect the plume information in the last 30 s, it will change back to z-type search. If the AUV did detect plume information in the 30 s, and the distance of multiple detected plume is less than 5 m, the AUV can determine that the source position is confirmed, and the plume tracing task is considered to be success.

Behavior switching diagram of chemical source search.
Simulation experiment
The simulation experiment in this article is carried out in the underwater semi-physical simulation system. The structure of the simulation system is shown in Figure 4.

The structure of underwater semi-physical simulation systems.
The simulation system consists of three parts: an AUV motion monitoring system, a chemical plume environment simulation system, and a PC104 embedded computer. The PC104 is an embedded computer used on real underwater AUVs for chemical plume tracking path planning. The sensor information such as concentration and flow rate/flow direction detected by the AUV in the semi-physical simulation is calculated by the chemical plume environment simulation system. The motion monitoring system can display the information of the current motion state of the AUV, transmit the information to the chemical plume environment simulation system through the Ethernet, and use the obtained motion information to determine whether the AUV tracking process is accurate, as shown in Figure 5.

Underwater semi-physical simulation operation pictures.
This section represents the underwater simulation experiment for the method proposed in the article. The search area is set to 100 × 100 m2. Figure 6 shows the search area is divided into

Sketch of cell partitioning in simulation.
Figure 7(a)–(d) shows the source likelihood map obtained by the POMDP method at four different moments in the AUV tracking process.

The four different moments in the AUV tracking process. (a) AUV extensive tracing diagram, (b) AUV detected plume for the first time, (c) process of AUV tracing, and (d) process of AUV confirmed source.
In Figure 7(a), the AUV starts from the initial position (0 m, 25 m) and performs z-type motion. The color region is used to represent the source likelihood map. The higher the probability value, the darker the color. At the initial moment, the AUV obtains very little plume information, and the probability values of source existence are equal in most cells. The probability values decrease near the area that passes by itself.
In Figure 7(b), the AUV first detected the plumage information and immediately turned to the APF tracking behavior at t = 42 s. After detecting the plume information, the AUV updated the source likelihood map. The situation where the source may exist in most cells disappeared, and correspondingly, a dark area appeared in the upstream direction of the AUV. This area is the calculated source likelihood map at the current moment.
In Figure 7(c), with the increase of AUV acquisition information, the source likelihood map converges to a small area at t = 83 s. Figure 7(d) shows that the AUV confirms the source at t = 145 s. The position of the red point is the cell with the highest probability of the source existence. At this moment, the source likelihood map converges to a very small area, which makes it easy for the AUV to perform source localization.
The source localization method used in this article is to compare the distance information of the recently detected plume. When the distance of the detected plume is less than a certain threshold (5 m), it is determined that the location is the location of the source. The simulation results show that the proposed method can make the AUV accurately confirm the chemical plume source location.
To verify the stability and reliability of the proposed tracing method, it is compared with the mature bionics tracing method. The bionics tracing method has the advantages of simple calculation and high success rate. It is a widely used chemical plume tracing method.
Figure 8 shows bionics tracing process. It can be seen that the bionics method is mainly to imitate the process of bio-tracing food, detailed descriptions of the bionics tracing method can be found in the study by Farrell et al. 27

Process of plume tracing based on bionics.
This article compares the bionics tracing method with the tracing method proposed in this article from the following two aspects: (1) Tracing success rate refers to the probability of successfully confirming the source within the specified time. 250 s as the specified time is selected in this article. (2) Time spent in the tracking process refers to the time taken from the AUV’s first detection of chemical plume concentration information to the end of the source confirmed.
Through 60 simulation experiments, the success rate comparison diagrams of the two tracing methods are obtained. Figure 9(a) is the success rate of the tracing method proposed in this article and Figure 9(b) is the success rate of bionics tracing method.

Two tracing methods success rate comparison chart. (a) The success rate of POMDP and APF method and (b) the success rate of bionics tracking method. POMDP: partially observable Markov decision process; APF: artificial potential field.
In the 60 simulation experiments of the tracing method proposed in this article, 58 successful, 2 failures, the success rate is about 96%. In the two failed experiments, the reason for one failure was because the AUV did not detect the plume and was always searching over a wide range, for the other failure is because the AUV is always unable to confirm the source position.
In the 60 simulation experiments of the bionics-based tracing method, 55 successful, 5 failures, the success rate is about 92%. In the five failed experiments, the reason for two failures was because the AUV lost plume during tracing, and then it did not continue to trace the plume; for the other three failures was that the AUV has never detected plume information.
In 58 successful experiments based on the tracking method presented in this article, the tracking process took an average of 114 s. In 55 successful experiments based on bionics tracking method, the tracking process took an average of 139 s.
Comparing the performance indicators of two different tracking methods, we can get the following conclusions: The method proposed in this article has a good tracking effect. After the plume information is detected, the loss of the plume does not occur. The bionics method has two losses and no trace of the feather. The method presented in this article has a higher success rate (96%) than the bionics method, the bionics method also has a high success rate (92%). This method presented in this article has a faster tracking speed than the bionics method, and the lower average time consumption can save about 20% of the time.
Through comparison experiments, it can be concluded that the tracking method combining source likelihood map and APF planning method has better reliability and stability and is a more efficient tracking method.
Make full use of the chemical plume concentration and flow rate/flow direction information to construct a source likelihood map, guide the AUV to carry out the purposeful tracking movement, and greatly improve the tracking efficiency. This is an important reason why the tracking method proposed in this article has better performance. The bionics method has less computational complexity, and in the process of running the bionic method program, there is no jamming phenomenon caused by excessive calculation. So in the case of the AUV with a low-performance processor, it is also a good choice to use the bionics method for plume tracing.
Conclusion
In this article, a method based on POMDP to construct source likelihood map is proposed for AUV chemical plume tracing. The semi-physical simulation system is used to verify the proposed tracking method, and the experimental results are compared with the bionic tracking method. Experiments show that the POMDP source likelihood map method and the APF planning method can guide the AUV to find the source position and have a higher success rate and better stability than the bionic tracing method. The success of the algorithm can prove that the olfactory tracking method based on engineering theory is applicable in real environment. And because of the advantages of engineering-based method in using sensors and computing resources, it will achieve better results than bio-bionic tracing method.
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work is supported by Harbin University of Commerce Funding (No.167274).
