Cooperative Localization of Multi-UAVs via Dynamic Nonparametric Belief Propagation under GPS Signal Loss Condition

Abstract

Self-localization is critical for many unmanned aerial vehicles (UAVs) tasks such as formation flight, path planning, and activity coordination. Traditionally, UAV can locate itself using GPS combined with some inertial sensors. However, due to the complex flight environment or failure of the GPS receiver, the UAV may lose its GPS signal and fail to locate itself, resulting in devastating consequence. In this paper, we will consider the problem of cooperative localization among multiple UAVs, in which the UAVs with failure of GPS receiver can help each other to locate themselves through mutual information exchanged based on the relative distance measurements. Specifically, we propose a dynamic Nonparametric Belief Propagation (dNBP) algorithm to calculate the posterior distribution of UAV's position conditioned on all observations made in the whole UAVs group. The dNBP is a natural combination of NBP with particle filtering, suitable for treating with the nonlinear model and highly non-Gaussian distributions arising in our application. Furthermore, dNBP provides the basis for distributed algorithm in which messages are exchanges between neighboring UAVs. Thus, the computational burden is distributed across UAVs. Simulations in Matlab environment show the effectiveness of our method.

1. Introduction

Unmanned aerial vehicles (UAVs) have attracted significant interest for a wide range of applications. Among these applications, active cooperation of several low-cost small-size UAVs has important advantages. The basic requirement of many flight tasks of UAVs group, such as the formation flight [1, 2], coordinated rendezvous [3], coordinated path planning [4], and task coordination [5], is the correct localization of each member in the group. Traditionally, the navigation or localization method of UAV is based on integration of inertial navigation system (INS), global positioning system, and so forth. In the case of flying in complicated environment such as buildings, foliage and hilly terrain, or errors of GPS receiver, UAV may lose its GPS signal, leading to catastrophic effects during the flight mission.

Here, we classify UAVs in the group into two types: fault UAVs, in which GPS fails to provide location information, and normal UAVs, which have available GPS measurements. We assume that each UAV in the group, fault or normal, has limited resource for computation and communication and synchronized internal clock that allows the UAVs to share a common notion of time. We also assume that each UAV has a noisy distance estimate to its neighbors (i.e., the ones within certain range) from signal metrics measurement based on direct communication. The relative range measurements can be obtained using either ultra wideband (UWB) radio technology or optical systems [6].

In a group of UAVs, the one with failure of GPS receiver can manage to localize itself using classical least square estimator [7] based on the relative distance measurement with respect to nearby normal ones. See Figure 1; for example, the three black points represent three normal UAVs (UAV0, UAV1, and UAV2) whose positions measurements are available. The red points stand for the fault UAVs (UAV3 and UAV4) whose positions are unknown. UAV3 can independently determine its own position in 2-dimensional space based on the relative distance estimates with respect to the other three normal UAVs (UAV0, UAV1, and UAV2).

Figure 1

Scheme of multi-UAVs cooperative localization.

However, due to the limitation of signal transmission, the distance measurement between UAV1 and UAV4 may be not reachable or be too noisy to be useful, as they are too far away from each other. So, UAV4 is unable to localize itself without ambiguity based on the knowledge of positions of UAV0 and UAV2. But if the distance measurement between UAV3 and UAV4 is available, UAV4 can manage to determine its position by taking into account the estimated position of UAV3. In this way, UAV4 can localize itself even when it has less than three normal neighbors. In fact, this is achieved by propagation of the position knowledge of normal UAVs in the group. In this paper, we will formulate this knowledge propagation procedure in a statistical inference frameworks and propose a dynamic Nonparametric Belief Propagation algorithm for UAV localization, which treats the uncertainties in UAVs' state and their propagations in a systematical way and integrates all available information, including INS measurements on each UAV, GPS measurements on normal UAVs, and the distance measurement between pairs of UAVs, in a distributed manner.

There are several related works about the cooperative localization of UAVs. Qu et al. [8] proposed a UAV localization method based on information synchronization for a ring communication topological structure. Shames et al. [9] considered the problems of cooperative self-localization of mobile agents and proposed a solution based on the rigid graph theory. However, in the above approach, it is assumed that both the distance and the angle measurement are available, which is not true in many cases. Qu and Zhang [10] uses relative range measurements and GPS signals of 3 nearby healthy UAVs to cooperatively locate the UAVs with failure of GPS receiver. But it is not clear how to treat the ambiguity in localization when the neighbors of fault UAV include less than 3 healthy ones. There are also some works about the cooperative localization in sensor networks via inference on graphical model [11–13]. However, in these applications, the anchor nodes are static and fixed. This is in contrast with our work where the state of normal UAVs is also uncertain and the normal member may vary with time.

In this paper, we propose a novel approach for multiple UAVs cooperative localization; the main features of our work include the following. (1)

We formulate the cooperative localization as a probabilistic inference problem and propose a graphical model representing the joint distribution of involved random variables in UAVs group. The model is more flexible than [11–13]. In our model it allows the movement of anchors (here refer to the normal UAVs), and the variation of anchors member over time.

(2)

We propose a dynamic Nonparametric Belief Propagation algorithm (dNBP) for inference on the graphical model. By Belief Propagation, the posterior marginal of the position of fault UAV is calculated conditioned on all information made in the whole UAVs group up to current time. This is in contrast with traditional works, such as the works in [7, 10], where the fault UAV is located using only neighboring normal ones.

(3)

The proposed dNBP is a natural combination of traditional NBP with particle filter, which applies to dynamic case. In dNBP, posterior distribution of unobserved variables is approximated by set of representative samples, which makes it suitable for ring-shaped distribution produced by relative distance measurement; see Figure 4.

(4)

In dNBP, the distributions are updated by messages passing among UAVs and upon convergence; the position estimate of UAV is given by the belief state of the corresponding variables. This provides the basis for distributed implementation of information fusion in UAVs group.

(5)

We utilize several efficient sampling techniques in dNBP to reduce the computational time. Implemented by Matlab on PC in our lab, a single filtering step on individual UAV can be finished within few seconds.

This paper is organized as follows: in Section 2, we formalize the problem and discuss the types of uncertainty which occur in the localization problem. In Section 3, we present the dNBP algorithm in detail. Section 4 shows how the dNBP works in our simulations. Finally, we present a brief conclusion and the future plan.

2. Problem Formulation

In this section, we formulate the multi-UAV cooperative localization problem in a probabilistic framework. The position of the UAV is three-dimensional. But concerning the convenience of description, we will narrow the scope of position to two-dimensional geographical coordinates. However, it can also easily be extended to three-dimension occasion. We restrict our attention to the simplified cases in which each UAV is equipped with a 2-axis accelerometer and GPS sensor and assume it can obtain noisy distance measurements with respect to its nearby UAVs. It is also straightforward to extent to the UAV with more complex sensor configurations. Suppose that we have N UAVs in our system. The state of the sth UAV at time τ consists of its position and velocity $X_{s, τ} = (x_{s, τ}, {\dot{x}}_{s, τ})$ .

Under the above assumptions, the state of each UAV evolves according to the following linear Markov model:

\begin{array}{l} x_{s, τ} = x_{s, τ - 1} + {\dot{x}}_{s, τ - 1}, \\ {\dot{x}}_{s, τ} = {\dot{x}}_{s, τ - 1} + u_{s, τ} + v_{τ}, \end{array}

(1)

where u is the output of accelerometer and v is a zero-mean Gaussian process modeling the imprecision of accelerometer measurement. For normal UAV, at each time τ, a GPS observation is available. The observation model is

\begin{matrix} z_{s, τ}^{n} = x_{s, τ}^{n} + w_{τ}, \end{matrix}

(2)

where w is also a Gaussian noise. Here, we use the superscript n to denote the normal UAV. Note that the set of normal UAVs may change with time due to the varying flight condition.

Moreover, we assume that with some probability $P_{0}$ , which is a function of the relative distance, each UAV has some noisy measurement from its neighbors $t \in Γ (s)$

\begin{matrix} \begin{matrix} d_{s t, τ} = ∥ x_{s, τ} - x_{t, τ} ∥ + v_{s t, τ} \end{matrix} v_{s t, τ} ~ N (0, σ_{v}^{2}), \end{matrix}

(3)

where

v_{s t, τ}

is the distance measurement noise. Concerning the limitation of transmission technology, it is reasonable to assume that the probability of detecting nearby UAVs falls off with distance. Following [14], we model the decline of the probability as follows:

\begin{matrix} \begin{matrix} P_{0} (x_{s, τ}, x_{t, τ}) = \exp {- \frac{1}{2} {∥ x_{s, τ} - x_{t, τ} ∥}^{2} / R^{2}} \end{matrix} . \end{matrix}

(4)

Here, R is a parameter specifying the range of the transmitter used for distance estimation.

We use y to denote the available observations, and $y_{τ} ≜ {z_{s, τ}^{n}, d_{s t, τ}}$ . Our goal is to recursively calculate the posterior marginal distribution of $x_{s, τ}$ conditioned on all observations up to current time, that is, $p (x_{s, τ} ∣ y_{1 : τ})$ . We will use dNBP algorithm to solve this problem, which we will discuss in the following sections.

3. Inference Algorithm

3.1. Graphical Model

The graphical model for inference is shown in Figure 2. Each node is associated with a random variable representing the state of a single UAV at certain time $X_{s, τ}$ . The node with superscript n corresponds to the normal UAV. Note that the set of normal nodes varies with time. The dashed arrows between time slice model the evolution of state across time $p (x_{s, τ} ∣ X_{s, τ - 1})$ which is defined by the dynamic model (1). The edges within each time slice model the relationship among UAVs. An edge presenting between two nodes means that the relative distance measurement is available and the corresponding UAVs can communicate with each other. Note that the edges within each time slice appear according to the detection probability $P_{0}$ and also vary across time.

Figure 2

Graphic model for inference in cooperative localization.

The problem of inference on the graphical model, that is, computation of the posterior distribution (belief) of state variable of each UAV, can be divided into two parts. First, is how to compute the posterior marginal distribution of the state on each node conditioned on observations from a single time slice. We will use NBP to solve this problem in Section 3.2. Second, is how to incorporate the dynamic information contained in dynamic model and perform tracking, that is, recursively calculate the belief state $p (x_{s, τ} ∣ y_{1 : τ})$ . We will discuss this in Section 3.3.

3.2. Intraslice Update

In this section, we omit the subscript τ in expression for clarity. During intraslice update, the only involved state variable is the position $x_{s}$ . Note that the positions of different UAVs within a single time slice are correlated through the relative distance measurements according to the edge pattern shown in Figure 2. The joint distribution of the position of all UAVs in the group at current time is

\begin{matrix} p (x) \propto \prod_{s \in V} ψ_{s} (x_{s}) \prod_{(s, t) \in E} ψ_{s, t} (x_{s}, x_{t}), \end{matrix}

(5)

where V and E are the set of nodes and edges, and the local potential function for each node is defined as

\begin{matrix} ψ_{s} (x_{s}) = {\begin{cases} p (z_{s} ∣ x_{s}), & s is normal, \\ 1, & s is fault, \end{cases} \end{matrix}

(6)

where

p (z_{s} ∣ x_{s})

is obtained from the GPS observation model (2). The pairwise potential function is defined as

\begin{matrix} ψ_{s, t} (x_{s}, x_{t}) = p (d_{s t} ∣ x_{s}, x_{t}) \end{matrix}

(7)

which is obtained from the distance measurement model (3).

We will use the NBP algorithm [13] to calculate the marginal distributions of each UAV's position from the joint distribution (5). Nonparametric Belief Propagation is an iterative inference algorithm for graphic model, which is very effective for marginal distribution computation with respect to the global joint distribution. During each iteration, NBP consists of two steps: message propagation and belief computation.

For message propagation, the tth UAV sends a message $m_{t s}^{n}$ to its neighbors $s \in Γ (t)$ , which is defined as (here, n is the index for iteration)

\begin{matrix} m_{t s}^{n} (x_{s}) \propto \int_{x_{t}}^{} ψ_{s, t} (x_{s}, x_{t}) ψ_{t} (x_{t}) \prod_{u \in Γ (t) ∖ s} m_{u t}^{n - 1} (x_{t}) d x_{t} . \end{matrix}

(8)

For belief computation, each node produces an approximation $p^{n} (x_{s} ∣ y)$ to the true marginal distribution $p (x_{s} ∣ y)$ by combining the incoming messages with the local potential

\begin{matrix} p^{n} (x_{s} ∣ y) \propto ψ_{s} (x_{s}) \prod_{t \in Γ (s)} m_{t s}^{n} (x_{s}) . \end{matrix}

(9)

After convergence, the position estimate of each UAV is given by the beliefs on the corresponding node. Exact evaluation of (8)-(9) involves computations which is not analytically tractable for most continuous variables. So, we represent $m_{t s}^{n}$ by a mixture of M Gaussian distributions

\begin{matrix} m_{t s} (x_{s}) = \sum_{i = 1}^{M} w_{s}^{i} N (x_{s}; x_{s}^{i}, Σ_{s}) . \end{matrix}

(10)

The covariance

Σ_{s}

is chosen as

Σ_{s} = σ_{v}^{2} ξ_{M} I

, where

ξ_{M}

is a constant calibrated offline to the number of samples M.

The message update equations (8) are performed using stochastic approximations in two stages: first, drawing samples $x_{s}^{(i)}$ from $p^{n - 1} (x_{s})$ , then using these samples to approximate each outgoing message. We summarize the procedure in Algorithm 1. As $m_{t s}^{n}$ is a ring-shaped distribution, in Algorithm 1, we sample the message in the polar coordinates, with uniformly distributed orientation and normal radius perturbed from the noisy interdistance reading. Note that the detection probability $P_{0}$ plays its role in sample weighing in line (3).

Algorithm 1: Computation of outgoing message.

Input: M weighted samples ${w_{s}^{(i)}, x_{s}^{(i)}}$ from $p^{n - 1} (x_{s})$ ; Output: an approximation to $m_{s t}^{n} (x_{t})$ for each neighbor $t \in Γ  (s)$

( $1$ ) Draw $θ_{s t}^{(i)} ~ U  (0,2 π)$ , $v_{s t}^{(i)} ~ N (0, σ_{v}^{2})$

( $2$ ) Means: $x_{s t}^{(i)} = x_{s}^{(i)} + (d_{s t} + v_{s t}^{(i)}) [\sin (θ_{s t}^{(i)}) ; \cos (θ_{s t}^{(i)})]$

( $3$ ) Weights: $w_{s t}^{(i)} = P_{0} (x_{s t}^{(i)}) / m_{s t}^{n - 1} (x_{s t}^{(i)})$

( $4$ ) Variance: $Σ_{s t} = σ_{v}^{2} ξ_{M} I$

Since each outgoing message is a Gaussian mixture with M components, estimation of the marginal distribution (belief) (9) is more difficult. Because it is the product of several Gaussian mixtures, computing (9) exactly is exponential in the number of incoming messages. Following [15], we use a technique called mixture importance sampling to compute the belief. As shown in Algorithm 2, we denote the set of neighbors of t having observed edges to t by $Γ_{t}^{0}$ . We create a collection of kM weighted samples ( $k > 1$ ) by drawing samples from each incoming message.

Algorithm 2: Belief computation.

Input: several Gaussian mixture messages ${w_{u t}^{(i)}, m_{u t}^{(i)}, Σ_{u t}}$ , $u \in Γ_{t}^{0}$ ; Output: an approximation to belief $p^{n} (x_{t})$

( $1$ ) For each observed neighbor $u \in Γ_{t}^{0}$

( $2$ ) Draw $k M / | Γ_{t}^{0} |$ samples ${x_{t}^{(i)}}$ from message $m_{u t}^{n}$

( $3$ ) Weight by $w^{(i)} = ψ_{t} (x_{t}^{(i)}) \prod_{u \in Γ_{t}} m_{u t}^{n} (x_{t}^{(i)}) /  \sum_{u \in Γ_{t}}^{} m_{u t}^{n} (x_{t}^{(i)})$

( $4$ ) From these $k M$ locations, re-sample by weight M times to produce M equal-weight samples.

3.3. Interslice Prediction and Tracking

The dynamic information is incorporated into inference by interslice prediction. For interslice prediction, each node sends a forward-time message to its corresponding node in the next time according to the dynamic mode

\begin{matrix} m_{s, τ, τ + 1} (X_{s, τ + 1}) = \int p (X_{s, τ + 1} ∣ X_{s, τ}) p (X_{s, τ} ∣ y_{1 : τ}) d X_{s, τ} . \end{matrix}

(11)

The time-forward message summarizes our previous knowledge about UAV's state conditioned on all history observations. Under the mixture representation of message, the calculation in (11) can be implemented using Algorithm 3. In Algorithm 3, the covariance

Σ_{s, τ + 1}

is given by the estimate

Σ_{s, τ + 1} = ROT ({X_{s, τ + 1}^{(i)}, w_{s, τ + 1}^{(i)},})

, which is proportional to the weighted covariance of the observed samples

\begin{array}{l} Σ_{s, τ + 1} = M^{(- 2 / (δ + 4))} \sum_{i = 1}^{M} w_{s, τ + 1}^{(i)} (X_{s, τ + 1}^{(i)} - {\bar{X}}_{s, τ + 1}^{}) \\ \times {(X_{s, τ + 1}^{(i)} - {\bar{X}}_{s, τ + 1}^{})}^{T}, \end{array}

(12)

where

{\bar{X}}_{s, τ + 1}^{} = \sum_{i} w_{s, τ + 1}^{(i)} X_{s, τ + 1}^{(i)}

is the weighted sample mean, and

δ = \dim (X)

Algorithm 3: Outgoing messages computation for tracking.

Input: M samples $X_{s, τ}^{(i)} = {x_{s, τ}^{(i)}, {\dot{x}}_{s, τ}^{(i)}}$ from marginal $p_{s, τ} (X_{s, τ})$ ; Output: forward message $m_{s, τ, τ + 1} (X_{s, τ + 1})$

( $1$ ) Means: $X_{s, τ + 1}^{(i)} ~ p (X_{s, τ + 1} ∣ X_{s, τ}^{(i)})$

( $2$ ) Weights: $w_{s, τ + 1}^{(i)} = 1 / M$

( $3$ ) Variance: $Σ_{s, τ + 1} = R O T ({X_{s, τ + 1}^{(i)}, w_{s, τ + 1}^{(i)}})$

( $4$ ) Forward message: $m_{s, τ, τ + 1} (X_{s, τ + 1}) = \sum_{i = 1}^{M} w_{s, τ + 1}^{(i)} N (X_{s, τ + 1}; X_{s, τ + 1}^{(i)}, Σ_{s, τ + 1})$ .

Taking the forward-time messages (11) as prior distribution of UAV's state, we can combine it with the NBP update for tracking. During tracking, at time τ, each UAV integrates the time-forward message from itself at previous time $τ - 1$ , the messages from its neighbors at current time τ, and its local potential to compute its belief

\begin{matrix} p (X_{s, τ} ∣ y_{1 : τ}) \propto m_{s, τ - 1, τ} (X_{s, τ}) ψ_{s} (x_{s, τ}) \prod_{t} m_{t s, τ}^{n} (x_{s, τ}), \end{matrix}

(13)

where

m_{t s, τ}^{n}

are obtained from NBP described in Section 3.2. Note that the time-forward message

m_{s, τ - 1, τ}

is a distribution over

X_{s, τ} = (x_{s, τ}, {\dot{x}}_{s, τ})

, while the local potential

ψ_{s}

and the intraslice message

m_{t s, τ}

are distributions over position

x_{s, τ}

. To calculate

p (X_{s, τ} ∣ y_{1 : τ})

, we first decompose the time-forward message into marginal position distribution and the conditional distribution of velocity given position

\begin{matrix} m_{s, τ - 1, τ} (X_{s, τ}) = m_{s, τ - 1} ({\dot{x}}_{s, τ} ∣ x_{s, τ}) m_{s, τ - 1} (x_{s, τ}) . \end{matrix}

(14)

Then we calculate the posterior distribution over

x_{s, τ}

in (13), which is a product of density, using the same technique in Algorithm 2. Finally, the resulting position estimates are transferred to velocity via the conditional distribution in (14). The remaining issue is how to perform the decomposition in (14). Note that the time-forward message is a Gaussian mixture. Because the conditional distribution of a subset of jointly Gaussian variable is Gaussian, conditional distributions of Gaussian mixture are mixture of lower dimensional Gaussian. Consider the ith component in the joint mixture with weight

w_{s, τ}^{(i)}

, the corresponding mean and covariance are

\begin{matrix} μ_{X}^{(i)} = [\begin{bmatrix} μ_{x}^{(i)} \\ μ_{\dot{x}}^{(i)} \end{bmatrix}], Σ_{X}^{(i)} = [\begin{bmatrix} Σ_{x, x}^{(i)} & Σ_{x, \dot{x}}^{(i)} \\ Σ_{\dot{x}, x}^{(i)} & Σ_{\dot{x}, \dot{x}}^{(i)} \end{bmatrix}] . \end{matrix}

(15)

The marginal

m_{s, τ - 1} (x_{s, τ})

has the weight

w_{s, τ}^{(i)}

, mean

μ_{x}^{(i)}

, and covariance

Σ_{x, x}^{(i)}

. From standard formulas [16], the conditional density

m_{s, τ - 1} ({\dot{x}}_{s, τ} ∣ x_{s, τ})

has the mean, variance, and weight as

\begin{matrix} μ_{\dot{x} | x}^{(i)} = μ_{\dot{x}}^{(i)} + Σ_{\dot{x}, x}^{(i)} {(Σ_{x, x}^{(i)})}^{- 1} (x^{(i)} - μ_{x}^{(i)}), \\ Σ_{\dot{x} | x}^{(i)} = Σ_{\dot{x}, \dot{x}}^{(i)} - Σ_{\dot{x}, x}^{(i)} {(Σ_{x, x}^{(i)})}^{- 1} Σ_{x, \dot{x}}^{(i)}, \\ w_{\dot{x} | x}^{(i)} \propto w_{s, τ}^{(i)} (\frac{| Σ_{\dot{x} | x}^{(i)} |}{| Σ_{X}^{(i)} |}) \exp {- \frac{1}{2} σ_{0}^{T} {(Σ_{X}^{(i)})}^{- 1} σ_{0}^{}}, \end{matrix}

(16)

where

σ_{0}^{}

is defined as

\begin{matrix} σ_{0}^{} = [\begin{bmatrix} x_{s, τ}^{(i)} - μ_{x}^{(i)} \\ μ_{\dot{x} | x}^{(i)} - μ_{\dot{x}}^{(i)} \end{bmatrix}] . \end{matrix}

(17)

The whole tracking algorithm is summarized in Algorithm 4. For normal UAVs, (13) is just a slightly modified particle filtering. By combining information provided by temporal dynamic and GPS observation, normal UAVs can give more reliable position information and act as anchor nodes during cooperative localization in UAVs group. The fault UAVs use (13) to combine temporal information and distance measurement, which can greatly improve their localization accuracy.

Algorithm 4: Marginal distribution computation for tracking.

Input: incoming messages $m_{t s, τ}^{}$ , $m_{s, τ - 1, τ}^{}$ ; Output: an approximation to marginal distribution $p (X_{s, τ})$

( $1$ ) Decompose the Gaussian mixture: $m_{s, τ - 1, τ} (X_{s, τ}) = m_{s, τ - 1} (x_{s, τ}) m_{s, τ - 1} ({\dot{x}}_{s, τ} ∣ x_{s, τ})$

( $2$ ) Draw $k M / 2 | Γ (s) |$ samples ${x_{s, τ}^{(i)}}$ from each incoming messages $m_{t s, τ}^{}$ ;

( $3$ ) Draw $k M / 2$ samples from ${x_{s, τ}^{(i)}}$ from each incoming messages $m_{s, τ - 1} (x_{s, τ})$

( $4$ ) Weight each of the kM samples by

$w_{s, τ}^{(i)} = ψ_{s} (x_{s, τ}^{(i)}) \frac{m_{s, τ - 1} (x_{s, τ}^{(i)}) \prod_{t \in Γ (s)} m_{s t} (x_{s, τ}^{(i)})}{m_{s, τ - 1} (x_{s, τ}^{(i)}) + \sum_{t \in Γ (s)}^{} m_{s t} (x_{s, τ}^{(i)})}$

( $5$ ) Resample from these $k M$ samples, producing $k M$ equally weighted samples ${x_{s, τ}^{(i)}}$

( $6$ ) Draw kM samples ${\dot{x}}_{s, τ}^{(i)} ~ m_{s, τ - 1} ({\dot{x}}_{s, τ} ∣ {\hat{x}}_{s, τ})$ , with ${\dot{w}}_{s, τ}^{(i)} = 1 / k M$ , producing kM equally weighted samles $X_{s, τ}^{(i)} = {x_{s, τ}^{(i)}, {\dot{x}}_{s, τ}^{(i)}}$

( $7$ ) Resample M samples from theses $k M$ samples $X_{s, τ}^{(i)}$

3.4. Discussion

In practice, we find that, in the intraslice update, if the traditional parallel message passing schedule is used, in which every node transmits message to its neighbors in parallel with each iteration, the convergence behavior of NBP is very unsatisfactory. This is mainly because the message passing from the fault UAV to normal one, which may contain significant bias from true distribution, may corrupt the belief of the normal UAV and slow down the convergence of the algorithm. Therefore, in our implementation, as in [12], we use a modified message passing schedule in which a normal node never receive messages from fault ones, and any fault node only transmits an outgoing message if it has receive at least three incoming messages. In situation where no fault UAV has three incoming messages, the threshold drops to two messages and then finally drops to one.

It should be noticed that, in the dNBP algorithm presented above, there are only two kinds of operations: the local information processing on individual UAV and the mutual message exchanges between neighboring UAVs. So, it is a fully distributed algorithm and can scale well for large UAV group. For a single UAV, the most computational demanding portions are the sampling from Gaussian mixture and evaluating Gaussian mixture at a particular location. Both require $O (k M | E |)$ operations per timestep, where $| E |$ is the number of observed interdistance measurements. And the parameters k and M correspond to the number of samples used to represent the distributions. Note that we can further reduce the computational burden by only updating belief using GPS and inter-distance measurement every N timesteps, as the INS navigation is accurate enough for short time.

4. Results

4.1. Case Study

In this section, we consider a typical scenario to illustrate how our algorithm works for the cooperative localization of multiple UAVs.

As shown in Figure 3, the multi-UAVs team consists of 3 normal UAVs and 3 fault UAVs. $X^{F}$ and $X^{N}$ stand for the state of fault UAV and normal UAV, respectively. The solid edges between nodes represent messages passing between UAVs, the arrows represent the passing direction, and associated number of each direct edge indicates message passing order according to the schedule described in Section 3.4. For example, a directed edge from node 1 to node 4 with number 0 means that a message $m_{14}^{0}$ is sent from UAV1 to UAV4 at the 0th iteration of NBP at current timestep. The dashed arrows represent passing of time-forward messages (11) of individual UAVs. For clearness, we only show the arrow for UAV6.

Figure 3

Illustration of the dNBP.

Figure 4

Cooperative localization via dNBP.

Let us first consider the intraslice update in time t. Suppose at time t that UAV1, UAV2, and UAV3 are normal, and UAV4, UAV5, and UAV6 have no knowledge about their location. Thus at the iter = 0 of NBP, UAV4 receives messages from UAV1, UAV2, and UAV3. UAV5 receives messages from UAV 1 and UAV 2. UAV 6 just receives message from UAV 3. For UAV4, it can locate itself using the messages coming from the other 3 UAVs shown in Figure 4(a), where three blue ring-shaped distributions represent the messages from its neighbors (8) calculated by Algorithm 1 and the red single-modal distribution stands for its belief (position estimation) (9) calculated by Algorithm 2 concerning all the three incoming messages. From Figure 4(a), we can see that after one iteration the position of UAV4 is located successfully without ambiguity, while due to the lack of the incoming messages, the estimations of UAV5 and UAV6 represent two-modal distribution (Figure 4(b)) and ring-shaped distribution (Figure 4(c)), respectively.

At the iter = 1, UAV4 will transmit outgoing messages to UAV5 and UAV6, as shown in Figure 3. With the incoming message sent by UAV4, UAV5 is able to locate itself by combining the messages from the other two normal UAVs (UAV1 and UAV2) as shown in Figure 4(d). For UAV6, using message coming from UAV4 enables it to improve the estimation accuracy from ring-shaped distribution to multimodal distribution shown in Figure 4(e). At iter = 2, both UAV4 and UAV5 are able to send outgoing messages according to the message update schedule. Combining incoming messages from UAV3, UAV4, and UAV5, the localization of UAV6 is accomplished, with distribution changing from multimodal to single modal, as shown in Figure 4(f). In this case, all fault UAV can localize itself without ambiguity after 3 iterations of NBP.

Now, let us turn to the dynamic case. At time t, we use the linear Markov model (1) to calculate the state predictive distribution (11) of all UAVs at time $t + 1$ using Algorithm 3. For normal UAVs, this distribution is combined with GPS observation in $t + 1$ to improve the position accuracy, which is in fact a particle filtering process. For fault UAVs, the predictive distribution is combined with incoming messages sent from its neighbors in $t + 1$ for its localization. In this way, dNBP reduces the ambiguity of fault UAV's position estimate by integrating the information provided by both temporal dynamics and distance measurements at time $t + 1$ . Take UAV6 as an example. Suppose at time $t + 1$ that UAV6 can only communicate with UAV3 and UAV1, as shown in Figure 3. Figure 4(h) shows the ambiguity, in which green two-modal distribution represents the position estimation of UAV6, just considering messages coming from its two neighbors. But UAV6 can be located correctly at time t, and the predictive distribution of UAV6 at $t + 1$ is shown in Figure 4(g). In dNBP, we combine the information in Figures 4(g) and 4(h) using Algorithm 4, resulting the final belief of UAV6's position shown in Figure 4(i). We can see that the belief in Figure 4(i) has reduced ambiguity (from two-modal to single-modal) compared with distributions in Figure 4(g) and decreased covariance (from 29.6 to 14.6) compared with predictive distribution in Figure 4(h).

4.2. Monte Carlo Simulation

In this section, we will demonstrate the performance of our method implemented in Matlab by Monte Carlo simulations and compare it with the traditional cooperative localization method in [7], where the fault UAVs are located based on nearby normal ones only.

We consider the case in which $N = 36$ UAVs are flying in 2-dimensional space during t = 1 : 100. The true trajectories of each UAV are generated independently according to the pre-specified acceleration sequence with random initial position sampled in 500 × 500 area. We model the GPS state of each UAV with a first-order Markov process, switching between being fault and normal according to the following transition matrix:

\begin{matrix} [\begin{bmatrix} 0.9 & 0.1 \\ 0.1 & 0.9 \end{bmatrix}] . \end{matrix}

(18)

For normal UAV, the GPS observations are generated according to (2) and standard derivation of Gaussian is set to 10 meters. For any pair of UAVs, we first determine whether there exists a communication link based on the detection probability (4), where the transmitter range R is typically set to 300 meters. Then the inter distance observation is produced by (3), and the standard derivation is set to 3 meters. The accelerometer noise is set to 0.5 meter/sec².

In each simulation, the true trajectories of UAVs and observations data are generated as above, and then our dNBP and traditional cooperative localization algorithm [7] are applied. The main performance criterion is the mean position estimate error, which is defined as

\begin{array}{l} M E (τ) = \frac{1}{N} \sum_{s} ∥ {\hat{x}}_{s, τ} - x_{s, τ} ∥, \\ M E = \frac{1}{N T} \sum_{τ} \sum_{s} ∥ {\hat{x}}_{s, τ} - x_{s, τ} ∥ . \end{array}

(19)

Firstly, we compare our dNBP and traditional localization method under above parameters setting. Figure 5 shows the mean position estimate error varying with time in a typical run. The upper panel depicts the mean estimate error of traditional localization algorithm and dNBP. The lower panel depicts the proportion of the fault UAVs in the total UAVs group at each time instance. It is clear in Figure 5 that our method can improve the estimate accuracy significantly, especially in the case where the number of fault UAVs is large. This can be attributed to the fact that, in our method, by using message passing, the posterior distribution of each fault UAV's position is calculated conditioned on all history observations (i.e., the GPS measurements of normal UAVs and relative distance measurements between neighboring UAVs) made in the whole UAVs group. In contrast, in traditional method, the position of fault UAV is estimated based on the states of neighboring normal ones. But when the number of normal UAVs is small, such as during a time from 40 to 60 in Figure 5, there are only 12 normal UAVs in the group of 36 UAVs, and many fault UAVs have less than 3 normal neighbors. In this case, traditional method gives poor estimate of the fault UAV's position. It can also be seen from Figure 5 that, at the initial section (

t < 5

), traditional method is superior to dNBP slightly. The reason is as follows. In our algorithm, the state of UAV includes its velocity, allowing incorporation of information made by accelerometer into the inference process. So, the mean error curve of dNBP is much smoother than that of traditional method, as shown in Figure 5. However, at the initial phase, the velocity is not estimated accurately as we have no direct measurement of velocity, and so does the predicted position given by the dynamic model. Consequently, the inaccurate position prediction deteriorates the posterior position estimate, resulting in a higher estimate error of dNBP than traditional method in this case. But as the accuracy of velocity increases, it is clear that, after

t > 5

, our method outperforms traditional ones in general.

Figure 5

Mean estimate error versus time.

The localization performance is influenced by many factors, including the number of normal UAVs in the group, the UAV communication range, and the noise in GPS and relative distance measurements. Next, we investigate the behavior of the localization algorithms under varying operating conditions by changing one factor and fixing others. The criterion statistics in all results below are summarized from 100 Monte Carlo simulations.

As mentioned before, the normal UAVs serve as anchors during cooperative localization. Thus, we are interested in what happens in dNBP and traditional algorithms when the number of normal UAVs changes. Figure 6(a) shows the results. The x-axis is defined as the ratio of normal UAV number to total number. In these simulations, we disable the switching of GPS state to keep the number of normal UAVs fixed. We can see from Figure 6(a) that the performance of traditional cooperative localization algorithm is heavily dependent on the number of normal UAVs. When the normal UAVs become sparse in the team, many fault UAVs cannot locate themselves due to the lack of normal neighbors, resulting in the rapid decline in localization accuracy. In contrast, in dNBP, the knowledge of normal UAVs can propagate throughout the whole networks by Belief Propagation and the fault UAV can locate itself successfully even when no normal UAV exists nearby. However, in order to keep all the position estimations of the fault UAVs correct, having at least three normal UAVs in the team is a must.

Figure 6

Performance comparison. (a) Mean estimate error versus proportion of normal UAVs. (b) Mean estimate error versus communication range. (c) Mean estimate error versus GPS noise. (d) Mean estimate error versus relative distance noise.

The transmitter range R is another important factor influencing the algorithm performance. The specific connection pattern of the UAVs group at any timestep is determined by the actual position of UAVs and their possible communication distance. We observe the behavior of dNBP and traditional algorithm when UAV's transmitter range is varying and we depict them in Figure 6(b). It is clear that, when R is small, both algorithms perform very poorly due to the lack of information for localization. On the other hand, when R is large enough, all UAVs in the group can connect to at least 3 normal ones. Thus, both algorithms show good performance. But it is noticeable that dNBP outperforms traditional algorithm significantly when R is varying between 200 and 350 meters. In fact, in dNBP, the position knowledge can be exchanged between two unconnected UAVs by a sequence of short range knowledge propagations. In Figure 6(b), when the communication range is large enough (>500 m), almost each pair of UAVs in the group can exchange information mutually. Thus, it can be ensured that there are at least three normal ones in the neighborhood of each fault UAV. In this case, the least square method can obtain the optimal estimate. So, when the communication range is larger than 500 m, as shown in Figure 6(b), traditional method performs better than dNBP.

We also compare the performance of dNBP and traditional method under varying GPS and relative distance measurement noise. The results are shown in Figures 6(c) and 6(d), where the x-axis stands for standard derivation of GPS and relative distance measurement noise, respectively. As expected, the estimate error increases with the observation noise, and dNBP is always better than traditional one.

As discussed before, in dNBP, the messages are represented by a set of samples. The more the samples used, the more the representation and hence the inference are accurate. The dNBP is a distributed algorithm in which the computational burden is distributed to individual UAVs in the group. For each UAV, the computing time is determined by the number of samples used for representing messages and the number of its neighbors. Table 1 shows the tradeoff between estimate accuracy and computational complexity. The time in the table refers to the average time consumed by individual fault UAV in a single filtering step, including an intraslice update and an interslice prediction. We can see that using a highly unoptimized Matlab code, satisfactory localization result (error of 3 meters) can be achieved within few seconds using 300 samples for message representation. By code optimization on embedded system, reduction in measurements update frequency, and constraint in communicating neighbors, the computing time can be further reduced. This provides the basis for large scale real-time application.

Table 1

Tradeoff between accuracy and complexity.

M	100	200	300	400	500	1000
Time (s)	1.9	4.5	7.8	15.9	17.6	50
$M E$ (m)	12.21	6.21	3.02	2.40	2.09	1.63

5. Conclusion

In this paper, we propose a novel approach for cooperative localization of multiple UAVs using dNBP. The proposed algorithm can locate UAV with fault GPS successfully when the communication range is short and GPS observations in the UAVs group are sparse. In practice, the motions of different UAVs in the team are always correlated. In the future, we plan to incorporate this correlation into the graphical model and develop new inference algorithm for improving the localization performance further.

Footnotes

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgment

This work is supported by the National Natural Science Foundation of China under Grant no. 61174020.

References

Merino

Wiklund

Caballero

Moe

Martínez-De Dios

J. R.

Forsén

P.-E.

Nordberg

Ollero

Vision-based multi-UAV position estimation

IEEE Robotics and Automation Magazine 2006 13 3 53 62

2-s2.0-34047205766

10.1109/MRA.2006.1678139

Stipanović

D. M.

Inalhan

Teo

Tomlin

C. J.

Decentralized overlapping control of a formation of unmanned aerial vehicles

Proceedings of the 41st IEEE Conference on Decision and Control

December 2002

2829 2835

2-s2.0-0036989818

Beard

R. W.

McLain

T. W.

Goodrich

M. A.

Anderson

E. P.

Coordinated target assignment and intercept for unmanned air vehicles

IEEE Transactions on Robotics and Automation 2002 18 6 911 922

2-s2.0-0036966881

10.1109/TRA.2002.805653

Brintaki

A. N.

Nikolos

L. K.

Coordinated UAV path planning using differential evolution

Proceedings of the 13th Mediterranean Conference on Control and Automation

June 2005

Limassol, Cyprus

549 556

Sujit

P. B.

Sinha

Ghose

Multi-UAV task allocation using team theory

Proceedings of the 44th IEEE Conference on Decision and Control, and the European Control Conference (CDC-ECC '05)

December 2005

1497 1502

2-s2.0-33847193564

10.1109/CDC.2005.1582370

Jourdan

D. B.

Dardari

Win

M. Z.

Position error bound for UWB localization in dense cluttered environments

IEEE Transactions on Aerospace and Electronic Systems 2008 44 2 613 628

2-s2.0-48749107397

10.1109/TAES.2008.4560210

Reichenbach

Born

Timmermann

Bill

Distributed linear least squares method for precise localization with low complexity in wireless sensor networks

Proceedings of the IEEE International Conference on Distributed computing in Sensors System

2006

Zhang

Zhou

Cooperative localization of UAV based on information synchronization

Proceedings of the IEEE International Conference on Mechatronics and Automation (ICMA '10)

August 2010

225 230

2-s2.0-78649290219

10.1109/ICMA.2010.5589081

Shames

Fidan

Anderson

B. D. O.

Hmam

Cooperative self-localization of mobile agents

IEEE Transactions on Aerospace and Electronic Systems 2011 47 3 1926 1947

2-s2.0-79960109909

10.1109/TAES.2011.5937274

10.

Zhang

Cooperative localization against GPS signal loss in multiple UAVs flight

Journal of Systems Engineering and Electronics 2011 22 1 103 112

2-s2.0-79952551476

10.3969/j.issn.1004-4132.2011.01.013

11.

Wymeersch

Lien

Win

M. Z.

Cooperative localization in wireless networks

Proceedings of the IEEE 2009 97 2 427 450

2-s2.0-62949121723

10.1109/JPROC.2008.2008853

12.

Schiff

Sudderth

E. B.

Goldberg

Nonparametric belief propagation for distributed tracking of robot networks with noisy inter-distance measurements

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS '09)

October 2009

1369 1376

2-s2.0-76249099774

10.1109/IROS.2009.5354772

13.

Ihler

A. T.

Fisher

J. W.

III Moses

R. L.

Willsky

A. S.

Nonparametric belief propagation for self-localization of sensor networks

IEEE Journal on Selected Areas in Communications 2005 23 4 809 819

2-s2.0-17144417306

10.1109/JSAC.2005.843548

14.

Moses

O. L.

Patterson

Self-calibration of sensor networks

Unattended Ground Sensor Technologies and Applicatons IV

April 2002

Orlando, Fla, USA

Proceedings of SPIE

15.

Doucet

Godsill

Andrieu

On sequential Monte Carlo sampling methods for Bayesian filtering

Statistics and Computing 2000 10 3 197 208

2-s2.0-0001460136

16.

Hyvarinen

Karhunen

Ojia

Independent Component Analysis 2001

John Wiley and Sons