Sage Journals: Discover world-class research

Abstract

Recently proposed distributed adaptive estimation algorithms for wireless sensor networks (WSNs) do not consider errors due to noisy links, which occur during the transmission of local estimates between sensors. In this paper, we study the effect of noisy links on the performance of distributed incremental least-mean-square (DILMS) algorithm for the case of Gaussian regressors. More specifically, we derive theoretical relations which explain how steady-state performance of DILMS algorithm (in terms of mean-square deviation (MSD), excess mean-square error (EMSE), and mean-square error (MSE)) is affected by noisy links. In our analysis, we use a spatial-temporal energy conservation argument to evaluate the steady-state performance of the individual nodes across the entire network. Our simulation results show that there is a good match between simulations and derived theoretical expressions. However, the important result is that unlike the ideal links case, the steady-state MSD, EMSE and MSE are not monotonically increasing functions of step size parameter when links are noisy. In addition, the optimal step size is found in a closed form for a special case which minimizes the steady-state values of MSD, EMSE, and MSE in each node.

1. Introduction

In many wireless sensor network applications, multiple displaced sensors are used to estimate and track an unknown parameter, for example, average temperature, level of water contaminants, or a target position [1, 2]. In general, parameter estimation in WSN can be solved by either a centralized approach or a decentralized approach [3]. In a centralized approach, the spatially distributed sensor send their locally processed data to a fusion center to form the final estimate [4–6]. As the number of nodes increases, centralized processing becomes computationally prohibitive, since it would require communications over longer range which leads to reduced battery life. On the other hand, in decentralized estimation, spatially displaced sensors provide local estimates by collaborating with other nodes in the network [7–9].

In some WSN applications, the parameter estimation task must be done whereas the statistical model for the underlying processes of interest is not available, or it changes over time. This issue motivated the development of special class of decentralized approaches known as distributed adaptive estimation schemes [10–14]. In these schemes, every node is equipped with local computing ability to derive a local estimate and share it with its predefined neighbors. Using cooperative processing in conjunction with adaptive filtering per node enables the entire network and also each individual node to track not only the variations of the environment but also the topology of the network.

Depending on the manner by which the nodes communicate with each other, they may be referred to as incremental algorithms or diffusion algorithms. In the incremental mode, a cyclic path through the network is required, and nodes communicate with neighbors within this path [10–12]. The given algorithms in [10–12] use different adaptive filter in their structure, such as LMS, recursive least-squares (RLS), and affine projection. When more communication and energy resources are available, a diffusion cooperative scheme can be applied where nodes communicate with all of their neighbors, and no cyclic path is required. Both LMS-based and RLS-based diffusion algorithms have been considered in the literature [13, 14]. In [15], we have considered the quantization effects on the steady-state performance of DILMS algorithm.

1.1. Problem Description

Analysis of distributed estimation algorithms in the presence of noisy links is an important physical layer issue which has been considered for different algorithms in the literature [16–21]. Nevertheless, this issue has not been considered in distributed adaptive estimation algorithms. Hence, in this paper, we study the performance of DILMS algorithm in a WNS with noisy links between sensors. The importance of such a study arises from the following facts: (i)

The WSN with noisy links is more realistic assumption than WSN with ideal links.

(ii)

The performance of distributed adaptive estimation algorithm (e.g., DILMS) can vary significantly when they are used in noisy links WSN, which makes it vital to analyze their performance.

Our aim in this paper is to derive some theoretical expressions that describe the steady-state performance of DILMS algorithm with noisy links. It must be noted that analyzing the DILMS algorithm in a WSN with noisy links is challenging task, since an adaptive network comprises a system of systems that processes data cooperatively in both time and space in presence of noisy links. To be more specific, the contributions of this paper are listed below.

1.2. Contributions

(1)

We show that the performance of distributed adaptive estimation algorithms drastically decreases when links are noisy.

(2)

We show that unlike the ideal link case [10], the steady-state MSD, EMSE, and MSE curves are not monotonically increasing functions of step size when links are noisy. This result is very important in real-world DILMS implementation and highlights the importance of our work.

(3)

The optimal step size is found in a closed form for a special case which minimizes the steady-state values of MSD, EMSE, and MSE in each sensor.

1.3. Paper Organization

The remainder of this paper is organized as follows: in Section 2, we introduce the DILMS algorithm. In Section 3, we present our analysis and explain some results of the derived theoretical results in Section 4. Simulation results are given in Section 5, and Finally, the conclusions are drawn in Section 6.

1.4. Notation

Throughout the paper, we adopt boldface letters for random quantities and normal font for nonrandom (deterministic) quantities. The “*” symbol is used for both complex conjugation for scalars and Hermitian transpose for matrices. The main symbols used in this paper are listed in Table 1.

Table 1

Symbols and their descriptions.

symbol	description
$(\cdot)^{T}$	transposition
$\| \cdot \|^{2}$	absolute squared operation
$Re {a}$	real value of a
$E (a)$	statistical expectation of a
$col {a, b}$	column vector with entries a and b
$∥ x ∥_{Σ}^{2}$	weighted norm for a column vector x, $x^{*} Σ x$
$Tr [A]$	trace of matrix A
μ	step size parameter in DILMS algorithm
$I_{M}$	$M \times M$ identity matrix

2. The DILMS Algorithm

Suppose that a WSN is deployed to estimate an unknown vector $w^{o}$ from measurements collected at N nodes in a network. Each node k has access to time realizations ${d_{k} (i), u_{k, i}}$ of zero-mean spatial data ${d_{k}, u_{k}}$ , where each $d_{k}$ is a scalar measurement and each $u_{k}$ is a $1 \times M$ row regression vector. We collect the regression and measurement data into global matrices as

\begin{matrix} U & ≜ col {u_{1}, u_{2}, \dots, u_{N}} (N \times M), \\ d & ≜ col {d_{1}, d_{2}, \dots, d_{N}} (N \times 1), \end{matrix}

(1)

then the estimation problem is formulated as

\begin{matrix} \min_{w} J (w), where J (w) = E ({‖ d - U w ‖}^{2}) . \end{matrix}

(2)

In the appendix, we introduce a motivating application where an estimation problem (2) arises. The optimal solution

w^{o}

is given by [10]

\begin{matrix} R_{d u} = R_{u} w^{o}, \end{matrix}

(3)

where

R_{d u} = E (U^{*} d)

and

R_{u} = E (U^{*} U)

. Note that the cost function (2) can be decomposed as [10]

\begin{matrix} J (w) = \sum_{k = 1}^{N} J_{k} (w), where J_{k} (w) = E ({| d_{k} - u_{k} w |}^{2}) . \end{matrix}

(4)

Using this property, in [10], a distributed incremental LMS strategy with a cyclic estimation structure is proposed, as follows:

\begin{matrix} ψ_{0}^{(i)} ⟵ w_{i - 1}, \\ ψ_{k}^{(i)} = \underset{k = 1,2, \dots, N}{ψ_{k - 1}^{(i)}} + μ_{k} u_{k, i}^{*} [d_{k} (i) - u_{k, i} ψ_{k - 1}^{(i)}], \\ w_{i} ⟵ ψ_{N}^{(i)}, \end{matrix}

(5)

where

ψ_{k}^{(i)}

indicates the local estimate at the node k and time i and

w_{i}

indicates the overall estimate at iteration i. For each time i, each node utilizes the local data

d_{k} (i), u_{k, i}

and

ψ_{k - 1}^{(i)}

received from the node

k - 1

to obtain

ψ_{k}^{(i)}

. At the end of this cycle,

ψ_{N}^{(i)}

is employed as both the global estimate

w_{i}

and the initial condition for the next time instant. Note that to implement the DILMS, the time realizations

{d_{k} (i), u_{k, i}}

are used. The update equation for DILMS in noisy link condition changes to

\begin{matrix} ψ_{k}^{(i)} = ψ_{k - 1}^{(i)} + q_{k, i} + μ_{k} u_{k, i}^{*} [d_{k} (i) - u_{k, i} (ψ_{k - 1}^{(i)} + q_{k, i})], \end{matrix}

(6)

where the

M \times 1

vector

q_{k, i}

is the channel noise term between sensor k and

k - 1

which is assumed to be additive with zero mean and covariance matrix

Q_{k}

. No distributional assumptions are required on the noise sequence. To show the effect of noisy links on the performance of DILMS algorithms, we consider a network with

N = 20

nodes. The observation noise has a variance of

σ_{v, k}^{2} = 1 0^{- 2}

, and also we assume

Q_{k} = 1 0^{- 3} I_{M}

for channel noise. The curves are obtained by averaging over 300 experiments with

μ = 0.001

(see Figure 1). As it is clear from Figure 1, the performance of DILMS algorithms drastically decreases when links are noisy.

Figure 1

The effect of noisy links on the performance of DILMS algorithm.

3. Performance Analysis

3.1. Data Model and Assumptions

In order to pursue the performance analysis, we will rely on the energy conservation approach of [10, 22]. Also, to carry out the performance analysis, we first need to assume a model for the data as is commonly done in the literature of adaptive algorithms. In the subsequent analysis, the following assumptions will be considered. (A.1)

The desired unknown vector $w^{o}$ relates to the ${d_{k} (i), u_{k, i}}$ via

\begin{matrix} d_{k} (i) = u_{k, i} w^{o} + v_{k} (i), \end{matrix}

(7)

where

v_{k} (i)

is white noise term with variance

σ_{v, k}^{2}

and is independent of

{d_{l} (j), u_{l, j}}

for all

l, j

(A.2)

$u_{k, i}$ is independent of $u_{l, i}$ for $k \neq l$ .

(A.3)

$u_{k, i}$ is independent of $u_{k, j}$ for $i \neq j$ .

(A.4)

$q_{k, i}$ is independent of ${u_{l, j}, v_{l} (j), q_{l, j}}$ for all $l, j$ .

3.2. Weighted Energy Conservation Relation

In steady-state analysis, we are interested in evaluating the MSD, EMSE, and MSE for every node k which are defined as

\begin{matrix} η_{k} & ≜ E ({‖ {\tilde{ψ}}_{k - 1}^{(\infty)} ‖}^{2}) = E ({‖ {\tilde{ψ}}_{k - 1}^{(\infty)} ‖}_{I}^{2}) (MSD), \\ ζ_{k} & ≜ E ({| e_{a, k} (\infty) |}^{2}) = E ({‖ {\tilde{ψ}}_{k - 1}^{(\infty)} ‖}_{R_{u, k}}^{2}) (EMSE), \\ ξ_{k} & ≜ E ({| e_{k} (\infty) |}^{2}) = ζ_{k} + σ_{v, k}^{2} (MSE), \end{matrix}

(8)

where

e_{k} (i) ≜ d_{k} (i) - u_{k, i} ψ_{k - 1}^{(i)}

e_{a, k} (i) ≜ u_{k, i} {\tilde{ψ}}_{k - 1}^{(i)}

, and

{\tilde{ψ}}_{k}^{(i)} ≜ w^{o} - ψ_{k}^{(i)}

. Note that due to incremental cooperation (cyclic path), for

k = 1

we use

k - 1 = N

in (8). We further define the weighted a priori and a posteriori local errors for each node k as follows:

\begin{matrix} e_{a, k}^{Σ} (i) = u_{k, i} Σ {\tilde{ψ}}_{k - 1}^{(i)}, e_{p, k}^{Σ} (i) = u_{k, i} Σ {\tilde{ψ}}_{k}^{(i)} . \end{matrix}

(9)

Using (6) and the definition of

e_{k} (i)

, we have

\begin{matrix} ψ_{k}^{(i)} = ψ_{k - 1}^{(i)} + μ_{k} u_{k, i}^{*} e_{k} (i) + q_{k, i} - μ_{k} u_{k, i}^{*} u_{k, i} q_{k, i} . \end{matrix}

(10)

By subtracting

w^{o}

from both sides of (10), we get

\begin{matrix} {\tilde{ψ}}_{k}^{(i)} = {\tilde{ψ}}_{k - 1}^{(i)} - μ_{k} u_{k, i}^{*} e_{k} (i) - q_{k, i} + μ_{k} u_{k, i}^{*} u_{k, i} q_{k, i} . \end{matrix}

(11)

Multiplying the previous equation from left by

u_{k, i} Σ

and using the definitions in (9), we have

\begin{matrix} e_{p, k}^{Σ} (i) & = e_{a, k}^{Σ} (i) - μ_{k} {‖ u_{k, i} ‖}_{Σ}^{2} e_{k} (i) - u_{k, i} Σ q_{k, i} \\ + μ_{k} {‖ u_{k, i} ‖}_{Σ}^{2} u_{k, i} q_{k, i} . \end{matrix}

(12)

By replacing the

e_{k} (i)

from (11) and equating the weighted norm of both sides of the resultant equation, we arrive to the following relation:

\begin{matrix} {‖ {\tilde{ψ}}_{k}^{(i)} ‖}_{Σ}^{2} + \frac{{| e_{a, k}^{Σ} (i) |}^{2}}{{‖ u_{k, i} ‖}_{Σ}^{2}} \\ = {‖ {\tilde{ψ}}_{k - 1}^{(i)} ‖}_{Σ}^{2} + \frac{{| e_{p, k}^{Σ} (i) |}^{2}}{{‖ u_{k, i} ‖}_{Σ}^{2}} + {‖ q_{k, i} ‖}_{Σ}^{2} - 2 Re {{\tilde{ψ}}_{k - 1}^{(i) *} Σ q_{k, i}} \\ + 2 Re {\frac{{\tilde{ψ}}_{k - 1}^{(i) *} Σ u_{k, i}^{*} u_{k, i} Σ q_{k, i}}{{‖ u_{k, i} ‖}_{Σ}^{2}}} - \frac{q_{k, i}^{*} Σ u_{k, i}^{*} u_{k, i} Σ q_{k, i}}{∥ u_{k, i} ∥_{Σ}^{2}} . \end{matrix}

(13)

We find from (13) that the cross terms are canceled out. Equation (13) is a space-time version of the weighted energy conservation relation used in [10] in the context of regular adaptive implementations.

3.3. Weighted Variance Relation

In this section, we use the energy conservation relation to evaluate the steady-state performance of the DILMS algorithm in every node when the links between nodes are noisy. To this aim, we need to have a recursive equation for ${∥ {\tilde{ψ}}_{k}^{(i)} ∥}_{Σ}^{2}$ . To obtain such a recursion we replace $e_{p, k}^{Σ} (i)$ from (12) into (13) to get

\begin{matrix} {‖ {\tilde{ψ}}_{k} ‖}_{Σ}^{2} & = {‖ {\tilde{ψ}}_{k - 1} ‖}_{Σ}^{2} - 2 Re {{\tilde{ψ}}_{k - 1}^{*} Σ q_{k}} + {‖ q_{k} ‖}_{Σ}^{2} \\ + 2 μ_{k} Re {e_{a, k}^{Σ *} u_{k} q_{k}} - μ_{k} q_{k}^{*} Σ u_{k}^{*} u_{k} q_{k} \\ - μ_{k} q_{k}^{*} u_{k}^{*} u_{k} Σ q_{k} + μ_{k}^{2} ∥ u_{k} ∥_{Σ}^{2} q_{k}^{*} u_{k}^{*} u_{k} q_{k} \\ - μ_{k} e_{a, k}^{Σ *} e_{k} - μ_{k} e_{k}^{*} e_{a, k}^{Σ} + μ_{k}^{2} {‖ u_{k} ‖}_{Σ}^{2} {| e_{k} |}^{2} \\ + μ_{k} e_{k}^{*} u_{k} Σ q_{k} - μ_{k}^{2} {‖ u_{k} ‖}_{Σ}^{2} e_{k}^{*} u_{k} q_{k} \\ + μ_{k} q_{k}^{*} Σ u_{k}^{*} e_{k} - μ_{k}^{2} {‖ u_{k} ‖}_{Σ}^{2} q_{k}^{*} u_{k}^{*} e_{k} . \end{matrix}

(14)

We have dropped the time index i for compactness of notation. Now, we can relate the

e_{k}

e_{a, k}

via

\begin{matrix} e_{k} = e_{a, k} + v_{k} . \end{matrix}

(15)

If we replace (15) into (14), take expectation of both sides, and use assumptions (A.1)–(A.4), we obtain

\begin{matrix} E ({‖ {\tilde{ψ}}_{k} ‖}_{Σ}^{2}) & = E (∥ {\tilde{ψ}}_{k - 1} ∥_{Σ}^{2}) + E ({‖ q_{k} ‖}_{Σ}^{2}) \\ - μ_{k} E (e_{a, k}^{Σ *} e_{a, k}) - μ_{k} E (e_{a, k}^{*} e_{a, k}^{Σ}) \\ + μ_{k}^{2} E ({‖ u_{k} ‖}_{Σ}^{2} {| e_{a, k} |}^{2}) + μ_{k}^{2} σ_{v, k}^{2} E ({‖ u_{k} ‖}_{Σ}^{2}) \\ + μ_{k}^{2} E ({‖ u_{k} ‖}_{Σ}^{2} q_{k}^{*} u_{k}^{*} u_{k} q_{k}) \\ - μ_{k} E (q_{k}^{*} Σ u_{k}^{*} u_{k} q_{k}) - μ_{k} E (q_{k}^{*} u_{k}^{*} u_{k} Σ q_{k}) . \end{matrix}

(16)

By considering the error definitions

e_{a, k}^{Σ}

and

e_{a, k}

, we can obtain the following relations:

\begin{matrix} E (e_{a, k}^{*} e_{a, k}^{Σ}) & = E ({‖ {\tilde{ψ}}_{k - 1} ‖}_{u_{k}^{*} u_{k} Σ}^{2}), \\ E (e_{a, k}^{Σ *} e_{a, k}) & = E ({‖ {\tilde{ψ}}_{k - 1} ‖}_{Σ u_{k}^{*} u_{k}}^{2}), \\ E ({‖ u_{k} ‖}_{Σ}^{2} {| e_{a, k} |}^{2}) & = E ({‖ u_{k} ‖}_{Σ}^{2} {‖ {\tilde{ψ}}_{k - 1} ‖}_{u_{k}^{*} u_{k}}^{2}) . \end{matrix}

(17)

Using the property

{∥ x ∥}_{A}^{2} + {∥ x ∥}_{B}^{2} = {∥ x ∥}_{A + B}^{2}

and (17), we can expand (16) as

\begin{matrix} E ({‖ {\tilde{ψ}}_{k} ‖}_{Σ}^{2}) & = E ({‖ {\tilde{ψ}}_{k - 1} ‖}_{𝚺^{'}}^{2}) + E ({‖ q_{k} ‖}_{Σ}^{2}) + μ_{k}^{2} σ_{v, k}^{2} E ({‖ u_{k} ‖}_{Σ}^{2}) \\ + μ_{k}^{2} E ({‖ u_{k} ‖}_{Σ}^{2} q_{k}^{*} u_{k}^{*} u_{k} q_{k}) \\ - μ_{k} E (q_{k}^{*} Σ u_{k}^{*} u_{k} q_{k}) - μ_{k} E (q_{k}^{*} u_{k}^{*} u_{k} Σ q_{k}), \end{matrix}

(18)

where in (18), we have

\begin{matrix} 𝚺^{'} ≜ Σ - μ_{k} Σ u_{k}^{*} u_{k} - μ_{k} u_{k}^{*} u_{k} Σ + μ_{k}^{2} {‖ u_{k} ‖}_{Σ}^{2} u_{k}^{*} u_{k} . \end{matrix}

(19)

Invoking the independence of the regression data

{u_{k}}

allows us to write

\begin{matrix} E ({‖ {\tilde{ψ}}_{k - 1} ‖}_{Σ^{'}}^{2}) = E ({‖ {\tilde{ψ}}_{k - 1} ‖}_{E (𝚺^{'})}^{2}), \end{matrix}

(20)

so that (18) becomes

\begin{matrix} E ({‖ {\tilde{ψ}}_{k} ‖}_{Σ}^{2}) & = E ({‖ {\tilde{ψ}}_{k - 1} ‖}_{Σ^{'}}^{2}) + E ({‖ q_{k} ‖}_{Σ}^{2}) + μ_{k}^{2} σ_{v, k}^{2} E ({‖ u_{k} ‖}_{Σ}^{2}) \\ + μ_{k}^{2} E ({‖ u_{k} ‖}_{Σ}^{2} q_{k}^{*} u_{k}^{*} u_{k} q_{k}) \\ - μ_{k} E (q_{k}^{*} Σ u_{k}^{*} u_{k} q_{k}) - μ_{k} E (q_{k}^{*} u_{k}^{*} u_{k} Σ q_{k}) . \end{matrix}

(21)

Note that in (21),

Σ^{'}

is a deterministic matrix as

\begin{matrix} Σ^{'} ≜ Σ - μ_{k} E (Σ u_{k}^{*} u_{k} + u_{k}^{*} u_{k} Σ) + μ_{k}^{2} E ({‖ u_{k} ‖}_{Σ}^{2} u_{k}^{*} u_{k}) . \end{matrix}

(22)

Recursion (21) is a variance relation that can be used to infer the steady-state performance at every node k. Note that

Σ^{'}

is solely regressors dependent and, therefore, decoupled from the weight error vector. For simplicity, in this work, we consider the following assumption.

(A.5)

The regressors ${u_{k}}$ arise from a source with circular Gaussian distribution with covariance matrix $R_{u, k}$ .

We introduce the eigndecomposition $R_{u, k} = U_{k} Λ_{k} U_{k}^{*}$ , where $Λ_{k}$ is a diagonal matrix with the eigenvalues of $R_{u, k}$ and $U_{k}$ is unitary, that is, $U_{k} U_{k}^{*} = U_{k}^{*} U_{k} = I$ . Define the transformed quantities

\begin{gathered} {\bar{ψ}}_{k} ≜ U_{k}^{*} {\tilde{ψ}}_{k}, {\bar{ψ}}_{k - 1} ≜ U_{k}^{*} {\tilde{ψ}}_{k - 1}, {\bar{u}}_{k} ≜ u_{k} U_{k}, \\ {\bar{q}}_{k} ≜ U_{k}^{*} q_{k}, \bar{Σ} ≜ U_{k}^{*} Σ U_{k}, {\bar{Σ}}^{'} ≜ U_{k}^{*} Σ^{'} U_{k} . \end{gathered}

(23)

Using the above definitions, (21) and (22) can be rewritten in the equivalent forms

\begin{gathered} \begin{matrix} E ({‖ {\bar{ψ}}_{k} ‖}_{\bar{Σ}}^{2}) &  = E ({‖ {\bar{ψ}}_{k - 1} ‖}_{{\bar{Σ}}^{'}}^{2} ) + E ({‖ {\bar{q}}_{k} ‖}_{\bar{Σ}}^{2}) \\ + μ_{k}^{2} σ_{v, k}^{2} E ({‖ {\bar{u}}_{k} ‖}_{\bar{Σ}}^{2}) + μ_{k}^{2} E ({‖ {\bar{u}}_{k} ‖}_{\bar{Σ}}^{2} {\bar{q}}_{k}^{*} {\bar{u}}_{k}^{*} {\bar{u}}_{k} {\bar{q}}_{k}) \\ - μ_{k} E ({\bar{q}}_{k}^{*} \bar{Σ} {\bar{u}}_{k}^{*} {\bar{u}}_{k} {\bar{q}}_{k}) - μ_{k} E ({\bar{q}}_{k}^{*} {\bar{u}}_{k}^{*} {\bar{u}}_{k} \bar{Σ} {\bar{q}}_{k}), \end{matrix} \\ {\bar{Σ}}^{'} = \bar{Σ} - μ_{k} E (\bar{Σ} {\bar{u}}_{k}^{*} {\bar{u}}_{k} + {\bar{u}}_{k}^{*} {\bar{u}}_{k} \bar{Σ}) + μ_{k}^{2} E ({‖ {\bar{u}}_{k} ‖}_{\bar{Σ}}^{2} {\bar{u}}_{k}^{*} {\bar{u}}_{k}) . \end{gathered}

(24)

To proceed, we need to evaluate the moments in (24) as follows:

\begin{gathered} \begin{matrix} E ({‖ {\bar{q}}_{k} ‖}_{\bar{Σ}}^{2}) = Tr [B_{k} \bar{Σ}], \end{matrix} \\ E ({‖ {\bar{u}}_{k} ‖}_{\bar{Σ}}^{2}) = Tr [Λ_{k} \bar{Σ}], \\ E ({‖ {\bar{u}}_{k} ‖}_{\bar{Σ}}^{2} {\bar{q}}_{k}^{*} {\bar{u}}_{k}^{*} {\bar{u}}_{k} {\bar{q}}_{k}) = Tr [B_{k} (Λ_{k} Tr [\bar{Σ} Λ_{k}] + γ Λ_{k} \bar{Σ} Λ_{k})], \\ E ({\bar{q}}_{k}^{*} \bar{Σ} {\bar{u}}_{k}^{*} {\bar{u}}_{k} {\bar{q}}_{k}) = Tr [Λ_{k} B_{k} \bar{Σ}], \\ E ({‖ {\bar{u}}_{k} ‖}_{\bar{Σ}}^{2} {\bar{u}}_{k}^{*} {\bar{u}}_{k}) = (Λ_{k} Tr [\bar{Σ} Λ_{k}] + γ Λ_{k} \bar{Σ} Λ_{k}), \end{gathered}

(25)

where

B_{k} = U_{k}^{*} Q_{k} U_{k}

and

γ = 1

for circular complex data and

γ = 2

for real data. Replacing these moments, (24) can be written as

\begin{matrix} E ({‖ {\bar{ψ}}_{k} ‖}_{\bar{Σ}}^{2}) & = E ({‖ {\bar{ψ}}_{k - 1} ‖}_{{\bar{Σ}}^{'}}^{2}) + Tr [B_{k} \bar{Σ}] + μ_{k}^{2} σ_{v, k}^{2} Tr [Λ_{k} \bar{Σ}] \\ + μ_{k}^{2} Tr [B_{k} (Λ_{k} Tr [\bar{Σ} Λ_{k}] + γ Λ_{k} \bar{Σ} Λ_{k})] \\ - 2 μ_{k} Tr [Λ_{k} B_{k} \bar{Σ}], \end{matrix}

(26)

\begin{matrix} {\bar{Σ}}^{'} & = \bar{Σ} - μ_{k} (\bar{Σ} Λ_{k} + Λ_{k} \bar{Σ}) \\ + μ_{k}^{2} (Λ_{k} Tr [\bar{Σ} Λ_{k}] + γ Λ_{k} \bar{Σ} Λ_{k}) . \end{matrix}

(27)

Note from (27) that choosing

\bar{Σ}

to be diagonal,

{\bar{Σ}}^{'}

will be diagonal as well, suggesting a more compact notation. Thus, we introduce the

M \times 1

column vectors

\begin{matrix} \bar{σ} \overset{Δ}{=} diag {\bar{Σ}}, {\bar{σ}}^{'} \overset{Δ}{=} diag {{\bar{Σ}}^{'}}, λ_{k} \overset{Δ}{=} diag {Λ_{k}}, \end{matrix}

(28)

where the

diag {}

notation will be used in two ways:

λ = diag {Λ}

is a column vector containing the main diagonal of

Λ

, and

Λ = diag {λ}

is a diagonal matrix whose entries are those of the vector λ. Therefore, using the diagonal notation, we obtain the following linear relation between the corresponding vectors

{\bar{σ}, {\bar{σ}}^{'}}

\begin{matrix} {\bar{σ}}^{'} = {\bar{F}}_{k} \bar{σ}, \end{matrix}

(29)

where

{\bar{F}}_{k}

is a

M \times M

matrix that includes statistics of local data and given by

\begin{matrix} {\bar{F}}_{k} \overset{Δ}{=} I - 2 μ_{k} Λ_{k} + γ μ_{k}^{2} Λ_{k}^{2} + μ_{k}^{2} λ_{k} λ_{k}^{T} . \end{matrix}

(30)

As a result, (26) becomes

\begin{matrix} E ({‖ {\bar{ψ}}_{k} ‖}_{diag {\bar{σ}}}^{2}) = E ({‖ {\bar{ψ}}_{k - 1} ‖}_{diag {{\bar{F}}_{k} \bar{σ}}}^{2}) + g_{k} \bar{σ}, \end{matrix}

(31)

where

g_{k}

is a row vector as

\begin{matrix} g_{k} ≜ μ_{k}^{2} σ_{v, k}^{2} λ_{k}^{T} + {(diag {B_{k}})}^{T} [I + μ_{k}^{2} λ_{k} λ_{k}^{T} + γ μ_{k}^{2} Λ_{k}^{2} - 2 μ_{k} Λ_{k}] . \end{matrix}

(32)

Using (30),

g_{k}

can be rewritten in a more compact form as

\begin{matrix} g_{k} = μ_{k}^{2} σ_{v, k}^{2} λ_{k}^{T} + {(diag {B_{k}})}^{T} {\bar{F}}_{k} . \end{matrix}

(33)

For the sake of clarity, we reintroduce the time index i but drop the

diag {}

notation from the subscripts in (31) for compactness. Expression (31) becomes

\begin{matrix} E ({‖ {\bar{ψ}}_{k}^{(i)} ‖}_{{\bar{σ}}_{k}}^{2}) = E ({‖ {\bar{ψ}}_{k - 1}^{(i)} ‖}_{{\bar{F}}_{k} {\bar{σ}}_{k}}^{2}) + g_{k} {\bar{σ}}_{k} . \end{matrix}

(34)

We replaced

{\bar{σ}, {\bar{σ}}^{'}}

{{\bar{σ}}_{k}, {\bar{σ}}_{k}^{'}}

in order to indicate that the weighting matrix can be node dependent.

3.4. Steady-State Behavior

By comparing (34) with the similar equation for DILMS algorithm with ideal links (i.e., equation (55) in [10]), we can conclude that the desired steady-state MSD, EMSE, and MSE for DILMS algorithm with noisy links at node k can be expressed as

\begin{gathered} η_{k} = a_{k} {(I - Π_{k, 1})}^{- 1} r, (MSD), \end{gathered}

(35)

\begin{matrix} ζ_{k} = a_{k} {(I - Π_{k, 1})}^{- 1} λ_{k}, (EMSE), \end{matrix}

(36)

\begin{gathered} ξ_{k} = ζ_{k} + σ_{v, k}^{2}, (MSE), \end{gathered}

(37)

where

\begin{gathered} Π_{k, l} ≜ {\bar{F}}_{k + l - 1} {\bar{F}}_{k + l} \dots {\bar{F}}_{N} {\bar{F}}_{1} \dots {\bar{F}}_{k - 1}, l = 1, \dots, N, \end{gathered}

(38)

\begin{matrix} a_{k} ≜ g_{k} Π_{k, 2} + g_{k + 1} Π_{k, 3} + \dots + g_{k - 2} Π_{k, N} + g_{k - 1}, \end{matrix}

(39)

\begin{gathered} r ≜ diag {I}, λ_{k} ≜ diag {Λ_{k}} . \end{gathered}

(40)

Note that in (38), the subscripts are all

\mod N

. It is evident that the effect of channel noise is addition of term

(diag {B_{k}})^{T} {\bar{F}}_{k}

g_{k}

in the case of ideal link.

4. Discussion on Derived Theoretical Results

An important result is that unlike the ideal links case, in the presence of noisy links, the MSD, EMSE, and MSE curves are not monotonically increasing functions of step size. To show this result more clearly and to make (35)–(37) analytically more tractable, we assume that (A.6)

$μ_{k} = μ$ , $R_{u, k} = λ I$ , $Q_{k} = σ_{c, k}^{2} I$ .

We further assume that μ is small enough so that ${\overset{̅}{F}}_{k}$ can be approximated as

\begin{matrix} {\overset{̅}{F}}_{k} \approx I - 2 μ λ I . \end{matrix}

(41)

So,

{\overset{̅}{F}}_{k}

is now a diagonal matrix and as a result, matrix

Π_{k, ℓ} = Π = {\overset{̅}{F}}_{1} {\overset{̅}{F}}_{2} \dots {\overset{̅}{F}}_{N}

will be diagonal as well. For small μ, we have

(1 - 2 μ λ)^{N} \approx (1 - 2 μ N λ)

, then Π can be approximated as

\begin{matrix} Π & = (I - 2 μ λ I) (I - 2 μ λ I) \dots (I - 2 μ λ I) \\ \approx (1 - 2 μ N λ) I, \end{matrix}

(42)

so that

\begin{matrix} I - Π \approx \sum_{k = 1}^{N} 2 μ λ I = 2 N μ λ I . \end{matrix}

(43)

Similarly, using the assumptions in (A.6), we have

\begin{matrix} g_{k} = μ^{2} σ_{v, k}^{2} λ r^{T} + σ_{c, k}^{2} (1 - 2 μ λ) r^{T}, \end{matrix}

(44)

so that

a_{k}

becomes

\begin{matrix} a_{k} \approx \sum_{k = 1}^{N} g_{k} = (\sum_{k = 1}^{N} (μ^{2} σ_{v, k}^{2} λ + σ_{c, k}^{2} (1 - 2 μ λ))) r^{T} . \end{matrix}

(45)

Now, replacing (43) and (45) into (35) and using

r^{T} r = M

, we obtain

\begin{matrix} η_{k} = \frac{M}{2 μ λ N} \sum_{k = 1}^{N} (μ^{2} σ_{v, k}^{2} λ + σ_{c, k}^{2} (1 - 2 μ λ)), \end{matrix}

(46)

similarly, we can find the following approximations for EMSE and MSE as

\begin{gathered} ζ_{k} = \frac{M}{2 μ N} \sum_{k = 1}^{N} (μ^{2} σ_{v, k}^{2} λ + σ_{c, k}^{2} (1 - 2 μ λ)), \\ ξ_{k} = ζ_{k} + σ_{v, k}^{2} . \end{gathered}

(47)

We can easily conclude from (46)-(47) that the MSD, EMSE, and MSE curves are not monotonically increasing function of step size parameter. In Figure 2, we have shown the MSD as a function of μ when

N = 20

M = 5

λ = 1

σ_{v}^{2} = 0.02

and for different values of

σ_{c}^{2}

. As it is clear from Figure 2, for

σ_{c}^{2} = 0

(i.e., noiseless links), the MSD curve is a monotonically increasing function of μ.

Figure 2

The steady-state MSD (in dB) curve as a function of μ and for different values of $σ_{c}^{2}$ .

Remark 1.

To explain this behavior, we consider again the update (6). For small μ, the channel noise term say $q_{k, i}$ is dominant term in update equation, so as $μ \to 0$ the steady-state performance deteriorates. As μ increases, the effect of channel noise term decreases, and finally as μ becomes larger, the steady-state performance deteriorates again like any adaptive algorithm.

Remark 2.

The optimal step size for MSD is given by

\begin{matrix} μ_{o} = \sqrt{\frac{\sum_{k = 1}^{N} σ_{c, k}^{2}}{\sum_{k = 1}^{N} λ σ_{v, k}^{2}}} . \end{matrix}

(48)

It must be noted that (48) is also the optimal step size for EMSE and MSE curves.

Remark 3.

Note that according to the given results in [10], as step size becomes smaller $(μ \to 0)$ , steady-state values of MSD, EMSE, and MSE in each node should be more smaller too, but this is not the case in the presence of noisy links. In fact, μ must be chosen more carefully in real world.

5. Simulation

In this section, we provide computer simulations to compare the theoretical expressions with simulation results. To conduct our simulation results, we consider the following steps: (1)

consider a network with noisy link and generate the measurement and regression data,

(2)

select a parameter $w^{o}$ (which is known for us but unknown for DILMS algorithm),

(3)

let the DILMS algorithm estimate $w^{o}$ in WSN with noisy links and data (generated in step 1),

(4)

obtain the MSD, EMSE, and MSE simulation results,

(5)

we apply the data (generated in step 1) to our derived theoretical results,

(6)

finally, we compare the resultant simulation results with derived theoretical results.

To this aim, we consider a distributed network with $N = 20$ nodes, and choose $M = 4$ and $w^{o} = [1 1 \dots 1]^{T} / \sqrt{M}$ .

5.1. Regressors with Shift Structure

Although the analysis relied on the independence assumptions, simulations presented in this subsection were carried out using regressors with shift structure to cope with realistic scenarios. The regressors are generated at each node k according to the following recursion:

\begin{matrix} u_{k} (i) = α_{k} u_{k} (i - 1) + β_{k} τ_{k} (i) . \end{matrix}

(49)

The expression above describes a first-order autoregressive (AR) process with a pole at

α_{k}

τ_{k}

is a white, zero-mean, Gaussian random sequence with unity variance or a uniform random sequence with unity variance,

α_{k} \in (0,0.5]

and

β_{k} = \sqrt{σ_{u, k}^{2} (1 - α_{k}^{2})}

. In this way, the covariance matrix

R_{u, k}

of the regressors

u_{k, i}

M \times M

Toeplitz matrix with entries

r_{k} (m) = σ_{u, k}^{2} α_{k}^{| m |}

m = 0,1, 2, \dots, M - 1

with

σ_{u, k}^{2} \in [0,0.5)

. The MSD, EMSE, and MSE are obtained by averaging the last 200 samples. Each curve is obtained by averaging over 100 independent experiments. The steady-state curves are generated by running the network learning process for 2000 iterations. We consider real data (

γ = 2

We assume that for node k, covariance matrix $Q_{k}$ is a diagonal matrix and has different values at the diagonal so

\begin{matrix} Q_{k} = {[\begin{bmatrix} δ_{k, 1} & 0 & \dots & 0 \\ 0 & δ_{k, 2} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & δ_{k, M} \end{bmatrix}]}_{M \times M} . \end{matrix}

(50)

In fact, at each node k, the noise term

q_{k, i}

is generated to result the required (assumed) covariance matrix

Q_{k}

. The statistical profiles for the mentioned parameters are illustrated in Figure 3.

Figure 3

Node profile and channel noise information.

In Figures 4–6, the steady-state of MSD, EMSE, and MSE for $μ = 0.03$ are plotted, respectively. It is clear from Figures 4–6 that there is a good match between simulation and theory. Note also that despite the diverse statistical profile, the MSD in Figure 4 is roughly even over the network. On the other hand, the EMSE and the MSE are more sensitive to local statistics, as depicted in Figures 5 and 6.

Figure 4

Steady-state MSD versus node, $μ = 0.03$ .

Figure 5

Steady-state EMSE versus node, $μ = 0.03$ .

Figure 6

Steady-state MSE versus node, $μ = 0.03$ .

In Figures 7 and 8, the MSD and EMSE for different values of μ for node $k = 10$ are plotted. We note that unlike the ideal link case [10], here, the steady-state MSD and EMSE (and also MSE) are not a monotonically increasing function of μ.

Figure 7

Steady-state MSD versus μ, node $k = 10$ .

Figure 8

Steady-state EMSE versus μ, node $k = 10$ .

5.2. Independent Regressors

In this case, we assume that the regressors data arise from independent Gaussian, where their eigenvalue spread is $ρ = 5$ . We assume $Q_{k} = 1 0^{- 3} I_{M}$ for channel noise. The observation noise variance $σ_{v, k}^{2}$ and $Tr [R_{u, k}]$ are shown in Figure 9. In Figures 10 and 11, the steady-state of MSD and EMSE for $μ = 0.001$ are plotted. It is clear from Figures 10 and 11 that there is a good match between simulation and theory.

Figure 9

The Observation noise power profile, (a) $σ_{v, k}^{2}$ and (b) $Tr [R_{u, k}]$ .

Figure 10

Steady-state MSD versus node, $μ = 0.001$ .

Figure 11

Steady-state EMSE versus node, $μ = 0.001$ .

6. Conclusions and Future Work

In this paper, we derived theoretical relations to predict the performance of incremental distributed least-mean square algorithm (DILMS) when the links between nodes were noisy. Starting with the weighted energy conservation relation, we derived a variance relation for our problem. However, The important result is that unlike the ideal link case, the steady-state MSD, EMSE, and MSE are not a monotonically increasing function of step size when links are noisy. The simulation results revealed that there is good match between the derived closed-form expressions for MSD, EMSE, and MSE for every node in the network and the simulation results. Note that although in this paper we focused on the LMS adaptive filter, we can extend it for other adaptive filters. In our future work, we will consider more sophisticated cooperation modes (rather than incremental mode), such as diffusion mode.

Footnotes

Appendix

Consider that a network with N sensors is deployed to observe a physical phenomenon such as temperature, humidity, or any other events in a specified environment. At time i, the kth node collects a measurement $d_{k} (i)$ (a time-realization data). By assuming an autoregressive (AR) model to represent these measurements, we have (A.1)

\begin{matrix} d_{k} (i) = \sum_{m = 1}^{M} β_{m} d_{k} (i - m) + v_{k} (i), \end{matrix}

where

v_{k} (i)

is additive zero-mean noise and the coefficients

{β_{m}}

are parameters of the underlying model. If we define the

M \times 1

parameter vector (A.2)

\begin{matrix} w^{o} = col {β_{1}, β_{2}, \dots, β_{M}}, \end{matrix}

and the

1 \times M

regression vector (A.3)

\begin{matrix} u_{k, i} = [\begin{bmatrix} d_{k} (i - 1) & d_{k} (i - 2) & \dots & d_{k} (i - M) \end{bmatrix}], \end{matrix}

then (A.1) at each node k can be rewritten as an equivalent linear measurement model (A.4)

\begin{matrix} d_{k} (i) = u_{k, i} w^{o} + v_{k} (i) . \end{matrix}

The objective becomes to estimate the model parameter vector

w^{o}

from the measurements

d_{k} (i)

and

u_{k, i}

over the network and thereby has the form of a system identification problem.

Acknowledgment

This research is partially supported by Iran Telecommunications Research Center (ITRC) which is appreciated.

References

Estrin

Girod

Pottie

Srivastava

Instrumenting the world with wireless sensor networks

Proceedings of IEEE Interntional Conference on Acoustics, Speech, and Signal Processing

May 2001

Salt Lake City, Utah, USA

2033 2036

2-s2.0-0034842559

Akyildiz

I. F.

Sankarasubramaniam

Cayirci

Wireless sensor networks: a survey

Computer Networks 2002 38 4 393 422

2-s2.0-0037086890

10.1016/S1389-1286(01)00302-4

Xiao

J. J.

Ribeiro

Luo

Z. Q.

Giannakis

G. B.

Distributed compression-estimation using wireless sensor networks

IEEE Signal Processing Magazine 2006 23 4 27 41

2-s2.0-33746338878

10.1109/MSP.2006.1657815

AlRegib

Distributed estimation in energy-constrained wireless sensor networks

IEEE Transactions on Signal Processing 2009 57 10 3746 3758

2-s2.0-70349640575

10.1109/TSP.2009.2022874

Ribeiro

Giannakis

G. B.

Bandwidth-constrained distributed estimation for wireless sensor networks—part I: Gaussian case

IEEE Transactions on Signal Processing 2006 54 3 1131 1143

2-s2.0-33244491229

10.1109/TSP.2005.863009

Ribeiro

Giannakis

G. B.

Bandwidth-constrained distributed estimation for wireless sensor networks—part II: Unknown probability density function

IEEE Transactions on Signal Processing 2006 54 7 2784 2796

2-s2.0-33745699035

10.1109/TSP.2006.874366

Luo

Z. Q.

Universal decentralized estimation in a bandwidth constrained sensor network

IEEE Transactions on Information Theory 2005 51 6 2210 2219

2-s2.0-17144419783

10.1109/TIT.2005.847692

Xiao

J. J.

Luo

Z. Q.

Decentralized estimation in an inhomogeneous sensing environment

IEEE Transactions on Information Theory 2005 51 10 3564 3575

2-s2.0-26844540868

10.1109/TIT.2005.855580

Xiao

Boyd

Lall

A spacetime diffusion scheme for peer-topeer least-squares estimation

Proceedings of the International Conference on Information Processing in Sensor Networks

2006

Nashville, Tenn, USA

168 176

10.

Lopes

C. G.

Sayed

A. H.

Incremental adaptive strategies over distributed networks

IEEE Transactions on Signal Processing 2007 55 8 4064 4077

2-s2.0-34547870144

10.1109/TSP.2007.896034

11.

Sayed

A. H.

Lopes

C. G.

Distributed recursive least-squares strategies over adaptive networks

Proceedings of the 40th Asilomar Conference on Signals, Systems, and Computers (ACSSC '06)

October-November 2006

Pacific Grove, Calif, USA

233 237

2-s2.0-47049104570

10.1109/ACSSC.2006.356622

12.

Chambers

J. A.

Lopes

C. G.

Sayed

A. H.

Distributed estimation over an adaptive incremental network based on the affine projection algorithm

IEEE Transactions on Signal Processing 2010 58 1 151 164

2-s2.0-72949108524

10.1109/TSP.2009.2025074

5071198

13.

Lopes

C. G.

Sayed

A. H.

Diffusion least-mean squares over adaptive networks: formulation and performance analysis

IEEE Transactions on Signal Processing 2008 56 7 3122 3136

2-s2.0-46649108330

10.1109/TSP.2008.917383

14.

Cattivelli

F. S.

Lopes

C. G.

Sayed

A. H.

Diffusion recursive least-squares for distributed estimation over adaptive networks

IEEE Transactions on Signal Processing 2008 56 5 1865 1877

2-s2.0-46649115239

10.1109/TSP.2007.913164

15.

Rastegarnia

Tinati

M. A.

Khalili

Performance analysis of quantized incremental LMS algorithm for distributed adaptive estimation

Signal Processing 2010 90 8 2621 2627

2-s2.0-77951208272

10.1016/j.sigpro.2010.02.019

16.

Schizas

I. D.

Ribeiro

Giannakis

G. B.

Consensus in ad hoc WSNs with noisy links—part I: distributed estimation of deterministic signals

IEEE Transactions on Signal Processing 2008 56 1 350 364

2-s2.0-37749015518

10.1109/TSP.2007.906734

17.

Schizas

I. D.

Giannakis

G. B.

Roumeliotis

S. I.

Ribeiro

Consensus in Ad hoc WSNs with noisy links—part II: distributed estimation and smoothing of random signals

IEEE Transactions on Signal Processing 2008 56 4 1650 1666

2-s2.0-41849143803

10.1109/TSP.2007.908943

18.

Aysal

T. C.

Barner

K. E.

Constrained decentralized estimation over noisy channels for sensor networks

IEEE Transactions on Signal Processing 2008 56 4 1398 1410

2-s2.0-41849134340

10.1109/TSP.2007.909006

19.

Aysal

T. C.

Barner

K. E.

Blind decentralized estimation for bandwidth constrained wireless sensor networks

IEEE Transactions on Wireless Communications 2008 7 5 1466 1471

2-s2.0-47149093007

10.1109/TWC.2008.060687

4524301

20.

Kar

Moura

J. M. F.

Distributed consensus algorithms in sensor networks with imperfect communication: link failures and channel noise

IEEE Transactions on Signal Processing 2009 57 1 355 369

2-s2.0-58649106429

10.1109/TSP.2008.2007111

21.

Chamberland

J. -F.

Veeravalli

V. V.

The impact of fading on decentralized detection in power constrained wireless sensor networks

Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '04)

May 2004

Montreal, Canada

837 840

22.

Sayed

A. H.

Fundamentals of Adaptive Filtering 2003

Hoboken, NJ, USA

Wiley

Performance Analysis of Distributed Incremental LMS Algorithm with Noisy Links

Abstract

1. Introduction

1.1. Problem Description

1.2. Contributions

1.3. Paper Organization

1.4. Notation

2. The DILMS Algorithm

3. Performance Analysis

3.1. Data Model and Assumptions

3.2. Weighted Energy Conservation Relation

3.3. Weighted Variance Relation

3.4. Steady-State Behavior

4. Discussion on Derived Theoretical Results

Remark 1.

Remark 2.

Remark 3.

5. Simulation

5.1. Regressors with Shift Structure

5.2. Independent Regressors

6. Conclusions and Future Work

Footnotes

Appendix

Acknowledgment

References