Abstract
This article presents a distributed estimation method called compressed-combine-reconstruct-adaptive to estimate an unknown sparse parameter of interest from noisy measurement over networks based on compressed sensing. It is useful in some distributed networks where the robustness and low consumption are desired features. The compressed sensing theory is introduced in the distributed estimation to further reduce the communication load as the unknown parameter of interest is sparse in many situations. With the proposed method, each node compresses its estimation in a compressed dimension form. The nodes only exchange their compressed estimations to reduce the communication load over the network. Next, each node combines the compressed estimations of neighbors with its own compressed estimation using combination coefficients depend on the topology of the network. Then, the compressed estimations are reconstructed in full dimension form with a reconstruction algorithm. At last, the nodes update their estimations with normalized least mean square algorithm. The stability analysis of the proposed compressed-combine-reconstruct-adaptive method is illustrated in this article. Our method is compared with standard diffusion methods and communication reduced methods in simulations. The results show that the compressed-combine-reconstruct-adaptive method achieves nearly the same performance as the standard diffusion methods while reducing the communication load significantly, and with a better performance (network mean square error), network mean square error, steady-state mean-square deviation and steady-state mean-square deviation) than other communication reduced methods.
Introduction
In a network like wireless sensor network, sensor nodes collect data in a distributed way in applications such as target localization and tracking, environment monitoring, spectrum sensing and so on. An unknown common parameter of interest is the distortion of the collected regression data by noise, which occurs when the local copy of the underlying system input signal at each node is corrupt by various sources of impairment such as measurement or quantization noise. 1 A critical issue is how to estimate the unknown parameter from the obtained data from each node over the network.
To solve the problem of parameter estimation in a network has two main strategies: one is centralized strategy and the other is distributed strategy. In a centralized strategy, all the nodes send the information to a central node to process and estimate the unknown parameter. Since the distributed networks are usually limited with energy and the communications between nodes are multi-hop, distributed strategies have attracted more and more attention. In a distributed strategy, each node estimates the parameter based on its own local computation and the estimation information received from its neighbors without the need for the central node. 2 One approach of the existing distributed strategies can be classified into incremental,3,4 diffusion1,5–8 and hierarchical methods.9,10 The diffusion least mean square (LMS) method is the most popular strategy, and we focus on it in this article. Each node performs an LMS update after exchanging the estimation with its neighbors in a diffusion method. 6 Compared with the centralized strategy, it can achieve the scalability, robustness and low communication load. There are many distributed diffusion methods proposed in articles. In work, 5 a simple adaptive diffusion LMS method is illustrated. Cattivelli and Sayed 6 analyzed the performance of combine-then-adapt (CTA) and adapt-then-combine (ATC) diffusion methods in a distributed network. Fernandezbes et al. 8 use the normalize step-size in the adaptive stage to adapt the input signal. As most networks contain a large number of nodes and the dimensions of the unknown parameter may be incredibly high, the communication load of estimation is still considerable in a distributed diffusion method. To further reduce the communication load, reduced-communication diffusion LMS (RC-DLMS) method allows each node to receive the intermediate estimations of a subset of its neighbors. 11 The set-membership normalized LMS (SM-NLMS) 12 reduces the communication load with selective cooperation. In the study by Wang et al., 13 they use a intermittent diffusion method called adapt-thenintermittently-combine (ATIC). However, these methods reduce the communication load at the cost of the performance of parameter estimation. The other approach of distributed strategies is distributed consensus method. Distributed consensus strategies develop into an elegant procedure to enforce agreement among cooperating nodes. The work by Sayed and Tu 14 confirms that diffusion networks are shown to converge faster and reach lower mean-square deviation than consensus networks.
Since the unknown parameter of interest is sparse, containing only a few non-zero elements among many negligible ones in various applications,15–17 we introduce the compressed sensing (CS) theory in this article. The CS theory has received much attention in the researcher community. CS has been shown to bring significant performance gain in wide-ranging application from astronomy, biology, communications, image and video processing, medicine, and radar. 18 By CS, it is possible to reconstruct signal of interest with a number of samples which is far smaller than the desired resolution of signal. 19 In literature,15,20 the researchers apply the CS to parameter estimation in a full dimension case by adding another function to consider the sparsity of unknown parameters. Nevertheless, the communication load between nodes is not considered. Xu et al.21,22 implement the CS in parameter estimation while the input signal in their method needs to be compressed by the same sensing matrix of the estimation, and it is difficult to operate.
Considering the communication load in a distributed network, we propose a new distributed estimation method which called compressed-combine-reconstruct-adaptive (CCRA) in this article. First, with the CCRA method, each node translates its estimation in full dimension form into a compressed dimension form so that the nodes only need to transmit the estimation in compressed form. Then, each node combines the compressed estimations from neighbors with its local compressed estimation. Next, they reconstruct the compressed estimations in the full dimension form accurately with a reconstruction algorithm called Stagewise Orthogonal Matching Pursuit (StOMP). At last, each node performs a LMS update of estimation with a normalized step-size. Moreover, by using the proposed CCRA method, the communication load in the whole network has significantly reduced without affecting the estimation performance.
This article is organized as follows. In section “Problem statement,” we state the distributed estimation problem and define the cost functions. Then, the derivation of the diffusion solution method is presented. In section “The proposed CCRA method,” we introduce the CS theory and describe our CCRA method. In section “Stability analysis,” the stability analyze of the CCRA method is illustrated. In section “Stability analysis,” we provide the detail simulation results of a distributed network with 20 nodes to illustrate the performance of our method compared with the existing standard and communication reduction diffusion methods. In section “Conclusion,” we have a conclusion to this article.
Problem statement
Network model
In this article, we consider a network with N nodes which is in a topology connected with each other illustrated in Figure 1. The nodes are denoted by neighbors as they can exchange their information directly without transferring. A usual linear regression model is shown as follows

Model of a simple distributed network.
The node i output, a scalar measurement
Cost function
To achieve an estimation vector
where E denotes the expectation operator. Assuming the process
where
Distributed diffusion strategy
By the centralized LMS algorithm, the whole network information should be collected and processed in a central node. To send and transmit the information to central node, the communication burden is greatly increased. It is impractical in a distributed network due to the limited resources of nodes. Moreover, if some links fail and change in the network, the centralized algorithm will not have a good performance.
On the contrary, we introduce the distributed diffusion method to overcome these drawbacks. In a distributed estimation method, each node only needs to exchange the information with its neighbors to achieve the estimation. It is assumed that two nodes are connected if they can communicate with each other directly. The neighbor denoted by
The distributed strategy is commonly performed in two stages: adaption and combination. Based on the topology of the network, the estimations are combined with combination coefficients
where
where
In this article, we seek to estimate the parameter of
where
To simplify the update function, we expand the last item
where
In the same way, the expansion of the last term around
where
Then, we substitute equations (7) and (8) into equation (6). Since the combination coefficients satisfy equation (4), we have the function in equation (9).
The term of the big large brackets is a function of the
where
Equation (10) is regarded as the combination stage and equation (11) is the adaptive stage. The distributed diffusion LMS is shown in Figure 2.

Distributed diffusion strategy.
The proposed CCRA method
CS
By the strategy in Section 2, the amount of information exchange is reduced. The communication burden is still considerable in an energy-limited network since the
We consider the
Since both
As equation (12) is an underdetermined equation, it is hard to acquire the original vector
The solution is still difficult to solve because it is an non-deterministic polynomial (NP)-hard problem. We can replace
It is shown that the optimization solution in equations (13) and (14) are the same if the vector is sufficiently sparse.
24
There are many reconstruction algorithms to pursue sparse solutions with, which are proposed by researchers such as basis pursuit (BP),
24
orthogonal matching pursuit (OMP),
25
subspace pursuit (SP),
26
precognition matching pursuit (PMP)
27
and so on. In this article, we use Stagewise Orthogonal Matching Pursuit (StOMP) to reconstruct the signal which can reconstruct the compressed vector without setting its sparsity.
28
The reconstruction phase is denoted as
CCRA method
In the proposed method in this article, we add the compressed stage and reconstruction stage in the standard diffusion method. Our method can be denoted as the CCRA. At the first stage in instant time t, each node acquires its local compressed estimation by own calculation. The compressed stage in function is shown in equation (16)
Then, the information exchanges between the nodes are the compressed estimations that are described in equation (17)
Similar to equation (11), the compressed
Since the performance of the LMS algorithm depends strongly on the step size parameter
The proposed distributed CCRA method is shown in Figure 3.

CCRA method.
Compared with the standard CTA diffusion algorithm, there are two more stages. With the help of the compressed stage, the dimension of the exchange information is reduced from M x 1 to D x 1. The reconstruction stage is to insure that the adaptive stage works. Through the proposed CCRA method, the communication burden is significantly reduced while the estimation is obtained. Here is the process running of CCRA in Table 1.
Process running of CCRA.
Stability analysis
To analyze the stability of the whole network, we introduce the global state–space model with representation to describe it. The quantities of the whole network are defined as the priori estimation vector of the whole network
The compressed priori estimation vector
We get the global compressed stage representation:
The simple global update function of the linear regression network model without cooperation can be represented as in equation (20)
The global update function of the network with the proposed distributed CCRA can be illustrated as in equation (21)
With the StOMP algorithm, we can reconstruct the compressed estimation fully by choosing appropriate sensing matrix and parameter D, the stability in the mean of the function (21) is similar to that of equation (22)
The true parameter matrix
Let us denote the estimation error matrix
Since
Assuming that the regressions are temporal and independent, we take the expectation to the both sides of the equation
where
To achieve the stability in the main of the global network model, the function in equation (27) should hold
Let the data matrix denotes
While the network operates with cooperation, we should consider
So that
In the other words, the spectral radius of
Simulation results
To illustrate the performance of the proposed method in this article, we compare it with that obtained by other LMS methods such as DLMS, NDLMS (normalized version), ATIC 8 and RC-DLMS. 9 The considered network topology is in Figure 4 with N = 20 nodes.

Network topology in the simulations.
The input regressors are M = 32 vector and generated as sample vectors

Trace of regression matrices.

Noise variance.
For performance metric, we used mean-square error (MSE) of the whole network defined as

Network MSE performance.

Network MSD performance.
To evaluate the performance in the steady state, we average the data of the last 500 time instant as steady state. We define the steady state MSE of node i as

Steady state MSE performance per node.

Steady state MSD performance per node.
Since the DLMS and NDLMS method use a standard diffusion method in communication, ATIC, RC-DLMS and the proposed CCRA are designed to reduce the communication load in different ways. Figure 11 shows the communication load under our simulation for each method. It can be seen that the standard diffusion method has 12,800 dimensions to send and receive in the whole network. The ATIC and RC-DLMS can reduce the communication load to 6400 on average, while they almost achieve the performance of the standard method. Furthermore, the proposed CCRA method in this article reduces the communication load to 4800 without affecting the performance of the network.

Communication load of the network.
Conclusion
We present a distributed estimation method over network called CCRA to reduce the communication load based on CS. The derivation of diffusion algorithm is illustrated. In this method, only a compressed estimation is distributed between nodes in the network as the unknown parameter is sparse. We describe the whole network in a global state–space model and analyze the global stability in the proposed CCRA method. Compared with other existing methods in the simulation, we achieve the same or better performance (including network MSD, network MSE, steady-state MSD and steady-state MSD) and faster convergence by using the proposed CCRA while reducing the communication load significantly.
Footnotes
Handling Editor: Antonio Lazaro
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
