Data Gathering Techniques for Wireless Sensor Networks: A Comparison

Abstract

We study the problem of data gathering in wireless sensor networks and compare several approaches belonging to different research fields; in particular, signal processing, compressive sensing, information theory, and networking related data gathering techniques are investigated. Specifically, we derived a simple analytical model able to predict the energy efficiency and reliability of different data gathering techniques. Moreover, we carry out simulations to validate our model and to compare the effectiveness of the above schemes by systematically sampling the parameter space (i.e., number of nodes, transmission range, and sparsity). Our simulation and analytical results show that there is no best data gathering technique for all possible applications and that the trade-off between energy consumptions and reliability could drive the choice of the data gathering technique to be used. In this context, our model could be a useful tool.

1. Introduction

Wireless sensor networks (WSNs) are composed of a lot of tiny, low power, and cheap wireless sensors, deployed in a geographic area to perform distributed tasks, for example, to monitor a physical phenomenon [1]. In 2004, MIT Technology Review ranked WSNs as the number one emerging technology [2] and today they are effectively employed for many applications, such as surveillance (e.g., real-time area audio or video surveillance), security (e.g., detection of biological agents or toxic chemicals), habit monitoring (e.g., environmental measurement of temperature, pressure, or mechanical vibration), home automation, military systems, and, in general, scientific experiments.

In a typical WSN topology, we can distinguish between ordinary wireless sensor nodes and base stations named sinks. The sink is usually connected to a power supply and it is capable of performing more complex operations than the ordinary nodes. Ordinary wireless sensor nodes, which are capable of transferring processed or raw sensed data to the sink, due to economical reasons, are instead usually powered by small size batteries that in most application scenarios are difficult or even impossible to replace or recharge.

So, in contrast to many other wireless devices (e.g., cellular phones, PDAs, and laptops), usually it is not expected to renew the energy supplied to a wireless sensor node during the life of the WSN. For this reason, each sensor node is required to work under very low power consumption conditions.

In general, to design a highly energy-efficient WSN, it is extraordinarily important to take into account capture, transmission, and routing issues, that is, data gathering techniques that specify how ordinary sensors work for gathering information and delivering them to the sink. As a consequence, data gathering is the main and more critical function provided by a WSN.

The main aim of this paper is to compare some of the state-of-the-art data gathering techniques considering their trade-off between reliability (i.e., packet loss and reconstruction error) and energy consumptions (i.e., network lifetime) by taking into account both compression and networking aspects. To the best of our knowledge, this is the first paper that considers such type of comparison for data gathering techniques belonging to different research fields (i.e., signal processing, compressive sensing, information theory, and networking related techniques are discussed and compared in this paper). Specifically, we derived a simple analytical model able to predict the energy efficiency and reliability of several data gathering techniques. Moreover, we carry out simulations to validate our model and to compare the effectiveness of the above schemes by systematically sampling the parameter space (i.e., number of nodes, transmission range, and sparsity).

The rest of the paper is organized as follows. In Section 2, we present a summary of related works. In Section 3, further details about existing data gathering techniques are provided by highlighting their advantages and drawbacks. In Section 4, the simulation scenario used for comparisons is detailed and an analytical model able to predict the energy efficiency and reliability of different data gathering techniques is derived. In Section 5, the metrics used to compare data gathering techniques are introduced. In Section 6, simulation results are provided and the developed analytical model is validated. Finally, in Section 7, some concluding remarks and future works are drawn.

For the sake of clarity, symbols and notations used throughout the paper are reported in Notations section.

2. Related Works

In the past few years, several data gathering techniques have been proposed for WSNs with the main aim of reducing energy consumptions in WSNs by exploiting correlations among sensory data. We can distinguish them into two broad categories: compression-oriented and networking-oriented.

The first category, named compression-oriented, is focused on maximizing network lifetime by taking advantage of data compression techniques [3–10]. In particular, [3, 4] analyze different lossless compression schemes for WSNs exploiting the temporal correlation in the sampled signals; in [5, 6], the authors exploit spatial correlation by using distributed source coding techniques based on the Slepian-Wolf theorem; finally, [7–10] investigate the fundamental limits of data gathering techniques based on the new paradigm of compressive sensing [11, 12]. A comprehensive review of existing data compression approaches in WSNs is provided in [13].

Further details about compression-oriented data gathering techniques will be provided in Section 3. In particular, signal processing, compressive sensing, and information theory related techniques are discussed, respectively, in Sections 3.1, 3.2, and 3.3.

Since radio transmission is the primary source of power consumption in WSNs, a second category of data gathering techniques, named networking-oriented, have dealt with the problem of maximizing network lifetime by taking into account network protocols and, more specifically, forwarding/routing mechanisms [14–17].

In particular, in [14], the authors show how it is possible to maximize the lifetime of a WSN by exploiting routing algorithms. In [15], the authors study the problem of forest construction for maximizing the network lifetime and adopt a simple data aggregation model where an intermediate sensor can aggregate multiple incoming messages into a single outgoing message. A different approach is proposed in [16] where a smart splitting technique is used in order to achieve different trade-offs between reliability and energy saving. Finally, in [17], the authors address the problem of maximizing network lifetime by taking into account also latency and reliability.

However, with the exception of [15], the above papers do not perform any kind of data aggregation with the aim of not introducing extra delay.

As shown in [18–21], by combining data aggregation and routing mechanisms, efficient data gathering schemes can be obtained.

In particular in [18], the problem of jointly optimized routing and data aggregation is investigated; in [19], the authors combine data compression and multipath routing techniques to obtain a reliable and low-latency data aggregation scheme; in [20], an energy-balanced data gathering and aggregating scheme is proposed which integrates a clustering hierarchical structure with the compressive sensing to optimize and balance the amount of data transmitted; finally in [21] a data gathering technique based on the network coding paradigm is proposed.

Further details about the above works will be provided in Section 3.4.

Several other papers exist which compare signal processing techniques in WSNs from energy efficiency and network lifetime perspectives and several works highlight the effect of using different routing mechanisms for data gathering.

Nevertheless, techniques belonging to different research fields such as compressive sensing, information theory, and networking are seldom evaluated against one another and this is the main goal of this paper.

Only recently, such comparisons have started, for instance, [22] where lossy data aggregation techniques are evaluated and compared in terms of reconstruction errors and energy consumptions. However, authors do not consider the impact of network reliability (i.e., packet loss) nor compare networking-based data gathering techniques such as those based on the network coding paradigm.

The aim of this paper is to fill this gap by investigating the effectiveness of all the above data gathering techniques also in terms of reliability. In particular, an analytical model able to predict the energy efficiency and reliability of different data gathering techniques is derived.

3. Data Gathering Techniques

We can classify data gathering schemes on the basis of the research field from which the technique used to exploit correlation among sensor nodes is drawn, that is, (i)

signal processing;

(ii)

compressive sensing;

(iii)

information theory;

(iv)

networking.

Techniques belonging to the above fields are discussed in the next subsections by highlighting their advantages and drawbacks.

3.1. Signal Processing Techniques

Frequently high correlations (spatial and/or temporal) among sensor readings exist. In this case, it is inefficient to deliver the entire raw data to the destination [8, 9] and signal processing, in particular Transforms and Encoding Compression (TEC) techniques, can be exploited in order to reduce the amount of data to send.

In the case of local TEC techniques, node collects measurements following the Shannon-Nyquist sampling theorem; these measurements are transformed and properly encoded and the output of such transformation is stored in the payload of one or more packets and sent to the sink. In particular, either lossy or lossless techniques can be used depending on the particular application scenario.

With lossy techniques [3], the original data is compressed discarding some of the original information; this allows achieving higher compression ratios but at the receiver side one can only reconstruct the data with a certain accuracy.

However, in some types of monitoring, the accuracy of observations is critical for understanding the underlying physical processes. In other cases, it is not possible to have an a priori knowledge about the magnitude of observational errors that are tolerable without affecting a correct data gathering. Moreover, some application domains (e.g., body area networks BANs in which sensor nodes permanently monitor and log vital signs) demand sensors with high accuracy and cannot tolerate measurements corrupted by lossy compression processes.

In all these kinds of WSNs, lossless data gathering is essential and desirable. Examples of local lossless compression schemes have been proposed in [4, 23, 24].

Lossy compression techniques have been evaluated and compared in terms of reconstruction errors and energy consumptions in [22]; therefore, in this paper we concentrate our attention on lossless techniques.

For the sake of space, we did not consider distributed TEC techniques in this paper but we refer the reader to [13]. Nevertheless, the major distributed approaches of signal processing applied to WSNs will be discussed in the next subsections.

3.2. Compressive Sensing

Compressive sensing (CS) is a new paradigm introduced by Candes and Tao [11] and Donoho [12] used to capture and to compress signals in WSNs where compression and sampling are merged and carried out at the same time. Basically, CS compresses a signal while acquiring data at its information rate (without relying to the Shannon-Nyquist sampling theorem). CS theory states that if a signal is sparse or compressible in a certain basis, then it can be reconstructed from a small number of linear measurements by solving an $l_{1}$ based convex optimization problem [7].

More precisely, let us define k-sparse signals $x = (x_{1}, \dots, x_{n})^{T}$ as signals that can be expressed as $x = Ψ α$ , where Ψ is an orthonormal transform and α is a vector with at most $k ≪ n$ nonzero entries; CS theory states that x can be recovered from $m = O (k l o g (n / k))$ linear combinations of measurements obtained as $y = Φ x$ , where Φ is an $m \times n$ matrix.

Note that, considering that k is a small value in comparison to n, it follows that m can be much smaller than n and therefore high compression ratios can be achieved using CS (i.e., by transmitting CS measurements y instead of raw data x).

Reconstruction is achieved by solving a complex optimization problem of the following form:

\begin{matrix} \arg m i n {‖α‖}_{1} \\ s . t . y = A α, \end{matrix}

(1)

where

A = Φ Ψ

. Once the above problem is solved, x can be recovered as

x = Ψ α

. Alternatively, when noised measures are considered, the following optimization problem can be considered:

\begin{matrix} \arg m i n {‖α‖}_{1} \\ s . t . {‖y - A α‖}_{2}^{2} \leq ϵ, \end{matrix}

(2)

where ϵ bounds the noise.

Several algorithms exist which are able to solve the above optimization problem (Basis Pursuit [25], OMP [26], and CoSaMP [27], to name just a few) and several theoretical results exist describing when these algorithms recovered sparse solutions. In particular, (i)

as proved in [28], a signal x can be recovered with high probability if Φ satisfies the Restricted Isometric Property (RIP).

Formally, a matrix Φ satisfies the RIP if for all k-sparse signals x exists a $δ_{k} \in (0,1)$ such that

\begin{matrix} (1 - δ_{k}) {‖x‖}_{2}^{2} \leq {‖Φ x‖}_{2}^{2} \leq (1 + δ_{k}) {‖x‖}_{2}^{2} . \end{matrix}

(3)

Example of matrices that satisfy the RIP condition are $\pm 1$ Bernoulli matrix and Gaussian distribution matrix where $ϕ_{i, j} ~ N (0, σ_{Φ}^{2})$ with $σ_{Φ}^{2} = 1 / m$ .

(ii)

As proved in [29], in the noise-free case, exact recovery with Gaussian matrix can be obtained if

\begin{matrix} m = m^{*} = 2 k l o g (\frac{n}{k}) + \frac{5}{4} k + 1 . \end{matrix}

(4)

We will exploit the above results to derive a reliability model for CS.

CS can be applied in cluster-based WSNs considering that each sensor node in a cluster sends its reading $x_{i}$ to the cluster head which will multiply all received readings by random coefficients $ϕ_{i, j}$ by generating m weighted sums $y_{i} = \sum_{j} ‍ ϕ_{i, j} x_{j}$ with $i \in [1, \dots, m]$ . Next, values $y_{i}$ , named CS measurements, are sent to the sink through one or multiple packets.

On the basis of the CS theory, under sparsity condition, by collecting a sufficient number of CS measurements, the sink will be able to reconstruct the original sensor data $x_{i}$ .

CS can be applied also in tree-based WSNs considering that the source node includes in its packet the sensing information which is the product of its acquired value and a random coefficient and then sends it to its next hop node [30]. In such way, CS compression could be performed with a low complexity at source nodes [9] and data traffic over the network is reduced [8]. However, in the last case a high number of hops are needed, that is, $h = O (n / k)$ , which may lead to high network latency.

In [7], comparisons of CS-based and conventional signal processing techniques for WSNs have been carried out in terms of energy efficiency and network lifetime.

However, there are several challenges that must be addressed in order to use CS: (i)

Decoding time for reconstruction can be $O (n^{3})$ and therefore prohibitively expensive for large networks. Less expensive algorithms exist (e.g., matching pursuit) but they provide less stable recovery and weaker error bounds in the recovered solution.

(ii)

CS assumes that the sensed data has a known constant sparsity, ignoring that the sparsity of real signals varies in temporal and spatial domain. In particular, the sparsifying basis Ψ is assumed to be given and fixed with time, but this is not the case for a realistic WSN scenario, where the signal of interest is unknown and its statistical characteristics can vary over time.

(iii)

CS-based techniques introduce not negligible losses (recovery errors) by reducing reliability and work best for large scale networks (at least a thousand nodes).

(iv)

Quantization effects: CS theory has mostly focused on real-valued measurements but in practice measurements must be represented with a finite number of bits. As a consequence, a trade-off exists between the number of measurements m and the number of bits per measurement $b_{C S}$ .

In this paper, we concentrate on the last two problems, by analyzing the effect of sparsity and quantization on energy saving and reliability. Further details on CS and how to exploit it for WSNs will be given in the next sections.

3.3. Information Theory Related Techniques

In order to exploit the correlation of data concurrently acquired by different sensors, DSC techniques, inspired by the Slepian-Wolf theorem, can be applied [2]. The DSC techniques imply that each sensor node sends its compressed outputs to the sink for joint decoding. This means that the nodes need to cooperate in groups of two or three so that one node provides the side information and another one can compress its information down to the Slepian-Wolf or the Wyner-Ziv limit. Furthermore, DSC approaches are also difficult to be applied in such scenarios since they work with the assumption that the statistical characteristics of the underlying data distribution should be known in advance [9, 31].

The most practical and well-known implementation of DSC is DISCUS [5, 32] where sensor nodes are considered divided into clusters. For each cluster, a node (the cluster head) sends uncompressed data (as side information) while all other nodes transmits encoded (i.e., compressed) data.

To encode data, a sensor node firstly divides all possible values into disjoint sets (named bins) so that values in the same bin have a minimum distance d. Each piece of sensory data is then compressed with a code that identifies the unique bin where the sampled value lies.

To better explain how DISCUS works let us consider a simple example.

Let us suppose that (quantized) measurements are in the integer range $[0,7]$ and that all data sensed from different sensors at (almost) the same time differ by at most $\pm 1$ . Without DSC compression, three bits are needed for each sensor to represent sensory data. Instead, in the case of DSC only the cluster head sends a three-bit value. The other sensor nodes can split the possible values into four bins ${0,4}; {1,5}; {2,6}; {3,7}$ so that values in the same bin have a minimum distance $d = 4$ and encode them, respectively, with ${00}, {01}, {10}, {11}$ . So if the sink receives 01 from a sensor node it knows that only two values are possible, that is, ${1,5}$ . Now let us suppose that the sink receives also the value 6 (properly encoded) from the cluster head; in this case, the value 01 is immediately interpreted as 5 without ambiguity as a consequence of the fact that sensed data can differ by at most 1.

In the above example, only the cluster head transmits 3 bits for each measurement while for all the other nodes 2 bits are enough; therefore, compression is achieved.

However, DSC relies on the assumption that statistical characteristics (i.e., correlation function) of the underlying data should be known a priori, which is difficult to obtain in practical scenario. For instance, the simple DISCUS scheme discussed above works only if the difference between the value sampled by the cluster head and all the other nodes in the same cluster is less than $d / 2$ . Moreover, losing side information (i.e., cluster head data) will cause fatal errors to the decoder, that is, low reliability.

A simple manner to improve reliability is achieved by retransmitting cluster head packets more times but this reduces compression efficiency. Therefore, a trade-off exists between energy consumptions and reliability on the basis of the maximum allowed number of retransmissions. In this paper, we investigate such a trade-off.

3.4. Networking Techniques

3.4.1. Routing-Based Techniques

Since radio transmission is the primary source of power consumption at the nodes, the design of energy-efficient routing is another important topic to investigate in the design of data gathering technique. The basic idea is to route the packet through the paths so as to minimize the overall energy consumption for delivering the packet from the source to the destination. The problem focuses on computing the flow and transmission power to maximize the lifetime of the network, which is the time at which the first node in the network runs out of energy [18]. Specifically, the energy consumption rate per unit of information transmission for each node depends on the choice of the next hop, that is, the routing decision. This choice can influence the energy required to reach the sink [14].

One of the most recent works which addresses the problem of maximizing network lifetime taking into account the routing mechanism is [17]. The authors try to achieve both low latency and high reliability. They construct a data gathering tree based on a reliability model, schedule data transmissions for the links on the tree, and assign transmitting power to each link accordingly. However, they do not perform any kind of data aggregation or data compression with the aim of not introducing extra delay.

Data aggregation can be performed on top of the routing algorithm. The aggregation function is usually performed by extracting some statistical values (e.g., maximum, minimum, and average) and then by transmitting only these [15]. In such a way, it is possible to reduce the amount of communicating data in the dense sensor networks and reduce the power consumption. However, this technique loses much of the structure of the original acquired data.

In particular, the authors of [15] study the problem of forest construction for maximizing the network lifetime. They adopt a simple data aggregation model and assume that an intermediate sensor can aggregate multiple incoming B-bit messages, together with its own message, into a single outgoing message. Moreover, they provide a polynomial time algorithm to build the tree and demonstrate that it is close to optimal.

In [14, 33, 34], the author considered the problem of maximizing the lifetime of WSNs by routing algorithms by recasting this problem as a linear programming problem solvable in polynomial time. The proposed algorithm is a shortest cost path routing whose link cost is a combination of transmission and reception energy consumption and the residual energy levels at the two end nodes.

In [18], the authors try to jointly optimize routing and data aggregation so that the network lifetime can be extended considering two dimensions. In the first dimension, the traffic across the network is reduced by data aggregation, so that one can reduce the power consumption of the nodes close to the sink node. In the second dimension, the traffic is balanced to avoid overwhelming the bottleneck nodes. A smoothing function is used to approximate an original maximization function by exploiting the special structure of the network. The necessary and sufficient conditions for achieving the optimality of this smoothing function were derived and a distributed gradient algorithm was accordingly designed.

Yang et al. propose in [35–37] a joint design of energy replenishment and data gathering by exploiting mobility. The SenCar, a multifunctional mobile entity, periodically chooses a subset of sensors to visit based on their energy status. It utilizes wireless energy transmissions to deliver energy to the visited sensors and, meanwhile, it collects data from nearby sensors via short-range multihop communications and can convey this data to the sink.

3.4.2. Network Coding Based Techniques

A different approach exploits the network coding (NC) paradigm.

NC is an effective information transmission approach originally introduced by Ahlswede et al. [38] to improve network capacity of multicast networks.

Differently from the classical store and forward network paradigm where nodes simply replicate and forward incoming packets, using NC intermediate nodes in the network have the ability to forward functions of received packets (e.g., linear combinations). In this manner, throughput gain, robustness, and energy saving can be achieved by exploiting the fact that each newly generated packet carries information contained in several original packets [39].

NC has received increasing attention also in WSNs as a promising tool to improve network lifetime and reliability by exploiting the broadcast nature of the wireless channel [40].

However, most of the proposed techniques developed so far [41–43], whilst being useful for data dissemination (e.g., traffic from the sink node to the sensor nodes), cannot be applied for data collection, which is the most important traffic in WSNs.

In fact, to apply NC for data gathering in WSNs, some issues have to be solved.

(i) Header Overhead. NC schemes are mostly based on random linear codes [44, 45] which allow implementing them in a distributed manner but introduce large overhead because coefficients used for linear combinations should be specified in packets header. The header size is proportional to the number of aggregated packets that, in the specific case of data collection in WSNs, could be equal to the number of nodes in the network.

(ii) All-or-Nothing Problem. When n packets are combined using NC, the sink has to receive at least n packets in order to be able to recover the original information. Thus, even if the sink receives $n - 1$ packets, it cannot recover any information. Instead, graceful degradation is desirable in WSNs.

(iii) Delay. The delay introduced by NC might be prohibitive for large networks where a large number of packets should be combined and decoded. Instead, many sensor networks applications, for instance, WSNs developed for control/automation or real-time audio/video streaming, require small bounded delays.

(iv) Duty Cycling. Most of NC schemes are based on overhearing; that is, nodes should remain in active mode to participate in NC-based routing, which increases the energy consumption of the sensor nodes. So, it is difficult to couple NC paradigm and duty-cycling techniques commonly used in WSNs.

(v) Reliability. Full (or at least high) reliability is desirable in sensor networks and is mandatory in several scenarios, for instance, in new scientific experiments, where accuracy of observations is critical, or in the case of biomedical applications, where it is necessary to ensure that important details are not lost causing errors in medical diagnosis. When random codes are used, even in a reliable network, the original messages can be retrieved with “high probability” (though not “certainty”), and high probability is achieved through the use of large finite fields (i.e., large coefficients and therefore large headers).

(vi) Complexity. NC techniques should be simple to cope with low computational and memory resources of sensor nodes.

We refer the reader to [46] for further considerations on the applicability of NC to WSNs.

The above issues have been solved in [16] where the authors proposed a new forwarding technique for WSNs based on the Chinese Remainder Theorem (CRT) able to achieve different trade-offs between reliability and energy saving.

Basically, CRT can be seen as a splitting technique able to transform an integer number Z into a vector of smaller numbers named CRT components, ${z_{i}}$ . CRT components are obtained from number Z using modular arithmetic as $z_{i} = Z (\mod p_{i})$ , where $p_{i}$ (with $i \in [1, \dots, N_{C R T}]$ ) are prime numbers (or at least pairwise coprime integer numbers).

CRT states that every integer number Z can be exactly recovered from its CRT components if the product of prime numbers $P = \prod_{j = 1}^{N_{C R T}} p_{j}$ satisfies the condition

\begin{matrix} P > Z \end{matrix}

(5)

(henceforward named reconstruction condition). In particular, the CRT always states that Z can be recovered through a simple linear combination as

\begin{matrix} Z = \sum_{j = 1}^{N_{C R T}} c_{j} z_{j} (\mod P) . \end{matrix}

(6)

Coefficients $c_{j}$ are given by $c_{j} = Q_{j} \cdot q_{j}$ , where $Q_{j} = P / p_{j}$ and $q_{j}$ is its modular inverse obtained by solving $q_{j} Q_{j} = 1 (\mod p_{j})$ .

CRT can be applied in WSNs to split packets produced by sensor nodes. Such smaller packets (i.e., CRT components) can be sent through different paths by exploiting path diversity of WSNs. The fact that relayers nodes forward smaller packets allows reducing energy consumption [47].

Moreover, CRT has several advantages in comparison to other NC techniques: (i)

The set of prime numbers ${p_{i}}$ can be chosen so that information produced by the sensor nodes can be reconstructed even if only a fraction of the CRT components are received by the sink, by improving reliability and solving the all-or-nothing problem.

(ii)

Differently from coefficients used for NC techniques, the set of prime numbers can be obtained directly by the sink (i.e., CRTs avoid header explosion).

(iii)

CRT can be efficiently combined with duty-cycling techniques [48] and distributed compression algorithms [19] to achieve an efficient data aggregation technique.

Considering the above advantages and the fact that this paper is focused on data gathering techniques for WSNs, we will consider CRT as representative of networking-based data gathering techniques.

4. Simulation Scenario

In this section, we will discuss the WSN model used for comparisons and simulations of data gathering techniques.

4.1. Network Model

We assume a WSN where the sink is located in the center of a square sensing area of size $G \times G [m^{2}]$ and sensor nodes are randomly distributed with density $ρ [n o d e s / m^{2}]$ . Each sensor node has a transmission range equal to $R [m]$ (with $R ≪ G / 2$ ) and sends its data to the sink through a multihop scheme.

The network is partitioned into nonoverlapped clusters using the procedure described in [16, 49]. The above-mentioned procedure is mainly based on the exchange of Initialization Messages (IMs) and allows organizing the network in clusters minimizing the number of hops needed by a sensor node to reach the sink. The sink is supposed to belong to cluster 1 (denoted as $C L_{1}$ ) and generates a first IM with its own address and a sequence number $S N = 2$ . Each node, which receives an IM from its neighbors with a sequence number $S N = h$ , will belong to cluster $C L_{h}$ and will retransmit the IM with an increased SN value together with its own address and the list of the nodes that will be used as forwarders (which it knows according to the source addresses specified in the received IMs). On the basis of the received IMs, at the end of the above procedure, each node in the network will know its own next hops and which other nodes will use it as a next-hop. Further details on the initialization procedure are reported in [49].

We assume that the above initialization procedure is carried out only one time so we neglect related energy consumptions.

In the following, nodes along the path from a source to the sink are referred to as relayers and nodes located one hop away from the source along the path to the sink are specifically called one-hop relayers.

We assume that, independently of the specific data gathering technique, relayers transmit packets through a load-balancing shortest-path scheme; that is, a node in cluster $C L_{h + 1}$ will select randomly a reachable node in the next cluster toward the sink ( $C L_{h}$ ), and this forwarding scheme is repeated until the sink is reached. In this manner, information reaches the sink with the minimum number of hops.

4.2. Data Gathering Model

Until now, we have classified data gathering techniques on the basis of the data aggregation technique used. However, data gathering techniques can be classified also considering the factors that drive data acquisition. In particular, four broad categories can be distinguished [50]: event-driven, time-driven, query-based, and hybrid.

In event-driven category, data are generated when an event of interest occurs, while in the time-driven category data are periodically sent to the sink at constant interval of time; in query-based category, data are collected according to sink requests. Finally, the hybrid approach is a combination of one or more of the above.

For simulation purpose, with the aim of evaluating energy consumptions and reliability, all the above categories can be unified with an abstraction of the concept of event, that is, by simply considering that data must be sent as a consequence of an event.

In particular, in query-based network an event is triggered by the reception of the query message while in time-driven networks the event can be associated with the rising clock edge of the sampling unit or, more practically, when a sufficient number of measures has been collected and a packet is ready to be transmitted.

Considering the above abstraction, energy consumptions and network reliability can be evaluated in terms of number of events (i.e., packets sent) by not taking into account who drives the event.

Therefore, in our simulation scenarios we will consider that $E_{v}$ events randomly occur in the sensor network and that, for each event, $N_{m}$ nodes recognize the event and generate a packet. More precisely, we assume that only nodes inside the circular area of radius r, with center in the location of the event, detect the event and therefore need to send a packet. Henceforward, we call the circular area related to an event a cell.

In event-driven networks, usually small packets are sent to specify that an event has been detected (a single w-bit word could be sufficient in most cases). Instead in the case of time-driven or query-driven networks, packets represent M measures collected in the time interval between two events. Both cases can be taken into account considering that for each event raw information of $M w$ bits has to be sent for each node by fixing $M = 1$ for event-driven networks and $M \geq 1$ in the case of time-driven or query-driven networks.

With the aim of reducing energy consumptions (i.e., the overall number of bits sent), raw data are not directly transmitted; instead, data are processed according to the chosen data gathering technique.

More precisely, we have the following.

(i) TEC. Nodes using TEC techniques exploit temporal correlation to reduce the number of bits.

Here we do not consider a specific TEC technique but assume that the compression factor, $F_{T E C}$ , of the TEC technique used is known. As a consequence, we can state that using a TEC technique each piece of raw data of w-bits will be represented after compression with $b_{T E C} = w / F_{T E C}$ bits and that for each event a node must transmit $L_{T E C} = M w / F_{T E C}$ bits. So considering that $N_{m}$ nodes sense the event, the overall number of bits transmitted for each event when TEC is used is

\begin{matrix} B_{T E C} = N_{m} L_{T E C} = \frac{N_{m} M w}{F_{T E C}} . \end{matrix}

(7)

As already stated, we assume that packets are transmitted through a load-balancing shortest-path scheme; that is, a node in cluster ${C L}_{h + 1}$ will select randomly a reachable node in the next cluster toward the sink ( ${C L}_{h}$ ), and this forwarding scheme is repeated until the sink is reached. In this manner, information reaches the sink with the minimum number of hops.

(ii) DSC. According to DISCUS, we assume that only one node for each cell (henceforward named the cell head) sends uncompressed measures (i.e., side information) into a packet of $M w$ bits while all the other nodes send compressed packets of $M b_{D S C} = M w / F_{D S C}$ bits. Also in this case we assume that all packets are transmitted through a load-balancing shortest-path scheme. However, to improve reliability we assume that the cell head transmits its packets $N_{r, D S C}$ times.

So considering that $N_{m}$ nodes sense the event, the overall number of bits transmitted for each event when DSC is used is

\begin{matrix} B_{D S C} = N_{r, D S C} M w + (N_{m} - 1) \frac{M w}{F_{D S C}} . \end{matrix}

(8)

(iii) CS. We assume that the cell header collects the packets of the other nodes in the same cell and sends them by applying CS. More precisely, the collected measures can be represented by a matrix X of $N_{m} \times M$ values of w-bits each where the ith column $x_{i}$ represents the measures taken by $N_{m}$ nodes almost at the same time. Considering that such values are highly correlated in both space and time by taking a proper transform, we obtain with high probability a sparse vector. For instance, we can assume that DCT is applied to each column vector $x_{i}$ and that only k DCT coefficients will be nonzero. In this case the cell head needs to send $m = O (k l o g (N_{m} / k))$ measurements for each column, that is, $M \cdot m$ measurements for each event.

For simulation purpose, we suppose that CS measurements are represented by $b_{C S}$ bits and that those measurements are sent through m packets of $L_{C S} = M \cdot b_{C S}$ bits each through a load-balancing shortest-path scheme.

So the overall number of bits transmitted by the cell head for each event when CS is used is

\begin{matrix} B_{C S} = m L_{C S} = m M b_{C S} . \end{matrix}

(9)

Other choices are possible without altering the overall number of transmitted bits $B_{C S}$ ; for instance, we could have considered M packets of $m \cdot b_{C S}$ bits, but the previous choice will simplify comparisons. In particular, we will show that with the above choice comparison results will be independent of M so our results will be valid for both event-driven ( $M = 1$ ) and time/query-driven ( $M ≫ 1$ ) techniques.

(iv) CRT. CRT is exploited as shown in [16].

In particular, we suppose that for each event $N_{m}$ source nodes send their packets to a common set of one-hop relayers named CRT relayers. Henceforward, we indicate by $N_{C R T}$ the number of CRT relayers.

As in the case of CS, the collected measures can be represented by a matrix X of $N_{m} \times M$ values of w-bits each where the ith column $x_{i}$ represents the measures taken by $N_{m}$ nodes almost at the same time.

CRT relayers process the data of each column $x_{i}$ in two steps: (1)

In the first step, received data are compressed with a compression algorithm by obtaining a binary sequence S.

(2)

In the second step, CRT is applied to improve reliability by splitting the binary sequence S so that each CRT relayer forwards a CRT component.

It is worth noting that each CRT relayer will independently compress the received packets by obtaining the same sequence S. This is possible mainly because as they receive the same data set and apply the same compression algorithm, the compressed sequence S obtained is the same for all relayers.

Henceforward, we indicate by $w_{S}$ the length of the compressed sequence S.

CRT relayers split the binary sequence S they have constructed and forward it. Specifically, the sequence S is interpreted as an integer $Z_{S} = \sum_{i = 0}^{w_{S} - 1} ‍ s_{i} \cdot 2^{i}$ (where $s_{i}$ are bits of $Z_{S}$ ) and by properly choosing the set of prime numbers ${p_{j}}$ each CRT relayer calculates and forwards the corresponding CRT component $z_{j} = Z_{S} (\mod p_{j})$ .

Note that $⌈{l o g}_{2} (p_{j})⌉$ is the number of bits needed to represent $z_{j}$ , so the overall number of bits transmitted by the CRT relayers is

\begin{matrix} B_{C R T} = M \sum_{j = 1}^{N_{C R T}} ⌈{l o g}_{2} (p_{j})⌉ . \end{matrix}

(10)

From the theory of CRT, the sink will be able to reconstruct all raw measurements from the CRT components provided that the reconstruction condition is satisfied (i.e., $\prod_{j = 1}^{N_{C R T}} p_{j} \geq 2^{w_{S}}$ ).

Note that the reconstruction condition can be satisfied by multiple sets of prime numbers; however, to reduce the number of bits needed to represent values $z_{j}$ , and therefore the overall number of bits sent, it is preferable to choose the smallest possible set of primes, which we refer to as the Minimum Primes Set (MPS).

For instance, if $N_{C R T} = 4$ and $w_{S} = 40$ , the MPS will be ${1019,1021,1031,1033}$ . In fact, this is the set of the smallest four consecutive primes that satisfy the relationship $\prod_{j = 1}^{N_{C R T}} p_{j} \geq 2^{40}$ .

However, when the set of primes is chosen as above, the message can be reconstructed if and only if all the CRT components are correctly received by the sink. So, to take into account the possible losses due to the wireless medium unreliability, we use the $M P S$ with f admissible failures ( $M P S - f$ ), that is, the set of the smallest consecutive primes that satisfy the reconstruction condition even if f CRT components are lost. As shown in [16], when $w_{S}$ , $N_{C R T}$ , and f are fixed, the $M P S - f$ set is unique so CRT relayers can obtain the $M P S - f$ in a distributed manner.

4.3. Source Model

As shown in [13] and references therein, differences among two consecutive samples of several real-world data (temperature, humidity, solar radiation, etc.) fit well with Gaussian distributions. So, in this paper we consider that sensed data $x_{i}$ are approximated by a Gaussian distribution and that they are correlated in both space and time.

This choice is motivated also by the fact that several analytical results are well known for Gaussian distribution and can be readily exploited to obtain the maximum lossless compression factor for correlated Gaussian sources.

As is well known when compression of discrete sources is considered, Shannon's entropy H gives the lossless compression limit.

For Gaussian correlated data, under suitable assumptions and without loss of generality, it can be shown that, considering $X_{1}, \dots, X_{N}$ obtained from quantization of continuous Gaussian variables $Y_{1}, \dots, Y_{N}$ , the joint entropy is [51]

\begin{matrix} H (X_{1}, \dots, X_{N}) = h (Y_{1}, \dots, Y_{N}) = \frac{1}{2} {l o g}_{2} ({(2 π e)}^{N} \cdot |Σ|), \end{matrix}

(11)

where

|Σ|

, known as generalized variance, is the determinant of the covariance matrix Σ and

h (\cdot)

is the differential entropy (rigorously speaking, we have to distinguish between differential entropy

h (Y)

for a continuous source Y (i.e., before the A/D conversion) and Shannon's information entropy

H (X)

for a discrete source X (i.e., after quantization introduced by the A/D conversion); however, it is straightforward to prove that their values coincide when unitary quantization step is considered, as done in this paper). In particular for Gaussian sources with the same correlation coefficient

ρ_{c}

and variance

σ^{2}

, the generalized variance is

| Σ | = σ^{2 N} \cdot [1 + (N - 1) ρ_{c}] \cdot (1 - ρ_{c})^{N - 1}

and therefore

\begin{matrix} H = N {l o g}_{2} (\sqrt{2 π e (1 - ρ_{c})} \cdot σ) + {l o g}_{2} (\sqrt{\frac{1 + (N - 1) ρ_{c}}{1 - ρ_{c}}}) . \end{matrix}

(12)

Moreover, considering that for a broad range of values (i.e., $ρ_{c} \in [0,0.99]$ , $N \geq 8$ ) the second term is negligible, it follows that

\begin{matrix} H \approx N {l o g}_{2} (\sqrt{2 π e (1 - ρ_{c})} \cdot σ) . \end{matrix}

(13)

Therefore, ideal (maximum) lossless compression factor for Gaussian variables considering blocks of N correlated values of w-bit each can be obtained as

\begin{matrix} F_{C, i d e a l} = \frac{w \cdot N}{H} = \frac{w}{{l o g}_{2} (\sqrt{2 π e (1 - ρ_{c})} \cdot σ)} . \end{matrix}

(14)

Throughout the paper, we assume that

\begin{matrix} F_{D S C} = F_{T E C} = F_{C, i d e a l} \end{matrix}

(15)

(i.e., maximum lossless compression factor). As a consequence, we have

\begin{matrix} b_{D S C} = b_{T E C} = {l o g}_{2} (\sqrt{2 π e (1 - ρ_{c})} \cdot σ) . \end{matrix}

(16)

As regards CS, the actual compression factor is related to the sparsity level $s = k / N_{m}$ in fact

\begin{matrix} F_{C S} = \frac{N_{m} M w}{B_{C S}} = \frac{w}{b_{C S}} \cdot \frac{1}{2 s l o g (1 / s) + (5 / 4) s + 1} . \end{matrix}

(17)

Note that $F_{C S}$ is a decreasing function of s.

So we consider two cases: an ideal sparsity level $s_{i d e a l}$ such that $F_{C S} = F_{C, i d e a l}$ and a slightly greater value $s^{'} = 1.2 \cdot s_{ideal}$ .

Finally in the case of CRT we consider that the simple MinDiff algorithm proposed in [4] is used for compression.

Basically, MinDiff encodes a set of uncompressed data $U = {x_{i}}$ with another set of compressed data $C = {μ, d_{1}, \dots, d_{n}}$ , where $μ = \min {x_{i}}$ is the minimum of the values in U and $d_{i}$ are the differences $d_{i} = x_{i} - μ$ represented with $b_{d} = ⌈{l o g}_{2} (\max {d_{i}} + 1)⌉$ bits each.

The number of bits $b_{d}$ needed to represent the set of differences and the value of μ are necessary for proper reconstruction and therefore an overhead of $w + {l o g}_{2} (w)$ bits must be considered.

Therefore, its compression factor considering blocks of N values of w-bit each can be obtained as

\begin{matrix} F_{C, MinDiff} = \frac{w \cdot N}{w + {l o g}_{2} (w) + N \cdot b_{d}} . \end{matrix}

(18)

4.4. Energy Model

Similarly to other works (e.g., [16, 19]), we consider a simple energy model where for each bit to be transmitted a node spends an energy equal to $e_{b}$ . Apparently, it seems that the model neglects the energy needed for computation and for reception but this is not true if we reflect on the fact that in sensor networks the number of bits transmitted, the number of bits received, and the number of processing operations are all proportional to the number of sensed measures. So energy needed for computation and for reception can be easily included in $e_{b}$ .

For instance, let us suppose that a node for sensing and processing M measures of w bits needs an energy equal to $M w \cdot e_{c}$ and that, using a proper compression technique with a compression factor equal to $F_{c}$ , it reduces the number of bits to be transmitted from $M w$ to $M w / F_{c}$ . In this case, the overall energy is $M w \cdot e_{c} + (M w / F_{c}) \cdot e_{T X}$ which we can rewrite as $(M w / F_{c}) \cdot e_{b}$ considering $e_{b} = F_{c} e_{c} + e_{T X}$ .

Finally, if also the energy needed for reception must be included and it differs from the energy used for transmission, considering that for almost all nodes the number of bits received is equal to the number of bits transmitted, it will be sufficient to use $e_{b} = F_{c} e_{c} + e_{T X} + e_{R X}$ .

Therefore, the main simplification introduced by our model is that we consider $e_{b}$ distance-independent; that is, we do not consider that $e_{T X}$ could be adaptively changed by the MAC layer on the basis of distance between source and destination node.

5. Performance Metrics

In order to estimate the energy efficiency of the above techniques, let us introduce the Energy reduction factor, $E R F_{X}$ , which represents the percentage reduction of the energy spent using a specific data gathering technique ( $E_{X}$ ) as compared to the case when raw measures are directly sent ( $E_{R A W}$ ). This metric is defined as

\begin{matrix} E R F_{X} = 100 \cdot \frac{E_{R A W} - E_{X}}{E_{R A W}}, \end{matrix}

(19)

where

X \in {T E C, C S, C R T, D C S}

When an ideal lossless network is considered, the $E R F_{X}$ can be evaluated as

\begin{matrix} E R F_{X, ideal} = 100 \cdot \frac{N_{m} M w \cdot e_{b} - B_{X} \cdot e_{b}}{N_{m} M w \cdot e_{b}}, \end{matrix}

(20)

where

e_{b}

is the energy spent by a node to transmit a single bit and

B_{X}

is the overall number of bits transmitted considering the specific data gathering technique derived in Section 4 (see (7)–(10)).

For instance, in the case of TEC-based data gathering is $B_{T E C} = N_{m} M b_{T E C} = N_{m} M w / F_{T E C}$ and as a consequence,

\begin{matrix} ER F_{TEC} = 100 \cdot (1 - \frac{1}{F_{T E C}}) . \end{matrix}

(21)

However, in the case of lossy network, the number of transmitted and received bits is different and the expected energy reduction factor has to be expressed taking into account the actual number of bits forwarded.

For comparison purpose, we decided to evaluate energies considering nodes belonging to cluster 2 (i.e., ${C L}_{2}$ ).

We restrict our analysis to the nodes of the second cluster for two reasons. Firstly, these nodes are the most critical as they represent the sinks neighbors. In fact, if these nodes run out of energy, the sink remains isolated. Secondly, network lifetime is defined as the time until the first node in the network dies and with high probability, if not certainty, this node belongs to ${C L}_{2}$ considering that all messages are routed to the sink through these nodes.

Finally, considering that network lifetime is related to the maximum energy consumed by a node in this paper, we investigate also the energy reduction factor related to the maximum energies:

\begin{matrix} E R F_{X, m a x} = \frac{E_{R A W, m a x} - E_{X, m a x}}{E_{R A W, m a x}} . \end{matrix}

(22)

Concerning reliability, we consider that a node fails to forward a packet with probability $p_{e}$ and evaluate the ratio $P_{R, X}$ between the number of raw measurements that are obtained by the sink and the number of raw measurements generated from source nodes or, equivalently,

\begin{matrix} P_{R, X} = 1 - \frac{M_{l o s t, X}}{M \cdot N_{m}}, \end{matrix}

(23)

where

M_{l o s t, X}

is the number of raw measurements that are lost due to the network and/or reconstruction errors.

$P_{R, T E C}$ can be easily estimated by assuming a perfect decoding technique where all received data are correctly decoded by the sink.

In fact if h is the number of hops needed to reach the sink and $p_{e}$ is the probability that a node fails to forward a packet, the probability that a packet is lost is $p_{n} = 1 - (1 - p_{e})^{h}$ and therefore the expected number of lost data is $M_{l o s t, T E C} = N_{m} M p_{n}$ . As a consequence

\begin{matrix} P_{R, T E C} = 1 - \frac{M_{l o s t, T E C}}{M \cdot N_{m}} = 1 - p_{n} = {(1 - p_{e})}^{h} . \end{matrix}

(24)

Note that the reliability of TEC techniques is related only to network parameters $p_{e}$ and h and cannot be improved without relying on channel coding techniques (e.g., FEC).

Differently from TEC techniques, for all the other data gathering techniques, reliability can be improved with a proper settings of design parameters. Nevertheless, a trade-off exists between reliability and energy saving as briefly discussed below.

(i) CRT. In the case of CRT data gathering primes, numbers can be selected so that all raw measures can be reconstructed even if at most f CRT components are lost.

As shown in [16], when f is fixed the reliability can be estimated as

\begin{matrix} P_{R, C R T} = \sum_{i = 0}^{f} (\begin{pmatrix} N_{C R T} \\ i \end{pmatrix}) p_{n}^{i} {(1 - p_{n})}^{N_{C R T} - i}, \end{matrix}

(25)

where

p_{n} = 1 - (1 - p_{e})^{h}

is the probability that a CRT component is lost.

As general rule, high reliability can be obtained by fixing f so that $f = N_{C R T} p_{n} + k_{f}$ where $k_{f}$ is a small constant on the order of $\sqrt{N_{C R T} p_{n}}$ .

This result can be justified by considering that $P_{R, C R T}$ (see (25)) can be approximated by the cumulative distribution function of a normal variable with mean $N_{C R T} p_{n}$ and variance $N_{C R T} p_{n} (1 - p_{n})$ .

For instance, in Figure 1 we show the reliability $P_{R, C R T}$ for different values of $N_{C R T}$ and f when $p_{n} = 0.04$ . As it is possible to observe, small values of f (e.g., $f = 6$ ) are sufficient to achieve high values of reliability (>0.99).

Figure 1

$P_{R, C R T}$ for different values of $N_{C R T}$ and f when $p_{n} = 0.04$ .

Higher reliability can be obtained by further increasing the value of the parameter f. However, by increasing f energy consumptions are increased too so in the next section we investigated the trade-off between reliability and energy consumptions for different values of f.

(ii) DSC. Also in the case of DSC, the probability to lose a packet is $p_{n} = 1 - (1 - p_{e})^{h}$ . However, DSC compressed measures cannot be recovered if side information (i.e., packets generated by the cell head) is not received.

So, in order to improve the reliability we considered that cell head transmits $N_{r, D S C}$ times the side information.

In this case, DSC reliability can be evaluated as

\begin{matrix} P_{R, DSC} = 1 - \frac{N_{lost,DSC}}{N_{m}}, \end{matrix}

(26)

where

N_{lost,DSC}

is the number of lost packets by taking into account that all packets related to the same event are lost if no one of the cell head packets arrives. Note that the expected value of

N_{lost,DSC}

N_{lost,DSC} = N_{m} p_{n}^{N_{r, DSC}} + (N_{m} - 1) (1 - p_{n}^{N_{r, DSC}}) p_{n}

In Figure 2, we can compare reliability of TEC and DSC for two different values of the loss probability per hop ( $p_{e} = 0.01$ and $p_{e} = 0.05$ ) and a fixed number of hops, $h = 4$ , when different values of $N_{r, DSC}$ and different number of source nodes $N_{m}$ are considered.

Figure 2

$P_{R, D S C}$ for different values of $N_{m}$ , $N_{r, D S C}$ , and $p_{e}$ when $h = 4$ .

As it is possible to observe by fixing $N_{r, D S C} = 3$ we have $P_{R, DSC} \geq P_{R, TEC}$ for a broad range of $p_{e}$ and $N_{m}$ (i.e., $p_{e} \in {0.01,0.05}$ and $N_{m} \in {10,100}$ ). Note also that further increasing $N_{r, DSC}$ does not improve $P_{R, DSC}$ so much. This can be justified considering that, for high values of, that is, $N_{r, DSC} \to \infty$ , it follows that $P_{R, DSC} \leq 1 - (N_{m} - 1) p_{n} / N_{m} = 1 - p_{n} + p_{n} / N_{m} \approx P_{R, TEC}$ .

Obviuosly, reliability increases with number of retransmissions $N_{r, DSC}$ but at the cost of reducing energy saving. So in the next section we investigated the trade-off between reliability and energy consumptions for different values of $N_{r, DSC}$ .

(iii) CS. By indicating with m the number of packets sent with CS, the probability to receive at least $m^{*}$ packets is

\begin{matrix} {\bar{P}}_{C S} = \sum_{i = 0}^{m - m^{*}} (\begin{pmatrix} m \\ i \end{pmatrix}) p_{n}^{i} {(1 - p_{n})}^{m - i} . \end{matrix}

(27)

Therefore, by increasing m it is possible to guarantee that, with the desired probability ${\bar{P}}_{C S}$ , at least $m^{*}$ packets are received by the sink. As a general rule, high reliability can be obtained by fixing m so that $m = m^{*} / (1 - p_{n}) + k_{m}$ , where $k_{m}$ is a small constant.

However, differently from all previous techniques, CS reliability is not related only to the number of received packets so that ${\bar{P}}_{C S}$ , henceforward named network reliability, is not the actual CS reliability (this justifies why we used notation ${\bar{P}}_{C S}$ instead of $P_{C S}$ ). In fact, CS techniques are based on reconstruction algorithms that could introduce errors so that reconstructed values ${\hat{x}}_{i}$ could differ from original raw data $x_{i}$ .

Nevertheless, if we assume that raw data $x_{i}$ are quantized values (as usual in WSNs where raw measures come from ADCs), we can state that quantized measures can be exactly recovered if the reconstruction error $| {\hat{x}}_{i} - x_{i} |$ is smaller than quantization error $Δ_{x} / 2$ , where $Δ_{x}$ is the quantization step used for quantizing $x_{i}$ .

Therefore, in our simulations, the reconstruction error, $| {\hat{x}}_{i} - x_{i} |$ , is evaluated for each measure and raw data is considered lost if this error is greater than $Δ_{x} / 2$ .

From simulation point of view, actual CS reliability can be evaluated as

\begin{matrix} P_{R, C S} = 1 - \frac{M_{l o s t, C S}}{M \cdot N_{m}}, \end{matrix}

(28)

where

M_{lost,CS}

is the number of lost measures considering both packet loss and reconstruction error.

We show in the next section that by choosing $b_{CS} = w + 3$ and $m = m^{*} / (1 - p_{n}) + 2$ we can obtain $P_{R, CS} \approx 1$ .

Two reconstruction algorithms are considered for $C S$ in this paper: ideal (i.e., oracle-based) reconstruction, where exact positions of nonzero values are assumed to be known at the sink node, and CoSaMP [27]. To distinguish among them, we indicate their reliability as $P_{R, CSI}$ and $P_{R, CoSaMP}$ , respectively.

6. Simulations Results

In this section, we compare data gathering techniques in terms of energy consumptions and reliability.

The results have been obtained through a custom C++ simulator. For each set of parameters mean results are reported considering 20 random topologies where nodes are uniformly distributed in a square area of size $600 \times 600 [m^{2}]$ , with density $ρ [nodes / m^{2}]$ . We also assume that $E_{v}$ events randomly occur in a faraway cluster (e.g., ${C L}_{5}$ so that $h = 4$ hops are needed to reach the sink) and that each event is detected by $N_{m} = ρ π r^{2}$ source nodes (where r is the sense radius).

If not otherwise stated, $E_{v} = 300$ events are considered and raw data are represented with $w = 12$ bits (which is a typical number for ADCs used in sensor networks).

6.1. ERF with Reliable Networks

To asses the simulator we first analyzed an ideal (fully reliable) WSN and evaluated the $E R F$ for different values of σ and $ρ_{c}$ when different data gathering techniques (TEC, CRT, DSC, and CS) are considered. In particular, for CS two cases have been considered: an ideal case (CSI) where the sparsity level $s = k / N_{m}$ is fixed to the minimum value $s = s_{i d e a l}$ such that $F_{C, C S} = F_{C, i d e a l}$ (see (17) and (14)) and a second case (CS2) where a slightly greater value $s^{'} = 1.2 \cdot s_{i d e a l}$ is used.

As shown in Figures 3 and 4, the results obtained through the analytical model (see (20)) and those reported by the simulator are very close to each other for all the values of σ and $ρ_{c}$ considered. These results confirm the validity of our model.

Figure 3

ERF of data gathering techniques in the case of reliable network ( $p_{e} = 0$ ), low variance ( $σ \in {8 – 64}$ ), and medium correlation ( $ρ_{c} = 0.55$ ).

Figure 4

ERF of data gathering techniques in the case of reliable network ( $p_{e} = 0$ ), high variance ( $σ \in {32 – 256}$ ), and high correlation ( $ρ_{c} = 0.97$ ).

The same results are obtained for different values of M, that is, by changing the number of raw measures per packet. This can be easily justified by the fact that $E_{R A W} = N_{m} M w e_{b}$ and $E_{X} = B_{X} e_{b}$ are both proportional to M so their ratio and therefore also the $E R F$ (see (19)) are independent of M. So simulation results for different values of M are not shown for the sake of space.

On the basis of the previous results, we can state that in the case of reliable networks TEC and DSC have a greater $E R F$ and therefore it seems that they should be preferred to the other analyzed data gathering techniques. However, as shown in the next section, this result is not true when reliability is an issue.

Note that in our simulations $E R F_{D S C}$ and $E R F_{T E C}$ are almost the same because we fixed the same values for spatial and temporal correlation coefficients. Obviously, when spatial and temporal correlation are not the same different results can be obtained.

Finally, note that although CRT appears to be the worst in terms of $E R F$ it is the only data gathering technique where an actual compression algorithm (MinDiff) has been considered (for all the other techniques, ideal compression factors have been assumed). So previous simulations results allow quantifying the penalty in using a simple compression algorithm (MinDiff) instead of more complex techniques (at least when Gaussian data are considered).

By comparing Figures 3 and 5, we can see that CRT and CS achieve different values of ERF when the sensing radius, r, changes. In particular, $E R F_{C S I}$ is no more able to reach the same values of $E R F_{D S C}$ for low values of r.

Figure 5

ERF of data gathering techniques in the case of reliable network ( $p_{e} = 0$ ) and lower sensing radius ( $r = 12$ ).

This can be justified considering that when r decreases, the sparsity level $s \propto 1 / N_{m}$ increases, and, as a consequence, the compression factor $F_{C S}$ decreases too (see (17)).

It is worth nothing that our analytical model is able to anticipate this result.

Also in the case of CRT, the ERF slightly decreases for lower values of r due to the fact that when $N_{m}$ decreases the overhead of the MinDiff algorithm is more relevant and $F_{C, M i n D i f f}$ is not able to approach $F_{C, i d e a l}$ (see (18)).

Similar considerations can be made about the node density ρ. In fact, as it is possible to observe by comparing Figures 3 and 6, $E R F_{C S I}$ and $E R F_{C R T}$ decrease for lower values of ρ (i.e., lower values of $N_{m}$ ).

Figure 6

ERF of data gathering techniques in the case of reliable network ( $p_{e} = 0$ ) and lower density ( $ρ = 0.045$ ).

It is worth noting that Figures 6 and 5 report quite similar ERF values despite different values of ρ and r being used for simulation. This result can be justified by considering that network density and sensing radius have been changed but without altering the overall number of source nodes (i.e., $N_{m} = π \cdot ρ \cdot r^{2} = 45$ in both cases).

6.2. ERF with Unreliable Networks

In Figures 7 and 8, reliability and ERF of data gathering techniques for $p_{e} = 0.01$ and different values of σ are reported.

Figure 7

Reliability of data gathering techniques for $p_{e} = 0.01$ .

Figure 8

ERF of data gathering techniques for $p_{e} = 0.01$ .

On the basis of the simulation results, we can state that even in the case of unreliable networks TEC and DSC have greater ERF (see Figure 8). However, their reliability is fully determined by the packet loss probability and cannot be improved; instead, by using CRT and CS higher reliability can be achieved by increasing f and m, respectively.

In particular, as shown in Figure 7, by fixing $f = 8$ a reliability higher than 0.975 can be achieved for CRT and even higher values can be obtained using CS when $m = m^{*} / (1 - p_{n}) + 2$ CS measures are sent.

It is important to note that the reliability plotted for CS is the actual reliability obtained after reconstruction with an ideal (oracle-based) reconstruction algorithm (i.e., $P_{R, CSI}$ and not ${\bar{P}}_{R, CS}$ ).

By choosing m so that $m = m^{*} / (1 - p_{n}) + 2$ , we have with high probability (i.e., ${\bar{P}}_{R, CS} > 0.99$ ) that the sink receives at least $m^{*}$ CS measures, that is, the minimum number of measures sufficient for reconstruction, and therefore original measurements can be perfectly recovered (so that $P_{R, C S} = 1$ ). In the next subsection, we will show that this is true only if $b_{CS} = w + 3$ , as chosen for our simulations.

Obviously, high reliability is achieved at the cost of a lower ERF but, by comparing Figures 8 and 3, we can state that the impact on the ERF is quite low (both $E R F_{C R T}$ and $E R F_{C S}$ decrease by a few percent).

As a consequence when high reliability is needed even with unreliable networks, CRT and CS should be preferred.

6.3. CS Reliability

In all the previous simulations, we have fixed the number of bits $b_{C S}$ used for representing quantized CS measures equal to $w + 3$ . A careful reader could observe that this choice is questionable and that by reducing $b_{C S}$ higher values of $E R F_{C S}$ can be obtained. This consideration is partially true: effectively, $E R F_{C S}$ increases for lower values of $b_{C S}$ , but our simulation results show that using values of $b_{C S}$ below $w + 3$ is not possible to have perfect reconstruction (i.e., $P_{R, C S} = 1$ ).

To convince the reader in Figure 9, we report simulation results about the actual reliability $P_{R, C S}$ for different values of $b_{C S}$ and sparsity levels when $w = 12$ . Simulation parameters are those reported in Figure 3.

Figure 9

$P_{R, C S I}$ for different values of $b_{C S}$ and sparsity levels when $w = 12$ .

As it is possible to observe, when $b_{C S} < 15$ reliability quickly decreases.

Similar results have been obtained for different values of w.

However, in practice, actual reliability depends also on the reconstruction algorithm used. For the sake of completeness, we report in Figure 10 CS reliability when the CoSaMP [27] algorithm is used for reconstruction.

Figure 10

$P_{R, CoSaMP}$ for different values of $b_{C S}$ and σ.

As it is possible to observe also in this case $b_{C S} = 15$ is needed for achieving high reliability. Nevertheless, perfect reliability is not obtained with CoSaMP when $m = m^{*}$ . So in some cases CRT could be preferred to CS because reliability can be better predicted.

6.4. ${ERF}_{m a x}$ and Network Lifetime

The ERF metric is an expression of mean energy consumptions; instead, network lifetime is more closely related to maximum energy consumptions.

In Figure 11, we report $E R F_{m a x}$ instead of $E R F$ for different data gathering techniques. $E R F_{m a x}$ is evaluated on the basis of (22), that is, considering the maximum energy consumptions for nodes belonging to cluster ${C L}_{2}$ . Maximum energies have greater variations in comparison to mean values so in order to have higher confidence we increased the number of events to $E_{v} = 3000$ .

Figure 11

${E R F}_{m a x}$ of data gathering techniques for $p_{e} = 0.01$ .

As it is possible to observe, TEC and DSC have higher $E R F_{m a x}$ and therefore greater network lifetime can be expected when DSC and TEC are used for data gathering.

Finally, note that CRT and CS have similar performance if the actual sparsity degree of CS is 20% more than the minimum (ideal) value. These observations can be extended also to all the previous simulations.

7. Conclusions and Future Works

In this paper, we have compared several data gathering techniques used in WSNs by using both simulation results and analytical models. In particular, the effectiveness of the above techniques has been investigated in terms of reliability (packet loss and reconstruction errors) and energy efficiency (i.e., ERF and network lifetime) by systematically sampling the parameter space (i.e., number of nodes, transmission range, and sparsity). Basically we can summarize our results as follows: (i)

DSC and TEC techniques should be preferred for maximizing network lifetime.

(ii)

CS should be preferred when high reliability is needed.

(iii)

CRT should be preferred for its inherent low complexity.

As a consequence, we can state that there is no best solution for all possible applications and that only the trade-off between energy consumptions, reliability, and complexity can drive the choice of the data gathering technique to be used for a specific application.

As future work, we plan to refine and improve the model to deal with actual correlated measurements (i.e., not only Gaussian data) and more realistic propagation channels (i.e., by taking into account actual distance between nodes).

Footnotes

Notations

Competing Interests

The authors declare that they have no competing interests.

References

Akyildiz

I. F.

Sankarasubramaniam

Cayirci

A survey on sensor networks

IEEE Communications Magazine 2002 40 8 102 105

10.1109/MCOM.2002.1024422

2-s2.0-0036688074

Xiong

Liveris

A. D.

Cheng

Distributed source coding for sensor networks

IEEE Signal Processing Magazine 2004 21 5 80 94

10.1109/msp.2004.1328091

2-s2.0-4544313438

Zordan

Martinez

Vilajosana

Rossi

On the performance of lossy compression schemes for energy constrained sensor networking

ACM Transactions on Sensor Networks 2014 11 1, article 15

10.1145/2629660

2-s2.0-84907197951

Campobello

Giordano

Segreto

Serrano

Comparison of local lossless compression algorithms for wireless sensor networks

Journal of Network and Computer Applications 2015 47 23 31

10.1016/j.jnca.2014.09.013

2-s2.0-84908450433

Chou

Petrovic

Ramachandran

A distributed and adaptive signal processing approach to reducing energy consumption in sensor networks

Proceedings of the 22nd Annual Joint Conference of the IEEE Computer and Communications (INFOCOM '03)

March-April 2003

San Francisco, Calif, USA

IEEE

1054 1062

10.1109/INFCOM.2003.1208942

Zhou

Juntti

Matsumoto

Data and error rate bounds for binary data gathering wireless sensor networks

Proceedings of the IEEE 16th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC '15)

June 2015

Stockholm, Sweden

505 509

10.1109/spawc.2015.7227089

Karakus

Gurbuz

A. C.

Tavli

Analysis of energy efficiency of compressive sensing in wireless sensor networks

IEEE Sensors Journal 2013 13 5 1999 2008

10.1109/jsen.2013.2244036

2-s2.0-84876221016

Zheng

Xiao

Wang

Tian

Guizani

Capacity and delay analysis for data gathering with compressive sensing in wireless sensor networks

IEEE Transactions on Wireless Communications 2013 12 2 917 927

10.1109/TWC.2012.122212.121032

2-s2.0-84874947147

Zheng

Yang

Tian

Gan

Wang

Xiao

Data gathering with compressive sensing in wireless sensor networks: a random walk based approach

IEEE Transactions on Parallel and Distributed Systems 2015 26 1 35 44

10.1109/tpds.2014.2308212

2-s2.0-84919680956

10.

Zhu

Liu

Chen

Z. D.

Data gathering in wireless sensor networks based on reshuffling cluster compressed sensing

International Journal of Distributed Sensor Networks 2015 2015 13

10.1155/2015/260913

11.

Candes

E. J.

Tao

Near-optimal signal recovery from random projections: universal encoding strategies?

IEEE Transactions on Information Theory 2006 52 12 5406 5425

10.1109/tit.2006.885507

MR2300700

2-s2.0-33947416035

12.

Donoho

D. L.

Compressed sensing

IEEE Transactions on Information Theory 2006 52 4 1289 1306

10.1109/tit.2006.871582

MR2241189

2-s2.0-33645712892

13.

Srisooksai

Keamarungsi

Lamsrichan

Araki

Practical data compression in wireless sensor networks: a survey

Journal of Network and Computer Applications 2012 35 1 37 59

10.1016/j.jnca.2011.03.001

2-s2.0-82155167426

14.

Chang

J.-H.

Tassiulas

Maximum lifetime routing in wireless sensor networks

IEEE/ACM Transactions on Networking 2004 12 4 609 619

10.1109/TNET.2004.833122

2-s2.0-4644360171

15.

Mao

Fahmy

Shroff

N. B.

Constructing maximum-lifetime data-gathering forests in sensor networks

IEEE/ACM Transactions on Networking 2010 18 5 1571 1584

10.1109/TNET.2010.2045896

2-s2.0-77958121214

16.

Campobello

Leonardi

Palazzo

Improving energy saving and reliability in wireless sensor networks using a simple CRT-based packet-forwarding solution

IEEE/ACM Transactions on Networking 2012 20 1 191 205

10.1109/tnet.2011.2158442

2-s2.0-84857359551

17.

Gong

Yang

Low-latency SINR-based data gathering in wireless sensor networks

IEEE Transactions on Wireless Communications 2014 13 6 3207 3221

10.1109/TWC.2014.042114.130347

2-s2.0-84903276735

18.

Hua

Yum

T.-S. P.

Optimal routing and data aggregation for maximizing lifetime of wireless sensor networks

IEEE/ACM Transactions on Networking 2008 16 4 892 903

10.1109/TNET.2007.901082

2-s2.0-50149087740

19.

Campobello

Serrano

Galluccio

Palazzo

Applying the Chinese remainder theorem to data aggregation in wireless sensor networks

IEEE Communications Letters 2013 17 5 1000 1003

10.1109/LCOMM.2013.040213.122430

2-s2.0-84878742044

20.

Xing

Xie

Wang

Energy-balanced data gathering and aggregating in wsns: a compressed sensing scheme

International Journal of Distributed Sensor Networks 2015 2015 10

585191

10.1155/2015/585191

21.

Lim

Network coding for severe packet reordering in multihop wireless networks

International Journal of Distributed Sensor Networks 2015 2015 9

379108

10.1155/2015/379108

22.

Rossi

Hooshmand

Zordan

Zorzi

Evaluating the gap between compressive sensing and distributed source coding in WSN

Proceedings of the International Conference on Computing, Networking and Communications (ICNC '15)

Feburary 2015

Garden Grove, Calif, USA

IEEE

911 917

10.1109/iccnc.2015.7069468

23.

Marcelloni

Vecchio

An efficient lossless compression algorithm for tiny nodes of monitoring wireless sensor networks

The Computer Journal 2009 52 8 969 987

10.1093/comjnl/bxp035

2-s2.0-77954255661

24.

Liang

Peng

Minimizing energy consumptions in wireless sensor networks via two-modal transmission

ACM SIGCOMM Computer Communication Review 2010 40 1 12 18

10.1145/1672308.1672311

25.

Chen

S. S.

Donoho

D. L.

Saunders

M. A.

Atomic decomposition by basis pursuit

SIAM Review 2001 43 1 129 159

10.1137/S003614450037906X

MR1854649

ZBL0979.94010

2-s2.0-0035273106

26.

Tropp

J. A.

Gilbert

A. C.

Signal recovery from random measurements via orthogonal matching pursuit

IEEE Transactions on Information Theory 2007 53 12 4655 4666

10.1109/tit.2007.909108

MR2446929

2-s2.0-64649083745

27.

Tropp

J. A.

Needell

CoSaMP: iterative signal recovery from incomplete and inaccurate samples

Applied and Computational Harmonic Analysis 2009 26 3 301 321

10.1016/j.acha.2008.07.002

MR2502366

2-s2.0-62749175137

28.

Candes

Wakin

An introduction to compressive sampling

IEEE Signal Processing Magazine 2008 25 2 21 30

10.1109/msp.2007.914731

29.

Chandrasekaran

Recht

Parrilo

P. A.

Willsky

A. S.

The convex geometry of linear inverse problems

Foundations of Computational Mathematics 2012 12 6 805 849

10.1007/s10208-012-9135-7

MR2989474

2-s2.0-84868193996

30.

Wang

Zhao

Xia

Zhang

Compressed sensing for efficient random routing in multi-hop wireless sensor networks

Proceedings of the IEEE GLOBECOM Workshops (GC Wkshps '10)

December 2010

Miami, Fla, USA

IEEE

266 271

10.1109/GLOCOMW.2010.5700323

31.

Razi

Yasami

Abedi

On minimum number of wireless sensors required for reliable binary source estimation

Proceedings of the IEEE Wireless Communications and Networking Conference (WCNC '11)

March 2011

Cancun, Mexico

IEEE

1852 1857

10.1109/wcnc.2011.5779415

2-s2.0-79959321647

32.

Pradhan

S. S.

Ramchandran

Distributed source coding using syndromes (discus): design and construction

Proceedings of the Data Compression Conference (DCC '99)

March 1999

158 167

2-s2.0-0032651629

33.

Chang

J.-H.

Tassiulas

Routing for maximum system lifetime in wireless ad-hoc networks

Proceedings of the 37th Annual Allerton Conference on Communication Control and Computing

September 1999

Monticello, Ill, USA

1191 1200

34.

Chang

J.-H.

Tassiulas

Energy conserving routing in wireless ad-hoc networks

Proceedings of the 19th IEEE Annual Joint Conference of the IEEE Computer and Communications Societies (INFOCOM '00)

March 2000

22 31

10.1109/INFCOM.2000.832170

35.

Guo

Yang

Wang

DaGCM: a concurrent data uploading framework for mobile data gathering in wireless sensor networks

IEEE Transactions on Mobile Computing 2016 15 3 610 626

10.1109/TMC.2015.2418202

36.

Guo

Wang

Yang

Joint mobile data gathering and energy provisioning in wireless rechargeable sensor networks

IEEE Transactions on Mobile Computing 2014 13 12 2836 2852

10.1109/TMC.2014.2307332

2-s2.0-84908507221

37.

Zhao

Yang

A framework of joint mobile energy replenishment and data gathering in wireless rechargeable sensor networks

IEEE Transactions on Mobile Computing 2014 13 12 2689 2705

10.1109/TMC.2014.2307335

2-s2.0-84908541632

38.

Ahlswede

Cai

S.-Y. R.

Yeung

R. W.

Network information flow

IEEE Transactions on Information Theory 2000 46 4 1204 1216

10.1109/18.850663

MR1768542

2-s2.0-0034229404

39.

Fragouli

Le Boudec

J.-Y.

Widmer

Network coding: an instant primer

ACM SIGCOMM Computer Communication Review 2006 36 1 63 68

40.

Ostovari

Khreishah

Network Coding Techniques for Wireless and Sensor Networks 2013

Berlin, Germany

Springer

41.

Chachulski

Jennings

Katti

Katabi

Trading structure for randomness in wireless opportunistic routing

ACM SIGCOMM Computer Communication Review 2007 37 4 169 180

10.1145/1282427.1282400

42.

Hou

I.-H.

Tsai

Y.-E.

Abdelzaher

T. F.

Gupta

AdapCode: Adaptive network coding for code updates in wireless sensor networks

Proceedings of the 27th IEEE Communications Society Conference on Computer Communications (INFOCOM '08)

April 2008

Phoenix, Ariz, USA

2189 2197

10.1109/infocom.2007.211

2-s2.0-51349100852

43.

Yang

Lou

R-code: network coding based reliable broadcast in wireless mesh networks with unreliable links

Proceedings of the IEEE Global Telecommunications Conference (GLOBECOM '09)

November 2009

Honolulu, Hawaii, USA

IEEE

1 6

10.1109/glocom.2009.5426175

2-s2.0-77951615036

44.

Koetter

Medard

Karger

D. R.

Effros

The benefits of coding over routing in a randomized settings

Proceedings of the IEEE International Symposium on Information Theory

July 2003

Yokohama, Japan

10.1109/ISIT.2003.1228459

45.

Medard

Koetter

Karger

D. R.

Effros

Shi

Leong

A random linear network coding approach to multicast

IEEE Transactions on Information Theory 2006 52 10 4413 4430

10.1109/tit.2006.881746

MR2300827

2-s2.0-33947399169

46.

Voigt

Roedig

Landsiedel

Samarasinghe

Prasad

M. B.

On the applicability of network coding in wireless sensor networks

ACM SIGBED Review 2012 9 3 46 48

10.1145/2367580.2367588

47.

Campobello

Leonardi

Palazzo

On the use of Chinese remainder theorem for energy saving in wireless sensor networks

Proceedings of the IEEE International Conference on Communications (ICC '08)

May 2008

Beijing, China

2723 2727

10.1109/icc.2008.514

2-s2.0-51249098030

48.

Leonardi

Campobello

Serrano

Palazzo

Trade-offs between energy saving and reliability in low duty cycle wireless sensor networks using a packet splitting forwarding technique

EURASIP Journal on Wireless Communications and Networking 2010 2010

932345

10.1155/2010/932345

2-s2.0-77957832082

49.

Campobello

Leonardi

Palazzo

A novel reliable and energy-saving forwarding technique for wireless sensor networks

Proceedings of the 10th ACM International Symposium on Mobile Ad Hoc Networking and Computing (MobiHoc '09)

May 2009

New Orleans, La, USA

269 278

10.1145/1530748.1530786

2-s2.0-70450207962

50.

Alsbouí

T. A. A.

Hammoudeh

Bandar

Nisbet

An overview and classification of approaches to information extraction in wireless sensor networks

Proceedings of the 5th International Conference on Sensor Technologies and Applications (SENSORCOMM '11)

August 2011

Nice, France

IARIA

255 260

51.

Cover

T. M.

Thomas

J. A.

Elements of Information Theory 1991

New York, NY, USA

Wiley-Interscience

10.1002/0471200611

MR1122806

Data Gathering Techniques for Wireless Sensor Networks: A Comparison

Abstract

1. Introduction

2. Related Works

3. Data Gathering Techniques

3.1. Signal Processing Techniques

3.2. Compressive Sensing

3.3. Information Theory Related Techniques

3.4. Networking Techniques

3.4.1. Routing-Based Techniques

3.4.2. Network Coding Based Techniques

4. Simulation Scenario

4.1. Network Model

4.2. Data Gathering Model

4.3. Source Model

4.4. Energy Model

5. Performance Metrics

6. Simulations Results

6.1. ERF with Reliable Networks

6.2. ERF with Unreliable Networks

6.3. CS Reliability

6.4. ERF m a x and Network Lifetime

7. Conclusions and Future Works

Footnotes

Notations

Competing Interests

References

6.4. ${ERF}_{m a x}$ and Network Lifetime