Abstract
Energy efficiency is one of the most crucial concerns for WSNs, and almost all researches assume that the process for data transmission consumes more energy than that of data collection. However, a few sophisticated collection processes of sensory data will consume much more energy than traditional transmission processes such as image and video acquisitions. Given this hypothesis, this article proposed an adaptive sampling algorithm based on temporal and spatial correlation of sensory data for clustered WSNs. First, according to spatial correlations between sensor nodes, a distributed clustering mechanism based on data gradient and residual energy level is proposed, and the whole network is divided into several independent clusters. Afterwards, each cluster head maintains an autoregressive prediction model for sensory data, which is derived from historical data in the temporal domain. With that, each cluster head has the ability of self-adjusting temporal sampling intervals within each cluster. Consequently, redundant data transmission is reduced by adjusting temporal sampling frequency while ensuring desired prediction accuracy. Finally, several distinct sampler collection sets are selected within each cluster following intra-cluster correlation matrix, and only one sampler collection needs to be activated at each round time. Sensory data of non-sampler can be substituted by those of sampler due to strong spatial correlation between them. Simulation results demonstrate the performance benefits of proposed algorithm.
Keywords
Introduction
Due to inherent advantages to monitor region of interest (ROI) in a fully distributed manner, wireless sensor networks (WSNs) have drawn considerable research interests and academic attentions in recent years.1,2 WSNs consist of a large number of self-organizing sensor nodes which are deployed in interesting or unattainable areas to acquire environmental information. With the advancements in system-on-chip (SOC) miniaturization and integration, it is possible to deploy abundant sensor nodes for practical applications3,4 Generally, each sensor node in WSNs collects raw sensory data periodically and then transmits to the sink node through wireless multi-hop communication for further analysis. However, this approach will result in excessive communication and energy consumption.
Due to energy-constrained characteristic of WSNs, in recent years, many energy management mechanisms have been proposed to reduce power consumption in the literature. Most researches suppose that energy consumption for data acquisition and processing is significantly lower than that for communication. Therefore, traditional researches focus on conserving energy as much as possible by reducing transmission capacity. Unfortunately, this assumption is not always held in a number of practical applications because acquisition time is typically longer than transmission time. Moreover, some acquisition processes are very sophisticated such as video and image encoding in wireless multimedia sensor networks. Consequently, adaptive sampling mechanism 5 is regarded as a promising method to improve energy efficiency in recent years, which focuses on reducing the amount of sensory data through self-adaptive sampling method. In addition, sensory data in WSNs usually have spatial correlation (due to geographical closeness of sensor nodes) and temporal correlation (due to smooth variations of signals in real world). Therefore, we investigated one hybrid adaptive sampling algorithm based on spatiotemporal correlation (ASbST in short) to conserve restricted energy.
The main idea of this article is to schedule sensor nodes by activating necessary and optimal sensor nodes with constraints of desired prediction accuracy. In this article, we activate more sensor nodes in data change non-smoothing regions and fewer sensor nodes in data change smoothing regions, thus guaranteeing data collection accuracy as well as reducing energy consumption. 6 In addition, sampling frequency in the temporal domain of each collection set is regulated by updated rule self-adaptively. With the proposed adaptive and optimal sampling algorithm ASbST, fewer sensor nodes are activated at the same time; meanwhile, the average sampling frequency is reduced. As a result, sensory data necessary for delivery are reduced so as to achieve significant energy conservation.
The rest of this article is organized as follows. Section “Related works” presents an overview in the area of adaptive data gathering. Section “Adaptive sampling for clustered WSNs” illustrates an analytical model and the overall principle of proposed adaptive sampling algorithm for clustered WSNs, and several crucial components of the proposed ASbST algorithm are described in detail from sections “Clusters construction” to “Autoregressive moving average (ARMA)-based predictive model at cluster head (CH).” Simulation and performance evaluation are presented in section “Simulations and analysis.” Finally, section “Conclusion” concludes this article.
Related works
Energy consumption is the most important research issue in WSNs. However, most researches focus on new methods for energy-efficient transmission, and only a few researchers are investigating energy effective adaptive sampling mechanisms. Energy-efficient transmission problem for WSNs has been investigated with mathematical optimization method in literature. 7 Generally, adaptive sampling mechanisms can be divided into two categories: 8 adaptive spatial sampling (ASS) algorithms and adaptive temporal sampling (ATS) algorithms. ASS algorithms ensure sampling accuracy using spatial region–based self-adjusting, wake-up/sleep states scheduling, and so on. ATS algorithms ensure sampling accuracy using sampling frequency self-adjusting, online signal tendency estimation, and so on. Figure 1 shows the categories of current adaptive sampling algorithms, which are concluded from a lot of existing literature.

Sampling strategy categories.
It is well known that WSNs prefer to sample more than requirement, thus resulting in significant spatial and temporal correlations. 9 Temporal correlation is used in an adaptive sampling algorithm named as adaptive sampling approach (ASAP) to minimize energy consumption of sensor nodes. 10 The main idea of ASAP is to use a dynamically changing subset of nodes as samplers so that sensory readings of sampler nodes are directly collected, whereas sensory data of non-sampler nodes are predicted through designed probabilistic model. Alippi et al. 11 propose an adaptive sampling algorithm that estimates online optimal sampling frequencies for sensor nodes and apply it into snow-monitoring applications. Santos et al. 12 present a technique that applies adaptive automata mechanism, and the possibility of sensor networks dynamically vary the interval (time) between a data logging and other, so as to reduce data sampling intervals. Zheng et al. 13 discuss how to perform energy-efficient sensory data sampling with compressive sensing technology. Our previous work 14 proposes a sample frequency–based adaptive sampling algorithm, where the sampling frequency of each cluster was determined by CH following the multiplicative increase additive decrease (MIAD) rule dynamically. Szczytowski et al. 15 identify the regions of oversampling or undersampling and propose a Voronoi-based ASS (ASample) solution. As a result, it removes unnecessary samples from regions of oversampling and generates additional new sampling locations in the undersampling regions to fulfill specified accuracy requirements. Deligiannakis et al. 16 describe an self-based regression (SBR) algorithm for compressing multi-valued feeds containing historical data from each sensor node. The key to their technique is the base signal—a series of values extracted from real measurements used to provide piece-wise approximation of the measurements. Moreover, they further explain how it can be adapted to exploit correlations among measurements operated on single node/multiple nodes. JB Li and JZ Li 17 propose a method to select some specific sensor nodes among the sensor nodes with stronger spatial correlations, thus reducing the data redundancy. However, this method does not consider the adjustment of the sampling frequency in time domain. Mukhopadhyay et al. 18 propose a data error correction method based on prediction model established in the time domain for WSNs to improve the reliability of sensory data. JZ Li and colleagues.19,20 focus on designing physical world aware data acquisition algorithms to support O(ε)-approximation to the physical world for any ε and then propose two physical world aware data acquisition algorithms to reconstruct the physical world. Our previous work 21 proposes a novel coverage ratio index in consideration of the spatial domain and temporal domain as well as a novel coverage sampling technology based on spatial–temporal joint union for WSNs. Most of the above researches are concentrating on adaptive sampling either in the time domain or in the spatial domain, and there are no relevant studies to take these factors into account.
This article aims at designing ATS algorithm combining self-adjusting frequency sampling and ASS and proposes a novel adaptive sampling algorithm based on temporal and spatial correlations of WSNs. To the best of our knowledge, this is the first spatiotemporal correlations–based adaptive sampling algorithm for clustered WSNs. By optimizing sampling frequency and sampling node sets of different clusters adaptively in a distributed manner, the average sampling frequency and number of sampling nodes will decrease so as to decrease energy consumption of whole WSNs. In addition, the proposed algorithm of this article can guarantee sensory data’s sampling accuracy at the same time.
Adaptive sampling for clustered WSNs
System model and overview
In this section, we describe the investigated clustered WSNs model and introduce basic concepts through an overview of adaptive sampling based on temporal and spatial correlation. In our clustered WSNs, a large number of energy-constrained sensor nodes assigned with unique IDs are employed randomly in a rectangle area. Therefore, the network topology of WSNs to be discussed can be represented by an undirected simple graph
Notations.
MRE: Mean Relative Error.
In addition, we assume that there is a higher node density in our clustered WSNs. Hence, the practical spatial data correlation model described in Lotfinezhad and Liang
23
can be introduced. Let
Due to the strong correlations of sensory data in WSNs, we propose a model-based adaptive sampling mechanism for clustered WSNs considering temporal and spatial correlations together. The proposed adaptive sampling algorithm ASbST comprises several components: (1) sensing data driven clustering algorithm. With this algorithm, sensor nodes, similar to each other in terms of their observed sensory data, should be clustered into one group. Furthermore, sensor nodes will join in its nearest CH to be cluster members, thus taking advantage of the spatial correlation measured with Euclidean distance for simplicity. (2) ATS algorithm. Each CH builds a prediction model with historical data within its cluster. Afterwards, it uses predicted value from estimated prediction model to substitute that of non-sampling nodes, thus decreasing sampling frequency, which is shown in Figure 2. The online intra-cluster prediction framework of each CH is described in Figure 3. It establishes and maintains a prediction model by feature extraction from historical data. (3) ASS algorithm. Cluster members are further divided into a few sub-clusters, and only one sensor node with most residual energy in each sub-cluster will be selected to be one member of sampling completed set (SCS). The CH uses sensory reading from its cluster members to abstract the spatial and temporal correlations of sensory data. Then, each cluster is partitioned into some sub-clusters, with the basis of principle that cluster members whose sensory data are highly correlated are divided into the same sub-cluster. As a result, only sensor nodes in SCS of each cluster serve as samplers, and the others belong to non-samplers in order to conserve nodes’ energy.

Temporal prediction model diagram.

Online intra-cluster prediction framework.
Moreover, in this article, we use network topology and sensory data derived from Intel’s laboratory in Berkeley University 24 with 54 sensor nodes deployed in the Lab.
Adaptive sampling in the temporal domain
Figure 4 shows the origin principle of proposed adaptive sampling algorithm in the temporal domain. There is a specific signal whose amplitude is shifting with time sequence. Obviously, oversampling denotes that the actual sampling frequency is much higher than necessary sampling rate, and undersampling shows that actual sampling frequency is much lower than necessary sampling rate. It is a promising research topic to minimize real-time sampling frequency as much as possible while satisfying desired prediction precision. However, it is very hard to acquire appropriate sampling rate because signal frequency is often unknown in advance.

Nyquist-based fixed sampling, oversampling, and undersampling: (a) fixed sampling rate, (b) oversampling rate, and (c) undersampling rate.
In the work, well-designed ATS strategy is operated in each cluster, and the intra-cluster sampling frequency is determined by each CH. When the deviation value between sampled data and predicted data exceeds a preset threshold, CH will increase intra-cluster sampling frequency; otherwise, CH will decrease intra-cluster sampling frequency. In other words, the intra-cluster sampling frequency can be increased to ensure the quality of sampled data with high dynamic nature of sensory data. Otherwise, the intra-cluster sampling frequency will decrease with low dynamic nature of sensory data, so as to reduce data redundancy. Finally, the balance of energy efficiency and data gathering quality can be achieved together through this method.
Adaptive sampling in the spatial domain
At first, we define two new concepts: oversampling region (the case with much more awaking sensor nodes than necessary) and undersampling region (the case with fewer awaking sensor nodes than necessary). Sensor nodes are partitioned into different clusters according to data gradient metric between sensor nodes in spatial domain. Therefore, the proposed ASS scheme will operate on each cluster in a fully distributed manner. The main principle of adaptive sampling algorithm in spatial domain is to reduce the number of awaking nodes with AR prediction model, which is maintained and updated by each CH. In addition, cluster members of each cluster are further divided into a few sub-clusters based on the historical model of sensory data. The proposed ASS approach is explained with an example (see Figure 5). Seven sensor nodes with ID 1 to 7 in the square ROIs can be divided into two independent sub-clusters named as SCS: {node 1, node 2, node 3, and node 4} and {node 5, node 6, and node7}. Either of them can cover the whole square ROI. In order to conserve energy, only these sensor nodes (named as samplers) in either SCS are awake; meanwhile, sensor nodes in another SCS enter sleeping status, which are named as non-samplers.

Oversampling spatial region.
Clusters’ construction
The goal of sensing-aware clustering algorithm proposed in this article aims at forming a hierarchical network framework to facilitate adaptive sampling and achieve the optimization objective of energy saving awareness and data gradient awareness. In the first phase, we propose a distributed clustering algorithm to establish one-hop clusters, where CHs are elected in a fully distributed fashion. Each sensor node is elected to be CH randomly with probability
where
Subsequently, CHs broadcast CH notification messages to their one-hop neighbor nodes after CHs election process. Those cluster members (CMs) that received CH notification messages will make a decision about which CH to join in. It is called as minimal data gradient principle. The details of minimal data gradient principle are illustrated as shown in Figure 6, and there are four CHs locating in the one-hop region of CMi. The minimal data gradient selection algorithm is described in equations (3) and (4). The CMi will select the CH with minimal gradient value as its CH, for example, if
where

Minimal data gradient principle.
After clusters construction phase, sensor nodes are divided into two categories: one particular sensor node is elected as CH responsible for data aggregation and the other sensor nodes as cluster members (CMs) responsible for data acquisition. The detailed sensing-aware clustering algorithm is illustrated in our previous work.
25
Figure 7 demonstrates the clustering results of different CH percent (

Clustering results of different cluster head percent: (a) CH percent (p = 0.05) and (b) CH percent (p = 0.15).
ATS
Following the most famous Nyquist law in signal processing field, if the highest frequency Fmax of signal is given, the minimum sampling frequency FN is more than twice the highest frequency, thus guaranteeing signal reconstruction. In particular, it is common to pick a sampling frequency three to five times higher than the highest frequency of signal, that is, FN = cFmax, where c = 3–5. 26 Unfortunately, highest signal frequency is not available in advance for most applications. Moreover, it will change over time in a non-stationary process. Consequently, the sampling frequency has to vary adaptively with signal variation tendency. For large-scale WSNs with many clusters, signal variation tendencies of different clusters are different, so signal sampling rate ought to adjust subsequently within different clusters.
This research holds the following assumptions: (1) the sensory data of WSNs in the temporal sequence are typically continuous and stable in many practical scenarios and (2) there are some data errors because of unreliable wireless communication, failure of sensor nodes, and so on. Based on the above two hypotheses, more collected sensory data do not mean more accurate results. Moreover, the actual sampling frequency is universally high in the time domain for many applications. For example, it is obvious that historical temperature values of Sensor No. 1 between 1 March and 6 March have strong similarity (as shown in Figure 8), defined as temporal correlations.

Temporal correlations of different days.
Unfortunately, it is impossible for accurate estimation of signal highest frequency in advance, so we propose an intra-cluster sampling frequency self-adjusting method. Each CH establishes one second-order AR prediction model based on historical data of its cluster members. After that, estimated value is used instead of sampling data which are calculated by AR prediction model. The self-adapting sampling frequency adjustment scheme is described as in the following: the intra-cluster sampling frequency will increase additively by a certain step value when the sampling error is less than the preset threshold; otherwise, the intra-cluster sampling frequency will be reduced to its half value. Figure 9 shows the detailed process of the proposed algorithm. In the each iteration process, each CH broadcasts a certain sampling frequency to its CMs, which is determined by current sampling accuracy. If current sampling error is larger than the default threshold

Pseudo-code for adjustment of intra-cluster sampling frequency.
The distributed characteristic of sampling frequency self-adjusting strategy is that different sampling frequency regulation processes are operated on different clusters at the same time. Therefore, the proposed algorithm is appropriate even for large-scale WSNs. The uniform initial sampling frequency of each cluster is determined by the sink node. Moreover, it is same for the minimal sampling frequency
where
ASS
The most prominent assumption of the work is that those sensor nodes that are closer to each other will have stronger correlation. For example, Figure 10 shows the data variation tendencies of Sensor Nos 1, 2, 16, and 17 in Intel’s laboratory. It is obvious that the variation tendency of Sensor Nos 1 and 2 is much more similar than that of Sensor Nos 1 and 16. Specially, it is obvious that the sensory data of Sensor No. 17 has many errors because of transmission or acquisition errors. There are two crucial components of the proposed ASS approach. The first one is to facilitate the classification of sensor nodes to serve as distinct sampler, and the other is named as optimized scheduling for sampler and non-sampler.

Spatial correlations of neighbor sensor nodes: (a) sensory data of Sensor No. 1, (b) sensory data of Sensor No. 2, (c) sensory data of Sensor No. 16, and (d) sensory data of Sensor No. 17.
The most important phase is to construct several distinct sampler collections within each cluster. The cluster formation described in section “Clusters construction” complies with two metrics: sensor nodes similar to each other in terms of sensory data should be clustered together and sensor nodes close to each other should be clustered together. As we know, higher correlations among sensor nodes within a cluster typically lead to higher sampling accuracy. However, this will lead to much lower efficiency of energy at the same time. The principle of proposed idea is to divide cluster members into several sub-clusters, and only one of sensor nodes in each sub-cluster is awake within its sub-cluster. It is supposed that the spatial region of WSNs is almost oversampling with ROI covered by at least two distinct collections. Therefore, our goal of further dividing sensor nodes into sub-clusters is to facilitate the selection of SCS. For example, in Figure 11, one cluster is divided into three sub-clusters (Sub-cluster A, Sub-cluster B, and Sub-cluster C), and Sensor Nos 1, 7, and 5 are selected from each sub-cluster to form an SCS. Once SCS is determined within one cluster, only SCS nodes need to wake up to be sampler and other sensor nodes transform to sleep mode so as to conserve energy. Moreover, sensory data of sleeping sensor nodes can be estimated at each CH using AR prediction model.

Example of sampling completed sets.
The above three sub-clusters are elected with such optimal metric. First of all, we create a correlation matrix for each twin collection i and collection j. For any two nodes u and v in the collection,
where

Optimization objective to select sampling collection sets.
The SCS granularity

Sub-clustering results with different SCS granularity
Once optimal SCS are determined in each round, sensor node with maximal residual energy within each sub-cluster is selected to be one member of SCS. As a result, sensor nodes are divided into sampler and non-sampler. Only the former needs to transmit the required data to the sink node instead of the latter. The values of non-sampler nodes will be predicted at CH by prediction model constructed with historical data.
ARMA-based predictive model at CH
The remaining problem is to calculate the previously defined sampling accuracy metric derived at each CH. The famous autoregressive and moving average (ARMA) model captures the autocorrelation in a data set by expressing the prediction as a linear combination of previous samples. This article adopts a smaller amount of calculation of linear AR model 28 to sample prediction model at each CH. Meanwhile, it uses the recursive least squares (LS) estimation methods for parameter estimation, which is described as follows
where
Considering such requirements as prediction accuracy and operation speed, two-order AR prediction model is introduced. Thus, the predictive value can be reconstructed by each CH with historical data
where
The autocorrelation function of sample data can be derived from equation (11)
Therefore, predictive error is calculated as the difference value between predictive value and actual value
The famous Yule-Walker method 29 is used to estimate coefficients of AR model. Although periodical update will introduce unnecessary cost into our algorithm, the calculation process of predictive model will operate only when the error value exceeds a certain threshold. Figure 14 shows the AR prediction error values with different window sizes of sensor node 1 on 1 March.

AR prediction error with different window size (1 March, Sensor No. 1).
Simulations and analysis
This section provides simulation setting and performance analysis for the proposed ASbST algorithm. Although some simulation performances of different individual parts are present at previous sections, only simulation results to verify the whole algorithm are analyzed and compared in this section.
Simulation setup
The sensory data used in the simulation are derived from Intel’s laboratory in Berkeley University. 24 The sensory data file contains temperature, humidity, brightness, and voltage from 54 different sensors about once every 30 s in 1 month. The deployment framework of sensor nodes in Intel’s laboratory is illustrated in Figure 15, where 54 sensor nodes labeled with IDs are distributed in a rectangle region.

Deployment of sensor nodes in Intel’s labs.
In this section, we describe the simulation settings and the metrics we have chosen to verify the performance of the proposed algorithm. MATLAB software 30 has been used for simulation tool. The evaluation metrics comprise correlation coefficient, predictive error, actual sampling frequency, and energy consumption rate. We compare performance with fixed sampling rate scheme, and the simulation process lasts 500 rounds to obtain average results. Table 2 demonstrates some important parameters for simulation.
Parameters for simulation.
It is assumed that each sensor node has the same initial energy 1 J, and energy consumption model is the same as Anastasi et al., 31 as such equations
Simulation results and performance analysis
First, the criteria evaluation is introduced to measure spatial correlations between sensor nodes. The correlation coefficient equation is defined as follows
where

Comparison of correlation coefficient with different window sizes: (a) window size, W = 10; (b) window size, W = 50; and (c) window size, W = 100.
Figure 17 compares the predictive error of Sensor No. 1 from 28 February to 29 February using AR model. We find out that predictive error is bounded by a certain small value, except that there is a major predictive error at boundary of 2 days due to some errors of original data set.

Predictive error from 28 February to 29 February of Sensor No. 1: (a) temperature variation and (b) predictive error.
As a representative of energy consumption rate in sensory sampling process, the mean sampling frequency value Mean
f
is defined as the ratio between the sum of actual sampling frequency with the number of sampling sequence as well as the unit of 1/60 Hz. It is obvious that the variation of sampling frequency drops from the original sampling frequency

Sampling frequency self-regulation with different MREs.
The energy residual ratio is defined as the ratio between residual energy and initial energy of all sensor nodes in our simulations. Figure 19 illustrates simulation results of energy consumption ratio with different SCS granularity values and CH percents. The proposed ASbST algorithm achieves energy efficiency of data sampling by collecting sensory data from only a subset of sampler. The sensory data of non-sampler will be predicted using AR model derived from recent sensory data. It is very obvious from Figure 19 that energy consumption will increase with much larger cluster number because there are much more awake sensor nodes. Moreover, energy consumption with smaller SCS granularity is much greater than that of larger SCS granularity with the same reason.

Energy consumption with different SCS granularity and cluster head percent.
Conclusion
In this article, we introduced an adaptive sampling algorithm based on temporal and spatial correlations for clustered WSNs. The proposed ASbST algorithm could be used to decrease data sampling frequency while guaranteeing data sampling quality efficiently. First, a novel clustering mechanism based on data gradient was proposed. Second, CHs maintained AR predictive model of sensory data in the time domain within each cluster, as well as self-adjusts and changes sampling frequency based on predictive model. Third, cluster members transmitted sensory data to its CH with self-adopting sampling frequency. Extensive simulation results verified that ASbST algorithm can achieve better performance with spatiotemporal correlations of historical sensory data. Our future work will focus on multi-dimensional sampling algorithm, which considers different spatial–temporal resolutions as well.
Footnotes
Acknowledgements
The authors would like to thank the anonymous reviewers whose insightful comments helped improve the presentation of this paper significantly.
Handling Editor: Amiya Nayak
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was partially supported by Natural Science Foundation of Zhejiang Province (No. LY18F030006 and No. LY16F030004) and National Natural Science Foundation of China (No. 61871163 and No. 61801431).
