Abstract
Deployment is a basic and key issue for all kinds of wireless sensor networks applications. Most of existing researches on deployment problem of wireless sensor network are on generic network level and not for specific application scenarios and also do not utilize any domain knowledge of practical applications. Focusing on the problem of limited number of sensing nodes in soil respiration sensor networks, using domain knowledge such as some lightweight parameters (temperature, humidity, etc.) influencing soil respiration and soil respiration having a day-periodic trend, we proposed a deployment method,
1. Introduction
Deployment strategy is very important for the application of wireless sensor network technology. On the one hand, it affects the connectivity and coverage performance of the entire network; on the other hand, it determines the number of nodes that the network needs. Good deployment strategy can maintain network performance and the monitoring accuracy using fewer nodes in the network, which can effectively reduce the cost of whole network when the nodes are expensive.
Soil respiration is one of the main ways of carbon exchange between the terrestrial ecosystem and atmosphere system and plays an important role in the global carbon cycle and carbon balance. CO2 discharged to the atmosphere through soil respiration is about 68 PgC/yr, accounting for about 10% of the total atmospheric CO2 [1]. Therefore, objectively evaluating the contribution of soil respiration carbon flux to the global carbon budget and analyzing the influence of soil respiration on global climate change are especially important and have become core issues of the global carbon cycle.
As to the measurement of soil respiration flux of carbon, accurate soil carbon flux data of a single position can be achieved using some current equipment, for example, LI-8100 series of soil respiration measuring systems from LI-COR company [2]. However, due to the influence of a variety of factors such as soil pore size and type, soil temperature, soil water content, wind speed, and CO2 concentration gradient, spatial and temporal heterogeneities of soil respiration carbon flux are very obvious [3], which makes existing methods hardly achieve accurate measurement of regional soil respiration carbon flux.
Wireless sensor networks have characteristics of wide monitoring range, sustainable monitoring, collaborative work, and transferring information in time and can meet the requirements of regional soil respiration monitoring such as area coverage, persistent monitoring, and synchronous sampling. In order to apply wireless sensor networks to soil respiration monitoring, we have designed an instrument for soil respiration carbon flux measurement with communication ability using wireless sensor networks protocol, as is shown in Figure 1, which can be used as soil respiration sensing nodes [4].

Wireless sensor network for soil respiration monitoring.
But we encountered a problem during the practical construction process of soil respiration monitoring sensor networks; that is, the number of soil respiration sensing nodes is limited due to the high cost, which makes a well-designed deployment strategy become an urgent need to use fewer nodes monitoring a target region.
Although there have been a lot of research works aimed at nodes deployment issues in wireless sensor networks, most of them are to study the coverage of monitoring area and network connectivity issues. Furthermore, most of these studies are generic methods to solve the deployment problem and not for a specific application scenario. However, there is usually some specific domain knowledge in the relevant fields of each practical application, and using such domain knowledge may lead to noneligible promoting effect on the solution to the deployment problem when wireless sensor network technology is applied in practical fields.
Because soil respiration in a position has an approximately day-periodic variation trend [5], the analyzing result of the former sensing data can play a guiding role for the later measurement. The diel variation trend of a position is affected by the environment parameters, so positions with similar parameters may have similar time domain features. It is very possible that some positions in a region with the same vegetation have similar parameters. As a result, if the initial measurement data of different positions have similar time domain features, the same similarity will be maintained in the process of subsequent measurement.
Therefore, we designed a deployment strategy of soil respiration sensor networks based on time domain feature similarity of lightweight parameters,
What should be emphasized is that although this method is put forward for a specific application of soil respiration monitoring, in fact the idea has universality and can be used for other applications with lightweight parameters, in which the monitoring data is periodic and sensing nodes costs are high. Therefore, the presented method can provide reference for solving problems during the application of wireless sensor network technology to other fields and promote wider application of wireless sensor networks.
The remaining parts of this paper are organized as follows. Section 2 introduces some previous work related to the subject this paper. Section 3 describes the nodes deployment problem in soil respiration monitoring sensor network to solve in this paper. Section 4 introduces in detail the proposed deployment strategy of soil respiration measurement nodes based on time domain similarity of lightweight parameters,
2. Related Work
2.1. Deployment of Wireless Sensor Networks
Wireless sensor network technology can be used for many applications such as industry automation, localization and track tracing, health monitoring, and environment monitoring. Deployment is an important and basic issue for each of such applications. The concerns of the deployment problem include coverage, connectivity, and energy-efficiency.
The coverage problem of a wireless sensor network for different applications includes field coverage and barrier coverage, and the field may be two-dimensional area or three-dimensional space or surface. The basic deployment method for 2D area coverage is cell-based, and the shape of each cell may be square, triangle, hexagon, and so on. Ammari and Das [6] studied the coverage and connectivity of wireless sensor networks in 3D space using integrated concentric sphere model and continuum percolation theory, so as to determine the critical density of sensor nodes for the deployment and topology control. Liu and Ma [7] analyzed the expected coverage ratios on both regular and irregular 3D surface using stochastic sensors deployment method. For barrier coverage, there are line-based model [8] and curve-based model [9] for the deployment of wireless sensor networks. Furthermore, the coverage requirement may be
The connectivity and energy-efficiency are also considered by many deployment methods besides coverage. If there are obstacles in the field, a serpentine movement policy can be used to handle obstacle [15]. The positions of relay nodes are important for the connectivity of a network. Lee and Younis [16] studied the selection and deployment problem of relay nodes, when some nodes failed and the network is separated into segments. The deployment of relay nodes can also be guided by the communication load of the network, so as to place more relay nodes in heavy load region [17]. Liu et al. [18] studied the deployment of sensor nodes using minimum cost when the required lifetime of network is given. Energy can also be saved through optimizing the distance of movement of sensor nodes in mobile sensor networks [19].
Compared with the previous work, the main novelty of
2.2. Wireless Sensor Networks for Environment Monitoring
There are many systems using wireless sensor network technology in the recent years. Barrenetxea et al. [20] established a wireless sensor network system for wild environment monitoring. GreenOrbs is a wireless sensor network system in forest for canopy closure sustainable estimation [21]. Besides, wireless sensor network technology is also used for volcano monitoring [22], wildlife monitoring [23], marine environment monitoring [24], soil property monitoring [25], and so on. In these systems, the focus is the application rather than deployment, so they do not have particular design for the optimization of the count and positions of sensor nodes.
Some environment monitoring applications use graph theory to optimize the positions of sensor nodes [26, 27]. However, these graph-based methods are generic and do not utilize any knowledge of applications. Du et al. [28] introduced a wireless sensor network system for wind distribution monitoring around an urban reservoir, in which sparse deployment is accomplished by adjusting distance of communication. However, it did not introduce how to determine the positions of sensor nodes.
Du et al. [29] segmented a year into different monsoon seasons according to the analysis of historical wind distribution data of the field and used computational fluid dynamics to learn the spatial correlation of wind in the presence of surrounding buildings. The most informative locations are the outputs of the deployment method. This method uses knowledge of application, which is similar to our method. However it cannot be used in soil respiration monitoring sensor network, because we cannot have historical soil respiration data of a monitored field and computational fluid dynamics cannot be used for soil respiration.
As to the previous works, the deployment methods of them are all different from the method based on similarities of lightweight parameters, which is to be studied in this paper.
3. Problem Descriptions
3.1. Soil Respiration Sensor Network
In order to monitor soil respiration carbon flux data in a region, traditional methods are to carry out measurements in one or a few sampling positions using devices such as LI-8100 and then estimate soil respiration carbon fluxes across the region. But the results of such methods using a small number of measuring positions are very inaccurate. On the one hand, for the spatial heterogeneity of soil respiration, using a small amount of measuring positions to represent other positions in the region as a whole is not appropriate [30]; on the other hand, soil respiration varies over time and the result measured at a certain moment cannot represent the measurements of other moments [31]. So the traditional measuring methods are not representative in both time and space aspects, and regional soil respiration carbon flux cannot be accurately estimated using traditional methods.
Therefore accurate measurement and estimation of regional soil respiration should meet the following three requirements. (1) Measurements in multiple positions: due to the spatial heterogeneity of soil respiration, only measurement results of multiposition can describe the entire regional soil respiration accurately. (2) Measurements at the same time in multiple positions: because of the time heterogeneity of soil respiration, measurements in the same position at different time may be different, so only measurements at the same time in multiple positions can describe the regional soil respiration at the measuring time. (3) Continuous measurements: for the temporal heterogeneity of soil respiration, few measurements cannot reflect the details of the soil respiration varying with time, and only continuous measurements can reflect the varying details of soil respiration along with time.
Wireless sensor network technology can well meet the three aspects of requirements of measurements: multiple positions, synchronization, and continuation. So we adopt this technology to carry out the measurement and estimation of regional soil respiration carbon flux. During the construction of a soil respiration sensor network, because the soil respiration measuring equipment is expensive which leads to limited quantity of the equipment, we add some routing nodes which bear only communication tasks and are not responsible for the measurement task to guarantee the connectivity of the network, as shown in Figure 1. In order to save energy of these nodes, soil respiration measuring nodes act as leaf nods in the data collecting tree, and measured data will be transferred to the sink node by routing nodes. In addition, we have studied segmental dynamic sampling strategy to reduce energy consumption of soil respiration measuring nodes and prolong the continuous working time of the whole system in our previous study [32].
3.2. Deployment of Soil Respiration Sensing Nodes
As mentioned above, a soil respiration sensor network contains some routing nodes to guarantee the connectivity of the network, so the deployment positions of soil respiration nodes mainly consider the requirements of application layer rather than network layer. At the application layer, in order to effectively measure regional soil respiration using a limited number of soil respiration sensing nodes, each node should be deployed in a more reasonable position.
Deploying a soil respiration sensor network in the region
The goal of this paper is to determine small
4. Our Solution
4.1. Main Idea
In order to solve the problem of reasonable layout of sampling positions, a relatively easy way is to arrange more sampling positions in subregions with more drastic changes of soil respiration so as to get more details of the changes and to choose fewer sampling positions in subregions with relatively flat changes of soil respiration. This approach is based on the spatial correlation of soil respiration, which means each measurement in a position represents the data in surrounding area, and the correlation can be exploited by spatial interpolation. As a result, the distribution state of sensing data in the entire region can be obtained by interpolation using a small number of measurement points.
The method above only pays attention to the local spatial correlation of soil respiration, the main consideration of which is the relationship of soil respiration of a position and that of positions nearby it. But the approach does not take into account the correlation between soil respiration of positions which are not adjacent. In addition to the correlation among adjacent positions which are naturally used in the spatial interpolation process, the correlation of soil respiration data measured in nonadjacent positions may play a significant role in promoting the rationality of the layout of soil respiration sensor network, which is the focus of this paper.
In this paper we designed a soil respiration nodes deployment method exploiting the correlation of soil respiration data in nonadjacent positions. According to the similarity of time domain features of soil respiration data in different positions of monitored area, these positions are divided into groups regardless of whether positions in each group are adjacent or not in physical space, and thus soil respiration data monitored in one position from each group can represent the remaining positions of the same group for the time domain similarity. During the monitoring stage, once soil respiration data are measured in the chosen sampling position of each group, soil respiration data of other positions in the same group can be obtained through certain transformation relations which are found during the grouping process. As a result, we can have soil respiration data of positions much more than sensing nodes actually deployed before final interpolation to simulate the real state of regional soil respiration, which makes the interpolation result closer to the real world.
However, obtaining the time domain similarities of soil respiration data of all positions in the monitored area is very difficult, since it needs fine-grained soil respiration data of each position in the monitored area for a period of time, which is the ultimate goal of soil respiration monitoring sensor network, and can only be realized through dense and regular deployment of soil respiration measuring nodes. In fact, it is not realistic to densely deploy soil respiration measuring nodes in the monitored area, because we cannot have enough soil respiration measuring nodes usually due to expensive cost, which is just the root cause of what to be studied in this paper.
To solve the problem above, we introduced the temporal similarity of lightweight parameters into the solution. According to the domain knowledge of soil respiration, soil respiration sensing nodes use open chamber method to measure the soil respiratory carbon flux, and the calculation is shown by [4]
From (1) we can see that soil respiration is affected by temperature, humidity, pressure, and other factors. Because price and energy consumption of sensors for these parameters are relatively low, we refer to these factors such as temperature, humidity, and pressure as “lightweight parameters” of soil respiration in this paper.
In order to solve the problem that the similarities of time domain features of fine-grained positional soil respiration data in the monitored area cannot be obtained directly, we converted the similarities into the similarities of time domain features of lightweight parameters in different positions. Because of the low cost and low energy consumption of lightweight parameter sensing nodes, lightweight parameters can be measured densely and regularly in the monitored area and the similarities of time domain features of lightweight parameters in different positions can be obtained.
Therefore, we prposed a strategy for the deployment of soil respiration monitoring sensor network based on similarity of time domain features of lightweight parameters, which is called
What should be emphasized is that because deploying a soil respiration measuring node requires some complicated special processes such as smoothing the ground surface and embedding the device bottom into the soil, once the layout scheme of sensor nodes is determined, it is usually assumed that soil respiration sensing nodes in the monitoring period will not be moved anymore after deployment. Therefore, these complex preparatory works including the measurement, analysis, and computation of lightweight parameters are very necessary for the optimal layout scheme of sensor nodes.
4.2. Temporal Similarity of Data in Different Positions
In order to group similar positions in the monitored area using the time domain features of the data, the similarity of time domain features of data between different positions need be defined.
Let
Because all the data in this application, either soil respiration data or lightweight parameters data, are all positive, the components of
4.3. Grouping Positions according to Temporal Similarity
After the temporal similarity of data between two positions is defined, we can divide multiple positions into different groups according to the similarities between each other, and the grouping result is that positions in each group have high similarities of time domain features of data while there are less similarities among positions from different groups. There are two cases about the grouping problem in practice, one of which is that the number of groups,
When the number of groups
When the lower bound of group similarity
After the grouping scheme of positions in the monitored area is determined and the grouping result is achieved, we choose the position where the data vector has the highest similarity with the center vector of all data vectors in the same group as the position to deploy a soil respiration sensing node. When the deploying positions in all groups are chosen, the positions of routing nodes
One position in each group is selected as a representative for the soil respiration sampling. Therefore, if the number of soil respiration measuring nodes to deploy is certain, we use the case of determined
4.4. Reconstruction of Data for Positions of a Group
A following problem is the reconstruction of data for the rest of positions using the data of the sampling position in a group. Data reconstruction should be done after the deployment and during the working stage of soil respiration sensor network system, and it seems that this issue is not a part of the deployment of a soil respiration sensor network. But in fact, as for the proposed deployment strategy based on time domain similarity of data, its main idea is taking advantage of the correlation of data from nonadjacent positions, so we should clarify that the method is able to rebuild the same correlation in data before putting the deployment strategy into practice. If the data correlation among nonadjacent positions used by the deployment strategy cannot be reconstructed during the network works, this deployment method will be not able to be applied into any applications.
The reconstruction problem of data of each position in a group is described as follows. Assume that a position group
To fulfill this task, we need compute and save the conversion relations between the data vectors of the rest of
During the analysis of premeasured data for grouping, record the ratios Ratio between data vector of every nodes
But in the approach we designed in this paper, the premeasured data is lightweight parameters data rather than soil respiration data, and the ratio vector Ratio is the relation of lightweight parameters data of each position within the group rather than that of soil respiration data. So this ratio vector Ratio can only be used to reconstruct lightweight parameters data and cannot be directly used for the reconstruction of soil respiration data.
Because the soil respiration carbon flux is calculated according to (1), the lightweight parameters such as temperature and humidity measured in position
4.5. TimSim Algorithm
Now we formally describe the deployment strategy for soil respiration monitoring sensor network based on time domain similarity of lightweight parameters,
Algorithm 1 requires the following:
( ( ( ( ( ( ( or ( ( ( ( ( ( (
As for the soil respiration sensing network, the length of time lightweight parameter data sequence
5. Experiments and Analysis
5.1. Experimental Data
In order to verify the proposed deployment strategy of soil respiration monitoring sensor network based on similarity of time domain features of lightweight parameters,
We adopt a substitutional method to simulate real soil respiration data of many dense positions in the monitored area using temporal and spatial shift. We deploy
Taking into account that there are several factors affecting soil respiration such as temperature, humidity, and air pressure, too many lightweight parameters may complicate the experiment. We carry out the experiment process in smooth lawn with uniform illumination, in which the temperature and pressure of different positions are almost the same; thus humidity is the only main factor affecting soil respiration in this area. So the lightweight parameter in our experiment only remains humidity data.
In the experiment, we used 20 soil respiration measuring equipments, deployed them in different positions of the monitored area, let them keep working for 5 days, and obtained 100 position-day serials of data. Then we changed the positions of such equipment and repeated the data collection process for 2 times. As a result, we obtained 3 data sets, each of which includes 100 position-day serials of data, and we distributed each data set regularly with
5.2. Experimental Scheme
Suppose that we have measured regional soil respiration real data
In order to evaluate the performance of
( ( ( ( ( ( ( according to ( ( ( ( ( ( (
5.3. Experimental Results and Analysis
5.3.1. Deployment Results and Performance Analysis of TimSim
In order to demonstrate the procedure and result of

Distribution of soil humidity and soil respiration carbon flux in the experimental area.
Then the similarities of time domain features of lightweight parameter data are calculated using the data set, according to which the 100 positions are divided into groups using

Position grouping results for different group numbers.
As we can see from Figure 3, there is not any obvious regular pattern in the spatial distribution of positions in each group. Positions in each group are not all adjacent in the space and their distribution is very disperse and irregular. When comparing the lightweight parameter data of positions in the same group, their values are not very close to each other at the same time. This is because the grouping method is based on the similarities of temporal pattern of data rather than the distribution of data at a certain moment.
As we can also see from Figure 3, the selected deployment positions of soil respiration sensing nodes are not in the geographical central positions of each group, because in our
After the deployment positions are determined, the distribution of error between

Error distribution of

Error comparison of three deployment strategies.
From Figure 4 we can see that errors vary within a wide range when the count of sampling positions is small and there are large errors in some positions. For example, the biggest error among all positions reaches 4.8 in Figure 4(a). The reason is that the group count is low, and there are weak similarities between data vectors of some positions and that of the deployment positions in the same group, which brings big errors during the reconstruction process of data in these positions with weak similarities according to centroid vector. For the same reason, the mean and variance of errors of different positions are also relatively big when the group count is low, as shown in Figure 5.
With the increase of the number of sampling positions, the mean and variance of the errors decrease, as shown in Figure 5. From Figure 4 we can see that errors of most positions decrease with increasing the number of sample points with also some exceptional positions, which is caused by the alternation of directions of centroid vectors with the change of count of groups. Grouping results alter with the growth of count of groups, which makes the data vectors of some positions farther from the centroid vector directions than the former case leading to bigger error in these positions. While with the growth of count of groups, quantity of positions within a group is reduced at large, the similarities in groups become better and there are no positions with very large error. As shown in Figure 4(d), the maximum error among all positions is only 0.9.
5.3.2. Comparison of TimSim and Other Methods
In order to show the advantage of the
As can be seen from Figure 5, when the number of sampling positions is low, the mean errors of
In addition, the error variance of
When the count of sampling positions is high,
With the increase of deployment positions, number of groups increases in the
6. Conclusions
Focusing on the problem of limited number of nodes for its expensive cost in soil respiration sensor networks, we proposed a new deployment strategy based on the time domain similarity of lightweight parameters,
In the proposed method, lightweight parameters data need to be measured densely and regularly in advance, which is infeasible when the area to be monitored is very large. In this case lightweight parameters can be relatively sparsely measured and then be interpolated to increase the density. Although this can reduce the accuracy of the final result, it is a feasible way anyway.
The method proposed in this paper is more convenient to be used for single factor than multiple factors of lightweight parameters, and the experiment in the paper also emphasized the guidance of a single lightweight parameter, humidity, to the deployment of soil respiration sensing nodes. When there are multiple factors, the problem is that the grouping results may be different using different lightweight parameters. A possible solution is to fuse multiple lightweight parameters as a complex lightweight parameter and analyze its temporal similarities among different positions using
Finally, although the proposed deployment method based on lightweight parameters in this paper is described as a solution to soil respiration sensor networks, this idea for deployment is applicable and can provide a reference and guidance to other applications of wireless sensor networks.
Footnotes
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
Acknowledgments
This study is supported by the NSF China under Grant nos. 61190114 and 61303236, the State Bureau of Forestry 948 Project under Grant no. 2013-4-71, and Zhejiang Provincial Science Technology Plan Projects Key Science Technology Specific Project under Grant no. 2012C13011-1.
