Sage Journals: Discover world-class research

Abstract

Data aggregation scheduling for variable aggregation rate model has wide application and should take network lifetime and energy efficiency into consideration. In this paper, the time-slot scheduling problem for the variable aggregation rate model is presented, and a time-slot scheduling integrating consideration of minimizing the energy consumption named Makeup Integer based Data Aggregation Scheduling (MIDAS) is proposed. The proposed MIDAS scheme integrates two core phases, namely, data aggregation set construction and aggregation set based scheduling algorithm. The key idea of MIDAS is to minimize the number of receiving and sending data packets in hotspot and to reduce the number of aggregated packets in network for better scheduling performance in network lifetime. Furthermore, it is also essential to increase energy utilization efficiency of the nodes in the middle layer by exploiting the remaining energy of peripheral nodes. A series of experiments are simulated to demonstrate that the proposed scheme has significantly increased the network lifetime and the energy utilization efficiency under the different aggregation rates and different network scales. Comparing with the SDAS, the lifetime can be increased by as much as 25%. The energy utilization efficiency can be improved by as much as 30%.

1. Introduction

Wireless sensor networks (WSNs) have captured considerable attention recently due to their enormous potential for environmental monitoring, surveillance operations, and industrial automation [1–5]. In wireless sensor networks, data aggregation is an important method of improving transmitting efficiency, and most existing researches have mainly focused on energy consumption [6–11] or transmission latency [12–19]. Some works have investigated the energy latency tradeoff [20–22].

In most past researches, data aggregation refers to the situation in which data packets meet at a node in the routing procedure and they are aggregated into one new data packet. Based on the assumption of transmitting one data packet in one time-slot, each node needs only one time-slot to transmit this one data packet. In real-life application, n nodes are aggregated into multidata packets. So multislots need to be assigned for one node, which may result in the allocation complexity of time-slot scheduling. And the following factors should be comprehensively considered.

(1) Interference. Each sensor is equipped only with a single radio transceiver, sending and receiving cannot be carried out simultaneously. The interference range r is the maximum distance within which nodes in receiving mode will be disturbed by an unrelated transmitter, thus suffering a loss. If a node hears more than one message at the same time, it can receive none of them correctly; therefore this causes a collision. Thus, two links are interfering if the receiver of at least one link is within the range of the transmitter of the other link.

(2) Time Constraints. Node scheduling should obey a sequential order. The time-slots are not necessarily continuously allocated for one node. A time-slot can be used for different purposes. It can be used to transmit its original data as well as the aggregated data. It can also be used to transmit other nodes' original data as well as the aggregated data. Thus, how to allocate multislots for one node will be a challenging issue.

(3) Energy Consumption. Because of the limitability of energy in WSNs, the energy consumption of sensor nodes should be considered in the process of data transmission [23–25]. Therefore, it is our major concern how to design an effective TDMA based aggregation scheduling algorithm, which can minimize the number of receiving and sending data packets in network to optimize the energy of WSNs.

Therefore, it is a challenge to study on data aggregation scheduling combining time-slot and variable aggregation rate. The main contributions of our work are as follows.

(1) This paper addressed the time-slot scheduling problem for the variable aggregation rate model. In the model, each node aggregates the original data of its child nodes and its own original data based on the given aggregation rate and then packs the aggregation result into m new data packets ( $m \in [1, n]$ ) and forwards them.

(2) This paper presented a Makeup Integer based Data Aggregation Scheduling (MIDAS) integrating to minimize the energy cost. Based on the aggregation model and the given aggregation rate, we select some nodes to construct multiple aggregation sets so that the size of each set is as close to its round-up integer as possible. And then the number of aggregated packets in the network is reduced for better scheduling performance. This method can decrease the energy consumption near the sink, increase the energy utilization efficiency, and improve the network lifetime. The key challenge is then to construct aggregation set. The aggregation set guarantees each node to aggregate once in the set. The nodes within the same aggregation set aggregate at the aggregation node in one sample cycle. And the nodes except aggregation node only forward the data from the upper stream sensors in the aggregation set. After the establishment of the aggregation set, the next step is to solve the problem of time-slot scheduling. In the schedule algorithm, the nodes in hotspots would hold on transmission and accumulate their data before sending them to sink at once.

Constructing the aggregation set with a round off thought can reduce the number of the aggregated data packets of the nodes in the first layer near the sink and decrease the energy consumption of the nodes in this area. For no aggregated nodes in the area far from the sink, the number of transmission data is not smaller but more than that near the sink. Thus, the remaining energy far away from the sink is fully utilized, which increases the energy utilization efficiency of the nodes in the middle layer and improve the energy utilization efficiency of the network.

(3) The results of simulation on the random generated tree topologies show that the time-slot scheduling proposed in this paper could realize the dual goals of improving the network lifetime and increasing energy efficiency. Comparing with the simple time-slot scheduling, the lifetime can be improved by as much as 25%. The energy utilization efficiency can be increased by as much as 30%.

The rest of this paper is organized as follows: In Section 2, the related works are reviewed. In Section 3, we discuss the research motive. The system model is described in Section 4. In Section 5, a data aggregation scheduling scheme for variable aggregation rate is presented. The simulation results and performance are analyzed in Section 6. Finally, we conclude in Section 7.

2. Related Work

The current data aggregation scheduling schemes are discussed in this section. Most previous works have mainly focused on energy-saving issue and it has been investigated in [6–11]. Krishnamachari et al. [6] illustrated the impact of data aggregation by comparing its performance with traditional end-to-end routing schemes. Wu et al. [8] used TDMA as the MAC layer protocol and scheduling the sensor nodes with consecutive time-slots at different radio states while reducing the number of state transitions. Wen et al. [9] proposed Heuristics for cluster construction and data aggregation routing such that total energy consumption is minimized. Mo et al. [10] presented a stochastic sensor selection algorithm that randomly selects a subset of sensors according to a certain probability distribution. Li et al. [11] proposed a connected dominating set based transmission scheduling algorithm. There are some papers that build a data aggregation tree to control the delay in [12–19]. The minimum-latency data aggregation problem (MDAT) in wireless sensor networks is well studied and proved to be NP hard [12]. Chen et al. had designed an approximation algorithm with the delay bound of $(Δ - 1) R$ for this problem, where Δ is the maximum node degree and R is the network radius. Huang et al. [13] proposed a centralized scheduling algorithm with the delay bound of $23 R + Δ + 18$ time-slots. Wan et al. [14] extended the result in Huang et al. and designed three types of aggregation scheduling for $ρ = 1$ with latency bound $15 R + Δ - 4$ , $2 R + O (\log R) + Δ$ , and $(1 + O (\log R / \sqrt[3]{R})) R + Δ$ , respectively, where ρ is an interference radius. Yu et al. [15] proposed a distributed aggregation transmission scheduling algorithm with the delay bound of $24 D + 6 Δ - 16$ time-slots, where D is the network diameter. Xu et al. [16, 17] theoretically proved that the delay of the aggregation scheduling generated by their algorithm is at most $16 R + Δ - 14$ time-slots. Wang et al. [18] designed a scheduling algorithm, Peony-tree based data aggregation (PDA), and proved the latency bound is $15 R + Δ - 15$ . Zhang et al. have proposed distributed network estimation and decentralized aggregation scheduling algorithm [19]. It could predict the upper stream sensors' estimates which cannot be aggregated to the sink before deadlines and it is able to aggregate as much estimate information as possible from the network to the sink within delay constraints. Some works have investigated the energy latency tradeoff [20–22]. Given a deadline, they minimized the amount of missed data [20], minimized overall energy dissipation of sensor nodes [21], or minimized the maximum energy consumption [22].

However, the abovementioned algorithms are all based on the assumption that data packets from n nodes can be aggregated into one data packet or n data packets and study the time-slot scheduling under these models [26, 27]. They are not suitable for the network with variable aggregation rate.

3. Research Motivation

The tree topology of a random network with 25 nodes rooted at the sink is illustrated in Figure 1. And the node ID is marked in the figure. The arrow line represents data transmission paths and the dashed line indicates the transmission interference.

Figure 1

A randomly generated aggregation tree with 25 nodes.

Having the minimum scheduling period of one hop node set $C = 4$ , the node scheduling time-slot is assigned with the algorithm proposed in [19]. Table 1 illustrates the scheduling results. Column represents the node's ID, ( $x, y$ ) represent time-slot scheduling, where x denote the assigned time-slot at which node ID sends its data to its parent for the first time. y denote node scheduling period.

Table 1

The time-slot assignment for all nodes.

ID	1	2	3	4	5	6	7	8	9
( $x, y$ )	×	(2, 4)	(4, 4)	(1, 4)	(3, 4)	(1, 4)	(3, 8)	(7, 8)	(4, 4)

ID	10	11	12	13	14	15	16	17	18

( $x, y$ )	(7, 8)	(5, 8)	(1, 8)	(3, 8)	(1, 4)	(2, 4)	(1, 8)	(2, 8)	(8, 8)

ID	20	21	22	23	24	25	×	×	×

( $x, y$ )	(6, 8)	(1, 4)	(3, 4)	(1, 8)	(3, 8)	(1, 8)	×	×	×

The data aggregation mechanism in [19] is based on the assumption that data packets of n nodes can be aggregated into one data packet. But if the aggregated data packets are more than one, and then there have some problems as follows.

(1) A node can not only be assigned one time-slot, but also may require multiple time-slots, which will make time-slot scheduling more complicated.

For instance, there exit two data packets that are produced by node 18 after the aggregation computation by the original data of node 18, node 23, and node 24. It must be assigned two time-slots to node 18 for transmission of the two data packets. And there are two data packets that are produced by node 10 after the aggregation computation by the original data of node 10, node 16, and node 17. It must be assigned two time-slots to node 10 for transmission of the two data packets. In addition, node 10 also needs to transmit the aggregation result of its child node 18. Thus we need to assign four time-slots totally for node 10.

Therefore, previous research methods assigning only one time-slot for one node cannot be adapted to the case where n nodes are aggregated into multidata packets. How to allocate multiple time-slots for one node to transmit its own original or aggregated data as well as its child nodes' original or aggregated data will be a challenging problem.

(2) How to design an effective aggregation scheduling algorithm which can balance the energy consumption is an important issue. Furthermore, it is also essential to increase energy utilization efficiency and prolong the network lifetime by exploiting the remaining energy of peripheral nodes.

We consider the subtree rooted at node 13 in Figure 1. Several typical aggregation scheduling approaches are selected for this explanation and the data aggregation rate is 0.25 for every node. The first is as shown in Figure 2(a), assuming that one data packet is produced by node 13 after the aggregation computation by the original data of nodes 13, 19, 20, and 25. It must be assigned one time-slot to node 13 for transmission of the one data packet. The second is as shown in Figure 2(b), assuming that one data packet is produced by node 13 after the aggregation computation by the original data of node 13 and node 19. And one data packet is produced by node 20 after the aggregation computation by the original data of node 20 and node 25. It must be assigned two time-slots to node 13, where one time-slot is used to transmit the aggregation result of itself, and another is used to transmit the aggregation result of its child node 20. The third is as shown in Figure 2(c); one slot is assigned for node 13 to send its own data, one slot is assigned to transmit the data of node 19, and one slot is assigned to transmit the aggregation result of its child node 20; that is, three slots are required for node 13. The forth is as shown in Figure 2(d); one slot is assigned for node 13 to send its own data, and three slots are assigned to transmit the data of nodes 19, 20, and 25, respectively; that is, four slots are required for node 13. The digital above the arrows represents the number of packets being sent in Figure 2.

Figure 2

Several typical aggregation forms of node 13.

Assume $ϖ_{t}^{i}$ denotes the number of transmitting data packets of node i. $ϖ_{r}^{i}$ denotes the number of receiving data packets of node i. $τ_{i}$ denotes the number of slots required for node i. And, in Figures 2(a), 2(b), 2(c), and 2(d), their corresponding values are shown in Figure 3.

Figure 3

The results of an instance under different cases.

As can be seen from Figure 3, the number of receiving and sending data packets of node 13 is different when exploiting different aggregation scheduling approaches, which leads to the different number of slots required for node 13. It is obviously that the more number of forwarding packets causes the more energy cost. We can see from Figure 3 that node 13 will receive three data packets and send one data packet by exploiting the method shown in Figure 2(a). Therefore, the number of sending data packets of node 13 is the least and the energy consumption also is the lowest. So, if we design an aggregation scheduling approach which can reduce the number of receiving and sending data packets near the sink and increases the number of the transmitting data packets far away from the sink, it will balance the energy consumption and thus prolong the network lifetime.

4. The System Model and Problem Statement

4.1. Network Model

We consider a wireless sensor network $G (V, E)$ with tree topology rooted at the sink, which is constructed by using the BFS method or by other methods proposed in [15], where V is the set of nodes and E is the set of communication links. We assume that all nodes once deployed are not movable and have the same communication range r and the same initial energy. In the protocol interference model [28] on which our work is based, each node has only one transmitting frequency and it cannot send and receive data simultaneously. For any node V, it has an interference range R. As shown in (1), a receiver V of a link VU is interfered by the signal from another node P. Consider the following:

\begin{matrix} ‖V - P‖ \leq R . \end{matrix}

(1)

In this paper, nodes are called one hop neighbors if there exits interferences between them. To ensure data transmission interference free, when any node is receiving data at any one timeslot, the other one hop neighbors cannot send data except to one of child nodes. And, when any node is sending, the other one hop neighbors cannot receive packets except from its parent node. We assume that the relationship between one hop neighbors except parent-child nodes is indicated by dashed lines in the aggregation tree (the same in the following).

4.2. Energy Consumption Model

The energy consumption model in this paper is the same as [29]. The energy cost when a node sends l bits data is computed by (2). The energy cost when a node receives l bits data is computed by (3). Consider the following:

\begin{matrix} E_{t} = l E_{e l e c} + l ε_{f s} d^{2}, i f d < d_{0}, \\ E_{t} = l E_{e l e c} + l ε_{a m p} d^{4}, i f d \geq d_{0}, \end{matrix}

(2)

\begin{matrix} E_{r} = E_{r} (l) = l E_{e l e c} . \end{matrix}

(3)

$E_{e l e c}$ denotes the transmitting circuit loss energy. Both the free space ( $d^{2}$ power loss) and the multipath fading ( $d^{4}$ power loss) channel models are used. If the transmission distance is less than the threshold $d_{0}$ , the power amplifier loss is based on free-space model. If the transmission distance is larger than or equal to the threshold $d_{0}$ , the multipath attenuation model is used. $ε_{f s}$ and $ε_{a m p}$ are the energy required by power amplification in the two models. l is the number of bits in a packet.

4.3. Data Aggregation Model

For each sample cycle, all sensor nodes do sampling once. At the beginning of each sample period, each sensor node generates a packet with the sensed information. The sample period is composed of multiple integer timeslots. And we assume the source data packet of each node is generated synchronously. Actually, one data packet needs to be transmitted in one time-slot. If the calculated data packet size is less than that of one packet after aggregation, we should transmit it as one data packet. And the transmission of one data packet must be finished in one time-slot. Therefore, 1.25 packets require two time-slots for transmission and 2.5 packets need three time-slots for transmission.

Definition 1 (the number of data packets ϖ).

$ξ (m)_{i}$ denotes the total amount of data produced at node i after the aggregation computation by node i and other $m - 1$ nodes. D denotes the amount of information contained in each data packet. The number of data packets $ϖ_{i}$ that node i is transmitting is computed by

\begin{matrix} ϖ_{i} = ⌈\frac{ξ {(m)}_{i}}{D}⌉ . \end{matrix}

(4)

Namely, data packet

ϖ_{i}

is the rounding-up ratio of the total amount of data

ξ (m)_{i}

and the amount of information contained in each data packet D.

Definition 2 (data aggregation rate γ).

γ denotes data aggregation rate which is a decimal between 0 and 1. For example, $γ = 0.5$ . Traditionally, n nodes are aggregated into one packet (as grey nodes shown in Figure 4, the boxes above the arrows represent the number of packets being sent). Given γ, n nodes can be aggregated into m data packets. m is an integer, $m \in [1, n]$ (as green nodes shown in Figure 4).

Figure 4

Data aggregation model.

Definition 3 (aggregation model).

For data aggregation, one adopts the lossless step-by-step multihop aggregation model. In such aggregation model, $σ_{i}$ denotes the original data packet of node i, $φ_{i, j}$ denotes the received data packet of node i from node j, and $ϕ_{i}$ denotes the final aggregation result of node i to all incoming data and its own data.

When node i receives data $φ_{i, j}$ from node j, if current data packet of node i is $σ_{i}$ , and data from j is $φ_{i, j} = σ_{j}$ , namely, the data to be aggregated is both origin data, then the aggregation formula follows

\begin{matrix} ϕ_{i} = σ_{i} \times γ + φ_{i, j} \times γ . \end{matrix}

(5)

If the current data packet of node i is

σ_{i}

, and data from j is

φ_{i, j} = ϕ_{j}

, namely, the received data packet of node i is not source data when being aggregated, the aggregation formula follows

\begin{matrix} ϕ_{i} = σ_{i} \times γ + φ_{i, j} . \end{matrix}

(6)

For example, as green nodes depicted in Figure 4, we assume that the packet size is $5 * 10^{5}$ bits, when $γ = 0.5$ ; according to (4), (5), and (6), we get

\begin{matrix} ϕ_{A} = σ_{A} \times 0.5 + σ_{B} \times 0.5 + φ_{A, C}, \\ φ_{A, C} = ϕ_{C} = σ_{C} \times 0.5 + σ_{D} \times 0.5 + σ_{E} \times 0.5 . \end{matrix}

(7)

The number of data packet needs to be transmitted by node C as in the following:

\begin{matrix} ϖ_{C} = ⌈\frac{(σ_{C} + σ_{D} + σ_{E}) \times γ}{D}⌉ = ⌈\frac{(5 * 10^{5} + 5 * 10^{5} + 5 * 10^{5}) \times 0.5}{5 * 10^{5}}⌉ = 2 . \end{matrix}

(8)

Thus, we need to assign two time-slots for node C. The number of data packet needs to be transmitted by node A as in the following:

\begin{matrix} ϖ_{A} = ⌈\frac{(σ_{A} + σ_{B}) \times γ}{D}⌉ + ϖ_{C} = ⌈\frac{(5 * 10^{5} + 5 * 10^{5}) \times 0.5}{5 * 10^{5}}⌉ + 2 = 3 . \end{matrix}

(9)

Therefore, node A needs to be assigned three time-slots for transmitting the three packets.

4.4. Problem Statements

The objective is to design a Data Aggregation Scheduling (MIDAS) scheme for variable aggregation rate WSNs. The scheduling time of the node i is denoted by $T s f t_{j}$ . When being first scheduled, the relay node transmits its own original data packet to its parent directly. The aggregation node aggregates its own data with data received from its child nodes, and then it transmits the aggregated data packets to its parent directly. In other scheduled time slots, all nodes forward the received data from their related child nodes to parents.

We use the energy cost model presented in Section 4.2 to calculate the network energy consumption. In this paper, network lifetime is defined as the time when the first node dies. After the first node dies in the network, it could seriously affect the connectivity and coverage of network, so that the network cannot fully play its due role. Considering all nodes in the network, the network lifetime is represented by (10), where $E_{i}$ represents the energy consumption of node i, and T denotes the network lifetime. Consider the following:

\begin{matrix} T = \underset{0 < i \leq n}{m i n} (E_{i}) . \end{matrix}

(10)

The energy utilization efficiency is defined as the ratio of the utilized energy to initial energy when the network dies. The energy utilization efficiency is represented by (11), where

E_{c o n s}^{i}

represents the energy consumption of node i.

$E_{i n i t}^{i}$ denotes the initial energy of node i and η denotes the energy utilization efficiency:

\begin{matrix} η = \frac{\sum_{i \in n}^{} E_{c o n s}^{i}}{\sum_{i \in n}^{} E_{i n i t}^{i}} . \end{matrix}

(11)

There is a complex relationship among transmitting data packet, energy consumption of nodes, and network lifetime. So we try to reduce data packets transmitted in the network, especially packets in the area near the sink. In the schedule algorithm, the nodes in hotspots would hold on transmission and accumulate their data before sending them to sink at once. This could realize the dual goals of increasing the amount of information aggregated to sink and decreasing the number of the transmitting data packets in network. Thus, according to (4) in Section 4.3, we select m packets to aggregate at the aggregation node i, the minimum value of m is represented by

\begin{matrix} \min (m) = \min (⌈\frac{ξ {(m)}_{i}}{D}⌉ - \frac{ξ {(m)}_{i}}{D}) . \end{matrix}

(12)

The work of schedule is to assign multiple time-slots to each node in the network to ensure interference free and the network lifetime maximization and the energy utilization efficiency maximization. In conclusion, the optimization goal of this paper is expressed in the following formula, where $E_{l e f t}^{i}$ represents the residual energy of node i:

\begin{matrix} \max (T) = m i n \max_{0 < i \leq n} (E_{i}), \\ \max (η) = \min (\frac{\sum_{i \in n}^{} E_{l e f t}^{i}}{\sum_{i \in n}^{} E_{i n i t}^{i}}), \\ \min (m) = \min (⌈\frac{ξ {(m)}_{i}}{D}⌉ - \frac{ξ {(m)}_{i}}{D}) . \end{matrix}

(13)

5. Scheme Design

5.1. Construct the Aggregation Set of Nodes

Definition 4 (aggregation set).

Given an integer m and a series set $S = {S_{1}, S_{2}, \dots, S_{k}}$ , $\forall S_{i} \subset S, L_{i} \leq m, ⋃_{i = 1}^{k} S_{i} = \{1,2, \dots, n\}$ , n is a sensor node in WSNs, and $L_{i}$ is the number of $S_{i}$ contained. We call an aggregation set if and only if $\forall S_{i}, S_{j} \subset S, S_{i} \cap S_{j} = ϕ$ . $S_{i}$ is called an aggregation set. For instance, when γ = 30%, $S = {S_{1}, S_{2}, \dots, S_{k}}$ is shown in Figure 5(c). The colored nodes mean that they are in different $S_{i}$ .

Figure 5

The aggregation set results of Figure 1.

Characteristic 1. If $L_{i} \neq 1, \exists x \in S_{i}, \forall y \in S_{i}, x \neq y$ , y is a child-node in subtree rooted at x. x is the aggregation node; it will aggregate its own original packets with received packets from its child nodes in $S_{i}$ into new data packets and then transfers it out. y is a relay node; it will only send its own data packet and forward received packets to its parent without aggregation.

Characteristic 2. According to the Characteristic 1, we know that, $\forall x \in S_{i}$ , if z is a child-node in subtree rooted at x and z is an aggregation node in $S_{j}, i \neq j$ , then x will only forward the aggregation result of z without aggregation repeat.

Our aggregation scheduling algorithm is based on an aggregation tree. Assume the maximum level number of the aggregation tree is H. $L_{i}$ denotes nodes in each layer i. For example, from Figure 1, we can see that the sink node is in the layer 0; nodes 6, 7, 8, and 9 are in layer 2. There are totally 5 layers in this example.

According to (12) in Section 4.4, we can get

\begin{matrix} \exists M_{0} \in [1,2, 3, \dots, n], \\ \underset{m \to M_{0}^{-}}{l i m} γ \times m = N, \\ N \in [1,2, 3, \dots, n], \end{matrix}

(14)

where

M_{0}, N, m

are integers and

N, m

are minimum values to satisfy (14). In the actual network, if the number of nodes is less than m and then we select

m^{'}

nodes to construct a set,

m^{'}

is close to m.

The parameters used in Algorithms 1–6 are as shown in Table 2. Consider $j \in [1, n]$ , $k \in [1, K]$ .

Table 2

The parameters used in Algorithms 1–6.

Symbol	Meaning	Symbol	Meaning
$N s e t [j]$	Aggregation set of node j belong to	$D_{j}$	The interference node set of node j

$S_{k}$	The kth aggregation set	$n o c_{j}$	The number of j's child nodes

$F_{j}$	Father node of j	$S_S O R T_{k}$	Resorted the kth aggregation set

$B_{j}$	Brother nodes of j	$T s f t_{j}$	Node j schedule time-slots

$S_B J_{k}$	The flag representing whether set k is accessed	$N S_{k}$	The number of nodes in the kth aggregation set

$T s f t_B J_{j}$	The flag representing whether nodej is assigned slot for sending original data	$T s f t_F B J_{j}$	The record that node j is assigned slot for sending aggregated data

Algorithm 1: Constructing the aggregation set (CAS).

$(1)$ Obtain m according to (14),

$(2)$ Set $K \leftarrow 1$ ; $S_{k} \leftarrow φ$ ; $N s e t [j] \leftarrow 0, j \in (1, n)$ ;

(3) FOR $i \in [H, 1]$ // i denote the ith layer

$(4)$ FOR each node $j \in L_{i}$

(5) IF node $j \notin \{S_{k} | k \in [1, K]\}$ $/ /$ if node j is not in any $S_{k}$

(6) $k \leftarrow K$ ; // k denote the kth aggregation set

(7) $S_{k} \leftarrow S_{k} \cup {j}$ ; // put node j in $S_{k}$

(8) $N s e t [j] \leftarrow k$ ; // j belong to the kth aggregation set

$(9)$ calculate the number of nodes L in set $S_{k}$ ;

( $10$ ) While $L < m$

( $11$ ) Call Search_fnode( $j; S_{k}; K; N s e t; m$ );

(12) Call Search_bnode( $j; S_{k}; K; N s e t; m$ );

(13) End while

(14) Output $S_{k}$ ;

(15) $K \leftarrow K + 1$ ;

(16) End If

(17) End For

(18) End For

(19) $K \leftarrow k$ ;

( $20$ ) Output the number K of aggregation set.

Algorithm 2: Search_fnode( $j; S_{k}; K; N s e t; m$ ).

(1) Find the father node of j is $F_{j}$ ;

(2) IF $F_{j}$ is not in any $S_{k}$ and it is not the sink node

(3) $S_{k} \leftarrow S_{k} \cup {F_{j}}$ ; // put the father node of j in $S_{k}$

(4) $N s e t [F_{j}] \leftarrow k$ ; // $F_{j}$ belong to the kth aggregation set

(5) calculate the number of nodes L in set $S_{k}$ ;

(6) IF $L = m$

(7) break;

(8) End If

(9) End If

(10) Send the message $S_{k}, N s e t$ to CAS.

Algorithm 3: Search_bnode( $j; S_{k}; K; N s e t; m$ ).

$(1$ ) Find the brother node of j is $B_{j}$ ;

(2) IF $B_{j}$ is not in any $S_{k}$

(3) $S_{k} \leftarrow S_{k} \cup {B_{j}}$ ; $/ /$ put the brother nodes of j in $S_{k}$

(4) $N s e t [B_{j}] \leftarrow k$ ; $/ /$ $B_{j}$ belong to the kth aggregation set

(5) calculate the number of nodes L in set $S_{k}$ ;

(6) IF $L = m$

(7) break;

(8) End If

(9) End If

(10) IF $N s e t [F_{j}] = N s e t [j]$ $/ /$ j and $F_{j}$ are in the same aggregation set

(11) $j \leftarrow F_{j}$ ; $/ /$ set $F_{j}$ as the current node.

(12) Else

(13) break;

(14) End If

(15) Send the message $S_{k}, N s e t, j$ to CAS.

Algorithm 4: Time-slot allocation for nodes (TSAN).

$(1$ ) resorted aggregation set $S_{k}$ and get $S_S O R T_{k}, k \in [1, K]$ ;

// The nodes are arranged layer by layer downward, and those in the same layer are arranged by node's ID

(2) $S_B J_{k} \leftarrow 0$ , $T s f t_B J_{j} \leftarrow 0$ , $T s f t_F B J_{j} \leftarrow 1$ , $T s f t_{j} \leftarrow 0$ ;

(3) FOR $i \in [1, H]$ // i denote the ith layer

(4) FOR each node $j \in L_{i}$

(5) IF $T s f t_B J_{j} = 0$ $/ /$ j is not assigned slot for sending original data

(6) Call Tsft_Set( $j, N s e t, S_S O R T_{k}, G, n, T s f t, T s f t_B J, D$ );

(7) End If

(8) End For

(9) End For

(10) FOR $i \in [1, H]$ // i denote the ith layer

(11) FOR each node $j \in L_{i}$

(12) IF $n o c_{j} \neq 0$ // j has child nodes

(13) Call Tsft_Forw( $j, N s e t, S e t_B J, T s f t, T s f t_F B J, D$ );

(14) End If

(15) End For

(16) End For

Algorithm 5: Tsft_Set( $j, N s e t, S_S O R T_{k}, G, n, T s f t, T s f t_B J, D$ ).

(1) $k \leftarrow N s e t [j]$ and get $S_S O R T_{k}$ ;

(2) Calculate the number of nodes $N S_{k}$ in the $S_S O R T_{k}$ ;

(3) IF $N S_{k}$ = 1

(4) assign a time-slot to j which meet C3;

(5) End If

(6) IF $N S_{k}$ = 2

(7) assign the minimum available time-slot to the child node of j in $S_S O R T_{k}$ ;

(8) assign a time-slot to j which meet C3;

(9) End If

( $10$ ) IF $N S_{k}$ >= 3

( $11$ ) FOR each node x except j in $S_S O R T_{k}$

( $12$ ) IF $N s e t [F_{x}] = k$ $/ /$ if x and its father are in the same aggregation set

( $13$ ) assign a time-slot to x which meet C1;

(14) $y \leftarrow x$ ;

(15) While $F (y) \neq j$

(16) assign a time-slot to $F_{y}$ which meet C2;

(17) $y \leftarrow F_{y}$ ;

(18) End While

(19) End If

( $20$ ) End for

( $21$ ) assign a time-slot to j which meet C3;

(22) End If

(23) Send the message $T s f t, T s f t_B J$ to TSAN.

Algorithm 6: $Tsft_Forw (j, N s e t, S e t_B J, T s f t, T s f t_F B J, D$ ).

(1) For each node e in the children set of j

(2) IF $N s e t_{j} \neq N s e t_{e}$ // node j and its child node e are not in the same aggregation set

(3) $k \leftarrow N s e t_{e}$ ; // e belong to the kth aggregation set

(4) IF $S_B J_{k} = 0$ // the kth aggregation set is not accessed

(5) $z \leftarrow T s f t_F B J_{e}$ ;

(6) $T s f t_F B J_{j} \leftarrow T s f t_F B J_{j} + z$ ;

(7) FOR $s = z$ downto 1

(8) assign z time-slots which meet C4, C5 to j for forwarding e's aggregated data;

(9) End For

(10) $S_B J_{k} \leftarrow 1$ ; // set aggregation set k is accessed

(11) Else

(12) $z \leftarrow T s f t_F B J_{e}$ ;

(13) $T s f t_F B J_{j} \leftarrow T s f t_F B J_{j} + (z - 1)$ ;

(14) FOR $s = z - 1$ downto 1

( $15$ ) assign $z - 1$ time-slots which meet C4, C5 to j for forwarding e's aggregated data;

(16) End For

( $17)$ End If

(18) Else

(19) $z \leftarrow T s f t_F B J_{e}$ ;

(20) IF $z \geq 2$

(21) $T s f t_F B J_{j} \leftarrow T s f t_F B J_{j} + (z - 1)$ ;

(22) FOR $s = z - 1$ downto 1

(23) assign $z - 1$ time-slots which meet C4, C5 to j for forwarding e's aggregated data;

(24) End For

(25) End If

(26) End If

(27) End For

(28) Send the message $S e t_B J, T s f t, T s f t_F B J$ to TSAN.

The main idea is to select m data packets to ensure the total amount of aggregated data close to an integer. The principle of selecting m packets for aggregation is as follows.

Given aggregation rate γ, for convenience, we assume $N = 1$ , and then we can get m according to (14). We search the tree from bottom to the top layer and put a node j into an aggregation set if it does not belong to any sets. Calculate the number of nodes L in this set. If L is less than m and the father node $F_{j}$ is not the sink node, then put $F_{j}$ into this set as well. Calculate the number of nodes L in this set again. If L is less than m, the brother nodes $B_{j}$ are put into this set. Calculate the number of nodes L in this set repeat. If L is still less than m, then set the father node $F_{j}$ as the node j. Repeat the above steps until L is equal to m or the father node is the sink or the father node is already in other aggregation sets.

The pseudo code of the data aggregation set construction (CAS) process is presented in Algorithm 1. It includes a Search_fnode subfunction and a Search_bnode subfunction, which are shown in Algorithms 2 and 3, respectively.

Additionally, CAS has achieved higher time efficiency as each node would be accessed at most twice. When first being accessed, a node will be put into an aggregation set. When being accessed again, a node will be inquired and it is no longer considered when it has already existed in an aggregation set. Accordingly, CAS offered optimized algorithm for the data aggregation set construction, and time complexity of which was polynomial $O (n)$ . n denotes the number of nodes in network.

For example, exploiting our proposed Algorithm 1, a series aggregation set $S = {S_{1}, S_{2}, \dots, S_{k}}$ of Figure 1 is as shown in Figure 5. The colored nodes represent that they are in different aggregation sets. γ represent aggregation rate, m represent the optimized number of packets for aggregation, and K represent the number of aggregation set.

5.2. The Design of MIDAS

In this part, we designed an aggregation set based scheduling algorithm (TSAN), which assigns time-slots for all the nodes in the network. The approach contains two steps.

The purpose of the first step (Tsft_Set) is assigning one or multislots to nodes except aggregation node in the set. In this case, leaf node finishes its original data transmission in the assigned time-slot. And other nodes transmit their original data in the earliest assigned time-slots and forward the received data from the upper stream nodes in the same set in other time-slots. In addition, it needs to allocate one slot for aggregation node to transmit the aggregated data of the aggregation set.

The purpose of the second step (Tsft_Forw) is assigning one slot or multislots to nodes in the journey of the aggregation node to sink for forwarding the aggregated data of the aggregation set.

We search the tree layer by layer downwards in the first step. If there exists a node j that has not assigned a time-slot, we get the aggregation set $S_S O R T_{k}$ including node j. And, then, we begin to assign slots for each node in this set. In the aggregation set, time-slot allocation of nodes needs to meet the following constraints: (C1)

The time-slot of child nodes for transmitting its original data is later than the time-slot of its father transmitting its original data. Namely, the first time-slot for transmission of the child node is later than the first time-slot for transmission of the father node.

(C2)

Node i has been assigned one time-slot $t s f t_{i}$ , if the father $F_{i}$ of node i is not an aggregation node, and then we need to assigned one time-slot $t s f t_{F_{i}}$ to $F_{i}$ , and $t s f t_{F_{i}}$ is later than $t s f t_{i}$ . And then set $F_{i}$ is as i, it needs to assign time-slot to $F_{i}$ until $F_{i}$ is an aggregation node.

(C3)

The time-slot assigned for aggregation node is later than that of its children in the same aggregation set and of its father node in the different aggregation sets.

In the above time-slot allocation, the relay nodes transmit more data as they do not aggregate while only forwarding the data received; this property makes full use of the remaining energy of peripheral nodes. And the aggregation nodes hold on and wait for data collection to a certain amount before transmitting to its father node at once. While because the nodes in the first layer near the sink are always aggregation nodes, so the data packets transmitted from the one hop neighbors to sink are greatly decreased because of aggregation. Furthermore, it also decreases the energy consumption in this area.

The first step finished, and then we start the second. We search the tree from bottom to the top layer in the second step. If there exists a node j and it has a child node $C_{j}$ which is an aggregation node, j and $C_{j}$ belong to different aggregation sets. Time-slot allocation needs to meet the following constraints: (C4)

One time-slot $t s f t_{j}$ is assigned to j for forwarding the aggregated data of $C_{j}$ , and $t s f t_{j}$ is later than slot of $C_{j}$ .

(C5)

One time-slot $t s f t_{F_{j}}$ is assigned to father node $F_{j}$ of j for forwarding the aggregated data of $C_{j}$ , and $t s f t_{F_{j}}$ is later than $t s f t_{j}$ . And then set $F_{j}$ is as j; it needs to assign time-slot to $F_{j}$ until $F_{i}$ is the sink.

In this way, all the nodes have been assigned time-slots. The pseudo code of the data aggregation scheduling is presented in Algorithm 4. It includes a Tsft_Set subfunction and a Tsft_Forw subfunction, which are shown in Algorithms 5 and 6, respectively.

We assume n denotes the number of nodes in the network. m denotes the number of nodes in an aggregation set. Firstly, we try to discuss time needed for a node to find an idle time-slot within an aggregation set. In a set, time required for sending all original data to the aggregation node is $O (n m)$ at most. The number of aggregation sets changes from $n / m$ to n and the highest level of the set is m. We assume the worst case is n and each node needs to take up m time-slots of transmitting its original data to the aggregation node. So time complexity of making the set traversing for a node to find available time-slot is polynomial $O (n m)$ . Secondly, when we find available time-slot for a node, the interference should be considered. In the worst-case scenario, each node may occupy m time-slots, and all assigned time-slots of n nodes must be compared. Thus the number of comparisons is $O (n m)$ . For a node to finish the above two steps, the time complexity of Tsft_Set is $O (n \times (n m \times n m)) = O (n^{3} m^{2}) = O (n^{3})$ . m is a constant. It can be seen that Tsft_Forw has a similar analysis to the Tsft_Set, and Tsft_Forw is of smaller scale than Tsft_Set. Thus the time complexity is $O (n^{3})$ too.

Based on the above analysis, the time complexity of TSAN is $O (n^{3} + n^{3}) = O (n)$ . Actually, the number of interference nodes is no more than n and the time-slots required for each node are no more than m too. So the time complexity is smaller than $O (n^{3})$ .

Figure 6 gives the node set T for each time-slot during which all the member nodes of T synchronously send packets to their corresponding parent nodes. For example, when $γ = 0.15$ , a cycle time is composed of 18 time-slots in Figure 6(a). Node 3 is assigned three slots of 13, 17, and 18. Packets from nodes 3, 8, 13, 19, 20, and 25 can be aggregated into one data packet and transmitted to the sink in the time-slot 13. The final aggregated data of node 10 can be transmitted to the sink in the time-slot 17, and the aggregated data of node 7 can be transmitted to the sink in the time-slot 18.

Figure 6

The node set T for each time-slot under different aggregate rates.

6. Performance Evaluation

In this section, compared with a non-round-up integer time-slot allocation approach named SDAS, the performance of the proposed MIDAS approach under different aggregation rates is evaluated. The aggregation rate of each sensor node is fixed to 0.15, 0.25, 0.3, and 0.5, respectively. The simulation is conducted on the platform of MATLAB 7.0 and the performance of the scheme is evaluated in a wireless sensor network with tree topology. And a child-parent relationship has been established. In each sample period, each node is sampled once and generates its original data packet synchronously at each sample initial time 0. The amount of information contained in an origin data packet is assumed to be 1 unit. And the number of aggregated data packets is calculated by (4). Table 3 shows the parameters and corresponding values in the network. Moreover, we assume the energy of sink is infinite and all the other nodes have the same initial energy 2 J. The energy cost only takes place in the case of receiving or transmitting data packet, which is calculated by (2) and (3). The energy utilization efficiency can be calculated by (11).

Table 3

Network parameters.

Symbol	Value	Symbol	Value
Initial energy $E_{init} /$ J	2	Packet size δ/bytes	5 ∗ 10⁵
Threshold distance $d_{0}$ /m	87	$e_{f s} /$ pJ⋅(b⋅m²)⁻¹	10 $pJ$
Sensing range $r_{s} /$ m	10	$e_{amp} /$ pJ⋅(b⋅m⁴)⁻¹	0.0013
$E_{fusion}$ /nJ⋅b⁻¹	5	$E_{elec} /$ nJ⋅b⁻¹	50

6.1. Node Scheduling Time-Slot Assignment

Figure 1 shows a randomly generated aggregation tree. Except the sink, distance from one node to its parent in the aggregation network is as shown in Table 4. Exploiting the proposed MIDAS, the result of node scheduling time-slot assignment of the aggregation tree shown in Figure 1 is presented in Table 5. The scheduling result is denoted by time-slots of each node to transmit. As can be seen in Table 5, each node has been assigned one slot or multislots. These time-slots of one node can be used for different purposes. For example, when the aggregation rate $γ = 15 %$ , the first time-slot to transmit for node 7 is the 14th time-slot; node 7 is sampled and aggregates with incoming data packets from nodes 11 and 12, and then it transmits the aggregated packet to its parent directly in the 14th time-slot. The second time-slot to transmit for node 7 is the 16th time-slot. And node 7 transmits the aggregated data packet of node 10 to its parent directly in the 16th time-slot. Exploiting algorithm SDAS, the result of node scheduling time-slot assignment of Figure 1 is presented in Table 6.

Table 4

Parent-child relationships and distances from child to parent node in the aggregation tree of Figure 1.

Node ID	1	2	3	4	5	6	7	8	9	10	11	12	13
Parent ID	×	1	1	1	1	2	3	3	5	7	7	7	8
Distance (m)	×	91	95	56	96	82	55	64	77	98	98	58	99

Node ID	14	15	16	17	18	19	20	21	22	23	24	25	×

Parent ID	9	9	10	10	10	13	13	15	15	18	18	20	×
Distance (m)	98	74	90	57	71	96	90	98	83	52	92	97	×

Table 5

Node scheduling of MIDAS of Figure 1 under different aggregation rates.

ID	1	2	3	4	5	6	7	8	9	10	11	12	13
γ = 15%	×	2	13, 17, 18	4	14	1	14, 16	1, 3, 6, 9, 12	1, 5, 8, 11, 13	15	1	3	2, 5, 8, 11
γ = 25%	×	2	8, 17, 18, 19, 20, 21	1	3, 10, 11	1	3, 5, 7, 14, 15, 16	9, 11	7, 9	9, 12, 13	4	6	10
γ = 30%	×	2	3, 12, 13, 14, 15, 16	4	5, 8	1	5, 9, 10	1, 8, 11	1, 3, 7	6, 8	1	4	5, 7
γ = 50%	×	2	3, 18, 19, 20, 21, 22, 23, 24, 25	4	1, 5, 9, 10	1	5, 9, 14, 15, 16, 17	1, 7, 10	3, 7, 8	6, 11, 12, 13	1	7	5, 8

ID	14	15	16	17	18	19	20	21	22	23	24	25	×

γ = 15%	3	6, 9, 12	2	3	4, 6, 8	4	7, 10	7	10	5	7	9	×
γ = 25%	8	1, 4, 6	10	11	1, 6, 8	1	2, 4	2	5	2	7	3	×
γ = 30%	2	6	2	3	7	6	2, 4	2	4	1	2	3	×
γ = 50%	2	4, 6	2	7	8, 10	2	6	1	5	1	9	1	×

Table 6

Node scheduling of SDAS of Figure 1 under different aggregation rates.

ID	1	2	3	4	5	6	7	8	9	10	11	12	13
γ = 15%	×	2	4, 13, 14, 15, 16, 17, 18, 19	1	3, 8, 9, 10	1	3, 6, 10, 11	7, 9, 12	4, 6, 7	7, 9	5	1	3, 8
γ = 25%	×	2	4, 13, 14, 15, 16, 17, 18, 19	1	3, 8, 9, 10	1	3, 6, 10, 11	7, 9, 12	4, 6, 7	7, 9	5	1	3, 8
γ = 30%	×	2	4, 13, 14, 15, 16, 17, 18, 19	1	3, 8, 9, 10	1	3, 6, 10, 11	7, 9, 12	4, 6, 7	7, 9	5	1	3, 8
γ = 50%	×	2	4, 17, 18, 19, 20, 21, 22, 23, 24, 25	1	3, 8, 9, 10	1	3, 6, 13, 14, 15, 16	7, 9, 12	4, 6, 7	7, 10, 11, 12	5	1	3, 8

ID	14	15	16	17	18	19	20	21	22	23	24	25	×

γ = 15%	1	2, 5	1	2	8	1	6	1	3	1	3	1	×
γ = 25%	1	2, 5	1	2	8	1	6	1	3	1	3	1	×
γ = 30%	1	2, 5	1	2	8	1	6	1	3	1	3	1	×
γ = 50%	1	2, 5	1	2	8, 9	1	6	1	3	1	3	1	×

6.2. Energy Utilization Efficiency

We compare the energy utilization efficiency property of the proposed MIDAS approach with that of SDAS.

First, based on the result of node scheduling time-slot assignment presented in Section 6.1, we could obtain the residual energy of each node in network of Figure 1 when the network dies, which is shown in Figures 7 and 8, respectively.

Figure 7

The residual energy of the MIDAS of Figure 1.

Figure 8

The residual energy of the SDAS of Figure 1.

Also, we consider the case when the number of nodes in the network is increased to 100 randomly. The network parameters and aggregation scheduling algorithm are the same as the ones above. Figures 9 and 10 show the residual energy of each node by running the two algorithms, respectively.

Figure 9

The residual energy of the MIDAS with 100 nodes.

Figure 10

The residual energy of the SDAS with 100 nodes.

We can see that there is an obvious fluctuation characteristic in Figures 7 and 9. The nodes that consume more energy are not always in hotspot when the network dies. Energy dissipation is larger for most sensors in the middle layer of network, during the entire execution of the data transmission. For example, node 10 and node 13 in Figure 7 consume more energy. But in Figures 8 and 10, it can be seen that when the network dies, the nodes which consume more energy are always in the area near the sink under no matter what aggregation rate. This means that the energy consumption is larger in the area near the sink but is smaller in the area far from the sink. From Figures 7–10, we can see that our MIDAS approach can make full use of the remaining energy of peripheral nodes than SDAS.

Second, we compare the energy utilization efficiency of our MIDAS approach with that of SDAS under the different aggregation rates and different network scales. Simulations are conducted on the network with nodes of 25, 50, 75, and 100, respectively. The results are shown in Figures 11, 12, 13, and 14, respectively.

Figure 11

The comparison of the energy utilization efficiency under 0.15 aggregation rate.

Figure 12

The comparison of the energy utilization efficiency under 0.25 aggregation rate.

Figure 13

The comparison of the energy utilization efficiency under 0.3 aggregation rate.

Figure 14

The comparison of the energy utilization efficiency under 0.5 aggregation rate.

Figures 11, 12, 13, and 14 compare the energy utilization efficiency of the network by using the two algorithms when the number of nodes varies. It can be seen from the figure that the energy utilization efficiency of the proposed MIDAS approach outperforms that of SDAS significantly. The energy utilization efficiency of MIDAS algorithm is mostly about 25%–45%, but that of SDAS algorithm is mostly about 15%–25%. The energy utilization efficiency of MIDAS algorithm is improved by 30% compared with the SDAS algorithm. This is because some nodes in the middle layer consume more energy for transmitting more data packets. Thus the residual energy of these nodes decreases when the transmission data packets increase. Therefore, the remaining energy far away from the sink is fully utilized, which increases the energy utilization efficiency of the network.

From Figures 7–14, we can see that our MIDAS approach can make full use of the remaining energy of peripheral nodes than SDAS so that performance of our method is better than that of SDAS in the energy utilization efficiency.

6.3. Network Lifetime

The network lifetime is an important metric to evaluate our approach. We compare the network lifetime of the proposed MIDAS approach with that of SDAS under the different aggregation rates and different network scales. Figures 15, 16, 17, and 18 show the comparison of network lifetime under the two scheduling approaches, respectively.

Figure 15

The comparison of the network lifetime with 25 nodes.

Figure 16

The comparison of the network lifetime with 50 nodes.

Figure 17

The comparison of the network lifetime with 75 nodes.

Figure 18

The comparison of the network lifetime with 100 nodes.

From Figures 15–18, we can see that the proposed MIDAS approach has a better performance in the aspect of network lifetime. The network lifetime of MIDAS is improved by 20% compared with that of SDAS under the same aggregation rate and the same network scales. This is because the proposed MIDAS approach has the ability to reduce the number of receiving and sending data packets near the sink for decreasing the energy consumption in this area by selecting reasonable nodes to aggregate. The aggregation nodes hold on transmission and accumulate their data before sending them to sink at once. Consequently, the energy consumption is reduced because the data needed to be scheduled in the network is reduced. This property is important since it prolongs the network lifetime by avoiding early energy depletion of sensors. Moreover, the improvement of network lifetime will be larger with the increase of the aggregation rate. This is because the number of data packets gathered by the sink decreases when the aggregation rate of nodes increases.

7. Conclusion

This paper focuses on the data aggregation scheduling combining time-slot and variable aggregation rate. Based on the aggregation model and the given aggregation rates, we proposed an efficient aggregation approach, which includes two coupled parts: aggregation set construction and an aggregation scheduling algorithm. The proposed approach can not only minimize the number of receiving and sending data packets in hotspot but also reduce the number of aggregated packets in network for better scheduling performance in network lifetime. Furthermore, it is also essential to increase energy utilization efficiency of the nodes in the middle layer by exploiting the remaining energy of peripheral nodes. Our simulation results verify the effectiveness of the MIDAS scheme. The dual goals of improving the network lifetime and increasing the energy utilization efficiency are simultaneously achieved.

In this paper, we focus on the fact that data aggregation rate is equal for every node. It would be interesting to study the algorithm extending to the case that nodes may have different data aggregation rates.

Footnotes

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

This work was supported in part by the National Natural Science Foundation of China (61272150, 61379110, and 61472450), Ministry Education Foundation of China (20130162110079 and MCM20121031), the National High Technology Research and Development Program of China (863 Program) (2012AA010105), the National Basic Research Program of China (973 Program) (2014CB046305), the Hunan Province Education Science Project (XJK015CXX006), and Central South University of Forestry and Technology Youth Fund Project (QJ2011010B).

References

Kang

Y.-M.

Lim

Yoo

Kim

C.-K.

Design, analysis and implementation of energy-efficient broadcast MAC protocols for wireless sensor networks

KSII Transactions on Internet and Information Systems 2011 5 6 1113 1132

10.3837/tiis.2011.06.002

2-s2.0-84863031699

Cai

L. X.

Liu

Luan

T. H.

Shen

X. S.

Mark

J. W.

Vincent Poor

Sustainability analysis and resource management for wireless mesh networks with renewable energy supplies

IEEE Journal on Selected Areas in Communications 2014 32 2 345 355

10.1109/JSAC.2014.141214

2-s2.0-84893323100

Gui

J. S.

Zeng

Z. W.

Joint network lifetime and delay optimization for topology control in heterogeneous wireless multi-hop networks

Computer Communications 2015 59 24 36

10.1016/j.comcom.2015.01.005

Luan

T. H.

S. Y.

Asefi

Shen

X. M.

Quality of experience oriented video streaming in challenged wireless networks: analysis, protocol design and case study

IEEE ComSoc MMTC E-Letter 2012 7 3 9 12

Rajagopalan

Varshney

P. K.

Data-aggregation techniques in sensor networks: a survey

IEEE Communications Surveys & Tutorials 2006 8 4 48 63

10.1109/comst.2006.283821

2-s2.0-84874468531

Krishnamachari

Estrin

Wicker

The impact of data aggregation in wireless sensor networks

Proceedings of the 22nd International Conference on Distributed Computing Systems Workshops

2002

Vienna, Austria

575 578

10.1109/icdcsw.2002.1030829

Fahmy

Shroff

N. B.

On the construction of a maximum-lifetime data gathering tree in sensor networks: NP completeness and approximation algorithm

Proceedings of the 27th IEEE Communications Society Conference on Computer Communications (INFOCOM ′08)

April 2008

Phoenix, Ariz, USA

IEEE

1013 1021

10.1109/infocom.2007.80

2-s2.0-51349108846

Y. W.

X.-Y.

Liu

Y. H.

Lou

Energy-efficient wake-up scheduling for data collection and aggregation

IEEE Transactions on Parallel and Distributed Systems 2010 21 2 275 287

10.1109/tpds.2009.45

2-s2.0-75449119271

Wen

Y.-F.

Anderson

T. A. F.

Powers

D. M. W.

On energy-efficient aggregation routing and scheduling in IEEE 802.15.4-based wireless sensor networks

Wireless Communications and Mobile Computing 2014 14 2 232 253

10.1002/wcm.1249

2-s2.0-84892617415

10.

Garone

Casavola

Sinopoli

Stochastic sensor scheduling for energy constrained estimation in multi-hop wireless sensor networks

IEEE Transactions on Automatic Control 2011 56 10 2489 2495

10.1109/tac.2011.2164022

MR2884173

2-s2.0-80053639655

11.

X.-Y.

Wang

Complexity of data collection, aggregation, and selection for wireless sensor networks

IEEE Transactions on Computers 2011 60 3 386 399

10.1109/tc.2010.50

MR2767390

2-s2.0-79551553052

12.

Chen

Zhu

Jia

Minimum data aggregation time problem in wireless sensor networks

Mobile Ad-Hoc and Sensor Networks 2005 3794

Berlin, Germany

Springer

133 142 Lecture Notes in Computer Science

10.1007/11599463_14

13.

Huang

S. C.-H.

Wan

P.-J.

C. T.

Yao

Nearly constant approximation for data aggregation scheduling in wireless sensor networks

Proceedings of the 26th IEEE International Conference on Computer Communications (INFOCOM ′07)

May 2007

366 372

10.1109/infcom.2007.50

2-s2.0-34548324833

14.

Wan

P.-J.

Huang

S. C.-H.

Wang

Wan

Jia

Minimum-latency aggregation scheduling in multihop wireless networks

Proceedings of the 10th ACM International Symposium on Mobile Ad Hoc Networking and Computing (MobiHoc ′09)

May 2009

New Orleans, La, USA

ACM

185 193

10.1145/1530748.1530773

2-s2.0-70450177431

15.

Distributed data aggregation scheduling in wireless sensor networks

Proceedings of the 28th IEEE Conference on Computer Communications (INFOCOM ′09)

April 2009

Rio de Janeiro, Brazil

2159 2167

10.1109/infcom.2009.5062140

2-s2.0-70349658421

16.

X. Y.

Mao

Tang

Wang

A delay-efficient algorithm for data aggregation in multihop wireless sensor networks

IEEE Transactions on Parallel and Distributed Systems 2011 23 1 163 175

10.1109/tpds.2010.80

2-s2.0-78649821231

17.

Wang

Mao

Tang

An improved approximation algorithm for data aggregation in multi-hop wireless sensor networks

Proceedings of the 2nd ACM International Workshop on Foundations of Wireless Ad Hoc and Sensor Networking and Computing (FOWANC ′09)

May 2009

New Orleans, La, USA

47 56

10.1145/1540343.1540352

18.

Wang

Huang

Near optimal scheduling of data aggregation in wireless sensor networks

Ad Hoc Networks 2013 11 4 1287 1296

10.1016/j.adhoc.2011.01.003

2-s2.0-84877582704

19.

Zhang

H. T.

H. D.

X.-Y.

Tang

S. J.

In-network estimation with delay constraints in wireless sensor networks

IEEE Transactions on Parallel and Distributed Systems 2013 24 2 368 380

10.1109/tpds.2012.122

2-s2.0-84871763353

20.

Hariharan

Shroff

N. B.

Maximizing aggregated revenue in sensor networks under deadline constraints

Proceedings of the 48th IEEE Conference on Decision and Control and 28th Chinese Control Conference (CDC/CCC ′09)

December 2009

Shanghai, China

4846 4851

10.1109/CDC.2009.5399718

21.

Krishnamachari

Prasanna

V. K.

Energy-latency tradeoffs for data gathering in wireless sensor networks

Proceedings of the 23rd Annual Joint Conference of the IEEE Computer and Communications Societies (INFOCOM ′04)

March 2004

244 255

10.1109/INFCOM.2004.1354498

22.

Becchetti

Korteweg

Marchetti-Spaccamela

Skutella

Stougie

Vitaletti

Latency constrained aggregation in sensor networks

Algorithms—ESA 2006 2006 4168

Springer

88 99 Lecture Notes in Computer Science

10.1007/11841036_11

MR2347134

23.

S. H.

Chen

J. M.

Shen

X. M.

Sun

Y. X.

Mobility and intruder prior information improving the barrier coverage of sparse sensor networks

IEEE Transactions on Mobile Computing 2014 13 6 1268 1282

10.1109/tmc.2013.129

2-s2.0-84902184986

24.

Jiang

L. S.

Liu

A. F.

Y. L.

Chen

Z. G.

Lifetime maximization through dynamic ring-based routing scheme for correlated data collecting in WSNs

Computers & Electrical Engineering 2015 41 1 191 215

10.1016/j.compeleceng.2014.04.001

2-s2.0-84899036051

25.

Liu

Y. X.

Liu

A. F.

Chen

Z. G.

Analysis and improvement of send-and-wait automatic repeat-request protocols for wireless sensor networks

Wireless Personal Communications 2015 81 3 923 959

10.1007/s11277-014-2164-6

2-s2.0-84910090807

26.

Kui

X. Y.

Wang

J. X.

Zhang

S. F.

Cao

J. N.

Energy balanced clustering data collection based on dominating set in wireless sensor networks

International Journal of Ad Hoc & Sensor Wireless Networks 2015 24 3-4 199 217

2-s2.0-84922967010

27.

S. B.

Chen

J. M.

Cheng

Sun

Y. X.

David

S. R.

EMD: energy-efficient P2P message dissemination in delay-tolerant wireless sensor and actor networks

IEEE Journal on Selected Area in Communications 2013 31 9 75 84

28.

Ekbatanifard

G. H.

Monsefi

Yaghmaee

M. H.

Hosseini

S. S. A.

Queen-MAC: a quorum-based energy-efficient medium access control protocol for wireless sensor networks

Computer Networks 2012 56 8 2221 2236

10.1016/j.comnet.2012.03.004

2-s2.0-84859827592

29.

Sinem

C. E.

Pravin

TDMA Scheduling Algorithms for Sensor Networks 2005

Berkeley, Calif, USA

Department of Electrical Engineering and Computer Sciences, University of California

MIDAS: A Data Aggregation Scheduling Scheme for Variable Aggregation Rate WSNs

Abstract

1. Introduction

2. Related Work

3. Research Motivation

4. The System Model and Problem Statement

4.1. Network Model

4.2. Energy Consumption Model

4.3. Data Aggregation Model

Definition 1 (the number of data packets ϖ).

Definition 2 (data aggregation rate γ).

Definition 3 (aggregation model).

4.4. Problem Statements

5. Scheme Design

5.1. Construct the Aggregation Set of Nodes

Definition 4 (aggregation set).

Algorithm 1: Constructing the aggregation set (CAS).

Algorithm 2: Search_fnode( j ; S k ; K ; N s e t ; m ).

Algorithm 3: Search_bnode( j ; S k ; K ; N s e t ; m ).

Algorithm 4: Time-slot allocation for nodes (TSAN).

Algorithm 5: Tsft_Set( j , N s e t , S _ S O R T k , G , n , T s f t , T s f t _ B J , D ).

Algorithm 6: Tsft_Forw ( j , N s e t , S e t _ B J , T s f t , T s f t _ F B J , D ).

5.2. The Design of MIDAS

6. Performance Evaluation

6.1. Node Scheduling Time-Slot Assignment

6.2. Energy Utilization Efficiency

6.3. Network Lifetime

7. Conclusion

Footnotes

Conflict of Interests

Acknowledgments

References

Algorithm 2: Search_fnode( $j; S_{k}; K; N s e t; m$ ).

Algorithm 3: Search_bnode( $j; S_{k}; K; N s e t; m$ ).

Algorithm 5: Tsft_Set( $j, N s e t, S_S O R T_{k}, G, n, T s f t, T s f t_B J, D$ ).

Algorithm 6: $Tsft_Forw (j, N s e t, S e t_B J, T s f t, T s f t_F B J, D$ ).