Abstract
Data aggregation algorithm aims to reduce the redundant information by gathering the sensed data, save energy, and prolong the lifetime of the network. However, the data aggregation technology will increase the network transmission delay of wireless sensor networks. Minimum-latency aggregation scheduling is designed to minimize the number of scheduled time slots to perform an aggregation. In this paper, we present an Adaptive Aggregation Scheduling Algorithm based on the Grid Partition (AASA-GP) in large-scale wireless sensor networks. By dividing the network into grids based on the geographical information, we allocate the channels according to the grid coordinates. Nodes with the same grid coordinates use the same channel and the adjacent grids use the different channels, so we can effectively avoid the wireless media transmission interference, increase the parallel transfer rate, and reduce the aggregation latency. Our extensive evaluation results demonstrate the superiority of the AASA-GP. For small-scale networks, the resultant latency is comparable with the best practice, and it is more suitable for large-scale wireless sensor networks.
1. Introduction
In multihop wireless sensor networks, a fundamental task is to gather data from all sensors to a distinguished sink node [1, 2]. It is already noted that adjacent sensor nodes monitoring an environmental feature typically register similar values [3]. This data redundancy of the spatial correlation among sensor observations inspires the research of in-network data aggregation. In general, each intermediate node aggregates its received data with its own record according to some aggregation functions (e.g., taking the maximum or minimum of them) into a single packet with fixed size. This type of application is called data aggregation, and its communication pattern is called convergecast [4]. The naive aggregation approaches which purely rely on medium-access-control layer mechanisms could result in latency that is too high to be practical due to the existence of mutual transmission interference [5, 6]. The goal of our study is to minimize the average data aggregation latency of the convergecast process, and a synchronized aggregation scheduling is necessary, where all transmissions proceed in synchronous time slots. Such an aggregation scheduling is designed under three conditions:
Each node transmits at most one packet with the fixed size in its allocated time slot. A node cannot transmit until all of its children complete the transmissions to itself. The assigned transmissions in the same time slot should be interference-free.
In this paper, the latency is measured by the number of time slots of the whole aggregation convergecast process, and our goal aims to minimize the latency.
2. Background
2.1. Transmission Interference Model
In wireless sensor networks, each node has a given transmission radius

Transmission interference model.
2.2. Time Scheduling on a Single Frequency Channel
An example network is shown in Figure 2(a), and the dash lines among nodes denote the communication neighborhood relationship, where node 0 is sink node. A

Illustrations of single frequency channel.
GGT [8] algorithm is designed to construct the spanning trees rooted at the sink, and the initial spanning tree contains only the sink node. In each round, all nonleaf nodes of the current spanning tree are the candidates of receivers, and all leaf nodes are the candidates of senders. As for the candidate senders, there are two rules to sort them in a selection sequence: (1) sort all nodes, based on the increasing order of the number of neighbors on the tree, and (2) sort nodes with the same order by the first rule, based on the increasing order of the number of neighbors out of the tree. The scheduling result is shown in Figure 2(c) and 7 time slots are required.
2.3. Time Scheduling on Multiple Frequency Channels
In the transmission interference model [9], there exist two constraints: (1) adjacency constraint is due to the half-duplex transceiver on each node which prevents it from simultaneous transmission and reception, as shown in Figure 1;
In Figure 3(a), there is a network with 6 sensor nodes and the solid lines represent the tree edges, and the dashed lines represent the interfering links. JFTSS [10] schedules a network starting from the link that has the largest number of packets (load) to be transmitted. When the load of the adjacent links is equal, such as in aggregated convergecast, the most constrained link is considered first, that is, the link for which the number of other links violating the interfering and adjacency constraints when scheduled simultaneously is the maximum. Figure 3(b) shows the aggregated tree, which is scheduled by JFTSS. In JFTSS, the link (2, sink) is firstly assigned with frequency

Illustrations of multiple frequency channels.
TMCP [11] partitions the network into multiple subtrees and minimizes the intratree interference by assigning different channels to the nodes residing on different branches starting from the top to the bottom of the tree. Figure 3(c) shows the same tree which is scheduled by TMCP to collect the aggregated data. Here, the nodes on the leftmost branch are assigned with frequency
At present, many tree-based topology control and routing algorithms are designed to aggregate and collect the sensing data; these are appropriate for the small-scale, short communication radius networks [12]. Multichannel communication is an efficient method to eliminate interference by enabling concurrent transmissions over different frequencies. But it is very difficult to assign channels to the tree network structure. Motivated by grid partition induction [13], we propose AASA-GP to schedule the aggregation process. In our algorithm, we firstly divide the network into grids based on the geography information and then allocate channels to the links based on grid coordinates. Nodes with the same grid coordinate using the same channel, adjacent grids using the other channels, which can effectively avoid the transmission interference thereby reduce the aggregate delay. To the best of our knowledge, it is the first time to use grid-based routing topology to solve aggregation latency.
The following lists our key findings and contributions:
Use the tree-based topology to route and solve aggregation latency. Allocate channel based on grid coordinates. Algorithm is appropriate for large-scale wireless sensor networks with the large communication range.
3. Protocol Description
3.1. Basic Idea
By dividing the network into grids and assigning different channels to adjacent grids, the wireless transmission medium interference constraint is avoided, and the data from other source nodes in the same grid can be collected and aggregated on the selected cluster head and then proceed to the sink.
3.2. Meshing
In our scheme, we randomly select N wireless sensor nodes to construct wireless sensor networks in

Network mesh.
Each node broadcasts its grid coordinates, and the nodes with the same grid coordinates will form a cluster, in which the highest-energy node serves as the cluster head and receives the data from other members in this grid and then aggregates the data into a fixed-size packet.
Due to the limitations of half-duplex mode, nodes with the same grid coordinates cannot communicate with the cluster head at the same time, but nodes with the different grid coordinates can communicate through multiple channels to avoid wireless media transmission interference and increase the parallel transmission.
3.3. Channel Assignment
We assign different channels to adjacent grids, and the scheme of the channel assignment of the network is shown in Figure 5, in which ch1 indicates channel 1. According to this allocation, we assign 9 different channels to the entire network so that nodes in the different grid can transmit data at the same time. For example, in Figure 5, red grid is allocated channel 9 and its channel number is different from the adjacent grids. In this way, when nodes in the red grid communicate with cluster head, it is interference-free with the adjacent 24 grids, in which red dashed line passes through. The total number of grids in network is

Illustrations of channel assignment.
At the same time, we can adjust the size of the grid (grid side length l) in order to guarantee nodes in red grid and in green grid to transmit data in parallel, so that nodes that belong to different grids can transmit data without interference. After the in-grid data collection, cluster head can forward the sensed data across the other grids to sink.
3.4. Routing between Grids
Routing across the grids mainly involves the communication between cluster heads, and our routing scheme can be analyzed by following two cases according to the location of the sink.
In Figure 6, the example network is divided into a

Illustrations of data forwarding between grids.
When sink is located in the center of the network, the route structure between grids is shown in Figure 6(b).
3.5. The Connectivity of the Network
Because of the limitation of communication capabilities of wireless sensor nodes, we assume that communication radius is

Cluster head selection area.
3.5.1. Connectivity within the Grid
As shown in Figure 7, we suppose node A is the cluster head; if any node in the grid could communicate with A, we should make the
3.5.2. Connectivity between the Grids
In order to guarantee the adjacent cluster heads can communicate with each other, the maximum distance between two cluster heads should be less than the node communication radius

Connectivity between the grids.
However, as shown in Figure 8(b), node G and node H use the same channel 1; if they want to transmit the data in parallel mode, the grid side length L should be satisfied as in the following inequality:
When L and R are required to satisfy (3) or (4), they must satisfy (2).
In summary, when sink lies at the center or edge of the network, if the network connectivity is to be ensured, R, L, and
When sink is in the corner of the network, R, L, and
In (6) and (7), node's communication radius
3.6. Network Topology of the Algorithm
According to the above algorithm description, we simulate a network in which the edge length is 200, the number of nodes is 800, the communication radius of node is 30, topology is shown in Figure 9, the red dots in each grid are cluster heads, the blue dot is sink, and the other dots are ordinary sensor nodes.

Network topology.
4. Simulation and Performance Analysis
4.1. Experiment Setup
We use C++ to simulate the following algorithms. Multichannel algorithms are JFTSS-channel: 2 (2 channels of JFTSS algorithm), JFTSS-channel: 16 (16 channels of JFTSS algorithm), TMCP-channel: 2 (2 channels of TMCP algorithm), TMCP-channel: 16 (16 channels of TMCP algorithm), our algorithm (9 channels). Single-channel algorithms are SDA and GGT. The routing architecture of our algorithm is based on grid, suitable for large-scale and large communication radius (
We randomly arrange N sensor nodes in a square area with the side length S; the average node density is
4.2. Comparison with Other Algorithms
In our simulation, we set the average node density as

Average degree of the network.
With the increase of node communication radius, the average degree of nodes also increases, so the network transmission interference also increases; this results in the increase of the aggregation delay.
Figure 11 shows the number of time slots needed when the number of nodes N varies from 50 to 1250 (i.e., S from 50 to 250), with the

Performance comparison with fixed node density (
In Figure 11(b), sink is located at the corner of the network; the grid side length of AASA-GP is
4.3. Simulation of Large-Scale Wireless Sensor Networks
We simulate large-scale wireless sensor networks. S varies from 100 to 1000, with

Performance for large-scale wireless sensor networks.
From Figure 12(b), we conclude that the variation trend of the aggregation network delay is similar to that shown in Figure 12(a), which indicates that AASA-GP can be applied to different network topologies.
5. Conclusions
This paper presents an adaptive aggregation scheduling algorithm based on the grid partition in large-scale wireless sensor networks (AASA-GP). By dividing the network into grids based on geographical information, when we assign the different channels to the adjacent grids, the wireless transmission interference can be avoided. By selecting the cluster head in each grid, the network load can be effectively balanced. Simulation results show that aggregation delay by AASA-GP is significantly less than that by the other algorithms. In wireless sensor networks, when the network scale and the node's communication radius are larger, the advantages of AASA-GP are more obvious.
Footnotes
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
Acknowledgments
This work is partially supported by the Project of National Natural Science Foundation of China under Grant nos. 71271165, 61373174 and 61572435, the Key Project of Natural Science Foundation of Shaanxi Province under Grant nos. 2015JZ002 and 2015JM6311, the Project of the Guangxi Key Laboratory of Trusted Software under Grant no. kx201416, the Project of the High Level Talents in Colleges of Guangdong Province (Guangdong Finance Education (2013) no. 246), and the Project of the Natural Science Foundation of Guangdong Province under Grant no. 2014A030307014.
