Abstract
Clustering techniques can reduce energy consumption of nodes and increase the scalability of the network. However, in uniformly deployed clustering wireless sensor networks, the uneven distribution of communication loads often causes the energy-hole problem, which means that the energy of the nodes in the hole region will be used up sooner than the nodes in other regions. In order to solve the problem, this paper theoretically analyzes the energy consumption of nodes in different network areas, gives the expression of the optimal energy distribution of heterogeneous nodes, and designs an energy-efficient clustering routing protocol. The goal of our work is to propose an energy-heterogeneous clustering scheme (EHCS) that allows the initial energy of the sensor nodes to be varied with the distance to sink. The nearer nodes from sink have more energy. Simulation results show that the method can balance the energy consumption among sensor nodes, achieve an obvious improvement on the network lifetime, and effectively avoid the energy-hole problem.
1. Introduction
Wireless sensor networks are composed of a large number of sensor nodes with limited energy resources. In many applications, the sensor nodes are usually inaccessible to the user after being deployed, and thus, replacement of the energy resource is not feasible. Hence, energy efficiency becomes a key design issue in order to improve the life span of the entire network. In multihop wireless sensor networks characterized by many-to-one traffic pattern, the cluster heads close to sink have to forward data for cluster heads away from sink and thus will tend to die earlier, resulting in network being partitioned and network lifetime being shortened. This is called the energy-hole problem. Simulations results [1] have showed that the energy expended by a sensor node drops significantly as they move away from the sink, and the power expended by a node in the 6th corona is less than 10% of the energy expended by a node in the first corona.
In order to prolong the network lifetime, this paper proposes an energy-heterogeneous clustering scheme where sensor nodes with larger data forwarding loads are given more initial energy. EHCS is only suitable for periodic data gathering applications where in a certain time, the energy consumption of nodes in different network areas is determined and can be calculated theoretically.
The remainder of this paper is organized as follows. Section 2 introduces the literature review. Section 3 defines the network model. Section 4 analyzes the optimal initial energy of nodes. Section 5 describes the clustering algorithm. Section 6 evaluates the performance of the protocol via simulation experiments. Finally, Section 7 reaches the conclusions.
2. Related Work
In order to improve the utilization of network energy, many efficient algorithms have been presented. The first type of algorithms is to adjust transmission ranges of sensor nodes. Tran-Quang et al. [2] have formulated the transmission range adjustment optimization problem as a 0-1 multiple choice knapsack problem and presented a dynamic programming method which allows sensor nodes to dynamically set their individual transmission levels according to their residual energy. Song et al. [3] have proposed an improved corona model with levels for analyzing sensors with adjustable transmission ranges in a WSN with a circular multihop deployment. They have proved that searching optimal transmission ranges of sensors among all coronas is a multiobjective optimization problem (MOP) and have designed a centralized algorithm and a distributed algorithm for assigning the transmission ranges of sensors in each corona for different node distributions. Liu et al. [4] have presented the particle swarm optimization algorithm (PSO) to solve the routing problem of avoiding the energy hole. The algorithm redefines the particle of the PSO, the operation of particle, and the “flying” rules. Liu and Guo [5] have proposed an analytical model to characterize the energy-hole problem and have described an iterative process to determine the optimum value of the model parameters. Zeng et al. [6] have analyzed the characteristics of data distribution in WSN and have obtained some results including the distribution of energy consumptions, the lifetime of each node in different localities, and the data transferring delay. They also proposed energy-hole avoidance for WSN based on adjust transmission power.
The second type of algorithms is to use nonuniform node distribution. Wu et al. [7] have investigated the theoretical aspects of the nonuniform node distribution strategy to avoid the energy hole around the sink and have proved that in a circular sensor network with a uniform node distribution and constant data reporting, the unbalanced energy depletion among the nodes in the whole network is unavoidable. If the ratio between the node densities of the adjacent
The third type of algorithms is to use mobile sinks or mobile relays to prolong the lifetime of wireless sensor networks. Marta and Cardei [10] have proposed solution to use mobile sinks that change their location when the nearby sensors' energy becomes low. In this way, the sensors located near sinks change over time. In deciding a new location, a sink searches for zones with richer sensor energy. Luo and Hubaux [11] have proved that the best mobility strategy consists in following the periphery of the network (assume that the sensors are deployed within a circle). Gao et al. [12] have proposed a data collection scheme called maximum amount shortest path (MASP) to optimize the mapping between members and subsinks. MASP has been formulated as an integer linear programming problem which is solved by a genetic algorithm. A communication protocol has been designed to implement MASP, which is also applicable in sensor networks with low density and multiple sinks. Cheng et al. [13] have proposed the architecture of multiple mobile sinks sparse wireless sensor network and an opportunistic transmission scheduling algorithm.
The above literature discuss solutions on the problem of the hot spots in flat networks. In the hierarchical network, clustering techniques can increase the scalability of wireless sensor networks and enable the efficient utilization of the limited energy resources. Many efficient energy-aware clustering routing protocols are proposed, for example, LEACH [14], LEACH_C [15], HEED [16], EADEEG [17], and PEGASIS [18].
LEACH (low-energy adaptive clustering hierarchy) utilizes randomized rotation of local cluster-heads to evenly distribute the energy load among the sensors in the network. LEACH uses localized coordination to enable scalability and robustness for dynamic networks and incorporates data fusion into the routing protocol to reduce the amount of information that must be transmitted to sink.
While LEACH has advantage to using distributed cluster formation algorithm, each node makes autonomous decisions. Then, this protocol offers no guarantee about the placement and number of cluster-head nodes. LEACH-C (LEACH-Centralized) is a protocol that uses a centralized clustering algorithm to form the clusters. During the set-up phase, each node sends information about its current location and energy level to sink. Sink runs an optimization algorithm to determine the clusters for that round. Sink may produce better clusters than those formed using the distributed algorithm by dispersing the cluster-head nodes throughout the network.
HEED (hybrid energy-efficient distributed clustering) is a novel distributed clustering approach, which does not make any assumptions about the presence of infrastructure or about node capabilities, other than the availability of multiple power levels in sensor nodes. It periodically selects cluster heads according to a hybrid of the node residual energy and a secondary parameter, such as node proximity to its neighbors or node degree. HEED terminates in
EADEEG (an energy-aware data gathering protocol for wireless sensor networks) adopts a new clustering parameter for cluster head election, which can better handle the heterogeneous energy capacities. Furthermore, it also adopts a simple but efficient approach, namely, intracluster coverage to cope with the fractional area coverage problem. EADEEG achieves a good performance in terms of lifetime by minimizing energy consumption for communications and balancing the energy load among all nodes.
In order to reduce the number of nodes communicating directly with sink, PEGASIS (power-efficient gathering in sensor information systems) organizes all sensor nodes into a near-optimal chain according to nodes' location by greedy algorithm. Each node transmits data to the closest possible neighbor. The data is passed along the chain from one node to another node and is fused. Finally, one designated cluster head transmits the combined data to sink. All network nodes take turns as the cluster head to balance the energy consumption of nodes. It is better than LEACH by about 100–300% in terms of network lifetime for different network topologies, but it has the greater delay and needs overall location information to construct the chain.
The above clustering protocols use randomized rotation of cluster heads to balance energy dissipation among sensor nodes. They can achieve local energy balance, but cannot avoid the energy-hole problem. From a global view, the nodes close to sink still deplete the more energy than other nodes. In order to solve the problem, UCS [19], EECS [20], and EEUC [21] use different methods.
UCS (unequal clustering size) assumes that the network is heterogeneous and cluster head nodes have more energy and predetermined locations. It organizes the network into heterogeneous clusters, where some more powerful nodes take on the cluster head role to control network operation, it is important to ensure that energy dissipation of these cluster head nodes is balanced.
EECS (energy-efficient clustering scheme) elects cluster heads with more residual energy through local radio communication while achieving well cluster head distribution; furthermore, it introduces a novel method to balance the load among the cluster heads. EECS is fully distributed and more energy efficient. Simulation results show that EECS outperforms LEACH significantly with prolonging the network lifetime over 35%.
EEUC introduces an unequal clustering mechanism to balance the energy consumption among cluster heads. Cluster heads closer to sink have smaller sizes than those heads father away from sink, thus cluster heads closer to sink can preserve some energy for the purpose of intercluster data forwarding.
3. Network Model
A sensor network consisting of N sensor nodes randomly deployed within a circle field to continuously monitor the surrounding environment. The ith sensor node is denoted by All sensor nodes and sink are stationary after deployment. There is a base station (i.e., sink) which is located in the center of the sensing field and has unlimited energy. Sensor nodes are heterogeneous, and nodes in different rings have different initial energy. All nodes are location-unaware, that is, not equipped with GPS-capable antennae. The energy of nodes cannot be renewed. All nodes are assigned a unique identifier and have the function of data aggregation. The intracluster sensed information is highly correlated; thus, the cluster head can aggregate N data packets from its members into a single length-fixed packet. But intercluster data is not correlated and cannot be fused. Nodes can figure out the distance to the sender according to RSSI (received signal strength indication) and can use power control to tune the amount of transmission power by the transmission distance to receiver. Such as the Berkeley Mote, it has 100 transmission power levels.
The network structure model is illustrated in Figure 1.

Network structure model.
The whole network region is divided into multilayer rings, and each ring has the same width w. Assuming that the number of rings is k,
4. Energy Analysis
4.1. Energy Consumption Model
This paper uses energy consumption model in [15]. The energy dissipation of transmission depends on the distance between the transmitter and receiver. When the distance is relatively far, the multiple-path fading channel model (
The radio dissipation
To receive this message, the radio dissipates energy is
The energy expended for data merging is
4.2. Theory Analysis
Theorem 1.
Assume that the area of the ring
Proof.
The wireless sensor network is a network of high-density, and clusters need to cover all the nodes in the entire network. Then, the overlap between clusters exists, and only one cluster head node exists in any cluster. When the overlap among clusters is most, the number of the cluster heads is maximal. The figure of maximal cluster heads is showed in Figure 2.
In Figure 2, the cluster head A is a regular hexagon with the edge length
On the contrary, when the overlap among clusters is least, the number of the cluster heads is minimal. The figure of minimal cluster heads is showed in Figure 3.
In Figure 3, the cluster head A is a regular hexagon. So in the ring
Therefore, in the ring

Maximal cluster heads.

Minimal cluster heads.
Theorem 2.
Assume that the area of the ring
Proof.
The expected value of the squared distance from the members to the cluster head (assumed to be at the center of mass of the cluster) is given by
The cluster radius r can be computed by the following formula:
Therefore, the expected value of the squared distance from the cluster members to the cluster head is
Theorem 3.
Assume that the area of the ring
Proof.
The expected value of the distance between two neighboring cluster heads is given by
Therefore,
Theorem 4.
Assume that the area of ring
In ring when when where
Proof.
In ring For receiving data from cluster members, the energy dissipation of a cluster head in ring For merging data from cluster members, the energy dissipation of a cluster head in ring For sending one frame data, the energy dissipation of a cluster head in ring For collecting one frame data, the total energy dissipation of a cluster head in ring The energy dissipation of a cluster member in ring For collecting one frame data, in ring In ring For forwarding data from the outer cluster heads, the energy dissipation of nodes in ring In ring When When
5. Clustering Algorithm
After the network deployment, sink broadcasts a Sink_ADV message to start all nodes to work, and each node computes out the approximate distanceto sink by the received signal strength of the Sink_ADV message.
5.1. Cluster Setup
In EHCS, in order to reduce the energy cost for the cluster head competition, a small number of nodes are chosen as candidate cluster heads to compete for final cluster heads by the predefined probability. Each candidate cluster head broadcasts Hello message in radius
5.2. Formation of Intercluster Forwarding Tree
The cluster head broadcasts a TREE_ADV message which includes the node number ID, the residual energy of the node, and the distance to sink. After collecting the TREE_ADV messages from the adjacent cluster heads, each cluster head establishes the neighbor cluster head information table which includes the number ID of the adjacent cluster head, the residual energy E of the adjacent cluster head, the distance from the adjacent cluster head to sink, and the distance from it to the adjacent cluster head.
In order to setup the intercluster forwarding tree, each cluster head chooses a suitable neighbor cluster head as its parent node. For the cluster head i, if
5.3. Data Collection
The cluster members collect the data and send it to the cluster head. Then, the cluster head fuses the intracluster data and transmits the compressed data to its parent node along the forwarding tree. The parent node forwards the received data to sink. After collecting the predetermined number of frames, the network enters into the next round and reclustering.
6. Simulations Results
In order to evaluate the performance of the protocol, EHCS is compared with multihop HEED and LEACH-C via simulations. Simulation parameters are listed in Table 1.
Simulation parameters.
Figure 4 shows the node distribution in the three protocols.

Distribution of nodes.
Figure 5 shows the clustering structure in EHCS. “*” denotes the cluster heads, and “∙” denotes the cluster members. Each cluster head chooses a neighbor cluster head closer to sink as its relay node.

Clustering structure in EHCS.
Table 2 shows the death time of nodes in the three protocols and the improvement rate of EHCS over LEACH_C and multihop HEED. If the death time of the first node is used as a comparison standard, EHCS is better than LEACH_C by 287.5% and multihop HEED by 20.8%. So, EHCS can better balance the energy consumption of nodes and prolong the lifetime of the network.
Performance comparison among three protocols.
7. Conclusions
Clustering is one of the key technologies of wireless sensor networks and has a significant impact on the network performance, but in clustering networks, many-to-one traffic pattern easily leads to the energy-hole problem because of the unbalanced energy dissipation among sensor nodes. We design an energy-heterogeneous clustering scheme where nodes closer to sink have more initial energy to provide the enough energy for forwarding data from outer cluster heads. Simulation results show that the proposed protocol is better than LEACH_C by 287.5% and multihop HEED by 20.8%, if the death time of the first node is used as a comparison standard.
Footnotes
Acknowledgment
This work is supported by the National Natural Science Foundation of China (nos. 61272236, 61174177, and 41172298).
