Abstract
We analyze performance of famous cluster based routing protocols and identify the factors affecting energy consumption in wireless sensor networks (WSNs). From theoretical and experimental analysis, it is observed that communication distance and cluster node density are the major sources in the formation of energy and coverage holes. To overcome these deficiencies, we propose a new hybrid approach of static clustering and dynamic selection of cluster heads. We also conduct a comprehensive energy consumption analysis of our technique with selected existing ones. Simulation results show that the proposed technique is relatively better in terms of energy holes minimization and network lifetime prolongation.
1. Introduction
Due to limited battery power of sensor nodes, their energy consumption during network operation is of high concern. Definition of network lifetime varies from application to application. In some cases, network lifetime is considered till death of the first node, while in others it is considered till the death of all nodes [1]. Whatever the definition of network lifetime is, the main purpose is to prolong network lifetime till the required objectives are met. Sensors' battery is the only source of power to keep the network alive. Therefore, network energy must be conserved to fulfil network lifetime requirement of the application. In order to accomplish the energy conservation goal, it is important to find how energy is consumed during network operations. The following are the factors that may affect energy consumption in clustered routing protocols:
Distance between cluster head (CH) and base station (BS) is the major factor of energy consumption. If the CH is at far distance from BS, then it has to apply amplification energy to transmit gathered data to BS, consuming a lot of energy. Large size of the cluster is another factor of high energy consumption. Cluster with high number of nodes becomes the reason to quickly drain CH's energy. Optimum number of CHs selection is very important towards balanced energy consumption, because a large number of CHs formed are equivalent to direct communication of nodes to BS and a small number of CHs formed lead to large sized cluster(s). Both cases cause surplus energy consumption. Multihop communication of CHs also leads to high energy consumption as transmissions and receptions are relatively high in number.
Any of the above factors may lead to the creation of energy holes. Energy hole is actually a phenomenon of overloading a node or CH for communication, which causes early death of the node or CH. Creation of energy hole not only reduces the network lifetime but also affects network throughput. Apart from routing layer energy consumption, MAC layer contentions also contribute to energy consumption. However, in this research work, we limit our scope to routing layer energy consumption issues only.
This work is extended form of our previous work in [2]. In this work, our main contributions are as follows. First we identify how unbalanced energy consumption in the traditional clustering protocols becomes the reason for creation of energy holes; to overcome this problem we propose a new technique in which we divide the network field into fixed logical regions. Static clustering and dynamic CH selection technique is used in this approach. A comprehensive mathematical formulation is provided in this work. The simulation results show improved stability period and reduced chance of energy hole creation.
The rest of the paper is organised as follows: Section 2 describes and criticises the work done in literature close to our work; Section 3 elaborates the problem statement in detail; Section 4 describes our proposed scheme; in Section 5, we present energy consumption model of our proposed work; performance evaluation of our proposed scheme with existing scheme is provided in Section 6; and finally conclusion is presented in Section 7.
2. Related Work
LEACH [3] is an initial clustering protocol, introduced by Heinzelman et al. In this protocol, CHs are selected on the basis of random number; therefore, there are chances that the number of CHs formed may or may not be optimum. This causes unbalanced load distribution among the nodes; as a result energy holes are created in the network.
Application-Specific Low Power Routing (ASLPR) protocol [4] is introduced by Shokouhifar and Jalali. Authors provided an application-specific approach for maximizing application required parameter of the protocol such as stability period, network lifetime, and 50% nodes death time. They used genetic algorithm for maximization or minimization of the required parameter. However, performance enhancement of certain parameter is achieved at the cost of other parameters.
Amjad et al., in [5], introduced a new clustering technique, Distributed Regional Energy Efficient Multihop Routing Protocol based on Maximum Energy (DREEM-ME), in WSNs. Authors succeeded in balancing energy consumption of nodes to some extent by dividing the circular field into subfields. Nodes within the subfield suffer long distance communication when these are located at one corner of the subfield and CH is located at the other corner. In such situations, nodes cannot associate with neighbouring subfield's CH, which is at shorter distance.
LEACH-Centralized (LEACH-C) is an extension of LEACH, which is proposed by Heinzelman [6]. In LEACH-C, BS gathers information about each node's location and energy level. Selection of CH is also probabilistic like LEACH. The plus point of this algorithm is that BS makes sure that the node with less energy does not become CH. However, in large scale networks, nodes which are far away from BS are unable to send their energy status to BS.
Israr and Awan [7] introduced Multiclustering protocol. In this protocol, CHs are selected randomly. Inter cluster communication is done through multihoping which reduces energy consumption due to reduced communication distance. Finally, data reaches BS through a number of CHs. Multihop communication could be a source of energy hole creation, if load is not balanced. The beauty of multihop-LEACH is its guaranteed network connectivity.
Kiani et al. introduced Efficient Intelligent Energy Routing Protocol in wireless sensor networks [8]. In this protocol, cluster shapes are square and a learning machine is introduced for selection of CH. Q-learning [9] technique and a cost function are used for selection of CH. Q-learning is a type of reinforcement learning approach which is based on genetic algorithm. Cost function is based on residual energy of a node and distance of node from BS. Node with maximum Q-value and minimum cost function value is selected as CH. LEACH-Selective Cluster (LEACH-SC) [10] is another cluster based routing protocol introduced by Wang et al. In this protocol, CH selection is probabilistic. A center point between node and sink is calculated. Node nearest to center point is selected as CH. Due to probabilistic selection of CH, the number of CHs in the entire network fluctuates around the optimum number of CHs.
In [11], authors proposed energy consumption model and tried to find energy hole in cluster based routing protocols. Authors identified different areas within the network in which more energy is consumed.
Localization techniques presented in [12, 13] divide the network area into subareas such that the chances of coverage hole creation are minimized.
Cluster based Multipath Routing Protocol (CMRP) is introduced by Sharma and Jena in [14]. This technique operates in four phases: neighbor discovery and topology construction, CH selection and cluster formation, data dissemination, and reclustering and rerouting. In neighbour discovery phase, all nodes send their neighbour information to BS. BS then forms an adjacency matrix on the basis of which it selects CHs. In data dissemination phase, CH and nodes send data in their respective time slots; otherwise these remain in sleep mode to conserve energy. BS monitors nodes' residual energy; as any node's residual energy falls to a certain threshold reclustering and rerouting phases are triggered.
Huang et al., in [15], studied lifetime of cluster in multihop communication. They find out the effects of transmission distance and size on lifetime of cluster.
3. Problem Statement
In this section, we elaborate the drawbacks of clustering protocols, LEACH and ASLPR. These drawbacks motivated us to propose a new protocol.
Probabilistic selection of CHs and transmission distance are two major factors that increase energy consumption of a node. Probabilistic selection of CH does not ensure optimum number of CHs during network operation. When the number of CHs is not optimum, some CHs become overloaded, which consume relatively high network energy [16]. Furthermore, if the overloaded CH is far away from BS then it has to expend more energy to send data to BS. In the following discussion, we elaborate how these two factors affect energy consumption of a node.
(1) Probabilistic CH Selection Does Not Guarantee Nonoccurrence of the Energy Hole Problem during Network Operation. In this method, protocol sets a threshold value for each node to elect itself as CH and then compares this value with a random number. If the random number is less than or equal to the threshold value, then the node becomes CH; otherwise it does not become CH. Due to probabilistic nature of CHs' selection, the total number of CHs fluctuates around the optimum value.
Let N sensor nodes be deployed in a network filed with node density ρ. The number of nodes n in a cluster of radius R can be calculated via the following:
Case 1 (if
).
In this case, R of the cluster decreases and
Case 2 (if
).
In this case, R of the cluster increases and
(2) Distant CHs Consume More Energy. Figure 1 shows energy consumption of CHs at different distances from BS. In this experimental setup, we place BS at zero point of network field. As expected, result shows that CHs near BS deplete less energy as compared to the far away ones. However, the reason of higher energy consumption of CHs which are at shorter distance from BS is large cluster size. These CHs have to forward large amount of data.

Energy consumption of CHs.
(3) Random Deployment of Nodes Does Not Guarantee Avoidance of the Coverage Hole Problem. Random deployment of nodes may cause overlapping problem; that is, two nodes located in the same area sense same location, whereas some area may not be sensed [17]. This may create coverage hole problem.
In ASLPR and LEACH, CH selection process is probabilistic and CHs directly communicate with BS. Therefore, in these protocols, chances of creation of energy and coverage hole are maximum. In the next section, we propose an idea in which we overcome these deficiencies.
4. Proposed Scheme
Localization problem has been commonly addressed. In this problem, network area is logically divided into subareas [12, 13] that helps in controlling the coverage hole problem. Therefore, we also use divide and rule technique to resolve this problem. In this section, we describe formation of regions and their mathematical formulations, CHs' selection procedure, protocol operation, and communication schematic of our proposed scheme.
4.1. Formation of Regions
In traditional cluster formation techniques, CHs are selected on the basis of probability. Nodes then associate with each CH on the basis of received signal strength, thus forming a cluster. In our protocol, BS divides the entire network area into small logical regions. Each region represents a static cluster, which reduces the communication distance between node and CH, and CH and BS. The following two steps describe the formation of regions in detail.
In step one, network area is divided into n equal distant concentric squares. Value of n is taken as
Coordinates of the top right corner of
Coordinates of the bottom right corner of
Coordinates of top left corner of
Coordinates of bottom left corner of

Formation of regions.
If we have n number of concentric squares, then the coordinates of nth square
d is multiple of
From (8) we can establish a relation to find d for nth square as
4.2. CH Selection
In our protocol, CH selection process is centralized. In the setup phase, BS decides which node will be the CH in each cycle. BS knows the location of all node; therefore, on the basis of central reference point of each region, BS decides CH in each cycle. BS prepares this schedule in each cycle and broadcasts it to all nodes. Each node on reception of CH schedule associates itself with BS's nominated CH. BS disqualifies those nodes to become CH whose residual energy status is below threshold. Nodes inform BS about their residual energy status by embedding current energy status in the data packet while sending data to CH or BS. DR protocol considers multihop inter cluster communication. As we assume
Secondary level CH are selected as follows: (i) CHs in
In this scheme, nodes associate with CH in two different methods. In first method, nodes of the same region are associated with their own region's CH only. Therefore, this method of association is known as confined association of nodes with respective region's CHs. Second method of association is known as nonconfined association of nodes. In this method, a node can be associated with neighbouring region's CH on the basis of minimum distance. For example, if a node of NCR1 is nearer to NCR2's CH, then it is associated with NCR2's CH instead of NCR1's CH. Both these methods have their own advantages and disadvantages; therefore, in performance evaluation section, we performed experiment with these methods separately.
4.3. Protocol Operation
In setup phase, BS divides the network field into small regions. In each region, one CH is selected per cycle. CHs of
In steady state phase, each node sends data to CH in its allocated time slot. Primary level CHs send aggregated data to their respective secondary level CHs that forward the aggregated data to BS.
4.4. DR Schematic
Our proposed scheme is summarized with the help of schematic diagram shown in Figure 3. The diagram shows three types of communications: type 1, type 2, and type 3. Nodes that belong to

Schematic of DR.
In our technique, CH selection is centralized. Each CH receives a message from BS containing CH ID, next hop and previous hop CH IDs. BS uses FDMA scheme to communicate with CHs. In this scheme, dedicated frequency is allocated to each CH. Inter cluster communication is performed through TDMA based scheduling. Each node sends data to its respective CH within its allocated time slot. TDMA schedule also contains a time slot for previous hop CH. The flowchart in Figure 4 shows TDMA slot allocation process of nodes and CHs.

TDMA slot allocation.
Direct Sequence Spread Spectrum (DSSS) technique is used to avoid node to CH and CH to CH collision. During TDMA scheduling, CHs inform member nodes to send data according to allocated DSSS code. This type of coding is known as transmitter based coding [17].
5. Energy Consumption Model
In this section, we develop a mathematical model to calculate energy consumption in different regions of the network field. In literature, different radio models have been introduced as selection of simulation radio model varies from application to application. For example, [18] presents simulation radio model for park and university areas. However, we adopt basic radio model from [19] due to its typical usage. The energy consumption equations (10) and (11) are adopted from [19] which show per bit energy cost of transmission (
5.1. Energy Consumption in
In this subsection, we first calculate node density and then energy consumption in
Nodes of
5.2. Energy Consumption in CRs
From Figure 2, d is the reference distance for the formation of NCR. Therefore, area of CR is given by
5.3. Energy Consumption in
In
There are four NCRs and four CRs. The area of each NCR in
Transmission Energy. Consider
Receive Energy. Equation (21) calculates receive energy of CH as follows:
5.4. Energy Consumption in
In DR protocol, area of NCR of
5.5. Experimental Analysis of Theoretical Model
Our main focus is to reduce the communication distance between node and CH or/and between CH and BS. In this section, we conduct experiments to verify our theoretical model. Energy consumption of nodes mainly depends on transmission distance and packet size that is why nodes in different regions of the network die at different instants. In addition to these two reasons, energy consumption of CHs depends on the number of associated nodes.
(1) Effect of Distance on Energy Consumption of CH. In the first experiment, we randomly deploy 11 nodes in each region of the network field. Nodes are initialized with homogeneous energy of 0.5 joules. After execution of cluster setup phase, distance information is gathered as shown in Table 1. The result of this experiment is shown in Figure 5.
Distance between CH to CH and BS.

Impact of distance on energy consumption of CH.
Finding 1. Figure 5 shows the difference of energy consumption between
(2) Effect of Node Density on Energy Consumption of CH. In the second experiment, we conduct tests for different node densities. A number of nodes associated with a CH are taken as 10, 25, 50, and 75. The rest of the experimental settings is the same as that of experiment 1. In this experiment, our objective is to analyze the effect of node density on energy consumption. Results of this experiment are shown in Figure 6.

Impact of density on energy consumption of CH.

Average energy consumption.
Finding 2. In experimental results of Figure 6, increase in energy consumption of
6. Performance Evaluation
We evaluate our proposed DR scheme by comparing it with ASLPR, DREEM-ME, and LEACH in terms of stability period, network lifetime, throughput, average energy consumption, propagation delay, and scalability. In this section, association of nodes with CH as confined and nonconfined is experimented separately. The results of confined method of association are named as CDR and those of nonconfined association are named as NCDR. We assume a network area of
6.1. Average Energy Consumption
Energy consumption of all compared protocols and our technique is shown in Figure 7. From this experimental comparison it can be seen that average energy consumption of CDR's nodes is minimum. Reduction in communication distance and load sharing of CHs through multihop communication are the major causes of low energy consumption. Energy consumption of NCDR is slightly higher than CDR. The reason is that in NCDR nodes are free to associate with any of the neighbouring region's CHs; therefore, load on neighbouring region's CH may increase which increases energy consumption.
6.2. Stability Period and Network Lifetime
Stability period is the time when death of the first node occurs. This time is also known as first node death time (FDT). As we can see from Figure 8 and Table 2, FDT of LEACH occurs at 2196th second, whereas FDT of ASLPR and DREEM-ME occurs at 2793th second and 3336th second, respectively. However, FDT of CDR is at 4139th second which shows that CDR is more stable. Figure 9 shows that difference between FDT and all nodes death time (ADT) of CDR is large, whereas this difference is small in other compared techniques. Therefore, we can say that CDR's ADT is
FDT and ADT comparison.

Comparison: dead nodes.

Comparison: alive nodes.
6.3. Throughput
Throughput is measured in terms of successfully received packets at BS. However, the total number of packets sent is not necessarily equal to the number of received packets at BS. For simulations, we use random uniform model [20] for packet drop count. According to this model, a packet is dropped if link status is bad. In this simulation setup, we suppose that the probability of bad link is
Figures 10 and 12 show that, in CDR and NCDR, a more number of packets are received as compared to LEACH, ASLPR, and DREEM-ME. There are three reasons behind higher throughput of CDR and NCDR: (1) enhanced network lifetime, (2) enhanced stability period, and (3) improved connectivity. However, due to longer network lifetime, number of packets transmitted are higher; therefore packets' drop rate is also higher as depicted in Figure 11. Improvement of CDR over DREEM-ME, ASLPR, and LEACH in terms of packets sent, packets received, and packets dropped is shown in Table 3.
Comparison: packets sent, packets received, and packets dropped.

Comparison: packets sent to BS.

Comparison: packets dropped.

Comparison: throughput.
6.4. Propagation Delay
Propagation delay is the time taken by a packet to reach BS (Figure 13). In LEACH and ASLPR, CHs directly send data to BS without involving intermediate hops. Therefore, in these protocols, propagation delay is directly proportional to the distance covered by a packet to reach BS. Meanwhile, in CDR, NCDR, and DREEM-ME, CHs are using intermediate hop to send data to BS. Therefore, the start of the network propagation delay of all four protocols is almost the same. Propagation delays of CDR and DREEM-ME are minimum as compared to LEACH and ASLPR because former protocols also use intermediate hop to send data to BS. However, as the network operation progresses, propagation delay reduces gradually. This is because farthest nodes in CDR die earlier and then the nearest nodes.

Comparison: propagation delay.
6.5. Scalability Analysis
A large number of nodes are involved in WSN; therefore it is essential to measure the performance of network in terms of scalability. Scalability is the measurement of the performance of network when communication load increases or network grows larger. A routing protocol is said to be scalable if it maintains or degrades, in a minor way, its performance with the expanding communication load. To measure the performance of proposed and existing protocols discussed in this research work, FDT, ADT, throughput, and packets drop are selected as the performance parameters. In the experimental setup, we randomly deployed 100, 300, 500, 800, and 1000 nodes in the network field. Figure 14 shows experimental result of FDT. In this experimental result, performance of LEACH remains stable with minor fluctuation in FDT; however, its performance is not improved. Performance of ASLPR, CDR, and NCDR is also inconsistent. In ASLPR, for 100, 300, and 500 nodes, performance of the protocol shows improved trend. The reason behind improved performance is that with the increase in number of nodes overall energy of the network is increased; therefore plot is showing increasing trend. However, for 800 and 1000 nodes, performance is slightly decreased; this is because more than 500 nodes in ASLPR are creating interference effect due to which nodes are wasting energy in overhearing. In CDR, nodes experience more interference effect because ADT is degraded. The reason behind degrading effect is that nodes in the regions are equally deployed; therefore increase in number of nodes is creating equal interference effect which is collectively degrading overall FDT performance of CDR. Performance of DREEM-ME and NCDR also shows continuous degrading trend. There are two reasons; nodes in the clusters of DREEM-ME and NCDR are equally distributed; however, these are free to associate with neighbouring cluster's CH, which increases load on neighbouring CH and the other reason is interference.

First node death time.
Figure 15 depicts performance analysis of protocols under discussion in terms of ADT. Figure 15 shows that behaviour of CDR, NCDR, and DREEM-ME is stable. It shows that, with the passage of time as nodes die, interference effect is reduced and nodes' ADT time becomes the same for varying number of nodes. ADT of ASLPR and LEACH is fluctuating. In ASLPR, it is increased for 300, 500, and 800 nodes; however, it is decreased for 1000 nodes.

All nodes death time.
Figures 16 and 17 represent behavior of these protocols in terms of packet drop and throughput. In both cases, increasing trend of CDR, NCDR, and DREEM-ME is logarithmic. The reason is straightforward; ADT of CDR, NCDR, and DREEM-ME protocols is also higher than LEACH and ASLPR. However, packet drop and throughput analysis of LEACH and ASLPR show that increasing trend is linear. The reason is that their ADT is less than CDR and DREEM-ME.

Packet drop for different number of nodes.

Throughput for different number of nodes.
From the above experimental analysis of scalability, it is clear that all four protocols are scalable because the performance of all protocols is almost stable in terms of ADT, FDT, throughput, and packet drop. However, minor degradation is a trade-off between increased work load and expansion of the network.
7. Conclusion and Future Work
Unbalanced load on nodes leads to the creation/formation of energy holes, which may also lead to coverage hole(s). Energy holes cause traffic blockage in multihop transmissions. In this regard, we have studied existing cluster organization based routing techniques and identified the causes of energy hole(s) creation/formation. Subject to our contribution, we have proposed static clustering and dynamic CH selection mechanism. As per our proposition, division of the network area into rectangular and square-shaped regions guarantees network coverage and cluster load balancing. Simulation results showed that, due to balanced energy consumption, network lifetime of our proposed scheme is prolonged, 6% more than ASLPR, 57% more than DREEM-ME, and 67% more than DREEM-ME. Moreover, our protocol achieves 92%, 90%, and 20% more throughput than LEACH, DREEM-ME, and ASLPR, respectively.
The drawback of our approach is that nodes of one region are bounded to associate with respective region's CH only; however, it is possible that the neighbouring CH may be at less distance than its own CH. Therefore, in future, we are interested in improving these deficiencies keeping in view a balanced network load. Moreover, we are also interested in theoretical evaluation of our proposed scheme with the existing research work.
Footnotes
Notations
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
