Abstract
The simple graph theory is commonly employed in wireless sensor networks topology control. An inherent problem of small-granularity algorithms is the high computing complexity and large solution space when managing large-scale WSNs. Computed transmission paths are of low fault tolerance because of unattended sensor nodes and frail wireless transmitting channels. This paper uses hyper-graph theory to solve these practical problems and proposes a spanning hyper-tree algorithm (SHTa) to compute the minimum transmitting power delivery paths set for WSNs convergecast. There are three main contributions of this paper: (1) we present a novel hyper-graph model to abstract large-scale and high connectivity WSNs into a robust hyper-tree infrastructure; (2) we present a precise mathematical derivation that solves the “hyper-tree existence” problem; (3) SHTa is proposed to compute the delivery paths set, which is the minimum power transmitting convergecast hyper-tree. Variable scale hyper-edges represented as computing units limit solution space and reduce computing complexity. Mutual backup delivery paths in one hyper-edge improve the capability of fault tolerance. With experiment results, SHTa computes short latency paths with low energy consumption, compared with previous algorithms. Furthermore, in dynamic experiments scenes, SHTa retains its robust transmitting quality and presents high fault tolerance.
1. Introduction
Self-organized wireless sensor networks can be used to cooperatively detect and perceive real objects. Sensors can communicate and exchange information among themselves without human intervention. This is achieved by integrating technologies, including sensors, embedded calculations, distributed information processing, and wireless communication. Wireless sensor networks have huge potential in civil and military applications, such as smart grid, smart home, healthcare monitoring, and intelligent transport.
Self-organized wireless sensor networks are made up of highly distributed systems of small-size, wireless unattended sensors. Each sensor is capable of detecting devices' current operating conditions, such as temperature, noise, vibration, or output signals. This data is preprocessed, transmitted, and exchanged in a machine-to-machine (M2M) network [1, 2]. There is a need for reliable, scalable, and smart protocols and algorithms for self-organized M2M networks or sensor networks.
In traditional communication networks, simple graph theory is always used [3, 4]. But a large-scale self-organized wireless sensor network consists of hundreds or thousands of nodes with a complex topology. Hence, a large number of the control massages are required to establish transmission paths. On the other hand, because of the low reliability of the sensor nodes and wireless communication links, many real-time control messages have to be used to maintain an established path. These tasks use significant amount of bandwidth and consume the extra energy.
To solve this problem, in this paper, we used the hyper-graph theory and proposed Spanning Hyper-Tree algorithm (SHTa) to create a concise and robust hyper-graph infrastructure for large-scale and high connectivity self-organized wireless networks‥ Based on the best of our knowledge, it is the first hyper-graph model for self-organized wireless networks architecture. Because a dynamic hyperedge is the minimum computing unit during routing in this type of hyper-graph architecture, fewer packets are used, which saves energy and prolongs the network's lifetime. More than one connected pairs in a hyperedge provides high bandwidth and low loss rate during transmission. This effectively improves network fault tolerance. Moreover, SHTa solves the “hyper-tree existence uncertainty problem,” which is a new problem that differs from the simple graph model. An axiom “any graph has its spanning tree” is invalidated in a hyper-graph, that is, not each hyper-graph exists spanning hyper-tree with loop-free. SHTa presents an effective spanning hyper-tree method and we proposed the strict mathematical proof to prove the certainty theorem.
The remainder of the paper is organized as follows. Section 2 introduces some background material on wireless communication network architecture and optimal routing problems. Wireless self-organized sensor networks' hyper-graph model is presented in Section 3. Section 4 describes the SHTa in detail, followed by validity proof. Section 5 proposes the computer simulation and evaluation; finally, Section 6 is the conclusion and outline of future research.
2. Convergecast with Data Aggregation in Wireless Sensor Networks
For peer-to-peer (P2P) communication model, Dijkstra and Bellman-Ford algorithms are often employed to build a shortest path tree (SPT), such as OSPF used in IP backbone networks. Each router with OSPF stores an SPT in which the root is itself. Packets are transmitted following SPT's branches to arrive the minimum cost.
Different from P2P, self-organized wireless sensor network collects data from each sensor nodes to “Sink,” called convergecast. During transmission, data aggregation is used to eliminate the redundancy in collection data. Many algorithms are presented to establish data aggregation tree, such as EADAT [5], E-Span [6], and HEED [7]. These algorithms set transmitting energy consumption as link weight and build SPT as an aggregation tree. Reference [8] presented the DCTC algorithm to detect and track a mobile target. DCTC used Dijkstra to establish collection tree, which is also an SPT.
Not every data packet will be transmitted from source to destination, due to data aggregation, in the intermediate nodes. A wireless sensor network is generalized as data center, and the optimum number of transmissions required per datum in the DC (Data Centre) is equal to the number of edges in the Minimum Steiner Tree (MST). Therefore, MST, not SPT, is the truly minimized sum cost tree in convergecast protocols with data aggregation.
Figure 1 shows an example to explain this optimization problem. Three nodes transmit information to Sink. SPT and MST are shown, respectively, in Figures 1(a) and 1(b). Table 1 presents the two routing configurations: the total cost of SPT method is 3.82, larger than MST. Therefore, MST is better than SPT. If the weight of edges is defined as energy consumption, MST is just the optimal energy consumption tree in the wireless sensor network.
Total sum cost with two transmitting methods.

(a) Routing based on SPT. (b) Routing based on MST.
3. Hypergraph Model for Wireless Self-Organized Sensor Networks
Whether for IP backbone network, cellar mobile network or Ad-hoc network, the simple graph theory is the main tool for research on architecture control [8–11] and counting routing protocols [3, 4, 12–14]. But in large-scale wireless self-organized sensor networks, the number of sensor nodes can be hundreds or thousands of times of that of backbone network or mobile network. Each node can connect with any one neighbour by omnidirectional antenna, which creates high node connectivity and complication in topology controlling. A simple graph algorithm with tiny granularity often has high computing complexity and uses a large amount of memory. On the other hand, in wireless sensor networks, a single transmitting path has a low fault tolerance level because of the low reliability of sensor nodes and wireless links. During data transmission, lots of control messages need to be transmitted frequently to maintain the connectivity of a delivery path, which may use lots of links' bandwidth and consume significant amount of energy.
To solve this problem, Hyper-graph theory is used as a novel mathematical tool to generalize high connectivity wireless self-organized networks into concise and robust hyper-graph infrastructure. As far as we know, it is the first hyper-graph architecture model in wireless sensor network. In the model, special nodes and connected edges among them are generalized as hyper-edges. With the growth of hyper-edges, as the minimum computing unit, fewer extra packets are used and the energy consumption is effectively reduced.
Proposition 1.
Let
We describe a wireless self-organized sensor network as a hyper-graph
In the hyper-graph model, we should also establish MST for optimal convegercast. But the binary relation of hyper-edge and vertices in hyper-graph is not the one-to-one mapping relation of vertices and edges as it is in a simple graph, which is more complex. Therefore, the axiom “any graph has its spanning tree” is invalid in hyper-graph, that is, not each hyper-graph exists spanning hyper-tree with loop-free. A hyper-graph example with no hyper-tree is shown in Figure 2. Two hyper-edges are split and Theorem 2 proposed hyper-tree does not exist, because of existing the loop (1-Hyperedge1-10-Hyperedge2-1) in the bipartite graph

Hyper-graph model.
Theorem 2.
Hyper-graph H is a hyper-tree, if and only if the bipartite graph
To ensure one hyper-graph certainly has hyper-tree, we proved two conditions must be satisfied:
if if condition one is satisfied and if
The precise mathematical proof is shown here. Firstly, if there is no hyper-cycle in
In Section 4, we presented a novel topology controlling algorithm to split hyper-edge, establish hyper-graph with satisfying the above two conditions, and span the minimum hyper-tree for minimum energy consumption convergecast.
4. Minimum Spanning Hyper-Tree Algorithm
In the implement of SHTa, a type of generalized synchronization mechanism with “synchronous round” was used. Firstly, we describe this synchronization mechanism.
Time synchronization is an important feature of distributed systems including wired and wireless communication systems. Many time synchronization schemes were designed including GPS [16] and Network Time Protocol (NTP) [17] used in IP networks applications. In M2M and sensor networks, time synchronization is also used frequently for various purposes including sensor data fusion, coordinated actuation, and power-efficient duty cycling: for example integrating a time series of proximity detections into a velocity estimate; measuring the time of flight of sound for localizing its source; distributing a beam forming array; suppressing redundant messages by recognizing that they describe duplicate detections of the same event by different sensors; or supporting energy efficient scheduling and power management. Now, many good time synchronization algorithms, such as Reference Broadcast (RBS [18]), TINY/MINI-SYNC [19], and Level Synchronization [20], are presented to provide time accuracy in wireless self-organized sensor networks.
Compared with accurate time slots synchronization, generalized synchronization mechanisms with “synchronous round” can save a large number of timescale check packets, which ensures the accurate time synchronisation, and reduce the complexity of designing communication protocols, therefore reducing the transmission energy consumption. In a generalized synchronous mechanism, each processor unit should complete two steps during one synchronous round: in the first step, the processor transmits event driven messages to its neighbor; in the second step, processor switches its current state with a state transition function, once it has received any valid messages.
When synchronous network is a deterministic system, a state transition function with the same valid input must achieve the same output in each time. Mapping the two steps onto a sensor node processor, the following two operations would be implemented.
Each main hyper-edge A minimal power chained hyper-edge
Two types of messages are employed in the above operations: (i) Op_HC: excite HC operation packet; (ii) R_mPCHe: request minimal power chain hyper-edge packet. The structures of the two types of packets are shown in Algorithm 1.
Packet Op_HC struct: typedef struct Op_HC_st // the struct of Op_HC packet { ULONG seq; // the serial number of the Op_HC packet UNIT new_mHe_ID; // new consolidation main hyper-edge ID UINT Og_mHe_ID_1; // original main hyper-edge ID UINT Og_mHe_ID_2; // original main hyper-edge ID UINT Og_cHe_ID; // chain hyper-edge ID between the above two main hyper-edges ULONG ttl; // the packet's lifetime }Op_HC; Packet R_mPCHe struct: typedef struct R_mPCHe_st // the struct of R_mPCHe packet { ULONG seq; // the serial number of the R_mPCHe packet ULONG Source_E_ID; // the R_mPCHe packet's source main hyper-edge ID UINT Source_ID; // the R_mPCHe packet's source node ID UINT Dest_ID; // the R_mPCHe packet's destination node ID ULONG Tran_Pw // transmission power in the chain Hyper-edge ULONG ttl; // the packet's lifetime }R_mPCHe;
Without loss of generality, we suppose SHTa implements at
(1) if ( (2) { if (R_mPCHe.ttl <= 0) (3) { ignore this R_mPCHe packet; (4) return UNSUCCESS; // can not recover the new delivery path (5) } (6) if (MPCHe.Source_E_ID == (7) { ignore this R_mPCHe; (8) R_mPCHe.ttl= R_mPCHe.ttl-1; (9) } (10) else// (11) if (12) R_mPCHe.Tran_Pw = Pw (13) if (Min_Pw(* (14) (15) }
As soon as two main hyper-edges
(1) if ( (2) if (3) (4) (5) return SUCCESS; // HC operation success and generate new main hyper-edge (6) }
Whenever SHTa cannot find any new chained hyper-edge
We rewrote the conclusion of the spanning hyper-tree from SHTa algorithm: In hyper-graph
Proof by Contradiction
Suppose that the conclusion is erroneous, that is, there is a hyper-tree T, which includes
Based on the definition,
5. Computer Simulation
This section evaluates the performance of the novel algorithm using simulation. Firstly, seven different sensor scenes are studied, in a
We first compute the maximum and average nodes' degree and the standard deviation of nodes' degree in seven different scenes to analyze the network density, shown in Table 2.
Maximum node degree, average nodes' degree, and the standard deviation of nodes' degree in different network size.
Figure 3 shows the average dissipated energy per packet as a function of networks size. DD and EADD have almost the same energy consumption and a half less than flooding. SHTa consumes less energy than DD and EADD. With the increase of the network size, SHTa can save 23.7% energy that of directed diffusion. It is because SHTa used hyper-edge as computing granularity, with consolidation operation, that hyper-edges become larger and less in the networks during the SHTa processing, which is completely different from the trivial nodes operation. Therefore it effectively reduces the number of overhead packets and reduces the size of the solution space, which results in the reduction of energy consumption and a prolonged network's lifetime.

Average dissipated energy in different network densities.
Figure 4 plots the average latency observed as a function of network size. Using the shortest path, DD and EADD algorithms have lower delays than flooding algorithm. Because more available energy can give nodes a faster response time, EADD just selects these vigorous nodes as relay stations and achieves lower delay than DD. Differing from the shortest edge path algorithm, SHTa uses hyper-edge, in which a set of identified nodes and edges composing multiple paths transmit information at the same time; therefore the lowest delays can be reached in four algorithms.

Average latency in different network densities.
Figure 5 presents another performance metric-loss packets rate. SHTa and flooding have lower value than DD and EADD. Compared with signal path in DD or EADD, multiple delivery paths in one hyper-edge in SHTa or duplication flooding in the Flooding algorithm improves the transmission reliability.

Packets loss rates in different densities of networks.
We also study the impact of dynamics in wireless sensor networks with 10%, 20%, and 30% random failure nodes. Figure 6 presented the average dissipated energy per packet as a function of network size. By increasing the failure percentage from 10% to 30%, both of DD and SHTa algorithms significantly consume more energy in transmitting per packet, the increase rate of DD is 33.40%, more than 28.48% of SHTa.

Average dissipated energy in dynamic experiments.
Figure 7 presents the average latency measurement. The results show that SHTa also provides the lower average delay for various fault percentages, that is, average 0.259 s when 30% of nodes fail. This is mainly because SHTa presents mutual backup delivery paths in one hyper-edge, which improves the capability of fault tolerance. In the final experiment, we evaluated the loss packets rate when the fault percentage of faulty nodes is increased. Figure 8 clearly shows that SHTa drops lower number of data packets compared with DD protocols, that is, 8.82% when 30% of nodes fail, and DD performs slightly worse, 17.12% for the same situation. All of results fairly present that SHTa is great robustness and can offers significant performance gain in networks with high fault percentage.

Average delay in dynamic experiments.

Packets loss rate in dynamic experiments.
6. Conclusion
To consistently provide reliable communication services for machine to machine applications, scalable and smart network architecture control algorithms are needed for wireless self-organized communication networks. This paper generalizes large-scale wireless self-organized sensor networks into concise and robust hyper-graph infrastructure and proposes an algorithm called SHTa to achieve minimum spanning hyper-tree. We proved algorithm's validity with mathematical deduction and computer simulation. Based on experimental results, the SHTa algorithm can save more energy and have lower latency and packets loss rates than previous algorithms, and the algorithm is more robust in the dynamic experiments. All of these results show that SHTa is an effective technique for wireless sensor networks and M2M applications.
Footnotes
Acknowledgments
This work was sponsored by the National Natural Science Foundation of China No.61172014 and no. 60702037; Natural Science Foundation of Tianjin no. 12JCZDJC21300 and 09JCYBJC00800; National Basic Research Program of China (973 Program) no. 2009CB219700. The authors thank their colleagues in the laboratory.
