Graph-Theoretic Based Connectivity Restoration Algorithms for Mobile Sensor Networks

Abstract

In mobile sensor networks (MSN), actuated sensors collaborate with each other in some predefined missions. The collaboration requires application-level coordination based on a strongly connected underlying network, which is often in an infrastructure-free ad hoc manner. The particular network topology provides flexibility as well as vulnerability to the potential applications of MSN; for example, the connectivity can be easily jeopardized if the network is partitioned into disjoint segments from the failure of some critical sensors. In this paper, a critical sensor determination and substitution (CSDS) strategy is proposed to address the important problem of network partitions in mobile sensor networks (MSN) due to the failure of particular sensors. CSDS utilizes a graph-theoretic method to locally identify critical sensors with 2-hop neighboring information. Then, an efficient backup sensor selection algorithm is proposed to monitor the critical sensors and, if necessary, substitute it in order to eliminate the partitions in MSN. The main contribution of our proposed work is that CSDS requires the relocation of only one sensor in each partition elimination process, so that the impacts on the primary missions of the MSN are minimized. Experimental simulations are conducted to evaluate the correctness and effectiveness of CSDS.

1. Introduction

Last decade we have witnessed a rapidly growing development in mobile sensor network (MSN) based on its enormous potential applications in a variety of scenarios such as terrain exploration, environmental monitoring, and remote measuring [1]. As an effective substitution of human, MSN often works in harsh environments where many potential risks, for example, mechanism malfunction or intentional sabotage, can be expected. The vulnerable topology of the ad hoc network based sensor system can be easily jeopardized due to the failure of some sensors caused by the potential risks. In particular, if the failed node is a critical sensor, for example, a cut-vertex, the network will be partitioned into a few disconnected segments. In this case, intersensor communications will not be possible for all sensors to deliver a timely response to critical missions, which may further cause fatal consequences to the applications [2]. Therefore, an efficient partition elimination, also called connectivity restoration [3] method, is pivotal to guarantee the successful execution of certain missions for MSN.

To date, many connectivity restoration schemes have been proposed; please refer to [3] for a comprehensive review on this topic. When the MSN is deployed into a remote area, where manually moving sensors are not applicable, a feasible way to restore the connectivity is to substitute the failed sensor by autonomously relocating the healthy ones. Both centralized [4] and decentralized [5] methods are designed in order to fulfill this objective. We argue that decentralized methods are rather a natural choice due to the distributed and autonomous characteristic of most MSNs. Typically, the restoration process involves a series of actions, including cut-vertices determination, backup sensors selection, and sensor relocation [6]. Contemporary schemes often focus on the connectivity restoration with multiple sensors. However, the sensors' current tasks are often neglected. When relocating sensors, most connectivity methods simply adopt direct movement, that is, the backup sensor moves directly towards the failed sensor [7], while other methods use potential function based controller to navigate the movements of sensors [8]. These methods, however, assume that all sensors in the network are free to be relocated in case of connectivity restoration.

Whether by cascaded movements or by potential force driven movements, it is often required to move several sensors just to restore single failure. It is argued that if a sensor rests in a spot, it may stay there for a reason; for example, it is occupied by a particular mission. It would not be feasible to move it just for reconnecting the network. Therefore, minimizing the number of sensors moved in connectivity restoration is important for task-oriented applications of MSN. To address this problem, a critical sensor determination and substitution (CSDS) strategy is proposed to eliminate partitions in mobile sensor networks (MSN) due to the failure of a critical sensor. First, a graph-theoretic method is designed to locally identify critical sensors with 2-hop neighboring information by computing the Laplacian matrix of a local spanning subgraph. Then, an efficient backup sensor selection algorithm is proposed to monitor the critical sensors and, if necessary, substitute it in order to eliminate the partitions in MSN. A preliminary version of this work was presented in [9].

The main contribution of our proposed work is to guarantee all partitions caused by the failure of a critical sensor are eliminated by moving only ONE backup sensor, as long as a noncritical sensor exists. Therefore, the negative impacts on the primary missions of the MSN are minimized. Theoretical analysis and experimental simulation studies are provided to verify the correctness and evaluate the performance of the proposed CSDS.

The outline of this paper is as follows. Section 2 presents the related work and motivations. The problem is then formulated in Section 3. In Section 4, we present the cut-vertices determination and backup selection algorithms in detail and provide the analytical results of CSDS. The results and discussions for CSDS simulation are provided in Section 5. Finally, Section 6 concludes the paper.

2. Related Work

2.1. Graph and Network Optimization

Connectivity restoration belongs to the basic problem of network optimization and resilience, which is rooted in graph theory and its application in networks [10]. The basic concept of connectivity restoration is to find the optimal backup node and/or optimal backup location in which the backup replaces the failed one in order to achieve the objective of restoring connectivity. Meanwhile, some secondary objective, for example, minimizing the impact on topology, or reducing nodes' degree, may be pursued. Such a problem is similar to the problem of service facility location, that is, locating a central facility in a network so as to minimize the sum of distances from the sources to it; the distance reflects the associated flow volume and/or cost of the paths [11]. Service facility location can be further classified as two main problems, that is, “p-median problem” and “p-center problem,” according to different objectives. In [12], a distributed algorithm is proposed to locally compute the near-optimal p-medians in linear time, which is proved to be suitable for online calculation in dynamic and large-scale networked systems. Other objectives may be also pursued in service facility location problem. In [13], the multicovering emergency service facility location problem is investigated, and a mathematical programming model is proposed to minimize the sum of expected disaster losses and the total costs of emergency service facilities. Deploying/choosing service facility within the network may naturally cluster the network into several subgraphs, with the facility being the cluster head to provide various services to the subnet. An adaptive clustering approach is proposed for autonomic systems in [14], and a unique feature of self-healing is achieved by recalculating near-optimal cluster head in case of component failures or congestion, for example, failure of links. Both service facility location and self-healing clustering problem can be applied to the problem of network resilience with certain variation. However, for this work, we focus on the fast restoration of the network connectivity rather than network optimization, and the failure of network is caused merely by the failure of node.

2.2. Direct Movement for Connectivity Restoration

The distributed connectivity restoration with respect to the failure of single cut-vertices has been extensively investigated [4–8, 15–19]. Both direct movement and potential force based movement are implemented. The direct movement can be further divided into block movement [2] and cascaded movement [15] methods according to their different patterns. In [2], several algorithms were proposed for achieving a 2-connectivity fault-tolerant configuration in multirobot networks by moving a subset (block) of mobile robots. Block movement can maintain the topology within the subset but requires a significantly large movement distance. To overcame this drawback and reduce the number of sensors moved, the cascaded movement method that only moves a set of necessary nodes for restoration is employed. In [7], Abbasi et al. presented a Distributed Actor Recovery Algorithm (DARA) to address the 1- and 2-connectivity restoration in wireless sensor and actor network (WSAN) with 2-hop neighboring information, DARA selects the backup candidate according to its node degree and distances, and then the selected nodes move to substitute the failed actor and their parental nodes in sequence. A more effective cascaded movement method was introduced in [15], which also added an effective mechanism to determine the cut-vertices of network proactively by identifying a Connected Dominating Set (CDS) using Dai's approach [16] with 2-hop neighboring information. By selecting the dominatee and best available dominator node, the number of sensors moved in the restoration is further reduced. Considering the impact of topology change on network performance, Least-Disruptive Topology Repair (LeDiR) algorithm was recently reported in [17], unlike other methods, LeDiR selecting a candidate node for restoration based on the partial view of network topology via a routing table. Recently, a localized hybrid timer based cut-vertex node failure recovery approach was proposed in [18] to ensure the timely restoration of MSN by allowing multiple failure handlers; cascaded movement is also adopted to relocate the associated sensors. In [19], with 1-hop neighboring information, the distributed partitioning Detection and Connectivity Restoration (DCR) was designed to identify the criticality of sensors and select the backup node based on the principle of “Guardian nomination.” The movements of sensors also follow the cascaded movement.

2.3. Potential Force Based Movement

Recently, potential force based methods [20–25] are widely exploited and proved to be very effective methods to redeploy mobile sensors and robots for achieving primary goals, for example, global connectivity, coverage, or fault-tolerance. The potential forces based methods are very attractive because of their effectiveness in creating a reactive behavior that provably avoids collisions with neighboring agents during reconfiguration. In [20], Zavlanos and Pappas proposed a hybrid control system that consists of a market based control strategy and a potential force based motion controller. The control system utilized potential force to achieve the main objective of maintaining global connectivity. The connectivity control system can reduce redundant communication links while preserving connectivity based on local estimation of spanning subgraph. Such problem is also addressed by a centralized potential force based controller in [21]. In [22], Mi et al. proposed a self-organization technique known as Distributed Link Removal Algorithm (DLRA) that aims to maintain connectivity in mobile sensor networks while reducing the redundant links, limiting the sensor nodes actuation and avoiding collisions. The redeployment of the mobile sensors is also by potential force based functions. However, the main drawback for these strategies is the well-known local minima problem [26]: interacting with the primary task of the mobile sensors (e.g., moving to a destination, creating a formation), potential forces can create undesired asymptotically stable configurations that prevent the sensors from reaching the desired configuration.

Another solution to navigate the mobile sensors is to use a distributed potential function based motion controller. A k-hop neighboring information based connectivity restoration framework, named HERO, was presented in [8], and the proposed method includes a potential function to drive the backup sensor to its destination while avoiding intersensor collisions. Due to the nature of the potential based controller, a relatively large number of sensors are employed in the connectivity restoration. The major problem of the contemporary schemes is the participation of multiple sensors in restoration process, which may not be possible in certain missions. This paper presents a novel solution to restore connectivity by moving only one backup sensor.

3. Preliminaries and Problem Formulation

3.1. System Models

For an initially connected sensor network with M actuated mobile sensors, encode the intersensor network in terms of an undirected graph $G = (V, E)$ , where $V = (1, \dots, M)$ denotes the set of vertices indexed by the set of mobile sensors $i \in [1, \dots, M]$ , and $E = {e_{i j}, (i, j) \in V \times V}$ denotes the unordered pairs which specifies that the bidirectional communication links exist between the respective sensors. Let $d_{i j} = ‖ξ_{i} - ξ_{j}‖$ be the Euclidean distance between sensors i and j, where $ξ_{i} \in R^{2}$ represents the location of sensor i. Then, a link $e_{i j}$ exists; that is, $e_{i j} \in E$ , if and only if $d_{i j} \leq R$ , where R is the transmission range of the sensors. The network has homogenous configuration for all sensors; that is, every sensor in the network is equipped with the same computing devices/functions and has the same communication range. The aforementioned network model gives rise to the following definitions.

Definition 1 (connectivity).

An undirected graph $G$ is connected if and only if there is at least one path for any two vertices.

Definition 2 (1-hop neighbors).

Define j as the 1-hop neighbor of i; that is, $j \in N_{i}$ , if and only if $e_{i j} \in E$ , where $N_{i}$ is the 1-hop neighboring set of i.

Definition 3 (2-hop neighbors).

Define k as the 2-hop neighbor of i; that is, $k \in N_{i}^{2}$ , where $N_{i}^{2}$ is the 2-hop neighboring set of i, if and only if there exists at least one route between i and k in $G$ with 2 hops.

3.2. Algebraic Definitions of Connectivity

Denote $w_{i j}$ as the symmetric weight of the communication links in $G$ , and $w_{i j}$ satisfies

\begin{matrix} w_{i j} = \{\begin{cases} 1, & d_{i j} \leq R \\ 0, & o t h e r w i s e . \end{cases} \end{matrix}

(1)

We further define the adjacency matrix $A (G) \in R^{M \times M}$ of the graph $G$ with entries as

\begin{matrix} {[A (G)]}_{i j} = {[A (G)]}_{j i} = w_{i j} . \end{matrix}

(2)

The powers of the adjacency matrix of a graph are closely related to network connectivity, which can be captured with the Laplacian matrix $L (G) \in R^{M \times M}$ of graph $G$ . The Laplacian matrix can be defined as follows:

\begin{matrix} L (G) = \{\begin{cases} - w_{i j}, & i f i \neq j \\ \sum_{s \neq i}^{} w_{i s}, & i f i = j . \end{cases} \end{matrix}

(3)

Furthermore, let $D (G) = d i a g (\sum_{j = 1}^{M} w_{i j})$ represent the diagonal matrix of degrees of graph $G$ , and $L (G)$ can be written as

\begin{matrix} L (G) = D (G) - A (G) . \end{matrix}

(4)

It is widely acknowledged that $L (G)$ with symmetric weights is always a symmetric positive-semidefinite matrix, which is closely related to the connectivity of the graph. The algebraic connectivity of the graph $G$ can be determined by the following theorem.

Theorem 4 (see [27]).

Let $0 \leq λ_{1} (G) \leq λ_{2} (G) \dots \leq λ_{n} (G)$ be the ordered eigenvalues of the Laplacian matrix $L (G)$ . Then, $λ_{1} = 0$ with corresponding eigenvector 1, that is, the $M \times 1$ vector of all entries equal to 1. Moreover, $λ_{2} (G) > 0$ if and only if the graph $G$ is connected.

The second smallest eigenvalue $λ_{2} (G)$ of the Laplacian matrix is also known as the algebraic connectivity or Fiedler value of the graph. Generally, higher value of $λ_{2} (G)$ indicates stronger connectivity of the respective network.

3.3. Main Objective

Based on the aforementioned background, the main objectives of this work can be described as follows.

Assume that the network has a nonuniform deployment topology with randomly deployed sensors, and each sensor knows its own location and periodically exchanges status information with its 1-hop neighbors and gathering information of 1- and 2-hop neighbors.

Main Objectives. The main objectives are as follows: (1) locally determining the critical sensors of the network based on 2-hop neighboring information, (2) selecting the best available backup sensor for each critical sensor so that (3) upon the failure of the critical sensor the connectivity of the network by relocating ONLY the backup sensor was restored, and further partitions of the network can be avoided.

4. Restoring Connectivity with Backup Sensor

The principle of our design is to proactively assign noncritical backup sensors to the critical sensors. Compared to the reactive methods, proactive selection of backups is beneficial for the timely execution of the restoration process, because the backup sensor can start the restoration immediately after the failure of a critical sensor. In this section, a critical sensor determination and substitution (CSDS) algorithm is proposed to determine the criticality of the sensors and select the best available backups with the aid of only 1- and 2-hop local neighboring information.

4.1. Critical Sensor Determination

Cut-vertices in a graph can be accurately determined in a centralized manner if the network-wide information is provided. However, it is not possible to accurately determine the cut-vertices with only local information. However, we manage to determine the noncritical sensor with confidence and mark others as critical; that is, if a sensor is marked as noncritical, it is not definitely a cut-vertex, whereas if a sensor is marked as critical, it may still be a global non-cut-vertex in $G$ .

Step 1 (initialization).

Sensors collect neighbor's information by periodically exchanging heartbeat messages with their 1-hop neighbors. The heartbeat message must contain sensor's ID, geographic position, and one-hop neighbor set $N_{i}$ . Each sensor can build complete $N_{i}$ and $N_{i}^{2}$ after successfully collecting information.

Step 2 (determination).

Based on $N_{i}$ and $N_{i}^{2}$ , sensor i can determine whether it is a noncritical sensor in the graph. First, let $G_{i}$ be the spanning subgraph of $G$ , and $G_{i}$ contains all of the sensors in $N_{i}$ and $N_{i}^{2}$ . The Laplacian matrix of $G_{i}$ is given by $L (G_{i}) = D (G_{i}) - A (G_{i})$ . Then, by computing the algebraic connectivity $λ_{2} (G_{i})$ , sensor i can determine whether its failure can cause the disconnection of the subgraph that consists of its 1- and 2-hop neighbors. If $λ_{2} (G_{i}) > 0$ , sensor i marks itself as a noncritical sensor; otherwise it is critical sensor. An example is shown in Figure 1. Figure 1(a) is the initially connected MSN with 13 mobile sensors, and graph-theoretic based method is used to determine the critical sensors. With spanning subgraph of $G$ , it is determined that (1) $G_{1}$ is connected and sensor 1 is accurately determined as noncritical sensor as in Figure 1(b); (2) $G_{2}$ is disconnected and sensor 2 is accurately determined as critical sensor as in Figure 1(c); and (3) $G_{3}$ is disconnected and sensor 3 is falsely determined as critical sensor when it is not actually a global critical sensor as in Figure 1(a). The false determination is mainly caused by information unavailability of the entire network topology, which occurs when restricted to local information. It is also worthwhile to notice that a similar idea of determining the most critical node in a subgraph by locally computing the $λ_{2} (G)$ within n-hops has been proposed in [28]. Our method defers from it in the way that CSDS computes the $λ_{2} (G)$ of a subgraph while excluding the node itself. Therefore, the node can clearly identify whether its absence causes the disconnection of the subgraph so that no critical node will be missed.

Figure 1

Example of the initial graph in (a), spanning subgraph in (b), spanning subgraph in (c), and spanning subgraph in (d).

Step 3 (notification).

If a sensor i determines itself as a critical sensor, it will inform all of its one-hop neighbors; that is, $j \in N_{i}$ , by sending a CriSen $(i)$ message to them. Any noncritical sensor j that receives this message and is not currently occupied by another task will send a backup application message $B A (j)$ back to notify i that it is available to be chosen as a backup. The value of $B A (j)$ is equal to the Euclidean distance between i and j; that is, $Φ [B A (j)] = d_{i j}$ . The pseudocode for critical sensor determination is shown in Pseudocode 1, and the sequence diagram of the algorithm is shown in Figure 2.

Pseudocode 1: Pseudocode of the critical sensor determination algorithm.

Step 1. Initialization

Receiving heartbeat messages from one-hop neighbor and collecting 2-hop neighbor information.

Constructing $N_{i}$ and $N_{i}^{2}$

Step 2. Determination

(1) For $i \in V$

(2) if $N_{i}$ and $N_{i}^{2}$ are completely collected

(3) SpanningGraph( $G$ ) = $G_{i}$

(4) Return $L (G_{i}) = D (G_{i}) - A (G_{i})$

(5) else go to (1)

(6) EigenValue( $G_{i}$ ) = $λ_{2} (G_{i})$

(7) if $λ_{2} (G_{i}) > 0$

(8) Critical(i) = false

(9) else Critical(i) = true

(10) end if

(11) end if

Step 3. Notification

(12) For $i \in V$

(13) if Critical(i) = true

(14) Send CriSen(i) to every $j \in N_{i}$

(15) end if

(16) for j received CriSen $(i)$

(17) if Critical(j) = false and Occupied(j) = false

(18) Send $B A (j)$ to i

(19) end if

Figure 2

Sequence diagram of the critical sensor determination algorithm.

4.2. Backup Sensor Selection and Connectivity Restoration

Backup selection of best available sensors is based on the criticality, hop-count, and Euclidean distances of their neighbors. The following criteria should be met when a critical sensor chooses its backup.

Case 1.

If a critical sensor i receives more than one $B A ()$ , which indicates that there is at least one 1-hop neighbor who is available to be chosen as backup, the sensor will choose a backup among its 1-hop neighbors. Specifically, a 1-hop neighbor j is chosen as the backup of a critical sensor i if and only if the following terms are satisfied:

\begin{matrix} λ_{2} (G_{j}) > 0, \end{matrix}

(5)

\begin{matrix} Φ [B A (j)] = \min \{Φ [B A ()]\}; \end{matrix}

(6)

that is, (1) j must be a noncritical sensor and (2) among all of the available 1-hop neighbors of i, j is geographically the closest one.

A very interesting yet crucial fact is that a noncritical node may become critical upon the failure of other nodes, and any movement of it may cause further partition of the network. This can be seen from Figure 3. As shown in Figure 3(a), upon the failure of the critical sensor 1, sensor 2 turns out to be a critical sensor while it was initially noncritical, and the movement of sensor 2 may cause further disconnection between sensor 3 and sensor 4. However, as shown in Figure 3(b), relocating sensor 2 to the exact position of the failed sensor 1 will not affect the global connectivity. The explanation of these interesting phenomena is straightforward: by substituting a failed critical node with a noncritical node, the final topology of the graph can be viewed as the initial topology without the noncritical node, that is, the failure of the noncritical node. Thus the global connectivity is maintained. The aforementioned fact is of great importance and entitles us to use just one sensor to restore the connectivity rather than use cascaded and potential based movements of a group of sensors.

Figure 3

An example of an initially noncritical sensor 2 becomes a critical vertex when sensor 1 fails, (a) the initial topology, and (b) the updated topology after sensor 2 restored the connectivity.

Case 2.

If a critical sensor i receives none $B A ()$ from 1-hop neighbors, which indicates that no 1-hop neighbor is available to be chosen as the backup. Then, sensor i will locate a noncritical sensor from 2 or more hop-count neighbors. First, sensor i will compose a backup request message $B R (i)$ . Then, the BSR message will be sent to its 2-hop neighbor by setting the hop-count to $T T L = 2$ . Any noncritical sensor receiving a $B R (i)$ will respond to it by sending a $B A ()$ back to sensor i. The value of $B A ()$ from a two-hop sensor j is all the Euclidean distance from it to the critical sensor i; that is, $Φ [B A (j)] = d_{i j}$ . Finally, after collecting $B A ()$ , sensor i will select a backup sensor according to (6). Note that a gradual expansion technique [29] is used to locate a noncritical sensor from higher hop-count neighbors; that is, sensor i composes a new $B R (i)$ with $T T L = k$ if there is no backup available within $k - 1$ . Sensor i will stop sending $B R (i)$ as soon as a backup is located. However, there is a rare case where no noncritical node exists in the network.

Case 3.

In case a critical sensor i does not receive any $B A ()$ message after searching the network, then all sensors in the network were identified as critical, which may not be restorable upon the failure of some special sensor. A heuristic solution to this situation under limited information is for each 1-hop neighbor of a critical sensor i, for example, j, to rerun the critical sensor determination algorithm without the critical sensor i and its other 1-hop neighbors. Specifically, each 1-hop sensor of one critical sensor, for example, i, will run the critical sensor determination algorithm again. However, in this step, instead of computing the Fiedler value of $G_{j}$ , it computes the Fiedler value of another subgraph $G_{j}^{'}$ , where $G_{j}^{'} = G j_{j} ∖ (i \cup (k \in N_{i}))$ , that is, a subgraph that excludes sensor i all the 1-hop sensor of i. If such a subgraph is connected without sensor j, that is, $λ_{2} (G_{j}^{'}) > 0$ , then sensor j will be able to replace sensor i without incurring any further disconnection of the network. For instance, as shown in Figure 4, all of the sensors in the network are initially critical sensors, and to choose a backup for sensor 1, its 1-hop sensors will again compute its criticality. If $λ_{2} (G_{2}^{'}) > 0$ , it will compose a $B R (2)$ to participate in the backup selection. According to our methods, sensors 3, 4, and 5 will also participate in the selection, and the selection will again obey (6). It is obvious that substituting sensor 1 with either of the 4 sensors can restore the global connectivity. The detailed pseudocode for the backup sensor selection is shown in Pseudocode 2, and the sequence diagram for this backup sensor selection algorithm can be found in Figure 5.

Pseudocode 2: Pseudocode of the backup sensor selection.

(1) For Critical(i) = true

(2) if $\{Φ [B A ()]\} \neq ⌀$ #Case 1

(3) if $Φ [B A (j)] = \min_{} \{Φ [B A ()]\}$

(4) Backup(i) = j, inform j

(5) end if

(6) else

(7) TTL = TTL + 1 #Case 2, initial TTL = 2

(8) if TTL < M

(9) waiting for $B A ()$ , go to (2)

(10) else

(11) for $j \in N_{i}$ #Case 3, for every 1-hop neighbor of sensor i

(12) Subgraph( $G_{j}$ ) = $G_{j}^{'}$

(13) if $λ_{2} (G_{j}^{'}) > 0$

(14) Send $B A (j)$ to i

(15) end if

(16) end

(17) go to (2)

(18) end if

(19) end

Figure 4

An example of a topology contains only critical sensors.

Figure 5

Sequence diagram of the backup sensor selection algorithm.

The designated backup sensor is responsible for monitoring the status of its parental critical sensor. If it is a 1-hop neighbor of the critical sensor, the failure of the critical sensor can be easily detected from the missing of heartbeat messages. Otherwise, if the backup sensor is 2 hops away from the critical sensor, the critical sensor should unicast the heart-beat message to it. If any of the heartbeat messages misses for more than a tolerant time period, the restoration process will be initiated.

Since cascaded movement is not required in CSDS, a simple node relocation method will be sufficient to drive the backup sensor to the position of the failed critical sensor in order to restore the connectivity. A straightforward solution is to utilize a target potential force $F_{T}$ , which can be a simple quadratic function in the distance to the target with a minimum at the target, such as $F_{T} = - \nabla V_{T} (ξ)$ , and

\begin{matrix} V_{T} (ξ) = K_{r} {‖ξ_{b} - ξ_{f}‖}^{2}, \end{matrix}

(7)

where

K_{r}

is a constant gain and

ξ_{b}

and

ξ_{f}

are the positions of the backup sensor and its corresponding failed critical sensor.

4.3. Algorithm Analysis

To verify the correctness and efficiency of CSDS, we introduce the following theorems.

Theorem 5.

In the worst case, the total movement distance (TMD) in CSDS is $0.5 (M - 1) R$ .

Proof.

The worst case topology for TMD is a line topology as in Figure 6, and the distances between each pair of neighbors are equal R. The failed sensor must be right in the center of the line; otherwise, a closer backup (either sensor 1 or sensor M) will be located first, and its TMD is smaller than the TMD in the worst case. Therefore, (1) if M is even, then the worst case TMD occurs when the failed sensor ID is $0.5 M - 1$ or $0.5 M + 1$ , and the TMD for both cases equals $0.5 M R - R$ ; (2) if M is odd, then the worst case TMD occurs when the failed sensor ID is $0.5 (M + 1)$ , and the worst case TMD in this case is $0.5 (M - 1) R$ . To conclude, the worst case TMD in CSDS is $0.5 (M - 1) R$ .

Figure 6

The worst case topology in terms of TMD.

CSDS outperforms some of the well-known methods in terms of TMD, as shown in Table 1. In addition, with cascaded movement, the maximum number of sensors moved in the worst case amounts to nearly all of the sensors in the entire network. CSDS, on the other hand, ensures that only one sensor moved regardless of the network topology.

Table 1

Comparisons of different methods.

Methods	Matrices (in worst case)
Methods	Information	NSM^a	TMD	TNM
DARA [7]	2-hop	$M - 3$	$(M - 3) R$	$O (M)$
PADRA+ [15]	2-hop	$0.5 (M - 1)$	$0.5 (M - 1) R$	$O (M^{2})$
HERO [8]	2-hop	$M - 1$	$(M - 1) R$	$O (M)$
CSDS	2-hop	1	$0.5 (M - 1) R$	$O (M^{2})$

^aNSM denotes number of sensors moved.

Another important metric that is measured in partition elimination is the message complexity of the algorithm, which indicates its scalability in large-scale networks. The message complexity is often measured by the total number of messages (TNM) in each execution loop of the algorithm. For CSDS, we have the following theorem.

Theorem 6.

The message complexity of CSDS is $O (M^{2})$ .

Proof.

The message complexity refers to the maximum quantity of generated messages of all sensors in the worst case with respect to number of mobile nodes. First, since CSDS requires 2-hop neighboring information in critical sensor determination, there are a total of $2 M$ messages. Second, for the backup selection algorithm, the worst case structure for TNM is a pure ringlike topology as shown in Figure 7. In this topology, all sensors in the network are critical vertices, and each sensor needs to send a total of $M - 2$ messages just to search for the backup sensor, and another 2 messages are required to determine the backup sensor. This leads to a total of M messages for each sensor and $M^{2}$ messages for the entire MSN in the backup selection step. Including the $2 M$ messages in the critical sensor determination, the TNM for CSDS is $M^{2} + 2 M$ which is $O (M^{2})$ .

Figure 7

The worst case topology in terms of TNM.

In fact, CSDC may need to locate a remote sensor as its backup due to its requirement of only one sensor to be moved for the partition elimination. Then, a relatively higher message complexity is inevitable, as can be seen from Table 1. However, the worst case topology is extremely unlikely in a typical uniform randomly distributed MSN, and the critical sensors are expected to locate a backup within a few hops. This phenomenon can be clearly observed in experimental simulation, which will be discussed in the next section.

5. Experimental Simulations

5.1. Simulation Scenarios and Performance Metrics

In the simulation scenarios, an initially connected MSN is randomly distributed into 1000 m × 1000 m area of interests. During the simulation, a random possible cut-vertex fails at a given time point. Each simulation is run for 50 different network topologies and the results were based on the average performances. We choose the NSM, TNM, and TMD as the performance matrices to evaluate the overall efficiency of CSDS against other popular methods. We adopt DARA, HERO, and PADRA+ as the baseline approaches to compare with the proposed CSDS, due to their similarity of using two-hop neighboring information in the restoration process.

5.2. Results and Discussions

Before presenting the performance evaluation of CSDS, a simulation study is conducted to show the possibility of 1-hop and 2-hop backups; that is, among the critical sensors in a randomly distributed network, how many of them have a backup from their 1-hop or 2-hop neighboring sets. The results shown in Figure 8 indicate that CSDS guarantees a high possibility that the critical sensor can locate a backup within 2 hops, which is important since it can effectively reduce the message cost by avoiding searching through the networks. Meanwhile, when the network becomes denser, the number of critical sensors is rapidly decreased; thus, there is an even higher possibility to locate a 1-hop backup sensor.

Figure 8

Ratios of 1- and 2-hop backup sensors with increased density.

The results of NSM are shown in Figure 9(a). Only 1 sensor is involved in a restoration process for CSDS. Meanwhile, the NSM for PADRA+ is around 2.5, due to its computing of CDS and the cascaded movement scheme. The NSM for DARA is around 4, and an even higher NSM can be observed for HERO due to its attractive potential function that causes smaller movements of a relatively large number of sensors. The NSM for HERO becomes at most 7 with the increased density of the network.

Figure 9

Comparative studies. (a) Number of sensors moved with increased density, (b) total number of messages with increased density, and (c) total movement distances with increased density.

Furthermore, the TMN of CSDS is shown in Figure 9(b). Since the message complexity for CSDS is $O (M^{2})$ , relatively higher message cost is expected in comparison with HERO and DARA, whose message complexity is $O (M)$ . Moreover, since PADRA+ allows cascaded movements, its TNM is also smaller than CSDS, even though its message complexity is also $O (M^{2})$ . It is worthwhile to note that the TMN of CSDS is much smaller than $M^{2}$ due to the fact that the occurrence of the worst case topology is unlikely, and the possibility of locating a 1-hop backup is high. Therefore, the relatively higher TMN should not be a major concern in applications.

As an important factor to evaluate the efficiency of the partition elimination, the results of TMD are shown in Figure 9(c). Comparing with cascaded and potential function based movement pattern, direct relocation of only 1 sensor can effectively reduce the TMD regardless of the topology and density of the networks. As is validated in the simulation study, the TMD of CSDS is smaller than that of the HERO, PADRA+, and DARA and is further reduced as the density of the network is increased. The TMD of CSDS can be as small as nearly 50 meters when the number of sensors reaches 180, which validates that the backups are almost always selected from the 1-hop neighbor sets.

6. Conclusions

To minimize the impacts on the network structure and sensor's current tasks, a partition elimination strategy CSDS, which moves only one sensor during a restoration process, is presented in this work. CSDS is fully distributed and requires only local information of 2-hop neighbors. Consisted of a graph-theoretic based critical sensor determination algorithm and an effective backup sensor selection algorithm, CSDS can restore the disconnections of the network caused by the failure of a cut-vertex immediately. Experimental simulation studies verify that, with a slightly higher message cost, CSDS can improve the efficiency of connectivity restoration in terms of total movement distance and maintain the layout of the sensor system by moving only one mobile sensor.

Future work will consider the network optimization prior to the disconnection of network or during the connectivity restoration process by implementing clustering algorithms and dual descent approach [30]. More realistic system and environment models will also be considered so that the connectivity restoration methods can be more suitable in the real-world applications.

Footnotes

Conflict of Interests

The authors declare no conflict of interests.

Acknowledgments

This work was supported by the National Science Foundation of China (Grants nos. 61272432, 61370092, and 61472033), Hubei Provincial Department of Education Outstanding Youth Scientific Innovation Team Support Foundation (T201410), and Fundamental Research Funds for the Central Universities (TW201502).

References

Dargie

Poellabauer

Fundamentals of Wireless Sensor Networks: Theory and Practice 2010

Chichester, UK

John Wiley & Sons

10.1002/9780470666388

Basu

Redi

Movement control algorithms for realization of fault-tolerant ad hoc robot networks

IEEE Network 2004 18 4 36 44

10.1109/MNET.2004.1316760

2-s2.0-4444339822

Younis

Senturk

I. F.

Akkaya

Lee

Senel

Topology management techniques for tolerating node failures in wireless sensor networks: a survey

Computer Networks 2014 58 1 254 283

10.1016/j.comnet.2013.08.021

2-s2.0-84893734846

Senturk

I. F.

Akkaya

Senel

An effective and scalable connectivity restoration heuristic for mobile sensor/actor networks

Proceedings of the IEEE Global Communications Conference (GLOBECOM ′12)

December 2012

Anaheim, Calif, USA

518 523

10.1109/glocom.2012.6503165

2-s2.0-84877646783

Yang

Liu

HERO: a hybrid connectivity restoration framework for mobile multi-agent networks

Proceedings of the IEEE International Conference on Robotics and Automation (ICRA ′11)

May 2011

Shanghai, China

IEEE

1702 1707

10.1109/icra.2011.5979682

2-s2.0-84871689620

Almasaeid

H. M.

Kamal

A. E.

On the minimum k-connectivity repair in wireless sensor networks

Proceedings of the IEEE International Conference on Communications (ICC ′09)

June 2009

Dresden, Germany

1 5

10.1109/icc.2009.5199257

2-s2.0-70449513990

Abbasi

A. A.

Younis

Akkaya

Movement-assisted connectivity restoration in wireless sensor and actor networks

IEEE Transactions on Parallel and Distributed Systems 2009 20 9 1366 1379

10.1109/TPDS.2008.246

2-s2.0-68849104727

Yang

Ding

Distributed connectivity restoration for mobile sensor systems with limited information

IEEE Sensors Journal 2014 14 11 3838 3850

10.1109/jsen.2014.2345565

Yang

James

Graph-theoretic critical sensor determination and partition elimination in mobile sensor networks

Proceedings of the IEEE International Conferences on Communciations

2015

London, UK

1 5

10.

Daskin

M. S.

Network and Discrete Location 1995

New York, NY, USA

John Wiley & Sons

Wiley-Interscience Series in Discrete Mathematics and Optimization

10.1002/9781118032343

MR1326602

11.

Goldman

A. J.

Optimal center location in simple networks

Transportation Science 1971 5 212 221

MR0359738

12.

Liotta

Ragusa

Pavlou

Near-optimal service facility location in dynamic communication networks

IEEE Communications Letters 2005 9 9 862 864

10.1109/LCOMM.2005.1506728

2-s2.0-27844569865

13.

Zhou

The multi-covering emergency service facility location problem with considering disaster losses

Proceedings of the 11th International Symposium on Operations Research and Its Applications in Engineering, Technology and Management (ISORA ′13)

August 2013

1 6

10.1049/cp.2013.2249

14.

Ragusa

Liotta

Pavlou

An adaptive clustering approach for the management of dynamic systems

IEEE Journal on Selected Areas in Communications 2005 23 12 2223 2235

10.1109/JSAC.2005.857203

2-s2.0-29144484196

15.

Akkaya

Senel

Thimmapuram

Uludag

Distributed recovery from network partitioning in movable sensor/actor networks via controlled mobility

IEEE Transactions on Computers 2010 59 2 258 271

10.1109/tc.2009.120

MR2750529

2-s2.0-75149155111

16.

Dai

An extended localized algorithm for connected dominating set formation in ad hoc wireless networks

IEEE Transactions on Parallel and Distributed Systems 2004 15 10 908 920

10.1109/TPDS.2004.48

2-s2.0-17244373061

17.

Abbasi

A. A.

Younis

M. F.

Baroudi

U. A.

Recovering from a node failure in wireless sensor-actor networks with minimal topology changes

IEEE Transactions on Vehicular Technology 2013 62 1 256 271

10.1109/tvt.2012.2212734

2-s2.0-84883690948

18.

Ranga

Dave

Verma

A. K.

A hybrid timer based single node failure recovery approach for WSANs

Wireless Personal Communications 2014 77 4 2155 2182

10.1007/s11277-014-1631-4

2-s2.0-84893196442

19.

Imran

Younis

Md Said

Hasbullah

Localized motion-based connectivity restoration algorithms for wireless sensor and actor networks

Journal of Network and Computer Applications 2012 35 2 844 856

10.1016/j.jnca.2011.12.002

2-s2.0-84856211515

20.

Zavlanos

M. M.

Pappas

G. J.

Distributed connectivity control of mobile networks

IEEE Transactions on Robotics 2008 24 6 1416 1428

10.1109/TRO.2008.2006233

2-s2.0-58249142713

21.

Zavlanos

M. M.

Pappas

G. J.

Potential fields for maintaining connectivity of mobile networks

IEEE Transactions on Robotics 2007 23 4 812 816

10.1109/tro.2007.900642

2-s2.0-34548161470

22.

Yang

Ding

Self-organized connectivity control and optimization subjected to dispersion of mobile ad hoc sensor networks

International Journal of Distributed Sensor Networks 2012 2012 15

672436

10.1155/2012/672436

2-s2.0-84870188772

23.

Egerstedt

Distributed coordination control of multiagent systems while preserving connectedness

IEEE Transactions on Robotics 2007 23 4 693 703

10.1109/TRO.2007.900638

2-s2.0-34548154326

24.

Yang

Freeman

R. A.

Gordon

G. J.

Lynch

K. M.

Srinivasa

S. S.

Sukthankar

Decentralized estimation and control of graph connectivity for mobile sensor networks

Automatica 2010 46 2 390 396

10.1016/j.automatica.2009.11.012

MR2877085

2-s2.0-74149084751

25.

Ajorlou

Momeni

Aghdam

A. G.

Connectivity preservation in a network of single integrator agents

Proceedings of the 48th IEEE Conference on Decision and Control Held Jointly with 2009 28th Chinese Control Conference (CDC/CCC '09)

December 2009

Shanghai, China

7061 7067

10.1109/cdc.2009.5400614

2-s2.0-77950837335

26.

Barnes

Fields

Valavanis

Unmanned ground vehicle swarm formation control using potential fields

Proceedings of the Mediterranean Conference on Control and Automation (MED ′07)

July 2007

1 8

10.1109/med.2007.4433724

2-s2.0-50249118304

27.

Godsil

Royle

Algebraic Graph Theory 2001 207

Berlin, Germany

Springer

Graduate Texts in Mathematics

10.1007/978-1-4613-0163-9

MR1829620

28.

Wehmuth

Ziviani

Distributed location of the critical nodes to network robustness based on spectral analysis

Proceedings of the 7th Latin-American Network Operations and Management Symposium (LANOMS ′11)

October 2011

1 8

10.1109/lanoms.2011.6102259

2-s2.0-84863307139

29.

Yang

Wang

Connectivity preserving task allocation in mobile robotic sensor network

Proceedings of the IEEE International Conferences on Communication (ICC ′14)

June 2014

Sydney, Australia

1 6

30.

Zargham

Ribeiro

Ozdaglar

Jadbabaie

Accelerated dual descent for network flow optimization

IEEE Transactions on Automatic Control 2014 59 4 905 920

10.1109/tac.2013.2293221

MR3199342

2-s2.0-84897429807