Sage Journals: Discover world-class research

Abstract

Sensor nodes in wireless sensor networks are prone to malfunction because they are exposed to the nearby environment directly. Consequently, wrong sensor readings occurred from sensor nodes and these readings are called an outlier. Commonly, since an outlier deviates from normal sensor readings and it can bring about some problems, various techniques to detect the outliers have been proposed. In this paper, we propose an efficient outlier detection technique based on data clustering. In order to decide the width of the cluster that consists of the sensor readings, we applied the Pigeonhole Principle and then detected the outliers based on clusters. In experiments, we demonstrate the efficiency of our proposed technique compared to other outlier detection techniques.

1. Introduction

Recently, since development of the integrated circuit, the size of the sensor is gradually reduced and various sensors are built in the sensor node in a wireless sensor network (WSN). Sensor nodes detect the various and huge information (e.g., temperature, humidity, and light) around their environments and they communicate to the base station and others using radio transmission. Accordingly, WSNs are used in various applications such as environment and habitat monitoring [1, 2], combat field surveillance [3], security [4], or health care [5] applications.

Commonly, sensor nodes are severely constrained in terms of the computation power, communication bandwidth, and battery power. Among these limitations, the power is of utmost importance, since replacing the battery of sensor nodes is too either expensive or impossible [6, 7]. Thus, the energy preservation is a major research issue since it directly impacts the life time of the network. Recently, much research has shown that the radio communication is more expensive than the computation or the sensing. Thus, many techniques [8–13] have been proposed in order to reduce the communication overhead.

Particularly, since sensor nodes are placed outdoor for the applications such as the disaster monitoring [14] and habitat monitoring, the sensor nodes can be malfunctioned or the sensor readings may be incorrect due to external impact such as the severe external environments [15]. In addition, due to sudden changes in external environments, some sensor readings may deviate significantly from the normal sensor readings. These abnormal sensor readings are called the outliers. For example, assume that several sensor nodes are deployed in a mountain to monitor forest fires. When an outlier value is detected and sent to a forest guard, the forest guard can identify the actual forest fires or he/she can initialize the sensor node if the outlier is generated by the malfunctioned sensor. Thus, the outlier detection is a quite important task to detect an event or to maintain sensor networks harmoniously.

In this paper, we focus on an energy-efficient outlier detection technique in WSNs based on data clustering. To construct data clusters, we use the Pigeonhole Principle. By applying cluster width, called the permission range, obtained by the Pigeonhole Principle, we partition the data space into several clusters. However, if we partition the data space evenly, some sensor readings are identified as outliers although similar sensor readings of them are detected by sensor nodes. Thus, we partition the domain of data unevenly based on the location of sensor readings. Then, we identify the outliers according to the user-defined threshold θ which is related to the number of sensor readings in a cluster.

The remainder of the paper is organized as follows. Section 2 discusses related work. In Section 3, we present the background of our work. Section 4 introduces the proposed outlier detection technique in WSNs. Section 5 presents an empirical evaluation. Section 6 summarizes the paper.

2. Related Work

In WSNs, a lot of outlier detection techniques have been proposed. In [16], an outlier detection technique was proposed to collect the outliers with respect to the neighbor sensor nodes. Each sensor node calculates the median of the sensor readings received from the neighbors and its readings. Each sensor node computes the mean μ and the standard deviation σ of the differences between its sensor readings and the calculated median. If the standardized value $(= (v - μ) / σ)$ of a sensor reading v is greater than or equal to the user-defined threshold, it is regarded as an outlier.

Palpanas et al. proposed an outlier detection technique based on the the Epanechnikov kernel function [17]. Given a value v, each sensor node estimates the number of values around v using the kernel function. If the number of values around value v is less than a user-defined threshold p, a value v is regarded as an outlier. However, to make the kernel function, each sensor node transmits the required information to the base station along the routing path. Thus, it wastes a lot of energy.

In [18], an outlier detection technique based on data clustering, called DC, was proposed. Given a user-defined threshold ε, if the distance between any two sensor readings is less than ε, they become a cluster. When the distance between a pair of cluster centers is less than ε, they are merged. The intercluster distance (ICD) of a cluster to k-nearest clusters is computed to detect the outlier clusters. When ICD of a cluster is quite different from the mean of ICDs, it is regarded as an outlier cluster. Then, the information of outlier clusters is broadcasted into WSNs. Thus, it consumes much energy.

In [19], an outlier detection technique based on an ellipsoid was proposed. Each sensor node constructs an ellipsoid boundary of sensor readings using the mean and the covariance of sensor readings and transmits its ellipsoid boundary to the base station. The base station merges all received ellipsoids, computes the global ellipsoid boundary, and broadcasts the global ellipsoid to all sensor nodes. Then, with respect to the global ellipsoid boundary, each sensor node identifies the outliers among its sensor readings.

Recently, an outlier detection technique based on the distance between sensor readings and the estimation deviation was proposed [20]. Each sensor node computes the expected deviation and the average distance between pairs of sensor readings detected within recent time interval. If the average distance is greater than the expected deviation, the sensor reading at the current time becomes an outlier. However, when the expectation model is frequently updated, the communication overhead increases.

3. Preliminary

In this section, we present the basic model of sensor networks briefly.

3.1. Sensor Networks

We consider a sensor network consisting of n stationary sensor nodes ${S_{1}, S_{2}, . . ., S_{n}}$ deployed in a field of interest and the powered base station serving as an access point for users to pose ad hoc queries. We use a routing tree [9] which is frequently used as a primitive to collect sensor nodes. Two nodes capable of bidirectional wireless communication directly are referred to as the neighbors for each other. Each sensor node can broadcast a message to all of its neighbors (or from a parent to its child nodes) at a time. A simple sensor network using a tree routing is shown in Figure 1. In Figure 1, $S_{1}$ to $S_{3}$ are the intermediate sensor nodes that have the child sensor nodes, and $S_{4}$ to $S_{9}$ are the leaf sensor nodes that have no child node.

Figure 1

A simple sensor network.

Each sensor node generates its readings periodically. A sampling period is known as an epoch [21]. To agree on a global time base that allows sensor nodes to start and finish each epoch simultaneously, each sensor node executes the SMACS protocol [22] or a global time synchronization protocol [23]. Based on global time synchronized, nodes sleep for a certain period of time in each epoch to minimize energy consumption and each sensor node awakes to sample and receive results when its neighbors try to propagate a message.

4. Outlier Detection Based on Clustering

In this section, we present our proposed outlier detection technique which is based on data clustering in WSNs. In addition, we introduce an efficient data transmission scheme for our outlier detection technique.

4.1. Clustering Technique

Generally, in the outlier detection techniques based on data clustering, the width of clusters is the most important, since the number of sensor readings in each cluster is affected by the width of clusters. If the width of clusters is too large, all sensor readings may belong to a single cluster. Otherwise, each cluster may have only a single sensor reading. Thus, the outliers cannot be identified in these cases.

To solve the above problem, in this paper, we applied the Pigeonhole Principle to determine the width of clusters. We regard that pigeonholes and pigeons are the domain of sensor readings and the sensor readings, respectively. Thus, when we partition this domain into x number of subdomains, where x is less than the number of sensor readings, at least one subdomain contains more than two sensor readings.

Given a set of sensor readings $R = {r_{1}, r_{2}, \dots, r_{n}} (| R | = n)$ , we can acquire the domain of R as [ $MAX (R), MIN (R)$ ]. Then, we obtain the permission range ( $PR$ ) as follows:

\begin{matrix} PR = (\frac{MAX (R) - MIN (R)}{n - 1}) . \end{matrix}

(1)

We use PR as the width of clusters and identify the outliers. For instance, given a set of sensor readings R shown in Figure 2, PR of R is obtained as $1.7 = (MAX (R) - MIN (R)) / (n - 1) = (19 - 2) / (11 - 1)$ . If the number of sensor readings in a cluster is less than or equal to the user-defined threshold θ, the cluster is the outlier cluster and the sensor readings in the outlier clusters are considered as the outliers.

Figure 2

Equipartitioning based on permission range.

In Figure 2, the clusters represented by dotted lines are presented when we partition the domain of R evenly. Assume that the user-defined threshold θ is $1$ . In Figure 2, $C_{2}$ , $C_{4}$ , and $C_{8}$ are the outlier clusters (denoted by ellipses) and the sensor readings in each outlier cluster are regarded as the outliers. Although a sensor reading $r_{4}$ in the cluster $C_{4}$ is definitely an outlier, since a sensor reading which is similar to $r_{4}$ does not exist, the sensor readings $r_{3}$ and $r_{7}$ are not the outliers since there are $r_{2}$ and $r_{8}$ which are similar to $r_{3}$ and $r_{7}$ , respectively. In other words, although the difference between a pair of sensor readings is less than the differences of the others, these sensor readings may belong to separated clusters if we partition the domain of R evenly.

To solve this drawback, we propose the nonequipartitioning based on the permission range (PR). In the nonequipartitioning, a set of clusters is constructed with respect to Definition 1.

Definition 1.

If the difference between a pair of sensor readings $r_{i}$ and $r_{j}$ in R is less than or equal to the permission range PR obtained by (1), we say $r_{i}$ is close to $r_{j}$ .

In our proposed technique, if $r_{i}$ and $r_{j}$ are close to each other, a single cluster contains $r_{i}$ and $r_{j}$ . For instance, as shown in Figure 3, since the difference between sensor readings $r_{1}$ and $r_{2}$ is close, a cluster $C_{1}$ for them is constructed. And then, a sensor reading $r_{3}$ is inserted into $C_{1}$ , since the difference between $r_{2}$ and $r_{3}$ is less than PR. But $r_{4}$ is not inserted into $C_{1}$ . The result of nonequipartitioning is presented in Figure 3. Each cluster consists of $C_{1} = {r_{1}, r_{2}, r_{3}}$ , $C_{2} = {r_{4}}$ , $C_{3} = {r_{5}, r_{6}}$ , and $C_{4} = {r_{7}, r_{8}, r_{9}, r_{10}, r_{11}}$ , respectively. Consequently, when a user-defined threshold θ is 1, a cluster $C_{2}$ is an outlier cluster and a sensor reading $r_{4}$ in $C_{2}$ is an outlier.

Figure 3

Nonequipartitioning based on permission range.

4.2. Clustering Scheme for WSNs

If each sensor node transmits its readings to the base station blindly at each epoch and the base station computes the outliers, each sensor node consumes much energy. In this section, we present an efficient data transmission scheme for our outlier detection algorithm. We assume that all sensor nodes take sensor readings periodically and keep these readings into their local storage for a time window w. Note that when w is 1, each sensor node transmits data to the base station at each epoch.

At first, according to the Definition 1, each sensor node constructs clusters using its sensor readings detected within w. If the number of sensor readings in a cluster is greater than a user-defined threshold θ, all sensor readings in this cluster cannot be the outliers (i.e., nonoutlier cluster (NOC)). Otherwise, all sensor readings in a cluster may be the outliers, and then we call such clusters the outlier candidate clusters (OCCs).

Along the routing path to the base station, each sensor node transmits NOCs and OCCs. For NOCs, cluster ranges (CRs) are transmitted only where CR consists of minimum and maximum values of sensor readings in a NOC. In contrast, for OCCs, the sensor readings in OCC are transmitted to the parent node.

When a parent node p received CRs of NOCs and OCCs from its child nodes, p attempts to merge them with its clusters. To merge the clusters, we use the following definition.

Definition 2.

Given two cluster ranges ${CR}_{i} = [\min_{i}, \max_{i}]$ and ${CR}_{j} = [\min_{j}, \max_{j}]$ , where $(\min_{i} < \min_{j})$ , if $\min_{j} - \max_{i} \leq PR$ , we say ${CR}_{i}$ and ${CR}_{j}$ overlap within $PR$ .

If a pair of cluster ranges overlap within PR, there are at least two sensor readings which are close and contained in different clusters. Thus, if two cluster ranges ${CR}_{i}$ (= $[\min_{i}, \max_{i}]$ ) and $C R_{j}$ (= $[\min_{j}, \max_{j}]$ ) of clusters $C_{i}$ and $C_{j}$ overlap within the permission range $PR$ , two clusters $C_{i}$ and $C_{j}$ are merged into a new cluster whose $CR$ is $[MIN ({\min_{i}, \min_{j}}), MAX ({\max_{i}, \max_{j}})]$ .

Note that, when two clusters are merged into a new cluster where at least one of them is a NOC, the merged cluster cannot be an OCC, since the number of sensor readings in a NOC is already greater than θ. Thus, when a sensor node transmits a NOC, we do not need to transmit all sensor readings in the NOC and it only needs the cluster range CR of the NOC. Thus, each sensor node reduces the energy consumption when it transmits the NOCs since the volume of the transmitted data from each sensor decreases.

In contrast, when two OCCs are merged into a new cluster, we check whether the number of sensor reading in the new cluster is greater than θ or not. Since each sensor node sends all sensor readings in each OCC, we can easily count the number of sensor readings in the new cluster.

Along the routing path from each sensor node to the base station, CRs of NOCs and OCCs are merged and transmitted gradually. Finally, the base station can determine the outliers among the received OCCs. Recall that the cluster ranges (CRs) rather than all senor readings in NOCs are transmitted along the routing paths. Thus, we can reduce the energy consumption of each sensor node.

For example, given a WSN with 11 sensor nodes in Figure 4 where the user-defined threshold θ is 1 and the domain of a set of sensor readings is $[2,19]$ , each sensor node constructs the clusters using its sensor readings. Then, a cluster of each sensor node becomes the outlier candidate cluster (OCC), since threshold θ is $1$ . As shown in Figure 5, the leaf nodes (i.e., $S_{8} : S_{11}$ ) transmit their clusters to their parents, respectively. In Figure 5, $S_{4}$ merges its OCCs and the received OCCs coming from $S_{8}$ and $S_{9}$ based on Definition 2, and then $S_{4}$ obtains a ${NOC}_{1}$ : $[15,17]$ . Similarly, $S_{6}$ obtains the OCCs (i.e., ${OCC}_{1}$ : $[3]$ , ${OCC}_{2}$ : $[8]$ , and ${OCC}_{3}$ : $[18]$ ).

Figure 4

Acquisition sensor readings.

Figure 5

Clustering in $S_{4}$ and $S_{6}$ .

In Figure 6, $S_{2}$ receives the NOCs and OCCs from its child nodes ( $S_{4}$ , $S_{5}$ , and $S_{6}$ ), and then $S_{2}$ merges the received clusters. Since the cluster range of $OC C_{1}$ coming from $S_{6}$ overlaps the sensor readings of $S_{2}$ and $S_{5}$ within $PR$ , $S_{2}$ generates a new cluster $NO C_{2}$ . Similarly, $S_{2}$ merges ${NOC}_{1}$ and ${OCC}_{3}$ to $NO C_{3}$ . But $OC C_{2}$ is not merged since it does not overlap with other clusters. And another sensor node $S_{3}$ generates the clusters $OC C_{3}$ and $OC C_{4}$ .

Figure 6

Clustering in $S_{2}$ and $S_{3}$ .

A sensor node $S_{1}$ receives the clusters from its child nodes as shown in Figure 7. $S_{1}$ merges $NO C_{3}$ and $OC C_{4}$ into $NO C_{4}$ , and it merges its sensor reading and $OC C_{3}$ into $NO C_{5}$ . Finally, $S_{1}$ transmits the clusters to the base station, and then the base station detects the outlier cluster using the outlier candidate cluster $OC C_{8}$ .

Figure 7

Clustering in $S_{1}$ .

4.3. Multidimensional Clustering Scheme

Generally, since the sensor nodes in WSNs are equipped with several sensor devices, sensor readings are multidimensional. For instance, when a sensor node obtains the weather information, it detects temperature as well as other parameters such as humidity and light. Thus, we propose the clustering scheme for a set of multidimensional data.

Let a set of d-dimension sensor readings $R^{d}$ be ${r_{1}^{d}, r_{2}^{d}, . . ., r_{n}^{d}} (| R^{d} | = n)$ . A d-dimensional sensor reading $r_{i}^{d}$ is represented as $〈 r_{i} [1], \dots, r_{i} [d] 〉$ , where $r_{i}^{d} [k]$ denotes the kth dimensional value of $r_{i}^{d}$ . On each dimension k such that $1 \leq k \leq d$ , we obtain the permission range ${PR}_{k} = (MAX ({r_{1}^{d} [k], \dots, r_{n}^{d} [k]}) - MIN ({r_{1}^{d} [k], \dots, r_{n}^{d} [k]})) / (n - 1)$ derived from (1). To merge the clusters, we extend Definition 2 as follows:

Definition 3.

Let a d-dimensional cluster range of a cluster $C_{i}$ be ${CR}_{i}^{d}$ , where the kth dimensional ranges of ${CR}_{i}^{d}$ are denoted as ${CR}_{i}^{d} [k] = [\min_{i}^{d} [k], \max_{i}^{d} [k]]$ with $1 \leq k \leq d$ . Given two d-dimensional cluster ranges ${CR}_{i}^{d}$ and ${CR}_{j}^{d}$ , if $\min_{i}^{d} [k] < \min_{j}^{d} [k]$ and $\min_{j}^{d} [k] - \max_{i}^{d} [k] \leq {PR}_{k}$ on a dimension k, we say ${CR}_{i}^{d}$ and ${CR}_{j}^{d}$ overlap within ${PR}_{k}$ on the kth dimension.

Based on Definition 3, For every dimension k with $1 \leq k \leq d$ , ${CR}_{i}^{d}$ and ${CR}_{j}^{d}$ overlap within ${PR}_{k}$ ; we merge the clusters $C_{i}$ and $C_{j}$ into a new cluster.

For example, given three cluster ranges ${CR}_{1}^{d}$ , ${CR}_{2}^{d}$ , and ${CR}_{3}^{d}$ where $d = 2$ , $n = 7$ , $P R_{1} = 1$ , and $P R_{2} = 1.5$ in Table 1, we merge ${CR}_{1}^{d}$ and ${CR}_{2}^{d}$ based on Definition 3. On the first dimension, since $\min_{1}^{d} [1] < \min_{2}^{d} [1]$ and $\min_{2}^{d} [1] - \max_{1}^{d} [1] (=  5 - 4)$ are less than or equal to ${PR}_{1} (= 1)$ , ${CR}_{1}^{d}$ and ${CR}_{2}^{d}$ overlap within ${PR}_{1}$ . ${CR}_{1}^{d}$ and ${CR}_{2}^{d}$ also overlap within ${PR}_{2}$ on the second dimension. Thus, ${CR}_{1}^{d}$ and ${CR}_{2}^{d}$ are merged into a new cluster. However, ${CR}_{3}^{d}$ is not merged into other clusters since ${CR}_{3}^{d}$ is not overlapped with any clusters on the second dimension such as $\min_{3}^{d} [2] - \max_{1}^{d} [2] (= 18 - 13) > {PR}_{2} (= 1.5)$ .

Table 1

Example of CRs on 2 dimensions.

Cluster ranges	$k = 1$	$k = 2$
$C R_{1}^{d}$	$C R_{1}^{d} [k] = [3,4]$	$C R_{1}^{d} [k] = [12,13]$
$C R_{2}^{d}$	$C R_{2}^{d} [k] = [5,6]$	$C R_{2}^{d} [k] = [10,11]$
$C R_{3}^{d}$	$C R_{3}^{d} [k] = [7,9]$	$C R_{3}^{d} [k] = [18,19]$

5. Experiments

5.1. Experimental Environments

To evaluate the performance of our proposed algorithm compared with the state-of-the-art algorithm, we used a set of real-life data which is provided by Intel Berkeley Research Lab [24]. A sensor network consists of 54 sensor nodes, and each sensor node is deployed in $40.5 \times 31$ (m²) area as shown in Figure 8. The base station is located at the center of the area. We set the default communication distance 7 m. The maximum depth and the maximum width of a routing tree in sensor network are 5 and 3, respectively. Sensor readings consist of temperature (Celsius), humidity (%), and illumination (Lux).

Figure 8

Placement of sensor nodes [24].

As competitors, we implemented the Brute-Force ( $BF$ ) and an outlier technique based on data clustering ( $DC$ ) [18]. In $BF$ , each sensor node transmits its readings to the base station at each epoch.

In $DC$ , each sensor node in a network transmits a set of intercluster distances (ICDs) in respect of sensor readings along the routing paths to the base station. The intercluster distance is the distance of centers in any two data clusters in each sensor node. If the intercluster distance is less than the user-defined threshold ε, two data clusters are merged. When the base station receives ICDs from all sensor nodes, it computes the ICDs of the k-nearest clusters. And then, the base station broadcasts the means of ICDs in order to identify the outliers in each sensor node. We set ε and k for $DC$ are 0.26 and 4, respectively. We called our proposed algorithm $PC$ .

To compute the energy consumption, we use the free space channel model [25]. Under this model, to transmit an l-bits message and a distance c, a sensor expends

\begin{array}{l} E_{T} (l, c) = E_{T -elec} (l) + E_{T -amp} (l, c) \\ = l * E_{elec} + ξ_{amp} * l * c^{2} . \end{array}

(2)

And, to receive this message, a sensor expends

\begin{matrix} E_{R} (l) = E_{R -elec} (l) = l * E_{elec} . \end{matrix}

(3)

In this experiment, we set 50 nJ/bit to the electronic circuit constant ( $E_{elec}$ ) and 100 pJ/bit/m² to the transmit amplifier constant ( $ξ_{amp}$ ). We set the size of packet as 40 bytes. The parameters used in our experiment are summarized in Table 2.

Table 2

Parameters.

Parameter	Default value	Range
Packet size (p)	40	40
Window size (w)	8	2, 5, 8, 11, 14
Threshold (θ)	2	1, 2, 3, 4, 5
Dimension (d)	1	1, 2, 3

5.2. Experimental Result

To evaluate the energy consumption of each outlier detection algorithm, we run our own simulator for 1000 epoches and plot the total energy consumption.

Figure 9 shows the energy consumption varying the window size w. When the size of a window is small (i.e., $w = 2$ ), $BF$ shows the best performance since $BF$ transmits sensor readings, but $DC$ and $PC$ transmit the information of clusters.

Figure 9

Varying the window size (w).

However, as w increases, the performance of $PC$ and $DC$ is improved since the cluster information rather than sensor readings is transmitted. Furthermore, the performance gap between $PC$ and $DC$ increases with increasing w. As w increases, the number of nonoutlier clusters (NOCs) increases in $PC$ . Thus, the size of data to be transmitted decreases. In $DC$ , the energy consumption is reduced since $DC$ is also based on data clustering as $PC$ . However, our proposed $PC$ is better than $DC$ because $PC$ used the permission range to determine the cluster width. Additionally, in $DC$ , the mean of ICDs needs to be broadcasted to obtain the outliers in each sensor node. Our proposed algorithm $PC$ is better than the $DC$ about 58% on the average.

Figure 10 shows the energy consumption varying θ. As shown in Figure 10, the performances of all techniques are stable in spite of varying θ. The energy consumption of $DC$ is constant, since the intercluster information has a fixed size. But the energy consumption of $PC$ increases slightly, since the number of sensor readings in an outlier candidate cluster increases slightly. Nonetheless, our proposed technique $PC$ shows the best performance in terms of energy efficiency. In this experiment, our proposed technique is better than the $DC$ about 51%.

Figure 10

Varying the threshold size (θ).

Figure 11 shows the energy consumption varying the dimension d. The energy consumption of all techniques is increased according to varying d, because the packet of the sensor nodes on multidimension contains more information as compared with that on 1 dimension. The energy consumption of $PC$ is less than other techniques $BF$ and $DC$ . In other words, our proposed technique is better than the $DC$ about $48 %$ .

Figure 11

Varying the dimension (d).

6. Conclusion

In this paper, we present an efficient outlier detection technique in WSNs. To obtain the appropriate width of clusters, we applied the Pigeonhole Principle. In our proposed technique, each sensor node in WSNs constructs and merges the clusters based on the permission range PR. Then, our proposed technique uses two kinds of clusters (NOC and OCC) in order to detect the outliers and reduce the energy consumption of each sensor. In our experiments with a set of real-life data, we show that our proposed technique outperforms existing techniques significantly.

Footnotes

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

This research was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science and Technology (2012R1A1B3003060).

References

Szewczyk

Osterweil

Polastre

Hamilton

Mainwaring

Estrin

Habitat monitoring with sensor networks

Communications of the ACM 2004 47 6 34 40

2-s2.0-4243114087

10.1145/990680.990704

Zhou

Chen

Jing

Sun

A distributed cache based reliable service execution and recovery approach in manets

Journal of Convergence 2012 3 1 5 12

Merrill

W. M.

Newberg

Sohrabi

Kaiser

Pottie

Collaborative networking requirements for unattended ground sensor systems

Proceedings of the IEEE Aerospace Conference

2003

2153 2165

Obaidat

M. S.

Zarai

Novel algorithm for secured mobility and ip traceability for wlan networks

Journal of Convergence 2012 3 2 1 8

J. K. Y.

Ubiquitous healthcare: healthcare systems and applications enabled by mobile and wireless technologies

Journal of Convergence 2012 3 2 15 20

Gupta

Chatterjee

Performance study of genus 3 hyperelliptic curve cryptosystem: a survey of qos based routing protocols for wireless sensor networks

Journal of Information Processing Systems 2012 8 1 145 158

Ijtihadie

R. M.

Hidayanto

B. C.

Affandi

Chisaki

Usagawa

Dynamic content synchronization between learning management systems over limited bandwidth network

Human-Centric Computing and Information Sciences 2012 2 17 1 16

Abadi

D. J.

Madden

Lindner

REED: robust, efficient filtering and event detection in sensor networks

Proceedings of the 31st International Conference on Very Large Data Bases (VLDB ′05)

September 2005

769 780

2-s2.0-33745621357

Madden

Franklin

M. J.

Hong

J. M. H. W.

Tag: a tiny aggregation service for ad-hoc sensor networks

Proceedings of the symposium on Operating Systems design and implementation

2002

131 146

10.

Singh

Lobiyal

D. K.

A novel energy-aware cluster head selection based on particle swarm optimization for wireless sensor networks

Human-Centric Computing and Information Sciences 2012 2 13 1 18

11.

Sumathi

Srinivas

A survey of qos based routing protocols for wireless sensor networks

Journal of Information Processing Systems 2012 8 4 589 602

12.

Trigoni

Guitton

Skordylis

Poster abstract: Routing and processing multiple aggregate queries in sensor networks

Proceedings of the 4th International Conference on Embedded Networked Sensor Systems (SenSys '06)

November 2006

391 392

2-s2.0-34547407217

10.1145/1182807.1182871

13.

Yang

Lim

H. B.

Zsu

T. M.

Tan

K. L.

In-network execution of monitoring queries in sensor networks

Proceedings of the ACM SIGMOD International Conference on Management of Data

June 2007

521 532

2-s2.0-35548987532

10.1145/1247480.1247538

14.

Liu

Q. L.

D. H.

Performance evaluation of multihop communication based on a mobile multi-robot system in a subterranean laneway

Journal of Information Processing Systems 2012 8 3 471 482

15.

Dhurandher

S. K.

Obaidat

M. S.

Gupta

A novel energy-aware cluster head selection based on particle swarm optimization for wireless sensor networks

Human-Centric Computing and Information Sciences 2012 2 3 1 14

16.

Cheng

Ding

Xing

Liu

Deng

Localized outlying and boundary data detection in sensor networks

IEEE Transactions on Knowledge and Data Engineering 2007 19 8 1145 1156

2-s2.0-34347262454

10.1109/TKDE.2007.1062

17.

Palpanas

Papadopoulos

Kalogeraki

Gunopulos

Distributed deviation detection in sensor networks

SIGMOD Record 2003 32 4 77 82

2-s2.0-14344259796

10.1145/959060.959074

18.

Rajasegarar

Leckie

Palaniswami

Bezdek

J. C.

Distributed anomaly detection in wireless sensor networks

Proceedings of the 10th IEEE Singapore International Conference on Communications Systems (ICCS ′06)

November 2006

2-s2.0-46949090794

10.1109/ICCS.2006.301508

19.

Suthaharan

Leckie

Moshtaghi

Karunasekera

Rajasegarar

Sensor data boundary estimation for anomaly detection in wireless sensor networks

Proceedings of the 7th International Conference on Mobile Adhoc and Sensor Systems (MASS ′10)

November 2010

546 551

2-s2.0-78650988787

10.1109/MASS.2010.5663896

20.

Ghaddar

Razafindralambo

Simplot-Ryl

Tawbi

Hijazi

Algorithm for temporal anomaly detection in WSNs

Proceedings of the IEEE Wireless Communications and Networking Conference (WCNC ′11)

March 2011

743 748

2-s2.0-79959297698

10.1109/WCNC.2011.5779255

21.

Madden

S. R.

Franklin

M. J.

Hellerstein

J. M.

Hong

TinyDB: an acquisitional query processing system for sensor networks

ACM Transactions on Database Systems 2005 30 1 122 173

2-s2.0-23944487783

10.1145/1061318.1061322

22.

Sohrabi

Gao

Ailawadhi

Pottie

G. J.

Protocols for self-organization of a wireless sensor network

IEEE Personal Communications 2000 7 5 16 27

2-s2.0-0034291065

10.1109/98.878532

23.

Sundararaman

Buy

Kshemkalyani

A. D.

Clock synchronization for wireless sensor networks: a survey

Ad Hoc Networks 2005 3 3 281 323

2-s2.0-13944269967

10.1016/j.adhoc.2005.01.002

24.

I.B.R. lab

Intel berekely research lab data

2004, http://db.csail.mit.edu/labdata/labdata.html

25.

Heinzelman

W. R.

Chandrakasan

Balakrishnan

Energy-efficient communication protocol for wireless microsensor networks

Proceedings of the 33rd Annual Hawaii International Conference on System Siences (HICSS ′33)

January 2000

223

2-s2.0-0033877788

An Energy-Efficient Outlier Detection Based on Data Clustering in WSNs

Abstract

1. Introduction

2. Related Work

3. Preliminary

3.1. Sensor Networks

4. Outlier Detection Based on Clustering

4.1. Clustering Technique

Definition 1.

4.2. Clustering Scheme for WSNs

Definition 2.

4.3. Multidimensional Clustering Scheme

Definition 3.

5. Experiments

5.1. Experimental Environments

5.2. Experimental Result

6. Conclusion

Footnotes

Conflict of Interests

Acknowledgments

References