Distributed Continuous k Nearest Neighbors Search over Moving Objects on Wireless Sensor Networks

Abstract

Continuous k-nearest neighbor (CkNN) search is a variation of kNN search that the system persistently reports k nearest moving objects to a user. For example, system continuously returns 3 nearest moving sensors to the user. Many query processing approaches for CkNN search have been proposed in traditional environments. However, the existing client-server approaches for CkNN search are sensitive to the number of moving objects. When the moving objects quickly move, the processing load on the server will be heavy due to the overwhelming data. In this thesis, we propose a distributed CkNN search algorithm (DCkNN) on wireless sensor networks (WSNs) based on the Voronoi diagram. There are four features about DCkNN: (1) each moving object constructs a local Voronoi cell and keeps the local information; (2) in order to keep the reliability of system, the query message will be propagated to related objects; (3) using the idea of safe time, the number of updates is reduced; (4) an equation to estimate a more accurate safe time is provided. Last, we present our findings through intensive experiments and the results validate our proposed approach, DCkNN.

1. Introduction

The mobile sensors have been widely used in recent years. For example, the smartphone, Samsung Galaxy S4, includes a barometer, thermometer, and hygrometer (to measure humidity)—the first major smartphone to do so. By using these sensors, people can easily use their mobile devices to obtain environmental information. On the other hand, the popularization of global positioning system (GPS) and the miniaturization of mobile devices, equipped with a wireless network module, make location-based services (LBSs) no longer expensive. Thus, with the combination of the mobile sensors and the GPS, more and more interesting applications and issues gradually emerged.

In general, one can regard each mobile device (sensor) as a moving object in wireless mobile sensor networks. Each user (object) can use the GPS and sensors to provide the environmental information, such as location, temperature, and humidity, to the information systems which are built by the manufacturers. These systems can sink user's information and send the user's location through the mobile networks. Mobile users can immediately access the information related to the location and geographic information. Many manufacturers even provide different telecommunication services to the users, The multiple types of information services make life more convenient. For example, if a user issues a query and wants to know the temperature at a specific location, the neighboring mobile objects of this location will return the measured temperature information to the user. Another example is the temperature monitoring system where a mobile sink may move around to collect the temperature around it. In these cases, the query may move.

As the above examples, the mobile query often selects k nearest neighbors (kNN) [1] around itself to obtain the average of the measured temperatures. Moreover, the mobile query may monitor the environmental information for a period of time. Each moving object thus needs to continuously reply to the mobile query with measured environmental information and location. Such an application actually is the continuous k nearest neighbors (CkNN) query [2, 3], which is used to derive the k nearest neighbor sensors and then get the average of the received information continuously. In this paper, we consider the mobile environment where the data objects and queries can move.

Spatial queries, such as k nearest neighbors query and continuous k nearest neighbors query, have attracted many researchers' interest and also have been discussed on many different applications. Most of the existing methods about the kNN or CkNN search in the considered mobile environment still use the client-server architecture and rely on a central server for processing queries. Each moving object needs to continuously obtain the location information of itself by GPS and send the information back to the server for updating the database. Thus, if we use the existing methods under a specific scenario that the position status of objects changes frequently, the server must receive a huge amount of information stream for updating each object's location in a short time, thus causing the server system to be overloaded. Besides, the large amount of messages for updates occupies quite a lot of bandwidth. We hence propose an effective and noncentralized approach for supporting CkNN query in a mobile environment.

In this work, we consider the continuous k nearest neighbors query in WSNs or some other distributed and mobile environments like WSNs, which consist of a number of mobile devices (sensors) and propose a new distributed approach, DCkNN, for processing CkNN query efficiently. From the object's point of view, the distributed architecture using local computation can effectively reduce the updating cost since the update messages in the client-server architecture may need more hops to arrive to the server. Even worse, the long delay of the update messages can make the answer of the CkNN inaccurate. On the other hand, the distributed architecture can avoid the long response time caused by the overload of the server. In our considered environments, each mobile device, also called moving sensor or moving object, can obtain its location information by the GPS equipment. Besides, each moving object is equipped with enough storage and a cpu for storing the local information and processing simple 2D functions, respectively.

In our proposed approach, each moving object will send messages to its one-hop neighbors when necessary to avoid the wrong information for processing spatial queries. When finding the kNN, our method uses the incremental strategy on the number of hops based on the Voronoi diagram (VD) [4]. In order to estimate the movement of a moving object accurately, we consider the speed of each moving object and adapt the concept of safe region [5] to our proposed approach, distributed continuous k nearest neighbors (DCkNN).

In this paper, we address the problem of efficiently processing continuous k nearest neighbors over moving objects with updates and make the following contributions. (i)

We identify the continuous k nearest neighbors search problem on mobile ad hoc networks and categorize some detailed existing works about this issue.

(ii)

We provide a novel approach, using local Voronoi diagram, for efficiently and precisely collecting the location of each object in a distributed way.

(iii)

With the combination of local Voronoi diagram, we propose a model for accurately estimating the safe-time (safe-region) of continuous k nearest neighbors query, which can lead to a short response time, fewer number of messages, less update cost, and high accuracy.

(iv)

We perform a simulated experimental evaluation and the results show the superiority of our solution over the adaptation of state-of-the-art solutions [5] given to the similar issues.

The rest of this paper is organized as follows. In Section 2, we review the related work. The performance metrics and considered issues are introduced in Section 3. Section 4 presents the proposed solution using localized Voronoi diagram and how to derive the accurate safe time. Section 5 gives some analysis and comparisons between the proposed DCkNN method and different existing methods. The discussion on the experimental simulation is in Section 6. Finally, we give the conclusion remarks in Section 7.

2. Related Work

The existing works for CkNN search can be categorized into the following categories according to the ways for accessing or managing data: (1) pull-based approach, (2) push-based approach, and (3) distributed and mobile approach. The pull-based approach uses the traditional client-server architecture [6–8]. A given central server is responsible for processing the spatial queries from the user devices and all the data are stored in the server. Whenever a client wants some information, the client will send a query to the server. In many applications, the amount of spatial data needed to be processed could be very large and hence the spatial data is usually saved in the external storages, such as disks. Thus, the existing approaches for CkNN search using the pull-based approach focus on optimizing the number of I/O's. Of course, the computation time (CPU time) is still an important performance metric when the size of spatial data needed to be processed is not large. As a result, the above two measurements, the number of I/O's and CPU time, are usually used to validate the effectiveness of the proposed algorithms for processing kNN and CkNN queries using pull-based approach. However, the bandwidth of communication between the clients and servers is asymmetric. In other words, the uplink bandwidth is limited and much smaller than the bandwidth of downlink. This phenomenon in wireless communication will cause the bottleneck [9] when the number of queries increases if the pull-based approach is applied for accessing data.

In order to overcome the bottleneck problem caused by the pull-based approach in wireless environment, the push-based approach is proposed and also referred to as data broadcasting approach. Data broadcasting has attracted a lot of research attention in the past decades and many approaches or protocols for different types of spatial queries have been proposed [10–13]. In the data broadcasting environment, the data are broadcast in the air by the server and the clients execute the query process by listening to the broadcast channel. The load for processing data on the server is dramatically decreased and relieves the bottleneck problem. Instead, the computation load on the clients is increased. From the user's point of view, the fewer battery power each device consumes, the better experience each user has. Thus the existing works [10, 11, 13–15], in general, discuss the performance of data broadcasting protocol in terms of latency and tuning time. Latency is the time a client experiences from issuing query to receiving the complete answer. The tuning time is the amount of time actually spent on listening to the broadcast. The latency indicates the quality of service (QoS) provided by the system and the tuning time can represent the power consumption of mobile clients.

Many works [16–18] have discussed the kNN and CkNN queries on the distributed and mobile environments like mobile ad hoc NETworks (MANETs) and wireless sensors networks (WSN). MANETs and WSNs are self-configuring infrastructure-less networks consisting of mobile devices and sensors connected by wireless communication. Each device in such networks is free to move independently in any direction and will thus change its links to other devices frequently. Each device can forward the traffic unrelated to its own use and act as a router. In contrast to the client-server and data broadcasting environments, there is no central server that handles the spatial queries or broadcasts spatial data in MANETs or WSNs. Accordingly, the information system based on the above infrastructure-less networks has to process the queries in a distributed way. Each mobile node (or device) in such a system cooperates and exchanges spatial data with each other and then derives the answer for the spatial queries.

In spatiotemporal data applications, the datasets consist of data objects and the data as well as queries might move over time. In order to process a great volume of spatial data and queries efficiently, some decomposition methods, such as grid and Voronoi diagram, have been used in the existing works [7, 19, 20]. Xiong et al. [20] focused on multiple kNN queries and proposed an incremental search technique based on hashing objects into a regular grid and keeping CPU time in mind. The main objective of this work on disk-resident data is to minimize disk I/O operations. The CPU time is considered only as a secondary objective. Zhao et al. [21] proposed a Voronoi-based approach for CKNN query. Although this work provided a navigation service to mobile devices, the query processing is still operated on a central server.

Mouratidis et al. [22] proposed a threshold-based algorithm to help the server processing continuous nearest neighbor queries from geographically distributed clients. Within a given threshold, the k nearest neighbors of the query point will not change. However, this work still needs a central server for monitoring the kNN objects, maintaining the huge spatial data, and broadcasting each object's information. Chatzimilioudis et al. [16] proposed a proximity algorithm to answer all k-nearest neighbor queries continuously. In such an environment, many stations, called query processors, are placed to monitor all the objects in the radio region. Then, different query processors can exchange the monitored spatial information to calculate the k nearest neighbor answer to the user (query point).

In contrast to most of the above approaches, Kim et al. [5] proposed a decentralized method to support continuous k nearest neighbor query. Each query object predicts a safe time as a threshold for reducing the cost of information update. This method assumes that there is a maximum speed, MaxSpeed, of each object in the environment and use, DIKNN algorithm [23] to find the initial k nearest neighbors. By monitoring the kth and $(k + 1)$ th NN, the mobile query can know the influence region and use this region to filter moving objects, thus reducing a great amount of update cost. However, this work only considered the stationary query, not a moving query. We will discuss the accuracy of this method later in details. Before introducing the proposed algorithm, we first give some background and terminologies.

3. Preliminaries

In this section, we present some concepts and techniques that are generally used when solving the spatial data queries on moving objects. The metrics generally used to evaluate the query process will also be introduced.

3.1. Spatial Query

The k nearest neighbor search (kNN search) is one of the important types of spatial queries. Suppose that the distance between two points v and u is $dist (v, u)$ and assume that there is a given dataset D with a query point q. The k nearest neighbor search is to find the k nearest data objects in dataset D. A simple example in Figure 1 illustrates the 2NN of the given query point q and data objects a and b are the two data objects closest to q. In this case, we will say 2NN $(q) = {a, b}$ . The definition of the k nearest neighbor search problem thus can be defined as follow.

Figure 1

A 2NN search with query point q, where 2NN $(q) = {a, b}$ .

Definition 1 (kNN search problem).

Given a query point q and an integer k, the k NN search problem is to find a subset $k NN (q) \subseteq D$ with $| k NN (q) | = k$ , such that any data object $p \in k NN (q)$ and $dist (p, q) \leq \min_{o} dist (q, o)$ , where $o \in (D - k NN (q))$ .

The continuous k nearest neighbors search (CkNN Search) [3] is a variation of kNN query. CkNN search is extended from kNN search and the system will continuously return the kNN answer in a time interval. In a mobile environment, the mobile devices are regarded as moving objects. Thus, the result of a CkNN search may change over time due to the movement of the queries and data objects. We can model the moving pattern of the query as a path consisting of consecutive line segments and then consider the line segments for CkNN search. Let D be a dataset of data objects and $q = [s, e]$ be a query line segment. A continuous k nearest neighbors (CkNN) query retrieves k nearest neighbors of every point in the line segment $q = [s, e]$ . The result contains a set of $〈 R, T 〉$ tuples, where R is the resulting kNN within the time interval T on $[s, e]$ . Consider Figure 2 as an example, where $D = {a, b, c, d, e, f, g, h}$ and $k = 1$ . The output of the query is the set of tuples $〈 {a}, [s, s_{1}] 〉$ , $〈 {c}, [s_{1}, s_{2}] 〉$ , $〈 {f}, [s_{2}, s_{3}] 〉$ , and $〈 {h}, [s_{3}, e] 〉$ . Tuple $〈 {a}, [s, s_{1}] 〉$ indicates that ${a}$ is the kNN(q) $(k = 1)$ for interval $[s, s_{1}]$ and so on. We then can define the continuous k nearest neighbor search problem as below.

Figure 2

An example of CkNN, $k = 1$ , where the output is ${〈 {a}, [s, s_{1}] 〉$ , $〈 {c}, [s_{1}, s_{2}] 〉$ , $〈 {f}, [s_{2}, s_{3}] 〉$ , $〈 {h}, [s_{3}, e] 〉}$ for $q = [s, e]$ .

Definition 2 (CkNN search problem).

Given a line segment $[s, e]$ and an integer k, for any data object p, if $o_{k} \in C k NN ([s, e])$ is one of the k nearest neighbor, then for any object $o^{'} \in (D -C k NN ([s, e]))$ , $dist (p, o^{'}) \geq dist (p, o_{k})$ .

3.2. Voronoi Diagram

Using Voronoi diagram (VD) for CkNN search is proposed in the pull-based approach [7]. Consider a set P of n distinct data points, called sites, in the plane. The Voronoi diagram of P, denoted by $VD (P)$ , is defined as the subdivision of the plane into n cells, one for each site in P, and each cell corresponding to a site $p \in P$ is defined to be the area of all points in the plane closer to p than to any other point in P. Figure 3 shows the Voronoi diagram of 16 data points $p_{1}, p_{2}, \dots, p_{16}$ in the plane. Actually, the boundary between two points is the perpendicular bisector of those two points. The method proposed in [7] uses the characteristics of the perpendicular bisector (the distances from any point on this line to both neighboring points are the same) to determine the nearest neighbors of the current query point. Let us use Figure 3 as an example. Then, each solid line is the perpendicular bisector between two neighboring data objects. Suppose that the solid line with arrows denotes a path for the query point from position S to position $D_{1}$ and the dotted line with arrows is the path from S to $D_{2}$ . For the path from S to $D_{1}$ , it is very easy to determine the answer set of the continuous nearest neighbors using Voronoi diagram and ${p_{9}, p_{8}, p_{11}, p_{12}, p_{10}}$ is the result. On the other hand, for the dotted path from S to $D_{2}$ , since the query does not move to another Voronoi cell, the answer set of continuous nearest neighbor is always ${p_{9}}$ .

Figure 3

A continuous nearest neighbor search example using Voronoi diagram.

Using the Voronoi diagram, one can easily find the NN for a query point. To find the kNN for an arbitrary $k > 1$ , we can extend the concept of Voronoi diagram to have the order-k Voronoi diagram [24]. Using the order-k Voronoi diagram, one can find the kNN for a query point by locating the query point in some order-k Voronoi cell. Patroumpas et al. proposed a method [25], which approximates the order-k Voronoi cells by limited data objects to support continuous k nearest neighbor query. Figure 4 shows the process of constructing an approximate order-3 Voronoi cell. Before the construction, all the k nearest neighbors are known. Initially, the system assumes that approximate cell ${VC}^{'}$ containing q is equal to the entire universe of interest where any possible location is being recorded. The system regards query point q as the center point, divides the whole region into eight quadrants, and finds the 3NN $(q) = {n_{1}, n_{2}, n_{3}}$ , as shown in Figure 4(a). For each closest point $m_{i}$ to q in each quadrant, the system delineates all its k bisectors with the kNN points and then produces the approximate order-k Voronoi cell of the kNN point with respect to q. But, for reducing processing cost even further, the system chooses bisector $b_{i}$ that is closest to q and thus bounds the cell most tightly (Figure 4(b)). Accordingly, by taking the most restrictive bisector $b_{i}$ for each quadrant in counter-clockwise order, the system can gradually crop entire universe and obtain the approximate order-k Voronoi cell ${VC}^{'}$ as shown from Figures 4(b) to 4(i). The gray region in Figure 4(i) is the resulting approximate order-3 Voronoi cell. No matter the query moves in this region, 3NN $(q) = {n_{1}, n_{2}, n_{3}}$ .

Figure 4

The construction of an approximate order-3 Voronoi cell, where q is the query point.

3.3. Safe-Time

In the wireless sensor network, the idea of safe-time is proposed by Kim et al [5]. This method first assumes that there is a maximum speed, MaxSpeed, of each object in the environment and then uses DIkNN algorithm [23] to find the initial k nearest neighbors. As Figure 5 shows, the user located at S issues a query with q as the center point of this query. The system will first inquiry object $n_{p}$ , the nearest neighbor of the query point q. Then, the process searches the initial kNN counter-clockwise on the concentric circles incremental with an distance w. The query message is forwarded in hop by hop manner as shown in Figure 5(b). Finally, the system can obtain the initial kNN from the last object which finished the query processing and derive the distance, cross-distance (CD), between the kth and $(k + 1)$ th nearest neighbors. As Figure 6(a) shows, the system then can compute a safe time $= CD / (2 * MaxSpeed)$ by the cross-distance between the kth and $(k + 1)$ th nearest neighbors. This indicates that the system ensures that the order of kth and $(k + 1)$ th nearest neighbors will not exchange in the safe-time period. Thus, the resulting kNN does not need to update. In addition, the maximum moving distance of each object is limited in the safe-time, so we can predict the maximum influenced region after the movement of kth and $(k + 1)$ th nearest neighbors as shown in Figures 6(b) and 6(c) to obtain the safe region (the dark rings in the figure). The system will ignore the objects which locate outside the safe region, thus reducing the cost of updating each object's information.

Figure 5

The kNN search on WSNs: (a) Q-node is the first object receiving the query and D-node is the sink node which is responsible for collecting data and (b) presents the multihops routing path for the kNN process.

Figure 6

Update process using safe-time for 3NN search on WSNs: (a) shows the CD between the third and fourth closest data points; (b) presents the maximum influenced distance for update using broadcast; (c) indicates the area for broadcasting to update.

3.4. Performance Metrics

In order to evaluate the proposed DCkNN approach, we consider the following performance metrics, including update cost, number of messages, accuracy, and response time.

3.4.1. Update Cost

The update cost is to measure the number of updates during the query process. We sometimes use the frequency to denote it. Since the sensors (objects) and even the query will move, the status of the sensors should be maintained in order to have the accurate results. However, the cost for one update is high in the distributed environment. Hence, one of the objectives of the designed algorithm, DCkNN, is to minimize the update cost for the CkNN query.

3.4.2. Number of Messages

Except for the number of updates, there are some other kinds of messages for the query process in the distributed and mobile environment, including the messages for query and information passing. The total number of messages used during the query process can be used to indicate the throughput over the networks and the energy consumption. So the proposed DCkNN is towards minimizing the total number of messages for the query.

3.4.3. Accuracy

By using the pull-based approach, the time interval for routing update messages to the server depends on the distance between the data object and the server. It thus can not ensure that the location information of each object saved in the server is the latest. Hence, the results may be timely incorrect when deriving the answers for a query due to the mobility. This situation becomes worse in the distributed and mobile environments. In our design, the proposed approach can avoid redundant updates and keep the accuracy of the results.

3.4.4. Response Time

The complicated query process, the delay time for message communication, and the number of users will affect the response time of the system to process the query. Long response time is unbearable and may return obsoletely invalid query results. Even worse, the long response time may cause important information missing or financial loss. Thus the system response time is also one of the important issues that must be considered when designing the method for continuous queries.

4. Distributed Continuous k Nearest Neighbors Search

In the mobile and distributed environments, like WSNs or MANETS, the pull-based approach may not be a good match for CkNN search due to the problems discussed in Section 2. We thus propose a distributed algorithm, DCkNN, for CkNN search in the mobile and distributed environments. The distributed approach using a better safe-time can reduce the update cost, relieve the load for server, shorten the response time, and make the result more accurate. There are three phases in DCkNN: initialization phase, query processing phase, and information update phase. In the initialization phase, each moving object exchanges the location information and builds a local Voronoi cell (LVC) as well as calculates the safe time for the LVC. The query processing phase collects sufficient location and environmental information of neighboring neighbors for exploring the result of the given CkNN query. The last information update phase is to maintain the latest information for the moving objects and the result of the continuous k nearest neighbor query. The following will explain each phase in details.

4.1. Phase 1: Initialization $(k = 1)$

During system initialization, each mobile sensor (moving object) exchanges location information with its neighbors, derives the local Voronoi cell (LVC), calculates the safe time of LVC, and then stores the LVC and safe time in the memory.

When a user issues an NN query Q and broadcasts the query message $q (k, h o p c)$ (case $k = 1$ ), where k is the number of the nearest neighbors the query wants to find and $h o p c$ is the hop counter and sets to one initially. When the query point Q's neighboring objects receive the query message $q (k, h o p c)$ , they will determine whether Q is located in their Voronoi Cells (VCs). If a neighboring object replies true, it means that this neighboring object is the nearest neighbor of the query point Q and this neighboring object will transmit the Voronoi cell (VC) information of itself to the query point Q, including the neighboring object boundaries, location, direction, and speed of movement, and some environmental information. On the other hand, if a neighboring object returns false, this neighboring object will check whether query point Q moves towards it or not. If the query point Q is moving towards a neighboring object, then this neighboring object also will send its VC information to the Q; otherwise, the query message will be ignored. After the initialization phase, each moving object will maintain the LVC information and safe time. The update process will be executed for the latest LVC information only when the safe time decreases to zero. Assume that each object has d neighbors in average. Since the bisector between two objects costs $O (1)$ time, the LVC construction (Initialization phase) can be finished in $O (d)$ time.

4.2. Phase 2: CkNN Query Processing

The pseudocode, Query_DCkNN_Search(), in Algorithm 1 shows the whole query process on the query point. The operations from Line 6 to the end of Query_DCkNN_Search() do the query processing of CkNN $(k > 1)$ . When a query point Q wants to find the k nearest neighbors where $k > 1$ , Q first broadcasts messages $q (k, h o p c)$ (case $k > 1$ ) and gets the local Voronoi cell (LVC) information from the nearest neighboring (NN) object. The nearest neighboring object of Q is responsible for routing messages to its neighboring objects, collecting the local Voronoi cell (LVC) information of the neighboring objects, and sending information back to query point Q. Basically, if the received information is insufficient to derive the kNN answer, the query point Q will repeatedly sends the query message q with one more hop (i.e., $h o p c = h o p c + 1$ ) to the specific neighboring objects to obtain more neighboring VC information until the received information is enough to get kNN answer.

Algorithm 1: The DCkNN algorithm on the query (sink) node.

Query_DCkNN_Search()

(1) broadcast a query $q (k, h o p c)$ ; $/ /$ $h o p c$ is initial to 1

(2) get the Voronoi cells or object's information;

(3) if ( $k = 1$ ) then

(4) show LVC;

(5) find $s a f e t i m e$ of DC1NN (with LVC);

(6) elseif ( $k > 1$ ) then

(7) while the received information is insufficient do

(8) broadcast the query $q (k, h o p c + 1)$ to ask more hops of neighbor nodes;

(9) run normal kNNSearch to find kNN set;

(10) show LVC according to kNNset and $I n f o$ ;

(11) find $s a f e t i m e$ of DCkNN (with LVC)

(12) while ( $s a f e t i m e \neq 0$ )

(13) wait;

When the received information is enough to derive the results of kNN, the query point Q will use the approximate method we modified from [25] to calculate the approximate order-k LVC in which the query point Q locates. By using the modified method, the query point Q has to partition the plane into λ equal sectors and selects an additional nearest neighboring object in each sector, excluded from kNN objects. Thus, the query point Q needs to collect at least $k + λ$ neighboring objects' information to build the approximate order-k LVC. Note that the information of the additional λ objects can be learned from the information of the received neighboring objects. In comparison with the centralized approach for constructing the approximate order-k Voronoi cell, the modified approximate method only needs to consider the k nearest neighboring objects and λ additional nearest neighboring objects instead of considering all objects in the wireless environment. The computation time is thus much better than the traditional centralized approach and can be derived as $O (λ) + O (λ) \times O (k) = O (λ (k + 1))$ , where $O (λ)$ is the time for selecting the λ additional neighboring objects. In the end, $O (λ k)$ is the time for determining the boundary of the approximate order-k LVC. For more details about the analysis of the computation time, please refer to [25]. After the query point Q calculates the approximate order-k LVC, a safe time of the query q can be derived. The safe time of the query q is regarded as the time limit for starting the next location information updating process and updating the CkNN answer.

Figure 7 gives an example for the case of $k = 1$ . As shown in Figure 7(a), there are 16 moving sensors, $o_{1}, \dots, o_{16}$ , in the system and each sensor has the location information about itself. In the initial phase, each sensor broadcasts the message with its location information and receives the location information from its neighbors. The local Voronoi cell (LVC) of each sensor then can be calculated and stored. The overall calculated Voronoi diagram is depicted in Figure 7(b). Afterwards, if the locations of the sensors change as moving, each sensor will broadcast new location information to its neighbors and each sensor can derive a new LVC by the received updating messages. In Figure 7(c), a continuous NN query is issued by some query object q with its location. In order to know the Voronoi cell where q locates, q first broadcasts the query message to the neighboring sensors (in dashed circle). After receiving the query message, the neighboring sensor will check whether the query q is located in its associated cell. If q is not in the cell, the sensor will ignore the query message. In Figure 7(c), sensor $o_{1}$ will return the result to the query q, saying that object $o_{1}$ is the nearest neighbor of q for the time being. If q moves in the direction as indicated by the solid arrow in Figure 7(d) to object $o_{6}$ , q can know that it moves towards the boundary of $o_{1}$ and $o_{6}$ by the LVC stored in $o_{6}$ and then derive the safe time. When approaching the boundary (i.e., the safe time is almost passed), a query message to $o_{6}$ will be issued by q in advance. Sensor $o_{6}$ will reply to q with the cell information to maintain the accuracy of the query result. Figure 7(e) shows the result after q moves into the associated cell of sensor $o_{6}$ .

Figure 7

An example of DCNN where (a) each sensor exchanges information in the initial step; (b) each sensor builds its LVC; (c) a query at object q is issued and sensor $o_{1}$ returns the information of its LVC to q since q is located in the Voronoi cell associated with $o_{1}$ ; (d) object q knows which sensor will be reached next and can derive the safe time by the LVC information of $o_{1}$ ; and (e) the query result to q can be updated seamlessly when it moves to $o_{6}$ .

Figure 8 demonstrates an example of DCkNN for $k > 1$ . The initial phase is the same as the previous DCNN example. Each sensor derives the corresponding LVC with the information from the neighboring sensors. The query is issued by a moving object q with its location. Object q locates itself and knows that it is in the VC of $o_{1}$ after sensor $o_{1}$ replies to q with its LVC information. Suppose $k = 3$ . After the initialization, q is located in $o_{1}$ and has the LVC information of $o_{1}$ . Having such information, q can derive the NN but 3NN cannot be decided yet. So, q will broadcast the query message to all of its one-hop neighboring sensors, say $o_{1}$ , $o_{2}$ , $o_{3}$ , $o_{8}$ , and $o_{9}$ in Figure 8(a), to request information. The neighboring sensors will reply to q the information about their LVCs. Object q then can derive the resulting 3NN for the time being, say $o_{1}$ , $o_{8}$ , and $o_{9}$ in the example. With these replied information, q can build the approximate order-3 Voronoi cell, as shown in Figure 8(b). Then, q uses the approximate order-3 Voronoi cell to derive the safe time in order to maintain the continuous query result. Now, if $k = 10$ , the information received from the one-hop neighboring sensors is not enough to derive the result. Then, the query message will be broadcast to q's two-hop neighboring sensors as Figure 8(c) shows. If the information is not enough to derive the resulting kNN, the query message will be broadcast to one more hop neighboring sensor. Such process will be repeated with an increment on the hop count. After having enough information, q can calculate the resulting 10NN and the approximate order-10 Voronoi cell as shown in Figures 8(d) and 8(e), respectively. Using the approximate order-10 Voronoi cell, the safe time can be derived. In the following, we will discuss how to derive the safe time for maintaining the result of the continuous kNN query.

Figure 8

An example of DCkNN for $k > 1$ , where the initialization phase is the same as previous DCNN example and (a) $o_{1}$ , $o_{8}$ , and $o_{9}$ are the 3NN to the query object q, (b) the approximate order-3 Voronoi cell can be built with enough information received, (c) the information is not enough for 10NN query and two-hop neighboring sensors are requested, (d) the resulting 10NN can be derived, and (e) the approximate order-10 Voronoi cell can be calculated with information received.

4.3. Phase 3: Information Updates

In the distributed and mobile environments, it is challenging for a mobile node to maintain the location information of the other mobile nodes effectively. If a mobile node sends the update messages frequently, it will consume much network resource and power. Furthermore, the result of the moving query should be effectively maintained. For different roles, nodes, and query, we propose effective ways, respectively, in our DCkNN query process to update the location information and maintain the query result.

4.3.1. The Safe Time of Each Object

In the proposed approach, DCkNN, each object cooperates with each other and exchanges information to update the location information of each object. In order to monitor and obtain the continuous answer of a continuous k nearest neighbors query, we use a safe distance between each object and query. Then we can derive the safe time of each object for location updates by estimating the maximum speed of that object. Suppose that the sensing area is partitioned into a grid for location management. There are three possible methods for deriving the safe distance. The first one is the grid-based safe distance. Each object directly uses the minimum distance between itself and the boundary of the grid cell where the object locates. Simply using the grid-based safe distance may cause some problems about the imprecise updates. The second method, VD-based safe distance, uses VD to obtain the safe distance. In order to predict the safe distance more precisely, we can consider the previous two distances simultaneously and then select the smaller one. So the last method, proper safe distance, will return a better safe distance by comparisons. Having the safe distance, the safe time then can be derived by estimating the maximum speed of the considered object.

In Figure 9, the sensing area is divided into a grid and the Voronoi diagram formed by the sensors is also shown. For sensor $o_{1}$ in Figure 9(a), the grid-based safe distance and VD-based safe distance are presented, respectively, and the proper safe distance will be the VD-based safe distance after the comparison. On the contrary, for sensor $o_{3}$ in Figure 9(b), the proper safe distance will be the grid-based safe distance.

Figure 9

An illustration for different safe distances: (a) sensor $o_{1}$ uses the VD-based safe distance as the proper safe distance and (b) sensor $o_{3}$ uses the grid-based safe distance as the proper safe distance.

4.3.2. The Safe Time of Each Query

For the moving query, it is important to know when the result will change. To derive the safe time, during which the result will not change, we refer to the characteristics of Voronoi diagram. In the traditional environment with static objects, the system only uses the minimum distance from the query to the boundary of the Voronoi cell where the query locates as the safe distance. The the safe time can be obtained by the estimated maximum speed of that query. During the safe time, no update is necessary, thus saving the update cost.

Unfortunately, in the moving object environment, the above method cannot obtain the right safe time. As shown in Figure 10, the left part shows the static case, where L is the Voronoi cell boundary between objects A and B, and the safe distance of the query can be calculated as the figure shows. However, for the moving objects, the original safe distance between the L and query cannot be directly used to calculate the safe time because the object movement will cause the distance form query to L to be nonlinearly increasing or decreasing. As the right part of Figure 10 shows, after the movement of object A and object B, L becomes $L^{'}$ and the safe distance changes. Thus, if we want to use the boundaries of Voronoi cell to estimate more accurate safe time, we need to consider the relative movements between the objects and query.

Figure 10

An example showing the impact on the safe distance caused by object movement.

In order to derive the precise safe distance, we consider the movements of objects and query simultaneously and use Figure 11 to illustrate. Initially, $\bar{Q A}$ and $\bar{Q B}$ are the distances from the query to objects A and B, respectively. The line between A and B is the boundary of the Voronoi cells associated with A and B and the query is located in the cell of A as Figure 11(a) shows. When objects and the query move as in Figure 11(b), the query may touch the boundary. This indicates that the query will enter another cell and the result of the query should be updated. The time from the beginning to the moment of touching the boundary is the precise safe time for the query. According to the properties of Voronoi diagram, when the query is on the boundary, $\bar{Q A} = \bar{Q B}$ . We hence can use the condition of $\bar{Q A} = \bar{Q B}$ to decide whether the query is on the boundary. With this property, even though the objects and query move frequently, the query only needs the speeds, directions, and locations of the neighboring objects to predict the safe time easily. Within the safe time, the query will not touch the boundary. In Figure 11(c), the query meets the boundary at $t_{1}$ and the safe time is $t_{1}$ .

Figure 11

The precise safe-time for moving objects A and B and the moving query, where (a) shows the beginning state, (b) presents the moment when the moving query meets the boundary, and (c) gives the diagram for the change on the safe distance with time t.

In the following, we present the way to calculate the safe time when $\bar{Q A} = \bar{Q B}$ . Assume that the initial location of the moving query is $(x_{q}, y_{q})$ and the speed is $(V_{x_{q}}, V_{y_{q}})$ . Suppose the initial location of object $A (B)$ is $(x_{A}, y_{A}) ((x_{B}, y_{B}))$ with the speed being $(V_{x_{A}}, V_{y_{A}}) ((V_{x_{B}}, V_{y_{B}}))$ . The equation to find when the query meets the boundary can be derived according to the property of perpendicular bisector. Since $\bar{Q A} = \bar{Q B}$ , we can have

\begin{array}{l} ({[(x_{q} + V_{x_{q}} * t) - (x_{A} + V_{x_{A}} * t)]}^{2} \\ {+ {[(y_{q} + V_{y_{q}} * t) - (y_{A} + V_{y_{A}} * t)]}^{2})}^{1 / 2} \\ = ({[(x_{q} + V_{x_{q}} * t) - (x_{B} + V_{x_{B}} * t)]}^{2} \\ {+ {[(y_{q} + V_{y_{q}} * t) - (y_{B} + V_{y_{B}} * t)]}^{2})}^{1 / 2} . \end{array}

(1)

Then,

\begin{array}{l} 0 = [V_{x_{A}}^{2} + V_{y_{A}}^{2} - V_{x_{B}}^{2} - V_{y_{B}}^{2} - 2 * V_{x_{q}} V_{x_{A}} \\ - 2 * V_{y_{q}} V_{y_{A}} + 2 * V_{x_{q}} V_{x_{B}} + 2 * V_{y_{q}} V_{y_{B}}] * t^{2} \\ + 2 [(x_{q} - x_{A}) (V_{x_{q}} - V_{x_{A}}) + (y_{q} - y_{A}) (V_{y_{q}} - V_{y_{A}}) \\ - (x_{q} - x_{B}) (V_{x_{q}} - V_{x_{B}}) - (y_{q} - y_{B}) (V_{y_{q}} - V_{y_{B}})] * t \\ + [{(x_{q} - x_{A})}^{2} + {(y_{q} - y_{A})}^{2} - {(x_{q} - x_{B})}^{2} - {(y_{q} - y_{B})}^{2}] . \end{array}

(2)

Equation (2) is a quadratic equation. Since a general quadratic equation can be written in the following form: $a x^{2} + b x + c = 0$ , recall that the discriminant of a quadratic equation is $Δ = b^{2} - 4 a c$ . If $Δ > 0$ , there are two distinct real roots; if $Δ = 0$ , there is exactly one real root; if $Δ < 0$ , there are two distinct (non-real) complex roots. In (2),

\begin{array}{l} a = V_{x_{A}}^{2} + V_{y_{A}}^{2} - V_{x_{B}}^{2} - V_{y_{B}}^{2} - 2 * V_{x_{q}} V_{x_{A}} \\ - 2 * V_{y_{q}} V_{y_{A}} + 2 * V_{x_{q}} V_{x_{B}} + 2 * V_{y_{q}} V_{y_{B}}, \\ b = 2 [(x_{q} - x_{A}) (V_{x_{q}} - V_{x_{A}}) + (y_{q} - y_{A}) (V_{y_{q}} - V_{y_{A}}) \\ - (x_{q} - x_{B}) (V_{x_{q}} - V_{x_{B}}) - (y_{q} - y_{B}) (V_{y_{q}} - V_{y_{B}})], \\ c = {(x_{q} - x_{A})}^{2} + {(y_{q} - y_{A})}^{2} - {(x_{q} - x_{B})}^{2} - {(y_{q} - y_{B})}^{2}, \end{array}

(3)

and then we can derive the safe time t according to the discriminant:

\begin{matrix} t = \frac{[- b + {(b^{2} - 4 a c)}^{1 / 2}]}{2 a} or \frac{[- b - {(b^{2} - 4 a c)}^{1 / 2}]}{2 a} . \end{matrix}

(4)

Finally, we select the minimum value of $t > 0$ as the safe time.

Additionally, assuming that d is the distance between the object and the query and t is the time, $\bar{Q A} = \bar{Q B}$ can be converted into a function of t and d, as shown in Figure 11(c). Then we can obtain the safe time $t_{1}$ which is the time when $\bar{Q A} = \bar{Q B}$ . Furthermore, the object B will be closest to the query at the time $t_{3}$ .

5. Comparisons with Previous Approaches

In this section, we compare the proposed DCkNN approach, the Safe-time with stationary query approach [5], and the naïve way used pull-based approach, in terms of safe time, space, and accuracy.

5.1. Safe Time

The original safe-time with stationary query [5] (safe-time(S)) does not support moving queries. An example of C3NN is shown in Figure 12, where $q_{1}$ is the query point, triangles are the 3NN objects, the pentagons represent the other individual objects, and $o_{4}$ is 4th NN. Figure 12(a) shows the original result for safe-time(S). After obtaining the kth and $(k + 1)$ th objects, ( $o_{2}$ and $o_{4}$ in the figure), the cross-distance (CD) of kth and $(k + 1)$ th objects can be calculated. Recall that the cross distance of two data objects A and B is the difference on the distances of A and B to the center (or, the query point). For the next update (after a safe-time period), all the objects in this region should be checked.

Figure 12

The phenomenon when using the original safe-time methods to (a) stationary query and (b) moving query.

We adapt the safe-time(S) method for moving query. The adapted approach will result in a higher update cost. Figure 12(b) presents a case that the query moves toward object $o_{3}$ (kth NN) and away from object $o_{4}$ ( $(k + 1)$ th NN). When the query $q_{1}$ moves to the point $q_{1}^{'}$ , the most outside concentric circle increases and becomes as large as the most outside dotted concentric circle. On the contrary, the most inside concentric circle decreases and becomes as small as the most inside dotted concentric circle. Thus, the cross-distance increases and more number of objects will be examined.

In addition, according to the Figure 12(b), we can know that the original Safe-time method, safe-time(S), is not appropriate for moving queries since the calculation of Safe-time(S) method does not consider the speed of query. safe-time(S) directly uses twofold of the maximum speed of the kth and $(k + 1)$ th NN objects to divide the CD to derive the safe time. The cost for updates increases when the CD becomes larger since the average checked area becomes bigger. Besides, the obtained kNN answer cannot be guaranteed to be correct. On the contrary, the cross-distance when the query moves towards object $o_{4}$ ( $(k + 1)$ th NN) and away from object $o_{3}$ (kth NN), in this case, the order of the kth and $(k + 1)$ th NN object may be changed, but the update does not perform, thus resulting in incorrect results.

We modify the Safe-time to fully support the moving query. In order to support and maintain the accuracy of moving query, the maximum speed (Maxspeed) used in the safe time calculation is required to be modified and replaced by the maximum relative speed between the query and the objects. Because of using the relative speed, the adapted safe-time method, safe-time(M), needs twofold maximum relative speed ( $2 * M a x s p e e d$ ) to calculate the safe time for guaranteeing the accurate results. The safe time derived is less than the one derived in safe-time(S) due to the relative speed. Hence, the update frequency of safe-time(M) is higher than the update frequency of safe-time(S).

5.2. Space

In DCkNN, each object and query only need to maintain the information of LVC. The information to be stored includes the location of neighboring objects and the object information of order-k's local Voronoi diagram. The total amount of stored information is not significant on each object. Thus, the use of storage space does not need to be very substantial.

5.3. Accuracy

The safe-time (S) [5] considers the stationary query. When applying it to the moving query, some adaption should be made. According to our experimental results, when the value of k increases with a moving query, the accuracy of the result will be reduced. In our simulation, we modify the original Safe-time(S) as Safe time(M) for moving query and consider the relative distance in detail caused by the moving query and objects. Although the correctness of the answer can be increased, the update frequency increases, thus increasing the number of delivered messages.

DCkNN uses Voronoi and grid cell boundary to calculate the safe time and updates the information of related objects before the end of the safe time. In fact, the boundary of the Voronoi cell represents the boundary of moving kNN query. After the safe time period, the query will collect the information of related neighboring objects. Having such real-time information, the query can have the result more accurate in time. Similar to safe-time(M), DCkNN uses the kth and $(k + 1)$ th nearest moving objects to derive the safe time. So, it is able to maintain the correctness of the answer.

6. Simulation Experiments

In the simulation, we compare the following methods: Naïve, DCkNN, safe-time (S), and safe-time (M). The Naïve method is centralized and used as a benchmark for the comparison. All the mobile sensors will send the location information to the central server to update the information. The other methods, DCkNN, Safe-time(S), and Safe-time(M), are distributed and used in the considered mobile wireless sensor network, where each object knows its position, has the ability to perform some calculation, and stores the information of localized Voronoi cell (LVC). We assume that each object can transmit a message to its one-hop neighbor so that each object can communicate with at least one object. For Safe-time(S) and the safe-time(M), a maximum speed (MaxSpeed) is set in order to estimate relative speed and distance for safe-time calculation.

In the simulation, all the compared approaches are performed under the same environmental conditions for fairness. The initial location and speed of moving objects are uniformly randomly distributed. Since we focus on the performance of the query process of the proposed DCkNN approach, we further assume that the transmission quality is ideal and no congestion and communication delays occur. The metrics measured include the update frequency for a query, the total number of messages, response time, and accuracy. The update frequency for a query is the average number of update messages sent by the data objects related to the query per minute. The total number of messages is all the amount of messages sent from all objects during the query process. The accuracy of each method is measured by comparing the results to the real answers. Since the query is continuous and the update occurs subsequently, the measured response time is mainly the time required from sending a query to obtaining the first result.

All the experiments are simulated, so the results about the response time may not show the practical cases exactly but can reflect the trends on the performance among all the compared approaches relatively. In order to have the response time in our simulation, we thus use the total number of hops for deriving the result to multiply the average time for one hop in the simulation system to present the response time. Thus, the response time is proportional to the total number of hops in our simulation program. In addition, because the main difference between Safe-time(S) and the Safe-time(M) is the way for deriving the safe time, the measured response time of these two methods is the same. The simulation program is implemented with Java. The setting of our simulation is shown in Table 1 with a default value for each parameter. In all the simulation experiments, the final results are reported with the average of 100 queries. The duration of each query is 60 seconds.

Table 1

Simulation parameters.

Parameter	Range	Default value
Edge length of sensing area (m)	200, 400, 600, 800, 1000	600
Edge length of grid (m)	50, 100, 150	50
Number of objects	20, 30, 40, 50, 60	30
Transmission range (m)	125	125
k	1, 3, 5, 10, 20	3
Moving objects (%)	20, 40, 60, 80, 100	20
Speed (m/s)	2, 4, 6, 8, 10	4

6.1. Effects of Sensing Area

Sensing area affects the number of update and query messages since if the area becomes greater with the fixed number of objects, relatively, the density of the objects becomes sparse. For example, when the sensing areas are 200*200, 400*400, 600*600, 800*800, and 1000*1000, the densities are 1.88, 0.47, 0.21, 0.12, and 0.08, respectively. Figure 13(a) shows that DCkNN has the lowest number of updates with all the different areas and the number of updates for Safe-time(S) is the highest (worst) because DCkNN considers the moving direction and speed of the queries and objects and selects the proper safe-distance to derive the safe-time. Safe-time(S) the Safe-time(M) estimate the safe time according to the maximum speed of the objects, so more number of updates can be expected. Additionally, Safe-time(M) is updated more frequently than Safe-time(S). Recall that Safe-time(M) considers the query movement. The average safe-distance becomes shorter, thus leading to more updates. Besides, when the sensing area increases from 200*200 to 600*600, the update frequency drops dramatically. The curve becomes flatter when the area is from 600*600 to 1000*1000 since the difference on the densities of the areas from 600*600 to 1000*1000 is not much. The number of updates is less as the safe-time becomes larger. The total number of transmitted messages of naïve approach is fixed at about 1800 packets. All the other distributed methods use fewer number of messages than naïve.

Figure 13

The effects of different sensing areas on (a) update frequency, (b) number of messages, (c) accuracy, and (d) response time.

To monitor the results, DCkNN only needs to check the related objects in the local Voronoi cell at the proper safe-time, thus reducing a lot of redundant update messages. Safe-time(S) and Safe-time(M) use the maximum speed of the objects to calculate the safe time, so the total number of transmitted messages of both approaches is more than that of DCkNN. Figure 13(b) shows that more than 30% of the total number of transmitted messages can be saved by DCkNN in comparison with Safe-time(M).

For the accuracy, since Safe-time(S) does not consider the query movement, the safe-time may be inaccurate. Furthermore, due to the mobility, when the density is high, the estimated results become more inaccurate. So, Safe-time(S) performs worst as shown in Figure 13(c). Safe-time(M) is revised from Safe-time(S) by considering the query movement and the safe-distance is less in average for guaranteeing better results. Hence Safe-time(M) has a better accuracy and almost the same as DCkNN. The accuracy of Safe-time(M) and DCkNN is almost about 100% correct and the gap may be caused by arithmetical errors in the experiments.

When the area becomes larger, the density becomes lower. So, the number of objects in each object's transmission range is fewer. Under such a circumstance, DCkNN uses less computation and the response time is shorter as shown in Figure 13(d). In contrast, the Safe-time(S) and the Safe-time(M) need to collect information before performing each new query and cause longer response time. Since Safe-time(M) and Safe-time(S) use the same method and procedure to handle the first query, the response time of these two methods will be almost the same.

6.2. Effects of Grid Size

In this simulation experiment set, we want to observe the update efficiency of each object in the third phase of DCkNN. Recall that we consider the grid-based and VD-base safe distances to derive a proper safe-time. In this experiment, DCkNN(G) uses grid-based safe distance to determine the safe time for the next update and DCkNN(V) uses VD-base safe distance to derive the safe time. The sensing area is fixed to 600*600 m² and there are 30 data objects. We compare DCkNN(V) with DCkNN(G) by observing the impacts of different grid sizes. All the other settings have the similar trends in our experimental results. In Figure 14(a), the update frequency of the objects in DCkNN(G) is high when the grid size is 50*50 m² and becomes less when the grid size decreases. Since the sensing area is 600*600 m² and 30 objects are given, the average distance between an object and the boundary of a Voronoi cell is about 25 to 50 meters. With the setting of grid size 50*50 m², the average distance obtained in DCkNN(G) is about 25 meters. Thus, the safe time of DCkNN(V) is longer. In general, as Figures 14(a) and 14(b) show, the update frequency of the objects and the number of messages, sent by the objects, decrease as the grid size increases.

Figure 14

The comparisons of using grid-based and VD-base safe distances to derive the safe-time: (a) update frequency, (b) number of messages, (c) accuracy, and (d) response time.

When the grid size is large, the safe time becomes long and less number of updates are issued. Hence, the results become more inaccurate. As shown in Figure 14(c), when the grid sizes are 50*50 m², 75*75 m², 100*100 m², and 150*150 m², the corresponding accuracies are 95%, 92%, 85%, and 78%, respectively. While the grid size increases to 200*200 m², the accuracy is significantly reduced to 69%, and the right answer thus can not be guaranteed. On the other hand, Figure 14(d) shows that the response times of DCkNN(G) and DCkNN(V) are not much different, because the size of grid is not the major factor to influence the complexity of obtaining information for the first query.

6.3. Number of Objects

In this subsection, we discuss the results as the number of data objects varies. The area in the presented experiment set is 600*600 (m²) with grid size of 50*50 (m²). As shown in Figure 15(a), when the number of objects increases, the number of updates in DCkNN is still relatively less than the ones in Safe-time(S) and Safe-time(M). If the number of objects increases, the number of replied messages per second to the central server in naïve will grow dramatically. With 20 to 60 objects, the number of replied messages to the server is from 1200 to 3600 (packets/min). By observing the results in Figures 15(b) and 15(c), as more objects are in the area, the number of messages increases for the other three methods and the accuracy reduces in Safe-time(S). These match the trends for different areas we discussed earlier due to the density. Similarly, the response time can be expected as shown in Figure 15(d) where the response time of DCkNN is still the shortest.

Figure 15

The effects of the number of objects on (a) update frequency, (b) number of messages, (c) accuracy, and (d) response time.

6.4. Different Values of k

It is obvious that the value of k affects the performance of query processing significantly. When the value of k becomes larger, the number of related objects required to be checked also increases. In other words, we need to spend more cost to find out which k objects are currently in the result for the CkNN query. Therefore, the changes in the value of k mainly affect the amount of data required for the first query and the update cost for maintaining the kth and the $(k + 1)$ th objects. So, in Figure 16(a), when the value of k is large, more updates are necessary for all the three methods. As Figure 16(b) shows, the influence on the number of transmitted messages in Safe-time(M) is more serious than the one in Safe-time(S). Due to the mobility, the number of influenced objects in Safe-time(M) will increase as the value of k increases. In particular, the update frequency in Safe-time(M) becomes nearly twofold the update frequency in Safe-time(S).

Figure 16

The impacts of different values of k on (a) update frequency, (b) number of messages, (c) accuracy, and (d) response time.

Figure 16(c) presents the impact on the accuracy for different values of k. With a larger value of k, an object in Safe-time(S) will have a higher chance to derive the incorrect safe time, thus causing lower accuracy. When the value of k increases, the response time of all three methods will increase significantly, as shown in Figure 16(d). The reason is that all DCkNN, Safe-time(S), and Safe-time(M) need to search more objects and collect more information for answering the query when the value of k increases to a certain number. For DCkNN, DCkNN only needs to store local Voronoi diagram, where the information required is less than required in Safe-time(S) and Safe-time(M), so the response time of DCkNN is the shortest.

6.5. The Effect of Moving Objects

In this experiment set, we consider different percentages of moving objects in all the data objects. The default value is 20%, which means that if there is a total of 30 objects, then there are six moving objects. In Figures 17(a) and 17(b), as the percentage of moving objects increases, the overall number of updates required between the objects increases relatively and the change rate of answer to the DCkNN accelerates, thus increasing the update frequency of the query and the number of messages. However, DCkNN still keeps the best performance. Figure 17(c) shows the accuracy of the result for different percentage of moving objects. DCkNN and Safe-time(M) both have higher accuracy and Safe-time(S) is still the worst. The safe time of each object in Safe-time(S) is calculated using the maximum speed of the objects. Hence whether the object is moving or not does not directly result in the incorrectness of the answer collection. In fact, in Safe-time(S), one of the main reasons to affect the accuracy is the directions of moving objects. Last, as Figure 17(d) shows, the percentage of moving objects does not impact the response time too much.

Figure 17

The effects of the moving objects on (a) update frequency, (b) number of messages, (c) accuracy, and (d) response time.

6.6. Effects of Speed

If each object moves faster, the relative velocities between objects may also become faster. This will make the safe time shorter and the update frequency of each object and query increases. The safe time, used in Safe-time(S) and Safe-time(M), is directly affected by the maximum speed of objects and does not consider the direction of each object's movement and the speed of individual movement. No matter the objects move or not and move fast or slow, both of the above methods directly use the maximum speed of objects to calculate the safe time for updating. On the other hand, DCkNN considers the moving direction of objects and the proper safe-time for updates. In this experiment, the number of moving objects is 20% of all the objects. As Figure 18(a) shows, as the moving objects speed up, the update cost increases and DCkNN still performs best. As the speed increases, the safe-time in Safe-time(S) and the Safe-time(M) becomes much shorter due to a larger maximum speed among all the objects. The trend becomes even worse as the speed increases.

Figure 18

The impacts of speed on (a) update frequency, (b) number of messages, (c) accuracy, and (d) response time.

Figure 18(b) shows the trend about the number of transmitted messages when the speed is up. DCkNN still has the best performance on the number of messages. There are 427.81 messages sent in average, when the speed of object is 10 m/s. However, the Safe-time(M) needs 679.13 messages. The accuracy will not be effected as the speed changes for DCkNN and Safe-time(M) because both of them are designed or adapted for the moving objects and queries. Safe-time(S) will have more inaccurate results as the speed is high as shown in Figure 18(c). However, the response time does not have a significant impact on the response time for all the three methods. Figure 18(d) presents this trend.

7. Conclusions

In this paper, we propose a new approach, DCkNN, which uses the local Voronoi diagram and proper safe-time to effectively process the CkNN query in distributed and mobile environments, such as wireless sensor networks. The proper safe-time can be quickly derived by a simple formula. By comparing our proposed DCkNN with other existing approaches, Safe-time(S), DCkNN, can effectively improve search efficiency, reduce response time and the number of messages, and keep the accuracy of the result high. Additionally, DCkNN reduces more 30% of transmission messages than Safe-time(M) because the characteristics of the Voronoi diagram are used and each object's moving direction is considered instead of directly using the maximum speed of the objects as Safe-time method. The derived safe-time is more precise. As a result, DCkNN needs less update frequency, reduces a great number of messages transmitted, and yields more accurate results. In this work, simulation experiments are also performed. The experimental results validate the proposed DCkNN approach for the continuous k nearest neighbor query in distributed and mobile environments where all the data objects and query can move.

Footnotes

Acknowledgment

This work is partially supported by NSC under the Grant 98-2220-E-027-007, 100-2221-E-027-087-MY2, and 102-2221-E-027-088-.

References

Bohl

Manouchehri

Winand

Mobile information systems for the private everyday life

Mobile Information Systems 2007 3 3-4 135 152

Tao

Papadias

Time-parameterized queries in spatio-temporal databases

Proceedings of the ACM SIGMOD International Conference on Managment of Data

June 2002

334 345

2-s2.0-0036373494

Tao

Papadias

Shen

Continuous nearest neighbor search

Proceedings of the 28th International Conference on Very Large Data Bases (VLDB '02)

2002

287 298

Aurenhammer

Voronoi diagrams—a survey of a fundamental geometric data structure

ACM Computing Surveys 1991 23 3 345 405

Kim

Cai

Tavanapong

Safe-time: distributed real-time monitoring of cKNN in mobile peer-to-peer networks

Proceedings of the 9th IEEE International Conference on Mobile Data Management (MDM '08)

April 2008

124 131

2-s2.0-51349143263

10.1109/MDM.2008.18

Kolahdouzan

Shahabi

Voronoi-based k nearest neighbor search for spatial network databases

Proceedings of the 30th International Conference on Very Large Data Bases

2004

840 851

Kolahdouzan

M. R.

Shahabi

Alternative solutions for continuous K nearest neighbor queries in spatial network databases

GeoInformatica 2005 9 4 321 341

2-s2.0-28044458115

10.1007/s10707-005-4575-8

Papadias

Tao

Mouratidis

Hui

C. K.

Aggregate nearest neighbor queries in spatial databases

ACM Transactions on Database Systems 2005 30 2 529 576

2-s2.0-23944510654

10.1145/1071610.1071616

Imielinski

Viswanathan

Badrinath

B. R.

Data on air: organization and access

IEEE Transactions on Knowledge and Data Engineering 1997 9 3 353 372

2-s2.0-0031147288

10.1109/69.599926

10.

Hambrusch

Liu

C.-M.

Prabhakar

Broadcasting and querying multi-dimensional index trees in a multi-channel environment

Information Systems 2006 31 8 870 886

2-s2.0-33748700811

10.1016/j.is.2005.05.002

11.

Liu

C.-M.

Lin

K.-F.

Disseminating dependent data in wireless broadcast environments

Distributed and Parallel Databases 2007 22 1 1 25

2-s2.0-34547652997

10.1007/s10619-006-7006-4

12.

T.-C.

Liu

C.-M.

On-demand data broadcasting for data items with time constraints on multiple broadcast channels

Database Systems for Advanced Applications 2010 6193 458 469 Lecture Notes in Computer Science

2-s2.0-77956148546

10.1007/978-3-642-14589-6_46

13.

Zheng

Lee

W.-C.

Lee

D. L.

Grid-partition index: a hybrid method for nearest-neighbor queries in wireless location-based services

The VLDB Journal 2006 15 1 21 39

2-s2.0-33745306642

10.1007/s00778-004-0146-0

14.

Gedik

Singh

Liu

Energy efficient exact kNN search in wireless broadcast environments

Proceedings of the 12th ACM International Symposium on Advances in Geographic Information Systems (GIS '04)

November 2004

137 146

2-s2.0-19544373140

15.

Mouratidis

Bakiras

Papadias

Continuous monitoring of spatial queries in wireless broadcast environments

IEEE Transactions on Mobile Computing 2009 8 10 1297 1311

2-s2.0-70349280342

10.1109/TMC.2009.14

16.

Chatzimilioudis

Zeinalipour-Yazti

Lee

W.-C.

Dikaiakos

M. D.

Continuous all k-nearest-neighbor querying in smartphone networks

Proceedings of the 13th IEEE International Conference on Mobile Data Management (MDM '12)

2012

79 88

17.

T.-Y.

Peng

W.-C.

Lee

W.-C.

Parallelizing itinerary-based KNN query processing in wireless sensor networks

IEEE Transactions on Knowledge and Data Engineering 2010 22 5 711 729

2-s2.0-77949918407

10.1109/TKDE.2009.146

18.

Nghiem

T. P.

Waluyo

A. B.

Taniar

A pure peer-to-peer approach for kNN query processing in mobile ad hoc networks

Personal and Ubiquitous Computing 2013 17 5 973 985

2-s2.0-84860549169

10.1007/s00779-012-0545-y

19.

Chon

H. D.

Agrawal

El Abbadi

Range and kNN query processing for moving objects in Grid model

Mobile Networks and Applications 2003 8 4 401 412

2-s2.0-0042196177

10.1023/A:1024535730539

20.

Xiong

Mokbel

M. F.

Aref

W. G.

SEA-CNN: scalable processing of continuous K-nearest neighbor queries in spatio-temporal databases

Proceedings of the 21st International Conference on Data Engineering (ICDE '05)

April 2005

643 654

2-s2.0-28044449142

21.

Zhao

Xuan

Rahayu

Taniar

Safar

Gavrilova

M. L.

Srinivasan

Voronoi-based continuous κ nearest neighbor search in mobile navigation

IEEE Transactions on Industrial Electronics 2011 58 6 2247 2257

2-s2.0-79956263952

10.1109/TIE.2009.2026372

22.

Mouratidis

Papadias

Bakiras

Tao

A threshold-based algorithm for continuous monitoring of k nearest neighbors

IEEE Transactions on Knowledge and Data Engineering 2005 17 11 1451 1464

2-s2.0-28244469516

10.1109/TKDE.2005.172

23.

S.-H.

Chuang

K.-T.

Chen

C.-M.

Chen

M.-S.

DIKNN: an itinerary-based KNN query processing algorithm for mobile sensor networks

Proceedings of the 23rd International Conference on Data Engineering (ICDE '07)

April 2007

456 465

2-s2.0-34548808020

10.1109/ICDE.2007.367891

24.

Lee

D. T.

On k-nearest neighbor voronoi diagrams in the plane

IEEE Transactions on Computers 1982 31 6 478 487

2-s2.0-0020141081

25.

Patroumpas

Minogiannis

Sellis

Approximate order-k Voronoi cells over positional streams

Proceedings of the 15th ACM International Symposium on Advances in Geographic Information Systems (GIS '07)

November 2007

276 283

2-s2.0-79959636238

10.1145/1341012.1341059

Distributed Continuous k Nearest Neighbors Search over Moving Objects on Wireless Sensor Networks

Abstract

1. Introduction

2. Related Work

3. Preliminaries

3.1. Spatial Query

Definition 1 (kNN search problem).

Definition 2 (CkNN search problem).

3.2. Voronoi Diagram

3.3. Safe-Time

3.4. Performance Metrics

3.4.1. Update Cost

3.4.2. Number of Messages

3.4.3. Accuracy

3.4.4. Response Time

4. Distributed Continuous k Nearest Neighbors Search

4.1. Phase 1: Initialization ( k = 1 )

4.2. Phase 2: CkNN Query Processing

Algorithm 1: The DCkNN algorithm on the query (sink) node.

4.3. Phase 3: Information Updates

4.3.1. The Safe Time of Each Object

4.3.2. The Safe Time of Each Query

5. Comparisons with Previous Approaches

5.1. Safe Time

5.2. Space

5.3. Accuracy

6. Simulation Experiments

6.1. Effects of Sensing Area

6.2. Effects of Grid Size

6.3. Number of Objects

6.4. Different Values of k

6.5. The Effect of Moving Objects

6.6. Effects of Speed

7. Conclusions

Footnotes

Acknowledgment

References

4.1. Phase 1: Initialization $(k = 1)$