A Graph Embedding Method Based on Sparse Representation for Wireless Sensor Network Localization

Abstract

In accordance with the problem that the traditional trilateral or multilateral estimation localization method is highly dependent on the proportion of beacon nodes and the measurement accuracy, an algorithm based on kernel sparse preserve projection (KSPP) is proposed in this dissertation. The Gaussian kernel function is used to evaluate the similarity between nodes, and the location of the unknown nodes will be commonly decided by all the nodes within communication radius through selection of sparse preserve projection self-adaptation and maintaining of the topological structure between adjacent nodes. Therefore, the algorithm can effectively solve the nonlinear problem while ranging, and it becomes less affected by the measuring error and beacon nodes quantity.

1. Introduction

For most wireless sensor network (WSN) applications, it is essential to know the location of sensor nodes. Related researches show that, in the information related to the context of monitoring deployment area provided to the users, more than 80% is related to the information of location [1].

The self-localization algorithm of nodes can be divided into the two major categories: range-based and range-free [2, 3]. Range-based approaches have a relatively higher accuracy, but this final location estimation accuracy depends on the measuring accuracy. The most common measuring technologies include Radio Signal Strength (RSSI) [4], Time of Arrival (ToA) [5], Time Difference of Arrival (TDoA) [6], and Angle of Arrival (AoA) [7]. The measurement data of RSSI can be achieved during each data communication between nodes, and it does not occupy additional bandwidth and energy, so the hardware expense is relatively simple and cheap. These turn it into a hotspot research direction to use the RSSI measurement data to conduct localization. However, in a complicated monitoring environment, the measurement of RSSI signal is also affected by multiple factors; for example [8–10], the communication between nodes generally uses free common channel, which is inevitably interfered by other devices in the monitoring area; RSSI signal itself has multipath feature; the hardware of sensor node is relatively cheap, simple, and with poor operational capability, and in addition, the unstable production technique has also caused unstable quality of certain node products, blockage by static or moving obstacles in the monitoring area, and so forth. All these will provoke a large amount of uncertain factors in the collected signals and make the signal present nonlinearity.

Figure 1 shows the cause of nonlinearity in a complex environment. When the part between node A and node C is interfered by other devices of the same frequency band, or if it is blocked by a static or moving object, the measured value is generally bigger than the actual value. If only linear method is used without considering the actual environment, and if the empirical model is directly used to obtain the measuring distance, the measuring precision is generally very low, which obviously cannot satisfy the actual application. Therefore, there are multiple difficulties to face in order to obtain accurate localization, which requires exploring new technologies and methods or combining other solutions to handle it.

Figure 1

Distance measurement in complex environment.

The rest of the paper is structured as follows. In Section 2, we address the related work in the area of range-based localization schemes. In Section 3, we give a brief review of related concepts. And Section 4 presents our localization method. Section 5 shows simulated results, and Section 6 gives the conclusions.

2. Related Work

Related literature [11] generally divides the localization algorithm based on signal strength ranging into two types: one type uses empirical formula transformation to obtain the distance after obtaining the signal strength between nodes and then uses the trilateration method or multilateral method to obtain the location of unknown node; another type directly uses the signal strength information between nodes to obtain the similarity between nodes through the machine learning method, and then, the relation between nodes is mined in accordance with the similarity and the location of beacon node. The former depends on the empirical model, and classic methods include the RADAR system [12] and SpotON system [13]. They only can be applied to some scenarios with relatively unchanging environment, and in order to obtain the localization result with high accuracy, training calibration should be conducted to different environments, which might cost a large amount of manpower and material resources. If the fitting model is not accurate or the deployment scenario is modified, then the original model cannot reflect the relation between the Euclidean distances and RSSI values, the distance measurement accuracy is low, and it will result in undesirable localization effects. The latter regards the nodes in network as independently distributed devices, uses the RSSI measurement values between adjacent nodes, refers to the machine learning algorithm, trains and learns a prediction model in the deployment area, and further estimates the location information of unknown nodes in the area, such as the LANDMARC localization system based on k nearest neighboring (k-nn) [14], kernel principal component analysis (KPCA) localization algorithm [15], kernel canonical correlation analysis (KCCA) location estimation method [16, 17], and localization algorithm based on kernel locality preserving projection (KLPP) [18]. The range-based localization technology based on machine learning method is not sensitive to RSSI measurement error and it has the characteristics of low requirement for measurement technology and high self-adoption degree; therefore, it has attracted considerable attention.

The k-nn method [18] relies on the “nearest adjacent distance criteria” weighted idea to obtain the information of unknown nodes, its algorithm is a kind of linear algorithm, the k value is generally artificially set, and it has high randomness. In addition, when the nodes are assigned to a complicated environment, the RSSI measured values have a high nonlinearity, which would lead to poor localization result. For nonlinear data, the researchers find that it is a practical and effective solution to use the kernel method to build the mapping model. The kernel method [19, 20] maps the original data into an appropriate high-dimensional feature space through a certain suitable kernel function, which transfers the nonlinear problem difficult to solve in the original space into a linear problem in the feature space. By referring to the characteristics of the kernel method, this paper uses it to solve the nonlinear problem in the RSSI measured data.

A large number of related researches show that after kernel function and training set are given, the kernel matrix (or Gram matrix) can be built, and the real internal structure of data can be disclosed in accordance with the similarity provided by the kernel matrix. The KPCA and KCCA localization algorithms taking into account this theory have been proposed successively.

KPCA is an extension of principal component analysis (PCA) [21] using techniques of the kernel method. Using kernel method, the nonlinear data is mapped into a high-dimensional feature space and then PCA is employed in feature space. The linear PCA in the high-dimensional feature space can best represent the projection of original data in order to realize dimensionality reduction and denoising of data. KCCA is also a kind of dimensionality reduction method similar to KPCA, and different from KPCA; by building the mapping of signal space and physical space in the feature space, the localization method based on KCCA makes it possible for the nodes to use RSSI values to deduce back the relative topological structure of their physical space and then calculate the location of these nodes. However, both of the KPCA and KCCA computations have adopted a global nonlinear mapping algorithm, which is simple and highly efficient, and although it can obtain good results under some situations, it has ignored the data distribution and failed to consider that the data have the local distribution feature. In addition, the KCCA algorithm is only applicable to fingerprint-based database for indoor localization, and it requires artificially collecting the training datum and building the distribution diagram of the relation between RSSI and Euclidean distances beforehand. During localization, in accordance with the distribution diagram obtained during previous training, k nearest access points (APs) will be found, and their center of mass will be used as the estimated location, and if the APs density is not high enough, it requires further iterative operation. Therefore, the KCCA estimation algorithm does not apply to the random deployment environment or any scenarios that cannot be reached by human.

Based on that, Wang et al. [18] used the KLPP algorithm and proposed the KLPP-based localization algorithm. The KLPP localization method uses the kernel function to measure the similarity between nodes, and after the RSSI values are mapped into the feature space, it uses the LPP algorithm [22] to construct the adjacency graph between nodes, which formulates the localization problem as a graph embedding problem and allows us to consider the topological structure of the networks. The KLPP-based localization method uses the kernel method to solve the nonlinear problem of RSSI values; by using the adjacency graph, the location of unknown nodes is commonly decided by their adjacent nodes, so the measurement error caused by remote beacons during the process of traditional multilateral or trilateral method is avoided, and the impact by the number of beacon nodes is small.

However, the KLPP algorithm is overly dependent on the adjacency graph, while the construction of the adjacency graph mainly use k-nn and ε-ball approaches [23]. The k-nn and ε-ball methods are the two most popular ones for graph construction in the literature. The k-nn graph method chooses k nearest adjacent samples, while the ε-ball method chooses the samples which falling into its ε-ball neighborhoods are linked to it. In addition, the weight of side in the graph is generally obtained through methods like binary, Gaussian kernel, or $l_{2}$ -reconstruction [24–26]. All of them determine the neighboring data based on the pairwise Euclidean distance, which is, however, very sensitive to data noises, especially in high-dimensional spaces; that is, the graph structure may dramatically change the presence of noise. The construction of the adjacency graph and the choice and setting of weight all restrict the final localization result of KLPP-based algorithm. For the localization algorithm, once the adjacency graph between nodes is given, in the subsequent operation, the adjacency graph parameters might have no change at all. However, for the WSN, its deployment area is generally affected by various actual factors such as hardware error, network attacks, and lack of energy and severe weather, which causes that the network topology can change at any time. Therefore, it is difficult for the KLPP-based localization method with preset parameters to adapt to a complicated environment, and as showed in Figure 1, when it is affected by the interference source, the structure in the figure will be affected in a certain degree. How to self-adaptively determine the relation between nodes is the foundation to obtain satisfying location estimation result. Researchers [27] found that sparse representation (SP) has natural discrimination ability, and through construction of $l_{1}$ -graph, each sample can automatically choose the adjacent samples through SP method and automatically make each sample be represented by the combination of surrounding training samples. In addition, $l_{1}$ -graph is robust owing to the overall contextual $l_{1}$ -norm formulation and the explicit consideration of data noises. Qiao et al. [28] smartly applied SP to the structure of the adjacency graph and proposed the sparsity preserving projections (SPP) algorithm. This algorithm uses the sparsity of coefficient in SP as natural differentiation information and introduces it into the reconstructed adjacency graph, through “sparse” constraint, the local structure feature of data will be self-adaptively captured, and the sample will be automatically linearly represented by the samples in its adjacent area. Then, Yin and Yang [29] use the kernel method to expand SPP to the nonlinear area, and through mapping of the kernel method, the coefficient of SP is used to construct the adjacency graph in the feature space. Because the kernel method has stronger discrimination information than the original SP method, therefore, KSPP has more efficient discrimination ability than SPP. Inspired by the KSPP algorithm mentioned above, during a study on the localization problem, this paper proposes the KSPP-based localization algorithm, called location estimation-KSPP (LE-KSPP). The algorithm will obtain the signal strength between nodes and measure the similarity between nodes through Gaussian kernel function, then, the network adjacency graph will be self-adaptively constructed through SP, the localization problem will be transformed into a graph embedding problem, the coordinate of unknown code will be commonly determined by all the nodes within the communication radius, and in this way, the impact of measurement error and the beacon node number will be reduced. The experiment and simulation results show that the LE-KSPP method has high localization accuracy, the impact of beacon node number is small, and it has strong environment adaptability, which apply to different deployment environments.

Figure 2(a) shows traditional multilateration or trilateration localization, whose measurements are made only between an unknown location sensor and known location sensors. When the location of beacon nodes is relatively far, the measurement error will definitely be bigger; in addition, the algorithm does not consider the impact of other nodes surrounding this node, so the location obtained through traditional method has a low location estimation accuracy. Figure 2(b) shows the LE-KLPP method; if its k-nn graph only chooses three pairs of sensors, then this algorithm has obviously ignored the impact of some other nodes in the adjacent area, so the location estimation result is not ideal. Figure 2(c) shows the LE-KSPP method, because each node can automatically obtain the neighbors with information through SP. The information of neighbors can be utilized to aid in the location estimation and enhances the accuracy and robustness of the localization system.

Figure 2

Traditional multilateration localization methods, LE-KLPP localization method, and LE-KSPP localization method.

3. Related Concepts

3.1. Kernel Method

The kernel method is one of the current hot topics in the field of artificial intelligence and machine learning, and it is based on statistical learning theories and kernel technology. As early as 1964, during research of potential function, Aizerman et al. [30] introduced the idea of kernel function as the inner product of feature space into learning field. However, it was until 1992 when this idea attracted the attention of Boser et al. [31]; they combined it with large margin hyperplane to generate support vector machine (SVM), and since then, the concept of kernel method has become one of the mainstream directions of machine learning literatures. Kernel method using the feature function ϕ embeds data into the feature space, so that the nonlinear data are presented linear model in such space, as showed in Figure 3.

Figure 3

Principle of kernel method.

In accordance with Figure 3, we can see that the basic idea of kernel method is to map the random vector datum in the input space into a higher feature space by using nonlinear function, and then, the linear learning algorithms are designed in this higher-dimensional feature space. The kernel method generally uses a kernel function to pack the nonlinear relation between input space and output space, and generally speaking, the solutions of any kernel methods consist of two parts, that is, a module and a learning algorithm. The module executes the process of mapping into the embedding or feature space, while the learning algorithm is used to find out the linear patterns in that space. The modularity of kernel methods proves its reusability as the learning algorithm. The matching algorithm can be combined with any kernel function, so it can be used in any data domain. The kernel component is for specific data, but it can be set with different algorithms to solve the task within the full range that we consider. Figure 4 shows the stages involved in the application of the kernel method.

Figure 4

The stages involved in the application of kernel methods.

The kernel function is a potential nonlinear and parametric equation of input variable. The kernel function relies on the input and output variables to realize control of parameters, and for the localization algorithm, the input variable is the input RSSI values matrix, and the output variable is relative coordinate of nodes. Therefore, the key of machine learning is estimation of parameter based on known input and output data. Consider there exists such a mapping, and assume a training sample set $S = {(x_{1}, y_{1}), \dots, (x_{l}, y_{l})}$ , in which $x_{i} \in R^{n}$ , and the corresponding label is $y_{i} \in R^{m}$ .

Definition 1.

$ϕ$ is a mapping from x to an (inner product) feature space F:

\begin{matrix} ϕ : x \in R^{n} ⟼ ϕ (x) \in F \subseteq R^{N} . \end{matrix}

(1)

The purpose of the map ϕ aims to transform the nonlinear relation into a linear relation. This kind of change to the input space can generally increase the learning efficiency, but in addition to enriching the function expressive ability, a higher-dimensional feature space can also increase the amount of computation, which correspondingly reduces the generalization ability of learning algorithm. Therefore, we need an implicit way to complete the data change process, and in the kernel method, this kind of direct computation process is called kernel function.

Definition 2.

A kernel is a function κ that for all $x, z \in X$ satisfies

\begin{matrix} κ (x, z) = 〈 ϕ (x), ϕ (z) 〉, \end{matrix}

(2)

where,

〈 \cdot, \cdot 〉

refers to the inner product. The definition of kernel function

κ (x, z)

is to calculate the inner product of two data vectors under the mapping of nonlinear transformation

ϕ (\cdot)

, while ϕ is a mapping of X to an (inner product) feature space F:

\begin{matrix} ϕ : x ⟼ ϕ (x), \end{matrix}

(3)

where x is the input space and

ϕ (x)

is the feature space.

Kernel function can use the inner product as the direct function of the input space, it can calculate the inner product more efficiently, which makes the feature space with index dimension or even infinite dimension possible, and it does not need to explicitly calculate the mapping ϕ. In other words, the kernel method uses a predefined kernel function to express the inner product of two sample vectors in the feature space, and it does not need to directly realize nonlinear mapping of the sample, so during use, it does not need to know the specific form of nonlinear mapping.

During actual application, there are three kinds of common kernel functions: polynomial kernel function, sigmoid kernel function, and Gaussian kernel function (also called radial basis kernel function) [19]. Of these, the Gaussian kernel function has the characteristic of maintaining the distance similarity of input function; this paper chooses the Gaussian kernel function to calculate the similarity between nodes, and its definition is as follows:

\begin{matrix} κ (x_{i}, x_{j}) = \exp (\frac{- {∥ x_{i} - x_{j} ∥}^{2}}{2 σ^{2}}) . \end{matrix}

(4)

3.2. Kernel Sparsity Preserving Projections

First of all, KSPP uses the kernel method to map the data into a higher feature space, which makes it linearly separable in the higher feature space; then, it uses the SPP method to construct the adjacent matrix of sample data in the higher-dimensional feature space; finally, the adjacent matrix is used to conduct feature extraction through the graph embedding method.

Assume there is a set of training sample $X = [x_{1}, x_{2}, \dots, x_{n}] \in R^{M \times n}$ ; firstly, use the nonlinear mapping function ϕ to map the training samples into a higher-d feature space to obtain $Φ = [ϕ (x_{1}), ϕ (x_{2}), \dots, ϕ (x_{n})]$ , and then, similar to the SPP approach, use the improved $l_{1}$ optimization to reconstruct the feature space samples of each sample $ϕ (x_{i})$ and obtain its corresponding weight coefficient $s_{i}$ . Therefore, the optimization problem can be expressed as

\begin{array}{l} \min_{s_{i}} {∥ s_{i} ∥}_{1} \\ s . t . ϕ (x_{i}) = Φ s_{i} \\ 1 = 1^{T} s_{i}, \end{array}

(5)

where

s_{i}

refers to the coefficient vector of SP in the feature space;

s_{i j} (j \neq i)

refers to the contribution amount of the training sample

ϕ (x_{j})

to the reconstructed

ϕ (x_{i})

in the feature space. Similarly, higher-dimensional feature space also has noise, and in order to obtain the solution to the optimization, we need to relax restrictions on the optimization and obtain

\begin{array}{l} \min_{s_{i}} {∥ s_{i} ∥}_{1} \\ s . t . ∥ ϕ (x_{i}) - Φ s_{i} ∥ \leq ε \\ 1 = 1^{T} s_{i} . \end{array}

(6)

Because the mapping relation ϕ is unknown, Φ and the sample

ϕ (x_{i})

in the higher feature space are also unknown, and therefore, the formula mentioned above cannot be directly solved. In accordance with (6), the optimization problem can be transformed to

\begin{array}{l} {\hat{s}}_{i} = \underset{s_{i}}{\arg \min} {∥ s_{i} ∥}_{1} \\ s . t . ∥ Φ^{T} Φ s_{i} - Φ^{T} ϕ (x_{i}) ∥ \leq ε \\ 1 = e^{T} s_{i} . \end{array}

(7)

At this moment, the formula can be solved to obtain the estimated vector of SP coefficient, ${\hat{s}}_{i} (i = 1,2, \dots, M)$ , and its combination is used to obtain the sparse reconstructed matrix $S = [{\hat{s}}_{1}, {\hat{s}}_{2}, \dots, {\hat{s}}_{M}]$ . At this moment, the kernel SPP objective function is transformed to

\begin{matrix} \min_{w} \sum_{i = 1}^{M} {(w^{T} ϕ (x_{i}) - w^{T} B {\hat{s}}_{i})}^{2} . \end{matrix}

(8)

Through deduction similar to SPP, we obtain the optimization criteria for KSPP; that is,

\begin{matrix} \min_{w} \frac{w^{T} Φ (I - S - S^{T} + S S^{T}) Φ^{T} w}{w^{T} Φ Φ^{T} w} . \end{matrix}

(9)

Then, the criterion is transformed to a solution of the generalized characteristic equation:

\begin{matrix} Φ (S + S^{T} - S S^{T}) Φ^{T} w = λ Φ Φ^{T} w . \end{matrix}

(10)

Left-multiply $Φ^{T}$ to the two sides of equation, and we obtain

\begin{matrix} Φ^{T} Φ (S + S^{T} - S S^{T}) Φ^{T} w = λ Φ^{T} Φ Φ^{T} w . \end{matrix}

(11)

w can be expressed as

\begin{matrix} w = Φ p . \end{matrix}

(12)

The generalized characteristic equation can be simplified to

\begin{matrix} K (S + S^{T} - S S^{T}) K p = λ K^{2} p . \end{matrix}

(13)

Solve the eigenvectors of the first d corresponding maximum eigenvalues $p_{i} (i = 1,2, \dots, d)$ .

K is the inner product of data in the feature space calculated by kernel function; that is,

\begin{array}{l} Φ^{T} Φ = {[ϕ (x_{1}), ϕ (x_{2}), \dots, ϕ (x_{M})]}^{T} \\ \cdot [ϕ (x_{1}), ϕ (x_{2}), \dots, ϕ (x_{M})] \\ = [\begin{bmatrix} K (x_{1}, x_{1}) & K (x_{1}, x_{2}) & \dots & K (x_{1}, x_{M}) \\ K (x_{2}, x_{1}) & K (x_{2}, x_{2}) & \dots & K (x_{2}, x_{M}) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ K (x_{M}, x_{1}) & K (x_{M}, x_{2}) & \dots & K (x_{M}, x_{M}) \end{bmatrix}] = K, \\ Φ^{T} ϕ (x_{i}) = {[ϕ (x_{1}), ϕ (x_{2}), \dots, ϕ (x_{M})]}^{T} ϕ (x_{i}) \\ = [\begin{bmatrix} K (x_{1}, x_{i}) \\ K (x_{2}, x_{i}) \\ ⋮ \\ K (x_{M}, x_{i}) \end{bmatrix}] . \end{array}

(14)

After KSPP projection, the ith new eigenvector is

\begin{array}{l} y_{i} = w_{i}^{T} φ (x) = {(\frac{B p_{i}}{\sqrt{p_{i}^{T} K p_{i}}})}^{T} φ (x) = \frac{p_{i}^{T} B^{T} φ (x)}{\sqrt{p_{i}^{T} K p_{i}}} \\ = \frac{p_{i}^{T}}{\sqrt{p_{i}^{T} K p_{i}}} [\begin{bmatrix} K (x_{1}, x) \\ K (x_{2}, x) \\ ⋮ \\ K (x_{M}, x) \end{bmatrix}] . \end{array}

(15)

The total KSPP eigenvector is

\begin{matrix} y = P^{T} [\begin{bmatrix} K (x_{1}, x) \\ K (x_{2}, x) \\ ⋮ \\ K (x_{M}, x) \end{bmatrix}] . \end{matrix}

(16)

4. Localization Algorithm with KSPP

4.1. The Connection between Kernel Function and Localization

The localization algorithm based on kernel method uses a kernel function to map the RSSI vector between nodes into corresponding feature space. In this space, after a linear algorithm is used to calculate the relation between nodes, it is projected into the coordinate space; that is, through the kernel function, the RSSI vector $S (S = [s_{1}, s_{2}, \dots, s_{n}])$ between n nodes in the monitoring area is related to the distance $D (D = [d_{1}, d_{2}, \dots, d_{n}])$ between physical spaces, and Figure 3 shows this kind of relation.

In accordance with Figure 5, we can see that when the communication radiuses of two nodes have intersection, then direct communication can be conducted between these two nodes; the weaker the formed RSSI vector is when two nodes have received the RSSI signal of other nodes in the network, the farther the actual distance between them is, which means the less similar they are; the closer the distance between two nodes, the stronger the RSSI vector between them formed by other nodes in the area.

Figure 5

Correlation between the signal and physical location spaces.

In addition, in the literature [16], Pan proved that when the nodes are deployed in an idyllic environment, the RSSI matrix is positive semidefinite, so we can believe that the RSSI matrix itself is a kernel function matrix. Therefore, we can use the kernel method to measure the similarity between samples, which means the kernel function implicitly embeds the RSSI data into the feature space, and the linear algorithm is used in the feature space to solve the nonlinear relation of RSSI in the Euclidean space.

Assume in the monitoring area, n nodes generate n samples ${x_{1}, x_{2}, \dots, x_{n}}$ , which are mapped into the feature space H through the kernel function $κ (\cdot)$ ; then, H consists of set:

\begin{matrix} {ϕ (x_{1}), ϕ (x_{2}), \dots, ϕ (x_{n})} . \end{matrix}

(17)

Furthermore, the closer the nodes in the area are, the more similar the signal strength that they have received from other nodes is; therefore, we can believe the signal strength space and feature space H are related, and

\begin{matrix} ϕ (\cdot) = [s (x_{1}, \cdot), s (x_{2}, \cdot), \dots, s (x_{n}, \cdot)], \end{matrix}

(18)

where

s (x_{i}, x_{j})

refers to the value of RSSI received by node

x_{i}

from node

x_{j}

, and make

s (x_{i}, x_{i}) = 0

; the value of RSSI received by node

x_{i}

from other nodes in the area is

s_{i} = [s (x_{i}, x_{1}), s (x_{i}, x_{2}), \dots, s (x_{i}, x_{n})]^{T}

. Because Gaussian kernel function has the characteristic of maintaining the distance similarity of input space, this paper uses Gaussian kernel function for the calculation of similarity between nodes:

\begin{array}{l} κ (x_{i}, x_{j}) = \exp (\frac{- {∥ ϕ (x_{i}) - ϕ (x_{j}) ∥}^{2}}{2 σ^{2}}) \\ = \exp (\frac{- \sum_{l = 1}^{n} {(s_{i l} - s_{j l})}^{2}}{2 σ^{2}}) . \end{array}

(19)

The localization process of LE-KSPP proposed in this paper includes three steps: first of all, by using the collected RSSI values between nodes, the similarity between nodes is calculated through the Gaussian kernel function; secondly, the adjacency graph is automatically constructed through SP, which transforms the localization problem to dimensionality reduction problem on the graph; finally, the relative coordinates of all nodes in the monitoring area are estimated. The collected RSSI matrix between nodes and the Gaussian kernel function are utilized to construct the kernel function matrix; by solving (13), we can obtain the corresponding feature vector of maximum eigenvalue, and because the experiment was conducted in a two-dimensional space, only the first two biggest eigenvalues are used. Assume

λ_{1}, λ_{2}

are the first two biggest eigenvalues of (13), and

λ_{1} \geq λ_{2}

; their corresponding feature vectors are

p_{1}, p_{2}

. Through

p_{1}, p_{2}

, the base of the relative coordinate space between nodes can be indirectly determined, and make

{\hat{c}}_{i} \in R^{2}

the estimated value of the relative coordinate of node

X_{i}

{\hat{c}}_{i}

can be estimated through the following formula:

\begin{matrix} {\hat{c}}_{i} = [\begin{bmatrix} {\hat{c}}_{i 1} \\ {\hat{c}}_{i 2} \end{bmatrix}] = [\begin{bmatrix} \sum_{j = 1}^{N} p_{1 j} κ (x_{i}, x_{j}) \\ \sum_{j = 1}^{N} p_{2 j} κ (x_{i}, x_{j}) \end{bmatrix}], \end{matrix}

(20)

in which

p_{1 j}

refers to the jth element of

p_{1}

and

p_{2 j}

is the jth element of

p_{2}

The LE-KSPP algorithm can be utilized to obtain the relative location between nodes, but in most applications, the absolute coordinate of nodes should be obtained; therefore, the obtained relative coordinate should be transformed to absolute coordinate. If the system has provided adequate beacon nodes (in 2-dimensional space, at least three beacon nodes are required, and in 3-dimensional space, at least four beacon nodes are required), the relative coordinate of nodes can be transformed to absolute coordinate from relative to absolute. Assume the estimated absolute coordinate ${\hat{x}}_{i}$ can be expressed by the following equation:

\begin{matrix} {\hat{x}}_{i} = T {\hat{c}}_{i} + b . \end{matrix}

(21)

In other words, the absolute coordinate of the node is obtained through the transformation of coordinates, in which T is the transformation matrix and b is the offset, and its size can be determined by the beacon node.

Through deduction, we can obtain the estimated coordinate of unknown node is

\begin{matrix} {\hat{x}}_{i} = T ({\hat{c}}_{i} - {\hat{c}}_{b}) + x_{b}, i = m + 1, \dots, n, \end{matrix}

(22)

where

c_{b}

is the relative coordinate of any beacon nodes;

x_{b}

is the corresponding absolute coordinate; the transformation matrix T can be obtained through the following equation:

\begin{matrix} T = Δ X_{- b} Δ C_{- b}^{T} {(Δ C_{- b} Δ C_{- b}^{T})}^{- 1}, \end{matrix}

(23)

where

\begin{matrix} Δ X_{- b} = [Δ x_{1}, \dots, Δ x_{b - 1}, Δ x_{b + 1}, \dots, Δ x_{m}], \\ Δ C_{- b} = [Δ c_{1}, \dots, Δ c_{b - 1}, Δ c_{b + 1}, \dots, Δ c_{m}] . \end{matrix}

(24)

While

Δ x_{i} = x_{i} - x_{b}

Δ c_{i} = {\hat{c}}_{i} - {\hat{c}}_{b}

. In order to avoid collinear or near collinear relation between beacon nodes, which might cause (23) to be unsolvable, we can use PCA to transform

Δ C_{- b}

and reduce the dimensionality of data, so that

Δ C_{- b} Δ C_{- b}^{T}

is not singular any more. Therefore, before obtaining the transformation matrix T , first of all, conduct PCA transformation to

Δ C_{- b}

; the projection matrix of PCA is recorded as P; then

Δ C_{- b}^{PCA} = P^{T} Δ C_{- b}

, so the expression of the transformation matrix

T^{PCA}

\begin{matrix} T^{PCA} = Δ X_{- b} P {(Δ C_{- b}^{PCA})}^{T} {(Δ C_{- b}^{PCA} {(Δ C_{- b}^{PCA})}^{T})}^{- 1} . \end{matrix}

(25)

At this moment, the absolute coordinate of unknown nodes can be obtained through the following formula:

\begin{matrix} {\hat{x}}_{i} = T^{PCA} ({\hat{c}}_{i} - {\hat{c}}_{b}) + x_{b}, i = m + 1, \dots, n . \end{matrix}

(26)

Consider there are n sensor nodes $X_{1}, X_{2}, \dots, X_{n}$ deployed in area S; assume the first m nodes are beacon nodes ${X_{1}, X_{2}, \dots, X_{m}; m ≪ n}$ , the coordinates of beacon nodes are known, which are $x_{i}$ , $i = 1, \dots, m$ , the rest $n - m$ nodes are unknown nodes, and their location information should be determined through certain localization algorithm.

About the LE-KSPP localization algorithm, see Algorithm 1, in which Steps (1) to (3) use the RSSI values between nodes to conduct training and learning; in Step (4), each node uses its adjacency relation with other nodes and estimates the relative coordinate of node through the training and learning model; by referring to the absolute location of beacon node, Step (5) transforms the node location of relative coordinate obtained in the area to absolute coordinate.

Algorithm 1: (LE-KSPP).

Input:

Beacon node coordinate:

${x_{1}, x_{2}, \dots, x_{m}}$ , $m \geq 3$

RSSI vector between nodes ${s_{i}}_{i = 1}^{n}$

Output:

Estimated coordinate of unknown

nodes ${{\hat{x}}_{m + 1}, {\hat{x}}_{m + 2}, \dots, {\hat{x}}_{n}}$ :

(1) By using the collected RSSI vector

between nodes ${s_{i j}}_{i, j = 1}^{n}$ , the similarity

between nodes is calculated through

Gaussian kernel function to form the kernel

matrix K;

(2) By solving the constrained optimization

problem (See (6)), we obtain the kernel

sparse representation coefficient

${\hat{s}}_{i} (i = 1,2, \dots, M)$ , and use the ${\hat{s}}_{i}$

combination to obtain the kernel sparse

reconstructed adjacent matrix;

(3) Solve the optimal projection vector

$p_{1}$ , $p_{2}$ , the two maximum eigenvalues

$λ_{1}$ , $λ_{2}$ are obtained through the

generalized characteristic equation

$K (S + S^{T} - S S^{T}) K p = λ K^{2} p$ ,

and their corresponding eigenvectors

$p_{1}$ , $p_{2}$ are used as the optimal vectors;

(4) Through (20), obtain the relative

coordinate matrix;

(5) Use the beacon nodes in the monitoring

area, and transfer the relative coordinate

into absolute coordinate; if the beacon

nodes have a collinear or near collinear

relation, adopt (26); if not, obtain the

absolute coordinate ${{\hat{x}}_{m + 1}, {\hat{x}}_{m + 2}, \dots, {\hat{x}}_{n}}$

of unknown node through (22).

5. Simulation and Experiment

Consider a WSN which is comprised of n nodes ${X_{1}, X_{2}, \dots, X_{n}}$ deployed in a $d (d = 2,3)$ -dimensional monitoring area. The IDs of nodes are $1,2, \dots, n$ , respectively, the actual coordinate of node $X_{i}$ is $x_{i}$ , and $X = {[x_{1}, x_{2}, \dots, x_{n}]}^{T}$ represents the coordinate matrix of node. Without loss of generality, let the first $m (m ≪ n)$ nodes in the n nodes be as beacon nodes, and make $X_{b} = [x_{1}, x_{2}, \dots, x_{m}]^{T}$ represent the coordinate matrix of beacon nodes. The purpose of localization method is to estimate the values of unknown nodes coordinate ${\hat{x}}_{i} (i = m + 1, m + 2, \dots, n)$ , which makes the estimated coordinate ${\hat{x}}_{i}$ as close to the actual coordinate $x_{i}$ of unknown nodes as possible.

Related literature [20] points out that the received signal strength s between nodes has a certain proportional relation with their distance d. In an ideal environment, node i is within the communication radius of node j, and the signal strength and distance have the following relation:

\begin{matrix} s (X_{i}, X_{j}) = k P d {(X_{i}, X_{j})}^{- μ}, \end{matrix}

(27)

where P is the sending signal voltage, k is the proportionality coefficient, μ is the signal attenuation coefficient, usually

μ > 2

, and

d (X_{i}, X_{j})

is the actual distance between node i and node j. If node i is not within the communication radius of node j, we denote

s (X_{i}, X_{j}) = 0

This section analyzes and assesses the performance of LE-KSPP localization method through experiment and simulation. In the experiment, the nodes were assigned in a two-dimensional space, and measurement of the distance between nodes adopted the range model-based simulation experiment and actual measurement-based data set, respectively. The parameters involved in the range model are fit with the values collected by Patwari [32], and the actual measurement data set is the RSSI data collected by Patwari experiment team [33] in a $12 m \times 14 m$ rectangular area. For the experiment of range model-based, the nodes are randomly or regularly deployed in the monitoring area. In order to investigate whether the algorithm is affected by the obstacle, a blocking experiment was added to the two deployment strategies mentioned above; that is, assume a big blocking object was put in the deployment area to make communication impossible between nodes, and this kind of area has a C sharp. For different network topology structures, by redeploying nodes in the same area for multiple times, all of the reported results are the average over 100 trials. The data set was actually measured through 44 nodes (including 4 beacon nodes) in the monitoring area, the center frequency of node was 2.4 Hz, the broadband direct sequence was used for spread spectrum communication, each RSSI value was measured for 10 times, and each node received and sent for 5 times.

Because it is difficult to evaluate performance by using relative coordinate, we adopted absolute coordinate to express location of nodes in this experiment. We also compare our LE-KSPP algorithm, which is built on one kind of graph embedding localization, with the same kind of algorithms such as MDS-MAP [34], Isomap [35], and LE-KLPP [18]. In addition, the location accuracy of ours and KLPP are both related to kernel function, which is called Gauss kernel function in our selection. We also found there is a certain relationship between kernel parameter σ and the distance between training samples, so in the experiment we assume the average distance between sample nodes of σ (4) as 50 times, and the cumulative variance contribution rate of PCA as 90%.

The paper utilizes Average Localization Error (ALE) performance index to evaluate the performance of the algorithm. The formula is shown as follows:

\begin{matrix} ALE = \frac{\sum_{i = 1}^{n} \sqrt{{({\hat{x}}_{i} - x_{i})}^{2} + {({\hat{y}}_{i} - y_{i})}^{2}}}{n \times R}, \end{matrix}

(28)

where n is the number of unknown nodes,

(x_{i}, y_{i})

is the unknown node actual location,

({\hat{x}}_{i}, {\hat{y}}_{i})

is unknown node evaluated location, and R is the radio range of sensor nodes.

5.1. Localization Results with Range Model-Based

In order to compare the impartiality of the experimental results, when the measurement information is based on signal strength model, in this section, the signal model in literature [32, 33] is used to simulate the signal strength between the nodes:

\begin{array}{l} P_{i j} ~ N ({\bar{P}}_{i j}, σ_{dB}^{2}), \\ {\bar{P}}_{i j} = P_{0} - 10 n_{p} lg (\frac{d_{i j}}{d_{0}}), \end{array}

(29)

among which

P_{i j}

represents the transmitted signal power which is received by node i from node j, and the unit is dBm;

P_{0}

represents the received signal power corresponding to the point of the reference range

d_{0}

;

d_{0}

represents the reference range;

n_{p}

represents the attenuation coefficient of the wireless transmission, related to the environment;

{\bar{P}}_{i j}

represents the received signal power corresponding to the point of the reference range

d_{0}

(dBm);

σ_{dB}^{2}

represents the shadow variance.

Among these, $n_{p}$ uses the data in the literature, but $σ_{dB}^{2} / n_{p} = 1 . 7$ .

5.1.1. Regular Deployment

In this group of experiments, the nodes were regularly deployed in a $200 m \times 200 m$ area, in which the grid has a side length of 10 m. Under the unblocked situation, there were 441 nodes in total; under the blocked situation, the number of nodes was changed to 381; 5 to 15 nodes were chosen as the beacon nodes, which were assured that their location information was known.

Before analyzing performance of LE-KSPP algorithm, let us first investigate the two final localization results under different deployment. In Figure 6, the circles denote the unknown node and the squares are the beacon, the line connects the actual coordinate and estimated coordinate of unknown nodes, and the longer the line is, the more the estimated value deviates from the actual location. Figure 6 indicates the localization results for each node under regular deployment with 10 beacons. The LE-KSPP algorithm of ALE for this uniformly deployed network is about 14.9% (Figure 6(a)) and 19.9% (Figure 6(b)).

Figure 6

Localization result with 10 beacons under regular deployment.

Figure 7 describes the impact of the beacon nodes number (from 5 to 15) on the ALE of the two localization algorithms of four localization algorithms in a regularly deployed network. They are regularly deployed in two blocked and unblocked environments, respectively. We can note that our algorithm always obtains the best results. Different with the MDS-MAP, Isomap and LE-KLPP, the localization accuracy of our algorithm raises with the increase of beacon nodes. This is because LE-KSPP method can reconstruct the relation between nodes through SP, and opposed to other methods, it can more self-adaptively choose and decide the node number and weight around its location.

Figure 7

ALE with different number of beacons.

5.1.2. Random Deployment

In this group of experiments, 200 nodes were randomly deployed in a $200 m \times 200 m$ two-dimensional square area, and 5 to 15 nodes were chosen from the 200 as the beacon nodes. Like the regular deployment, in order to investigate the influence of non-line-of-sight on the localization algorithm, the experiment scenario with obstacle was added to the random deployment experiment, and in addition, the signal strength between nodes was still simulated by using (29).

Similarly, the two definitive localization results were analyzed first. As showed in Figure 8, in these two experiments, the number of beacon nodes is still 10. Under the unblocked situation, LE-KSPP is about 17.5%; the corresponding ALE is 24.6% under the blocked situation.

Figure 8

Localization result with 10 beacons under random deployment.

In Figure 9, for the randomly placed sensor network, we also compare it with the other algorithms (i.e., MDS-MAP, Isomap, and LE-KLPP) under different number of beacons. We can see that our algorithm always achieves the minimal average location error. When the number of beacons is 5 and 7, the ALE of value of MDS-MAP and LE-KLPP is even bigger than 40%, respectively.

Figure 9

ALE with different number of beacons.

5.2. Localization Results with the Actually Measured Data Sets

The literature and experiment of Section 5.1 show that the LE-KLPP has a better performance than the MDS-MAP and Isomap algorithm; therefore, only LE-KSPP algorithm and LE-KLPP algorithm are compared in this group experiment. The experimental data come from the SPAN lab, and the experimental scenario is as shown in Figure 10.

Figure 10

Actual collection area.

By using the data set mentioned above, this paper has compared the localization performance of the LE-KLPP and LE-KSPP algorithms, and see Table 1 for the experiment results. In accordance with Table 1, we can see that under separate communication radius (CR), the localization accuracies of LE-KSPP are all higher than the LE-KLPP algorithm, and the ALE has an increase of more than 10%.

Table 1

The ALE of LE-KLPP and LE-KSPP with actual RSSI-based ranging.

CR (m)	The ALE of LE-KLPP	The ALE of LE-KSPP
6.5	23.71%	17.24%
7	15.68%	14.11%
7.5	14.04%	11.73%
8	12.66%	10.46%

Figure 11 shows the localization result under the communication radius of 7.5 m. In the figure, we can see that the closer these two algorithms are to the beacon node point, the smaller the estimated error is, which means, because the beacon node has an accurate location, it is more accurate to use it to determine the location of unknown node; in Figure 11, the LE-KLPP algorithm is used, because it artificially sets parameter k and it causes that an optimal solution (short line) can be obtained in certain area (far from the beacon node), while in some area, it does not apply (long line); in Figure 11, the LE-KSPP algorithm is chosen, because SP is used to self-adaptively obtain the number of adjacent points and the obtained estimation value is relatively stable (the length of line does not have big change).

Figure 11

Location estimates with actual RSSI-based ranging.

6. Conclusion

This paper studies the localization problem of sensor network nodes built on signal strength, and the LE-KSPP algorithm is proposed. This LE-KSPP utilizes the kernel method to map the signal strength to a higher-dimensional feature space and conducts SP, and then, the SP coefficient of the obtained signal set is obtained through SP. Because the data in a higher-dimensional feature space have better linear separability, the SP can self-adaptively capture the “local” structure of data, and different sample points are automatically endowed with different “adjacency” number and parameter selection is avoided, which makes the algorithm more applicable to different environments.

The experiments in this paper show that the proposed LE-KSPP algorithm can obtain relatively satisfying localization results on both range model and actually measured data, the impact of the beacon nodes number and blockage on this algorithm is small, and it can self-adapt to various network topology environments, which has a high robustness.

Footnotes

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

The paper is sponsored by Natural Science Foundation of China (61272379), Prospective and Innovative Project of Jiangsu Province (BY2012201); Provincial University Natural Science Research Foundation of Jiangsu Education Department (12KJD510006, 13KJD520004); Doctoral Scientific Research Startup Foundation of Jinling Institute of Technology (JIT-B-201411).

References

Cabri

Leonardi

Mamei

Zambonelli

Location-dependent services for mobile users

IEEE Transactions on Systems, Man, and Cybernetics Part A: Systems and Humans. 2003 33 6 667 681

10.1109/TSMCA.2003.819496

2-s2.0-0346076753

Liu

Yang

Wang

Jian

Location, localization, and localizability

Journal of Computer Science and Technology 2010 25 2 274 297

10.1007/s11390-010-9324-2

2-s2.0-77952154160

Chen

Xia

Huang

Wang

A localization method for the internet of things

The Journal of Supercomputing 2013 63 3 657 674

10.1007/s11227-011-0693-2

2-s2.0-80053345362

Feng

W. S. A.

Valaee

Tan

Received-signal-strength-based indoor positioning using compressive sensing

IEEE Transactions on Mobile Computing 2012 11 12 1983 1993

10.1109/TMC.2011.216

2-s2.0-84867888352

Falsi

Dardari

Mucchi

Win

M. Z.

Time of arrival estimation for UWB localizers in realistic environments

Eurasip Journal on Applied Signal Processing 2006 2006

32082

10.1155/ASP/2006/32082

2-s2.0-33746875997

Yang

K. C.

An approximately efficient TDOA localization algorithm in closed-form for locating multiple disjoint sources with erroneous sensor positions

IEEE Transactions on Signal Processing 2009 57 12 4598 4615

10.1109/TSP.2009.2027765

MR2722322

2-s2.0-70450237070

Wann

C.-D.

Lin

H.-Y.

Hybrid TOA/AOA estimation error test and non-line of sight identification in wireless location

Wireless Communications and Mobile Computing 2009 9 6 859 873

10.1002/wcm.799

2-s2.0-67649982769

Mao

Fidan

Localization Algorithms and Strategies for Wireless Sensor Networks: Monitoring and Surveillance Techniques for Target Tracking 2009

New York, NY, USA

Information Science Reference

Wang

Data collection in wireless sensor networks by utilizing multiple mobile nodes

2011 7th International Conference on Mobile Ad-hoc and Sensor Networks, MSN 2011

December 2011

chn

83 90

10.1145/1868497.1868511

2-s2.0-78650100770

10.

Chen

Chandrakasan

A. P.

Stojanović

V. M.

Design and analysis of a hardware-efficient compressed sensing architecture for data compression in wireless sensors

IEEE Journal of Solid-State Circuits 2012 47 3 744 756

10.1109/JSSC.2011.2179451

2-s2.0-84857804076

11.

Chengqun

Learning-Based Localization in Wireless Sensor Networks 2009

Hangzhou, China

Faculty of Information Technology, Zhejiang University

12.

Bahl

P. P. V.

RADAR: an in-building RF-based user location and traeking system

Proceedings of the 9th Annual Joint Conference of the IEEE Computer and Communications Soeieties

2000

775 784

13.

Hightower

Borriello

Location systems for ubiquitous computing

Computer 2001 34 8 57 66

10.1109/2.940014

2-s2.0-0035424017

14.

L. M.

Liu

Lau

Y. C.

Patil

A. P.

LANDMARC: indoor location sensing using active RFID

Wireless Networks 2004 10 6 701 710

10.1023/B:WINE.0000044029.06344.dd

2-s2.0-5544326540

15.

Essoloh

Richard

Snoussi

Honeine

Distributed localization in wireless sensor networks as a pre-image problem in a reproducing kernel Hilbert space

Proceedings of the 16th European Signal Processing Conference (EUSIPCO '08)

August 2008

Lausanne, Switzerland

2-s2.0-84863754678

16.

Pan

Learning-Based Localization in Wireless Sensor Networks 2007

Hong Kong, China

The Hong Kong University of Science and Technology

17.

Pan

J. J.

Kwok

J. T.

Yang

Chen

Multidimensional vector regression for accurate and low-cost location estimation in pervasive computing

IEEE Transactions on Knowledge and Data Engineering 2006 18 9 1181 1193

10.1109/TKDE.2006.145

2-s2.0-33746899028

18.

Wang

Chen

Sun

Shen

A graph embedding method for wireless sensor networks localization

Proceedings of the IEEE Global Telecommunications Conference (GLOBECOM '09)

December 2009

Honolulu, Hawaii, USA

10.1109/GLOCOM.2009.5425241

2-s2.0-77951562738

19.

Shawe-Taylor

Cristianini

Kernel Methods for Pattern Analysis 2004

Cambridge University Press

20.

Nguyen

Jordan

I. M.

Sinopoli

A kernel-based learning approach to ad hoc sensor network localization

ACM Transactions on Sensor Networks 2005 1 1 134 152

10.1145/1077391.1077397

21.

Jolliffe

I. T.

Principal Component Analysis 2002 2nd

New York, NY, USA

Springer

Springer Series in Statistics

MR2036084

22.

Niyogi

Locality Preserving Projections 2003

Cambridge, Mass, USA

MIT Press

23.

Hastie

Tibshirani

Friedman

The Elements of Statistical Learning 2008

New York, NY, USA

Springer

MR2722294

24.

Cortes

Mohri

On transductive regression

Proceedings of the 20th Annual Conference on Neural Information Processing Systems (NIPS '06)

December 2006

2-s2.0-84864073503

25.

Roweis

S. T.

Saul

L. K.

Nonlinear dimensionality reduction by locally linear embedding

Science 2000 290 5500 2323 2326

10.1126/science.290.5500.2323

2-s2.0-0034704222

26.

Belkin

Niyogi

Laplacian eigenmaps for dimensionality reduction and data representation

Neural Computation 2003 15 6 1373 1396

10.1162/089976603321780317

Zbl1085.68119

2-s2.0-0042378381

27.

Wright

Yang

A. Y.

Ganesh

Sastry

S. S.

Robust face recognition via sparse representation

IEEE Transactions on Pattern Analysis and Machine Intelligence 2009 31 2 210 227

10.1109/TPAMI.2008.79

2-s2.0-61549128441

28.

Qiao

Chen

Tan

Sparsity preserving projections with applications to face recognition

Pattern Recognition 2010 43 1 331 341

10.1016/j.patcog.2009.05.005

Zbl1186.68421

2-s2.0-69049112203

29.

Yin

Yang

Kernel sparsity preserving projections and its application to biometrics

Acta Electronica Sinica 2013 41 4 639 645

10.3969/j.issn.0372-2112.2013.04.003

2-s2.0-84878763100

30.

Aizerman

Braverman

M. E.

Rozoner

L. I.

Theoretical foundations of the potential function method in pattern recognition learning

Automation and Remote Control 1964 25 821 837

31.

Boser

B. E.

Guyon

I. M.

Vapnik

V. N.

Training algorithm for optimal margin classifiers

Proceedings of the 5th Annual ACM Workshop on Computational Learning Theory (COLT '92)

July 1992

144 152

2-s2.0-0026966646

32.

Patwari

Location Estimation in Sensor Networks 2005

The University of Michigan

33.

Patwari

Kasera

S. K.

Measured Channel Impulse Response Data Set 2007

34.

Shang

Ruml

Zhang

Fromherz

M. P. J.

Localization from mere connectivity

Proceedings of the 4th ACM International Symposium on Mobile Ad Hoc Networking and Computing (MobiHoc '03)

June 2003

Annapolis, Md, USA

201 212

2-s2.0-0242612017

10.1145/778415.778439

35.

Patwari

Hero

A. O.

III

Manifold learning algorithms for localization in wireless sensor networks

Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing

May 2004

857 860

2-s2.0-4544235011