Abstract
This paper firstly represents a survey of recent researches on localization of CPS and analyzes the advantages and limitations of the existing schemes. To overcome the limitations, a novel localization scheme is then proposed, which includes site survey, received signal comparison, and location correction phases. The main improvement of our proposed scheme is to filter the noises and interference added to the received RSSI signal; the detailed procedure is then provided with mathematical analysis. Finally, the experiment and simulation results are presented with analysis.
1. Introduction
In recent years, wireless sensor networks [1, 2] achieve rapid development, which constitutes the fundamental of cyber-physical systems (CPS). In many CPS, sensed data will only be useful with the location information, which is so called location based services (LBS). Therefore, the localization of sensor nodes is essentially important for implementation of WSNs.
Many localization schemes have been introduced, which can be divided into range-based [3] and range-free [4–6] categories. Due to the big difference between indoor and outdoor environment, schemes designed for indoor or outdoor localization are relatively different. Only indoor environment is considered in the scope of this paper.
GPS [7] localization is a mature method nowadays, which uses satellite signals for localization. However, it does not work as well indoors due to the blockage of signals by walls and other barriers. Furthermore, GPS components are relatively expensive and large in size if high precision is required, which is not suitable for sensor nodes. Therefore, other available radio signals indoors are required. Nowadays, with the implementation of huge amount of WiFi [8] access points (APs) all around the world, many shopping malls and other public places offer WiFi signals. People with smart phones and other equipment such as pads or laptops can sense and measure WiFi signals and use them for localization with RSSI signature.
In theory, radio RSSI value decreases with distance increases. Because of multipath fading [9] and other signal attenuations [10], WiFi RSSI signature indoors is not smoothly monotone decreasing. It presents fluctuation in some pattern. Therefore, accurate ranging is not practical without complicated equipment. Therefore, range-based schemes are not good choices in indoor environment. To overcome such big problems, range-free schemes, such as fingerprint-based indoor localization [11–13], are introduced. It actually alters the algorithm kernel from geometry issues into classified issues. Original fingerprint-based localization aims to locate a user to the most likely sampling site, which serves as a reference location in positioning model. This algorithm is widely used for the sake of easy understanding and realization. The impact from random human presence can be offset to a large extent due to the fact that it only selects single ref-location. But the localization accuracy depends on the sampling sites interval and density utterly. The cost of preliminary manual work such as site survey [14, 15] will be increased exponentially once the accuracy needs to be improved.
Through our experiments, it is found that high frequency radio signals, such as WiFi RSSI signatures, are remarkably susceptible to human presence and movement. Human crossing the link between a WiFi AP and an RSSI receiver (AP-Receiver link) causes distortion of RSSI signature values remarkably, rebound after leaving, which causes localization inaccuracy. Furthermore, when there is no human interference, WiFi RSSI signatures present an inherent fluctuation that is a Gaussian distribution [10], which induces localization inaccuracy as well. It gives rise to false capture if the D-value of an RSSI value and its prediction reaches the threshold to determine the location even though there is no human interference. To recover such kind of RSSI signature attenuated waveforms, filtering schemes have to be introduced to eliminate the impact of random human presence, such as exponential weighted moving average (EWMA) [16].
In this paper, to overcome these limitations, a novel scheme called Fingerprint Signature Reorganizing (FSR) is introduced, with Overlap-based Weighted k Nearest Neighbors (OWKNN) algorithm, which is based from WKNN [17, 18]. By using the same interval and density of sampling sites as original scheme, both accuracy and standard deviation of localization can be improved significantly. This scheme is no longer susceptible to human presence [19] and movement as a Linearly Weighted Moving Average (LWMA) [20] scheme is adopted. Cooperative localization based on joint distribution of received APs-
The remainder of this paper is organized as follows. Section 2 outlines the previous work carried out on fingerprint-based localization algorithms, KNN, R-KNN, WKNN algorithms and valuable ideas with respect to localization. The scheme model and algorithms are described in Section 3. Section 4 presents a description of the experiment scenario and gives some practical results. Large scale, high density network simulation is carried out in Section 5 followed by performance evaluation and discussion. Finally, in Section 6, the conclusion and future work are listed.
2. Related Work
In recent researches of indoor localization, WIFI RSSI-based scheme is widely used due to the fact that few extra hardware is required [14, 21]. Although time of signal arrival (ToA) and time difference of signal arrival (TDoA) [22] can perform well outdoors, they suffer great multipath interference and other signal attenuations indoors [9, 10]. As Heurtefeux and Valois [21] mentioned, these solutions require dedicated hardware at both emitters and receiving ends and perform low in energy consumption. They also pointed out the limitations of RSSI-based schemes, such as signal instability and susceptibility to interference. However, if these problems can be solved, or at least partly solved, WiFi RSSI-based schemes are supposed to be applicable with high performance.
To determine the relationship between signal strength and RSSI signature, Rappaport [10] proposed path loss model, which presents log-distance path loss model when signal propagates in open space. Tested signal received power is given by
Other researches [12, 13, 17, 23–27] also proposed path loss model or radio propagation model generally based on log-distance path loss model and their tested RSSI signatures also confirmed the Gaussian distributed random variable model. They adopted different parameters in order to apply to specific scenario or hardware. In [24], authors set an alternative formulation:
Although radio propagation model offers the formula about RSSI and Transmitter-Receiver distance, precise positioning based on it is not feasible particularly indoors. Tanglesome surroundings induce unexpected small-scale attenuation, so trilateration [27] by T-R distances presents deviation because localization accuracy of range-based schemes highly depends on the accuracy of T-R distance estimation. Therefore, fingerprint-based localization [11–13] is widely adopted due to its feature of range-free. Walls, furniture, showcases, and so forth will not change RSSI fingerprint signatures and naturally have little impact on this algorithm as long as they stay statically.
Fingerprint-based localization schemes require a fundamental step called site survey [14, 15] that offers the data sampled serving as a reference locations system. However, the cost of preliminary manual work on site survey is a conspicuous shortage for such schemes. Wu et al. [15] and Chintalapudi et al. [14] proposed their schemes that reach an uplifting outcome; meanwhile, preliminary work lessens radically from site survey. An alternative way is to improve localization accuracy remarkably compared with previous schemes by costing the same in site survey.
Many schemes have been proposed to classify WiFi APs by using correlation coefficients of listened RSSI values, such as centroid [24], WDF [25], KNN [28, 29], R-KNN [17], WKNN [18]. Among which, WKNN is most reasonable in certain cases by focusing on estimating reference locations and their weights. Accuracy is improved significantly compared to original fingerprint. In this paper, OWKNN algorithm is proposed and the overlap ratio of APs between sensed by users and related reference locations is taken into consideration on weights estimation.
It is known that WiFi uses short wave signal, which is susceptible to interference. All the algorithms above perform retrograde in noisy cases, especially deteriorated on OWKNN's weight estimating. Outemzabet and Nerguizian [30] adopted Kalman filtering and Kim and Noble [16] used EWMA to filter noisy values and both of them performed well. This paper opts for LWMA [20] instead for the sake of simplification on computation complexity and codes realization, meanwhile, obtaining almost the same efficiency with respect to RSSI signature values recovering.
3. Scheme and Algorithm Description
In this section, the proposed Fingerprint Signature Reorganizing localization scheme and corresponding algorithms and the mathematical models are described. The main assumptions and key points are listed as follows.
Fingerprint-based localization schemes work according to comparison between user's received RSSI signatures and traversal of reference parameters computed and stored in database sampled from site survey. The similarity level determines the estimating location ultimately. It actually alters the kernel of localization from geometry issues into classification issues. RSSI values received present a Gaussian-like distribution without human interference. So the average values of RSSI signatures is obtained to help get the stable parameters and serve the algorithm efficiently, which is supposed to be closer to the theoretical RSSI signatures. However, in real world, RSSI signatures cannot get rid of the impact of human presence, which gives rise to deviation of the average values and degenerates localization accuracy. LWMA filtering can help recover the scenario back to no interference situation.
3.1. Fingerprint Signature Recognizing Localization Scheme
Although original fingerprint algorithm can offset the impact of random human presence to a large extent due to its position estimating chooses only one reference location sampled from site survey, which is regarded as single NN algorithm as well. The localization accuracy depends utterly on the sampling sites interval and density.
To overcome the limitations and maintain the advantages, a Fingerprint Signature Recognizing (FSR) localization scheme is proposed which only alters slightly from works that existed. Figure 1(a) is the overview of the flowchart of the scheme. Compared with original fingerprint localization scheme (Figure 1(b)), LWMA filtering, cooperative localization, and OWKNN algorithm are introduced to enhance localization performance. The method of estimating reference locations’ weights is the crucial step in fingerprinting algorithm. Many fingerprinting algorithms take the D-value of RSSI signatures to average a user and a reference location received as the basic parameter to compute the weights in an inverse correlation way.

(a) Our scheme. (b) Original scheme.
In our proposed OWKNN algorithm, a simple, efficient, and widely used method is adopted, which is demonstrated in Algorithm 1, where ε is a constant value used to protect computation from division by zero. By obtaining the average
3.2. Capture and LWMA Filtering
Not every RSSI signature value received is supposed to be replaced in filtering; otherwise the stochastic deviation of several first prior parameters selected will be passed to entire following RSSI values. Human presence on the link between a WiFi AP and an RSSI receiver causes RSSI signature values’ attenuation remarkably, rebound once human leaves (Figure 2(a) received waveform). Such kind of RSSI signature attenuated waveform is supposed to be captured and filtered by LWMA to eliminate the impact of random human presence. The equation is given by

(a) RSSI signature waveform. (b) LWMA analysis.
The length of receiving RSSI signatures list is assigned to
A human moving path can cover a list of AP-Receiver links (Figure 3(a)) in real world. It induces a user's interfered

(a) Human interference. (b) Temporal-spatial correlation.
3.3. False Capture and Cooperative Localization
The fluctuation of RSSI received presents a zero-mean Gaussian distribution within a
It is observed that
312 groups of vectors,
Given the crowd density in a shopping mall, it has a big probability that two persons closed both have localization requirements at the same time. Single interference source is supposed to affect two closed receivers in a similar pattern due to temporal correlation and spatial correlation, which serve as a ground to cooperative localization. Human presence on AP-Receiver links when moving, as an interference source, which impacts RSSI signatures received from all the APs in real world (as shown in Figure 3(b)). There is an APs intersection between
Assume
σ, which presents the level of fluctuation RSSI signature received, has the dominated influence on the probability of false capture. It becomes bigger when increases.
Cooperative localization reaches an uplifting outcome. Theoretically,
3.4. Overlap-Based Weighted k Nearest Neighbors Algorithm
KNN-based algorithms evaluate user's location by the relationship among reference locations and their weights; the equation is
K nearest neighbour locations are picked out correctly. Their weights evaluated are reasonable with respect to user's location.
Algorithm performs well if weights evaluation is high efficiency and the localization accuracy is supposed to be better than range-based schemes due to the low performance in range evaluation indoors.
Many fingerprint-based algorithms take the D-values of RSSI signature average calculated from a user and a ref-location's list as the base number to compute weights in an inverse correlation. But such schemes can confront a big problem that gives rise to a severely incorrect weights evaluation. In Figure 4(a), the

(a) 4 nearest neighbors. (b) Crasher case.
OWKNN algorithm is introduced for both ruling out crasher reference locations and reevaluating weights of nearest reference locations more reasonable. The kernels of this algorithm are demonstrated as shown in Algorithm 2, where
weight
Ref· Ref·

Comparison of overlap APs.
In order to find out the method for weights reevaluation, the analysis of overlapped APs is based on the following assumptions.
Sensing area of a receiver is modeled as a disk of radius r. The quantity of overlapped WiFi APs sensed at a user's location and a ref-location is in proportion to the overlap area of their sensing ranges.
Therefore, the weights evaluating issue become a geometrical issue through studying the overlap area instead of overlap APs. Formula (12) is simple and supposed to have high efficiency in making OWKNN algorithm qualify to Standard (ii):
The overlap area (Figures 6(a) and 6(b)) of two sensing ranges computed by distance of a user location and a ref-location is

(a) Geometrical relationship. (b)
The curve of formula (13) is almost linear especially for the first half (Figure 7(a)), which is used in algorithm. So it can be simplified to a linear pattern for the sake of easier realization both in algorithm and coding:

(a) Curve of formula (13). (b) Weighted centroid.
The relationship between
In order to obtain the equation of Reference locations system is formed to grid pattern;
The function images with respect to formula (16) restricted by the conditions above are demonstrated in diverse groups of
The function images of relationship between

The curve surfaces of simplified formula (16).

The function curve surfaces of relationship between
It is observed that
This scheme obtains an evaluation in exponential pattern computed by
4. Scenario Study and Experiment Results
This section describes the characteristics of a general indoor scenario, such as a shopping mall, in which our scheme and algorithms work effectively. The RSSI signatures both within and without human interference through experiment are also demonstrated. The data collected and distributions calculated provide parameters of our algorithms and models.
4.1. WiFi APs Deployment and Its Impact on FSR Scheme
In most shopping malls, free WiFi access is usually offered to customers, which can provide extra services. Given the high popularizing rate of smart phone nowadays, it is feasible to assume that hundreds of smart phones with WiFi devices can help indoor localization.
OWKNN algorithm computes
WiFi APs are randomly deployed in office buildings due to unorganized relationship among office owners without considering the high coincidence for overlap of propagation range of WiFi APs. Nearly every office needs at least one WiFi AP on its own due to privacy and information security. The high density of APs can sharply decrease the proportion of stochastic deviation in a given region (Figure 10) due to the large denominator. Through experiments, a WiFi receiver can sense 40–60 APs in an office building in a university.

APs randomly deployed with relatively high density.
The density of WiFi APs in a shopping mall is lower as open access is the only consideration. A receiver can sense 15–30 APs, which can give rise to higher proportion of stochastic deviation in a shopping mall than that in an office building. However, the APs deployment in a shopping mall is not random utterly, but substantially based on grid or cell pattern (Figure 11), which are supposed to be the most economic coverage schemes. The stochastic deviation becomes stable and predictable and that contributes to renew OWKNN algorithm. The

APs deployed in grid pattern with relatively low density.
4.2. RSSI Signature Attenuated Waveform and Distribution
One of the RSSI signature attenuated waveforms caused by human presence on AP-Receiver links is demonstrated in Figure 2(a). The vast majority of human interference can be captured in waveforms clearly.
Fluctuated distributions of RSSI signatures tested from experiments are collected and calculated to be used as the distribution of real case served in false capture. Several lists of RSSI signatures within different length of AP-Receiver links are demonstrated in Figure 12.

RSSI signatures without human interference.
The WiFi AP hardware and laptop as the receiver are shown in Figure 13.

The experimental equipment.
5. Simulation Setup and Performance Evaluation
In order to examine the efficiency of our advanced fingerprint-based localization scheme, experiments and simulations are carried out based on the following configurations. The entire scenario is set up as follows (APs deployment pattern is similar to Figure 10, the site survey scheme is bases on Figure 14, and the user locations is tested bases on Figure 15).
Assume each WiFi AP has the same hardware characteristics (RSSI signatures fluctuation picks Ref case in Table 1) and a radio propagation range of The distance between samples in either vertical or horizontal direction is 5 meters in site survey. Therefore, a total of 441 sampling sites are formed from a grid of 21 meters × 21 meters, which covers the room serve as reference locations. 1681 users’ locations are formed from a grid of 41 meters × 41 meters, which covers the room to be tested for algorithm performance evaluation.
RSSI signatures fluctuated distributions in different cases.
Results with respect to different RSSI signatures distributions.

Reference locations in site survey.

Users locations tested.
Several key parameters and crucial conditions of algorithm are computed based on the setting characteristics of scenario. ɛ is assigned to
Theoretically,
Above all,

Localization accuracy.

Standard deviation.
There are comparisons among
This scheme improves accuracy by
It verifies that advanced fingerprint-based localization scheme achieves a good accuracy and succeeds in recovering in interfered conditions. Theoretically, efficiency of cooperative localization is supposed to be
6. Conclusion and Future Work
In this paper, an advanced fingerprint-based indoor localization scheme called Reorganizing the Fingerprint Signatures (RFS) for CPS has been introduced. The interference of human presence to localization accuracy has been studied and the corresponding scheme has been implemented. The scheme can also solve the problem of false capture and introduces cooperative localization for further increasing of localization accuracy. The experiment and simulation results show that this scheme can sufficiently improve the localization accuracy.
Footnotes
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
Acknowledgments
The work presented in this paper was partially financed by the National Natural Science Foundation of China under Grants 61190113 and 61272539 and Zhejiang Provincial Key Innovation Team on Sensor Networks under Grant no. 2009R50046.
