Abstract
A new 3D fingerprinting positioning scheme based on cellular networks is proposed by using implicit information of the location around geographic environments. Compared with original fingerprint positioning scheme, singular measurement is processed at the beginning of the proposed scheme. The main improvement of our proposed scheme is to employ cell matching degree and choose the best reasonable fingerprint according to the correlation of its surrounding points. Additionally, in order to improve searching efficiency, a new searching window is introduced. Finally, compared with the maximum likelihood method and the weighted K nearest neighbors method, the promising numerical results are also presented.
1. Introduction
The location-based service (LBS) in indoor environments has recently received unprecedented attention, since it is closely related to human lives. According to industry statistics, 70% of voice services and 80% of data services occur in indoor environments. Indoor location technologies based on wireless sensor network (WSN) increase enormously, for instance, wireless fidelity (Wi-Fi) and radio frequency identification (RFID) [1].
Much work has been done in the indoor location field. For example, Brunato and Battiti [2] proposed localization algorithms based on statistical learning theory. Mumtaz et al. [3] developed a downlink location based on adaptive beamforming algorithm, which profits from available positioning information. Alsehly et al. [4] presented a novel WIFI based indoor positioning method to save time and cost. Lee et al. [5] presented a localization method based on sound sources in underground parking lots and the least-squares method (LSM). T. Kim and E.-J. Kim [6] used SSID of virtual AP which contains the location information to allow users to detect their locations by scanning the WLAN signals. Liu and Yang [7] combined scene analysis and the feature of trilateration and gave a WIFI based positioning scheme. Navarro-Alvarez et al. [8] presented a new adaptive method to calculate the path loss exponent which could contribute to the precise estimation of the distance. Alsalih et al. [9] proposed two RFID localization schemes, which used received signal strength index (RSSI) and the transmission power control from the reader side.
Excluding accurate and mature for these technologies, compared with cellular location technology, they need to install extra infrastructures such as access points (APs) and bring extra costs. Additionally, a more realistic challenge is that users can not use the indoor location services, since their mobile terminals do not have corresponding modules. In order to solve these problems, radio frequency pattern matching (RFPM) method is proposed in 3GPP RAN4 due to its advantages [10, 11]. First of all, it does not bring any extra equipment to existing cellular networks. Secondly, it is not sensitive to the channel states and can be designed to obtain better results in the environment with significant multipath by using specific algorithms.
A typical localization method based on fingerprint matching is generally divided into two phases: an offline training phase (offline) and an online positioning phase (online). As shown in Figure 1, in the offline training phase, a pattern database can be constructed to store radio measurement vectors between network side and the center of every separated grid in advance [12, 13]. In that way, the radio condition mapping is created. In the online stage, when network side receives a positioning request coming from user equipment (UE), the position of UE can be acquired by matching available UE measurements with the corresponding data of every separated grid. According to the certain algorithm criteria, the most appropriate grid is selected as estimated position of target UE. Therefore, in the condition that the pattern database is created, it is necessary to find a good matching algorithm.

Fingerprinting positioning principle.
Recently, several fingerprint matching algorithms have been used for UE localization. In [14], a fingerprint positioning method is proposed, which puts the average of K nearest neighbors similar to MR data in the feature database as estimated position. In [15], a matching algorithm based on maximum likelihood is proposed to suit the situation where the measurement values are relatively independent, and measurement errors satisfy the Gaussian distribution. A new position estimation method is obtained from support vector machine (SVM) in [16].
In this paper, we propose a new fingerprinting scheme, which makes full use of the implicit measurement information of environment and space. Then, the promising test results are also presented, compared with the traditional RFPM localization method in indoor environments.
This paper is organized as follows. In next section, some traditional RFPM algorithms are given, including the maximum likelihood method and the K nearest neighbors method. Section 3 describes a new scheme called fingerprint correlation localization (FCL), with enhanced nearest neighbor localization (ENNL) algorithm. In Section 4, the test results based on the real cellular localization data are given. Finally, conclusions are drawn in the last section.
2. Traditional RFPM Localization Methods
Available fingerprint matching algorithms can be divided into two categories: probabilistic algorithms and deterministic algorithms. Probabilistic algorithms create a mathematic model and get an estimated position based on maximum likelihood estimation [15] or naive Bayesian statistical learning theory [1, 2]. Deterministic algorithms select the most similar position by calculating the similarity of measurement parameters between the target UE and the fingerprint. The most famous deterministic algorithm is K nearest neighbors method proposed by Microsoft Corporation [14].
2.1. ML Positioning Scheme
The probability density distribution of measurements can be obtained by measuring the identical point repeatedly. Then, the estimated position is the coordinate of fingerprint, which maximizes the likelihood function and achieves the maximum probability.
In [15], the basic model for a UE positioning can be expressed as
Assume that all the measurement variables are mutually independent and the measurement error satisfies the Gaussian distribution. Its probability density is defined by
The ML positioning method assumes that measurement variables are mutually independent, and the measurement error satisfies the Gaussian distribution. Therefore, it can achieve relatively high accuracy under this assumption.
2.2. K Nearest Neighbors Positioning Scheme
In the online stage, using the strength of signal, the K nearest neighbors positioning method calculates the Euclidean distance between the measurement vector and the fingerprint of each grid [14] as
The variable K has a significant impact on its positioning accuracy. If K equals 1, the algorithm is equivalent to the nearest neighbor method, which could be affected by measurement error easily. When K is a big number, points that are not close to the real location would be included. Then, the accuracy will decrease dramatically [17, 18]. Therefore, mostly, the best K is a small constant (from 2 to 4).
Better positioning accuracy can be achieved by averaging the coordinate of K nearest neighbor positions in the database. It also improves the positioning accuracy by using the weighted average technique. The more the fingerprint point is approximate to the needed positioning point in signal space, the higher weight value is assigned. Then, the K nearest neighbors' method is modified by
Through the analysis, compared with the algorithms that focus on complex environmental characteristics, the K nearest neighbors positioning scheme pays more attention to the similarity between the measurement signal features and the fingerprint in database.
3. Fingerprint Correlation Localization Scheme
UE can be located more rapidly and more accurately if the environmental and space information is used properly. In this section, the modified fingerprint correlation localizationscheme and the corresponding mathematical model are presented, including flowchart and specific implementations. In order to improve searching efficiency, a new searching window is introduced.
3.1. Fingerprint Correlation Localization Scheme
Compared with original fingerprint positioning scheme, the singular measurement process, the cell matching degree, and the correlation with surrounding points are introduced in the modified scheme. Figure 2(a) is the overview of the flowchart of our scheme, and Figure 2(b) is the flowchart of original scheme.

(a) Proposed algorithm flowchart. (b) Original scheme.
The construction of pattern database is as follows. For each layer of 3D buildings, it is divided into many areas whose shapes may be rectangular, which is denoted as
Pattern database
3.2. Processing Singular Measurements
Based on the pattern database, we handle singular measurement at first. UE sends a positioning request to the network side and reports the measurement report (MR). When the network interface obtains the MR, it judges the number of received signal code power (RSCP), which is given as if if
3.3. Enhanced Nearest Neighbor Localization Algorithm
For the reported MR, the measurement vector F is defined as
Then, we calculate the cell matching degree between the measurement vector F and each fingerprint
According to our numerical experiments, the indicator of cell matching degree
Then, the candidate fingerprint set, which has senior cell matching degree with measurement vector, is obtained and denoted by
Calculate the Euclidean distance between F and each grid of C as follows:
Finally, the best grid fingerprint is determined from these potential fingerprints.
At first, the smallest point
3.4. Searching Window
From the above analysis, the matching process costs much time when the fingerprint database is huge. In order to improve searching efficiency, we restrict searching data in a range of the coarse estimation point. The coarse estimation point can be obtained as follows. When the call-ID of the next MR is similar to an MR of the previous set, we regard the previous estimated point as the coarse estimation point of the current point and search the fingerprint points around that coarse estimation point.
4. Real Test Experiments
In order to test the effect of our modified method, we collect the real data of macrocell in a supermarket and compare it with the ML method and KNN method for those real data.
4.1. Test Scenario
In a real cellular network environment, since UE is distributed widely, the cost of acquiring large-scale drive test data is too high in a wide area. Then, it is not impracticable to construct a fingerprint database from drive test data. A practicable method of constructing a fingerprint database is generated by ray tracking techniques [19], and our pattern database is provided by Huawei Technologies Co., Ltd., in 2013.
The fingerprint database is constructed in an area of 2000 m * 2000 m, Chengdu, China. That test scenario is divided into four layers, and the height of each floor is 7 m. The size of vertical grid is 5 m * 5 m. The total number of grids is 272306. The grid fingerprint has ten cells.
They collect the real measurement data including RSCP and the position of UE in a supermarket. The supermarket is a building with four floors, and its area is 173.95 m * 53.82 m. The total number of measurement reports is 148482. And each measurement report receives the signals from seven cells.
4.2. Real Test Results
In the phase of processing singular measurements, since RSCP of the measurement report only comes from one cell-ID, the position of UE cannot be determined. To overcome this phenomenon, its historical tracking information is added to determine the position of UE. The positioning performance compared with original method is shown in Figure 3 and Table 1.
Error statistics (m).

Horizontal positioning error.
USA Federal Communication Commission (FCC) releases the regulatory requirements for typical region. Positioning errors of FCC emergency positioning are less than 50 m with 67% and 150 m with 95% in the terminal-dependent case and less than 100 m with 67% and 300 m with 95% in the network-dependent case [20]. Therefore, we obtain the cumulative distribution function for all the MR positioning errors and compare positioning accuracy at 67% and 90% probability. The positioning probability is defined by the ratio of the number of the positioning error being less than the distance and the total number of MR measurements.
From Figure 3 and Table 1, we find that the revised method with processing singular measurements improves 20 m horizontal positioning errors at 90% positioning probability, compared with the original localization method.
In the positioning phase, as discussed in Section 3, an appropriate threshold of cell matching degree needs to be chosen. The threshold cannot be too large or too small. In order to pick up an appropriate threshold, we test the algorithm when the threshold of cell matching degree is chosen from 60%, 70%, 80%, and 90% to 100%, and their localization errors and cost of execution time are presented by Figures 4 and 5 and Tables 2 and 3.
Error statistics (m).
Time statistics (minute).

Estimated error at 67% positioning probability.

Estimated error with 67% positioning probability.
From Figures 4 and 5, we find that the positioning accuracy seems better when the threshold is chosen from 70% to 80%, and the positioning time is decreasing monotonically in threshold variable. Therefore, we choose 75% as the threshold of cell matching degree.
Based on the above discussions, we compare the revised positioning method with the maximum likelihood positioning method (ML) and the K nearest neighbors positioning method (KNN), and their numerical results are presented in Figures 6 and 7 and Tables 4 and 5. Figure 6 is their numerical curves of horizontal positioning accuracy. Figure 7 is their numerical curves of vertical positioning accuracy. From Tables 4 and 5, compared with the ML positioning method and the KNN positioning method, we find that our revised positioning method (ENNL) improves the horizontal positioning accuracy from 70.8 m (ML) and 51 m (KNN) to 21 m (ENNL) at the 67% positioning probability and the vertical average positioning error from 7 m (ML) and 5.9 m (KNN) to 2 m (ENNL), respectively. The vertical error especially of our revised positioning scheme achieves 0 m at 67% positioning probability, which can distinguish the floor of the test building significantly compared with other methods, since the floor height of the test building is 7 m. It meets the FCC requirement and implements 3D indoor localization well [20].
Horizontal error statistics (m).
Vertical error statistics (m).

Horizontal positioning accuracy.

Vertical positioning accuracy.
5. Conclusions
In this paper, the principle of fingerprinting positioning and a novel positioning method in indoor environments with cellular networks have been presented. The detailed description of ML and KNN and the revised positioning scheme are given. Besides, we test and compare those positioning methods in a real indoor environment based on cellular network. From the test results, the horizontal accuracy is 21 m at 67% positioning probability of the revised positioning method without any extra equipment and satisfies the FCC requirement, that is, horizontal positioning accuracy 50 m at 67% positioning probability [20].
Footnotes
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
Acknowledgments
This work was supported by Grant no. YB2013050048 from the Innovation Research Program of Huawei Technologies Co., Ltd., and Grant nos. YB2013090066 and YB2013050048 from Huawei Technologies Co., Ltd., and Grant no. 2012ZX020013 from Institute of Sensing Technology and Business, Beijing University of Posts and Telecommunications.
