Abstract
With the continuous expansion of the market of device-free localization in smart cities, the requirements of device-free localization technology are becoming higher and higher. The large amount of high-dimensional data generated by the existing device-free localization technology will improve the positioning accuracy as well as increase the positioning time and complexity. The positions required from single target to multi-targets become a further increasing difficulty for device-free localization. In order to satisfy the practical localizing application in smart city, an efficient multi-target device-free localization method is proposed based on a sparse coding model. To accelerate the positioning as well as improve the localization accuracy, a sparse coding-based iterative shrinkage threshold algorithm (SC-IA) is proposed and a subspace sparse coding-based iterative shrinkage threshold algorithm (SSC-IA) is presented for different practical application requirements. Experiments with practical dataset are performed for single-target and multi-targets localization, respectively. Compared with three typical machine learning algorithms: deep learning based on auto encoder, K-nearest neighbor, and orthogonal matching pursuit, experimental results show that the proposed sparse coding-based iterative shrinkage threshold algorithm and subspace sparse coding-based iterative shrinkage threshold algorithm can achieve high localization accuracy and low time cost simultaneously, so as to be more practical and applicable for the development of smart city.
Keywords
Introduction
Smart cities intelligently respond to the needs of life, environmental protection, public safety, urban services, and industrial and commercial activities by using information and communication technologies to perceive, analyze, and integrate key information from core urban operating systems. 1 Its essence is to apply advanced information technology to realize the intelligent management and operation of the city and create a better life for the urban people. A smart city, trying to deal with the problems intelligently, may involve many application areas, as shown in Figure 1. Among all technologies and services in smart city, localization information is one of the most important elements. Thus, wireless localization technology, especially indoor localization technology, has become the practical basis for extensive applications. For example, in the healthcare of a home or hospital for detecting the location and activities of the elderly people or patients;2,3 in smart spaces such as airports, shopping centers, and tourist attractions for location-based services;4,5 in case of fire for location detection of firefighters and survivors to realize emergency rescue; 6 for indoor environmental monitoring and building energy saving control; 7 as well as for detecting and tracking the location of terrorists in safety protection. 8

Frame diagram of smart city.
Various wireless localization technologies such as GPS, 9 infrared, 10 ultrasonic, 11 and radio frequency (RF) identification 12 have been developed to support the needs of the aforementioned scenarios. The application of these technologies, however, requires the target to be equipped with wireless devices, such as smart phones, ultrasonic receiver nodes, and tags.13,14 Therefore, these device localization technologies may be limited in some scenarios, like in a fire or earthquake, in an emergency, and in detecting or tracking illegal people in the building, because the targets may not wear or carry trackable devices all the time.
Device-free localization (DFL) technology, using RF sensor network to detect, track, and locate targets without any additional devices, has attracted wide attention.15–17 The principle of DFL is to detect targets and estimate the location of targets by the small RF signal changes caused by targets. Most current DFL methods, however, can only detect a single target rather than multi-targets, which will limit the practical application in smart city. Moreover, high localization accuracy depends on the deployment of large number of nodes and the repeated measurements of received signal strength (RSS), which will affect the efficiency and timeliness severely so as not to support the applications like real-time navigation or rescue in emergency. Therefore, reducing the total localization duration without sacrificing accuracy becomes critical. However, Wilson’s RF-based model 18 shows that as the number of nodes in the network increases, the total number of links will increase quadratically. Therefore, the processing of high-dimensional data in the DFL system will be the fundamental focus for practical application.
In order to deal with high-dimensional data and achieve high localization accuracy in the process of multi-target locating, an efficient multi-target device-free localization (EMDL) method is proposed. First, an iterative shrinkage threshold algorithm (ISTA) for target localization is combined with a sparse coding method based on convex optimization. Then, a subspace-based sparse coding (SSC) scheme is further proposed to reduce the dimension in the direction of row dimension and column dimension so as to handle the high-dimensional data problem. Finally, experiments based on practical dataset are carried out on the sparsely distributed targets, and the results are compared with those by three typical machine learning algorithms, deep learning based on auto encoder (DL-AE), K-nearest neighbor (KNN), and orthogonal matching pursuit (OMP), to evaluate the performance of the proposed algorithm.
The rest of this article is organized as follows: section “DFL model and algorithm” describes the typical DFL algorithms and models. Section “EMDL” introduces the established architecture, the localization stage, and the proposed algorithms of EMDL method in detail. The performance evaluation results are shown in section “Simulation results and evaluations,” and section “Conclusion” summarizes the whole work.
DFL model and algorithm
As shown in Table 1, the existing DFL methods mainly include fingerprint recognition, geometric methods, and compressive sensing (CS), where fingerprint-based localization is also integrated into other localization methods19–22 to convert the localization problems into fingerprint matching problems. Collecting the fingerprint training database is always the first step, and then some machine learning classifier is adopted to match from the new fingerprint to determine the location of the target. Zhang et al. 23 proposed a typical geometric method, in which the tracking field was divided into different regions, and the adjacent regions took different communication channels to improve the matching efficiency. For each zone, three nodes were deployed on the ceiling to form a regular triangle to monitor the zone. In each triangle region, a Support Vector Regression (SVR) model could be adopted to locate the object. Wang et al. 24 converted the DFL problem into a sparse signal reconstruction problem and reconstructed the location information from the undersampling measurements using a recursive compressed sensing algorithm. The common problem of the aforementioned existing methods is that a large number of nodes are required to meet the requirements of high localization accuracy.
The list of DFL models.
DFL: device-free localization; DBGM: dynamic Bayesian Gaussian mixture; KC: K-means clustering; CS: compressive sensing; BGMP: Bayesian greedy matching pursuit; LP: linear programming; OMP: orthogonal matching pursuit.
Many models and algorithms are presented based on the characteristics of the target locations and the relationship between the RSS measurements and the target states. Wang et al. 25 used a dynamic Bayesian Gaussian mixture algorithm (DBGM) and a K-means clustering (KC) to improve the objects tracking in radio tomographic imaging (RTI) systems. Youssef et al. 26 proposed a dynamic statistical model based on CS theory and used Bayesian greedy matching pursuit (BGMP) algorithm to locate the target. Liu et al. 27 used sparse representation model for indoor localization and applied the traditional sparse coding algorithms LP and OMP to solve the l1-norm constraint minimization problem. These algorithms are inefficient in processing high-dimensional data. In addition, 28 a sparse representation model for DFL was proposed, but only focused on single-target localization rather than multi-target localization. In addition, 29 a novel affine scaling steepest descent (ASSD) algorithm was also proposed to find a satisfying sparse solution of lp-norm minimization. The optimal step size is calculated by the steepest descent direction, which not only overcomes the problem that the traditional lp-norm minimization sparse recovery algorithm usually obtains the suboptimal sparse solution when the initial point does not belong to the convergence domain of the global optimal sparse solution, and convergence is also accelerated at low signal-to-noise ratio (SNR). The sparse matrix construction and iterative ideas in this algorithm are very instructive for accelerating locating.
Based on the aforementioned algorithms, for the purpose of practical application in smart city, an efficient multi-target DFL model is proposed based on sparse representation. Algorithmically, unlike the traditional sparse coding algorithms like OMP, the advanced Iterative-shrinkage-threshold Algorithm (IA) is introduced for sparse coding to improve the efficiency of processing high-dimensional data in DFL.
EMDL
This section focuses on the architecture of the proposed EMDL, as well as the core algorithms presented based on the sparse model.
Architecture of EMDL
The architecture of EMDL proposed in this article can be described from both the theoretical models and practical applications.
Theoretical model
Sparse coding algorithm is an unsupervised learning method to represent the sample data efficiently by exploring a set of super-complete basis vectors. In view of the large amount of data and high dimensions in indoor localization, sparse coding algorithm will be introduced in the proposed EMDL to achieve high precision and efficiency of localization. Figure 2 shows the theoretical framework of EMDL. According to the theoretical model of EMDL, the feature data will be extracted from each sub-dataset of the original data and stored in a column, so as to achieve dimensionality reduction in the column. Then, similar method is adopted to further reduce the row dimension. The reduction from two dimensions can improve the effect of data shrinking and avoid the main original information lost.

Theoretical framework of EMDL.
Practice model
The monitoring area of the proposed EMDL system can be discretized into a grid as shown in Figure 3, where only the target location is non-zero, so as to weaken the energy of signal broadcasting. That is, only the target locations will affect the RSS measurements of each anchor point (AP) in the detection area of the EMDL system. The wireless sensor node is referred to as the AP. 30 All APs are transmitters–receivers, which are deployed around the monitoring area and can communicate with each other. These APs act as the transmitters in sequence according to the schedule, that is, only one AP transmits the detecting signal and the other APs receive the signal at some time. As shown in Figure 3, the appearance of a target may absorb and scatter the transmitted signals, which will cause the RSS measurements changing so as to achieve the target detection.

Practice model of EMDL.
Overall process of EMDL
Assume that there
where [.]
T
represents the transposition. There will be
Considering that there will be only one AP operating as the transmitter at one time,
where there will be
Dictionary constructing phase
The monitoring area is discretized into

The dictionary constructing phase.
During the offline testing process, assume a target lying at or passing some RP, each AP will transmit the detecting signal in turn according to the schedule, and the other APs will obtain the RSS measurements to form the RSS matrix. To cover all the possible positions, the aforementioned measuring steps will be repeated for all
where
where
Performing τ tests at
where the
Detecting and localizing phase
After the training in the offline stage, the target will be detected and localized, which can be called online stage. The practical detecting signal will be obtained and the detecting vector
Linear combination of the detecting signals
Assume that the target is lying at the pth RP, that is, the target belongs to the pth class. If there are enough experiments performed at the pth RP in the dictionary constructing phase to obtain enough samples for the pth RP, the new sample of the simulate target obtained at the pth RP can be approximated by the pth sample matrix
where
Sparse representation of the detecting signals
The samples in Figure 5 are taken as example with the dataset consisting of 81 points from 9 classes. The red triangle in Figure 5 represents the detecting signal, the blue circle represents the non-zero coefficient of the red triangle after classification by sparse representation, and the rest represents the training sample of the dictionary. It can be seen that if the detecting signal belongs to the fifth class, the selected samples for indicating the non-zero coefficients of the detecting signal sorted by the sparse representation are also mainly from the fifth class.

Description of the detecting signal sparse-representation classification.
Based on equation (7), the linear representation of the detecting vector
where
EMDL algorithms
Sparse model
Sparse coding is a process of calculating the coefficient vector based on the detecting signals set

Description of the sparse coding process for the target localization.
For
If
where l0-norm
Considering the noise in practical data, the dense noise model is introduced and equation (8) is modified to equation (11)
where
For sparse coding, a more general modification of equation (12) is to use the
where the first term with l2-norm square is the reconstructing error term, and the second term with l1-norm is the sparse penalty term related to the possible location range of the target.
Finally, equation (13) is optimized in EMDL using the ISTA 33 with the sparse coding steps as in equation (14)
where
where
Sparse coding-based iterative shrinkage threshold algorithm
Sparse coding-based iterative shrinkage threshold algorithm (SC-IA) is proposed based on the aforementioned established model to accelerate the positioning. According to equations (14) and (15),
Based on equation (16), let
If the solution only includes one non-zero component, the corresponding location of RP can be directly treated as the target location. If the solutions contain multiple non-zero components, it can be divided into the following two cases:
For a single target, the location of the target can be estimated at the
For
where
Pseudocode for SC-IA.
SC-IA: sparse coding efficient multi-target device-free local.
SSC-IA
Considering that the large amount of data in the space may affect the localizing efficiency, a SSC scheme is proposed to utilize subspace techniques. After the dimensionality reduction is operated in the row and column directions of the EMDL dataset, SC-IA can be performed in the low-dimensional subspace to improve the efficiency without sacrificing the localization accuracy.
Inspired by subspace techniques used in dimensionality reduction 34 and denoising, 35 a new dictionary will be constructed in the principal component subspace. Specifically, the entire process of the SSC scheme is divided into two steps.
Dimensionality reduction in column direction
It can be known from equation (5) that the original dictionary
where
where
Dimensionality reduction in row direction
Before dimensionality reduction in the row dimension, zero-mean normalization is performed on
Then, the eigenvalues and eigenvectors of
where
where
where RoC is the abbreviation for ratio of cumulative distribution.
Therefore, it is necessary to achieve a high RoC with the smallest possible
Therefore, a new approximate sample dictionary
where
In the detecting phase, when the target is in the monitoring area, a new approximate detecting vector
where
Therefore, similar to the process of SC-IA, targets can be located by addressing the following issues (equation (29))
where
If
For a single target, the location of the target will be estimated at the
For
where
Pseudocode for SSC-IA.
SSC-IA: subspace sparse coding-based iterative shrinkage threshold algorithm; SVD: singular value decomposition; EVD: eigenvalue decomposition.
Simulation results and evaluations
The performance of the proposed EMDL will be evaluated using the experimental dataset from the SPAN laboratory of the University of Utah.
Figure 7 shows the experimental scenario on the first floor of the physical laboratory building. A total of 24 APs are deployed on a 12 m × 12 m square simulating area with 2 m between APs. Each AP works in 2.4 GHz and communicates using the IEEE 802.15.4. There are 36 RPs in the monitoring area. Thirty tests are performed repeatedly with the same configuration for each RP. The RSS sample matrix at each RP is divided into two parts, where 25 tests are used to construct the dictionary and the average of the rest 5 tests is used as a detecting signal for locating a single target. For the condition with multiple targets, the dictionary is the same as the dictionary for locating a single target, but the detecting signal is from another 30 new RSS samples for each additional target.

Description of the experimental scenario.
Settings and indicators
SC-IA and SSC-IA are compared with three popular algorithms widely used in DFL such as DL-AE, KNN, and OMP. Considering OMP is an effective DFL algorithm based on sparse coding 23 essentially, SSC-OMP representing OMP based on the SSC mechanism proposed in this article is also adopted for performance comparison to estimate the subspace sparse coding algorithm. Some performance indicators used are described as follows:
Assuming that the actual locations of
For a single target location (N = 1), MLE becomes localization error (LE)
When data are converted to the subspace, both the row and column dimensions are reduced. For ease of presentation, the row dimension is abbreviated as Rds, and the column dimension is abbreviated as Cds in the following description.
In practical applications, the EMDL system is inevitably subject to the noise. Gaussian distribution is adopted to simulate the additive noise in the experiments. For any ith RP
Single target localization
The experiments for single target localization are performed during the dictionary constructing phase with the comparison in noisy environment and noise-free environment.
Localization performance of SC-IA
Figure 8 shows the comparison of localizing performance by SC-IA, DL-AE, KNN, and OMP. It can be seen from Figure 8(a) that when the SNR is greater than 10 dB, the accuracy of four algorithms is basically above 90%. However, when the SNR is between 5 and 10 dB, the accuracy of the SC-IA algorithm is significantly higher than that of the other three algorithms. When the SNR is less than 5 dB, it can be seen that the accuracy of all algorithms drops sharply, indicating that this range does not satisfy the localization conditions. Figure 8(b) depicts the case of LE for SC-IA with SNR in the range of −10 to 40 dB. It can be seen that in the case where the SNR of the detection signal is 10 dB or noiseless, the LE of all detected locations is close to zero. This means that SC-IA is robust to noise when the SNR of the detecting signal is greater than 10 dB.

Localization performance of SC-IA with noisy and noiseless detecting signals: (a) comparison of localizing accuracy with different SNRs and (b) comparison of LE for SC-IA at each RP.
Localization performance of SSC-IA
Figure 9 shows the comparison of the localization performance of SSC-IA and other three algorithms in processing low-dimensional data. It can be seen from Figure 9(a) that when the row dimension of the data is reduced to 10, the localizing accuracy of the SSC-IA can reach 97.5%, which is much higher than that of the other three algorithms. When the row dimension of the data is higher than 12, the accuracy of the algorithm is 100%, but the convergence speed of SSC-IA is better than the other three algorithms. When the row dimension of the data is less than 10, the accuracy of the four algorithms is less than 93%, indicating that the lower limit of the dimension of accurate localization is reached at this time. Figure 9(b) shows the comparison of the ALE of SC-IA with the processing data of 576-Rds-900-Cds and SSC-IA with processing data sizes of 10-Rds-35-Cds, 20-Rds-35-Cds, and 30-Rds-35-Cds, when the SNR of the detecting signal is between −10 to 40 dB. It can be seen that when the SNR is greater than 10 dB, the ALE of the SC-IA and the SSC-IA with a processing data dimension of 30-Rds-35-Cds is close to 0; at the same time, when SNR is greater than 15 dB, SSC-IA with the processing data dimension of 20-Rds-35-Cds also performs well. This shows that if the data dimension is reduced to 30-Rds-35-Cds, the localizing accuracy and the robustness are not affected. SSC-IA can be used to process reduced dimensionality data and is robust to noisy environments when the SNR is above 10 dB. This means that even if the dimension of EMDL data is reduced from 576-Rds-900-Cds to 20-Rds-35-Cds in noiseless condition, or the data dimension drops to 30-Rds-35-Cds when the SNR is higher than 10 dB, SSC-IA can maintain high accuracy and efficient processing performance, which will greatly reduce the computational burden.

Localization performance of SSC-IA and the compared methods with different row dimensional data: (a) comparison of localizing accuracy with other algorithms and (b) comparison of ALE with SNR from −10 to 40 dB.
In addition, the reason why low-dimensional data are used in EMDL is that almost all important features of each class are extracted into the eigenvector after EVD. Therefore, reducing the column dimension will not lose the main feature. As shown in Figure 10(a), the eigenvalues associated with the original data are sorted in descending order. The first principal component

Eigenvalue (a) and cumulative distribution ratios (RoC) (b) associated with the 1st to the 40th.
Therefore, it is understandable that the accuracy can reach 100% when the eigenvalue is higher than 12 in Figure 9(a). That is because the first 12 features of the reduced data can represent 97.3% of the characteristics of the original dataset, which is enough to ensure the accuracy of EMDL.
Localization performance of dictionary construction with noise
SSC-IA with data dimension of 30-Rds-35-Cds shows the same good localization performance as SC-IA with data dimension of 576-Rds-900-Cds when SNR is greater than 10 dB according to Figure 9(b). Therefore, in this section, the original data as well as the data with dimension reduced to 30-Rds-35-Cds are adopted to simulate the localization performance in dictionary construction with noise.
As shown in Figure 11, the noise is added to the dictionary ranging from 0 to 40 dB with an interval of 5 dB. Meanwhile, experiments were performed using the detecting signal datasets with SNR = 10 dB and SNR = 20 dB and a noiseless dataset, processing the original dataset of 576-Rds-900-Cds and the dimensionality reduced dataset of 30-Rds-35-Cds. Figure 11(a) shows that ALE will approach 0 when a noiseless detecting signal or a detecting signal with SNR = 20 dB is input. SC-IA can achieve high accuracy and is robust when the SNR of the dictionary is 15 dB. After the SNR of the dictionary is higher than 25 dB, SC-IA can achieve very low error even if the detecting signal with SNR = 10 dB is input.

Localization performance of SC-IA and SSC-IA with the noisy dictionary when the detection signal is noisy with SNR = 10 and 20 dB and noiseless: (a) average localization error of SC-IA and (b) average localization error of SSC-IA.
After the SNR of the dictionary reaches 10 dB, ALE will approach 0 when the noiseless detecting signal or the detecting signal with SNR = 20 dB is input as in Figure 11(b), which indicates that the proposed SSC-IA can maintain high accuracy and robustness. However, when the SNR of the detecting signal is 10 dB, SSC-IA may fail to maintain the localizing accuracy. In addition, it can be found that when the SNR of the dictionary is 10 dB, the localizing accuracy of SSC-IA is greater than that of SC-IA, mainly because EVD is actually a denoising process of feature extraction.
From Figure 12, the localization performance of SSC-IA and SC-IA is greater than the that of other three algorithms, and SSC-IA is superior to SC-IA with the noisy dictionary.

Performance comparison between proposed algorithm and other algorithms for processing noiseless detection signals.
Therefore, it can be considered that SC-IA and SSC-IA can maintain good performance in terms of accuracy and robustness even if the dataset is affected by a certain degree of noise during the dictionary constructing stage and the target detecting stage.
Multi-targets localization
The localizing performance of the proposed SC-IA and SSC-IA for multi-targets, which is the most practical requirement in smart city, will be evaluated in this section. The case of two targets is taken as an example. Table 4 provides six possible locations of the simulated two targets.
Six locations of the two simulated targets (unit: meter).
Figure 13 is a comparison of the localizing results of the two targets. Figure 13(a) shows that the MLE for the last five cases is almost zero for SC-IA based on the dictionary of 576-Rds-900-Cds so as to prove that the two target locations can be accurately estimated. However, for the first case, target 1 can be accurately located, while there is a small LE of 0.63 m at its adjacent location of (4,6) for target 2. When a 15-Rds-35-Cds low-dimensional dictionary is adopted by SSC-IA, there will be a small error in the fourth case and a large error in the first case. When using a dictionary of20-Rds-35-Cds, the LE of the first case can be reduced significantly. In addition, when using the dictionary of 25-Rds-35-Cds, there is no error except for the first case, and the same localization accuracy is achieved by SC-IA. Figure 13(b) further proves that SSC-IA is superior to the other three algorithms when the data dimension is reduced to 15-Rds-35-Cds, 20-Rds-35-Cds, and 25-Rds-35-Cds. Therefore, SSC-IA with 25-Rds-35-Cds data can achieve the same high accuracy with SC-IA for two target localization.

Comparison of localization performance between SC-IA and SSC-IA locating two targets: (a) mean localization error of SC-IA and SSC-IA and (b) comparison of the mean localization error between SSC-IA and other three algorithms for processing 15, 20, and 25 rows of data, respectively, where KNN cannot locate multiple targets.
In addition, the execution time costs of OMP, SSC-OMP, SC-IA, and SSC-IA in processing different dimensional data are listed and compared in Table 5 to show the time cost of sparse coding to process high-dimensional data, and the acceleration effect when the input data is reduced from 576-Rds-900-Cds to low dimension.
Time costs for processing different dimensional data.
OMP: orthogonal matching pursuit, SC-IA: sparse coding based iterative shrinkage threshold algorithm; SSC: subspace-based sparse coding; SSC-IA: subspace sparse coding-based iterative shrinkage threshold algorithm.
It can be seen from Table 5, when the original data are 576-Rds-900-Cds, the time costs of OMP and SC-IA are 0.35 s and 2.0 × 10−3 s, respectively, which indicates that the proposed SC-IA is more efficient than OMP in processing high-dimensional data of EMDL. For the low-dimensional data of 30-Rds-35-Cds, the relative time costs are further reduced to 3.6 × 10−4 and 1.9 × 10−4 s by SSC, which means that the subspace sparse coding algorithm can accelerate the convergence significantly to improve the localizing efficiency remarkably.
Conclusion
Indoor location sensing is one of the important elements in smart city. In this article, an EMDL system is proposed to provide effective and accurate DFL for multi-targets, so as to expand new practical application as well as accelerate the development of relative intellectual technologies. Considering the accuracy requirement on DFL for multi-targets, a sparse coding model and a sparse coding method based on iterative shrinkage threshold algorithm (SC-IA) are proposed in this article. The high localizing accuracy, however, causes the sample data to increase seriously and inevitably. Aiming at the curse of dimensionality in multi-targets localization, a subspace sparse coding mechanism is presented to optimize SC-IA as SSC-IA, for obtaining low-dimension data and high efficiency by maintaining the localizing accuracy. Simulating results show that, compared with three typical algorithms of FDL, the proposed EMDL based on SC-IA can achieve much higher localizing accuracy and that based on SSC-IA can reduce the data dimension more greatly, so as to perform better than the other typical algorithms, both a single target localization and multiple targets localization. Moreover, experiments in noisy environment prove that the proposed EMDL system possesses better robustness. Finally, the analyses on operating efficiency further suggest that EMDL has obvious advantages in localization time cost. The superiorities in accuracy and efficiency make the proposed EMDL have more practical application value in the development of smart cities. Future research direction will focus on the moving target and the indoor navigation based on SSC.
Footnotes
Handling Editor: Feng Ye
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the National Natural Science Foundation of China (61771186), University Nursing Program for Young Scholars with Creative Talents in Heilongjiang Province (UNPYSCT-2017125), Distinguished Young Scholars Fund of Heilongjiang University, and postdoctoral Research Foundation of Heilongjiang Province (LBH-Q15121).
