Sage Journals: Discover world-class research

Abstract

The moving least squares (MLS) and moving total least squares (MTLS) are two of the most popular methods used for reconstructing measurement data, on account of their good local approximation accuracy. However, their reconstruction accuracy and robustness will be greatly reduced when there are outliers in measurement data. This article proposes an improved MTLS method (IMTLS), which introduces an improved random sample consensus (RANSAC) algorithm and a correction parameter in the support domain, to deal with the outliers and random errors. Based on the nodes within the support domain, firstly the improved RANSAC is used to generate a model for establishing the group of pre-interpolation and calculating the residual of each node. Subsequently, the abnormal degree of the node with the largest residual is evaluated by the correction parameter associated with the node residual and random errors. The node with certain abnormal degree will be eliminated and the remaining nodes are used to obtain the approximation coefficients. The correction parameter can be used for data reconstruction without insufficient or excessive elimination. The results of numerical simulation and measurement experiment show that the reconstruction accuracy and robustness of the IMTLS method is superior to the MLS and MTLS method.

Keywords

Measurement data outliers moving total least squares random sample consensus surface reconstruction

Introduction

After decades of development, the reconstruction algorithms for discrete data have played an essential role in many engineering and scientific fields, especially in error analysis and data processing. The conventional numerical methods, such as finite element method (FEM), interpolate or approximate the nodes through defining a mesh based on known nodes.¹ However, the accuracy of reconstruction will be greatly reduced and the fitting will even fail when the grid-based method is used to deal with large deformation and discontinuity problems.² In addition, the human-labor and time cost of generating meshes in complex-shaped domains are not satisfactory. While meshless methods use node-based approximation without mesh discretization, the efficiency and accuracy of data processing are, therefore, greatly improved.³ In order to meet the development requirement of different fields, a multitude of meshless methods have been presented and applied, such as the element-free galerkin (EFG),⁴ diffuse elements method (DEM),⁵ and moving least squares (MLS).

Mclain⁶ proposed the weighted least squares method. On this basis, Lancaster and Salkauskas⁷ introduced the moving concept and proposed the MLS method in 1981. To this day, the MLS has become an important method for constructing shape function. Unlike the traditional least square method using complete polynomials, the shape function of the MLS is composed of a coefficient vector and a basis function vector, which can obtain higher continuity under the condition with low order basis function.⁸ The introduction of the weight function with compact support makes the reconstructed curve or surface accurate and smooth, which has contributed to the wide application of the MLS in various fields. For example, the MLS is applied to solve elasticity problems,⁹ the compressible Navier-Stokes,¹⁰ Kuramoto-Sivashinsky,¹¹ and Burgers equation,¹² and estimate mathematical model based on discrete points.¹³ Due to its good performance, the MLS method is often used in combination with other methods to construct shape function. In finite element analysis, the introduction of the MLS method enhances the shape function of the active element.¹⁴ In the smoothed particle hydrodynamics (SPH), the MLS method was used to construct kernel functions to obtain higher consistency.¹⁵

The MLS method obtains the local fitting coefficients through the weighted least squares (LS)¹⁶ method in the support domain¹⁷ based on the Gauss Markov error model, in which only the dependent variable contains errors.¹⁸ The total least squares (TLS)¹⁹ method is an estimation method based on the errors-in-variables (EIV) model. Unlike the Gauss Markov model, the errors of both independent variable and dependent variable are considered in the EIV model.²⁰ By replacing the LS estimation with the TLS estimation in the support domain, the MLS method is transformed into the moving total least squares (MTLS).²¹

Nevertheless, due to the impact of factors, such as the disturbances in the measurement environment, the measurement data often contains outliers that seriously deviate from the actual value.^22,23 The LS and TLS respectively used in the support domains of the MLS and MTLS method are not robust estimation methods.^24,25 When there are outliers in the domain, larger deviations will exist in the fitted values around outliers. Many studies have been carried out to reduce the negative impact from the outliers, with the proposed robust algorithms dividing into two main forms. One is to select a subsample from the discrete points to obtain the regression coefficient.^26,27 If the outlier is in the subsample, it will be automatically eliminated. However, this method may eliminate some of non-outliers, in which case the accuracy of approximation will be significantly affected.²⁸ The other type of algorithm is to identify outliers first and then weaken the influence of outliers by assigning weights to them. In this method, it is difficult to determine appropriate weights when multiple outliers with different levels exist in the discrete data.

In order to weaken the impact of outliers on reconstruction, we propose an improved MTLS (IMTLS) reconstruction method in this article. In the support domain, we deal with outliers by introducing an improved random sample consensus (RANSAC) and a correction parameter, and then the local approximation coefficients are obtained based on the TLS estimation. RANSAC is a robust model estimation algorithm, especially when measurement data contains high proportion of outliers. However, it has limitations in data fitting influencing its accuracy and stability.²⁹ Therefore, RANSAC algorithm needs to be improved to estimate a relatively reliable initial model. The correction parameter associated with the random error and the node residual is introduced to detect and eliminate abnormal node.

The rest of this article is structured as follows: the second section is a brief introduction to the MLS, MTLS, and RANSAC algorithm, the third section explains the principle and procedure of the proposed algorithm in detail, and the fourth part verifies the performance of the IMTLS method through numerical simulation and experimental data.

Introduction to the basic algorithms

The MLS method

Consider that there are n discrete points x = [x₁, x₂, …, x_n] and corresponding y = [y₁, y₂, …, y_n] in a bounded area Ω. In the MLS method, the trial function f(x) for the fitting point x can be written as

f (x) = \sum_{j = 1}^{m} p_{j} (x) a_{j} (x) = p^{T} (x) a (x)

(1)

where p(x) = [p₁(x), p₂(x), …, p_m(x)]^T is a vector of the monomial basis function and the number of the basic is m, and a(x) = [a₁(x), a₂(x), …, a_m(x)]^T is the coefficient vector to be solved. In this article, for 2D curve and 3D surface fitting, we choose p(x)=[1, x]^T and p(x)=[1, x, y]^T, respectively.

In order to solve the optimal a(x), the construct function E(x) is a quadratic function about a(x), that is,

E (x) = \sum_{J = 1}^{n} θ (‖ x - x_{J} ‖ / r) {(\sum_{j = 1}^{m} p_{j} (x_{J}) a_{j} (x) - y_{J})}^{2}

(2)

where r is used to control size of the support domain, and θ(||x–x_J||/r) is a non-negative and compactly supported weight function to attribute a weight to each node according to its position relative to x. The fitting property of MLS algorithm will be influenced by the weight function. For example, the fitting form will be interpolated when θ(0) = ∞.³⁰ There are many kinds of weight functions with compact support such as exponential, Gaussian, and cubic spline weight functions. This article chooses exponential weight function, that is,

θ (s) = {\begin{matrix} \frac{\exp (- α s^{2}) - \exp (- α)}{1 - \exp (- α)} if s \leq 1 \\ 0 else \end{matrix}

(3)

where α is a coefficient related to the convergence speed.

According to the principle of least squares, the coefficient vector a(x) can be solved by

\frac{\partial E (x)}{\partial a_{j} (x)} = 2 \sum_{i = 1}^{N} θ (s) [\sum_{J = 1}^{m} p_{J} (x_{i}) a_{J} (x) - y_{i}] p_{j} (x_{i}) = 0

(4)

where j = 1, 2, …, m, solving equation (4) to obtain

a (x) = A^{- 1} (x) C (x) u

(5)

where

\begin{matrix} u = [y_{1}, y_{2}, \dots, y_{N}]^{T} \\ A (x) = \sum_{i = 1}^{n} θ (‖ x - x_{i} ‖ / r) p (x_{i}) p^{T} (x_{i}) \\ C (x) = [θ (‖ x - x_{1} ‖ / r) p (x_{1}), θ (‖ x - x_{2} ‖ / r) p (x_{2}), \\ \dots, θ (‖ x - x_{n} ‖ / r) p (x_{n})] \end{matrix}

Taking equation (5) into equation (1), the function f(x) can be presented as

f (x) = p^{T} (x) A^{- 1} (x) C (x) u

(6)

The MTLS method

Unlike the MLS using LS estimation to obtain the local fitting coefficients, the MTLS method applies the TLS estimation for those coefficients, in which the errors of both dependent variable and independent variables can be considered. In TLS, the solution of linear equation is considered, that is,

(B + E_{B}) X = L + e

(7)

where Band L are coefficient matrix and observation vector respectively, E_B and e are random error matrix of B and L respectively, and X is the coefficient vector to be solved. According to the principle of total least squares, the solution of equation (7) can be transformed into a constrained optimization problem, that is,

X = \arg min (e_{B}^{T} e_{B} + e^{T} e)

(8)

where e_B = vec(E_B).

Therefore, X can be solved by Lagrange multiplier method. The function is constructed as

J (e, e_{B}, φ, X) = e^{T} e + e_{B}^{T} e_{B} + 2 φ^{T} (L + e - BX - E_{B} X)

(9)

where φ is the vector of Lagrange multiplier. Let the first partial derivative of J with respect to e, e_B, φ, and X equal to 0, that is,

{\begin{matrix} \frac{1}{2} \frac{\partial J}{\partial e} = e + φ = 0 \\ \frac{1}{2} \frac{\partial J}{\partial e_{B}} = E_{B} - φ X^{T} = 0 \\ \frac{1}{2} \frac{\partial J}{\partial φ} = L - BX + e + φ - E_{B} X = 0 \\ \frac{1}{2} \frac{\partial J}{\partial X} = B^{T} φ + E_{B}^{T} φ = 0 \end{matrix}

(10)

derived from equation (10), we can obtain

{\begin{matrix} B^{T} AX - B^{T} L = X μ \\ L^{T} L - L^{T} BX = μ \end{matrix}

(11)

where μ = $e_{B}^{T}$ e_B + e^Te.

The equation (11) can be transformed into

{[B L]}^{T} [B L] [\begin{matrix} X \\ - 1 \end{matrix}] = [\begin{matrix} B^{T} B & B^{T} L \\ L^{T} B & L^{T} L \end{matrix}] [\begin{matrix} X \\ - 1 \end{matrix}] = μ [\begin{matrix} X \\ - 1 \end{matrix}]

(12)

It can be seen from equation (12) that the problem of solving total least squares is transformed into obtaining the eigenvalues and eigenvectors of matrix [B L]^T [B L]. When μ is taken as the smallest eigenvalue of the matrix [B L]^T [B L], X in the corresponding eigenvector is the solution of TLS. For the MTLS method, the construction of the fitting function is consistent with the MLS method, and the same weight function is used for the MTLS. The acquisition of local regression coefficients can be expressed as

D = {[B_{W} L_{W}]}^{T} [B_{W} L_{W}] = M^{- 1} Λ M

(13)

where

\begin{matrix} L_{W} = [θ (‖ x - x_{1} ‖ / r) y_{1}, θ (‖ x - x_{2} ‖ / r) y_{2}, \\ \dots, θ (‖ x - x_{n} ‖ / r) y_{n}]^{T} \\ B_{W} = [θ (‖ x - x_{1} ‖ / r) p (x_{1}), (‖ x - x_{2} ‖ / r) p (x_{2}), \\ \dots, θ (‖ x - x_{n} ‖ / r) p (x_{n})]^{T} \\ Λ = diag (λ_{1}, λ_{2}, \dots, λ_{m + 1}), \\ (λ_{1} \geq λ_{2} \geq \dots \geq λ_{m + 1}) \\ M = [v_{1}, v_{2}, \dots, v_{m + 1}], D v_{i} = λ_{i} v_{i} \\ v_{i} = [v_{i, 1}, v_{i, 2}, \dots, v_{i, m + 1}]^{T} (i = 1, 2, \dots, m + 1) \end{matrix}

If λ_m₊₁≠λ_m, combined with equations (12) and (13), the regression coefficient vector a(x) is:

a (x) = - \frac{1}{v_{m + 1, m + 1}} {[v_{m + 1, 1}, v_{m + 1, 2}, \dots, v_{m + 1, m}]}^{T}

(14)

In the MLS method, the coefficient matrix B is error-free, while only the observation vector L is contaminated. Nevertheless, this assumption is usually unrealistic in some fields of engineering, because the errors, such as sampling errors and instrument errors, are always brought into matrix B. Therefore, the MTLS method can give a more reasonable approximation than the MLS method when both matrix B and the vector L contain errors.³¹

The RANSAC algorithm

The RANSAC algorithm, presented by Fischler and Bolles,³² can robustly estimate the model parameters. It has been applied to many fields, especially in computer vision,^33,34 due to good performance in handling the data with a tremendous level of outliers and remarkably simple structure. The main idea of the RANSAC is to calculate parameters of hypothesized model by randomly sampling the subsample from the entire dataset,³⁵ and then the model is performed on the entire dataset. During this process, a distance threshold that is considered empirically³⁶ is set to calculate the inlier rate of each model, and sampling is stopped when the number of iterations reaches the defined value. The parameters corresponding to the highest inlier rate (the highest consensus) are selected as the model parameters of the total sample. The brief calculation steps of the RANSAC algorithm are described as follows:

Algorithm 1. The RANSAC method.
Input: discrete data set C = ${x_{i}, y_{i}}_{i = 1}^{N}$ Output: model G(x) Set Q = 0 for (j = 1 to k) do Randomly select a minimum subset from C to calculate model g(x) and get the residuals Count the number of inliers q of the current model according to distance threshold if (q > Q) then G(x) = g(x) and Q = q end if end for

In order to get a subsample that is all inliers with probability p, the sampling times k can be obtained by the following formula,³⁷ that is,

k = \frac{\log (1 - p)}{\log (1 - {(1 - p)}^{ζ})}

(15)

where w is the proportion of outliers in the entire dataset, and ζ is number of data nodes needed to calculate model parameters. For example, ζ = 2 is taken in the linear parameter estimation. Then a reliable model is obtained through multiple iterations, and the node with the residual outside the distance threshold is considered as outlier.

The improved MTLS method

When there are outliers in the measurement data, the robustness and accuracy of both the MLS and MTLS method are not satisfactory due to their construction principles.³⁸ Therefore, this article proposes an IMTLS method to reduce the negative impact of outliers on reconstruction. In this method, an improved RANSAC and a correction parameter are introduced into the support domain of the MTLS method to detect and eliminate abnormal nodes, and then the local fitting coefficients based on the TLS estimation can be obtained. The improved RANSAC algorithm is different from the standard RANSAC algorithm in two aspects. Firstly, it considers every possible subsample to generate hypothesized models, which can enhance the stability of data reconstruction. Secondly, the threshold is automatically set to a value related to the random error of the data, which reduces the negative impact of the original method to set a threshold empirically.

Then, we mainly introduce the principle and procedure of the algorithm reconstruction in support domain, as shown in Figure 1. Assuming that there are N nodes in single support domain, the reconstruction process of the MTLS method can mainly be divided into three steps. Firstly, an estimation model is obtained by the improved RANSAC algorithm. Secondly, the abnormal node is eliminated by introducing a correction parameter δ. In this step, whether to eliminate the node with the largest residual is determined by the size of d_max/δ and l, where d_max is the maximum residual and l is automatically set according to the random error of the data. Lastly, local approximation coefficients are determined by TLS estimation. In the support domain, the calculation process of the IMTLS method is shown as follows:

Figure 1.

The fitting procedure in the support domain of IMTLS method.

Algorithm 2. The IMTLS method.
Input: discrete data C = {x_i, y_i $}_{i = 1}^{N}$ , parameter δ and l Output: local regression parameter a Set Q = 0 Select different combinations including ζ nodes from C,denoted as C₁, C₂, …, ${C_{C}}_{N}^{ζ}$ for (j = 1 to $C_{N}^{ζ}$ ) do Get model g(x) based on C_j and residuals d = {d_i $}_{i = 1}^{N}$ Count the number of inliers p of the model according to l if (q > Q) then Q = q and D = d end if end for Get maximum residual d_max = max(D) if (d_max/δ ≥ l) then Eliminate the node with d_max else Reserve the node with d_max end if Remaining data is processed based on TLS to get a

Algorithm 2. The IMTLS method.

Input: discrete data C = {x_i, y_i

}_{i = 1}^{N}

, parameter δ and l
Output: local regression parameter a
Set Q = 0
Select different combinations including ζ nodes from C,denoted as C₁, C₂, …,

{C_{C}}_{N}^{ζ}

for (j = 1 to

C_{N}^{ζ}

) do
Get model g(x) based on C_j and residuals d = {d_i

}_{i = 1}^{N}

Count the number of inliers p of the model according to l
if (q > Q) then
Q = q and D = d
end if
end for
Get maximum residual d_max = max(D)
if (d_max/δ ≥ l) then
Eliminate the node with d_max
else
Reserve the node with d_max
end if
Remaining data is processed based on TLS to get a

After eliminating the abnormal nodes, assume that t nodes $\bar{x}$ = [ $\bar{x}$ ₁, $\bar{x}$ ₂,…, $\bar{x}$ _t ] and $\bar{y}$ =[ $\bar{y}$ ₁, $\bar{y}$ ₂, …, $\bar{y}$ _t ] are retained in the support domain. The solution of the TLS can also be obtained by the singular value decomposition (SVD).³⁹ As shown in the linear equation (7), perform the SVD on the augmented matrix

F = [B L] = U Σ V^{T} = \underset{m + 1 t - m - 1}{[U_{1} U_{2}]} [\begin{matrix} Γ \\ 0 \end{matrix}] V^{T} = U_{1} Γ V^{T}

(16)

in which U = [U₁U₂]_t _× _t and V_(m+1) _× _(m+1) is an orthogonal matrix composed of eigenvectors of matrix FF^T and matrix F^TF respectively, and Γ = diag[ξ₁, ξ₂, …, ξ_m₊₁], ξ₁ ≥ ξ₂ ≥…≥ ξ_m₊₁ are the singular values of matrix F. Matrix Γ and matrix V can be presented as

Γ = \underset{m 1}{[\begin{matrix} Γ_{1} & 0 \\ 0 & Γ_{2} \end{matrix}]} \begin{matrix} m \\ 1 \end{matrix} V = \underset{m 1}{[\begin{matrix} V_{11} & V_{12} \\ V_{21} & V_{22} \end{matrix}]} \begin{matrix} m \\ 1 \end{matrix}

When V₂₂ is non-singular and ξ_m ≠ ξ_m₊₁, the solution of TLS is unique, that is,

X = - V_{12} V_{22}^{- 1}

(17)

For the IMTLS method, the matrix F is weighted, that is,

F_{W} = [B_{W} L_{W}] = U_{W} Σ_{W} V_{W}^{T}

(18)

Where

\begin{matrix} L_{W} = [θ (‖ x - {\bar{x}}_{1} ‖ / r) {\bar{y}}_{1}, θ (‖ x - {\bar{x}}_{2} ‖ / r) {\bar{y}}_{2}, \dots, \\ θ (‖ x - {\bar{x}}_{t} ‖ / r) {\bar{y}}_{t}]^{T} \\ B_{W} = [θ (‖ x - {\bar{x}}_{1} ‖ / r) p ({\bar{x}}_{1}), θ (‖ x - {\bar{x}}_{2} ‖ / r) p ({\bar{x}}_{2}), \dots, \\ θ (‖ x - {\bar{x}}_{t} ‖ / r) p ({\bar{x}}_{t} {)]}^{T} \end{matrix}

The solution expression of equation (17) is rewritten as

X_{W} = - V_{W 12} V_{W 22}^{- 1}

(19)

In the support domain of the IMTLS method, abnormal nodes are automatically detected and eliminated by the improved RANSAC method and the introduced correction parameter δ. The outliers are evidently included in the abnormal node and can be effectively processed. The local fitting coefficients are obtained by the TLS estimation. As the movement of fitting point x, outliers and random errors are processed in the entire domain.

Case verification

In this section, both numerical and experimental cases were considered by the MLS, MTLS, and IMTLS method, to verify the performance of the proposed method.

Case 1

Take the curve function

y = \sin (0.3 π x) + 0.8 \sin (0.5 π x)

(20)

as a model to generate uniformly distributed discrete points (x_0i, y_0i). Outliers (0, Δy_i) and random errors of normal distribution with zero mean are added to (x_0i, y_0i) to get simulation data (x_1i, y_1i). The reconstruction points (x_2i, y_2i) are obtained after reconstruction. The sum of absolute differences can be presented as

s = \sum_{i = 1}^{n} | y_{0 i} - y_{2 i} |

(21)

which is considered as the index to evaluate three algorithms.

In this case, the parameter

l = \sqrt{σ_{x}^{2} + σ_{y}^{2}}

(22)

Let n = 201 and r = [max(x₀)−min(x₀)] × 4/100, where min(x₀) = 0 and max(x₀) = 5. Under the same random error condition (σ_x = 0.001 and σ_y = 0.001), we add four different outliers to (x_0i, y_0i), and the s values obtained by three algorithms are shown in Table 1. The fitting curves are shown in Figure 2.

Table 1.

The comparison of three methods in Case 1.

Error parameter	MLS	MTLS	IMTLS
s ₁	1.974703	1.995617	0.177981
s ₂	3.752185	3.852139	0.177980
s ₃	3.854453	3.878887	0.179741
s ₄	3.906223	4.180141	0.180511
$\bar{s}$	3.371891	3.476696	0.179053
σ_s	0.808569	0.864746	0.001107

Figure 2.

The curves fitting by three algorithms under different outlier conditions.

The figures and tables of the above cases show that outliers can be effectively detected and eliminated by the IMTLS method to reduce the incalculable negative impact on the reconstruction. Compared with the MLS and MTLS method, the error values of the reconstructed data processed by the IMTLS method is much smaller.

To further verify the performance of the IMTLS method, we take this case as an example, in which only random errors are added in the simulation data. In Table 2, we can see that the error values obtained by the IMTLS fitting is still minimum among the three reconstruction methods, which shows that the IMTLS method also has good approximation performance under this situation, and Figure 3 shows the bar graph of error values under different random errors.

Table 2.

The comparison of processing random errors in Case 1.

σ_x	σ_y	s
		MLS	MTLS	IMTLS
0.000001	0.001	0.274568	0.152626	0.151917
0.00001	0.001	0.274228	0.152063	0.151351
0.0001	0.001	0.275371	0.153263	0.152555
0.001	0.001	0.285897	0.176635	0.176257
0.001	0.0001	0.282678	0.162684	0.161941
0.001	0.00001	0.281492	0.162066	0.161338
0.001	0.000001	0.280397	0.162179	0.161444

Figure 3.

The s values under different random errors.

Furthermore, the CPU time of each algorithm is calculated by MATLAB based on the datasets with different size of n = 201 + i × 200 (i = 0, 1, …, 4). Two conditions are considered and presented in Tables 3 and 4 (r₀ = [max(x₀)−min(x₀)] × 4/100, where max(x₀) = 5 and min(x₀) = 0) respectively. One is to fix the number of nodes in support domain, and the other one is to fix the size of support domain. In the first condition, CPU time of IMTLS method changes steadily with the increase of n. While in the second condition, the CPU time of IMTLS method increases rapidly as n increases because more nodes are included in single support domain. These results illustrate that the number of nodes in the support domain has a significant impact on the CPU time of IMTLS method. Therefore, in order to ensure the efficiency of IMTLS method in data reconstructing, the size of support domain should be appropriately selected to control the number of nodes in support domain.

Table 3.

The CPU times of three methods with a fixed number of nodes in support domain (s).

n	201	401	601	801	1001
r	r₀	(1/2)r₀	(1/3)r₀	(1/4)r₀	(1/5)r₀
MLS	0.0148	0.0375	0.0477	0.0602	0.0813
MTLS	0.0075	0.0164	0.0250	0.0422	0.0477
IMTLS	0.1473	0.3148	0.4531	0.5977	0.7516

Table 4.

The CPU times of three methods with a fixed radius of support domain (s).

n	201	401	601	801	1001
r	r₀	r₀	r₀	r₀	r₀
MLS	0.0148	0.0563	0.1227	0.2407	0.4063
MTLS	0.0075	0.0273	0.0547	0.6000	1.1250
IMTLS	0.1473	0.9633	2.9602	45.3609	90.3750

Case 2

In this case, the surface data of standard ball was processed to verify the performance of the IMTLS method. The data is obtained by a commercial white light interferometer (WLI) – Taylor Hobson CCI 3000.

Figure 4 shows the schematic diagram of a WLI system. Firstly, a broadband illumination beam passes through an interferometric objective via a beam splitter. The beams that were reflected by reference mirror and measured surface were focused onto a camera. Interference fringes will generate when the optical path difference (OPD) between the reference and measurement arm is within the coherence length, and the visibility of the fringes increases as OPD decreases. A series of interferograms can be obtained through scanning the objective. Surface data can be obtained by tracking all coherence peaks or phase retrieval within the field of view of the objective.

Figure 4.

Schematic diagram of a WLI system.

The errors contained in experimental data will have a negative impact on obtaining the true profile of the ball measured, and the impact can be reduced by using appropriate algorithm for reconstruction. Three methods are used to reconstruct the experimental data respectively. Then, the reconstructed data are used for parameter regression based on simulated annealing algorithm⁴⁰ and the regressed radius is used as the evaluation index of reconstruction method. The radius of standard ball is calibrated as 14.402 mm. Since the random error of measurement data cannot be calibrated technically, l is defined to the standard deviation of residuals fitted by the LS estimation in support domain. As shown in Figure 5, data selected in different locations are processed by three algorithms. As shown in Table 5, the radius corresponding to the IMTLS is more proximate to the calibrated value among the three algorithms.

Figure 5.

The processing of experimental data.

Table 5.

The regression radii of three methods in Case 2 (mm).

Radius parameter	MLS	MTLS	IMTLS
R ₁	14.340	14.435	14.405
R ₂	14.441	14.393	14.408
R ₃	14.723	14.632	14.613
R ₄	14.615	14.617	14.420
$\bar{R}$	14.530	14.519	14.462
σ_R	0.149	0.106	0.088

For the correction parameter δ introduced by the IMTLS method, we set different values to observe the change of reconstruction result. As shown in Figure 6(a), when there are outliers in the data, the reconstruction result of the IMTLS method is obviously better than the other two algorithms. And the result will be consistent with that of the MTLS method as δ increases, which shows that when δ increases to a certain value, the node will not be eliminated in the support domain. As shown in Figure 6(b), when there are only random errors, the result of the IMTLS method is still better among the three algorithms. It is clear that δ can be selected in this way for the curve and surface reconstruction.

Figure 6.

Trendlines of three algorithms in Case 1: (a) the data of the fourth group and (b) the data only containing random errors.

Furthermore, we set different number of nodes that can be eliminated in the support domain, to observe the number of points eliminated in each support domain. The data of the fourth group in Case 2 is taken as an example.

As shown in the Figure 7, the l calculated by the residuals fitted by the LS estimation is relatively large in the support domain with outliers, while in the support domain without outliers, the obtained l is relatively small. Therefore, compared with the support domain without outliers, fewer points are eliminated in the support domain with outliers.

Figure 7.

Number of points eliminated in each support domain under different conditions.

Figure 8 demonstrates the processing result with eliminated nodes, some outliers are not effectively eliminated, because the number of eliminated nodes is not sufficient to eliminate all outliers in some support domains. In addition, the processed data are regressed by simulated annealing to get the regression radii, as shown in Table 6. When the outliers are effectively processed, the corresponding regression radius is significantly closer to the calibrated value. This indicates that the outlier has a great negative influence on the parameter regression and can be processed effectively by setting an appropriate number of elimination nodes.

Figure 8.

The fitting surface under different conditions.

Table 6.

The regression radii under different conditions (mm).

R ₄₄	R ₄₃	R ₄₂	R ₄₁
14.420	14.394	14.464	14.484

Through numerical simulation and experimental data verification, the IMTLS method shows better performance. As seen from Case 1, compared with the MLS and MTLS method, the IMTLS method can effectively deal with errors, regardless of whether there are outliers in the data. In addition, the IMTLS method also inherits the good local approximation properties from the MTLS method. We further verify the algorithm by processing measurement data of a standard ball, and the performance of proposed method is evaluated by regression radius. As seen from Case 2, the processed result by the IMTLS algorithm is more proximate to calibrated value.

Conclusion

The MLS and MTLS method show good performance in the fitting of discrete data, such as realizing effective approximation for local geometry feature and obtaining high-order continuous approximation functions with low-order basis functions, etc. However, these two reconstruction methods are not robust, because the outliers in the measurement data will have extremely negative impact on the fitting results. In order to reduce this impact, we proposed an improved MTLS method, in which an improved RANSAC and a correction parameter are introduced into the support domain of the MTLS method to process the abnormal nodes, and then the local fitting coefficients are obtained by the TLS estimation. In this way, the improved MTLS method not only has the advantages of the MTLS method but also can effectively deal with outliers. Practically, we verified the proposed algorithm by dealing with experimental data obtained by CCI. The processing results of three cases show that the performance of the IMTLS method is significantly better than the other two algorithms.

Footnotes

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the National Natural Science Foundation of China (Grant No. 51605094) and the Fundamental Research Funds for the Central Universities (Grant No. WK2090050042).

ORCID iDs

Dawei Tang

Shuwen Lin

References

Mehrabi

Voosoghi

. Recursive moving least squares. Eng Anal Bound Elem 2015; 58: 119–128.

Sun

Zhai

Zhang

, et al. An adaptive meshless method based on friction condition control strategy. Eng Fail Anal 2019; 106: 104148.

Wang

Qin

Wei

, et al. Meshless method with ridge basis functions. Appl Math Comput 2010; 217(5): 1870–1886.

Arehpanahi

Vahedi

. Modified weight function with automatic node generation in element-free Galerkin method for magnetic field computation. IET Sci Meas Technol 2015; 9(8): 1043–1049.

Breitkopf

Rassineux

Savignat

, et al. Integration constraint in diffuse element method. Comput Methods Appl Mech Eng 2004; 193(12): 1203–1220.

Mclain

. Drawing contours from arbitrary data points. Comput J 1974; 4: 318–324.

Lancaster

Salkauskas

. Surfaces generated by moving least squares methods. Math Comput 1981; 37(155): 141–158.

Feng

Yang

, et al. An investigation on modeling and compensation of synthetic geometric errors on large machine tools based on moving least squares method. Proc IMechE, Part B: J Engineering Manufacture 2018; 232(3): 412–427.

Huang

Lei

Huang

, et al. Boundary moving least square method for 2D elasticity problems. Eng Anal Bound Elem 2019; 106: 505–512.

10.

Harris

Kassab

Divo

. A shock-capturing meshless scheme using RBF blended interpolation and moving least squares. Eng Anal Bound Elem 2019; 109: 81–93.

11.

Dabboura

Sadat

Prax

. A moving least squares meshless method for solving the generalized Kuramoto-Sivashinsky equation. Alex Eng J 2016; 55(3): 2783–2787.

12.

Fangyan

Jiao

Yanjin

, et al. Moving least squares particle hydrodynamics method for Burgers’ equation. Appl Math Comput 2019; 356: 362–378.

13.

Kaplan

. Developing of the new models in solar radiation estimation with curve fitting based on moving least-squares approximation. Renew Energy 2020; 146: 2462–2471.

14.

Mostafaiyan

Wießner

Heinrich

. Moving least-squares aided finite element method (MLS-FEM): a powerful means to predict pressure discontinuities of multi-phase flow fields and reduce spurious currents. Comput Fluids 2020; 211: 104669.

15.

Ghoneim

. A smoothed particle hydrodynamics-phase field method with radial basis functions and moving least squares for meshfree simulation of dendritic solidification. Appl Math Model 2019; 77(2): 1704–1741.

16.

Yingjie

Liling

. A novel geometry error measurement methodology for coaxiality evaluation. Proc IMechE, Part B: J Engineering Manufacture 2021; 235(4): 627–639.

17.

Calvo

Gómez

Domingo

. Circle fitting from the polarity transformation regression. Precis Eng 2013; 37(4): 908–917.

18.

Wang

Liu

, et al. Data snooping algorithm for universal 3D similarity transformation based on generalized EIV model. Measurement 2018; 119: 56–62.

19.

Lei

Liu

, et al. Image reconstruction algorithm based on the extended regularised total least squares method for electrical capacitance tomography. IET Sci Meas Technol 2008; 2(5): 326–336.

20.

Wang

Liu

Fang

, et al. A universally efficient algorithm and precision assessment for seamless 3D similarity transformation. Meas Sci Technol 2020; 31: 105006.

21.

Scitovski

Ungar

Jukić

. Approximating surfaces by moving total least squares method. Appl Math Comput 1998; 93(2–3): 219–232.

22.

Wang

D’Amato

Gómez

. Confidence distance matrix for outlier identification: a new method to improve the characterizations of surfaces measured by confocal microscopy. Measurement 2019; 137: 484–500.

23.

Lou

Tang

Zeng

, et al. Application of clustering filter for noise and outlier suppression in optical measurement of structured surfaces. IEEE Trans Instrum Meas 2020; 69(9): 6509–6517.

24.

Zheng

Feng

Huang

. A modified moving least-squares suitable for scattered data fitting with outliers. J Comput Appl Math 2020; 370: 112655.

25.

Liu

. An advanced multiple outlier detection algorithm for 3D similarity datum transformation. Measurement 2020; 163: 107945.

26.

Čížek

. Least trimmed squares in nonlinear regression under dependence. J Stat Plan Inference 2006; 136(11): 3967–3988.

27.

Zhou

Wang

Xiao

. Robust outlier removal using penalized linear regression in multiview geometry. Neurocomputing 2017; 267: 455–465.

28.

Amir

Levin

. Quasi-interpolation and outliers removal. Numer Algorithms 2018; 78: 805–825.

29.

Raguram

Chum

Pollefeys

, et al. USAC: a universal framework for random sample consensus. IEEE Trans Pattern Anal Mach Intell 2013; 35(8): 2022–2038.

30.

Sober

Aizenbud

Levin

. Approximation of functions over manifolds: a moving least-squares approach. J Comput Appl Math 2021; 83: 113140.

31.

Wang

Xiang

. Quantum algorithm for total least squares data fitting. Phys Lett A 2019; 383(19): 2235–2240.

32.

Fischler

Bolles

. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun ACM 1981; 24: 381–395.

33.

, et al. Vision based in-process inspection for countersink in automated drilling and riveting. Precis Eng 2019; 58: 35–46.

34.

Jeon

Khambampati

Kim

, et al. Image reconstruction in EIT with unreliable electrode data using random sample consensus method. Meas Sci Technol 2017; 28: 055403.

35.

Zheng

Liu

, et al. A primitive-based 3D reconstruction method for remanufacturing. Int J Adv Manuf Technol 2019; 103: 3667–3681.

36.

Nejad

Nasri

. A-RANSAC: adaptive random sample consensus method in multimodal retinal image registration. Biomed Signal Process Control 2018; 45: 325–338.

37.

Civera

Grasa

Davison

, et al. 1-Point RANSAC for extended Kalman filtering: application to real-time structure from motion and visual odometry. J Field Robot 2010; 27: 609–631.

38.

Tang

, et al. A robust moving total least-squares fitting method for measurement data. IEEE Trans Instrum Meas 2020; 69(10): 7566–7573.

39.

Zhang

Ovaska

. ADC characterization based on singular value decomposition. IEEE Trans Instrum Meas 2002; 51(1): 138–143.

40.

Tamizharasan

Barnabas

Pakkirisamy

. Optimization of turning parameters by using design of experiments and simulated annealing algorithm based on audible acoustic emission signals. Proc IMechE, Part B: J Engineering Manufacture 2012; 226(7): 1159–1173.

Surface reconstruction method for measurement data with outlier detection by using improved RANSAC and correction parameter

Abstract

Keywords

Introduction

Introduction to the basic algorithms

The MLS method

The MTLS method

The RANSAC algorithm

The improved MTLS method

Case verification

Case 1

Case 2

Conclusion

Footnotes

Declaration of conflicting interests

Funding

ORCID iDs

References