Sage Journals: Discover world-class research

Abstract

Tool wear prediction is paramount for guaranteeing the quality of the workpiece and improving lifetime of the cutter. However, the multicollinearity between the extracted features deteriorates the prediction accuracy. To overcome this, a partial least square regression-based method is proposed. The main characteristic of partial least square regression is that the regression analysis is realized in the principle component space so that multicollinearity between the input variables can be avoided. To testify the correctness of the proposed method, the milling experiment is preceded and the dynamic cutting force is collected to depict the variation of the tool wear. Moreover, Monte Carlo cross validation is adopted to improve the robustness of partial least square regression. The analysis and comparison between the partial least square regression model and the multiple linear regression model shows that the presented method can get more accurate results.

Keywords

Tool wear prediction milling process partial least square regression Monte Carlo cross validation multicollinearity

Introduction

With the development of modern manufacturing industries, more attention is focused on how to minimize cost and maximize productivity. Tool wear is one of major obstacles to realize large scale automation and minimize human intervention.¹ In comparison with tool status classification, tool wear prediction is preferred in some cases because the accurate tool wear value can be estimated so that the further process optimizing and control strategy can be taken in time. Previouly, many researchers have focused much energy on effective and accurate prediction of the tool wear value and multiple linear regression (MLR) is one of the commonly used methods to predict the tool wear value based on sensory signal. Jacob et al.² built a MLR model to predict the tool wear based on the average force and the average peak force. Bhattacharyya et al.³ proposed a two-stage model in which the MLR model was used in the first stage to relate the selected features to the tool wear value so that the tool wear value was predicted accurately. Li et al.⁴ realized the tool wear prediction with the combination of the MLR model and the wavelet-based features. These successful applications demonstrate the effectiveness of the MLR method. However, because all feature variables depend on the variation of tool wear status, the colinearity exists inevitably among them. Moreover, the feature vectors under the same tool wear value usually fluctuate within a certain scope because of the disturbance of the noisy signal and the complexity of tool wear topology. In this situation, the model coefficients of the MLR may change erratically in response to small changes in the model or the data, which will deteriorate the prediction accuracy correspondingly.

In this article, to overcome the shortcomings of MLR and improve the accuracy of tool wear prediction, a partial least square regression (PLSR) model is presented. PLSR combines the characteristics of principal component analysis and the MLR model. The main advantage is that the regression analysis is established in the principal component space in which the variables are independent from each other. Therefore, the colinearity between the selected variables can be avoided, which will improve the prediction accuracy greatly. The PLSR method has been adopted in many aspects, such as predicting infrared spectra,⁵ ripening time of Manchego cheese,⁶ couples mental health⁷ and time series modeling of process data.⁸ However, to the author’s knowledge, the evaluation of using PLSR to overcome the multicollinearity in the field of continuous tool wear prediction has not been reported. To testify the effectiveness of the proposed method, milling experiments of Titanium alloy are carried out and sixteen harmonics features are utilized to predict the tool wear value using the PLSR model. Moreover, the Monte Carlo cross validation (MCCV) method is also adopted to improve the robustness of the prediction model. The analysis and comparison between PLSR and MLR shows that the combination of PLSR with MCCV is more accurate to realize online tool wear prediction.

This article is organized as follows. The principle of PLSR and MCCV is given first. The experiments and harmonics based feature extraction method are then described. The analysis of the variation of extracted features shows that the multicollinearity exists between different harmonics. Based on MCCV, PLSR is utilized to build the relationship between the tool wear value and the harmonic features. To make a comparison, the MLR is also adopted to predict the tool wear value using the same data as the training and test samples. The analysis and comparison of different performance criteria show that the PLSR outperforms the MLR method. Some useful conclusions are given in the last section.

Principle of PLSR modeling and MCCV

Principle of PLSR modeling

To avoid the deterioration of the tool wear prediction accuracy because of the multicollinearity between the selected feature variables, the PLSR model is adopted that is built according to the following calibration model⁹

\begin{matrix} y = X β + e \\ E (e) = 0, Cov (e) = σ^{2} I \end{matrix}

(1)

where E(.) and Cov(.) denote the expectation and covariance, respectively. To extract the partial least square (PLS) components, the observation matrix X can be further decomposed as

X = t_{1} p_{1}^{t} + t_{2} p_{2}^{t} + \dots + t_{k} p_{k}^{t} + R = T P^{t} + R

(2)

where t _i and p _i are the PLS scores and loadings. If there are no measurement errors in the data matrix, equation (1) can be rewritten as

y = T P^{t} β + e = T α + e

(3)

where α is the simplification of P^t β , which denotes the regression coefficients in the PLS space. Because the number of PLS components are selected as k, the left q–k components can be considered as the representatives of the noise or the cause of the colinearity in the data set. Therefore, the corresponding elements in the matrix T and α are regarded as zeros, and only the former k components remain in the model, that is

y = T_{k} α_{k} + e

(4)

The number of components k is also called the dimension of the model. The least square solution of equation (4) is calculated by

{\hat{α}}_{k} = (T_{k}^{t} T_{k})^{- 1} T_{k}^{t} y

(5)

and the fitted value of y is

\hat{y} = T_{k} {\hat{α}}_{k} = X H_{k} (T_{k}^{t} T_{k})^{- 1} H_{k}^{t} X^{t} y

(6)

where H_k is the coefficients matrix that is used to build the relationship between the observation matrix X and the PLS scores matrix T_k, that is

T_{k} = X H_{k}

(7)

Therefore, the PLS estimator ${\hat{β}}_{k}$ of β with the former k components remaining in the model can be calculated by

{\hat{β}}_{k} = H_{k} (T_{k}^{t} T_{k})^{- 1} H_{k}^{t} X^{t} y

(8)

The detailed algorithm of PLSR modeling is described in Qingsong and Yizeng.⁹

Principle of MCCV

Although PLSR is an effective method to get rid of multicollinearity in the explanatory variables and realize the accurate modeling, it is difficult to determine the suitable number of latent variables so as to obtain the best predictive ability. Here, MCCV is presented to perform the cross validation several times iteratively based on the Monte Carlo algorithm. At each time, it splits the training samples into two parts S_c and S_v randomly and this process is repeated L times in which the MCCV criterion is defined as⁹

MCC V_{n_{V}} (k) = \frac{1}{L n_{V}} \sum_{i = 1}^{L} {‖ y_{{ks}_{v} (i)} - {\hat{y}}_{k s_{v} (i)} ‖}^{2}

(9)

where ${\hat{y}}_{k s_{v} (i)}$ is the predicted value using model built by S_c training dataset and $y_{k s_{v} (i)}$ is the measured value. This criterion is used to select the optimal k that gives the minimum MCCV value. Unlike leave-one-out cross validation, the MCCV is an asymptotically consistent method in determining the number of components in the calibration model. It can avoid an unnecessary large model, and therefore, decreases the risk of over-fitting. Therefore, the built model can be more robust and accurate. Here in the following section, MCCV is adopted for the PLSR modeling of tool wear value based on the feature vectors extracted from the force signal.

Experimental set-up and feature extraction

As shown in Figure 1, a milling experiment of Ti–6Al–4V titanium alloy was conducted in a Makino vertical machining center and the cutting forces generated during the machining process are measured by a three-axis piezoelectric dynamometer and collected by a data acquisition card (sampling at 10 kHz). The tool geometry and cutting parameters are described in Table 1. The tool wear status is measured by an optical microscope after every cutting pass. As shown in Figure 2, the cutter wear appeared around the tool nose zone. Therefore, the maximal vertical length of this area was measured to depict the tool wear status. Finally, 38 cutting passes were achieved and the force during each pass was collected continuously during the machining process. Because the fore signals are polluted by the noisy signal, it should be preprocessed first by getting rid of the trend item and the wild data. The filtered waveform of the cutting force in the feed direction and its amplitude spectrum are illustrated in Figure 3, from which we can see that the signal displays periodic characteristic and the dominant frequency components in the spectrum representation of cutting force are around the tooth passing frequency (TPF) and its integral multiple harmonics. Therefore, the amplitude corresponds to the TPF and its harmonics are selected as the feature vectors to depict the variation of the tool wear. The TPF can be calculated by

Figure 1.

Schematic diagram of the experimental set-up.

Table 1.

Tool geometry and cutting parameters.

Cutter diameter (mm)	Teeth number	Cutter geometry	Relief angle	Edge angle	Spindle speed (r/min)	Cut width (mm)	Cut depth (mm)	Feed rate (mm/z)
12	1	Parallelogram	11°	90°	1060	6	0.4	0.1

Figure 2.

Sketch of tool nose wear measurement.

Figure 3.

Waveform and spectrum of milling force signal.

TPF = \frac{SK}{60}

(10)

It has been proven that the amplitude of the cutting harmonics increase with the processing of the tool wear status.^10,11 Because the TPF and its harmonics are within the lower frequency band, the force signal is first decomposed into different scales with discrete wavelet decomposition in which the coefficients in the jth levels can be written as¹²

{\begin{matrix} c_{j} (i) = \sum_{m} h_{0} (m - 2 i) c_{j - 1} (m) \\ d_{j} (i) = \sum_{m} h_{1} (m - 2 i) c_{j - 1} (m) \end{matrix}

(11)

h₀ and h₁ are the low-pass and high-pass filters related to the wavelet function, and m and i are the index of the elements in the signal. Then, the amplitude spectrum of the low frequency signal c(i) at certain scale j is calculated by fast Fourier transform (FFT) analysis and the peak amplitude value around the TPF and its integral multiple harmonics can be obtained and organized as feature vectors. The whole flowchart of the feature extraction is demonstrated in Figure 4.

Figure 4.

Flowchart of harmonic-based feature extraction.

In this article, the scale of wavelet decomposition is selected as three, therefore the frequency band scope of the low frequency signal is within 0∼1.25 kHz. Because the amplitude of the harmonics in the spectrum graph decrease with the increase of the harmonic order on the whole and the amplitude corresponding to the first 16 harmonics are larger than the others, obviously, these amplitude values are selected and organized as the feature vector to depict the variation of the tool wear status. To show the generalization of the proposed model, the cutting force signal is first divided into 40 segments with the length of 8000 data points. So the total number of samples for each tool wear status is 40. The variation of the several selected features with the tool wear value is illustrated in Figure 5 by means of box plot, in which (a), (b) and (c) denotes the 1st-order harmonic feature, the 6th-order harmonic feature and the 11th-order harmonic feature, respectively. For each box, the central mark is the median value and the edges of the box are the 25th and 75th percentiles. The whiskers extend to the most extreme data points, which are not considered as outliers and the outliers are plotted individually, which are labeled as ‘+’. It can be seen that the median value of the harmonic features increase with the increasing of the tool wear value monotonically and these features share the common trend. To further analyze their relationship, the correlation coefficients of different harmonic features are calculated and some of the results are listed in Table 2.

Figure 5.

Feature variation under different orders.

Table 2.

Some of the correlation coefficients between different harmonic features.

Harmonic order	1st	3rd	7th	9th	11th	15th
2nd	0.98	0.99	0.92	0.5	0.79	0.71
4th	0.92	0.97	0.85	0.51	0.71	0.67
6th	0.89	0.90	0.97	0.62	0.74	0.68
8th	0.78	0.80	0.93	0.75	0.68	0.58
10th	0.82	0.81	0.81	0.70	0.95	0.65
14th	0.79	0.80	0.75	0.45	0.69	0.94

It can be seen that these feature vectors are highly correlated. When they are utilized as the predictor variables, multicollinearity is introduced between them unavoidably, which will deteriorate the prediction accuracy correspondingly. Therefore, to improve the prediction accuracy and robustness, the PLSR is adopted in the following section to predict the tool wear value and make comparison with MLR.

Tool wear prediction using PLSR and comparison with MLR

MCCV

In this section, PLSR is used to build the regression model and realize the prediction of tool wear based on MCCV. Here the repetition times of the MCCV are set to 500 and the maximum number of the latent variables is 16. The variation of the MCCV criterion with the increase of latent variables is illustrated in Figure 6. It can be seen that the value of the MCCV criterion goes down quickly as the number of component increases. However, after the number of the latent variables is larger than 10, it increases slightly. Therefore, the optimum number of the explanatory variables for PLSR is selected as 10.

Figure 6.

Variation of MCCV criterion with the number of latent variables.

Comparison of PLSR with MLR

After the optimum number of the latent variables is determined, the PLSR can be realized for online prediction of the tool wear value. Moreover, MLR is also adopted based on the same data here to make a comparison with the PLSR. To compare the performance of these two methods intuitively, four indicators are presented to depict the global prediction capability of the proposed method. The first is the average absolute error in predicting the tool wear value¹³

RMSEP = \sqrt{\frac{\sum_{i = 1}^{N} {(y_{i}^{'} - y_{i})}^{2}}{N}}

(12)

where ${y'}_{i}$ and y_i represent the model computed and measured values of the variable, respectively.

The second is the relative error of prediction of the dependent variable in percentage (REP), which is calculated as¹³

REP = \sqrt{\frac{\sum_{i = 1}^{N} {(y_{i}^{'} - y_{i})}^{2}}{\sum_{i = 1}^{N} y_{i}^{2}}} \times 100

(13)

The third is the accuracy factor (A_f), which indicates the spread of the results about the prediction¹³

A_{f} = 10^{(\sum_{i = 1}^{N} \frac{| \log (\frac{y_{i}^{'}}{y_{i}}) |}{N})}

(14)

The above indicators illustrate the accuracy of the model in different aspects. The smaller the indicator is, the higher the accuracy. The fourth is called the coefficient of determination (R²), which represents the percentage of variability that can be explained by the model¹³

R^{2} = {[\frac{N \sum_{i = 1}^{N} y_{i}^{'} y_{i} - \sum_{i = 1}^{N} y_{i}^{'} \sum_{i = 1}^{N} y_{i}}{\sqrt{N (\sum_{i = 1}^{N} y_{i}^{' 2}) - {(\sum_{i = 1}^{N} y_{i})}^{2}} \times (N (\sum_{i = 1}^{N} y_{i}^{' 2}) - {(\sum_{i = 1}^{N} y_{i})}^{2})}]}^{2}

(15)

The larger value means that the model has a stronger ability to reflect the relationship between the features and the tool wear status.

Based on these performance indicators, the numerical comparison of the PLSR and MLR under the same number of explanatory variables is realized and the results are listed in Table 3. It can be seen that the prediction accuracy of the PLSR is higher than MLR. To demonstration the difference of the prediction results between the PLSR and MLR more clearly, the comparison of the measured and predicted tool wear value under different cutting passes is illustrated in Figure 7. It can be seen that the accuracy of PLSR method is higher than MLR at both the beginning and final stage of the tool wear process. Therefore, it can be concluded that the PLSR has a strong ability to overcome the influence of multicollinearity so as to realize a more accurate online tool wear prediction.

Table 3.

Performance comparison of PLSR with MLR.

	RMSEP	REP	A _f	R ²
PLSR	0.0169	10.62	154.9	0.9397
MLR	0.0177	11.09	213.7	0.9342

PLSR: partial least square regression; MLR: multiple linear regression.

Figure 7.

Comparison of predicted tool wear with measured value.

Conclusions

In this article, to overcome the influence of multicollinearity and improve the accuracy, a PLSR model is presented to realize continuous tool wear prediction. The main characteristic of the PLSR is that the model can be built in the principal component space so as to avoid the multicollinearity among the input features. To testify the effectiveness, a milling test is carried out and the harmonic features extracted from the milling force signal are adopted to depict the variation of tool wear status. By comparing several performance indicators and analyzing the prediction curve, it can be concluded that the PLSR model outperforms the MLR for the accurate prediction of tool wear status. This method casts new light on the online accurate prediction of tool wear in the industrial environment.

Footnotes

Funding

This project is supported by National Natural Science Foundation of China [51175371].

References

Srinivasa

Nagabhushana

Ramakrishna Rao

. Flank wear estimation in face milling based on radial basis function neural networks. Int J Adv Manuf Technol 2002; 20: 241–247.

Jacob

Joseph

. A multiple-regression model for monitoring tool wear with a dynamometer in milling operations. J Technol Studies 2004; 30: 71–77

Bhattacharyya

Sengupt

Mukhopadhyay

. Cutting force-based real-time estimation of tool wear in face milling using a combination of signal processing techniques. Mech Sys Signal Process 2007; 21: 2665–2683.

Zeng

Zhou

. Multi-modal sensing and correlation modelling for condition-based monitoring in milling machine. SIMTech Technical Reports 2007; 8: 50–56.

Husheng

Griffiths

Tate

. Comparison of partial least squares regression and multi-layer neural networks for quantification of nonlinear systems and application to gas phase Fourier transform infrared spectra. Analytica Chimica Acta 2003; 489: 125–136.

Poveda

Garcıa

Martin-Alvarez

. Application of partial least squares (PLS) regression to predict the ripening time of Manchego cheese. Food Chemistry 2004; 84: 29–33.

Svante

Michael

Lennart

. PLS-regression: a basic tool of chemometrics. Chemometrics Intell Lab Sys 2001; 58: 109–130.

Hojjat

Abbas

Laleh Same

. A comparison of partial least squares (PLS) and ordinary least squares (OLS) regressions in predicting of couples mental health based on their communicational patterns. Procedia Social Behavioral Sci 2010; 5: 1459–1463.

Qingsong

Yizeng

. Monte Carlo cross validation. Chemometrics Intell Lab Sys 2001; 56: 1–11.

10.

Yan

Wong

Lee

. An investigation of indices based on milling force for tool wear in milling. J Mat Process Technol 1999; 89–90: 245–253.

11.

Kious

Ouahabi

Boudraa

. Detection process approach of tool wear in high speed milling. Measurement 2010; 43: 1439–1446.

12.

Ting

Guozheng

Banghua

. EEG feature extraction based on wavelet packet decomposition for brain computer interface. Measurement 2008; 41: 618–625.

13.

Singh

Ojha

Malik

. Partial least squares and artiﬁcial neural networks modeling for predicting chlorophenol removal from aqueous solution. Chemometrics Intell Lab Sys 2009; 99: 150–160.

Online tool wear prediction based on partial least square regression and Monte Carlo cross validation