Sage Journals: Discover world-class research

Abstract

As one of the most important factors affecting shop floor management, tool life is determined by the tool flank wear or break, which is related to the tool parameters, cutting conditions and workpiece parameters. It is found that the relationship between these factors and the tool life is too nonlinear to be analytically formulated. For this reason, back-propagation neural network model is used to predict the tool life for its strong ability of nonlinear fitness. To avoid the local optimum, slow convergence and low generalization capability of the back-propagation neural network, a tool life prediction model, which is based on improved particle swarm optimization–back-propagation neural networks, is proposed in this article. The particle swarm optimization is applied to optimize the weights and thresholds of the back-propagation algorithm for improving the ability of global search and generality. Existing sample data of tools are used to train the proposed model for predicting the life of the tools that are similar to the sample data in tool style. The face milling tools and workpiece of 45 steel are selected for experiment. Theoretical analysis and comparative experiments with back-propagation neural network indicate that the life predicted by the particle swarm optimization–back-propagation model is much better than that of the back-propagation model. It proves that particle swarm optimization–back-propagation model has better convergence, stronger robustness and higher generality. This model also provides a theoretical basis for the economization of tool demand analysis and production planning.

Keywords

Tool life particle swarm optimization back-propagation neural network

Introduction

The dull or damaged tools can not only put extra strain on the machine tool system but also cause quality loss of the workpiece. It is also a major cause of unscheduled stoppage in a machining environment, which would result in time lost and capital destroyed. Knowing the tool life enables technologist to replace tool before the tool is dull so as to improve the product quality and production efficiency. Moreover, knowledge of tool life is also beneficial to the tool demand analysis, tool cost control and production planning. Therefore, a method to predict the tool life is necessary.

Tool life is determined by the tool flank wear or break. The tool flank wear and break is relevant to the tool parameters (tool style, tool diameter, tool material, etc.), cutting materials and processing parameters (cutting speed, cutting width, feed rate, etc.) and so on. It is necessary to find out the relationship between these affecting factors and the tool life. However, the relationship between these factors and the tool life is highly nonlinear. It is difficult for the traditional Taylor formulas of tool life to get the real tool life in different processing environment. The job of finding out the mathematical formula between affecting factors and life expectancy is not easy. Therefore, a method to find out a relationship that involves no mathematical formula is needed. Back-propagation (BP) neural network can learn and store large amounts of input–output model mapping relationship, without revealing the mathematical equations. With its ability of approximating any nonlinear function of arbitrary precision and its requirement of limited number of samples, BP model has been widely used in the process of solving fuzzy, nonlinear problems. Considering the great nonlinearity between the affecting factors and the tool life, the tool life prediction can be regarded as a nonlinear problem. Hence, the BP neural network, which is simple, efficient and economic, can be applied to find out the relationship.

Ezugwu et al.¹ used BP neural network to predict tool lives and failure modes for experiments not used in training. The best results are 58.3% correct tool life prediction (within 20% of the actual tool life) and 87.5% correct failure mode prediction. Ojha and Dixit² used neural networks to predict the maximal, minimum and most proximate estimates of the tool life. The comparison in Ojha and Dixit’s article showed that there were a higher robustness and a better convergence in BP neural network than in multiple regressions. He also proposed a methodology for updating/obtaining the tool life estimates based on the shop floor feedback. Sanjay et al.³ used BP neural network and statistical methods for the prediction of tool wear in drilling. Drill size, feed, spindle speed, torque, machining time and thrust force were given as inputs to the artificial neural network and the flank wear was estimated. The results of both methods were compared with the experimental values. Neural networks were found to show better results compared to the statistical method.

But BP neural network has its own weaknesses. It would easily fall into a local optimum and is slow in convergence and weak in generalization ability. Laser- and video-based online artificial vision systems with neural network for direct online tool condition monitoring (TCM) can be more accurate. Some researchers have worked on this application of TCM.^4,5 But high cost and inconsistency due to variation in illumination have prevented this method from being implemented in the industry. A more economical proposition is to use an indirect method of monitoring tool wear from measured signals. Paul and Varadarajan⁶ built a multi-sensor fusion model based on artificial neural network for TCM. Srinivas and Kotaiah⁷ and Chen and Li⁸ established tool wear models in indirect measures of least-squares regression. Since tool wear must be measured after the tool cutting is interrupted, only few training data are available for learning the correlation between tool wear and indirect measures. Therefore, least-squares regression cannot guarantee the generalization performance of tool wear models. From D’Addona and Teti,⁴ Li et al.,⁵ Paul and Varadarajan,⁶ Srinivas and Kotaiah,⁷ Chen and Li⁸ and Sick,⁹ it is shown that the supervision of tool wear is the most difficult task in the context of TCM for metal-cutting processes.

To achieve the optimal convergence velocity and better generalization performance without supervision, several methods to optimize BP algorithm have been presented. These methods include adding learning rate and momentum factor, analog annealing algorithm,¹⁰ evolutionary algorithm (EA) and other random optimal algorithms.

Particle swarm optimization (PSO) algorithm is a kind of global optimization algorithm. It has the advantages of fewer parameters, easy implementation, fast convergence and strong robustness compared to genetic algorithm (GA) and other EAs.¹¹ The PSO is also able to converge to the global optimal solutions with a certain probability.¹² The combination of PSO and BP neural network not only guarantees the learning rate but also solves the weaknesses of BP neural network mentioned former. The potential of the PSO algorithm has been demonstrated by its successful application to optimization problems in artificial neural network design.^13–16

Therefore, a reliable and economic PSO-BP neural network model is provided for tool life prediction in this article. The basic BP neural network structure for tool life prediction is built on the basis of relative tool parameters and processing conditions as input and tool life as output. Proper numbers of hidden layers and nodes are set according to the input and output values. PSO algorithm is used to optimize the weights and thresholds of BP neural network before training BP neural network. Then an improved BP learning algorithm is further applied to find the optimal values.

The highly nonlinear relationship between affecting factors and tool life is obtained by training this model with sample data. Tool life can be predicted by inputting affecting factors to this model. A comparative experiment of tool life prediction in the BP model, improved BP model and PSO-BP model is conducted after the theoretical analysis. The results proved a better convergence, stronger robustness and higher generalization of PSO-BP model than BP model.

Theory of BP neural network

BP neural network was put forward by Rumelhart et al.¹⁷ in 1986. It has become the most important neural network model for the weaknesses of other available networks.¹⁸ It is a kind of error BP training algorithm for multilayer feedforward neural network. This network consists of the input layer, hidden layer and output layer. BP network can learn and store large amounts of input–output model mapping relationship without revealing the mathematical equations that describe the mapping relationship. The BP algorithm is used in BP neural network for learning: first, the output values are obtained by dealing with input values; second, the model calculates the errors of the network outputs compared to the expected values; third, errors are passed back to the network to modify the connecting weights and thresholds. The errors of the output would be decreased next time by this way. The weights and thresholds are adjusted constantly by the BP network until they meet the satisfactory output values or errors. The basic BP neural network model is shown in Figure 1.¹⁹

Figure 1.

Basic BP neural network.

Standard BP algorithm weight adjustment rule is as formula (1)

w (k + 1) = w (k) - η \frac{\partial E (k)}{\partial w (k)}

(1)

where k represents the run times, w is the weight of the model, $η$ is a constant called the learning rate and $E (k)$ is the mean square error of the output value of all samples in the kth run times. Threshold adjustment rule is similar to formula (1). The adjustment of weight mentioned later includes the adjustment of the threshold.

Self-adapting algorithm

The standard BP algorithm learning rate is a constant. In order to minimize the total error, $η$ should be small enough, but small $η$ will prolong the learning time. Although large $η$ can accelerate the learning rate, it easily leads to the shock of the learning process which would result in poor convergence.

Self-adapting algorithm¹¹ is mentioned to improve the efficiency of learning process by automatically adjusting the learning rate of weight adjustment formula. When the new error is a certain ratio higher than the old error, the learning rate will decrease; when the new error is lower than the old error, the learning rate will increase. This method guarantees training the network at the maximum acceptable learning rate anytime.

The adjustment function of learning rate $η$ is given as formula (2)

η (k) = {\begin{matrix} t_{1} η (k - 1) & E (k) > tE (k - 1) \\ t_{2} η (k - 1) & E (k) < E (k - 1) \\ η (k - 1) & otherwise \end{matrix}

(2)

where $t_{1}$ is the increment factor of learning rate and is used to increase the learning rate and $t_{2}$ is the reduction factor of learning rate and is used to reduce the learning rate; $t \geq 1$ , $t_{1} < 1$ and $t_{2} > 1$ .

Additional momentum method

The additional momentum method transfers the influence of the previous adjustment of weight to the current weight adjustment formula through a momentum factor, which can not only reduce the BP training time but also ensure the stability of the process.¹⁷ The momentum can adjust the weights toward the average bottom of error curved surface.

When the network weights enter the flat area at the bottom of error curved surface, the local gradient will become very small. The adjustment of weight is approximately the same at the iteration k and (k − 1), so as to reduce the network sensitivity to local details of error curved surface. In this way, the momentum can help to keep the network out of the error surface at a local minimum value.

Improved BP neural network

The self-adapting algorithm with additional momentum greatly shortens the training time and guarantees the stability of the training process. Improved BP neural network weight adjustment rule can be modified as formula (3)

w (k + 1) = w (k) - (1 - α) η (k) \frac{\partial E_{T}}{\partial ω (k)} + α Δ w (k)

(3)

where $α$ is the momentum factor and its value is usually 0.95; $Δ w (k) = w (k) - w (k - 1)$ ; the function of $η (k)$ is given as formula (2).

Theory of PSO algorithm

The PSO algorithm is originated from artificial life evolutionary computation theory.²⁰ In PSO algorithm, each particle is like a bird. The particle flies in the search space at a certain velocity and meanwhile dynamically adjusts its velocity and direction according to flying experience of its own and its companion. All particles have their fitness values that are determined by the objective function. Then, they can get their current positions and the best positions they have experienced (particle best, pbest) so far by calculating the fitness values. This experience is called the particle flying experience. In addition, each particle also knows the best position all the particles have experienced in the whole group so far (global best, gbest), which is called the companion flying experience. Each particle uses the following information to change its current position: 1, current location; 2, current velocity; 3, the distance between the current position and the particle best position; and 4, the distance between the current position and the global best position. Optimization search is carried out by such a group of randomly initialized particles in an iterative manner.

Algorithm description

Suppose a swarm in a d-dimensional target search space composed of m particles $x = {(x_{1}, x_{2}, \dots, x_{m})}^{T}$ . The position of i particle is $x_{i} = {(x_{i, 1}, x_{i, 2}, \dots, x_{i, d})}^{T}$ , and its velocity is $v_{i} = {(v_{i, 1}, v_{i, 2}, \dots, v_{i, d})}^{T}$ . The particle best is calculated by fitness function denoted by ${pbest}_{i} = {({pbest}_{i, 1}, {pbest}_{i, 2}, \dots, {pbest}_{i, m})}^{T}$ and the global best is denoted by $gbest = {({gbest}_{1}, {gbest}_{2}, \dots, {gbest}_{m})}^{T}$ . After finding out the pbest and gbest, the position and velocity of the particle itself can be updated by formulas (4) and (5)²¹

\begin{matrix} v_{i, d}^{k + 1} = & {wv}_{i, d}^{k} + c_{1} r_{1} ({pbest}_{i, d}^{k} - x_{i, d}^{k}) \\ + c_{2} r_{2} ({gbest}_{d}^{k} - x_{g, d}^{k}) \end{matrix}

(4)

x_{i, d}^{k + 1} = x_{i, d}^{k} + v_{i, d}^{k + 1}

(5)

where k is the iteration number; $v_{i, d}^{k + 1}$ represents the velocity of particle i in dth dimension and k + 1 iterations; w is the inertia weight; $c_{1}$ and $c_{2}$ are called learning factors or acceleration constants; $c_{1}$ controls the step size of the particle toward pbest and $c_{2}$ controls the step size of the particle toward gbest; $r_{1}$ and $r_{2}$ are uniformly distributed random numbers ranging from 0 to 1; and $v_{i, d}^{k}$ and $x_{i, d}^{k}$ represent the velocity and position of particle i in dth dimension and k iterations, respectively.

The ${pbest}_{i, d}^{k}$ represents the pbest of particle I in the dth dimension after k iterations; the ${gbest}_{d}^{k}$ represents the gbest of the swarm in the dth dimension after k iterations. If $x_{i, d} \in [- x_{\max}, x_{\max}]$ , $v_{i, d} \in [- v_{\max}, v_{\max}]$ , the maximum velocity can be set as $v_{\max} = {tx}_{\max}$ , where $0 \leq t \leq 1$ ; If $v_{i, d}^{k + 1} > v_{\max}$ or $x_{i, d}^{k + 1} > x_{\max}$ , $v_{i, d}^{k + 1} = v_{\max}$ or $x_{i, d}^{k + 1} = x_{\max}$ , respectively; if $v_{i, d}^{k + 1} < - v_{\max}$ or $x_{i, d}^{k + 1} < - x_{\max}$ , $v_{i, d}^{k + 1} = - v_{\max}$ or $x_{i, d}^{k + 1} = - x_{\max}$ , respectively. The rule for updating the particle position can be represented in Figure 2 according to formulas (5) and (6).

Figure 2.

The particle position update.

From Figure 2, it can be seen that the distance between particle x and gbest is becoming closer with the increase in the number of iterations. For the minimization problem, the smaller the objective function value, the better the fitness value.²² Given function f, the update rules of minimization problem are as formulas (6) and (7)

{pbest}_{i}^{k} = {\begin{matrix} {pbest}_{i}^{k - 1} & (f ({pbest}_{i}^{k - 1}) < f ({pbest}_{i}^{k})) \\ x_{i}^{k} & otherwise \end{matrix}

(6)

{gbest}^{k} = arg min {f ({pbest}_{i}^{k}) | i = 1, 2, \dots, n}

(7)

Improved PSO algorithm

The inertia weight w is provided to control the global search ability and local optimization ability of PSO. A higher value of w helps particle to jump out of local optimum for the global optimization; a lower value of w is beneficial to the local optimization and can accelerate the convergence of the algorithm. An improved PSO by decreasing inertia weight with the increase in iteration is presented. The parameters of this algorithm are set as follows: $c_{1} r_{1} = φ_{1}$ , $c_{2} r_{2} = φ_{1}$ , $φ_{1} \in [0, 2]$ and $φ_{2} \in [0, 2]$ . The inertia weight w is function of the iteration k and decreases from 0.9 to 0 in a straight line²¹

w (k) = 0.9 - 0.5 \frac{k}{K_{\max}}

(8)

where $K_{\max}$ is the maximum number of iterations.

This algorithm has a higher inertia weight in the early iterations to guarantee the global search ability of PSO; w becomes lower later, so that the local optimization ability is strong to get a better convergence performance.

The basic steps of PSO algorithm

Step 1. Initialization: initialize the particle swarm position x in $[x_{\min}, x_{\max}]$ and velocity v in $[v_{\min}, v_{\max}]$ randomly according to the principle of uniform distribution.

Step 2. Calculate the fitness of each particle.

Step 3. Update the particle best of each particle according to formula (6).

Step 4. Update the global best of the swarm according to formula (7).

Step 5. Update the velocity and position of the particle according to formulas (4), (5) and (8).

Step 6. Judge whether the iterations meet the termination condition. If so, enter the Step 7, or else, back to Step 2.

Step 7. End, save the result.

PSO-BP neural network model for tool life prediction

In the actual machining process, tool life usually refers to the tool durability. It refers to the tool cutting time from the new or newly sharpened tool first into use to the tool is dull or broken. Boring, milling, drilling machine tools and computer numerical control (CNC) machining centers are the major machines used in flexible manufacturing system (FMS) workshop. Tool durability is proportional to the tool use coefficient, tool material and diameter and so on and inversely proportionate to the workpiece material coefficient, cutting velocity, depth of cut, cutting width, number of teeth and so on.

The traditional calculation of tool life²³ is given as formula (9)

T = \frac{C_{r} \cdot D_{0}^{q}}{v_{c} \cdot a_{p}^{x} \cdot f^{y} \cdot a_{w}^{u} \cdot Z^{p}}

(9)

where $C_{r}$ is the use coefficient that is associated with the workpiece material, tool material and other cutting parameters; $D_{0}$ , $v_{c}$ , $a_{p}$ , $f$ , $a_{w}$ and Z represent tool diameter, cutting velocity, depth of cut, feed rate, cutting width and number of tool teeth, respectively. The numerical sizes of the coefficients (x, y, u, p and q) reflect the influence degree of each tool parameter on the tool life. These numerical sizes are usually determined by experience or experiment and vary with the change in machining conditions.

The relationship between tool life and influence parameters is highly nonlinear. This relationship can be more complicated with the influence of the machining environment. Formula (9) can hardly describe this relationship correctly.²⁴ BP neural network is simple, rapid and economical. Besides, it has the capability to approximate any nonlinear function with arbitrary precision. These characteristics are very suitable for tool life prediction. But BP neural network has its own weaknesses. It would easily fall into a local optimum and is slow in convergence and weak in generalization ability. By contrast, PSO algorithm also has the advantages of easy implementation, fast convergence and strong robustness compared to GA and ant colony algorithm. Considering the powerful global search ability and generality of PSO algorithm and the strong local search ability of BP algorithm, this article built the PSO-BP model by combining the PSO and the BP neural network to predict tool life. PSO is used to optimize the weights of BP neural network. It takes the influence parameters as input and the tool life as output of neural network. The weights are optimized by the improved PSO to narrow the search range, and then optimized by neural network for better convergence.

Steps of the PSO-BP algorithm

Figure 3 shows a flowchart of the PSO-BP algorithm.

Step 1: initialize the particle swarm.

The particles contain the information of the connecting weights and thresholds of the BP algorithm. Therefore, the topology of the neural network should be built according to the sample data and the function of the algorithm first. The number of the input nodes and output nodes are set by the “variable” and “dependent variable” number of the sample data; the hidden layers and its nodes are set through testing experiment or the former researchers’ experience. All the connecting weights and thresholds are encoded into a vector which is called the particle.

The particles have its fly rule and limited fly area. The parameters for the fly rule and fly would be set. These parameters include population size M, inertia weight w, learning factors c ₁ and c ₂, range of particle’s position and velocity, termination condition of PSO and BP algorithm. The initial particle swarm can be generated randomly based on the principle of uniform distribution. This principle guarantees the global search in the area.

Step 2: evaluate the fitness values of the initial particles.

The fitness function f should be established first. The function of f is to find the distance between the target and the current position. The distance is determined by the function f. A better function f can reduce the learning time and improve the precision of the result. The connecting weights and thresholds, which form the current BP model, are isolated from the gbest. For this PSO-BP model, the fitness function is used to compare the errors between the output values of the current BP model and the desired values.

Step 3: update the pbest and gbest.

For the initial particles, the pbests are set as the fitness of the initial particles; for the particles after k iterations, the pbests are updated according to formula (6). The gbests are selected from the pbests through formula (7).

Step 4, Step 5 and Step 6: update the particle velocity and position.

The update rules have been presented in formulas (4), (5) and (8). Both the adjustment of the velocity and the position are affected by the pbest and the gbest. Formula (8) is to keep the particles flying in the given area. The pbests and the gbests are updated again by calculating the fitness values of the new positions of the particles. This is a step-by-step process to find the optimal solution.

Step 7: judge whether the iterations meet the termination condition.

The termination conditions are usually set to the maximum iterations or a desired fitness value. If so, enter Step 8, otherwise, back to Step 3 to update the pbest and gbest again.

Step 8 and Step 9: train the BP neural network.

The global search of the PSO algorithm is over. The connecting weights and thresholds of the BP neural network are obtained. The improved BP neural network presented in section “Improved BP neural network” is trained to search the local minimum in the vicinity of the gbest. If the search result is better than gbest, update the gbest. The set method of termination condition is the same as Step 7.

Step 10: optimization over. Save the connecting weights and thresholds.

After these steps, a PSO-BP algorithm for the specific problem is presented.

Figure 3.

Flowchart of the PSO-BP algorithm.

Model building

1. Build the topology of neural network

The choice of the network topology decides the performance of the neural network.²⁵ This work includes the design of input layer, output layer and hidden layers.

Input and output layer design

The tool life is influenced by processing mode, tool materials, workpiece materials, machining precision, tool diameter, tool teeth, depth of cut, feed rate, milling width, cutting velocity and so on. But in the actual processing, when sample data are enough, good learning samples can be obtained through the advanced screening conditions.²⁴ So, the non-numeric factors, such as milling mode, tool materials and workpiece materials can be selected as query conditions. The samples are queried in this condition. Therefore, six numeric parameters, including tool diameter, tool teeth, depth of cut, feed rate, milling width and cutting speed, are selected as the input nodes of network and tool life is selected as output node.

Hidden layer design

It had been proved that a three-layer BP neural network can simulate any n-to-m mapping.²⁶ So, this model selects the structure with a single hidden layer. According to Kolmogorov principle,²⁷ the number of hidden layer nodes is set by formula (10)

N = 2 I + 1

(10)

where I and N represent the BP neural network nodes of input layer and hidden layer, respectively. Set I = 6 and N = 13.

Therefore, the topology architecture of the neural network is 6-13-1.

2. Preprocessing of the sample data

Difference in the size of the training sample data has great influence on the convergence speed of the network, so the input and output sample data should be preprocessed (normalization processing) before training or testing. Formula (11) is used to compress the input and output sample data to (0.1, 0.9)

X' (i, j) = 0.1 + 0.8 \frac{X (i, j) - X_{\min} (i, j)}{X_{\max} (j) - X_{\min} (j)}

(11)

The results should be processed through anti-normalization to get the real prediction values of tool life.

Figure 4 shows the BP neural network model for tool life.

3. Parameter setting of the PSO algorithm

Set $c_{1} = c_{2} = 2.0$ ; the search area of the particles is [−1, 1]; maximum velocity of particles $v_{\max} = 0.5$ ; swarm number N = 30. The function of w is formula (8). For formula (2), t = 1.04, $t_{1} = 0.75$ and $t_{2} = 1.05$ . The maximum runtime of PSO is 2000 and the maximum runtime of BP is 15,000.

4. Fitness function

The root-mean-square error (RMSE) is a frequently used measure of the differences between the predicted values and the observed values. The RMSE represents the sample standard deviation of the differences between predicted values and observed values. The function of RMSE is given as formula (12)

RMSE = \sqrt{\frac{\sum_{i = 1}^{n} {(T_{i} - T'_{i})}^{2}}{n}}

(12)

where $T_{i}$ is the real life and $T'$ is the predicted life.

Figure 4.

BP neural network model for tool life prediction.

The errors are squared before averaged, so the RMSE gives a relatively high weight to large errors. Therefore, RMSE is sensitive to the errors. This character guarantees both the accuracy and precision of the predicted value. The smaller the RMSE, the higher the accuracy and precision of predicted values. PSO is a way to find out the minimum. Hence, RMSE is chosen as a fitness function to guarantee both the accuracy and precision of the predicted value

fitness = RMSE

(13)

Experiment and analysis

In order to verify the performance of this model, three simulation experiments for tool life prediction were carried out. One was conducted by PSO-BP model as this article presented, another was conducted by basic BP neural network and the other one was conducted by improved BP neural network mentioned in section “Theory of BP neural network.”

Sample data collection

The most common cutters and workpieces were selected for this experiment. The most frequently used tools in FMS workshop are milling tools, among which face milling cutter and end milling cutters are the most commonly employed. Therefore, facing milling cutters were selected. The material of the cutters is YT15 cemented carbide; the material of workpieces is 45 steel; processing method is rough milling. The cutting conditions are shown in Table 1.

Table 1.

Tool cutting conditions.

Parameter	Range
Cutting velocity (m/min)	95–183
Tool diameter (mm)	80–315
Teeth number	4–16
Cutting width (mm)	40–210
Cutting depth (mm)	2–10
Feed rate (mm/tooth)	0.07–0.24

In all, 22 groups of sample data were included under this condition. Table 2 shows 17 groups of the sample data which are used for training the network. Five groups of the sample in Table 2 are used for testing the performance of the network (Table 3).

Table 2.

Training data set.

Data no.	D ₀ (mm)	Z	a_p (mm)	a_w (mm)	f (mm/tooth)	v_c (m/min)	T (min)
1	80	4	2	40	0.12	183	72
2	80	4	3	50	0.10	175	85
3	100	5	4	60	0.13	150	98
4	100	5	5	75	0.15	130	107
5	100	5	7	60	0.07	173	125
6	125	6	5	75	0.10	144	180
7	125	6	7	80	0.11	140	136
8	125	6	6	90	0.13	135	113
9	160	8	3	96	0.10	150	190
10	160	8	5	100	0.13	130	171
11	160	8	9	110	0.18	108	153
12	200	10	8	115	0.13	125	180
13	200	10	3	135	0.16	115	250
14	250	12	9	150	0.18	110	160
15	250	12	10	170	0.20	105	137
16	315	16	5	210	0.24	95	326
17	315	16	9	190	0.13	110	305

Table 3.

Testing data set.

Data no.	D ₀ (mm)	Z	a_p (mm)	a_w (mm)	f (mm/tooth)	v_c (m/min)	T (min)
1	100	5	5	55	0.1	165	100
2	125	6	3	75	0.13	135	190
3	160	8	9	96	0.18	110	160
4	200	10	6	110	0.15	125	165
5	250	12	7	140	0.13	120	240

Error measures

To assess the performance of the PSO-BP model, the following error measures were used:

Absolute error

E_{a} = | T_{i} - T'_{i} |

(14)

Fractional error

E_{f} = | 1 - \frac{T'_{i}}{T_{i}} |

(15)

RMSE

RMSE = \sqrt{\frac{\sum_{i = 1}^{n} {(T_{i} - T'_{i})}^{2}}{n}}

(16)

Mean square fractional error

{RMSE}_{f} = \sqrt{\frac{\sum_{i = 1}^{n} {(1 - {\frac{T'_{i}}{T}}_{i})}^{2}}{n}}

(17)

Simulation experiment and results

Environment of the simulation experiment

The simulation experiment about PSO-BP model, improved BP model and BP model was conducted 10 times in the computer. The computer environment used in the experiment was as follows: Intel $^{Ⓡ}$ Core™ i5-2310, 2.90 GHz of CPU, 2 G of memory, Win7 operating system and MATLAB 2012b simulation environment.

Parameter and process setting for the three models

The MATLAB programs of these three models were built by the parameters presented in Table 4 and section “PSO-BP neural network model for tool life prediction.” The values of the connecting weights and thresholds were trained by the training data set. The results were obtained by the trained models.

Results of the simulation

Table 5 shows the average predicted tool life of testing samples. Table 6 shows the detailed performance of the models. These performances were obtained by the statistic of all the outputs of the three models after 10 training. The average values of the errors, including RMSE, RMSE_f , E_a , E_f and correct rate, are selected for the performance. The correct rate in Table 6 refers to the proportion of the predicted values whose absolute value of fractional error is less than 10% in all values. Figures 5 –7 show the distribution of the predicted tool life.

Table 4.

The parameters of BP, improved BP and PSO-BP models.

Parameters	Basic BP	Improved BP	PSO-BP
Iterations of BP	30,000	15,000	15,000
Iterations of PSO	0	0	2000
Weight adjustment rule of BP	Formula (1)	Formula (3)	Formula (3)
Weight adjustment rule of PSO			Formulas (4) and (5)
Architecture	6-13-1	6-13-1	6-13-1

BP: back-propagation; PSO: particle swarm optimization.

Table 5.

The outputs of BP, improved BP and PSO-BP model.

No.	Desired output	BP			Improved BP			PSO-BP
No.	Desired output	Output	E_f (%)	Iterations	Output	E_f (%)	Iterations	Output	E_f (%)	Iterations
1	100	108	8	BP: 30,000	108	8	BP: 15,000	105	5	PSO: 2000; BP: 15,000
2	190	157	17.37		150	21.05		187	1.58
3	160	134	16.25		140	12.5		158	1.25
4	163	176	7.98		165	1.21		168	3.07
5	240	244	1.67		242	0.81		247	1.23

BP: back-propagation; PSO: particle swarm optimization.

Table 6.

Performance of BP, improved BP and PSO-BP models.

Performance measures	BP model		Improved BP model		PSO-BP model
Performance measures	Training data	Testing data	Training data	Testing data	Training data	Testing data
RMSE	8.27	16.54	0.32	16.77	2.48	3.99
RMSE_f (%)	6.48	10.08	0.27	9.65	2.25	2.38
E_a	5.8	26.07	0.23	26.38	1.76	6.3
E_f (%)	4.46	15.92	0.18	15.46	1.49	3.81
Correct rate (%)	87.65	34	100	34	100	94

BP: back-propagation; PSO: particle swarm optimization; RMSE: root-mean-square error.

Figure 5.

Life distribution of basic BP.

Figure 6.

Life distribution of improved BP.

Figure 7.

Life distribution of PSO-BP.

Analysis of experimental results

It can be seen from Table 5 that the life predicted by the PSO-BP model is better than basic BP model and improved BP model. The differences between the results of BP model and improved BP model are not significant. The number of iterations of PSO-BP model is smaller than that of BP model. The values of fractional errors of basic BP model range from 1.67 to 17.37. While the fractional error of all the testing data is less than 5.00% which is much smaller than the values of BP model. The less iterations and high accuracy show the fast convergence of the PSO-BP model.

The training data errors of basic BP model in Table 6 are relatively small which indicate the strong ability of nonlinear fitting. But the testing data errors are larger than the training data errors. These weaknesses reveal the low generalization ability of BP model. The training data errors of BP model are the smallest in the three models, but the testing data errors are not satisfied. The testing data errors are as poor as the error of the basic BP model. This proves the fast convergence but the low generalization ability of the improved BP model. All the errors of PSO-BP model in Table 6 are much smaller than those of BP model. The testing data correct rate reaches 94%. RMSE is an effective way to evaluate the accuracy and precision of the result. The 3.99 of RMSE shows the high generalization ability of PSO-BP model.

The average predicted life and worst predicted life shown in Figures 5 –7 further proved the higher convergence and greater generalization capability of PSO-BP model.

Through data analysis and comparison, the convergence, robustness and generalization of neural network are greatly improved by using the PSO-BP model. The performance of the PSO-BP model is much better than that of the BP neural network. PSO-BP model is very suitable for tool life prediction.

The satisfied results of the PSO-BP model demonstrated that this model is applicable for tools with different diameters and cutting teeth.

Summary and conclusion

A reliable BP neural network model based on PSO algorithm is established in this article to predict the cutting tool life. This model takes the advantages of the global search capability of PSO and the complex nonlinear mapping ability of BP neural network. In this model, PSO algorithm is used to optimize the weights and thresholds of BP neural network before the training BP neural network. BP learning algorithm is further used to find the optimal.

The highly nonlinear relationship between affecting factors and tool life can be obtained through the utility of the complex nonlinear mapping ability of BP neural network. An effective method for tool life prediction is achieved by training the existing experimental and production data. The calculating process is a black-box operation which reveals the strong adaptive ability of PSO-BP neural network.

Theoretical analysis and simulation show that PSO-BP algorithm can effectively reduce the risk of falling into a local minimum value of neural network, in which the convergence, robustness and generality of the BP neural network are greatly enhanced.

A neural network expert system is established when this model is adapted in FMS tool management system and the sample data are saved to the database. Then, more tool information which can provide a dynamic tool sample database for PSO-BP tool life prediction model will be obtained. Furthermore, a dynamic update of PSO-BP model is achieved with the support of this database to improve the preciousness of life prediction.

In this article, tools with different diameters and teeth numbers are selected as sample data and predicted successfully by this model. This indicates that more affecting factors, such as the hardness of the materials and tools, can be selected with the support of the neural network expert system.

This model also provides an effective way for the optimization of tool machining parameters. In the machining process, in order to realize the unity of tool change time and improve production efficiency, the desired tool life can be obtained by adjusting the parameters affecting tool life. For selected tool and material, users can adjust the input parameters of cutting speed, depth and feed rate until the output results reach the expected tool life. The workshop production efficiency will also be greatly improved by combining this model with FMS.

Footnotes

Acknowledgements

Thanks to Zheqi Zhu for improving the language (English) of this article.

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The National Science and Technology Major Project (Grant No. 2012ZX04011-031), National Outstanding Youth Science Foundation (Grant No. 50925518), National Science and Technology Support Plan Subsidization Project (Grant No. 2012BAF12B09) and the Youth Science Foundation of National Natural Science Foundation (Grant No. 51005260) supported this research project.

References

Ezugwu

Arthur

Hines

. Tool-wear prediction using artificial neural networks. J Mater Process Tech 1995; 49(3): 255–264.

Ojha

Dixit

. An economic and reliable tool life estimation procedure for turning. Int J Adv Manuf Tech 2005; 26(7–8): 726–732.

Sanjay

Neema

Chin

. Modeling of tool wear in drilling by statistical analysis and artificial neural network. J Mater Process Tech 2005; 170(3): 494–500.

D’Addona

Teti

. Image data processing via neural networks for tool wear prediction. Proced CIRP 2013; 12: 252–257.

Wong

Nee

AYC

. Intelligent tool wear identification based on optical scattering image and hybrid artificial intelligence techniques. Proc IMechE, Part B: J Engineering Manufacture 1999; 213(2): 191–196.

Paul

Varadarajan

. A multi-sensor fusion model based on artificial neural network to predict tool wear during hard turning. Proc IMechE, Part B: J Engineering Manufacture 2012; 226(5): 853–860.

Srinivas

Kotaiah

. Tool wear monitoring with indirect methods. Manuf Technol Today India 2005; 4: 7–9.

Chen

. Development of a tool wear observer model for online tool condition monitoring and control in machining nickel-based alloys. Int J Adv Manuf Tech 2009; 45(7–8): 786–800.

Sick

. On-line and indirect tool wear monitoring in turning with artificial neural networks: a review of more than a decade of research. Mech Syst Signal Pr 2002; 16(4): 487–546.

10.

Srikant

Krishna

Rao

. Online tool wear prediction in wet machining using modified back propagation neural network. Proc IMechE, Part B: J Engineering Manufacture 2011; 225(7): 1009–1018.

11.

Wang

. Evolving artificial neural networks using an improved PSO and DPSO. Neurocomputing 2008; 71(4): 1054–1060.

12.

Tian

. The hybrid neural network technology. Beijing, China: Science Press, 2009.

13.

Wang

Huang

Zhang

. Mechanical property prediction of strip model based on PSO-BP neural network. J Iron Steel Res Int 2008; 15(3): 87–91.

14.

Ismail

Jeng

Zhang

. An optimised product-unit neural network with a novel PSO–BP hybrid training algorithm: applications to load–deformation analysis of axially loaded piles. Eng Appl Artif Intel 2013; 26(10): 2305–2314.

15.

Razfar

Asadnia

Haghshenas

. Optimum surface roughness prediction in face milling X20Cr13 using particle swarm optimization algorithm. Proc IMechE, Part B: J Engineering Manufacture 2010; 224(11): 1645–1653.

16.

Kiranyaz

Ince

Yildirim

. Evolutionary artificial neural networks by multi-dimensional particle swarm optimization. Neural Networks 2009; 22(10): 1448–1462.

17.

Rumelhart

Hinton

Williams

. Learning representations by back-propagation error. Nature 1986; 323(9): 533–536.

18.

Bose

Liang

. Neural network fundamentals with graphs, algorithms and applications. New York: McGraw-Hill, Inc., 1996.

19.

Kasabov

. Foundations of neural networks, fuzzy systems, and knowledge engineering. New York: Marcel Alencar, 1996.

20.

Kennedy

Eberhart

. Particle swarm optimization. IEEE IJCNN 1995; 4(2): 1942–1948.

21.

Shi

Eberhart

. Empirical study of particle swarm optimization. In: Proceedings of the 1999 congress on evolutionary computation, 1999 (CEC ‘99), Washington, DC, 6–9 July 1999, vol. 3. New York: IEEE.

22.

Jianchao

. Particle swarm optimization algorithm. Beijing, China: Science Press, 2004.

23.

Xiankui

. Mechanical processing handbook. Beijing, China: China Machine Press, 2008.

24.

Yang

Wang

. Prediction of cutting tool life based on evolutionary neural network. Comput Integr Manuf 2008; 14(1): 167.

25.

Hagan

Demuth

Beale

. Neural network design. Boston, MA: PWS Publishing Company, 1996.

26.

Ito

. Representation of functions by superpositions of a step or sigmoid function and their applications to neural network theory. Neural Networks 1991; 4(3): 385–394.

27.

Minping

Guanglu

. The theory of stochastic processes. Beijing, China: Peking University Press, 2004.

Tool life prediction based on particle swarm optimization–back-propagation neural network

Abstract

Keywords

Introduction

Theory of BP neural network

Self-adapting algorithm

Additional momentum method

Improved BP neural network

Theory of PSO algorithm

Algorithm description

Improved PSO algorithm

The basic steps of PSO algorithm

PSO-BP neural network model for tool life prediction

Steps of the PSO-BP algorithm

Model building

Experiment and analysis

Sample data collection

Error measures

Simulation experiment and results

Analysis of experimental results

Summary and conclusion

Footnotes

Acknowledgements

Declaration of Conflicting Interests

Funding

References