Abstract
This article reports the result of an experimental study to analyze the effect of parameters that induce residual stresses during electric discharge machining of two particulate-reinforced metal matrix composites. Several factors were varied to study their impact on the formation of residual stresses, and pulse-off time was identified as the most significant factor. Artificial neural network was implemented to predict the residual stresses. Metal matrix composites with low coefficient of thermal expansion and high reinforced particle exhibit lower residual stresses. Also, better conductive electrode materials used during machining cause lower residual stress. The artificial neural network model accurately predicts the residual stresses and is a reliable tool for predicting residual stresses. The micrographs show that the workpiece with a higher concentration of reinforced particulates results in lesser flow line in the matrix material hence less residual stresses. X-ray spectra also reveal the phase transformation on the machined surface.
Keywords
Introduction
Electric discharge machining (EDM) is a versatile machining process used for manufacturing variety of applications having stringent precision and accuracy requirements. In this process, the material is removed by a series of electrical sparks generated between two terminals (anode and cathode) in a dielectric medium. The process produces a mirror image of the tool (electrode) as it advances toward the workpiece. The process has the ability to machine hard-to-cut materials and can produce complex shapes. There are negligible cutting forces as there is no contact between the tool and work material resulting in a distortion-free and accurate machining with minimal requirement of manual skill. Because of this advantage, the EDM process is an attractive option to machine composite material such as particulate/fiber-reinforced metal matrix composites (MMCs) to achieve the required shape and size with superior dimensional accuracy and productivity. The temperature range of the spark developed is around 8000 °C–12,000 °C due to the excitation of a pulse generator, which produces rectangular shape pulses with adjusted on and off times, voltage level and discharge current. The sequence of sparks results in a substantial amount of phase transformation, thereby melting the surface of both poles. 1 The efficiency of the machining process depends upon various process parameters such as spark energy, flushing pressure, dielectric medium and the electrode. During EDM of MMCs, a localized high-temperature electric discharge occurs in the presence of dielectric, which may result in change in mechanical and physical properties and also develop residual stresses within the material. The residual stresses affect the fatigue life of the workpiece and often cause distortion or cracking and, in some cases, also premature failure of the part in service.2–5 For making this process commercially viable for MMCs, the prediction of induced residual stresses at various settings of input parameter is very important so that these could be controlled to improve the service life of the components. Developing a reliable mathematical model relating the machining parameters with residual stresses is very difficult due to complexities involved and also the nonlinearity of the EDM process. However, many researchers have reported the use of various neural network models to predict some responses accurately by effectively organizing the qualitative and quantitative factors. The work of McCulloch and Pitts 6 serves as the foundation for the growth of neural network architecture. The model suggested by them was further modified with the construction of electronic discrete variable automatic computer. 7 Currently, the most widely used artificial neural network (ANN) is multilayer perceptions based on back propagation (BP) techniques. The technique was employed for controlling and modeling various selected parameters. For example, many studies were feed forward BP to predict the surface roughness in machining processes.8–11 Similarly, BP neural network control was used to predict tool wear and the resultant failures in drilling and turning operation.12–14 Various neural network techniques have been used for prediction of responses during EDM.15–17 Feed forward neural network with BP learning algorithm has been used to estimate the height of the workpiece online and distinguish the machining condition in wire EDM. 18 A process control system consisting of a fuzzy gap width controller adapted by neural network was introduced. 19 Experiments were conducted to develop a model of EDM using BP neural network with current, pulse-on time and pulse-off time as an input parameter for predicting the metal removal rate (MRR) and tool wear rate (TWR). 20 A neural network was developed to study the steel transformation curve. 21 Feed forward neural network was used to classify the waveform generated from electrolyte-incorporated modified EDM. 22 Taguchi’s L18 orthogonal array was used to conduct experiments on explosive electric discharge grinding. The results were analyzed using genetic algorithm developed using neural network for optimization of the results. 23 In another study, Taguchi’s L27 orthogonal array was used to present feed forward neural network based on BP along with a regression model to predict the surface roughness in abrasive jet machining process. 24 The ANNs were applied successfully during machining and can accurately predict the responses during EDM.25–29
The literature review shows that ANN is a powerful tool to develop models particularly in manufacturing applications to predict, generalize and analyze nonlinear multivariate data. Also, no study has been reported that applies ANNs to predict the induced residual stresses in particulate reinforced MMCs during EDM. In the present study, two types of particulate-reinforced Al MMCs were machined using EDM, and a superior and reliable process model was developed using ANN to predict the residual stresses.
Experimental study
The experimental study was completed using two different types of MMCs commonly used for engineering applications. Material I (65 vol% SiC/A356.2 MMC) was procured in the form of a rectangular strip while material II was a hybrid (10 vol% SiC–5 vol% quartz in aluminum) prepared by stir casting method. The properties of both the materials are listed in Table 1. The experiments were conducted on a die sinking EDM machine (OSCARMAX SD550 ZNC, Taiwan) using conventional polarity. Three electrode (tool) materials namely (a) electrolytic copper, (b) fine grained graphite (particle size of 5.0 µm) and (c) copper–graphite composite with 50% Cu (Grade 673, resistivity: 2.03 µΩm and density: 2.95 g/cm3) procured from SGL Corporation, USA, were used to machine MMCs. A pilot study coupled with information available from the literature was used to identify the machine and process parameters to be varied during the study. Based on this study, current, pulse-on time, pulse-off time and dielectric fluid were identified for detailed study of their effect on residual stresses generated during machining. To draw valid conclusions from the experiments, each of these factors was varied at three levels to understand their impact on the development of residual stresses. Thus, a total of six factors namely material type (samples I and II), current (I), pulse-on time (ton), pulse-off time (toff), electrode material (I, II and III) and dielectric medium were chosen for study. The list of factors studied with their respective levels is shown in Table 2. The effect of these factors on normal stress (σφ) induced after machining of MMC samples was analyzed. Taguchi’s experimental design technique was used to design an orthogonal design from which valid conclusions could be drawn.30–32 Considering the degrees of freedom required from the array, L18 was selected as the most appropriate array. The assignment of factors was carried out using MINITAB 15.
Properties of MMCs.
Factors interested and their levels.
EDM: electric discharge machining.
Residual stresses
The component in service is subject to various types of stresses. The presence of residual stresses combined with external stresses may cause premature failures. The residual stresses are stresses, which are existing within a body/structure in the absence of external loading or thermal gradients. In other words, residual stresses in a body/structure are due to manufacturing methods, surface treatment, coatings, shot-peening process or structural change like phase transformation. X-ray diffraction (XRD) method used for measuring residual stresses in machined surface layer employs the characteristic radiation emitted from an X-ray. The advantage of this method is that repeated measurements are possible on the specimen. Figure 1 shows the schematic representation of concepts used in residual stress analysis.

Concept of X-ray diffraction stress analysis.
The general equation for measuring the residual stress is represented by equation (1). The lattice strain
where
S1 = −ν/E, 1/2S2 = (1 +ν)/E, S1 and 1/2S2 are the X-ray elastic constants (XECs).33–36 From equation (1), in order to measure the unidirectional normal residual stress, the parameter a+ is calculated using the lattice strain for positive and negative values of ψ for the particular rotational angle ϕ of the sample and is given by equation (3)
where
During this study, the residual stresses induced during machining were measured using XRD method on PANalytical’s X’Pert Pro MPD (The Netherlands) diffractometer. The evaluation was completed using XRD classic technique using only one plane {hkl} to calculate the magnitude of residual stress. The machined surface was cleaned to remove the resolidified globules of molten metal to reduce measurement errors. In this experiment, the highest peak (2θ≥ 130°) also known as Bragg’s peak diffracted from the matrix phase of composite was selected for precise measurements. The small change in lattice spacing gives considerable change in peak angle, and it is represented by equation (4)
A number of independent XRD measurements of the isolated diffracted peak from the matrix phase of composite material were made at a different angle of ψ tilts. The obtained dϕψ spacing at different angles ψ tilt (positive and negative) was used to calculate the lattice strain. The calculated a+ was plotted against sin2ψ, and the slope of the linear fit curve was compared with equation (3). The residual stress with accepted uncertainty value due to the goodness of curve fitting was obtained. Sikarskie 37 presented various factors contributing to the uncertainty in measurement of stress by XRD method. The inhomogeneous stress state generated during certain trials causes preferred orientation, which leads to lower peak intensities and less accurate peak location resulting in higher uncertainty. This effect of texture in measurement can be overcome by choosing appropriate tilts or translation of the sample to bring more grains in diffraction condition. The experimental layout and the measured residual stress values with uncertainty are shown in Table 3. The absolute residual stress (σφ) values with uncertainty were measured after each trial by above methodology and are given in the last column of Table 3.
Experimental layout and results.
Sample calibration and data analysis for trial 6
Residual stress was measured in matrix phase (Al phase) of the machined sample. The measurement is performed by selecting the isolated peak diffracted at the highest value of 2θ from {422} plane when using Cu Kα radiations (λ = 1.5406 Å). Table 4 for trial 6 (sample I) represents the change in lattice spacing measured from 19 ψ tilts. The calculated lattice strain for positive and negative ψ tilts is represented in the last column of Table 4. Residual stress was analyzed by comparing the linear fit regression equation obtained from the plot of a+ versus sin2ψ (Figure 2) with equation (3) as follows
Peak table and lattice strain for trial 6.
where 1/2S2 = 6.98 T−1Pa

Represents a+ versus sin2ψ plot with regression equation of trail 6
Hence, the normal residual stress for trial 6 is 110 MPa with uncertainty of ±11.4 MPa
Results and discussions
The analysis of variance (ANOVA) for residual stresses induced into the machined surface was completed, and the results are given in Table 5. The factors with larger value of F-test indicate that there is a significant effect in inducing the residual stresses. Similarly, the p-value indicates the significance level for each factor. The contribution percentage of each input factor was measured by analyzing the F-value and the p-value. From the results, it is clear that pulse-off time is the most significant factor followed by dielectric medium, current, workpiece and electrode material. On the other hand, the effect of pulse-on time is negligible. For compressive, qualitative and quantitative analyses of EDM process, ANN was used to establish the relationship between input and output parameters and also to understand and predict the process performance. The main effect of various input parameters is presented in Figure 3.
Analysis of means for the residual stresses.
Significant factors.

Main effect plots for residual stresses.
ANN
A feature borrowed from the physiology of the brain is the basis for the development of modeling technique known as ANN systems or simply neural networks. A neural network is defined by a collection of parallel processors connected together in the form of a directed graph, organized in such a way so that the network structure lends itself to the problem being considered. A network consists of processing element or unit in the network as a neuron, with connections between units indicated by the arc. The flow of information is indicated by the arrowhead on the connections. Figure 4 shows the general processing element or neuron model. Once the net input is calculated, it produces final output passing onto a nonlinear filter called activation function. There are several types of activation function used in neural network, but activation function mostly utilized in multilayer network is continuous in nature that varies gradually between the asymptotic values of 0 and 1 or −1 and +1 and is given by equation (5)

Schematic illustration of single neuron in a network. The input connections are modeled as an arrow from other nodes. Each input connection has associated with it a quantity wij, called weight.
The identity function (f(x) = x) is available for the output layer neuron. The neural network is broadly classified into three basic types namely (a) supervised, (b) unsupervised and (c) reinforced, as shown in Figure 5. The BP method has been proven to be the most useful in manufacturing-related applications. The network learns from the predefined set of input–output example pairs using a two-phase propagate–adapt cycle. After the input pattern is applied as a stimulus to the first layer of network units, it is propagated through each hidden layers until an output is generated. This output is then compared to desired output, and an error signal is computed for output nodes. The difference between the output generated by network model (Tor) and the actual output (Oor) is given by equation (6)

Classification of learning algorithm.
Based on error signal, a connection weight is updated using BP algorithm.
Estimation of residual stresses by ANN
The estimation of model in neural network is composed of two stages: (a) training and (b) testing of the network with the experimental machining data. The network consists of one input, two hidden and one output layer and was assumed to be 6–n1–n2–1, where six are the input factors with 17 neurons in the input layer, n1 and n2 neurons in the hidden layer and one neuron in the output layer. The number of neurons in the hidden layer is determined by trial and error as there is no general methodology available to estimate these. The criterion is to select minimum number of nodes, which would not affect the performance of network by minimizing the storage memory for synaptic weight. When the number of hidden nodes is equal to the number of training patterns, the learning could be fastest but on the other hand, it loses the generalization capabilities. The error criterion adopted in the present model is sum of square error.
The procedure is outlined in the following:
Divide the data set randomly in the training set and the testing set groups in appropriate percentage.
Apply different network architectures to train the network.
Evaluate the network architecture by comparing the normalized importance of each parameter obtained from the network with the ANOVA output results and percentage relative error.
Test the network for its generalized ability using group of data classified for testing.
The experimental data given in Table 3 consist of 18 groups with the levels of input parameters/factors and residual stress as output response. Out of these, 12 group of data (given by trials 1, 2, 3, 5, 7, 8, 11, 12, 13, 15, 16 and 18) were randomly selected (about 67%) of all data groups were associated with training of network, and the remaining six groups of data (represented by trials 4, 6, 9, 10, 14 and 17) or 33% of data were used to test the network. For training of network, a commercial window–based ANN software, IBM-SPSS, was used. The number of network was constructed and the best network was selected based on minimum absolute relative error. This was interestingly achieved with first and second hidden layers having nine and seven nodes, respectively. The learning rate and the momentum values were selected as 0.001 and 0.7, respectively. The summarized neural network architect for the prediction of residual stress is shown in Figure 6. The weights were updated, and the activation function in hidden layers and output was selected as hyperbolic tangent and identity function, respectively. It was observed that the outcome of network directly depends upon the weight distribution done randomly. Thus, the network was trained repeatedly to update the weight until the hierarchy of normalized importance of each input parameter in the network showed consistency with the results obtained during ANOVA analysis. The results of neural network model are shown in Table 6. The synaptic weight distribution of the designed networks is shown in Tables 7–9. Also, it can be observed from Figure 7 that pulse-off time is the most significant factor followed by type of dielectric medium used. The other factors have relatively smaller significance with pulse-on time having the least significance. The performance of ANN was measured with absolute % relative error between the output of network with the experimental value and was calculated by equation (7)
Comparison of ANN model with experimental results.
ANN: artificial neural network.
Synaptic weight distribution between input layer and hidden layer (n1).
Synaptic weight distribution between hidden layers n1 and n2.
Synaptic weight distribution between hidden layer (n2) and output.

Neural network architect for residual stress calculation.

Normalized importance of each input factors on the residual stresses.
Figure 8 shows that the predicted data of network are very close to the experimental data results in verifying the network generalization capabilities.

Comparison of ANN results and the experimental outputs.
Other statistical tools such as paired sample t-test and correlation were also completed to measure the amount of association between the experimental value and predicted value obtained from the network. The results are summarized in Tables 10 and 11. Table 10 shows that experimental value and the predicted value are positively correlated, r (N = 18) = 98.4%. Also, from Table 11, it can be concluded that the mean residual stress value increased from the experimental value to the predicted value by 0.227, t(17) = −0.119, p = 0.907 > 0.05. The experimental value and the results obtained from ANN model are not significantly different at 95% confidence range.
Correlation between the experimental value and ANN model value.
ANN: artificial neural network.
Paired sample t-test (experimental value and ANN model value).
ANN: artificial neural network.
Microstructure analysis
For understanding the topography of EDMed surfaces in relation to the residual stresses induced during machining, a scanning electron microscope (SEM) study was completed on select samples for both materials used in this study. Figures 9 and 10 show the comparative residual stresses induced in materials I and II, respectively. Figure 9(a) shows the minimum residual stress in material I (65 vol% SiC/A356.2) obtained for trial 4 (I = 8 A; ton = 45 µs; toff = 15 µs, e = Gr, d = EDM oil + Cu powder). This may be because of a lower residual stress due to the pull out particulate and micro shrinkages, which are lesser deep. Similarly, the maximum residual stress for material I was obtained in trial 9, as shown in Figure 9(b) (I = 8 A; ton = 10 µs; toff = 45 µs, e = Cu–Gr, d = EDM oil + Gr powder). In this figure, the expelled material is resolidified on the workpiece in the form of globules leading to formation of micro cracks due to high energy level resulting in larger residual stresses.

EDMed specimen of sample I: (a) residual stress = 36.3 MPa and (b) residual stress = 129 MPa

EDMed specimen of sample II: (a) residual stress = 41.8 MPa and (b) residual stress = 231.5 MPa.
Figure 10(a) and (b) shows the minimum and maximum residual stresses for material II (10 vol% SiC–5 vol% quartz/Al matrix composite). The minimum residual stress was obtained for trial 13 (I = 4 A; ton = 45 µs; toff = 15 µs, e = Gr, d = EDM oil + Gr powder), and maximum residual stress was obtained for trial 18 (I = 8 A; ton = 45 µs; toff = 45 µs, e = Cu–Gr, d = EDM oil). Both micrographs show the flow line of molten metal. However, these flow lines are more predominant in Figure 10(b) due to severe melting leading to high residual stress.
The SEM pictures of the machined surface represent the comparison of surface having highest and lowest residual stresses. This SEM analysis of the sample also reflects the surface properties of the machined surface such as surface roughness, porosity and crack. The machined surface in both the composites has a porous structure, which shows that the metal removal process was a result of melting, evaporation and spalling. 38 Furthermore, the surface topography reveals that the residual stresses may be caused due to metal flow, globules pockmarks, voids and crack. For material I, the flow lines of molten matrix are not as severe as observed in material II due to low coefficient of thermal expansion and density of reinforced particulate but there is a formation of pores and voids in the micrograph.
Material I (65 vol% SiC/A356.2) does not show large flow line phenomena on the machined surface as seen in material II (10 vol% SiC–5 vol% quartz/Al composite). This could be due to low heat expansion coefficient and high density of reinforced particles in material I and thus do not experience high thermal gradient and volume change during EDM. The typical XRD spectra of un-machined and EDMed workpiece were carried out for both materials using Cu Kα radiations (λ = 1.5406 Å) to examine the changes on the machined surface using a scan rate of 1°/min and range up to 140° with the generator setting of 40 mA and 45 kV. The resulting scans are shown in Figures 11 and 12. The lower most plots in both figures are for the un-machined surface, and the other two plots are for trials, which exhibited lowest and the highest residual stresses. The XRD of machined surface in Figures 11 and 12 indicates the formation of new phases. The peak intensity reduces as the residual stress magnitude increases due to dislocation of planes of atoms.

XRD spectra of sample I (a) before EDM and (b and c) EDMed surface with residual stress of 36.3 and 129MPa, respectively.

XRD spectra of sample II (a) before EDM and (b and c) EDMed surface with residual stress of 41.8 and 231.5 MPa, respectively.
Conclusion
This study reports the result of an experiment conducted on two different variants of MMCs for assessing the residual stresses induced while machining with an EDM process. Pulse-off time was identified as the most significant factor resulting in residual stresses in two materials. The analysis of results showed that MMCs with low coefficient of thermal expansion and a high density of reinforced particle have lower residual stresses. The electrode material which had high conductivity resulting in smooth formation of plasma channel between the electrodes induced least amount of residual stresses. The increase in pulse-off time causes a steep rise in residual stress due to extended solidification period while pulse-on time has no effect. The addition of power in the dielectric lowers the residual stresses. However, the conductivity of power particles has no effect on residual stresses. Subsequently, ANN model was developed to predict the residual stresses in such materials. The model accurately predicts the residual stresses and can be used as a reliable tool for the study of residual stresses in case of complex problems that involve qualitative and quantitative factors. The XRD patterns clearly indicate the formation of new phases on the machined surface. The peak intensity after machining is reduced due to dislocation of atomic layer resulting in residual strains.
Footnotes
Appendix 1
Declaration of conflicting interests
The authors declare that there is no conflict of interest.
Funding
This research received no specific grant from any funding agency in the public, commercial or not-for-profit sectors.
