Abstract
Titanium alloys are extensively employed in machining to produce high-value components, where surface roughness is crucial for their functionality. In this study, the roughness of Ti6Al4V and Ti6Al7Nb turned surfaces was analysed. The goal was to compare the alloy’s roughness behaviour when machined under the same conditions. Therefore, hypothesis testing was performed to assess the equality of the expected values and population median. It was found that, for some parameters, the mean (RzI, Rt) and median (Rq, RzD, Pt) roughness was statistically different when changing alloys. However, for others, it did not present a significant difference, in terms of mean (RmD, Rp) and median (Ra, Rpm). Consequently, a relevant insight from this study was that the phenomenon interpretation was affected by roughness parameter selection. On the other hand, as expected, the feed rate played a critical role in roughness control. It was observed that Ti6Al7Nb had a smoother topography (Ra was 0.15 ± 0.01 μm and 0.62 ± 0.04 μm) than Ti6Al4V (0.31 ± 0.02 μm and 0.67 ± 0.03 μm) at 0.05 and 0.15 mm/rev. Yet, at 0.1 mm/rev, this behaviour changed, as Ti6Al4V had a smoother surface (Ra was 0.40 ± 0.02 μm) than Ti6Al7Nb (Ra was 0.51 ± 0.03 μm). Finally, the multi-optimization indicated that a low feed rate (0.05 mm/rev) and medium (60 m/min) and high (90 m/min) cutting speed led to lower roughness and higher material removal rate for turning Ti6Al7Nb and Ti6Al7V, respectively. The insights from this study may have a practical application in the manufacturing and quality control of biomedical components.
Introduction
Although α + β Ti6Al4V alloy was the number one choice for manufacturing the first generation of titanium implants, 1 researchers found that long-term exposure to Vanadium was correlated with the development of Alzheimer’s disease. 2 Therefore, other Ti-alloys, such as Ti6Al7Nb, have been preferred for producing implants since they reduce unwanted interactions with the human body. This is achieved without compromising other thermo-mechanical properties that make Ti-alloys suitable biomaterials, including corrosion resistance, good strength-to-density ratio and low elasticity modulus. 3
If, on the one hand, the Ti6Al7Nb alloy is more biocompatible, on the other, the number of studies reporting its machinability is still scarce compared to the traditional Ti6Al4V alloy. Nonetheless, relevant contributions were found, including Sun et al., 4 that applied liquid nitrogen in Ti6Al7Nb turning and observed an improvement in the surface hardness and finish compared to dry and flooded cooling. Furthermore, Lauro et al. 5 conducted a very thorough study in Ti6Al7Nb dry micro turning and assessed several responses that characterize the machinability, including machining forces, cutting temperature, chip compression ratio, specific cutting energy and friction coefficient at the rake face. The same authors 6 performed similar tests with minimum quantity lubrication (MQL) and observed worse surface roughness for MQL conditions due to higher chip adherence (Ra <0.2 μm for dry, and Ra <0.3 μm for MQL). Nonetheless, MQL improved the burr formation by avoiding the growth of a build-up edge. The works described above are praiseworthy as they are among the few describing the effect of the lubrication in the surface state of Ti6Al7Nb.
Reports comparing the two α + β Ti-alloys were also found, including, Fellah et al., 7 which noticed, through tribological tests, that the wear resistance of Ti6Al7Nb was lower than Ti6Al4V. More recently, a similar trend was observed by Almeida et al. 8 To explain the findings, the authors evaluated the topography of worn Ti-alloys surfaces, with reconstruction by extended depth-of-focus, as well as a fractal dimension analysis. Variations in β phase distribution of the Ti-alloys were pointed-out as one of the reasons for the distinct topographical and wear behaviour. A concern that can be drawn from these reports is that Ti6Al7Nb surfaces potentially release more debris in the human body than Ti6Al4V. This, in turn, highlights the relevance of investigating effective machining strategies that promote Ti6Al7Nb smooth surfaces, thereby reducing its potential for debris release.
Furthermore, high-quality surface texture enhances the natural anti-corrosive properties of Ti-alloys, as reported by Avinash and Kumar. 9 Their findings implied that, in contrast to Ti6Al4V, Ti6Al7Nb surfaces presented a lower corrosion rate. This behaviour was attributed to the greater resistance of dioxide titanium film formed in the presence of Nb elements. However, before the corrosion tests, the roughness evaluation pointed to smoother Ti6Al7Nb surfaces (Ra: 0.15–1.73 μm), rather than Ti6Al4V (Ra: 0.36–1.87 μm). Undoubtedly, this aided in the improvement of the anticorrosion properties of Ti6Al7Nb.
While the previous work, 9 suggested a better surface finish in Ti6Al7Nb micro-milled surfaces, reports for turning10,11 showed the opposite trend. Nevertheless, Mello et al. 11 found that the turning conditions required to achieve the same (and optimal) surface roughness in both alloys were identical, indicating that they exhibit similar machinability. An analogous inference was found in Filho et al., 12 as the optimal machining conditions for minimum specific cutting energy for both alloys were the same.
A key point to consider is that while the surfaces may have similar roughness, they can have distinct textures and chemical element distributions, as shown in Carvalho et al. 13 Indeed, it was found, with an innovative correlative microscopy technique, that variations in the process conditions lead to the formation of clusters of vanadium in feed mark regions of Ti6Al4V surfaces. These heterogeneities were not noted in Ti6Al7Nb surfaces. These insights are particularly pertinent to avoid undesirable interactions between surfaces and the human body.
Thus, the concept of surface integrity is very broad, as it includes topographical, microstructural and mechanical aspects. Yet, the surface finish is undoubtedly one of the key aspects in the manufacturing of functional surfaces, as it accounts for the waviness and micro geometric deviations, namely, the surface roughness. In machining, the latter refers to the irregularities resulting from the interaction between the tool and the material and depends, among other factors, on the process conditions, including cutting speed, feed rate, stiffness of the machine tool, tool nose radius, used of metalworking fluids and others. 14 As irregularities (groves and valleys) induce stress concentrations in the surface, higher wear rate, reduced fatigue resistance and a higher tendency for crack propagation may lead to component failure if surface roughness is not accounted for. 15
Besides, for hard tissue replacements, including joints and dental implants, the surface roughness impacts the structural and functional bonding between the implant and the bone, known as osseointegration. 16 For dental implants, turning and threading operations typically provide a smooth surface with an arithmetic average deviation of the roughness profile (Ra) between 0.3 and 0.6 μm. 17 Nevertheless, many available solutions endured multiple surface modifications, such as abrasion, etching, blasting and coatings to increase the roughness.16,17 Indeed, a Ra ranging between 0.79 and 2.89 μm was observed by Nicolas-Silvente et al. 16 when assessing off-the-shelf dental implants. It seems that, compared to smoother implants (Ra <1 μm), a certain degree of roughness, preferably between 1 and 2 μm, improves the proliferation and attachment of bone cells, as well as the bone-to-implant contact. Still, as discussed in Jordana et al., 18 the risk of periimplantitis (inflammation around the implant) intensifies with increasing roughness (Ra >1.2 μm).
On the other hand, for articulating components, such as the femoral head, a Ra below 0.05 μm is required (ISO 7206-2: 2011/AMD 1:2016) to avoid particle generation due to excessive wear. 19 This finish cannot be achieved solely through machining but rather requires grinding and polishing operations. Nevertheless, the better the finish provided by machining, the less time is required for these operations.
To assess surfaces roughness 2D and 3D methods can be used. 2D techniques include stylus profilometry, while 3D technology consists of optical interferometry (3D profilometers), 20 confocal 21 and atomic force microscopy. 22 Besides, roughness parameters can be classified as 3D when determined for an area of the surface instead of a line (2D). 23 Due to their capacity to inspect a wider area, thus having higher statistical significance, 3D parameters gained recognition for evaluating heterogeneous rough textures, such as those found in additive manufacturing surfaces.24,25 Furthermore, as 3D inspection entails additional cost and technical requirements, 2D profilometry is frequently sufficient for analysing well-understood components and processes, such as homogenous surfaces produced by machining. Also, the many decades of data and experience with 2D roughness measurements make 2D profilometry a well-established technique known to academics and industrials. Indeed, contact profilometry has worldwide traceability, as many finish standards used by the industry were defined using it. 19 Other advantages of 2D profilometry include portability, flexibility in measuring small and large workpieces and the capacity to assess inside bores.
From all the 2D roughness parameters described in the state-of-the-art, the arithmetic average deviation of the roughness profile (Ra) is the most acknowledged by the industry and academia. 26 Nevertheless, several authors agreed that for some components, the definition of the surface state through Ra is simplistic, as it overlooks the peak and valley distribution throughout the profile.27,28 The same happens with the root mean square of the surface roughness (Rq). Indeed, surfaces with the same Ra or Rq can vary widely in their surface topology and, therefore, diverge widely in performance. 29 Still, Rq is more sensitive to peaks and valleys than Ra since the amplitudes are squared. 23
Another parameter commonly used to describe the machined surface finish is the peak-to-valley height average from five consecutive sampling lengths (RzD).28,30 Contrary to Ra and Rq, RzD is more sensitive to occasional high peaks or deep valleys, as instead of averages, the highest profile heights are examined. 23 Moreover, RzD brings advantages over parameters such as Rt (maximum peak-to-valley length over the evaluation length) and RmD (maximum individual peak-to-valley height from five sampling lengths) as it integrates more of the roughness profile. On the other hand, RzD is an average of the extreme events in the roughness profile, therefore, can be excessively affected by outliers.
Indeed, multi-parameter roughness analysis can detect irregularities that could compromise the functional performance of components, as shown Folwaczny et al. 31 These authors were concerned about how periodontal probes would damage smooth implants. It was found, with paired hypothesis testing, that for Rp (highest peak), Rt (maximum peak-to-valley height) and Rv (lowest valley), there was a significant increase in roughness after probing, which was not found for Ra and RzD. Therefore, there are parameters better at detecting certain aspects of the roughness profile than others.
Thus, profilometry analysis improves when confronting roughness parameters that describe different aspects of the topographical profile.16,31 Indeed, many other parameters, all with their merits and limitations, are described in roughness measurement standards. 23 The prevalence of some parameters to the detriment of others is more correlated to their historic availability on profilometers than to their accuracy in classifying component surface finish. In the absence of standards, it is necessary to apply a scientific criterion to decide which parameters describe the surface finish accurately.
The present work tackles this research gap, as statistical inference, via paired hypothesis testing was used not only to compare the Ti-alloys surface state but also to understand how much the selection of a particular roughness parameter will support the similarity or disparity between the surface finish of both Ti-alloys. Indeed, paired hypothesis testing made it possible to compare the two population means of correlated samples. In our case, Ti-alloys turned under the same conditions. This statistical tool has been used by others to compare the material removal rate of 2 bars of steel machined by electric discharge 32 ; the roughness attained with two values of cut-off (0.08 and 0.25 mm) 19 ; the cutting efforts generated under two machining environments (dry and nanofluid), 33 and two tool coatings (AlTiN and TiAlCrN). 34 Besides, paired hypothesis testing was effective for inferring if a predictive model differs significantly (or not) from experimental measurements 35 or other models. 36
Therefore, this work focuses on answering the following research questions, do the alloys behave similarly, in terms of surface finish, when machined under the same conditions? Will this depend on the roughness parameter chosen to make the assessment? Which cutting conditions provide a better surface finish for the Ti-alloys? Are they similar? The first two questions were answered with statistical inference, while the others with multi-objective optimization via the Grey relational analysis (GRA).
GRA has been used by several researchers to optimize the machining conditions, namely cutting tools, metalworking fluids and tool overhang, as a function of multiple responses, including machining forces, vibration, surface finish, tool wear, temperature and material removal rate.37,38 Although there are several multi-objective optimization algorithms, 39 GRA has been widely used as it works well with small sample sizes, does not require any specific data distribution and involves simple calculations. 40 Furthermore, the algorithm outcome is a ranking, so it is quite intuitive to interpret the preferential order of the trials, which simplifies the decision-making in machining process optimization.
To summarize, two biocompatible Ti-alloys were dry-turned. The state of the machined surfaces was evaluated (and quantified) with several amplitude roughness parameters attained by stylus profilometry. The main goal was to understand if the Ti-alloys presented a similar surface finish when machined under the same conditions. This is one of the many criteria for replacing the most selected and toxic Ti-alloy (Ti6Al4V) for Ti6Al7Nb in biomedical components manufacturing. Statistical inference was employed to compare the biomedical alloys’ surface finish but also to encourage a reflection on the choice of roughness evaluation methods and their effectiveness in classifying turned surfaces. Indeed, the innovative character of this work (novelty) resides in this combined analysis. Furthermore, a multi-criterion decision-making (GRA) method was applied to find which turning conditions resulted in better surface finishing and efficient chip removal for both alloys.
Methods
This section details the experimental procedures carried out in this work, which according to Figure 1, included: (a) Dry turning tests in Ti-alloys (Ti6Al7Nb and Ti6Al4V); (b) Evaluation of the machined surfaces with a stylus probe roughness tester; (c) Data treatment and analysis, namely statistical inference, and optimization.

(a) Experimental setup, the white arrows represent the machining process parameters, (b) roughness tester and conditions used to evaluate the machined surfaces and (c) methodology for data treatment and analysis.
Titanium alloys
Ti6Al7Nb and Ti6Al4V alloy billets with a 12 mm diameter were used in the turning experiments. According to TiFast S.r.l., they were produced by vacuum arc remelting, were hot rolled, annealed and cold finished. The as-received alloys had an average surface roughness (Ra) of 0.61 ± 0.09 μm (Ti6Al4V) and 0.52 ± 0.03 μm (TiAl7Nb).
The Ti6Al4V and Ti6Al7Nb alloys presented a microstructure composed of a fine dispersion of α a and β phases. A detailed microstructural report of the alloys is available in a previous publication by the authors. 13 The Ti-alloys’ chemical composition, as well as thermo-mechanical properties, can be found in Tables 1 and 2, respectively. These values were available in the quality certificate sheets delivered with the alloys.
Chemical composition in % atomic weight for Ti-alloys.
Thermo-mechanical properties for Ti-alloys.
Dry turning experiments
The oblique dry turning tests were conducted on a CNC turning centre Kingsbury MHP 50 with 18 kW power and a maximum revolution speed of 4500 rpm. PVD TiAlN + TiN coated tools with chip breaker, designation CNMG 120408-MF1 TS2000 (from Seco tools), were employed in a tool holder model C3-PCLNL-22040-12 (from Sandvik Coromant). The insert geometry was suitable for machining stainless steels, super alloys and titanium alloys in semi-finishing and finishing operations.
The cutting parameters were chosen following: (1) the manufacturer’s recommendations (ap: 0.25–3.0 mm, f: 0.08–0.3 mm/rev; vc: 50–70 m/min); (2) the type of operation (finishing), which is why a low feed rate (0.05, 0.1 and 0.15 mm/rev) and depth-of-cut (0.5 mm) were applied; (3) the research goals, which were, minimize the surface roughness, without compromising the productivity, by controlling the feed rate and cutting speed; (4) the state-of-the-art, including a review on cutting parameters for machining titanium alloys by Festas et al. 41 and the optimization work done by Mazid et al. 42 These authors carried out turning tests in a Ti6Al4V alloy, and although they applied cutting fluid, they observed that for a cutting configuration similar to ours, the conditions that minimized the surface roughness (Ra: 0.8 μm) were an ap of 0.5 mm, an f of 0.1 mm/rev and a vc of 50 m/min.
As displayed in Table 3, a factorial design (Taguchi L9 orthogonal array) with three levels of cutting speed (vc) and feed rate (f) was implemented for the turning experiments. A total of nine machining conditions were investigated. The tests were repeated twice to stabilize the sample mean and standard deviation and increase the statistical inference reliability. 43
Turning conditions for the experiments.
The CNMG inserts selected for the machining tests had eight cutting edges. Three cutting edges were used per insert, as each one completed the tests and repetitions done under the same feed rate (f). The maximum working time per cutting edge was below 1.5 min, corresponding to the tests carried out when f was 0.05 mm/rev. For this cutting time, as expected, there were no modifications to the tools. Indeed, the cutting edges were observed periodically, with an optical microscope to guarantee that. Besides, the cutting length was short to prevent tool wear from influencing the individual responses of each roughness parameter since the alloys could present different wear behaviours.
Surface roughness measurements
The surface finish was evaluated with a roughness metre Model T1000 with LV15 diamond probe, both from the Hommelwerke™ company, as shown in Figure 1(b). The mechanical stylus probe acquired the surface roughness profile along 1.5 mm, the defined tracing length (lt), as presented in Figure 2 After that, several roughness parameters that describe the surface finish were estimated.

Primary profile for the as-received Ti6Al4V alloy and roughness parameters.
The parameters selected for this work, and their description, can be found in Figure 3. For every roughness parameter, four measurements were taken, two per specimen, spaced approximately at 180°, making a total of 36 observations collected for each alloy.

Description of the roughness parameters analysed in the scope of this work. (Hommelwerk cortesy: Hommel Tester T1000 manual).
Data treatment
As shown in Figure 1(c), after the roughness measurements, the dataset was uploaded to R ×64 4.0.3 and IBM SPSS Statistics 27 for further analysis. As mentioned before, one of the goals of this work was to determine if there was a significant difference between the surface finish of two titanium alloys machined under the same cutting conditions. Therefore, paired hypothesis t-test was used to compare the corresponding expected mean values of each surface roughness parameter of two related groups (alloys).
Hypothesis testing consisted of (1) formulating the null (H0) and alternative hypothesis (Ha), as well as the significance level (α). (2) Select the appropriate test and state the relevant test statistic. (3) Determine the value of the test statistic and the p-value. (4) Compare the p-value with the significance level. Then, if p-value≤α, H0 is rejected. Otherwise, if p-value >α, H0 cannot be rejected. 44
For the paired t-test, the null hypothesis (H0) was that the two means (alloys) were equal, which was the same as saying that the population mean difference was zero. As demonstrated in equation (1), to determine the test statistic (T), three fundamental data values were required, namely the average of the differences between the paired data (xd), the standard deviation for the paired difference (sd) and the number of paired differences (n). Afterwards, the estimated test statistic was compared with the tabulated, taken from the known probability distribution (t-distribution). Moreover, the test statistic was utilized for calculating the p-value. In practice, all these steps were performed by statistical software.
As shown in Figure 1(c), the paired t-test was used when the paired differences followed a normal distribution. Due to the sample size (36 observations), the paired difference normality was checked with a Shapiro-Wilk test. 45 This test assumes a null hypothesis (H0) where the sample, in our case paired difference, comes from a normally distributed population.
When the normality was rejected, the non-parametric Wilcoxon signed-rank test was used to determine if the two paired groups (two alloys machined under the same conditions) were different from one another in a statistically significant manner. The test statistic was determined by (1) calculating the absolute difference between groups. (2) Determining the rank of the absolute difference between the groups. If the absolute difference is zero, exclude the pair. (3) Computing de positive and negative rankings. The Wilcoxon test statistic is the sum of the positive rankings. Once again, all these steps were performed by statistical software.
The statistical inference was followed by multi-objective optimization with Grey relational analysis (GRA). Ten responses, including nine surface roughness parameters (Ra, Rq, RmD, RzD, RzI, Rt, Rpm, Rp) and the maximum material removal rate (MMR), were included in the GRA. MMR was included to consider the process productivity, that is the volume of material removed per minute and was estimated by the multiplication of the cutting speed (vc), feed rate (f) and depth-of-cut (ap) as indicated in Sadeghifar et al. 46 The goal was to find optimal machining conditions for both alloys.
GRA implementation began with dataset normalization. At this step, the original raw responses (xi (k)), which had their specific scales, were converted into comparable data, ranging from 0 to 1 (yi (k)). Normalization was done using two equations, one for the ‘smaller-the-better’ responses (equation (2)), that is the nine roughness parameters and the other (equation (3)) for the ‘greater-the-better’ attribute, in this case, the MMR.
After that, the deduction of the deviation sequence (Δ oi ), as well as the estimation of the Grey relational coefficient (ϑi (k)) was carried out using equations (4) and (5). Δ min and Δ max correspond to the maximum and minimum from the deviation sequence (Δ oi ). In equation (5), the term ζ is called distinguish factor and was 0.5 suggested by Mia et al. 47
Afterwards, as presented in equation (6), the multi-responses were converted into a single value called Grey relational grade (ξ), which was calculated by combining the Grey relational coefficient (ϑk) with a weight value (wk). In the current study, all the responses had the same weight, as suggested in Mia et al. 47
Finally, the Grey relational order corresponds to the descending order of the Grey relational grades. The optimal run was the one with the highest Grey relational grade, corresponding to rank 1. All these steps were implemented in Microsoft Excel.
Results
This section details the main findings regarding the surface texture attained when turning two biomedical Ti-alloys, starting with the boxplot visualization containing the distribution for the surface roughness parameters for both titanium alloys, as shown in Figure 4. The measurements from surfaces machined with the same feed rate (f) present the same colour, that is yellow (0.05 mm/rev), blue (0.10 mm/rev) and red (0.15 mm/rev), whereas the ones attained at the same level of cutting speed (vc) present the same shape, namely cross (30 m/min), circle (60 m/min) or triangle (90 m/min).

Ti-alloys roughness distribution for Ra (a), Rq (b), RmD (c), RzD (d), RzI (e), Rt (f), Rpm (g), Rp (h) and Pt (i). The black lines represent the distribution median, while the red dot is the mean.
Figure 4 indicates that for several roughness parameters, including Ra (a), Rq (b), RmD (c), RzD (d), RzI (e), Rt (f), Rpm (g) and Rp (h) the measurements are clustered by colour, meaning feed rate. These clusters were separated, as is the case for the Ra (a), Rq (b) and RzD (d) in Ti6Al4V surfaces, while sometimes they were overlaid. The overlaps were slight, as in the Ra (a), Rq (b) and RzD (d) for Ti6Al7Nb surfaces and Rpm (g) for both alloys, or more evident as observed for RmD (c), RzI (e), Rt (f), Rp (h) for both alloys. As expected, the parameters that emphasize the deviations of the roughness profile, namely RmD (c), RzI (e) and Rt (f) for the peak-to-valley height, and Rp (h) for the peak height, culminated in distributions with overlaps, due to higher data variability. The corresponding parameters, namely RzD (d) and Rpm (g), that divide the profile in sampling lengths and look at its average, reduced the data variability, which increased the separation between clusters. The same applies to Ra (a) and Rq (b).
The feed rate effect on the surface roughness of machined surfaces is well reported in the state-of-the-art.26,48 It is a well-established fact that increasing the feed rate in machining operations leads to rougher surfaces, which is reflected in roughness measurements in Figure 4(a) to (h). The results from Figure 4(a) to (h) also indicate that lower and higher levels of feed rate (0.05 and 0.15 mm/rev) promoted smoother surfaces for Ti6Al7Nb alloy (Ra was 0.15 ± 0.01 and 0.62 ± 0.04 μm) than for Ti6Al4V alloy (Ra was 0.31 ± 0.02 and 0.67 ± 0.03 μm). On the other hand, when the feed rate was 0.1 mm/rev, the Ti6Al4V alloy surfaces were smoother (Ra was 0.40 ± 0.02 μm) compared to Ti6Al7Nb alloy (Ra was 0.51 ± 0.03 μm).
Additionally, the roughness measurements from Figure 4(a) to (h) suggest that regardless of the cutting speed, for Ti6Al7Nb alloy, it is possible to increase the feed rate from medium (0.1 mm/rev) to high (0.15 mm/rev) without compromising (severely) the surface roughness (Ra from 0.51 ± 0.03 to 0.62 ± 0.04 μm). An analogous conclusion was established for Ti6Al4V alloy, as an increase from lower (0.05 mm/rev) to medium (0.1 mm/rev) feed rate does not impact (drastically) the surface roughness (Ra from 0.31 ± 0.02 to 0.40 ± 0.02 μm). This outcome arose from the asymmetries presented by the distributions for the alloys. As shown in Figure 4(a) to (h), for Ti6Al7Nb alloy, the roughness distributions were left-skewed (mean < median), while for Ti6Al4V alloy, the distributions were right-skewed (mean > median), as indicated in Figure 4(a) to (i). The roughness values attained at 0.05 mm/rev for Ti6Al7Nb alloy surfaces (Ra: 0.15 ± 0.01 μm) decreased the distribution mean and enhanced the left skewing.
Furthermore, Figure 4 suggests that the roughness data does not show any specific trend in terms of cutting speed (vc). A similar indication was given by the data in Table 4, which relates the measurements for each roughness parameter with the cutting conditions. The roughness values from Table 4 were obtained from the average of the four measurements. Table 4 indicates that the average roughness (Ra) for Ti6Al7Nb surfaces varied between 0.14 ± 0.01 and 0.64 ± 0.02 μm and from 0.30 ± 0.01 to 0.70 ± 0.01 μm for Ti6Al4V alloy surfaces, corresponding to a roughness grade number of ranging from N4 to N6.
Roughness response (average and standard deviation) and material removal rate as a function of the cutting conditions.
In Table 4, it was observed that, when assessing the distance between peaks and valleys, Rt > RmD > RzI > RzD for all the tested conditions. As expected, the parameters that resulted from the evaluation of the tracing length (Rt and RmD) were higher than the parameters that counted for the average of the sample sizes (RzI and RzD). Rt was the highest value since it detects the higher peak and the lowest valley in the tracing length, followed by RmD, which detects the same thing, but within the sample sizes. A comparable conclusion was established for the average parameters RzI (tracing length) and RzD (sample sizes).
Among the roughness parameters, the maximum profile depth (Pt), shown in Figure 4(i), presents a random distribution, perhaps because this parameter was calculated from the unfiltered roughness profile and is the most sensitive to profile changes.
Statistical inference
Normality test
The Shapiro-Wilk test was used to check the normality of the paired differences associated with each dependent variable (Ra, Rq, RmD, RzD, RzI, Rt, Rpm, Rp, Pt). In the scope of the conducted work, the null hypothesis ‘H0: The dependent variable paired differences follow a normal distribution in the population of Ti-alloys surfaces’ cannot be rejected if the p-value >0.05, while if p-value ≤0.05 the null hypothesis is rejected in favour of the alternative hypothesis which is ‘Ha: The dependent variable paired differences does not follow a normal distribution in the population of Ti-alloys surfaces’.
Table 5 shows that the p-value (Ra, Rq, RzD, Rpm, Pt) ≤0.05 which rejects the null hypothesis H0 in favour of Ha (at a 5% significance level) and, thus, one cannot assume that Ra, Rq, RzD, Rpm and Pt followed a normal distribution. For RmD, RzI, Rt and Rp, since the p-value >0.05, H0 cannot be rejected (at a 5% level), so the normality of differences can be assumed (highlighted in bold in Table 5).
The p-value for the Shapiro-Wilk test in two paired samples.
The histogram of differences, and the Q-Q plots, were visualized to substantiate these findings. Figure 5 shows these graphs for one case where the normality of differences can be assumed (Rp) and for another where it is not (Ra). Figure 5(a) indicates that when the differences follow a normal distribution, the histogram is relatively symmetric about the mean (0.097) and fits a bell shape that has only one peak (mode: 0.36). Figure 5(b) shows a bimodal distribution (mode: −0.13, 016), which usually indicates that the distribution in the population is not normal. Besides, in normal distributions, the points in the Q-Q plot define a line, as demonstrated in Figure 5(c).

Histogram of differences with descriptive statistics for Rp (a) and Ra (b). Q-Q plot for Rp (c) and Ra (d).
Paired t-test
Since one assumption to performing paired t-test is to assure that the two samples are strongly related to being paired, the Pearson correlation coefficient was calculated. Table 6 shows that there was statistical evidence to support that the two samples can be paired as the null hypothesis, ‘H0: The Pearson correlation coefficient is equal to zero, therefore a linear relationship between the paired items does not exist’ was rejected (p-value (RmD, RzI, Rt, Rp)≤0.05) in favour of the alternative hypothesis, ‘Ha: The Pearson correlation coefficient is different from zero, therefore a linear relationship between the paired items exists’ at a 5% significance level. The high Pearson correlation coefficient (Table 6) also implies a strong relationship between the paired samples.
Pearson correlation coefficient and p-value.
For the variables that presented a normal distribution of the differences (highlighted in bold in Table 5) as well as a strong Pearson correlation coefficient (Table 6), t-testing was performed to check if ‘there was any significant difference in the surface roughness of Ti-alloys surfaces machined under the same cutting conditions (H0)?’. The Table 7 results indicate that the null hypothesis H0 was rejected for RzI and Rt, so it was possible to conclude that for these parameters the mean surface roughness of machined Ti6Al4V and Ti6Al7Nb alloy surfaces were statistically different. While for RmD and Rp the null hypothesis H0 cannot be rejected and consequently there are no significant differences between the mean surface roughness of Ti6Al4V and Ti6Al7Nb alloy surfaces (highlighted in bold in Table 7).
p-Value for the two-tailed t-test.
Wilcoxon test
For the variables that do not follow a normal distribution of the differences (Table 5), the Wilcoxon test was performed to check if there was any significant difference in the surface roughness of Ti-alloys machined under the same conditions (H0)? Accordingly, to Table 8 this hypothesis was rejected for Rq, RzD and Pt as the p-value ≤0.05 at a 5% significance level, which means that the median roughness when changing the material to be machined from Ti6Al4V to Ti6Al7Nb alloy was statistically different. For Ra and Rpm, the null hypothesis H0 cannot be rejected as the p-value >0.05 at a 5% significance level (highlighted in bold in Table 8), meaning that the median roughness when changing the Ti-alloy did not present significant differences.
p-Value for the paired Wilcoxon test.
Multi-objective optimization
Table 9 presents the Grey relational order achieved for both alloys. In Appendix A, the results from the intermediate steps, namely normalization (Table A1), deviation sequence (Table A2) and Grey relational grade (Table A3), which led to this ranking can be found.
Grey relational grade and rank.
The optimal run was the one with a higher Grey relational grade, which was attained at a low feed rate (0.05 mm/rev) for both alloys. However, at a high cutting speed (90 m/min) for Ti6Al4V and medium cutting speed (60 m/min) for Ti6Al7Nb alloy. Nevertheless, for the Ti6Al7Nb alloy, the rank second position was at 90 m/min. Therefore, it seems reasonable to machine both alloys at the same cutting speed (90 m/min), if needed, to improve the process productivity without neglecting the surface texture. The worst conditions for machining Ti6Al4V alloy (rank 7, 8 and 9) were at a high feed rate (0.15 mm/rev) for all the cutting speeds, while for Ti6Al7Nb alloy, they were at a low cutting speed (30 m/min) combined with medium (0.10 mm/rev) and high (0.15 mm/rev) feed rate (rank 8 and 9).
Furthermore, Table 9 indicates that for Ti6Al4V alloy, the ranking was clustered by feed rate, meaning that the positions 1–3, 4–6 and 6–9 correspond to the conditions with low (0.05 mm/rev), medium (0.1 mm/rev) and high (0.15 mm/rev) feed rate. This was not observed for the medium (0.1 mm/rev) and high (0.15 mm/rev) feed rates for the Ti6Al7Nb alloy. Besides, these results agree with the boxplots from Figure 4, where it was observed that for Ti6Al4V alloy, the transitions between clusters with the same feed rate were well separated, while for Ti6Al7Nb alloy the clusters corresponding to medium (0.1 mm/rev) and high (0.15 mm/rev) levels of feed rate were overlaid for many roughness parameters.
Discussion
As point-out previously, the main goal of this research was to answer the following questions: Do the alloys behave similarly, in terms of surface finish, when machined under the same conditions? Will this depend on the roughness parameter chosen to make the assessment? Which cutting conditions provide a better surface finish for the Ti-alloys? Are they similar?
Statistical inference showed that for some parameters, the mean (RzI, Rt) and median (Rq, RzD, Pt) surface roughness was statistically different when changing the alloy, while for others, there was no significant difference in terms of mean (RmD, Rp) and median (Ra, Rpm). Therefore, the alloys behave similarly for some roughness parameters (Ra, RmD, Rpm, Rp) and differently for others (Rq, RzD, RzI, Rt, Pt). The Ra result was consistent with the findings from Mello et al., 11 as they concluded, through paired t-test, that Ti6Al7Nb and Ti6Al4V alloys presented a similar roughness.
Furthermore, in a different but comparable scope, Folwaczny et al. 31 reported a significant (for Rp and Rt) and not significant (for Ra, RzD) increase in the surface roughness of implants after probing. These findings highlight how relevant it is to use several roughness parameters that present different sensitives to the features of the roughness profile (peaks, valleys, height) for defining the surface state, especially when developing standards or quality control procedures. Moreover, it emphasizes the need for observing the data instead of making empirical decisions.
The multi-objective optimization with GRA showed that the optimal conditions that minimize the surface roughness and maximize the material removal rate were attained at 0.05 mm/rev for both alloys. However, at 90 m/min for Ti6Al4V and 60 m/min for Ti6Al7Nb alloy. Moreover, for Ti6Al7Nb alloy, the rank second position was at 0.05 mm/rev and 90 m/min. Therefore, the recommended cutting conditions can be the same. To some extent, these outcomes agree with the findings from other authors, as the optimal conditions for improving the surface finish 11 and the specific cutting energy 12 when machining Ti6Al4V and Ti6AL7Nb were the same.
Nonetheless, the Ti6Al4V alloy ranking is clustered by feed rate, while for Ti6Al7Nb alloy is not. These results were consistent with the boxplot visualization from Figure 4. As for Ti6Al4V distribution, the transitions between clusters were defined, while for Ti6Al7Nb, they were overlaid. Therefore, while the optimal conditions were comparable, the overall behaviour was not. Consequently, deducting that the machinability of the alloy is similarly based only on the optimal machining conditions can lead to a precipitous conclusion.
Conclusion
In this work, the surface roughness of two α + β biomedical titanium alloys turned under dry conditions was compared. The effects of the cutting speed (vc) and feed rate (f), on the surface roughness, were inspected. Several amplitude roughness parameters, including Ra, Rq, RmD, RzD, RzI, Rt, Rpm, Rp and Pt were measured with a stylus probe. Hypothesis testing was used to evaluate the equality of the expected values and population median. Multi-objective optimization was conducted for defining suitable cutting regimes. Some meaningful conclusions can be taken from this work:
The boxplot charts were quite helpful to understand the impact of the cutting conditions on the surface roughness parameters. It seems that the roughness data does not present any specific trend in terms of cutting speed, however the same cannot be said about the feed rate, since as it increases rougher surfaces were attained.
The boxplot charts indicate that Ti6Al7Nb alloy machined surfaces presented a smoother texture for a lower (0.05 mm/rev) and higher (0.15 mm/rev) level of feed rate when compared with Ti6Al4V alloy surfaces, yet at the intermediate level (0.10 mm/rev), this behaviour changes as Ti6Al4V alloy surfaces were smoother.
The results from the hypothesis testing highlight the importance of using several roughness parameters to understand the Ti-alloy surface state because different conclusions, present a comparable finish (Ra, RmD, Rpm, Rp) or not (Rq, RzD, RzI, Rt, Pt), can be drawn depending on the analysed roughness descriptor.
The multi-objective optimization showed that the conditions that improve the surface integrity, without neglecting the process productivity were a low feed rate (0.05 mm/rev) combined with a high cutting speed (90 m/min) for Ti6Al4V alloy and a low feed rate (0.05 mm/rev) combined with a medium cutting speed (60 m/min) for Ti6Al7Nb alloy.
Footnotes
Appendix A
This section contains the results from the intermediate steps, namely data normalization (Table A1), deduction of the deviation sequence (Table A2) and determination of the grey relational grade (Table A3) behind the application of the multi-objective optimization algorithm, that is grey relational analysis. These steps were carried out considering the original data from Table 4.
Acknowledgements
The authors also acknowledge TiFast S.R.L., from Italy, for providing the Ti-alloy.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The authors acknowledge the Portuguese Foundation for Science and Technology (FCT) for the PhD grant ref. SFRH/BD/07040/2021.
