Abstract
The aim of this work is to identify the best strategy for cluster analysis of acoustic emission (AE) events for cross-ply PE/PE graded composites under tensile loading. Six cluster algorithms are made comparison. Peak amplitude and frequency centroid represent adequately AE events clustering to five groups for PE/PE graded composites. A link between five representative damage modes with corresponding five groups of AE event is established. The results can be used to distinguish damage mechanisms and to investigate damage evaluation of PE/PE graded composites, and damage initiation and propagation of a PE/PE graded sample under tensile loading are discussed. This study provides guides for the establishment of AE interpretation and identification of damage modes for ultra-high molecular polyethylene–reinforced composites in future studies.
Introduction
Fiber-reinforced thermoplastic (FRT) composites are potentially and widely used in many demanding applications, including astronautic industries and civil constructions, because of good toughness, corrosion resistance, fatigue resistance, simple molding process, and high material utilization. Ultra-high molecular polyethylene (UHMWPE) fibers possess outstanding mechanical property and low density, promising in above applications. Due to unsatisfactory molecular polarity of the PE family, it is not easy for them to compatible with other matrix, thereby UHMWPE fibers are usually fabricated to self-reinforced (SR) composites, with reinforcement and resin belonging to the same polymer family, to obtain better interfaces. 1,2 During the processes of use, damages and even rupture can occur, such as fiber breakage, matrix cracking, and delamination, and thus it is important and essential to investigate damage mechanisms of FRT composites for their security use in practical processes.
Acoustic emission (AE) has been established as an accurate non-destructive technique for composite materials to monitor incipient failure mechanisms and discriminate different damage modes and the damage sources. 3 –7 AE signals, originating from sudden release of strain energy where damage happens, contain useful information on damages, with each AE event as an acoustic signature of certain damage mechanism. 8 Determining connections between AE signal parameters and corresponding damage mode is an important issue. AE registration has been applied to identify damage mechanisms for FRT composites in several reports (summarized in Table 1). 9 –15 For glass/polypropylene and glass/polyester composites, some researchers gave peak amplitude (PA) 9,10 and peak frequency (PF) ranges 11 for different damage modes. For UHMWPE/HDPE composites, our group concluded corresponding PA range as follows: 30–45 dB relates to interfacial debonding, 30–60 dB is matrix cracking, 60–80 dB corresponds to plastic deformation and matrix cracking, 60–80 dB is fiber pullout, 80–97 dB correlates to fiber breakage, and 60–85 dB is delamination. 12 Wang and coauthors utilized frequency spectrum analysis and assumed that PF of 40 kHz is matrix cracking, 40 kHz and 240 kHz relate to debonding, and 200 kHz is fiber rupture. 13 However, one AE parameter only describe one AE characteristic, while multi-descriptors provide rich information about AE event, which will better represent acoustic signature for each damage mode.
Summary of cluster results in the literature.
AE: acoustic emission; G: glass fiber; PP: polypropylene; PET: polyester; UHMWPE: ultra-high molecular polyethylene; HDPE: high-density polyethylene; LDPE: low-density polyethylene; PA: peak amplitude; PF: peak frequency; D: duration; MF: multiple features.
aSimple model specimen: LDPE resin, (90°) laminate, single fiber composite, fiber bundle composite, and (±45°) laminate.
Cluster analysis is one of the generally accepted ways to discriminate multi-parametrical AE signals, which is a synonym for unsupervised pattern recognition (UPR) technique.
14
In cluster analysis, selection of cluster algorithm is crucial, besides feature selection and cluster evaluation.
15
Most frequently used cluster algorithms in literatures are
Herein, we investigated damage mechanisms of cross-ply PE/PE graded thermoplastic composites by cluster analysis of AE data under tensile loading. Six cluster algorithms were made comparison, including
Materials and methods
Materials and sample preparation
The materials used in this study are UHMWPE fibers and LDPE films, which served as reinforcements and matrix, respectively. The mechanical properties of UHMWPE yarn and LDPE resin are the same materials as used in literature. 22 Cross-ply SR graded composites were prepared by the method of film stacking and hot compaction method. Three lamina types with different fiber volume fractions 19%, 25%, and 31% were labeled as lamina “1#,” “2#,” and “3#,” respectively. The preparation parameters are pressure of 1.0 MPa, temperature of 120°C, and 5 min. The prepared specimens are cross-ply graded composite specimen “FC” with stacking sequence [03#/903#/02#/902#/01#/901#]2 s.
Tensile tests
Tensile tests were performed on the cross-ply graded PE/PE samples according to ASTM D3039 standard. The length of tensile samples was 180 mm with gage length (

Schematic diagram of cross-ply graded PE/PE tensile test specimen.
Parameters of the acoustic emission acquisition.
Three tests for the studied specimen were chosen for cluster analysis of AE events in this study, which referred to as “FC0-n,” where “FC0” indicates the type of preform, and
Cluster analysis methods
In this study, nine originally recorded AE signal features were used for cluster analysis: (1) PA, (2) duration (D), (3) rise time, (4) PF, (5) counts, (6) energy (E), (7) frequency centroid (FCoG, frequency where the areas of the frequency spectrum below and above FCoG are the same), (8) RA (rise time divided by peak amplitude value); value (rise time divided by PA), and (9) weighted frequency. 4 Cluster analysis procedure consists of feature selection, cluster algorithm, and cluster evaluation as illustrated in Figure 2, which is performed on Matlab platform. Four-ordered Butterworth high-pass filter was used to eliminate noises from AE signals, and all AE features were normalized ranging from 0 to 1 before data analysis. Through feature selection by Laplacian score 23,24 and correlation coefficient analysis, four AE features which have most cluster ability could be selected, and this procedure can eliminate irrelevant and redundant features so as to improve cluster efficiency and quality. 25 Then, the selected AE features were employed as feature set for principal component analysis and cluster algorithm for cluster analysis. Silhouette coefficient (SC) and Davies–Bouldin (DB) index were employed to evaluate cluster validity. SC combines both cohesion and separation between clusters and measures how distinct or well separated for a cluster apart from other clusters. DB is a ratio of within-cluster distance and between-cluster distance. A higher SC and lower DB indicate better cluster quality. Six algorithms were investigated and compared for cluster analysis of AE signals of composites and were introduced in the following.

Flow chart of the cluster analysis procedure in this study.
k -means
Firstly, choose
k -means++
Fuzzy c-means
The objective function of FCM algorithm is defined in equation (2):
The key difference of FCM and
Self-organized map
SOM is one of the most frequently used artificial neural network in unsupervised learning.
28,29
SOM can be regarded as a competitive neural network and it presents the characteristics of cooperative learning.
15
It can be used to process the multidimensional and nonlinear data, and it is one of the most realistic models of cerebral neurons. In SOM, through the competition among the neighboring neurons, the data distribution pattern can be analyzed. The neurons form into a map in the low dimensional grids, as shown in Figure 3. The procedure of training
determine the number of neurons according to input vectors, and the size of map
assign
choose one input vector
update the weighted vectors of the winning neuron and its neighbor neurons according to Kohonen learning rule, as in equation (3):

Self-organizing map architecture.
where
repeat steps c and d for all input vectors.
In this study, SOM combined with
Adaptive affinity propagation
AAP algorithm
30,31
is an improved affinity propagation (AP) algorithm reported in
Results
AE registration
Figure 4 illustrates the relationship between nine AE features and applied strain of a cross-ply PE/PE specimen. It can be seen that PA and energy have similar distribution. For time-domain AE features, AE events distributed evenly in the amplitude range of 30–100 dB. At low strain, there are mostly AE events with low-medium, short duration and short rise time, fewer counts, and low RA value. When the strain is up to failure strain, a small number of AE events with high PA, long duration and long rise time, few counts, and low RA value generate. For frequency-domain features, low frequency AE events generated at low strain 1–2%, higher frequency AE events come into being at medium strain from 2% to 3.5%, and AE events with highest frequency mainly start to happen at high strain more than 3.5%.

Nine AE features: PA, duration, rise time, energy, counts, RA value, PF, FCoG of gravity, and WF versus strain for a cross-ply graded PE/PE specimen. AE: acoustic emission; PA: peak amplitude; PF: peak frequency; FCoG: frequency centroid; WF: weighted frequency.
Repeatability of AE parameters
Figure 5 shows AE energy as a function of applied strain for three individual tests of cross-ply graded specimens: FC-01, FC-02, and FC-03. For all three cross-ply graded specimens, the energy of most AE events is less than 106 aJ, when strain increases to 4%, AE events with higher energy (>106 aJ) appear, and AE events with moderate energy (103–105 aJ) more likely happen at low and high strain. Figure 6 represents PA and PF histogram for all three specimens. It can be seen that PA and PF distributions follow similar pattern. There are more AE events in the amplitude range between 40 dB and 70 dB, fewer AE events in amplitude range less than 40 dB and over 80 dB, and it is obvious that there are fewest AE events with PA in the range of 70–80 dB, with the number of AE events less than 10. For frequency histogram, it is clear that most AE events concentrate in frequency band of 100–150 kHz, about 41.7–65.1% of all events. In frequency band of 50–100 kHz, the number of AE events is >30 (approximately 16.8–38.4%), whereas for frequency over 150 kHz, there are fewest AE events, less than 20 (approximately 17.7–27.4%). The repeatability of AE events for three specimens can also be obtained from Figures 5 and 6. Given the statistical scatter, AE energy, amplitude and frequency pattern are qualitatively stable for cross-ply graded PE/PE specimens in fiber direction of 0-ply.

AE registration data of energy versus strain for three cross-ply graded PE/PE specimens: FC-01, FC-02, and FC-03. AE: acoustic emission.

(a) PA distribution and (b) PF distribution of three cross-ply graded PE/PE specimens. PA: peak amplitude; PF: peak frequency.
Cluster results
Optimal cluster number
Figure 7 presents cluster validity indexes of different cluster number for all three cross-ply graded composite (FC); specimens. Optimal cluster number is chosen between 2 and 11 when SC is maximized and DB is minimized. It can be seen that for FC-02 and FC-03 specimens, when cluster number is 5, SC is maximum and DB is minimum, which ensures the best cluster quality; for FC-01, the optimum is weak; however, choosing 5 as cluster number for this case also brings low DB value and high SC value. Table 3 summarizes the cluster validity estimations of AE events grouped to 5 clusters for all three FC samples, which reveals that SC is good and acceptable, 0.6 < SC < 0.7 for FC-02 and FC-03, and low SC < 0.6 for FC-01. For all tests, DB index is less than 1, indicating intra-cluster distance is less than inter-cluster distance for clusters, and proves good cluster quality for all tests.

Select optimal cluster numbers by SC and DB index for FC specimens. When cluster number is 5, SC is maximum, and DB index is minimum. SC: Silhouette coefficient; DB: Davies-Bouldin.
Cluster validity index and percentage of AE events for all three cross-ply graded PE/PE specimens.
AE: acoustic emission; SC: Silhouette coefficient; DB: Davies–Bouldin.
Choice of cluster algorithm
The compared six cluster algorithms are all unsupervised, and the aim is to estimate cluster quality of each cluster algorithm for AE data of cross-ply graded PE/PE specimens. Figure 8 shows SC and DB index for a cross-ply PE/PE graded specimen by six cluster algorithms when cluster number is 5. In Figure 9, the cluster validity of AE events in each cluster by SC was investigated for six algorithms, respectively, for DB is relevant to the center of cluster. Larger SC means that the intra-cluster distance is larger than the inter-cluster distance for one AE point attributing to corresponding cluster, and this means this data point is allocated to a correct group. It is obvious that for SOM +

Comparison of six cluster algorithms by SC and DB index. SC: Silhouette coefficient; DB: Davies–Bouldin.

Cluster quality of SC of each cluster to compare six cluster algorithms. SC: Silhouette coefficient.
In the literature by Gutkin,
15
cluster quality of SOM +
Cluster results
Figure 10 presents the percent variance and cumulative variance of each principal component by principal component analysis. It is obvious that the cumulative variances of the first two principal components (Pd1 and Pd2) explain roughly two-thirds of the total variability for all data information, so it is reasonable to limit the presentation of AE data to these two components to better visualize the AE data.

Variance and cumulative variance of the important principal components.
Figure 11 shows the projection of five clusters to 2-D plot by the first two principal components for cross-ply graded specimens FC-01, FC-02, and FC-03, respectively. And the five clusters were designated as CL1, CL2, CL3, CL4, and CL5 for brevity. It can be seen that AE signals are well separated by two components Pd1 and Pd2, which indicate that the inner cluster compactness and intra-cluster separation is satisfied. Moreover, it is necessary to further narrow the representative set of AE parameters for better study. Table 4 lists the eigenvalues of each AE feature for principal components Pd1 and Pd2, and it can be seen that the eigenvalue of PA and FCoG is higher, which demonstrate that PA and frequency contribute most for Pd1 and Pd2, respectively, according to the definition of principal component. And thus it can be concluded that PA and PF are the most important AE parameters in the chosen cluster number of 5, and this can be evidenced by good separation of five clusters in the space of these two parameters in Figure 12.

Cluster results in the first two principal components coordinates for all three FC specimens.
Eigenvalues of four selected AE features for the first two principal components.;
AE: acoustic emission; PA: peak amplitude; PF: peak frequency; FCoG: frequency centroid.
aThe significance of PA for Pd1 is 0.09.
bThe significance of PA for Pd1 is 0.12.

Cluster results in PA and FCoG coordinates for cross-ply graded PE/PE specimens. PA: peak amplitude; FCoG: frequency centroid.
Figure 12 shows AE events in five clusters distributed in PA and FCoG coordinates for all three tests, and it can be seen that the shapes of five clusters for all tests are similar. CL1 has lower PA and lower frequency, CL2 has medium PA and lower frequency like CL1, CL3 has similar PA as CL1 and slightly higher frequency, CL4 has higher PA and frequency, and CL5 has the highest PA and frequency. From Figure 12, it can be seen that CL2 and CL4 are slightly overlapped for FC-01 specimen and CL1, CL3, and CL5 are better separated with each other and the other two clusters. The percentage of AE events number in each cluster and cluster validity index are summarized in Table 3.
Cluster boundaries
The cluster bounds for three specimens are summarized in Table 5. PA range of CL1 is less than 60 dB, and FCoG is less than 500 kHz. CL2 has higher PA than CL1 from 60 dB to 80 dB, but similar FCoG range. PA and frequency band of CL3 are about <70 dB and 500–1000 kHz. AE events in CL4 have similar amplitude range with CL2 and higher frequency of >500 kHz for most AE events. PA of AE events in CL5 is highest, >80 dB, and FCoG of most AE signals is less than 1000 kHz.
Cluster boundaries and corresponding damage modes for cross-ply graded PE/PE composites.
PA: peak amplitude; FCoG: frequency centroid.
For further analysis, five clusters are denoted by AlFl, AmFl, AlFh, AmFh, and AhFb in the following sections, where “l,” “m,” “h,” and “b” are the abbreviations of “low,” “medium,” “high,” and “broad.” “A” refers to PA and “F” is FCoG. Therefore, AlFl represents low amplitude and low frequency cluster, AmFl is medium amplitude and low frequency cluster, AlFh represents cluster with low amplitude and high frequency, AmFh refers to medium amplitude and high frequency cluster, and AhFb denotes cluster with high amplitude and broad frequency.
Discussions
The connections between clusters and damage modes
The cross-ply PE/PE graded FC specimen was loaded in fiber direction of 0-ply. UHMWPE fibers mainly stand load, so LDPE matrix does not present plastic deformation during tensile process. Five damage mechanisms can be concluded and observed for PE/PE graded laminates under tensile loading: matrix cracking, fiber-matrix debonding, delamination, fiber pullout, and fiber breakage, 9 –11,32 shown in scanning electron microscopy (SEM) images (Figure 13). In order to investigate the connections between resulted clusters and different damage modes, five clusters were established according to the results for PE/PE composites in literatures (Table 1). 9,11 In cluster AlFl and AlFh, PA of AE events is less than 60 dB, which should be assumed to relate with matrix cracking or interface debonding. PA of AE events in AmFl and AmFh is 60–80 dB, which can be regarded as correlation with fiber pullout or delamination. PF is not the most representative feature for the clusters, whereas FCoG has good classification ability. It can be seen from the typical frequency spectrum of fiber pullout AE signal in the literature by Wang 10 that FCoG of fiber pullout is more than 500 kHz, so it can be assumed that AmFh corresponds to fiber pullout. AE events in AhFb have highest PA and broad frequency range, possibly generated by delamination. Though frequency range for fiber breakage varies in different studies corresponding, 16,33,34 from approximately 100 kHz to 400 kHz, high-frequency AE events were usually regarded as being generated by fiber breakages. Therefore, the correlations between resulted clusters and different damage modes can be preliminarily concluded and summarized in Table 5 as follows: AlFl—matrix cracking, AmFl—delamination, AlFh—interface debonding, AmFh—fiber pullout, and AhFb—fiber breakage.

SEM micrographs of cross-ply graded composite specimen “FC” after tensile loading. There are five damage modes, including matrix cracking, debonding, delamination, fiber breakage, and fiber pullout. SEM: scanning electron microscope.
Damage initiation and propagation process
Figure 14 shows cumulative AE events and location plots for FC-03 sample. XLoc refers to relative distance between generated AE events in tensile specimen to one attached sensor, with the position near the upper sensor as 0 cm along x direction (Figure 14(c)). It can be seen that AE events in AlFl and AlFh initiate at strain of about 1.3%, then AE events in AlFl increase slowly and concentrated in 2–4 cm. When strain is up to 2.5%, AE events multiply and distribute evenly all over the specimen. AE events in AmFl start to appear at strain 2.5%, and then increase and multiply rapidly, most AE events generated in the position of 2–6 cm. AE events in AmFh and AhFb happen at higher strain, with AmFh a little earlier, and events in AmFh increase slowly at first and accumulate rapidly when close to ultimate failure, and mainly concentrate in the position of 4–6 cm.

(a) AE events in each cluster increasing with strain; (b) AE events x position versus strain for specimen FC-03; (c) the distance between two sensors is about 8 cm, and the position near the upper sensor is assumed as zero along x direction. AE: acoustic emission.
The sharp slope of cumulative AE energy curve was defined as damage threshold in literatures.
35
–37
At the beginning of tensile, there are no acoustic events registered. In further, starting from certain strain emin (AE initiation strain), acoustic events start to occur, with a few AE events. Then, the occurrence rate of AE events with higher energy starts increasing sharply, and soon the energy content reaches higher levels. We assumed that

AE registration energy and cumulative energy with events in each cluster and stress–strain curve for FC-03 specimen. AE: acoustic emission.
Above all, the damage process of the specimen can be predicted and investigated by combining cluster results and damage thresholds. Before lowest damage threshold
Conclusions
In this study, AE registration and corresponding damage modes of cross-ply PE/PE graded composites during tension loading were investigated, and the AE events were studied by unsupervised cluster analysis. Six cluster algorithms were made comparison in order to find the appropriate algorithm for AE data pattern for PE/PE graded composites. PA and FCoG were found to be the most relevant features for the obtained five clusters. The correlations with five clusters and five damage mechanisms were established, and damage initiation and propagation process of a PE/PE graded specimen were discussed. The conclusions can be made as follows:
AE events can be discriminated in five clusters based on nine AE features by feature selection and clustering. PA and FCoG are two crucial parameters in this discrimination: AlFl corresponds to low amplitude and low-frequency events; AmFl relates to moderate amplitude and low-frequency events; AlFh correlates with low amplitude and high-frequency events; AmFh refers to moderate amplitude and high-frequency events; and events in AhFb are high amplitude and broad frequency;
The correlations between clusters and damage modes can be established as follows: AlFl—matrix cracking, AmFl—delamination, AlFh—interface debonding, AmFh—fiber breakage, and AhFb—fiber pullout; the sequence of AE events initiation in the clusters is AlFl = AlFh > AmFl > AmFh > AhFb, and the order of the events number during the whole test is AmFl > AlFl > AlFh > AhFb > AmFh.
Footnotes
Acknowledgements
The authors are grateful to Prof. Stepan Lomov for all the help. The help of Vallen AE system of Johan Vanhulst and SEM equipment of Rudy De Vos in KU Leuven is acknowledged with gratitude.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
