Abstract
BACKGROUND:
Feature selection is a technology that improves the performance result by eliminating overlapping or unrelated features.
OBJECTIVE:
To improve the performance result, this study proposes a new feature selection that uses the distance between the centers.
METHODS:
This study uses the distance between the centers of gravity (DBCG) of the bounded sum of the weighted fuzzy memberships (BSWFMs) supported by a neural network with weighted fuzzy membership (NEWFM).
RESULTS:
Using distance-based feature selection, 22 minimum features with a high performance result are selected, with the shortest DBCG of BSWFMs removed individually from the initial 24 features. The NEWFM used 22 minimum features as inputs to obtain a sensitivity, accuracy, and specificity of 99.3%, 99.5%, and 99.7%, respectively.
CONCLUSIONS:
In this study, only the mean DBCG is used to select the features; in the future, however, it will be necessary to incorporate statistical methods such as the standard deviation, maximum, and normal distribution.
Introduction
Epilepsy, affecting 0.6%–1.0% of the total population, is a condition in which a portion of the nerve cells that make up the human brain are damaged and seizures repeatedly occur owing to excessive electrical discharge from the cerebral cortex [1, 2]. Partial epilepsy is called epilepsy when excessive seizures occur only in one part of the brain, and generalized epilepsy occurs throughout the brain [3]. A condition is diagnosed as epilepsy when seizures in the brain occur recursively for a long period without any special cause [4].
Nonlinear techniques such as correlation dimensions and the Lyapunov exponents have been used to extract the complexity from EEG signal [5, 6, 7]. EEG signal was decomposed into timefrequency domains by a discreet wavelet transforms (DWTs) [8, 9]. In addition, the coefficients produced by the wavelet transformation (WT) are used as inputs for the adaptive neuro-fuzzy inference system (ANFIS) [10]. Some models of epileptic seizure classification have been developed that combine wavelet coefficients with an artificial neural network [11] and a support vector machine [12]. Subasi extracted features based on the wavelet coefficients and for the extracted features used a classification model called a mixture of experts [13]. In addition, Polat classified epileptic seizures using Fourier transforms and a principal component analysis [14]. Convolutional neural networks have also been used for detecting epileptic seizures based on functional near-infrared spectroscopy signal [15], a recurrence quantification analysis [16], and a multi-channel time-series [17]. In addition, deep neural networks have been used to detect epileptic seizures with higher order statistics [18] and different feature scaling techniques [19]. However, as disadvantages of these two models, the initial features are used without a feature selection and cannot provide interpretable general rules, such as fuzzy rules for epileptic seizure classification.
Feature selection has become an important subject in machine learning and pattern recognition [20, 21, 22]. A good feature selection allows benefits such as reduced computational costs or an improved performance result by eliminating unnecessary or noisy features. The rapid increase in data used for learning has also recently led to problems in the learning process [23, 24]. As one of the major problems, large amounts of data are less relevant to each other, which often leads to a misclassification and increases in classification errors as a result of learning using poor data [25, 26].
This study proposes a new distance-based feature selection using an existing neural network with weighted fuzzy membership (NEWFM) [27, 28] to improve the performance result of both normal and epileptic seizure EEG signal. An NEWFM has the bounded sum of weighted fuzzy memberships (BSWFMs) that can indicate the differences in the graphical characteristics between normal and epileptic seizure EEG signal. This study is largely composed of two parts, the first part of which processes the EEG signal, and the second part classifies normal and epileptic seizure EEG signal by selecting the minimum features. In the signal processing section, the WT produces a wavelet coefficient that eliminates noise from the EEG signal. A statistical technique has been used to extract features from wavelet coefficients to be used as inputs.
Feature selection uses the DBCG of the BSWFMs generated by the learning process of the NEWFM to obtain the minimum features that represent the best performance result.
The remainders of this study are organized as follows. Section 2 describes the experimental data, wavelet transformation, and statistical techniques. It also describes the NEWFM. Section 3 then details the NEWFM-based feature selection techniques proposed in this study. Section 4 describes the performance result obtained using feature selection and compares the performance result before and after feature selection. Finally, Section 5 provides some concluding remarks regarding this study.
Overview of the epileptic seizure classification model
As can be seen from the epileptic seizure classification model shown in Fig. 1, the frequency distribution and frequency variation are features extracted from the wavelet coefficients generated from an EEG signal. From the extracted features, the minimum features representing the best performance result were selected using a distance-based feature selection technique.
Normal and epileptic seizure classification model.
In this study, the EEG signal tested by Andrzejak [29] was applied to classify both normal and epileptic seizure EEG signal. The experimental data are divided into five groups of experiments (A, B, C, D, and E) [29]. All experimental groups included 100 single channels EEG signal sections. This study used two groups of experiments A and E applied by Subasi [13]. Experiment A is made up of normal EEG signal from a healthy subject, and Experiment E is made up of epileptic seizure EEG signal from a subject with epilepsy symptoms. The experimental datasets used in this study are listed in Table 1. Each episode in the experimental group consisted of 512 points. Figure 2 shows example normal and epileptic seizure EEG signal.
Number of training and test set
Number of training and test set
Examples of epileptic seizure (left) and normal (right) EEG signal.
WTs are transformation technologies that represent the time and frequency components of a signal, and frequency components vary over time [8, 10, 11]. Typically, Fourier transforms represent frequency components assuming that the signal does not change over time. In contrast, WT is a technique applied to signal processing regions to represent frequency and time components of signals whose frequency components change over time. Using experimental data tested by Andrzejak and Subasi, in this study, Daubechies 4 wavelet transforms (WTs), at a 5-point scale, as described in Fig. 3, were applied to extract the wavelet coefficients, the detail coefficient (g[n]), and the approximation coefficient (a[n]) [4]. Figure 4 shows episodes of wavelet transformed normal and epileptic seizure EEG signal.
Description of features
Description of features
Decomposition of wavelet transform.
Examples of wavelet transformed epileptic seizure (left) and normal (right) EEG signal.
The wavelet coefficients were extracted from the initial 24 features to be used as input by the statistical techniques in Table 2 [30]. The initial 24 features are listed in Table 3. (1), (2), and (5) in Table 2 indicate the EEG signal frequency distribution [13]. In addition, (3) and (4) in Table 2 refer to frequency variations for the EEG signal [13].
An NEWFM is a type of fuzzy neural networks that use the BSWFMs [27, 28]. The NEWFM is composed of three layers: class, hyperbox, and input in Fig. 5. An
Structure of NEWFM.
Example of before and after an adjust (
Example of three BSWFMs.
An adjust (
Both BSWFMs can graphically show the difference between normal and epileptic seizure EEG signal for each feature. In Fig. 8, the fuzzy values of fuzzy membership functions
Detailed descriptions of features based on wavelet coefficients
The numbers in parentheses mean the level of wavelet coefficients. As examples, dc(2) denotes the detailed coefficients, and ac(2) denotes the approximation coefficients, at level 2.
BSWFMs-based on inference system.
Performance results without feature selection
To overcome misclassification and classification errors as a result of learning using poor data, and to improve the performance result, NEWFM uses the DBCG of the BSWFMs to select features that eliminate less relevant data from a large amount of data. This study was carried out using an NEWFM from the initial 24 features. Using the DBCG of the BSWFMs generated through this learning, the feature selection was carried out in four steps.
The values of Using the normalized BSWFM, the centers of gravity for the BSWFM of the normal and epileptic seizure EEG signal are derived. The DBCG of the BSWFMs is derived in Fig. 9, and the DBCGs for the BSWFMs for 24 features are as shown in Table 3. The performance result is compared by individually removing features with the lowest mean DBCG of the BSWFMs.
In Step 4, the performance result is obtained by removing the features with the lowest mean individually based on the results of Table 3. As the results indicate, the performance result was the highest when 22 among the initial 24 features were removed with coefficients dc(2) and dc(5) remaining.
Performance results with feature selection
Comparison of performance results
DBCG of BSWFMs.
The sensitivity, accuracy, and specificity, defined in (2), were used to evaluate the performance result in this study. Here, a true positive (TP) denotes that an epileptic seizure EEG signal is classified properly as an epileptic seizure EEG signal, a false negative (FN) denotes that it is classified as a normal EEG signal, a false positive (FP) denotes that it is falsely classified as an epileptic seizure EEG signal, and a true negative (TN) denotes that it is classified as a normal EEG signal. In this study, the performance result obtained from the 22 minimum features selected through the feature selection based on the DBCG of the BSWFMs, and the performance result obtained from the 24 initial features, are compared in Tables 4–6. Although 0.3% were lower in specificity, 3.3% and 1.5% were higher in sensitivity and accuracy, respectively.
In this study, using feature selection based on the DBCG of the BSWFMs through this learning of the NEWFM, the least important features were removed individually. Eliminating each of these least important features individually while selecting the minimum features for epileptic seizure classification was proposed to obtain the best performance result. Using this feature selection, 22 minimum features obtained from 24 initial features were used as input to the NEWFM to achieve the best performance result. Feature selection based on the DBCG of the BSWFMs minimizes the number of features and enables the highest performance result by eliminating those features that are unnecessary or adversely affect the classification results. NEWFM has an advantage that it takes long time (more than 1 week) to learn data for learning process, however, it takes very short time (less than 1 second) to make a result for classification process.
The sensitivity, accuracy, and specificity when using feature selection based on the DBCG of the BSWFMs were 99.3%, 99.5%, and 99.7%, respectively, which are 3.3% and 1.5% higher and 0.3% lower than those obtained without using feature selection (at 96%, 98%, and 100%), respectively.
In this study, only the mean DBCG was used to select the features; in the future, however, it will be necessary to incorporate statistical methods such as the standard deviation, maximum, and normal distribution using the maximum based on the results listed in Table 3.
Footnotes
Acknowledgments
This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korean government (MSIT) (No. NRF-2019R1F1A1055423).
Conflict of interest
None to report.
