Abstract
Abnormal breast can be diagnosed using the digital mammography. Traditional manual interpretation method cannot yield high accuracy. In this study, we proposed a novel computer-aided diagnosis system for detecting abnormal breasts. Our dataset contains 200 mammogram images with size of 1024 × 1024. First, we segmented the region of interest from mammogram images. Second, the fractional Fourier transform was employed to obtain the unified time–frequency spectrum. Third, spectrum coefficients were reduced by principal component analysis. Finally, both support vector machine and k-nearest neighbors were used and compared. The proposed “weighted-type fractional Fourier transform+principal component analysis+support vector machine” achieved sensitivity of 92.22% ± 4.16%, specificity of 92.10% ± 2.75%, and accuracy of 92.16% ± 3.60%. It is better than both the proposed “weighted-type fractional Fourier transform+principal component analysis+k-nearest neighbors” and other five state-of-the-art approaches in terms of sensitivity, specificity, and accuracy. The proposed computer-aided diagnosis system is effective in detecting abnormal breasts.
Keywords
Background
Breast cancer is a common disease among women. 1 Digital mammogram (DM) is one of the most effective approaches for its early and accurate detection. 2 However, the manual interpretation procedure is hard for physicians. 3 The reasons stem from (1) the complicated anatomic structure of the breast and (2) the similarity of the breast tissues (such as the dense breast) with cancer masses.
This motivated the necessity of developing computer-aided diagnosis (CAD) system.4,5 Roughly, three different aims exist. First is to detect abnormal breasts from mammography. Second is to detect masses. Third is to distinguish malignant masses from benign masses. In this study, we focus on the former one.
In the last decade, scholars have proposed various approaches. Milosevic et al. 6 used in total 20 gray-level co-occurrence matrix (GLCM) features from both mammograms and thermograms. They employed k-nearest neighbors (k-NN), support vector machine (SVM), and naive Bayes classifier (NBC) for classification. Azar and El-Said 7 presented a performance analysis for the diagnosis of the Wisconsin breast cancer, based on six variants of SVMs. Iseri and Oz 8 proposed a new feature extraction method: multi-window-based statistical analysis (MWBSA) for detection of micro-calcification clusters which are early signs of breast cancer. The artificial neural network (ANN) was used as a classifier. Han et al. 9 combined the simplified pulse-coupled neural network and an improved vector active contour without edge, with the aim of mass segmentation in breast cancer. Singh et al. 10 used SVM with texture, shape features, and hierarchical centroid method to classify benign masses and malignant masses. Perez et al. 11 addressed the theoretical description and experimental evaluation of a new feature selection method “uFilter.” They used each dataset for training SVM, linear discriminant analysis, feed-forward backpropagation neural network, and naive Bayes machine learning algorithms to produce scores for statistical comparisons. The results showed uFilter significantly outperformed the U-test method for almost all classification schemes (p < 0.05). Yang et al. 12 identified breast cancer using the integrated information from magnetic resonance imaging (MRI) and mammography. Maximum intensity projection (MIP) and thin-plate spline (TPS) methods were applied to perform information integration. Yakoumakis et al. 13 used Monte Carlo code to assess the radiation dose of contrast-enhanced digital mammography (CEDM). Zheng et al. 14 designed a method for parenchymal texture analysis in DM. Their texture features were calculated within the breast from the intersection points of grid lines, through a local region with center point at each lattice. Anitha and Peter 15 proposed an automatic segmentation method to identify and segment suspicious mass regions, based on maximal cell strength update in cellular automata. Peng et al. 16 used fluorodeoxyglucose (FDG)-positron emission tomography (PET)/computed tomography (CT) for very early breast cancer detection, in women with breast micro-calcification lesions. Molloi et al. 17 employed spectral mammography to evaluate the breast density. Nithya and Santhi 18 offered a survey for CAD of breast cancer. They summarized the latest approaches used for DM, and applied in preprocessing, feature extraction, classification, and segmentation, for breast cancer detection. Gorgel et al. 19 suggested spherical wavelet transform (SWT) and applied to use SVM as classifiers.
All the existing works obtained promising and encouraging results. Nevertheless, the problem remains on how to improve the accuracy (e.g. accurate detection of breast masses). Different from existing approaches, this article offers a novel system to solve this problem with higher accuracy than existing methods. We used fractional Fourier transform (FRFT) to extract distinguishing features. In two-dimensional (2D) situation, FRFT transforms from image domain to the “unified time–frequency domain,” which is demonstrated to have better classification performance than either wavelet domain or frequency domain.20–22 Afterward, we employed principal component analysis (PCA) to reduce fractional Fourier spectrums. Finally, we used two classification methods: SVM and k-NN.
The structure of this article is organized in the following way. Section “Materials” contains the dataset. Section “Methodology” covers the methodology used in this article. Section “Results and discussion” contains the results and discussion. Finally, section “Conclusion and future research” concludes this article and offers future work.
Materials
The open-access mini-Mammographic Image Analysis Society (MIAS) database was used, which can be downloaded from http://peipa.essex.ac.uk/info/mias.html. It contains 322 mammogram images and their sizes are of 1024 × 1024. Their background tissues consist of three types: fatty, dense glandular, and fatty glandular. These different background tissues (especially the latter two ones) enhance the difficulty of this task, since both glands and cancer appear white on a mammogram image, so tumors may be hidden by the dense tissues (see Figure 1).

Illustration of three different background tissues of normal breasts: (a) fatty, (b) fatty glandular, and (c) dense glandular.
We picked up 200 samples from the dataset, 100 normal and 100 abnormal. The latter contains six types including calcification, architectural distortion, spiculated masses, circumscribed masses, ill-defined masses, and asymmetry (see Figure 2). Note that we treated all six types as abnormal, so our task is simplified as to detect abnormal breasts from normal ones.

Illustration of six abnormal types: (a) circumscribed mass, (b) asymmetry, (c) architectural distortion, (d) calcification, (e) ill-defined masses, and (f) spiculated masses.
Methodology
Region-of-interest segmentation
In this step, we extracted the region of interest (ROI)23–25 to avoid useless processing resources and data storage. The preprocessing is a five-step procedure as shown in Figure 3.

Diagram of preprocessing.
In the first step, median filter (MF) is a nonlinear additive noise-reduction approach. 26 It can augment the performance of following processing. MF runs through the image pixel-by-pixel and replaces each pixel with the median of pixels within its approximated window. Usually, we force the window to contain an odd number of pixels, so the median operation is easy to implement. In this study, a 3 × 3 window was selected.
Next, homomorphic filtering (HF) was used to remove multiplicative noises. The diagram of HF is depicted in Figure 4.

Diagram of HF.
Here, f and g represent the original and denoised image, respectively. DFT and IDFT means discrete Fourier transform and its inverse transform. The filter function H is defined as
where D0 represents a specified distance; D(u, v) represents the distance from the point (u, v) to the origin; c is a constant to control the slope sharpness of the filter function; and aL and aH are defined as 0.5 and 2, respectively. Both “ln” and “exp” are logarithmic and exponential functions. Illumination and reflectance are combined in a multiplicative way, 27 the components are turned to be addictive by the logarithmic operation, 28 so they are separable in frequency domain.
Third, image enhancement is performed to give a better subjective view. 29 The image enhancement in spatial domain is the simplest method. Currently, there are three types of transform functions as linear, logarithmic, and power law. In this work, the logarithmic enhancement was used to expand the narrow range of low-level grayscale regions and shrinks the wide range of high-level grayscale regions.
Fourth, the background removal procedure removes the unwanted background contents, on the basis of the region-growing method, which relies on the hypothesis that neighboring pixels within a particular region entail similar gray values. 30 Region-growing method selects initial seed points and compares the seed points with their neighboring pixels. When a similarity criterion is met, then the neighboring pixels are assigned to the cluster that contains the seed points. 31 The above process is iterated until all pixels in the image are segmented into clusters.
Finally, we need to remove the pectoral muscle area, which is usually denser than the normal breast tissue. This is done by applying a thresholding method, whose values were obtained by experience. After this step, the pectoral muscle was isolated and merely the ROI object remains within the mammogram image.
FRFT
In 2D situation, Fourier transform (FT) decomposes an image into frequency domain, and FT coefficients played an important role in many academic domains. 32 A novel variant of FT, namely, FRFT was proven to have better performance than FT. 33
Suppose function x(t) is in the time domain (can be easily extended to 2D situation) and its α-angle FRFT spectrum Xα(f) is in the time–frequency domain, we have
where S denotes the transform kernel with the form of
where i is the imaginary unit. As α approaches a multiple of π, the operators csc and cot diverge. Taking the limit leads to the following clearer expression
where m is arbitrary integer and δ the Dirac delta function. Angular frequency ω can be used to replace time–frequency f so that
here
For mammogram images, 2D-FRFT was implemented with two angles (αx and αy) involved. As αx = αy = 1, the FRFT turns to a standard FT. As αx = αy = 0, the FRFT turns to an identity operator. The 2D-FRFT can be performed by applying one one-dimensional (1D)-FRFT along rows and followed by applying another 1D-FRFT along columns or vice versa. 34
Weighted-type fractional Fourier transform and feature reduction
Weighted-type fractional Fourier transform (WFRFT) is the simplest calculation method of FRFT. It sticks to the rotation property of standard FRFT and meanwhile can be performed by fast algorithms. Mathematically, WFRFT replaced the continuous variables t with the discrete version n and replaced f with k. It defined by following form35–37
From formula (7) we see that WFRFT is the linear combination of identity matrix, time inverse matrix, DFT matrix, and IDFT matrix. Nevertheless, the results of WFRFT are not strictly equal to those of FRFT, 38 that is, there will be slight changes for WFRFT results.
The WFRFT transforms the image to the fractional Fourier domain, and the obtained fractional Fourier spectrums contain too many features, which complicate the classification and increase computational burden. The PCA is able to reduce the data dimensionality in an efficient way. It uses a preset orthogonal transformation, so as to transform time–frequency coefficients Xα into new variables linearly uncorrelated. Those variables are named as principal components (PC).39,40
Classification and cross validation
The reduced features were submitted to (1) SVM and (2) k-NN methods. SVM is the most popular classifier for small-size data. It is based on the principle of minimizing structural risk. The advantage of SVM is that it performs well on the test data, that is, its generalization ability is excellent.41–43
k-NN is a simple yet robust nonlinear classifier. 44 It finds the k closest neighbors to the query sample and predicts it as the majority vote of classes of the k neighbors. The k-NN belongs to the instance-based learning, in which the functions are approximated locally and the training is deferred till classification. 45
The 10-fold stratified cross validation (SCV) was utilized to obtain out-of-sample error. The whole dataset was divided at random into 10 folds, with nearly the same distributions. Each time, nine folds are used for training and the rest for test. This procedure was repeated 10 times in the way that each fold was used as the test set. The classification performance was obtained by combination of all 10 results on the test set.
Above 10-fold SCV ran 50 times in order to eliminate the influence of randomness. The final performances based on a 50 × 10-fold SCV were yielded by calculating the average and standard deviation of the results of 50 runs. Three common measures were used as sensitivity, specificity, and accuracy.
Implementation
The implementation of this CAD system contains two phases. Phase I (offline learning) is to train the classifier and Phase II (online prediction) is to predict the result of query mammograms. Table 1 gives its pseudocode. Note that PCA in online prediction used the coefficient matrix generated by the PCA in offline learning.46–48
Pseudocode of the proposed system.
WFRFT: weighted-type fractional Fourier transform; PCA: principal component analysis; PC: principal component; SVM: support vector machine; k-NN: k-nearest neighbors; SCV: stratified cross validation.
Results and discussion
The algorithm was developed by ourselves based on 64-bit MATLAB 2015a (The Mathworks©). Statistics toolbox was used to accelerate the program developing. The simulation experiments were implemented under Windows 7 operating system, on the platform of P4 IBM with 3.2-GHz processor and 8-GB RAM.
Preprocessing result
Figure 5 shows the result of each procedure during the preprocessing. Figure 5(a) shows an original image of a normal breast with dense glandular background. Figure 5(b) and (c) offers the noise-reduction results of additive noises and multiplicative noises. Here, we can observe the noises are nearly removed. Figure 5(d) provides the logarithmic contrast enhancement, by which the tissues in the mammogram are easily perceivable. Figure 5(e) and (f) takes away the background and pectoral muscle, respectively. In all, the ROI area was perfectly segmented.

Each step of preprocessing: (a) original image, (b) additive noise reduction, (c) multiplicative noise reduction, (d) image enhancement, (e) background removal, and (f) pectoral muscle removal.
WFRFT of 1D simulation data
Figure 6 shows the time–frequency spectrum of a triangular signal tri(t). To show clearly how α affects the spectrum, α is assigned with 11 values from 0 to 1 with increase of 0.1. Remember that for a standard FT, the frequency spectrum turns the triangular signal to a 2-power of sinc function as sinc2(f).

Time–frequency spectrums of tri(t): (a) 0, (b) 0.1, (c) 0.2, (d) 0.3, (e) 0.4, (f) 0.5, (g) 0.6, (h) 0.7, (i) 0.8, (j) 0.9, and (k) 1.0. Here, α is in the range of [0, 1]. The real components were plotted in black, while the imaginary components were plotted in blue.
The gradual change in Figure 6 shows the connection between time–frequency spectrum and the angle α. As α increases to 1, the time–frequency spectrum becomes standard frequency spectrum. As α decreases to 0, the time–frequency spectrum turns to original signal. Therefore, for the mammogram, we set both αx and αy vary from 0.7 to 1 with equal increase of 0.1. We did not consider the range in [0, 0.6] since FRFT with angles close to 0 will yield to original signal.
WFRFT of mammogram image
Figure 7 shows the different WFRFT spectrum images of a normal breast. Following the facts found in section “WFRFT of 1D simulation data,” we set the two angles αx and αy vary from 0.7 to 1 with equal increase of 0.1, respectively. Compared to standard FT that only transforms the mammogram image to one frequency spectrum, our defined WFRFT provides 16 spectrum images. Therefore, the WFRFT can provide more information than FT.

Time–frequency spectrums of a normal breast. All images were log-enhanced and pseudocolor was added for clear vision.
Classifier comparison
The 16 spectrum images obtained by WFRFT were reduced by PCA and then the reduced features were fed into both SVM and k-NN. We compared “WFRFT+PCA+SVM” with “WFRFT+PCA+k-NN.” Here, SVM did not use any kernel. The optimal k in k-NN was obtained by grid searching, namely, from the range of [1, 9] with increment of 2. Finally, we found the k-NN achieved the largest accuracy when k was equal to 3. The results are listed in Table 2.
Classifier comparison.
WFRFT: weighted-type fractional Fourier transform; PCA: principal component analysis; SVM: support vector machine; k-NN: k-nearest neighbors. (Bold means the best).
The comparison results in Table 3 show that the proposed “WFRFT+PCA+SVM” achieved sensitivity of 92.22% ± 4.16%, specificity of 92.10% ± 2.75%, and accuracy of 92.16% ± 3.60%. The proposed “WFRFT+PCA+k-NN” achieved sensitivity of 91.18% ± 3.41%, specificity of 91.56% ± 3.28%, and accuracy of 91.37% ± 3.29%. Hence, it is clear that using SVM obtained larger sensitivity, specificity, and accuracy than using k-NN. We can conclude that SVM is superior to k-NN in the diagnosis of abnormal breasts.
Classification comparison.
GLCM: gray-level co-occurrence matrix; NBC: naive Bayes classifier; MIP: maximum intensity projection; TPS: thin-plate spline; SWT: spherical wavelet transform; WFRFT: weighted-type fractional Fourier transform; PCA: principal component analysis; SVM: support vector machine; k-NN: k-nearest neighbors.
Comparison with state-of-the-art methods
We compared the better one of the two proposed methods, that is, the “WFRFT+PCA+SVM” with five state-of-the-art methods. The comparison results are shown in Table 3.
Data in Table 3 show that the “GLCM+SVM” approach obtained accuracy of 62.0%, “GLCM+k-NN” of 60.7%, “GLCM+NBC” of 55.3%, “MIP+TPS” of 84.8% ± 3.1%, and “SWT+SVM” of 90.1%. Our method obtains the largest accuracy of 92.16% ± 3.60%. Hence, this shows our method performs better than five state-of-the-art approaches. Another finding is WFRFT+PCA can extract more distinguishing features than GLCM, since the accuracy of “WFRFT+PCA+SVM” is larger than that of “GLCM+SVM.” This validates the superiority of WFRFT to GLCM.
Time analysis
Finally, we gave the time analysis of our WFRFT+PCA+SVM method over two stages: the offline learning (Phase I) and online prediction (Phase II). The results are shown in Table 4.
Time analysis (unit: second).
WFRFT: weighted-type fractional Fourier transform; PCA: principal component analysis; SVM: support vector machine; PC: principal component.
The time analysis in Table 4 shows that Phase I costs about 7914.33+0.48+0.69 = 7915.5 s and Phase II costs about 38.15+0.02+0.01 = 38.18 s. Note that Phase I was performed over 200 mammogram images and Phase II was over one query image. A shortcoming of our approach is that the WFRFT process costs too much time (about 38.15 s per image). The reason is because the mammogram image is too large (1024 × 1024).
In the future, we will try to down-sample the mammogram images and test whether the classification performance will decrease. Besides, fast FRFT implementation methods will be included.49,50
Conclusion and future research
We proposed a new approach for CAD of abnormal breast in mammogram images using the combination of three components: WFRFT, PCA, and SVM. The results show that our method yields better performance than five existing methods. Our contributions are as follows: (1) we apply WFRFT in abnormal breast detection and demonstrated WFRFT is effective and (2) the “WFRFT+PCA+SVM” is superior to state-of-the-art methods with regard to accuracy, sensitivity, and specificity.
In the future, thermograms will be considered. Besides, advanced classification methods (shared-weight neural network 51 and deep learning 52 ) shall be included. The combination between WFRFT and entropy will also be considered as the fractional Fourier entropy. Another important thing is to apply our method to other diseases such as Alzheimer’s disease.
Footnotes
Academic Editor: António Mendes Lopes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This paper was supported by NSFC (51407095 and 61503188), Natural Science Foundation of Jiangsu Province (BK20150983 and BK20150982), Jiangsu Key Laboratory of 3D Printing Equipment and Manufacturing (BM2013006), Key Supporting Science and Technology Program (Industry) of Jiangsu Province (BE2012201, BE2013012-2, and BE2014009-3), Program of Natural Science Research of Jiangsu Higher Education Institutions (15KJB470010, 13KJB460011, 14KJB480004, and 14KJB520021), Special Funds for Scientific and Technological Achievement Transformation Project in Jiangsu Province (BA2013058), Nanjing Normal University Research Foundation for Talented Scholars (2013119XGQ0061 and 2014119XGQ0080), and Open Fund of Guangxi Key Laboratory of Manufacturing System & Advanced Manufacturing Technology (15-140-30-008K).
