Abstract
Objective
To explore the advantages of using artificial neural networks (ANNs) to recognize patterns in colposcopy to classify images in colposcopy. PURPOSE: Transversal, descriptive, and analytical study of a quantitative approach with an emphasis on diagnosis. The training test e validation set was composed of images collected from patients who underwent colposcopy. These images were provided by a gynecology clinic located in the city of Criciúma (Brazil). The image database (
Results
After 126 cycles, the validation was performed. The best results reached an accuracy of 72.15%, a sensibility of 69.78%, and a specificity of 68%.
Conclusion
Although the preliminary results still exhibit an average efficiency, the present approach is an innovative and promising technique that should be deeply explored in the context of the present study.
Keywords
Introduction
A colposcopy is a test that allows the cervix to be viewed with up to 10 times magnification,1,2 which is essential for the evaluation of abnormalities or lesions (which are highly suggestive of cervical neoplasia)3,4 during cytopathological examinations and to assist in biopsies (which are essential for definitive diagnoses).5,6
Infection with human papillomavirus (HPV) is a sexually transmitted disease and is considered a preneoplastic lesion. The incidence of HPV infection has increased in recent decades.7–10 The most common manifestation of HPV infection is subclinical, and the estimated prevalence of HPV infection is greater than 50% in sexually active women.11–14
Therefore, seeking a diagnostic imaging technique, such as colposcopy, medical informatics is proving increasingly helpful for certain clinical tasks, such as data storage, information retrieval, indication of warning signs, critical therapy, and image recognition interpretation.15–18
The artificial neural networks (ANNs) used in medical informatics seek to provide methods for the classification of images from mathematical algorithms. ANNs are systems that are able to acquire, store, and use experiential knowledge to mimic the abilities of the biological nervous system. 19 Among the applications of ANNs, we highlight pattern recognition, which can greatly aid in the interpretation of imaging tests such as colposcopy and radiographs. Currently, ANNs have been applied in several areas of medicine and have obtained approximately 80–90% accuracy in areas such as oncology and mastology. 20
In this context, the objective of this study is to use pattern recognition via ANNs for the automated classification of colposcopic images on dot pattern.
Methods
A cross-sectional, descriptive, analytical, quantitative approach, with a diagnostic emphasis, approved by the Ethics and Human Research Board, was conducted. The research was performed under protocol 176/2009. This research complied with the principles of the Declaration of Helsinki.
A hybrid neural networkbased on Kohonen self-organizing maps and multilayer perceptron (MLP) network was used, and this model was chosen because it is a reasonably small net and as a result, it learns faster and reaches good generalization ability with a reasonably small sized training dataset. 21
The training set consisted of images of patients submitted to colposcopy from January 2009 to May 2010 from a gynecology service in the city of Criciúma (Santa Catarina, Brazil). A convenience sample was estimated by combining 179 images taken directly in the digital JPEG format (thus negating a need to scan the images) and authorized by informed consent for the original images. The database of images (
The neural network training was performed after the data collection and was divided into two parts. First, the images were processed without a filter, thereby resulting in 66 trials, with an accuracy rate of 65%. In the second step, each image was processed by the user with contrast enhancement, and then the Euclidean distance of each image was found.
The step after processing is divided into training, testing, and validation. At this stage, we used the application developed by the Research Group on Applied Computational Intelligence and by the Research Group on Information Technology and Communication in Health of the Universidade do Extremo Sul Catarinense (Santa Catarina, Brazil), whose interface is shown in Figure 1. This software offers a hybrid neural network based on Kohonen self-organizing maps and MLP networks. 23

Training of the ANN.
In the Kohonen network training, 48 images were used. Each of these images contained a sub-region of 64 × 64 pixels selected from the original image (Fig. 1) by the software (this was the size that best characterized the dotted pattern, despite tests with 32 × 32 and 128 × 128 images, which resulted in lower accuracy values). The colposcopic image is read into the application, and then the user selects a region of interest with or without the dotted pattern, which will be handled by the library Java Advanced Imaging (JAI) and used in the neural network. The pixels are displayed and stored by the application that used the library JAI, which makes the imaging. When selecting a region on the image to be used, the application stores the 64 × 64 pixels selected by the library JAI area, and for the classification of a new image, the JAI library is used again and each pixel is read to create a neuron network for each image pixel.
The dot pattern was present in 20 of the images and absent in 28 of the images. Such a ratio of images was also used for testing and validation (Fig. 2).

Selecting subregions of size 64 × 64 pixels.
The architecture of the Kohonen network used for training was 64 neurons in the input layer and a two-dimensional lattice of 8 × 8 neurons in the Kohonen layer. The output of this training was then used as the input for the MLP network trained with a backpropagation algorithm that classified the resulting feature map as belonging to either the group of images that show the dot pattern (class 1) or the group of images without this feature (class 2).
The MLP network architecture used for training and classifying these patterns was defined with 3 layers, 64 neurons in the input layer, 45 neurons in the hidden layer, and 1 neuron in the output layer, and started with 10 iterations and ended in 400 iterations; the learning rate started at 0.5 and ended at 0.6, a momentum of 0.4 with 500 iterations.
The Kohonen network structure was defined with two layers accomplished with an initial learning rate of 0.5 ending at 0.30 (although the final value was set at 0.01), a neighborhood size of 5 and an interval decrement in the neighborhood of 10, and the number of iterations starting at 10 and ending at 300.
In the statistical analysis, we calculated the sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), positive likelihood ratio (LR+), negative likelihood ratio (LR-), receiver operating characteristic (ROC) curve, area under curve (AUC) and accuracy of the validation of the ANN, using the Microsoft Excel® 2010 and IBM Statistical Package for the Social Sciences (SPSS) 21.0 software.
Results
The database of 170 images was used in training, testing, and validation. To reach the final state of ANN, 126 cycles were performed in the training. The accuracy values obtained in the training ranged from 41.78% to 71.5%, and in the test, from 52.5% to 72.15%. The validation was performed with the highest accuracy of 72.15% for the neural network whose parameters are mentioned in the “Methods” section. Figure 3 illustrates the variation of accuracy in the top 30 tests.

Variation of accuracy.
The sensitivity of the ANN was 69.78%, and the specificity was 68.0%; the PPV was 69.47%, and the NPV was 68.29%; and the calculations revealed an LR+ of 1.58 and an LR- of 1.92. Figure 4 illustrates the ROC curve, and the AUC was 0.73.

ROC curve.
Discussion
Colposcopy is an important imaging method for the diagnosis and classification of cervical lesions. 2 In our study, the accuracy demonstrated by the ANN was 72.15%, which is below the average found in the literature of approximately 85%;24–29 as an example, the study conducted in Recife (Pernambuco, Brazil), 24 which aimed to develop an intelligent system composed of ANNs and applied to the diagnosis of breast cancer in 2009, exhibited an accuracy of 95%.
Claude and colleagues
30
developed a study for colposcopic image classification based on contour parameters used in a comparison study of different ANNs and the
Researchers from Alicante (Spain) evaluated the accuracy of using three ANNs in the diagnosis of urological dysfunctions 25 with the objective of assisting the diagnosis of varied and complex pathologies and helping to identify sick patients while reducing the cost and wear caused by medical treatment. The accuracy obtained was approximately 83%, thereby demonstrating the method's effectiveness in helping the clinician make more accurate diagnoses. 25 Such an accuracy was superior to that of our study, possibly because three ANNs were used instead of a single network.
The standard colposcopic dot type consists of a focal point in which the capillaries form a dot image.1,2 Speckles that are finer and smoother in appearance indicate a greater likeliness of classification of low-grade injury or metaplasia, especially if the intercapillary distance is small. The probability of high-grade injury increases when the dithering is coarser and less regular.1,2
Researchers at the Albert Einstein College of Medicine in the United States evaluated the reduction in the rates of false-negative cervical smears with the help of ANN, and 487 images of smears of 228 women with documented biopsies of high-grade precursor lesions or invasive carcinoma, which were obtained from 10 different laboratories, were analyzed. 26 The instrument was used to detect cytological abnormalities such as atypical squamous or glandular cells of undetermined significance, low- or high-grade squamous intraepithelial lesions, and carcinoma. The results indicate a significant reduction in the rate of false negatives, thereby facilitating early detection of premalignant lesions of the cervix and contributing to prevention of invasive carcinoma of the cervix. 26 Similar to our work, this study contributed to explaining new methods to aid in the early identification of changes in the uterine cervix.
In Brazil, a survey by the Federal University of Rio de Janeiro employed ANNs for the classification of breast lesions through the analysis of samples based on digital images of a small section of examinations via fine-needle aspiration. 27 For the classification of lesions, backpropagation using nine inputs to generate a logic output was used with the trained neural network algorithm, thereby indicating benignity or malignancy. The accuracy was 97%, 27 which is superior to the accuracy observed in our study.
Another Brazilian study performed in São Paulo in 2007 used ANN to classify postural patterns in children with mouth-breathing, 28 and included 84 children, of whom 52 were mouth-breathing and 32 were nasal-breathing. The analysis contained anthropometric variables and measured diaphragm excursion and body posture. In the classification of posture, a sensitivity of 0.98 and a specificity of 0.97 were obtained. These results are also superior to the results obtained in this study. 28
ANNs have also been used for the prediction of infectious diseases, such as hepatitis A, as demonstrated in a study conducted in 2005 in Rio de Janeiro, which achieved a specificity of 99%, a sensitivity of 70%, and an error rate of 12%. 29 A study in Santa Catarina, Brazil, that sought to aid in the diagnostic imaging of primary osteoarthritis of the lumbar spine obtained an accuracy of 62.85% in the use of an ANN for the recognition of image patterns of osteophytes of the lumbar spine in the lateral view. 23
As demonstrated above, the number of applications of ANNs is increasing, and they are increasingly used in various areas of medicine.19,20 In gynecology, ANNs were explored in an Italian study that sought to help classify ovarian masses as malignant or non-malignant using images obtained by transvaginal ultrasound Doppler flowmetry. The tests revealed a sensitivity of 96% and a specificity of 97.7%, which are greater than those in our study, perhaps because a different imaging method was used. 31
Several other studies have been conducted to evaluate the accuracy of neural networks in the classification of mammography images, and all obtained results demonstrate that the method would be effective in aiding in the early detection of suspicious lesions in the breast. 20 Our study also aimed to classify images, but the results were inferior.
Our research considered the use of a hybrid model based on Kohonen self-organizing maps and MLP networks, which was successfully used in previous studies.21,32 Thus, our study presented the first use of hybrid ANNs for pattern classification of dot images obtained via colposcopy. The sensitivity and specificity of the method were approximately 69%. We recommend that future studies conduct tests with more images of patients that present or do not present dot patterns in the colposcopic examination, improve the methodology used in image processing to seek greater system accuracy, and we also recomends to refine the hybrid model used, the use of methods such as random forest and boosting and using other architectures of neural networks. Although the neural network has exhibited moderate results because of the innovative nature of this study, its values demonstrate the potential of ANNs for helping the healthcare professional.
The use of computers to aid in the analysis, interpretation, and classification of images has proven effective in the diagnoses of many medical specialties. The initial goal was not to provide high accuracy but to obtain a performance that is close to that of the specialist, thereby serving as a support tool and not as a substitute method and assisting in the learning process.
Author Contributions
Conceived and designed the experiments: PWS, NBI, RSC, RV, CDV, GPM, ELR, MLS, SAM. Analyzed the data: PWS, NBI. Contributed to the writing of the manuscript: CC, LBC, EC, PJM, RAC. Agree with manuscript results and conclusions: PWS, NBI, RSC, RV, CDV, GPM, ELR, CC, LBC, EC, PJM, RAC, MLS, SAM. Jointly developed the structure and arguments for the paper: PWS, NBI, RSC, RV, MLS, SAM. Made critical revisions and approved final version: PWS, NBI, RSC, RV, CDV, GPM, ELR, CC, LBC, EC, PJM, RAC, MLS, SAM. All authors reviewed and approved of the final manuscript.
