Abstract
A good understanding of the population dynamics of algal communities is crucial in several ecological and pollution studies of freshwater and oceanic systems. This paper reviews the subsequent introduction to the automatic identification of the algal communities using image processing techniques from microscope images. The diverse techniques of image preprocessing, segmentation, feature extraction and recognition are considered one by one and their parameters are summarized. Automatic identification and classification of algal community are very difficult due to various factors such as change in size and shape with climatic changes, various growth periods, and the presence of other microbes. Therefore, the significance, uniqueness, and various approaches are discussed and the analyses in image processing methods are evaluated. Algal identification and associated problems in water organisms have been projected as challenges in image processing application. Various image processing approaches based on textures, shapes, and an object boundary, as well as some segmentation methods like, edge detection and color segmentations, are highlighted. Finally, artificial neural networks and some machine learning algorithms were used to classify and identifying the algae. Further, some of the benefits and drawbacks of schemes are examined.
Introduction
Algae are a very huge and diverse collection of simple, normally autotrophic organisms, ranging from unicellular to multicellular forms. They affect water properties such as water color, odor, taste, and the chemical composition, which may cause potential hazards for human and animal health. 1 They are highly sensitive to the changes in their environment. 2 Shift in algal species and population can be used to identify the environmental changes and the status of nutrient content. 3 Algae are very good biological indications for water pollution assessment; therefore, they have long been used to assess the quality of waters in lakes, ponds, reservoirs, rivers, and so on. However, identification of algae at their taxonomy level and the application in environmental assessment is a difficult process. Several studies reported the conventional identification of algae by using microscopy images, which is a time consuming process. This has led many researchers to develop several systems to automate the analyzing and classifying algal images.2,3 An automated computer-based recognition and classification system for the rapid identification of algae will definitely reduce the burden of routine identifications by taxonomists.4–6 This identification and classification would allow many people to identify and know about the algae without any knowledge of algae.
Image processing is an effective technology to analyze the digital images for various applications in society. In that category, it is used in several places, such as in medical images, spatial images, underwater images, and other biological images. Several studies were carried out on the biodiversity of algae in India.7–13 Very little research was identified on automatic algal identification using image processing techniques.
Most research applied image processing to detect, count, identify, and classify algal groups; some of this approach was efficient with 92% accuracy. 14 Some developed tools are used effectively for online monitoring, some for measurements of density of microorganism in water, and other tools were developed to assist in recognition process, such as enhancing images, noise elimination, and edge-extracted segmentation.15–17 A combination of image processing techniques and Artificial Neural Network (ANN) algorithms are used to automate the process of detection and recognition. 18 Other techniques used included was image processing with genetic algorithms or ANN for recognition purpose.15,19–22 MatLab based image processing tools were used for the complete enhancement and analytical operations. An automated object recognition segments the algal images and locates possible objects accurately by their boundary and texture without human interaction. 23 Automatic identification and classifications of diatoms with a circular shape were achieved by using contour and texture analysis. 24
Image Processing Methodology
Identification of the algal community from images consists of various steps namely preprocessing, segmentation, morphological operations, feature extraction, classification, and identification. Figure 1 gives the architectural layout of the image processing method used in the identification and the classification of algae. In the following section, we will discuss the functionality of each processing technique.

Proposed methodology of automatic algal identification.
Image Preprocessing
Correct object detection depends upon many factors, such as the type of illumination, the presence of shadows, the level of noise, the state of focus, the overlapping of objects, as well as level of object similarity to the background.25,26 The digital grayscale images captured from a microscope are preprocessed to reduce the effects of nonuniform illumination and other noise. A median filter (size 3×3 and 5×5) was used to reduce image noise.15,27 In the present study the neighborhood averaging technique was used to enhance the image and morphological features were processed for noise elimination, and to keep the cyanobacteria structure clear (Fig. 2).

Pre processed images by various filters.
Nonuniform illumination was corrected using the top-hat filter. Neighborhood averaging technique using Wiener filter and median filter methods were used to reduce image noise and to preserve edges. The performance of the three methods were analyzed statistically and the results were shown in Table 1. Based on mean squared error and peak signal noise ratio values, the median method showed a better result than the other two methods.
Comparison of noise removal filters using MSE and PSNR metrics.
Image Segmentation
Objects within each image are separated from the background via a process called segmentation. Segmentation is the key part in the image processing.25,26 Algal images showed various shapes for the same species. The edges and contour of the objects are more meaningful. So far, much research on the automatic identification of algae has been done using edge detection; this is achieved by the Sobel edge detector. 28 Another algorithm called the Canny edge detector algorithm is a powerful edge detector for image segmentation.15,24,29
In this study, both the Canny and Sobel edge detection methods were adopted for image segmentation. 25 After the Sobel edge detector method is applied, the resulting images had many discontinuities. Laplacian of Gaussian operator was applied on the Sobel image to smooth the image. 28 The edges of the algae with minimum discontinuities were detected in the Canny edge detector method. To avoid the discontinuities, the same method was repeated for several times on the detected edges. A mean square error of the Canny edge detection method is slightly greater than the Sobel edge detection method. The peak signal noise ratio of the Canny method is slightly lesser than the Sobel method. Finally, the object result from the Sobel method was better than the Canny edge detection method; this is shown in Figure 3 and Table 2.
Comparison of the noise edge detection methods using MSE and PSNR metrics.

Edge detection methods.
Feature Extraction
Feature extraction used to transform a binary and color image from the preprocessed stage into a set of parameters that described the algae features. 15 Once an interesting feature has been detected, the illustration of this feature will be used to compare with all possible features known to the processor.
There are two main methods for object identification that use boundary information. 26 The first is the Fourier descriptor method, and the second is the moment invariant method. In the Fourier descriptor method, the boundary is divided into N = 2n parts to produce N equidistant boundary points. The coordinates of these points were now processed using fast Fourier transform. This will produce frequency classification of the boundary. The second method is finding moment invariants. In this technique, seven moment invariants can be derived, all of which are invariant to objects and changes made in magnification. 23
Two-dimensional moment invariants of a digitally sampled M × M image.
f (x, y), (x, y = 0 … M – 1) is given as,
where p, q = 0, 1, 2, 3
The moments f (x, y) translated by an amount (a, b), are defined as,
Thus, the central moments m′pq or μpq can be computed from (2) on substituting
When scaling normalization is applied the central moments change as,
In particular, Hu defines seven values, which are computed by normalizing central moments through order three, which are invariant to object scale, position, and orientation. 30 In terms of the central moments, the seven moments are given as,
The moment invariant features are given in Table 3.
Moment invariants for the algae.
Walker et al 26 used new features to classify an object into one of the number of classes, (ie, Microcystis, Anabaena, and so on) it is essential to quantitatively measure characteristics of the object that may indicate its class membership. For example, the feature “area” is an excellent discriminator of class membership when classifying algae such as Microcystis and Anabaena cyanobacteria, as these two genera differ substantially in size. The features of each object, including morphometric properties (the area, circularity, and perimeter length), object boundary, shape features, frequency domain features, and spatial statistics containing Gray level co-occurrence matrix measures are used for identification.
The principal component analysis (PCA) method is widely used in most image processing applications to reduce the number of features by a normalization process. 1 PCA involves a mathematical procedure that transforms a number of (possibly) correlated variables into a (smaller) number of uncorrelated variables called principal components. The first principal component accounts for as much of the variability in the data as possible, and each succeeding component accounts for as much of the remaining variability as possible. The Fourier spectrum is ideally suitable for describing the directionality of periodic or almost periodic two-dimensional patterns in a round image.24,30
Identification
The classification method uses a set of features or parameters to differentiate each object, where these features should be related to the task at hand. A human expert has to determine into what classes an object may be categorized and also has provided a set of sample objects with known classes. This set of identified objects is called the training set. This is used to train the classification programs to learn how to classify objects.
Automated recognition of blue-green algae implemented a discriminant analysis for classification. It is a statistical method that provides a discriminator function for each different species. Discriminant analysis may be used for two objectives: to assess the adequacy of classification, given the group memberships of the objects under study; or to assign objects to one of a number of (known) groups of objects.
Gao et al 24 proposed a neural networks classification. Here, neural networks are designed with 15, 30, 40, 60, or 80 nodes in a single hidden layer and six nodes for each class in the output layer to test the performance.
Mansoor et al 1 presented multilayer perceptron feed forward ANN to perform an identification process for selected cyanobacteria. ANN architecture consists of six outputs, three outputs, and three neurons in a hidden layer—0.78 for learning rate, and 0.5 for momentum. The classifier is used to index the database content during the training mode for categorizing purposes.
Walker et al 26 implemented a general Bayes decision function for assumed Gaussian feature distributions with unequal variance–covariance matrices. The resulting decision surface is of hyperquadtric form. In this, the target is only the anabaena and microcystis genera. So, the microalgea in water samples were classified to the genus level.
Fang et al 19 used perceptron and the feed forward back propagation scheme of the neural network. The perceptron has six neurons and its accuracy is 100% sensitivity and 39.8% specificity. The result is 97.8% sensitivity and 72.4% specificity for this application.
Anggraini et al 27 implemented Bayes classifier in each node. The performance of this classification model was evaluated using 20 microphotographs obtained from different blood smears, which are identified as infested erythrocytes with sensitivity of 92.59%, specificity of 99.65%.
In this study, a back propagation neural network was used to classify the images that achieved 100% of classification accuracy on the trained images and 80% classification accuracy on tested images. The results are shown in Table 4.
Observation and analysis on existing system.
Conclusion
This paper reviewed various techniques of preprocessing, segmentation, feature extraction, and classification in image processing. The achieved detection rate of combining all the features was more than 98%. Particularly, using the neural network, 86.5% of the identification rate was achieved. In total, 95% accuracy was achieved in the identification and classification of four genera of cyanobacteria using back propagation and shape boundary features. Then, 97% of the classification accuracy was achieved by object size, shape, and texture based on feature extraction techniques. For automatic algal identification, the identification accuracy was increased by several features such as shape, size, object boundary, and textures combined with morphological operators. The automatic identification rate is increased by using different segmentation methods and developing new features for microscopic algae images.
Author Contributions
Conceived and designed the experiments: NS, CP, PS, SK. Analyzed the data: Wrote the first draft of the manuscript: NS, CP, PS. Contributed to the writing of the manuscript: NS, CP, PS. Agree with manuscript results and conclusions: NS, CP, PS. Jointly developed the structure and arguments for the paper: NS, CP, PS, SK. Made critical revisions and approved final ver sion: NS, CP, PS, SK. All authors reviewed and approved of the final manuscript.
Funding
Authors would like to thank University Grants Commission, Government of India, for funding to carry out this project.
Competing Interests
Author(s) disclose no potential conflicts of interest.
Disclosures and Ethics
As a requirement of publication the authors have provided signed confirmation of their compliance with ethical and legal obligations including but not limited to compliance with ICMJE authorship and competing interests guidelines, that the article is neither under consideration for publication nor published elsewhere, of their compliance with legal and ethical guidelines concerning human and animal research participants (if applicable), and that permission has been obtained for reproduction of any copyrighted material. This article was subject to blind, independent, expert peer review. The reviewers reported no competing interests.
