Abstract
In order to improve the harvesting efficiency of apple harvesting robot, this article presents an apple recognition method based on pulse coupled neural network and genetic Elman neural network (GA-Elman). Firstly, we use pulse coupled neural network to segment the captured 150 images, respectively, and extract six color features of R, G, B, H, S, and I and 10 shape features of circular variance, density, the ratio of perimeter square to area, and Hu invariant moments of segmented images, and these 16 features are considered as the inputs of Elman neural network. In order to overcome some defects of Elman neural network, such as, trapping local minimum easily and determining the number of hidden neurons difficultly; in this article, genetic algorithm is introduced to optimize it, and new optimization way is designed, that is, the connection weights and number of hidden neurons separate encoding and evolving simultaneously, in the process of structural evolution at the same time the learning of connection weights is completed, and then the operating efficiency and recognition precision of Elman model are improved. In order to get more precision neural network, and avoid the influence of fruit recognition caused by branches or leaves shadow, apple along with branches and leaves is allowed to train. The results of experiments show that compared with the traditional back-propagation, Elman neural network, and other two recognition algorithms of obscured fruit. the genetic Elman neural network algorithm is the optimal method which successful training rate can reach to 100%, recognition rate of overlapping fruit and obscured fruit can reach to 88.67% and 93.64%, respectively, and the total recognition rate reaches to 94.88%.
Introduction
With the advancement of science and technology, robotics technology has gradually matured, and apple harvesting robot also makes rapid growth. 1,2 Apple harvesting robot is relatively complex, and the working environment of the visual system as the “eyes” of harvesting robot is a key technology of harvesting robot intelligence. Accompanied by either monocular vision system 3 or binocular vision system, 4 whether identify static fruit, 5 or dynamic fruit, 6 regardless of single fruit, 7 overlapped fruit, 8 and in the near-scene apple, 9 to recognize apple in the night, 10 these have made great progress. Kelman et al. proposed convexity of fruit tree images detection to determine the apple edges and 3-D modeling by the least square constraint mechanism, respectively, fixing the mature individual apple, thereby to reach an accuracy of 94%. 11 Wachs et al. used the maximum interactive information of two forms of images, infrared and color images, to extract high-level and low-level visual features, and in fruit canopy detection, the two feature recognition rates of “green” apples can reach 54% and 74%, respectively. 12 Zhang et al. utilizes support vector machine and super green operator to propose the recognition of nearly color background green apple to achieve the recognition rate of 89.3% in the night 9 ; Li et al. uses binocular stereo vision technology for apple’s three-dimensional position and reduces the search scope and other measures to improve around two image matching precision, with the range error about 5%. 13 The rapid and efficient recognition of apple fruit affects the operating efficiency of harvesting robots directly. However, the image segmentation and object recognition are the bottlenecks restricting its development, which affect the reliability and real-time performance of harvesting robots.
Image segmentation is the premise of feature extraction and target recognition, and the results have a direct impact on the progress of follow-up work. 14,15 For apple image segmentation, in the past, the segmentation algorithm is based on threshold or improved threshold, 16,17 such as K-means clustering, 18 Otsu, 19 and so on, and these methods have achieved good results. However, they mainly depend on image gray level information but ignored the spatial information of image. In this study, the pulse coupled neural network (PCNN) 20 is introduced into the apple image segmentation; the advantage is that both the spatial information and the gray information of the image are considered to obtain a more ideal segmentation. Accurately and rapidly recognition of apple target is still difficult and the key point of apple harvesting robots vision system. In order to further improve the recognition accuracy and speed of apple harvesting robots and meet real-time requirements of apple harvesting operations, it needs to further optimize recognition algorithms. Elman neural network is introduced into apple image recognition, and this model is a feedback neural network with the ability of adapting to time-varying dynamic characteristic and strong global stability. Genetic algorithm (GA) is widely used in optimization. 21,22 In order to overcome the inherent defects and improve the performance of Elman neural network, GA is introduced, and a new optimization method is adopted, that is, the connection weights and hidden neurons are separate encoding and evolving simultaneously, and thus GA-Elman neural network is obtained.
In order to further improve the working efficiency of apple harvesting robot, in this article, a new target fruit recognition method is proposed. First, PCNN algorithm is employed to segment the apple images, which can take into account the color features and spatial information of the images, and the segmentation effect is ideal. Then, the segmented images are processed by corresponding morphological processing, the 6 color features and 10 shape features were extracted, and these 16 features are considered as the typical features of target apples. Finally, a new GA-Elman algorithm is established and used to recognize the target apples, which can improve the operating efficiency and recognition precision.
The rest of the article is organized as follows: In the second section, firstly, the brief introduction of the apple image acquisition is given, the PCNN segmentation algorithm is described, target fruit segmented image is obtained, and then some morphological processing is applied, in order to extract the color and shape features. In the third section, the new thought of GA-Elman algorithm is described, and then to construct a new simultaneous optimization way of connection weight and network structure, the basic steps of new classification algorithm are given. In the fourth section, the experiment is arranged, and the performance of new method is evaluated from multiple perspectives, such as, recognition rate, training times, recognition times, and recognition rate of fruit with different postures. The fifth section provides some conclusion and discussion of the new method.
Apple images segmented by PCNN
In order to better obtain the characteristic information of the apple image and provide the basis for the identification of the target fruit, there is a need to do a series of processing on the image, such as image denoising, image segmentation, feature extraction, and so on. The image segmentation and feature extraction are directly related to the recognition accuracy of the object.
Image acquisition
Image acquisition location: Images were acquired from apple demonstration bases in Dashahe Town, Fengxian county, Jiangsu Province, Jiangsu University apple harvesting robot experiment bases.
Image acquisition information: all images are collected under natural light, including sunny and cloudy conditions, and smooth and backlighting, all the images are collected between 8:00 and 17:00 pm.
Image acquisition device: the camera is connected to the computer, AFT-0814MP (Vision Electronics Co., Ltd), with focal length 8 mm, field of view angle 54°, and closest object distance 0.15 m, and it is made sure that the image acquisition distance is between 50 and 200 cm.
The variety of the apples is Fuji, and a total of 150 apple images were collected in the experiment which include 97 sunny and 53 cloudy, and the number of the fruits is 229. The collected color image of the apple is shown in Figure 1.

Original apple images. (a) Single unobscured fruit; (b) single unobscured fruit; (c) foliage cover the fruit.
In the natural environment, the natural growth posture of apple fruit changes, and the position factors of the image acquisition system of the harvesting robot make the collected apple images to vary, such as single unobscured fruit, adjacent fruit, overlapping fruit, and foliage cover the fruit. The light environment of the collected image and the growth posture of the fruit will affect the recognition efficiency and accuracy of the visual system, thus affect the harvesting efficiency.
PCNN segmentation algorithm
As a third-generation neural network, PCNN is widely used in image processing. PCNN is a self-organizing network that does not require training and the network was constructed by simulating the activities of the mammal’s visual cortex neurons. Through the interaction of neighboring pixels, the neuron can simultaneously pulse with similar input data, which can compensate for the spatial incoherence, time series feature, and small changes in amplitude of input data, thereby completely retaining the area information of the image. 23 –26
When using PCNN for image segmentation, the two-dimensional image matrix M
Therefore, PCNN can group image pixels based on spatial proximity and brightness similarity and apply it to image segmentation, which can achieve better segmentation results. Currently, PCNN image segmentation has achieved good results in many fields. This study is applied to apple image segmentation.
The simplified model of PCNN neurons is shown in Figure 2.

Simplified PCNN neurons model. PCNN: pulse coupled neural network.
The mathematical description of the simplified PCNN neurons model is as follows:
In the input area
Fij is the neuron input, and Iij is the external nerve input excitation, that is, pixel (i, j) corresponds to characteristic value
Lij is the connection entry, Yij is the pulse output, and Wij is the connection coefficient of coupling connection.
In the connection input area
Uij is the internal active item, and β is the connection coefficient for internal active items.
In the pulse generator stage
Eij is the dynamic threshold, and the expression of Eij is as follows
In formula, γ is the dynamic threshold attenuation step and ve is the dynamic threshold amplification factor.
Segmentation effect of apple images
Figure 1 shows an example and the segmented results are shown in Figure 3.

Apple image PCNN segmentation results. (a) Segmented apple; (b) Split leaves; (c) Split tree branches. I, II, III correspondence to Figure 1(a), 1(b), and 1(c). PCNN: pulse coupled neural network.
At present, there are no standards for judging the effect of image segmentation, and most of them depend on intuitive visual. From the results of image segmentation, for background where the space distances are closer and the color differences are similar, it is segmented into the same class. Segmented images of target apple, leaves, and branches are clear, with good visual effects. In this study, PCNN algorithm is introduced, which takes into account the spatial location of the pixels and color difference information. The segmentation result is more ideal.
Good segmentation results are beneficial to the feature extraction of each segmented image, and the extracted features are more efficient. If the segmentation effect is poor, it may cause the extracted features not to express the characteristics of the target, or the extracted features are incomplete, and it is even difficult to extract the effective features of the target, which ultimately influence the recognition effect.
Feature extraction of target fruit
Feature extraction 27 –31 starts from an initial set of measured data and builds derived values (features) intended to be informative and nonredundant, facilitating the subsequent learning and generalization steps and, in some cases, leading to better human interpretations. The training process of the recognition model is based on the training samples consisting of a single fruit and focuses on the completeness of the extracted apple fruit characteristics to fully characterize the apple features. Therefore, we try to choose a single unobscured fruit as a training sample in the selection of training samples. In the process of identification, due to the diversity of apple fruits, the characteristics of the samples extracted from the fruits are not complete, such as blocking and overlapping of fruits and trees, leading to information loss and affecting the identification of fruits. In addition, the lighting environment will also interfere with the recognition. As the shielding area and occluding edge are different, the extracted features are also very different and will affect the fruit recognition. In order to avoid the influence of the above factors, and obtain a more accurate identification neural network model, we train the model with apple fruits, branches and leaves together.
Therefore, feature extraction of training samples must consider not only the apple fruit but also the feature extraction of branches and leaves. We need to intercept a relatively complete leaf and a branch of a branch in each image. When identifying sample image processing, we only extract the characteristics of the target fruit. Depending on the characteristics of apples, leaves, and branches, this study considers the extraction of its color and shape characteristics to better describe the object.
Color feature
The color feature of an image is a global feature used to describe the surface properties of the scene corresponding to the image or image region. It is based on the characteristics of the pixels, and all pixels belonging to the image or image region have their own contributions. The color difference of apple fruit, leaves, and branches is relatively obvious, so the color characteristics of the extracted objects are practical and feasible.
In red-green-blue (RGB) color space, the color can be directly compared, and the characteristic information of the color image object of R, G, and B can be well controlled, and the characteristic of these three components is very effective in the color feature extraction. Because the apple harvesting robot belongs to outdoor operations, we need to consider the influence of light and further extract color features in the hue-saturation-intensity (HSI) color space. HSI color space is similar to the mechanism of human color perception and can adapt to all lighting conditions. The three component features of the image object, such as H, S, and I, are necessary in color feature extraction. In this study, six component features in the RGB and HSI color spaces were extracted to describe the global features of the target.
Shape feature
Because the color is insensitive to changes in the direction and size of the image, color features cannot express the local features of the image object well, so it is necessary to further extract the shape features. Apple fruit, leaves, and branches have their own specific shapes, and the differences are more obvious. Therefore, the shape characteristics of the extracted objects are practical and feasible.
According to the characteristics of the segmented image, we select image geometry characteristics of the object and the Hu invariant moment, in which the geometric feature mainly selects the circle variability, density, and perimeter square area ratio of three eigenvectors, and Hu invariant moment contains seven feature vectors. In this study, a total of 10 shape features were extracted and their biggest feature is that they do not change because of the position, size, and angle of the target in the image. They satisfy the RST (rotation, proportion, and translation) invariance. Hu invariant moments are put into use for extracting the shape features of each connected area in target image, and the seven invariant moments are as follows
Where
GA-Elman recognition algorithm
The recognition of apple fruit can be regarded as a classification problem in essence, while the classification problem is an input/output transformation. The neural network model has unique advantages in dealing with such problems. 32,33 This study uses the Elman neural network design classifier to use the 16 features extracted in the third section as input to the neural network for training modeling.
Thought of GA-Elman algorithm
Elman neural network has been widely used in many fields. 34,35 However, Elman neural network carries forward the advantages of the back-propagation (BP) algorithm and, at the same time, it also inevitably inherited the some inherent disadvantages of the BP network, such as easy to trap in local minimum, resulting in training failure; the learning rate is fixed, which limits the network’s convergence rate; the number of hidden neurons is difficult to determine, thus artificial attempts to waste a lot of time. These deficiencies limit the neural network transport efficiency and recognition accuracy. So, the genetic algorithm is introduced into the Elman neural network, and its optimization is performed in two aspects 36 : first, to optimize the connection weights and thresholds of the network, to improve the network learning efficiency, and to solve the fixed learning rate and the shortcomings of the local minimum; second, to optimize the hidden layer neurons numbers, which can solve the problem in determining the difficult hidden layer structure, and to reduce the time people try to construct the network.
In the previous studies with evolutionary Elman neural network of genetic algorithms, most of them optimize the connection weight or network structure (the number of hidden neurons) individually. It has good results in enhancing network performance, but the effect is still slightly unsatisfied. In the optimization of weights of space, although it obtains the optimal connection weights and thresholds, this needs to be based on the specific network structure; simply optimizing the network structure can save people from trying to determine the number of hidden neurons, but still a weight learning can’t get rid of the shortcomings of the inherited BP algorithm. Therefore, the study proposes a novel optimization method. Elman neural network connection weights and the hidden neurons are coded and simultaneously evolved, constructing a new classification algorithm based on genetic algorithm optimization (GA-Elman) to improve the classification efficiency and accuracy of the model.
Elman neural network
Elman neural network is based on back-BP neural network optimization, adding an undertaking layer in the hidden layer of BP network, and as there is a time delay operator to achieve the purpose of memory, the system has a time-varying dynamic characteristics ability and strong global stability. Elman neural network is a feedback neural network, and its topology structure is shown in Figure 4.

Topology structure of Elman neural network.
The topology structure of Elman neural network is generally made up of four layers: input layer, hidden layer, undertaking layer, and output layer. The undertaking layer is used to memorize the output value of the hidden layer unit at the previous moment and can be considered as one-step delay operator. Based on the BP network structure, the output of the hidden layer is delayed and stored by the receiving layer and self-linked to the input of the hidden layer. This self-association-mode makes it sensitive to historical state data and the addition of an internal feedback network increases the ability of the network itself to process dynamic information.
Setting n as input and m as output, the hidden layer and the following layer have R neurons, the weight of the input layer to the hidden layer is w
1, the weight of the following layer to the hidden layer is w
2, the weight of the hidden layer to the output layer is w
3;
In formula
f is the transfer function of hidden layer and commonly used S type function
g is the transfer function of output layer, and it is often a linear function
Elman neural network uses the BP algorithm for weight correction, and the network error is
tk is the target output vector.
Genetic algorithm
Genetic algorithm is a highly parallel, random, and adaptive search algorithm developed by referring to natural selection and evolution mechanism. It uses a group search technique to represent a set of problems using a population. Through a series of genetic operations such as selection, crossover, mutation, and so on, it can produce a new generation of population and gradually make the population evolve into the state of the approximate optimal solution. GA is the beginning of a population from the potential solution set, and the population is made up of several gene-encoded individuals which are entities with a characteristic chromosome.
Genetic algorithm is just an iterative process, and it has retained a set of candidate solutions in each iteration, sorting according to the advantages and disadvantages of the solution and according to index to choose some solution. The genetic operator is used to operate to produce a new generation of a set of candidate solutions, and this process is repeated until they meet some convergence index. The basic idea of genetic algorithm is shown in Figure 5.

Flow chart of genetic algorithm.
The basic steps of genetic algorithm as follows: Steps 1. Establish initial population composed of character string randomly; Step 2. Calculate the fitness of every individual separately; Step 3. According to genetic probability, exploit duplication, crossover, mutation to generate new populations; Step 4. Repeat execute steps 2 and 3, until termination condition, and select optimal individual as the result of the genetic algorithm.
Construction of GA-Elman algorithm
When the genetic algorithm optimizes the connection weight of the neural network, some penalty items are added to the error function, and we have not to consider whether it is differentiable, which can improve the generality of the network and reduce the complexity of the network, so the optimization of the connection weights has great potential in this area. In optimizing the network structure, it can optimize different topologies in the face of diverse tasks to improve the insufficiency of the number of hidden neurons in the Elman network.
This study provides a method to optimize the connection weights and structure of neural networks. The main contents of the GA-Elman algorithm are chromosome encoding, definition of fitness function, and genetic operator construction, and the algorithm flow is shown in Figure 6.

Flow chart of combined genetic algorithm with neural network.
Coding scheme
In the GA-Elman design process, the biggest problem encountered is the way to design a reasonable program to encode the network connection weights, output thresholds, and the number of hidden neurons. In this article, we adopt the methods of separate encoding and simultaneous evolution.
The number of hidden neurons is coded with real number and the upper limit p is set. In addition, a binary code to the hidden layer is added as controlling gene which can be generated by a random function. When the value of the control gene is 0, corresponding to the hidden layer node has no effect on the output layer; when the value of the control gene is 1, it is equivalent to this hidden layer node, which influences the output layer. For the input n and output m of Elman neural network, its coding length is as follows
In formula, the first p is the length of the hidden neurons, n × p indicates the length of the weight of the input layer to the hidden layer, p × m indicates the length of the weight of the hidden layer to the output layer, and the second p is the length of the coding length of the controlling code. Because the undertake layer of the Elman neural network corresponds to the hidden layer, the coding scheme is the same as the hidden layer.
The connection weights are mainly encoded by real numbers, that is, each weight is directly represented by a real number. The expression is intuitive and overcomes the drawbacks of the original binary coding. This coding method is based on the code length of the network structure that has been determined, and its coding length is as follows
In formula, n × p indicates the length of the weight of the input layer to the hidden layer, p × m indicates the length of the weight of the hidden layer to the output layer, p is the threshold coding length of hidden neurons, and m is the threshold encoding length of neurons in the output layer. Consider the threshold in the weight code. Thus, this study is a piece of evolution together with a threshold.
Fitness calculation
We use the training error of the network to determine the fitness of the chromosome corresponding to the network and set fitness function as
C is a constant and e is the error. It is generally accepted that a network with a large error has a low degree of fitness.
Genetic operator construction
The selection operator uses “roulette” selection, namely, the probability of everyone entering the next generation is equal to the radio of that adaptive value and total individual adaptation value of whole population. The higher the fitness value, the greater the probability of entering the next generation.
Crossover operator, using the “single point crossover”, partly exchanges or recombines the parental genes according to a certain probability.
Mutation operator binary coding uses the “inverted” mutations, and real coding is Gaussian.
Using the elite reservation strategy, individuals that searched currently have the highest degree of fitness, which will reserve to the next generation directly and prevent the loss or destruction of the crossover and mutation operations, thereby to ensure the global convergence of the genetic algorithm.
GA-Elman recognition algorithm
In this study, the optimization of both connection weights and hidden layer structures is performed simultaneously to maximize the optimization of Elman neural networks. According to the structure of the “Obscured fruit recognition” section, a new optimization method is used to code the simultaneous evolution of the two methods. That is, while learning the structure of the hidden layer neurons, the connection weights can be learned. The new idea not only improves the network’s operating speed and recognition accuracy but also improves the generalization ability of the network to obtain a more accurate network model.
In this article, the basic steps of the optimization algorithm are as follows: Step 1. Unifying the weights and thresholds of Elman network and, at the same time, coding the real number, the number of hidden layer neurons is coded with real number and the upper limit is set, adding the binary coded control gene to the hidden layer neurons; Step 2. Decoding the one code, we can get different networks; Step 3. The network from step 2 is trained by a given training to determine whether to meet the accuracy requirements, if satisfied, the training stops and turns to step 7, otherwise it transfers to step 4; Step 4. According to the objective function and the training results to determine individual fitness, we use the elite retention strategy and select a number of individuals with the largest fitness value, directly inherited to the next generation; Step 5. Using crossover and mutation operators to deal with the current generation of groups to generate the next generation of groups; Step 6. Determining whether the set of genetic algebra is reached, then return to step 3, otherwise return to step 2; Step 7. The optimization of Elman neural network model is obtained.
The feature of the extracted segmented image is modeled according to the above algorithm steps to realize the recognition of apple fruit, increase its recognition efficiency and precision, and further improve the harvesting efficiency of the apple harvesting robot.
Experiments
Experimental design and environment
In Fengxian, the China’s township of apple, 400 images are captured from apple orchard, and they are divided into training set and testing data according to 80/20, that is, 320 images as training samples and 80 images as training samples. In order to avoid the influence of the branches and leaves, the training samples also consider with the branches and leaves.
In order to better verify the superiority of the optimized algorithm, two groups experiments are arranged. One is compared with BP, Elman neural network, to show the performance of new algorithm optimized by GA. One is compared with some obscured fruit recognition algorithms, because single unobscured fruit recognition is no longer a challenge.
Test operation platform: CPU Intel Core2 Duo E7300 2.66 GHz, RAM 1.99 GB, Graphics card Intel® G33/G31 ECF; operating environment: 32 bit Windows 7, MatlabR2012b.
Recognition effect
Parameter setting
In training process, there are three indicators to measure the learning ability of each recognition model, such as training success rate, convergence, and run time. In the testing process, the cognitive performance of the trained model is evaluated from three aspects: recognition rate, recognition time, and error. According to different postures of target apples, the recognition rate of non-mask fruits, overlapped fruits, and fruits blocked by leaves and branches was calculated, respectively, and the total recognition rate was determined, to verify the generalization capabilities of three network models.
Genetic algorithm parameter setting: population size is 30, GA maximum iteration number is 300 times, the crossover rate is 0.9, the mutation rate is 0.01, the elite retention strategy selects four optimal individuals, the target error is MSE 1e-5, Elman is 2000 times, and traingdx is used as learning algorithm (a gradient descent learning algorithm with adaptive learning rate). Using Gao’s empirical formula, the number of neurons in hidden layer BP neural network is set 37
where s is the number of hidden neurons, m is the input, and n is the output.
Neural network training
In the 320 training samples, apple fruits, integrity leaves, and branches were extracted from the segmented images. There are 483 apples in 320 training images, and 50 leaves and 50 branches are selected to participate in training, simultaneously, thus 583 samples were considered as the training set. The extracted 16 features of each sample are used as the inputs of the neural network, and the outputs is three classes. So the neural network structure with sixteen input and three output is designed. Three kinds of neural networks for 100 times training, the statistical training success rate, three steps of convergences, training time, and the average values were listed in Table 1.
Comparison of the training performance of each algorithm.
GA: genetic algorithm; BP: back-propagation.
From Table 1 shows that the traditional Elman neural network is slightly higher than the BP network in training success rate; after optimizing the genetic algorithm, GA-Elman model’s training success rate greatly increased, and it can reach 100%. For the steps of convergence, the Elman model is far lower than the BP network, indicating that the Elman model has better convergence performance. GA-Elman converged quickly and the optimization of genetic algorithm accelerated the convergence speed. From the training time, the shortest is Elman neural network. While GA-Elman algorithm takes the longest, because genetic optimization iteration time cost more, that is, by the time, cost is exchanged for training success ratio.
Recognition and verify
GA-Elman model was used as a classifier of the apple fruit to separate three types of fruit images from the background in terms of recognition effect. Then, a series of subsequent operations, such as binarization, small area denoising, contour extraction, and the boundary fitting, were carried out to obtain the final recognition effect, as shown in Figure 7. If the fruit was overlapped or obscured by the branches and leaves, we wouldn’t get the complete feature information. Then, the information will be lost, which will influence classification effect. In serious cases, misunderstandings or omissions may occur, as shown in Figure 8.

Sample figure of correctly recognize fruit. (a) original image, (b) preliminary identification, (c) binarization, (d) small area denoising, (e) contour extraction, (f) fitting. I an unobscured fruit, II a slight shade, III a partial shade, IV an overlapping fruit.

Sample figure of missing recognize fruit. (a) original image, (b) preliminary identification, (c) binarization, (d) small area denoising, (e) contour extraction, and (f) fitting.
For operating efficiency and recognition rate of three models, the three models are evaluated from the following two aspects: one is recognition rate of fruits with different growth postures, the other is counting the total recognition rate, running time, and error. The recognition rate is
Running time is the cost of each recognition operation, and its average value is recorded. The squared sum error between the output value of the model and the true value of the sample is calculated according to formula 11. The test was repeated 100 times and the results are shown in Table 2.
Comparison of the recognition performance of each algorithm.
GA: genetic algorithm; BP: back-propagation.
There are 139 apples in 80 testing images, 50 fruits were unobscured, 36 were overlapped, and 53 were obscured by branches or leaves, with recognition rate of images under different growth postures.
From Table 2, we can see that the recognition effect of the three identification models on unobscured fruits is more than 99%, so it is feasible to use the neural network algorithm in apple fruits recognition. Due to the presence of interference factors, we can’t recognize the lapped fruits well and the correct recognition rate slightly decreased. GA-Elman model of cognition’s effect is the best and its average recognition rate of lapped fruits and covering fruits can reach 88.67% and 93.64%, which were higher than the not-optimized traditional Elman identification model.
By evaluating the total recognition rate, model BP was found to be the lowest and model GA-Elman the highest, reaching 94.88%. From the sum of squared errors, the new model still performs optimally. GA-Elman NN model has the highest recognition and the least error, indicating that the model has the best identification accuracy. From the recognition time evaluation, the three recognition models are relatively close. Relatively speaking, the GA-Elman model is the shortest, which only needs 1.89 s to identity 179 samples of apple.
From the above experimental results, we came to know that three kinds of network algorithms can be trained for the training samples of apple image from the network model training process. From the recognition effect, for the single unobscured fruit, the recognition rates all of this three models are more than 99%, thus the neural network recognition algorithm is suitable for apple fruit recognition. The overall performance of the Elman neural network is better than BP neural network model either from the training performance of the network or from the recognition performance, because the Elman network has one more undertaking layer with respect to the BP network in structure and plays a memory role.
From the perspective of the training process of three kinds of algorithms, the training success rate of GA-Elman neural network is the highest, but the training takes long time. This demonstrates that genetic optimization is an iterative process at the expense of time, it can accelerate the convergence of the neural network, and save the time that artificially attempt network architecture. From the recognition effect, the recognition rate of the fruits with overlap and branches and leaves is improved greatly, the total recognition rate of the three types of fruits is also significantly increased, with smaller error. Therefore, the GA-Elman neural network model has the highest recognition accuracy.
Obscured fruit recognition
In the current target fruit recognition algorithm of apple image, it is no longer a difficult problem to recognize a single unobscured target fruit. However, it is still very difficult to recognize obscured by apples, leaves, or branches, and the recognition rate is not high enough. In this experiment, two recognition algorithms of obscured apples 33,34 are compared. The experimental data sets are 36 overlapped apples and 53 obscured apples, and the test was repeated 100 times and the results are presented in Table 3.
Performance comparison of fruit recognition algorithms for obscured targets.
It is worth noting that the obscured degree and the overlapped degree of fruit affect the identification of apple fruit to a large extent. The statistics of 118 overlapping and obscured fruits were carried out. When the area covered by the fruit was less than 30%, the recognition rate has almost no effect; and the covered area was between 30% and 60%, the recognition effect was affected, and it was easy to cause errors. When the area covered is greater than 60%, or even larger, identifying the difficulties caused by the appearance of a greater chance of misunderstanding or leakage identification could not be recognized.
Results analysis
According to the results of the first group experimental, it can be known from the network model training process that three network algorithms can be trained for training samples of apple images. From the recognition effect point of view, for the single unobscured fruit, the recognition rate of the three models is greater than 99%, so the neural network recognition algorithm is suitable for apple fruit recognition. Comparing the training processes of these three algorithms, GA-Elman neural network has the highest training success rate, but the training time is longer. This shows that genetic optimization is an iterative process at the cost of time, which can accelerate the convergence of neural networks and save the time of manually attempting the network structure. Judging from the recognition effect, the recognition rate of fruits obscured by fruits or branches and leaves has been greatly improved, and the total recognition rate of the three types of fruits has also been significantly improved with less error. Therefore, GA-Elman neural network model has the highest recognition precision.
Recognition of obscured and overlapped apples is still a difficult problem; from second group experimental results, our method is better than the other two methods. However, when the area is too large to be covered by branches and leaves, the extraction of apple fruit is not complete, and the loss of information is relatively large, which leads the sample to be the outlier samples. Outlier samples are still difficulty to machine learn, and so how to optimize the outlier samples will be the focus of the next research, thereby improving the recognition efficiency.
Conclusions and further works
PCNN is applied in apple image segmentation. Apple fruit, leaves, and branches got that based on the image space approach and the gray level similarity, the segmentation effect is good and reaches the expectation. As for split image, six color features were extracted from R, G, B, H, S and I channels respectively, Hu invariant moment these 10-shape feature vector. The extracted 16 feature vectors can describe the global and local features of the object well, and the extracted features are effective. After the Elman neural network is optimized by the genetic algorithm, it accelerates the network convergence, improves the generalization ability of the network, and greatly improves the training success rate, which are reach 100% in this study. Only genetic optimization pays the price of time, and network learning time is slightly longer. Neural network models have high recognition efficiency for unobscured apples, which is close to 100%, and the neural network for recognition of apple fruit is feasible. For the recognition of overlapped apples, the correct recognition rate of the GA-Elman optimization model is significantly higher than the other models. It shows that the new GA-Elman model improves the efficiency of network operation and the highest recognition accuracy.
In conclusion, GA-Elman algorithm will consume more time; however, in the training process, the convergence speed will be improved, not only training success rate is increasing, but also generalization ability is enhancing. It can reduce the time of artificial attempts. The recent literatures about real-life applications of contemporary automation techniques in different fields give us new inspiration, using other intelligent computation 39 –42 to optimize neural network, in order to get more superior model; at the same time, we try to use deep learning algorithm 43 –46 to recognize the apple target fruit. We expect that the new algorithm can be applied in more fields, such as target recognition, 47 –50 sliding control, 51 –54 and so on.
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work is supported by Natural Science Foundation of Shandong Province in China (no.: ZR2017BC013); China Postdoctoral Science Foundation (no.: 2018M630797); National Nature Science Foundation of China (no.: 31571571, 61572300, 61379101); and Taishan Scholar Program of Shandong Province of China (no.: TSHW201502038).
