Abstract
To obtain the deeper features of the target image in the complex environment and to improve the robustness and recognition accuracy of the algorithm, first, taking images in Corel5K database as the research object, based on Faster Region-based Convolutional Neural Network (R-CNN) structure, ReLu and Softplus activation functions are used to improve the activation function in the model. The model is used to extract features from images, the region proposal network is used to generate the region proposal of images, and the proposal filtering algorithm based on hierarchical clustering is used to select the target in the region proposal. Then, the spatial pyramid pooling in the original R-CNN structure is replaced with the region of interest (ROI) pooling layer to improve the model, and the classification and identification of targets in the region proposal images are carried out. Based on the gradient descent algorithm, the weights in the model are optimized to improve the recognition accuracy of the model. Finally, the object in the image is detected. Softmax classifier is used to classify the proposal feature images, and category of the object is obtained. After optimization of non-maximum suppression, the regressor is used to update the parameters in the model, and the graphic recognition and retrieval system is constructed. The recognition accuracy of the improved activation function in this study is up to 99.88%, and when the size of the pooling layer is 12*12, the recognition accuracy is up to 98.11%. The significance of this study lies in the successful enhancement of the Faster R-CNN model, resulting in a notable reduction in running time from its previous iteration by 0.37 hours, with the improved model now completing tasks in 0.88 hours. Furthermore, the development of the image retrieval module enables semantic-based image retrieval, allowing for more precise and efficient retrieval of images. Ultimately, these advancements underscore the model’s efficacy in target recognition and its ability to facilitate the recognition and retrieval of graphs, showcasing its potential for various applications in computer vision and image processing tasks.
Get full access to this article
View all access options for this article.
