Sage Journals: Discover world-class research

Abstract

To address challenges such as insufficient real-time performance, balancing detection accuracy with model size, and interference from complex lighting and occlusion in orchard environments, this study proposes a lightweight and efficient apple detection model, YOLO-EAD (efficient apple detection). The model enhances YOLOv8 by replacing its backbone with the EfficientViT-T network to reduce computational complexity, introducing a self-attention-based detection head (SA-detect) to streamline detection branches, and integrating a coordinate attention (CA) mechanism in the Neck layer to improve feature focus. Additionally, the SIoU loss function is adopted for more precise bounding box regression. These enhancements collectively reduce model size to 4.1 MB and computational complexity to 9.3 GFlops while achieving a high mAP@0.5 of 96.7%. Compared to the original YOLOv8s, this model reduces complexity by 67.5% with improved detection accuracy and robustness under varied lighting and occlusion conditions, making it suitable for real-time applications on edge devices in agricultural robotics.

Keywords

apple detection real-time detection lightweight model deep learning YOLOv8

Get full access to this article

View all access options for this article.

References

Patocka

Bhardwaj

Klimova

, et al. Malus domestica: a review on nutritional features, chemical composition, traditional and medicinal value. Plants 2020; 9(11): 1408.

Coase

. The nature of the firm. Economica 1937; 4(16): 386–405.

Hertz

Zahniser

. Is there a farm labor shortage? Am J Agric Econ 2013; 95(2): 476–481.

Zhang

Lammers

Chu

, et al. An automated apple harvesting robot—from system design to field evaluation. J Field Robot 2024; 41(7): 2384–2400.

Tang

Chen

Wang

, et al. Recognition and localization methods for vision-based fruit picking robots: a review. Front Plant Sci 2020; 11: 510.

Yang

Han

, et al. Vision based fruit recognition and positioning technology for harvesting robots. Comput Electron Agric 2023; 213: 108258.

Aherwadi

Mittal

. Fruit quality identification using image processing, machine learning, and deep learning: a review. Adv Appl Math Sci 2022; 21(5): 2645–2660.

Tang

Qiu

Zhang

, et al. Optimization strategies of fruit detection to overcome the challenge of unstructured background in field orchard environment: a review. Precis Agric 2023; 24(4): 1183–1219.

Xiao

Wang

, et al. Fruit detection and recognition based on deep learning for automatic harvesting: an overview and review. Agronomy 2023; 13(6): 1625.

10.

Ren

Girshick

, et al. Faster R-CNN: towards real-time object detection with region proposal networks, 2015. [Online]. Available. https://arxiv.org/abs/1506.01497

11.

LeCun

Bottou

Bengio

, et al. Gradient-based learning applied to document recognition. Proc IEEE 1998; 86(11): 2278–2324.

12.

Jiang

Ergu

Liu

, et al. A review of Yolo algorithm developments. Procedia Comput Sci 2022; 199: 1066–1073.

13.

Sekharamantry

Melgani

Malacarne

. Deep learning-based apple detection with attention module and improved loss function in YOLO. Remote Sens 2023; 15(6): 1516.

14.

Shi

Hou

Xia

, et al. Improved young fruiting apples target recognition method based on YOLOv7 model. Neurocomputing 2024; 623: 129186.

15.

Gai

Liu

. TL-YOLOv8: a blueberry fruit detection algorithm based on improved YOLOv8 and transfer learning. IEEE Access 2024; 12: 86378–86390.

16.

Liu

Abeyrathna

RMRD

Mulya Sampurno

, et al. Faster-YOLO-AP: a lightweight apple detection algorithm based on improved YOLOv8 with a new efficient PDWConv in orchard. Comput Electron Agric 2024; 223: 109118.

17.

Vyas

Verma

Goel

. AI-powered trustable and explainable fall detection system using transfer learning. Image Vis Comput 2024; 136: 104832.

18.

Veeramachaneni

Premnath

. Block-greedy and CNN based underwater image dehazing for novel depth estimation and optimal ambient light. Water 2021; 13(23): 3365.

19.

Bhusal

Karkee

Zhang

. Apple dataset benchmark from orchard environment in modern fruiting wall. Pullman, WA: Washington State University, 2019. https://datasetninja.com/apple-dataset-benchmark-from-orchard-environment#citation

20.

Liu

Peng

Zheng

. Efficientvit: memory efficient vision transformer with cascaded group attention. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2023, pp. 14420–14430: IEEE.

21.

Shen

Sun

. Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 7132–7141: IEEE.

22.

Bastidas

Tang

. Channel attention networks. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, 2019: IEEE.

23.

Woo

Park

Lee

J-Y

. CBAM: convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV), 2018, pp. 3–19: ACM.

24.

Ouyang

Zhang

, et al. Efficient multi-scale attention module with cross-spatial learning. In: ICASSP 2023-2023 IEEE international conference on acoustics, speech and signal processing (ICASSP), 2023, pp. 1–5: IEEE.

25.

Wang

Zhu

, et al. ECA-Net: efficient channel attention for deep convolutional neural networks. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp. 11534–11542: IEEE.

26.

Gevorgyan

SIoU loss: more powerful learning for bounding box regression. arXiv preprint arXiv:2205.12740, 2022.

YOLO-EAD: A lightweight apple detection model based on enhanced YOLOv8 for complex orchard environments

Abstract

Keywords

Get full access to this article

References