Sage Journals: Discover world-class research

Abstract

The first major contribution of the paper is the proposal of using an improved DEtection Transformer network (named R2N-DETR) and Kinect-V2 camera for detecting multiple-size peaches under orchards with varied illumination and fruit occlusion. R2N-DETR model first employed Res2Net-50 to extract a fused low-high level feature map containing fine spatial features and precise semantic information of multi-size peaches from Red-Green-Blue-Depth (RGB-D) images. Second, the encoder-decoder was performed on the feature map to obtain the global context. Finally, all detected objects were detected according to each object’s global context. For the detection of 1101 RGB-D images (imaged from two orchards over three years), the R2N-DETR model achieves an average precision of 0.944 and an average detecting time of 53 ms for each image. The developed system could provide precise visual guidance for robotic picking and contribute to improving yield prediction by providing accurate fruit counting.

Keywords

Deep learning peach detection RGB-D image R2N-DETR open orchard

Get full access to this article

View all access options for this article.

References

and Wang

, Genetic resources, breeding programs in China, and gene mining of peach: A review, Horticultural Plant Journal 6 (2020), 205–215. doi: 10.1016/j.hpj.2020.06.001.

Saedi

S.I.

and Khosravi

, A deep neural network approach towards real-time on-branch fruit recognition for precision horticulture, Expert Systems with Applications 159 (2020), 113594. doi: 10.1016/j.eswa.2020.113594.

Wang

and Yang

, Water quality monitoring and evaluation using remote sensing techniques in China: A systematic review, Ecosystem Health and Sustainability 5 (2019), 47–56. doi: 10.1080/20964129.2019.1571443.

Kapach

Barnea

Mairon

Edan

and Ben-Shahar

, Computer vision for fruit harvesting robots–state of the art and challenges ahead, International Journal of Computational Vision and Robotics 3 (2012), 4–34. doi: 10.1504/IJCVR..

Wang

Chen

Zhang

and Zhang

, Applications of machine vision in agricultural robot navigation: A review, Computers and Electronics in Agriculture 198 (2022), 107085. doi: 10.1016/j.compag.2022.107085.

Wang

Liu

Chen

Huang

and Han

, A band selection approach based on a modified gray wolf optimizer and weight updating of bands for hyperspectral image, Applied Soft Computing 112 (2021), 107805. doi: 10.1016/j.asoc.2021.107805.

Majeed

Zhang

Karkee

and Zhang

, Faster R–CNN–based apple detection in dense-foliage fruiting-wall trees using RGB and depth features for robotic harvesting, Biosystems Engineering 197 (2020), 245–256. doi: 10.1016/j.biosystemseng.2020.07.007.

Barnea

Mairon

and Ben-Shahar

, Colour-agnostic shape-based 3D fruit detection for crop harvesting robots, Biosystems Engineering 146 (2016), 57–70. doi: 10.1016/j.biosystemseng.2016.01.013.

Zou

Shi

Guo

and Ye

, Object detection in 20 years: A survey, arXiv preprint arXiv:1905.05055, 2019. doi: 10.48550/arXiv.1905.05055.

10.

Zhao

Z.-Q.

Zheng

S.-T.

and Wu

, Object detection with deep learning: A review, IEEE Transactions on Neural Networks and Learning Systems 30 (2019), 3212–3232. doi: 10.1109/TNNLS.2018.2876865.

11.

Diwan

Anirudh

and Tembhurne

J.V.

, Object detection using YOLO: challenges, architectural successors, datasets and applications, Multimedia Tools and Applications (2022), 1–33. doi: 10.1007/s11042-022-13644-y.

12.

X.-T.

and Jo

K.-H.

, A review on anchor assignment and sampling heuristics in deep learning-based object detection, Neurocomputing 506 (2022), 96–116. doi: 10.1016/j.neucom.2022.07.003.

13.

Xue

Zheng

Wan

and Mao

, Detection of passion fruits and maturity classification using Red-Green-Blue Depth images, Biosystems Engineering 175 (2018), 156–167. doi: 10.1016/j.biosystemseng.2018.09.004.

14.

Zhu

Huang

and Guo

, Using color and 3D geometry features to segment fruit point cloud and improve fruit recognition accuracy, Computers and Electronics in Agriculture 174 (2020), 105475. doi: 10.1016/j.compag.2020.105475.

15.

Song

Liu

and Cui

, Kiwifruit detection in field images using Faster R-CNN with VGG16, IFAC-PapersOnLine 52 (2019), 76–81. doi: 10.1016/j.ifacol.2019.12.500.

16.

Chu

Lammers

and Liu

, Deep learning-based apple detection using a suppression mask R-CNN, Pattern Recognition Letters 147 (2021), 206–211. doi: 10.1016/j.patrec.2021.04.022.

17.

Tian

Yang

Wang

and Liang

, Apple detection during different growth stages in orchards using the improved YOLO-V3 model, Computers and Electronics in Agriculture 157 (2019), 417–426. doi: 10.1016/j.compag.2019.01.012.

18.

Mirhaji

Soleymani

Asakereh

and Mehdizadeh

S.A.

, Fruit detection and load estimation of an orange orchard using the YOLO models through simple approaches in different imaging and illumination conditions, Computers and Electronics in Agriculture 191 (2021), 106533. doi: 10.1016/j.compag.2021.106533.

19.

Zheng

Gao

Zhang

Wang

and Dong

, End-to-end object detection with adaptive clustering transformer, arXiv preprint arXiv:2011.09315, 2020. doi: 10.48550/arXiv.2011.09315.

20.

Heo

Yun

Han

Chun

Choe

and Oh

S.J.

, Rethinking spatial dimensions of vision transformers, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, 11936–11945.

21.

Carion

Massa

Synnaeve

Usunier

Kirillov

and Zagoruyko

, End-to-end object detection with transformers, in: European Conference on Computer Vision, Springer, 2020, 213–229.

22.

Lin

T.-Y.

Maire

Belongie

Hays

Perona

Ramanan

Dollár

and Zitnick

C.L.

, Microsoft coco: Common objects in context, in: European Conference on Computer Vision, Springer, 2014, 740–755.

23.

Zhu

Wang

and Dai

, Deformable detr: Deformable transformers for end-to-end object detection, arXiv preprint arXiv:2010.04159, 2020. doi: 10.48550/arXiv.2010.04159.

24.

Feng

Liu

Gao

Majeed

Al-Mallahi

Zhang

and Cui

, Fast and accurate detection of kiwifruit in orchard using improved YOLOv3-tiny model, Precision Agriculture 22 (2021), 754–776. doi: 10.1007/.

25.

Gao

S.-H.

Cheng

M.-M.

Zhao

Zhang

X.-Y.

Yang

M.-H.

and Torr

, Res2net: A new multi-scale backbone architecture, IEEE Transactions on Pattern Analysis and Machine Intelligence 43 (2019), 652–662. doi: 10.1109/TPAMI.2019.2938758.

26.

Vaswani

Shazeer

Parmar

Uszkoreit

Jones

Gomez

A.N.

Kaiser

Ł.

and Polosukhin

, Attention is all you need, Advances in Neural Information Processing Systems 30 (2017), 5998–6008.

27.

Bello

Zoph

Vaswani

Shlens

and Le

Q.V.

, Attention augmented convolutional networks, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019, pp. 3286–3295.

28.

Loshchilov

and Hutter

, Decoupled weight decay regularization, arXiv preprint arXiv:1711.05101, 2017. doi: 10.48550/arXiv.1711.05101.

29.

Roy

A.M.

Bose

and Bhaduri

, A fast accurate fine-grain object detection model based on YOLOv4 deep neural network, Neural Computing and Applications 34 (2022), 3895–3921. doi: 10.1007/s00521-021-06651-x.

30.

Qiu

Liu

and Sun

, Borderdet: Border feature for dense object detection, in: European Conference on Computer Vision, Springer, 2020, pp. 549–564.

31.

Song

Liu

and Wang

, Revisiting the sibling head in object detector, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 11563–11572.

32.

Yang

Zhang

Dai

Xiao

Yuan

and Gao

, Focal attention for long-range interactions in vision transformers, Advances in Neural Information Processing Systems 34 (2021), 30008–30022.

33.

Zhang

Liu

Chen

and Ding

, Application of deep learning algorithms in geotechnical engineering: a short critical review, Artificial Intelligence Review 54 (2021), 5633–5673. doi: 10.1007/s10462-021-09967-1.

34.

Zheng

Chen

Pang

Yang

Chen

and Xue

, A mango picking vision algorithm on instance segmentation and key point detection from RGB images in an open orchard, Biosystems Engineering 206 (2021), 32–54. doi: 10.1016/j.biosystemseng.2021.03.012.

35.

Bochkovskiy

Wang

C.-Y.

and Liao

H.-Y.M.

, Yolov4: Optimal speed and accuracy of object detection, arXiv preprint arXiv:2004.10934, 2020. doi: 10.48550/arXiv.2004.10934.

36.

Mai

Zhang

Jia

and Meng

M.Q.-H.

, Faster R-CNN with classifier fusion for automatic detection of small fruits, IEEE Transactions on Automation Science and Engineering 17 (2020), 1555–1569. doi: 10.1109/TASE.2020.2964289.

37.

Wan

and Goudos

, Faster R-CNN for multi-class fruit detection using a robotic vision system, Computer Networks 168 (2020), 107036. doi: 10.1016/j.comnet.2019.107036.

38.

Gong

Wang

Chandra

and Liu

, Keepaugment: A simple information-preserving data augmentation approach, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 1055–1064.

39.

Yang

Wang

Zhang

Wei

Lin

and Yuille

, Lite vision transformer with enhanced self-attention, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 11998–12008.

40.

Dai

Chen

Yang

Zhang

Yuan

and Zhang

, Dynamic detr: End-to-end object detection with dynamic attention, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 2988–2997.

Detection of multi-size peach in orchard using RGB-D camera combined with an improved DEtection Transformer model

Abstract

Keywords

Get full access to this article

References