Research on automatic detection and segmentation of prostate zones based on YOLO-D

Abstract

Background

Accurate identification and localization of prostate zones in magnetic resonance (MR) images are essential for clinical diagnosis and treatment planning. However, convolutional object detection models like YOLO often struggle to capture the complex geometric features of the prostate.

Objective

To enhance the detection and segmentation performance of prostate MR images by addressing limitations in spatial feature extraction and static focusing mechanisms present in conventional YOLO models.

Methods

We propose YOLO-D, an enhanced YOLOv8-based model integrating a Deformable Convolution (DConv) module to better capture fine-grained image details and improve geometric adaptability. Additionally, the Wise-IoU loss function is employed to introduce a dynamic and non-monotonic focusing mechanism, effectively reducing inter-class interference and enhancing localization accuracy.

Results

YOLO-D was evaluated on the publicly available ProstateX dataset using precision, recall, average precision (AP), and F1 score as evaluation metrics. For detection, it achieved 93.4% precision, 91.2% recall, 94.7% AP, and an F1 score of 0.922. For segmentation, YOLO-D achieved 90.7% precision, 88.6% recall, 91.1% AP, and an F1 score of 0.897—consistently outperforming the baseline YOLOv8.

Conclusions

By incorporating DConv and Wise-IoU, YOLO-D offers a robust and efficient solution for automatic prostate zone analysis, with promising potential in real-time clinical imaging applications.

Keywords

prostate zones YOLOv8 object detection instance segmentation wise-IoU

Get full access to this article

View all access options for this article.

References

Montagne

Hamzaoui

Allera

, et al. Challenge of prostate MRI segmentation on T2-weighted images: inter-observer variability and impact of prostate morphology. Insights Imaging 2021; 12: 71.

Gurkan

Budak

Karatas

, et al. Segmentation of prostate zones on a novel MRI database using mask R-CNN: an implementation on PACS system. J Fac Eng Archit Gazi Univ 2024; 39: 1401–1416.

McNeal

. The zonal anatomy of the prostate. Prostate 1981; 2: 35–49.

X-D

Yan

S-S

Liu

R-J

, et al. Apparent differences in prostate zones: susceptibility to prostate cancer, benign prostatic hyperplasia and prostatitis. Int Urol Nephrol 2024; 56: 2451–2458.

Greene

Wheeler

Egawa

, et al. A comparison of the morphological features of cancer arising in the transition zone and in the peripheral zone of the prostate. J Urol 1991; 146: 1069–1076.

Choi

Kim

, et al. Functional MR imaging of prostate cancer. Radiographics 2007; 27: 63–75.

Bonekamp

Jacobs

El-Khouli

, et al. Advancements in MR imaging of the prostate: from diagnosis to interventions. Radiographics 2011; 31: 677–703.

Hoeks

Barentsz

Hambrock

, et al. Prostate cancer: multiparametric MR imaging for detection, localization, and staging. Radiology 2011; 261: 46–66.

Khan

Yahya

Alsaih

, et al. Recent automatic segmentation algorithms of MRI prostate regions: a review. IEEE Access 2021; 9: 97878–97905.

10.

Fassia

M-K

Balasubramanian

Woo

, et al. Deep learning prostate MRI segmentation accuracy and robustness: a systematic review. Radiol Artif Intell 2024; 6: e230138.

11.

Bardis

Houshyar

Chantaduly

, et al. Segmentation of the prostate transition zone and peripheral zone on MR images with deep learning. Radiol Imaging Cancer 2021; 3: e200024.

12.

Ragab

Abdulkadir

Muneer

, et al. A comprehensive systematic review of YOLO for medical object detection (2018 to 2023). IEEE Access 2024; 12: 57815–57836.

13.

Dai

Xiong

, et al. Deformable convolutional networks. In: Proceedings of the IEEE international conference on computer vision. Venice, Italy: IEEE, 2017, pp.764–773.

14.

Redmon

Divvala

Girshick

, et al. You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp.779–788.

15.

Diwan

Anirudh

Tembhurne

. Object detection using YOLO: challenges, architectural successors, datasets and applications. Multimed Tools Appl 2023; 82: 9243–9275.

16.

Terven

Córdova-Esparza

D-M

Romero-González

J-A

. A comprehensive review of yolo architectures in computer vision: from yolov1 to yolov8 and yolo-nas. Mach Learn Knowl Extr 2023; 5: 1680–1716.

17.

Jocher

Chaurasia

Qiu

. Ultralytics/ultralytics. GitHub Repository, 2023.

18.

Sohan

Sai Ram

Rami Reddy

. A review on YOLOv8 and its advancements. In: International conference on data intelligence and cognitive informatics. India: Springer, 2024, pp.529–545.

19.

Reis

Kupec

Hong

, et al. Real-time flying object detection with YOLOv8. arXiv preprint arXiv:2305.09972, 2023.

20.

Farhadi

Redmon

. YOLOv3: an incremental improvement. In: Computer vision and pattern recognition. Honolulu, Hawaii, USA: IEEE, 2018, pp.1–6.

21.

Wei

Tong

. Enhanced-YOLOv8: a new small target detection model. Digit Signal Process 2024; 153: 104611.

22.

Shi

Wang

Kong

, et al. Dual-quality map based no reference image quality assessment using deformable convolution. Digit Signal Process 2022; 123: 103398.

23.

Tong

Chen

, et al. Wise-IoU: bounding box regression loss with dynamic focusing mechanism. arXiv preprint arXiv:2301.10051, 2023.

24.

Yin

Feng

Wang

, et al. Automated follicle counting system (AFCS) using YOLO-based object detection algorithm and its application in the POI model. Biomed Signal Process Control 2025; 103: 107423.

25.

Armato

Petrick

Drukker

. PROSTATEx: prostate MR classification challenge (conference presentation). In: Medical imaging 2017: computer-aided diagnosis. Orlando, Florida, USA: SPIE, 2017, pp.1158–1158.

26.

Hatamizadeh

Nath

Tang

, et al. Swin UNETR: swin transformers for semantic segmentation of brain tumors in MRI images. In: International MICCAI BrainLesion workshop. Cham: Springer, 2021, pp.272–284.

27.

Isensee

Jaeger

Kohl

, et al. nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation. Nat Methods 2021; 18: 203–211.

28.

Gkioxari

Dollár

, et al. Mask R-CNN. In: Proceedings of the IEEE international conference on computer vision. Venice, Italy: IEEE, 2017, pp.2961–2969.

29.

Litjens

Toth

Van De Ven

, et al. Evaluation of prostate segmentation algorithms for MRI: the PROMISE12 challenge. Med Image Anal 2014; 18: 359–373.