Sage Journals: Discover world-class research

Abstract

Steel surface defect detection is regarded as a critical component of quality control in intelligent manufacturing, as its effectiveness directly influences product qualification rates and production costs. To address this issue, a precise defect detection model, LDSE-YOLO, is proposed in this study. Conventional spatial attention mechanisms focus solely on spatial features and fail to resolve the limitations posed by the parameter-sharing nature of convolutional kernels. Additionally, traditional feature pyramid networks lack effective multi-scale contextual modeling, while existing attention mechanisms are often restricted to a single domain, making it difficult to achieve robust object representation and background suppression under complex conditions.To this end, a Local Dynamic Convolution module (LDConv) is first introduced. Unlike static convolutions with fixed patterns, LDConv employs a dynamic weight allocation mechanism to enhance the representation of fine-grained defects. Next, a Spatial-Context Attention Module (SCAM) is proposed, which integrates dilated convolution and adaptive spatial attention to construct a feature pyramid with improved multi-scale perception. This design combines large receptive field feature extraction with dual spatial-channel attention to effectively decouple defect features from background noise in texture-rich environments.Furthermore, an Enhanced Occlusion Attention Module (EOAM) is incorporated to strengthen the representation of occluded areas, suppress background interference, and reinforce spatial-channel attention, thereby improving the detection of small and partially occluded defects. Experimental results demonstrate that the proposed LDSE-YOLO model achieves superior overall detection performance on the NEU-DET and GC10-DET benchmark datasets, with mAP@0.5 improvements of 4.3% and 2.1%, respectively, compared to mainstream baseline models.

Keywords

SCAM lDConv EOAM steel Surface Defect Detection ultra-Small Object Detection

Get full access to this article

View all access options for this article.

References

Bouguettaya

Zarzour

. CNN-based hot-rolled steel strip surface defects classification: a comparative study between different pre-trained CNN models. Int J Adv Manufact Technol 2024; 132: 399–419.

Liu

Chen

, et al. CGTD-net: Channel-wise global transformer-based dual-branch network for industrial strip steel surface defect detection. IEEE Sens J 2024; 24: 4863–4873.

Yasir

Ahn

. Faster metallic surface defect detection using deep learning with channel shuffling. arXiv preprint arXiv:2406.14582, 2024, in press.

Zhao

Wang

Zhang

, et al. A review of convolutional neural networks in computer vision. Artif Intell Rev 2024; 57: 99.

Wan

. ELA: Efficient local attention for deep convolutional neural networks. arXiv preprint arXiv:2403.01123, 2024, in press.

Mienye

Swart

. A comprehensive review of deep learning: Architectures, recent advances, and applications. Information 2024; 15: 755.

Zhao

Zhang

. An improved pooling method for convolutional neural networks. Sci Rep 2024; 14: 1589.

Fei

, et al. Interpretable capsule networks via self attention routing on spatially invariant feature surfaces. Sci Rep 2025; 15: 13026.

Rekavandi

Boussaid

, et al. A guide to image-and video-based small object detection using deep learning: Case study of maritime surveillance. IEEE trans Intell Transp Syst 2025; 26: 2851–2879.

10.

Pang

Peng

, et al. A small object detection model for drone images based on multi-attention fusion network. Image Vis Comput 2025; 155: 105436.

11.

Wei

Cheng

, et al. A review of small object detection based on deep learning. Neural Comput Appl 2024; 36: 6283–6303.

12.

Xia

Guo

Zhang

, et al. LSDNet: Lightweight strip-steel surface defect detection networks for edge device environment. Opt Lasers Eng 2025; 186: 108857.

13.

Tang

Chen

Sun

, et al. Review of surface defect detection of steel products based on machine vision. IET Image Process 2023; 17: 303–322.

14.

Zhang

Wang

, et al. An efficient model for metal surface defect detection based on attention mechanism and multi-scale feature. J Supercomput 2025; 81: 40.

15.

Tang

Wang

Zhang

, et al. A lightweight surface defect detection framework combined with dual-domain attention mechanism. Expert Syst Appl 2024; 238: 121726.

16.

Wei

Wang

. SED-YOLO based multi-scale attention for small object detection in remote sensing. Sci Rep 2025; 15: 3125.

17.

Yang

Bender

, et al. Condconv: Conditionally parameterized convolutions for efficient inference. Adv Neural Inf Process Syst 2019; 32: 1307–1318.

18.

Chen

, et al. Frequency dynamic convolution for dense image prediction. arXiv preprint arXiv:2503.18783, 2025, in press.

19.

Chen

Dai

Liu

, et al. Dynamic convolution: Attention over convolution kernels. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp.11030–11039.

20.

Cong

Zhou

. A review of convolutional neural network architectures and their optimizations. Artif Intell Rev 2023; 56: 1905–1969.

21.

Wang

. Improved YOLOv7 for small object detection algorithm based on attention and dynamic convolution. Appl Sci 2023; 13: 9316.

22.

Hassanin

Anwar

Radwan

, et al. Visual attention methods in deep learning: An in-depth survey. Informat Fusion 2024; 108: 102417.

23.

Jia

Wang

Chen

, et al. MobileNet-CA-YOLO: An improved YOLOv7 based on the mobileNetV3 and attention mechanism for rice pests and diseases detection. Agriculture 2023; 13: 1285.

24.

. Variations of Squeeze and Excitation networks. arXiv preprint arXiv:2304.06502, 2023, in press.

25.

Zhang

Kong

Chen

, et al. CBAM-CRLSGAN: A novel fault diagnosis method for planetary transmission systems under small samples scenarios. Measurement 2024; 234: 114795.

26.

Wang

Yang

, et al. A normalized Gaussian Wasserstein distance for tiny object detection. arXiv preprint arXiv:2110.13389, 2021, in press.

27.

Chen

Ding

Zhao

, et al. Surface defect detection methods for industrial products: A review. Appl Sci 2021; 11: 7657.

28.

Jin

Chen

. A survey of surface defect detection of industrial products based on a small number of labeled data. arXiv preprint arXiv:2203.05733, 2022, in press.

29.

Wang

Zhang

Yuan

, et al. Few-shot adaptive Faster R-CNN. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2019, pp.7173–7182.

30.

Gkioxari

Dollár

, et al. Mask R-CNN. In: Proceedings of the IEEE international conference on computer vision, 2017, pp.2961–2969.

31.

Babu

Habelalmateen

Srikanteswara

, et al. Wafer surface semiconductor defect classification using convolution neural network based improved faster R-CNN. In: 2024 Second international conference on data science and information system (ICDSIS), 2024, pp.1–4, IEEE.

32.

Ashrafi

Teymouri

Etaati

, et al. Steel surface defect detection and segmentation using deep neural networks. Results Eng 2025; 25: 103972.

33.

Shen

Zhou

Liu

. MINet: Multiscale interactive network for real-time salient object detection of strip steel surface defects. IEEE Trans Indust Informat 2024; 20: 7842–7852.

34.

Tian

Doermann

. Yolov12: Attention-centric real-time object detectors. arXiv preprint arXiv:2502.12524, 2025.

35.

Alif

Hussain

. YOLOv12: A Breakdown of the Key Architectural Features. arXiv preprint arXiv:2502.14740, 2025.

36.

Song

Meng

, et al. An end-to-end steel surface defect detection approach via fusing multiple hierarchical features. IEEE Trans Instrum Meas 2019; 69: 1493–1504.

37.

Duan

Jiang

, et al. Deep metallic surface defect detection: The new benchmark and detection network. Sensors 2020; 20: 1562.

38.

Zhang

, et al. LDD-YOLO: An improved lightweight detection method for steel surface defects based on YOLOv8. In: 2024 IEEE International conference on systems, man, and cybernetics (SMC), 2024, pp.7–13.

39.

Jiang

Cui

, et al. Optimization algorithm of steel surface defect detection based on YOLOv8n-SDEC. IEEE Access 2024; 12: 95106–95117.

40.

Shao

Zhou

, et al. TD-net: tiny defect detection network for industrial products. Complex Intell Syst 2024; 10: 3943–3954.

41.

Zhao

, et al. DETRs beat YOLOs on real-time object detection. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2024, pp.16965–16974.

Dynamic convolution and attention-based steel surface defect detection via LDSE-YOLO

Abstract

Keywords

Get full access to this article

References