Enhancing Pavement Crack Detection Using a Hybrid Convolutional Neural Network-Transformer Architecture

Abstract

Pavement cracks are the most common type of distress in transportation infrastructure. Despite the robust performance of convolutional neural network (CNN)-based networks, their ability to capture fine features is significantly limited, which is important for comprehensive crack detection. Accurately capturing long-range contextual information is crucial for the automatic assessment of road conditions. To address the limitations, this study introduces an innovative architecture that synergistically combines CNN and Transformer modules in parallel branches, significantly enhancing semantic segmentation by optimizing feature extraction and bolstering the capture of long-range dependencies. The CNN branch is designed to capture pixel-level contextual representations and incorporates an additional head for generating boundary heatmaps, which facilitates enhanced regional interaction. Concurrently, the Transformer branch employs multi-head self-attention mechanisms and multilayer perceptron (MLP) modules to assimilate long-range contextual representations. A contextual attention module integrates boundary features with the normalized feature set, accentuating boundary regions and directing the model to accurately delineate overlapping objects. Comprehensive experiments demonstrate that the proposed network performs better than the state-of-the-art methods on the public data sets, achieving F1 scores of 76.36%. Our proposed model significantly reduces false detections in scenarios involving long and thin cracks while preserving its overall crack detection capabilities.

Keywords

data and data science infrastructure pavements pavement condition evaluation detection computer vision

Get full access to this article

View all access options for this article.

References

Dorafshan

Thomas

R. J.

Maguire

Comparison of Deep Convolutional Neural Networks and Edge Detectors for Image-Based Crack Detection in Concrete. Construction and Building Materials, Vol. 186, 2018, pp. 1031–1045.

Fan

Bocus

M. J.

Zhu

Jiao

Wang

Cheng

Liu

Road Crack Detection Using Deep Convolutional Neural Network and Adaptive Thresholding. Proc., 2019 IEEE Intelligent Vehicles Symposium (IV), Paris, France, IEEE, New York, 2019, pp. 474–479.

Hoang

N.-D.

Nguyen

Q.-L.

Metaheuristic Optimized Edge Detection for Recognition of Concrete Wall Cracks: A Comparative Study on the Performances of Roberts, Prewitt, Canny, and Sobel Algorithms. Advances in Civil Engineering, Vol. 2018, No. 1, 2018, p. 7163580.

Akagic

Buza

Omanovic

Karabegovic

Pavement Crack Detection Using Otsu Thresholding for Image Segmentation. Proc., 2018 41st International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO), Opatija, Croatia, IEEE, New York, 2018, pp. 1092–1097.

Fujita

Hamamoto

A Robust Automatic Crack Detection Method from Noisy Concrete Surfaces. Machine Vision and Applications, Vol. 22, 2011, pp. 245–254.

Zhang

Chen

Zernike-Moment Measurement of Thin-Crack Width in Images Enabled by Dual-Scale Deep Learning. Computer-Aided Civil and Infrastructure Engineering, Vol. 34, No. 5, 2019, pp. 367–384.

Cha

Y. J.

Choi

Suh

Mahmoudkhani

Büyüköztürk

Autonomous Structural Visual Inspection Using Region-based Deep Learning for Detecting Multiple Damage Types. Computer-Aided Civil and Infrastructure Engineering, Vol. 33, No. 9, 2018, pp. 731–747.

Pan

Deng

Shen

Kang

Pavement Distress Detection and Classification Based on YOLO Network. International Journal of Pavement Engineering, Vol. 22, No. 13, 2021, pp. 1659–1672.

Liu

Anguelov

Erhan

Szegedy

Reed

C.-Y.

Berg

A. C.

Ssd: Single Shot Multibox Detector. Proc., Computer Vision-ECCV 2016: 14th European Conference, Part I 14, Amsterdam, The Netherlands, Springer, Cham, October 11–14, 2016, pp. 21–37.

10.

Carr

T. A.

Jenkins

M. D.

Iglesias

M. I.

Buggy

Morison

Road Crack Detection Using a Single Stage Detector Based Deep Neural Network. Proc., IEEE Workshop on Environmental, Energy, and Structural Monitoring Systems, Salerno, Italy, IEEE, New York, 2018, pp. 1–5.

11.

Qiu

Lau

Real-Time Detection of Cracks in Tiled Sidewalks Using YOLO-Based Method Applied to Unmanned Aerial Vehicle (UAV) Images. Automation in Construction, Vol. 147, 2023, p. 104745.

12.

Luo

Cai

An Enhanced Lightweight Network for Road Damage Detection Based on Deep Learning. Electronics, Vol. 12, No. 12, 2023, p. 2583.

13.

Long

Shelhamer

Darrell

Fully Convolutional Networks for Semantic Segmentation. Proc., IEEE Conference on Computer Vision and Pattern Recognition, IEEE Xplore, 2015, pp. 3431–3440.

14.

Ronneberger

Fischer

Brox

U-Net: Convolutional Networks for Biomedical Image Segmentation. Proc., Medical Image Computing and Computer-Assisted Intervention-MICCAI 2015: 18th International Conference, Part III 18, Munich, Germany, Springer, Cham, October 5–9, 2015, pp. 234–241.

15.

Chen

L.-C.

Papandreou

Kokkinos

Murphy

Yuille

A. L.

Deeplab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected Crfs. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 40, No. 4, 2017, pp. 834–848.

16.

Tong

Yuan

Gao

Wang

Pavement Defect Detection with Fully Convolutional Network and an Uncertainty Framework. Computer-Aided Civil and Infrastructure Engineering, Vol. 35, No. 8, 2020, pp. 832–849.

17.

Chen

Feng

Xiao

Chen

Gao

Zhao

Pavement Crack Detection Based on the Improved Swin-Unet Model. Buildings, Vol. 14, No. 5, 2024, p. 1442.

18.

Huyan

Tighe

Zhai

CrackU-Net: A Novel Deep Convolutional Neural Network for Pixelwise Pavement Crack Detection. Structural Control and Health Monitoring, Vol. 27, No. 8, 2020, p. e2551.

19.

Lau

T. L.

CrackHAM: A Novel Automatic Crack Detection Network Based on U-Net for Asphalt Pavement. IEEE Access, Vol. 12, 2024, pp. 12655–12666.

20.

Gao

Cao

Cai

Zhou

Pixel-Level Road Crack Detection in UAV Remote Sensing Images Based on ARD-Unet. Measurement, Vol. 219, 2023, p. 113252.

21.

Fan

Chen

Wei

Loprencipe

Chen

Di Mascio

Automatic Crack Detection on Road Pavements Using Encoder-Decoder Architecture. Materials, Vol. 13, No. 13, 2020, p. 2960.

22.

Wang

Computer Vision-Based Road Crack Detection Using an Improved I-UNet Convolutional Networks. Proc., 2020 Chinese Control and Decision Conference (CCDC), Hefei, China, IEEE, New York, 2020, pp. 539–543.

23.

Sun

Xie

Jiang

Cao

Liu

DMA-Net: DeepLab with Multi-Scale Attention for Pavement Crack Segmentation. IEEE Transactions on Intelligent Transportation Systems, Vol. 23, No. 10, 2022, pp. 18392–18403.

24.

Woo

Park

Lee

J.-Y.

Kweon

I. S.

Cbam: Convolutional Block Attention Module. Proc., European Conference on Computer Vision (ECCV), Computer Vision Foundation, 2018, pp. 3–19.

25.

Shen

Sun

Squeeze-and-Excitation Networks. Proc., IEEE Conference on Computer Vision and Pattern Recognition, IEEE Xplore, 2018, pp. 7132–7141.

26.

Cao

Lin

Wei

Gcnet: Non-Local Networks Meet Squeeze-Excitation Networks and Beyond. Proc., IEEE/CVF International Conference on Computer Vision Workshops, Computer Vision Foundation, 2019, pp. 1971–1980.

27.

Guo

Liu

A Novel Transformer-Based Network with Attention Mechanism for Automatic Pavement Crack Detection. Construction and Building Materials, Vol. 391, 2023, p. 131852.

28.

Xiang

Zhang

El Saddik

Pavement Crack Detection Network Based on Pyramid Structure and Attention Mechanism. IET Image Processing, Vol. 14, No. 8, 2020, pp. 1580–1586.

29.

Guo

Qian

Liu

Pavement Crack Detection Based on Transformer Network. Automation in Construction, Vol. 145, 2023, p. 104646.

30.

Guan

Kang

Lei

Chen

Pavement Crack Detection from CCD Images with a Locally Enhanced Transformer Network. International Journal of Applied Earth Observation and Geoinformation, Vol. 110, 2022, p. 102825.

31.

Xiang

Wang

Qiao

An Improved YOLOv5 Crack Detection Method Combined with Transformer. IEEE Sensors Journal, Vol. 22, No. 14, 2022, pp. 14328–14335.

32.

Zhu

Fan

Liu

Yuan

Wang

Sheng

Wang

K. C.

RHA-Net: An Encoder-Decoder Network with Residual Blocks and Hybrid Attention Mechanisms for Pavement Crack Segmentation. arXiv Preprint arXiv:2207.14166, 2022.

33.

Vaswani

Shazeer

Parmar

Uszkoreit

Jones

Gomez

A. N.

Kaiser

Ł.

Polosukhin

Attention Is All You Need. Advances in Neural Information Processing Systems, 2017, pp. 6000–6010.

34.

Zhao

Qian

TransMF: Transformer-Based Multi-Scale Fusion Model for Crack Detection. Mathematics, Vol. 10, No. 13, 2022, p. 2354.

35.

Sun

Zhai

Pei

Zhao

Automatic Pavement Crack Detection Transformer Based on Convolutional and Sequential Feature Fusion. Sensors, Vol. 23, No. 7, 2023, p. 3772.

36.

Shamsabadi

E. A.

Rao

A. S.

Nguyen

Ngo

Dias-da-Costa

Vision Transformer-Based Autonomous Crack Detection on Asphalt and Concrete Surfaces. Automation in Construction, Vol. 140, 2022, p. 104316.

37.

Azad

Heidari

Merhof

Contextual Attention Network: Transformer Meets U-Net. Proc., International Workshop on Machine Learning in Medical Imaging, Springer, Cham, 2022, pp. 377–386.

38.

Zhang

Ren

Sun

Deep Residual Learning for Image Recognition. Proc., IEEE Conference on Computer Vision and Pattern Recognition, IEEE Xplore, 2016, pp. 770–778.

39.

Xie

Girshick

Dollár

Aggregated Residual Transformations for Deep Neural Networks. Proc., IEEE Conference on Computer Vision and Pattern Recognition, IEEE Xplore, 2017, pp. 1492–1500.

40.

Zhang

Liu

Transfuse: Fusing Transformers and Cnns for Medical Image Segmentation. Proc., Medical Image Computing and Computer Assisted Intervention-MICCAI 2021: 24th International Conference, Part I 24, Strasbourg, France, September 27–October 1, 2021, Springer, Cham, 2021, pp. 14–24.

41.

Jha

Smedsrud

P. H.

Riegler

M. A.

Johansen

De Lange

Halvorsen

Johansen

H. D.

ResUNet++: An Advanced Architecture for Medical Image Segmentation. Proc., 2019 IEEE International Symposium on Multimedia (ISM), San Diego, CA, IEEE, New York, 2019, pp. 225–2255.

42.

Chen

Luo

Adeli

Wang

Yuille

A. L.

Zhou

Transunet: Transformers Make Strong Encoders for Medical Image Segmentation. arXiv Preprint arXiv:2102.04306, 2021.

43.

Qin

Zhang

Huang

Dehghan

Zaiane

O. R.

Jagersand

U2-Net: Going Deeper with Nested U-Structure for Salient Object Detection. Pattern Recognition, Vol. 106, 2020, p. 107404.

44.

Yang

Zhang

Prokhorov

Mei

Ling

Feature Pyramid and Hierarchical Boosting Network for Pavement Crack Detection. IEEE Transactions on Intelligent Transportation Systems, Vol. 21, No. 4, 2019, pp. 1525–1535.

45.

Liu

Yang

Lau

Wang

Luo

Lee

V. C. S.

Ding

Automated Pavement Crack Detection and Segmentation Based on Two-Step Convolutional Neural Network. Computer-Aided Civil and Infrastructure Engineering, Vol. 35, No. 11, 2020, pp. 1291–1305.

46.

Deng

Guo

Real-Time Pavement Surface Crack Detection Based on Lightweight Semantic Segmentation Model. Transportation Geotechnics, Vol. 48, 2024, p. 101335.