Sage Journals: Discover world-class research

Abstract

Remote sensing Change Detection (CD) has advanced significantly with the adoption of Convolutional Neural Networks (CNNs) and Transformers. While CNNs provide robust feature extraction capabilities, they are limited by their receptive field size, and Transformers are constrained by quadratic computational complexity when handling long sequences, impacting scalability. The Mamba architecture offers a compelling alternative with its linear complexity and high parallelism; however, its intrinsic 1D processing structure results in a loss of spatial information in 2D vision tasks. This paper proposes an efficient framework employing a Vision Mamba variant that enhances the ability to capture 2D spatial information while maintaining Mamba’s hallmark linear complexity. The framework utilizes a 2DMamba encoder to effectively learn global spatial contextual information from multi-temporal images. For feature fusion, we introduce a 2D scan-based, channel-parallel scanning strategy coupled with a spatio-temporal feature fusion method. This approach adeptly captures both local and global change information, addressing spatial discontinuity issues during fusion. In the decoding phase, we present a feature change flow-based decoding method that enhances the mapping of feature change information from low-resolution to high-resolution feature maps, thus mitigating feature shift and misalignment. Extensive experiments on benchmark datasets such as LEVIR-CD+ and WHU-CD demonstrate the competitive performance of our framework compared to state-of-the-art methods, highlighting the significant potential of Vision Mamba for efficient and accurate remote sensing change detection.

Keywords

State Space Models (SSMs)mamba architecture remote sensing change detection binary change detection spatio-temporal feature fusion high-resolutionoptical imagery

Get full access to this article

View all access options for this article.

References

Chen

Shi

. Remote sensing image change detection with transformers. IEEE Trans Geosci Remote Sens 2022; 60: 1–14.

Bandara

WGC

Patel

. A transformer-based siamese network for change detection. In: IGARSS 2022 - 2022 IEEE International Geoscience and Remote Sensing Symposium, 2022, pp.207-210. DOI: 10.1109/IGARSS46834.2022.9883686.

Daudt

Le Saux

Boulch

. Fully convolutional siamese networks for change detection. In: 2018 25th IEEE international conference on image processing (ICIP), 2018, pp.4063-4067.

Chen

, et al. Change detection in multisource VHR images via deep siamese convolutional multiple-layers recurrent neural network. IEEE Trans Geosci Remote Sens 2019; 58: 2848–2864.

Fang

Shao

, et al. SNUNet-CD: A densely connected siamese network for change detection of VHR images. IEEE Geosci Remote Sens Lett 2021; 19: 1–5.

Chen

Song

Han

, et al. Changemamba: Remote sensing change detection with spatio-temporal state space model. arXiv preprint arXiv:2404.03425, 2024.

Zhang

Yue

Tapete

, et al. A deeply supervised image fusion network for change detection in high resolution bi-temporal remote sensing images. ISPRS J Photogramm Remote Sens 2020; 166: 183–200.

Han

Guo

, et al. HANet: A hierarchical attention network for change detection with bitemporal very-high-resolution remote sensing images. IEEE J Select Top Appl Earth Observ Remote Sens 2023; 16: 3867–3878.

Han

Guo

, et al. change guiding network: Incorporating change prior to guide Change detection in remote sensing imagery. IEEE J Sel Topics Appl Earth Observ Remote Sens 2023; 16: 8395–8407.

10.

Zhong

, et al. TransUNetCD: A hybrid transformer network for change detection in optical remote-sensing images. IEEE Trans Geosci Remote Sens 2022; 60: 1–19.

11.

Zhang

Zhao

Zhang

, et al. Relation changes matter: Cross-temporal difference transformer for change detection in remote sensing images. IEEE Trans Geosci Remote Sens 2023; 61: 1–15.

12.

Mausel

Brondizio

, et al. Change detection techniques. Int J Remote Sens 2004; 25: 2365–2401.

13.

Coppin

Jonckheere

. Digital changeDetection methodsinEcosystem monitoring: AReview. Int J Remote Sens 2004; 25: 1565.

14.

Ling

Foody

, et al. A superresolution land-cover change detection method using remotely sensed images with different spatial resolutions. IEEE Trans Geosci Remote Sens 2016; 54: 3822–3841.

15.

Wellmann

Lausch

Andersson

, et al. Remote sensing in urban planning: Contributions towards ecologically sound policies?. Landsc Urban Plan 2020; 204: 103921.

16.

Shi

Zhang

, et al. Change detection based on artificial intelligence: State-of-the-art and challenges. Remote Sens (Basel) 2020; 12: 1688.

17.

Bai

Wang

Yin

, et al. Deep learning for change detection in remote sensing: a review. Geo-spatial Inform Sci 2023; 26: 262–288.

18.

Dosovitskiy

Beyer

Kolesnikov

, et al. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929, 2020.

19.

Dao

. Mamba: Linear-time sequence modeling with selective state spaces. arXiv preprint arXiv:2312.00752, 2023.

20.

Goel

Ré

. Efficiently modeling long sequences with structured state spaces. arXiv preprint arXiv:2111.00396, 2021.

21.

Zhang

Nguyen

Han

, et al. 2DMamba: Efficient state space model for image representation with applications on Giga-pixel whole slide image classification. arXiv preprint arXiv:2412.00678, 2024.

22.

Zhang

Chen

Liu

, et al. CDMamba: Remote sensing image change detection with mamba. arXiv preprint arXiv:2406.04207, 2024.

23.

Lin

Dollár

Girshick

, et al. Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp.2117–2125.

24.

LeCun

Bottou

Bengio

, et al. Gradient-based learning applied to document recognition. Proc IEEE 1998; 86: 2278–2324.

25.

Zhang

Ren

, et al. Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp.770-778.

26.

Huang

Liu

Van Der Maaten

, et al. Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp.4700-4708.

27.

Krizhevsky

Sutskever

Hinton

. Imagenet classification with deep convolutional neural networks. Adv Neural Inf Process Syst 2012; 25: 1097–1105.

28.

Simonyan

Zisserman

. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556, 2014.

29.

Szegedy

Liu

Jia

, et al. Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp.1-9.

30.

Xie

Girshick

Dollár

, et al. Aggregated residual transformations for deep neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp.1492-1500.

31.

Zhang

Lin

Yang

, et al. ESCNet: An end-to-end superpixel-enhanced change detection network for very-high-resolution remote sensing images. IEEE Trans Neural Netw Learn Syst 2021; 34: 28–42.

32.

Liu

Lin

Cao

, et al. Swin Transformer: Hierarchical Vision Transformer using Shifted Windows. In: Proceedings of the IEEE/CVF international conference on computer vision (ICCV), 2021.

33.

Shen

Zhang

, et al. Triplet contrastive learning for unsupervised vehicle Re-identification. arXiv preprint arXiv:2301.09498, 2023.

34.

Shen

Shu

, et al. Pedestrian-specific bipartite-aware similarity learning for text-based person retrieval. In: Proceedings of the 31th ACM international conference on multimedia, 2023.

35.

Song

Xia

Weng

, et al. Axial cross attention meets CNN: Bibranch fusion network for change detection. IEEE J Select Top Appl Earth Observ Remote Sens 2023; 16: 21–32.

36.

Smith

JTT

Warrington

Linderman

. Simplified state space layers for sequence modeling. arXiv preprint arXiv:2208.04933, 2022.

37.

Zhu

Liao

Zhang

, et al. Vision mamba: Efficient visual representation learning with bidirectional state space model. arXiv preprint arXiv:2401.09417, 2024.

38.

Wang

. U-mamba: Enhancing long-range dependency for biomedical image segmentation. arXiv preprint arXiv:2401.04722, 2024.

39.

Zhu

Xiong

Dai

, et al. Deep feature flow for video recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp.2349-2358.

40.

Gadde

Jampani

Gehler

. Semantic video cnns through representation warping. In Proceedings of the IEEE international conference on computer vision, 2017, pp.4453–4462.

41.

Nilsson

Sminchisescu

. Semantic video segmentation by gated recurrent flow propagation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp.6819-6828.

42.

Simonyan

Zisserman

. Two-stream convolutional networks for action recognition in videos. Adv Neural Inf Process Syst 2014; 27: 568–576.

43.

You

Zhu

, et al. Semantic flow for fast and accurate scene parsing. In: Computer Vision–ECCV 2020: 16th European conference, glasgow, UK, August 23–28, 2020, Proceedings, Part I 16, 2020, pp.775-793.

44.

Zhang

, et al. Improving semantic segmentation via decoupled body and edge supervision. In: Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XVII 16, 2020, pp.435–452.

45.

Jaderberg

Simonyan

Zisserman

, et al. Spatial transformer networks. NeurIPS 2015; 28: 2017–2025.

46.

Wei

. Fully convolutional networks for multisource building extraction from an open aerial and satellite imagery data set. IEEE Trans Geosci Remote Sens 2018; 57: 574–586.

47.

Shi

Liu

, et al. A deeply supervised attention metric-based network and an open aerial image dataset for remote sensing change detection. IEEE Trans Geosci Remote Sens 2021; 60: 1–16.

48.

Chen

Shi

. A spatial-temporal attention-based method and a new dataset for remote sensing image change detection. Remote Sens (Basel) 2020; 12: 1662.

49.

Huang

, et al. Spatiotemporal enhancement and interlevel fusion network for remote sensing images change detection. IEEE Trans Geosci Remote Sens 2024; 62: 1–14.

50.

Zhang

Wang

Cheng

, et al. SwinSUNet: Pure transformer network for remote sensing image change detection. IEEE Trans Geosci Remote Sens 2022; 60: 1–13.

51.

Zhao

Chen

Zhang

, et al. Rs-mamba for large remote sensing image dense prediction. IEEE Trans Geosci Remote Sens 2024; 62: 1–13.

52.

Dong

Yuan

Hua

, et al. ConMamba: CNN and SSM high-performance hybrid network for remote sensing change detection. IEEE Trans Geosci Remote Sens 2024; 62: 1–15.

53.

Liu

Cheng

Sun

, et al. CWmamba: Leveraging CNN-mamba fusion for enhanced change detection in remote sensing images. IEEE Geosci Remote Sens Lett 2025; 22: 1–5.

2DMCG: 2D mamba with change flow guidance for change detection in remote sensing

Abstract

Keywords

Get full access to this article

References