Sage Journals: Discover world-class research

Abstract

Deep learning-based image semantic segmentation approaches heavily rely on large-scale training datasets with dense annotations and often suffer from scarce semantic labels for unseen categories. This limitation has spurred a research trend in Few-shot image Semantic Segmentation (FSS), which makes it possible to segment objects of new categories using only a few labeled samples. Although more and more FSS methods are emerging and gradually integrated into practical applications, a deep understanding of its achievements and issues is still missing. In this survey, we focus on the recent developments of FSS, specifically on FSS methods based on meta-learning. According to different network architectures, we summarize the related research into three classes, that are Convolutional Neural Network-based (CNN-based) models, Graph Neural Network-based (GNN-based) models, and Transformer-based models. Then, we explore the specific implementations of these models, including parameter-based methods, metric-based methods, attention-based methods, and optimization-based methods. Furthermore, we illustrate datasets and analyze the experimental results of various kinds of methods. Toward the end of the paper, we discuss the limitations of FSS and present its applications and challenges to provide further research directions.

Keywords

Deep learning few-shot learning image semantic segmentation meta-learning

Get full access to this article

View all access options for this article.

References

Shelhamer

, Long

and Darrell

, Fully Convolutional Networks for semantic segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence 39(4) (2017), 640–651.

, Wu

, Yang

, et al., Review the state-of-the-art technologies of semantic segmentation based on deep learning, Neurocomputing 493 (2022), 626–646.

, Yang

, Tan

, et al., Methods and datasets on semantic segmentation: A review, Neurocomputing 304 (2018), 82–103.

Zhang

, Zhou

, Zhao

, et al., A survey of semi- and weakly supervised semantic segmentation of images, Artificial Intelligence Review 53(6) (2020), 4259–4288.

Papandreou

, Chen

L.C.

, Murphy

K.P.

, et al., Weakly- and Semi-Supervised Learning of a Deep Convolutional Network for Semantic Image Segmentation, in IEEE International Conference on Computer Vision, 2015, 1742–1750.

Zhang

Q.C.

, Yang

L.T.

, Chen

Z.K.

, et al., A survey on deep learning for big data, Information Fusion 42 (2018), 146–157.

Lake

B.M.

, Ullman

T.D.

, Tenenbaum

J.B.

, et al., Building machines that learn and think like people, Behavioral and Brain Sciences 40(1) (2017), 1–101.

T.A.M.I., Computing Machinery and Intelligence, Mind 59(236) (1950), 433–460.

Fefei

, Fergus

and Pietro Perona, One-shot learning of object categories, IEEE Transactions on Pattern Analysis and Machine Intelligence 28(4) (2006), 594–611.

10.

Koch

, Siamese neural networks for one-shot image recognition, University of Toronto, PhD thesis, 2015.

11.

Vinyals

, Blundell

, Lillicrap

, et al., Matching Networks for One Shot Learning, in Neural Information Processing Systems (NIPS), 2016, 3630–3638.

12.

Wang

, Yao

, Kwok

J.T.

, et al., Generalizing from a few examples: A survey on few-shot learning, Acm Computing Surveys 53(3) (2020), 1–34.

13.

Caelles

, Maninis

K.K.

, Pont-Tuset

, et al., One-Shot Video Object Segmentation, in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2017, 5320–5329.

14.

Hochreiter

, Younger

A.S.

and Conwell

P.R.

, Learning to Learn Using Gradient Descent, in Artificial Neural Networks — ICANN 2001, 2001, 87–94.

15.

Chen

, Yang

, Huang

, et al., A survey on few-shot image semantic segmentation, Frontiers of Data & Computing 3(6) (2021), 17–34.

16.

Wei

, Li

and Liu

, A review of image semantic segmentation under few-shot dilemma, Computer Engineering and Applications 59(02) (2023), 1–11.

17.

Ren

, Tang

, Sun

, et al., Visual semantic segmentation based on few/zero-shot learning: An overview, IEEE/CAA Journal of Automatica Sinica, (2023), 1–21.

18.

Shaban

, Bansal

and Z

, One-Shot Learning for Semantic Segmentation, in British Machine Vision Conference, 2017, 1–14.

19.

Zhang

, Lin

G.S.

, Liu

F.Y.

, et al., Pyramid Graph Networks with Connection Attentions for Region-Based One-Shot Semantic Segmentation, in IEEE/CVF International Conference on Computer Vision (ICCV), 2019, 9586–9594.

20.

, He

, Zhu

, et al., Simpler is Better: Few-shot Semantic Segmentation with Classifier Weight Transformer, in IEEE/CVF International Conference on Computer Vision (ICCV), 2021, 8721–8730.

21.

Zhuge

and Shen

, Deep reasoning network for few-shot semantic segmentation, in Proceedings of the 29th ACM International Conference on Multimedia, 2021, 5344–5352.

22.

Liu

, Cao

, Liu

, et al., Dynamic extension nets for few-shot semantic segmentation, in Proceedings of the 28th ACM international conference on multimedia, 2020, 1441–1449.

23.

Zhang

, Wei

, Yang

, et al., SG-One: Similarity guidance network for one-shot semantic segmentation, IEEE Transactions on Cybernetics 50(9) (2020), 3855–3865.

24.

Dong

and Xing

E.P.

, Few-shot semantic segmentation with prototype learning, in 29th British Machine Vision Conference, 2018, 1–13.

25.

Wang

, Liew

J.H.

, Zou

, et al., PANet: Few-shot image semantic segmentation with prototype alignment, in Proceedings of the IEEE International Conference on Computer Vision, 2019, 9196–9205.

26.

Siam

, Oreshkin

B.N.

, Jagersand

, et al., AMP: Adaptive Masked Proxies for Few-Shot Segmentation, in IEEE/CVF International Conference on Computer Vision (ICCV), 2019, 5248–5257.

27.

Khoi

and Todorovic

, Feature Weighting and Boosting for Few-Shot Segmentation, in IEEE/CVF International Conference on Computer Vision (ICCV), 2019, 622–631.

28.

Liu

, Zhang

, et al., Part-Aware Prototype Network for Few-Shot Semantic Segmentation, in European Conference on Computer Vision, 2020, 142–158.

29.

Achanta

, Shaji

, Smith

, et al., SLIC superpixels compared to State-of-the-Art superpixel methods, IEEE Transactions on Pattern Analysis and Machine Intelligence 34(11) (2012), 2274–2281.

30.

Yang

, Liu

, Li

, et al., Prototype Mixture Models for Few-Shot Semantic Segmentation, in European Conference on Computer Vision (ECCV), 2020, 763–778.

31.

Yang

, Zhuo

, Qi

, et al., Mining Latent Classes for Few-shot Segmentation, in 18th IEEE/CVF International Conference on Computer Vision (ICCV), 2021, 8701–8710.

32.

Fan

, Pei

, Tai

Y.-W.

, et al., Self-support Few-Shot Semantic Segmentation, in 17th European Conference on Computer Vision, 2022, 701–719.

33.

Sung

, Yang

, Zhang

, et al., Learning to compare: Relation network for few-shot learning, in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, 1199–1208.

34.

, Wei

, Chen

Y.P.

, et al., FSS-: A -class dataset for few-shot segmentation, in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020, 2866–2875.

35.

Zhang

, Lin

, Liu

, et al., CANet: Class-Agnostic Segmentation Networks With Iterative Refinement and Attentive Few-Shot Learning, in IEEE/CVF Conference on Computer Vision and Pattern Recognition 2019, 5212–5221.

36.

Tian

, Zhao

, Shu

, et al., Prior guided feature enrichment network for few-shot segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence 44(2) (2022), 1050–1065.

37.

Zhang

B.F.

, Xiao

J.M.

, Qin

, et al., Self-Guided and Cross-Guided Learning for Few-Shot Segmentation, in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, 8308–8317.

38.

, Jampani

, Sevilla-Lara

, et al., Adaptive Prototype Learning and Allocation for Few-Shot Segmentation, in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, 8330–8339.

39.

Liu

, Bao

, Xie

G.-S.

, et al., Dynamic Prototype Convolution Network for Few-Shot Semantic Segmentation, in IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, 11543–11552.

40.

Xie

G.-S.

, Xiong

, Liu

, et al., Few-shot semantic segmentation with cyclic memory network, in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, 7293–7302.

41.

Cheng

, Lang

and Han

, Holistic prototype activation for few-shot segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence, (2022), 1–17.

42.

, Shi

, Lin

, et al., Learning meta-class memory for few-shot semantic segmentation, in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, 517–526.

43.

Lang

, Cheng

, Tu

, et al., Learning What Not to Segment: A New Perspective on Few-Shot Segmentation, in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, 8047–8057.

44.

Lang

, Cheng

, Tu

, et al., Few-shot segmentation via divide-and-conquer proxies, International Journal of Computer Vision, (2023), 1–23.

45.

Niu

, Zhong

and Yu

H.J.N.

, A review on the attention mechanism of deep learning, Neurocomputing 452 (2021), 48–62.

46.

Guo

M.-H.

, Xu

T.-X.

, Liu

J.-J.

, et al., Attention mechanisms in computer vision: A survey, Computational Visual Media 8(3) (2022), 331–368.

47.

Wang

, Jiang

, Qian

, et al., Residual attention network for image classification, in 30th IEEE Conference on Computer Vision and Pattern Recognition, 2017, 6450–6458.

48.

Woo

, Park

, Lee

J.-Y.

, et al., CBAM: Convolutional Block Attention Module, in European Conference on Computer Vision (ECCV), 2018, 3–19.

49.

, Shen

and Sun

, Squeeze-and-Excitation Networks, in IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018, 7132–7141.

50.

Vaswani

, Shazeer

, Parmar

, et al., Attention is all you need, in Advances in Neural Information Processing Systems, 2017, 5999–6009.

51.

Hou

R.B.

, Chang

, Ma

B.P.

, et al., Cross Attention Network for Few-shot Classification, in Advances in Neural Information Processing Systems (NeurIPS), 2019.

52.

, Yang

, Zhang

, et al., Attention-Based Multi-Context Guiding for Few-Shot Semantic Segmentation, in AAAI Conference on Artificial Intelligence 2019, 8441–8448.

53.

, Liu

, Zhu

, et al., Arnet: attention-based refinement network for few-shot semantic segmentation, in IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2020, 2238–2242.

54.

Liu

, Zhang

, Lin

, et al., CRNet: Cross-Reference Networks for Few-Shot Segmentation, in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020, 4164–4172.

55.

Liu

, Zhang

, Lin

, et al., CRCNet: Few-shot segmentation with cross-reference and regionglobal conditional networks, International Journal of Computer Vision 130(12) (2022), 3140–3157.

56.

Yang

, Meng

, Li

, et al., A New Local Transformation Module for Few-Shot Segmentation, in International Conference on MultiMedia Modeling, 2020, 76–87.

57.

Liu

, Guo

, Zhu

, et al., Mining semantic information from intra-image and cross-image for few-shot segmentation, Multimedia Tools and Applications 81(13) (2022), 18305–18326.

58.

Gairola

, Hemani

, Chopra

, et al., SimPropNet: Improved Similarity Propagation for Few-shot Image Segmentation, in 29th International Joint Conference on Artificial Intelligence, 2021, 573–579.

59.

Liu

, Peng

, Chen

, et al., FECANet: Boosting few-shot semantic segmentation with feature-enhanced context-aware network, IEEE Transactions on Multimedia (2023), 1–13.

60.

Tian

, Wu

, Qi

, et al., Differentiable meta-learning model for few-shot semantic segmentation, in AAAI Conference on Artificial Intelligence, 2020, 12087–12094.

61.

Cao

, Zhang

, Diao

, et al., Meta-Seg: A generalized meta-learning framework for multi-class few-shot semantic segmentation, IEEE Access 7 (2019), 166109–166121.

62.

Zhu

, Zhai

and Cao

, Self-supervised tuning for few-shot segmentation, in International Joint Conference on Artificial Intelligence (IJCAI), 2020, 1019–1025.

63.

Veličković

, Cucurull

and C. A, Graph Attention Networks, in, International Conference on Learning Representations (ICLR), 2018, 1–12.

64.

Wang

, Zhang

, Hu

, et al., Few-Shot Semantic Segmentation with Democratic Attention Networks, in European Conference on Computer Vision (ECCV) 2020, 730–746.

65.

Xie

G.S.

, Liu

, Xiong

, et al., Scale-Aware Graph Neural Network for Few-Shot Semantic Segmentation, in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, 5471–5480.

66.

Gao

, Xiao

, Yin

, et al., A Mutually Supervised Graph Attention Network for Few-Shot Segmentation: The Perspective of Fully Utilizing Limited Samples, IEEE Transactions on Neural Networks and Learning Systems, (2022), 1–13.

67.

Han

, Wang

, Chen

, et al., A survey on vision transformer, IEEE Transactions on Pattern Analysis Machine Intelligence 45(1) (2022), 87–110.

68.

Zhang

, Kang

, Yang

, et al., Few-shot segmentation via cycle-consistent transformer, Advances in Neural Information Processing Systems 34 (2021), 21984–21996.

69.

Zhang

, Wu

, et al., Catrans: context and affinity transformer for few-shot segmentation, arXiv preprint arXiv:.12817, (2022).

70.

Peng

, Tian

, Wu

, et al., Hierarchical Dense Correlation Distillation for Few-Shot Segmentation, in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, 23641–23651.

71.

Wang

, Sun

and Zhang

, Rethinking the Correlation in Few-Shot Segmentation: A Buoys View, in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, 7183–7192.

72.

Shi

, Wei

, Zhang

, et al., Dense Cross-Query-and-Support Attention Weighted Mask Aggregation for Few-Shot Segmentation, in European Conference on Computer Vision (ECCV), 2022, 151–168.

73.

, Sun

and Yang

, Suppressing the heterogeneity: A strong feature extractor for few-shot segmentation, in The Eleventh International Conference on Learning Representations, 2023.

74.

Everingham

, Van Gool

, Williams

C.K.I.

, et al., The Pascal Visual Object Classes (VOC) Challenge, International Journal of Computer Vision 88(2) (2010), 303–338.

75.

Hariharan

, Arbelaez

, Bourdev

, et al., Semantic Contours from Inverse Detectors, in IEEE International Conference on Computer Vision (ICCV), 2011, 991–998.

76.

Lin

T.Y.

, Maire

, Belongie

, et al., Microsoft COCO: Common Objects in Context, in European Conference on Computer Vision (ECCV), 2014, 740–755.

77.

Simonyan

and Zisserman

, Very deep convolutional networks for large-scale image recognition, arXiv preprint arXiv:1409.1556, (2014).

78.

Krizhevsky

, Sutskever

and Hinton

G.E.

, ImageNet classification with deep convolutional neural networks, Communications of the ACM 60(6) (2017), 84–90.

79.

Liu

, Lin

, Cao

, et al., Swin Transformer: Hierarchical Vision Transformer using Shifted Windows, in IEEE/CVF International Conference on Computer Vision (ICCV), 2021, 9992–10002.

80.

Touvron

, Cord

, Douze

, et al., Training data-efficient image transformers & distillation through attention, in International conference on machine learning, 2021, 10347–10357.

81.

Roy

A.G.

, Siddiqui

, Polsterl

, et al., ‘Squeeze & excite’ guided few-shot segmentation of volumetric images, Medical Image Analysis 59 (2020), 1–12.

82.

Q.J.

, Dang

, Tajbakhsh

, et al., A location-sensitive local prototype network for few-shot medical image segmentation, in 18th IEEE International Symposium on Biomedical Imaging (ISBI), 2021, 262–266.

83.

Huang

, Xu

, Shen

, et al., Rethinking Few-Shot Medical Segmentation: A Vector Quantization View, in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, 3072–3081.

84.

Guha Roy

, Siddiqui

, Polsterl

, et al., Squeeze & excite guided few-shot segmentation of volumetric images, Medical Image Analysis 59 (2020), 101587.

85.

Khadka

, Jha

, Hicks

, et al., Meta-learning with implicit gradients in a few-shot setting for medical image segmentation, Computers in Biology and Medicine 143 (2022), 105227.

86.

Achmamad

, Ghazouani

and Ruan

, Few-shot learning for brain tumor segmentation from MRI images, in IEEE International Conference on Signal Processing (2022), 489–494.

87.

Wang

, Cao

, Wei

, et al., Alternative Baselines for Low-Shot 3D Medical Image Segmentation-An Atlas Perspective, in AAAI Conference on Artificial Intelligence (2021), 634–642.

88.

Feng

, Zheng

, Gao

, et al., Interactive few-shot learning: Limited supervision, better medical image segmentation, IEEE Transactions on Medical Imaging 40(10) (2021), 2575–2588.

89.

Zhao

, Balakrishnan

, Durand

, et al., Data augmentation using learned transformations for one-shot medical image segmentation, in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, 8535–8545.

90.

Ding

, Sun

, Tang

, et al., Few-shot medical image segmentation with cycle-resemblance attention, in Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023, 2488–2497.

91.

Wang

X.Y.

, Yuan

Y.W.

, Guo

D.Y.

, et al., SSA-Net: Spatial self-attention network for COVID-19 pneumonia infection segmentation with semi-supervised few-shot learning, Medical Image Analysis 79 (2022), 102459.

92.

Chen

, Yao

, Zhou

, et al., Momentum contrastive learning for few-shot COVID-19 diagnosis from chest CT images, Pattern Recognition 113 (2021), 107826.

93.

Jadon

, COVID-19 detection from scarce chest x-ray image data using few-shot deep learning approach, in Medical Imaging 2021: Imaging Informatics for Healthcare, Research and Applications, 2021, The Society of Photo-Optical Instrumentation Engineers (SPIE).

94.

Abdel-Basset

, Chang

, Hawash

, et al., FSS–nCov: A deep learning architecture for semi-supervised few-shot segmentation of COVID-19 infection, Knowledge-Based Systems 212 (2021), 106647.

95.

Zhao

, Chua

T.-S.

and Lee

G.H.

, Few-shot 3D Point Cloud Semantic Segmentation, in, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, 8869–8878.

96.

Mao

, Guo

, Lu

, et al., Bidirectional Feature Globalization for Few-shot Semantic Segmentation of 3D Point Cloud Scenes, in International Conference on 3D Vision, 2022, 505–514.

97.

Sharma

, Dash

, Roy Chowdhury

, et al., PriFit: Learning to fit primitives improves few shot point cloud segmentation, Computer Graphics Forum 41(5) (2022), 39–50.

98.

Zhu

, Cao

, Zhai

, et al., One-shot texture retrieval using global grouping metric, IEEE Transactions on Multimedia 23 (2021), 3726–3737.

99.

Bhunia

A.K.

, Bhunia

A.K.

, Ghose

, et al., A deep one-shot network for query-based logo retrieval, Pattern Recognition 96 (2019), 106965.

100.

Bao

Y.Q.

, Song

K.C.

, Liu

, et al., Triplet-graph reasoning network for few-shot metal generic surface defect segmentation, IEEE Transactions on Instrumentation and Measurement 70 (2021), 3083561.

101.

Y.J.

, Zhang

P.F.

, Xu

, et al., Few-shot prototype alignment regularization network for document image layout segementation, Pattern Recognition 115 (2021), 107882.

102.

Tian

, Lai

, Jiang

, et al., Generalized Few-shot Semantic Segmentation, in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, 11553–11562.

103.

Liu

S.-A.

, Zhang

, Qiu

, et al., Learning Orthogonal Prototypes for Generalized Few-Shot Semantic Segmentation, in IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, 11319–11328.

104.

Hajimiri

, Boudiaf

, Ben Ayed

, et al., A Strong Baseline for Generalized Few-Shot Semantic Segmentation, in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, 11269–11278.

105.

Azad

, Fayjie

A.R.

, Kauffmann

, et al., On the Texture Bias for Few-Shot CNN Segmentation, in IEEE Winter Conference on Applications of Computer Vision (WACV), 2021, 2673–2682.

106.

Raza

, Ravanbakhsh

, Klein

, et al., Weakly Supervised One Shot Segmentation, in IEEE/CVF International Conference on Computer Vision Workshop (ICCVW), 2019, 1401–1406.

107.

Han

and Oh

T.H.

, Learning Few-shot Segmentation from Bounding Box Annotations, in IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2023, 3739–3748.

108.

Kang

, Koniusz

, Cho

, et al., Distilling Self-Supervised Vision Transformers for Weakly-Supervised Few-Shot Classification & Segmentation, in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, 19627–19638.

109.

Pambala

A.K.

, Dutta

and Biswas

, SML: Semantic meta-learning for few-shot semantic segmentation * *, Pattern Recognition Letters 147 (2021), 93–99.

110.

Yang

, Chen

, Feng

, et al., MIANet: Aggregating Unbiased Instance and General Information for Few-Shot Semantic Segmentation, in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, 7131–7140.

111.

Amac

M.S.

, Sencan

, Baran

O.B.

, et al., MaskSplit: Self-supervised Meta-learning for Few-shot Semantic Segmentation, in IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2022, 428–438.

112.

Kalluri

and Chandraker

, Cluster-to-adapt: Few Shot Domain Adaptation for Semantic Segmentation across Disjoint Labels, in IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2022, 4120–4130.

113.

Wang

, Duan

, Wang

, et al., Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer, in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, 7055–7064.

114.

Ahmed

, Lin

J.C.-W.

and Srivastava

, Ensemble-based deep meta learning for medical image segmentation, Journal of Intelligent Fuzzy Systems 42(5) (2022), 4307–4313.

115.

Lei

, Zhang

X.C.

, He

J.F.

, et al., Cross-Domain Few-Shot Semantic Segmentation, in European Conference on Computer Vision (ECCV), 2022, 73–90.

Few-shot image semantic segmentation based on meta-learning: A review

Abstract

Keywords

Get full access to this article

References