Sage Journals: Discover world-class research

Abstract

Navigating complex real-world environments requires understanding the semantic context and effectively making decisions. Existing solutions leave room for improvements: traditional reactive approaches that do not maintain a map often struggle in complex environments, map-dependent methods demand significant effort in mapping processes, and learning-based methods rely on large training datasets and face the difficulty of generalization. To address these challenges, we propose a novel visual semantic navigation framework that combines data-driven semantic understanding, Pareto-optimal decision-making, and image-space planning. Our approach uses a local environmental representation called navigability image, which allows the robot to assess immediate traversability without relying on apriori mapping or navigation data. Building on this, we introduce Pareto-Optimal Visual Navigation (POVNav), a decision-making framework in the image space that identifies appropriate subgoals, constructs collision-free paths, and generates control commands using visual servoing. This framework also supports selective navigation behaviors, such as avoiding traversable yet slippery grasslands to prevent getting stuck, by dynamically adjusting the navigability criteria within the local representation. POVNav is lightweight, operating solely with a monocular camera and without requiring map storage or training data collection, making it highly versatile for different robotic platforms and environments. Extensive year-round real-world experiments validated its efficacy in both structured indoor environments and unstructured outdoor settings, including dense forest trails and snow-covered roads. Field experiments using various image segmentation techniques demonstrated its robustness and adaptability across a wide range of conditions. Additionally, we demonstrate that POVNav successfully guides a robot through narrow pipes in a culvert inspection task. Overall, we showcase the utility of POVNav in real-world scenarios, highlighting its flexibility and computational efficiency for autonomous robots in complex environments.

Keywords

navigation in the wild visual semantic navigation pareto-optimal decision-making

Get full access to this article

View all access options for this article.

References

Anand

Pushp

Raj

, et al. (2019) Gaussian mixture model based object detection and tracking using dynamic patch estimation. In: 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, Macau, China , November 2019, pp. 4474–4481.

Anderson

Chang

Chaplot

, et al. (2018) On evaluation of embodied navigation agents. arXiv preprint arXiv:1807.06757.

Argus

Hermann

Long

, et al. (2020) Flowcontrol: optical flow based visual servoing. In: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems, Las Vegas, NV, USA, October 2020, pp. 7534–7541.

Bailer

Taetz

Stricker

(2015) Flow fields: dense correspondence fields for highly accurate large displacement optical flow estimation. In: Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, December 2015, pp. 4015–4023.

Bansal

Tolani

Gupta

, et al. (2020) Combining optimal control and learning for visual navigation in novel environments. In: Conference on Robot Learning, Cambridge, MA, USA, November 2020, pp. 420–429.

Bista

Giordano

Chaumette

(2016) Appearance-based indoor navigation by ibvs using line segments. IEEE Robotics and Automation Letters 1(1): 423–430.

Boretti

Bich

Zhang

, et al. (2022) Visual navigation using sparse optical flow and time-to-transit. In: 2022 International Conference on Robotics and Automation, Philadelphia, PA, USA , May 2022, pp. 9397–9403.

Braillon

Pradalier

Crowley

, et al. (2006) Real-time moving obstacle detection using optical flow models. In: 2006 IEEE Intelligent Vehicles Symposium, Tokyo, Japan , June 2006, pp. 466–471.

Brito

Floor

Ferranti

, et al. (2019) Model predictive contouring control for collision avoidance in unstructured dynamic environments. IEEE Robotics and Automation Letters 4(4): 4459–4466.

10.

Carlone

Karaman

(2018) Attention and anticipation in fast visual-inertial navigation. IEEE Transactions on Robotics 35(1): 1–20.

11.

Chang

Siagian

Itti

(2010) Mobile robot vision navigation & localization using gist and saliency. In: 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, Taipei, Taiwan , October 2010, pp. 4147–4154. IEEE.

12.

Chaplot

Jiang

Gupta

, et al. (2020a) Semantic curiosity for active visual learning. In: Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part VI 16, pp. 309–326. Springer.

13.

Chaplot

Salakhutdinov

Gupta

, et al. (2020b) Neural topological slam for visual navigation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Glasgow, UK , August 2020, pp. 12875–12884.

14.

Chaumette

Hutchinson

(2006) Visual servo control. i. Basic approaches. IEEE Robotics and Automation Magazine 13(4): 82–90.

15.

Chen

Ding

Gregory

, et al. (2023b) Ida: informed domain adaptive semantic segmentation. In: 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Detroit, MI, USA , October 2023, pp. 90–97. IEEE.

16.

Chen

Papandreou

Kokkinos

, et al. (2017a) Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Transactions on Pattern Analysis and Machine Intelligence 40(4): 834–848.

17.

Chen

Papandreou

Schroff

, et al. (2017b) Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:1706.05587.

18.

Chen

Xue

Cai

(2019) Domain adaptation for semantic segmentation with maximum squares loss. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, South Korea, November 2019, pp. 2090–2099.

19.

Chen

Pushp

Liu

(2022) CALI: coarse-to-fine ALIgnments based unsupervised domain adaptation of traversability prediction for deployable autonomous navigation. In: Proceedings of Robotics: Science and Systems. New York City, NY, USA , pp. 1–14.

20.

Chen

Ding

Crandall

, et al. (2023a) Polyline generative navigable space segmentation for autonomous visual navigation. IEEE Robotics and Automation Letters 8(4): 2054–2061.

21.

Cheng

Misra

Schwing

, et al. (2022) Masked-attention mask transformer for universal image segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, June 2022, pp. 1290–1299.

22.

Conroy

Gremillion

Ranganathan

, et al. (2009) Implementation of wide-field integration of optic flow for autonomous quadrotor navigation. Autonomous Robots 27: 189–198.

23.

Courbon

Mezouar

Martinet

(2008) Indoor navigation of a non-holonomic mobile robot using a visual memory. Autonomous Robots 25(3): 253–266.

24.

Dame

Marchand

(2011) A new information theoretic approach for appearance-based navigation of non-holonomic vehicle. In: 2011 IEEE International Conference on Robotics and Automation, Shanghai, China , May 2011, pp. 2459–2464.

25.

de Croon

De Wagter

Seidl

(2021) Enhancing optical-flow-based control by learning visual appearance cues for flying robots. Nature Machine Intelligence 3(1): 33–41.

26.

Dosovitskiy

Beyer

Kolesnikov

, et al. (2020) An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929.

27.

Dwivedi

Sadiya

Balode

, et al. (2024) Visual features are processed before navigational affordances in the human brain. Scientific Reports 14(1): 5573.

28.

Fox

Burgard

Thrun

(1997) The dynamic window approach to collision avoidance. IEEE Robotics and Automation Magazine 4(1): 23–33.

29.

Franz

Mallot

(2000) Biomimetic robot navigation. Robotics and Autonomous Systems 30(1-2): 133–153.

30.

Ganin

Ustinova

Ajakan

, et al. (2016) Domain-adversarial training of neural networks. Journal of Machine Learning Research 17(1): 2096.

31.

Giachetti

Campani

Torre

(1998) The use of optical flow for road navigation. IEEE Transactions on Robotics and Automation 14(1): 34–48.

32.

Hahn

Chaplot

Tulsiani

, et al. (2021) No rl, no simulation: learning to navigate without navigating. Advances in Neural Information Processing Systems 34: 26661–26673.

33.

Helbing

Molnar

(1995) Social force model for pedestrian dynamics. Physical Review 51(5): 4282–4286.

34.

Hirose

Taguchi

Xia

, et al. (2021) Probabilistic visual navigation with bidirectional image prediction. In: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems, Prague, Czech Republic (virtual), September 2021, pp. 1539–1546.

35.

Hoffman

Wang

, et al. (2016) Fcns in the wild: pixel-level adversarial and constraint-based adaptation. arXiv preprint arXiv:1612.02649.

36.

Hoffman

Tzeng

Park

, et al. (2018) Cycada: cycle-consistent adversarial domain adaptation. In: International Conference on Machine Learning, Stockholm, Sweden , July 2018, pp. 1989–1998. PMLR.

37.

Honegger

Meier

Tanskanen

, et al. (2013) An open source and open hardware embedded metric optical flow cmos camera for indoor and outdoor applications. In: 2013 IEEE International Conference on Robotics and Automation, Karlsruhe, Germany , May 2013, pp. 1736–1741.

38.

Hong

Tan

Pinette

, et al. (1992) Image-based homing. IEEE Control Systems Magazine 12(1): 38–45.

39.

Hoyer

Dai

Van Gool

(2021) Daformer: improving network architectures and training strategies for domain-adaptive semantic segmentation. arXiv preprint arXiv:2111.14887.

40.

Hoyer

Dai

Van Gool

(2022a) Daformer: improving network architectures and training strategies for domain-adaptive semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, June 2022, pp. 9924–9935.

41.

Hoyer

Dai

Van Gool

(2022b) Hrda: context-aware high-resolution domain-adaptive semantic segmentation. In: European conference on computer vision, pp. 372-391.

42.

Hui

Tang

Loy

(2018) Liteflownet: a lightweight convolutional neural network for optical flow estimation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA , June 2018, pp. 8981–8989.

43.

Ilg

Mayer

Saikia

, et al. (2017) Flownet 2.0: evolution of optical flow estimation with deep networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, July 2017, pp. 2462–2470.

44.

Ilg

Cicek

Galesso

, et al. (2018) Uncertainty estimates and multi-hypotheses networks for optical flow. In: Proceedings of the European Conference on Computer Vision, Munich, Germany , September 2018, pp. 652–667.

45.

Janschek

Beck

(2006) Performance analysis for visual planetary landing navigation using optical flow and dem matching. In: AIAA Guidance, Navigation, and Control Conference and Exhibit, Keystone, CO, USA, August 2006, p. 6706.

46.

Jayender

Azizian

Patel

(2008) Autonomous image-guided robot-assisted active catheter insertion. IEEE Transactions on Robotics 24(4): 858–871.

47.

Jia

Balasuriya

Challa

(2009) Vision based target tracking for autonomous land vehicle navigation: a brief survey. Recent Patents on Computer Sciences 2(1): 32–42.

48.

Karnan

Warnell

Xiao

, et al. (2022) Voila: visual-observation-only imitation learning for autonomous navigation . In: 2022 International Conference on Robotics and Automation (ICRA), Philadelphia, PA, USA , May 2022, pp. 2497–2503. IEEE.

49.

Károly

Elek

Haidegger

, et al. (2019) Optical flow-based segmentation of moving objects for mobile robot navigation using pre-trained deep learning models. In: 2019 IEEE International Conference on Systems, Man and Cybernetics, Bari, Italy , October 2019, pp. 3080–3086.

50.

Kim

Kwon

Yoo

, et al. (2023) Topological semantic graph memory for image-goal navigation. In: Conference on Robot Learning, Atlanta, GA, USA, November 2023, pp. 393–402. PMLR.

51.

Kirillov

Mintun

Ravi

, et al. (2023) Segment anything. In: Proceedings of the IEEE/CVF International Conference on Computer Vision,, Paris, France , October 2023, pp. 4015–4026.

52.

Koreitem

Shkurti

Manderson

, et al. (2020) One-shot informed robotic visual search in the wild . In: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems, Las Vegas, NV, USA , October 2020, pp. 5800–5807.

53.

Kwon

Kim

Choi

, et al. (2021) Visual graph memory with unsupervised representation for visual navigation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, Canada , October 2021, pp. 15890–15899.

54.

Kwon

Park

(2023) Renderable neural radiance map for visual navigation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, Canada , June 2023, pp. 9099–9108.

55.

Lei

Wang

Zhou

, et al. (2024) Instance-aware exploration-verification-exploitation for instance imagegoal navigation . In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA , June 2024, pp. 16329–16339.

56.

Košecka

(2020) Learning view and target invariant visual servoing for navigation. In: 2020 IEEE International Conference on Robotics and Automation (ICRA), Paris, France , May 2020, pp. 658–664. IEEE.

57.

Liu

Okatani

(2022) Symmetry-aware neural architecture for embodied visual exploration. In: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, June 2022, pp. 17221–17230. IEEE.

58.

Liu

Hong

Herman

, et al. (1998) Accuracy vs efficiency trade-offs in optical flow algorithms. Computer Vision and Image Understanding 72(3): 271–286.

59.

Liu

Luo

, et al. (2017) Deep learning markov random field for semantic segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 40(8): 1814–1828.

60.

Liu

Huang

, et al. (2023) Efficient visual tracking based on fuzzy inference for intelligent transportation systems. IEEE Transactions on Intelligent Transportation Systems 24(12): 15795–15806.

61.

Long

Shelhamer

Darrell

(2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA , June 2015, pp. 3431–3440.

62.

Luo

Zheng

Guan

, et al. (2019) Taking a closer look at domain shift: category-level adversaries for semantics consistent domain adaptation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA , June 2019, pp. 2507–2516.

63.

Mammarella

Campa

Fravolini

, et al. (2008) A comparison of optical flow algorithms for real time aircraft guidance and navigation. In: AIAA Guidance, Navigation and Control Conference and Exhibit, Honolulu, HI, USA, August 2008, p. 7494.

64.

Manderson

Higuera

JCG

Wapnick

, et al. (2020) Vision-based goal-conditioned policies for underwater navigation in the presence of obstacles. In: Proceedings of Robotics: Science and Systems, Corvalis, Oregon, USA , pp. 1–10.

65.

Mateus

Avina

Devy

(2005) Robot visual navigation in semi-structured outdoor environments . In: Proceedings of the 2005 IEEE International Conference on Robotics and Automation, Barcelona, Spain , April 2005, pp. 4691–4696. IEEE.

66.

McCarthy

Bames

(2004) Performance of optical flow techniques for indoor navigation with a mobile robot . In: IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA’04. 2004, New Orleans, LA, USA , April 2004, Vol. 5, pp. 5093–5098.

67.

Mei

Zhu

Zou

, et al. (2020) Instance adaptive self-training for unsupervised domain adaptation. In: Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XXVI 16, pp. 415–430. Springer.

68.

Menegatti

Maeda

Ishiguro

(2004) Image-based memory for robot navigation using properties of omnidirectional images. Robotics and Autonomous Systems 47(4): 251–267.

69.

Meronen

Wilkinson

and Solin A (2020) Movement tracking by optical flow assisted inertial navigation. In: 2020 IEEE 23rd International Conference on Information Fusion, Rust, Austria, July 2020, pp. 1–8.

70.

Mielle

Magnusson

Lilienthal

(2016) Using sketch-maps for robot navigation: interpretation and matching . In: 2016 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR), Lausanne, Switzerland , October 2016, pp. 252–257. IEEE.

71.

Ohnishi

Imiya

(2006) Dominant plane detection from optical flow for robot navigation. Pattern Recognition Letters 27(9): 1009–1021.

72.

Otte

Richardson

Mulligan

, et al. (2009) Path planning in image space for autonomous robot navigation in unstructured environments. Journal of Field Robotics 26(2): 212–240.

73.

Pan

Zhang

Ichter

, et al. (2020) Zero-shot imitation learning from demonstrations for legged robot visual navigation. In: 2020 IEEE International Conference on Robotics and Automation (ICRA), Paris, France , May 2020 , pp. 679–685. IEEE.

74.

Ponce

Brieva

Moya-Albor

(2018) Distance estimation using a bio-inspired optical flow strategy applied to neuro-robotics. In: 2018 International Joint Conference on Neural Networks, Rio de Janeiro, Brazil, July 2018, pp. 1–7.

75.

Pushp

Kalhapure

Das

, et al. (2022) Uav-miniugv hybrid system for hidden area exploration and manipulation . In: 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems, Kyoto, Japan , October 2022, pp. 1297–1304.

76.

Pushp

Chen

Luo

, et al. (2023) Povnav: a pareto-optimal mapless visual navigator. In: International Symposium on Experimental Robotics, Chiang Mai, Thailand, Springer, November 2023, pp. 250–263.

77.

Ranftl

Bochkovskiy

Koltun

(2021) Vision transformers for dense prediction. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, Canada , October 2021, pp. 12179–12188.

78.

Ranjan

Black

(2017) Optical flow estimation using a spatial pyramid network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA , July 2017, pp. 4161–4170.

79.

Ren

Yan

, et al. (2017) Unsupervised deep learning for optical flow estimation. Proceedings of the AAAI Conference on Artificial Intelligence 31(1): 10723.

80.

Rondon

Fantoni-Coichot

Sanchez

, et al. (2009) Optical flow-based controller for reactive and relative navigation dedicated to a four rotor rotorcraft. In: 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, St. Louis, MO, USA , October 2009, pp. 684–689.

81.

Ronneberger

Fischer

Brox

(2015) U-net: convolutional networks for biomedical image segmentation. In: International Conference on Medical Image Computing and computer-assisted Intervention, Munich, Germany, October 2015, pp. 234–241.

82.

Saito

Watanabe

Ushiku

, et al. (2018) Maximum classifier discrepancy for unsupervised domain adaptation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA , June 2018, pp. 3723–3732.

83.

Shen

Zhu

, et al. (2019) Situational fusion of visual representation for visual navigation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, South Korea, November 2019, pp. 2881–2890.

84.

Sivakumar

Modi

Gasparino

, et al. (2021) Learned visual navigation for under-canopy agricultural robots. In: Proceedings of Robotics: Science and Systems, Virtual, July 2021, pp. 1–15. Virtual.

85.

Song

Huang

(2001) Fast optical flow estimation and its application to real-time obstacle avoidance. In: Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation, Volume 3, Seoul, South Korea, May 2001, pp. 2891–2896.

86.

Song

Liang

Xiao

, et al. (2024) Tgs: trajectory generation and selection using vision language models in mapless outdoor environments. IEEE Robotics and Automation Letters .

87.

Soundararajan

Bovik

(2013) Survey of information theory in visual quality assessment. Signal, Image and Video Processing 7: 391–401.

88.

Srinivasan

Chahl

Weber

, et al. (1999) Robot navigation inspired by principles of insect vision. Robotics and Autonomous Systems 26(2-3): 203–216.

89.

Srivastava

Lima

Shinde

, et al. (2021) Estimation and control for autonomous uav system to neutralize unknown aerial maneuvering target. In: 2021 Seventh Indian Control Conference, Mumbai, India, December 2021, pp. 117–122.

90.

Talukder

Matthies

(2004) Real-time detection of moving objects from moving vehicles using dense stereo and optical flow. In: 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, Volume 44, Sendai, Japan , September 2004, Vol. , pp. 3718–3725.

91.

Vakil

Malas

Megherbi

(2015) Information theoretic approach for template matching in registration of partially overlapped aerial imagery . In: 2015 National Aerospace and Electronics Conference (NAECON), Dayton, OH, USA , June 2015, pp. 146–150. IEEE.

92.

van der Zwaan

Bernardino

Santos-Victor

(2002) Visual station keeping for floating robots in unstructured environments. Robotics and Autonomous Systems 39(3-4): 145–155.

93.

Vaswani

Shazeer

Parmar

, et al. (2017) Attention is all you need. In: Advances in Neural Information Processing Systems, Long Beach, CA, USA, December 2017, Vol. 30.

94.

Wang

Shen

Zhang

, et al. (2020) Classes matter: a fine-grained adversarial approach to cross-domain semantic segmentation . In: European Conference on Computer Vision, Glasgow, UK , August 2020, pp. 642–659. Springer.

95.

Wang

Liu

, et al. (2021) A visual navigation framework for the aerial recovery of uavs. IEEE Transactions on Instrumentation and Measurement 70: 1–13.

96.

Wasserman

Yadav

Chowdhary

, et al. (2023) Last-mile embodied visual navigation. In: Conference on Robot Learning, Atlanta, GA, USA, November 2023, pp. 666–678. PMLR.

97.

Wellhausen

Ranftl

Hutter

(2020) Safe robot navigation via multi-modal anomaly detection. IEEE Robotics and Automation Letters 5(2): 1326–1333.

98.

Gong

, et al. (2020) Towards target-driven visual navigation in indoor scenes via generative imitation learning. IEEE Robotics and Automation Letters 6(1): 175–182.

99.

Wang

, et al. (2021) Reinforcement learning-based visual navigation with information-theoretic regularization. IEEE Robotics and Automation Letters 6(2): 731–738.

100.

Wulff

Black

(2015) Efficient sparse-to-dense optical flow estimation using a learned basis and layers. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA , June 2015, pp. 120–130.

101.

Xie

Wang

, et al. (2021) Segformer: simple and efficient design for semantic segmentation with transformers. Advances in neural information processing systems 34: 12077–12090.

102.

Xie

Yuan

, et al. (2022) Towards fewer annotations: active learning via region impurity and prediction uncertainty for domain adaptive semantic segmentation . In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA , June 2022, pp. 8068–8078.

103.

Yamada

Tominaga

Ichikawa

(2003) An autonomous flying object navigated by real-time optical flow and visual target detection . In: Proceedings. 2003 IEEE International Conference on Field-Programmable Technology, Tokyo, Japan, December 2003, 222–227.

104.

Yan

Qin

Liu

, et al. (2022) Mapless navigation with safety-enhanced imitation learning. IEEE Transactions on Industrial Electronics 70(7): 7073–7081.

105.

Yang

Wang

Cadena

, et al. (2023) iPlanner: imperative path planning. In: Proceedings of Robotics: Science and Systems. Daegu, Republic of Korea, Daegu, Republic of Korea, July 2023.

106.

Koltun

(2015) Multi-scale context aggregation by dilated convolutions. arXiv preprint arXiv:1511.07122.

107.

Zhai

Xiang

, et al. (2021) Optical flow and scene flow estimation: a survey. Pattern Recognition 114: 107861.

108.

Zhang

Yang

(2008) Autonomous visual navigation guided by path boundaries for mobile robot. In: 2008 International Conference on Computer Science and Software Engineering, Volume 6, Wuhan, China, December 2008, pp. 344–348. IEEE.

109.

Zhang

Sun

Wang

, et al. (2021) Visual object tracking based on residual network and cascaded correlation filters. Journal of Ambient Intelligence and Humanized Computing 12: 8427–8440.

110.

Zhao

Pei

(2013) Object tracking based on particle filter with discriminative features. Journal of Control Theory and Applications 11(1): 42–53.

111.

Zhao

Shi

, et al. (2017) Pyramid scene parsing network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA , July 2017, pp. 2881–2890.

112.

Zheng

Zhao

, et al. (2021) Rethinking semantic segmentation from a sequence-to-sequence perspective with transformers . In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA , June 2021, pp. 6881–6890.

113.

Zingg

Scaramuzza

Weiss

, et al. (2010) Mav navigation through indoor corridors using optical flow. In: 2010 IEEE International Conference on Robotics and Automation, Anchorage, AK, USA , May 2010, pp. 3361–3368.

114.

Zou

Kumar

, et al. (2018) Unsupervised domain adaptation for semantic segmentation via class-balanced self-training. In: Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, September 2018, pp. 289–305.

115.

Zou

Liu

, et al. (2019) Confidence regularized self-training. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, South Korea, November 2019, pp. 5982–5991.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

94.92 MB

Navigating the wild: Pareto-optimal visual decision-making in image space

Abstract

Keywords

Get full access to this article

References

Supplementary Material