Navigation in Contour-Drawn Scenes Using Augmented Reality

Abstract

The visual system can recover 3D information from many different types of visual information, e.g., contour-drawings. How well can people navigate in a real dynamic environment with contour-drawings? This question was addressed by developing an AR-device that could show a contour-drawing of a real scene in an immersive manner and by conducting an observational field study in which the two authors navigated in real environments wearing this AR-device. The navigation with contour-drawings was difficult in natural scenes but easy in urban scenes. This suggests that the visual information from natural and urban environments is sufficiently different and our visual system can accommodate to this difference of the visual information in different environments.

Keywords

navigation/wayfinding augmented reality contour-drawing line-drawing 3D perception head-mounted display (HMD)scene perception

Almost all studies in vision science are conducted in well-controlled laboratory environments that allow us to test specific factors while other, artifactual factors, are eliminated or minimized. But when this is done, it may become difficult to discuss whether, or how much, the factor tested will be critical in a real environment. The factor may affect a human's performance of some task in a laboratory experiment, but this factor may not be useful for doing the task in the real environment.

There are some field studies that used head-mounted prism goggles to test the performance of the visual system in a real environment (see Yoshimura, 1996 for a review). Prisms and mirrors in these goggles optically transform a retinal image in a real scene.

Note that image-filters of computer vision can be also used to transform retinal images by using Augmented-Reality (AR) technology (Anstis, 1992; Bao & Engel, 2019; Grush et al., 2015; Juan & Calatrava, 2011; Krösl et al., 2020; Velázquez et al., 2015). We developed an AR-headset that can show a contour-drawing generated by extracting luminance-edges in an image using a Sobel filter, or a grayscale-image of a real scene in an immersive manner (Farshchi et al., 2021). We showed that observers can perform some run-of-the-mill tasks almost equally well with both contour-drawings and grayscale-images (see Cole et al., 2009; Elder, 2018; Hertzmann, 2021; Pizlo et al., 2014; Sayim & Cavanagh, 2011). These tasks were performed by hands on a desktop while the observer was sitting still on a chair.

The authors (TS, AM) conducted this observational field study in which they wore the AR-headset and dynamically navigated in three real environments, namely, a forest, a park, and inside a building. The AR-headset showed a contour-drawing, a grayscale-image, or a color-image of a real scene to an observer in an immersive manner (Figure 1, see Farshchi et al., 2021 for details). The observers wore the headset intermittently to make it possible to change the image filters while they navigated.

Figure 1.

(A) A smartphone composing the AR-headset and (B-E) contour-drawings (left), grayscale-images (center), and color-images (right) of real scenes. Images are owned by the author TS. See https://osf.io/3tf9d/ for other images from this study.

Navigating in the real environment was always easier with the color- and grayscale-image filters than with the contour-drawing filter. Navigation with the contour-drawing filter was especially difficult in the forest. It was very difficult to see the bumps, dents, steps, and the slopes of the ground in the forest with the contour-drawing filter (Figure 1(B)). The information lost in a contour-drawing, for example, luminance-gradients and luminance-polarity of the shading and cast-shadows can be important in natural scenes.

Cast-shadows were also disturbing with the contour-drawing filter. The boundaries of these cast-shadows were represented as bright contours in the contour-drawings (Figures 1(B-C), Metzger, 1936/2006; Sayim & Cavanagh, 2011). The cast-shadows were often misperceived as markings on objects, or as objects, or crevasses on the ground.

Navigation was easy on the sidewalks and on the pavement in the park and in the hallways of a building even with the contour-drawing filter (Figures 1(C-D)). Subjectively, the curbs of the sidewalk, the regular pattern of the pavement, and the edges of the floor, walls, and ceiling of the hallways were helpful during the navigations, but note that stairs were very difficult to navigate with all the image filters, especially with the contour-drawing filter (Figure 1(E)). With the contour-drawing filter, the edges of the stairs were visible but it was hard to make out which faces of the stairs were horizontal or vertical.

Navigation was easy with the contour-drawings in the urban scenes. The urban scenes were composed of man-made objects and they were regular. These regularities of the scenes introduced configurations of contours that are model-based invariants of the regularities and that can be used to recover 3D information in the scenes by making use of the regularities as a priori constraints (Qian et al., 2018; Sawada et al., 2015; Walther & Shen, 2014).

Our study suggests that visual information available in natural and urban environments is quite different and our visual system can accommodate to this difference of the visual information in different environments.

Footnotes

Acknowledgement

This article was prepared within the framework of the Academic Fund Program at the National Research University Higher School of Economics (HSE University) in 2019 (grant № 19-04-006, awarded to TS) and by the Russian Academic Excellence Project «5-100».

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the National Research University Higher School of Economics (grant number 19-04-006).

ORCID iD

Tadamasa Sawada

How to cite this article

Sawada, T., Mendoza Arvizu, A., Farshchi, M., & Alexandra, K (2022). Navigation in contour-drawn scenes using augmented reality. i-Perception, 13(1), 1–4.

References

Anstis

(1992). Visual adaptation to a negative, brightness-reversed world: Some preliminary observations. In Carpenter

G. A.

Grossberg

(Eds.), Neural networks for vision and image processing (pp. 1–14). MIT Press.

Bao

Engel

S. A.

(2019). Augmented reality as a tool for studying visual plasticity: 2009 to 2018. Current Directions in Psychological Science, 28(6), 574–580. https://doi.org/10.1177/0963721419862290

Cole

Sanik

DeCarlo

Finkelstein

Funkhouser

Rusinkiewicz

Singh

(2009). How well do line drawings depict shape? ACM Transactions on Graphics, 28(3), 28. https://doi.org/10.1145/1531326.1531334

Elder

J. H.

(2018). Shape from contour: Computation and representation. Annual Review of Vision Science, 4, 423–450. https://doi.org/10.1146/annurev-vision-091517-034110

Farshchi

Kiba

Sawada

(2021). Seeing our 3D world while only viewing contour-drawings. PLoS ONE, 16(1), e0242581. https://doi.org/10.1371/journal.pone.0242581

Grush

Jaswal

Knoepfler

Brovold

(2015). Visual adaptation to a remapped Spectrum. In Metzinger

Windt

J. M.

(Eds.), Open MIND (pp. 1-16). MIND Group.

Hertzmann

(2021). The role of edges in line drawing perception. Perception, 50(3), 266–275. https://doi.org/10.1177/0301006621994407

Juan

M. C.

Calatrava

(2011). An augmented reality system for the treatment of phobia to small animals viewed via an optical see-through HMD: Comparison with a similar system viewed via a video see-through HMD. International Journal of Human-Computer Interaction, 27(5), 436–449.

Krösl

Elvezio

Hürbe

Karst

Feiner

Wimmer

(2020, March). XREye: Simulating visual impairments in eye-tracked XR. In 2020 IEEE conference on virtual reality and 3D user interfaces abstracts and workshops (VRW) (pp. 831–832). IEEE.

10.

Metzger

(2006). Laws of seeing (L. Spillmann, Trans.). MIT Press. (Original work published 1936).

11.

Pizlo

Sawada

Steinman

R. M.

(2014). Making a machine that sees like us. Oxford University Press.

12.

Qian

Ramalingam

Elder

J. H.

(2018, December). LS3D: Single-view gestalt 3D surface reconstruction from Manhattan line segments. In Asian conference on computer vision (pp. 399–416). Springer.

13.

Sawada

Pizlo

(2015). Shape perception. In Busemeyer

Townsend

Wang

Z. J.

Eidels

(Eds.), Oxford Handbook of computational and mathematical psychology (pp. 255–276). Oxford University Press.

14.

Sayim

Cavanagh

(2011). What line drawings reveal about the visual brain. Frontiers in Human Neuroscience, 5, 118. https://doi.org/10.3389/fnhum.2011.00118

15.

Velázquez

Varona

Rodrigo

Haro

Acevedo

(2015). Design and evaluation of an eye disease simulator. IEEE Latin America Transactions, 13(8), 2734–2741. https://doi.org/10.1109/TLA.2015.7332157

16.

Walther

D. B.

Shen

(2014). Nonaccidental properties underlie human categorization of complex natural scenes. Psychological Science, 25(4), 851–860. https://doi.org/10.1177/0956797613512662

17.

Yoshimura

(1996). A historical review of long-term visual-transposition research in Japan. Psychological Research, 59(1), 16–32. https://doi.org/10.1007/BF00419831