Sage Journals: Discover world-class research

Abstract

We implement Adelson and Bergen's spatiotemporal energy model with extension to three-dimensional (x–y–t) in an interactive tool. It helps gain an easy understanding of early (first-order) visual motion perception. We demonstrate its usefulness in explaining an assortment of phenomena, including some that are typically not associated with the spatiotemporal energy model.

Keywords

spatiotemporal filtering motion energy interactive visual illusions science education eye movement

We present a tool¹ that reveals key characteristics of early visual motion perception through a simple inspection. The setup involves pointing an off-the-shelf camera at visual images or videos such as in Figure 1. This tool implements the spatiotemporal energy model² (Adelson & Bergen, 1985; Watson & Ahumada, 1985)—the standard model of first-order motion perception (Nishida et al., 2018)—with extension to three-dimensional (3D, x–y–t) in real-time, and visualizes perceived motion direction by mapping it on to a color wheel (Figure 2). This makes gaining insights about early visual motion perception an easy and interactive experience.³

Figure 1.

A laptop and a webcam explain motion visual illusions. Here, the illusory rotation of the rings in Pinna–Brelstaff illusion (Pinna & Brelstaff, 2000) is immediately revealed through the colors encoding motion energy (Adelson & Bergen, 1985). Refer to Figure 2 for more details.

Figure 2.

The motion perception tool explains an assortment of visual illusions: stepping feet (Anstis, 2001; Bach, 2004; Kitaoka & Anstis, 2021), Kinegram (Bach, 2014), structure from motion (Bach, 2002; Rogers & Graham, 1979), Pinna–Brelstaff (Bach, 2003; Pinna & Brelstaff, 2000), Translational Moirè Patterns (Bach, 2013; Spillmann, 1993), Spine drift (Bach, 2011; Kitaoka, 2010), grid masking (Bach, 2019), and global motion influenced by arrows (@jagarikin, 2022).

In fact, by simply playing around with the tool, we discovered that spatiotemporal energy models (Adelson & Bergen, 1985) directly explain many more phenomena than previously understood⁴ . We applied our tool to illusions available on YouTube and Twitter, as well as curated lists (Bach, 1997; Shapiro & Todorovic, 2017). For some of them, we also moved the camera to mimic head/eye movements. In Figure 2, we show outputs on an assorted list of phenomena we have found in this process.

This is in contrast to traditional methods of testing a model on visual stimuli. Instead of saving image sequences to disk, processing them offline, and generating visualizations of energy for different motion directions over time (as a post-processing step), our tool allows doing all this live.

Take, for example, the stepping feet illusion (Anstis, 2001). In this illusion (Bach, 2004: shows a demo), the yellow and blue “feet” are set against a grating of black-and-white lines. The “feet” are vertically aligned, and move smoothly and at the same speed. But they produce an illusion of stepping alternatively as if the yellow foot pauses when the blue foot moves and vice versa. When we point a webcam at this illusion and run our motion perception tool, it directly and immediately reflects our perception (row 1 of Figure 2). The “feet” appear to move in distinct steps, at consecutive times⁵ $t$ and $t + 1$ . You may also reduce the contrast of grating (Bach, 2004: un-tick the “Hi contrast” button), and notice that it weakens the strength of the illusion. With this, it is easy to understand the role of contrast in motion perception (Anstis, 2004).

Another example is the Pinna–Brelstaff illusion (Pinna & Brelstaff, 2000). First, let us look at the animated online version of this illusion (Bach, 2003) that simulates the head moving toward/away (looming) with respect to the static pattern. Here, even though the rings are simply expanding or contracting, it elicits additional illusory rotations for each ring. We can verify this same percept with our tool (row 4 of Figure 2).

To understand the visualization, the notion of a “phase” for rings is helpful. On an expanding ring without rotation, every point will move away from its center in the same direction of the radial line⁶ . On the other hand, a rotating ring produces motion along the tangent at every point. Thus, the visual motion “phase” of a rotating ring relates to an expanding ring in the following way: taking the expanding ring as a reference ( $0^{\circ}$ ), a clockwise rotating ring has a phase of $+ 90^{\circ}$ . In other words, a clockwise rotating ring is $+ 90^{\circ}$ rotation of an expanding ring. Likewise, a counter-clockwise rotating ring has a phase of $- 90^{\circ}$ . Any intermediate phase indicates simultaneous rotation and expansion as the resultant phase is a vector addition of movement due to expansion and rotation.

For the animated version of the Pinna–Brelstaff illusion (Bach, 2003), the output of our motion perception tool is a combination of radial and tangential motion. When the rings are expanding, the inner ring has a phase of $+ 45^{\circ}$ (illusory clockwise rotation), and the outer ring has a phase of $- 45^{\circ}$ (illusory counter-clockwise rotation). You may further change the angle of Gabor elements, using the slider on the right, and immediately notice its effect on the strength and direction of illusory rotations.

Static patterns of the Pinna–Brelstaff illusion (Pinna & Brelstaff, 2000) also elicit an illusory percept for translation and rotations of the head. This too can be reproduced with our tool by simply moving the camera, roughly recreating the required head motion. The ability to interact with the visual illusion by moving the camera is powerful as it allows us to understand action–perception coupling (Rolfs & Schweitzer, 2022) for the case of self-movement and visual motion perception.

The spine drift illusion (Bach, 2011; Kitaoka, 2010: shows a demo) demonstrates such a relationship between eye movements and motion perception (Menshikova & Krivykh, 2016). When we view this illusion, the central square appears to float with respect to the background. We can reproduce this by slightly moving the camera, which simulates fixational eye movements (Rolfs, 2009). This produces perceived motion along two distinct directions (row 6 of Figure 2), indicating that the center square excites motion receptors strongly in a direction different from that of the periphery.

In this paper, we explored a limited range of phenomena—some with imprecise head/eye movements. However, our tool easily extends to study other scenarios. You may use it to study motion perception during smooth pursuit (Battaje & Brock, 2022; Morvan & Wexler, 2009; Terao et al., 2015), or with slow erratic drift and miniature saccades (Rolfs, 2009), which are known to contribute to many motion illusions (Beer et al., 2008; Menshikova & Krivykh, 2016; Murakami, 2006; Troncoso et al., 2008). Or alternatively, to study illusions based on eye blinks (Faubert & Herbert, 1999; Otero-Millan et al., 2012).

The key to the usefulness of our tool is its ability to run in real-time. For this, we use PyTorch (Paszke et al., 2019), a library targeted toward deep learning. It makes low-level accelerated computing routines⁷ accessible through a high-level programming language. With a few lines of code, it is easy to apply linear filtering (convolutions) on a sequence of images—a spatiotemporal volume—in real-time. For example, our tool is a template for a component of an active vision robotic application (Battaje & Brock, 2022) that uses fixation and resultant motion cues for 3D perception.

Similarly, we believe the interactive real-time nature of this tool could be extended to other domains. From color perception to the perception of causality and animacy, when there are computational models that may be expressed as linear filters (for which computation is fast), it would be easy to implement tools similar to the one described here, and immediately “see” the results of a given model.

In conclusion, we present an interactive tool that helps explain early visual motion perception. The setup is simple: a laptop and an external webcam. Using this tool, we can easily explain old, as well as new, visual phenomena. This also works for phenomena that involve physical eye movements. The code is openly available and uses accelerated computing libraries that make it easy to adapt to other, more complex visual perception models. With this, the process of learning and discovery becomes as simple as playing with toys. We hope the vision science community can take advantage of such a method of interactive discovery.

Supplemental Material

sj-pdf-1-ipe-10.1177_20416695231159182 - Supplemental material for An interactive motion perception tool for kindergarteners (and vision scientists)

Supplemental material, sj-pdf-1-ipe-10.1177_20416695231159182 for An interactive motion perception tool for kindergarteners (and vision scientists) by Aravind Battaje, Oliver Brock and Martin Rolfs in i-Perception

Footnotes

Acknowledgements

We thank Annick Langlois for her feedback on the manuscript. We would also like to express our gratitude to Michael Bach for creating and maintaining a resource as playful and useful as .

Author Contribution(s)

Aravind Battaje: Conceptualization; Methodology; Software; Visualization; Writing – original draft.

Oliver Brock: Formal analysis; Supervision; Writing – review & editing.

Martin Rolfs: Formal analysis; Supervision; Writing – review & editing.

Declaration of Conflicting Interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy—EXC 2002/1 “Science of Intelligence”—project number 390523135.

ORCID iDs

Aravind Battaje

Oliver Brock

Martin Rolfs

Supplemental Material

Supplemental material for this article is available online.

Notes

How to cite this article

Battaje, A., Brock, O., & Rolfs, M. (2023). An interactive motion perception tool for kindergarteners (and vision scientists). i-Perception, 14(2), 1–6.

References

Adelson

E. H.

Bergen

J. R.

(1985). Spatiotemporal energy models for the perception of motion. Journal of the Optical Society of America. A, Optics and image science, 2, 284–299.

Anstis

(2001). Footsteps and inchworms: Illusions show that contrast affects apparent speed. Perception, 30, 785–794. .

Anstis

(2004). Factors affecting footsteps: Contrast can change the apparent speed, amplitude and direction of motion. Vision Research, 44, 2171–2178.

Bach

(1997). 148 optical illusions & visual phenomena + explanations. https://michaelbach.de/ot

Bach

(2002). Structure from motion at equiluminance. https://michaelbach.de/ot/col-equilu/

Bach

(2003). Pinna–Brelstaff illusion. https://michaelbach.de/ot/mot-PinnaBrelstaff/

Bach

(2004). “Stepping feet” motion illusion. https://michaelbach.de/ot/mot-feetLin/

Bach

(2011). “Spine drift” illusion. https://michaelbach.de/ot/mot-spineDrift/

Bach

(2013). Moirè patterns. https://michaelbach.de/ot/lum-moire1/

10.

Bach

(2014). Kinegram (“Scanimation”). https://michaelbach.de/ot/mot-scanimation/

11.

Bach

(2019). Grid masking. https://michaelbach.de/ot/lum-gridMasking/

12.

Battaje

Brock

(2022). One object at a time: accurate and robust structure from motion for robots. In 2022 IEEE/RSJ international conference on intelligent robots and systems (IROS) (pp. 3598–3603). IEEE. https://doi.org/10.1109/IROS47612.2022.9981953

13.

Beer

A. L.

Heckel

A. H.

Greenlee

M. W.

(2008). A motion illusion reveals mechanisms of perceptual stabilization. PLoS one, 3, e2741. https://doi.org/10.1371/journal.pone.0002741.

14.

Challinor

K. L.

Mather

(2010). A motion-energy model predicts the direction discrimination and MAE duration of two-stroke apparent motion at high and low retinal illuminance. Vision Research, 50, 1109–1116. https://doi.org/10.1016/j.visres.2010.04.002.

15.

Faubert

Herbert

A. M.

(1999). The peripheral drift illusion: A motion illusion in the visual periphery. Perception, 28, 617–621.

16.

@jagarikin. (2022). Background movement is always constant (translated). Twitter. https://twitter.com/jagarikin/status/1531115731223339008

17.

Kitaoka

(2010). Tilt illusions and anomalous motion illusions . http://www.psy.ritsumei.ac.jp/\let\prOteCt\relax\Protect\csnameacp:c\endcsname{3}akitaoka/riken2010.html.

18.

Kitaoka

Anstis

(2021). A review of the footsteps illusion. Journal of Illusion, 2, 5612. https://doi.org/10.47691/joi.v2.5612.

19.

Mather

Challinor

K. L.

(2009). Psychophysical properties of two-stroke apparent motion. Journal of Vision, 9, 28. https://doi.org/10.1167/9.1.28.

20.

Menshikova

G. Y.

Krivykh

P. O.

(2016). The role of eye movements in the ‘Spine Drift’ illusion perception. International Journal of Psychophysiology, 108, 163. https://doi.org/10.1016/j.ijpsycho.2016.07.468.

21.

Morvan

Wexler

(2009). The nonlinear structure of motion perception during smooth eye movements. Journal of Vision, 9, 1.

22.

Murakami

(2006). Fixational eye movements and motion perception. In S. Martinez-Conde, S. L. Macknik, L. M. Martinez, J. M. Alonso, & P. U. Tse (Eds.), Progress in brain research (Vol. 154, pp. 193–209). Elsevier.

23.

Nishida

Kawabe

Sawayama

Fukiage

(2018). Motion perception: From detection to interpretation. Annual Review of Vision Science, 4, 501–523.

24.

Otero-Millan

Macknik

S. L.

Martinez-Conde

(2012). Microsaccades and blinks trigger illusory rotation in the “rotating snakes” illusion. Journal of Neuroscience, 32, 6043–6051. https://doi.org/10.1523/JNEUROSCI.5823-11.2012.

25.

Paszke

Gross

Massa

Lerer

Bradbury

Chanan

, … Chintala

(2019). Pytorch: An imperative style, high-performance deep learning library. In H. Wallach, H. Larochelle, A. Beygelzimer, F. d’Alché-Buc, E. Fox & R. Garnett (Eds.), Advances in neural information processing systems 32 (pp. 8024–8035). Curran Associates, Inc..

26.

Pinna

Brelstaff

G. J.

(2000). A new visual illusion of relative motion. Vision Research, 40, 2091–2096. https://doi.org/10.1016/S0042-6989(00)00072-9.

27.

Rogers

Graham

(1979). Motion parallax as an independent cue for depth perception. Perception, 8, 125–134.

28.

Rolfs

(2009). Microsaccades: Small steps on a long way. Vision Research, 49, 2415–2441. https://doi.org/10.1016/j.visres.2009.08.010.

29.

Rolfs

Schweitzer

(2022). Coupling perception to action through incidental sensory consequences of motor behaviour. Nature Reviews Psychology, 1, 112–123. https://doi.org/10.1038/s44159-021-00015-x.

30.

Shapiro

A. G.

Todorovic

(2017). The Oxford Compendium of Visual Illusions. Oxford University Press. https://doi.org/10.1093/acprof:oso/9780199794607.001.0001.

31.

Spillmann

(1993). The perception of movement and depth in Moirè patterns. Perception, 22, 287–308.

32.

Terao

Murakami

Nishida

(2015). Enhancement of motion perception in the direction opposite to smooth pursuit eye movement. Journal of Vision, 15, 2.

33.

Troncoso

X. G.

Macknik

S. L.

Otero-Millan

Martinez-Conde

(2008). Microsaccades drive illusory motion in the Enigma illusion. Proceedings of the National Academy of Sciences of the United States of America, 105, 16033–16038. https://doi.org/10.1073/pnas.0709389105.

34.

Watson

A. B.

Ahumada

A. J.

(1985). Model of human visual-motion sensing. Journal of the Optical Society of America A, 2, 322–342. https://doi.org/10.1364/JOSAA.2.000322.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.56 MB