New Approaches to 3D Vision

Paul Linton

7 min readDec 14, 2022

New volume from the The Royal Society

On AI, animal navigation, and human vision

Philosophical Transactions of the Royal Society B: Biological Sciences: Vol 378, No 1869

Close Drawer Menu Open Drawer Menu

royalsocietypublishing.org

Overview of articles below 👇🧵

Cover of the Royal Society volume “New Approaches to 3D Vision”

The Introduction asks:

Does 3D vision (in computer vision, animal navigation, and human vision) rely on building an accurate 3D model of the environment? Or can it simply rely on a partial or distorted 3D model, or no model at all?

New Approaches to 3D Vision | Philosophical Transactions of the Royal Society B: Biological…

In November 2021 we held a Royal Society scientific meeting on 'New approaches to 3D vision' with the following mission…

royalsocietypublishing.org

Gif of LIDAR scanning of the environment

2. Sergey Levine and Dhruv Shah argue:

Robot navigation should be based on ‘traversability’ rather than an explicit 3D map of the environment. And they explain how a robot can learn this directly through trial and error (‘experiential learning’)

Learning robotic navigation from experience: principles, methods and recent results | Philosophical…

Navigation is one of the most heavily studied problems in robotics and is conventionally approached as a geometric…

royalsocietypublishing.org

Autonomous robot delivering packages to people’s houses

3. Ida Momennejad provides:

A rubric for NeuroAI that distinguishes between:

1. Human-like behaviour

2. Neural plausibility, and

3. Engineering goals

And uses 3D navigation by AI as a key example

A rubric for human-like agents and NeuroAI | Philosophical Transactions of the Royal Society B…

Researchers across cognitive, neuro- and computer sciences increasingly reference 'human-like' artificial intelligence…

royalsocietypublishing.org

4. Jenny Read develops:

A new model of stereo vision for the praying mantis, explaining how an insect with much fewer neurons than humans is able to extract 3D information from stereo

Stereopsis without correspondence | Philosophical Transactions of the Royal Society B: Biological…

Gaining information about the 3D structure of the world from two-dimensional retinal images is a major challenge for…

royalsocietypublishing.org

5. Kate Jeffery explores:

The rotational and translational symmetry in navigation cells (place cells, head-direction cells, and grid cells), when this symmetry breaks down, and what this means for how space is processed in the brain

Symmetries and asymmetries in the neural encoding of 3D space | Philosophical Transactions of the…

The neural code for space centres on three canonical cell types: place cells, head direction cells and grid cells…

royalsocietypublishing.org

6. Edward Horrocks, Isabelle Mareschal, and Aman Saleem ask:

How are vision and motion integrated? Combining insights from humans and monkeys (self-motion affects the perception of optic flow) with their work on locomotion signals in mouse visual areas

Walking humans and running mice: perception and neural encoding of optic flow during self-motion |…

Locomotion produces full-field optic flow that often dominates the visual motion inputs to an observer ( figure 1) [ 1…

royalsocietypublishing.org

Mouse running down a tunnel in virtual reality

7. Andrew Glennerster argues that:

Human 3D vision doesn’t rely on building a 3D map of the environment, but instead if better thought as a collection of sensorimotor contingencies that map how our actions lead to changes in the 2D retinal image

Understanding 3D vision as a policy network | Philosophical Transactions of the Royal Society B…

Open your eyes and look around. It is natural to assume that your perception must depend on a 3D representation of the…

royalsocietypublishing.org

Person moving back and forth in virtual reality

8. Paul Linton argues for:

A new approach to stereo vision, and therefore visual scale and visual shape, arguing that we’ve wasted the last 200 years trying to shoe-horn stereo vision into Kepler and Descartes triangulation-based account

Minimal theory of 3D vision: new approach to visual scale and visual shape | Philosophical…

Since Kepler and Descartes in the early-1600s, vision science has been committed to a triangulation model of stereo…

royalsocietypublishing.org

Person pointing as vergence (angular rotation of eyes) is manipulated

9. Fulvio Domini argues for:

A rejection of probabilistic approaches to 3D vision, arguing that rather than trying to estimate the true 3D layout, human 3D vision instead prioritises maximising whatever 3D vision signal exists

The case against probabilistic inference: a new deterministic theory of 3D visual processing |…

The spatial organization of the visual world in three dimensions is a fundamental aspect of our conscious perception…

royalsocietypublishing.org

Person reaching for illuminated object in darkness

10. Dhanraj Vishwanath argues that:

3D vision relies on 3 distinct encodings:

1. 3D shape

2. egocentric distance, and

3. object-to-object distance

And explores how these 3 different encodings are prioritised in different viewing conditions

From pictures to reality: modelling the phenomenology and psychophysics of 3D perception |…

Textbook descriptions of the psychology of 3D visual perception most often claim that the visual system acts as a sort…

royalsocietypublishing.org

Wiggle gif of a man in a colonnade giving impression of 3D

11. Sarah Creem-Regehr, Jeanine Stefanucci, and Bobby Bodenheimer explore why:

Distances are underestimated in VR, and the ways in which distance perception in VR can be improved, from visuomotor experience in VR, to the properties of the headset

Perceiving distance in virtual reality: theoretical insights from contemporary technologies |…

Decades of research have shown that absolute egocentric distance is underestimated in virtual environments (VEs) when…

royalsocietypublishing.org

Woman putting on VR headset and entering virtual reality

12. Anna Rzepka, Kieran Hussey, Margaret Maltz, Karsten Babin, Laurie Wilcox, and Jody Culham:

Observers rely more on the familiar size of objects in VR than they do in the real world, questioning whether we can equate studies in VR with the real world

Familiar size affects perception differently in virtual reality and the real world | Philosophical…

The promise of virtual reality (VR) as a tool for perceptual and cognitive research rests on the assumption that…

royalsocietypublishing.org

Rubik’s cube and dice shrinking and growing in size

13. Robert Whitwell, Mehul Garach, Mel Goodale, and Irene Sperandio:

Defend the dissociation between perception and action in the Ebbinghaus illusion, where the illusion affects visual perception but not action

(vs. the suggestion that the dissociation is merely due to differences in eye movements in the two tasks)

Looking at the Ebbinghaus illusion: differences in neurocomputational requirements, not…

In a now classic grasping paradigm introduced in 1995, Aglioti et al. [ 1] showed that the scaling of grasp aperture…

royalsocietypublishing.org

14. Antonella Maselli, Eyal Ofek, Brian Cohn, Ken Hinckley, and Mar Gonzalez-Franco:

Displace the virtual location of the hand in VR, and find participants are more effective at making corrections towards the body midline than away from it

Enhanced efficiency in visually guided online motor control for actions redirected towards the body…

Reaching objects in a dynamic environment requires fast online corrections that compensate for sudden object shifts or…

royalsocietypublishing.org

Person reaching in virtual reality as virtual hand misplaced relative to real hand

15. Michael Morgan reviews:

The complexity of the stereo computations required to see an object coming fast towards you, like a snowball!

Stereopsis for rapidly moving targets | Philosophical Transactions of the Royal Society B…

Stereoscopic depth perception is possible with luminance-defined target velocities at least as high as 600° s−1, up to…

royalsocietypublishing.org

16. Ewa Niechwiej-Szwedo, Linda Colpa, and Agnes Wong review:

The importance of stereo vision for reaching and grasping in children, with the aim of better treating stereo vision deficits that affect half a billion people worldwide

The role of binocular vision in the control and development of visually guided upper limb movements…

Vision provides a key sensory input for the performance of fine motor skills, which are fundamentally important to…

royalsocietypublishing.org

Child trying to reach for spoon in mouth and missing

17. Ione Fine and Woon Ju Park ask:

How are people born blind able to cross a busy road using sound? Remarkably, their spatial processing of sounds recruits a brain region (hMT+) typically associated with the visual perception of motion

Do you hear what I see? How do early blind individuals experience object motion? | Philosophical…

One of the most important tasks for 3D vision is tracking the movement of objects in space. The ability of early blind…

royalsocietypublishing.org

Man crossing busy street with cars and motorbikes

Royal Society Meeting

This volume builds on Royal Society meeting ‘New Approaches to 3D Vision’ in November 2021

Abstracts and recordings of talks are available on the meeting website:

New approaches to 3D vision | Royal Society

You currently have JavaScript disabled in your web browser, please enable JavaScript to view our website as intended…

royalsociety.org

As well as YouTube playlist:

YouTube videos from Royal Society meeting

Finally, a big thank you to my co-editors:

Michael Morgan FRS, Jenny Read, Dhanraj Vishwanath, Sarah Creem-Regehr, and Fulvio Domini, as well as The Royal Society

New Approaches to 3D Vision

Philosophical Transactions of the Royal Society B: Biological Sciences: Vol 378, No 1869

Close Drawer Menu Open Drawer Menu

New Approaches to 3D Vision | Philosophical Transactions of the Royal Society B: Biological…

In November 2021 we held a Royal Society scientific meeting on 'New approaches to 3D vision' with the following mission…

Learning robotic navigation from experience: principles, methods and recent results | Philosophical…

Navigation is one of the most heavily studied problems in robotics and is conventionally approached as a geometric…

A rubric for human-like agents and NeuroAI | Philosophical Transactions of the Royal Society B…

Researchers across cognitive, neuro- and computer sciences increasingly reference 'human-like' artificial intelligence…

Stereopsis without correspondence | Philosophical Transactions of the Royal Society B: Biological…

Gaining information about the 3D structure of the world from two-dimensional retinal images is a major challenge for…

Symmetries and asymmetries in the neural encoding of 3D space | Philosophical Transactions of the…

The neural code for space centres on three canonical cell types: place cells, head direction cells and grid cells…

Walking humans and running mice: perception and neural encoding of optic flow during self-motion |…

Locomotion produces full-field optic flow that often dominates the visual motion inputs to an observer ( figure 1) [ 1…

Understanding 3D vision as a policy network | Philosophical Transactions of the Royal Society B…

Open your eyes and look around. It is natural to assume that your perception must depend on a 3D representation of the…

Minimal theory of 3D vision: new approach to visual scale and visual shape | Philosophical…

Since Kepler and Descartes in the early-1600s, vision science has been committed to a triangulation model of stereo…

The case against probabilistic inference: a new deterministic theory of 3D visual processing |…

The spatial organization of the visual world in three dimensions is a fundamental aspect of our conscious perception…

From pictures to reality: modelling the phenomenology and psychophysics of 3D perception |…

Textbook descriptions of the psychology of 3D visual perception most often claim that the visual system acts as a sort…

Perceiving distance in virtual reality: theoretical insights from contemporary technologies |…

Decades of research have shown that absolute egocentric distance is underestimated in virtual environments (VEs) when…

Familiar size affects perception differently in virtual reality and the real world | Philosophical…

The promise of virtual reality (VR) as a tool for perceptual and cognitive research rests on the assumption that…

Looking at the Ebbinghaus illusion: differences in neurocomputational requirements, not…

In a now classic grasping paradigm introduced in 1995, Aglioti et al. [ 1] showed that the scaling of grasp aperture…

Enhanced efficiency in visually guided online motor control for actions redirected towards the body…

Reaching objects in a dynamic environment requires fast online corrections that compensate for sudden object shifts or…

Stereopsis for rapidly moving targets | Philosophical Transactions of the Royal Society B…

Stereoscopic depth perception is possible with luminance-defined target velocities at least as high as 600° s−1, up to…

The role of binocular vision in the control and development of visually guided upper limb movements…

Vision provides a key sensory input for the performance of fine motor skills, which are fundamentally important to…

Do you hear what I see? How do early blind individuals experience object motion? | Philosophical…

One of the most important tasks for 3D vision is tracking the movement of objects in space. The ability of early blind…

New approaches to 3D vision | Royal Society

You currently have JavaScript disabled in your web browser, please enable JavaScript to view our website as intended…

Written by Paul Linton