Perception, Part 2 Gleitman et al. (2011), Chapter 5

Similar documents
Basic distinctions. Definitions. Epstein (1965) familiar size experiment. Distance, depth, and 3D shape cues. Distance, depth, and 3D shape cues

Binocular cues to depth PSY 310 Greg Francis. Lecture 21. Depth perception

PERCEIVING DEPTH AND SIZE

Important concepts in binocular depth vision: Corresponding and non-corresponding points. Depth Perception 1. Depth Perception Part II

Stereovision. Binocular disparity

Prof. Feng Liu. Spring /27/2014

lecture 10 - depth from blur, binocular stereo

Realtime 3D Computer Graphics Virtual Reality

Robert Collins CSE486, Penn State Lecture 08: Introduction to Stereo

Stereo. Shadows: Occlusions: 3D (Depth) from 2D. Depth Cues. Viewing Stereo Stereograms Autostereograms Depth from Stereo

Mahdi Amiri. May Sharif University of Technology

Stereo CSE 576. Ali Farhadi. Several slides from Larry Zitnick and Steve Seitz

Virtual Reality ll. Visual Imaging in the Electronic Age. Donald P. Greenberg November 16, 2017 Lecture #22

Think-Pair-Share. What visual or physiological cues help us to perceive 3D shape and depth?

Depth. Common Classification Tasks. Example: AlexNet. Another Example: Inception. Another Example: Inception. Depth

Computer and Machine Vision

DEPTH PERCEPTION. Learning Objectives: 7/31/2018. Intro & Overview of DEPTH PERCEPTION** Speaker: Michael Patrick Coleman, COT, ABOC, & former CPOT

Project 4 Results. Representation. Data. Learning. Zachary, Hung-I, Paul, Emanuel. SIFT and HoG are popular and successful.

Processing Framework Proposed by Marr. Image

Multi-View Geometry (Ch7 New book. Ch 10/11 old book)

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

CSc I6716 Spring D Computer Vision. Introduction. Instructor: Zhigang Zhu City College of New York

CEng Computational Vision

3D Computer Vision. Introduction. Introduction. CSc I6716 Fall Instructor: Zhigang Zhu City College of New York

USE R/G GLASSES. Binocular Combina0on of Images. What can happen when images are combined by the two eyes?

Last update: May 4, Vision. CMSC 421: Chapter 24. CMSC 421: Chapter 24 1

Stereoscopic media. 3D is hot today. 3D has a long history. 3D has a long history. Digital Visual Effects Yung-Yu Chuang

Optimizing Monocular Cues for Depth Estimation from Indoor Images

Recap: Features and filters. Recap: Grouping & fitting. Now: Multiple views 10/29/2008. Epipolar geometry & stereo vision. Why multiple views?

COS Lecture 10 Autonomous Robot Navigation

Perceptual Optics for Prisms and Lenses (for vision therapists)

COMP 558 lecture 22 Dec. 1, 2010

Local qualitative shape from stereo. without detailed correspondence. Extended Abstract. Shimon Edelman. Internet:

Feature Tracking and Optical Flow

Shadows in the graphics pipeline

Visual Perception. Basics

General Principles of 3D Image Analysis

Visual areas in the brain. Image removed for copyright reasons.

Lecture 19. Depth Perception Reading Assignments: TMB2 7.1

Computational Foundations of Cognitive Science

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

Multidimensional image retargeting

CS 534: Computer Vision Segmentation and Perceptual Grouping

Stereo: Disparity and Matching

There are many cues in monocular vision which suggests that vision in stereo starts very early from two similar 2D images. Lets see a few...

Robert Collins CSE486, Penn State. Robert Collins CSE486, Penn State. Image Point Y. O.Camps, PSU. Robert Collins CSE486, Penn State.

2/26/2016. Chapter 23 Ray Optics. Chapter 23 Preview. Chapter 23 Preview

Comparison between Motion Analysis and Stereo

Practice Exam Sample Solutions

High-Fidelity Augmented Reality Interactions Hrvoje Benko Researcher, MSR Redmond

Lecture 14: Computer Vision

Natural Viewing 3D Display

A Simple Vision System

Miniature faking. In close-up photo, the depth of field is limited.

CS 563 Advanced Topics in Computer Graphics Stereoscopy. by Sam Song

Chapter 5. Projections and Rendering

IDE-3D: Predicting Indoor Depth Utilizing Geometric and Monocular Cues

Presentation #: CL05

Conflict? What conflict?

Binocular Stereo Vision

PHYSICS. Chapter 34 Lecture FOR SCIENTISTS AND ENGINEERS A STRATEGIC APPROACH 4/E RANDALL D. KNIGHT

Depth cue integration: stereopsis and image blur

Visual motion. Many slides adapted from S. Seitz, R. Szeliski, M. Pollefeys

Final Exam Study Guide

Correspondence and Stereopsis. Original notes by W. Correa. Figures from [Forsyth & Ponce] and [Trucco & Verri]

Depth. Chapter Stereo Imaging

Complex Sensors: Cameras, Visual Sensing. The Robotics Primer (Ch. 9) ECE 497: Introduction to Mobile Robotics -Visual Sensors

Perspective and vanishing points

Perceived visual direction near an occluder

EE795: Computer Vision and Intelligent Systems

PERCEPTION. 2D1380 Artificial Intelligence. 2D1380 Artificial Intelligence. 2D1380 Artificial Intelligence. 2D1380 Artificial Intelligence

Finally: Motion and tracking. Motion 4/20/2011. CS 376 Lecture 24 Motion 1. Video. Uses of motion. Motion parallax. Motion field

Stereospatial masking and aftereffect with normal and transformed random-dot patterns*

Chapter Three: Perceiving and Representing Shape and Depth

Human Perception of Objects

Multiple View Geometry

Stereoscopic transparency: a test for binocular vision s disambiguating power 1

3D-TV Content Creation: Automatic 2D-to-3D Video Conversion Liang Zhang, Senior Member, IEEE, Carlos Vázquez, Member, IEEE, and Sebastian Knorr

Inline Computational Imaging: Single Sensor Technology for Simultaneous 2D/3D High Definition Inline Inspection

Chapter 26 Geometrical Optics

Lecture 19: Depth Cameras. Visual Computing Systems CMU , Fall 2013

The Uniqueness Constraint Revisited:

CHAPTER 7. The perception of surface and of interreflections

Towards Measuring of Depth Perception from Monocular Shadow Technique with Application in a Classical Painting

Stereo imaging ideal geometry

VS 117 LABORATORY III: FIXATION DISPARITY INTRODUCTION

Integrating multiple cues to depth order at object boundaries

CHAPTER 6 PERCEPTUAL ORGANIZATION BASED ON TEMPORAL DYNAMICS

Poor visibility of motion in depth is due to early motion averaging

Visual Pathways to the Brain

3D Scanning. Qixing Huang Feb. 9 th Slide Credit: Yasutaka Furukawa

Optimization of Proximity Judgment

Announcements. Stereo

Scaling of Rendered Stereoscopic Scenes

Greatly enhanced visual detail and vividity. Accuracy based on mathematical derivation Disparity can function in isolation (RDS)

CAP 5415 Computer Vision. Fall 2011

Introduction to visual computation and the primate visual system

A Qualitative Analysis of 3D Display Technology

Hysteresis in human binocular fusion: temporalward and nasalward ranges

The Absence of Depth Constancy in Contour Stereograms

Transcription:

Perception, Part 2 Gleitman et al. (2011), Chapter 5 Mike D Zmura Department of Cognitive Sciences, UCI Psych 9A / Psy Beh 11A February 27, 2014 T. M. D'Zmura 1

Visual Reconstruction of a Three-Dimensional Scene Retinal images are two-dimensional The depth or distance from the viewer must be recovered: the third dimension Depth Cues Pictorial or static cues (found in a single monocular image) Physiological cues (weak) Stereo disparity (used by many but not all people when viewing a scene with two eyes) Motion cues (available from a sequence of monocular images) Optic flow (caused by motion of the viewer) Independent object motion T. M. D'Zmura 2

T. M. D'Zmura 3

How do we know that the white square lies in front of the gray disk? Perhaps the gray disk is a pacman eating the white square. Perceptual grouping (closure and convexity) leads us to the standard interpretation: the white square occludes the gray disk. T. M. D'Zmura 4

Any occlusion in this picture? T. M. D'Zmura 5

T-Junctions X-Junctions While T-junctions are commonly thought to be an important cue to occlusion, detection and interpretation of T-junctions in visual imagery is very tough perceptual grouping principles provide the correct answer without their use T. M. D'Zmura 6

Amodal Completion of the Object Lying Behind the Occluder try http://www.michaelbach.de/ot/mot_breathingsquare/index.html T. M. D'Zmura 7

Shading and Shadows cast shadow T. M. D'Zmura 8

Shading and Shadows attached shadow T. M. D'Zmura 9

Shading and Shadows shadow: cast and attached T. M. D'Zmura 10

Shading and Shadows Any shadows in this picture? shadow: cast and attached T. M. D'Zmura 11

Shading and Shadows a shaded crater T. M. D'Zmura 12

Shading and Shadows a shaded crater, photo upside-down T. M. D'Zmura 13

Shading and Shadows visual system assumes light from above (in a retinal coordinate system view with your head upside-down!) T. M. D'Zmura 14

Shading and Shadows Any shading in this picture? visual system assumes light from above (in a retinal coordinate system view with your head upside-down!) T. M. D'Zmura 15

Shading and Shadows from http://gandalf.psych.umn.edu/~kersten/kersten-lab/demos/shadows.html by Kersten, Mamassian and Knill T. M. D'Zmura 16

Shading and Shadows Aerial Perspective (aka atmospheric perspective) scattering of light by molecules and particles along the path from a distant surface to the eye causes a spatial blurring and loss of contrast T. M. D'Zmura 17

Shading and Shadows Aerial Perspective short-wavelength light is scattered more than light at longer wavelengths; things off in the distance look more blue T. M. D'Zmura 18

Shading and Shadows Aerial Perspective Retinal and Familiar Size (Relative Size) If we know the rough size of an object (say, a person), then we can infer from its retinal image size how far away it is. T. M. D'Zmura 19

Shading and Shadows Aerial Perspective Retinal and Familiar Size Linear Perspective from J.J. Gibson (1957) lines parallel in 3D space converge at a vanishing point T. M. D'Zmura 20

Shading and Shadows Aerial Perspective Retinal and Familiar Size Linear Perspective things close to the vanishing point: distant things far from the vanishing point: near lines parallel in 3D space converge at a vanishing point T. M. D'Zmura 21

Shading and Shadows Aerial Perspective Retinal and Familiar Size Linear Perspective Perugino s Christ Delivering Keys to St. Peter, 1485 T. M. D'Zmura 22

Shading and Shadows Aerial Perspective Retinal and Familiar Size Linear Perspective Any perspective cues in this picture? T. M. D'Zmura 23

Shading and Shadows Aerial Perspective Retinal and Familiar Size Linear Perspective Texture Gradients T. M. D'Zmura 24

Shading and Shadows Aerial Perspective Retinal and Familiar Size Linear Perspective Texture Gradients T. M. D'Zmura 25

Shading and Shadows Aerial Perspective Retinal and Familiar Size Linear Perspective Texture Gradients T. M. D'Zmura Caillebotte, 1877 26

Shading and Shadows Aerial Perspective Retinal and Familiar Size Linear Perspective Texture Gradients Victor Vasarely, Vega-Nor, 1969 T. M. D'Zmura 27

Shading and Shadows Aerial Perspective Retinal and Familiar Size Linear Perspective Texture Gradients T. M. D'Zmura 28

Shading and Shadows Aerial Perspective Retinal and Familiar Size (Relative Size) Linear Perspective Texture Gradients Height in the Plane (Relative Height) things toward the bottom of an image tend to be nearer than things toward the top T. M. D'Zmura 29

Shading and Shadows Aerial Perspective Retinal and Familiar Size (Relative Size) Linear Perspective Texture Gradients Height in the Plane (Relative Height) T. M. D'Zmura 30

Physiological Cues Accommodation The crystalline lens of the eye changes shape to help keep objects in focus near accommodation lens more curved far accommodation lens more flat likely a weak cue to depth T. M. D'Zmura 31

Physiological Cues Accommodation Convergence & Divergence vergence movements of the two eyes, in opposite directions, to fixate -likely a weak cue to depth T. M. D'Zmura 32

Binocular Depth Perception: Stereopsis Disparity the difference between the retinal images of the left and right eyes Suppose that your eyes are converged on a point of intermediate distance. A point lying closer ( Near ) will give rise to retinal images displaced towards the ears (temporally) crossed disparity or near disparity A point lying farther ( Far ) will give rise to retinal images displaced towards the nose (nasally) uncrossed disparity or far disparity T. M. D'Zmura 33

Binocular Depth Perception: Stereopsis Fusion the merging of disparate retinal images into a single, unified percept Diplopia double vision arising from failure of fusion T. M. D'Zmura 34

Fusion is possible within Panum s area. Points which lie outside this area appear diplopic (double image). Panum s Fusional Area Horopter Diplopia Diplopia Diplopia F F T. M. D'Zmura 35

Stereopsis ability to recover depth information from binocular disparity people possess this ability to greater or lesser degrees; perhaps as many as 10% do not have usable stereopsis -such people are stereoblind T. M. D'Zmura 36

Among the easiest stereo displays to view are stereo pairs, for which one tries to cross one s eyes so that the image at left and the image at right come to lie atop one another in the center. T. M. D'Zmura 37

Here s another... T. M. D'Zmura 38

Yet another... T. M. D'Zmura 39

Here is an Escheresque stereo pair... T. M. D'Zmura 40

Vision scientists have, since the work of Bela Julesz, used random dot stereograms in experiments on stereopsis. These provide little cue to 3D content other than disparity. The stereo pair below works in the same way as the previous stereo pairs but might take a little more effort. fine book: Bela Julesz (1971) Foundations of Cyclopean Perception. (Chicago: Univ. Chicago Press). T. M. D'Zmura 41

Such displays demonstrate the notion of global stereopsis, the perception of depth in the absence of monocular shape or form. The task for the visual system is to find dots in one eye s image that correspond to dots in the other eye s image: the correspondence problem T. M. D'Zmura 42

Kinect Structured Illumination rapidly shine infrared laser (left) in a large number of known directions use an infrared-sensitive camera (right) to measure where in the image the infrared spots fall infer the depth at the position of the infrared spot by comparing the known direction the known camera image position Result: Depth map across the visual image received by a second camera (standard RGB color camera, center) T. M. D'Zmura 43

Motion Cues to Depth Optic Flow (caused by motion of the viewer) Independent Object Motion How do we perceive motion in the first place?? T. M. D'Zmura 44

Motion Detectors (Werner Reichardt) DELAY x x DELAY -1 + +1 yikes! T. M. D'Zmura 45

Stimulation Time 1 DELAY x Let s try to detect a light moving to the left... T. M. D'Zmura 46

Stimulation Time 1 DELAY x Positive response by the left receptor to the light, held in the delay mechanism for one time unit... T. M. D'Zmura 47

Stimulation the light has moved to the right Time 2 DELAY x multiplication Response to rightward motion! Positive response by the right receptor to the light, multiplied by the delayed positive response by the left receptor T. M. D'Zmura 48

A problem... Stimulation Shine a light steadily on both receptors DELAY x Responds to steady light (oops!) We need a way to eliminate the response of such a system to a steady light covering both receptors. T. M. D'Zmura 49

Solution, Part 1 Add a leftwards-motion detector DELAY x x DELAY -1 + +1 T. M. D'Zmura 50

Solution, Part 2 Compare the responses of the leftwardsand rightwards-motion detectors DELAY x x DELAY -1 + +1 T. M. D'Zmura 51

Reichardt Detectors: Motion Opponency DELAY x x DELAY -1 Inhibition: Leftwards Motion + +1 Excitation: Rightwards Motion T. M. D'Zmura 52

Reichardt Detectors: Motion Opponency Leftward and rightward motions are compared by the same opponent neurons. Upward and downward motions are compared by the same opponent neurons. Causing such neurons to adapt to a motion stimulus can lead to interesting motion aftereffects. http://www.michaelbach.de/ot/mot_adapt/index.html T. M. D'Zmura 53

Motion Cues to Depth: Structure From Motion Independent Object Motion provides dues as to object shape and depth Try http://www.michaelbach.de/ot/sze_necker/index.html Biological interpretation of apparent motion (Johansson) http://www.michaelbach.de/ot/mot_biomot/index.html try also http://www.journalofvisi on.org/content/suppl/20 11/01/13/2.5.2.DCSupple mentaries/genderclass.s wf and http://www.journalofvisi on.org/content/suppl/20 11/01/13/3.4.1.DCSupple mentaries/dog.swf T. M. D'Zmura 54

Motion Cues to Depth: Motion Parallax as you move along, nearby objects move more rapidly through your visual field distant objects move more slowly through your visual field T. M. D'Zmura 55