To Conclude: Vision in terms of

Similar documents
2D Scene Representation and Description

Computational Foundations of Cognitive Science

All human beings desire to know. [...] sight, more than any other senses, gives us knowledge of things and clarifies many differences among them.

Representing 3D Objects: An Introduction to Object Centered and Viewer Centered Models

CSE 4392/5369. Dr. Gian Luca Mariottini, Ph.D.

Neural Nets. CSCI 5582, Fall 2007

Backpropagation in Neural Nets, and an Introduction to Vision. CSCI 5582, Fall 2007

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

Introduction to visual computation and the primate visual system

A Hierarchial Model for Visual Perception

Measurement of Pedestrian Groups Using Subtraction Stereo

Computer Vision. Introduction

Introduction to Computer Graphics with WebGL

What is Computer Vision?

What Happened to the Representations of Perception? Cornelia Fermüller Computer Vision Laboratory University of Maryland

Visualizing Flow Fields by Perceptual Motion

Dynamic visual attention: competitive versus motion priority scheme

DEEP LEARNING AND DIGITS DEEP LEARNING GPU TRAINING SYSTEM

Image Formation. Ed Angel Professor of Computer Science, Electrical and Computer Engineering, and Media Arts University of New Mexico

The Hilbert Problems of Computer Vision. Jitendra Malik UC Berkeley & Google, Inc.

Vision: From Eye to Brain (Chap 3)

Fast Border Ownership Assignment with Bio-Inspired Features

12/3/2009. What is Computer Vision? Applications. Application: Assisted driving Pedestrian and car detection. Application: Improving online search

S3 (3.1) N5 Volume.notebook April 30, 2018

3D Object Recognition: A Model of View-Tuned Neurons

Sketchable Histograms of Oriented Gradients for Object Detection

Computer Vision, CS766. Staff. Instructor: Li Zhang TA: Jake Rosin

Shape from shading. Surface brightness and Surface Orientation --> Reflectance map READING: Nalwa Chapter 5. BKP Horn, Chapter 10.

Three-Dimensional Computer Vision

Does the Brain do Inverse Graphics?

Small is the New Big: Data Analytics on the Edge

Range Image Registration with Edge Detection in Spherical Coordinates

Interactive Inverted Perspective Rendering for Architectural Visualization

Computer Graphics Global Illumination

Why equivariance is better than premature invariance

CSE4030 Introduction to Computer Graphics

Neuromorphic Hardware. Adrita Arefin & Abdulaziz Alorifi

Computer Vision. I-Chen Lin, Assistant Professor Dept. of CS, National Chiao Tung University

Flow fields PSY 310 Greg Francis. Lecture 25. Perception

Computer Graphics. Bing-Yu Chen National Taiwan University The University of Tokyo

Face Cyclographs for Recognition

Does the Brain do Inverse Graphics?

Image Processing, Analysis and Machine Vision

By Suren Manvelyan,

S3 (3.1) Volume.notebook March 02, 2016

CSE 527: Intro. to Computer

Beyond bags of features: Adding spatial information. Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba

Illusory Contour Detection Using MRF Models

Computer Graphics Global Illumination

A Computational Approach To Understanding The Response Properties Of Cells In The Visual System

Mobile Robots Summery. Autonomous Mobile Robots

Computer Vision: Making machines see

CONCENTRATIONS: HIGH-PERFORMANCE COMPUTING & BIOINFORMATICS CYBER-SECURITY & NETWORKING

Deep Learning in Image Processing

Attributes and More Crowdsourcing

Visual Perception. Basics

PART-LEVEL OBJECT RECOGNITION

7. The Geometry of Multi Views. Computer Engineering, i Sejong University. Dongil Han

Game Programming. Bing-Yu Chen National Taiwan University

arxiv:q-bio/ v1 [q-bio.nc] 4 Oct 2004

Local Features and Bag of Words Models

Texture. COS 429 Princeton University

Efficient Visual Coding: From Retina To V2

Object Recognition. The Chair Room

Object Recognition. Computer Vision. Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce

Vision: Form Perception

DIGITS DEEP LEARNING GPU TRAINING SYSTEM

Intrinsic3D: High-Quality 3D Reconstruction by Joint Appearance and Geometry Optimization with Spatially-Varying Lighting

Robotics Programming Laboratory

NICTA From imagination to impact

Emerging Vision Technologies: Enabling a New Era of Intelligent Devices

Some properties of our visual system. Designing visualisations. Gestalt principles

Course Administration

Evolution of Impossible Objects

The representation of visual depth perception based on the plenoptic function in the retina and its neural computation in visual cortex V1

Global Illumination. Frank Dellaert Some slides by Jim Rehg, Philip Dutre

Which is better? Sentential. Diagrammatic Indexed by location in a plane

Contextual Co-occurrence Information for Object Representation and Categorization

Animation. Identifying Types of Animation. Copyright Texas Education Agency, All rights reserved.

Multi-view Stereo. Ivo Boyadzhiev CS7670: September 13, 2011

Scene-Centered Description from Spatial Envelope Properties

On Board 6D Visual Sensors for Intersection Driving Assistance Systems

Grouping and Segmentation

Texture April 17 th, 2018

Boundaries and Sketches

Collaborative Mapping with Streetlevel Images in the Wild. Yubin Kuang Co-founder and Computer Vision Lead

Practice Exam Sample Solutions

Anatomical and Functional Modeling in Neuroscience using Blender

Attributes. Computer Vision. James Hays. Many slides from Derek Hoiem

Computational Aesthetics for Rendering Virtual Scenes on 3D Stereoscopic Displays

Lecture 12 Recognition

Other Reconstruction Techniques

A SYNOPTIC ACCOUNT FOR TEXTURE SEGMENTATION: FROM EDGE- TO REGION-BASED MECHANISMS

CPSC 532E Week 6: Lecture. Surface Perception; Completion

Visual Perception for Robots

Searching Image Databases Containing Trademarks

The elements of early vision or, what vision (and this course) is all about. NYU/CNS Center for Neural Science

W4. Perception & Situation Awareness & Decision making

Joint design of data analysis algorithms and user interface for video applications

Lecture 12 Recognition. Davide Scaramuzza

Transcription:

To Conclude: Vision in terms of Neurophysiology Receptive fields Left/right hemisphere Visual pathway, packing problem, columns, complementary features Cognitive psychology Perceptual grouping Bottom-up vs. top-down processes Optical illusions Hemispheres, motion perception, Information processing The Marr Paradigm 1

David Marr [1945-1980] Vision [1982] VISION A Computational Investigation into the Human Representation and Processing of Visual Information Freeman Co., 1982 NEUROBIOLOGY 2

David Marr Vision Institute of Electrical Measurement and Measurement Signal Processing 3

David Marr Vision Institute of Electrical Measurement and Measurement Signal Processing 4

David Marr Vision What does it mean, to see? The plain man s answer (and Aristotle s, too) would be, to know what is where by looking. In other words, vision is the process of discovering from images what is present in the world, and where it is. (p.3, 1 st paragraph of General Introduction) Vision Image Understanding: To know what is where. D. Marr 3D 2D, Reconstruction vs. recognition Video Understanding: What is where and when? (borrowed from D. Marr) in space and time 4D 3D 5

Marr Vision: Emphasis on Reconstruction 6

Marr Vision: Emphasis on Reconstruction 7

The Marr Paradigm Computational Framework Stone (Vision and Brain, 2012): computational framework suggests: Vision works like a computer. Better: informational framework because Marr was keen to emphasize the nature of information being processed without necessarily referring to the particular machinery (e.g., neurons or chips) 8

The Marr Paradigm Analogy with Flying Marr (p.27): Importance of Computational Theory an algorithm is likely to be understood more readily by understanding the nature of the problem being solved than by examining the mechanism (and the hardware) in which it is embodied. trying to understand perception by studying only neurons is like trying to understand bird flight by studying only feathers: It just cannot be done. First understand aerodynamics, then think about structures of feathers, shape of wings etc. Wright brothers 1902 (from [Stone, 2012]) 9

The Marr Paradigm Computational Framework 3D surface shape by finding surface normals from shading information SfS 0 90 180 Greylevels Surface normals Neurons A single CPU Multicore CPUs GPUs frogs passing cupcakes. [Stone] 10

David Marr Vision Representational Framework Primal sketch 2-1/2D sketch 3D model 11

David Marr Vision Representational Framework Primal sketch 2-1/2D sketch viewer centered 3D model object centered 12

Marr Primal Sketch saliency! Compare today s interest point, line, edge detection, etc. Raw primal sketch full primal sketch (includes grouping) 13

Marr 2-1/2D Sketch Surface patches (surface normals), depth discontinuitites 14

Marr 3D Model Representation Generalized cylinder, generalized cone 3D hierarchical models 15

David Marr Vision Representational Framework Primal sketch 2-1/2D sketch 3D model 16

Defining the Terms Image Understanding + Video Understanding Image Understanding Processing Image Computer Graphics Scene description 17

Please describe this scene: Scene Description Many possible (+correct!) descriptions Correct/best description may depend on the particular goal(s) purposive, qualitative, active vision [Aloimonos, 1992] 18

My Model of Image Understanding [Pinz, 1994] Repräsentationen Prozesse Datenfluss Kontrollfluss 19

My Model of Image Understanding 20

Up to WS 2014/15: - Mostly 2D - Image understanding Institute of Electrical Measurement and Measurement Signal Processing My Model of Image Understanding This course: - Can this be extended towards video understanding? KU: 2D image and scene description 21

2D Scene Description houses [Matsuyama 90] face [Brunelli 92] pedestrians [Suzuki 90] 2D image objects tokens 22

2D (+time!) Video Description Fast object segmentation in unconstrained video [Papazoglou&Ferrari, ICCV 13] http://groups.inf.ed.ac.uk/calvin/fastvideosegmentation/ 23

2D (+time!) Video Description Fast object segmentation in unconstrained video [Papazoglou&Ferrari, ICCV 13] 24

3D Scene Description Scenecoordinate system S Object 1 Object 2 25

3D (+time!) Video Description 26

More Definition: Visual Recognition [Perona 09] The holy grail of Computer Vision Five tasks of visual recognition : Verification (is a car in the image?) Detection and localization (what is there? where?) Classification (n beach images, m city images) Naming (name and locate all objects in an image) Description: objects, actions, relations, etc. (example kissing scene understanding ) Increasing complexity from top bottom Complexity Image and Video Understanding: mostly 2D (+time) recognition Image-based Measurement: 3D (+time) reconstruction 27

2D Scene Representation and Description You can get very far in 2D! image segmentation image description 2D image object token tokenset 2D grouping 2D scene description 28