Modeling Attention to Salient Proto-objects

Similar documents
Selective visual attention enables learning and recognition of multiple objects in cluttered scenes

Is bottom-up attention useful for object recognition?

Selective visual attention enables learning and recognition of multiple objects in cluttered scenes

Saliency Detection: A Spectral Residual Approach

The Use of Attention and Spatial Information for Rapid Facial Recognition in Video

3D Object Recognition: A Model of View-Tuned Neurons

Robust Frequency-tuned Salient Region Detection

Spatio-temporal Saliency Detection Using Phase Spectrum of Quaternion Fourier Transform

Dynamic visual attention: competitive versus motion priority scheme

A Model of Salient Region Detection

A Novel Approach for Saliency Detection based on Multiscale Phase Spectrum

Evaluation of Selective Attention under Similarity Transforms

PERFORMANCE ANALYSIS OF COMPUTING TECHNIQUES FOR IMAGE DISPARITY IN STEREO IMAGE

A Hierarchical Visual Saliency Model for Character Detection in Natural Scenes

A Modified Approach to Biologically Motivated Saliency Mapping

Using visual attention to extract regions of interest in the context of image retrieval

Salient Region Detection and Segmentation in Images using Dynamic Mode Decomposition

Enhanced object recognition in cortex-like machine vision

Neurally Inspired Mechanisms for the Dynamic Visual Attention Map Generation Task

Toward Object Recognition with Proto-Objects and Proto-Scenes

A New Framework for Multiscale Saliency Detection Based on Image Patches

Quasi-thematic Features Detection & Tracking. Future Rover Long-Distance Autonomous Navigation

Experimenting a Visual Attention Model in the Context of CBIR systems

Fixation-Based Visual Image Segmentation using the concept of Border-ownership

Salient Region Detection using Weighted Feature Maps based on the Human Visual Attention Model

CMPUT 616 Implementation of a Visual Attention Model

An Object-based Visual Attention Model for Robots

A Model of Dynamic Visual Attention for Object Tracking in Natural Image Sequences

An Improved Image Resizing Approach with Protection of Main Objects

International Conference on Information Sciences, Machinery, Materials and Energy (ICISMME 2015)

Learning to Detect A Salient Object

Suitability Analysis Based on Multi-Feature Fusion Visual Saliency Model in Vision Navigation

Object-Based Saliency Maps Harry Marr Computing BSc 2009/2010

AUTONOMOUS IMAGE EXTRACTION AND SEGMENTATION OF IMAGE USING UAV S

Evaluation of regions-of-interest based attention algorithms using a probabilistic measure

A Nonparametric Approach to Bottom-Up Visual Saliency

Categorization by Learning and Combining Object Parts

Small Object Segmentation Based on Visual Saliency in Natural Images

Saliency Detection Using Quaternion Sparse Reconstruction

A Multi-modal Attention System for Smart Environments

Shape Matching. Brandon Smith and Shengnan Wang Computer Vision CS766 Fall 2007

A Novel Approach to Image Segmentation for Traffic Sign Recognition Jon Jay Hack and Sidd Jagadish

Determining Patch Saliency Using Low-Level Context

A Model of Visual Attention for Natural Image Retrieval

Detecting Salient Contours Using Orientation Energy Distribution. Part I: Thresholding Based on. Response Distribution

Improving the Selection and Detection of Visual Landmarks Through Object Tracking

THE HUMAN visual system interprets the world in a

Object Detection System

Local Features and Kernels for Classifcation of Texture and Object Categories: A Comprehensive Study

The Vehicle Logo Location System based on saliency model

Integrating Inhomogeneous Processing and Proto-object Formation in a Computational Model of Visual Attention

Multi Camera Visual Saliency Using Image Stitching

Effect of Salient Features in Object Recognition

TRABAJO FIN DE GRADO

Saliency Maps of High Dynamic Range Images

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.

Embedding High-Level Information into Low Level Vision: Efficient Object Search in Clutter

Vision and Image Processing Lab., CRV Tutorial day- May 30, 2010 Ottawa, Canada

Saliency Estimation Using a Non-Parametric Low-Level Vision Model

Analysis and Modeling of Fixation Point Selection for Visual Search in Cluttered Backgrounds

Video Deinterlacing by Graphic based Visual Saliency

Confidence Based updation of Motion Conspicuity in Dynamic Scenes

Supplementary Material for submission 2147: Traditional Saliency Reloaded: A Good Old Model in New Shape

Main Subject Detection via Adaptive Feature Selection

Asymmetry as a Measure of Visual Saliency

Representing 3D Objects: An Introduction to Object Centered and Viewer Centered Models

Saliency Detection in Aerial Imagery

ATTENTION BIASED SPEEDED UP ROBUST FEATURES (AB-SURF): A NEURALLY-INSPIRED OBJECT RECOGNITION ALGORITHM FOR A WEARABLE AID FOR THE VISUALLY-IMPAIRED

Random Walks on Graphs to Model Saliency in Images

Learning a Sparse Representation for Object Detection

Applying Preattentive Visual Guidance in Document Image Analysis

Saliency Extraction for Gaze-Contingent Displays

Predicting Visual Saliency of Building using Top down Approach

All Rights Reserved 2014 IJARECE

Introduction to visual computation and the primate visual system

SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS

People Recognition and Pose Estimation in Image Sequences

Sparse Models in Image Understanding And Computer Vision

An Oscillatory Correlation Model of Object-based Attention

Research Article A Computing Model of Selective Attention for Service Robot Based on Spatial Data Fusion

TOWARDS THE ESTIMATION OF CONSPICUITY WITH VISUAL PRIORS

The importance of intermediate representations for the modeling of 2D shape detection: Endstopping and curvature tuned computations

SAR image target detection in complex environments based on improved visual attention algorithm

Active Multi-View Object Search on a Humanoid Head

Research Article International Journal of Emerging Research in Management &Technology ISSN: (Volume-4, Issue-5) Abstract

Learning a Sparse Representation for Object Detection

BiTS A Biologically-Inspired Target Screener for Detecting Manmade Objects in Natural Clutter Backgrounds

Saliency Detection using Region-Based Incremental Center-Surround Distance

Image Resizing Based on Gradient Vector Flow Analysis

Super-Resolution (SR) image re-construction is the process of combining the information from multiple

Modeling Visual Cortex V4 in Naturalistic Conditions with Invari. Representations

ELL 788 Computational Perception & Cognition July November 2015

VISUAL ATTENTION MODEL WITH ADAPTIVE WEIGHTING OF CONSPICUITY MAPS FOR BUILDING DETECTION IN SATELLITE IMAGES

SUPPLEMENTARY MATERIAL FOR. Do computer vision models differ systematically from human object perception? RT Pramod 1,2 & SP Arun 1

The SIFT (Scale Invariant Feature

A Survey on Detecting Image Visual Saliency

A Hierarchical Compositional System for Rapid Object Detection

Design & Implementation of Saliency Detection Model in H.264 Standard

Probabilistic Models of Attention based on Iconic Representations and Predictive Coding

Object recognition in images by human vision and computer vision Chen, Q.; Dijkstra, J.; de Vries, B.

Transcription:

Modeling Attention to Salient Proto-objects Dirk Bernhardt-Walther Collaborators: Laurent Itti Christof Koch Pietro Perona Tommy Poggio Maximilian Riesenhuber Ueli Rutishauser Beckman Institute, University of Illinois at Urbana-Champaign

SUnS, Feb 2nd, 2007 2

SUnS, Feb 2nd, 2007 3

Attention and Object Recognition Parsing a scene into objects: Selective attention used to serialize perception How can objects be attended before they are perceived? Our solution: selection of salient proto-object regions likely to contain an object, based on bottom-up visual information. SUnS, Feb 2nd, 2007 4

Outline 1. A model for attending to salient protoobject regions. 2. Application to biologically plausible object recognition. 3. Application to computer vision. SUnS, Feb 2nd, 2007 5

Proto-objects in Coherence Theory Rensink, Visual Cognition 2000 SUnS, Feb 2nd, 2007 6

Attending to proto-objects Rensink, Vision Res. 2000 SUnS, Feb 2nd, 2007 7

Related Ideas Coherence Theory (Rensink 2000 a,b) Object files (Kahneman and Treisman 1984, 1992) Attention spreading over objects (Egly et al. 1994) Spreading of inhibition of return over objects (Jordan and Tipper 1999) SUnS, Feb 2nd, 2007 8

Bottom-up attention to locations Koch and Ullman 1985 Itti, Koch, and Niebur 1998 Itti and Koch 2001 SUnS, Feb 2nd, 2007 9

SUnS, Feb 2nd, 2007 10

Saliency-based region selection Matlab code: http://www.saliencytoolbox.net Smoothed mask Colors Binary mask Saliency Map Intensities Segmented FM Inhibition Of Return Orientations Feature Maps Original Image SUnS, Feb 2nd, 2007 11

1 2 3 4 SUnS, Feb 2nd, 2007 12

Outline 1. A model for attending to salient protoobject regions. 2. Application to biologically plausible object recognition. 3. Application to computer vision. SUnS, Feb 2nd, 2007 13

Proto-objects in Coherence Theory Rensink, Visual Cognition 2000 SUnS, Feb 2nd, 2007 14

Attending to proto-objects Rensink, Vision Res. 2000 SUnS, Feb 2nd, 2007 15

Attentional modulation in HMAX (with Maximilian Riesenhuber and Tomaso Poggio, MIT)? Modulation mask µ IT equivalent recognition layer Salient region selection Inhibition of return Spatial modulation V4 equivalent HMAX S2 layer V1 / V2 equivalent S1/C1 layers Walther et al. BMCV 2002 Walther & Koch, Neural Netw. 2006 (Riesenhuber & Poggio 1999) SUnS, Feb 2nd, 2007 16

Attentional modulation of V4 neurons Delayed match-to-sample task Firing rate of two neurons A B Attended Unattended McAdams & Maunsell 2000 21 % 38 % SUnS, Feb 2nd, 2007 17

Deployment of attention in area V4 Modulation of V4 neurons in macaques due to spatial attention µ (*) spatial and feature-based attention SUnS, Feb 2nd, 2007 18

Modeling Results paperclips Walther & Koch, NN 2006 error bars: s.e.m. over 21 paperclips (Logothetis et al. 1994) SUnS, Feb 2nd, 2007 19

Modeling Results MPI Tübingen faces error bars: s.e.m. over 21 faces (Vetter & Blanz 1994) SUnS, Feb 2nd, 2007 20

Deployment of attention in area V4 Modulation of V4 neurons in macaques due to spatial attention m Walther and Koch, Neural Netw. 2006 µ (*) spatial and feature-based attention SUnS, Feb 2nd, 2007 21

Summary biologically motivated vision Can use spatial attention to enable detection of multiple objects in a biologically realistic model of object recognition in cortex. 20-40 % modulation of neural activity is sufficient in our experiments compatible with neurophysiology SUnS, Feb 2nd, 2007 22

Attention Deployment in Machine Vision (with Ueli Rutishauser and Pietro Perona, Caltech) Spatial grouping and preferential processing of a likely object regions for: Learning and recognition of several independent objects from a single image, Pruning away clutter. All-or-none modulation preferable to save resources, i.e., µ = 1. Apply to recognition system by Lowe (2004): 1. Detect scale invariant features (SIFT), 2. Store and match constellations of SIFT features. SUnS, Feb 2nd, 2007 23

Learning and recognizing multiple objects Walther, Rutishauser, Koch, and Perona, CVIU 2005 SUnS, Feb 2nd, 2007 24

Grouping of keypoints SUnS, Feb 2nd, 2007 25

Learning and recognizing multiple objects 1 training image, 101 test images (a) (b) 2 objects learned from training image Book present in 24, box in 23 images, in four of these both objects are present (c) (d) Recognition results: Rutishauser, Walther, Koch, and Perona, CVPR 2004; Walther, Rutishauser, Koch, and Perona, CVIU 2005 SUnS, Feb 2nd, 2007 26

Objects in cluttered scenes Using the relative object (ROS) size as an inverse measure of the amount of clutter. ROS = # # pixels _ object pixels _ image SUnS, Feb 2nd, 2007 27

Recognition performance in clutter (false positive rate < 0.07 % in all cases) SUnS, Feb 2nd, 2007 28

Conclusions Suggest a simple bottom-up way to attend to proto-object regions, Demonstrate applications in biological modeling and in computer vision. Not covered: Non-spatial attention (e.g. feature or objectbased attention to overlapping stimuli), Top-down biases (see, e.g., Navalpakkam and Itti 2005). SUnS, Feb 2nd, 2007 29

Acknowledgements Collaborators: Caltech: Christof Koch, Ueli Rutishauser, Pietro Perona USC: Laurent Itti MIT: Maximilian Riesenhuber (now Georgetown), Tomaso Poggio Funding: Beckman Foundation, NSF, NIMH, Sloan-Swartz Foundation, Keck Foundation Matlab code available at: http://www.saliencytoolbox.net SUnS, Feb 2nd, 2007 30