From natural scene statistics to models of neural coding and representation (part 1)

Size: px
Start display at page:

Download "From natural scene statistics to models of neural coding and representation (part 1)"

Transcription

1 From natural scene statistics to models of neural coding and representation (part 1) Bruno A. Olshausen Helen Wills Neuroscience Institute, School of Optometry and Redwood Center for Theoretical Neuroscience UC Berkeley

2 Review article: What Natural Scene Statistics Can Tell Us about Cortical Representation Bruno A. Olshausen & Michael S. Lewicki To appear in: The New Visual Neurosciences Chalupa and Werner, Eds. MIT Press

3 Today s talk Why natural scene statistics? - (a bit about biology) Theory of Redundancy Reduction Sparse Coding

4 What are the principles of computation and representation governing this system?

5 Natural images are full of ambiguity

6 Natural images are full of ambiguity

7 Vision as inference lens World Image Model

8 Visual cortical areas - macaque monkey

9 Visual cortical areas 46 TF TH STPa 7a FEF STPp AITd CITv AITv CITd Faces/objects VIP LIP MSTd MSTI FST PITd PITv DP VOT Intermediate-level MDP MIP PO MT V4t V4 vision PIP V3A V2 V1 (courtesy of Jeff Hawkins)

10

11

12

13 1 mm 2 of cortex analyzes ca. 14 x 14 array of retinal sample nodes and contains 100,000 neurons (Anderson & Van Essen, 1995)

14

15

16

17 Anatomy of a synapse

18 The evolution of eyes Land & Fernald (1992)

19

20 Efficient Coding represent the most relevant visual information with the fewest physical and metabolic resources

21 Theory of Redundancy Reduction Attneave (1954) Some Informational Aspects of Visual Perception Barlow (1961) Possible Principles Underlying the Transformations of Sensory Messages - nervous system should reduce redundancy - makes more efficient use of neural resources - enables storing information about prior probabilities since P (x) =Π i P (x i ) suspicious coincidences

22 From theory to models Laughlin (1981) - histogram equalization Laughlin, Srinivasan and Dubs (1982) - Predictive Coding: A Fresh View of Inhibition in the Retina Field (1987) - natural images have 1/f 2 power spectra Atick & Redlich (1992); van Hateren (1992; 1993) - whitening Dan, Atick, and Reid (1996) - LGN whitens natural movies

23 Natural scene statistics and visual coding f A B C D E F Aa:.:i (Field 1987)

24 Whitening (or decorrelation) theory (Atick & Redlich, 1992) 1/f image whitening filter decorrelated image amplitude amplitude x = amplitude frequency frequency frequency

25

26

27 Robust Coding Doi & Lewicki (2006) - A Theory of Retinal Population Coding sensory noise channel noise ν δ optical blur encoder decoder s H x W r A ŝ image observation representation reconstruction (a) 20dB 10dB 0dB -10dB (b) (d) Magnitude 20dB -10dB (c) Spatial freq.

28 Robust Coding b Karklin & Simoncelli (2011) - Efficient coding of natural images with a population of noisy Linear-Nonlinear neurons a c 15 0 b 16 1 Figure 1: a. Schematic of the model (see text for description). The go transfer between images x and the neural response r, subjecttomet Information ON center about the stimulus isoff center conveyed both by the arrangement of the neural nonlinearities. Top: two neurons encode two stimulus c 16 an image, x 1 and x 2 )withlinearfilters(blacklines)whoseoutputisp functions (thick color lines; thin color lines show isoresponse con levels). The steepness of the nonlinearities specifies the precision represented: regions of steep slope correspond to finer partitioning o uncertaintyabout the input. Bottom: joint encoding leads to binning o the isoresponse lines above. Grayscale shading indicates the level of u of the input (lighter shades correspond to higher uncertainty). Efficien subject to input distribution, noise levels, and metabolic costs on the Parameter λ j specifies the trade-off between information gained by fi of generating them. It is difficult to obtain a biologically valid est ultimately, the value of sensory information gained depends ontheb [26]. Alternatively, we can use λ j as a Lagrange multiplier to enfor

29 Beyond efficient coding RR is appropriate when there is a bottleneck. But V1 expands dimensionality - many more neurons than inputs The real goal of sensory representation is to model the redundancy in images, not necessarily to reduce it (Barlow 2001) What we desire is a meaningful representation. RR provides a valid probabilistic model only when the world can be described in terms of statistically independent components. To understand cortical representation we must appeal to a different principle.

30 V1 is highly overcomplete LGN afferents IVb layer 4 cortex Barlow (1981) 0 1mm C I b and IV

31 Sparse, distributed representation a i I(x,y) Provides a way to group things together so that the world can be described in terms of a small number of events at any given moment. Converts higher-order redundancy in images into a simple form of redundancy.

32 Sparse vs. dense vs. grandmother cell codes Dense codes (ascii) Sparse, distributed codes Local codes (grandmother cells) High combinatorial capacity (2 N ) - Difficult to read out + Decent combinatorial capacity (~N K ) + Still easy to read out - Low combinatorial capacity (N) + Easy to read out

33 Gabor-filter response histogram

34 ... Sparse coding External world Internal model image model I(x,y) φ i (x,y) ai image neural features other activities stuff (sparse)

35 Learned dictionary Φ (critically sampled) 180 orientation bandwidth (degrees) radial bandwidth (octaves)

36 Effect of overcompleteness and hard sparsity (Rehn and Sommer 2006)

37 Learned dictionary 10x overcomplete (joint work with David Warland, UC Davis)

38 Sparsity 10% 5% 1% 27db 20db 8.2db 19db 13db 5.3db 13db 8.5db 5x 2.5x 180 orientation bandwidth (degrees) Over-completeness 10x 160 SNR: 3.2db radial bandwidth (octaves) 2.5

39 10x, 1%

40 Blob Ridge-like Grating

41 Sparsity 10% 5% 27db 20db 19db 13db 13db 8.5db blob 1% 8.2db grating ridge-like 5.3db 5x 2.5x 180 orientation bandwidth (degrees) Over-completeness 10x 160 SNR: 3.2db radial bandwidth (octaves) 2.5

42 Solutions are stable

43 Tiling properties: Blobs (highest spatial-frequency band only) y position x position

44 Tiling properties: Ridge-like orientation spatial position

45 Sparse coding of time-varying images " # #!$!&! % $ # '$(%(!)!&* t % $ +#$%%%!$!"!"! Speed vs. direction Speed vs. spatial frequency I(x, t) = i a i (t) φ i (x, t)+(x, t)! Speed (pixels/frame) Spatial frequency (cy/pixel)

46 Learned basis space-time basis functions (200 bfs, 12 x12 x 7)

47 Sparse coding and reconstruction 2 sparsified 0 amplitude convolution time (sec)

48 Extensions to color, disparity Wachtler, Lee and Sejnowski (2001), Hoyer & Hyvarinen (2000)

49 Nonlinear encoding Solutions may be computed by a network of leaky integrators and threshold units (Rozell et al. 2008) Explaining away Feedforward response ("#) Sparsified response ($#) g g g g g!!!!!!!!!!

50 Nonlinear encoding explaining away explains ncrf effects Lee et al. (2007), Zhu & Rozell (2010) end-stopping contrast-invariant tuning c =0.1 c =0.2 c =0.3 c =0.4 c =0.5 response 3 2 Response (a) target competing (a) Bar length (pixels) (b) (a) Orientation (deg) (b) surround suppression Iso surround Orth surround Uniform (a) Center Orth surround Iso surround Response Iso surround Orth surround Uniform Surround contrast (b) Center Orth surround Iso surround Response (a) (c) Center contrast (d)

51 Energy-based models Osindero, Welling and Hinton (2005) A 1.0x B ICA P (x) = 1 Z e E(x) E(x) = M i=1 α i log 1+ (J i x) Beta=5 Beta=2 Beta=1/ C 1.7x D 2.4x

Perception as an Inference Problem

Perception as an Inference Problem Perception as an Inference Problem Bruno A. Olshausen Helen Wills Neuroscience Institute, School of Optometry and Redwood Center for Theoretical Neuroscience UC Berkeley What are the principles governing

More information

Computational Perception. Visual Coding 3

Computational Perception. Visual Coding 3 Computational Perception 15-485/785 February 21, 2008 Visual Coding 3 A gap in the theory? - - + - - from Hubel, 1995 2 Eye anatomy from Hubel, 1995 Photoreceptors: rods (night vision) and cones (day vision)

More information

C. Poultney S. Cho pra (NYU Courant Institute) Y. LeCun

C. Poultney S. Cho pra (NYU Courant Institute) Y. LeCun Efficient Learning of Sparse Overcomplete Representations with an Energy-Based Model Marc'Aurelio Ranzato C. Poultney S. Cho pra (NYU Courant Institute) Y. LeCun CIAR Summer School Toronto 2006 Why Extracting

More information

The Sparse Manifold Transform

The Sparse Manifold Transform The Sparse Manifold Transform Bruno Olshausen Redwood Center for Theoretical Neuroscience, Helen Wills Neuroscience Institute, and School of Optometry UC Berkeley with Yubei Chen (EECS) and Dylan Paiton

More information

Efficient Visual Coding: From Retina To V2

Efficient Visual Coding: From Retina To V2 Efficient Visual Coding: From Retina To V Honghao Shan Garrison Cottrell Computer Science and Engineering UCSD La Jolla, CA 9093-0404 shanhonghao@gmail.com, gary@ucsd.edu Abstract The human visual system

More information

SPARSE CODES FOR NATURAL IMAGES. Davide Scaramuzza Autonomous Systems Lab (EPFL)

SPARSE CODES FOR NATURAL IMAGES. Davide Scaramuzza Autonomous Systems Lab (EPFL) SPARSE CODES FOR NATURAL IMAGES Davide Scaramuzza (davide.scaramuzza@epfl.ch) Autonomous Systems Lab (EPFL) Final rapport of Wavelet Course MINI-PROJECT (Prof. Martin Vetterli) ABSTRACT The human visual

More information

Recursive ICA. Abstract

Recursive ICA. Abstract Recursive ICA Honghao Shan, Lingyun Zhang, Garrison W. Cottrell Department of Computer Science and Engineering University of California, San Diego La Jolla, CA 92093-0404 {hshan,lingyun,gary}@cs.ucsd.edu

More information

A Hierarchial Model for Visual Perception

A Hierarchial Model for Visual Perception A Hierarchial Model for Visual Perception Bolei Zhou 1 and Liqing Zhang 2 1 MOE-Microsoft Laboratory for Intelligent Computing and Intelligent Systems, and Department of Biomedical Engineering, Shanghai

More information

Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science

Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science V1-inspired orientation selective filters for image processing and computer vision Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science 2 Most of the images in this

More information

ICA mixture models for image processing

ICA mixture models for image processing I999 6th Joint Sy~nposiurn orz Neural Computation Proceedings ICA mixture models for image processing Te-Won Lee Michael S. Lewicki The Salk Institute, CNL Carnegie Mellon University, CS & CNBC 10010 N.

More information

Detecting Salient Contours Using Orientation Energy Distribution. Part I: Thresholding Based on. Response Distribution

Detecting Salient Contours Using Orientation Energy Distribution. Part I: Thresholding Based on. Response Distribution Detecting Salient Contours Using Orientation Energy Distribution The Problem: How Does the Visual System Detect Salient Contours? CPSC 636 Slide12, Spring 212 Yoonsuck Choe Co-work with S. Sarma and H.-C.

More information

Wavelets, vision and the statistics of natural scenes

Wavelets, vision and the statistics of natural scenes Wavelets, vision and the statistics of natural scenes By D. J. Field Uris Hall, Cornell University, Ithaca, NY 14853, USA The processing of spatial information by the visual system shows a number of similarities

More information

Modern Signal Processing and Sparse Coding

Modern Signal Processing and Sparse Coding Modern Signal Processing and Sparse Coding School of Electrical and Computer Engineering Georgia Institute of Technology March 22 2011 Reason D etre Modern signal processing Signals without cosines? Sparse

More information

Natural image statistics and efficient coding

Natural image statistics and efficient coding Network: Computation in Neural Systems 7 (1996) 333 339. Printed in the UK WORKSHOP PAPER Natural image statistics and efficient coding B A Olshausen and D J Field Department of Psychology, Uris Hall,

More information

Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science

Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science 2D Gabor functions and filters for image processing and computer vision Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science 2 Most of the images in this presentation

More information

Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science

Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science 2D Gabor functions and filters for image processing and computer vision Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science 2 Most of the images in this presentation

More information

Learning Transformational Invariants from Natural Movies

Learning Transformational Invariants from Natural Movies Learning Transformational Invariants from Natural Movies Charles F. Cadieu & Bruno A. Olshausen Helen Wills Neuroscience Institute University of California, Berkeley Berkeley, CA 94720 {cadieu, baolshausen}@berkeley.edu

More information

Introduction to visual computation and the primate visual system

Introduction to visual computation and the primate visual system Introduction to visual computation and the primate visual system Problems in vision Basic facts about the visual system Mathematical models for early vision Marr s computational philosophy and proposal

More information

Modeling Visual Cortex V4 in Naturalistic Conditions with Invari. Representations

Modeling Visual Cortex V4 in Naturalistic Conditions with Invari. Representations Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant and Sparse Image Representations Bin Yu Departments of Statistics and EECS University of California at Berkeley Rutgers University, May

More information

Bilinear Models of Natural Images

Bilinear Models of Natural Images In: SPIE Proceedings vol. 6492: Human Vision and Electronic Imaging XII, (B.E. Rogowitz, T.N. Pappas, S.J. Daly, Eds.), Jan 28-Feb 1, 2007, San Jose, California Bilinear Models of Natural Images Bruno

More information

Computational Models of V1 cells. Gabor and CORF

Computational Models of V1 cells. Gabor and CORF 1 Computational Models of V1 cells Gabor and CORF George Azzopardi Nicolai Petkov 2 Primary visual cortex (striate cortex or V1) 3 Types of V1 Cells Hubel and Wiesel, Nobel Prize winners Three main types

More information

A Computational Approach To Understanding The Response Properties Of Cells In The Visual System

A Computational Approach To Understanding The Response Properties Of Cells In The Visual System A Computational Approach To Understanding The Response Properties Of Cells In The Visual System David C. Noelle Assistant Professor of Computer Science and Psychology Vanderbilt University March 3, 2004

More information

Conjectures regarding the nonlinear geometry of visual neurons

Conjectures regarding the nonlinear geometry of visual neurons Conjectures regarding the nonlinear geometry of visual neurons James R. Golden a, Kedarnath P. Vilankar a, Michael C.K. Wu b,c, David J. Field a a Department of Psychology, Cornell University, Ithaca,

More information

Independent component lters of natural images compared with simple cells in primary visual cortex

Independent component lters of natural images compared with simple cells in primary visual cortex Independent component lters of natural images compared with simple cells in primary visual cortex J. H. van Hateren * and A. van der Schaaf Department of Biophysics, University of Groningen, Nijenborgh

More information

The "tree-dependent components" of natural scenes are edge filters

The tree-dependent components of natural scenes are edge filters The "tree-dependent components" of natural scenes are edge filters Daniel Zoran Interdisciplinary Center for Neural Computation Hebrew University of Jerusalem daniez@cs.huji.ac.il Yair Weiss School of

More information

Estimating Markov Random Field Potentials for Natural Images

Estimating Markov Random Field Potentials for Natural Images Estimating Markov Random Field Potentials for Natural Images Urs Köster 1,2, Jussi T. Lindgren 1,2 and Aapo Hyvärinen 1,2,3 1 Department of Computer Science, University of Helsinki, Finland 2 Helsinki

More information

Multiview Feature Learning

Multiview Feature Learning Multiview Feature Learning Roland Memisevic Frankfurt, Montreal Tutorial at IPAM 2012 Roland Memisevic (Frankfurt, Montreal) Multiview Feature Learning Tutorial at IPAM 2012 1 / 163 Outline 1 Introduction

More information

Efficient Hardware Architecture for Sparse Coding

Efficient Hardware Architecture for Sparse Coding IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 62, NO. 16, AUGUST 15, 2014 4173 Efficient Hardware Architecture for Sparse Coding Jung Kuk Kim, Student Member, IEEE, Phil Knag, Student Member, IEEE, Thomas

More information

Learning Feature Hierarchies for Object Recognition

Learning Feature Hierarchies for Object Recognition Learning Feature Hierarchies for Object Recognition Koray Kavukcuoglu Computer Science Department Courant Institute of Mathematical Sciences New York University Marc Aurelio Ranzato, Kevin Jarrett, Pierre

More information

Nonparametric sparse hierarchical models describe V1 fmri responses to natural images

Nonparametric sparse hierarchical models describe V1 fmri responses to natural images Nonparametric sparse hierarchical models describe V1 fmri responses to natural images Pradeep Ravikumar, Vincent Q. Vu and Bin Yu Department of Statistics University of California, Berkeley Berkeley, CA

More information

Bio-inspired Binocular Disparity with Position-Shift Receptive Field

Bio-inspired Binocular Disparity with Position-Shift Receptive Field Bio-inspired Binocular Disparity with Position-Shift Receptive Field Fernanda da C. e C. Faria, Jorge Batista and Helder Araújo Institute of Systems and Robotics, Department of Electrical Engineering and

More information

What Are the Invariant Occlusive Components of Image Patches? A Probabilistic Generative Approach

What Are the Invariant Occlusive Components of Image Patches? A Probabilistic Generative Approach What Are the Invariant Occlusive Components of Image Patches? A Probabilistic Generative Approach Zhenwen Dai University of Sheffield, UK, and FIAS, Goethe-University Frankfurt, Germany z.dai@sheffield.ac.uk

More information

Structural Similarity Sparse Coding

Structural Similarity Sparse Coding Appl. Math. Inf. Sci. 8, No. 1L, 363-369 (2014) 363 Applied Mathematics & Information Sciences An International Journal http://dx.doi.org/10.12785/amis/081l46 Structural Similarity Sparse Coding Zhiqing

More information

SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS

SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS Cognitive Robotics Original: David G. Lowe, 004 Summary: Coen van Leeuwen, s1460919 Abstract: This article presents a method to extract

More information

Human Vision Based Object Recognition Sye-Min Christina Chan

Human Vision Based Object Recognition Sye-Min Christina Chan Human Vision Based Object Recognition Sye-Min Christina Chan Abstract Serre, Wolf, and Poggio introduced an object recognition algorithm that simulates image processing in visual cortex and claimed to

More information

Independent component analysis of natural image sequences yields spatio-temporal lters similar to simple cells in primary visual cortex

Independent component analysis of natural image sequences yields spatio-temporal lters similar to simple cells in primary visual cortex Independent component analysis of natural image sequences yields spatio-temporal lters similar to simple cells in primary visual cortex J. H. van Hateren 1* and D. L. Ruderman 2 1 Department of Neurobiophysics,

More information

Improvement of optic flow estimation from an event-based sensor with recurrently connected processing layers

Improvement of optic flow estimation from an event-based sensor with recurrently connected processing layers Improvement of optic flow estimation from an event-based sensor with recurrently connected processing layers Vilim Štih 31st March 2014 1 Introduction The Dynamic Vision Sensor (DVS) is a visual sensor

More information

Learning Spatio-temporally Invariant Representations from Video

Learning Spatio-temporally Invariant Representations from Video Learning Spatio-temporally Invariant Representations from Video Jae Hyun Lim, Hansol Choi, Jun-Cheol Park, Jae Young Jun and Dae-shik Kim Department of Electrical Engineering Korea Advanced Institute of

More information

Learning Intermediate-Level Representations of Form and Motion from Natural Movies

Learning Intermediate-Level Representations of Form and Motion from Natural Movies 1 Learning Intermediate-Level Representations of Form and Motion from Natural Movies Charles F. Cadieu and Bruno A. Olshausen Redwood Center for Theoretical Neuroscience, Helen Wills Neuroscience Institute,

More information

Simple Method for High-Performance Digit Recognition Based on Sparse Coding

Simple Method for High-Performance Digit Recognition Based on Sparse Coding 1 Simple Method for High-Performance Digit Recognition Based on Sparse Coding Kai Labusch Institute for Neuro- and Bioinformatics University of Lübeck D-23538 Lübeck labusch@inb.uni-luebeck.de Erhardt

More information

Last update: May 4, Vision. CMSC 421: Chapter 24. CMSC 421: Chapter 24 1

Last update: May 4, Vision. CMSC 421: Chapter 24. CMSC 421: Chapter 24 1 Last update: May 4, 200 Vision CMSC 42: Chapter 24 CMSC 42: Chapter 24 Outline Perception generally Image formation Early vision 2D D Object recognition CMSC 42: Chapter 24 2 Perception generally Stimulus

More information

Exercise: Simulating spatial processing in the visual pathway with convolution

Exercise: Simulating spatial processing in the visual pathway with convolution Exercise: Simulating spatial processing in the visual pathway with convolution This problem uses convolution to simulate the spatial filtering performed by neurons in the early stages of the visual pathway,

More information

Department of Psychology, Uris Hall, Cornell University, Ithaca, New York

Department of Psychology, Uris Hall, Cornell University, Ithaca, New York Natural image statistics and ecient coding B A Olshausen and D J Field Department of Psychology, Uris Hall, Cornell University, Ithaca, New York 14853. Email: bao1@cornell.edu, djf3@cornell.edu To appear

More information

Computer vision: models, learning and inference. Chapter 13 Image preprocessing and feature extraction

Computer vision: models, learning and inference. Chapter 13 Image preprocessing and feature extraction Computer vision: models, learning and inference Chapter 13 Image preprocessing and feature extraction Preprocessing The goal of pre-processing is to try to reduce unwanted variation in image due to lighting,

More information

OBJECT RECOGNITION ALGORITHM FOR MOBILE DEVICES

OBJECT RECOGNITION ALGORITHM FOR MOBILE DEVICES Image Processing & Communication, vol. 18,no. 1, pp.31-36 DOI: 10.2478/v10248-012-0088-x 31 OBJECT RECOGNITION ALGORITHM FOR MOBILE DEVICES RAFAŁ KOZIK ADAM MARCHEWKA Institute of Telecommunications, University

More information

Self-Organizing Sparse Codes

Self-Organizing Sparse Codes Self-Organizing Sparse Codes Yangqing Jia Sergey Karayev 2010-12-16 Abstract Sparse coding as applied to natural image patches learns Gabor-like components that resemble those found in the lower areas

More information

MultiVariate Bayesian (MVB) decoding of brain images

MultiVariate Bayesian (MVB) decoding of brain images MultiVariate Bayesian (MVB) decoding of brain images Alexa Morcom Edinburgh SPM course 2015 With thanks to J. Daunizeau, K. Brodersen for slides stimulus behaviour encoding of sensorial or cognitive state?

More information

Emergence of Topography and Complex Cell Properties from Natural Images using Extensions of ICA

Emergence of Topography and Complex Cell Properties from Natural Images using Extensions of ICA Emergence of Topography and Complex Cell Properties from Natural Images using Extensions of ICA Aapo Hyviirinen and Patrik Hoyer Neural Networks Research Center Helsinki University of Technology P.O. Box

More information

Edge Detection (with a sidelight introduction to linear, associative operators). Images

Edge Detection (with a sidelight introduction to linear, associative operators). Images Images (we will, eventually, come back to imaging geometry. But, now that we know how images come from the world, we will examine operations on images). Edge Detection (with a sidelight introduction to

More information

Independent component analysis applied to feature extraction from colour and stereo images

Independent component analysis applied to feature extraction from colour and stereo images Network: Comput. Neural Syst. 11 (2000) 191 210. Printed in the UK PII: S0954-898X(00)14814-6 Independent component analysis applied to feature extraction from colour and stereo images Patrik O Hoyer and

More information

Vision: From Eye to Brain (Chap 3)

Vision: From Eye to Brain (Chap 3) Vision: From Eye to Brain (Chap 3) Lecture 7 Jonathan Pillow Sensation & Perception (PSY 345 / NEU 325) Princeton University, Fall 2017 early visual pathway eye eye optic nerve optic chiasm optic tract

More information

Laser sensors. Transmitter. Receiver. Basilio Bona ROBOTICA 03CFIOR

Laser sensors. Transmitter. Receiver. Basilio Bona ROBOTICA 03CFIOR Mobile & Service Robotics Sensors for Robotics 3 Laser sensors Rays are transmitted and received coaxially The target is illuminated by collimated rays The receiver measures the time of flight (back and

More information

A Developing Sensory Mapping for Robots

A Developing Sensory Mapping for Robots A Developing Sensory Mapping for Robots Nan Zhang and John Weng Zhengyou Zhang Dept. of Computer Sci. and Engr. Microsoft Research Michigan State University Microsoft Corp. East Lansing, MI 48823 Redmond,

More information

Learning-based Methods in Vision

Learning-based Methods in Vision Learning-based Methods in Vision 16-824 Sparsity and Deep Learning Motivation Multitude of hand-designed features currently in use in vision - SIFT, HoG, LBP, MSER, etc. Even the best approaches, just

More information

Reliable disparity estimation through selective integration

Reliable disparity estimation through selective integration Visual Neuroscience (1998), 15, 511 528. Printed in the USA. Copyright 1998 Cambridge University Press 0952-5238098 $12.50 Reliable disparity estimation through selective integration MICHAEL S. GRAY, 1,2

More information

2 Feature Extraction and Signal Reconstruction. of Air and Bone Conduction Voices by Independent Component Analysis

2 Feature Extraction and Signal Reconstruction. of Air and Bone Conduction Voices by Independent Component Analysis IMECS 28, 9-2 March, 28, Hong Kong Feature Extraction and Signal Reconstruction of Air and Bone Conduction Voices by Independent Component Analysis Tadahiro Azetsu, Eiji Uchino, Ryosuke Kubota, and Noriaki

More information

3D Object Recognition: A Model of View-Tuned Neurons

3D Object Recognition: A Model of View-Tuned Neurons 3D Object Recognition: A Model of View-Tuned Neurons Emanuela Bricolo Tomaso Poggio Department of Brain and Cognitive Sciences Massachusetts Institute of Technology Cambridge, MA 02139 {emanuela,tp}gai.mit.edu

More information

CAP 5415 Computer Vision Fall 2012

CAP 5415 Computer Vision Fall 2012 CAP 5415 Computer Vision Fall 01 Dr. Mubarak Shah Univ. of Central Florida Office 47-F HEC Lecture-5 SIFT: David Lowe, UBC SIFT - Key Point Extraction Stands for scale invariant feature transform Patented

More information

An algorithm for image representation as independent levels of resolution

An algorithm for image representation as independent levels of resolution An algorithm for image representation as independent levels of resolution Antonio Turiel 1, Jean-Pierre Nadal 2, and Néstor Parga 3 1 Air Project - INRIA. Domaine de Voluceau BP105 78153 Le Chesnay CEDEX,

More information

Digital Image Processing COSC 6380/4393

Digital Image Processing COSC 6380/4393 Digital Image Processing COSC 6380/4393 Lecture 21 Nov 16 th, 2017 Pranav Mantini Ack: Shah. M Image Processing Geometric Transformation Point Operations Filtering (spatial, Frequency) Input Restoration/

More information

The SIFT (Scale Invariant Feature

The SIFT (Scale Invariant Feature The SIFT (Scale Invariant Feature Transform) Detector and Descriptor developed by David Lowe University of British Columbia Initial paper ICCV 1999 Newer journal paper IJCV 2004 Review: Matt Brown s Canonical

More information

What Visual Discrimination of Fractal Textures Can Tell Us About Discrimination of Camouflaged Targets

What Visual Discrimination of Fractal Textures Can Tell Us About Discrimination of Camouflaged Targets What Visual Discrimination of Fractal Textures Can Tell Us About Discrimination of Camouflaged Targets Vincent A. Billock Douglas W. Cunningham Brian H. Tsou Combat ID Discrimination Tasks Discriminate

More information

Robust contour extraction and junction detection by a neural model utilizing recurrent long-range interactions

Robust contour extraction and junction detection by a neural model utilizing recurrent long-range interactions Robust contour extraction and junction detection by a neural model utilizing recurrent long-range interactions Thorsten Hansen 1, 2 & Heiko Neumann 2 1 Abteilung Allgemeine Psychologie, Justus-Liebig-Universität

More information

EMERGENCE OF SIMPLE-CELL RECEPTIVE PROPERTIES BY LEARNING A SPARSE CODE FOR NATURAL IMAGES

EMERGENCE OF SIMPLE-CELL RECEPTIVE PROPERTIES BY LEARNING A SPARSE CODE FOR NATURAL IMAGES EMERGENCE OF SIMPLE-CELL RECEPTIVE PROPERTIES BY LEARNING A SPARSE CODE FOR NATURAL IMAGES Bruno A. Olshausen & David J. Field Presenter: Ozgur Yigit Balkan Outline Linear data representations Sparse Vector

More information

Modeling image patches with a directed hierarchy of Markov random fields

Modeling image patches with a directed hierarchy of Markov random fields Modeling image patches with a directed hierarchy of Markov random fields Simon Osindero and Geoffrey Hinton Department of Computer Science, University of Toronto 6, King s College Road, M5S 3G4, Canada

More information

A Model for Encoding Multiple Object Motions and Self-Motion in Area MST of Primate Visual Cortex

A Model for Encoding Multiple Object Motions and Self-Motion in Area MST of Primate Visual Cortex The Journal of Neuroscience, January 1, 1998, 18(1):531 547 A Model for Encoding Multiple Object Motions and Self-Motion in Area MST of Primate Visual Cortex Richard S. Zemel 1 and Terrence J. Sejnowski

More information

Sparse Models in Image Understanding And Computer Vision

Sparse Models in Image Understanding And Computer Vision Sparse Models in Image Understanding And Computer Vision Jayaraman J. Thiagarajan Arizona State University Collaborators Prof. Andreas Spanias Karthikeyan Natesan Ramamurthy Sparsity Sparsity of a vector

More information

Neurophysical Model by Barten and Its Development

Neurophysical Model by Barten and Its Development Chapter 14 Neurophysical Model by Barten and Its Development According to the Barten model, the perceived foveal image is corrupted by internal noise caused by statistical fluctuations, both in the number

More information

EECS 556 Image Processing W 09. Image enhancement. Smoothing and noise removal Sharpening filters

EECS 556 Image Processing W 09. Image enhancement. Smoothing and noise removal Sharpening filters EECS 556 Image Processing W 09 Image enhancement Smoothing and noise removal Sharpening filters What is image processing? Image processing is the application of 2D signal processing methods to images Image

More information

Why equivariance is better than premature invariance

Why equivariance is better than premature invariance 1 Why equivariance is better than premature invariance Geoffrey Hinton Canadian Institute for Advanced Research & Department of Computer Science University of Toronto with contributions from Sida Wang

More information

Modeling the top neuron layers of the human retina. Laurens Koppenol ( ) Universiteit Utrecht

Modeling the top neuron layers of the human retina. Laurens Koppenol ( ) Universiteit Utrecht Modeling the top neuron layers of the human retina Laurens Koppenol (3245578) 23-06-2013 Universiteit Utrecht 1 Contents Introduction... 3 Background... 4 Photoreceptors... 4 Scotopic vision... 5 Photopic

More information

EECS150 - Digital Design Lecture 14 FIFO 2 and SIFT. Recap and Outline

EECS150 - Digital Design Lecture 14 FIFO 2 and SIFT. Recap and Outline EECS150 - Digital Design Lecture 14 FIFO 2 and SIFT Oct. 15, 2013 Prof. Ronald Fearing Electrical Engineering and Computer Sciences University of California, Berkeley (slides courtesy of Prof. John Wawrzynek)

More information

Slide07 Haykin Chapter 9: Self-Organizing Maps

Slide07 Haykin Chapter 9: Self-Organizing Maps Slide07 Haykin Chapter 9: Self-Organizing Maps CPSC 636-600 Instructor: Yoonsuck Choe Spring 2012 Introduction Self-organizing maps (SOM) is based on competitive learning, where output neurons compete

More information

doi: /j.neucom

doi: /j.neucom doi: 10.1016/j.neucom.2014.08.019 1 2 3 4 5 6 7 Neural Construction of 3D Medial Axis from the Binocular Fusion of 2D MAs Wei Qiu, Yasuhiro Hatori, and Ko Sakai * Department of Computer Science, University

More information

Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation

Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation M. Blauth, E. Kraft, F. Hirschenberger, M. Böhm Fraunhofer Institute for Industrial Mathematics, Fraunhofer-Platz 1,

More information

Templates, Image Pyramids, and Filter Banks

Templates, Image Pyramids, and Filter Banks Templates, Image Pyramids, and Filter Banks Computer Vision James Hays, Brown Slides: Hoiem and others Reminder Project due Friday Fourier Bases Teases away fast vs. slow changes in the image. This change

More information

Computational Vision U. Minn. Psy 5036 Daniel Kersten Contrast normalization, 2nd order histogram analysis of filter outputs

Computational Vision U. Minn. Psy 5036 Daniel Kersten Contrast normalization, 2nd order histogram analysis of filter outputs Computational Vision U. Minn. Psy 5036 Daniel Kersten Contrast normalization, nd order histogram analysis of filter outputs Initialize Spell check off In[537]:= SetOptions@ArrayPlot, ColorFunction Ø "GrayTones",

More information

Topographic Mapping with fmri

Topographic Mapping with fmri Topographic Mapping with fmri Retinotopy in visual cortex Tonotopy in auditory cortex signal processing + neuroimaging = beauty! Topographic Mapping with fmri Retinotopy in visual cortex Tonotopy in auditory

More information

Image pyramids and their applications Bill Freeman and Fredo Durand Feb. 28, 2006

Image pyramids and their applications Bill Freeman and Fredo Durand Feb. 28, 2006 Image pyramids and their applications 6.882 Bill Freeman and Fredo Durand Feb. 28, 2006 Image pyramids Gaussian Laplacian Wavelet/QMF Steerable pyramid http://www-bcs.mit.edu/people/adelson/pub_pdfs/pyramid83.pdf

More information

CS 4495 Computer Vision. Linear Filtering 2: Templates, Edges. Aaron Bobick. School of Interactive Computing. Templates/Edges

CS 4495 Computer Vision. Linear Filtering 2: Templates, Edges. Aaron Bobick. School of Interactive Computing. Templates/Edges CS 4495 Computer Vision Linear Filtering 2: Templates, Edges Aaron Bobick School of Interactive Computing Last time: Convolution Convolution: Flip the filter in both dimensions (right to left, bottom to

More information

Perception. Autonomous Mobile Robots. Sensors Vision Uncertainties, Line extraction from laser scans. Autonomous Systems Lab. Zürich.

Perception. Autonomous Mobile Robots. Sensors Vision Uncertainties, Line extraction from laser scans. Autonomous Systems Lab. Zürich. Autonomous Mobile Robots Localization "Position" Global Map Cognition Environment Model Local Map Path Perception Real World Environment Motion Control Perception Sensors Vision Uncertainties, Line extraction

More information

CS4733 Class Notes, Computer Vision

CS4733 Class Notes, Computer Vision CS4733 Class Notes, Computer Vision Sources for online computer vision tutorials and demos - http://www.dai.ed.ac.uk/hipr and Computer Vision resources online - http://www.dai.ed.ac.uk/cvonline Vision

More information

Edge and Texture. CS 554 Computer Vision Pinar Duygulu Bilkent University

Edge and Texture. CS 554 Computer Vision Pinar Duygulu Bilkent University Edge and Texture CS 554 Computer Vision Pinar Duygulu Bilkent University Filters for features Previously, thinking of filtering as a way to remove or reduce noise Now, consider how filters will allow us

More information

Basic distinctions. Definitions. Epstein (1965) familiar size experiment. Distance, depth, and 3D shape cues. Distance, depth, and 3D shape cues

Basic distinctions. Definitions. Epstein (1965) familiar size experiment. Distance, depth, and 3D shape cues. Distance, depth, and 3D shape cues Distance, depth, and 3D shape cues Pictorial depth cues: familiar size, relative size, brightness, occlusion, shading and shadows, aerial/ atmospheric perspective, linear perspective, height within image,

More information

Coordinate Transformations in Parietal Cortex. Computational Models of Neural Systems Lecture 7.1. David S. Touretzky November, 2013

Coordinate Transformations in Parietal Cortex. Computational Models of Neural Systems Lecture 7.1. David S. Touretzky November, 2013 Coordinate Transformations in Parietal Cortex Lecture 7.1 David S. Touretzky November, 2013 Outline Anderson: parietal cells represent locations of visual stimuli. Zipser and Anderson: a backprop network

More information

Optimal disparity estimation in natural stereo- images (Supplement) Johannes Burge & Wilson S. Geisler

Optimal disparity estimation in natural stereo- images (Supplement) Johannes Burge & Wilson S. Geisler Optimal disparity estimation in natural stereo- images (Supplement) Johannes Burge & Wilson S. Geisler Computing the log likelihood with populations of simple and complex cells Simple cells in visual cortex

More information

Comparison of Feature Detection and Matching Approaches: SIFT and SURF

Comparison of Feature Detection and Matching Approaches: SIFT and SURF GRD Journals- Global Research and Development Journal for Engineering Volume 2 Issue 4 March 2017 ISSN: 2455-5703 Comparison of Detection and Matching Approaches: SIFT and SURF Darshana Mistry PhD student

More information

Measurements and Bits: Compressed Sensing meets Information Theory. Dror Baron ECE Department Rice University dsp.rice.edu/cs

Measurements and Bits: Compressed Sensing meets Information Theory. Dror Baron ECE Department Rice University dsp.rice.edu/cs Measurements and Bits: Compressed Sensing meets Information Theory Dror Baron ECE Department Rice University dsp.rice.edu/cs Sensing by Sampling Sample data at Nyquist rate Compress data using model (e.g.,

More information

Features that draw visual attention: an information theoretic perspective

Features that draw visual attention: an information theoretic perspective Neurocomputing 65 66 (2005) 125 133 www.elsevier.com/locate/neucom Features that draw visual attention: an information theoretic perspective Neil D.B. Bruce a,b, a Department of Computer Science, York

More information

Does the Brain do Inverse Graphics?

Does the Brain do Inverse Graphics? Does the Brain do Inverse Graphics? Geoffrey Hinton, Alex Krizhevsky, Navdeep Jaitly, Tijmen Tieleman & Yichuan Tang Department of Computer Science University of Toronto How to learn many layers of features

More information

Project proposal: Statistical models of visual neurons

Project proposal: Statistical models of visual neurons Project proposal: Statistical models of visual neurons Anna Sotnikova asotniko@math.umd.edu Project Advisor: Prof. Daniel A. Butts dab@umd.edu Department of Biology Abstract Studying visual neurons may

More information

Unsupervised Discovery of Non-Linear Structure using Contrastive Backpropagation

Unsupervised Discovery of Non-Linear Structure using Contrastive Backpropagation Unsupervised Discovery of Non-Linear Structure using Contrastive Backpropagation G. E. Hinton, S. Osindero, M. Welling and Y. W. Teh Department of Computer Science University of Toronto Toronto, Canada

More information

Probabilistic Tracking and Reconstruction of 3D Human Motion in Monocular Video Sequences

Probabilistic Tracking and Reconstruction of 3D Human Motion in Monocular Video Sequences Probabilistic Tracking and Reconstruction of 3D Human Motion in Monocular Video Sequences Presentation of the thesis work of: Hedvig Sidenbladh, KTH Thesis opponent: Prof. Bill Freeman, MIT Thesis supervisors

More information

Supplementary Figure 1. Decoding results broken down for different ROIs

Supplementary Figure 1. Decoding results broken down for different ROIs Supplementary Figure 1 Decoding results broken down for different ROIs Decoding results for areas V1, V2, V3, and V1 V3 combined. (a) Decoded and presented orientations are strongly correlated in areas

More information

Coupling of Evolution and Learning to Optimize a Hierarchical Object Recognition Model

Coupling of Evolution and Learning to Optimize a Hierarchical Object Recognition Model Coupling of Evolution and Learning to Optimize a Hierarchical Object Recognition Model Georg Schneider, Heiko Wersing, Bernhard Sendhoff, and Edgar Körner Honda Research Institute Europe GmbH Carl-Legien-Strasse

More information

Efficient computation and cue integration with noisy population codes

Efficient computation and cue integration with noisy population codes articles 21 Nature Publishing Group http://neurosci.nature.com Efficient computation and cue integration with noisy population codes 21 Nature Publishing Group http://neurosci.nature.com S. Deneve 1, P.

More information

CSC 411 Lecture 18: Matrix Factorizations

CSC 411 Lecture 18: Matrix Factorizations CSC 411 Lecture 18: Matrix Factorizations Roger Grosse, Amir-massoud Farahmand, and Juan Carrasquilla University of Toronto UofT CSC 411: 18-Matrix Factorizations 1 / 27 Overview Recall PCA: project data

More information

Filtering Images. Contents

Filtering Images. Contents Image Processing and Data Visualization with MATLAB Filtering Images Hansrudi Noser June 8-9, 010 UZH, Multimedia and Robotics Summer School Noise Smoothing Filters Sigmoid Filters Gradient Filters Contents

More information

Filter Select ion Model for Generating Visual Mot ion Signals

Filter Select ion Model for Generating Visual Mot ion Signals In: C. L. Giles, S. J. Hanson, and J. D. Cowan (Eds.) Advances in Neural Infirmation Processing Systems 5, Morgan Kaufman Publishers, San Mateo, CA, (1993). Filter Select ion Model for Generating Visual

More information

Sequential Maximum Entropy Coding as Efficient Indexing for Rapid Navigation through Large Image Repositories

Sequential Maximum Entropy Coding as Efficient Indexing for Rapid Navigation through Large Image Repositories Sequential Maximum Entropy Coding as Efficient Indexing for Rapid Navigation through Large Image Repositories Guoping Qiu, Jeremy Morris and Xunli Fan School of Computer Science, The University of Nottingham

More information