Perception as an Inference Problem

Size: px
Start display at page:

Download "Perception as an Inference Problem"

Transcription

1 Perception as an Inference Problem Bruno A. Olshausen Helen Wills Neuroscience Institute, School of Optometry and Redwood Center for Theoretical Neuroscience UC Berkeley

2 What are the principles governing information processing in this system?

3 Wallisch & Movshon (2008) Gabor filters..?.. objects. faces

4 Two views of visual system function Deduction - feature extraction, classification - (Hubel & Wiesel; Fukushima; deep learning ) Inference - generative models, recurrent computation - (Helmholtz; Nakayama; Kersten & Yuille; Geman; Lee & Mumford)

5 Hubel & Wiesel (1962, 1965) Hypercomplex Complex 20. The hypothetical illustrated has a complex Simple

6 output (y) y = f(x; w) input data (x)

7 Is this the goal of vision?

8 Visual Navigation in Box Jellyfish 799 jumping spider sand wasp Figure 1. Rhopali of the Upper Lens box jellyfish (A and B) In freely s lia maintain a cons the medusa chan heavy crystal (sta rhopalium causes such that the oriented. Thus, th straight upward body orientation. ated on the far sid eyes directed to t (C) Modeling the peripheral photore angular sensitivity ceptors are supe cording to the co

9 .... state (s) sensory data (x) actuator movement (a) ṡ + s = g(s, x, a; w) a = f(s)

10 Vision as inference lens World Image Model

11

12 Separation of shape and reflectance reflectance shading (Adelson, 2000)

13 Possible neural circuits for inferential computation in V1 1. Sparse coding 2. Separating form and motion from time-varying images

14 ... Sparse coding External world Internal model image model (Olshausen & Field, 1996; Chen, Donoho & Saunders 1995) I(x,y) φ i (x,y) ai MX I( x) = a i i ( x)+ ( x) i=1 image neural features other activities stuff (sparse)

15 Energy function preserve information be sparse

16 Energy function -log P(I a) P(a) preserve information be sparse

17 Coefficients a i may be computed via thresholding and lateral inhibition ( LCA - Rozell, Johnson, Baraniuk & Olshausen, 2008) g g g g g b i = X x i(x) I(x) G ij = X x i(x) j (x)

18 1.25x 2.5x 5x 10x

19 Two examples 1. Sparse coding 2. Separating form and motion from time-varying images

20 Visual perception requires separation of form and motion from time-varying retinal images (eye movement data from Austin Roorda, UC Berkeley)

21 Simple averaging is not sufficient

22 The problem I(~x, t) =S(~x ~x(t)) + (~x, t) ˆ~x(t) = arg min ~x(t) I(~x, t) S(~x ~x(t)) 2 Z Ŝ(~x) = I(~x + ~x(t)) dt

23 Traditional models compute motion and form independently motion energy and pooling optic flow time-varying image feature extraction and pooling invariant pattern recognition

24 Traditional models compute motion and form independently motion energy and pooling optic flow time-varying image feature extraction and pooling invariant pattern recognition

25 Motion and form must be estimated simultaneously ) time-varying image estimate motion optic flow regularization (smoothness) ) time-varying image estimate motion estimate pattern form motion pattern form natural scene statistics prior natural scene statistics prior

26 Graphical model for separating form and motion (Alex Anderson, Ph.D. thesis) X 0 X 1 X 2 Eye position R 0 R 1 R 2 Spikes (from LGN afferents) S Pattern Ŝ = arg max S log P (R S)

27 Given current estimate of position (X), update S Retina Internal Position Estimate (X) Internal Form Estimate (S)

28 Given current estimate of form (S), update X P(X t R 0:t ) R t+1 S = S t P(X t+1 R 0:t ) P(R t+1 X t+1,s = S t ) P(X t+1 R 0:t+1 )

29 Joint estimation of form (S) and position (X)

30 Including a prior over form (S) X 0 X 1 X 2 Eye position R 0 R 1 R 2 Spikes (from LGN afferents) S A D Pattern Dictionary Sparse representation  = arg max A log P (R A) + log P (A) sparse

31 Learned dictionary D

32 Prior over form (S) improves inference

33 Form prior improves inference

34 Main points Perception seems better described as an inference problem that attempts to disentangle underlying causes from image data. Inference involves bidirectional information flow both within and between levels of representation. This moves us away from thinking of receptive fields and instead toward how populations of neurons interact to perform collective computations.

35 Papers Olshausen BA (2014) Perception as an Inference Problem. In: The Cognitive Neurosciences V. M. Gazzaniga, R. Mangun, Eds. MIT Press. Rozell CJ, Johnson DH, Baraniuk RG, Olshausen BA (2008). Sparse Coding via Thresholding and Local Competition in Neural Circuits. Neural Computation, 20, Olshausen BA (2013) Highly overcomplete sparse coding. In: SPIE Proceedings vol. 8651: Human Vision and Electronic Imaging XVIII, (B.E. Rogowitz, T.N. Pappas, H. de Ridder, Eds.), Feb. 4-7, 2013, San Francisco, California.

The Sparse Manifold Transform

The Sparse Manifold Transform The Sparse Manifold Transform Bruno Olshausen Redwood Center for Theoretical Neuroscience, Helen Wills Neuroscience Institute, and School of Optometry UC Berkeley with Yubei Chen (EECS) and Dylan Paiton

More information

From natural scene statistics to models of neural coding and representation (part 1)

From natural scene statistics to models of neural coding and representation (part 1) From natural scene statistics to models of neural coding and representation (part 1) Bruno A. Olshausen Helen Wills Neuroscience Institute, School of Optometry and Redwood Center for Theoretical Neuroscience

More information

Learning representations for active vision

Learning representations for active vision Learning representations for active vision Bruno Olshausen Redwood Center for Theoretical Neuroscience, Helen Wills Neuroscience Institute, and School of Optometry UC Berkeley Brian Cheung Alex Anderson

More information

Introduction to visual computation and the primate visual system

Introduction to visual computation and the primate visual system Introduction to visual computation and the primate visual system Problems in vision Basic facts about the visual system Mathematical models for early vision Marr s computational philosophy and proposal

More information

Modern Signal Processing and Sparse Coding

Modern Signal Processing and Sparse Coding Modern Signal Processing and Sparse Coding School of Electrical and Computer Engineering Georgia Institute of Technology March 22 2011 Reason D etre Modern signal processing Signals without cosines? Sparse

More information

C. Poultney S. Cho pra (NYU Courant Institute) Y. LeCun

C. Poultney S. Cho pra (NYU Courant Institute) Y. LeCun Efficient Learning of Sparse Overcomplete Representations with an Energy-Based Model Marc'Aurelio Ranzato C. Poultney S. Cho pra (NYU Courant Institute) Y. LeCun CIAR Summer School Toronto 2006 Why Extracting

More information

Bilinear Models of Natural Images

Bilinear Models of Natural Images In: SPIE Proceedings vol. 6492: Human Vision and Electronic Imaging XII, (B.E. Rogowitz, T.N. Pappas, S.J. Daly, Eds.), Jan 28-Feb 1, 2007, San Jose, California Bilinear Models of Natural Images Bruno

More information

Natural image statistics and efficient coding

Natural image statistics and efficient coding Network: Computation in Neural Systems 7 (1996) 333 339. Printed in the UK WORKSHOP PAPER Natural image statistics and efficient coding B A Olshausen and D J Field Department of Psychology, Uris Hall,

More information

Bio-inspired Binocular Disparity with Position-Shift Receptive Field

Bio-inspired Binocular Disparity with Position-Shift Receptive Field Bio-inspired Binocular Disparity with Position-Shift Receptive Field Fernanda da C. e C. Faria, Jorge Batista and Helder Araújo Institute of Systems and Robotics, Department of Electrical Engineering and

More information

Computational Perception. Visual Coding 3

Computational Perception. Visual Coding 3 Computational Perception 15-485/785 February 21, 2008 Visual Coding 3 A gap in the theory? - - + - - from Hubel, 1995 2 Eye anatomy from Hubel, 1995 Photoreceptors: rods (night vision) and cones (day vision)

More information

Modeling Visual Cortex V4 in Naturalistic Conditions with Invari. Representations

Modeling Visual Cortex V4 in Naturalistic Conditions with Invari. Representations Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant and Sparse Image Representations Bin Yu Departments of Statistics and EECS University of California at Berkeley Rutgers University, May

More information

A Hierarchial Model for Visual Perception

A Hierarchial Model for Visual Perception A Hierarchial Model for Visual Perception Bolei Zhou 1 and Liqing Zhang 2 1 MOE-Microsoft Laboratory for Intelligent Computing and Intelligent Systems, and Department of Biomedical Engineering, Shanghai

More information

ICA mixture models for image processing

ICA mixture models for image processing I999 6th Joint Sy~nposiurn orz Neural Computation Proceedings ICA mixture models for image processing Te-Won Lee Michael S. Lewicki The Salk Institute, CNL Carnegie Mellon University, CS & CNBC 10010 N.

More information

Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science

Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science V1-inspired orientation selective filters for image processing and computer vision Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science 2 Most of the images in this

More information

The elements of early vision or, what vision (and this course) is all about. NYU/CNS Center for Neural Science

The elements of early vision or, what vision (and this course) is all about. NYU/CNS Center for Neural Science The elements of early vision or, what vision (and this course) is all about NYU/CNS Center for Neural Science The Electric Monk was a labor saving device, like a dishwasher or a video recorder. Dishwashers

More information

Project proposal: Statistical models of visual neurons

Project proposal: Statistical models of visual neurons Project proposal: Statistical models of visual neurons Anna Sotnikova asotniko@math.umd.edu Project Advisor: Prof. Daniel A. Butts dab@umd.edu Department of Biology Abstract Studying visual neurons may

More information

The most cited papers in Computer Vision

The most cited papers in Computer Vision COMPUTER VISION, PUBLICATION The most cited papers in Computer Vision In Computer Vision, Paper Talk on February 10, 2012 at 11:10 pm by gooly (Li Yang Ku) Although it s not always the case that a paper

More information

CS 523: Multimedia Systems

CS 523: Multimedia Systems CS 523: Multimedia Systems Angus Forbes creativecoding.evl.uic.edu/courses/cs523 Today - Convolutional Neural Networks - Work on Project 1 http://playground.tensorflow.org/ Convolutional Neural Networks

More information

Learning Transformational Invariants from Natural Movies

Learning Transformational Invariants from Natural Movies Learning Transformational Invariants from Natural Movies Charles F. Cadieu & Bruno A. Olshausen Helen Wills Neuroscience Institute University of California, Berkeley Berkeley, CA 94720 {cadieu, baolshausen}@berkeley.edu

More information

Neural Nets. CSCI 5582, Fall 2007

Neural Nets. CSCI 5582, Fall 2007 Neural Nets CSCI 5582, Fall 2007 Assignments For this week: Chapter 20, section 5 Problem Set 3 is due a week from today Neural Networks: Some First Concepts Each neural element is loosely based on the

More information

SPARSE COMPONENT ANALYSIS FOR BLIND SOURCE SEPARATION WITH LESS SENSORS THAN SOURCES. Yuanqing Li, Andrzej Cichocki and Shun-ichi Amari

SPARSE COMPONENT ANALYSIS FOR BLIND SOURCE SEPARATION WITH LESS SENSORS THAN SOURCES. Yuanqing Li, Andrzej Cichocki and Shun-ichi Amari SPARSE COMPONENT ANALYSIS FOR BLIND SOURCE SEPARATION WITH LESS SENSORS THAN SOURCES Yuanqing Li, Andrzej Cichocki and Shun-ichi Amari Laboratory for Advanced Brain Signal Processing Laboratory for Mathematical

More information

3D Human Motion Analysis and Manifolds

3D Human Motion Analysis and Manifolds D E P A R T M E N T O F C O M P U T E R S C I E N C E U N I V E R S I T Y O F C O P E N H A G E N 3D Human Motion Analysis and Manifolds Kim Steenstrup Pedersen DIKU Image group and E-Science center Motivation

More information

Pattern recognition (3)

Pattern recognition (3) Pattern recognition (3) 1 Things we have discussed until now Statistical pattern recognition Building simple classifiers Supervised classification Minimum distance classifier Bayesian classifier Building

More information

Detecting Salient Contours Using Orientation Energy Distribution. Part I: Thresholding Based on. Response Distribution

Detecting Salient Contours Using Orientation Energy Distribution. Part I: Thresholding Based on. Response Distribution Detecting Salient Contours Using Orientation Energy Distribution The Problem: How Does the Visual System Detect Salient Contours? CPSC 636 Slide12, Spring 212 Yoonsuck Choe Co-work with S. Sarma and H.-C.

More information

Human Vision Based Object Recognition Sye-Min Christina Chan

Human Vision Based Object Recognition Sye-Min Christina Chan Human Vision Based Object Recognition Sye-Min Christina Chan Abstract Serre, Wolf, and Poggio introduced an object recognition algorithm that simulates image processing in visual cortex and claimed to

More information

Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science

Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science 2D Gabor functions and filters for image processing and computer vision Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science 2 Most of the images in this presentation

More information

Department of Psychology, Uris Hall, Cornell University, Ithaca, New York

Department of Psychology, Uris Hall, Cornell University, Ithaca, New York Natural image statistics and ecient coding B A Olshausen and D J Field Department of Psychology, Uris Hall, Cornell University, Ithaca, New York 14853. Email: bao1@cornell.edu, djf3@cornell.edu To appear

More information

Deep Learning in Image Processing

Deep Learning in Image Processing Deep Learning in Image Processing Roland Memisevic University of Montreal & TwentyBN ICISP 2016 Roland Memisevic Deep Learning in Image Processing ICISP 2016 f 2? cathedral high-rise f 1 It s the features,

More information

OBJECT RECOGNITION ALGORITHM FOR MOBILE DEVICES

OBJECT RECOGNITION ALGORITHM FOR MOBILE DEVICES Image Processing & Communication, vol. 18,no. 1, pp.31-36 DOI: 10.2478/v10248-012-0088-x 31 OBJECT RECOGNITION ALGORITHM FOR MOBILE DEVICES RAFAŁ KOZIK ADAM MARCHEWKA Institute of Telecommunications, University

More information

Advanced Techniques for Mobile Robotics Bag-of-Words Models & Appearance-Based Mapping

Advanced Techniques for Mobile Robotics Bag-of-Words Models & Appearance-Based Mapping Advanced Techniques for Mobile Robotics Bag-of-Words Models & Appearance-Based Mapping Wolfram Burgard, Cyrill Stachniss, Kai Arras, Maren Bennewitz Motivation: Analogy to Documents O f a l l t h e s e

More information

Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science

Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science 2D Gabor functions and filters for image processing and computer vision Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science 2 Most of the images in this presentation

More information

Efficient Visual Coding: From Retina To V2

Efficient Visual Coding: From Retina To V2 Efficient Visual Coding: From Retina To V Honghao Shan Garrison Cottrell Computer Science and Engineering UCSD La Jolla, CA 9093-0404 shanhonghao@gmail.com, gary@ucsd.edu Abstract The human visual system

More information

Bilevel Sparse Coding

Bilevel Sparse Coding Adobe Research 345 Park Ave, San Jose, CA Mar 15, 2013 Outline 1 2 The learning model The learning algorithm 3 4 Sparse Modeling Many types of sensory data, e.g., images and audio, are in high-dimensional

More information

Fitting Models to Distributed Representations of Vision

Fitting Models to Distributed Representations of Vision Fitting Models to Distributed Representations of Vision Sourabh A. Niyogi Department of Electrical Engineering and Computer Science MIT Media Laboratory, 20 Ames Street, Cambridge, MA 02139 Abstract Many

More information

Computational Models of V1 cells. Gabor and CORF

Computational Models of V1 cells. Gabor and CORF 1 Computational Models of V1 cells Gabor and CORF George Azzopardi Nicolai Petkov 2 Primary visual cortex (striate cortex or V1) 3 Types of V1 Cells Hubel and Wiesel, Nobel Prize winners Three main types

More information

Backpropagation in Neural Nets, and an Introduction to Vision. CSCI 5582, Fall 2007

Backpropagation in Neural Nets, and an Introduction to Vision. CSCI 5582, Fall 2007 Backpropagation in Neural Nets, and an Introduction to Vision CSCI 5582, Fall 2007 Assignments Problem Set 3 is due a week from today The Update Rule for a Weighted Edge of a Perceptron To update the weight

More information

Does the Brain do Inverse Graphics?

Does the Brain do Inverse Graphics? Does the Brain do Inverse Graphics? Geoffrey Hinton, Alex Krizhevsky, Navdeep Jaitly, Tijmen Tieleman & Yichuan Tang Department of Computer Science University of Toronto How to learn many layers of features

More information

Saliency Extraction for Gaze-Contingent Displays

Saliency Extraction for Gaze-Contingent Displays In: Workshop on Organic Computing, P. Dadam, M. Reichert (eds.), Proceedings of the 34th GI-Jahrestagung, Vol. 2, 646 650, Ulm, September 2004. Saliency Extraction for Gaze-Contingent Displays Martin Böhme,

More information

Learning-based Methods in Vision

Learning-based Methods in Vision Learning-based Methods in Vision 16-824 Sparsity and Deep Learning Motivation Multitude of hand-designed features currently in use in vision - SIFT, HoG, LBP, MSER, etc. Even the best approaches, just

More information

Independent Component Analysis (ICA) in Real and Complex Fourier Space: An Application to Videos and Natural Scenes

Independent Component Analysis (ICA) in Real and Complex Fourier Space: An Application to Videos and Natural Scenes Independent Component Analysis (ICA) in Real and Complex Fourier Space: An Application to Videos and Natural Scenes By Nimit Kumar* and Shantanu Sharma** {nimitk@iitk.ac.in, shsharma@iitk.ac.in} A Project

More information

Last update: May 4, Vision. CMSC 421: Chapter 24. CMSC 421: Chapter 24 1

Last update: May 4, Vision. CMSC 421: Chapter 24. CMSC 421: Chapter 24 1 Last update: May 4, 200 Vision CMSC 42: Chapter 24 CMSC 42: Chapter 24 Outline Perception generally Image formation Early vision 2D D Object recognition CMSC 42: Chapter 24 2 Perception generally Stimulus

More information

Multiview Feature Learning

Multiview Feature Learning Multiview Feature Learning Roland Memisevic Frankfurt, Montreal Tutorial at IPAM 2012 Roland Memisevic (Frankfurt, Montreal) Multiview Feature Learning Tutorial at IPAM 2012 1 / 163 Outline 1 Introduction

More information

SPARSE CODES FOR NATURAL IMAGES. Davide Scaramuzza Autonomous Systems Lab (EPFL)

SPARSE CODES FOR NATURAL IMAGES. Davide Scaramuzza Autonomous Systems Lab (EPFL) SPARSE CODES FOR NATURAL IMAGES Davide Scaramuzza (davide.scaramuzza@epfl.ch) Autonomous Systems Lab (EPFL) Final rapport of Wavelet Course MINI-PROJECT (Prof. Martin Vetterli) ABSTRACT The human visual

More information

/

/ ... 3 1.... 7 1.1.,.... 7 1.1.... 8 1.2.... 13 2.... 16 2.1.... 16 2.2.... 17 2.3.... 21 2.4.... 27 3.... 32 4.... 39 4.1..... 39 4.2..... 39 4.3. /.... 42 4.4. /.... 44 4.5. /.... 47 4.6. - /.... 49 4.7.....

More information

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science. Professor William Hoff Dept of Electrical Engineering &Computer Science http://inside.mines.edu/~whoff/ 1 Object Recognition in Large Databases Some material for these slides comes from www.cs.utexas.edu/~grauman/courses/spring2011/slides/lecture18_index.pptx

More information

Dictionary learning Based on Laplacian Score in Sparse Coding

Dictionary learning Based on Laplacian Score in Sparse Coding Dictionary learning Based on Laplacian Score in Sparse Coding Jin Xu and Hong Man Department of Electrical and Computer Engineering, Stevens institute of Technology, Hoboken, NJ 73 USA Abstract. Sparse

More information

EMERGENCE OF SIMPLE-CELL RECEPTIVE PROPERTIES BY LEARNING A SPARSE CODE FOR NATURAL IMAGES

EMERGENCE OF SIMPLE-CELL RECEPTIVE PROPERTIES BY LEARNING A SPARSE CODE FOR NATURAL IMAGES EMERGENCE OF SIMPLE-CELL RECEPTIVE PROPERTIES BY LEARNING A SPARSE CODE FOR NATURAL IMAGES Bruno A. Olshausen & David J. Field Presenter: Ozgur Yigit Balkan Outline Linear data representations Sparse Vector

More information

Self-Organizing Sparse Codes

Self-Organizing Sparse Codes Self-Organizing Sparse Codes Yangqing Jia Sergey Karayev 2010-12-16 Abstract Sparse coding as applied to natural image patches learns Gabor-like components that resemble those found in the lower areas

More information

Simple Method for High-Performance Digit Recognition Based on Sparse Coding

Simple Method for High-Performance Digit Recognition Based on Sparse Coding 1 Simple Method for High-Performance Digit Recognition Based on Sparse Coding Kai Labusch Institute for Neuro- and Bioinformatics University of Lübeck D-23538 Lübeck labusch@inb.uni-luebeck.de Erhardt

More information

A Simple Vision System

A Simple Vision System Chapter 1 A Simple Vision System 1.1 Introduction In 1966, Seymour Papert wrote a proposal for building a vision system as a summer project [4]. The abstract of the proposal starts stating a simple goal:

More information

Convolutional Neural Networks. Computer Vision Jia-Bin Huang, Virginia Tech

Convolutional Neural Networks. Computer Vision Jia-Bin Huang, Virginia Tech Convolutional Neural Networks Computer Vision Jia-Bin Huang, Virginia Tech Today s class Overview Convolutional Neural Network (CNN) Training CNN Understanding and Visualizing CNN Image Categorization:

More information

Joint design of data analysis algorithms and user interface for video applications

Joint design of data analysis algorithms and user interface for video applications Joint design of data analysis algorithms and user interface for video applications Nebojsa Jojic Microsoft Research Sumit Basu Microsoft Research Nemanja Petrovic University of Illinois Brendan Frey University

More information

What is Computer Vision?

What is Computer Vision? Perceptual Grouping in Computer Vision Gérard Medioni University of Southern California What is Computer Vision? Computer Vision Attempt to emulate Human Visual System Perceive visual stimuli with cameras

More information

Filter Select ion Model for Generating Visual Mot ion Signals

Filter Select ion Model for Generating Visual Mot ion Signals In: C. L. Giles, S. J. Hanson, and J. D. Cowan (Eds.) Advances in Neural Infirmation Processing Systems 5, Morgan Kaufman Publishers, San Mateo, CA, (1993). Filter Select ion Model for Generating Visual

More information

a labelled visual world, images together with scenes, in order to infer scenes from images. The image data might be single or multiple frames; the sce

a labelled visual world, images together with scenes, in order to infer scenes from images. The image data might be single or multiple frames; the sce To appear in: Adv. Neural Information Processing Systems 11, M. S. Kearns, S. A. Solla and D. A. Cohn, eds. MIT Press, 1999. Learning to estimate scenes from images William T. Freeman and Egon C. Pasztor

More information

Recursive ICA. Abstract

Recursive ICA. Abstract Recursive ICA Honghao Shan, Lingyun Zhang, Garrison W. Cottrell Department of Computer Science and Engineering University of California, San Diego La Jolla, CA 92093-0404 {hshan,lingyun,gary}@cs.ucsd.edu

More information

The cerebral cortex model that self-organizes conditional probability tables and executes belief propagation IJCNN 2007 #1065

The cerebral cortex model that self-organizes conditional probability tables and executes belief propagation IJCNN 2007 #1065 The cerebral corte model that self-organizes conditional probability tables and eecutes belief propagation IJCNN 007 #065 National Institute of Advanced Industrial Science and TechnologyAIST, Japan uuji

More information

Statistical models of visual neurons

Statistical models of visual neurons Statistical models of visual neurons Final Presentation Anna Sotnikova Applied Mathematics and Statistics, and Scientific Computation program Advisor: Dr. Daniel A. Butts Department of Biology 1 The power

More information

A Developing Sensory Mapping for Robots

A Developing Sensory Mapping for Robots A Developing Sensory Mapping for Robots Nan Zhang and John Weng Zhengyou Zhang Dept. of Computer Sci. and Engr. Microsoft Research Michigan State University Microsoft Corp. East Lansing, MI 48823 Redmond,

More information

Overview. Related Work Tensor Voting in 2-D Tensor Voting in 3-D Tensor Voting in N-D Application to Vision Problems Stereo Visual Motion

Overview. Related Work Tensor Voting in 2-D Tensor Voting in 3-D Tensor Voting in N-D Application to Vision Problems Stereo Visual Motion Overview Related Work Tensor Voting in 2-D Tensor Voting in 3-D Tensor Voting in N-D Application to Vision Problems Stereo Visual Motion Binary-Space-Partitioned Images 3-D Surface Extraction from Medical

More information

Presentation Outline. Semantic Segmentation. Overview. Presentation Outline CNN. Learning Deconvolution Network for Semantic Segmentation 6/6/16

Presentation Outline. Semantic Segmentation. Overview. Presentation Outline CNN. Learning Deconvolution Network for Semantic Segmentation 6/6/16 6/6/16 Learning Deconvolution Network for Semantic Segmentation Hyeonwoo Noh, Seunghoon Hong,Bohyung Han Department of Computer Science and Engineering, POSTECH, Korea Shai Rozenberg 6/6/2016 1 2 Semantic

More information

An Event-based Optical Flow Algorithm for Dynamic Vision Sensors

An Event-based Optical Flow Algorithm for Dynamic Vision Sensors An Event-based Optical Flow Algorithm for Dynamic Vision Sensors Iffatur Ridwan and Howard Cheng Department of Mathematics and Computer Science University of Lethbridge, Canada iffatur.ridwan@uleth.ca,howard.cheng@uleth.ca

More information

Persistent Homology for Characterizing Stimuli Response in the Primary Visual Cortex

Persistent Homology for Characterizing Stimuli Response in the Primary Visual Cortex Persistent Homology for Characterizing Stimuli Response in the Primary Visual Cortex Avani Wildani Tatyana O. Sharpee (PI) ICML Topology 6/25/2014 The big questions - How do we turn sensory stimuli into

More information

Autoencoders, denoising autoencoders, and learning deep networks

Autoencoders, denoising autoencoders, and learning deep networks 4 th CiFAR Summer School on Learning and Vision in Biology and Engineering Toronto, August 5-9 2008 Autoencoders, denoising autoencoders, and learning deep networks Part II joint work with Hugo Larochelle,

More information

EECS150 - Digital Design Lecture 14 FIFO 2 and SIFT. Recap and Outline

EECS150 - Digital Design Lecture 14 FIFO 2 and SIFT. Recap and Outline EECS150 - Digital Design Lecture 14 FIFO 2 and SIFT Oct. 15, 2013 Prof. Ronald Fearing Electrical Engineering and Computer Sciences University of California, Berkeley (slides courtesy of Prof. John Wawrzynek)

More information

Sparse Models in Image Understanding And Computer Vision

Sparse Models in Image Understanding And Computer Vision Sparse Models in Image Understanding And Computer Vision Jayaraman J. Thiagarajan Arizona State University Collaborators Prof. Andreas Spanias Karthikeyan Natesan Ramamurthy Sparsity Sparsity of a vector

More information

Processing Framework Proposed by Marr. Image

Processing Framework Proposed by Marr. Image Processing Framework Proposed by Marr Recognition 3D structure; motion characteristics; surface properties Shape From stereo Motion flow Shape From motion Color estimation Shape From contour Shape From

More information

Representing the World

Representing the World Table of Contents Representing the World...1 Sensory Transducers...1 The Lateral Geniculate Nucleus (LGN)... 2 Areas V1 to V5 the Visual Cortex... 2 Computer Vision... 3 Intensity Images... 3 Image Focusing...

More information

m Environment Output Activation 0.8 Output Activation Input Value

m Environment Output Activation 0.8 Output Activation Input Value Learning Sensory-Motor Cortical Mappings Without Training Mike Spratling Gillian Hayes Department of Articial Intelligence University of Edinburgh mikes@dai.ed.ac.uk gmh@dai.ed.ac.uk Abstract. This paper

More information

Recognition with Bag-ofWords. (Borrowing heavily from Tutorial Slides by Li Fei-fei)

Recognition with Bag-ofWords. (Borrowing heavily from Tutorial Slides by Li Fei-fei) Recognition with Bag-ofWords (Borrowing heavily from Tutorial Slides by Li Fei-fei) Recognition So far, we ve worked on recognizing edges Now, we ll work on recognizing objects We will use a bag-of-words

More information

Chapter 8: Convolutional Networks

Chapter 8: Convolutional Networks Chapter 8: Convolutional Networks Dietrich Klakow Spoken Language Systems Saarland University, Germany Dietrich.Klakow@LSV.Uni-Saarland.De Neural Networks Implementation and Application Introduction Source:

More information

CAN THE SUN S DIRECTION BE ESTIMATED PRIOR TO THE DETERMINATION OF SHAPE?

CAN THE SUN S DIRECTION BE ESTIMATED PRIOR TO THE DETERMINATION OF SHAPE? CAN THE SUN S DIRECTION BE ESTIMATED PRIOR TO THE DETERMINATION OF SHAPE? WOJCIECH CHOJNACKI MICHAEL J. BROOKS Department of Computer Science, University of Adelaide Adelaide, SA 5005, Australia and DANIEL

More information

Geometry of Multiple views

Geometry of Multiple views 1 Geometry of Multiple views CS 554 Computer Vision Pinar Duygulu Bilkent University 2 Multiple views Despite the wealth of information contained in a a photograph, the depth of a scene point along the

More information

Non-Differentiable Image Manifolds

Non-Differentiable Image Manifolds The Multiscale Structure of Non-Differentiable Image Manifolds Michael Wakin Electrical l Engineering i Colorado School of Mines Joint work with Richard Baraniuk, Hyeokho Choi, David Donoho Models for

More information

Features that draw visual attention: an information theoretic perspective

Features that draw visual attention: an information theoretic perspective Neurocomputing 65 66 (2005) 125 133 www.elsevier.com/locate/neucom Features that draw visual attention: an information theoretic perspective Neil D.B. Bruce a,b, a Department of Computer Science, York

More information

Image Representation by Active Curves

Image Representation by Active Curves Image Representation by Active Curves Wenze Hu Ying Nian Wu Song-Chun Zhu Department of Statistics, UCLA {wzhu,ywu,sczhu}@stat.ucla.edu Abstract This paper proposes a sparse image representation using

More information

Think-Pair-Share. What visual or physiological cues help us to perceive 3D shape and depth?

Think-Pair-Share. What visual or physiological cues help us to perceive 3D shape and depth? Think-Pair-Share What visual or physiological cues help us to perceive 3D shape and depth? [Figure from Prados & Faugeras 2006] Shading Focus/defocus Images from same point of view, different camera parameters

More information

Object-Based Saliency Maps Harry Marr Computing BSc 2009/2010

Object-Based Saliency Maps Harry Marr Computing BSc 2009/2010 Object-Based Saliency Maps Harry Marr Computing BSc 2009/2010 The candidate confirms that the work submitted is their own and the appropriate credit has been given where reference has been made to the

More information

Dynamic visual attention: competitive versus motion priority scheme

Dynamic visual attention: competitive versus motion priority scheme Dynamic visual attention: competitive versus motion priority scheme Bur A. 1, Wurtz P. 2, Müri R.M. 2 and Hügli H. 1 1 Institute of Microtechnology, University of Neuchâtel, Neuchâtel, Switzerland 2 Perception

More information

Sparsity and image processing

Sparsity and image processing Sparsity and image processing Aurélie Boisbunon INRIA-SAM, AYIN March 6, Why sparsity? Main advantages Dimensionality reduction Fast computation Better interpretability Image processing pattern recognition

More information

Neural competitive structures for segmentation based on motion features

Neural competitive structures for segmentation based on motion features Neural competitive structures for segmentation based on motion features Javier Díaz 1, Sonia Mota 1, Eduardo Ros 1 and Guillermo Botella 1 1 Departamento de Arquitectura y Tecnología de Computadores, E.T.S.I.

More information

Learning to Perceive Transparency from the Statistics of Natural Scenes

Learning to Perceive Transparency from the Statistics of Natural Scenes Learning to Perceive Transparency from the Statistics of Natural Scenes Anat Levin Assaf Zomet Yair Weiss School of Computer Science and Engineering The Hebrew University of Jerusalem 9194 Jerusalem, Israel

More information

Facial Expression Classification with Random Filters Feature Extraction

Facial Expression Classification with Random Filters Feature Extraction Facial Expression Classification with Random Filters Feature Extraction Mengye Ren Facial Monkey mren@cs.toronto.edu Zhi Hao Luo It s Me lzh@cs.toronto.edu I. ABSTRACT In our work, we attempted to tackle

More information

Sparse PCA Extracting Multi-scale Structure from Data

Sparse PCA Extracting Multi-scale Structure from Data Proceedings International Conference on Computer Vision, pages 641-647, Vancouver, Canada, 21 c IEEE 1 Sparse PCA Extracting Multi-scale Structure from Data Chakra Chennubhotla & Allan Jepson Department

More information

Online Learning for Object Recognition with a Hierarchical Visual Cortex Model

Online Learning for Object Recognition with a Hierarchical Visual Cortex Model Online Learning for Object Recognition with a Hierarchical Visual Cortex Model Stephan Kirstein, Heiko Wersing, and Edgar Körner Honda Research Institute Europe GmbH Carl Legien Str. 30 63073 Offenbach

More information

Bayesian Model of Dynamic Image Stabilization in the Visual System

Bayesian Model of Dynamic Image Stabilization in the Visual System Bayesian Model of Dynamic Image Stabilization in the Visual System The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters Citation Burak,

More information

A Modified Approach to Biologically Motivated Saliency Mapping

A Modified Approach to Biologically Motivated Saliency Mapping A Modified Approach to Biologically Motivated Saliency Mapping Shane Grant Department of Computer Science University of California, San Diego La Jolla, CA 9093 wgrant@ucsd.edu Kevin A Heins Department

More information

Nonparametric sparse hierarchical models describe V1 fmri responses to natural images

Nonparametric sparse hierarchical models describe V1 fmri responses to natural images Nonparametric sparse hierarchical models describe V1 fmri responses to natural images Pradeep Ravikumar, Vincent Q. Vu and Bin Yu Department of Statistics University of California, Berkeley Berkeley, CA

More information

Wavelets, vision and the statistics of natural scenes

Wavelets, vision and the statistics of natural scenes Wavelets, vision and the statistics of natural scenes By D. J. Field Uris Hall, Cornell University, Ithaca, NY 14853, USA The processing of spatial information by the visual system shows a number of similarities

More information

Does the Brain do Inverse Graphics?

Does the Brain do Inverse Graphics? Does the Brain do Inverse Graphics? Geoffrey Hinton, Alex Krizhevsky, Navdeep Jaitly, Tijmen Tieleman & Yichuan Tang Department of Computer Science University of Toronto The representation used by the

More information

Learning transport operators for image manifolds

Learning transport operators for image manifolds Learning transport operators for image manifolds Benjamin J. Culpepper Department of EECS Computer Science Division University of California, Berkeley Berkeley, CA 9472 bjc@cs.berkeley.edu Bruno A. Olshausen

More information

Content Based Image Retrieval Using Curvelet Transform

Content Based Image Retrieval Using Curvelet Transform Content Based Image Retrieval Using Curvelet Transform Ishrat Jahan Sumana, Md. Monirul Islam, Dengsheng Zhang and Guojun Lu Gippsland School of Information Technology, Monash University Churchill, Victoria

More information

Hierarchical Discriminative Sparse Coding via Bidirectional Connections

Hierarchical Discriminative Sparse Coding via Bidirectional Connections Hierarchical Discriminative Sparse Coding via Bidirectional Connections Zhengping Ji, Wentao Huang, Garrett Kenyon and Luis M. A. Bettencourt Abstract Conventional sparse coding learns optimal dictionaries

More information

Perception, Part 2 Gleitman et al. (2011), Chapter 5

Perception, Part 2 Gleitman et al. (2011), Chapter 5 Perception, Part 2 Gleitman et al. (2011), Chapter 5 Mike D Zmura Department of Cognitive Sciences, UCI Psych 9A / Psy Beh 11A February 27, 2014 T. M. D'Zmura 1 Visual Reconstruction of a Three-Dimensional

More information

A RECOGNITION SYSTEM THAT USES SACCADES TO DETECT CARS FROM REAL-TIME VIDEO STREAMS. Predrag Neskouic, Leon N Cooper* David Schuster t

A RECOGNITION SYSTEM THAT USES SACCADES TO DETECT CARS FROM REAL-TIME VIDEO STREAMS. Predrag Neskouic, Leon N Cooper* David Schuster t Proceedings of the 9th International Conference on Neural Information Processing (ICONIP'OZ), Vol. 5 Lip0 Wang, Jagath C. Rajapakse, Kunihiko Fukushima, Soo-Young Lee, and Xin Yao (Editors) A RECOGNITION

More information

Neurally Inspired Mechanisms for the Dynamic Visual Attention Map Generation Task

Neurally Inspired Mechanisms for the Dynamic Visual Attention Map Generation Task Neurally Inspired Mechanisms for the Dynamic Visual Attention Map Generation Task Maria T. López 1, Miguel A. Fernández 1, Antonio Fernández-Caballero 1, and Ana E. Delgado 2 1 Departamento de Informática

More information

Arbib: Slides for TMB2 Section 7.2 1

Arbib: Slides for TMB2 Section 7.2 1 Arbib: Slides for TMB2 Section 7.2 1 Lecture 20: Optic Flow Reading assignment: TMB2 7.2 If, as we walk forward, we recognize that a tree appears to be getting bigger, we can infer that the tree is in

More information

Illumination and Reflectance

Illumination and Reflectance COMP 546 Lecture 12 Illumination and Reflectance Tues. Feb. 20, 2018 1 Illumination and Reflectance Shading Brightness versus Lightness Color constancy Shading on a sunny day N(x) L N L Lambert s (cosine)

More information

High-level Vision as Statistical Inference. Daniel Kersten. SHORT TITLE: Vision as Statistical Inference.

High-level Vision as Statistical Inference. Daniel Kersten. SHORT TITLE: Vision as Statistical Inference. 1 High-level Vision as Statistical Inference Daniel Kersten SHORT TITLE: Vision as Statistical Inference kersten@tc.umn.edu Department of Psychology, University of Minnesota, 75 East River Road,Minneapolis,

More information

Lecture 2: Spatial And-Or graph

Lecture 2: Spatial And-Or graph Lecture 2: Spatial And-Or graph for representing the scene-object-part-primitive hierarchy Song-Chun Zhu Center for Vision, Cognition, Learning and Arts University of California, Los Angeles At CVPR, Providence,

More information