Perception as an Inference Problem
|
|
- Esmond Pope
- 5 years ago
- Views:
Transcription
1 Perception as an Inference Problem Bruno A. Olshausen Helen Wills Neuroscience Institute, School of Optometry and Redwood Center for Theoretical Neuroscience UC Berkeley
2 What are the principles governing information processing in this system?
3 Wallisch & Movshon (2008) Gabor filters..?.. objects. faces
4 Two views of visual system function Deduction - feature extraction, classification - (Hubel & Wiesel; Fukushima; deep learning ) Inference - generative models, recurrent computation - (Helmholtz; Nakayama; Kersten & Yuille; Geman; Lee & Mumford)
5 Hubel & Wiesel (1962, 1965) Hypercomplex Complex 20. The hypothetical illustrated has a complex Simple
6 output (y) y = f(x; w) input data (x)
7 Is this the goal of vision?
8 Visual Navigation in Box Jellyfish 799 jumping spider sand wasp Figure 1. Rhopali of the Upper Lens box jellyfish (A and B) In freely s lia maintain a cons the medusa chan heavy crystal (sta rhopalium causes such that the oriented. Thus, th straight upward body orientation. ated on the far sid eyes directed to t (C) Modeling the peripheral photore angular sensitivity ceptors are supe cording to the co
9 .... state (s) sensory data (x) actuator movement (a) ṡ + s = g(s, x, a; w) a = f(s)
10 Vision as inference lens World Image Model
11
12 Separation of shape and reflectance reflectance shading (Adelson, 2000)
13 Possible neural circuits for inferential computation in V1 1. Sparse coding 2. Separating form and motion from time-varying images
14 ... Sparse coding External world Internal model image model (Olshausen & Field, 1996; Chen, Donoho & Saunders 1995) I(x,y) φ i (x,y) ai MX I( x) = a i i ( x)+ ( x) i=1 image neural features other activities stuff (sparse)
15 Energy function preserve information be sparse
16 Energy function -log P(I a) P(a) preserve information be sparse
17 Coefficients a i may be computed via thresholding and lateral inhibition ( LCA - Rozell, Johnson, Baraniuk & Olshausen, 2008) g g g g g b i = X x i(x) I(x) G ij = X x i(x) j (x)
18 1.25x 2.5x 5x 10x
19 Two examples 1. Sparse coding 2. Separating form and motion from time-varying images
20 Visual perception requires separation of form and motion from time-varying retinal images (eye movement data from Austin Roorda, UC Berkeley)
21 Simple averaging is not sufficient
22 The problem I(~x, t) =S(~x ~x(t)) + (~x, t) ˆ~x(t) = arg min ~x(t) I(~x, t) S(~x ~x(t)) 2 Z Ŝ(~x) = I(~x + ~x(t)) dt
23 Traditional models compute motion and form independently motion energy and pooling optic flow time-varying image feature extraction and pooling invariant pattern recognition
24 Traditional models compute motion and form independently motion energy and pooling optic flow time-varying image feature extraction and pooling invariant pattern recognition
25 Motion and form must be estimated simultaneously ) time-varying image estimate motion optic flow regularization (smoothness) ) time-varying image estimate motion estimate pattern form motion pattern form natural scene statistics prior natural scene statistics prior
26 Graphical model for separating form and motion (Alex Anderson, Ph.D. thesis) X 0 X 1 X 2 Eye position R 0 R 1 R 2 Spikes (from LGN afferents) S Pattern Ŝ = arg max S log P (R S)
27 Given current estimate of position (X), update S Retina Internal Position Estimate (X) Internal Form Estimate (S)
28 Given current estimate of form (S), update X P(X t R 0:t ) R t+1 S = S t P(X t+1 R 0:t ) P(R t+1 X t+1,s = S t ) P(X t+1 R 0:t+1 )
29 Joint estimation of form (S) and position (X)
30 Including a prior over form (S) X 0 X 1 X 2 Eye position R 0 R 1 R 2 Spikes (from LGN afferents) S A D Pattern Dictionary Sparse representation  = arg max A log P (R A) + log P (A) sparse
31 Learned dictionary D
32 Prior over form (S) improves inference
33 Form prior improves inference
34 Main points Perception seems better described as an inference problem that attempts to disentangle underlying causes from image data. Inference involves bidirectional information flow both within and between levels of representation. This moves us away from thinking of receptive fields and instead toward how populations of neurons interact to perform collective computations.
35 Papers Olshausen BA (2014) Perception as an Inference Problem. In: The Cognitive Neurosciences V. M. Gazzaniga, R. Mangun, Eds. MIT Press. Rozell CJ, Johnson DH, Baraniuk RG, Olshausen BA (2008). Sparse Coding via Thresholding and Local Competition in Neural Circuits. Neural Computation, 20, Olshausen BA (2013) Highly overcomplete sparse coding. In: SPIE Proceedings vol. 8651: Human Vision and Electronic Imaging XVIII, (B.E. Rogowitz, T.N. Pappas, H. de Ridder, Eds.), Feb. 4-7, 2013, San Francisco, California.
The Sparse Manifold Transform
The Sparse Manifold Transform Bruno Olshausen Redwood Center for Theoretical Neuroscience, Helen Wills Neuroscience Institute, and School of Optometry UC Berkeley with Yubei Chen (EECS) and Dylan Paiton
More informationFrom natural scene statistics to models of neural coding and representation (part 1)
From natural scene statistics to models of neural coding and representation (part 1) Bruno A. Olshausen Helen Wills Neuroscience Institute, School of Optometry and Redwood Center for Theoretical Neuroscience
More informationLearning representations for active vision
Learning representations for active vision Bruno Olshausen Redwood Center for Theoretical Neuroscience, Helen Wills Neuroscience Institute, and School of Optometry UC Berkeley Brian Cheung Alex Anderson
More informationIntroduction to visual computation and the primate visual system
Introduction to visual computation and the primate visual system Problems in vision Basic facts about the visual system Mathematical models for early vision Marr s computational philosophy and proposal
More informationModern Signal Processing and Sparse Coding
Modern Signal Processing and Sparse Coding School of Electrical and Computer Engineering Georgia Institute of Technology March 22 2011 Reason D etre Modern signal processing Signals without cosines? Sparse
More informationC. Poultney S. Cho pra (NYU Courant Institute) Y. LeCun
Efficient Learning of Sparse Overcomplete Representations with an Energy-Based Model Marc'Aurelio Ranzato C. Poultney S. Cho pra (NYU Courant Institute) Y. LeCun CIAR Summer School Toronto 2006 Why Extracting
More informationBilinear Models of Natural Images
In: SPIE Proceedings vol. 6492: Human Vision and Electronic Imaging XII, (B.E. Rogowitz, T.N. Pappas, S.J. Daly, Eds.), Jan 28-Feb 1, 2007, San Jose, California Bilinear Models of Natural Images Bruno
More informationNatural image statistics and efficient coding
Network: Computation in Neural Systems 7 (1996) 333 339. Printed in the UK WORKSHOP PAPER Natural image statistics and efficient coding B A Olshausen and D J Field Department of Psychology, Uris Hall,
More informationBio-inspired Binocular Disparity with Position-Shift Receptive Field
Bio-inspired Binocular Disparity with Position-Shift Receptive Field Fernanda da C. e C. Faria, Jorge Batista and Helder Araújo Institute of Systems and Robotics, Department of Electrical Engineering and
More informationComputational Perception. Visual Coding 3
Computational Perception 15-485/785 February 21, 2008 Visual Coding 3 A gap in the theory? - - + - - from Hubel, 1995 2 Eye anatomy from Hubel, 1995 Photoreceptors: rods (night vision) and cones (day vision)
More informationModeling Visual Cortex V4 in Naturalistic Conditions with Invari. Representations
Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant and Sparse Image Representations Bin Yu Departments of Statistics and EECS University of California at Berkeley Rutgers University, May
More informationA Hierarchial Model for Visual Perception
A Hierarchial Model for Visual Perception Bolei Zhou 1 and Liqing Zhang 2 1 MOE-Microsoft Laboratory for Intelligent Computing and Intelligent Systems, and Department of Biomedical Engineering, Shanghai
More informationICA mixture models for image processing
I999 6th Joint Sy~nposiurn orz Neural Computation Proceedings ICA mixture models for image processing Te-Won Lee Michael S. Lewicki The Salk Institute, CNL Carnegie Mellon University, CS & CNBC 10010 N.
More informationNicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science
V1-inspired orientation selective filters for image processing and computer vision Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science 2 Most of the images in this
More informationThe elements of early vision or, what vision (and this course) is all about. NYU/CNS Center for Neural Science
The elements of early vision or, what vision (and this course) is all about NYU/CNS Center for Neural Science The Electric Monk was a labor saving device, like a dishwasher or a video recorder. Dishwashers
More informationProject proposal: Statistical models of visual neurons
Project proposal: Statistical models of visual neurons Anna Sotnikova asotniko@math.umd.edu Project Advisor: Prof. Daniel A. Butts dab@umd.edu Department of Biology Abstract Studying visual neurons may
More informationThe most cited papers in Computer Vision
COMPUTER VISION, PUBLICATION The most cited papers in Computer Vision In Computer Vision, Paper Talk on February 10, 2012 at 11:10 pm by gooly (Li Yang Ku) Although it s not always the case that a paper
More informationCS 523: Multimedia Systems
CS 523: Multimedia Systems Angus Forbes creativecoding.evl.uic.edu/courses/cs523 Today - Convolutional Neural Networks - Work on Project 1 http://playground.tensorflow.org/ Convolutional Neural Networks
More informationLearning Transformational Invariants from Natural Movies
Learning Transformational Invariants from Natural Movies Charles F. Cadieu & Bruno A. Olshausen Helen Wills Neuroscience Institute University of California, Berkeley Berkeley, CA 94720 {cadieu, baolshausen}@berkeley.edu
More informationNeural Nets. CSCI 5582, Fall 2007
Neural Nets CSCI 5582, Fall 2007 Assignments For this week: Chapter 20, section 5 Problem Set 3 is due a week from today Neural Networks: Some First Concepts Each neural element is loosely based on the
More informationSPARSE COMPONENT ANALYSIS FOR BLIND SOURCE SEPARATION WITH LESS SENSORS THAN SOURCES. Yuanqing Li, Andrzej Cichocki and Shun-ichi Amari
SPARSE COMPONENT ANALYSIS FOR BLIND SOURCE SEPARATION WITH LESS SENSORS THAN SOURCES Yuanqing Li, Andrzej Cichocki and Shun-ichi Amari Laboratory for Advanced Brain Signal Processing Laboratory for Mathematical
More information3D Human Motion Analysis and Manifolds
D E P A R T M E N T O F C O M P U T E R S C I E N C E U N I V E R S I T Y O F C O P E N H A G E N 3D Human Motion Analysis and Manifolds Kim Steenstrup Pedersen DIKU Image group and E-Science center Motivation
More informationPattern recognition (3)
Pattern recognition (3) 1 Things we have discussed until now Statistical pattern recognition Building simple classifiers Supervised classification Minimum distance classifier Bayesian classifier Building
More informationDetecting Salient Contours Using Orientation Energy Distribution. Part I: Thresholding Based on. Response Distribution
Detecting Salient Contours Using Orientation Energy Distribution The Problem: How Does the Visual System Detect Salient Contours? CPSC 636 Slide12, Spring 212 Yoonsuck Choe Co-work with S. Sarma and H.-C.
More informationHuman Vision Based Object Recognition Sye-Min Christina Chan
Human Vision Based Object Recognition Sye-Min Christina Chan Abstract Serre, Wolf, and Poggio introduced an object recognition algorithm that simulates image processing in visual cortex and claimed to
More informationNicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science
2D Gabor functions and filters for image processing and computer vision Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science 2 Most of the images in this presentation
More informationDepartment of Psychology, Uris Hall, Cornell University, Ithaca, New York
Natural image statistics and ecient coding B A Olshausen and D J Field Department of Psychology, Uris Hall, Cornell University, Ithaca, New York 14853. Email: bao1@cornell.edu, djf3@cornell.edu To appear
More informationDeep Learning in Image Processing
Deep Learning in Image Processing Roland Memisevic University of Montreal & TwentyBN ICISP 2016 Roland Memisevic Deep Learning in Image Processing ICISP 2016 f 2? cathedral high-rise f 1 It s the features,
More informationOBJECT RECOGNITION ALGORITHM FOR MOBILE DEVICES
Image Processing & Communication, vol. 18,no. 1, pp.31-36 DOI: 10.2478/v10248-012-0088-x 31 OBJECT RECOGNITION ALGORITHM FOR MOBILE DEVICES RAFAŁ KOZIK ADAM MARCHEWKA Institute of Telecommunications, University
More informationAdvanced Techniques for Mobile Robotics Bag-of-Words Models & Appearance-Based Mapping
Advanced Techniques for Mobile Robotics Bag-of-Words Models & Appearance-Based Mapping Wolfram Burgard, Cyrill Stachniss, Kai Arras, Maren Bennewitz Motivation: Analogy to Documents O f a l l t h e s e
More informationNicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science
2D Gabor functions and filters for image processing and computer vision Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science 2 Most of the images in this presentation
More informationEfficient Visual Coding: From Retina To V2
Efficient Visual Coding: From Retina To V Honghao Shan Garrison Cottrell Computer Science and Engineering UCSD La Jolla, CA 9093-0404 shanhonghao@gmail.com, gary@ucsd.edu Abstract The human visual system
More informationBilevel Sparse Coding
Adobe Research 345 Park Ave, San Jose, CA Mar 15, 2013 Outline 1 2 The learning model The learning algorithm 3 4 Sparse Modeling Many types of sensory data, e.g., images and audio, are in high-dimensional
More informationFitting Models to Distributed Representations of Vision
Fitting Models to Distributed Representations of Vision Sourabh A. Niyogi Department of Electrical Engineering and Computer Science MIT Media Laboratory, 20 Ames Street, Cambridge, MA 02139 Abstract Many
More informationComputational Models of V1 cells. Gabor and CORF
1 Computational Models of V1 cells Gabor and CORF George Azzopardi Nicolai Petkov 2 Primary visual cortex (striate cortex or V1) 3 Types of V1 Cells Hubel and Wiesel, Nobel Prize winners Three main types
More informationBackpropagation in Neural Nets, and an Introduction to Vision. CSCI 5582, Fall 2007
Backpropagation in Neural Nets, and an Introduction to Vision CSCI 5582, Fall 2007 Assignments Problem Set 3 is due a week from today The Update Rule for a Weighted Edge of a Perceptron To update the weight
More informationDoes the Brain do Inverse Graphics?
Does the Brain do Inverse Graphics? Geoffrey Hinton, Alex Krizhevsky, Navdeep Jaitly, Tijmen Tieleman & Yichuan Tang Department of Computer Science University of Toronto How to learn many layers of features
More informationSaliency Extraction for Gaze-Contingent Displays
In: Workshop on Organic Computing, P. Dadam, M. Reichert (eds.), Proceedings of the 34th GI-Jahrestagung, Vol. 2, 646 650, Ulm, September 2004. Saliency Extraction for Gaze-Contingent Displays Martin Böhme,
More informationLearning-based Methods in Vision
Learning-based Methods in Vision 16-824 Sparsity and Deep Learning Motivation Multitude of hand-designed features currently in use in vision - SIFT, HoG, LBP, MSER, etc. Even the best approaches, just
More informationIndependent Component Analysis (ICA) in Real and Complex Fourier Space: An Application to Videos and Natural Scenes
Independent Component Analysis (ICA) in Real and Complex Fourier Space: An Application to Videos and Natural Scenes By Nimit Kumar* and Shantanu Sharma** {nimitk@iitk.ac.in, shsharma@iitk.ac.in} A Project
More informationLast update: May 4, Vision. CMSC 421: Chapter 24. CMSC 421: Chapter 24 1
Last update: May 4, 200 Vision CMSC 42: Chapter 24 CMSC 42: Chapter 24 Outline Perception generally Image formation Early vision 2D D Object recognition CMSC 42: Chapter 24 2 Perception generally Stimulus
More informationMultiview Feature Learning
Multiview Feature Learning Roland Memisevic Frankfurt, Montreal Tutorial at IPAM 2012 Roland Memisevic (Frankfurt, Montreal) Multiview Feature Learning Tutorial at IPAM 2012 1 / 163 Outline 1 Introduction
More informationSPARSE CODES FOR NATURAL IMAGES. Davide Scaramuzza Autonomous Systems Lab (EPFL)
SPARSE CODES FOR NATURAL IMAGES Davide Scaramuzza (davide.scaramuzza@epfl.ch) Autonomous Systems Lab (EPFL) Final rapport of Wavelet Course MINI-PROJECT (Prof. Martin Vetterli) ABSTRACT The human visual
More information/
... 3 1.... 7 1.1.,.... 7 1.1.... 8 1.2.... 13 2.... 16 2.1.... 16 2.2.... 17 2.3.... 21 2.4.... 27 3.... 32 4.... 39 4.1..... 39 4.2..... 39 4.3. /.... 42 4.4. /.... 44 4.5. /.... 47 4.6. - /.... 49 4.7.....
More informationColorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.
Professor William Hoff Dept of Electrical Engineering &Computer Science http://inside.mines.edu/~whoff/ 1 Object Recognition in Large Databases Some material for these slides comes from www.cs.utexas.edu/~grauman/courses/spring2011/slides/lecture18_index.pptx
More informationDictionary learning Based on Laplacian Score in Sparse Coding
Dictionary learning Based on Laplacian Score in Sparse Coding Jin Xu and Hong Man Department of Electrical and Computer Engineering, Stevens institute of Technology, Hoboken, NJ 73 USA Abstract. Sparse
More informationEMERGENCE OF SIMPLE-CELL RECEPTIVE PROPERTIES BY LEARNING A SPARSE CODE FOR NATURAL IMAGES
EMERGENCE OF SIMPLE-CELL RECEPTIVE PROPERTIES BY LEARNING A SPARSE CODE FOR NATURAL IMAGES Bruno A. Olshausen & David J. Field Presenter: Ozgur Yigit Balkan Outline Linear data representations Sparse Vector
More informationSelf-Organizing Sparse Codes
Self-Organizing Sparse Codes Yangqing Jia Sergey Karayev 2010-12-16 Abstract Sparse coding as applied to natural image patches learns Gabor-like components that resemble those found in the lower areas
More informationSimple Method for High-Performance Digit Recognition Based on Sparse Coding
1 Simple Method for High-Performance Digit Recognition Based on Sparse Coding Kai Labusch Institute for Neuro- and Bioinformatics University of Lübeck D-23538 Lübeck labusch@inb.uni-luebeck.de Erhardt
More informationA Simple Vision System
Chapter 1 A Simple Vision System 1.1 Introduction In 1966, Seymour Papert wrote a proposal for building a vision system as a summer project [4]. The abstract of the proposal starts stating a simple goal:
More informationConvolutional Neural Networks. Computer Vision Jia-Bin Huang, Virginia Tech
Convolutional Neural Networks Computer Vision Jia-Bin Huang, Virginia Tech Today s class Overview Convolutional Neural Network (CNN) Training CNN Understanding and Visualizing CNN Image Categorization:
More informationJoint design of data analysis algorithms and user interface for video applications
Joint design of data analysis algorithms and user interface for video applications Nebojsa Jojic Microsoft Research Sumit Basu Microsoft Research Nemanja Petrovic University of Illinois Brendan Frey University
More informationWhat is Computer Vision?
Perceptual Grouping in Computer Vision Gérard Medioni University of Southern California What is Computer Vision? Computer Vision Attempt to emulate Human Visual System Perceive visual stimuli with cameras
More informationFilter Select ion Model for Generating Visual Mot ion Signals
In: C. L. Giles, S. J. Hanson, and J. D. Cowan (Eds.) Advances in Neural Infirmation Processing Systems 5, Morgan Kaufman Publishers, San Mateo, CA, (1993). Filter Select ion Model for Generating Visual
More informationa labelled visual world, images together with scenes, in order to infer scenes from images. The image data might be single or multiple frames; the sce
To appear in: Adv. Neural Information Processing Systems 11, M. S. Kearns, S. A. Solla and D. A. Cohn, eds. MIT Press, 1999. Learning to estimate scenes from images William T. Freeman and Egon C. Pasztor
More informationRecursive ICA. Abstract
Recursive ICA Honghao Shan, Lingyun Zhang, Garrison W. Cottrell Department of Computer Science and Engineering University of California, San Diego La Jolla, CA 92093-0404 {hshan,lingyun,gary}@cs.ucsd.edu
More informationThe cerebral cortex model that self-organizes conditional probability tables and executes belief propagation IJCNN 2007 #1065
The cerebral corte model that self-organizes conditional probability tables and eecutes belief propagation IJCNN 007 #065 National Institute of Advanced Industrial Science and TechnologyAIST, Japan uuji
More informationStatistical models of visual neurons
Statistical models of visual neurons Final Presentation Anna Sotnikova Applied Mathematics and Statistics, and Scientific Computation program Advisor: Dr. Daniel A. Butts Department of Biology 1 The power
More informationA Developing Sensory Mapping for Robots
A Developing Sensory Mapping for Robots Nan Zhang and John Weng Zhengyou Zhang Dept. of Computer Sci. and Engr. Microsoft Research Michigan State University Microsoft Corp. East Lansing, MI 48823 Redmond,
More informationOverview. Related Work Tensor Voting in 2-D Tensor Voting in 3-D Tensor Voting in N-D Application to Vision Problems Stereo Visual Motion
Overview Related Work Tensor Voting in 2-D Tensor Voting in 3-D Tensor Voting in N-D Application to Vision Problems Stereo Visual Motion Binary-Space-Partitioned Images 3-D Surface Extraction from Medical
More informationPresentation Outline. Semantic Segmentation. Overview. Presentation Outline CNN. Learning Deconvolution Network for Semantic Segmentation 6/6/16
6/6/16 Learning Deconvolution Network for Semantic Segmentation Hyeonwoo Noh, Seunghoon Hong,Bohyung Han Department of Computer Science and Engineering, POSTECH, Korea Shai Rozenberg 6/6/2016 1 2 Semantic
More informationAn Event-based Optical Flow Algorithm for Dynamic Vision Sensors
An Event-based Optical Flow Algorithm for Dynamic Vision Sensors Iffatur Ridwan and Howard Cheng Department of Mathematics and Computer Science University of Lethbridge, Canada iffatur.ridwan@uleth.ca,howard.cheng@uleth.ca
More informationPersistent Homology for Characterizing Stimuli Response in the Primary Visual Cortex
Persistent Homology for Characterizing Stimuli Response in the Primary Visual Cortex Avani Wildani Tatyana O. Sharpee (PI) ICML Topology 6/25/2014 The big questions - How do we turn sensory stimuli into
More informationAutoencoders, denoising autoencoders, and learning deep networks
4 th CiFAR Summer School on Learning and Vision in Biology and Engineering Toronto, August 5-9 2008 Autoencoders, denoising autoencoders, and learning deep networks Part II joint work with Hugo Larochelle,
More informationEECS150 - Digital Design Lecture 14 FIFO 2 and SIFT. Recap and Outline
EECS150 - Digital Design Lecture 14 FIFO 2 and SIFT Oct. 15, 2013 Prof. Ronald Fearing Electrical Engineering and Computer Sciences University of California, Berkeley (slides courtesy of Prof. John Wawrzynek)
More informationSparse Models in Image Understanding And Computer Vision
Sparse Models in Image Understanding And Computer Vision Jayaraman J. Thiagarajan Arizona State University Collaborators Prof. Andreas Spanias Karthikeyan Natesan Ramamurthy Sparsity Sparsity of a vector
More informationProcessing Framework Proposed by Marr. Image
Processing Framework Proposed by Marr Recognition 3D structure; motion characteristics; surface properties Shape From stereo Motion flow Shape From motion Color estimation Shape From contour Shape From
More informationRepresenting the World
Table of Contents Representing the World...1 Sensory Transducers...1 The Lateral Geniculate Nucleus (LGN)... 2 Areas V1 to V5 the Visual Cortex... 2 Computer Vision... 3 Intensity Images... 3 Image Focusing...
More informationm Environment Output Activation 0.8 Output Activation Input Value
Learning Sensory-Motor Cortical Mappings Without Training Mike Spratling Gillian Hayes Department of Articial Intelligence University of Edinburgh mikes@dai.ed.ac.uk gmh@dai.ed.ac.uk Abstract. This paper
More informationRecognition with Bag-ofWords. (Borrowing heavily from Tutorial Slides by Li Fei-fei)
Recognition with Bag-ofWords (Borrowing heavily from Tutorial Slides by Li Fei-fei) Recognition So far, we ve worked on recognizing edges Now, we ll work on recognizing objects We will use a bag-of-words
More informationChapter 8: Convolutional Networks
Chapter 8: Convolutional Networks Dietrich Klakow Spoken Language Systems Saarland University, Germany Dietrich.Klakow@LSV.Uni-Saarland.De Neural Networks Implementation and Application Introduction Source:
More informationCAN THE SUN S DIRECTION BE ESTIMATED PRIOR TO THE DETERMINATION OF SHAPE?
CAN THE SUN S DIRECTION BE ESTIMATED PRIOR TO THE DETERMINATION OF SHAPE? WOJCIECH CHOJNACKI MICHAEL J. BROOKS Department of Computer Science, University of Adelaide Adelaide, SA 5005, Australia and DANIEL
More informationGeometry of Multiple views
1 Geometry of Multiple views CS 554 Computer Vision Pinar Duygulu Bilkent University 2 Multiple views Despite the wealth of information contained in a a photograph, the depth of a scene point along the
More informationNon-Differentiable Image Manifolds
The Multiscale Structure of Non-Differentiable Image Manifolds Michael Wakin Electrical l Engineering i Colorado School of Mines Joint work with Richard Baraniuk, Hyeokho Choi, David Donoho Models for
More informationFeatures that draw visual attention: an information theoretic perspective
Neurocomputing 65 66 (2005) 125 133 www.elsevier.com/locate/neucom Features that draw visual attention: an information theoretic perspective Neil D.B. Bruce a,b, a Department of Computer Science, York
More informationImage Representation by Active Curves
Image Representation by Active Curves Wenze Hu Ying Nian Wu Song-Chun Zhu Department of Statistics, UCLA {wzhu,ywu,sczhu}@stat.ucla.edu Abstract This paper proposes a sparse image representation using
More informationThink-Pair-Share. What visual or physiological cues help us to perceive 3D shape and depth?
Think-Pair-Share What visual or physiological cues help us to perceive 3D shape and depth? [Figure from Prados & Faugeras 2006] Shading Focus/defocus Images from same point of view, different camera parameters
More informationObject-Based Saliency Maps Harry Marr Computing BSc 2009/2010
Object-Based Saliency Maps Harry Marr Computing BSc 2009/2010 The candidate confirms that the work submitted is their own and the appropriate credit has been given where reference has been made to the
More informationDynamic visual attention: competitive versus motion priority scheme
Dynamic visual attention: competitive versus motion priority scheme Bur A. 1, Wurtz P. 2, Müri R.M. 2 and Hügli H. 1 1 Institute of Microtechnology, University of Neuchâtel, Neuchâtel, Switzerland 2 Perception
More informationSparsity and image processing
Sparsity and image processing Aurélie Boisbunon INRIA-SAM, AYIN March 6, Why sparsity? Main advantages Dimensionality reduction Fast computation Better interpretability Image processing pattern recognition
More informationNeural competitive structures for segmentation based on motion features
Neural competitive structures for segmentation based on motion features Javier Díaz 1, Sonia Mota 1, Eduardo Ros 1 and Guillermo Botella 1 1 Departamento de Arquitectura y Tecnología de Computadores, E.T.S.I.
More informationLearning to Perceive Transparency from the Statistics of Natural Scenes
Learning to Perceive Transparency from the Statistics of Natural Scenes Anat Levin Assaf Zomet Yair Weiss School of Computer Science and Engineering The Hebrew University of Jerusalem 9194 Jerusalem, Israel
More informationFacial Expression Classification with Random Filters Feature Extraction
Facial Expression Classification with Random Filters Feature Extraction Mengye Ren Facial Monkey mren@cs.toronto.edu Zhi Hao Luo It s Me lzh@cs.toronto.edu I. ABSTRACT In our work, we attempted to tackle
More informationSparse PCA Extracting Multi-scale Structure from Data
Proceedings International Conference on Computer Vision, pages 641-647, Vancouver, Canada, 21 c IEEE 1 Sparse PCA Extracting Multi-scale Structure from Data Chakra Chennubhotla & Allan Jepson Department
More informationOnline Learning for Object Recognition with a Hierarchical Visual Cortex Model
Online Learning for Object Recognition with a Hierarchical Visual Cortex Model Stephan Kirstein, Heiko Wersing, and Edgar Körner Honda Research Institute Europe GmbH Carl Legien Str. 30 63073 Offenbach
More informationBayesian Model of Dynamic Image Stabilization in the Visual System
Bayesian Model of Dynamic Image Stabilization in the Visual System The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters Citation Burak,
More informationA Modified Approach to Biologically Motivated Saliency Mapping
A Modified Approach to Biologically Motivated Saliency Mapping Shane Grant Department of Computer Science University of California, San Diego La Jolla, CA 9093 wgrant@ucsd.edu Kevin A Heins Department
More informationNonparametric sparse hierarchical models describe V1 fmri responses to natural images
Nonparametric sparse hierarchical models describe V1 fmri responses to natural images Pradeep Ravikumar, Vincent Q. Vu and Bin Yu Department of Statistics University of California, Berkeley Berkeley, CA
More informationWavelets, vision and the statistics of natural scenes
Wavelets, vision and the statistics of natural scenes By D. J. Field Uris Hall, Cornell University, Ithaca, NY 14853, USA The processing of spatial information by the visual system shows a number of similarities
More informationDoes the Brain do Inverse Graphics?
Does the Brain do Inverse Graphics? Geoffrey Hinton, Alex Krizhevsky, Navdeep Jaitly, Tijmen Tieleman & Yichuan Tang Department of Computer Science University of Toronto The representation used by the
More informationLearning transport operators for image manifolds
Learning transport operators for image manifolds Benjamin J. Culpepper Department of EECS Computer Science Division University of California, Berkeley Berkeley, CA 9472 bjc@cs.berkeley.edu Bruno A. Olshausen
More informationContent Based Image Retrieval Using Curvelet Transform
Content Based Image Retrieval Using Curvelet Transform Ishrat Jahan Sumana, Md. Monirul Islam, Dengsheng Zhang and Guojun Lu Gippsland School of Information Technology, Monash University Churchill, Victoria
More informationHierarchical Discriminative Sparse Coding via Bidirectional Connections
Hierarchical Discriminative Sparse Coding via Bidirectional Connections Zhengping Ji, Wentao Huang, Garrett Kenyon and Luis M. A. Bettencourt Abstract Conventional sparse coding learns optimal dictionaries
More informationPerception, Part 2 Gleitman et al. (2011), Chapter 5
Perception, Part 2 Gleitman et al. (2011), Chapter 5 Mike D Zmura Department of Cognitive Sciences, UCI Psych 9A / Psy Beh 11A February 27, 2014 T. M. D'Zmura 1 Visual Reconstruction of a Three-Dimensional
More informationA RECOGNITION SYSTEM THAT USES SACCADES TO DETECT CARS FROM REAL-TIME VIDEO STREAMS. Predrag Neskouic, Leon N Cooper* David Schuster t
Proceedings of the 9th International Conference on Neural Information Processing (ICONIP'OZ), Vol. 5 Lip0 Wang, Jagath C. Rajapakse, Kunihiko Fukushima, Soo-Young Lee, and Xin Yao (Editors) A RECOGNITION
More informationNeurally Inspired Mechanisms for the Dynamic Visual Attention Map Generation Task
Neurally Inspired Mechanisms for the Dynamic Visual Attention Map Generation Task Maria T. López 1, Miguel A. Fernández 1, Antonio Fernández-Caballero 1, and Ana E. Delgado 2 1 Departamento de Informática
More informationArbib: Slides for TMB2 Section 7.2 1
Arbib: Slides for TMB2 Section 7.2 1 Lecture 20: Optic Flow Reading assignment: TMB2 7.2 If, as we walk forward, we recognize that a tree appears to be getting bigger, we can infer that the tree is in
More informationIllumination and Reflectance
COMP 546 Lecture 12 Illumination and Reflectance Tues. Feb. 20, 2018 1 Illumination and Reflectance Shading Brightness versus Lightness Color constancy Shading on a sunny day N(x) L N L Lambert s (cosine)
More informationHigh-level Vision as Statistical Inference. Daniel Kersten. SHORT TITLE: Vision as Statistical Inference.
1 High-level Vision as Statistical Inference Daniel Kersten SHORT TITLE: Vision as Statistical Inference kersten@tc.umn.edu Department of Psychology, University of Minnesota, 75 East River Road,Minneapolis,
More informationLecture 2: Spatial And-Or graph
Lecture 2: Spatial And-Or graph for representing the scene-object-part-primitive hierarchy Song-Chun Zhu Center for Vision, Cognition, Learning and Arts University of California, Los Angeles At CVPR, Providence,
More information