Computational Perception. Visual Coding 3

Similar documents
Neurophysical Model by Barten and Its Development

From natural scene statistics to models of neural coding and representation (part 1)

Image Formation. Camera trial #1. Pinhole camera. What is an Image? Light and the EM spectrum The H.V.S. and Color Perception

Modeling the top neuron layers of the human retina. Laurens Koppenol ( ) Universiteit Utrecht

Introduction to color science

Visual Acuity. Visual acuity

Visual Acuity. Excerpts from a lecture by: Harold E. Bedell. Physiological Optics & Vision Science

(0, 1, 1) (0, 1, 1) (0, 1, 0) What is light? What is color? Terminology

CS4670: Computer Vision

CSE 167: Lecture #6: Color. Jürgen P. Schulze, Ph.D. University of California, San Diego Fall Quarter 2012

Visual Acuity. Adler s Physiology of the Eye 11th Ed. Chapter 33 - by Dennis Levi.

CS5670: Computer Vision

Color Vision. Spectral Distributions Various Light Sources

Lecture 1 Image Formation.

POVS Basic Vision Science Core. Visual Acuity

CSE 167: Lecture #6: Color. Jürgen P. Schulze, Ph.D. University of California, San Diego Fall Quarter 2011

Vision: From Eye to Brain (Chap 3)

Reading. 2. Color. Emission spectra. The radiant energy spectrum. Watt, Chapter 15.

Optics of Vision. MATERIAL TO READ Web: 1.

Light. Properties of light. What is light? Today What is light? How do we measure it? How does light propagate? How does light interact with matter?

The Human Visual System!

Lenses: Focus and Defocus

Human Perception of Objects

Topographic Mapping with fmri

Representing the World

2012 Imaging Science Ph.D. Comprehensive Examination June 15, :00AM to 1:00PM IMPORTANT INSTRUCTIONS

Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science

Image Formation. CS418 Computer Graphics Eric Shaffer.

Computer Vision I - Image Matching and Image Formation

Announcements. Light. Properties of light. Light. Project status reports on Wednesday. Readings. Today. Readings Szeliski, 2.2, 2.3.

Color and Shading. Color. Shapiro and Stockman, Chapter 6. Color and Machine Vision. Color and Perception

Computer Graphics. The Human Visual System (HVS) Philipp Slusallek

EXAM SOLUTIONS. Computer Vision Course 2D1420 Thursday, 11 th of march 2003,

Light. Computer Vision. James Hays

Exercise: Simulating spatial processing in the visual pathway with convolution

SPARSE CODES FOR NATURAL IMAGES. Davide Scaramuzza Autonomous Systems Lab (EPFL)

Introduction to Neural Networks Overview of the visual system

CS6670: Computer Vision

Spectral Color and Radiometry

Visual areas in the brain. Image removed for copyright reasons.

Perceptual Effects in Real-time Tone Mapping

Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science

CS6670: Computer Vision

Reading. 4. Color. Outline. The radiant energy spectrum. Suggested: w Watt (2 nd ed.), Chapter 14. Further reading:

Color. Computational Photography MIT Feb. 14, 2006 Bill Freeman and Fredo Durand

CS4670: Computer Vision

Efficient Visual Coding: From Retina To V2

Detector systems for light microscopy

Measuring Light: Radiometry and Photometry

Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science

this is processed giving us: perceived color that we actually experience and base judgments upon.

Color. Phillip Otto Runge ( )

Robust Face Recognition via Sparse Representation

Chapter 5 Extraction of color and texture Comunicação Visual Interactiva. image labeled by cluster index

Active Fixation Control to Predict Saccade Sequences Supplementary Material

C. Poultney S. Cho pra (NYU Courant Institute) Y. LeCun

The Elements of Colour

A New Time-Dependent Tone Mapping Model

Improving visual function diagnostic metrics. Charles Campbell

Improvement of optic flow estimation from an event-based sensor with recurrently connected processing layers

Phase. E = A sin(2p f t+f) (wave in time) or E = A sin(2p x/l +f) (wave in space)

Lecture 4 Recap of PHYS110-1 lecture Physical Optics - 4 lectures EM spectrum and colour Light sources Interference and diffraction Polarization

Visual Pathways to the Brain

The Display pipeline. The fast forward version. The Display Pipeline The order may vary somewhat. The Graphics Pipeline. To draw images.

CSE 167: Introduction to Computer Graphics Lecture #6: Colors. Jürgen P. Schulze, Ph.D. University of California, San Diego Fall Quarter 2013

One of the important tasks of the visual system is to interpret

Measuring Light: Radiometry and Photometry

Colour appearance and the interaction between texture and colour

Corner Detection. GV12/3072 Image Processing.

(A) Electromagnetic. B) Mechanical. (C) Longitudinal. (D) None of these.

Digital Image Fundamentals

Lecture 11. Color. UW CSE vision faculty

Spectral Images and the Retinex Model

25-1 Interference from Two Sources

Modern Signal Processing and Sparse Coding

Motivation. Intensity Levels

A Hierarchial Model for Visual Perception

Perception as an Inference Problem

Local invariant features

( ) = First Bessel function, x = π Dθ

What is Color and How is It Measured?

Computer Vision Course Lecture 02. Image Formation Light and Color. Ceyhun Burak Akgül, PhD cba-research.com. Spring 2015 Last updated 04/03/2015

2014 Summer School on MPEG/VCEG Video. Video Coding Concept

Topic 5 Image Compression

Visualizing Flow Fields by Perceptual Motion

IMPORTANT INSTRUCTIONS

Colour Reading: Chapter 6. Black body radiators

MODELING LED LIGHTING COLOR EFFECTS IN MODERN OPTICAL ANALYSIS SOFTWARE LED Professional Magazine Webinar 10/27/2015

Image Analysis and Formation (Formation et Analyse d'images)

Capturing the sampling effects: a TOD sensor performance model

Feature selection and dimensionality reduction of neural data to model adaptation in the retina

An introduction to 3D image reconstruction and understanding concepts and ideas

Color Appearance in Image Displays. O Canada!

Local Linear Models for Improved von Kries Adaptation

Face Recognition by Independent Component Analysis

SUPPLEMENTARY FILE S1: 3D AIRWAY TUBE RECONSTRUCTION AND CELL-BASED MECHANICAL MODEL. RELATED TO FIGURE 1, FIGURE 7, AND STAR METHODS.

COSC579: Scene Geometry. Jeremy Bolton, PhD Assistant Teaching Professor

A Video Watermarking Algorithm Based on the Human Visual System Properties

Physics Midterm I

Optimal disparity estimation in natural stereo- images (Supplement) Johannes Burge & Wilson S. Geisler

Transcription:

Computational Perception 15-485/785 February 21, 2008 Visual Coding 3

A gap in the theory? - - + - - from Hubel, 1995 2

Eye anatomy from Hubel, 1995 Photoreceptors: rods (night vision) and cones (day vision) Other layers: processing to enhance SNR and maximize information transmission All visual information conveyed to brain through ~1 million optic nerve fibers CP08:Visual Coding 3 3

Retinal cross section: the real thing from Wandell, 1995 4

Distribution of rods and cones from Rodeick, 1998 Maximum accuity is in the fovea: - about 50,000 cones, each about 30 seconds of visual angle - no sharp border, but about 250 cones across - about 2 of visual angle CP08:Visual Coding 3 5

Photoreceptor mosaic in macaque monkey Light circles S cones. Dark circles M&L cones. Small circles rods. from Wandell, 1995 CP08:Visual Coding 3 6

Light intensity spans an enormous range luminance catch rate (cd/m 2 ) (photons/s) environment significance 10 6 1/5100 s overcast night sky rod threshold 10 3 1/5 s starlight 3/s cone threshold 10 2 full moon 10/s colors visible 10 1 100/s sunrise 10 2 indoor lighting 10 3 best accuity 10 4 clear sky at noon 10 5 300,000/s patch of snow at noon 1,000,000/s maximal cone rate 7

Scotopic Vision: rods and night vision from Rodeick, 1998 Photons on retina: sky-tree border in overcast night sky 8

Rods sensitivity Rods respond reliably to single photons. - Do we really need this degree of sensitivity? Suppose instead of 1 photon, a rod needed 2 photons (within 0.1 sec). - How much more light would we need to get same output? Relevant facts: - photon catch rate is 1/5100 secs or once every 85 hours - but, there are 10 8 rods - photons arrive at random times Poisson distribution What is the rate of seeing two photon within 0.1 sec? - Once every 16 years. Need 100,000 times intensity to achieve same discriminability. - In short: two photon sensitivity no night vision. 9

Dynamic range of rods from Rodeick, 1998 Retinal circuitry enhances sensitivity by summing the output across many rods Rods are also insensitive to the direction of light What about cones? CP08:Visual Coding 3 10

Dynamic range of rods and cones from Rodeick, 1998 Cones do not saturate CP08:Visual Coding 3 11

Cones are adapted for daylight vision Cones are sensitive to the direction of photon arrival from Rodeick, 1998 Cones are aligned to optimize focus 12

Color vision: Cone spectral sensitivity curves from Wandell, 1995 Three types of spectral sensitivity curves: long, medium, and short wavelength. Response of an individual cone is the integral of the product of the curve and the spectrum at that point. CP08:Visual Coding 3 13

Sampling of color space from Wandell, 1995 These two spectra are indistinguishable. The perception of color starts from here, but gets far more complicated. CP08:Visual Coding 3 14

There are far more M&L cones than S cones: Why? Light circles S cones. Dark circles M&L cones. Small circles rods. from Wandell, 1995 CP08:Visual Coding 3 15

Chromatic aberration from Wandell, 1995 16

Focus is matched to M & L cones from Rodeick, 1998 17

Modulation transfer function 3 mm pupil diameter. Eye in focus at 580 nm and diffraction limited. No significant contrast beyond 4 cpd at short wavelengths. CP08:Visual Coding 3 18

Cone spatial mosaic measured in living human retina Roorda and Williams (1999). False color: blue, green, red represent S, M, and L cones. Subject JW temporal (a) and nasal (b) retina, Subject AN nasal retina (c) Subject % L cones % M cones % S cones L:M ratio JW 75.8 20.0 4.2 3.8 : 1 AN 50.6 44.2 5.2 1.2 : 1 CP08:Visual Coding 3 19

Back to computation

Redundancy reduction for noisy channels (Atick, 1992) s input channel y = x + ν x recoding A Ax output channel y = Ax + δ y Mutual information I(x, s) = s,x [ ] P (x, s) P (x, s) log 2 P (s)p (x) I(x, s) = 0 iff P (x, s) = P (x)p (s), i.e. x and s are independent. 21

Profiles of optimal filters - - + - - high SNR - reduce redundancy - center-surround - - + - - + low SNR - average - low-pass filter matches behavior of retinal ganglion cells CP08:Visual Coding 3 22

An observation: Contrast sensitivity of ganglion cells Luminance level decreases one log unit each time we go to lower curve. 23

Natural images have a 1/f amplitude spectrum Field, 1987 CP08:Visual Coding 3 24

Components of predicted filters whitening filter low-pass filter optimal filter from Atick, 1992 25

Predicted contrast sensitivity functions match neural data from Atick, 1992 26

Robust coding of natural images Theory refined: - image is noisy and blurred - neural population size changes - neurons are noisy from Hubel, 1995 (a) Undistorted image (b) Fovea retinal image (c) 40 degrees eccentricity Intensity -4 0 4 Visual angle [arc min] -4 0 4 from Doi and Lewicki, 2006 CP08:Visual Coding 3 27

Problem 1: Real neurons are noisy Estimates of neural information capacity system (area) stimulus bits / sec bits / spike fly visual (H1) motion 64 ~1 monkey visual (MT) motion 5.5-12 0.6-1.5 frog auditory (auditory nerve) noise & call 46 & 133 1.4 & 7.8 Salamander visual (ganglinon cells) rand. spots 3.2 1.6 cricket cercal (sensory afferent) mech. motion 294 3.2 cricket cercal (sensory afferent) wind noise 75-220 0.6-3.1 cricket cercal (10-2 and 10-3) wind noise 8-80 avg. = 1 Electric fish (P-afferent) amp. modulation 0-200 0-1.2 Limited capacity neural codes need to be robust. After Borst and Theunissen, 1999 28

Traditional codes are not robust encoding neurons Original sensory input 29

Traditional codes are not robust encoding neurons sensory input Add noise equivalent to 1 bit precision Original 1x efficient coding reconstruction (34% error) 1 bit precision 30

Redundancy plays an important role in neural coding Response of salamander retinal ganglion cells to natural movies fractional redundancy cell pair distance (μm) 31 From Puchalla et al, 2005

Robust coding (Doi and Lewicki, 2005,2006) x W u A x^ Data Encoder Noiseless Representation Decoder Reconstruction Model limited precision using additive channel noise: x W u r A x^ Data Encoder Noiseless Representation Channel Noise n Noisy Representation Decoder Reconstruction r = Wx + n = u + n. ˆx = Ar = AWx + An. Caveat: linear, but can evaluate for different noise levels. 32

Generalizing the model: sensory noise and optical blur (a) Undistorted image (b) Fovea retinal image (c) 40 degrees eccentricity Intensity -4 0 4 Visual angle [arc min] -4 0 4 s optical blur H sensory noise channel noise ν δ encoder x W r only implicit decoder A ŝ image observation representation reconstruction Can also add sparseness and resource constraints 33

How do we learn robust codes? sensory noise channel noise ν δ optical blur encoder decoder s H x W r A ŝ image observation representation reconstruction Objective: Find W and A that minimize reconstruction error. Channel capacity of the i th neuron: C i = 1 2 ln(snr i + 1) To limit capacity, fix the coefficient signal to noise ratio: SNR i = u2 i σ 2 n Now robust coding is formulated as a constrained optimization problem. 34

Robust coding of natural images encoding neurons sensory input Add noise equivalent to 1 bit precision Original 1x efficient coding reconstruction (34% error) 1 bit precision 35

Robust coding of natural images encoding neurons sensory input Weights adapted for optimal robustness Original 1x robust coding reconstruction (3.8% error) 1 bit precision 36

Reconstruction improves by adding neurons encoding neurons sensory input Weights adapted for optimal robustness Original reconstruction error: 0.6% 8x robust coding 1 bit precision 37

Adding precision to 1x robust coding original images 1 bit: 12.5% error 2 bit: 3.1% error 38

Can derive minimum theoretical average error bound E = 1 1 N SNR + 1 N M λ i tha - ith eigenvalue of the data covariance N - input dimensionality [ N ] 2 λi i=1 M - # of coding units (neurons) if SNR SNR c Algorithm achieves theoretical lower bound Results Bound 0.5x 19.9% 20.3% 1x 12.4% 12.5% 8x 2.0% 2.0% Balcan, Doi, and Lewicki, 2007; Balcan and Lewicki, 2007 39

Sparseness localizes the vectors and increases coding efficiency encoding vectors (W) decoding vectors (A) 100 80 normalized histogram of coeff. kurtosis Kurtosis of RC avg. = 4.9 robust coding % frequency % 60 40 20 robust sparse coding average error is unchanged 12.5% 12.5% 40 0 100 80 % 60 40 20 0 4 6 8 10 kurtosis kurtosis Kurtosis of RSC 4 6 8 10 kurtosis avg. = 7.3

Non-zero resource constraints localize weights fovea (a) 20dB 10dB 0dB -10dB (b) (c) 40 degrees 20dB average error is also unchanged -10dB 41 from Doi and Lewicki, 2007