Efficient Large-Scale Stereo Matching

Similar documents
Fundamentals of Stereo Vision Michael Bleyer LVA Stereo Vision

StereoScan: Dense 3D Reconstruction in Real-time

Multiple View Geometry

Stereo Vision II: Dense Stereo Matching

A Patch Prior for Dense 3D Reconstruction in Man-Made Environments

Static Scene Reconstruction

Geometric Reconstruction Dense reconstruction of scene geometry

Global Stereo Matching Leveraged by Sparse Ground Control Points

Improved depth map estimation in Stereo Vision

Real-time Global Stereo Matching Using Hierarchical Belief Propagation

CS 4495/7495 Computer Vision Frank Dellaert, Fall 07. Dense Stereo Some Slides by Forsyth & Ponce, Jim Rehg, Sing Bing Kang

EUSIPCO

In ICIP 2017, Beijing, China MONDRIAN STEREO. Middlebury College Middlebury, VT, USA

CONTENTS. High-Accuracy Stereo Depth Maps Using Structured Light. Yeojin Yoon

Part 3: Dense Stereo Correspondence

CS4495/6495 Introduction to Computer Vision. 3B-L3 Stereo correspondence

Stereo Matching.

PatchMatch Stereo - Stereo Matching with Slanted Support Windows

Project 2 due today Project 3 out today. Readings Szeliski, Chapter 10 (through 10.5)

SPM-BP: Sped-up PatchMatch Belief Propagation for Continuous MRFs. Yu Li, Dongbo Min, Michael S. Brown, Minh N. Do, Jiangbo Lu

Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923

Priors for Stereo Vision under Adverse Weather Conditions

Stereo and Epipolar geometry

Segmentation Based Stereo. Michael Bleyer LVA Stereo Vision

Today. Stereo (two view) reconstruction. Multiview geometry. Today. Multiview geometry. Computational Photography

Reconstructing Reflective and Transparent Surfaces from Epipolar Plane Images

Stereo Matching with Reliable Disparity Propagation

Stereo Vision Based Traversable Region Detection for Mobile Robots Using U-V-Disparity

Stereo. Many slides adapted from Steve Seitz

Epipolar Geometry and Stereo Vision

Human Body Recognition and Tracking: How the Kinect Works. Kinect RGB-D Camera. What the Kinect Does. How Kinect Works: Overview

Dense 3D Reconstruction. Christiano Gava

Binocular stereo. Given a calibrated binocular stereo pair, fuse it to produce a depth image. Where does the depth information come from?

Lecture 10: Multi view geometry

A novel 3D torso image reconstruction procedure using a pair of digital stereo back images

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

Stereo vision. Many slides adapted from Steve Seitz

Graph Cut based Continuous Stereo Matching using Locally Shared Labels

In Defense of 3D-Label Stereo

Stereo Vision. MAN-522 Computer Vision

Stereo Vision in Structured Environments by Consistent Semi-Global Matching

Recap from Previous Lecture

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

segments. The geometrical relationship of adjacent planes such as parallelism and intersection is employed for determination of whether two planes sha

Camera Drones Lecture 3 3D data generation

From Orientation to Functional Modeling for Terrestrial and UAV Images

Stereo. 11/02/2012 CS129, Brown James Hays. Slides by Kristen Grauman

Input. Output. Problem Definition. Rectified stereo image pair All correspondences lie in same scan lines

Bilateral and Trilateral Adaptive Support Weights in Stereo Vision

Introduction à la vision artificielle X

BIL Computer Vision Apr 16, 2014

Multi-view Stereo. Ivo Boyadzhiev CS7670: September 13, 2011

Epipolar Geometry and Stereo Vision

Geometry based Repetition Detection for Urban Scene

Using temporal seeding to constrain the disparity search range in stereo matching

Dense 3D Reconstruction. Christiano Gava

EECS 442 Computer vision. Stereo systems. Stereo vision Rectification Correspondence problem Active stereo vision systems

Multi-Flash Stereopsis: Depth Edge Preserving Stereo with Small Baseline Illumination

Colour Segmentation-based Computation of Dense Optical Flow with Application to Video Object Segmentation

Integrating LIDAR into Stereo for Fast and Improved Disparity Computation

Surface Normal Aided Dense Reconstruction from Images

Multiray Photogrammetry and Dense Image. Photogrammetric Week Matching. Dense Image Matching - Application of SGM

Epipolar Geometry CSE P576. Dr. Matthew Brown

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

EE795: Computer Vision and Intelligent Systems

Real-Time Disparity Map Computation Based On Disparity Space Image

We are IntechOpen, the world s leading publisher of Open Access books Built by scientists, for scientists. International authors and editors

A Comparative Study of Stereovision Algorithms

Subpixel accurate refinement of disparity maps using stereo correspondences

Data Term. Michael Bleyer LVA Stereo Vision

Project 3 code & artifact due Tuesday Final project proposals due noon Wed (by ) Readings Szeliski, Chapter 10 (through 10.5)

Evaluation of Different Methods for Using Colour Information in Global Stereo Matching Approaches

Automatic Disparity Search Range Estimation for Stereo Pairs of Unknown Scenes

Embedded real-time stereo estimation via Semi-Global Matching on the GPU

Temporally Consistence Depth Estimation from Stereo Video Sequences

Fast and Robust 3D Terrain Surface Reconstruction of Construction Site Using Stereo Camera

Elevation-Based MRF Stereo Implemented in Real-Time on a GPU

Towards Real-time Stereo using Non-uniform Image Sampling and Sparse Dynamic Programming

Segment-based Stereo Matching Using Graph Cuts

Final project bits and pieces

3D Photography: Stereo Matching

Stereo Correspondence with Occlusions using Graph Cuts

Project Updates Short lecture Volumetric Modeling +2 papers

Flow Estimation. Min Bai. February 8, University of Toronto. Min Bai (UofT) Flow Estimation February 8, / 47

arxiv: v1 [cs.cv] 6 Apr 2012

3D Sensing and Reconstruction Readings: Ch 12: , Ch 13: ,

CS5670: Computer Vision

Lecture 19: Depth Cameras. Visual Computing Systems CMU , Fall 2013

POST PROCESSING VOTING TECHNIQUES FOR LOCAL STEREO MATCHING

Probabilistic Correspondence Matching using Random Walk with Restart

There are many cues in monocular vision which suggests that vision in stereo starts very early from two similar 2D images. Lets see a few...

Evaluation of Stereo Algorithms for 3D Object Recognition

CHAPTER 3 DISPARITY AND DEPTH MAP COMPUTATION

MACHINE VISION APPLICATIONS. Faculty of Engineering Technology, Technology Campus, Universiti Teknikal Malaysia Durian Tunggal, Melaka, Malaysia

3D RECONSTRUCTION FROM STEREO/ RANGE IMAGES

Lecture 10: Multi-view geometry

Massively Parallel Multiview Stereopsis by Surface Normal Diffusion

Kinect Device. How the Kinect Works. Kinect Device. What the Kinect does 4/27/16. Subhransu Maji Slides credit: Derek Hoiem, University of Illinois

Depth from Stereo. Dominic Cheng February 7, 2018

Transcription:

Efficient Large-Scale Stereo Matching Andreas Geiger*, Martin Roser* and Raquel Urtasun** *KARLSRUHE INSTITUTE OF TECHNOLOGY **TOYOTA TECHNOLOGICAL INSTITUTE AT CHICAGO KIT University of the State of Baden-Wuerttemberg and National Laboratory of the Helmholtz Association www.kit.edu

Contents 1 Motivation and Related Work 2 Efficient Large-Scale Stereo Matching 3 Experimental Evaluation 4 Summary and Future Work Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 2/24

Contents 1 Motivation and Related Work 2 Efficient Large-Scale Stereo Matching 3 Experimental Evaluation 4 Summary and Future Work Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 3/24

Motivation Camera Laserscanner Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 4/24

Why is 3D from Stereo hard? Ambiguities Textureless regions Sensor saturation Non-Lambertian surfaces z grows quadratically Computational burden Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 5/24

Why is 3D from Stereo hard? Ambiguities Textureless regions Sensor saturation Non-Lambertian surfaces z grows quadratically Computational burden Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 5/24

Why is 3D from Stereo hard? Ambiguities Textureless regions Sensor saturation Non-Lambertian surfaces z grows quadratically Computational burden distance error focal length disparity error baseline Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 5/24

Related Work: Local Methods Local Methods Winner-takes-All Examples Block matching (Scharstein 02) Adaptive windows (Kanade 94, Yoon 06) Plane-sweep (Collins 96, Gallup 07) Problems Small matching ratios Border bleeding Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 6/24

Related Work: Global Methods Global Methods Minimize 1D/2D energy E(d) = E data (d) + λe smooth (d) Examples Graph cuts, Belief propagation (Kolmogorov 02, Felzenszwalb 06) Variational methods (Pock 07, Zach 09) Fusion moves (Woodford 08, Bleyer 10) Problems Computational and memory requirements Pairwise potentials can not model planarity Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 7/24

Related Work: Seed-and-Grow Seed-and-Grow Methods Grow disparity components from random seeds Examples (Cech 07) (Sara 03) Problems Slanted/textureless surfaces No dense disparity maps Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 8/24

Contents 1 Motivation and Related Work 2 Efficient Large-Scale Stereo Matching 3 Experimental Evaluation 4 Summary and Future Work Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 9/24

Idea Assumption: rectified images Image pairs contain easy and hard correspondences Robustly match easy correspondences on regular grid Build prior on dense search space dense matching Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 10/24

Idea Easy Hard Assumption: rectified images Image pairs contain easy and hard correspondences Robustly match easy correspondences on regular grid Build prior on dense search space dense matching Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 10/24

Idea Robust Match Assumption: rectified images Image pairs contain easy and hard correspondences Robustly match easy correspondences on regular grid Build prior on dense search space dense matching Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 10/24

Idea Assumption: rectified images Image pairs contain easy and hard correspondences Robustly match easy correspondences on regular grid Build prior on dense search space dense matching Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 10/24

Efficient Large-Scale Stereo Notation Robust support points S = {s 1,..., s M } with s m = (u m v m d m ) T Disparity d n N Observations o n = (u n v n f n ) T Local image features f n Algorithm Split image domain into support points S and dense pixels Assume factorization of distribution over disparity, observations and support points into... Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 11/24

Efficient Large-Scale Stereo Notation Robust support points S = {s 1,..., s M } with s m = (u m v m d m ) T Disparity d n N Observations o n = (u n v n f n ) T Local image features f n Algorithm Split image domain into support points S and dense pixels Assume factorization of distribution over disparity, observations and support points into... Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 11/24

Efficient Large-Scale Stereo Notation Robust support points S = {s 1,..., s M } with s m = (u m v m d m ) T Disparity d n N Observations o n = (u n v n f n ) T Local image features f n Algorithm Split image domain into support points S and dense pixels Assume factorization of distribution over disparity, observations and support points into... Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 11/24

Model Left Image Right Image Graphical Model Support Points Support Points Observation in left image Observations p(d n, o (l) n, o (r) n, S) p(d n S, o (l) }{{} Prior n ) p(o (r) n o (l) n, d n ) }{{} Likelihood Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 12/24

Model Left Image Right Image Graphical Model Support Points Support Points Observation in left image Observations p(d n, o (l) n, o (r) n, S) p(d n S, o (l) }{{} Prior n ) p(o (r) n o (l) n, d n ) }{{} Likelihood Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 12/24

Prior and Likelihood Prior p(d n S, o (l) n ) Support pt. triangulation Piecew. linear manifold Local extrapolation Likelihood p(o (r) n o (l) n, d n ) Laplace distribution 5 5 block window 3 3 Sobel filter Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 13/24

Prior and Likelihood Prior p(d n S, o (l) n ) Support pt. triangulation Piecew. linear manifold Local extrapolation Likelihood p(o (r) n o (l) n, d n ) Laplace distribution 5 5 block window 3 3 Sobel filter Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 13/24

Sampling from the model Left image Sample mean Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 14/24

Sampling from the model Left image Sample mean Right image Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 14/24

Contents 1 Motivation and Related Work 2 Efficient Large-Scale Stereo Matching 3 Experimental Evaluation 4 Summary and Future Work Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 15/24

Middlebury Benchmark 900 x 750 pixels, ground truth Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 16/24

Middlebury Benchmark 900 x 750 pixels, 0.4 seconds Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 16/24

Middlebury Benchmark 1300 x 1100 pixels, ground truth Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 16/24

Middlebury Benchmark 1300 x 1100 pixels, 1 second Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 16/24

Accuracy (on cones image pair) Error >2px [%] 20 15 10 uniform prior Cech 07 Felzenszwalb 06 Kolmogorov 01 Kostkova 03 our method 5 0 0 0.5 1 1.5 2 Image resolution [Megapixel] Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 17/24

Running times (on cones image pair) 2000 1500 Kolmogorov 01 Kostkova 03 our method 50 40 Felzenszwalb 06 Cech 07 uniform prior our method Running time [s] 1000 Running time [s] 30 20 500 10 0 0 0.5 1 1.5 2 Image resolution [Megapixel] 0 0 0.5 1 1.5 2 Image resolution [Megapixel] [For more details see: Geiger et al., ACCV 2010] Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 18/24

3D Reconstruction: Brussels 2 seconds [http://cvlab.epfl.ch/data/strechamvs/] Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 19/24

3D Face Reconstruction [http://www.fujifilm.com/products/3d] Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 20/24

Urban Scene Reconstruction Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 21/24

Contents 1 Motivation and Related Work 2 Efficient Large-Scale Stereo Matching 3 Experimental Evaluation 4 Summary and Future Work Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 22/24

Summary Simple prior based on sparse feature matches Reduced ambiguities and run-time Takes into account slanted surfaces Real-time 3D reconstruction of static scenes on CPU C++ / MATLAB code available at http://cvlibs.net Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 23/24

Summary Simple prior based on sparse feature matches Reduced ambiguities and run-time Takes into account slanted surfaces Real-time 3D reconstruction of static scenes on CPU C++ / MATLAB code available at http://cvlibs.net Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 23/24

Summary Simple prior based on sparse feature matches Reduced ambiguities and run-time Takes into account slanted surfaces Real-time 3D reconstruction of static scenes on CPU C++ / MATLAB code available at http://cvlibs.net Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 23/24

Summary Simple prior based on sparse feature matches Reduced ambiguities and run-time Takes into account slanted surfaces Real-time 3D reconstruction of static scenes on CPU C++ / MATLAB code available at http://cvlibs.net Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 23/24

Summary Simple prior based on sparse feature matches Reduced ambiguities and run-time Takes into account slanted surfaces Real-time 3D reconstruction of static scenes on CPU C++ / MATLAB code available at http://cvlibs.net Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 23/24

Future Work Develop better priors Incorporate segmentation / global reasoning on lines GPU implementation (goal: 20 fps at 1-2 megapixels) Employ as unitary potentials on global methods smaller label sets Thank you! Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 24/24

Future Work Develop better priors Incorporate segmentation / global reasoning on lines GPU implementation (goal: 20 fps at 1-2 megapixels) Employ as unitary potentials on global methods smaller label sets Thank you! Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 24/24

Future Work Develop better priors Incorporate segmentation / global reasoning on lines GPU implementation (goal: 20 fps at 1-2 megapixels) Employ as unitary potentials on global methods smaller label sets Thank you! Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 24/24

Future Work Develop better priors Incorporate segmentation / global reasoning on lines GPU implementation (goal: 20 fps at 1-2 megapixels) Employ as unitary potentials on global methods smaller label sets Thank you! Efficient Large-Scale Stereo Matching Code: http://cvlibs.net 24/24