Real-Time Disparity Map Computation Based On Disparity Space Image

Similar documents
A FAST SEGMENTATION-DRIVEN ALGORITHM FOR ACCURATE STEREO CORRESPONDENCE. Stefano Mattoccia and Leonardo De-Maeztu

segments. The geometrical relationship of adjacent planes such as parallelism and intersection is employed for determination of whether two planes sha

Real-time Global Stereo Matching Using Hierarchical Belief Propagation

Improved depth map estimation in Stereo Vision

A locally global approach to stereo correspondence

Stereo Matching with Reliable Disparity Propagation

Stereo Matching.

Segment-based Stereo Matching Using Graph Cuts

Fundamentals of Stereo Vision Michael Bleyer LVA Stereo Vision

Improving the accuracy of fast dense stereo correspondence algorithms by enforcing local consistency of disparity fields

CS4495/6495 Introduction to Computer Vision. 3B-L3 Stereo correspondence

MACHINE VISION APPLICATIONS. Faculty of Engineering Technology, Technology Campus, Universiti Teknikal Malaysia Durian Tunggal, Melaka, Malaysia

A novel heterogeneous framework for stereo matching

On Building an Accurate Stereo Matching System on Graphics Hardware

Evaluation of Stereo Algorithms for 3D Object Recognition

Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923

Classification and evaluation of cost aggregation methods for stereo correspondence

Hierarchical Belief Propagation To Reduce Search Space Using CUDA for Stereo and Motion Estimation

A Performance Study on Different Cost Aggregation Approaches Used in Real-Time Stereo Matching

A novel 3D torso image reconstruction procedure using a pair of digital stereo back images

Stereo Correspondence with Occlusions using Graph Cuts

Probabilistic Correspondence Matching using Random Walk with Restart

Near real-time Fast Bilateral Stereo on the GPU

A Local Iterative Refinement Method for Adaptive Support-Weight Stereo Matching

Stereo Video Processing for Depth Map

A Near Real-Time Color Stereo Matching Method for GPU

Fast correlation-based stereo matching with the reduction of systematic errors

Temporally Consistence Depth Estimation from Stereo Video Sequences

Perceptual Grouping from Motion Cues Using Tensor Voting

Chaplin, Modern Times, 1936

CS5670: Computer Vision

Stereo Matching: An Outlier Confidence Approach

Fast Stereo Matching using Adaptive Window based Disparity Refinement

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

Asymmetric 2 1 pass stereo matching algorithm for real images

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and

CONTENTS. High-Accuracy Stereo Depth Maps Using Structured Light. Yeojin Yoon

Stereo Vision II: Dense Stereo Matching

STEREO matching has been one of the most active research

Project 3 code & artifact due Tuesday Final project proposals due noon Wed (by ) Readings Szeliski, Chapter 10 (through 10.5)

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

Comparison of Graph Cuts with Belief Propagation for Stereo, using Identical MRF Parameters

Segmentation-based Disparity Plane Fitting using PSO

On-line and Off-line 3D Reconstruction for Crisis Management Applications

Final project bits and pieces

A virtual tour of free viewpoint rendering

3D RECONSTRUCTION FROM STEREO/ RANGE IMAGES

Data Term. Michael Bleyer LVA Stereo Vision

Filter Flow: Supplemental Material

Announcements. Stereo Vision Wrapup & Intro Recognition

What have we leaned so far?

Optimizing Monocular Cues for Depth Estimation from Indoor Images

Near Real-Time Stereo Matching Using Geodesic Diffusion

Recap from Previous Lecture

Linear stereo matching

Stereo Vision Based Image Maching on 3D Using Multi Curve Fitting Algorithm

Generation of a Disparity Map Using Piecewise Linear Transformation

Adaptive Support-Weight Approach for Correspondence Search

Stereo: Disparity and Matching

Introduction à la vision artificielle X

Global Stereo Matching Leveraged by Sparse Ground Control Points

Robot localization method based on visual features and their geometric relationship

Stereo Vision in Structured Environments by Consistent Semi-Global Matching

Image Based Reconstruction II

EE795: Computer Vision and Intelligent Systems

SIMPLE BUT EFFECTIVE TREE STRUCTURES FOR DYNAMIC PROGRAMMING-BASED STEREO MATCHING

Stereo. 11/02/2012 CS129, Brown James Hays. Slides by Kristen Grauman

A 3D Reconstruction System Based on Improved Spacetime Stereo

Project 2 due today Project 3 out today. Readings Szeliski, Chapter 10 (through 10.5)

hsgm: Hierarchical Pyramid Based Stereo Matching Algorithm

DISTANCE MEASUREMENT USING STEREO VISION

Integrating LIDAR into Stereo for Fast and Improved Disparity Computation

Bilateral and Trilateral Adaptive Support Weights in Stereo Vision

Symmetric Sub-Pixel Stereo Matching

A New Approach for Stereo Matching Algorithm with Dynamic Programming

Lecture 10: Multi view geometry

Subpixel accurate refinement of disparity maps using stereo correspondences

Image Guided Cost Aggregation for Hierarchical Depth Map Fusion

Graph Cut based Continuous Stereo Matching using Locally Shared Labels

Towards a Simulation Driven Stereo Vision System

A New Parallel Implementation of DSI Based Disparity Computation Using CUDA

Joint Histogram Based Cost Aggregation for Stereo Matching

Fast Stereo Matching of Feature Links

Lecture 14: Computer Vision

Stereo Matching. Stereo Matching. Face modeling. Z-keying: mix live and synthetic

BIL Computer Vision Apr 16, 2014

REVIEW OF STEREO VISION ALGORITHMS: FROM SOFTWARE TO HARDWARE

Fusing Color and Texture Features for Stereo Matching

Detecting motion by means of 2D and 3D information

PatchMatch Stereo - Stereo Matching with Slanted Support Windows

Stereo and Epipolar geometry

Real-Time Stereo Vision on a Reconfigurable System

Computer Vision I. Announcements. Random Dot Stereograms. Stereo III. CSE252A Lecture 16

Stereo. Outline. Multiple views 3/29/2017. Thurs Mar 30 Kristen Grauman UT Austin. Multi-view geometry, matching, invariant features, stereo vision

The Naked Truth about Cost Functions for Stereo Matching

Reliability Based Cross Trilateral Filtering for Depth Map Refinement

Bio-inspired Binocular Disparity with Position-Shift Receptive Field

Epipolar Geometry and Stereo Vision

STRUCTURAL EDGE LEARNING FOR 3-D RECONSTRUCTION FROM A SINGLE STILL IMAGE. Nan Hu. Stanford University Electrical Engineering

Structured Light II. Thanks to Ronen Gvili, Szymon Rusinkiewicz and Maks Ovsjanikov

Transcription:

Real-Time Disparity Map Computation Based On Disparity Space Image Nadia Baha and Slimane Larabi Computer Science Department, University of Science and Technology USTHB, Algiers, Algeria nbahatouzene@usthb.dz, slarabi@usthb.dz Abstract:The current paper proposes a real-time dense stereo matching algorithm based on a DSI (Disparity Space Image ) data structure that can produce accurate disparity map. Our approach divides the disparity map computing into two main steps. The first step deals with the computing of the initial disparity map using DSI method. The second step presents a simple and a fast method to refine the initial disparity map. New strategies and improvements are introduced so that an accurate and fast result can be reached. Experimental results on real data sets were conducted for evaluating the solutions proposed. Also, a comparative evaluation of our method with others methods is presented. Keywords: DSI (Disparity Space Image), disparity map, Occlusion, Stereovision, Median filter. 1. Introduction Stereo matching is a problem to find correspondences between two or more input images. It is one of fundamental computer vision problems with a wide range of applications, and hence it has been extensively studied in the computer vision field for decades. Stereo matching consists to find for each point in the left image, its corresponding in the right one. The difference between horizontal distances of these points is the disparity. A disparity map consists of all the possible disparity values in an image. Such a map is basically a representation of the depth of the perceived scene. Therefore, the disparity maps have been used to address efficiently problems such as 3D reconstruction, mobile robot navigation and many other domains. Despite the simplification brought by the epipolar geometry, the problem of matching remains difficult to solve due to occlusion, luminosity changes between viewpoints and non textured areas. To overcome these difficulties, several methods have been proposed. A state of the art of different existing methods is presented by [1]. In general, stereo algorithms can be categorized into major classes: local methods and global methods. Local algorithms, which are based on correlation, can have very efficient implementations that are suitable for real-time application [2-4].The central problem of local windowbased algorithms is how to determine the size and shape of the aggregation window [5]. That is, a window must be large enough to cover sufficient intensity variation while small enough to avoid crossing depth discontinuities for reliable estimation. This inherent ambiguity causes problems such as noisy disparities in textureless region and blurred object boundaries. Global approaches minimize an overall cost function that involves all the pixels of the image. Field is led to minimize the objective function of energy. Several optimization methods have been proposed such as dynamic programming [6], graph cuts [7], directed anisotropic diffusion [8], belief propagation [9,1] and neural network based approaches [11,12]. A survey for the different approaches can be found in [1,13]. In this paper, we propose a new approach for computing a dense disparity map based on the DSI data structure. As such, stereo images are matched used a DSI composed of gray values representing the cost of all possible matching in stereo images. Our approach divides the matching process into two steps: initial disparity map and refinement of disparity map. This paper is organized as follows: section 2 presents the stages followed to compute the disparity map. In section 3, experimental results obtained on real images are presented and discussed. Finally, section 4 concludes the paper with some remarks. 2. Initial Disparity Map Estimation 2.1 DSI Computation Disparity Space Image (DSI) is an explicit representation of the matching space introduced by A. Bobik and S. 486

Intille [14]. It plays an essential role in the development of the overall matching algorithm which uses the occlusion constraints. Assuming that the pairs of images are rectified, thus the disparity computation concern two matched points which have the same abscise. For each pixel p l (x l,y l ) in the left image (reference image), the disparity computation will concern all pixels of a window W l centred on p l. At each pixel p i (x i,y i ) of the W l, the matched pixel p j (x j, y j ) will appertains to the window W d r of the right image centred on p r (x r, y r ) (see figure 1). The position of W d r depends on the disparity d of the pair (p l, p r ) which varies from zero to d max, where d max represents the highest disparity value of the stereoscopic images. The relations which bind two matched points p i, p j of W l and W d r are: x j = x i +s * d, y j = y i, where s = {+1,-1} is a sign chosen so that disparities are always positive. To determine the disparity of a given pixel p l (x l,y l ), we calculate for an assumed disparity d the cost DSI d (p i ) of all p i of the windows W l. The size of the window was experimentally chosen to be 7x7. As the implementation of this method for DSI computation is time consuming, we propose in the next section the introduction of computation Strategy in order to parallelize the calculation of various costs. 2.2 Computation Strategy For each pixel p l (x l,y l ) in the left image, the disparity computation will concern all pixels of a window W l centered on p l. In our case, we used a shiftable window of 7 x7 size (Line x column). In order to increase the computation time of the DSI method, when we compute the disparity of p (i, j +1), we perform the operations shown in Figure 2 (a) which avoids recalculating at each time the sum AC for each pixel in the window W l. Thus, we compute the disparity for one column at a time rather than the entire window, because the pixels in the neighborhood of p(i, j +1) are the same as those in the neighborhoods of p(i,j). The same principle is used to calculate the disparity of the pixel p(i+1,j). We do the computation for only one line at a time rather than the entire window (see figure 2(b)). By using these operations, the redundancy in computation is completely removed and the computation time has been considerably reduced. 49 Vectors DSI - + = P ( i, j ) P ( i, j+1 ) Figure 1. The windows used in DSI Computation The DSI d (p i ) cost is then computed for a given disparity d. It is defined as follow: DSI d I (p i ) = I l (x i, y i ) I r (x i d,y i )) (1) Aggregation is performed by summing the calculated matching costs over a variable window with constant disparity d. The aggregation cost AC d pi is defined as: 49 Vectors DSI (a) - + = (b) P ( i, j ) P ( i+1, j ) = ( ) (2) Figure 2. Illustration of shifting method used in DSI method Where W l is a set of neighboring pixels on which p lies. Considering all the aggregation values obtained varying the disparity in the range [.. d max ], the initial disparity of the pixel p l is chosen as the disparity d* with the minimal cost AC d* among the various costs of neighboring pixels to p l of the variable support window W l and will be noted d * (p l ). 3. Disparity Map Refinement The resulting disparity map described above is not the optimal one because it still some noise and errors. We propose in the following our disparity map refinement method. 487

3.1. Refinement Method We assume that initial disparities of all pixels of the left image are computed. For each pixel p l of the left image, we first verify if the disparity is dominant in the variable support window W l. If it is the case, this disparity will be considered as the final disparity and does not necessitate any refinement. Otherwise, we do a refinement which consists to select in variable support window W l the disparity associated to the best score for each pixel. In this step, we apply a vote in order to choose the dominant disparity in the associated W l using the central pixel p l and its neighboring pixels. The disparity that will obtain the highest number of confirmations (dominant) will be considered as the new disparity of the central pixel p l. Finally, in order to keep the good trade-off between accuracy and processing time, a simple median filter is applied for smoothing the disparity map. 4. Experimental results In this section, we describe the experiments conducted to evaluate the performance of the proposed method. The two criteria used for the evaluation are accuracy and computation cost. Many applications not only require accurate disparities, but fast runtime as well. In our case, this work is intended to be used for obstacle detection system of autonomous mobile robot navigation. 4.1 Evaluation in terms of Accuracy For what concerns accuracy we rely on an evaluation methodology analogous to that adopted on the Middlebury stereo evaluation site [19]. We use four references stereo pairs (Tsukuba, Venus, Teddy and Cones). Figure 3 illustrates the resultss of the initial disparity maps and final disparity maps obtained for four selected images. In order to compare the results of different algorithms, we adopt the method similar to that of [1]. Parameters ALL and NOCC are defined according to the Middlebury web site [19]. ALL is the error computed on the whole image and NOCC is the error computed on the whole image excluding the occluded region. Among the quality measures proposed by [1] in their paper we adopted the percentage of bad matching pixels between the computed disparity map d C (x,y) and the ground truth disparity map d T (x,y): PBP= (1/N ( d C (x, y) d T (x, y) > δ d )) (3) Image Ground Truth Initial disparity Final disparity Figure 3. Disparity map obtained with our method Table 1 shows the results in term of accuracy obtained by our method on some images available in the Middlebury website[19]. We studied also the influence of window size on the accuracy of the proposed method. Figure 4 illustrates the influence of the window size on the accuracy of our method for each image. 4.2 Evaluation in terms of computation time Similarly to the evaluation of accuracy, Table 2 shows the processing time obtained by our method when we applied the computation strategy (section 2.2) on the four images pairs with different window size. The timing tests were performed on a PC Intel Core Duo 2 GHz, 2GB RAM. 4.3 Discussion and comparison This section presents a comparison between the proposed method and other state-of-artt methods. The results obtained by our method are better than some methods reported in the literature [13,16]. Table 3 shows a comparison of stereo vision implementation reported in the literature in terms of computation time. A ranking of each method according to the computation time is shown in the second column. Where δ d is the error disparity deviating from the ground truth more than 1 pixel. 488

5. Conclusion The stereo correspondence problem remains an active area for research. In this paper, we have presented a realtime disparity map estimation algorithm based on the DSI data structure. The disparity map computing process is divided into two main steps. The first one deals with computing the initial disparity map using the DSI structure. The second one presents a simple and fast method to refine the initial disparity map so that an accurate result can be achieved. Using this method, we approach the results of global methods without sacrificing the simplicity, flexibility and speed of local aggregation methods. As seen in the experiments results, we have performed an experimental investigation in order to asses the impact of the window size and the time optimization techniques. According to the results, we can say that the computation time mainly depends on the image size and the size window. References [1] Scharstein D.and Szeliski,R.. A taxonomy and evaluation of dense two-frame stereo correspondence algorithms Int. Journal on Computer Vision, pp.47,7 42,22. [2] Tombari F., Mattoccia S., and Di Stefano L. Near real-time based on effective cost aggregation ICPR 28-1 [3] Mattocia S., A correspondence ICCV, 12 th Intern. Conf. On Computer Vision Workshops, pp. 1763-177, 29. [4] Gerrits M. and Bekaert P. Local stereo matching with segmentation-based outlier rejection In proceeding Conference on Computer and robot Vision, 26. [5] Kanade, T. and Okutomi, M. A stereo matching algorithm with an adaptive window: theory and experiment IEEE Trans. Pattern Anal. Machine Intell., 16, 92-932, 1994. [6] Kim J., Lee K., Coi B. and Lee S., A dense stereo matching using two-pass dynamic programming with generalized ground control points In Proc. Conf. Computer Vision and Pattern Recognition, pp.175-182, 25. [7] Kolmogorov V. and Zabih R.,. What energy functions can be minimized via graph cuts? ECCV(3),Lecture notes in computer science, pp:65-81, 22. [8] Banno A., IkeuchiK., Disparity map refinement and 3D Surface smoothing via directed anisotropic diffusion 12 th ICCV Workshops, 187-187, 297. [9] Tappen M. and Freeman W., Comparison of Graph cuts with Belief Propagation for Stereo, using identical MRF Parameters Inter. Conf. on Computer Vision, ICCV 23. [1] Yang Q., Wang L., Yang R., Stewenius H. Nister D., Stereo Matching with Color-Weighted Correlation, Hierarchical Belief Propagation and Occlusion Handling. PAMI 28. [11] Vanetti M., GalloI.,BinaghiE.,29. Dense Two- Frame Stereo Corresponence by Self-Organizing Neural Network Image Analysis and Processing, LNCs, Springer, pp 135-142, 29. [12] Venkatesh Y.V, RajaS.K., Raspanti M. Neural disparity computation for dense two-frame stereo correspondence IEEE Transaction On Image Processing 16( 11) pp: 2822-2829., 27. [13] Nalpantidis L., Sirakoulis G.and Gasteratos, A., Review of stereo matching algorithms: from software to hardware International journal of optomechtonics 2: pp.435-462, 28. [14] Bobik,A. and Intille,S., Large occlusion stereo International Journal on computer vision 33, pp.181-2, 1999. [15] Tombari F.,Mattoccia S., and Di Stefano L., Segmentation-based adaptive support for accurate stereo correspondence In Proc. Pacific-Rin Symposium on Image and Video Technology, 27. [16] Tombari F.,Mattoccia,S., and Di Stefano L. Clasification and evaluation of cost aggregation methods for stereo correspondence In Proc. CVPR. 28 [17] YoonK. and KweonI., Adaptive Support Weight Approach for Correspondence Search IEEE Tran. On Pattern Analysis and Machine Intelligence, vol. 28. 26. [18] Veksler O. Extracting dense features for visual correspondence with graph cuts Proc. Of the IEEE Computer Society Conference on Computer Vision and pattern Recognition, 23. [19] http://www.middlebury-edu/stereo. 489

Bad Percentage (%) 5 4 3 2 1 Venus Bad Percentage (%) 4 3 2 1 Tsukuba Bad Percentage (%) 6 5 4 3 2 1 Cones Bad Percentage (%) 5 4 3 2 1 Teddy Figure 4. PBP obtained by our method on four image pairs for different window sizes. Table 1: Accuracy according to the methodology defined by the Middlebury web site. Aver. bad Cones Teddy Tsukuba Venus ALL NOCC ALL NOCC ALL NOCC ALL NOCC Percent. Accuracy 28,9 2,9 32 25,1 15,5 13,7 7,9 6,66 18,83 81,2 Table 2. Processing Time(s): (a) DSI, (b) DSI with Shifting et (c) DSI with Shifting + threads Tsukuba Venus Teddy Cones (a) (b) (c) (a) (b) (c) (a) (b) (c) (a) (b) (c).531.297.18.75.516.34 1.922 1.344.79 1.922 1.344.81 5.984.377.23 1.438.734.43 4 1.922 1.1 3.968 1.96 1.12 7*7 1.641.484.28 2.453.875.53 7.47 2.438 1.42 7.23 2.438 1.51 9*9 2.25.57.34 3.9 1.9.62 12.3 2.75 1.71 11.87 2.86 1.7 11*11 3.67.4 5.82 1.17.71 17.64 3.23 2 17.31 3.45 2.12 13*13 4.1.75.43 7.62 1.32.81 23.65 3.75 2.4 23.74 3.71 2.4 49

Table 3. Comparison of stereo vision implementations Author Time(s) Algorithm Image size Machine Our Method,28 s 2,33 s DSI DSI + Refinement Tsukuba 2.GHz Intel Core Duo Tombari (28)[2].2 s 1 Aggr. Stra. Based on color segm. Tsukuba Gerrits (26)[4] 2 s 3 Segment. based Teddy 2.4 GHz Intel core Duo 2.4 GHz Intel core Duo Kim (25)[6] 4.4 s 4 Dyn. Prog. Tsukuba 2.4 GHz Pentium IV Veksler (23) 6 s 6 Graph-cut Tsukuba.6 GHz, Pentium III Tappen (23)[9] 183 s 11 Accelerated Belief Prop. Map 2.4 GHz Pentium IV Mattocia (29)[3] 13 s 7 LC Locally Consist. Tsukuba 2.5 GHz Intel Core Duo Yoon (26)[17] 6 s 8 -based Tsukuba AMD 27 Vanetti (29)[11] 1 s 9 Self org. Map All images 1.8 GHz AMD Processor Venkatesh (27)[12] 12 s 1 Self org. Map 256 x 256 1.4 GHz Pentium IV 2.4 GHz Intel core Duo Tombari (27)[15] 33mn34s 13 Segment Support Teddy 491