Towards Real-time Stereo using Non-uniform Image Sampling and Sparse Dynamic Programming

Size: px
Start display at page:

Download "Towards Real-time Stereo using Non-uniform Image Sampling and Sparse Dynamic Programming"

Transcription

1 Towards Real-time Stereo using Non-uniform Image Sampling and Sparse Dynamic Programming Michel Sarkis and Klaus Diepold Institute for Data Processing, Technische Universität München Arcisstr. 21, Munich, Germany [michel, Abstract Constructing the 3D mesh of a scene from stereo images is a major task in computer vision. It usually involves several steps including stereo matching and meshing. Unfortunately, the time required to generate the 3D mesh is time demanding due to the large amount of pixels to be processed. In this work, we propose a framework to accelerate the overall process. The key issue is to first reduce the number of pixels by approximating an image with a content adaptive mesh. The nodes of the mesh are sparse and they represent the non-uniform samples of the image. To benefit from the reduced set of pixels, we formulate a dynamic programming based stereo matching algorithm which computes the depth only at the sparse samples. We then show by setting up some tests using some real images that the non-uniform samples are sufficient to recover the original dense depth map of the scene by interpolating them using the mesh. The results obtained also show that the employment of the proposed strategy reduces the overall processing time of stereo matching to more than 50% of the original time. We are now able to construct scenes in real-time using less computational resources. 1. Introduction Stereo-based scene reconstruction is a well established research topic in computer vision. It plays an essential role in many applications like telepresence and virtual reality. It usually consists of several steps. The stereo images are rectified. A stereo matching algorithm follows to determine the depth map of the scene upon which the 3D points are computed. Then, a mesh is constructed to create a 3D model of the scene which can be rendered using a graphical processing unit (GPU) on a virtual display. This research is sponsored by the German Research Foundation (DFG) as a part of the SFB 453 project, High-Fidelity Telepresence and Teleaction. Constructing the 3D mesh model of a scene is computationally very demanding mainly due to the application of stereo matching and surface reconstruction algorithms. The time required to obtain the 3D mesh depends on various factors: the complexity of the algorithms used, the size of the stereo images, i.e. the number of points to be matched and meshed, the number of disparity levels. To enhance the speed of stereo matching, most of the recent works formulate the problem using GPUs to parallelize the computations and attain real-time speed [4, 5, 24, 26, 27], while others increase the hardware capacity by using a large PC cluster [11]. In order to boil down the size the 3D mesh and accelerate its rendering on a GPU, the developed algorithms reduce the number of points and build a simplified mesh using the reduced set [2, 13, 17, 21]. Stereo matching and surface simplification are two different problems which are usually treated independently. Their common overhead is the large amount of pixels that have to be processed. In applications like telepresence and robot navigation, however, these two components occur together. Treating them independently leads to a decrease in the overall performance of the application. The justification of the last claim is simple. A simplified mesh is usually built using a reduced set of points which are chosen depending on certain criteria, see [13,17,21] for example. It is thus necessary to compute the depth values only at these points to obtain the complete mesh model of the scene. Doing so will ameliorate the speed of stereo matching and minimize the required computational resources since the number of pixels to be processed is reduced. Three points are mainly discussed in this work. We will first derive an algorithm that estimates the significant features in an image by detecting its non-uniform samples. The obtained samples are sufficient to construct the mesh approximation of the scene while preserving its content. Then, we will propose a sparse stereo matching strategy based on dynamic programming to compute the disparity values of the non-uniform samples. We will then argue that the estimated depth values of those samples are enough to recover

2 the dense depth map of the scene by interpolating them using the mesh. To justify this point, we will setup a test using the Middlebury data set [19, 20] which evaluates the quality of the recovered dense disparity maps. In addition, the results we obtained show that the overall time needed for scene reconstruction reduces significantly. The rest of this work is divided as follows. Section 2 describes some related work. In Section 3, the proposed strategy is derived. Section 4 evaluates the performance of the scheme. Finally, conclusions are drawn in Section Related Work 2.1. Stereo Matching Three different groups of techniques exist to compute a disparity map. First, there are local methods which are based on the block matching technique. Second, there are scanline based methods which minimize a cost function over each scanline. Finally, there are global methods which operate on the entire image to minimize a cost function. The local approaches are quite fast and can achieve a real-time speed. Their outcome, however, suffer from inconsistencies and they usually result in low-quality depth maps even when using the latest developments to remedy their deficiencies as the one described in [8]. The scanline based methods are also fast and produce depth maps with better quality when compared to the local methods [19]. Moreover, there has been a lot of research lately to accelerate these techniques while preserving the quality of the depth maps. In [1], low complexity cost functions were defined and used in addition to the fact that DP was modified to prune the bad search regions. In [10], quadtree subregioning was employed to partition the image into several sub-images while the GPU is employed to parallelize the computations in [4, 5, 24, 27]. The global methods as the ones described in [3, 7] are usually too slow to use in realtime applications unless they are formulated using graphical hardware [26]. These methods, however, result in depth maps which exhibit the best quality [19] Meshing The Stereo Images Approximating 3D points with a triangular mesh has been dealt with thoroughly in the literature. The main difficulty in this area is the long time required to construct the mesh due to the large number of points that have to be processed, e.g. for a image, points have to be meshed. To overcome this burden, the number of points is reduced before building the mesh. Some algorithms are designed to operate on the 3D data and can be applied after stereo reconstruction [6, 13, 14, 17, 21]. While others reduce the number of points in the image and then construct the mesh using the reduced point set. The reduced set of pixels is usually referred to as the non-uniform samples of the image since the obtained sparse image will be irregularly sampled depending on the variation of the pixels intensities. In [25], quadtree subregioning was used in order to represent the image with a mesh, where the nodes of the mesh are the desired non-uniform samples. In [28], the gradient image is computed to determine a feature image. Then, using the gradient image, the non-uniform samples are determined by optimally placing the points using the Floyd-Steinberg error diffusion algorithm. In [16], nonuniform samples are chosen by computing the skewness of the pixels and then choosing the pixels that are above a predefined threshold. The advantage of these methods over the straightforward 3D meshing techniques is that the nonuniform samples can be selected very efficiently. 3. The Proposed Approach The aim of our work is to accelerate the overall speed of scene acquisition starting from the fact that an image can be faithfully represented by a subset of its pixels, the nonuniform samples, to capture the scene it represents. Nonuniform image sampling is the process of sampling the image at an irregular sampling grid. The grid has a high sampling density, i.e. the image is sampled with high rate where there intensity variation is high, otherwise it is low. An example is illustrated in Figures 1a and 1c. In Section 3.1, we will derive the method to build the 2D mesh representation of the left image where the vertices of the triangles form the desired non-uniform samples. Then, we will discuss how to match the samples to the right image of the stereo pair in Section Meshing The Stereo Images With Tritree Tritree (triangle tree) subdivisions consists of partitioning an area into a triangular grid [23]. A triangle is first generated which includes all the area to be triangulated. Then, it is subdivided in a balanced manner into smaller triangles until all the area of interest is meshed. In this work, a different strategy is used that allows the representation of an image with an adaptive mesh. An image is first divided into two triangles along one of the longest diagonals. After that, each of the two triangles is split recursively until a predefined criterion is satisfied. The difference between the methods is illustrated in Figure 2. Tritree subdivisions is similar in spirit to the quadtree subdivisioning method used in [25] to represent an image with a mesh. There, an image is partitioned into rectangles which can be subsequently subdivided into two or more triangles depending on an error criterion. Therefore, it appears more natural to start directly by building a tree of triangles instead of building a tree of rectangles and transforming it into triangles at a later stage. Let V i (x i, y i, I i ) with i = 1, 2, 3 be the three vertices of a random triangle T under consideration in an image. If the

3 From (4), the equation of the plane is directly obtained as ax + by + ci + k = 0. (5) (a) (c) Figure 1. Sample result using the proposed algorithm. a: The sample Lab image. b: The 2D mesh obtained by the proposed tritree algorithm. c: The sparse depth map obtained using the proposed sparse DP approach. d: Top view of the 2.5D mesh obtained by combining the sparse depth map with the 2D mesh of the image. division criterion is met, see (7), T should be divided from its longest edge which has to maximize the expression (b) (d) max V i (x i, y i ) V j (x j, y j ), (1) with i, j = 1, 2, 3 and i j. The new vertex, or the nonuniform sample, is the middle point of the longest edge and is obtained by V = 1 2 (V i(x i, y i ) + V j (x j, y j )). (2) The points of an image describe a 3D space represented by the 2D-coordinates of the pixel in the image and the corresponding intensity 1. When building the mesh, a triangle T of the mesh should represent the pixels that it covers as accurately as possible. Each triangle is formed by three points which are parts of the nodes or the vertices of the mesh. The plane Π described by the vertices of T, i.e. V i, is defined using the normal equation as n p(x, y, I) + k = 0, (3) where p(x, y, I) denotes a pixel lying on the plane, k is a real constant and n is the normal vector to the plane. The normal vector can be computed with the three vertices V i (x i, y i, I i ) as the cross product of any two edges of the triangle as n = [V2 V 1 ] [V 3 V 1 ] = a b. (4) c 1 Note that in the case of color images, the same derivation applies but have to be conducted for all the color planes. Thus, it is now possible to recover the intensity value Î of a pixel lying inside the triangle T by rewriting (5) Î = ax + by + k. (6) c Consequently, one can check now how each triangle represents the pixels which lie within by comparing the reconstructed intensities of the pixels with the original ones. A well known quality metric is the Peak Signal to Noise Ratio (PSNR) [15] which is defined as ( ) PSNR = 10 log, (7) MSE where MSE stands for the Mean Squared Error defined across the area A of the triangle T by MSE = 1 A A 1 n=0 ( I (p n ) Î (p n)) 2. (8) The symbol denotes the floor operator. The intensities of the pixels are assumed to vary between 0 and 255. If the PSNR of the reconstructed intensities of T using (6) is lower than a predefined threshold ɛ, then T is decomposed into two smaller triangles. This step is repeated for each triangle until ɛ is satisfied across the whole image. The area of a triangle cannot be less then unity since it will not contain any inlying pixel. Thus, if the division of T leads to a child triangle with area less then unity, the division should be deleted from the triangle tree and T and should not be divided anymore. Note that the area of a triangle cannot be very big in order to avoid fitting a single plane in very large image regions. In this work, the maximum area of a triangle was set to 100 pixels. (a) (b) Figure 2. Sketch illustrating the tritree subdivisions applied to an image. a: The original algorithm of [23]. b: The proposed tritree partitioning scheme Sparse Stereo Matching Sparsity of the data is a property that has been researched extensively in various fields including computer vision. By

4 exploring the sparsity of the data, it is possible to design fast algorithms because only a small system of equations has to be solved. Since the obtained left image from Section 3.1 is sparse, it makes sense taking advantage of this fact. Sparse stereo is usually performed using a local approach. The main disadvantage of such methods is that the smoothness constraint cannot be enforced. In addition, their output have a lot of tendencies for errors at the object boundaries [19]. Another possibility is to apply semi-dense stereo matching techniques as in [9, 18] which determine the depth at some pixels of the image considered to be reliable. Applying the latter type of techniques does not meet the goal of this work since the found pixels do not have to coincide with the locations of the non-uniform samples. And for each non computed sample, the triangles of the mesh where it belongs will be missing from the reconstructed scene and thus deteriorating its quality. Apart from that, we choose DP to accomplish this task since it is one of the fastest optimization methods and leads to a good quality. DP was first introduced in stereo vision in the context of edge based methods [12]. Then, the interest in it grew since it is one of the fastest optimization techniques and leads to good quality depth maps. DP is derived in this section to find the depth of the sparse non-uniform samples and, hence, the naming Sparse DP. The inputs are a sparse left image and a full right image. Let p (x, y) be a pixel in the left image lying on the y th scanline and q (x, y ) be the corresponding pixel in the right image. In this work, the stereo images are assumed to be rectified such that the conjugate epipolar lines become horizontal and share the same y coordinate. Consequently, the disparity value d can be expressed as a function of the pixel difference in the scanline such that d = x x Computation of Cost Function Various pixel-based cost functions exist in the literature. The ones, which are used the most are the absolute difference and the squared difference due to their simplicity. The cost function C used in the present work is the absolute difference which is defined at a disparity value d to be C (x, y, d) = p (x, y) q (x + d, y). (9) In this case, the disparity value of p corresponds to the location where C (x, y, d) reaches a minimum. In general, any pixel-based cost function could be used. The important issue here is to evaluate only the costs over the support region of each non-uniform sample. The support region of a pixel p (x, y) is defined to be the region along which the cost is aggregated [19]. Suppose that the support region S of p is a box filter of size w h. If p (x, y) is a non-uniform sample, C is evaluated at all the pixels of S. The reason is that the aggregated cost of p must be computed over all of S, see (10) Aggregation of Cost The sought disparity surface is assumed to be smooth since the neighboring pixels of p are likely to have the same disparity value. Therefore, the initial costs are usually aggregated. The aggregated cost C a of p defined over its support region S is C a (x, y, d) = x,ỹ w S ( x, ỹ, d) C ( x, ỹ, d), (10) where w S is a weighting function defined over S. The main advantage of sparse DP is that the aggregated costs have only to be evaluated at the non-uniform samples and hence the process is sped up. By looking at Figure 1c for example, the number of non-uniform samples needed to obtain the 3D mesh of the scene amounts to 42% of the total number of pixels. Hence, the effort of the evaluation of the aggregated costs reduces to 42% of the total effort as well Disparity Optimization Using Sparse DP In DP, the stereo correspondence problem is formulated as finding the optimal path through the disparities that minimizes an energy function defined over a scanline. The energy function is usually minimized by conducting a 2D search in a plane defined by C a and it is written as E (d) = E data (d) + λ E smooth (d), (11) where E data (d) is the data term defined using the aggregated costs reflected by (10), E smooth (d) is the smoothness term and λ is a constant to penalize depth discontinuities. In sparse DP, Equation (11) is minimized along a sparse scanline. Therefore, the smoothness term should take the sparsity of the data into account. It can be defined as Ẽ smooth (d) = x τ x d (x) d (x prev ), (12) where x prev represents the neighboring non-uniform sample and d ( ) is the corresponding disparity value. The scalar τ x is related to the distance between the samples. It should be high if the neighboring samples are close and it should be monotonically decreasing as the distance between the samples increases. This is related to the fact that the closer the samples are, the more probable there is a discontinuity and vice versa. In this work, this constant is chosen to be τ x = (x x prev ) 2 since it insures that (12) is continuous and it allows for interpolation between the neighboring samples. It is important to mention that when filling the cost volume, the scalar τ x should reflect the distance between the samples only when penalizing the costs related with the previous pixel, e.g. C (x prev, d 1). To penalize the cost of a pixel going to a different disparity level, e.g. C (x, d + 1), the distance between the samples should be

5 ignored since this cost is not related to the previous sample. In such a case, the scalar τ x must be set to unity. Using the derived equations, the DP optimization is performed at each scanline y to extract the best disparity path. The obtained result is a sparse depth map as the one shown in Figure 1c. By combining the depth map obtained from sparse DP and the mesh of the previous section, the 2.5D mesh of the scene can be constructed as shown in Figure 1d. In the proposed scheme, we did not deal with the problems of occlusion detection or inter-scanline consistency that usually result from the application of DP. However, our approach can be integrated in any DP based technique that takes these points into account. In the presented results, we will be using the DP algorithms proposed in [19, 24]. 4. Experimental Results We have performed tests, which consist of evaluating the performance of our proposed method using the Middlebury data set [19, 20] and some images as the ones shown in Figure 1 and in the first row of Figure 7. These images will be referred to as the Lab image and the Hall image respectively. All the tests were performed on an AMD Athlon XP 64 bit processor (2.2 Ghz, 2 GB RAM) using a Linux operating system and C++ programming language. The performance of the proposed tritree method is assessed by measuring both the length of the mesh and the time required to build the mesh at various PSNR levels, see Equation 7. Figure 3 shows the mesh length of the Tsukuba, Teddy, Lab and Hall images in comparison to the meshing method of Yang [28] and that of Ramponi [16]. It can be directly observed that tritree clearly outperforms the other methods. Looking at Table 1, tritree requires more time to obtain the result since the whole image is used to determine the samples whereas in [16, 28] gradient images are used. Although tritree is slower than the methods of Yang and Ramponi, it has the advantage of possessing a parallel structure. Each new triangle can be independently subdivided from the other ones since they are not related, see Figure 2b. Consequently, it will be possible to subdivide the computations among a cluster of CPUs working in parallel. Figure 4 shows the average time required by tritree applied to a sequence of images at 40 db PSNR. The measurements were performed using a cluster of four AMD processors similar to the ones used in the previous experiments. Each CPU was assigned an equal amount of the image to be processed and the complete mesh of the frame is then gathered in one data set by padding each part. To force such parallelism in the C++ programming language, it is necessary to use the multi-threading library, i.e. pthread, and initiate each part of the image as a single thread. Using a single CPU, the average frame rate is low as was remarked in Table 1 for the image. But as the number increases to 4 CPUs, i.e. (4 parallel threads), the rate increases to 4 VGA frames per second (fps). Tritree can be easily divided among a cluster of CPUs (or a multi-core processor) while using other methods, the job is more complicated. The main reason lies in the fact that if the image is split to be processed with several CPUs, there will be a difficulty in connecting the partial meshes into a single one. (a) (c) (d) Figure 3. Comparison of the variation of the mesh length versus the PSNR threshold of the proposed tritree with the methods of [16, 28]. a: the Tsukuba image. b: the Teddy image. c: the Hall image. d: the Lab image. (b) Algorithm Tsukuba Teddy Lab Tritree Yang [28] Ramponi [16] Table 1. Time Evaluation in seconds of tritree with several other meshing approaches at 40 db PSNR. Figure 4. Average time needed by tritree to generate a mesh versus the number of processors used in parallel. The PSNR threshold was set to 40 db. The images used were of size The proposed sparse DP method is also tested in terms of speed and quality by comparing it to the DP implementations of [19,24]. The quality is tested using the Middlebury data set. The visual output is shown in Figure 5, while the metrics are tabulated in Table 2. The metrics were obtained by taking the ground truth depth maps found in [19, 20], extracting the disparity values of the non-uniform samples and then computing the percentage of the error between the sparse disparity maps obtained using each version of sparse

6 DP and the ground truth values. The results show that the error rate has increased less than 4% on average with respect to the total number of pixels when compared to the original DP methods. To get a feeling on what does the little increase in the error rate means in the quality of the depth maps produced by DP, we show in Figure 6 the depth maps produced by the original algorithm of [24] along with the reconstructed dense depth maps of its proposed sparse version. The latter was obtained by interpolating the sparse depth maps shown in the second row of Figure 5 using Equations 4 to 6. As we can see, the visual quality is hardly effected. This is also confirmed in Table 3 obtained from the Middlebury evaluation web-tool for 0.5 pixels error threshold. On the contrary, the table shows us that the sparse version of [24] is better than other existing dense DP based algorithms. Nevertheless, the performance of sparse DP deteriorates near depth discontinuities. This can be justified for the interpolation of the sparse disparity map extends the edges and leads to the higher values in Table 3. To further elaborate on our results, we show in Figure 7 the combined outcome of tritree and sparse DP to construct a 2.5D mesh approximation of two real stereo images. DP of [19] DP of [24] Image Sparse Original Sparse Original Tsukuba Venus Teddy Cones Average Table 2. Percentage error of the DP methods of [19, 24] with their sparse versions using the Middlebury data set. The non-uniform samples were computed at 40 db PSNR. The time comparison between the proposed sparse DP versions and the corresponding original versions is illustrated in Table 4. Sparse DP reduces the time of the disparity computation tremendously for it exploits the sparsity of the non-uniformly sampled images. As the size of the image increases, the improvement in time also increases since non-uniform sampling eliminates the non-significant pixels in the image and hence unnecessary computations. At an image resolution of pixels and at 150 disparity levels, the time needed to compute the depth map reduces on average to 50% of the original time. The speed of stereo matching can be also improved by distributing the computations over a cluster of CPUs as was done with tritree. The strategy we will follow is to assign an equal amount of the scanlines of the image which must be processed to a CPU; then, all the partial results are combined by simple padding. Figure 8 shows the average rate (fps) required by the DP technique of [24] to generate a disparity map along with its proposed sparse version. The measurements were performed on the same CPU cluster described before. We applied each of these algorithms to the same VGA video sequence used in tritree by setting the maximal disparity range 100. Sparse DP is able to achieve an average rate of 1.5 fps using 4 CPUs. Compared to the original approach, the computation of the dense depth maps would have required at least twice the number of CPUs to reach this frame rate. The proposed approach was integrated into a stereo camera placed on a pan-tilt unit to construct the 3D model of a room. The motion of the camera is known along with the intrinsic parameters. The images have a VGA resolution. The maximum disparity value was set to 150. The threshold of tritree was set to 40 db. The experiment is shown in the accompanying video. 5. Conclusion A new scheme was presented in this work to accelerate stereo-based scene reconstruction. The left image of a stereo pair is first meshed using the proposed tritree subregioning method, which approximates the left image with a sparse image containing the non-uniform samples. Then, the a sparse disparity map is computed by estimating the depth values of the sample using a proposed sparse DP approach. Tests show that tritree is able to construct the mesh approximation of the image while significantly reducing its size. Moreover, using sparse DP, depth maps are now obtained up to 50% faster than with using dense DP without a significant loss in their quality. References [1] S. Birchfield and C. Tomasi. Depth discontinuities by pixelto-pixel stereo. Int. J. Computer Vision, 35(3): , Dec [2] T. Burkert, J. Leupold, and G. Passig. A photorealistic predictive display. Presence: Teleoperators and Virtual Environments, 13(1):22 43, Feb [3] P. F. Felzenszwalb and D. P. Huttenlocher. Efficient belief propagation for early vision. Int. J. Computer Vision, 70(1):41 54, Oct [4] M. Gong and R. Yang. Image-gradient-guided real-time stereo on graphics hardware. In Int. Conf. 3-D Digital Imaging and Modeling, pages , Jun [5] M. Gong and Y.-H. Yang. Near real-time reliable stereo matching using programmable graphics hardware. In IEEE Conf. Computer Vision and Pattern Recognition, pages , Jun [6] M. H. Gross, R. Gatti, and O. G. Staadt. Fast multiresolution surface meshing. In IEEE Conf. Visualization, pages , Oct [7] H. Hirschmüller. Stereo processing by semi-global matching and mutual information. IEEE Trans. Pattern Analysis and Machine Intelligence, [8] H. Hirschmüller, P. R. Innocent, and J. Garibaldi. Real-time correlation-based stereo vision with reduced border errors. Int. J. Computer Vision, 47(1): , Apr

7 Figure 5. Visual outcome of the proposed sparse DP algorithm applied to several DP based stereo matching methods. The non-uniform samples were obtained at 40 db PSNR threshold. The first row is the sparse version of [19] while the second row is that of [24]. Avg. Tsukuba Venus Teddy Cones Algorithm Rank nonocc all disc nonocc all disc nonocc all disc nonocc all disc DP of [24] DP of [5] Sparse DP DP of [22] DP of [19] Table 3. Evaluation of the proposed sparse DP of [24] with its original version and several DP algorithms using the Middlebury web-tool. The results shown are measured at 0.5 pixels threshold. Figure 6. The resulting dense depth maps from the DP of [24] in the first row along with the interpolated depth maps of its proposed sparse version shown in Figure 5 in the second row. [9] J. Kostkova and R. Sara. Stratified dense matching for stereopsis in complex scenes. In British Machine Vision Conf., pages , Sep [10] C. Leung, B. Appleton, and C. Sun. Fast stereo matching by iterated dynamic programming and quadtree subregioning. In Britich Machine Vision Conf., pages , Sep [11] J. Mulligan, X.. Zabulis, N. Kelshikar, and K. Daniilidis. Stereo-based environment scanning for immersive telepresence. IEEE Trans. Circuits and Systems for Video Technology, 14(3): , Mar [12] Y. Ohta and T. Kanade. Stereo by intra- and inter-scanline search using dynamic programming. IEEE Trans. Pattern Analysis and Machine Intelligence, 7(2): , [13] Y. Ohtake, A. Belyaev, and H. P. Seidel. An integrating approach to meshing scattered point data. In ACM Symp. Solid and Physical Modeling, pages 61 69, Jun [14] R. Pajarola. Overview of quadtree-based terrain triangulation and visualization. Technical Report UCI-ICS-02-01, Information and Computer Science, University of California Irvine, 2002.

8 Image Image Disparity DP of [19] DP of [24] Name Size Range Sparse Original Sparse Original Tsukuba Venus Teddy Cones Lab Hall Table 4. Time Evaluation in seconds of the DP methods of [19, 24] with their proposed sparse versions using various images. [15] M. Rabbani and P. W. Jones. Digital Image Compression Techniques. SPIE Press, 1st edition, [16] G. Ramponi and S. Carrato. An adaptive sampling algorithm and its application to image coding. Image and Vision Computing, 19(7): , May [17] A. D. Sappa and M. A. Garcia. Coarse-to-fine approximation of range images with bounded error adaptive triangular meshes. SPIE J. Electronic Imaging, 16(2), Apr [18] R. Sara. Finding the largest unambiguous component of stereo matching. In European Conf. Computer Vision, pages , May [19] D. Scharstein and R. Szeliski. A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. Int. J. Computer Vision, 47(1):7 42, Apr [20] D. Scharstein and R. Szeliski. High-accuracy stereo depth maps using structured light. In IEEE Conf. Computer Vision and Pattern Recognition, pages , Jun [21] C. Shen, J. F. O Brien, and J. R. Shewchuk. Interpolating and approximating implicit surfaces from polygon soup. In ACM SIGGRAPH (ACM Trans. Graphics), pages , Aug [22] O. Veksler. Stereo correspondence by dynamic programming on a tree. In IEEE Conf. Computer Vision and Pattern Recognition, pages , Jun [23] S. Ø. Ville. A structured tri-tree search method for generation of optimal unstructured finite element grids in two and three dimenstions. Int. J. Numerical Methods in Fluids, 14(7): , [24] L. Wang, M. Liao, M. Gong, R. Yang, and Nistér. High quality real-time stereo using adaptive cost aggregation and dynamic programming. In Int. Symp. 3D Data Processing, Visualization and Transmission, pages , Jun [25] Y. Wang and O. Lee. Use of two-dimensional deformable mesh structures for video coding, part II the analysis problem and a region-based coder employing an active mesh representation. IEEE Trans. Circuits and Systems for Video Technology, 6(6): , Dec [26] Q. Yang, L. Wang, R. Yang, S. Wang, M. Liao, and D. Nistér. Real-time global stereo matching using hierarchical belief propagation. In British Machine Vision Conf., pages , Sep [27] R. Yang, M. Pollefeys, and S. Li. Improved real-time stereo on commodity graphics hardware. In IEEE Conf. Computer Vision and Pattern Recognition Workshops, Jun [28] Y. Yang, M. N. Wernick, and J. G. Brankov. A fast approach for accurate content-adaptive mesh generation. IEEE Trans. Image Processing, 12(8): , Aug Figure 7. Outcome of the proposed reconstruction method applied to the Tsukuba image of [19] and the sample hall image. The first row is the original image. The second row shows the constructed 2D mesh with tritree at 40 db PSNR. The third row reflects the reconstructed 2.5D mesh after incorporating the depth values of sparse DP. Figure 8. Performance of the original DP matching algorithm of [24] along with its proposed sparse version depending on the number of processors used. In the test, a VGA sequence with maximum disparity level of 100 was used. The PSNR threshold of tritree was set to 40 db.

Real-time Global Stereo Matching Using Hierarchical Belief Propagation

Real-time Global Stereo Matching Using Hierarchical Belief Propagation 1 Real-time Global Stereo Matching Using Hierarchical Belief Propagation Qingxiong Yang 1 Liang Wang 1 Ruigang Yang 1 Shengnan Wang 2 Miao Liao 1 David Nistér 1 1 Center for Visualization and Virtual Environments,

More information

Fundamentals of Stereo Vision Michael Bleyer LVA Stereo Vision

Fundamentals of Stereo Vision Michael Bleyer LVA Stereo Vision Fundamentals of Stereo Vision Michael Bleyer LVA Stereo Vision What Happened Last Time? Human 3D perception (3D cinema) Computational stereo Intuitive explanation of what is meant by disparity Stereo matching

More information

Real-Time Disparity Map Computation Based On Disparity Space Image

Real-Time Disparity Map Computation Based On Disparity Space Image Real-Time Disparity Map Computation Based On Disparity Space Image Nadia Baha and Slimane Larabi Computer Science Department, University of Science and Technology USTHB, Algiers, Algeria nbahatouzene@usthb.dz,

More information

STEREO BY TWO-LEVEL DYNAMIC PROGRAMMING

STEREO BY TWO-LEVEL DYNAMIC PROGRAMMING STEREO BY TWO-LEVEL DYNAMIC PROGRAMMING Yuichi Ohta Institute of Information Sciences and Electronics University of Tsukuba IBARAKI, 305, JAPAN Takeo Kanade Computer Science Department Carnegie-Mellon

More information

Stereo Video Processing for Depth Map

Stereo Video Processing for Depth Map Stereo Video Processing for Depth Map Harlan Hile and Colin Zheng University of Washington Abstract This paper describes the implementation of a stereo depth measurement algorithm in hardware on Field-Programmable

More information

Colour Segmentation-based Computation of Dense Optical Flow with Application to Video Object Segmentation

Colour Segmentation-based Computation of Dense Optical Flow with Application to Video Object Segmentation ÖGAI Journal 24/1 11 Colour Segmentation-based Computation of Dense Optical Flow with Application to Video Object Segmentation Michael Bleyer, Margrit Gelautz, Christoph Rhemann Vienna University of Technology

More information

Stereo Matching: An Outlier Confidence Approach

Stereo Matching: An Outlier Confidence Approach Stereo Matching: An Outlier Confidence Approach Li Xu and Jiaya Jia Department of Computer Science and Engineering The Chinese University of Hong Kong {xuli,leojia}@cse.cuhk.edu.hk Abstract. One of the

More information

Using temporal seeding to constrain the disparity search range in stereo matching

Using temporal seeding to constrain the disparity search range in stereo matching Using temporal seeding to constrain the disparity search range in stereo matching Thulani Ndhlovu Mobile Intelligent Autonomous Systems CSIR South Africa Email: tndhlovu@csir.co.za Fred Nicolls Department

More information

Data Term. Michael Bleyer LVA Stereo Vision

Data Term. Michael Bleyer LVA Stereo Vision Data Term Michael Bleyer LVA Stereo Vision What happened last time? We have looked at our energy function: E ( D) = m( p, dp) + p I < p, q > N s( p, q) We have learned about an optimization algorithm that

More information

Data Partitioning. Figure 1-31: Communication Topologies. Regular Partitions

Data Partitioning. Figure 1-31: Communication Topologies. Regular Partitions Data In single-program multiple-data (SPMD) parallel programs, global data is partitioned, with a portion of the data assigned to each processing node. Issues relevant to choosing a partitioning strategy

More information

Asymmetric 2 1 pass stereo matching algorithm for real images

Asymmetric 2 1 pass stereo matching algorithm for real images 455, 057004 May 2006 Asymmetric 21 pass stereo matching algorithm for real images Chi Chu National Chiao Tung University Department of Computer Science Hsinchu, Taiwan 300 Chin-Chen Chang National United

More information

Efficient Large-Scale Stereo Matching

Efficient Large-Scale Stereo Matching Efficient Large-Scale Stereo Matching Andreas Geiger*, Martin Roser* and Raquel Urtasun** *KARLSRUHE INSTITUTE OF TECHNOLOGY **TOYOTA TECHNOLOGICAL INSTITUTE AT CHICAGO KIT University of the State of Baden-Wuerttemberg

More information

Hierarchical Belief Propagation To Reduce Search Space Using CUDA for Stereo and Motion Estimation

Hierarchical Belief Propagation To Reduce Search Space Using CUDA for Stereo and Motion Estimation Hierarchical Belief Propagation To Reduce Search Space Using CUDA for Stereo and Motion Estimation Scott Grauer-Gray and Chandra Kambhamettu University of Delaware Newark, DE 19716 {grauerg, chandra}@cis.udel.edu

More information

Processing 3D Surface Data

Processing 3D Surface Data Processing 3D Surface Data Computer Animation and Visualisation Lecture 12 Institute for Perception, Action & Behaviour School of Informatics 3D Surfaces 1 3D surface data... where from? Iso-surfacing

More information

A novel heterogeneous framework for stereo matching

A novel heterogeneous framework for stereo matching A novel heterogeneous framework for stereo matching Leonardo De-Maeztu 1, Stefano Mattoccia 2, Arantxa Villanueva 1 and Rafael Cabeza 1 1 Department of Electrical and Electronic Engineering, Public University

More information

A Developer s Survey of Polygonal Simplification algorithms. CS 563 Advanced Topics in Computer Graphics Fan Wu Mar. 31, 2005

A Developer s Survey of Polygonal Simplification algorithms. CS 563 Advanced Topics in Computer Graphics Fan Wu Mar. 31, 2005 A Developer s Survey of Polygonal Simplification algorithms CS 563 Advanced Topics in Computer Graphics Fan Wu Mar. 31, 2005 Some questions to ask Why simplification? What are my models like? What matters

More information

Processing 3D Surface Data

Processing 3D Surface Data Processing 3D Surface Data Computer Animation and Visualisation Lecture 15 Institute for Perception, Action & Behaviour School of Informatics 3D Surfaces 1 3D surface data... where from? Iso-surfacing

More information

Lecture 14: Computer Vision

Lecture 14: Computer Vision CS/b: Artificial Intelligence II Prof. Olga Veksler Lecture : Computer Vision D shape from Images Stereo Reconstruction Many Slides are from Steve Seitz (UW), S. Narasimhan Outline Cues for D shape perception

More information

MACHINE VISION APPLICATIONS. Faculty of Engineering Technology, Technology Campus, Universiti Teknikal Malaysia Durian Tunggal, Melaka, Malaysia

MACHINE VISION APPLICATIONS. Faculty of Engineering Technology, Technology Campus, Universiti Teknikal Malaysia Durian Tunggal, Melaka, Malaysia Journal of Fundamental and Applied Sciences ISSN 1112-9867 Research Article Special Issue Available online at http://www.jfas.info DISPARITY REFINEMENT PROCESS BASED ON RANSAC PLANE FITTING FOR MACHINE

More information

3D RECONSTRUCTION FROM STEREO/ RANGE IMAGES

3D RECONSTRUCTION FROM STEREO/ RANGE IMAGES University of Kentucky UKnowledge University of Kentucky Master's Theses Graduate School 2007 3D RECONSTRUCTION FROM STEREO/ RANGE IMAGES Qingxiong Yang University of Kentucky, qyang2@uky.edu Click here

More information

Stereo Vision. MAN-522 Computer Vision

Stereo Vision. MAN-522 Computer Vision Stereo Vision MAN-522 Computer Vision What is the goal of stereo vision? The recovery of the 3D structure of a scene using two or more images of the 3D scene, each acquired from a different viewpoint in

More information

On-line and Off-line 3D Reconstruction for Crisis Management Applications

On-line and Off-line 3D Reconstruction for Crisis Management Applications On-line and Off-line 3D Reconstruction for Crisis Management Applications Geert De Cubber Royal Military Academy, Department of Mechanical Engineering (MSTA) Av. de la Renaissance 30, 1000 Brussels geert.de.cubber@rma.ac.be

More information

Depth Estimation for View Synthesis in Multiview Video Coding

Depth Estimation for View Synthesis in Multiview Video Coding MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Depth Estimation for View Synthesis in Multiview Video Coding Serdar Ince, Emin Martinian, Sehoon Yea, Anthony Vetro TR2007-025 June 2007 Abstract

More information

BIL Computer Vision Apr 16, 2014

BIL Computer Vision Apr 16, 2014 BIL 719 - Computer Vision Apr 16, 2014 Binocular Stereo (cont d.), Structure from Motion Aykut Erdem Dept. of Computer Engineering Hacettepe University Slide credit: S. Lazebnik Basic stereo matching algorithm

More information

Outdoor Scene Reconstruction from Multiple Image Sequences Captured by a Hand-held Video Camera

Outdoor Scene Reconstruction from Multiple Image Sequences Captured by a Hand-held Video Camera Outdoor Scene Reconstruction from Multiple Image Sequences Captured by a Hand-held Video Camera Tomokazu Sato, Masayuki Kanbara and Naokazu Yokoya Graduate School of Information Science, Nara Institute

More information

CONTENT ADAPTIVE SCREEN IMAGE SCALING

CONTENT ADAPTIVE SCREEN IMAGE SCALING CONTENT ADAPTIVE SCREEN IMAGE SCALING Yao Zhai (*), Qifei Wang, Yan Lu, Shipeng Li University of Science and Technology of China, Hefei, Anhui, 37, China Microsoft Research, Beijing, 8, China ABSTRACT

More information

A FAST SEGMENTATION-DRIVEN ALGORITHM FOR ACCURATE STEREO CORRESPONDENCE. Stefano Mattoccia and Leonardo De-Maeztu

A FAST SEGMENTATION-DRIVEN ALGORITHM FOR ACCURATE STEREO CORRESPONDENCE. Stefano Mattoccia and Leonardo De-Maeztu A FAST SEGMENTATION-DRIVEN ALGORITHM FOR ACCURATE STEREO CORRESPONDENCE Stefano Mattoccia and Leonardo De-Maeztu University of Bologna, Public University of Navarre ABSTRACT Recent cost aggregation strategies

More information

segments. The geometrical relationship of adjacent planes such as parallelism and intersection is employed for determination of whether two planes sha

segments. The geometrical relationship of adjacent planes such as parallelism and intersection is employed for determination of whether two planes sha A New Segment-based Stereo Matching using Graph Cuts Daolei Wang National University of Singapore EA #04-06, Department of Mechanical Engineering Control and Mechatronics Laboratory, 10 Kent Ridge Crescent

More information

Fast Stereo Matching using Adaptive Window based Disparity Refinement

Fast Stereo Matching using Adaptive Window based Disparity Refinement Avestia Publishing Journal of Multimedia Theory and Applications (JMTA) Volume 2, Year 2016 Journal ISSN: 2368-5956 DOI: 10.11159/jmta.2016.001 Fast Stereo Matching using Adaptive Window based Disparity

More information

Accurate and Dense Wide-Baseline Stereo Matching Using SW-POC

Accurate and Dense Wide-Baseline Stereo Matching Using SW-POC Accurate and Dense Wide-Baseline Stereo Matching Using SW-POC Shuji Sakai, Koichi Ito, Takafumi Aoki Graduate School of Information Sciences, Tohoku University, Sendai, 980 8579, Japan Email: sakai@aoki.ecei.tohoku.ac.jp

More information

Factorization with Missing and Noisy Data

Factorization with Missing and Noisy Data Factorization with Missing and Noisy Data Carme Julià, Angel Sappa, Felipe Lumbreras, Joan Serrat, and Antonio López Computer Vision Center and Computer Science Department, Universitat Autònoma de Barcelona,

More information

Processing 3D Surface Data

Processing 3D Surface Data Processing 3D Surface Data Computer Animation and Visualisation Lecture 17 Institute for Perception, Action & Behaviour School of Informatics 3D Surfaces 1 3D surface data... where from? Iso-surfacing

More information

Probabilistic Correspondence Matching using Random Walk with Restart

Probabilistic Correspondence Matching using Random Walk with Restart C. OH, B. HAM, K. SOHN: PROBABILISTIC CORRESPONDENCE MATCHING 1 Probabilistic Correspondence Matching using Random Walk with Restart Changjae Oh ocj1211@yonsei.ac.kr Bumsub Ham mimo@yonsei.ac.kr Kwanghoon

More information

EE795: Computer Vision and Intelligent Systems

EE795: Computer Vision and Intelligent Systems EE795: Computer Vision and Intelligent Systems Spring 2012 TTh 17:30-18:45 FDH 204 Lecture 14 130307 http://www.ee.unlv.edu/~b1morris/ecg795/ 2 Outline Review Stereo Dense Motion Estimation Translational

More information

A Performance Study on Different Cost Aggregation Approaches Used in Real-Time Stereo Matching

A Performance Study on Different Cost Aggregation Approaches Used in Real-Time Stereo Matching International Journal of Computer Vision 75(2), 283 296, 2007 c 2007 Springer Science + Business Media, LLC. Manufactured in the United States. DOI: 10.1007/s11263-006-0032-x A Performance Study on Different

More information

STRUCTURAL EDGE LEARNING FOR 3-D RECONSTRUCTION FROM A SINGLE STILL IMAGE. Nan Hu. Stanford University Electrical Engineering

STRUCTURAL EDGE LEARNING FOR 3-D RECONSTRUCTION FROM A SINGLE STILL IMAGE. Nan Hu. Stanford University Electrical Engineering STRUCTURAL EDGE LEARNING FOR 3-D RECONSTRUCTION FROM A SINGLE STILL IMAGE Nan Hu Stanford University Electrical Engineering nanhu@stanford.edu ABSTRACT Learning 3-D scene structure from a single still

More information

Egemen Tanin, Tahsin M. Kurc, Cevdet Aykanat, Bulent Ozguc. Abstract. Direct Volume Rendering (DVR) is a powerful technique for

Egemen Tanin, Tahsin M. Kurc, Cevdet Aykanat, Bulent Ozguc. Abstract. Direct Volume Rendering (DVR) is a powerful technique for Comparison of Two Image-Space Subdivision Algorithms for Direct Volume Rendering on Distributed-Memory Multicomputers Egemen Tanin, Tahsin M. Kurc, Cevdet Aykanat, Bulent Ozguc Dept. of Computer Eng. and

More information

Project Updates Short lecture Volumetric Modeling +2 papers

Project Updates Short lecture Volumetric Modeling +2 papers Volumetric Modeling Schedule (tentative) Feb 20 Feb 27 Mar 5 Introduction Lecture: Geometry, Camera Model, Calibration Lecture: Features, Tracking/Matching Mar 12 Mar 19 Mar 26 Apr 2 Apr 9 Apr 16 Apr 23

More information

Chaplin, Modern Times, 1936

Chaplin, Modern Times, 1936 Chaplin, Modern Times, 1936 [A Bucket of Water and a Glass Matte: Special Effects in Modern Times; bonus feature on The Criterion Collection set] Multi-view geometry problems Structure: Given projections

More information

Linear stereo matching

Linear stereo matching Linear stereo matching Leonardo De-Maeztu 1 Stefano Mattoccia 2 Arantxa Villanueva 1 Rafael Cabeza 1 1 Public University of Navarre Pamplona, Spain 2 University of Bologna Bologna, Italy {leonardo.demaeztu,avilla,rcabeza}@unavarra.es

More information

Stereo Matching.

Stereo Matching. Stereo Matching Stereo Vision [1] Reduction of Searching by Epipolar Constraint [1] Photometric Constraint [1] Same world point has same intensity in both images. True for Lambertian surfaces A Lambertian

More information

SIMPLE BUT EFFECTIVE TREE STRUCTURES FOR DYNAMIC PROGRAMMING-BASED STEREO MATCHING

SIMPLE BUT EFFECTIVE TREE STRUCTURES FOR DYNAMIC PROGRAMMING-BASED STEREO MATCHING SIMPLE BUT EFFECTIVE TREE STRUCTURES FOR DYNAMIC PROGRAMMING-BASED STEREO MATCHING Michael Bleyer and Margrit Gelautz Institute for Software Technology and Interactive Systems, ViennaUniversityofTechnology

More information

Announcements. Stereo Vision Wrapup & Intro Recognition

Announcements. Stereo Vision Wrapup & Intro Recognition Announcements Stereo Vision Wrapup & Intro Introduction to Computer Vision CSE 152 Lecture 17 HW3 due date postpone to Thursday HW4 to posted by Thursday, due next Friday. Order of material we ll first

More information

Stereo Vision Based Image Maching on 3D Using Multi Curve Fitting Algorithm

Stereo Vision Based Image Maching on 3D Using Multi Curve Fitting Algorithm Stereo Vision Based Image Maching on 3D Using Multi Curve Fitting Algorithm 1 Dr. Balakrishnan and 2 Mrs. V. Kavitha 1 Guide, Director, Indira Ganesan College of Engineering, Trichy. India. 2 Research

More information

Lecture 10: Multi view geometry

Lecture 10: Multi view geometry Lecture 10: Multi view geometry Professor Fei Fei Li Stanford Vision Lab 1 What we will learn today? Stereo vision Correspondence problem (Problem Set 2 (Q3)) Active stereo vision systems Structure from

More information

Improving the accuracy of fast dense stereo correspondence algorithms by enforcing local consistency of disparity fields

Improving the accuracy of fast dense stereo correspondence algorithms by enforcing local consistency of disparity fields Improving the accuracy of fast dense stereo correspondence algorithms by enforcing local consistency of disparity fields Stefano Mattoccia University of Bologna Dipartimento di Elettronica, Informatica

More information

Stereo Matching with Reliable Disparity Propagation

Stereo Matching with Reliable Disparity Propagation Stereo Matching with Reliable Disparity Propagation Xun Sun, Xing Mei, Shaohui Jiao, Mingcai Zhou, Haitao Wang Samsung Advanced Institute of Technology, China Lab Beijing, China xunshine.sun,xing.mei,sh.jiao,mingcai.zhou,ht.wang@samsung.com

More information

Optimizing Monocular Cues for Depth Estimation from Indoor Images

Optimizing Monocular Cues for Depth Estimation from Indoor Images Optimizing Monocular Cues for Depth Estimation from Indoor Images Aditya Venkatraman 1, Sheetal Mahadik 2 1, 2 Department of Electronics and Telecommunication, ST Francis Institute of Technology, Mumbai,

More information

CHAPTER 3 DISPARITY AND DEPTH MAP COMPUTATION

CHAPTER 3 DISPARITY AND DEPTH MAP COMPUTATION CHAPTER 3 DISPARITY AND DEPTH MAP COMPUTATION In this chapter we will discuss the process of disparity computation. It plays an important role in our caricature system because all 3D coordinates of nodes

More information

CS5670: Computer Vision

CS5670: Computer Vision CS5670: Computer Vision Noah Snavely, Zhengqi Li Stereo Single image stereogram, by Niklas Een Mark Twain at Pool Table", no date, UCR Museum of Photography Stereo Given two images from different viewpoints

More information

Stereo imaging ideal geometry

Stereo imaging ideal geometry Stereo imaging ideal geometry (X,Y,Z) Z f (x L,y L ) f (x R,y R ) Optical axes are parallel Optical axes separated by baseline, b. Line connecting lens centers is perpendicular to the optical axis, and

More information

Detecting motion by means of 2D and 3D information

Detecting motion by means of 2D and 3D information Detecting motion by means of 2D and 3D information Federico Tombari Stefano Mattoccia Luigi Di Stefano Fabio Tonelli Department of Electronics Computer Science and Systems (DEIS) Viale Risorgimento 2,

More information

Final project bits and pieces

Final project bits and pieces Final project bits and pieces The project is expected to take four weeks of time for up to four people. At 12 hours per week per person that comes out to: ~192 hours of work for a four person team. Capstone:

More information

Irradiance Gradients. Media & Occlusions

Irradiance Gradients. Media & Occlusions Irradiance Gradients in the Presence of Media & Occlusions Wojciech Jarosz in collaboration with Matthias Zwicker and Henrik Wann Jensen University of California, San Diego June 23, 2008 Wojciech Jarosz

More information

Segmentation Based Stereo. Michael Bleyer LVA Stereo Vision

Segmentation Based Stereo. Michael Bleyer LVA Stereo Vision Segmentation Based Stereo Michael Bleyer LVA Stereo Vision What happened last time? Once again, we have looked at our energy function: E ( D) = m( p, dp) + p I < p, q > We have investigated the matching

More information

What have we leaned so far?

What have we leaned so far? What have we leaned so far? Camera structure Eye structure Project 1: High Dynamic Range Imaging What have we learned so far? Image Filtering Image Warping Camera Projection Model Project 2: Panoramic

More information

A New Parallel Implementation of DSI Based Disparity Computation Using CUDA

A New Parallel Implementation of DSI Based Disparity Computation Using CUDA International Journal of Computer and Communication Engineering, Vol. 3, No. 1, January 2014 A New Parallel Implementation of DSI Based Disparity Computation Using CUDA Aamer Mehmood, Youngsung Soh, and

More information

A Survey of Light Source Detection Methods

A Survey of Light Source Detection Methods A Survey of Light Source Detection Methods Nathan Funk University of Alberta Mini-Project for CMPUT 603 November 30, 2003 Abstract This paper provides an overview of the most prominent techniques for light

More information

A virtual tour of free viewpoint rendering

A virtual tour of free viewpoint rendering A virtual tour of free viewpoint rendering Cédric Verleysen ICTEAM institute, Université catholique de Louvain, Belgium cedric.verleysen@uclouvain.be Organization of the presentation Context Acquisition

More information

Stereo Vision in Structured Environments by Consistent Semi-Global Matching

Stereo Vision in Structured Environments by Consistent Semi-Global Matching Stereo Vision in Structured Environments by Consistent Semi-Global Matching Heiko Hirschmüller Institute of Robotics and Mechatronics Oberpfaffenhofen German Aerospace Center (DLR), 82230 Wessling, Germany.

More information

Range Imaging Through Triangulation. Range Imaging Through Triangulation. Range Imaging Through Triangulation. Range Imaging Through Triangulation

Range Imaging Through Triangulation. Range Imaging Through Triangulation. Range Imaging Through Triangulation. Range Imaging Through Triangulation Obviously, this is a very slow process and not suitable for dynamic scenes. To speed things up, we can use a laser that projects a vertical line of light onto the scene. This laser rotates around its vertical

More information

Occlusion Detection of Real Objects using Contour Based Stereo Matching

Occlusion Detection of Real Objects using Contour Based Stereo Matching Occlusion Detection of Real Objects using Contour Based Stereo Matching Kenichi Hayashi, Hirokazu Kato, Shogo Nishida Graduate School of Engineering Science, Osaka University,1-3 Machikaneyama-cho, Toyonaka,

More information

Subdivision Of Triangular Terrain Mesh Breckon, Chenney, Hobbs, Hoppe, Watts

Subdivision Of Triangular Terrain Mesh Breckon, Chenney, Hobbs, Hoppe, Watts Subdivision Of Triangular Terrain Mesh Breckon, Chenney, Hobbs, Hoppe, Watts MSc Computer Games and Entertainment Maths & Graphics II 2013 Lecturer(s): FFL (with Gareth Edwards) Fractal Terrain Based on

More information

Improving Border Localization of Multi-Baseline Stereo Using Border-Cut

Improving Border Localization of Multi-Baseline Stereo Using Border-Cut Improving Border Localization of Multi-Baseline Stereo Using Border-Cut Marc-Antoine Drouin Martin trudeau DIRO, Université de Montréal, Canada {drouim,trudeaum,roys}@iro.umontreal.ca Sébastien Roy Abstract

More information

Local Image Registration: An Adaptive Filtering Framework

Local Image Registration: An Adaptive Filtering Framework Local Image Registration: An Adaptive Filtering Framework Gulcin Caner a,a.murattekalp a,b, Gaurav Sharma a and Wendi Heinzelman a a Electrical and Computer Engineering Dept.,University of Rochester, Rochester,

More information

Step-by-Step Model Buidling

Step-by-Step Model Buidling Step-by-Step Model Buidling Review Feature selection Feature selection Feature correspondence Camera Calibration Euclidean Reconstruction Landing Augmented Reality Vision Based Control Sparse Structure

More information

Adaptive-Mesh-Refinement Pattern

Adaptive-Mesh-Refinement Pattern Adaptive-Mesh-Refinement Pattern I. Problem Data-parallelism is exposed on a geometric mesh structure (either irregular or regular), where each point iteratively communicates with nearby neighboring points

More information

Stereo vision. Many slides adapted from Steve Seitz

Stereo vision. Many slides adapted from Steve Seitz Stereo vision Many slides adapted from Steve Seitz What is stereo vision? Generic problem formulation: given several images of the same object or scene, compute a representation of its 3D shape What is

More information

Scene Segmentation by Color and Depth Information and its Applications

Scene Segmentation by Color and Depth Information and its Applications Scene Segmentation by Color and Depth Information and its Applications Carlo Dal Mutto Pietro Zanuttigh Guido M. Cortelazzo Department of Information Engineering University of Padova Via Gradenigo 6/B,

More information

CS 4495/7495 Computer Vision Frank Dellaert, Fall 07. Dense Stereo Some Slides by Forsyth & Ponce, Jim Rehg, Sing Bing Kang

CS 4495/7495 Computer Vision Frank Dellaert, Fall 07. Dense Stereo Some Slides by Forsyth & Ponce, Jim Rehg, Sing Bing Kang CS 4495/7495 Computer Vision Frank Dellaert, Fall 07 Dense Stereo Some Slides by Forsyth & Ponce, Jim Rehg, Sing Bing Kang Etymology Stereo comes from the Greek word for solid (στερεο), and the term can

More information

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching Stereo Matching Fundamental matrix Let p be a point in left image, p in right image l l Epipolar relation p maps to epipolar line l p maps to epipolar line l p p Epipolar mapping described by a 3x3 matrix

More information

DiFi: Distance Fields - Fast Computation Using Graphics Hardware

DiFi: Distance Fields - Fast Computation Using Graphics Hardware DiFi: Distance Fields - Fast Computation Using Graphics Hardware Avneesh Sud Dinesh Manocha UNC-Chapel Hill http://gamma.cs.unc.edu/difi Distance Fields Distance Function For a site a scalar function f:r

More information

Multi-View Image Coding in 3-D Space Based on 3-D Reconstruction

Multi-View Image Coding in 3-D Space Based on 3-D Reconstruction Multi-View Image Coding in 3-D Space Based on 3-D Reconstruction Yongying Gao and Hayder Radha Department of Electrical and Computer Engineering, Michigan State University, East Lansing, MI 48823 email:

More information

Stereo: the graph cut method

Stereo: the graph cut method Stereo: the graph cut method Last lecture we looked at a simple version of the Marr-Poggio algorithm for solving the binocular correspondence problem along epipolar lines in rectified images. The main

More information

Fast Natural Feature Tracking for Mobile Augmented Reality Applications

Fast Natural Feature Tracking for Mobile Augmented Reality Applications Fast Natural Feature Tracking for Mobile Augmented Reality Applications Jong-Seung Park 1, Byeong-Jo Bae 2, and Ramesh Jain 3 1 Dept. of Computer Science & Eng., University of Incheon, Korea 2 Hyundai

More information

Nonrigid Surface Modelling. and Fast Recovery. Department of Computer Science and Engineering. Committee: Prof. Leo J. Jia and Prof. K. H.

Nonrigid Surface Modelling. and Fast Recovery. Department of Computer Science and Engineering. Committee: Prof. Leo J. Jia and Prof. K. H. Nonrigid Surface Modelling and Fast Recovery Zhu Jianke Supervisor: Prof. Michael R. Lyu Committee: Prof. Leo J. Jia and Prof. K. H. Wong Department of Computer Science and Engineering May 11, 2007 1 2

More information

Fast Stereo Matching of Feature Links

Fast Stereo Matching of Feature Links Fast Stereo Matching of Feature Links 011.05.19 Chang-il, Kim Introduction Stereo matching? interesting topics of computer vision researches To determine a disparity between stereo images A fundamental

More information

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching Stereo Matching Fundamental matrix Let p be a point in left image, p in right image l l Epipolar relation p maps to epipolar line l p maps to epipolar line l p p Epipolar mapping described by a 3x3 matrix

More information

Accurate 3D Face and Body Modeling from a Single Fixed Kinect

Accurate 3D Face and Body Modeling from a Single Fixed Kinect Accurate 3D Face and Body Modeling from a Single Fixed Kinect Ruizhe Wang*, Matthias Hernandez*, Jongmoo Choi, Gérard Medioni Computer Vision Lab, IRIS University of Southern California Abstract In this

More information

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science. Professor William Hoff Dept of Electrical Engineering &Computer Science http://inside.mines.edu/~whoff/ 1 Stereo Vision 2 Inferring 3D from 2D Model based pose estimation single (calibrated) camera > Can

More information

Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923

Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923 Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923 Teesta suspension bridge-darjeeling, India Mark Twain at Pool Table", no date, UCR Museum of Photography Woman getting eye exam during

More information

Texture Sensitive Image Inpainting after Object Morphing

Texture Sensitive Image Inpainting after Object Morphing Texture Sensitive Image Inpainting after Object Morphing Yin Chieh Liu and Yi-Leh Wu Department of Computer Science and Information Engineering National Taiwan University of Science and Technology, Taiwan

More information

Motion Estimation. There are three main types (or applications) of motion estimation:

Motion Estimation. There are three main types (or applications) of motion estimation: Members: D91922016 朱威達 R93922010 林聖凱 R93922044 謝俊瑋 Motion Estimation There are three main types (or applications) of motion estimation: Parametric motion (image alignment) The main idea of parametric motion

More information

CS4495/6495 Introduction to Computer Vision. 3B-L3 Stereo correspondence

CS4495/6495 Introduction to Computer Vision. 3B-L3 Stereo correspondence CS4495/6495 Introduction to Computer Vision 3B-L3 Stereo correspondence For now assume parallel image planes Assume parallel (co-planar) image planes Assume same focal lengths Assume epipolar lines are

More information

Bilateral and Trilateral Adaptive Support Weights in Stereo Vision

Bilateral and Trilateral Adaptive Support Weights in Stereo Vision Cost -based In GPU and Support Weights in Vision Student, Colorado School of Mines rbeethe@mines.edu April 7, 2016 1 / 36 Overview Cost -based In GPU 1 Cost 2 3 -based 4 In GPU 2 / 36 Cost -based In GPU

More information

One category of visual tracking. Computer Science SURJ. Michael Fischer

One category of visual tracking. Computer Science SURJ. Michael Fischer Computer Science Visual tracking is used in a wide range of applications such as robotics, industrial auto-control systems, traffic monitoring, and manufacturing. This paper describes a new algorithm for

More information

Binocular stereo. Given a calibrated binocular stereo pair, fuse it to produce a depth image. Where does the depth information come from?

Binocular stereo. Given a calibrated binocular stereo pair, fuse it to produce a depth image. Where does the depth information come from? Binocular Stereo Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image Where does the depth information come from? Binocular stereo Given a calibrated binocular stereo

More information

A Comparative Study of Stereovision Algorithms

A Comparative Study of Stereovision Algorithms A Comparative Study of Stereovision Algorithms Elena Bebeşelea-Sterp NTT DATA ROMANIA Sibiu, Romania Raluca Brad Faculty of Engineering Lucian Blaga University of Sibiu Sibiu, Romania Remus Brad Faculty

More information

Stereo Correspondence by Dynamic Programming on a Tree

Stereo Correspondence by Dynamic Programming on a Tree Stereo Correspondence by Dynamic Programming on a Tree Olga Veksler University of Western Ontario Computer Science Department, Middlesex College 31 London O A B7 Canada olga@csduwoca Abstract Dynamic programming

More information

2D rendering takes a photo of the 2D scene with a virtual camera that selects an axis aligned rectangle from the scene. The photograph is placed into

2D rendering takes a photo of the 2D scene with a virtual camera that selects an axis aligned rectangle from the scene. The photograph is placed into 2D rendering takes a photo of the 2D scene with a virtual camera that selects an axis aligned rectangle from the scene. The photograph is placed into the viewport of the current application window. A pixel

More information

A MULTI-RESOLUTION APPROACH TO DEPTH FIELD ESTIMATION IN DENSE IMAGE ARRAYS F. Battisti, M. Brizzi, M. Carli, A. Neri

A MULTI-RESOLUTION APPROACH TO DEPTH FIELD ESTIMATION IN DENSE IMAGE ARRAYS F. Battisti, M. Brizzi, M. Carli, A. Neri A MULTI-RESOLUTION APPROACH TO DEPTH FIELD ESTIMATION IN DENSE IMAGE ARRAYS F. Battisti, M. Brizzi, M. Carli, A. Neri Università degli Studi Roma TRE, Roma, Italy 2 nd Workshop on Light Fields for Computer

More information

Performance of Stereo Methods in Cluttered Scenes

Performance of Stereo Methods in Cluttered Scenes Performance of Stereo Methods in Cluttered Scenes Fahim Mannan and Michael S. Langer School of Computer Science McGill University Montreal, Quebec H3A 2A7, Canada { fmannan, langer}@cim.mcgill.ca Abstract

More information

Multigrid Pattern. I. Problem. II. Driving Forces. III. Solution

Multigrid Pattern. I. Problem. II. Driving Forces. III. Solution Multigrid Pattern I. Problem Problem domain is decomposed into a set of geometric grids, where each element participates in a local computation followed by data exchanges with adjacent neighbors. The grids

More information

Comparison of Graph Cuts with Belief Propagation for Stereo, using Identical MRF Parameters

Comparison of Graph Cuts with Belief Propagation for Stereo, using Identical MRF Parameters Comparison of Graph Cuts with Belief Propagation for Stereo, using Identical MRF Parameters Marshall F. Tappen William T. Freeman Computer Science and Artificial Intelligence Laboratory Massachusetts Institute

More information

On Building an Accurate Stereo Matching System on Graphics Hardware

On Building an Accurate Stereo Matching System on Graphics Hardware On Building an Accurate Stereo Matching System on Graphics Hardware Xing Mei 1,2, Xun Sun 1, Mingcai Zhou 1, Shaohui Jiao 1, Haitao Wang 1, Xiaopeng Zhang 2 1 Samsung Advanced Institute of Technology,

More information

3D Editing System for Captured Real Scenes

3D Editing System for Captured Real Scenes 3D Editing System for Captured Real Scenes Inwoo Ha, Yong Beom Lee and James D.K. Kim Samsung Advanced Institute of Technology, Youngin, South Korea E-mail: {iw.ha, leey, jamesdk.kim}@samsung.com Tel:

More information

DETECTION AND ROBUST ESTIMATION OF CYLINDER FEATURES IN POINT CLOUDS INTRODUCTION

DETECTION AND ROBUST ESTIMATION OF CYLINDER FEATURES IN POINT CLOUDS INTRODUCTION DETECTION AND ROBUST ESTIMATION OF CYLINDER FEATURES IN POINT CLOUDS Yun-Ting Su James Bethel Geomatics Engineering School of Civil Engineering Purdue University 550 Stadium Mall Drive, West Lafayette,

More information

A Local Iterative Refinement Method for Adaptive Support-Weight Stereo Matching

A Local Iterative Refinement Method for Adaptive Support-Weight Stereo Matching A Local Iterative Refinement Method for Adaptive Support-Weight Stereo Matching Eric T. Psota, Jędrzej Kowalczuk, Jay Carlson, and Lance C. Pérez Department of Electrical Engineering, University of Nebraska,

More information

Motion Estimation using Block Overlap Minimization

Motion Estimation using Block Overlap Minimization Motion Estimation using Block Overlap Minimization Michael Santoro, Ghassan AlRegib, Yucel Altunbasak School of Electrical and Computer Engineering, Georgia Institute of Technology Atlanta, GA 30332 USA

More information

Real-time stereo reconstruction through hierarchical DP and LULU filtering

Real-time stereo reconstruction through hierarchical DP and LULU filtering Real-time stereo reconstruction through hierarchical DP and LULU filtering François Singels, Willie Brink Department of Mathematical Sciences University of Stellenbosch, South Africa fsingels@gmail.com,

More information