There exist several goals for three-dimensional (3-D) digitization

Size: px
Start display at page:

Download "There exist several goals for three-dimensional (3-D) digitization"

Transcription

1 [ Fabio Remondino, Sabry F. El-Hakim, Armin Gruen, and Li Zhang ] Turning Images into 3-D Models [Developments and performance analysis of image matching for detailed surface reconstruction of heritage objects] There exist several goals for three-dimensional (3-D) digitization and modeling of natural and cultural heritage objects, such as for accurate detailed documentation, digital preservation and restoration, physical replicas, virtual tourism, and research or education. We focus on the detailed documentation aspect. Capturing the 3-D data requires a technique that is highly accurate; portable due to accessibility problems; low cost due to limited budgets; fast due to the usually short allowed time on the site so as not to disturb works or visitors; and flexible and scalable due to the wide variety and sizes of sites and heritage objects. It is also important that the technique captures dense 3-D data on the necessary surface elements to guarantee a realistic experience close up or to monitor surface condition over time. Furthermore, the fine geometric details are needed because even with rich texture maps, models without these details will exhibit too smooth and flat-looking surfaces or polygonized silhouettes that are easily detected by the human eye. Although image-based modeling techniques can produce accurate and realisticlooking models, practical systems are still highly interactive since current fully automated methods are still unproven in real applications and may not guarantee accurate results. As an imagebased measurement technique, photogrammetry has for many years been dealing with accurate 3-D reconstruction of objects. Even though it was often perceived as time 1995 MASTER SERIES Digital Object Identifier /MSP /08/$ IEEE IEEE SIGNAL PROCESSING MAGAZINE [55] JULY 2008

2 consuming and complicated, the heritage community is starting to consider it for precise digital documentation as a promising, less costly, and practical alternative to range sensors. Thus, efforts to increase the level of automation are continuing. Dense matching of image elements over two or more images is necessary to capture the geometric details in 3-D, but it is a challenging and often ill-posed problem. However, there are several proven strategies for example, limiting the search to the epipolar line, working with hierarchies of image pyramids, matching features and edges first to provide better approximations for successive area-based matching (ABM), using a region-growing algorithm from initial seed points, computing intermediate depth maps, and using more than two images for higher reliability. Subpixel precision is an important factor as only a pixel precision creates a noticeable systematic stepping (quantization error) that gives undesirable visual effects. But in spite of all the advances, some problems still remain and user interactions are still needed. A fully automated, precise, and reliable method adaptable to different image sets and scene contents is not available, in particular for convergent and wide baseline images. However, it has been proven that a successful image matcher should do the following: 1) use accurately calibrated cameras and images with strong geometric configuration, 2) use local and global image information to extract all the possible matching candidates and get global consistency among the matching candidates, 3) use constraints to restrict the search space, 4) consider an estimated shape of the object as a priori information, and 5) employ strategies to monitor the matching results. With this in mind, in this article we propose a multistage image-based modeling approach that requires only a limited amount of human interactivity and is capable of capturing the fine geometric details with similar accuracy as close-range active range sensors. It can also cope with wide baselines using several advancements over standard stereo matching techniques. Our approach is sequential, starting from a sparse basic segmented model created with a small number of interactively measured points. This model, specifically the equation of each surface, is then used as a guide to automatically add the fine details. The following three techniques are used, each where best suited, to retrieve the details: 1) For regularly shaped patches such as planes, cylinders, or quadrics, we apply a fast relative stereo matching technique. 2) For more complex or irregular segments with unknown shape, we use a global multi-image geometrically constrained technique. 3) For segments unsuited for stereo matching, we employ depth from shading (DFS). The goal is not the development of a fully automated procedure for 3-D object reconstruction from image data (e.g., structure from motion) or a sparse stereo approach, but we aim at the digital reconstruction of detailed and accurate surfaces from calibrated and oriented images for practical daily documentation and digital conservation of wide variety of heritage objects (Figure 1). AN OVERVIEW OF IMAGE MATCHING TECHNIQUES Since a large body of work on image-based 3-D modeling and image matching exists and the techniques are definitely scene and application dependent, we focus here on those approaches that were practically applied to cultural heritage objects and sites. We also report only on methods for creating detailed geometric models. Thus image-based rendering (IBR), which skips geometric modeling, surveying, computer-aided design (CAD) techniques, or visual hull, factorization methods and other approaches that yield mainly a sparse set of 3-D points, will not be covered. (a) (b) [FIG1] (a) Typical cultural heritage objects requiring (b) detailed and accurate 3-D models for documentation, conservation, analysis, restoration, or replica purposes. IEEE SIGNAL PROCESSING MAGAZINE [56] JULY 2008

3 The fully automated structure from motion 3-D modeling procedures widely reported in the vision community [1], [2] require very short intervals between consecutive images to guarantee constant illumination and scale between successive images. This large number of closely spaced images can be a problem in an archaeological area, large heritage sites, or in areas with limited accessibility. The short baseline also results in large depth errors unless the points are tracked over a large image sequence. Large errors up to 5% (one part in 20) were reported, limiting the use of these methods to simple visualization applications. On the other hand, wide baselines between images give rise to image-scale variations and occlusion problems. Different strategies have been proposed for stereo matching with wide baselines [3] [5], but a complete accuracy analysis is still lacking. Image matching is generally defined as the establishment of correspondences between two or more images to reconstruct surfaces in 3-D. In order to extract these correspondences, the primitives to be matched must be defined. Afterwards, a similarity measure is computed and evaluated between primitive pairs, and then a disparity map or 3-D point cloud can be generated. Image matching remains an active area of research even after four decades of activity. The main reason is the difficulty in finding a unique match (multiple possible correct matches) or no match at all (partly occluded in one image or looks very different due to light and geometric variation), which creates a primarily ill-posed problem. Illposed problems can be converted into well-posed problems by introducing constraints. In general, there are three forms of constraints unary, binary, and N-ary constraints. A unary constraint (like the epipolar or collinearity constraints [6] or the similarity constraint [7]) represents the similarity or likelihood of individual matches. The surface smoothness and uniqueness constraint can be seen as a binary constraint. N- ary constraints are usually represented with mutually topological relations among matches. Recent overviews on stereo matching can be found in [8] and [9], while [10] compared multi-image matching techniques. References [8] and [10] classified the different stereo and multiview matching algorithms according to six fundamental properties: the scene representation, photoconsistency measure, visibility model, shape prior, reconstruction algorithm, and initialization requirements. On the other hand, we consider the two main classes of matching primitives, i.e., image intensity patterns (windows composed of grey values around a point of interest) and features (edges and regions), which are then transformed into 3-D information through a mathematical model (e.g., collinearity model or camera projection matrix). According to these primitives, the resulting matching algorithms are generally classified as ABM or feature-based matching (FBM). ABM ABM, also called signal-based matching, is the more traditional approach. It is justified by the continuity assumption, which asserts that at a certain level of resolution where image matching is performed, most of the image window depicts a portion of a continuous and planar surface element. Therefore, adjacent pixels in the image window will generally represent contiguous points in object space. In ABM, each point to be matched is the center of a small window of pixels (patch) in a reference image (template) which is statistically compared with equally sized windows of pixels in another (target) image. The measure of match is either a difference metric that is minimized, such as RMS difference, or more commonly a similarity measure that is maximized. ABM is usually based on local squared or rectangular windows. In its oldest form, area-based image matching was performed with cross-correlation and the correlation coefficient as a similarity measure. Cross-correlation works fast and well if the patches contain enough signal without too much high-frequency content (noise) and if geometrical and radiometric distortions are minimal. To overcome these problems, image reshaping parameters and radiometric corrections were considered, leading to the well-known nonlinear least squares matching (LSM) estimation procedure [6]. The location and shape of the matched window is estimated with respect to some initial values and computed until the grey-level differences between the deformed patch and the template one reach a minimum. That is, the goal function to be minimized is the L2- norm of the residuals of least squares estimation, although several investigations have shown that L1-norm or absolute deviation (LAD) can be used to improve the accuracy of the estimation problem in the presence of outlier pixels in the data. References [6], [11], and [12] introduced multiphoto geometrically constrained (MPGCs) matching and the use of additional constraints into the image matching and the surface reconstruction process. Recently, the MPGC-based framework was used in a reformulated version in [4]. ABM was also generalized from image to object space, introducing the concept of groundel or surfel [13]. ABM, especially the LSM method with its subpixel capability, has a very high accuracy potential (up to 1/50 pixel) if welltextured image patches are used. Disadvantages of ABM are the need for small searching range for successful matching, the large data volume which must be handled and, in the case of LSM, the requirement of good initial values for the unknown parameters, although this is not the case for other techniques such as graph-cut [8]. Problems occur in areas with occlusions, areas with a lack of or repetitive texture, or if the surface does not correspond to the assumed model (e.g., planarity of the matched local surface patch). FBM FBM determines the image correspondence using image features. FBM comprises two stages: 1) the detection of interesting features and their attributes in all images and 2) the determination of the corresponding features using particular similarity measures. The two stages are related to each other in the sense that the feature extraction and attributes computation must be such that the correspondence determination is easy, precise, and not sensitive to scale and IEEE SIGNAL PROCESSING MAGAZINE [57] JULY 2008

4 orientation. A good feature for image matching should be distinct with respect to its neighborhood, invariant with respect to geometric and radiometric influences, stable with respect to noise, and unique with respect to other features. There are three types of features: 1) Interest Points [14], [15]: Interest point detectors are generally divided into contour-based methods, which search for maximal curvature or inflexion points along the contour chains; signal-based methods, which analyze the image signal and derive a measure which indicates the presence of an interest point; and methods based on template fitting which try to fit the image signal to a parametric model of a specific type of interest point (e.g., a corner). The main properties of a point detector are: 1) accuracy, or the ability to detect a pattern at its correct pixel location; 2) stability, or the ability to detect the same feature after the image undergoes some geometrical transformation (e.g., rotation or scale) or illumination changes; 3) sensitivity, or the ability to detect feature points in low contrast conditions; and 4) controllability and speed, or the number of parameters controlling the operator and the time required to identify features. 2) Edges [16]: The key for edge extraction is the intensity change, which is shown via the gradient of the image. Edge detectors usually follow the same steps: smoothing, applying edge enhancement filters, applying a threshold, and edge tracing. Then the most widely used methods involve edgel linking and segmentation. The Canny detector [17] is probably the most widely used edge detector and very suitable due to its performance and low sensitivity to parameter variation. Lines (edgel) provide more geometric Camera Calibration Edit and Fill Holes Ordering and Preprocessing Tie Point Extraction Image Registration Seed Points Segmentation Shape Properties Initial Basic Model Fine Details with Dense Matching and DFS Final Textured Model Rendering/Visualization [FIG2] The modeling pipeline for object reconstruction from images. information than single points and are also useful in the surface reconstruction (e.g., as breaklines) to avoid smoothing effects on the object edges. 3) Regions [18]: Regions are homogeneous areas of the images with intensity variations below a certain threshold. Image regions should be invariant under certain transformations. Under a generic camera movement (e.g., translation), the most common transformation is an affinity, but also scale-invariant detectors have been proposed. Generally, an interest point detector is used to localize the points, and afterwards an invariant region is extracted around each point. Features are first extracted and afterwards associated with attributes ( descriptors ) to characterize and match them [15]. A typical strategy to match characterized features is the computation of the Euclidean or Mahalanobis distance between the descriptor elements. Larger (or global) features are called structures and are usually composed of different local features. Matching with global features is also referred to as relational or structural matching [19]. It establishes a correspondence from the primitives of one structural description to the primitives of a second structural description. Besides the attributes of the local features, relations between these features are introduced to characterize global features and establish the correspondence for example, 1) geometric relations (e.g., the angle between two adjacent polygon sides or the minimum distance between two edges), 2) radiometric relations (e.g., the differences in average grey value or grey value variance between two adjacent regions), and 3) topologic relations (e.g., the notion that one feature is contained in another). Since relational matching techniques use not only image features but also geometrical or topological relations among the features to determine the correspondence, the image matching tasks can be fully automated without any initial estimates or very coarse ones. Relational matching can be approached in many ways, relying on graph searching techniques, energy minimization, or relaxation labeling techniques [20]. FBM is often used as an alternative method or combined with ABM. Compared to ABM, FBM techniques are more flexible with respect to surface discontinuities, less sensitive to image noise, and require less approximate values. The accuracy of FBM is limited by the accuracy of the feature extraction process. Also, because of the sparse and irregularly distributed nature of the Interactive extracted features, the matching results Fully Automatic in general are sparse point clouds and postprocessing procedures like interpolation need to be performed. Widely Separated Images Constraints 3-D Reconstruction IEEE SIGNAL PROCESSING MAGAZINE [58] JULY 2008

5 DETAILED SURFACE RECONSTRUCTION METHODOLOGY Our approach (Figure 2) is a step-wise method which has been under constant development and improvement for many years. The advances came mainly due to the fact that it has been used on several heritage objects and sites (Figure 1) with a high degree of complexity, each providing new challenges that required solutions. We assume that the cameras are precalibrated, the images are captured with the same camera settings as for the calibration, and the images are oriented with subpixel accuracy. The user must then segment the scene to remove unwanted regions, such as background, and divide the object or site into regions to improve the matching and modeling process. Indeed, scene segmentation [21] reduces processing time and helps in the modeling step, regardless of the object size and complexity. CREATING THE INITIAL MODEL Modeling architectural structures or archaeological finds and sites requires a sparse model to be created first and then the fine geometric details to be added in a second step. In the literature, the two-step approach is quite common. The first step, interactive or automated, produces a basic model using assumptions on the object s shape or camera interior parameters, while the second step uses dense stereo matching to add details. For architectural structures, we create basic models of surface elements such as planar walls, quadrics, and cylindrical shapes like columns, arches, doors, or windows using an approach initially developed in [22]. It is based on photogrammetric bundle adjustment and uses knowledge about surface shapes such as being planes, cylinders, parallel, symmetric, etc. For archaeological objects, we define the raw geometry of the object using some seed points located in the main discontinuity areas. 3-D MODELING OF THE FINE GEOMETRIC DETAILS Using the initial sparse model (namely, surface equations) as a guide and knowing the camera calibration and orientation parameters, we developed an automated procedure to extract the fine details with high-resolution meshes and achieve accurate documentation with photo-realistic visualization. The following three techniques are used, each where best suited: 1) a relative stereo matching technique for patches with regular shape, fitting an implicit function (e.g., plane, cylinder, or quadric) 2) a multi-image matching technique for irregular patches with unknown approximate function and using seed points 3) DFS for patches unsuited for stereo or multi-image matching (e.g., untextured patches). STEREO MATCHING Stereo matching works best when sufficient texture variations or localized features are present on the surface. Therefore, we first analyze the intensity level of the template window to select the areas where stereo matching will be applied. This includes mean, standard deviation, and second derivative of the grey levels of the pixels in the window. If those are higher than preset thresholds, the stereo matching will proceed; otherwise, we consider the region to be too uniform for stereo matching and switch to DFS, which works best on smoothly shaded surfaces. The relative stereo matching approach reduces the problems by using the basic model to narrow the search for matching. The procedure is as follows for each segment with known fitting function: A high-resolution approximate mesh of triangulated 3-D points, which can be as dense as one vertex per pixel, is placed automatically on each segment according to its fitting function. The coordinates of the approximate mesh from the basic model are replaced with the final coordinates from the stereo matching. In fact, the technique computes only the correction to the points. The stereo matching minimizes the normalized squares of the difference between the template and the search window. The search is done along the epipolar line, limiting the search to a disparity range computed from the basic model. The window in the search image is re-sampled to take into account the difference in orientation between the two images and surface orientation of the basic model. This accounts for the geometric variations between these two images and gives accurate and reliable results. MULTIPHOTO GEOMETRICALLY CONSTRAINED (MPGC) MATCHING The stereo matching approach, although fast and effective for relatively flat surfaces, requires an approximate surface shape. However, for irregular surfaces like archaeological finds and sculptures, the approximate shape is unknown. Therefore, an extended, albeit slower, more global approach that does not require knowledge of an approximate surface has been developed. It is based on nonlinear least-squares estimation and simultaneously uses more than two images to increase its precision and reliability by matching the point in all the images it appears in. The multi-image matching approach was originally developed for the processing of very high-resolution linear array images [23] and afterwards modified to accommodate any linear array sensor [24], as for instance available with satellite images. Now it has been extended to process other image data such as the traditional aerial photos or convergent close-range images. The multi-image approach, based on the MPGC framework of [6], [11], and [12], uses a coarse-to-fine hierarchical solution with an effective combination of several image matching algorithms and automatic quality control. The approach performs three mutually connected steps: 1) Image Preprocessing: The set of available images is processed with an adaptive smoothing filter [25] to reduce the noise level and possible radiometric problems such as strong bright and dark regions and to optimize the images for subsequent feature extraction and image matching. An enhancement filter [26] is also applied to strongly enhance IEEE SIGNAL PROCESSING MAGAZINE [59] JULY 2008

6 [FIG3] Examples of image preprocessing with adaptive smoothing and enhancement filter in dark areas and untextured regions. and sharpen the already existing texture patterns. The filter adjusts brightness values in local areas so that the local mean and standard deviation match user-specified target values (Figure 3). Finally, image pyramids are generated to work with several versions of the image having progressively changing spatial resolutions. 2) Multiple Primitive Multi-Image (MPM) Matching: Multiple matching primitives (interest points, edges, and grid points) are used. Interest points are suitable to generate accurate surface models, but they suffer from noise, occlusions, and discontinuities. Edges generate coarser but more stable models as they have higher semantic information and (a) [FIG4] Multiphoto matching example, with the template image, the resampled image patches at the end of the least squares estimation, and the search images with the epipolar line. (c) (b) are more tolerant to noise. A regular image grid is used in addition to overcome problems in low-texture areas. The MPM matching performs three operations at each pyramid level: 1) point and edge extraction and matching, 2) integration of matching primitives, and 3) initial mesh generation. Feature points are interest points extracted with the Lue operator [27] and the dominant points of the edges (extracted with Canny operator [17]), computed through a polygon approximation algorithm. Within the pyramid levels, the feature matching is performed with an extension of the standard cross-correlation [geometrically constrained crosscorrelation (GC 3 )] technique. The multi-image matching is guided from object space, knowing an approximate surface model, and allows reconstruction of 3-D object coordinates from all available images simultaneously. Having a set of images, the central image is chosen as reference and others serve as the search images. The normalized cross-correlation (NCC) coefficient is used as the similarity measure between the image windows in the reference and one of the search images. Compared to the traditional crosscorrelation method, NCC in the GC 3 algorithm is computed with respect to the height value Z in object space rather than the disparity in image space, so that the NCC functions of all individual stereo image pairs can be integrated in a single framework. Then, following [28], instead of computing the correct match of a point P by evaluating the individual NCC functions between the reference and search images, we define the sum of NCC (SNCC) for a point P, with respect to Z, finding the Z value which maximizes the SNCC function. For the edge matching, a preliminary list of IEEE SIGNAL PROCESSING MAGAZINE [60] JULY 2008

7 candidates is built up based on the similarity measures and Self-tuning matching parameters: These parameters some constraints (epipolar geometry as well as the approxiare automatically determined by analyzing the results of mated digital surface model derived from the higher level of the higher-level image pyramid matching and using them image pyramid) to restrict the number of possible matches. at the current pyramid level. These parameters include Then the GC3 algorithm is applied, with further similarity measures computed from the geometric and region attributes of the edges and consistency checking to remove wrong correspondences. The consistency checking is based on the figural continuity constraint and it is performed in a local neighborhood along the edges and solved by a probability relaxation method. 3) Refined Matching: At the original image resolution level, an MPCG LSM (Figure 4) and least squares B-spline snakes (a) (b) [29] can be used as an option to achieve potentially subpixel accuracy matches and identify some inaccurate and possibly false matches. With the LSB snakes, the edges in object space are represented with linear B-spline functions whose parameters are directly estimated, together with the matching parameters in the image spaces of multiple images. The surface derived from the previous MPM step provides good (c) (d) enough approximations for the two matching methods and increases the convergence rate. [FIG5] (a) Original image (out of three) of a bronze low-relief, (b) The main characteristics of the multi-image-based matching extracted 3-D edges, and derived surface model, shown in (c) shaded and (d) color-code mode. procedure are: Truly multiple image matching: A point is matched simultaneously in all the images where it is visible, and exploiting the collinearity constraint, the 3-D coordinates are directly computed together with their accuracy values. The multiple solution and occlusion problems are reduced with the multiimage approach, removing ambiguities with the multiple epipolar line intersections. Matching with multiple primitives: The method takes advantage of both area-based matching and feature-based matching techniques and uses both local and global image information. In particular, it combines an edge matching method with a point matching method through a probability-relaxation-based relational matching process (a) (b) (c) (Figure 5). Edges generate coarser but more stable models as they have higher semantic information and they are [FIG6] Model of a relief at a Dresden site dense matching reconstruction on five images, shown as (a) textured, (b) colormore tolerant to image noise. Feature points are instead coded, and (c) shaded 3-D model. suitable to generate dense and accurate surface models, but they suffer from problems caused by image noise, low texture, occlusions, and discontinuities. Therefore, the combination of feature points and grid points is necessary since the grid points can be used to fill gaps in areas of poor or no textures. Moreover, their combination draws a compromise between the optimal requirement for precise and reliable matching and the optimal requirement for [FIG7] The 3-D documentation of marble bas reliefs using widely separated images, visualized as color-coded, shaded, and textured model. point distribution. IEEE SIGNAL PROCESSING MAGAZINE [61] JULY 2008

8 the size of the correlation window, the search distance, and the threshold values. The adaptive determination of the matching parameters results in higher success rate and less mismatches. High matching redundancy: By exploiting the multiimage concept, highly redundant matching results are obtained. This also allows automatic blunder detection, and mismatches can be detected and deleted through the analysis and consistency checking within a small neighborhood. [FIG8] The 3-D modeling of a church apse from three convergent images, shown as shaded, color-coded, and textured model. [FIG9] DFS results on stone itching/carving. DFS DFS is applied where grey-level variations are not adequate for stereo/ multiphoto matching or for areas appearing only in a single image. Standard shape from shading techniques, which compute surface normals, lacked success in actual applications due to the ill-posed formulation that requires unrealistic assumptions, such as that the camera looks orthogonally at a Lambertian surface, and there is only one single light source located at infinity [30]. Our approach computes the depth directly, rather than the surface normal. It is applied to a work image: a grey-level version of the original with some preprocessing such as noise removal filtering and editing of unwanted shades. Using known depth and grey level at 8 10 points determined interactively, we form a curve describing the relation between grey levels and depth variation from the basic model. The curve intersects the grey-level axis at the average intensity value of points actually falling on the basic model. By adjusting the curve, the results can be instantly reviewed. We adjust the coordinates of the grid points on the surface of the basic model segment according to shading using this curve. EXAMPLES AND PERFORMANCE EVALUATION We have performed different tests retrieving 3-D models from several cultural heritage data sets. We show how our approach and methodology IEEE SIGNAL PROCESSING MAGAZINE [62] JULY 2008

9 can accurately document heritages of different size, shape complexity, and types of detail (Figures 6 9). The different datasets contain both short baseline and wide baseline images, and textured and untextured surfaces. We also performed a quantitative evaluation of the accuracy of our matching approach using several test objects, one of which is shown in Figure 10(a), in a controlled lab environment. The lab allowed us to compare the results with ground truth under the same measurement conditions. For ground truth, the objects were scanned with two highly accurate closerange laser scanners: the Surphaser HS25X (0.48 mm accuracy, phase-shift measurement principle) and the ShapeGrabber 502 (0.42 mm accuracy, triangulation based). The same objects were then modeled with both matching techniques described previously. To compare these models with ground truth data, we used PolyWorks Inspector software. Color-coded results of the comparison are illustrated in Figure 10(b). The standard deviation of the differences between the scanned model and the image-based model was, on average, 0.54 mm (Surphaser) and 0.52 mm (ShapeGrabber) for all data sets. The error distribution shows that the largest errors are consistently near boundaries and at sharp surface gradients. All the different methodologies, including the laser scanners used for ground truth, also have problems in those areas. Another accuracy and performance evaluation test was performed with a small statue, about 15 cm high. The imagematched model, generated using 25 images, has been compared with range data acquired with a Breuckmann stripe projection system (Opto-Top SE, feature accuracy of 50 μm). The color-coded comparison of the two results gives a standard deviation of 0.17 mm (Figure 11). (a) size or shape and accurately retrieve all the fine geometric details from calibrated and oriented images. The fast stereo pair approach constrains the search of correspondences along the epipolar line, while the 3-D coordinates of points and matched edges are computed in a second phase using rejection criteria for the forward ray intersection. On the other hand, the multiimage approach is more reliable and precise but requires fairly accurate image orientation parameters to exploit the collinearity constraint within the LSM estimation. It uses points and edges to retrieve all the surface details, and it can be applied to short or wide baseline images. It can cope with scale and other geometry changes, different illumination conditions or repeated patterns, and occlusion thanks to improved reliability by the multi-image approach. The accuracy evaluations (relative accuracy 1:1,000) was based on i) the standard deviations provided by the registration of the ground truth data with the photogrammetrically reconstructed surface and ii) the graphical display (color-coded) of the differences. A standard methodology for the performance evaluation of the results should be developed like those available for the traditional surveying or [FIG10] (a) Lab test object with the derived image-based model and (b) color-coded difference between scanned model and image matched model. (b) CONCLUSIONS Three-dimensional imagebased modeling of heritages is a very interesting topic with many possible applications. In this article, we reported our methodology for detailed surface measurement and reconstruction. We presented a two-step procedure able to model complex objects of any [FIG11] A small statue (approximately 15 cm high and 9 cm wide) modeled with 25 images (12 megapixel each with a 28-mm objective). A closer view of the reconstructed upper details is also presented, together with the 3-D comparison results (std = 0.17 mm). IEEE SIGNAL PROCESSING MAGAZINE [63] JULY 2008

10 CMM. Apart from standards, comparative data and best practices are also needed to show not only advantages, but also limitations of systems and algorithms. A good example is provided by the German VDI/VDE 2634 guideline for close-range optical 3-D vision systems (particularly for full-frame range cameras and single scan) while the American ASTM/E57 committee is trying to develop standards for 3-D imaging systems. Nevertheless, our accuracy evaluations demonstrated the potential of the photogrammetric approach, and in particular of the proposed matching strategy, for the documentation of cultural heritage. Indeed, photogrammetry has all the potential to retrieve the same results (details) as active range sensors, but in a cheaper, faster, more portable, and simpler manner. Fully automated approaches which retrieve nice-looking 3-D data without the fine details are not of practical use in the accurate daily documentation and digital conservation of heritage objects. We believe that site managers, archaeologists, restorators, conservators, and the entire heritage community need simple and cost-effective methods to be able to accurately record and document objects and sites. AUTHORS Fabio Remondino (fabior@ethz.ch) is a scientific researcher at the Institute of Geodesy and Photogrammetry (chair of Photogrammetry and Remote Sensing) of ETH Zurich, Switzerland, and the Centre for Scientific and Technological Research of the B. Kessler Foundation in Trento, Italy. He received his master s degree in environmental engineering at Polytechnic of Milan, Italy, and a Ph.D. in image-based modeling from ETH Zurich, Switzerland. He is the cochair of the International Society of Photogrammetry and Remote Sensing (ISPRS) working group on scene modeling and virtual reality. Sabry El-Hakim (sabry.el-hakim@nrc.cnrc.gc.ca) is a principal research officer at the Visual Information Technology Group in the Institute of Information Technology at the National Research Council (NRC) of Canada. He received his M.Sc. and Ph.D. degrees in photogrammetry from the University of New Brunswick, Canada, after which he joined the NRC. He is chair of the International Society of Photogrammetry and Remote Sensing (ISPRS) working group on scene modeling and virtual reality and a Fellow of SPIE. His research interests include imagebased modeling, multisensor data integration, and virtual reality. Armin Gruen (agruen@geod.baug.ethc.ch) is professor and head of the Chair of Photogrammetry and Remote Sensing at the Institute of Geodesy and Photogrammetry of ETH Zurich, Switzerland. He graduated in geodetic sciences and obtained his doctorate degree in 1974 in photogrammetry from TU Munich, Germany. From , he was an associate professor with the Department of Geodetic Science and Surveying, Ohio State University, Columbus. He has published more than 375 articles and papers and edited and coedited 21 books and conference proceedings. Li Zhang (zhangl@casm.ac.cn) received his master s degree in photogrammetry at the Wuhan University, P.R. China, and a Ph.D. in from ETH Zurich, Institute of Geodesy and Photogrammetry. He is now working at the Chinese Academy of Surveying and Mapping, Beijing, China. REFERENCES [1] R. Hartley and A. Zisserman, Multiple View Geometry in Computer Vision, 2nd ed. London: Cambridge Univ. Press, [2] M. Pollefeys, L. Van Gool, M. Vergauwen, F. Verbiest, K. Cornelis, J. Tops, and R. Koch, Visual modeling with a hand-held camera, Int. J. Comp. Vis., vol. 59, no. 3, pp , [3] C. Strecha, T. Tuytelaars, and L. Van Gool, Dense matching of multiple widebaseline views, in Proc. IEEE ICCV 03, 2003, vol. 2, pp [4] M. Goesele, N. Snavely, B. Curless, H. Hoppe, and S.M. Seitz, Multi-view stereo for community photo collections, in Proc. ICCV 2007, Rio de Janeiro, Brazil, [5] Y. Furukawa and J. Ponce, Accurate, dense and robust multi-view stereopsis, in Proc. IEEE CVPR07, Minneapolis, MN, [6] A. Gruen, Adaptive least square correlation: A powerful image matching technique, South African J. Photogrammetry, Remote Sens. Cart., vol. 14, no. 3, pp , [7] D.N. Bhat and S.K. Nayar, Ordinal measures for image correspondence, IEEE Trans. Pattern Anal. Mach. Intell. (PAMI), vol. 20, no. 4, pp , [8] D. Scharstein and R. Szeliski, A taxonomy and evaluation of dense two-frame stereo correspondence algorithms, Int. J. Comp. Vis., vol. 47, no. 1 3, pp. 7 42, [9] M.Z. Brown, D. Burschka, and G.D. Hager, Advance in computational stereo, IEEE Trans. Pattern Anal. Machine Intell. (PAMI), vol. 25, no. 8, pp , [10] S.M. Seitz, B. Curless, J. Diebel, D. Scharstein, and R. Szeliski, A comparison and evaluation of multi-view stereo reconstruction algorithm, in Proc. IEEE Int. Conf. Computer Vision Pattern Recognition (CVPR), pp , [11] A. Gruen and M. Baltsavias, Geometrically constrained multiphoto matching, Photogrammetric Eng. Remote Sens., vo. 54, no. 5, pp , [12] E.P. Baltsavias, Multi-photo geometrically constrained matching, Ph.D. thesis, Institute of Geodesy and Photogrammetry, ETH Zurich, Switzerland, [13] U.V. Helava, Object-space least-squares correlation, Photogrammetric Eng. Remote Sens., vol. 54, no. 6, pp , [14] C. Schmid, R. Mohr, and C. Bauckhage, Evaluation of interest point detectors, Int. J. Comp. Vis., vol. 37, no. 2, pp , [15] K. Mikolajczyk and C. Schmid, A performance evaluation of local descriptors, IEEE Trans. Pattern Anal. Machine Intell. (PAMI), vol. 27, no. 10, pp , [16] C. Schmid and A. Zisserman, The geometry and matching of lines and curves over multiple views, Int. J. Comp. Vis., vol. 40, no. 3, pp , [17] J.F. Canny, A computational approach to edge detection, IEEE Trans. Pattern Anal. Machine Intell. (PAMI), vol. 8, no. 6, pp , [18] K. Mikolajczyk, T. Tuytelaars, C. Schmid, A. Zisserman, J. Matas, F. Schaffalitzky, T. Kadir, and L. Van Gool, A comparison of affine region detectors, Int. J. Comp. Vis., vol. 65, no. 1 2, pp , [19] Y. Wang, Principles and applications of structural image matching, J. Photogrammetry Remote Sens., vol. 53, no. 3, pp , [20] Z. Zhang, J. Zhang, L. Mingsheng, and L. Zhang, Automatic registration of multisource imagery based on global image matching, Photogrammetric Eng. Remote Sens., vol. 66, no. 5, pp , [21] G. Zeng, S. Paris, L. Quan, and F. Sillion, Accurate and scalable surface representation and reconstruction from images, IEEE Trans. Pattern Anal. Machine Intell. (PAMI), vol. 29, no. 1, pp , [22] S.F. El-Hakim, Semi-automatic 3-D reconstruction of occluded and unmarked surfaces from widely separated views, Int. Arch. Photogrammetry Remote Sens., vol. 34, no. 5, pp , [23] L. Zhang and A. Gruen, Automatic DSM generation from linear array imagery data, Int. Arch. Photogrammetry Remote Sens., vol. 35, no. 3, pp , [24] L. Zhang, Automatic digital surface model (DSM) generation from linear array images, Ph.D. dissertation Nr. 90, Institute of Geodesy and Photogrammetry, ETH Zurich, Switzerland, [25] P. Saint-Marc, J.-S. Chen, and G. Medioni, Adaptive smoothing: A general tool for early vision, IEEE Trans. Pattern Anal. Machine Intell. (PAMI), vol. 13, no. 6, pp , [26] R. Wallis, An approach to the space variant restoration and enhancement of images, in Proc. Symp. Current Mathematical Problems in Image Science, Naval Postgraduate School, Monterey, CA, Nov. 1976, pp [27] Y. Lue, Interest operator and fast implementation, Int. Arch. Photogrammetry Remote Sens., vol. 27, no. 3, pp , [28] M. Okutomi and T. Kanade, A multiple-baseline stereo, IEEE Trans. Pattern Anal. Machine Intell. (PAMI), vol. 15, no. 4, pp , [29] H. Li, Semi-automatic road extraction from satellite and aerial images, Ph.D. thesis, Institute of Geodesy and Photogrammetry, ETH Zurich, Switzerland, [30] R. Zhang, P.-S. Tsai, J.E. Cryer, and M. Shah, Shape from shading: A survey, IEEE Trans. Pattern Anal. Machine Intell. (PAMI), vol. 21, no. 8, pp , [SP] IEEE SIGNAL PROCESSING MAGAZINE [64] JULY 2008

Detailed image-based 3D geometric reconstruction of heritage objects

Detailed image-based 3D geometric reconstruction of heritage objects Detailed image-based 3D geometric reconstruction of heritage objects FABIO REMONDINO 1 Abstract: The attention in digital documentation and preservation of heritages is always increasing and fast but reliable,

More information

Effective High Resolution 3D Geometric Reconstruction of Heritage and Archaeological Sites from Images

Effective High Resolution 3D Geometric Reconstruction of Heritage and Archaeological Sites from Images Effective High Resolution 3D Geometric Reconstruction of Heritage and Archaeological Sites from Images S.F. El-Hakim 1, Fabio Remondino 2, Lorenzo Gonzo 3, Francesca Voltolini 3 1 National Research Council

More information

Step-by-Step Model Buidling

Step-by-Step Model Buidling Step-by-Step Model Buidling Review Feature selection Feature selection Feature correspondence Camera Calibration Euclidean Reconstruction Landing Augmented Reality Vision Based Control Sparse Structure

More information

IMAGE-BASED 3D ACQUISITION TOOL FOR ARCHITECTURAL CONSERVATION

IMAGE-BASED 3D ACQUISITION TOOL FOR ARCHITECTURAL CONSERVATION IMAGE-BASED 3D ACQUISITION TOOL FOR ARCHITECTURAL CONSERVATION Joris Schouteden, Marc Pollefeys, Maarten Vergauwen, Luc Van Gool Center for Processing of Speech and Images, K.U.Leuven, Kasteelpark Arenberg

More information

Dense 3D Reconstruction. Christiano Gava

Dense 3D Reconstruction. Christiano Gava Dense 3D Reconstruction Christiano Gava christiano.gava@dfki.de Outline Previous lecture: structure and motion II Structure and motion loop Triangulation Today: dense 3D reconstruction The matching problem

More information

EVALUATION OF WORLDVIEW-1 STEREO SCENES AND RELATED 3D PRODUCTS

EVALUATION OF WORLDVIEW-1 STEREO SCENES AND RELATED 3D PRODUCTS EVALUATION OF WORLDVIEW-1 STEREO SCENES AND RELATED 3D PRODUCTS Daniela POLI, Kirsten WOLFF, Armin GRUEN Swiss Federal Institute of Technology Institute of Geodesy and Photogrammetry Wolfgang-Pauli-Strasse

More information

Dense 3D Reconstruction. Christiano Gava

Dense 3D Reconstruction. Christiano Gava Dense 3D Reconstruction Christiano Gava christiano.gava@dfki.de Outline Previous lecture: structure and motion II Structure and motion loop Triangulation Wide baseline matching (SIFT) Today: dense 3D reconstruction

More information

Multi-view stereo. Many slides adapted from S. Seitz

Multi-view stereo. Many slides adapted from S. Seitz Multi-view stereo Many slides adapted from S. Seitz Beyond two-view stereo The third eye can be used for verification Multiple-baseline stereo Pick a reference image, and slide the corresponding window

More information

SIMPLE ROOM SHAPE MODELING WITH SPARSE 3D POINT INFORMATION USING PHOTOGRAMMETRY AND APPLICATION SOFTWARE

SIMPLE ROOM SHAPE MODELING WITH SPARSE 3D POINT INFORMATION USING PHOTOGRAMMETRY AND APPLICATION SOFTWARE SIMPLE ROOM SHAPE MODELING WITH SPARSE 3D POINT INFORMATION USING PHOTOGRAMMETRY AND APPLICATION SOFTWARE S. Hirose R&D Center, TOPCON CORPORATION, 75-1, Hasunuma-cho, Itabashi-ku, Tokyo, Japan Commission

More information

Effective High Resolution 3D Geometric Reconstruction of Heritage and Archaeological Sites from Images

Effective High Resolution 3D Geometric Reconstruction of Heritage and Archaeological Sites from Images 3D Data Acquisition and Processing 43 Sabry F. El-Hakim Fabio Remondino Lorenzo Gonzo Francesca Voltolini Effective High Resolution 3D Geometric Reconstruction of Heritage and Archaeological Sites from

More information

SURFACE RECONSTRUCTION ALGORITHMS FOR DETAILED CLOSE-RANGE OBJECT MODELING

SURFACE RECONSTRUCTION ALGORITHMS FOR DETAILED CLOSE-RANGE OBJECT MODELING SURFACE RECONSTRUCTION ALGORITHMS FOR DETAILED CLOSE-RANGE OBJECT MODELING Fabio Remondino, Li Zhang Institute of Geodesy and Photogrammetry, ETH Zurich, Switzerland E-mail: fabio@geod.baug.ethz.ch Web:

More information

Segmentation and Tracking of Partial Planar Templates

Segmentation and Tracking of Partial Planar Templates Segmentation and Tracking of Partial Planar Templates Abdelsalam Masoud William Hoff Colorado School of Mines Colorado School of Mines Golden, CO 800 Golden, CO 800 amasoud@mines.edu whoff@mines.edu Abstract

More information

The raycloud A Vision Beyond the Point Cloud

The raycloud A Vision Beyond the Point Cloud The raycloud A Vision Beyond the Point Cloud Christoph STRECHA, Switzerland Key words: Photogrammetry, Aerial triangulation, Multi-view stereo, 3D vectorisation, Bundle Block Adjustment SUMMARY Measuring

More information

Accurate and Dense Wide-Baseline Stereo Matching Using SW-POC

Accurate and Dense Wide-Baseline Stereo Matching Using SW-POC Accurate and Dense Wide-Baseline Stereo Matching Using SW-POC Shuji Sakai, Koichi Ito, Takafumi Aoki Graduate School of Information Sciences, Tohoku University, Sendai, 980 8579, Japan Email: sakai@aoki.ecei.tohoku.ac.jp

More information

Stereo Vision. MAN-522 Computer Vision

Stereo Vision. MAN-522 Computer Vision Stereo Vision MAN-522 Computer Vision What is the goal of stereo vision? The recovery of the 3D structure of a scene using two or more images of the 3D scene, each acquired from a different viewpoint in

More information

EE795: Computer Vision and Intelligent Systems

EE795: Computer Vision and Intelligent Systems EE795: Computer Vision and Intelligent Systems Spring 2012 TTh 17:30-18:45 FDH 204 Lecture 14 130307 http://www.ee.unlv.edu/~b1morris/ecg795/ 2 Outline Review Stereo Dense Motion Estimation Translational

More information

Structured Light II. Thanks to Ronen Gvili, Szymon Rusinkiewicz and Maks Ovsjanikov

Structured Light II. Thanks to Ronen Gvili, Szymon Rusinkiewicz and Maks Ovsjanikov Structured Light II Johannes Köhler Johannes.koehler@dfki.de Thanks to Ronen Gvili, Szymon Rusinkiewicz and Maks Ovsjanikov Introduction Previous lecture: Structured Light I Active Scanning Camera/emitter

More information

3D Computer Vision. Structured Light II. Prof. Didier Stricker. Kaiserlautern University.

3D Computer Vision. Structured Light II. Prof. Didier Stricker. Kaiserlautern University. 3D Computer Vision Structured Light II Prof. Didier Stricker Kaiserlautern University http://ags.cs.uni-kl.de/ DFKI Deutsches Forschungszentrum für Künstliche Intelligenz http://av.dfki.de 1 Introduction

More information

Multiple View Geometry

Multiple View Geometry Multiple View Geometry CS 6320, Spring 2013 Guest Lecture Marcel Prastawa adapted from Pollefeys, Shah, and Zisserman Single view computer vision Projective actions of cameras Camera callibration Photometric

More information

AN APPROACH OF SEMIAUTOMATED ROAD EXTRACTION FROM AERIAL IMAGE BASED ON TEMPLATE MATCHING AND NEURAL NETWORK

AN APPROACH OF SEMIAUTOMATED ROAD EXTRACTION FROM AERIAL IMAGE BASED ON TEMPLATE MATCHING AND NEURAL NETWORK AN APPROACH OF SEMIAUTOMATED ROAD EXTRACTION FROM AERIAL IMAGE BASED ON TEMPLATE MATCHING AND NEURAL NETWORK Xiangyun HU, Zuxun ZHANG, Jianqing ZHANG Wuhan Technique University of Surveying and Mapping,

More information

DSM GENERATION FROM EARLY ALOS/PRISM DATA USING SAT-PP

DSM GENERATION FROM EARLY ALOS/PRISM DATA USING SAT-PP DSM GENERATION FROM EARLY ALOS/PRISM DATA USING SAT-PP K. Wolff, A. Gruen Institute of Geodesy and Photogrammetry, ETH-Zurich, CH-8093 Zurich, Switzerland @geod.baug.ethz.ch KEY WORDS: PRISM

More information

Accurate 3D Face and Body Modeling from a Single Fixed Kinect

Accurate 3D Face and Body Modeling from a Single Fixed Kinect Accurate 3D Face and Body Modeling from a Single Fixed Kinect Ruizhe Wang*, Matthias Hernandez*, Jongmoo Choi, Gérard Medioni Computer Vision Lab, IRIS University of Southern California Abstract In this

More information

UP TO DATE DSM GENERATION USING HIGH RESOLUTION SATELLITE IMAGE DATA

UP TO DATE DSM GENERATION USING HIGH RESOLUTION SATELLITE IMAGE DATA UP TO DATE DSM GENERATION USING HIGH RESOLUTION SATELLITE IMAGE DATA K. Wolff*, A. Gruen Institute of Geodesy and Photogrammetry, ETH Zurich Wolfgang-Pauli-Str. 15, CH-8093 Zurich, Switzerland (wolff,

More information

Lecture 10: Multi view geometry

Lecture 10: Multi view geometry Lecture 10: Multi view geometry Professor Fei Fei Li Stanford Vision Lab 1 What we will learn today? Stereo vision Correspondence problem (Problem Set 2 (Q3)) Active stereo vision systems Structure from

More information

Using temporal seeding to constrain the disparity search range in stereo matching

Using temporal seeding to constrain the disparity search range in stereo matching Using temporal seeding to constrain the disparity search range in stereo matching Thulani Ndhlovu Mobile Intelligent Autonomous Systems CSIR South Africa Email: tndhlovu@csir.co.za Fred Nicolls Department

More information

Experiments with Edge Detection using One-dimensional Surface Fitting

Experiments with Edge Detection using One-dimensional Surface Fitting Experiments with Edge Detection using One-dimensional Surface Fitting Gabor Terei, Jorge Luis Nunes e Silva Brito The Ohio State University, Department of Geodetic Science and Surveying 1958 Neil Avenue,

More information

Computer Vision Lecture 17

Computer Vision Lecture 17 Computer Vision Lecture 17 Epipolar Geometry & Stereo Basics 13.01.2015 Bastian Leibe RWTH Aachen http://www.vision.rwth-aachen.de leibe@vision.rwth-aachen.de Announcements Seminar in the summer semester

More information

Computer Vision Lecture 17

Computer Vision Lecture 17 Announcements Computer Vision Lecture 17 Epipolar Geometry & Stereo Basics Seminar in the summer semester Current Topics in Computer Vision and Machine Learning Block seminar, presentations in 1 st week

More information

Outdoor Scene Reconstruction from Multiple Image Sequences Captured by a Hand-held Video Camera

Outdoor Scene Reconstruction from Multiple Image Sequences Captured by a Hand-held Video Camera Outdoor Scene Reconstruction from Multiple Image Sequences Captured by a Hand-held Video Camera Tomokazu Sato, Masayuki Kanbara and Naokazu Yokoya Graduate School of Information Science, Nara Institute

More information

SOME stereo image-matching methods require a user-selected

SOME stereo image-matching methods require a user-selected IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, VOL. 3, NO. 2, APRIL 2006 207 Seed Point Selection Method for Triangle Constrained Image Matching Propagation Qing Zhu, Bo Wu, and Zhi-Xiang Xu Abstract In order

More information

STATE-OF-THE-ART in DENSE IMAGE MATCHING

STATE-OF-THE-ART in DENSE IMAGE MATCHING STATE-OF-THE-ART in DENSE IMAGE MATCHING Fabio REMONDINO 3D Optical Metrology (3DOM) Bruno Kessler Foundation (FBK) Trento, Italy Email: remondino@fbk.eu http://3dom.fbk.eu Bruno Kessler Foundation (FBK)

More information

3D MODELING OF CLOSE-RANGE OBJECTS: PHOTOGRAMMETRY OR LASER SCANNING?

3D MODELING OF CLOSE-RANGE OBJECTS: PHOTOGRAMMETRY OR LASER SCANNING? 3D MODELING OF CLOSE-RANGE OBJECTS: PHOTOGRAMMETRY OR LASER SCANNING? F. Remondino 1 A. Guarnieri 2 A. Vettore 2 1 Institute of Geodesy and Photogrammetry ETH Hönggerberg - Zurich, Switzerland e-mail:

More information

Stereo. 11/02/2012 CS129, Brown James Hays. Slides by Kristen Grauman

Stereo. 11/02/2012 CS129, Brown James Hays. Slides by Kristen Grauman Stereo 11/02/2012 CS129, Brown James Hays Slides by Kristen Grauman Multiple views Multi-view geometry, matching, invariant features, stereo vision Lowe Hartley and Zisserman Why multiple views? Structure

More information

DETECTORS AND DESCRIPTORS FOR PHOTOGRAMMETRIC APPLICATIONS

DETECTORS AND DESCRIPTORS FOR PHOTOGRAMMETRIC APPLICATIONS DETECTORS AND DESCRIPTORS FOR PHOTOGRAMMETRIC APPLICATIONS KEY WORDS: Features Detection, Orientation, Precision Fabio Remondino Institute for Geodesy and Photogrammetry, ETH Zurich, Switzerland E-mail:

More information

A Statistical Consistency Check for the Space Carving Algorithm.

A Statistical Consistency Check for the Space Carving Algorithm. A Statistical Consistency Check for the Space Carving Algorithm. A. Broadhurst and R. Cipolla Dept. of Engineering, Univ. of Cambridge, Cambridge, CB2 1PZ aeb29 cipolla @eng.cam.ac.uk Abstract This paper

More information

Centre for Digital Image Measurement and Analysis, School of Engineering, City University, Northampton Square, London, ECIV OHB

Centre for Digital Image Measurement and Analysis, School of Engineering, City University, Northampton Square, London, ECIV OHB HIGH ACCURACY 3-D MEASUREMENT USING MULTIPLE CAMERA VIEWS T.A. Clarke, T.J. Ellis, & S. Robson. High accuracy measurement of industrially produced objects is becoming increasingly important. The techniques

More information

Chapter 3 Image Registration. Chapter 3 Image Registration

Chapter 3 Image Registration. Chapter 3 Image Registration Chapter 3 Image Registration Distributed Algorithms for Introduction (1) Definition: Image Registration Input: 2 images of the same scene but taken from different perspectives Goal: Identify transformation

More information

Stereo and Epipolar geometry

Stereo and Epipolar geometry Previously Image Primitives (feature points, lines, contours) Today: Stereo and Epipolar geometry How to match primitives between two (multiple) views) Goals: 3D reconstruction, recognition Jana Kosecka

More information

Viewpoint Invariant Features from Single Images Using 3D Geometry

Viewpoint Invariant Features from Single Images Using 3D Geometry Viewpoint Invariant Features from Single Images Using 3D Geometry Yanpeng Cao and John McDonald Department of Computer Science National University of Ireland, Maynooth, Ireland {y.cao,johnmcd}@cs.nuim.ie

More information

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching Stereo Matching Fundamental matrix Let p be a point in left image, p in right image l l Epipolar relation p maps to epipolar line l p maps to epipolar line l p p Epipolar mapping described by a 3x3 matrix

More information

EECS 442 Computer vision. Stereo systems. Stereo vision Rectification Correspondence problem Active stereo vision systems

EECS 442 Computer vision. Stereo systems. Stereo vision Rectification Correspondence problem Active stereo vision systems EECS 442 Computer vision Stereo systems Stereo vision Rectification Correspondence problem Active stereo vision systems Reading: [HZ] Chapter: 11 [FP] Chapter: 11 Stereo vision P p p O 1 O 2 Goal: estimate

More information

Announcements. Stereo Vision Wrapup & Intro Recognition

Announcements. Stereo Vision Wrapup & Intro Recognition Announcements Stereo Vision Wrapup & Intro Introduction to Computer Vision CSE 152 Lecture 17 HW3 due date postpone to Thursday HW4 to posted by Thursday, due next Friday. Order of material we ll first

More information

Structured light 3D reconstruction

Structured light 3D reconstruction Structured light 3D reconstruction Reconstruction pipeline and industrial applications rodola@dsi.unive.it 11/05/2010 3D Reconstruction 3D reconstruction is the process of capturing the shape and appearance

More information

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching Stereo Matching Fundamental matrix Let p be a point in left image, p in right image l l Epipolar relation p maps to epipolar line l p maps to epipolar line l p p Epipolar mapping described by a 3x3 matrix

More information

AN AUTOMATIC 3D RECONSTRUCTION METHOD BASED ON MULTI-VIEW STEREO VISION FOR THE MOGAO GROTTOES

AN AUTOMATIC 3D RECONSTRUCTION METHOD BASED ON MULTI-VIEW STEREO VISION FOR THE MOGAO GROTTOES The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Volume XL-4/W5, 05 Indoor-Outdoor Seamless Modelling, Mapping and avigation, May 05, Tokyo, Japan A AUTOMATIC

More information

Motion Estimation and Optical Flow Tracking

Motion Estimation and Optical Flow Tracking Image Matching Image Retrieval Object Recognition Motion Estimation and Optical Flow Tracking Example: Mosiacing (Panorama) M. Brown and D. G. Lowe. Recognising Panoramas. ICCV 2003 Example 3D Reconstruction

More information

Lecture 10: Multi-view geometry

Lecture 10: Multi-view geometry Lecture 10: Multi-view geometry Professor Stanford Vision Lab 1 What we will learn today? Review for stereo vision Correspondence problem (Problem Set 2 (Q3)) Active stereo vision systems Structure from

More information

Model-based segmentation and recognition from range data

Model-based segmentation and recognition from range data Model-based segmentation and recognition from range data Jan Boehm Institute for Photogrammetry Universität Stuttgart Germany Keywords: range image, segmentation, object recognition, CAD ABSTRACT This

More information

A Desktop 3D Scanner Exploiting Rotation and Visual Rectification of Laser Profiles

A Desktop 3D Scanner Exploiting Rotation and Visual Rectification of Laser Profiles A Desktop 3D Scanner Exploiting Rotation and Visual Rectification of Laser Profiles Carlo Colombo, Dario Comanducci, and Alberto Del Bimbo Dipartimento di Sistemi ed Informatica Via S. Marta 3, I-5139

More information

CSE 252B: Computer Vision II

CSE 252B: Computer Vision II CSE 252B: Computer Vision II Lecturer: Serge Belongie Scribes: Jeremy Pollock and Neil Alldrin LECTURE 14 Robust Feature Matching 14.1. Introduction Last lecture we learned how to find interest points

More information

Fundamentals of Stereo Vision Michael Bleyer LVA Stereo Vision

Fundamentals of Stereo Vision Michael Bleyer LVA Stereo Vision Fundamentals of Stereo Vision Michael Bleyer LVA Stereo Vision What Happened Last Time? Human 3D perception (3D cinema) Computational stereo Intuitive explanation of what is meant by disparity Stereo matching

More information

BIL Computer Vision Apr 16, 2014

BIL Computer Vision Apr 16, 2014 BIL 719 - Computer Vision Apr 16, 2014 Binocular Stereo (cont d.), Structure from Motion Aykut Erdem Dept. of Computer Engineering Hacettepe University Slide credit: S. Lazebnik Basic stereo matching algorithm

More information

EVALUATION OF SEQUENTIAL IMAGES FOR PHOTOGRAMMETRICALLY POINT DETERMINATION

EVALUATION OF SEQUENTIAL IMAGES FOR PHOTOGRAMMETRICALLY POINT DETERMINATION Archives of Photogrammetry, Cartography and Remote Sensing, Vol. 22, 2011, pp. 285-296 ISSN 2083-2214 EVALUATION OF SEQUENTIAL IMAGES FOR PHOTOGRAMMETRICALLY POINT DETERMINATION Michał Kowalczyk 1 1 Department

More information

3D Computer Vision. Dense 3D Reconstruction II. Prof. Didier Stricker. Christiano Gava

3D Computer Vision. Dense 3D Reconstruction II. Prof. Didier Stricker. Christiano Gava 3D Computer Vision Dense 3D Reconstruction II Prof. Didier Stricker Christiano Gava Kaiserlautern University http://ags.cs.uni-kl.de/ DFKI Deutsches Forschungszentrum für Künstliche Intelligenz http://av.dfki.de

More information

An Evaluation of Volumetric Interest Points

An Evaluation of Volumetric Interest Points An Evaluation of Volumetric Interest Points Tsz-Ho YU Oliver WOODFORD Roberto CIPOLLA Machine Intelligence Lab Department of Engineering, University of Cambridge About this project We conducted the first

More information

Chaplin, Modern Times, 1936

Chaplin, Modern Times, 1936 Chaplin, Modern Times, 1936 [A Bucket of Water and a Glass Matte: Special Effects in Modern Times; bonus feature on The Criterion Collection set] Multi-view geometry problems Structure: Given projections

More information

On-line and Off-line 3D Reconstruction for Crisis Management Applications

On-line and Off-line 3D Reconstruction for Crisis Management Applications On-line and Off-line 3D Reconstruction for Crisis Management Applications Geert De Cubber Royal Military Academy, Department of Mechanical Engineering (MSTA) Av. de la Renaissance 30, 1000 Brussels geert.de.cubber@rma.ac.be

More information

BUILDING MODEL RECONSTRUCTION FROM DATA INTEGRATION INTRODUCTION

BUILDING MODEL RECONSTRUCTION FROM DATA INTEGRATION INTRODUCTION BUILDING MODEL RECONSTRUCTION FROM DATA INTEGRATION Ruijin Ma Department Of Civil Engineering Technology SUNY-Alfred Alfred, NY 14802 mar@alfredstate.edu ABSTRACT Building model reconstruction has been

More information

Research on an Adaptive Terrain Reconstruction of Sequence Images in Deep Space Exploration

Research on an Adaptive Terrain Reconstruction of Sequence Images in Deep Space Exploration , pp.33-41 http://dx.doi.org/10.14257/astl.2014.52.07 Research on an Adaptive Terrain Reconstruction of Sequence Images in Deep Space Exploration Wang Wei, Zhao Wenbin, Zhao Zhengxu School of Information

More information

Image matching, point transfer, DSM generation

Image matching, point transfer, DSM generation Image matching, point transfer, DSM generation Dr. Maria Pateraki Department of Rural and Surveying Engineering Aristotle University of Thessaloniki tel:30 2310 996407, email: mariapat@topo.auth.gr, URL:

More information

INTEREST OPERATORS IN CLOSE-RANGE OBJECT RECONSTRUCTION

INTEREST OPERATORS IN CLOSE-RANGE OBJECT RECONSTRUCTION INTEREST OPERATORS IN CLOSE-RANGE OBJECT RECONSTRUCTION I. Jazayeri, C.S. Fraser Department of Geomatics, The University of Melbourne, Melbourne, VIC 31 Australia i.jazayeri@pgrad.unimelb.edu.au &- c.fraser@unimelb.edu.au

More information

A Survey of Light Source Detection Methods

A Survey of Light Source Detection Methods A Survey of Light Source Detection Methods Nathan Funk University of Alberta Mini-Project for CMPUT 603 November 30, 2003 Abstract This paper provides an overview of the most prominent techniques for light

More information

Building Roof Contours Extraction from Aerial Imagery Based On Snakes and Dynamic Programming

Building Roof Contours Extraction from Aerial Imagery Based On Snakes and Dynamic Programming Building Roof Contours Extraction from Aerial Imagery Based On Snakes and Dynamic Programming Antonio Juliano FAZAN and Aluir Porfírio Dal POZ, Brazil Keywords: Snakes, Dynamic Programming, Building Extraction,

More information

Recap from Previous Lecture

Recap from Previous Lecture Recap from Previous Lecture Tone Mapping Preserve local contrast or detail at the expense of large scale contrast. Changing the brightness within objects or surfaces unequally leads to halos. We are now

More information

AUTOMATIC DSM GENERATION FROM LINEAR ARRAY IMAGERY DATA

AUTOMATIC DSM GENERATION FROM LINEAR ARRAY IMAGERY DATA AUTOMATIC DSM GENERATION FROM LINEAR ARRAY IMAGERY DATA Zhang Li, Armin Gruen Institute of Geodesy and Photogrammetry, Swiss Federal Institute of Technology Zurich ETH Hoenggerberg; CH-8093 Zurich, Switzerland

More information

Multi-View Stereo for Static and Dynamic Scenes

Multi-View Stereo for Static and Dynamic Scenes Multi-View Stereo for Static and Dynamic Scenes Wolfgang Burgard Jan 6, 2010 Main references Yasutaka Furukawa and Jean Ponce, Accurate, Dense and Robust Multi-View Stereopsis, 2007 C.L. Zitnick, S.B.

More information

Light source estimation using feature points from specular highlights and cast shadows

Light source estimation using feature points from specular highlights and cast shadows Vol. 11(13), pp. 168-177, 16 July, 2016 DOI: 10.5897/IJPS2015.4274 Article Number: F492B6D59616 ISSN 1992-1950 Copyright 2016 Author(s) retain the copyright of this article http://www.academicjournals.org/ijps

More information

What have we leaned so far?

What have we leaned so far? What have we leaned so far? Camera structure Eye structure Project 1: High Dynamic Range Imaging What have we learned so far? Image Filtering Image Warping Camera Projection Model Project 2: Panoramic

More information

3D Modeling of Objects Using Laser Scanning

3D Modeling of Objects Using Laser Scanning 1 3D Modeling of Objects Using Laser Scanning D. Jaya Deepu, LPU University, Punjab, India Email: Jaideepudadi@gmail.com Abstract: In the last few decades, constructing accurate three-dimensional models

More information

EVOLUTION OF POINT CLOUD

EVOLUTION OF POINT CLOUD Figure 1: Left and right images of a stereo pair and the disparity map (right) showing the differences of each pixel in the right and left image. (source: https://stackoverflow.com/questions/17607312/difference-between-disparity-map-and-disparity-image-in-stereo-matching)

More information

Feature Based Registration - Image Alignment

Feature Based Registration - Image Alignment Feature Based Registration - Image Alignment Image Registration Image registration is the process of estimating an optimal transformation between two or more images. Many slides from Alexei Efros http://graphics.cs.cmu.edu/courses/15-463/2007_fall/463.html

More information

A COMPREHENSIVE TOOL FOR RECOVERING 3D MODELS FROM 2D PHOTOS WITH WIDE BASELINES

A COMPREHENSIVE TOOL FOR RECOVERING 3D MODELS FROM 2D PHOTOS WITH WIDE BASELINES A COMPREHENSIVE TOOL FOR RECOVERING 3D MODELS FROM 2D PHOTOS WITH WIDE BASELINES Yuzhu Lu Shana Smith Virtual Reality Applications Center, Human Computer Interaction Program, Iowa State University, Ames,

More information

Image processing and features

Image processing and features Image processing and features Gabriele Bleser gabriele.bleser@dfki.de Thanks to Harald Wuest, Folker Wientapper and Marc Pollefeys Introduction Previous lectures: geometry Pose estimation Epipolar geometry

More information

Passive 3D Photography

Passive 3D Photography SIGGRAPH 2000 Course on 3D Photography Passive 3D Photography Steve Seitz Carnegie Mellon University University of Washington http://www.cs cs.cmu.edu/~ /~seitz Visual Cues Shading Merle Norman Cosmetics,

More information

IMAGE MATCHING TOWARDS MATURITY

IMAGE MATCHING TOWARDS MATURITY IMAGE MATCHING TOWARDS MATURITY Dimitris P. SKARLATOS Department of Surveying, National Technical University, GR-1578 Athens, Greece dskarlat@survey.ntua.gr Working Group III/2 KEY WORDS: Image Matching,

More information

3D Photography: Active Ranging, Structured Light, ICP

3D Photography: Active Ranging, Structured Light, ICP 3D Photography: Active Ranging, Structured Light, ICP Kalin Kolev, Marc Pollefeys Spring 2013 http://cvg.ethz.ch/teaching/2013spring/3dphoto/ Schedule (tentative) Feb 18 Feb 25 Mar 4 Mar 11 Mar 18 Mar

More information

Robust extraction of image correspondences exploiting the image scene geometry and approximate camera orientation

Robust extraction of image correspondences exploiting the image scene geometry and approximate camera orientation Robust extraction of image correspondences exploiting the image scene geometry and approximate camera orientation Bashar Alsadik a,c, Fabio Remondino b, Fabio Menna b, Markus Gerke a, George Vosselman

More information

arxiv: v1 [cs.cv] 28 Sep 2018

arxiv: v1 [cs.cv] 28 Sep 2018 Camera Pose Estimation from Sequence of Calibrated Images arxiv:1809.11066v1 [cs.cv] 28 Sep 2018 Jacek Komorowski 1 and Przemyslaw Rokita 2 1 Maria Curie-Sklodowska University, Institute of Computer Science,

More information

COMPARATIVE STUDY OF DIFFERENT APPROACHES FOR EFFICIENT RECTIFICATION UNDER GENERAL MOTION

COMPARATIVE STUDY OF DIFFERENT APPROACHES FOR EFFICIENT RECTIFICATION UNDER GENERAL MOTION COMPARATIVE STUDY OF DIFFERENT APPROACHES FOR EFFICIENT RECTIFICATION UNDER GENERAL MOTION Mr.V.SRINIVASA RAO 1 Prof.A.SATYA KALYAN 2 DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING PRASAD V POTLURI SIDDHARTHA

More information

Fast Image Registration via Joint Gradient Maximization: Application to Multi-Modal Data

Fast Image Registration via Joint Gradient Maximization: Application to Multi-Modal Data MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Fast Image Registration via Joint Gradient Maximization: Application to Multi-Modal Data Xue Mei, Fatih Porikli TR-19 September Abstract We

More information

Flexible Calibration of a Portable Structured Light System through Surface Plane

Flexible Calibration of a Portable Structured Light System through Surface Plane Vol. 34, No. 11 ACTA AUTOMATICA SINICA November, 2008 Flexible Calibration of a Portable Structured Light System through Surface Plane GAO Wei 1 WANG Liang 1 HU Zhan-Yi 1 Abstract For a portable structured

More information

FAST REGISTRATION OF TERRESTRIAL LIDAR POINT CLOUD AND SEQUENCE IMAGES

FAST REGISTRATION OF TERRESTRIAL LIDAR POINT CLOUD AND SEQUENCE IMAGES FAST REGISTRATION OF TERRESTRIAL LIDAR POINT CLOUD AND SEQUENCE IMAGES Jie Shao a, Wuming Zhang a, Yaqiao Zhu b, Aojie Shen a a State Key Laboratory of Remote Sensing Science, Institute of Remote Sensing

More information

SEMI-AUTOMATIC CITY MODEL EXTRACTION FROM TRI-STEREOSCOPIC VHR SATELLITE IMAGERY

SEMI-AUTOMATIC CITY MODEL EXTRACTION FROM TRI-STEREOSCOPIC VHR SATELLITE IMAGERY SEMI-AUTOMATIC CITY MODEL EXTRACTION FROM TRI-STEREOSCOPIC VHR SATELLITE IMAGERY F. Tack a,, R. Goossens a, G. Buyuksalih b a Dept. of Geography, Ghent University, Krijgslaan 281, 9000 Ghent, Belgium (f.tack,

More information

RELIABLE IMAGE MATCHING WITH RECURSIVE TILING

RELIABLE IMAGE MATCHING WITH RECURSIVE TILING RELIABLE IMAGE MATCHING WITH RECURSIVE TILING D. Novák, E. Baltsavias, K. Schindler Institute of Geodesy and Photogrammetry, ETH Zürich, 8093 Zürich, Switzerland (david.novak, manos, konrad.schindler)@geod.baug.ethz.ch

More information

Structure from Motion. Introduction to Computer Vision CSE 152 Lecture 10

Structure from Motion. Introduction to Computer Vision CSE 152 Lecture 10 Structure from Motion CSE 152 Lecture 10 Announcements Homework 3 is due May 9, 11:59 PM Reading: Chapter 8: Structure from Motion Optional: Multiple View Geometry in Computer Vision, 2nd edition, Hartley

More information

ENGN2911I: 3D Photography and Geometry Processing Assignment 1: 3D Photography using Planar Shadows

ENGN2911I: 3D Photography and Geometry Processing Assignment 1: 3D Photography using Planar Shadows ENGN2911I: 3D Photography and Geometry Processing Assignment 1: 3D Photography using Planar Shadows Instructor: Gabriel Taubin Assignment written by: Douglas Lanman 29 January 2009 Figure 1: 3D Photography

More information

Tecnologie per la ricostruzione di modelli 3D da immagini. Marco Callieri ISTI-CNR, Pisa, Italy

Tecnologie per la ricostruzione di modelli 3D da immagini. Marco Callieri ISTI-CNR, Pisa, Italy Tecnologie per la ricostruzione di modelli 3D da immagini Marco Callieri ISTI-CNR, Pisa, Italy Who am I? Marco Callieri PhD in computer science Always had the like for 3D graphics... Researcher at the

More information

CS 4758: Automated Semantic Mapping of Environment

CS 4758: Automated Semantic Mapping of Environment CS 4758: Automated Semantic Mapping of Environment Dongsu Lee, ECE, M.Eng., dl624@cornell.edu Aperahama Parangi, CS, 2013, alp75@cornell.edu Abstract The purpose of this project is to program an Erratic

More information

BUILDING POINT GROUPING USING VIEW-GEOMETRY RELATIONS INTRODUCTION

BUILDING POINT GROUPING USING VIEW-GEOMETRY RELATIONS INTRODUCTION BUILDING POINT GROUPING USING VIEW-GEOMETRY RELATIONS I-Chieh Lee 1, Shaojun He 1, Po-Lun Lai 2, Alper Yilmaz 2 1 Mapping and GIS Laboratory 2 Photogrammetric Computer Vision Laboratory Dept. of Civil

More information

Stereo Matching.

Stereo Matching. Stereo Matching Stereo Vision [1] Reduction of Searching by Epipolar Constraint [1] Photometric Constraint [1] Same world point has same intensity in both images. True for Lambertian surfaces A Lambertian

More information

International Journal for Research in Applied Science & Engineering Technology (IJRASET) A Review: 3D Image Reconstruction From Multiple Images

International Journal for Research in Applied Science & Engineering Technology (IJRASET) A Review: 3D Image Reconstruction From Multiple Images A Review: 3D Image Reconstruction From Multiple Images Rahul Dangwal 1, Dr. Sukhwinder Singh 2 1 (ME Student) Department of E.C.E PEC University of TechnologyChandigarh, India-160012 2 (Supervisor)Department

More information

AR Cultural Heritage Reconstruction Based on Feature Landmark Database Constructed by Using Omnidirectional Range Sensor

AR Cultural Heritage Reconstruction Based on Feature Landmark Database Constructed by Using Omnidirectional Range Sensor AR Cultural Heritage Reconstruction Based on Feature Landmark Database Constructed by Using Omnidirectional Range Sensor Takafumi Taketomi, Tomokazu Sato, and Naokazu Yokoya Graduate School of Information

More information

Project 3 code & artifact due Tuesday Final project proposals due noon Wed (by ) Readings Szeliski, Chapter 10 (through 10.5)

Project 3 code & artifact due Tuesday Final project proposals due noon Wed (by  ) Readings Szeliski, Chapter 10 (through 10.5) Announcements Project 3 code & artifact due Tuesday Final project proposals due noon Wed (by email) One-page writeup (from project web page), specifying:» Your team members» Project goals. Be specific.

More information

Stereo II CSE 576. Ali Farhadi. Several slides from Larry Zitnick and Steve Seitz

Stereo II CSE 576. Ali Farhadi. Several slides from Larry Zitnick and Steve Seitz Stereo II CSE 576 Ali Farhadi Several slides from Larry Zitnick and Steve Seitz Camera parameters A camera is described by several parameters Translation T of the optical center from the origin of world

More information

Combining Appearance and Topology for Wide

Combining Appearance and Topology for Wide Combining Appearance and Topology for Wide Baseline Matching Dennis Tell and Stefan Carlsson Presented by: Josh Wills Image Point Correspondences Critical foundation for many vision applications 3-D reconstruction,

More information

MULTIVIEW REPRESENTATION OF 3D OBJECTS OF A SCENE USING VIDEO SEQUENCES

MULTIVIEW REPRESENTATION OF 3D OBJECTS OF A SCENE USING VIDEO SEQUENCES MULTIVIEW REPRESENTATION OF 3D OBJECTS OF A SCENE USING VIDEO SEQUENCES Mehran Yazdi and André Zaccarin CVSL, Dept. of Electrical and Computer Engineering, Laval University Ste-Foy, Québec GK 7P4, Canada

More information

3D Photography: Stereo

3D Photography: Stereo 3D Photography: Stereo Marc Pollefeys, Torsten Sattler Spring 2016 http://www.cvg.ethz.ch/teaching/3dvision/ 3D Modeling with Depth Sensors Today s class Obtaining depth maps / range images unstructured

More information

Multi-stable Perception. Necker Cube

Multi-stable Perception. Necker Cube Multi-stable Perception Necker Cube Spinning dancer illusion, Nobuyuki Kayahara Multiple view geometry Stereo vision Epipolar geometry Lowe Hartley and Zisserman Depth map extraction Essential matrix

More information

III. VERVIEW OF THE METHODS

III. VERVIEW OF THE METHODS An Analytical Study of SIFT and SURF in Image Registration Vivek Kumar Gupta, Kanchan Cecil Department of Electronics & Telecommunication, Jabalpur engineering college, Jabalpur, India comparing the distance

More information

Stereo. Outline. Multiple views 3/29/2017. Thurs Mar 30 Kristen Grauman UT Austin. Multi-view geometry, matching, invariant features, stereo vision

Stereo. Outline. Multiple views 3/29/2017. Thurs Mar 30 Kristen Grauman UT Austin. Multi-view geometry, matching, invariant features, stereo vision Stereo Thurs Mar 30 Kristen Grauman UT Austin Outline Last time: Human stereopsis Epipolar geometry and the epipolar constraint Case example with parallel optical axes General case with calibrated cameras

More information