When 3D Reconstruction Meets Ubiquitous RGB-D Images

Size: px
Start display at page:

Download "When 3D Reconstruction Meets Ubiquitous RGB-D Images"

Transcription

1 When 3D Reconruction Meets Ubiquitous RGB-D Images Quanshi Zhang, Xuan Song, Xiaowei Shao, Huijing Zhao, and Ryosuke Shibasaki University of Tokyo, Peking University Abract 3D reconruction from a single image is a classical roblem in comuter vision. However, it ill oses great challenges for the reconruction of daily-use objects with irregular 1 shaes. In this aer, we roose to learn 3D reconruction knowledge from informally catured 2 RGB-D images, which will robably be ubiquitously used in daily life. The learning of 3D reconruction is defined as a category modeling roblem, in which a model for each category is trained to encode category-secific knowledge for 3D reconruction. The category model eimates the ixellevel 3D ructure of an object from its 2D aearance, by taking into account considerable variations in rotation, 3D ructure, and texture. Learning 3D reconruction from u- biquitous RGB-D images creates a new set of challenges. Exerimental results have demonrated the effectiveness of the roosed aroach. 1. Introduction 3D reconruction is a classical area in the field of comuter vision, but 3D reconruction from a single image ill has great challenges. In this aer, we re-consider the roblem of single-view 3D reconruction in terms of two CV areas: category modeling and knowledge mining from big visual data (see Fig. 1). Category modeling & task outut: For the bottleneck in single-view 3D reconruction, i.e. the reconruction of objects with irregular 1 ructures, we have to return to the concet of category modeling. Therefore, the objective is to train a model to detect objects in the target category in large images, while simultaneously rojecting their 2D shaes into the 3D sace at the ixel level. The category model encodes the knowledge of how intra-category ructure deformation and object rotations affect 2D object shaes. Mining from big visual data & task inut: Another bottleneck lies in efficiently learning the category-secific 1 The word irregular is used to indicate that our aroach focuses on general object categories without setting rong assumtions for object shaes. In contra, Cheeger-set-based methods focus on ball-like surfaces, and ersective-based methods require vanishing oints. Informally catured RGB-D images Category detector A RGB image Collected RGB-D object samles Category detector Category model 2D shae 3D ructure Figure 1. Flowchart for training (cyan arrows) and teing (green arrows) rocesses. Single-view 3D reconruction is hamered by objects with irregular 1 shaes. Therefore, for objects in each category, we aim to learn a secific category model for 3D reconruction. We roose to directly train the category model from informally catured RGB-D images (where target objects are NOT aligned) to save human labeling, thereby ensuring high efficiency in model learning. We fir mine the category detector from informally catured RGB-D images to collect RGB-D objects, and then use these object samles to train the category model. For teing, the category detector and category model are used in sequence to localize the target object and eimate its 3D ructure. knowledge of 3D reconruction for a huge number of categories. Ideally, we would need to train a model for each object category in daily use, so as to conruct a knowledge base to rovide a 3D reconruction service for arbitrary RGB images. Therefore, we hoe to learn from big visual data 2 to avoid the labor of manually rearing training samles (e.g. well built 3D models), and thus ensure a high learning efficiency. In this aer, we train category models directly from informally catured and unaligned RGB-D images. We use the hrase informally catured to describe loose requirements for ubiquitous 2 images. They are tyical of what can be directly collected by search engines (Figures 1 and 2). The informally catured images are not manually aligned, and they consi of small objects that are randomly ositioned. In articular, these daily-use objects are 2 Actually, learning from big visual data is ill in the early ages of develoment, and its scoe is quite extensive. It usually involves two asects. This fir is learning from a large amount of web data, such as the widely used dee learning [17]. The second is learning under challenging conditions of informally catured images that contain small objects and are ubiquitous in everyday life, as is the case in our method. [26] also rovides detailed descrition of the second asect. 1

2 Duan Srayer Inut images for training Detailed view of srayer samles Detailed view of duan samles Figure 2. Overview of challenges. Automatic category modeling from cluttered scenes Robuness to intracategory variations Eimate deformation & remove object-rotation effects in 3D reconruct Learn with deth measurement noises usually designed with texture variations, various rotations, and some ructure deformation for commercial uroses. Technically seaking, these images can be loosely regarded as a kind of big visual data 2. Challenge analysis: Ju like the training, the RGB images for teing 3D reconruction are also such kind of informally catured images in this research. The use of informally catured images raises a number of challenges, as shown in Fig. 2. For alication uroses, the category model should have the ability to detect small target objects with various textures, rotations, and scales in large RGB images before 3D reconruction. Similarly, the learning rocess should also be able to handle these challenges to collect RGB-D objects in cluttered RGB-D images. Moreover, this detection has to be accurate to the art level, since 3D reconruction uses the arts 2D ositions to eimate the ose and 3D ructure of an object. Then, we analyze the challenges of learning 3D reconruction knowledge from the collected RGB-D objects 3. We need to simultaneously consider the following three factors that affect the corresondence between the 2D and 3D ructure of the object. 1) Rotation changes. This is the rimary factor. 2) Deformation of 3D object ructures. 3) Deth errors near object edges in training samles (measured by the Kinect) Proosed method: As shown in Fig. 1, the training of category detectors is the fir e. The category detectors collect 3 RGB-D/RGB object samles from the informally catured RGB-D/RGB images, which serves as the basis for further training/teing of 3D reconruction. We use unsuervised category modeling based on grah matching [28] to train the category detector (secified in [26]). This algorithm automatically discovers a set of relatively diinguishing arts (namely key arts) of objects for each category. 3 In our udy, these objects are collected from RGB images (or RGB dimensions of the RGB-D images) using the trained category detectors. The 3D object ructure is rojected into the 2D shae afterwards. Thus, the category detector achieves a art-level detection. We then train the category model to use the arts 2D ositions to determine the object s ose and 3D ructure. Unlike conventional methods that use well built 3D models to rovide multi-view object aearances and rior regularity of 3D ructure deformation, our aroach needs to learn this knowledge in an unsuervised manner. In other words, we need to train the category model to simultaneously remove the effects of viewoint changes, and eimate the 3D ructure deformation using the 2D shae of an object. The category model consis of a number of eimators, each corresonding to a secific osition inside the 2D shae. These eimators use the satial relationshi between key object arts to eimate the corresonding 3D coordinates of their ositions. We connect neighboring eimators to form a continuous Markov random field (MRF) and thus train these eimators. The contributions of this aer can be summarized as follows. We regard 3D reconruction from a single image as a category modeling roblem to overcome the difficulties in the reconruction of irregular 1 shaes. To ensure a high learning efficiency and a wide alication, this is the fir attemt to learn category models from informally catured RGB-D images, and then aly the category model back to the informally catured RGB images for 3D reconruction. We exlore a new set of challenges raised by our choice of training and teing data, and rovide a solution. 2. Related work 3D reconruction is a large area in the field of comuter vision. However, in this aer, we limit our discussion to 3D reconruction from a single image. Many methods for single-view 3D reconruction have rong assumtions for the image environment. For examle, shaes from shading methods [23, 4, 3, 14] had secial requirements for lighting conditions and textures. A large number of udies use the ersective rincile for 3D reconruction. They were tyically based on the vanishing oints in images, and therefore assumed that these images contained enough cues to extract vanishing oints [24, 15]. Such methods were mainly alied to large-scale indoor and urban environments. In addition, some udies used the ground/vertical assumtion to assi in vanishing oint extraction [13, 2, 7, 8]. Saxena et al. [19] roosed to learn an MRF for 3D reconruction in large-scale environments. Fouhey et al. [9] roosed to learn 3D rimitives from RGB-D images for single-view reconruction of indoor environment. A number of methods relied on the assumtion that the object ructure in queion had smooth surfaces. They usually extracted object-level ructures from locally catured images that contained no ersective cues. For examle, the Cheeger Set aroach [18] assumed that the target ob-

3 ject had ball-like surfaces. Given the object contour and the object volume, it comuted the 3D ructure with the minimum surface area. However, such assumtions for object ructure are usually not valid for the reconruction of irregular 1 shaes. Therefore, many methods [25, 22, 21, 14] combined human interactions with these assumtions to guide the 3D reconruction. By and large, the lack of cues for the eimation of irregular 1 object ructures is the main challenge for single-view 3D reconruction. From this ersective, our research is related to the examle-based methods [11, 10, 20]. They required a comrehensive dataset of 3D object samles that were catured in all oses from different viewoints. The 3D ructure knowledge of a target object was extracted via a large number of comarisons between the object and the samles in the dataset. Chen et al. [6, 5] learned 3D reconruction knowledge from well built 3D object models. These 3D object models rovided knowledge on ructure deformation and the multi-view aearance. In this way, they learned to match the 2D shae of a given object to the 3D object models, and thus eimate the object s viewoint, ose, and 3D ructure. In contra, we aly much less suervision to idealize the concet of automatic category modeling in terms of training. Learning from informally catured RGB-D images without manual alignment saves great labor for rearation for 3D object models or a large 3D-examle dataset. The challenges of object detection, rotation changes, and the eimation of 3D ructure deformation are all involved in the training rocess. The trained category model is then alied back to the informally catured RGB images for 3D reconruction, without examle comarison. 3. Training of category detectors Category detectors are not only trained from, but also alied to informally catured and unaligned images. The category models for 3D reconruction are designed on the basis of category detectors. In our udy, we use the grahical model defined in [26] as the category detector, considering the need for robuness to shae deformations and rotations in the informally catured images. We then aly unsuervised learning for grah matching [28] to mine the category detectors. In reality, we simly use the RGB channels of RGB-D images for training. The detector is trained to encode the attern for the common objects in all the images. A set of key object arts are discovered to form the attern for a category (see Fig. 3(c)). Category detectors: As shown in Fig. 3(a), the image is reresented as a comlete attributed relational grah (ARG). In Fig. 3(b), continuous object edges (magenta) are discretized into line segments (black), and these line segments form the grah nodes. Thus the objects within it are reresented as sub-grahs. as in (b) SAP ( s includes i texture features) Old node/edge s features i j New node/edge s features i j Node Node t Nodet i j s i Nodes Figure 3. Object detectors. (a) Grah-based image reresentation. Object edge segments (cyan) form the grah nodes. (b) Notation for the grahical model. The initial grahical model consis of five nodes (colored lines) and edges (dotted lines) between them. (c) Learning the grahical model (category detector). The algorithm modifies (i) the initial grahical model into (ii) the SAP (G). The node set and node/edge features are modified so that 1) the node number (size) of G is maximized and 2) all the node/edge features reresent common atterns in all the grahs. The category detector is a grahical model in [26], which will be automatically trained to be the soft attributed attern (SAP) among the target sub-grahs (or objects) in different images. Let grah G with node set V denote the model. G is described by a set of node and edge features. Let {F s i } and {F j } (i = 1, 2,..., N P, j = 1, 2,..., N Q ) denote the i-th node feature for node s and the j-th edge feature for edge (s, t), resectively. Here, N P and N Q indicate the feature numbers for each node and edge, resectively. F V = {Fi s} {Fj }, s, t V. Note that F i s includes the texture features on the terminals of lines segments. The category detector thus also contains the textural knowledge. (Please see [26] for details of the feature settings.) Object detection: Object detection is achieved via grah matching. Given a target RGB image reresented by grah G with a node set V, grah matching is erformed to comute the matching assignments between G and G. Let x denote the label set, and let the label x s x denote the matched node in G for node s in G. Grah matching is formulated as the minimization of the following energy function w.r.t x. E(x F V,F V )= P s (x s F V,F V )+ Q (x s,x t F V,F V ) (1) s V s,t V,s t This is a tyical formula for grah matching, where the functions P s (, ) and Q (,, ) measure matching enalties (feature dissimilarity) between the two matched nodes and edges. These are generally defined using squared differences. E.g. P s (x s F V,F V ) = w T [ F1 s F xk s 1 2,..., FN s P F xk s N P 2 ]. (Please see [28] for details.) Learning grah matching with SAPs: As shown in Fig. 3(c), [28] roosed an algorithm for mining SAPs. The Node

4 SAP is a fuzzy attributed attern, which describes the common sub-grahs in a set of ARGs, in which node and edge features have considerable variations. In other words, the category detector (i.e. the SAP) reresents a set of key object arts (nodes) that frequently aear in training images of the entire category. The category detector is trained taking into account the robuness to texture variations and ructure deformations. To art the training rocess, this method requires eole only to label the sub-grah of a single object for initializing the grahical model G. [28] is designed to modify G from the secific shae of the labeled object into the SAP among all grahs. Given a set of N training images, each image is denoted by {G k } (k = 1, 2,..., N) with node set V k. F V k are the attribute sets of G k, and xk s x k X indicates the node in V k matched by s V. The frequency of node s in G is defined as the average enalty for matching s to all the {G k }, as follows. E s( ˆX,ˆF V )= 1 N N [P s(ˆx k s ˆF V,F V )+ ] Q k (ˆx k s, ˆx k t ˆF V,F V k ) k=1 t V,t s If E s ( ˆX, ˆF V ) is less than a given threshold τ, node s is regarded as a frequently aearing art; otherwise not. Hence, the goal of learning grah matching is to train the node set V and attributes F V of model G, in such a manner that 1) all nodes of G frequently aear in grahs {G k } and 2) the size of G is maximized. max V V s.t. (ˆF V, ˆX) = argmin F V,X={x k } N s V, E s ( ˆX, ˆF V ) τ. k=1 E(xk F V, F V k ); [28] rooses an EM framework to solve (2). 4. Learning 3D reconruction We introduce the rearation of training samles, model learning, and the alication of 3D reconruction in the following three subsections Object samle collection & ose normalization The rearation of training samles involves two es, i.e. object samle collection and 3D ose normalization. In the fir e, we use the trained category detector to collect RGB-D samles from informally catured RGB-D images for training. Each training samle contains its 2D shae and the 3D coordinates of each 2D ixel within it. The 3D coordinates are measured in the global coordinate syem of each RGB-D image, rather than a coordinate syem w.r.t the object. Therefore, in the second e, we comute the 3D ose of the object, and thereby normalize the 3D coordinates into the object coordinate syem. (2) Object samle collection: We use the category detector to match a set of key object arts (grah nodes) in each image by alying (1). However, as mentioned above, the key arts consi of line segments on the object, and do not comrise the entire object body. Therefore, we fir recover the entire body area of the object from the detected line segments, before the further learning. Actually, a number of methods for interactive 4 object segmentation are oriented to this alication. Nevertheless, in order to make a reliable evaluation of the roosed 3D reconruction method, we need to simlify the whole syem and thereby avoid bringing in uncertainties related to object segmentation. Consequently, given the key line segments of the object, we roughly eimate its body area as the convex hull of the line terminals (see Fig. 4(c)). 3D ose normalization: We define centers and 3D oses for the object samles in the RGB-D images, thus conructing a relative coordinate syem for each samle, as shown in Fig. 4(f,bottom). We then roject 3D oint clouds of the objects onto these coordinate syems, as the ground truth of their 3D ructures. Fig. 4(e) shows the notation. The center of an object is defined as the mean of the 3D coordinates of its key arts c = 1 s m s/m, where m is the art (node) number and s denotes the center coordinates of art s. We define the orientations of the three orthogonal coordinate axes < v 1,v 2,v 3 > as follows. v 1 =u(v 1 + v 2), v 2 =u(v 1 v 2), v 3 =v 1 v 2 (3) where function u( ) normalizes a vector to a unit vector, v 1 = u( 1 s<t m u( t s )), v 2 = u( 1 s<t m (o t o s )), and o s denotes the 3D orientation of line segment s Model learning We train a category model from RGB-D object samles to eimate the ixel-level 3D reconruction. Essentially, even if we are given a rior 3D ructure for a category, 3D reconruction from a single image ill has great challenges. There are two general hyotheses, i.e. object rotations and ructure deformations, to interreting a secific 2D object shae in 3D reconruction. Each hyothesis can indeendently eimate the 3D object ructure from the 2D shae (e.g. comuting the rotation or 3D deformation of the rior 3D ructure that be fits the contour of the 2D shae). Obviously, the 3D reconruction needs to combine the both hyotheses. Unlike [5, 6] using well built 3D object models to rovide or train rior regularity of ructure deformation, we need to train the category model to simultaneously identify the effects of object rotations and ructure deformations from 2D shaes, without rior knowledge. This greatly increases the challenge. 4 The key object arts can be labeled as the foreground.

5 a b c d e t o t f Local 2D coordinate t syem o s s Centerline s Almo erendicular to the an Almo arallel to the an M q M s,t Lean again a box s,(i) node t q,(i),(j) q,(j) s Lean again a box Figure 4. Some es in learning 3D reconruction. (a,b) Visualization of the detected key object arts in the RGB and deth image. (c) Recovery of the object body area. (d) Notation for the local 2D coordinate syem. (e) Notation for determining the local 3D coordinate syem. (f) Three axes of the local 3D coordinate syem. (bottom) Local 3D coordinate syems of three objects. Fig. 5 shows the basic design of the category model. The category model handles both the effects and directly rojects each 2D ixel into a 3D sace. Considering the model s robuness to shae deformation, we use a set of local 2D coordinate syems to simultaneously localize each ixel inside the 2D shae. Each local 2D coordinate syem is conructed using each air of key object arts. For each oint in every 2D coordinate syem, we train a local regressor to eimate its 3D coordinates from its oint features. The oint features are designed to reresent the satial relationshi between the oint and key objects arts, as well as the 2D shae of these key object arts. Thus, the oint features contain sufficient cues for object rotations and ructure deformation to guide the eimation of the 3D coordinates of each 2D oint. For oint in the local 2D coordinate syem w.r.t key arts s and t of a given object samle, its oint features are defined as θ = [ϑ 1, ϑ 2, ϑ 3, ϑ 4, ϑ 5, 1] T. ϑ 1 and ϑ 2 denote the lengths of s and t, resectively, and ϑ 3 denotes the angle between s and t. ϑ 4 and ϑ 5 measure the diance between and the line segments s and t, resectively. ϑ 1, ϑ 2, ϑ 4, and ϑ 5 are normalized by the centerline length between s and t (Fig. 4(d)). Then, the local regressor F ( ) is modeled as a linear regression of the 3D coordinates of using θ : F (θ ) = (θ ) T M (4) node t node s node u node v i Six local 2D coordinate syems uv sv Figure 5. Learning 3D reconruction. The category model uses different local 2D coordinate syems (bottom) to localize the 2D oints, and eimates the 3D ructure in these coordinate syems. Given a secific 2D coordinate syem w.r.t each air of key arts s and t, we generate a MRF to train the arameter M for the local regressor F( ) in each different oint. F( ) uses the oint feature θ,(i) in object samle i to eimate 3D coordinates of, thus overcoming i s secific object rotations, ructure deformation, and deth errors. tv su where matrix M is the arameter for the function F ( ). The local regressor F ( ) is the basic element of the category model. The category model is formulated as a linear combination of these local regressors, as follows. Y(x) = 1 Z 1 s<t m j tu w F (θ ) =(x) (5) where Z = 1 s<t m w=(x). x is a ixel inside the 2D object shae, and Y(x) eimates its 3D coordinates. s and t indicate two key arts of the object; m denotes the key art number. Given the local coordinate syem conructed using s and t, function (x) determines the 2D osition of x in this coordinate syem. Thus, function F( ) is the local regressor of 3D coordinates for oint in the coordinate syem of s and t, as mentioned above. w measures the reliability of the local regressor F( ) and is regarded as its weight. Local 2D coordinate syem: As shown in Fig. 4(d), given the line segments of each air of key arts s and t (1 s < t m), we generate a local 2D coordinate syem. The line segment connecting the center of s to that of t is t

6 called the centerline between s and t. The origin of the local coordinate syem w.r.t s and t is defined as the center of the centerline. We take the centerline s orientation as the orientation of the fir coordinate axis, and determine the orientation of the second axis using the right-hand rule. The unit vector of the fir axis is normalized using the centerline length, and that of the second axis is normalized using the average segment length of s and t. Model learning based on a continuous MRF: In the local 2D coordinate syem determined by the key arts s and t, the local regressors of the neighboring ositions are connected to form an MRF. We generate the following objective function to learn the arameters of the regressors. argmax M φ 1 (M ) + V (,q) E φ 2 (M, M q ) (6) where M = {M V}; V and E denote node and edge sets of the MRF, resectively. In the MRF, the unary term φ 1 ( ) measures the similarities between the ground truth and the 3D coordinates eimated by the regressor for each oint. φ 1 (M ) = 1 ex( η F S (θ,(i) ) y i ) (7) i S where S is the set of training samles. For each object samle i, θ,(i) and y i denote the oint feature and true 3D coordinates of, resectively. η is a scaling arameter. The airwise term φ 2 (, ) is a smoothing term between neighboring oints. φ 2 (M, M q ) = λ(m M q ) 2 ex[ α(u + U q ) 2 ] U = 1 max S (, ) E y i y i (8) i S where U use true 3D ructures of the training samles to measure the discontinuity around oint. Object areas with high discontinuity, such as object edges, have low weights for smoothing. λ and α are scaling arameters. We use the MCMC method to solve the continuous MRF. Finally, when the regressor F ( ) has been trained, the weight for F ( ) in (5) is comuted as w = 1 S i S ex[ η(f (θ,(i) ) y i )2 ] (9) D reconruction based on category models As shown in Fig. 1, we use a trained category model to directly erform 3D reconruction on teing RGB images. We fir use the trained category detector to collect target objects in the RGB images, and simultaneously determine the body area of the objects (see Section 4.1). We then use the category model in (5) to eimate the 3D coordinates for each ixel inside the object. 5. Exeriments 5.1. Data We use the category dataset of Kinect RGB-D images [1, 26], which is ublished as a andard RGB-D object dataset 5 for the learning of grah-matching-based models, such as [26, 27]. These RGB-D images deict cluttered scenes containing objects with different textures and rotations. Four categories notebook PC, drink box, srayer, and duan in this dataset contain a sufficient number of RGB-D objects for training and are thus chosen in the exeriments Imlementation details Training of category detectors: We train the SAP (category detector) from a set of large grahs (informally catured images). Parameter τ controls the grah size of the SAP, or in other word, the number of key object arts contained by the category detector. When we set a higher value for τ, we can extract more key arts for a category, but the key arts are less reliable in art detection. To simlify the learning rocess, we require the SAPs (detectors) to have four nodes (key arts) for all the four categories. We try different values of τ in the training rocess, until the category detector satisfies this requirement. Hence, we can detect four key arts for each object, and the rotation and deformation of the object are determined by the comlex satial relationshi between the four key arts. Learning 3D reconruction: As shown in Fig. 5, in each local 2D coordinate syem, we divide the entire object area comrising of grids to identify different 2D oints. A local regressor with arameter M is generated for each grid, and we thus use the local regressors conruct the MRF. For the alication of 3D reconruction, many ixels in the 2D shae are not accurately localized in the grid centers. To achieve accurate ixel-level 3D reconruction, we interolate the regressor arameter value for each 2D ixel from the trained arameters M of the nearby grids. We set arameter α as 0.1 for all the categories in model learning, and use different values of λ and η to te the syem Quantitative comarison Cometing method: We comare our aroach with conventional methods for 3D reconruction from a single image. As discussed in Section 2, mo related techniques have their own secific assumtions for the target image and are thus not suitable for 3D reconruction of irregular 1 shaes. From this viewoint, examle-based 3D reconruction is close to our method, but conventional techniques are 5 Comared to other RGB-D datasets i.e. [16, 12], this is one of the large RGB-D object datasets, and reflects challenges of grah matching.

7 Figure 6. 3D reconruction comarison. The roosed method exhibits smaller reconruction errors and surface roughness than the examle-based method. The learning of category models is not sensitive to different settings of arameters λ and η. λ and η are not involved in the examle-based method hamered by the use of informally catured RGB-D images for training. Nevertheless, considering the challenges in the training dataset, we design a cometing method that achieves a rough idea of the examle-based 3D reconruction, as follows. Fir, given a RGB image, we use the category detector trained in Section 3 to detect the target object in it, as well as its key object arts. We then randomly select an RGB-D object from the training samle set as the examle for 3D reconruction. The 3D coordinate eimation is based on (5), ju like the roosed method. However, without training, the weight for local regressors (w ) is set to 1. The local regressors F is redefined as a conant, i.e. the 3D coordinate for oint on the examle, rather than a function in (4) w.r.t. oint features θ. Pixel-level evaluation: We evaluate the reconruction errors and surface roughness (non-smoothness) of the roosed aroach at the ixel level. Reconruction errors are widely used to evaluate 3D reconruction. We manually reare the ground the truth of the 3D ructure for each object using the Kinect measurement. We measure the diance between the eimated 3D coordinates of each ixel and its true coordinates, as the ixel reconruction error. We define the reconruction error of an object as the average reconruction error of its conituent ixels. The reconruction error of a category is defined as the average reconruction error of all its objects. The other evaluation metric is the surface roughness. The roughness of ixel is measured as r = Y 1 N() q N() Yq, where ixel N() is the set of 4-connectivity neighboring ixels of, Y denotes the eimated 3D coordinates of ixel on the object. Ju like the reconruction error, the object roughness is the average of the ixel roughness, and the surface roughness of a category is defined as the average of object-level roughness. The evaluation is achieved via cross validation. Ju as in [26, 27], we randomly select 2/3 and 1/3 of the RGB- D images in the entire dataset as a air of samle ools for training and teing, resectively. Each air of samle ools can train a category model, and rovide values of the reconruction error and surface roughness for the category. We reare different airs of training and teing ools and comute the average erformance by cross validation. Fig. 6 and Fig. 7 show the comarison of 3D reconruction erformance. Without sufficient learning, the cometing method cannot correctly eimate ructure deformation for objects, and suffers from the unavoidable errors in Kinect s deth measurement. 6. Conclusions In this aer, we roosed to learn a category model for single-view 3D reconruction, which encoded knowledge for ructure deformation, texture variations, and rotation changes. The category model was both trained from and alied to the informally catured images. Exeriments demonrated the effectiveness of the roosed method. For the training of category detectors, we used various node and edge features. These included texture features on local image atches. Thus, the texture information contributes to the extraction of key object arts. However, textures contribute much less in further 3D reconruction, as many daily-use objects (e.g. the drink box) are designed with different textures that are unrelated to object ructures for commercial uroses. The satial relationshi between key object arts was, therefore, taken as more reliable cues for 3D reconruction and used in model learning. We did not aly object segmentation and thus avoided the uncertainties related to it, so as to simlify the syem and enable a reliable evaluation. The category model did not erformed so well on object edges as in other object area due to the time-of-the-flight errors and calibration errors between 3D oints and RGB images. References [1] Category Dataset of Kinect RGBD Images, htt://sites.google.com/site/quanshizhang. 6 [2] O. Barinova, V. Konushin, A. Yakubenko, K. Lee, H. Lim, and A. Konushin. Fa automatic single-view 3-d reconruction of urban scenes. In ECCV,

8 a1 a2 b1 b2 a3 b3 a4 Learn texture knowledge for object detection a5 b4 a6 a7 b5 c2 c3 a9 b7 c9 b9 b6 b8 Eimate ructure deformation from 2D shaes c1 a8 c4 c5 c6 c7 c8 Figure 7. 3D reconruction erformance. (a1 9) Object detection (cyan lines) in RGB images. (b1 9) Detailed view of the target objects. (c1 9) Reconruction results of the roosed method. [3] J. T. Barron and J. Malik. Color conancy, intrinsic images, and shae eimation. In ECCV, [4] J. T. Barron and J. Malik. Shae, albedo, and illumination from a single image of an unknown object. In CVPR, [5] Y. Chen and R. Ciolla. Single and sarse view 3d reconruction by learning shae riors. In CVIU, 115(5): , , 4 [6] Y. Chen, T.-K. Kim, and R. Ciolla. Inferring 3d shaes and deformations from single views. In ECCV, , 4 [7] E. Delage, H. Lee, and A. Ng. Automatic single-image 3d reconructions of indoor manhattan world scenes. In Robotics Research, [8] E. Delage, H. Lee, and A. Ng. A dynamic bayesian network model for autonomous 3d reconruction from a single indoor image. In CVPR, [9] D. Fouhey, A. Guta, and M. Hebert. Data-driven 3d rimitives for single image underanding. In ICCV, [10] T. Hassner and R. Basri. Examle based 3d reconruction from single 2d images. In CVPR worksho, [11] T. Hassner and R. Basri. Single view deth eimation from examles. In CoRR, abs/ , [12] E. Herb, X. Ren, and D. Fox. Rgb-d object discovery via multi-scene analysis. In IROS, [13] D. Hoiem, A. Efros, and M. Hebert. Geometric context from a single image. In ICCV, [14] K. Karsch, Z. Liao, J. Rock, J. T. Barron, and D. Hoiem. Boundary cues for 3d object shae recovery. In CVPR, , 3 [15] K. Ko ser, C. Zach, and M. Pollefeys. Dense 3d reconruction of symmetric scenes from a single image. In DAGM, [16] K. Lai, L. Bo, X. Ren, and D. Fox. A large-scale hierarchical multi-view rgb-d object dataset. In Proc. of IEEE International Conference on Robotics and Automation (ICRA), ages , [17] Q. Le, M. Ranzato, R. Monga, M. Devin, K. Chen, G. Corrado, J. Dean, and A. Ng. Building high-level features using large scale unsuervised learning. In ICML, [18] M. R. Oswald, E. To e, and D. Cremers. Fa and globally otimal single view reconruction of curved objects. In CVPR, [19] A. Saxena, M. Sun, and A. Y. Ng. Make3d: Learning 3d scene ructure from a single ill image. In PAMI, 31(5): , [20] A. Thomas, V. Ferrari, B. Leibe, T. Tuytelaars, and L. Gool. Shae-from recognition: Recognition enables meta-data transfer. In CVIU, [21] E. Toe, C. Nieuwenhuis, and D. Cremers. Relative volume conraints for single view 3d reconruction. In CVPR, [22] E. To e, M. R. Oswald, D. Cremers, and C. Rother. Imagebased 3d modeling via cheeger sets. In ACCV, [23] A. Varol, A. Shaji, M. Salzmann, and P. Fua. Monocular 3d reconruction of locally textured surfaces. In PAMI, 34(6): , [24] T. Xue, J. Liu, and X. Tang. Symmetric iecewise lanar object reconruction from a single image. In CVPR, [25] L. Zhang, G. Dugas-Phocion, J.-S. Samson, and S. M. Seitz. Single view modeling of free-form scenes. In CVPR, [26] Q. Zhang, X. Song, X. Shao, R. Shibasaki, and H. Zhao. Category modeling from ju a single labeling: Use deth information to guide the learning of 2d models. In IEEE Conference on Comuter Vision and Pattern Recognition (CVPR), ages , , 2, 3, 6, 7 [27] Q. Zhang, X. Song, X. Shao, H. Zhao, and R. Shibasaki. Learning grah matching for category modeling from cluttered scenes. In ICCV, , 7 [28] Q. Zhang, X. Song, X. Shao, H. Zhao, and R. Shibasaki. Attributed grah mining and matching: An attemt to define and extract soft attributed atterns. In CVPR, , 3, 4

Learning Motion Patterns in Crowded Scenes Using Motion Flow Field

Learning Motion Patterns in Crowded Scenes Using Motion Flow Field Learning Motion Patterns in Crowded Scenes Using Motion Flow Field Min Hu, Saad Ali and Mubarak Shah Comuter Vision Lab, University of Central Florida {mhu,sali,shah}@eecs.ucf.edu Abstract Learning tyical

More information

CS 229 Final Project: Single Image Depth Estimation From Predicted Semantic Labels

CS 229 Final Project: Single Image Depth Estimation From Predicted Semantic Labels CS 229 Final Project: Single Image Deth Estimation From Predicted Semantic Labels Beyang Liu beyangl@cs.stanford.edu Stehen Gould sgould@stanford.edu Prof. Dahne Koller koller@cs.stanford.edu December

More information

Interactive Image Segmentation

Interactive Image Segmentation Interactive Image Segmentation Fahim Mannan (260 266 294) Abstract This reort resents the roject work done based on Boykov and Jolly s interactive grah cuts based N-D image segmentation algorithm([1]).

More information

Perception of Shape from Shading

Perception of Shape from Shading 1 Percetion of Shae from Shading Continuous image brightness variation due to shae variations is called shading Our ercetion of shae deends on shading Circular region on left is erceived as a flat disk

More information

Last time: Disparity. Lecture 11: Stereo II. Last time: Triangulation. Last time: Multi-view geometry. Last time: Epipolar geometry

Last time: Disparity. Lecture 11: Stereo II. Last time: Triangulation. Last time: Multi-view geometry. Last time: Epipolar geometry Last time: Disarity Lecture 11: Stereo II Thursday, Oct 4 CS 378/395T Prof. Kristen Grauman Disarity: difference in retinal osition of same item Case of stereo rig for arallel image lanes and calibrated

More information

Face Recognition Using Legendre Moments

Face Recognition Using Legendre Moments Face Recognition Using Legendre Moments Dr.S.Annadurai 1 A.Saradha Professor & Head of CSE & IT Research scholar in CSE Government College of Technology, Government College of Technology, Coimbatore, Tamilnadu,

More information

Stereo Disparity Estimation in Moment Space

Stereo Disparity Estimation in Moment Space Stereo Disarity Estimation in oment Sace Angeline Pang Faculty of Information Technology, ultimedia University, 63 Cyberjaya, alaysia. angeline.ang@mmu.edu.my R. ukundan Deartment of Comuter Science, University

More information

CMSC 425: Lecture 16 Motion Planning: Basic Concepts

CMSC 425: Lecture 16 Motion Planning: Basic Concepts : Lecture 16 Motion lanning: Basic Concets eading: Today s material comes from various sources, including AI Game rogramming Wisdom 2 by S. abin and lanning Algorithms by S. M. LaValle (Chats. 4 and 5).

More information

A Novel Iris Segmentation Method for Hand-Held Capture Device

A Novel Iris Segmentation Method for Hand-Held Capture Device A Novel Iris Segmentation Method for Hand-Held Cature Device XiaoFu He and PengFei Shi Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, Shanghai 200030, China {xfhe,

More information

Multi-robot SLAM with Unknown Initial Correspondence: The Robot Rendezvous Case

Multi-robot SLAM with Unknown Initial Correspondence: The Robot Rendezvous Case Multi-robot SLAM with Unknown Initial Corresondence: The Robot Rendezvous Case Xun S. Zhou and Stergios I. Roumeliotis Deartment of Comuter Science & Engineering, University of Minnesota, Minneaolis, MN

More information

The Research on Curling Track Empty Value Fill Algorithm Based on Similar Forecast

The Research on Curling Track Empty Value Fill Algorithm Based on Similar Forecast Research Journal of Alied Sciences, Engineering and Technology 6(8): 1472-1478, 2013 ISSN: 2040-7459; e-issn: 2040-7467 Maxwell Scientific Organization, 2013 Submitted: October 31, 2012 Acceted: January

More information

PREDICTING LINKS IN LARGE COAUTHORSHIP NETWORKS

PREDICTING LINKS IN LARGE COAUTHORSHIP NETWORKS PREDICTING LINKS IN LARGE COAUTHORSHIP NETWORKS Kevin Miller, Vivian Lin, and Rui Zhang Grou ID: 5 1. INTRODUCTION The roblem we are trying to solve is redicting future links or recovering missing links

More information

Bayesian Oil Spill Segmentation of SAR Images via Graph Cuts 1

Bayesian Oil Spill Segmentation of SAR Images via Graph Cuts 1 Bayesian Oil Sill Segmentation of SAR Images via Grah Cuts 1 Sónia Pelizzari and José M. Bioucas-Dias Instituto de Telecomunicações, I.S.T., TULisbon,Lisboa, Portugal sonia@lx.it.t, bioucas@lx.it.t Abstract.

More information

SPARSE SIGNAL REPRESENTATION FOR COMPLEX-VALUED IMAGING Sadegh Samadi 1, M üjdat Çetin 2, Mohammad Ali Masnadi-Shirazi 1

SPARSE SIGNAL REPRESENTATION FOR COMPLEX-VALUED IMAGING Sadegh Samadi 1, M üjdat Çetin 2, Mohammad Ali Masnadi-Shirazi 1 SPARSE SIGNAL REPRESENTATION FOR COMPLEX-VALUED IMAGING Sadegh Samadi 1, M üjdat Çetin, Mohammad Ali Masnadi-Shirazi 1 1. Shiraz University, Shiraz, Iran,. Sabanci University, Istanbul, Turkey ssamadi@shirazu.ac.ir,

More information

Graph Cut Matching In Computer Vision

Graph Cut Matching In Computer Vision Grah Cut Matching In Comuter Vision Toby Collins (s0455374@sms.ed.ac.uk) February 2004 Introduction Many of the roblems that arise in early vision can be naturally exressed in terms of energy minimization.

More information

Improved Image Super-Resolution by Support Vector Regression

Improved Image Super-Resolution by Support Vector Regression Proceedings of International Joint Conference on Neural Networks, San Jose, California, USA, July 3 August 5, 0 Imroved Image Suer-Resolution by Suort Vector Regression Le An and Bir Bhanu Abstract Suort

More information

Wavelet Based Statistical Adapted Local Binary Patterns for Recognizing Avatar Faces

Wavelet Based Statistical Adapted Local Binary Patterns for Recognizing Avatar Faces Wavelet Based Statistical Adated Local Binary atterns for Recognizing Avatar Faces Abdallah A. Mohamed 1, 2 and Roman V. Yamolskiy 1 1 Comuter Engineering and Comuter Science, University of Louisville,

More information

AUTOMATIC 3D SURFACE RECONSTRUCTION BY COMBINING STEREOVISION WITH THE SLIT-SCANNER APPROACH

AUTOMATIC 3D SURFACE RECONSTRUCTION BY COMBINING STEREOVISION WITH THE SLIT-SCANNER APPROACH AUTOMATIC 3D SURFACE RECONSTRUCTION BY COMBINING STEREOVISION WITH THE SLIT-SCANNER APPROACH A. Prokos 1, G. Karras 1, E. Petsa 2 1 Deartment of Surveying, National Technical University of Athens (NTUA),

More information

An Efficient Coding Method for Coding Region-of-Interest Locations in AVS2

An Efficient Coding Method for Coding Region-of-Interest Locations in AVS2 An Efficient Coding Method for Coding Region-of-Interest Locations in AVS2 Mingliang Chen 1, Weiyao Lin 1*, Xiaozhen Zheng 2 1 Deartment of Electronic Engineering, Shanghai Jiao Tong University, China

More information

Depth Estimation for 2D to 3D Football Video Conversion

Depth Estimation for 2D to 3D Football Video Conversion International Research Journal of Alied and Basic Sciences 2013 Available online at www.irjabs.com ISSN 2251-838X / Vol, 6 (4): 475-480 Science Exlorer ublications Deth Estimation for 2D to 3D Football

More information

Learning Robust Locality Preserving Projection via p-order Minimization

Learning Robust Locality Preserving Projection via p-order Minimization Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence Learning Robust Locality Preserving Projection via -Order Minimization Hua Wang, Feiing Nie, Heng Huang Deartment of Electrical

More information

Chapter 8: Adaptive Networks

Chapter 8: Adaptive Networks Chater : Adative Networks Introduction (.1) Architecture (.2) Backroagation for Feedforward Networks (.3) Jyh-Shing Roger Jang et al., Neuro-Fuzzy and Soft Comuting: A Comutational Aroach to Learning and

More information

Optimal Multiple Sprite Generation based on Physical Camera Parameter Estimation

Optimal Multiple Sprite Generation based on Physical Camera Parameter Estimation Otimal Multile Srite Generation based on Physical Camera Parameter Estimation Matthias Kunter* a, Andreas Krutz a, Mrinal Mandal b, Thomas Sikora a a Commun. Systems Grou, Technische Universität Berlin,

More information

CS 428: Fall Introduction to. Geometric Transformations. Andrew Nealen, Rutgers, /15/2010 1

CS 428: Fall Introduction to. Geometric Transformations. Andrew Nealen, Rutgers, /15/2010 1 CS 428: Fall 21 Introduction to Comuter Grahics Geometric Transformations Andrew Nealen, Rutgers, 21 9/15/21 1 Toic overview Image formation and OenGL (last week) Modeling the image formation rocess OenGL

More information

A Morphological LiDAR Points Cloud Filtering Method based on GPGPU

A Morphological LiDAR Points Cloud Filtering Method based on GPGPU A Morhological LiDAR Points Cloud Filtering Method based on GPGPU Shuo Li 1, Hui Wang 1, Qiuhe Ma 1 and Xuan Zha 2 1 Zhengzhou Institute of Surveying & Maing, No.66, Longhai Middle Road, Zhengzhou, China

More information

Grouping of Patches in Progressive Radiosity

Grouping of Patches in Progressive Radiosity Grouing of Patches in Progressive Radiosity Arjan J.F. Kok * Abstract The radiosity method can be imroved by (adatively) grouing small neighboring atches into grous. Comutations normally done for searate

More information

Matlab Virtual Reality Simulations for optimizations and rapid prototyping of flexible lines systems

Matlab Virtual Reality Simulations for optimizations and rapid prototyping of flexible lines systems Matlab Virtual Reality Simulations for otimizations and raid rototying of flexible lines systems VAMVU PETRE, BARBU CAMELIA, POP MARIA Deartment of Automation, Comuters, Electrical Engineering and Energetics

More information

Detection of Occluded Face Image using Mean Based Weight Matrix and Support Vector Machine

Detection of Occluded Face Image using Mean Based Weight Matrix and Support Vector Machine Journal of Comuter Science 8 (7): 1184-1190, 2012 ISSN 1549-3636 2012 Science Publications Detection of Occluded Face Image using Mean Based Weight Matrix and Suort Vector Machine 1 G. Nirmala Priya and

More information

Weight Co-occurrence based Integrated Color and Intensity Matrix for CBIR

Weight Co-occurrence based Integrated Color and Intensity Matrix for CBIR Weight Co-occurrence based Integrated Color and Intensity Matrix for CBIR Megha Agarwal, 2R.P. Maheshwari Indian Institute of Technology Roorkee 247667, Uttarakhand, India meghagarwal29@gmail.com, 2rmaheshwari@gmail.com

More information

Adaptive Reciprocal Cell based Sparse Representation for Satellite Image Restoration

Adaptive Reciprocal Cell based Sparse Representation for Satellite Image Restoration Vol.49 (SoftTech 04),.8-85 htt://dx.doi.org/0.457/astl.04.49.50 Adative Recirocal Cell based Sarse Reresentation for Satellite Image Restoration Yanfei He and Shunfeng Wang Deartment of Math and Statistic,

More information

Wide-baseline Hair Capture using Strand-based Refinement

Wide-baseline Hair Capture using Strand-based Refinement 2013 IEEE Conference on Comuter Vision and Pattern Recognition Wide-baseline Hair Cature using Strand-based Refinement Linjie Luo Princeton University Cha Zhang Microsoft Research Zhengyou Zhang Microsoft

More information

Sensitivity Analysis for an Optimal Routing Policy in an Ad Hoc Wireless Network

Sensitivity Analysis for an Optimal Routing Policy in an Ad Hoc Wireless Network 1 Sensitivity Analysis for an Otimal Routing Policy in an Ad Hoc Wireless Network Tara Javidi and Demosthenis Teneketzis Deartment of Electrical Engineering and Comuter Science University of Michigan Ann

More information

Figure 8.1: Home age taken from the examle health education site (htt:// Setember 14, 2001). 201

Figure 8.1: Home age taken from the examle health education site (htt://  Setember 14, 2001). 201 200 Chater 8 Alying the Web Interface Profiles: Examle Web Site Assessment 8.1 Introduction This chater describes the use of the rofiles develoed in Chater 6 to assess and imrove the quality of an examle

More information

EE678 Application Presentation Content Based Image Retrieval Using Wavelets

EE678 Application Presentation Content Based Image Retrieval Using Wavelets EE678 Alication Presentation Content Based Image Retrieval Using Wavelets Grou Members: Megha Pandey megha@ee. iitb.ac.in 02d07006 Gaurav Boob gb@ee.iitb.ac.in 02d07008 Abstract: We focus here on an effective

More information

A label distance maximum-based classifier for multi-label learning

A label distance maximum-based classifier for multi-label learning Bio-Medical Materials and Engineering 26 (2015) S1969 S1976 DOI 10.3233/BME-151500 IOS ress S1969 A label distance maximum-based classifier for multi-label learning Xiaoli Liu a,b, Hang Bao a, Dazhe Zhao

More information

Texture Mapping with Vector Graphics: A Nested Mipmapping Solution

Texture Mapping with Vector Graphics: A Nested Mipmapping Solution Texture Maing with Vector Grahics: A Nested Mimaing Solution Wei Zhang Yonggao Yang Song Xing Det. of Comuter Science Det. of Comuter Science Det. of Information Systems Prairie View A&M University Prairie

More information

Holistic 3D Scene Understanding from a Single Geo-tagged Image

Holistic 3D Scene Understanding from a Single Geo-tagged Image Holistic 3D Scene Understanding from a Single Geo-tagged Image Shenlong Wang, Sanja Fidler and Raquel Urtasun Deartment of Comuter Science University of Toronto {slwang, fidler, urtasun}@cs.toronto.edu

More information

Lecture 3: Geometric Algorithms(Convex sets, Divide & Conquer Algo.)

Lecture 3: Geometric Algorithms(Convex sets, Divide & Conquer Algo.) Advanced Algorithms Fall 2015 Lecture 3: Geometric Algorithms(Convex sets, Divide & Conuer Algo.) Faculty: K.R. Chowdhary : Professor of CS Disclaimer: These notes have not been subjected to the usual

More information

arxiv: v1 [cs.mm] 18 Jan 2016

arxiv: v1 [cs.mm] 18 Jan 2016 Lossless Intra Coding in with 3-ta Filters Saeed R. Alvar a, Fatih Kamisli a a Deartment of Electrical and Electronics Engineering, Middle East Technical University, Turkey arxiv:1601.04473v1 [cs.mm] 18

More information

DEPT: Depth Estimation by Parameter Transfer for Single Still Images

DEPT: Depth Estimation by Parameter Transfer for Single Still Images DEPT: Depth Estimation by Parameter Transfer for Single Still Images Xiu Li 1, 2, Hongwei Qin 1,2, Yangang Wang 3, Yongbing Zhang 1,2, and Qionghai Dai 1 1. Dept. of Automation, Tsinghua University 2.

More information

MATHEMATICAL MODELING OF COMPLEX MULTI-COMPONENT MOVEMENTS AND OPTICAL METHOD OF MEASUREMENT

MATHEMATICAL MODELING OF COMPLEX MULTI-COMPONENT MOVEMENTS AND OPTICAL METHOD OF MEASUREMENT MATHEMATICAL MODELING OF COMPLE MULTI-COMPONENT MOVEMENTS AND OPTICAL METHOD OF MEASUREMENT V.N. Nesterov JSC Samara Electromechanical Plant, Samara, Russia Abstract. The rovisions of the concet of a multi-comonent

More information

Building Polygonal Maps from Laser Range Data

Building Polygonal Maps from Laser Range Data ECAI Int. Cognitive Robotics Worksho, Valencia, Sain, August 2004 Building Polygonal Mas from Laser Range Data Longin Jan Latecki and Rolf Lakaemer and Xinyu Sun and Diedrich Wolter Abstract. This aer

More information

Extracting Optimal Paths from Roadmaps for Motion Planning

Extracting Optimal Paths from Roadmaps for Motion Planning Extracting Otimal Paths from Roadmas for Motion Planning Jinsuck Kim Roger A. Pearce Nancy M. Amato Deartment of Comuter Science Texas A&M University College Station, TX 843 jinsuckk,ra231,amato @cs.tamu.edu

More information

Detection of Abandoned Objects in Crowded Environments

Detection of Abandoned Objects in Crowded Environments Detection of Abandoned Objects in Crowded Environments Medha Bhargava, Chia-Chih Chen, M. S. Ryoo, and J. K. Aggarwal Comuter and Vision Research Center Deartment of Electrical and Comuter Engineering

More information

DEPT: Depth Estimation by Parameter Transfer for Single Still Images

DEPT: Depth Estimation by Parameter Transfer for Single Still Images DEPT: Depth Estimation by Parameter Transfer for Single Still Images Xiu Li 1,2, Hongwei Qin 1,2(B), Yangang Wang 3, Yongbing Zhang 1,2, and Qionghai Dai 1 1 Department of Automation, Tsinghua University,

More information

Feature Selection and Pose Estimation From Known Planar Objects Using Monocular Vision

Feature Selection and Pose Estimation From Known Planar Objects Using Monocular Vision Feature Selection and Pose Estimation From Known Planar Objects Using Monocular ision Shengdong Xu and Ming Liu 2 ETH Zurich, Switzerland, Email: Samuel.xu988@gmail.com 2 The Hong Kong University of Science

More information

521493S Computer Graphics Exercise 3 (Chapters 6-8)

521493S Computer Graphics Exercise 3 (Chapters 6-8) 521493S Comuter Grahics Exercise 3 (Chaters 6-8) 1 Most grahics systems and APIs use the simle lighting and reflection models that we introduced for olygon rendering Describe the ways in which each of

More information

Efficient Parallel Hierarchical Clustering

Efficient Parallel Hierarchical Clustering Efficient Parallel Hierarchical Clustering Manoranjan Dash 1,SimonaPetrutiu, and Peter Scheuermann 1 Deartment of Information Systems, School of Comuter Engineering, Nanyang Technological University, Singaore

More information

CENTRAL AND PARALLEL PROJECTIONS OF REGULAR SURFACES: GEOMETRIC CONSTRUCTIONS USING 3D MODELING SOFTWARE

CENTRAL AND PARALLEL PROJECTIONS OF REGULAR SURFACES: GEOMETRIC CONSTRUCTIONS USING 3D MODELING SOFTWARE CENTRAL AND PARALLEL PROJECTIONS OF REGULAR SURFACES: GEOMETRIC CONSTRUCTIONS USING 3D MODELING SOFTWARE Petra Surynková Charles University in Prague, Faculty of Mathematics and Physics, Sokolovská 83,

More information

A 2D Random Walk Mobility Model for Location Management Studies in Wireless Networks Abstract: I. Introduction

A 2D Random Walk Mobility Model for Location Management Studies in Wireless Networks Abstract: I. Introduction A D Random Walk Mobility Model for Location Management Studies in Wireless Networks Kuo Hsing Chiang, RMIT University, Melbourne, Australia Nirmala Shenoy, Information Technology Deartment, RIT, Rochester,

More information

Visualization, Estimation and User-Modeling for Interactive Browsing of Image Libraries

Visualization, Estimation and User-Modeling for Interactive Browsing of Image Libraries Visualization, Estimation and User-Modeling for Interactive Browsing of Image Libraries Qi Tian, Baback Moghaddam 2 and Thomas S. Huang Beckman Institute, University of Illinois, Urbana-Chamaign, IL 680,

More information

Pattern Recognition Letters

Pattern Recognition Letters Pattern Recognition Letters 30 (2009) 114 122 Contents lists available at ScienceDirect Pattern Recognition Letters journal homeage: www.elsevier.com/locate/atrec A stroke filter and its alication to text

More information

Equality-Based Translation Validator for LLVM

Equality-Based Translation Validator for LLVM Equality-Based Translation Validator for LLVM Michael Ste, Ross Tate, and Sorin Lerner University of California, San Diego {mste,rtate,lerner@cs.ucsd.edu Abstract. We udated our Peggy tool, reviously resented

More information

GRAPH CUT OPTIMIZATION FOR THE MUMFORD-SHAH MODEL

GRAPH CUT OPTIMIZATION FOR THE MUMFORD-SHAH MODEL GRAPH CUT OPTIMIZATION FOR THE MUMFORD-SHAH MODEL Noha El Zehiry 1,2, Steve Xu 2, Prasanna Sahoo 2 and Adel Elmaghraby 1 1 Deartment of Comuter Engineering and Comuter Science, University of Louisville,

More information

P Z. parametric surface Q Z. 2nd Image T Z

P Z. parametric surface Q Z. 2nd Image T Z Direct recovery of shae from multile views: a arallax based aroach Rakesh Kumar. Anandan Keith Hanna Abstract Given two arbitrary views of a scene under central rojection, if the motion of oints on a arametric

More information

Earthenware Reconstruction Based on the Shape Similarity among Potsherds

Earthenware Reconstruction Based on the Shape Similarity among Potsherds Original Paer Forma, 16, 77 90, 2001 Earthenware Reconstruction Based on the Shae Similarity among Potsherds Masayoshi KANOH 1, Shohei KATO 2 and Hidenori ITOH 1 1 Nagoya Institute of Technology, Gokiso-cho,

More information

Efficient Sequence Generator Mining and its Application in Classification

Efficient Sequence Generator Mining and its Application in Classification Efficient Sequence Generator Mining and its Alication in Classification Chuancong Gao, Jianyong Wang 2, Yukai He 3 and Lizhu Zhou 4 Tsinghua University, Beijing 0084, China {gaocc07, heyk05 3 }@mails.tsinghua.edu.cn,

More information

Shape from Second-bounce of Light Transport

Shape from Second-bounce of Light Transport Shae from Second-bounce of Light Transort Siying Liu 1, Tian-Tsong Ng 1, and Yasuyuki Matsushita 2 1 Institute for Infocomm Research Singaore 2 Microsoft Research Asia Abstract. This aer describes a method

More information

AUTOMATIC GENERATION OF HIGH THROUGHPUT ENERGY EFFICIENT STREAMING ARCHITECTURES FOR ARBITRARY FIXED PERMUTATIONS. Ren Chen and Viktor K.

AUTOMATIC GENERATION OF HIGH THROUGHPUT ENERGY EFFICIENT STREAMING ARCHITECTURES FOR ARBITRARY FIXED PERMUTATIONS. Ren Chen and Viktor K. inuts er clock cycle Streaming ermutation oututs er clock cycle AUTOMATIC GENERATION OF HIGH THROUGHPUT ENERGY EFFICIENT STREAMING ARCHITECTURES FOR ARBITRARY FIXED PERMUTATIONS Ren Chen and Viktor K.

More information

An accurate and fast point-to-plane registration technique

An accurate and fast point-to-plane registration technique Pattern Recognition Letters 24 (23) 2967 2976 www.elsevier.com/locate/atrec An accurate and fast oint-to-lane registration technique Soon-Yong Park *, Murali Subbarao Deartment of Electrical and Comuter

More information

Probabilistic Region Matching in Narrow-Band Endoscopy for Targeted Optical Biopsy

Probabilistic Region Matching in Narrow-Band Endoscopy for Targeted Optical Biopsy Probabilistic Region Matching in Narrow-Band Endoscoy for Targeted Otical Biosy Selen Atasoy 1,2, Ben Glocker 1, Stamatia Giannarou 2, Diana Mateus 1, Alexander Meining(MD) 3, Guang-Zhong Yang 2 and Nassir

More information

AUTOMATIC ROAD VECTOR EXTRACTION FOR MOBILE MAPPING SYSTEMS

AUTOMATIC ROAD VECTOR EXTRACTION FOR MOBILE MAPPING SYSTEMS AUTOMATIC ROAD VECTOR EXTRACTION FOR MOBILE MAPPING SYSTEMS WANG Cheng a, T. Hassan b, N. El-Sheimy b, M. Lavigne b a School of Electronic Science and Engineering, National University of Defence Technology,

More information

GEOMETRIC CONSTRAINT SOLVING IN < 2 AND < 3. Department of Computer Sciences, Purdue University. and PAMELA J. VERMEER

GEOMETRIC CONSTRAINT SOLVING IN < 2 AND < 3. Department of Computer Sciences, Purdue University. and PAMELA J. VERMEER GEOMETRIC CONSTRAINT SOLVING IN < AND < 3 CHRISTOPH M. HOFFMANN Deartment of Comuter Sciences, Purdue University West Lafayette, Indiana 47907-1398, USA and PAMELA J. VERMEER Deartment of Comuter Sciences,

More information

Citation for the original published paper (version of record):

Citation for the original published paper (version of record): htt://www.diva-ortal.org Prerint This is the submitted version of a aer ublished in Robotics and Autonomous Systems. Citation for the original ublished aer (version of record): Andreasson, H., Lilienthal,

More information

IEEE Coyright Notice Personal use of this material is ermitted. However, ermission to rerint/reublish this material for advertising or romotional uroses or for creating new collective works for resale

More information

Learning to Combine Mid-level Cues for Object Proposal Generation

Learning to Combine Mid-level Cues for Object Proposal Generation Learning to Combine Mid-level Cues for Object Proosal Generation Tom Lee Sanja Fidler Sven Dickinson University of Toronto {tshlee,fidler,sven}@cs.toronto.edu Abstract In recent years, region roosals have

More information

AUTOMATIC EXTRACTION OF BUILDING OUTLINE FROM HIGH RESOLUTION AERIAL IMAGERY

AUTOMATIC EXTRACTION OF BUILDING OUTLINE FROM HIGH RESOLUTION AERIAL IMAGERY AUTOMATIC EXTRACTION OF BUILDING OUTLINE FROM HIGH RESOLUTION AERIAL IMAGERY Yandong Wang EagleView Technology Cor. 5 Methodist Hill Dr., Rochester, NY 1463, the United States yandong.wang@ictometry.com

More information

Face Recognition Based on Wavelet Transform and Adaptive Local Binary Pattern

Face Recognition Based on Wavelet Transform and Adaptive Local Binary Pattern Face Recognition Based on Wavelet Transform and Adative Local Binary Pattern Abdallah Mohamed 1,2, and Roman Yamolskiy 1 1 Comuter Engineering and Comuter Science, University of Louisville, Louisville,

More information

Application of 3D Human Motion in the Sports based on Computer Aided Analysis

Application of 3D Human Motion in the Sports based on Computer Aided Analysis Send Orders for Rerints to rerints@benthamscience.ae 1728 he Oen Cybernetics & Systemics Journal, 2015, 9, 1728-1732 Oen Access Alication of 3D Human Motion in the Sorts based on Comuter Aided Analysis

More information

Using Permuted States and Validated Simulation to Analyze Conflict Rates in Optimistic Replication

Using Permuted States and Validated Simulation to Analyze Conflict Rates in Optimistic Replication Using Permuted States and Validated Simulation to Analyze Conflict Rates in Otimistic Relication An-I A. Wang Comuter Science Deartment Florida State University Geoff H. Kuenning Comuter Science Deartment

More information

The Spatial Skyline Queries

The Spatial Skyline Queries The Satial Skyline Queries Mehdi Sharifzadeh Comuter Science Deartment University of Southern California Los Angeles, CA 90089-078 sharifza@usc.edu Cyrus Shahabi Comuter Science Deartment University of

More information

Selecting Discriminative Binary Patterns for a Local Feature

Selecting Discriminative Binary Patterns for a Local Feature BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 15, No 3 Sofia 015 rint ISSN: 1311-970; Online ISSN: 1314-4081 DOI: 10.1515/cait-015-0044 Selecting Discriminative Binary atterns

More information

How to fit a surface to a point cloud?

How to fit a surface to a point cloud? TRICS seminar Comuter Science Deartment How to fit a surface to a oint cloud? or otimization of surface functionals in comuter vision Yuri Boykov Otimization of surface functionals in comuter vision The

More information

Space-efficient Region Filling in Raster Graphics

Space-efficient Region Filling in Raster Graphics "The Visual Comuter: An International Journal of Comuter Grahics" (submitted July 13, 1992; revised December 7, 1992; acceted in Aril 16, 1993) Sace-efficient Region Filling in Raster Grahics Dominik Henrich

More information

Model-Based Annotation of Online Handwritten Datasets

Model-Based Annotation of Online Handwritten Datasets Model-Based Annotation of Online Handwritten Datasets Anand Kumar, A. Balasubramanian, Anoo Namboodiri and C.V. Jawahar Center for Visual Information Technology, International Institute of Information

More information

Robust Real Time Pattern Matching using Bayesian Sequential Hypothesis Testing

Robust Real Time Pattern Matching using Bayesian Sequential Hypothesis Testing Robust Real Time Pattern Matching using Bayesian Sequential Hyothesis Testing Ofir Pele and Michael Werman Abstract This aer describes a method for robust real time attern matching. We first introduce

More information

MULTI-CAMERA SURVEILLANCE WITH VISUAL TAGGING AND GENERIC CAMERA PLACEMENT. Jian Zhao and Sen-ching S. Cheung

MULTI-CAMERA SURVEILLANCE WITH VISUAL TAGGING AND GENERIC CAMERA PLACEMENT. Jian Zhao and Sen-ching S. Cheung MULTI-CAMERA SURVEILLANCE WITH VISUAL TAGGING AND GENERIC CAMERA PLACEMENT Jian Zhao and Sen-ching S. Cheung University of Kentucky Center for Visualization and Virtual Environment 1 Quality Street, Suite

More information

3D Surface Simplification Based on Extended Shape Operator

3D Surface Simplification Based on Extended Shape Operator 3D Surface Simlification Based on Extended Shae Oerator JUI-LIG TSEG, YU-HSUA LI Deartment of Comuter Science and Information Engineering, Deartment and Institute of Electrical Engineering Minghsin University

More information

Privacy Preserving Moving KNN Queries

Privacy Preserving Moving KNN Queries Privacy Preserving Moving KNN Queries arxiv:4.76v [cs.db] 4 Ar Tanzima Hashem Lars Kulik Rui Zhang National ICT Australia, Deartment of Comuter Science and Software Engineering University of Melbourne,

More information

10. Parallel Methods for Data Sorting

10. Parallel Methods for Data Sorting 10. Parallel Methods for Data Sorting 10. Parallel Methods for Data Sorting... 1 10.1. Parallelizing Princiles... 10.. Scaling Parallel Comutations... 10.3. Bubble Sort...3 10.3.1. Sequential Algorithm...3

More information

arxiv: v1 [cs.cv] 20 Sep 2016

arxiv: v1 [cs.cv] 20 Sep 2016 Transfer Learning for Material Classification using Convolutional Networks Patrick Wieschollek Hendrik P.A. Lensch Tübingen University arxiv:69.688v [cs.cv] 2 Se 26 Abstract Material classification in

More information

THE COMPARISON OF DRAINAGE NETWORK EXTRACTION BETWEEN SQUARE AND HEXAGONAL GRID-BASED DEM

THE COMPARISON OF DRAINAGE NETWORK EXTRACTION BETWEEN SQUARE AND HEXAGONAL GRID-BASED DEM The International Archives of the Photogrammetry, Remote Sensing and Satial Information Sciences, Volume XLII-4, 208 ISPRS TC IV Mid-term Symosium 3D Satial Information Science The Engine of Change, 5

More information

Short Papers. Symmetry Detection by Generalized Complex (GC) Moments: A Close-Form Solution 1 INTRODUCTION

Short Papers. Symmetry Detection by Generalized Complex (GC) Moments: A Close-Form Solution 1 INTRODUCTION 466 IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE VOL 2 NO 5 MAY 999 Short Paers Symmetry Detection by Generalized Comlex (GC) Moments: A Close-Form Solution Dinggang Shen Horace HS I

More information

Mining Association rules with Dynamic and Collective Support Thresholds

Mining Association rules with Dynamic and Collective Support Thresholds Mining Association rules with Dynamic and Collective Suort Thresholds C S Kanimozhi Selvi and A Tamilarasi Abstract Mining association rules is an imortant task in data mining. It discovers the hidden,

More information

Support Vector Machines for Face Authentication

Support Vector Machines for Face Authentication Suort Vector Machines for Face Authentication K Jonsson 1 2, J Kittler 1,YPLi 1 and J Matas 1 2 1 CVSSP, University of Surrey Guildford, Surrey GU2 5XH, United Kingdom 2 CMP, Czech Technical University

More information

Vehicle Logo Recognition Using Modest AdaBoost and Radial Tchebichef Moments

Vehicle Logo Recognition Using Modest AdaBoost and Radial Tchebichef Moments Proceedings of 0 4th International Conference on Machine Learning and Comuting IPCSIT vol. 5 (0) (0) IACSIT Press, Singaore Vehicle Logo Recognition Using Modest AdaBoost and Radial Tchebichef Moments

More information

A Model-Adaptable MOSFET Parameter Extraction System

A Model-Adaptable MOSFET Parameter Extraction System A Model-Adatable MOSFET Parameter Extraction System Masaki Kondo Hidetoshi Onodera Keikichi Tamaru Deartment of Electronics Faculty of Engineering, Kyoto University Kyoto 66-1, JAPAN Tel: +81-7-73-313

More information

Visual Correspondence Using Energy Minimization and Mutual Information

Visual Correspondence Using Energy Minimization and Mutual Information Visual Corresondence Using Energy Minimization and Mutual Information Junhwan Kim Vladimir Kolmogorov Ramin Zabih Comuter Science Deartment Cornell University Ithaca, NY 14853 Abstract We address visual

More information

Signature File Hierarchies and Signature Graphs: a New Index Method for Object-Oriented Databases

Signature File Hierarchies and Signature Graphs: a New Index Method for Object-Oriented Databases Signature File Hierarchies and Signature Grahs: a New Index Method for Object-Oriented Databases Yangjun Chen* and Yibin Chen Det. of Business Comuting University of Winnieg, Manitoba, Canada R3B 2E9 ABSTRACT

More information

SKIN CANCER LESION CLASSIFICATION USING LBP BASED HYBRID CLASSIFIER

SKIN CANCER LESION CLASSIFICATION USING LBP BASED HYBRID CLASSIFIER DOI: htt://dx.doi.org/10.26483/ijarcs.v8i7.4469 Volume 8, No. 7, July August 2017 International Journal of Advanced Research in Comuter Science RESEARCH PAPER Available Online at www.ijarcs.info ISSN No.

More information

A New and Efficient Algorithm-Based Fault Tolerance Scheme for A Million Way Parallelism

A New and Efficient Algorithm-Based Fault Tolerance Scheme for A Million Way Parallelism A New and Efficient Algorithm-Based Fault Tolerance Scheme for A Million Way Parallelism Erlin Yao, Mingyu Chen, Rui Wang, Wenli Zhang, Guangming Tan Key Laboratory of Comuter System and Architecture Institute

More information

3D FACIAL GEOMETRIC FEATURES FOR CONSTRAINED LOCAL MODEL. Shiyang Cheng Stefanos Zafeiriou Akshay Asthana Maja Pantic

3D FACIAL GEOMETRIC FEATURES FOR CONSTRAINED LOCAL MODEL. Shiyang Cheng Stefanos Zafeiriou Akshay Asthana Maja Pantic 3D FACIAL GEOMETRIC FEATURES FOR CONSTRAINED LOCAL MODEL Shiyang Cheng Stefanos Zafeiriou Akshay Asthana Maja Pantic Imerial College London, Deartment of Comuting, United Kingdom University of Twente,

More information

Directed File Transfer Scheduling

Directed File Transfer Scheduling Directed File Transfer Scheduling Weizhen Mao Deartment of Comuter Science The College of William and Mary Williamsburg, Virginia 387-8795 wm@cs.wm.edu Abstract The file transfer scheduling roblem was

More information

Augmenting Shape with Appearance in Vehicle Category Recognition

Augmenting Shape with Appearance in Vehicle Category Recognition Augmenting Shae with Aearance in Vehicle ategory Recognition Ozge. Ozcanli* Amir Tamrakar Benjamin B. Kimia Joseh L. Mundy Division of Engineering, Brown University, Providence, RI 02912 Abstract Shae

More information

Theoretical Analysis of Graphcut Textures

Theoretical Analysis of Graphcut Textures Theoretical Analysis o Grahcut Textures Xuejie Qin Yee-Hong Yang {xu yang}@cs.ualberta.ca Deartment o omuting Science University o Alberta Abstract Since the aer was ublished in SIGGRAPH 2003 the grahcut

More information

Improve Precategorized Collection Retrieval by Using Supervised Term Weighting Schemes Λ

Improve Precategorized Collection Retrieval by Using Supervised Term Weighting Schemes Λ Imrove Precategorized Collection Retrieval by Using Suervised Term Weighting Schemes Λ Ying Zhao and George Karyis University of Minnesota, Deartment of Comuter Science Minneaolis, MN 55455 Abstract The

More information

High-speed Depth Stream Generation from a Hybrid Camera

High-speed Depth Stream Generation from a Hybrid Camera High-seed Deth Stream Generation from a Hybrid Camera Xinxin Zuo, Sen Wang, Jiangbin Zheng Ruigang Yang Northwestern Polytechnical University, Xi an, P.R.China University of Kentucky, Lexington, KY, USA

More information

Unifying Holistic and Parts-Based Deformable Model Fitting

Unifying Holistic and Parts-Based Deformable Model Fitting Unifying Holistic and Parts-Based Deformable Model Fitting Joan Alabort-i-Medina Stefanos Zafeiriou Deartment of Comuting, Imerial College London, United Kingdom {ja30, s.zafeiriou}@imerial.ac.uk Abstract

More information

Cross products. p 2 p. p p1 p2. p 1. Line segments The convex combination of two distinct points p1 ( x1, such that for some real number with 0 1,

Cross products. p 2 p. p p1 p2. p 1. Line segments The convex combination of two distinct points p1 ( x1, such that for some real number with 0 1, CHAPTER 33 Comutational Geometry Is the branch of comuter science that studies algorithms for solving geometric roblems. Has alications in many fields, including comuter grahics robotics, VLSI design comuter

More information

Experiments on Patent Retrieval at NTCIR-4 Workshop

Experiments on Patent Retrieval at NTCIR-4 Workshop Working Notes of NTCIR-4, Tokyo, 2-4 June 2004 Exeriments on Patent Retrieval at NTCIR-4 Worksho Hironori Takeuchi Λ Naohiko Uramoto Λy Koichi Takeda Λ Λ Tokyo Research Laboratory, IBM Research y National

More information