ROBUST ROAD DETECTION FROM A SINGLE IMAGE USING ROAD SHAPE PRIOR. Zhen He, Tao Wu, Zhipeng Xiao, Hangen He

Size: px
Start display at page:

Download "ROBUST ROAD DETECTION FROM A SINGLE IMAGE USING ROAD SHAPE PRIOR. Zhen He, Tao Wu, Zhipeng Xiao, Hangen He"

Transcription

1 ROBUST ROAD DETECTION FROM A SINGLE IMAGE USING ROAD SHAPE PRIOR Zhen He, Tao Wu, Zhipeng Xiao, Hangen He College of Mechatronics and Automation National University of Defense Technology Changsha, Hunan, P. R. China ABSTRACT Many road detection algorithms require pre-learned information, which may be unreliable as the road scene is usually unexpectable. Single image based (i.e., without any pre-learned information) road detection techniques can be adopted to overcome this problem, while their robustness needs improving. To achieve robust road detection from a single image, this paper proposes a general road shape prior to enforce the detected region to be road-shaped by encoding the prior into a graph-cut segmentation framework, where the training data is automatically generated from a predicted road region of the current image. By iteratively performing the graph-cut segmentation, an accurate road region will be obtained. Quantitative and qualitative experiments on the challenging SUN Database validate the robustness and efficiency of our method. We believe that the road shape prior can also be used to yield improvements for many other road detection algorithms. Index Terms Road detection, shape prior, graph cuts. INTRODUCTION Vision-based road detection is of high relevance for autonomous driving and pedestrian detection. Detecting the road from an onboard road image is very challenging due to the diversity of road shapes (e.g., straight, left/right curve, and winding), road materials (e.g., concrete, soil, and asphalt), and backgrounds (e.g., street, forest, and mountain), and the noise induced by varying illumination, different weather, object occlusion, etc., as shown in Fig.. With the information learned off-line from some specific road scenes, many proposed segmentation algorithms [, 2, 3, 4] perform well for the same road scenes but poorly for the others. However, the off-line learning needs to label training data manually which is time consuming and the road scene is usually unknown, especially on moving vehicles. Hence, road detection from a single image (i.e. without any pre-learned information) is needed to overcome these problems and thus will be more challenging. There has been some works on single image based road detection. In [5], Kong et al. use a vanishing-point-constrained edge detection technique to acquire straight road boundaries. This method may be inapplicable when there is no legible vanishing point (e.g., on the ascent), or the road boundaries are curved. To deal with complex road shapes, Alvarez et al. [6] use a likelihood-based region growing method to label pixels. It is factually a local method and not robust to noise, which may cause overgrowth or undergrowth. More recently, in [7], general information off-line learned from other road scenes is combined with the information on-line learned from current image to enhance the detection robustness. However, in its on-line learning procedure, the training data is taken from a fixed region (i.e. the central-bottom of the image), thus the road patterns may not be well-learned /3/$ IEEE 2757 (c) (d) (e) (f) Fig.. Examples of different roads with: Different shapes, Different materials, (c) Different backgrounds, (d) Different illumination, (e) Different weather, and (f) Object occlusion. Aiming at utilizing the prior knowledge to improve the robustness of single image based road detection, we propose a general road shape prior to enforce the detected road region to be road-shaped by encoding this prior into a graph-cut segmentation framework, where the labels are automatically generated from the current image to model the road. The final detected road region is acquired by iteratively performing the graph cuts, where the last detected road region is iterated to a current predicted one that is used for prior encoding and label generating and initialized as a semicircle at the central-bottom of the image. The road shape prior is based on the observation that in on-board road images, the varied road regions should have some shape constraints making them look like road, and thus the detected road regions should also satisfy these constraints. Unlike previous methods using limited templates that are hard to cover varied road shapes and need matching, our approach benefits from a general road shape prior that is more natural and can be encoded in a unified graph-cut framework; unlike previous online learning methods generating labels from the current image in one shot, it progressively generates a better batch of labels automatically by iterative graph cuts and thus builds a better model. An outline of the paper follows. In Sec. 2 we demonstrate how the basic graph-cut segmentation can be used for road detection, in Sec. 3 we explain how to incorporate the road shape prior in the graph-cut segmentation framework, and in Sec. 4 we detail the iterative graph cuts. Experimental results and discussion are presented in Sec. 5, while conclusions are given in Sec ROAD DETECTION USING BASIC GRAPH-CUT SEGMENTATION Road detection can be formulated as a binary labeling problem trying to assign a label (road or background) to each pixel. A popular way to such problems is to use the efficient graph-cut segmentation [8]. Let yi 0 (background), (road)} be the label of pixel i of the image, P be the set of all pixels, and N be the set of all neighboring pixel pairs. Then y = yi i P} is the collection of all ICIP 203

2 label assignments and defines a segmentation. The basic graph-cut optimizes y by minimizing the energy function: E(y) = D i(y i) + λ V i,j(y i, y j) () i P i,j N where D i is the data term penalizing the assignments that do not fit for pixel i, and V i,j is the smoothness term penalizing the different assignments for neighboring pixels i and j. Let I i be the feature vector of pixel i. Then D i is typically defined as the negative loglikelihood of a label y i being assigned to pixel i and takes the form: D i() = ln Pr(I i R) D i(0) = ln Pr(I i B) (2) where Pr(I i R) is the road model, and Pr(I i B) is the background model. The smoothness term V i,j follows the conventional form: V i,j(y i, y j) = [y i y j] exp( Ii Ij 2 2β ) dist(i, j) where [ ] is if y i y j and 0 otherwise, β denotes the mean squaredifference between feature vectors over all neighboring pixels, and dist(i, j) is the Euclidean distance between pixel i and j. Parameter λ > 0 in Eq. () weights the importance between D i and V i,j. Compared with other local methods, the basic graph-cut mainly benefits from V i,j that encodes the neighboring consistency, and its efficient global optimization. However, it is still a bottom-up approach and unable to capture the rich properties of the road region. Thus, we propose a general road shape prior and incorporate it with the basic graph-cut segmentation framework, as shown in Sec INCORPORATING THE ROAD SHAPE PRIOR As there are a wide variety of road shapes in the on-board road images (see Fig. ), it is hard to describe them with limited templates. However, as shown in Fig. 2, for a segmented image, we can easily tell whether the foreground (white region) looks like a road. Hence, there may be some shape constraints making a region like a road, which we call road shape prior. (c) (d) (e) (f) (g) (h) Fig. 2. Examples of some segmented images. Obviously, the white regions in, (d), (f), and (h) look more like roads than those in, (c), (e), and (g). Here we use the shrinking constraint and consistency constraint to describe such a prior. The former is based on the observation that along the road axis (i.e. the centerline of the road region), the road region is perspectively shrinking from the near to the distant, while the latter suggests that the region between both sides of the road should consistently belong to the road, and vice versa. Consider Fig. 3, let l be the centerline of the white region R, and pixel i be an arbitrary point. Horizontally move l to l, and draw a horizon line h, both of which pass through the center of i. p is an arbitrary pixel on l and below i, and q is an arbitrary pixel on h and between l and l (or on l). The shrinking constraint implies that i, i R p R, while the consistency constraint implies that i, (i R) (i / l) q R. Thus, in Fig. 3, the shrinking constraint is satisfied in and, the consistency constraint is satisfied in and (c), whereas (d) satisfies neither of the constraints. Our criterion is that a region is road-shaped if and only if it satisfies both of the constraints. This criterion is consistent with our (3) (c) (d) Fig. 3. Illustration of the shrinking constraint and consistency constraint. In and, i, i R p R, so the shrinking constraint is satisfied, while in and (c), i, (i R) (i / l) q R, therefore the consistency constraint is satisfied. In (d), i, (i R) (p / R), and i, (i R) (i / l) (q / R), so neither of the constraints is satisfied. Table. Comparison between our criterion and the intuition. The road-shaped regions in Fig. 2 are also road-like, and vice versa. Images in Fig. 2 (c) (d) (e) (f) (g) (h) Shrinking const. Consistency const. Road-shaped Road-like intuition (e.g., see Table ). To improve the detection robustness, we enforce the detected road region to be road-shaped by encoding both constraints into a graph-cut framework. Inspired by [9], we define a shrinking constraint term S i,p and a consistency constraint term C i,q as follows: S i,p(y i, y p) =, if y i = and y p = 0 0, otherwise C i,q(y i, y q) =, if y i = and y q = 0 0, otherwise Note that S i,p is defined on pixel pairs (i, p), while C i,q is defined on pixel pairs (i, q) (see Fig. 3). Both terms penalize the assignments that violate the corresponding constraint by taking infinite cost. In fact, it is enough to put S i,p and C i,q only between neighboring pixels. For S i,p, if every pixel pair (i, p) satisfies the shrinking constraint, then, of course, every neighboring (i, p) also satisfies it, and if there exists some pixel pair not satisfying the shrinking constraint, then there should be at least a neighboring (i, p) not satisfying it. It is similar for C i,q. We use a standard 8-neighborhood system (see Fig. 4) for generality. For every pixel i that not on l, it is easy to find a neighboring pixel q, the center of which is exactly passed through by h. While for every pixel i, l may not exactly passes through the center of any neighboring pixels of i. In this case, a neighboring pixel p will be chosen if it lies the closest to l (see Fig. 4). Fig. 4. Illustration choosing a pixel p neighbored to i from a standard 8-neighborhood system. A standard 8-neighborhood system, where 8 neighbors are connected to each pixel i. A neighboring pixel of i (i.e., p 4) is chosen as it lies the closest to l among all possible neighbors (i.e., p p 5, which are not above i). (4) 2758

3 With constraint terms Si,p and Ci,q, E(y) becomes: E(y) = Di (yi ) + λ Vi,j (yi, yj ) i P + i,j N Si,p (yi, yp ) + i,p N (5) Ci,q (yi, yq ) i,q N As the road region is less changeable than the background and the road feature is more self-similar, we only build the road model Pr(Ii R), and redefine the data term Di in Eq. (5) as: yi, if Pr(Ii R) γ Di (yi ) = (6) yi, otherwise where the threshold γ = γ0 max Pr(Ii R), and γ0 [0, ] is a fixed value. Di penalizes the assignments not fitting for the data. In Eq. (5), Vi,j (yi, yj ) takes the same form of Eq. (3), and we fix the weighting parameter λ to. According to [0], E is submodular since every term in Eq. (5) is submodular. As E is also a quadratic pseudo boolean function, it can be exactly minimized via graph cuts. Note that both the shrinking and consistency constraints depend on the road axis, which is determined by the road region, and the road labels should also be generated from the road region. However, the road region is unknown before detection. Thus, as detailed in Sec. 4, we first initialize a predicted road region, and then iteratively use the graph cuts to update the predicted region until it converges. Fig. 5. Illustration of our iterative graph cuts for road detection. 4. ROAD DETECTION BY ITERATIVE GRAPH CUTS Our iterative graph cuts algorithm is illustrated in Fig. 5. For an arbitrary test road image, we compute the illuminant invariant ranging from 0 to 255 as its feature,as described in [6]. As the real road region is unknown, we first initialize a road region Rp (see (d), a semicircle at the central-bottom of the image). To aquire a reliable labeling region L (see (c)), we leave it with a margin of 2 ( SRp SRp /2) (SR denotes the area of R) to the background region Bp. Then, a road model (f) is built with the training data uniformly labeled on L, and the likelihood image (see (e), which is normalized) of can be acquired using (f). Next, we obtain the data terms (see (i)) according to Eq. (6). The smaller the energy, the more likely the road class. While the smoothness terms (see (h)) can be directly computed from using Eq. (3). On the other hand, we get the road axis from (d) and extend it to the image border (see (g)). Then the road shape prior, i.e. the shrinking constraint (j) and consistency constraint (k) can be added. Finally, we minimize E(y) in Eq. (5) via graph cuts. If the segmented road region Rd (see (n)) converges, i.e., Rd satisfies ε(rd ) < ε0, where ε( ) is the convergence error and ε0 is a constant, a final detected result (m) will be obtained. Otherwise, Rd will be iterated to Rp of the next cycle. Here we set ε0 to 0 3, and define ε(rd ) = (SRd Rp SRd Rp )/(SRp Bp ). An example of the iterative graph cuts is shown in Fig. 6. By iteration, a better batch of road labels are progressively generated, thus a better road model can be built. Moreover, it favors our road shape prior, which benefits the robust road detection. 5. EXPERIMENTAL RESULTS AND DISCUSSION To validate our method, three different experiments are conducted on the SUN Database [], where 500 of the 868 still on-board road images are randomly selected as test set, and also manually Fig. 6. An example of the iterative graph cuts when γ0 (see Eq. (6)) is set to 0.. A test image with its illuminant invariant and smoothness energy, both of which are computed in one shot, and the ground truth. An illustration of the iteration. A better road model (column 2) is gradually built using the labeled data taken from L (column ). As the predicted region is approaching to the real road region, the road axis also becomes accurate (column 4), and a better road shape prior can be enforced on the likelihood image (column 3) to get a better result (column 5, 6). The detected road region Rd converges in the 4th iteration where ε(rd ) = < 0 3. labeled as ground truth (no training set is needed in our algorithm). All the images are resized to for testing. The parameter γ0 of Eq. (6) is fixed to 0. by default. We use gco-v3.0 software2 [2, 3, 0] to implement the graph-cut optimization. Evaluating the performance of iteration. First, we fix the number of iterations to 4 for efficiency, as 95.9% of the segmented results converge after 4 iterations (see Table 2). Then, the iteration performance is evaluated by 4 types of pixel-wise measures (ie., the recall (T P/(T P + F N )), precision (T P/(T P + F P )), F-measure (2/(/recall + /precision)), and quality (T P/(T P + F P + F N )), which are denoted by RC, P C, F, and Q, respectively) and the running time on a 2 GHz computer with 4.0 GB RAM. As shown in Table 3, each of the four measures reaches the highest after 4 iterations,

4 Table 2. The ratio of converged segmented results (η) with the increasing of iterations. η reaches 95.9% after the 4th iteration. Iter η(%) Table 3. Quantitative results of the iteration performance. Iter. RC (%) P C (%) F (%) Q (%) Time ms ms ms ms ms which take only 98 ms running time and thus can achieve real-time detection. Some qualitative results can be seen in Fig. 7. Comparing with the basic graph-cut segmentation. To show that our road shape prior can help improve the performance of basic graph-cut segmentation, we evaluate our method with BGCv (Basic graph cuts with only data terms) and BGCv2 (basic graph cuts with both data and smoothness terms). Quantitative results are provided using ROC curves that represent the trade-off between true positive rate (T P/(T P + F N )) and false positive rate (F P/(F P + T N )). The data terms for basic graph cuts are got by Eq. (6) using the training data taken from a fixed region (the same as our initialized labeling region). By changing γ0 from 0 to, we obtain the ROC curves and corresponding AU C (area under the curve) (see Fig. 8). Obviously, our method outperforms BGCv and BGCv2 with the highest AU C (95.4%). Qualitative results in Fig. 9 show that the detected road regions are enforced to be road-shaped with our method. Comparing with state-of-the-art algorithms. We compare our method with two state-of-the-art single image based road detection algorithms: the vanishing point (VP) based method [5] and the likelihood-based region growing (LRG) method [6]. Quantitative results (see Table 4) show that our method outperforms both VP and LRG with the measures of RC, P C, F, and Q. Qualitative results in Fig. 9 show that VP fails when there is no legible vanishing point (row 5, 6) or the road boundaries are curved (row 2), and the LRG fails due to overgrowth (row, 3, 5) or undergrowth (row 2, 4). Yet, our method performs the best on the first five instances, benefiting from the global optimization under a general road shape constraint. Discussion. While our method enforces such a shape prior on the road segmentation task, it can also result in a degradation in performance, e.g., a fork road may not satisfy our road shape prior (see, e.g., row 6 of Fig. 9). Besides, our method fails when the illuminantinvariant feature lacks discriminability in some cases (e.g., in the last instance of Fig. 9, the illuminant-invariant feature of the road and its surroundings is similar). Thus, engineering better features and exploring more useful road priors (e.g., a more general shape prior that can handle more complex road shapes such as forks or crossings) are our interesting directions for future work. Table 4. Quantitative results of our method compared with state-ofthe-art single image road detection algorithms VP [5] and LRG [6]. RC (%) P C (%) F (%) Q (%) VP 75.9 ± ± ± ± 22.9 LRG 89. ± ± ± ± 6.6 Ours 9.2 ± 8.6 ± 9.3 ± ± 6.9 Fig. 7. Qualitative results of the iteration performance. Fig. 8. ROC curves and the corresponding AU C of our method compared with basic graph-cut methods BGCv and BGCv2. 6. CONCLUSIONS In this paper, we proposed a general road shape prior to enforce the detected region to be road-shaped by encoding it into a graph-cut framework, where the training data is automatically generated from a predicted road region of the current image. Accurate road region is obtained by iteratively performing the graph cuts. Experimental results validate the robustness and efficiency of our method. Acknowledgment. The presented research work is supported by the National Natural Science Foundation of China (Grant No ) Fig. 9. Comparisons of our method over the basic graph-cut methods (BGCv, BGCv2) and two state-of-the-art methods (VP, LRG). Each row shows a different instance. White and black regions in column 2 7 denote roads and backgrounds, respectively. Column : Test images. Column 2 6: Detection results obtained by: VP [5], BGCv, BGCv2, LRG [6], and our method, respectively. Column 7: Manually labeled ground truth. Note that the same feature (i.e., the illuminant invariant) are used in the last four methods.

5 7. REFERENCES [] J.M. Alvarez, T. Gevers, and A.M. López, Vision-based road detection using road models, in Image Processing (ICIP), th IEEE International Conference on. IEEE, 2009, pp [2] Q. Huang, M. Han, B. Wu, and S. Ioffe, A hierarchical conditional random field model for labeling and segmenting images of street scenes, in Computer Vision and Pattern Recognition (CVPR), 20 IEEE Conference on. IEEE, 20, pp [3] P. Sturgess, L. Ladicky, N. Crook, and P.H.S. Torr, Scalable cascade inference for semantic image segmentation, in BMVC, 202. [4] G. Floros and B. Leibe, Joint 2d-3d temporally consistent semantic segmentation of street scenes, in Computer Vision and Pattern Recognition (CVPR), 202 IEEE Conference on. IEEE, 202, pp [5] H. Kong, J.Y. Audibert, and J. Ponce, Vanishing point detection for road detection, in Computer Vision and Pattern Recognition (CVPR), 2009 IEEE Conference on. IEEE, 2009, pp [6] J.M. Alvarez and A.M. Lopez, Road detection based on illuminant invariance, Intelligent Transportation Systems, IEEE Transactions on, vol. 2, no., pp , 20. [7] J.M. Alvarez, T. Gevers, Y. LeCun, and A.M. Lopez, Road scene segmentation from a single image, in Proceedings of the 2th European Conference on Computer Vision. Springer, 202. [8] Y. Boykov and M.P. Jolly, Interactive graph cuts for optimal boundary & region segmentation of objects in nd images, in Computer Vision, 200. ICCV 200. Proceedings. Eighth IEEE International Conference on. IEEE, 200, vol., pp [9] O. Veksler, Star shape prior for graph-cut image segmentation, in Proceedings of the 0th European Conference on Computer Vision: Part III. Springer, 2008, pp [0] V. Kolmogorov and R. Zabin, What energy functions can be minimized via graph cuts?, Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 26, no. 2, pp , [] J. Xiao, J. Hays, K.A. Ehinger, A. Oliva, and A. Torralba, Sun database: Large-scale scene recognition from abbey to zoo, in Computer vision and pattern recognition (CVPR), 200 IEEE conference on. IEEE, 200, pp [2] Y. Boykov, O. Veksler, and R. Zabih, Fast approximate energy minimization via graph cuts, Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 23, no., pp , 200. [3] Y. Boykov and V. Kolmogorov, An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision, Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 26, no. 9, pp ,

Supervised texture detection in images

Supervised texture detection in images Supervised texture detection in images Branislav Mičušík and Allan Hanbury Pattern Recognition and Image Processing Group, Institute of Computer Aided Automation, Vienna University of Technology Favoritenstraße

More information

Combining PGMs and Discriminative Models for Upper Body Pose Detection

Combining PGMs and Discriminative Models for Upper Body Pose Detection Combining PGMs and Discriminative Models for Upper Body Pose Detection Gedas Bertasius May 30, 2014 1 Introduction In this project, I utilized probabilistic graphical models together with discriminative

More information

Monocular Vision Based Autonomous Navigation for Arbitrarily Shaped Urban Roads

Monocular Vision Based Autonomous Navigation for Arbitrarily Shaped Urban Roads Proceedings of the International Conference on Machine Vision and Machine Learning Prague, Czech Republic, August 14-15, 2014 Paper No. 127 Monocular Vision Based Autonomous Navigation for Arbitrarily

More information

Data-driven Depth Inference from a Single Still Image

Data-driven Depth Inference from a Single Still Image Data-driven Depth Inference from a Single Still Image Kyunghee Kim Computer Science Department Stanford University kyunghee.kim@stanford.edu Abstract Given an indoor image, how to recover its depth information

More information

MULTI-REGION SEGMENTATION

MULTI-REGION SEGMENTATION MULTI-REGION SEGMENTATION USING GRAPH-CUTS Johannes Ulén Abstract This project deals with multi-region segmenation using graph-cuts and is mainly based on a paper by Delong and Boykov [1]. The difference

More information

Region-based Segmentation and Object Detection

Region-based Segmentation and Object Detection Region-based Segmentation and Object Detection Stephen Gould Tianshi Gao Daphne Koller Presented at NIPS 2009 Discussion and Slides by Eric Wang April 23, 2010 Outline Introduction Model Overview Model

More information

Object detection using non-redundant local Binary Patterns

Object detection using non-redundant local Binary Patterns University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2010 Object detection using non-redundant local Binary Patterns Duc Thanh

More information

Segmentation. Bottom up Segmentation Semantic Segmentation

Segmentation. Bottom up Segmentation Semantic Segmentation Segmentation Bottom up Segmentation Semantic Segmentation Semantic Labeling of Street Scenes Ground Truth Labels 11 classes, almost all occur simultaneously, large changes in viewpoint, scale sky, road,

More information

Supplementary Material: Adaptive and Move Making Auxiliary Cuts for Binary Pairwise Energies

Supplementary Material: Adaptive and Move Making Auxiliary Cuts for Binary Pairwise Energies Supplementary Material: Adaptive and Move Making Auxiliary Cuts for Binary Pairwise Energies Lena Gorelick Yuri Boykov Olga Veksler Computer Science Department University of Western Ontario Abstract Below

More information

Color Image Segmentation

Color Image Segmentation Color Image Segmentation Yining Deng, B. S. Manjunath and Hyundoo Shin* Department of Electrical and Computer Engineering University of California, Santa Barbara, CA 93106-9560 *Samsung Electronics Inc.

More information

Automatic Shadow Removal by Illuminance in HSV Color Space

Automatic Shadow Removal by Illuminance in HSV Color Space Computer Science and Information Technology 3(3): 70-75, 2015 DOI: 10.13189/csit.2015.030303 http://www.hrpub.org Automatic Shadow Removal by Illuminance in HSV Color Space Wenbo Huang 1, KyoungYeon Kim

More information

Graph Based Image Segmentation

Graph Based Image Segmentation AUTOMATYKA 2011 Tom 15 Zeszyt 3 Anna Fabijañska* Graph Based Image Segmentation 1. Introduction Image segmentation is one of the fundamental problems in machine vision. In general it aims at extracting

More information

Detection of a Single Hand Shape in the Foreground of Still Images

Detection of a Single Hand Shape in the Foreground of Still Images CS229 Project Final Report Detection of a Single Hand Shape in the Foreground of Still Images Toan Tran (dtoan@stanford.edu) 1. Introduction This paper is about an image detection system that can detect

More information

Learning the Three Factors of a Non-overlapping Multi-camera Network Topology

Learning the Three Factors of a Non-overlapping Multi-camera Network Topology Learning the Three Factors of a Non-overlapping Multi-camera Network Topology Xiaotang Chen, Kaiqi Huang, and Tieniu Tan National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy

More information

Scene Segmentation in Adverse Vision Conditions

Scene Segmentation in Adverse Vision Conditions Scene Segmentation in Adverse Vision Conditions Evgeny Levinkov Max Planck Institute for Informatics, Saarbrücken, Germany levinkov@mpi-inf.mpg.de Abstract. Semantic road labeling is a key component of

More information

A New Feature Local Binary Patterns (FLBP) Method

A New Feature Local Binary Patterns (FLBP) Method A New Feature Local Binary Patterns (FLBP) Method Jiayu Gu and Chengjun Liu The Department of Computer Science, New Jersey Institute of Technology, Newark, NJ 07102, USA Abstract - This paper presents

More information

Self Lane Assignment Using Smart Mobile Camera For Intelligent GPS Navigation and Traffic Interpretation

Self Lane Assignment Using Smart Mobile Camera For Intelligent GPS Navigation and Traffic Interpretation For Intelligent GPS Navigation and Traffic Interpretation Tianshi Gao Stanford University tianshig@stanford.edu 1. Introduction Imagine that you are driving on the highway at 70 mph and trying to figure

More information

CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION

CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION 6.1 INTRODUCTION Fuzzy logic based computational techniques are becoming increasingly important in the medical image analysis arena. The significant

More information

Cost-alleviative Learning for Deep Convolutional Neural Network-based Facial Part Labeling

Cost-alleviative Learning for Deep Convolutional Neural Network-based Facial Part Labeling [DOI: 10.2197/ipsjtcva.7.99] Express Paper Cost-alleviative Learning for Deep Convolutional Neural Network-based Facial Part Labeling Takayoshi Yamashita 1,a) Takaya Nakamura 1 Hiroshi Fukui 1,b) Yuji

More information

Segmentation with non-linear constraints on appearance, complexity, and geometry

Segmentation with non-linear constraints on appearance, complexity, and geometry IPAM February 2013 Western Univesity Segmentation with non-linear constraints on appearance, complexity, and geometry Yuri Boykov Andrew Delong Lena Gorelick Hossam Isack Anton Osokin Frank Schmidt Olga

More information

Markov Random Fields and Gibbs Sampling for Image Denoising

Markov Random Fields and Gibbs Sampling for Image Denoising Markov Random Fields and Gibbs Sampling for Image Denoising Chang Yue Electrical Engineering Stanford University changyue@stanfoed.edu Abstract This project applies Gibbs Sampling based on different Markov

More information

Markov Networks in Computer Vision. Sargur Srihari

Markov Networks in Computer Vision. Sargur Srihari Markov Networks in Computer Vision Sargur srihari@cedar.buffalo.edu 1 Markov Networks for Computer Vision Important application area for MNs 1. Image segmentation 2. Removal of blur/noise 3. Stereo reconstruction

More information

ADAPTIVE GRAPH CUTS WITH TISSUE PRIORS FOR BRAIN MRI SEGMENTATION

ADAPTIVE GRAPH CUTS WITH TISSUE PRIORS FOR BRAIN MRI SEGMENTATION ADAPTIVE GRAPH CUTS WITH TISSUE PRIORS FOR BRAIN MRI SEGMENTATION Abstract: MIP Project Report Spring 2013 Gaurav Mittal 201232644 This is a detailed report about the course project, which was to implement

More information

A novel template matching method for human detection

A novel template matching method for human detection University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2009 A novel template matching method for human detection Duc Thanh Nguyen

More information

Accurate 3D Face and Body Modeling from a Single Fixed Kinect

Accurate 3D Face and Body Modeling from a Single Fixed Kinect Accurate 3D Face and Body Modeling from a Single Fixed Kinect Ruizhe Wang*, Matthias Hernandez*, Jongmoo Choi, Gérard Medioni Computer Vision Lab, IRIS University of Southern California Abstract In this

More information

Separating Objects and Clutter in Indoor Scenes

Separating Objects and Clutter in Indoor Scenes Separating Objects and Clutter in Indoor Scenes Salman H. Khan School of Computer Science & Software Engineering, The University of Western Australia Co-authors: Xuming He, Mohammed Bennamoun, Ferdous

More information

IMPROVED FINE STRUCTURE MODELING VIA GUIDED STOCHASTIC CLIQUE FORMATION IN FULLY CONNECTED CONDITIONAL RANDOM FIELDS

IMPROVED FINE STRUCTURE MODELING VIA GUIDED STOCHASTIC CLIQUE FORMATION IN FULLY CONNECTED CONDITIONAL RANDOM FIELDS IMPROVED FINE STRUCTURE MODELING VIA GUIDED STOCHASTIC CLIQUE FORMATION IN FULLY CONNECTED CONDITIONAL RANDOM FIELDS M. J. Shafiee, A. G. Chung, A. Wong, and P. Fieguth Vision & Image Processing Lab, Systems

More information

Pixel-Pair Features Selection for Vehicle Tracking

Pixel-Pair Features Selection for Vehicle Tracking 2013 Second IAPR Asian Conference on Pattern Recognition Pixel-Pair Features Selection for Vehicle Tracking Zhibin Zhang, Xuezhen Li, Takio Kurita Graduate School of Engineering Hiroshima University Higashihiroshima,

More information

A Feature Point Matching Based Approach for Video Objects Segmentation

A Feature Point Matching Based Approach for Video Objects Segmentation A Feature Point Matching Based Approach for Video Objects Segmentation Yan Zhang, Zhong Zhou, Wei Wu State Key Laboratory of Virtual Reality Technology and Systems, Beijing, P.R. China School of Computer

More information

Markov Networks in Computer Vision

Markov Networks in Computer Vision Markov Networks in Computer Vision Sargur Srihari srihari@cedar.buffalo.edu 1 Markov Networks for Computer Vision Some applications: 1. Image segmentation 2. Removal of blur/noise 3. Stereo reconstruction

More information

Detecting motion by means of 2D and 3D information

Detecting motion by means of 2D and 3D information Detecting motion by means of 2D and 3D information Federico Tombari Stefano Mattoccia Luigi Di Stefano Fabio Tonelli Department of Electronics Computer Science and Systems (DEIS) Viale Risorgimento 2,

More information

Image Resizing Based on Gradient Vector Flow Analysis

Image Resizing Based on Gradient Vector Flow Analysis Image Resizing Based on Gradient Vector Flow Analysis Sebastiano Battiato battiato@dmi.unict.it Giovanni Puglisi puglisi@dmi.unict.it Giovanni Maria Farinella gfarinellao@dmi.unict.it Daniele Ravì rav@dmi.unict.it

More information

Moving Object Segmentation Method Based on Motion Information Classification by X-means and Spatial Region Segmentation

Moving Object Segmentation Method Based on Motion Information Classification by X-means and Spatial Region Segmentation IJCSNS International Journal of Computer Science and Network Security, VOL.13 No.11, November 2013 1 Moving Object Segmentation Method Based on Motion Information Classification by X-means and Spatial

More information

A Robust Two Feature Points Based Depth Estimation Method 1)

A Robust Two Feature Points Based Depth Estimation Method 1) Vol.31, No.5 ACTA AUTOMATICA SINICA September, 2005 A Robust Two Feature Points Based Depth Estimation Method 1) ZHONG Zhi-Guang YI Jian-Qiang ZHAO Dong-Bin (Laboratory of Complex Systems and Intelligence

More information

People Tracking and Segmentation Using Efficient Shape Sequences Matching

People Tracking and Segmentation Using Efficient Shape Sequences Matching People Tracking and Segmentation Using Efficient Shape Sequences Matching Junqiu Wang, Yasushi Yagi, and Yasushi Makihara The Institute of Scientific and Industrial Research, Osaka University 8-1 Mihogaoka,

More information

Computer Vision : Exercise 4 Labelling Problems

Computer Vision : Exercise 4 Labelling Problems Computer Vision Exercise 4 Labelling Problems 13/01/2014 Computer Vision : Exercise 4 Labelling Problems Outline 1. Energy Minimization (example segmentation) 2. Iterated Conditional Modes 3. Dynamic Programming

More information

Image Segmentation Using Iterated Graph Cuts Based on Multi-scale Smoothing

Image Segmentation Using Iterated Graph Cuts Based on Multi-scale Smoothing Image Segmentation Using Iterated Graph Cuts Based on Multi-scale Smoothing Tomoyuki Nagahashi 1, Hironobu Fujiyoshi 1, and Takeo Kanade 2 1 Dept. of Computer Science, Chubu University. Matsumoto 1200,

More information

Intensity-Depth Face Alignment Using Cascade Shape Regression

Intensity-Depth Face Alignment Using Cascade Shape Regression Intensity-Depth Face Alignment Using Cascade Shape Regression Yang Cao 1 and Bao-Liang Lu 1,2 1 Center for Brain-like Computing and Machine Intelligence Department of Computer Science and Engineering Shanghai

More information

CS 5540 Spring 2013 Assignment 3, v1.0 Due: Apr. 24th 11:59PM

CS 5540 Spring 2013 Assignment 3, v1.0 Due: Apr. 24th 11:59PM 1 Introduction In this programming project, we are going to do a simple image segmentation task. Given a grayscale image with a bright object against a dark background and we are going to do a binary decision

More information

Fitting: The Hough transform

Fitting: The Hough transform Fitting: The Hough transform Voting schemes Let each feature vote for all the models that are compatible with it Hopefully the noise features will not vote consistently for any single model Missing data

More information

Disguised Face Identification (DFI) with Facial KeyPoints using Spatial Fusion Convolutional Network. Nathan Sun CIS601

Disguised Face Identification (DFI) with Facial KeyPoints using Spatial Fusion Convolutional Network. Nathan Sun CIS601 Disguised Face Identification (DFI) with Facial KeyPoints using Spatial Fusion Convolutional Network Nathan Sun CIS601 Introduction Face ID is complicated by alterations to an individual s appearance Beard,

More information

Use of Shape Deformation to Seamlessly Stitch Historical Document Images

Use of Shape Deformation to Seamlessly Stitch Historical Document Images Use of Shape Deformation to Seamlessly Stitch Historical Document Images Wei Liu Wei Fan Li Chen Jun Sun Satoshi Naoi In China, efforts are being made to preserve historical documents in the form of digital

More information

Discrete Optimization of Ray Potentials for Semantic 3D Reconstruction

Discrete Optimization of Ray Potentials for Semantic 3D Reconstruction Discrete Optimization of Ray Potentials for Semantic 3D Reconstruction Marc Pollefeys Joined work with Nikolay Savinov, Christian Haene, Lubor Ladicky 2 Comparison to Volumetric Fusion Higher-order ray

More information

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009 Learning and Inferring Depth from Monocular Images Jiyan Pan April 1, 2009 Traditional ways of inferring depth Binocular disparity Structure from motion Defocus Given a single monocular image, how to infer

More information

A New Bayesian Framework for Object Recognition

A New Bayesian Framework for Object Recognition Proceedings of IEEE conference on Computer Vision and Pattern Recognition (CVPR), 1999 vol. II, p.517 A New Bayesian Framework for Object Recognition Yuri Boykov Daniel P. Huttenlocher Computer Science

More information

Segmentation and Tracking of Partial Planar Templates

Segmentation and Tracking of Partial Planar Templates Segmentation and Tracking of Partial Planar Templates Abdelsalam Masoud William Hoff Colorado School of Mines Colorado School of Mines Golden, CO 800 Golden, CO 800 amasoud@mines.edu whoff@mines.edu Abstract

More information

Iterated Graph Cuts for Image Segmentation

Iterated Graph Cuts for Image Segmentation Iterated Graph Cuts for Image Segmentation Bo Peng 1, Lei Zhang 1, and Jian Yang 2 1 Department of Computing, The Hong Kong Polytechnic University, Kowloon, Hong Kong, China. 2 School of Computer Science

More information

Learning image representations equivariant to ego-motion (Supplementary material)

Learning image representations equivariant to ego-motion (Supplementary material) Learning image representations equivariant to ego-motion (Supplementary material) Dinesh Jayaraman UT Austin dineshj@cs.utexas.edu Kristen Grauman UT Austin grauman@cs.utexas.edu max-pool (3x3, stride2)

More information

Correcting User Guided Image Segmentation

Correcting User Guided Image Segmentation Correcting User Guided Image Segmentation Garrett Bernstein (gsb29) Karen Ho (ksh33) Advanced Machine Learning: CS 6780 Abstract We tackle the problem of segmenting an image into planes given user input.

More information

Object Recognition. Lecture 11, April 21 st, Lexing Xie. EE4830 Digital Image Processing

Object Recognition. Lecture 11, April 21 st, Lexing Xie. EE4830 Digital Image Processing Object Recognition Lecture 11, April 21 st, 2008 Lexing Xie EE4830 Digital Image Processing http://www.ee.columbia.edu/~xlx/ee4830/ 1 Announcements 2 HW#5 due today HW#6 last HW of the semester Due May

More information

Face Alignment Under Various Poses and Expressions

Face Alignment Under Various Poses and Expressions Face Alignment Under Various Poses and Expressions Shengjun Xin and Haizhou Ai Computer Science and Technology Department, Tsinghua University, Beijing 100084, China ahz@mail.tsinghua.edu.cn Abstract.

More information

Beyond Bags of Features

Beyond Bags of Features : for Recognizing Natural Scene Categories Matching and Modeling Seminar Instructed by Prof. Haim J. Wolfson School of Computer Science Tel Aviv University December 9 th, 2015

More information

Dynamic Shape Tracking via Region Matching

Dynamic Shape Tracking via Region Matching Dynamic Shape Tracking via Region Matching Ganesh Sundaramoorthi Asst. Professor of EE and AMCS KAUST (Joint work with Yanchao Yang) The Problem: Shape Tracking Given: exact object segmentation in frame1

More information

Introduction à la vision artificielle X

Introduction à la vision artificielle X Introduction à la vision artificielle X Jean Ponce Email: ponce@di.ens.fr Web: http://www.di.ens.fr/~ponce Planches après les cours sur : http://www.di.ens.fr/~ponce/introvis/lect10.pptx http://www.di.ens.fr/~ponce/introvis/lect10.pdf

More information

A Partial Curve Matching Method for Automatic Reassembly of 2D Fragments

A Partial Curve Matching Method for Automatic Reassembly of 2D Fragments A Partial Curve Matching Method for Automatic Reassembly of 2D Fragments Liangjia Zhu 1, Zongtan Zhou 1, Jingwei Zhang 2,andDewenHu 1 1 Department of Automatic Control, College of Mechatronics and Automation,

More information

Translation Symmetry Detection: A Repetitive Pattern Analysis Approach

Translation Symmetry Detection: A Repetitive Pattern Analysis Approach 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops Translation Symmetry Detection: A Repetitive Pattern Analysis Approach Yunliang Cai and George Baciu GAMA Lab, Department of Computing

More information

Markov Random Fields and Segmentation with Graph Cuts

Markov Random Fields and Segmentation with Graph Cuts Markov Random Fields and Segmentation with Graph Cuts Computer Vision Jia-Bin Huang, Virginia Tech Many slides from D. Hoiem Administrative stuffs Final project Proposal due Oct 27 (Thursday) HW 4 is out

More information

A Benchmark for Interactive Image Segmentation Algorithms

A Benchmark for Interactive Image Segmentation Algorithms A Benchmark for Interactive Image Segmentation Algorithms Yibiao Zhao 1,3, Xiaohan Nie 2,3, Yanbiao Duan 2,3, Yaping Huang 1, Siwei Luo 1 1 Beijing Jiaotong University, 2 Beijing Institute of Technology,

More information

BIL Computer Vision Apr 16, 2014

BIL Computer Vision Apr 16, 2014 BIL 719 - Computer Vision Apr 16, 2014 Binocular Stereo (cont d.), Structure from Motion Aykut Erdem Dept. of Computer Engineering Hacettepe University Slide credit: S. Lazebnik Basic stereo matching algorithm

More information

A Background Modeling Approach Based on Visual Background Extractor Taotao Liu1, a, Lin Qi2, b and Guichi Liu2, c

A Background Modeling Approach Based on Visual Background Extractor Taotao Liu1, a, Lin Qi2, b and Guichi Liu2, c 4th International Conference on Mechatronics, Materials, Chemistry and Computer Engineering (ICMMCCE 2015) A Background Modeling Approach Based on Visual Background Extractor Taotao Liu1, a, Lin Qi2, b

More information

Tri-modal Human Body Segmentation

Tri-modal Human Body Segmentation Tri-modal Human Body Segmentation Master of Science Thesis Cristina Palmero Cantariño Advisor: Sergio Escalera Guerrero February 6, 2014 Outline 1 Introduction 2 Tri-modal dataset 3 Proposed baseline 4

More information

Image Classification Using Wavelet Coefficients in Low-pass Bands

Image Classification Using Wavelet Coefficients in Low-pass Bands Proceedings of International Joint Conference on Neural Networks, Orlando, Florida, USA, August -7, 007 Image Classification Using Wavelet Coefficients in Low-pass Bands Weibao Zou, Member, IEEE, and Yan

More information

Transactions on Information and Communications Technologies vol 16, 1996 WIT Press, ISSN

Transactions on Information and Communications Technologies vol 16, 1996 WIT Press,   ISSN ransactions on Information and Communications echnologies vol 6, 996 WI Press, www.witpress.com, ISSN 743-357 Obstacle detection using stereo without correspondence L. X. Zhou & W. K. Gu Institute of Information

More information

Analysis: TextonBoost and Semantic Texton Forests. Daniel Munoz Februrary 9, 2009

Analysis: TextonBoost and Semantic Texton Forests. Daniel Munoz Februrary 9, 2009 Analysis: TextonBoost and Semantic Texton Forests Daniel Munoz 16-721 Februrary 9, 2009 Papers [shotton-eccv-06] J. Shotton, J. Winn, C. Rother, A. Criminisi, TextonBoost: Joint Appearance, Shape and Context

More information

A Road Marking Extraction Method Using GPGPU

A Road Marking Extraction Method Using GPGPU , pp.46-54 http://dx.doi.org/10.14257/astl.2014.50.08 A Road Marking Extraction Method Using GPGPU Dajun Ding 1, Jongsu Yoo 1, Jekyo Jung 1, Kwon Soon 1 1 Daegu Gyeongbuk Institute of Science and Technology,

More information

Generic Face Alignment Using an Improved Active Shape Model

Generic Face Alignment Using an Improved Active Shape Model Generic Face Alignment Using an Improved Active Shape Model Liting Wang, Xiaoqing Ding, Chi Fang Electronic Engineering Department, Tsinghua University, Beijing, China {wanglt, dxq, fangchi} @ocrserv.ee.tsinghua.edu.cn

More information

Texture Sensitive Image Inpainting after Object Morphing

Texture Sensitive Image Inpainting after Object Morphing Texture Sensitive Image Inpainting after Object Morphing Yin Chieh Liu and Yi-Leh Wu Department of Computer Science and Information Engineering National Taiwan University of Science and Technology, Taiwan

More information

Markov/Conditional Random Fields, Graph Cut, and applications in Computer Vision

Markov/Conditional Random Fields, Graph Cut, and applications in Computer Vision Markov/Conditional Random Fields, Graph Cut, and applications in Computer Vision Fuxin Li Slides and materials from Le Song, Tucker Hermans, Pawan Kumar, Carsten Rother, Peter Orchard, and others Recap:

More information

A robust stereo prior for human segmentation

A robust stereo prior for human segmentation A robust stereo prior for human segmentation Glenn Sheasby, Julien Valentin, Nigel Crook, Philip Torr Oxford Brookes University Abstract. The emergence of affordable depth cameras has enabled significant

More information

IRIS SEGMENTATION OF NON-IDEAL IMAGES

IRIS SEGMENTATION OF NON-IDEAL IMAGES IRIS SEGMENTATION OF NON-IDEAL IMAGES William S. Weld St. Lawrence University Computer Science Department Canton, NY 13617 Xiaojun Qi, Ph.D Utah State University Computer Science Department Logan, UT 84322

More information

Learning Two-View Stereo Matching

Learning Two-View Stereo Matching Learning Two-View Stereo Matching Jianxiong Xiao Jingni Chen Dit-Yan Yeung Long Quan Department of Computer Science and Engineering The Hong Kong University of Science and Technology The 10th European

More information

arxiv: v1 [cs.cv] 20 Dec 2016

arxiv: v1 [cs.cv] 20 Dec 2016 End-to-End Pedestrian Collision Warning System based on a Convolutional Neural Network with Semantic Segmentation arxiv:1612.06558v1 [cs.cv] 20 Dec 2016 Heechul Jung heechul@dgist.ac.kr Min-Kook Choi mkchoi@dgist.ac.kr

More information

Accelerating Pattern Matching or HowMuchCanYouSlide?

Accelerating Pattern Matching or HowMuchCanYouSlide? Accelerating Pattern Matching or HowMuchCanYouSlide? Ofir Pele and Michael Werman School of Computer Science and Engineering The Hebrew University of Jerusalem {ofirpele,werman}@cs.huji.ac.il Abstract.

More information

Bus Detection and recognition for visually impaired people

Bus Detection and recognition for visually impaired people Bus Detection and recognition for visually impaired people Hangrong Pan, Chucai Yi, and Yingli Tian The City College of New York The Graduate Center The City University of New York MAP4VIP Outline Motivation

More information

Spatial Latent Dirichlet Allocation

Spatial Latent Dirichlet Allocation Spatial Latent Dirichlet Allocation Xiaogang Wang and Eric Grimson Computer Science and Computer Science and Artificial Intelligence Lab Massachusetts Tnstitute of Technology, Cambridge, MA, 02139, USA

More information

UNSUPERVISED CO-SEGMENTATION BASED ON A NEW GLOBAL GMM CONSTRAINT IN MRF. Hongkai Yu, Min Xian, and Xiaojun Qi

UNSUPERVISED CO-SEGMENTATION BASED ON A NEW GLOBAL GMM CONSTRAINT IN MRF. Hongkai Yu, Min Xian, and Xiaojun Qi UNSUPERVISED CO-SEGMENTATION BASED ON A NEW GLOBAL GMM CONSTRAINT IN MRF Hongkai Yu, Min Xian, and Xiaojun Qi Department of Computer Science, Utah State University, Logan, UT 84322-4205 hongkai.yu@aggiemail.usu.edu,

More information

segments. The geometrical relationship of adjacent planes such as parallelism and intersection is employed for determination of whether two planes sha

segments. The geometrical relationship of adjacent planes such as parallelism and intersection is employed for determination of whether two planes sha A New Segment-based Stereo Matching using Graph Cuts Daolei Wang National University of Singapore EA #04-06, Department of Mechanical Engineering Control and Mechatronics Laboratory, 10 Kent Ridge Crescent

More information

Motion Estimation. There are three main types (or applications) of motion estimation:

Motion Estimation. There are three main types (or applications) of motion estimation: Members: D91922016 朱威達 R93922010 林聖凱 R93922044 謝俊瑋 Motion Estimation There are three main types (or applications) of motion estimation: Parametric motion (image alignment) The main idea of parametric motion

More information

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks Si Chen The George Washington University sichen@gwmail.gwu.edu Meera Hahn Emory University mhahn7@emory.edu Mentor: Afshin

More information

Skin and Face Detection

Skin and Face Detection Skin and Face Detection Linda Shapiro EE/CSE 576 1 What s Coming 1. Review of Bakic flesh detector 2. Fleck and Forsyth flesh detector 3. Details of Rowley face detector 4. Review of the basic AdaBoost

More information

Selection of Scale-Invariant Parts for Object Class Recognition

Selection of Scale-Invariant Parts for Object Class Recognition Selection of Scale-Invariant Parts for Object Class Recognition Gy. Dorkó and C. Schmid INRIA Rhône-Alpes, GRAVIR-CNRS 655, av. de l Europe, 3833 Montbonnot, France fdorko,schmidg@inrialpes.fr Abstract

More information

IMAGE RETRIEVAL USING VLAD WITH MULTIPLE FEATURES

IMAGE RETRIEVAL USING VLAD WITH MULTIPLE FEATURES IMAGE RETRIEVAL USING VLAD WITH MULTIPLE FEATURES Pin-Syuan Huang, Jing-Yi Tsai, Yu-Fang Wang, and Chun-Yi Tsai Department of Computer Science and Information Engineering, National Taitung University,

More information

COMP 558 lecture 22 Dec. 1, 2010

COMP 558 lecture 22 Dec. 1, 2010 Binocular correspondence problem Last class we discussed how to remap the pixels of two images so that corresponding points are in the same row. This is done by computing the fundamental matrix, defining

More information

Superpixels via Pseudo-Boolean Optimization

Superpixels via Pseudo-Boolean Optimization Superpixels via Pseudo-Boolean Optimization Yuhang Zhang, Richard Hartley The Australian National University {yuhang.zhang, richard.hartley}@anu.edu.au John Mashford, Stewart Burn CSIRO {john.mashford,

More information

Improving Image Segmentation Quality Via Graph Theory

Improving Image Segmentation Quality Via Graph Theory International Symposium on Computers & Informatics (ISCI 05) Improving Image Segmentation Quality Via Graph Theory Xiangxiang Li, Songhao Zhu School of Automatic, Nanjing University of Post and Telecommunications,

More information

Multi-Label Moves for Multi-Label Energies

Multi-Label Moves for Multi-Label Energies Multi-Label Moves for Multi-Label Energies Olga Veksler University of Western Ontario some work is joint with Olivier Juan, Xiaoqing Liu, Yu Liu Outline Review multi-label optimization with graph cuts

More information

Robot localization method based on visual features and their geometric relationship

Robot localization method based on visual features and their geometric relationship , pp.46-50 http://dx.doi.org/10.14257/astl.2015.85.11 Robot localization method based on visual features and their geometric relationship Sangyun Lee 1, Changkyung Eem 2, and Hyunki Hong 3 1 Department

More information

Subject-Oriented Image Classification based on Face Detection and Recognition

Subject-Oriented Image Classification based on Face Detection and Recognition 000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050

More information

Human Head-Shoulder Segmentation

Human Head-Shoulder Segmentation Human Head-Shoulder Segmentation Hai Xin, Haizhou Ai Computer Science and Technology Tsinghua University Beijing, China ahz@mail.tsinghua.edu.cn Hui Chao, Daniel Tretter Hewlett-Packard Labs 1501 Page

More information

Multi-sensor Fusion Using Dempster s Theory of Evidence for Video Segmentation

Multi-sensor Fusion Using Dempster s Theory of Evidence for Video Segmentation Multi-sensor Fusion Using Dempster s Theory of Evidence for Video Segmentation Björn Scheuermann, Sotirios Gkoutelitsas, and Bodo Rosenhahn Institut für Informationsverarbeitung (TNT) Leibniz Universität

More information

Neighbourhood-consensus message passing and its potentials in image processing applications

Neighbourhood-consensus message passing and its potentials in image processing applications Neighbourhood-consensus message passing and its potentials in image processing applications Tijana Ružić a, Aleksandra Pižurica a and Wilfried Philips a a Ghent University, TELIN-IPI-IBBT, Sint-Pietersnieuwstraat

More information

ELL 788 Computational Perception & Cognition July November 2015

ELL 788 Computational Perception & Cognition July November 2015 ELL 788 Computational Perception & Cognition July November 2015 Module 6 Role of context in object detection Objects and cognition Ambiguous objects Unfavorable viewing condition Context helps in object

More information

Pairwise Threshold for Gaussian Mixture Classification and its Application on Human Tracking Enhancement

Pairwise Threshold for Gaussian Mixture Classification and its Application on Human Tracking Enhancement Pairwise Threshold for Gaussian Mixture Classification and its Application on Human Tracking Enhancement Daegeon Kim Sung Chun Lee Institute for Robotics and Intelligent Systems University of Southern

More information

Robust and accurate change detection under sudden illumination variations

Robust and accurate change detection under sudden illumination variations Robust and accurate change detection under sudden illumination variations Luigi Di Stefano Federico Tombari Stefano Mattoccia Errico De Lisi Department of Electronics Computer Science and Systems (DEIS)

More information

Supplementary Material for Zoom and Learn: Generalizing Deep Stereo Matching to Novel Domains

Supplementary Material for Zoom and Learn: Generalizing Deep Stereo Matching to Novel Domains Supplementary Material for Zoom and Learn: Generalizing Deep Stereo Matching to Novel Domains Jiahao Pang 1 Wenxiu Sun 1 Chengxi Yang 1 Jimmy Ren 1 Ruichao Xiao 1 Jin Zeng 1 Liang Lin 1,2 1 SenseTime Research

More information

intro, applications MRF, labeling... how it can be computed at all? Applications in segmentation: GraphCut, GrabCut, demos

intro, applications MRF, labeling... how it can be computed at all? Applications in segmentation: GraphCut, GrabCut, demos Image as Markov Random Field and Applications 1 Tomáš Svoboda, svoboda@cmp.felk.cvut.cz Czech Technical University in Prague, Center for Machine Perception http://cmp.felk.cvut.cz Talk Outline Last update:

More information

Understanding Tracking and StroMotion of Soccer Ball

Understanding Tracking and StroMotion of Soccer Ball Understanding Tracking and StroMotion of Soccer Ball Nhat H. Nguyen Master Student 205 Witherspoon Hall Charlotte, NC 28223 704 656 2021 rich.uncc@gmail.com ABSTRACT Soccer requires rapid ball movements.

More information

Comparison of Graph Cuts with Belief Propagation for Stereo, using Identical MRF Parameters

Comparison of Graph Cuts with Belief Propagation for Stereo, using Identical MRF Parameters Comparison of Graph Cuts with Belief Propagation for Stereo, using Identical MRF Parameters Marshall F. Tappen William T. Freeman Computer Science and Artificial Intelligence Laboratory Massachusetts Institute

More information

Interactive Image Segmentation with GrabCut

Interactive Image Segmentation with GrabCut Interactive Image Segmentation with GrabCut Bryan Anenberg Stanford University anenberg@stanford.edu Michela Meister Stanford University mmeister@stanford.edu Abstract We implement GrabCut and experiment

More information

Supplementary material: Strengthening the Effectiveness of Pedestrian Detection with Spatially Pooled Features

Supplementary material: Strengthening the Effectiveness of Pedestrian Detection with Spatially Pooled Features Supplementary material: Strengthening the Effectiveness of Pedestrian Detection with Spatially Pooled Features Sakrapee Paisitkriangkrai, Chunhua Shen, Anton van den Hengel The University of Adelaide,

More information