ROBUST ROAD DETECTION FROM A SINGLE IMAGE USING ROAD SHAPE PRIOR. Zhen He, Tao Wu, Zhipeng Xiao, Hangen He
|
|
- Dylan Beverly Greene
- 6 years ago
- Views:
Transcription
1 ROBUST ROAD DETECTION FROM A SINGLE IMAGE USING ROAD SHAPE PRIOR Zhen He, Tao Wu, Zhipeng Xiao, Hangen He College of Mechatronics and Automation National University of Defense Technology Changsha, Hunan, P. R. China ABSTRACT Many road detection algorithms require pre-learned information, which may be unreliable as the road scene is usually unexpectable. Single image based (i.e., without any pre-learned information) road detection techniques can be adopted to overcome this problem, while their robustness needs improving. To achieve robust road detection from a single image, this paper proposes a general road shape prior to enforce the detected region to be road-shaped by encoding the prior into a graph-cut segmentation framework, where the training data is automatically generated from a predicted road region of the current image. By iteratively performing the graph-cut segmentation, an accurate road region will be obtained. Quantitative and qualitative experiments on the challenging SUN Database validate the robustness and efficiency of our method. We believe that the road shape prior can also be used to yield improvements for many other road detection algorithms. Index Terms Road detection, shape prior, graph cuts. INTRODUCTION Vision-based road detection is of high relevance for autonomous driving and pedestrian detection. Detecting the road from an onboard road image is very challenging due to the diversity of road shapes (e.g., straight, left/right curve, and winding), road materials (e.g., concrete, soil, and asphalt), and backgrounds (e.g., street, forest, and mountain), and the noise induced by varying illumination, different weather, object occlusion, etc., as shown in Fig.. With the information learned off-line from some specific road scenes, many proposed segmentation algorithms [, 2, 3, 4] perform well for the same road scenes but poorly for the others. However, the off-line learning needs to label training data manually which is time consuming and the road scene is usually unknown, especially on moving vehicles. Hence, road detection from a single image (i.e. without any pre-learned information) is needed to overcome these problems and thus will be more challenging. There has been some works on single image based road detection. In [5], Kong et al. use a vanishing-point-constrained edge detection technique to acquire straight road boundaries. This method may be inapplicable when there is no legible vanishing point (e.g., on the ascent), or the road boundaries are curved. To deal with complex road shapes, Alvarez et al. [6] use a likelihood-based region growing method to label pixels. It is factually a local method and not robust to noise, which may cause overgrowth or undergrowth. More recently, in [7], general information off-line learned from other road scenes is combined with the information on-line learned from current image to enhance the detection robustness. However, in its on-line learning procedure, the training data is taken from a fixed region (i.e. the central-bottom of the image), thus the road patterns may not be well-learned /3/$ IEEE 2757 (c) (d) (e) (f) Fig.. Examples of different roads with: Different shapes, Different materials, (c) Different backgrounds, (d) Different illumination, (e) Different weather, and (f) Object occlusion. Aiming at utilizing the prior knowledge to improve the robustness of single image based road detection, we propose a general road shape prior to enforce the detected road region to be road-shaped by encoding this prior into a graph-cut segmentation framework, where the labels are automatically generated from the current image to model the road. The final detected road region is acquired by iteratively performing the graph cuts, where the last detected road region is iterated to a current predicted one that is used for prior encoding and label generating and initialized as a semicircle at the central-bottom of the image. The road shape prior is based on the observation that in on-board road images, the varied road regions should have some shape constraints making them look like road, and thus the detected road regions should also satisfy these constraints. Unlike previous methods using limited templates that are hard to cover varied road shapes and need matching, our approach benefits from a general road shape prior that is more natural and can be encoded in a unified graph-cut framework; unlike previous online learning methods generating labels from the current image in one shot, it progressively generates a better batch of labels automatically by iterative graph cuts and thus builds a better model. An outline of the paper follows. In Sec. 2 we demonstrate how the basic graph-cut segmentation can be used for road detection, in Sec. 3 we explain how to incorporate the road shape prior in the graph-cut segmentation framework, and in Sec. 4 we detail the iterative graph cuts. Experimental results and discussion are presented in Sec. 5, while conclusions are given in Sec ROAD DETECTION USING BASIC GRAPH-CUT SEGMENTATION Road detection can be formulated as a binary labeling problem trying to assign a label (road or background) to each pixel. A popular way to such problems is to use the efficient graph-cut segmentation [8]. Let yi 0 (background), (road)} be the label of pixel i of the image, P be the set of all pixels, and N be the set of all neighboring pixel pairs. Then y = yi i P} is the collection of all ICIP 203
2 label assignments and defines a segmentation. The basic graph-cut optimizes y by minimizing the energy function: E(y) = D i(y i) + λ V i,j(y i, y j) () i P i,j N where D i is the data term penalizing the assignments that do not fit for pixel i, and V i,j is the smoothness term penalizing the different assignments for neighboring pixels i and j. Let I i be the feature vector of pixel i. Then D i is typically defined as the negative loglikelihood of a label y i being assigned to pixel i and takes the form: D i() = ln Pr(I i R) D i(0) = ln Pr(I i B) (2) where Pr(I i R) is the road model, and Pr(I i B) is the background model. The smoothness term V i,j follows the conventional form: V i,j(y i, y j) = [y i y j] exp( Ii Ij 2 2β ) dist(i, j) where [ ] is if y i y j and 0 otherwise, β denotes the mean squaredifference between feature vectors over all neighboring pixels, and dist(i, j) is the Euclidean distance between pixel i and j. Parameter λ > 0 in Eq. () weights the importance between D i and V i,j. Compared with other local methods, the basic graph-cut mainly benefits from V i,j that encodes the neighboring consistency, and its efficient global optimization. However, it is still a bottom-up approach and unable to capture the rich properties of the road region. Thus, we propose a general road shape prior and incorporate it with the basic graph-cut segmentation framework, as shown in Sec INCORPORATING THE ROAD SHAPE PRIOR As there are a wide variety of road shapes in the on-board road images (see Fig. ), it is hard to describe them with limited templates. However, as shown in Fig. 2, for a segmented image, we can easily tell whether the foreground (white region) looks like a road. Hence, there may be some shape constraints making a region like a road, which we call road shape prior. (c) (d) (e) (f) (g) (h) Fig. 2. Examples of some segmented images. Obviously, the white regions in, (d), (f), and (h) look more like roads than those in, (c), (e), and (g). Here we use the shrinking constraint and consistency constraint to describe such a prior. The former is based on the observation that along the road axis (i.e. the centerline of the road region), the road region is perspectively shrinking from the near to the distant, while the latter suggests that the region between both sides of the road should consistently belong to the road, and vice versa. Consider Fig. 3, let l be the centerline of the white region R, and pixel i be an arbitrary point. Horizontally move l to l, and draw a horizon line h, both of which pass through the center of i. p is an arbitrary pixel on l and below i, and q is an arbitrary pixel on h and between l and l (or on l). The shrinking constraint implies that i, i R p R, while the consistency constraint implies that i, (i R) (i / l) q R. Thus, in Fig. 3, the shrinking constraint is satisfied in and, the consistency constraint is satisfied in and (c), whereas (d) satisfies neither of the constraints. Our criterion is that a region is road-shaped if and only if it satisfies both of the constraints. This criterion is consistent with our (3) (c) (d) Fig. 3. Illustration of the shrinking constraint and consistency constraint. In and, i, i R p R, so the shrinking constraint is satisfied, while in and (c), i, (i R) (i / l) q R, therefore the consistency constraint is satisfied. In (d), i, (i R) (p / R), and i, (i R) (i / l) (q / R), so neither of the constraints is satisfied. Table. Comparison between our criterion and the intuition. The road-shaped regions in Fig. 2 are also road-like, and vice versa. Images in Fig. 2 (c) (d) (e) (f) (g) (h) Shrinking const. Consistency const. Road-shaped Road-like intuition (e.g., see Table ). To improve the detection robustness, we enforce the detected road region to be road-shaped by encoding both constraints into a graph-cut framework. Inspired by [9], we define a shrinking constraint term S i,p and a consistency constraint term C i,q as follows: S i,p(y i, y p) =, if y i = and y p = 0 0, otherwise C i,q(y i, y q) =, if y i = and y q = 0 0, otherwise Note that S i,p is defined on pixel pairs (i, p), while C i,q is defined on pixel pairs (i, q) (see Fig. 3). Both terms penalize the assignments that violate the corresponding constraint by taking infinite cost. In fact, it is enough to put S i,p and C i,q only between neighboring pixels. For S i,p, if every pixel pair (i, p) satisfies the shrinking constraint, then, of course, every neighboring (i, p) also satisfies it, and if there exists some pixel pair not satisfying the shrinking constraint, then there should be at least a neighboring (i, p) not satisfying it. It is similar for C i,q. We use a standard 8-neighborhood system (see Fig. 4) for generality. For every pixel i that not on l, it is easy to find a neighboring pixel q, the center of which is exactly passed through by h. While for every pixel i, l may not exactly passes through the center of any neighboring pixels of i. In this case, a neighboring pixel p will be chosen if it lies the closest to l (see Fig. 4). Fig. 4. Illustration choosing a pixel p neighbored to i from a standard 8-neighborhood system. A standard 8-neighborhood system, where 8 neighbors are connected to each pixel i. A neighboring pixel of i (i.e., p 4) is chosen as it lies the closest to l among all possible neighbors (i.e., p p 5, which are not above i). (4) 2758
3 With constraint terms Si,p and Ci,q, E(y) becomes: E(y) = Di (yi ) + λ Vi,j (yi, yj ) i P + i,j N Si,p (yi, yp ) + i,p N (5) Ci,q (yi, yq ) i,q N As the road region is less changeable than the background and the road feature is more self-similar, we only build the road model Pr(Ii R), and redefine the data term Di in Eq. (5) as: yi, if Pr(Ii R) γ Di (yi ) = (6) yi, otherwise where the threshold γ = γ0 max Pr(Ii R), and γ0 [0, ] is a fixed value. Di penalizes the assignments not fitting for the data. In Eq. (5), Vi,j (yi, yj ) takes the same form of Eq. (3), and we fix the weighting parameter λ to. According to [0], E is submodular since every term in Eq. (5) is submodular. As E is also a quadratic pseudo boolean function, it can be exactly minimized via graph cuts. Note that both the shrinking and consistency constraints depend on the road axis, which is determined by the road region, and the road labels should also be generated from the road region. However, the road region is unknown before detection. Thus, as detailed in Sec. 4, we first initialize a predicted road region, and then iteratively use the graph cuts to update the predicted region until it converges. Fig. 5. Illustration of our iterative graph cuts for road detection. 4. ROAD DETECTION BY ITERATIVE GRAPH CUTS Our iterative graph cuts algorithm is illustrated in Fig. 5. For an arbitrary test road image, we compute the illuminant invariant ranging from 0 to 255 as its feature,as described in [6]. As the real road region is unknown, we first initialize a road region Rp (see (d), a semicircle at the central-bottom of the image). To aquire a reliable labeling region L (see (c)), we leave it with a margin of 2 ( SRp SRp /2) (SR denotes the area of R) to the background region Bp. Then, a road model (f) is built with the training data uniformly labeled on L, and the likelihood image (see (e), which is normalized) of can be acquired using (f). Next, we obtain the data terms (see (i)) according to Eq. (6). The smaller the energy, the more likely the road class. While the smoothness terms (see (h)) can be directly computed from using Eq. (3). On the other hand, we get the road axis from (d) and extend it to the image border (see (g)). Then the road shape prior, i.e. the shrinking constraint (j) and consistency constraint (k) can be added. Finally, we minimize E(y) in Eq. (5) via graph cuts. If the segmented road region Rd (see (n)) converges, i.e., Rd satisfies ε(rd ) < ε0, where ε( ) is the convergence error and ε0 is a constant, a final detected result (m) will be obtained. Otherwise, Rd will be iterated to Rp of the next cycle. Here we set ε0 to 0 3, and define ε(rd ) = (SRd Rp SRd Rp )/(SRp Bp ). An example of the iterative graph cuts is shown in Fig. 6. By iteration, a better batch of road labels are progressively generated, thus a better road model can be built. Moreover, it favors our road shape prior, which benefits the robust road detection. 5. EXPERIMENTAL RESULTS AND DISCUSSION To validate our method, three different experiments are conducted on the SUN Database [], where 500 of the 868 still on-board road images are randomly selected as test set, and also manually Fig. 6. An example of the iterative graph cuts when γ0 (see Eq. (6)) is set to 0.. A test image with its illuminant invariant and smoothness energy, both of which are computed in one shot, and the ground truth. An illustration of the iteration. A better road model (column 2) is gradually built using the labeled data taken from L (column ). As the predicted region is approaching to the real road region, the road axis also becomes accurate (column 4), and a better road shape prior can be enforced on the likelihood image (column 3) to get a better result (column 5, 6). The detected road region Rd converges in the 4th iteration where ε(rd ) = < 0 3. labeled as ground truth (no training set is needed in our algorithm). All the images are resized to for testing. The parameter γ0 of Eq. (6) is fixed to 0. by default. We use gco-v3.0 software2 [2, 3, 0] to implement the graph-cut optimization. Evaluating the performance of iteration. First, we fix the number of iterations to 4 for efficiency, as 95.9% of the segmented results converge after 4 iterations (see Table 2). Then, the iteration performance is evaluated by 4 types of pixel-wise measures (ie., the recall (T P/(T P + F N )), precision (T P/(T P + F P )), F-measure (2/(/recall + /precision)), and quality (T P/(T P + F P + F N )), which are denoted by RC, P C, F, and Q, respectively) and the running time on a 2 GHz computer with 4.0 GB RAM. As shown in Table 3, each of the four measures reaches the highest after 4 iterations,
4 Table 2. The ratio of converged segmented results (η) with the increasing of iterations. η reaches 95.9% after the 4th iteration. Iter η(%) Table 3. Quantitative results of the iteration performance. Iter. RC (%) P C (%) F (%) Q (%) Time ms ms ms ms ms which take only 98 ms running time and thus can achieve real-time detection. Some qualitative results can be seen in Fig. 7. Comparing with the basic graph-cut segmentation. To show that our road shape prior can help improve the performance of basic graph-cut segmentation, we evaluate our method with BGCv (Basic graph cuts with only data terms) and BGCv2 (basic graph cuts with both data and smoothness terms). Quantitative results are provided using ROC curves that represent the trade-off between true positive rate (T P/(T P + F N )) and false positive rate (F P/(F P + T N )). The data terms for basic graph cuts are got by Eq. (6) using the training data taken from a fixed region (the same as our initialized labeling region). By changing γ0 from 0 to, we obtain the ROC curves and corresponding AU C (area under the curve) (see Fig. 8). Obviously, our method outperforms BGCv and BGCv2 with the highest AU C (95.4%). Qualitative results in Fig. 9 show that the detected road regions are enforced to be road-shaped with our method. Comparing with state-of-the-art algorithms. We compare our method with two state-of-the-art single image based road detection algorithms: the vanishing point (VP) based method [5] and the likelihood-based region growing (LRG) method [6]. Quantitative results (see Table 4) show that our method outperforms both VP and LRG with the measures of RC, P C, F, and Q. Qualitative results in Fig. 9 show that VP fails when there is no legible vanishing point (row 5, 6) or the road boundaries are curved (row 2), and the LRG fails due to overgrowth (row, 3, 5) or undergrowth (row 2, 4). Yet, our method performs the best on the first five instances, benefiting from the global optimization under a general road shape constraint. Discussion. While our method enforces such a shape prior on the road segmentation task, it can also result in a degradation in performance, e.g., a fork road may not satisfy our road shape prior (see, e.g., row 6 of Fig. 9). Besides, our method fails when the illuminantinvariant feature lacks discriminability in some cases (e.g., in the last instance of Fig. 9, the illuminant-invariant feature of the road and its surroundings is similar). Thus, engineering better features and exploring more useful road priors (e.g., a more general shape prior that can handle more complex road shapes such as forks or crossings) are our interesting directions for future work. Table 4. Quantitative results of our method compared with state-ofthe-art single image road detection algorithms VP [5] and LRG [6]. RC (%) P C (%) F (%) Q (%) VP 75.9 ± ± ± ± 22.9 LRG 89. ± ± ± ± 6.6 Ours 9.2 ± 8.6 ± 9.3 ± ± 6.9 Fig. 7. Qualitative results of the iteration performance. Fig. 8. ROC curves and the corresponding AU C of our method compared with basic graph-cut methods BGCv and BGCv2. 6. CONCLUSIONS In this paper, we proposed a general road shape prior to enforce the detected region to be road-shaped by encoding it into a graph-cut framework, where the training data is automatically generated from a predicted road region of the current image. Accurate road region is obtained by iteratively performing the graph cuts. Experimental results validate the robustness and efficiency of our method. Acknowledgment. The presented research work is supported by the National Natural Science Foundation of China (Grant No ) Fig. 9. Comparisons of our method over the basic graph-cut methods (BGCv, BGCv2) and two state-of-the-art methods (VP, LRG). Each row shows a different instance. White and black regions in column 2 7 denote roads and backgrounds, respectively. Column : Test images. Column 2 6: Detection results obtained by: VP [5], BGCv, BGCv2, LRG [6], and our method, respectively. Column 7: Manually labeled ground truth. Note that the same feature (i.e., the illuminant invariant) are used in the last four methods.
5 7. REFERENCES [] J.M. Alvarez, T. Gevers, and A.M. López, Vision-based road detection using road models, in Image Processing (ICIP), th IEEE International Conference on. IEEE, 2009, pp [2] Q. Huang, M. Han, B. Wu, and S. Ioffe, A hierarchical conditional random field model for labeling and segmenting images of street scenes, in Computer Vision and Pattern Recognition (CVPR), 20 IEEE Conference on. IEEE, 20, pp [3] P. Sturgess, L. Ladicky, N. Crook, and P.H.S. Torr, Scalable cascade inference for semantic image segmentation, in BMVC, 202. [4] G. Floros and B. Leibe, Joint 2d-3d temporally consistent semantic segmentation of street scenes, in Computer Vision and Pattern Recognition (CVPR), 202 IEEE Conference on. IEEE, 202, pp [5] H. Kong, J.Y. Audibert, and J. Ponce, Vanishing point detection for road detection, in Computer Vision and Pattern Recognition (CVPR), 2009 IEEE Conference on. IEEE, 2009, pp [6] J.M. Alvarez and A.M. Lopez, Road detection based on illuminant invariance, Intelligent Transportation Systems, IEEE Transactions on, vol. 2, no., pp , 20. [7] J.M. Alvarez, T. Gevers, Y. LeCun, and A.M. Lopez, Road scene segmentation from a single image, in Proceedings of the 2th European Conference on Computer Vision. Springer, 202. [8] Y. Boykov and M.P. Jolly, Interactive graph cuts for optimal boundary & region segmentation of objects in nd images, in Computer Vision, 200. ICCV 200. Proceedings. Eighth IEEE International Conference on. IEEE, 200, vol., pp [9] O. Veksler, Star shape prior for graph-cut image segmentation, in Proceedings of the 0th European Conference on Computer Vision: Part III. Springer, 2008, pp [0] V. Kolmogorov and R. Zabin, What energy functions can be minimized via graph cuts?, Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 26, no. 2, pp , [] J. Xiao, J. Hays, K.A. Ehinger, A. Oliva, and A. Torralba, Sun database: Large-scale scene recognition from abbey to zoo, in Computer vision and pattern recognition (CVPR), 200 IEEE conference on. IEEE, 200, pp [2] Y. Boykov, O. Veksler, and R. Zabih, Fast approximate energy minimization via graph cuts, Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 23, no., pp , 200. [3] Y. Boykov and V. Kolmogorov, An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision, Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 26, no. 9, pp ,
Supervised texture detection in images
Supervised texture detection in images Branislav Mičušík and Allan Hanbury Pattern Recognition and Image Processing Group, Institute of Computer Aided Automation, Vienna University of Technology Favoritenstraße
More informationCombining PGMs and Discriminative Models for Upper Body Pose Detection
Combining PGMs and Discriminative Models for Upper Body Pose Detection Gedas Bertasius May 30, 2014 1 Introduction In this project, I utilized probabilistic graphical models together with discriminative
More informationMonocular Vision Based Autonomous Navigation for Arbitrarily Shaped Urban Roads
Proceedings of the International Conference on Machine Vision and Machine Learning Prague, Czech Republic, August 14-15, 2014 Paper No. 127 Monocular Vision Based Autonomous Navigation for Arbitrarily
More informationData-driven Depth Inference from a Single Still Image
Data-driven Depth Inference from a Single Still Image Kyunghee Kim Computer Science Department Stanford University kyunghee.kim@stanford.edu Abstract Given an indoor image, how to recover its depth information
More informationMULTI-REGION SEGMENTATION
MULTI-REGION SEGMENTATION USING GRAPH-CUTS Johannes Ulén Abstract This project deals with multi-region segmenation using graph-cuts and is mainly based on a paper by Delong and Boykov [1]. The difference
More informationRegion-based Segmentation and Object Detection
Region-based Segmentation and Object Detection Stephen Gould Tianshi Gao Daphne Koller Presented at NIPS 2009 Discussion and Slides by Eric Wang April 23, 2010 Outline Introduction Model Overview Model
More informationObject detection using non-redundant local Binary Patterns
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2010 Object detection using non-redundant local Binary Patterns Duc Thanh
More informationSegmentation. Bottom up Segmentation Semantic Segmentation
Segmentation Bottom up Segmentation Semantic Segmentation Semantic Labeling of Street Scenes Ground Truth Labels 11 classes, almost all occur simultaneously, large changes in viewpoint, scale sky, road,
More informationSupplementary Material: Adaptive and Move Making Auxiliary Cuts for Binary Pairwise Energies
Supplementary Material: Adaptive and Move Making Auxiliary Cuts for Binary Pairwise Energies Lena Gorelick Yuri Boykov Olga Veksler Computer Science Department University of Western Ontario Abstract Below
More informationColor Image Segmentation
Color Image Segmentation Yining Deng, B. S. Manjunath and Hyundoo Shin* Department of Electrical and Computer Engineering University of California, Santa Barbara, CA 93106-9560 *Samsung Electronics Inc.
More informationAutomatic Shadow Removal by Illuminance in HSV Color Space
Computer Science and Information Technology 3(3): 70-75, 2015 DOI: 10.13189/csit.2015.030303 http://www.hrpub.org Automatic Shadow Removal by Illuminance in HSV Color Space Wenbo Huang 1, KyoungYeon Kim
More informationGraph Based Image Segmentation
AUTOMATYKA 2011 Tom 15 Zeszyt 3 Anna Fabijañska* Graph Based Image Segmentation 1. Introduction Image segmentation is one of the fundamental problems in machine vision. In general it aims at extracting
More informationDetection of a Single Hand Shape in the Foreground of Still Images
CS229 Project Final Report Detection of a Single Hand Shape in the Foreground of Still Images Toan Tran (dtoan@stanford.edu) 1. Introduction This paper is about an image detection system that can detect
More informationLearning the Three Factors of a Non-overlapping Multi-camera Network Topology
Learning the Three Factors of a Non-overlapping Multi-camera Network Topology Xiaotang Chen, Kaiqi Huang, and Tieniu Tan National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy
More informationScene Segmentation in Adverse Vision Conditions
Scene Segmentation in Adverse Vision Conditions Evgeny Levinkov Max Planck Institute for Informatics, Saarbrücken, Germany levinkov@mpi-inf.mpg.de Abstract. Semantic road labeling is a key component of
More informationA New Feature Local Binary Patterns (FLBP) Method
A New Feature Local Binary Patterns (FLBP) Method Jiayu Gu and Chengjun Liu The Department of Computer Science, New Jersey Institute of Technology, Newark, NJ 07102, USA Abstract - This paper presents
More informationSelf Lane Assignment Using Smart Mobile Camera For Intelligent GPS Navigation and Traffic Interpretation
For Intelligent GPS Navigation and Traffic Interpretation Tianshi Gao Stanford University tianshig@stanford.edu 1. Introduction Imagine that you are driving on the highway at 70 mph and trying to figure
More informationCHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION
CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION 6.1 INTRODUCTION Fuzzy logic based computational techniques are becoming increasingly important in the medical image analysis arena. The significant
More informationCost-alleviative Learning for Deep Convolutional Neural Network-based Facial Part Labeling
[DOI: 10.2197/ipsjtcva.7.99] Express Paper Cost-alleviative Learning for Deep Convolutional Neural Network-based Facial Part Labeling Takayoshi Yamashita 1,a) Takaya Nakamura 1 Hiroshi Fukui 1,b) Yuji
More informationSegmentation with non-linear constraints on appearance, complexity, and geometry
IPAM February 2013 Western Univesity Segmentation with non-linear constraints on appearance, complexity, and geometry Yuri Boykov Andrew Delong Lena Gorelick Hossam Isack Anton Osokin Frank Schmidt Olga
More informationMarkov Random Fields and Gibbs Sampling for Image Denoising
Markov Random Fields and Gibbs Sampling for Image Denoising Chang Yue Electrical Engineering Stanford University changyue@stanfoed.edu Abstract This project applies Gibbs Sampling based on different Markov
More informationMarkov Networks in Computer Vision. Sargur Srihari
Markov Networks in Computer Vision Sargur srihari@cedar.buffalo.edu 1 Markov Networks for Computer Vision Important application area for MNs 1. Image segmentation 2. Removal of blur/noise 3. Stereo reconstruction
More informationADAPTIVE GRAPH CUTS WITH TISSUE PRIORS FOR BRAIN MRI SEGMENTATION
ADAPTIVE GRAPH CUTS WITH TISSUE PRIORS FOR BRAIN MRI SEGMENTATION Abstract: MIP Project Report Spring 2013 Gaurav Mittal 201232644 This is a detailed report about the course project, which was to implement
More informationA novel template matching method for human detection
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2009 A novel template matching method for human detection Duc Thanh Nguyen
More informationAccurate 3D Face and Body Modeling from a Single Fixed Kinect
Accurate 3D Face and Body Modeling from a Single Fixed Kinect Ruizhe Wang*, Matthias Hernandez*, Jongmoo Choi, Gérard Medioni Computer Vision Lab, IRIS University of Southern California Abstract In this
More informationSeparating Objects and Clutter in Indoor Scenes
Separating Objects and Clutter in Indoor Scenes Salman H. Khan School of Computer Science & Software Engineering, The University of Western Australia Co-authors: Xuming He, Mohammed Bennamoun, Ferdous
More informationIMPROVED FINE STRUCTURE MODELING VIA GUIDED STOCHASTIC CLIQUE FORMATION IN FULLY CONNECTED CONDITIONAL RANDOM FIELDS
IMPROVED FINE STRUCTURE MODELING VIA GUIDED STOCHASTIC CLIQUE FORMATION IN FULLY CONNECTED CONDITIONAL RANDOM FIELDS M. J. Shafiee, A. G. Chung, A. Wong, and P. Fieguth Vision & Image Processing Lab, Systems
More informationPixel-Pair Features Selection for Vehicle Tracking
2013 Second IAPR Asian Conference on Pattern Recognition Pixel-Pair Features Selection for Vehicle Tracking Zhibin Zhang, Xuezhen Li, Takio Kurita Graduate School of Engineering Hiroshima University Higashihiroshima,
More informationA Feature Point Matching Based Approach for Video Objects Segmentation
A Feature Point Matching Based Approach for Video Objects Segmentation Yan Zhang, Zhong Zhou, Wei Wu State Key Laboratory of Virtual Reality Technology and Systems, Beijing, P.R. China School of Computer
More informationMarkov Networks in Computer Vision
Markov Networks in Computer Vision Sargur Srihari srihari@cedar.buffalo.edu 1 Markov Networks for Computer Vision Some applications: 1. Image segmentation 2. Removal of blur/noise 3. Stereo reconstruction
More informationDetecting motion by means of 2D and 3D information
Detecting motion by means of 2D and 3D information Federico Tombari Stefano Mattoccia Luigi Di Stefano Fabio Tonelli Department of Electronics Computer Science and Systems (DEIS) Viale Risorgimento 2,
More informationImage Resizing Based on Gradient Vector Flow Analysis
Image Resizing Based on Gradient Vector Flow Analysis Sebastiano Battiato battiato@dmi.unict.it Giovanni Puglisi puglisi@dmi.unict.it Giovanni Maria Farinella gfarinellao@dmi.unict.it Daniele Ravì rav@dmi.unict.it
More informationMoving Object Segmentation Method Based on Motion Information Classification by X-means and Spatial Region Segmentation
IJCSNS International Journal of Computer Science and Network Security, VOL.13 No.11, November 2013 1 Moving Object Segmentation Method Based on Motion Information Classification by X-means and Spatial
More informationA Robust Two Feature Points Based Depth Estimation Method 1)
Vol.31, No.5 ACTA AUTOMATICA SINICA September, 2005 A Robust Two Feature Points Based Depth Estimation Method 1) ZHONG Zhi-Guang YI Jian-Qiang ZHAO Dong-Bin (Laboratory of Complex Systems and Intelligence
More informationPeople Tracking and Segmentation Using Efficient Shape Sequences Matching
People Tracking and Segmentation Using Efficient Shape Sequences Matching Junqiu Wang, Yasushi Yagi, and Yasushi Makihara The Institute of Scientific and Industrial Research, Osaka University 8-1 Mihogaoka,
More informationComputer Vision : Exercise 4 Labelling Problems
Computer Vision Exercise 4 Labelling Problems 13/01/2014 Computer Vision : Exercise 4 Labelling Problems Outline 1. Energy Minimization (example segmentation) 2. Iterated Conditional Modes 3. Dynamic Programming
More informationImage Segmentation Using Iterated Graph Cuts Based on Multi-scale Smoothing
Image Segmentation Using Iterated Graph Cuts Based on Multi-scale Smoothing Tomoyuki Nagahashi 1, Hironobu Fujiyoshi 1, and Takeo Kanade 2 1 Dept. of Computer Science, Chubu University. Matsumoto 1200,
More informationIntensity-Depth Face Alignment Using Cascade Shape Regression
Intensity-Depth Face Alignment Using Cascade Shape Regression Yang Cao 1 and Bao-Liang Lu 1,2 1 Center for Brain-like Computing and Machine Intelligence Department of Computer Science and Engineering Shanghai
More informationCS 5540 Spring 2013 Assignment 3, v1.0 Due: Apr. 24th 11:59PM
1 Introduction In this programming project, we are going to do a simple image segmentation task. Given a grayscale image with a bright object against a dark background and we are going to do a binary decision
More informationFitting: The Hough transform
Fitting: The Hough transform Voting schemes Let each feature vote for all the models that are compatible with it Hopefully the noise features will not vote consistently for any single model Missing data
More informationDisguised Face Identification (DFI) with Facial KeyPoints using Spatial Fusion Convolutional Network. Nathan Sun CIS601
Disguised Face Identification (DFI) with Facial KeyPoints using Spatial Fusion Convolutional Network Nathan Sun CIS601 Introduction Face ID is complicated by alterations to an individual s appearance Beard,
More informationUse of Shape Deformation to Seamlessly Stitch Historical Document Images
Use of Shape Deformation to Seamlessly Stitch Historical Document Images Wei Liu Wei Fan Li Chen Jun Sun Satoshi Naoi In China, efforts are being made to preserve historical documents in the form of digital
More informationDiscrete Optimization of Ray Potentials for Semantic 3D Reconstruction
Discrete Optimization of Ray Potentials for Semantic 3D Reconstruction Marc Pollefeys Joined work with Nikolay Savinov, Christian Haene, Lubor Ladicky 2 Comparison to Volumetric Fusion Higher-order ray
More informationLearning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009
Learning and Inferring Depth from Monocular Images Jiyan Pan April 1, 2009 Traditional ways of inferring depth Binocular disparity Structure from motion Defocus Given a single monocular image, how to infer
More informationA New Bayesian Framework for Object Recognition
Proceedings of IEEE conference on Computer Vision and Pattern Recognition (CVPR), 1999 vol. II, p.517 A New Bayesian Framework for Object Recognition Yuri Boykov Daniel P. Huttenlocher Computer Science
More informationSegmentation and Tracking of Partial Planar Templates
Segmentation and Tracking of Partial Planar Templates Abdelsalam Masoud William Hoff Colorado School of Mines Colorado School of Mines Golden, CO 800 Golden, CO 800 amasoud@mines.edu whoff@mines.edu Abstract
More informationIterated Graph Cuts for Image Segmentation
Iterated Graph Cuts for Image Segmentation Bo Peng 1, Lei Zhang 1, and Jian Yang 2 1 Department of Computing, The Hong Kong Polytechnic University, Kowloon, Hong Kong, China. 2 School of Computer Science
More informationLearning image representations equivariant to ego-motion (Supplementary material)
Learning image representations equivariant to ego-motion (Supplementary material) Dinesh Jayaraman UT Austin dineshj@cs.utexas.edu Kristen Grauman UT Austin grauman@cs.utexas.edu max-pool (3x3, stride2)
More informationCorrecting User Guided Image Segmentation
Correcting User Guided Image Segmentation Garrett Bernstein (gsb29) Karen Ho (ksh33) Advanced Machine Learning: CS 6780 Abstract We tackle the problem of segmenting an image into planes given user input.
More informationObject Recognition. Lecture 11, April 21 st, Lexing Xie. EE4830 Digital Image Processing
Object Recognition Lecture 11, April 21 st, 2008 Lexing Xie EE4830 Digital Image Processing http://www.ee.columbia.edu/~xlx/ee4830/ 1 Announcements 2 HW#5 due today HW#6 last HW of the semester Due May
More informationFace Alignment Under Various Poses and Expressions
Face Alignment Under Various Poses and Expressions Shengjun Xin and Haizhou Ai Computer Science and Technology Department, Tsinghua University, Beijing 100084, China ahz@mail.tsinghua.edu.cn Abstract.
More informationBeyond Bags of Features
: for Recognizing Natural Scene Categories Matching and Modeling Seminar Instructed by Prof. Haim J. Wolfson School of Computer Science Tel Aviv University December 9 th, 2015
More informationDynamic Shape Tracking via Region Matching
Dynamic Shape Tracking via Region Matching Ganesh Sundaramoorthi Asst. Professor of EE and AMCS KAUST (Joint work with Yanchao Yang) The Problem: Shape Tracking Given: exact object segmentation in frame1
More informationIntroduction à la vision artificielle X
Introduction à la vision artificielle X Jean Ponce Email: ponce@di.ens.fr Web: http://www.di.ens.fr/~ponce Planches après les cours sur : http://www.di.ens.fr/~ponce/introvis/lect10.pptx http://www.di.ens.fr/~ponce/introvis/lect10.pdf
More informationA Partial Curve Matching Method for Automatic Reassembly of 2D Fragments
A Partial Curve Matching Method for Automatic Reassembly of 2D Fragments Liangjia Zhu 1, Zongtan Zhou 1, Jingwei Zhang 2,andDewenHu 1 1 Department of Automatic Control, College of Mechatronics and Automation,
More informationTranslation Symmetry Detection: A Repetitive Pattern Analysis Approach
2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops Translation Symmetry Detection: A Repetitive Pattern Analysis Approach Yunliang Cai and George Baciu GAMA Lab, Department of Computing
More informationMarkov Random Fields and Segmentation with Graph Cuts
Markov Random Fields and Segmentation with Graph Cuts Computer Vision Jia-Bin Huang, Virginia Tech Many slides from D. Hoiem Administrative stuffs Final project Proposal due Oct 27 (Thursday) HW 4 is out
More informationA Benchmark for Interactive Image Segmentation Algorithms
A Benchmark for Interactive Image Segmentation Algorithms Yibiao Zhao 1,3, Xiaohan Nie 2,3, Yanbiao Duan 2,3, Yaping Huang 1, Siwei Luo 1 1 Beijing Jiaotong University, 2 Beijing Institute of Technology,
More informationBIL Computer Vision Apr 16, 2014
BIL 719 - Computer Vision Apr 16, 2014 Binocular Stereo (cont d.), Structure from Motion Aykut Erdem Dept. of Computer Engineering Hacettepe University Slide credit: S. Lazebnik Basic stereo matching algorithm
More informationA Background Modeling Approach Based on Visual Background Extractor Taotao Liu1, a, Lin Qi2, b and Guichi Liu2, c
4th International Conference on Mechatronics, Materials, Chemistry and Computer Engineering (ICMMCCE 2015) A Background Modeling Approach Based on Visual Background Extractor Taotao Liu1, a, Lin Qi2, b
More informationTri-modal Human Body Segmentation
Tri-modal Human Body Segmentation Master of Science Thesis Cristina Palmero Cantariño Advisor: Sergio Escalera Guerrero February 6, 2014 Outline 1 Introduction 2 Tri-modal dataset 3 Proposed baseline 4
More informationImage Classification Using Wavelet Coefficients in Low-pass Bands
Proceedings of International Joint Conference on Neural Networks, Orlando, Florida, USA, August -7, 007 Image Classification Using Wavelet Coefficients in Low-pass Bands Weibao Zou, Member, IEEE, and Yan
More informationTransactions on Information and Communications Technologies vol 16, 1996 WIT Press, ISSN
ransactions on Information and Communications echnologies vol 6, 996 WI Press, www.witpress.com, ISSN 743-357 Obstacle detection using stereo without correspondence L. X. Zhou & W. K. Gu Institute of Information
More informationAnalysis: TextonBoost and Semantic Texton Forests. Daniel Munoz Februrary 9, 2009
Analysis: TextonBoost and Semantic Texton Forests Daniel Munoz 16-721 Februrary 9, 2009 Papers [shotton-eccv-06] J. Shotton, J. Winn, C. Rother, A. Criminisi, TextonBoost: Joint Appearance, Shape and Context
More informationA Road Marking Extraction Method Using GPGPU
, pp.46-54 http://dx.doi.org/10.14257/astl.2014.50.08 A Road Marking Extraction Method Using GPGPU Dajun Ding 1, Jongsu Yoo 1, Jekyo Jung 1, Kwon Soon 1 1 Daegu Gyeongbuk Institute of Science and Technology,
More informationGeneric Face Alignment Using an Improved Active Shape Model
Generic Face Alignment Using an Improved Active Shape Model Liting Wang, Xiaoqing Ding, Chi Fang Electronic Engineering Department, Tsinghua University, Beijing, China {wanglt, dxq, fangchi} @ocrserv.ee.tsinghua.edu.cn
More informationTexture Sensitive Image Inpainting after Object Morphing
Texture Sensitive Image Inpainting after Object Morphing Yin Chieh Liu and Yi-Leh Wu Department of Computer Science and Information Engineering National Taiwan University of Science and Technology, Taiwan
More informationMarkov/Conditional Random Fields, Graph Cut, and applications in Computer Vision
Markov/Conditional Random Fields, Graph Cut, and applications in Computer Vision Fuxin Li Slides and materials from Le Song, Tucker Hermans, Pawan Kumar, Carsten Rother, Peter Orchard, and others Recap:
More informationA robust stereo prior for human segmentation
A robust stereo prior for human segmentation Glenn Sheasby, Julien Valentin, Nigel Crook, Philip Torr Oxford Brookes University Abstract. The emergence of affordable depth cameras has enabled significant
More informationIRIS SEGMENTATION OF NON-IDEAL IMAGES
IRIS SEGMENTATION OF NON-IDEAL IMAGES William S. Weld St. Lawrence University Computer Science Department Canton, NY 13617 Xiaojun Qi, Ph.D Utah State University Computer Science Department Logan, UT 84322
More informationLearning Two-View Stereo Matching
Learning Two-View Stereo Matching Jianxiong Xiao Jingni Chen Dit-Yan Yeung Long Quan Department of Computer Science and Engineering The Hong Kong University of Science and Technology The 10th European
More informationarxiv: v1 [cs.cv] 20 Dec 2016
End-to-End Pedestrian Collision Warning System based on a Convolutional Neural Network with Semantic Segmentation arxiv:1612.06558v1 [cs.cv] 20 Dec 2016 Heechul Jung heechul@dgist.ac.kr Min-Kook Choi mkchoi@dgist.ac.kr
More informationAccelerating Pattern Matching or HowMuchCanYouSlide?
Accelerating Pattern Matching or HowMuchCanYouSlide? Ofir Pele and Michael Werman School of Computer Science and Engineering The Hebrew University of Jerusalem {ofirpele,werman}@cs.huji.ac.il Abstract.
More informationBus Detection and recognition for visually impaired people
Bus Detection and recognition for visually impaired people Hangrong Pan, Chucai Yi, and Yingli Tian The City College of New York The Graduate Center The City University of New York MAP4VIP Outline Motivation
More informationSpatial Latent Dirichlet Allocation
Spatial Latent Dirichlet Allocation Xiaogang Wang and Eric Grimson Computer Science and Computer Science and Artificial Intelligence Lab Massachusetts Tnstitute of Technology, Cambridge, MA, 02139, USA
More informationUNSUPERVISED CO-SEGMENTATION BASED ON A NEW GLOBAL GMM CONSTRAINT IN MRF. Hongkai Yu, Min Xian, and Xiaojun Qi
UNSUPERVISED CO-SEGMENTATION BASED ON A NEW GLOBAL GMM CONSTRAINT IN MRF Hongkai Yu, Min Xian, and Xiaojun Qi Department of Computer Science, Utah State University, Logan, UT 84322-4205 hongkai.yu@aggiemail.usu.edu,
More informationsegments. The geometrical relationship of adjacent planes such as parallelism and intersection is employed for determination of whether two planes sha
A New Segment-based Stereo Matching using Graph Cuts Daolei Wang National University of Singapore EA #04-06, Department of Mechanical Engineering Control and Mechatronics Laboratory, 10 Kent Ridge Crescent
More informationMotion Estimation. There are three main types (or applications) of motion estimation:
Members: D91922016 朱威達 R93922010 林聖凱 R93922044 謝俊瑋 Motion Estimation There are three main types (or applications) of motion estimation: Parametric motion (image alignment) The main idea of parametric motion
More informationDeep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks
Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks Si Chen The George Washington University sichen@gwmail.gwu.edu Meera Hahn Emory University mhahn7@emory.edu Mentor: Afshin
More informationSkin and Face Detection
Skin and Face Detection Linda Shapiro EE/CSE 576 1 What s Coming 1. Review of Bakic flesh detector 2. Fleck and Forsyth flesh detector 3. Details of Rowley face detector 4. Review of the basic AdaBoost
More informationSelection of Scale-Invariant Parts for Object Class Recognition
Selection of Scale-Invariant Parts for Object Class Recognition Gy. Dorkó and C. Schmid INRIA Rhône-Alpes, GRAVIR-CNRS 655, av. de l Europe, 3833 Montbonnot, France fdorko,schmidg@inrialpes.fr Abstract
More informationIMAGE RETRIEVAL USING VLAD WITH MULTIPLE FEATURES
IMAGE RETRIEVAL USING VLAD WITH MULTIPLE FEATURES Pin-Syuan Huang, Jing-Yi Tsai, Yu-Fang Wang, and Chun-Yi Tsai Department of Computer Science and Information Engineering, National Taitung University,
More informationCOMP 558 lecture 22 Dec. 1, 2010
Binocular correspondence problem Last class we discussed how to remap the pixels of two images so that corresponding points are in the same row. This is done by computing the fundamental matrix, defining
More informationSuperpixels via Pseudo-Boolean Optimization
Superpixels via Pseudo-Boolean Optimization Yuhang Zhang, Richard Hartley The Australian National University {yuhang.zhang, richard.hartley}@anu.edu.au John Mashford, Stewart Burn CSIRO {john.mashford,
More informationImproving Image Segmentation Quality Via Graph Theory
International Symposium on Computers & Informatics (ISCI 05) Improving Image Segmentation Quality Via Graph Theory Xiangxiang Li, Songhao Zhu School of Automatic, Nanjing University of Post and Telecommunications,
More informationMulti-Label Moves for Multi-Label Energies
Multi-Label Moves for Multi-Label Energies Olga Veksler University of Western Ontario some work is joint with Olivier Juan, Xiaoqing Liu, Yu Liu Outline Review multi-label optimization with graph cuts
More informationRobot localization method based on visual features and their geometric relationship
, pp.46-50 http://dx.doi.org/10.14257/astl.2015.85.11 Robot localization method based on visual features and their geometric relationship Sangyun Lee 1, Changkyung Eem 2, and Hyunki Hong 3 1 Department
More informationSubject-Oriented Image Classification based on Face Detection and Recognition
000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050
More informationHuman Head-Shoulder Segmentation
Human Head-Shoulder Segmentation Hai Xin, Haizhou Ai Computer Science and Technology Tsinghua University Beijing, China ahz@mail.tsinghua.edu.cn Hui Chao, Daniel Tretter Hewlett-Packard Labs 1501 Page
More informationMulti-sensor Fusion Using Dempster s Theory of Evidence for Video Segmentation
Multi-sensor Fusion Using Dempster s Theory of Evidence for Video Segmentation Björn Scheuermann, Sotirios Gkoutelitsas, and Bodo Rosenhahn Institut für Informationsverarbeitung (TNT) Leibniz Universität
More informationNeighbourhood-consensus message passing and its potentials in image processing applications
Neighbourhood-consensus message passing and its potentials in image processing applications Tijana Ružić a, Aleksandra Pižurica a and Wilfried Philips a a Ghent University, TELIN-IPI-IBBT, Sint-Pietersnieuwstraat
More informationELL 788 Computational Perception & Cognition July November 2015
ELL 788 Computational Perception & Cognition July November 2015 Module 6 Role of context in object detection Objects and cognition Ambiguous objects Unfavorable viewing condition Context helps in object
More informationPairwise Threshold for Gaussian Mixture Classification and its Application on Human Tracking Enhancement
Pairwise Threshold for Gaussian Mixture Classification and its Application on Human Tracking Enhancement Daegeon Kim Sung Chun Lee Institute for Robotics and Intelligent Systems University of Southern
More informationRobust and accurate change detection under sudden illumination variations
Robust and accurate change detection under sudden illumination variations Luigi Di Stefano Federico Tombari Stefano Mattoccia Errico De Lisi Department of Electronics Computer Science and Systems (DEIS)
More informationSupplementary Material for Zoom and Learn: Generalizing Deep Stereo Matching to Novel Domains
Supplementary Material for Zoom and Learn: Generalizing Deep Stereo Matching to Novel Domains Jiahao Pang 1 Wenxiu Sun 1 Chengxi Yang 1 Jimmy Ren 1 Ruichao Xiao 1 Jin Zeng 1 Liang Lin 1,2 1 SenseTime Research
More informationintro, applications MRF, labeling... how it can be computed at all? Applications in segmentation: GraphCut, GrabCut, demos
Image as Markov Random Field and Applications 1 Tomáš Svoboda, svoboda@cmp.felk.cvut.cz Czech Technical University in Prague, Center for Machine Perception http://cmp.felk.cvut.cz Talk Outline Last update:
More informationUnderstanding Tracking and StroMotion of Soccer Ball
Understanding Tracking and StroMotion of Soccer Ball Nhat H. Nguyen Master Student 205 Witherspoon Hall Charlotte, NC 28223 704 656 2021 rich.uncc@gmail.com ABSTRACT Soccer requires rapid ball movements.
More informationComparison of Graph Cuts with Belief Propagation for Stereo, using Identical MRF Parameters
Comparison of Graph Cuts with Belief Propagation for Stereo, using Identical MRF Parameters Marshall F. Tappen William T. Freeman Computer Science and Artificial Intelligence Laboratory Massachusetts Institute
More informationInteractive Image Segmentation with GrabCut
Interactive Image Segmentation with GrabCut Bryan Anenberg Stanford University anenberg@stanford.edu Michela Meister Stanford University mmeister@stanford.edu Abstract We implement GrabCut and experiment
More informationSupplementary material: Strengthening the Effectiveness of Pedestrian Detection with Spatially Pooled Features
Supplementary material: Strengthening the Effectiveness of Pedestrian Detection with Spatially Pooled Features Sakrapee Paisitkriangkrai, Chunhua Shen, Anton van den Hengel The University of Adelaide,
More information