Rapid Face and Object Detection in ios
|
|
- Cynthia Lamb
- 6 years ago
- Views:
Transcription
1 Rapid Face and Object Detection in ios Instructor - Simon Lucey Designing Computer Vision Apps
2 Today Background on Rapid Face Detection Facial Feature Detection in ios. Dlib for Object Detection.
3
4 Face Detection
5 What can OpenCV do? Functionality overview Image Processing Filters Transformations Video, Stereo, 3D Calibration Pose estimation Taken from OpenCV 3.0 latest news and the roadmap. Edges, contours Robust features Segmentation Optical Flow Detection and recognition Depth
6 OpenCV 3.0 In terms of detectors all the standards one are still there:- Viola & Jones style face detector. Dalal & Triggs style pedestrian detector. (d) Support in 3.0 now for deformable parts based models. Dalal, Navneet, and Bill Triggs. "Histograms of oriented gradients for human detection." 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05). Vol. 1. IEEE, Viola, Paul, and Michael Jones. "Rapid object detection using a boosted cascade of simple features." Computer Vision and Pattern Recognition, CVPR Proceedings of the 2001 IEEE Computer Society Conference on. Vol. 1. IEEE, 2001.
7 Viola & Jones Computationally Expensive Cheap High Low Capacity Viola, Paul, and Michael Jones. "Rapid object detection using a boosted cascade of simple features." Computer Vision and Pattern Recognition, CVPR Proceedings of the 2001 IEEE Computer Society Conference on. Vol. 1. IEEE, 2001.
8 Viola & Jones Instead of searching all regions of an image with the same complexity classifier, we can use a cascade. All Sub!windows T T F F F T Further Processing Reject Sub!window
9 Viola & Jones Vast majority of regions in an natural image can be rejected using small capacity classifiers. Figure 4: From left to right: input image, followed by portions of the image which contain un-reject patches after the sequential evaluation of 1 (13.3% patches remaining), 10 (2.6%), 20 (0.01%) and 30 (0.002%) support vectors. Note that in these images, a pixel is displayed if it is part of any remaining un-rejected patch at any scale, orientation or position This explains the apparent discrepancy between the above percentages and the visual impression.
10 Improving Speed Further Viola & Jones (2001) suggested using box filters. (Sub)image I(x, y) Two adjacent regions R 1, R 2 Feature value: f(i) = (x,y) R 2 I(x, y) (x,y) R 1 I(x, y) 10 (Black)
11 Box Filters (Sub)image I(x, y) Two adjacent regions R 1, R 2 R 1 Feature value: f(i) = (x,y) R 2 I(x, y) (x,y) R 1 I(x, y) 11 (Black)
12 Box Filters (Sub)image I(x, y) R 2 Two adjacent regions R 1, R 2 R 1 Feature value: f(i) = (x,y) R 2 I(x, y) (x,y) R 1 I(x, y) 12 (Black)
13 Box Filters Three types of box filters Vary size, aspect ration, location, orientation Defined over window 160,000 distinct features Classifier using a single feature: 13 (Black)
14 Why do this? We need to compute the box filter values many, many times and we must do it very fast! I(x,y ) II(x, y) = x x, y y I(x,y ) (x, y) 14 (Black)
15 Computing Integral Images Computing sum of pixels in a rectangular area: f(a) = 15 (Black)
16 Computing Integral Images Computing sum of pixels in a rectangular area: f(a) = II(A) A 16 (Black)
17 Computing Integral Images Computing sum of pixels in a rectangular area: f(a) = II(A) II(B) B A 17 (Black)
18 Computing Integral Images Computing sum of pixels in a rectangular area: f(a) = II(A) II(B) II(C) B C A 18 (Black)
19 Computing Integral Images Computing sum of pixels in a rectangular area: f(a) = II(A) II(B) II(C) + II(D) D B C A A 3 box filter array takes only 8 lookups. 19 (Black)
20 Computational Comparison To evaluate a 24x24 region of an image using pixels with a linear classifier takes 576 lookups and flops. Conversely, to evaluate a 24x24 region with a 3 box filter classifier takes 8 lookups and flops. Detector False detections Viola-Jones 78.3% 85.2% 88.8% 89.8% 90.1% 90.8% 91.1% 91.8% 93.7% Rowley-Baluja-Kanade 83.2% 86.0% % % 89.9% Evaluation reported on the MIT-CMU Face Database Viola, Paul, and Michael Jones. "Rapid object detection using a boosted cascade of simple features." Computer Vision and Pattern Recognition, CVPR Proceedings of the 2001 IEEE Computer Society Conference on. Vol. 1. IEEE, 2001.
21 Viola & Jones Boosting is ideally suited to be used with box filters. Techniques like AdaBoost, LogitBoost or GentleBoost can naturally learn a complex classifier from a cascade of weak classifiers. H(x) = sign M m=1 mh m (x) Viola, Paul, and Michael Jones. "Rapid object detection using a boosted cascade of simple features." Computer Vision and Pattern Recognition, CVPR Proceedings of the 2001 IEEE Computer Society Conference on. Vol. 1. IEEE, 2001.
22 Schneiderman & Kanade Schneiderman & Kanade reported improved results using a Semi-Naive Bayesian classifier. Technique used a cascade of box filters, but used a mutual information (MI) to group features into independent sets.!"#$%&'()*+),-%'!"#$%&'()*+),-%'!"#$%&'()*+),-%'
23 Schneiderman & Kanade
24 The Future - Deep Detection? Farfade, Sachin Sudhakar, Mohammad J. Saberian, and Li-Jia Li. "Multi-view face detection using deep convolutional neural networks." Proceedings of the 5th ACM on International Conference on Multimedia Retrieval. ACM, Li, Haoxiang, et al. "A convolutional neural network cascade for face detection." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
25 The Future - Deep Detection 24-net 12-net Input Image Convolutional layer Max-pooling Fully-connected layer layer 3 channels 16 3x3 filters 12x12 stride 1 3x3 kernel stride 2 16 outputs Labels 2 classes face / non-face Input Image Convolutional layer 3 channels 24x x5 filters stride 1 resize Max-pooling Fully-connected layer layer Labels 2 classes face / non-face 3x3 kernel stride outputs 12-net Fully-connected layer 48-net Input Image 3 channels 48x48 Convolutional layer 64 5x5 filters stride 1 resize Normalization Max-pooling Fully-connected Max-pooling Normalization Convolutional layer layer layer layer layer layer 3x3 kernel stride 2 9x9 region 64 5x5 filters stride 1 9x9 region 24-net 3x3 kernel stride 2 Labels 2 classes 256 outputs face / non-face Fully-connected layer Still uses a cascade strategy for speed. 12x12 network disposes of 90% of image, where then the subsequent denser networks are used. Li, Haoxiang, et al. "A convolutional neural network cascade for face detection." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
26 The Future - Deep Detection Li, Haoxiang, et al. "A convolutional neural network cascade for face detection." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
27 Today Background on Rapid Face Detection Facial Feature Detection in ios. Dlib for Object Detection.
28 Face Detection in ios In ios face detection comes built in, and can be performed much more efficiently than standard OpenCV. Utilizes the QuartzCore and CoreImage frameworks within the project.
29 Facial Feature Detection
30 CIFaceFeature
31 CIFaceFeature
32 CIFaceFeature Example We are now going to demonstrate a simple example of face detection in ios. On your browser please go to the address, Or better yet, if you have git installed you can type from the command line. $ git clone
33 CIFaceFeature Example
34 CIFaceFeature Example
35 Smerk and GPUImage Recently, an extension to GPUImage was proposed to allow for the utilization of ios face detection within ios. Called Smerk - GitHub project page can be found at:-
36 Smerk Example We are now going to demonstrate how we can perform realtime face tracking through GPUImage. On your browser please go to the address, Or better yet, if you have git installed you can type from the command line. $ git clone
37 Smerk Example
38 Smerk Example
39 Smerk Example
40 Smerk Example
41 Today Background on Rapid Face Detection Facial Feature Detection in ios. Dlib for Object Detection.
42 Dlib C++ for Computer Vision Dlib is a general purpose cross-platform C++ library designed using contract programming and modern C++ techniques. It is open source software and licensed under the Boost Software License. Code is platform independent (Windows, Linux, MAC OS X). Check out more details at the link - Very useful set of vision and learning tools.
43 Why Dlib is useful? What makes Dlib very cool, is its ability to train your own object detectors quickly and easily. This is hard to do in OpenCV as it relies on something called Hard Negative Mining. Requires setting tricky parameters, and can often takes hours/days to train a model.
44 Dlib - Make your own detector! Dlib - uses the well known HOG - SVM pipeline for object detection - Dalal & Triggs Does not rely on HNM, instead employs Structural Support Vector Machine (SVM). No need for negative training set, no messy parameters. Using this tutorial authors were able to learn a face detector in just a few minutes using Dlib. Visualization of HOG Detector Dalal, Navneet, and Bill Triggs. "Histograms of oriented gradients for human detection." 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05). Vol. 1. IEEE, 2005.
45 Dlib versus OpenCV Red Box - Dlib Blue Circles - OpenCV Taken from
46 Dlib versus OpenCV Another example - 8 images of stop signs downloaded and labeled. Dlib was then used to create a HOG detector. Visualization of HOG Detector Taken from
47 More to read A Convolutional Neural Network Cascade for Face Detection Haoxiang Li, Zhe Lin, Xiaohui Shen, Jonathan Brandt, Gang Hua Stevens Institute of Technology Hoboken, NJ {hli18, ghua}@stevens.edu Abstract In real-world face detection, large visual variations, such as those due to pose, expression, and lighting, demand an advanced discriminative model to accurately differentiate faces from the backgrounds. Consequently, effective models for the problem tend to be computationally prohibitive. To address these two conflicting challenges, we propose a cascade architecture built on convolutional neural networks (CNNs) with very powerful discriminative capability, while maintaining high performance. The proposed CNN cascade operates at multiple resolutions, quickly rejects the background regions in the fast low resolution stages, and carefully evaluates a small number of challenging candidates in the last high resolution stage. To improve localization effectiveness, and reduce the number of candidates at later stages, we introduce a CNN-based calibration stage after each of the detection stages in the cascade. The output of each calibration stage is used to adjust the detection window position for input to the subsequent stage. The proposed method runs at 14 FPS on a single CPU core for VGA-resolution images and 100 FPS using a GPU, and achieves state-of-the-art detection performance on two public face detection benchmarks. 1. Introduction Face detection is a well studied problem in computer vision. Modern face detectors can easily detect near frontal faces. Recent research in this area focuses more on the uncontrolled face detection problem, where a number of factors such as pose changes, exaggerated expressions and extreme illuminations can lead to large visual variations in face appearance, and can severely degrade the robustness of the face detector. The difficulties in face detection mainly come from two aspects: 1) the large visual variations of human faces in the cluttered backgrounds; 2) the large search space of possible face positions and face sizes. The former one requires 1 Adobe Research San Jose, CA {zlin, xshen, jbrandt}@adobe.com the face detector to accurately address a binary classification problem while the latter one further imposes a time efficiency requirement. Ever since the seminal work of Viola et al. [27], the boosted cascade with simple features becomes the most popular and effective design for practical face detection. The simple nature of the features enable fast evaluation and quick early rejection of false positive detections. Meanwhile, the boosted cascade constructs an ensemble of the simple features to achieve accurate face vs. non-face classification. The original Viola-Jones face detector uses the Haar feature which is fast to evaluate yet discriminative enough for frontal faces. However, due to the simple nature of the Haar feature, it is relatively weak in the uncontrolled environment where faces are in varied poses, expressions under unexpected lighting. A number of improvements to the Viola-Jones face detector have been proposed in the past decade [30]. Most of them follow the boosted cascade framework with more advanced features. The advanced feature helps construct a more accurate binary classifier at the expense of extra computation. However, the number of cascade stages required to achieve the similar detection accuracy can be reduced. Hence the overall computation may remain the same or even reduced because of fewer cascade stages. This observation suggests that it is possible to apply more advanced features in a practical face detection solution as long as the false positive detections can be rejected quickly in the early stages. In this work, we propose to apply the Convolutional Neural Network (CNN) [13] to face detection. Compared with the previous hand-crafted features, CNN can automatically learn features to capture complex visual variations by leveraging a large amount of training data and its testing phase can be easily parallelized on GPU cores for acceleration. Considering the relatively high computational expense of the CNNs, exhaustively scanning the full image in multiple scales with a deep CNN is not a practical solution. To achieve fast face detection, we present a CNN cascade, MITSUBISHI ELECTRIC RESEARCH LABORATORIES Rapid Object Detection Using a Boosted Cascade of Simple Features Viola, P.; Jones, M. TR May 2004 H. Schneiderman & T. Kanade. Probabilistic Modeling of Local Appearance and Spatial Relationships for Object Recognition", CVPR P. Viola and M. Jones. Rapid Object Detection Using a Boosted Cascade of Simple Features, CVPR N. Dalal & B. Triggs, Histograms of Oriented Gradients for Human Detection, CVPR H. Li et al. A Convolutional Neural Network Cascade for Face Detection, CVPR 2015 Abstract This paper describes a machine learning approach for visual object detection which is capable of processing images extremely rapidly and achieving high detection rates. This work is distinguished by three key contributions. The first is the introduction of a new image representation called the Integral Image which allows the features used by our detector to be computed very quickly. The second is a learning algorithm, based on AdaBoost, which selects a small number of critical visual features from a larger set and yields extremely efficient classifiers[6]. The third contribution is a method for combining increasingly more complex classifiers in a cascade which allows background regions of the image to be quickly discarded while spending more computation on promising object-like regions. The cascade can be viewed as an object specific focus-of-attention mechanism which unlike previous approaches provides statistical guarantees that discarded regions are unlikely to contain the object of interest. In the domain of face detection the system yields detection rates comparable to the best previous systems. Used in real-time applications, the detector runs at 15 frames per second without resorting to image differencing or skin color detection. IEEE Computer Society Conference on Computer Vision and Pattern Recognition This work may not be copied or reproduced in whole or in part for any commercial purpose. Permission to copy in whole or in part without payment of fee is granted for nonprofit educational and research purposes provided that all such whole or partial copies include the following: a notice that such copying is by permission of Mitsubishi Electric Research Laboratories, Inc.; an acknowledgment of the authors and individual contributions to the work; and all applicable portions of the copyright notice. Copying, reproduction, or republishing for any other purpose shall require a license with payment of fee to Mitsubishi Electric Research Laboratories, Inc. All rights reserved. Copyright c Mitsubishi Electric Research Laboratories, Inc., Broadway, Cambridge, Massachusetts Histograms of Oriented Gradients for Human Detection Navneet Dalal and Bill Triggs INRIA Rhône-Alps, 655 avenue de l Europe, Montbonnot 38334, France {Navneet.Dalal,Bill.Triggs}@inrialpes.fr, Abstract We briefly discuss previous work on human detection in We study the question of feature sets for robust visual object recognition, adopting linear SVM based human detec- 2, give an overview of our method 3, describe our data sets in 4 and give a detailed description and experimental tion as a test case. After reviewing existing edge and gradient based descriptors, we show experimentally that grids evaluation of each stage of the process in 5 6. The main conclusions are summarized in 7. of Histograms of Oriented Gradient (HOG) descriptors significantly outperform existing feature sets for human detec- 2 Previous Work tion. We study the influence of each stage of the computation There is an extensive literature on object detection, but on performance, concluding that fine-scale gradients, fine here we mention just a few relevant papers on human detection [18,17,22,16,20]. See [6] for a survey. Papageorgiou et orientation binning, relatively coarse spatial binning, and high-quality local contrast normalization in overlapping descriptor blocks are all important for good results. The new SVM using rectified Haar wavelets as input descriptors, with al [18] describe a pedestrian detector based on a polynomial approach gives near-perfect separation on the original MIT a parts (subwindow) based variant in [17]. Depoortere et al pedestrian database, so we introduce a more challenging give an optimized version of this [2]. Gavrila & Philomen dataset containing over 1800 annotated human images with [8] take a more direct approach, extracting edge images and a large range of pose variations and backgrounds. matching them to a set of learned exemplars using chamfer distance. This has been used in a practical real-time pedestrian detection system [7]. Viola et al [22] build an efficient 1 Introduction Detecting humans in images is a challenging task owing moving person detector, using AdaBoost to train a chain of to their variable appearance and the wide range of poses that progressively more complex region rejection rules based on they can adopt. The first need is a robust feature set that Haar-like wavelets and space-time differences. Ronfard et allows the human form to be discriminated cleanly, even in al [19] build an articulated body detector by incorporating cluttered backgrounds under difficult illumination. We study SVM based limb classifiers over 1 st and 2 nd order Gaussian the issue of feature sets for human detection, showing that locally normalized Histogram of Oriented Gradient (HOG) de- of Felzenszwalb & Huttenlocher [3] and Ioffe & Forsyth filters in a dynamic programming framework similar to those scriptors provide excellent performance relative to other existing feature sets including wavelets [17,22]. The proposed position histograms with binary-thresholded gradient magni- [9]. Mikolajczyk et al [16] use combinations of orientation- descriptors are reminiscent of edge orientation histograms tudes to build a parts based method containing detectors for [4,5], SIFT descriptors [12] and shape contexts [1], but they faces, heads, and front and side profiles of upper and lower are computed on a dense grid of uniformly spaced cells and body parts. In contrast, our detector uses a simpler architecture with a single detection window, but appears to give they use overlapping local contrast normalizations for improved performance. We make a detailed study of the effects significantly higher performance on pedestrian images. of various implementation choices on detector performance, 3 Overview of the Method taking pedestrian detection (the detection of mostly visible people in more or less upright poses) as a test case. For simplicity and speed, we use linear SVM as a baseline classifier chain,which is summarized in fig. 1. Implementation details This section gives an overview of our feature extraction throughout the study. The new detectors give essentially perfect results on the MIT pedestrian test set [18,17], so we have well-normalized local histograms of image gradient orienta- are postponed until 6. The method is based on evaluating created a more challenging set containing over 1800 pedestrian images with a large range of poses and backgrounds. use over the past decade [4,5,12,15]. The basic idea is that tions in a dense grid. Similar features have seen increasing Ongoing work suggests that our feature set performs equally local object appearance and shape can often be characterized well for other shape-based object classes. rather well by the distribution of local intensity gradients or 1 Probabilistic Modeling of Local Appearance and Spatial Relationships for Object Recognition Abstract In this paper, we describe an algorithm for object recognition that explicitly models and estimates the posterior probability function, P( object image). We have chosen a functional form of the posterior probability function that captures the joint statistics of local appearance and position on the object as well as the statistics of local appearance in the visual world at large. We use a discrete representation of local appearance consisting of approximately 10 6 patterns. We compute an estimate of P( object image) in closed form by counting the frequency of occurrence of these patterns over various sets of training images. We have used this method for detecting human faces from frontal and profile views. The algorithm for frontal views has shown a detection rate of 93.0% with 88 false alarms on a set of 125 images containing 483 faces combining the MIT test set of Sung and Poggio with the CMU test sets of Rowley, Baluja, and Kanade. The algorithm for detection of profile views has also demonstrated promising results. 1. Introduction In this paper we derive a probabilistic model for object recognition based primarily on local appearance. Local appearance is a strong constraint for object recognition when the object contains areas of distinctive detailing. For example, the human face consists of distinctive local regions such as the eyes, nose, and mouth. However, local appearance alone is usually not sufficient to recognize an object. For example, a human face becomes unintelligible to a human observer when the various features are not in the proper spatial arrangement. Therefore the joint probability of local appearance and position on the object must be modeled. Nevertheless, representation of only the appearance of the object is still not sufficient for object recognition. Some local patterns on the object may be more unique than others. For example, the intensity patterns around the eyes of a human face are much more unique than the intensity patterns found on the cheeks. In order to represent the uniqueness of local appearance, the statistics of local appearance in the world at large must also be modeled. The underlying representation we have chosen for local Henry Schneiderman and Takeo Kanade Robotics Institute Carnegie Mellon University Pittsburgh, PA appearance is discrete. We have partitioned the space of local appearance into a finite number of patterns. The discrete nature of this representation allows us to estimate the overall statistical model, P( object image), in closed form by counting the frequency of occurrence of these patterns over various sets of training images. In this paper we derive a functional form for the posterior probability function P( object image) that combines these representational elements. We then describe how we have applied this model to the detection of faces in frontal view and profile. We begin in section 2 with a review of Bayes decision rule. We then describe our strategy for deriving the functional form of the posterior probability function in section 3 and perform the actual derivation in section 4. In section 5, we describe how use training images to estimate a specific probability function within the framework of this functional form. In section 6 and 7 we give our results for frontal face detection and profile detection, respectively. In section 8 we compare our representation with other appearance-based recognition methods. 2. Review of Bayes decision rule The posterior probability function gives the probability that the object is present given an input image. Knowledge of this function is all that is necessary to perform object recognition. For a given input image region, x = image, we decide whether the object is present or absent based on which probability is P( object x) P( object x) = 1 P object x ( ) larger, or, respectively. This choice is known as the maximum a posteriori (MAP) rule or the Bayes decision rule. Using this decision rule, we achieve optimal performance, in the sense of minimum rate of classification errors, if the posterior probability function is accurate. 3. Model derivation strategy Presented at CVPR98 Unfortunately, it is not practically feasible to fully represent P( object image) and achieve optimal performance; it is too large and complex a function to represent. The best we can do is choose a simplified form of P( object image) that can be reliably estimated using the available training data. Although a fully general form of P( object image) is intractable, it provides a useful starting point for derivation of a sim-
Histograms of Oriented Gradients for Human Detection p. 1/1
Histograms of Oriented Gradients for Human Detection p. 1/1 Histograms of Oriented Gradients for Human Detection Navneet Dalal and Bill Triggs INRIA Rhône-Alpes Grenoble, France Funding: acemedia, LAVA,
More informationObject Category Detection: Sliding Windows
04/10/12 Object Category Detection: Sliding Windows Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Today s class: Object Category Detection Overview of object category detection Statistical
More informationHistogram of Oriented Gradients for Human Detection
Histogram of Oriented Gradients for Human Detection Article by Navneet Dalal and Bill Triggs All images in presentation is taken from article Presentation by Inge Edward Halsaunet Introduction What: Detect
More informationStudy of Viola-Jones Real Time Face Detector
Study of Viola-Jones Real Time Face Detector Kaiqi Cen cenkaiqi@gmail.com Abstract Face detection has been one of the most studied topics in computer vision literature. Given an arbitrary image the goal
More informationObject Category Detection: Sliding Windows
03/18/10 Object Category Detection: Sliding Windows Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Goal: Detect all instances of objects Influential Works in Detection Sung-Poggio
More informationFast Human Detection Using a Cascade of Histograms of Oriented Gradients
MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Fast Human Detection Using a Cascade of Histograms of Oriented Gradients Qiang Zhu, Shai Avidan, Mei-Chen Yeh, Kwang-Ting Cheng TR26-68 June
More informationRecap Image Classification with Bags of Local Features
Recap Image Classification with Bags of Local Features Bag of Feature models were the state of the art for image classification for a decade BoF may still be the state of the art for instance retrieval
More informationWindow based detectors
Window based detectors CS 554 Computer Vision Pinar Duygulu Bilkent University (Source: James Hays, Brown) Today Window-based generic object detection basic pipeline boosting classifiers face detection
More informationObject Detection Design challenges
Object Detection Design challenges How to efficiently search for likely objects Even simple models require searching hundreds of thousands of positions and scales Feature design and scoring How should
More informationA Hybrid Face Detection System using combination of Appearance-based and Feature-based methods
IJCSNS International Journal of Computer Science and Network Security, VOL.9 No.5, May 2009 181 A Hybrid Face Detection System using combination of Appearance-based and Feature-based methods Zahra Sadri
More informationhttps://en.wikipedia.org/wiki/the_dress Recap: Viola-Jones sliding window detector Fast detection through two mechanisms Quickly eliminate unlikely windows Use features that are fast to compute Viola
More informationA Survey of Various Face Detection Methods
A Survey of Various Face Detection Methods 1 Deepali G. Ganakwar, 2 Dr.Vipulsangram K. Kadam 1 Research Student, 2 Professor 1 Department of Engineering and technology 1 Dr. Babasaheb Ambedkar Marathwada
More informationPreviously. Window-based models for generic object detection 4/11/2011
Previously for generic object detection Monday, April 11 UT-Austin Instance recognition Local features: detection and description Local feature matching, scalable indexing Spatial verification Intro to
More informationGeneric Object-Face detection
Generic Object-Face detection Jana Kosecka Many slides adapted from P. Viola, K. Grauman, S. Lazebnik and many others Today Window-based generic object detection basic pipeline boosting classifiers face
More informationRGBD Face Detection with Kinect Sensor. ZhongJie Bi
RGBD Face Detection with Kinect Sensor ZhongJie Bi Outline The Existing State-of-the-art Face Detector Problems with this Face Detector Proposed solution to the problems Result and ongoing tasks The Existing
More informationFace Detection and Alignment. Prof. Xin Yang HUST
Face Detection and Alignment Prof. Xin Yang HUST Many slides adapted from P. Viola Face detection Face detection Basic idea: slide a window across image and evaluate a face model at every location Challenges
More informationPerson Detection in Images using HoG + Gentleboost. Rahul Rajan June 1st July 15th CMU Q Robotics Lab
Person Detection in Images using HoG + Gentleboost Rahul Rajan June 1st July 15th CMU Q Robotics Lab 1 Introduction One of the goals of computer vision Object class detection car, animal, humans Human
More informationLarge-Scale Traffic Sign Recognition based on Local Features and Color Segmentation
Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation M. Blauth, E. Kraft, F. Hirschenberger, M. Böhm Fraunhofer Institute for Industrial Mathematics, Fraunhofer-Platz 1,
More informationHuman Motion Detection and Tracking for Video Surveillance
Human Motion Detection and Tracking for Video Surveillance Prithviraj Banerjee and Somnath Sengupta Department of Electronics and Electrical Communication Engineering Indian Institute of Technology, Kharagpur,
More informationFace Detection using Hierarchical SVM
Face Detection using Hierarchical SVM ECE 795 Pattern Recognition Christos Kyrkou Fall Semester 2010 1. Introduction Face detection in video is the process of detecting and classifying small images extracted
More informationSkin and Face Detection
Skin and Face Detection Linda Shapiro EE/CSE 576 1 What s Coming 1. Review of Bakic flesh detector 2. Fleck and Forsyth flesh detector 3. Details of Rowley face detector 4. Review of the basic AdaBoost
More informationTowards Practical Evaluation of Pedestrian Detectors
MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Towards Practical Evaluation of Pedestrian Detectors Mohamed Hussein, Fatih Porikli, Larry Davis TR2008-088 April 2009 Abstract Despite recent
More informationMobile Human Detection Systems based on Sliding Windows Approach-A Review
Mobile Human Detection Systems based on Sliding Windows Approach-A Review Seminar: Mobile Human detection systems Njieutcheu Tassi cedrique Rovile Department of Computer Engineering University of Heidelberg
More informationDiscriminative classifiers for image recognition
Discriminative classifiers for image recognition May 26 th, 2015 Yong Jae Lee UC Davis Outline Last time: window-based generic object detection basic pipeline face detection with boosting as case study
More informationFACE DETECTION AND RECOGNITION OF DRAWN CHARACTERS HERMAN CHAU
FACE DETECTION AND RECOGNITION OF DRAWN CHARACTERS HERMAN CHAU 1. Introduction Face detection of human beings has garnered a lot of interest and research in recent years. There are quite a few relatively
More informationRapid Object Detection Using a Boosted Cascade of Simple Features
MERL A MITSUBISHI ELECTRIC RESEARCH LABORATORY http://www.merl.com Rapid Object Detection Using a Boosted Cascade of Simple Features Paul Viola and Michael Jones TR-2004-043 May 2004 Abstract This paper
More informationObject recognition (part 1)
Recognition Object recognition (part 1) CSE P 576 Larry Zitnick (larryz@microsoft.com) The Margaret Thatcher Illusion, by Peter Thompson Readings Szeliski Chapter 14 Recognition What do we mean by object
More informationCategorization by Learning and Combining Object Parts
Categorization by Learning and Combining Object Parts Bernd Heisele yz Thomas Serre y Massimiliano Pontil x Thomas Vetter Λ Tomaso Poggio y y Center for Biological and Computational Learning, M.I.T., Cambridge,
More informationDetecting Pedestrians Using Patterns of Motion and Appearance
MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Detecting Pedestrians Using Patterns of Motion and Appearance Viola, P.; Jones, M.; Snow, D. TR2003-90 August 2003 Abstract This paper describes
More informationClassifier Case Study: Viola-Jones Face Detector
Classifier Case Study: Viola-Jones Face Detector P. Viola and M. Jones. Rapid object detection using a boosted cascade of simple features. CVPR 2001. P. Viola and M. Jones. Robust real-time face detection.
More informationObject detection as supervised classification
Object detection as supervised classification Tues Nov 10 Kristen Grauman UT Austin Today Supervised classification Window-based generic object detection basic pipeline boosting classifiers face detection
More informationHigh Level Computer Vision. Sliding Window Detection: Viola-Jones-Detector & Histogram of Oriented Gradients (HOG)
High Level Computer Vision Sliding Window Detection: Viola-Jones-Detector & Histogram of Oriented Gradients (HOG) Bernt Schiele - schiele@mpi-inf.mpg.de Mario Fritz - mfritz@mpi-inf.mpg.de http://www.d2.mpi-inf.mpg.de/cv
More informationDetection of a Single Hand Shape in the Foreground of Still Images
CS229 Project Final Report Detection of a Single Hand Shape in the Foreground of Still Images Toan Tran (dtoan@stanford.edu) 1. Introduction This paper is about an image detection system that can detect
More informationOut-of-Plane Rotated Object Detection using Patch Feature based Classifier
Available online at www.sciencedirect.com Procedia Engineering 41 (2012 ) 170 174 International Symposium on Robotics and Intelligent Sensors 2012 (IRIS 2012) Out-of-Plane Rotated Object Detection using
More informationHuman detection solution for a retail store environment
FACULDADE DE ENGENHARIA DA UNIVERSIDADE DO PORTO Human detection solution for a retail store environment Vítor Araújo PREPARATION OF THE MSC DISSERTATION Mestrado Integrado em Engenharia Eletrotécnica
More informationHuman Detection Based on a Probabilistic Assembly of Robust Part Detectors
Human Detection Based on a Probabilistic Assembly of Robust Part Detectors K. Mikolajczyk 1, C. Schmid 2, and A. Zisserman 1 1 Dept. of Engineering Science Oxford, OX1 3PJ, United Kingdom {km,az}@robots.ox.ac.uk
More informationEfficient Interest Point Detectors & Features
Efficient Interest Point Detectors & Features Instructor - Simon Lucey 16-423 - Designing Computer Vision Apps Today Review. Efficient Interest Point Detectors. Efficient Descriptors. Review In classical
More informationFace Detection on OpenCV using Raspberry Pi
Face Detection on OpenCV using Raspberry Pi Narayan V. Naik Aadhrasa Venunadan Kumara K R Department of ECE Department of ECE Department of ECE GSIT, Karwar, Karnataka GSIT, Karwar, Karnataka GSIT, Karwar,
More informationDetecting Pedestrians by Learning Shapelet Features
Detecting Pedestrians by Learning Shapelet Features Payam Sabzmeydani and Greg Mori School of Computing Science Simon Fraser University Burnaby, BC, Canada {psabzmey,mori}@cs.sfu.ca Abstract In this paper,
More informationComputer Science Faculty, Bandar Lampung University, Bandar Lampung, Indonesia
Application Object Detection Using Histogram of Oriented Gradient For Artificial Intelegence System Module of Nao Robot (Control System Laboratory (LSKK) Bandung Institute of Technology) A K Saputra 1.,
More informationParallel Tracking. Henry Spang Ethan Peters
Parallel Tracking Henry Spang Ethan Peters Contents Introduction HAAR Cascades Viola Jones Descriptors FREAK Descriptor Parallel Tracking GPU Detection Conclusions Questions Introduction Tracking is a
More informationHistogram of Oriented Gradients (HOG) for Object Detection
Histogram of Oriented Gradients (HOG) for Object Detection Navneet DALAL Joint work with Bill TRIGGS and Cordelia SCHMID Goal & Challenges Goal: Detect and localise people in images and videos n Wide variety
More informationFeature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking
Feature descriptors Alain Pagani Prof. Didier Stricker Computer Vision: Object and People Tracking 1 Overview Previous lectures: Feature extraction Today: Gradiant/edge Points (Kanade-Tomasi + Harris)
More informationFace Detection Using Look-Up Table Based Gentle AdaBoost
Face Detection Using Look-Up Table Based Gentle AdaBoost Cem Demirkır and Bülent Sankur Boğaziçi University, Electrical-Electronic Engineering Department, 885 Bebek, İstanbul {cemd,sankur}@boun.edu.tr
More informationPeople detection in complex scene using a cascade of Boosted classifiers based on Haar-like-features
People detection in complex scene using a cascade of Boosted classifiers based on Haar-like-features M. Siala 1, N. Khlifa 1, F. Bremond 2, K. Hamrouni 1 1. Research Unit in Signal Processing, Image Processing
More informationDetecting People in Images: An Edge Density Approach
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 27 Detecting People in Images: An Edge Density Approach Son Lam Phung
More informationFace and Nose Detection in Digital Images using Local Binary Patterns
Face and Nose Detection in Digital Images using Local Binary Patterns Stanko Kružić Post-graduate student University of Split, Faculty of Electrical Engineering, Mechanical Engineering and Naval Architecture
More informationFace detection. Bill Freeman, MIT April 5, 2005
Face detection Bill Freeman, MIT 6.869 April 5, 2005 Today (April 5, 2005) Face detection Subspace-based Distribution-based Neural-network based Boosting based Some slides courtesy of: Baback Moghaddam,
More informationLast week. Multi-Frame Structure from Motion: Multi-View Stereo. Unknown camera viewpoints
Last week Multi-Frame Structure from Motion: Multi-View Stereo Unknown camera viewpoints Last week PCA Today Recognition Today Recognition Recognition problems What is it? Object detection Who is it? Recognizing
More informationImage Analysis. Window-based face detection: The Viola-Jones algorithm. iphoto decides that this is a face. It can be trained to recognize pets!
Image Analysis 2 Face detection and recognition Window-based face detection: The Viola-Jones algorithm Christophoros Nikou cnikou@cs.uoi.gr Images taken from: D. Forsyth and J. Ponce. Computer Vision:
More informationDetecting Pedestrians Using Patterns of Motion and Appearance
International Journal of Computer Vision 63(2), 153 161, 2005 c 2005 Springer Science + Business Media, Inc. Manufactured in The Netherlands. Detecting Pedestrians Using Patterns of Motion and Appearance
More informationSelection of Scale-Invariant Parts for Object Class Recognition
Selection of Scale-Invariant Parts for Object Class Recognition Gy. Dorkó and C. Schmid INRIA Rhône-Alpes, GRAVIR-CNRS 655, av. de l Europe, 3833 Montbonnot, France fdorko,schmidg@inrialpes.fr Abstract
More informationTriangle Method for Fast Face Detection on the Wild
Journal of Multimedia Information System VOL. 5, NO. 1, March 2018 (pp. 15-20): ISSN 2383-7632(Online) http://dx.doi.org/10.9717/jmis.2018.5.1.15 Triangle Method for Fast Face Detection on the Wild Karimov
More informationDetecting Pedestrians Using Patterns of Motion and Appearance
Detecting Pedestrians Using Patterns of Motion and Appearance Paul Viola Michael J. Jones Daniel Snow Microsoft Research Mitsubishi Electric Research Labs Mitsubishi Electric Research Labs viola@microsoft.com
More informationBayes Risk. Classifiers for Recognition Reading: Chapter 22 (skip 22.3) Discriminative vs Generative Models. Loss functions in classifiers
Classifiers for Recognition Reading: Chapter 22 (skip 22.3) Examine each window of an image Classify object class within each window based on a training set images Example: A Classification Problem Categorize
More informationFROM VIDEO STREAMS IN THE WILD
SEMANTIC FACE SEGMENTATION FROM VIDEO STREAMS IN THE WILD Student: Deividas Skiparis Supervisors: Pascal Landry (Imersivo) and Sergio Escalera (UB) In Collaboration With Imersivo SL 7/5/2017 2 Problem
More informationVisuelle Perzeption für Mensch- Maschine Schnittstellen
Visuelle Perzeption für Mensch- Maschine Schnittstellen Vorlesung, WS 2009 Prof. Dr. Rainer Stiefelhagen Dr. Edgar Seemann Institut für Anthropomatik Universität Karlsruhe (TH) http://cvhci.ira.uka.de
More informationHuman detections using Beagle board-xm
Human detections using Beagle board-xm CHANDAN KUMAR 1 V. AJAY KUMAR 2 R. MURALI 3 1 (M. TECH STUDENT, EMBEDDED SYSTEMS, DEPARTMENT OF ELECTRONICS AND COMMUNICATION ENGINEERING, VIJAYA KRISHNA INSTITUTE
More informationHand Posture Recognition Using Adaboost with SIFT for Human Robot Interaction
Hand Posture Recognition Using Adaboost with SIFT for Human Robot Interaction Chieh-Chih Wang and Ko-Chih Wang Department of Computer Science and Information Engineering Graduate Institute of Networking
More informationClassifiers for Recognition Reading: Chapter 22 (skip 22.3)
Classifiers for Recognition Reading: Chapter 22 (skip 22.3) Examine each window of an image Classify object class within each window based on a training set images Slide credits for this chapter: Frank
More informationComponent-based Face Recognition with 3D Morphable Models
Component-based Face Recognition with 3D Morphable Models B. Weyrauch J. Huang benjamin.weyrauch@vitronic.com jenniferhuang@alum.mit.edu Center for Biological and Center for Biological and Computational
More informationTraffic Sign Localization and Classification Methods: An Overview
Traffic Sign Localization and Classification Methods: An Overview Ivan Filković University of Zagreb Faculty of Electrical Engineering and Computing Department of Electronics, Microelectronics, Computer
More informationVideo Google faces. Josef Sivic, Mark Everingham, Andrew Zisserman. Visual Geometry Group University of Oxford
Video Google faces Josef Sivic, Mark Everingham, Andrew Zisserman Visual Geometry Group University of Oxford The objective Retrieve all shots in a video, e.g. a feature length film, containing a particular
More informationRecent Researches in Automatic Control, Systems Science and Communications
Real time human detection in video streams FATMA SAYADI*, YAHIA SAID, MOHAMED ATRI AND RACHED TOURKI Electronics and Microelectronics Laboratory Faculty of Sciences Monastir, 5000 Tunisia Address (12pt
More informationRobust PDF Table Locator
Robust PDF Table Locator December 17, 2016 1 Introduction Data scientists rely on an abundance of tabular data stored in easy-to-machine-read formats like.csv files. Unfortunately, most government records
More informationPrincipal Component Analysis and Neural Network Based Face Recognition
Principal Component Analysis and Neural Network Based Face Recognition Qing Jiang Mailbox Abstract People in computer vision and pattern recognition have been working on automatic recognition of human
More informationFace detection and recognition. Detection Recognition Sally
Face detection and recognition Detection Recognition Sally Face detection & recognition Viola & Jones detector Available in open CV Face recognition Eigenfaces for face recognition Metric learning identification
More informationFACE DETECTION BY HAAR CASCADE CLASSIFIER WITH SIMPLE AND COMPLEX BACKGROUNDS IMAGES USING OPENCV IMPLEMENTATION
FACE DETECTION BY HAAR CASCADE CLASSIFIER WITH SIMPLE AND COMPLEX BACKGROUNDS IMAGES USING OPENCV IMPLEMENTATION Vandna Singh 1, Dr. Vinod Shokeen 2, Bhupendra Singh 3 1 PG Student, Amity School of Engineering
More informationPedestrian Detection and Tracking in Images and Videos
Pedestrian Detection and Tracking in Images and Videos Azar Fazel Stanford University azarf@stanford.edu Viet Vo Stanford University vtvo@stanford.edu Abstract The increase in population density and accessibility
More informationCategory-level localization
Category-level localization Cordelia Schmid Recognition Classification Object present/absent in an image Often presence of a significant amount of background clutter Localization / Detection Localize object
More informationEfficient Interest Point Detectors & Features
Efficient Interest Point Detectors & Features Instructor - Simon Lucey 16-423 - Designing Computer Vision Apps Today Review. Efficient Interest Point Detectors. Efficient Descriptors. Review In classical
More informationTraffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers
Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers A. Salhi, B. Minaoui, M. Fakir, H. Chakib, H. Grimech Faculty of science and Technology Sultan Moulay Slimane
More informationHuman-Robot Interaction
Human-Robot Interaction Elective in Artificial Intelligence Lecture 6 Visual Perception Luca Iocchi DIAG, Sapienza University of Rome, Italy With contributions from D. D. Bloisi and A. Youssef Visual Perception
More informationInternational Journal Of Global Innovations -Vol.4, Issue.I Paper Id: SP-V4-I1-P17 ISSN Online:
IMPLEMENTATION OF EMBEDDED HUMAN TRACKING SYSTEM USING DM3730 DUALCORE PROCESSOR #1 DASARI ALEKHYA M.TECH Student, #2 Dr. SYED ABUDHAGIR.U Associate Professor, Dept of ECE B.V.RAJU INSTITUTE OF TECHNOLOGY,
More informationHaar Wavelets and Edge Orientation Histograms for On Board Pedestrian Detection
Haar Wavelets and Edge Orientation Histograms for On Board Pedestrian Detection David Gerónimo, Antonio López, Daniel Ponsa, and Angel D. Sappa Computer Vision Center, Universitat Autònoma de Barcelona
More informationComponent-based Face Recognition with 3D Morphable Models
Component-based Face Recognition with 3D Morphable Models Jennifer Huang 1, Bernd Heisele 1,2, and Volker Blanz 3 1 Center for Biological and Computational Learning, M.I.T., Cambridge, MA, USA 2 Honda
More informationDeformable Part Models
CS 1674: Intro to Computer Vision Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 9, 2016 Today: Object category detection Window-based approaches: Last time: Viola-Jones
More information2D Image Processing Feature Descriptors
2D Image Processing Feature Descriptors Prof. Didier Stricker Kaiserlautern University http://ags.cs.uni-kl.de/ DFKI Deutsches Forschungszentrum für Künstliche Intelligenz http://av.dfki.de 1 Overview
More informationHuman detection using local shape and nonredundant
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2010 Human detection using local shape and nonredundant binary patterns
More informationObject detection using non-redundant local Binary Patterns
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2010 Object detection using non-redundant local Binary Patterns Duc Thanh
More informationProbabilistic Modeling of Local Appearance and Spatial Relationships for Object Recognition
Presented at CVPR98 Probabilistic Modeling of Local Appearance and Spatial Relationships for Object Recognition Henry Schneiderman and Takeo Kanade Robotics Institute Carnegie Mellon University Pittsburgh,
More informationCS4495/6495 Introduction to Computer Vision. 8C-L1 Classification: Discriminative models
CS4495/6495 Introduction to Computer Vision 8C-L1 Classification: Discriminative models Remember: Supervised classification Given a collection of labeled examples, come up with a function that will predict
More informationFast and Robust Classification using Asymmetric AdaBoost and a Detector Cascade
Fast and Robust Classification using Asymmetric AdaBoost and a Detector Cascade Paul Viola and Michael Jones Mistubishi Electric Research Lab Cambridge, MA viola@merl.com and mjones@merl.com Abstract This
More informationA New Strategy of Pedestrian Detection Based on Pseudo- Wavelet Transform and SVM
A New Strategy of Pedestrian Detection Based on Pseudo- Wavelet Transform and SVM M.Ranjbarikoohi, M.Menhaj and M.Sarikhani Abstract: Pedestrian detection has great importance in automotive vision systems
More informationGeneric Object Detection Using Improved Gentleboost Classifier
Available online at www.sciencedirect.com Physics Procedia 25 (2012 ) 1528 1535 2012 International Conference on Solid State Devices and Materials Science Generic Object Detection Using Improved Gentleboost
More informationHuman Detection. A state-of-the-art survey. Mohammad Dorgham. University of Hamburg
Human Detection A state-of-the-art survey Mohammad Dorgham University of Hamburg Presentation outline Motivation Applications Overview of approaches (categorized) Approaches details References Motivation
More informationAutomatic Initialization of the TLD Object Tracker: Milestone Update
Automatic Initialization of the TLD Object Tracker: Milestone Update Louis Buck May 08, 2012 1 Background TLD is a long-term, real-time tracker designed to be robust to partial and complete occlusions
More informationObject Category Detection. Slides mostly from Derek Hoiem
Object Category Detection Slides mostly from Derek Hoiem Today s class: Object Category Detection Overview of object category detection Statistical template matching with sliding window Part-based Models
More informationCategory vs. instance recognition
Category vs. instance recognition Category: Find all the people Find all the buildings Often within a single image Often sliding window Instance: Is this face James? Find this specific famous building
More informationColor Model Based Real-Time Face Detection with AdaBoost in Color Image
Color Model Based Real-Time Face Detection with AdaBoost in Color Image Yuxin Peng, Yuxin Jin,Kezhong He,Fuchun Sun, Huaping Liu,LinmiTao Department of Computer Science and Technology, Tsinghua University,
More informationResearch on Robust Local Feature Extraction Method for Human Detection
Waseda University Doctoral Dissertation Research on Robust Local Feature Extraction Method for Human Detection TANG, Shaopeng Graduate School of Information, Production and Systems Waseda University Feb.
More informationDetecting and Reading Text in Natural Scenes
October 19, 2004 X. Chen, A. L. Yuille Outline Outline Goals Example Main Ideas Results Goals Outline Goals Example Main Ideas Results Given an image of an outdoor scene, goals are to: Identify regions
More informationFast Human Detection With Cascaded Ensembles On The GPU
Fast Human Detection With Cascaded Ensembles On The GPU The MIT Faculty has made this article openly available. Please share how this access benefits you. Your story matters. Citation As Published Publisher
More informationFace detection, validation and tracking. Océane Esposito, Grazina Laurinaviciute, Alexandre Majetniak
Face detection, validation and tracking Océane Esposito, Grazina Laurinaviciute, Alexandre Majetniak Agenda Motivation and examples Face detection Face validation Face tracking Conclusion Motivation Goal:
More informationDisparity Search Range Estimation: Enforcing Temporal Consistency
MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Disparity Search Range Estimation: Enforcing Temporal Consistency Dongbo Min, Sehoon Yea, Zafer Arican, Anthony Vetro TR1-13 April 1 Abstract
More informationFace/Flesh Detection and Face Recognition
Face/Flesh Detection and Face Recognition Linda Shapiro EE/CSE 576 1 What s Coming 1. Review of Bakic flesh detector 2. Fleck and Forsyth flesh detector 3. Details of Rowley face detector 4. The Viola
More informationGeneric Face Alignment Using an Improved Active Shape Model
Generic Face Alignment Using an Improved Active Shape Model Liting Wang, Xiaoqing Ding, Chi Fang Electronic Engineering Department, Tsinghua University, Beijing, China {wanglt, dxq, fangchi} @ocrserv.ee.tsinghua.edu.cn
More informationImage Processing Pipeline for Facial Expression Recognition under Variable Lighting
Image Processing Pipeline for Facial Expression Recognition under Variable Lighting Ralph Ma, Amr Mohamed ralphma@stanford.edu, amr1@stanford.edu Abstract Much research has been done in the field of automated
More informationObject detection. Asbjørn Berge. INF 5300 Advanced Topic: Video Content Analysis. Technology for a better society
INF 5300 Advanced Topic: Video Content Analysis Object detection Asbjørn Berge Small part of the Cydonia region, taken by the Viking 1orbiter and released by NASA/JPL on July 25, 1976 Salvador Dali (1974)
More informationSelective Search for Object Recognition
Selective Search for Object Recognition Uijlings et al. Schuyler Smith Overview Introduction Object Recognition Selective Search Similarity Metrics Results Object Recognition Kitten Goal: Problem: Where
More information