Human detection solution for a retail store environment
|
|
- Baldwin Mills
- 5 years ago
- Views:
Transcription
1 FACULDADE DE ENGENHARIA DA UNIVERSIDADE DO PORTO Human detection solution for a retail store environment Vítor Araújo PREPARATION OF THE MSC DISSERTATION Mestrado Integrado em Engenharia Eletrotécnica e de Computadores Supervisor: Jaime Cardoso, PhD (FEUP) Co-Supervisor: Pedro Carvalho, PhD (INESC Porto) July 5, 2013
2 c Vítor Araújo, 2013
3 Abstract This document provides an overview of the current state of the art on the field of human detection by automated systems, as part of the "Preparation of the MSc Dissertation" course unit. This will serve as a starting point for the Dissertation work, which will take place on the 1st semester of the 2013/2014 academic year. For this effect, a work plan and proposed work methodology are also included. i
4 ii
5 Contents Abstract Symbols and Abbreviations i ix 1 Introduction Motivation Objectives Document Structure State of the Art HOG Detection Results Conclusion CENTRIST Detection Results Conclusion Viola-Jones Detection Results Conclusion C Detection Results Conclusion HOG-LBP Detection Results Conclusion Planning Work plan Challenges Expected results Tools Conclusion 11 References 13 iii
6 iv CONTENTS
7 List of Figures 2.1 Detection performance of the HOG descriptor (from [1]) Comparison between C 4 and HOG on the INRIA data set (from [2]) Work plan estimate v
8 vi LIST OF FIGURES
9 List of Tables 2.1 Detection rates for certain numbers of false positives vii
10 viii LIST OF TABLES
11 Symbols and Abbreviations Acronyms 1D 1 Dimension, p. 3 CCTV Closed-circuit television, p. 1 CENTRIST CENsus TRansform histogram, pp. iii, 4 6 CMU Carnegie Mellon University, p. 5 CPU Central Processing Unit, p. 6 FBI Federal Bureau of Investigation, p. 1 FPPW False positive per window, pp. 3, 4, 7 FPS Frames per second, pp. 1, 6 GPU Graphics Processing Unit, p. 6 HOG Histogram of Oriented Gradients, pp. iii, v, 3, 6, 7 IDE Integrated Development Environment, p. 10 LBP Local Binary Pattern, pp. iii, 7 MIT Massachusetts Institute of Technology, pp. 3 5 SVM Support Vector Machine, p. 7 ix
12
13 Chapter 1 Introduction This chapter contextualizes the current importance of automatic human detection systems, their major advantages and weaknesses, while also describing the main objective of this project and the structure of the rest of the document. 1.1 Motivation In recent years, the adoption of security camera systems has spread drastically, not only in the enterprise environment but also for home and small store surveillance. These systems are usually installed in a closed environment (CCTV), transmitting and/or recording only to a local endpoint. The quality of the cameras deployed has also improved considerably along the years, going from black and white video with a low resolution and frame-rate to colored high-definition video with 60 FPS or even more. Fortunately for consumers, these improvements in video quality were also accompanied by an increasing affordability of high capacity data storage media, which has allowed for the recording and archival of surveillance video for longer periods of time, with little cost. Unfortunately, these technological advances come with a price. The amount of data captured by surveillance systems has made it difficult to find the exact information needed, be it the moment an intruder is recorded by the camera, or the unpredictable but potentially helpful recording of a missing person. This is where automated recognition systems come into play. Image recognition software has been the subject of many improvements over the years, but still has a long way to go before it achieves the same quality of detection that a human is capable of. Currently, the biggest advantage of automated systems is their speed. With the right off-the-shelf hardware to support it, some of these systems can scan through hours of high definition video in minutes. Nevertheless, even the most powerful hardware clusters are useless if they are not backed up by a good detection algorithm. For instance, in the 2013 Boston Marathon bombings in the United States of America, an unprecedented amount of footage was recorded. This footage was then process by the authorities but the results came up empty, despite the fact that both suspects were already in the FBI database [3]. The suspects ended up being detected by a human that was 1
14 2 Introduction looking through the video. The lack of effectiveness of the system was attributed to several causes, like the low resolution of the cameras and the long range of the recordings, some of which were badly focused and caught from angles that fell within the software s weaknesses. 1.2 Objectives The primary object of this project is to improve and adapt an existing algorithm to work in an object dense area. In this area, which is based on a small store environment, there will be objects blocking the people in the image. The majority of these objects will be static, but some moving objects can also be present and should be taken into account. 1.3 Document Structure Apart from this introductory chapter, this document has 3 more chapters. In chapter 2 a state of the art analysis and related work is presented. In chapter 3 the development planning is described. And in chapter 4 a brief conclusion reflects on future developments.
15 Chapter 2 State of the Art The field of computer vision technologies is currently the subject of many academic research projects, yet improvements still come in small amounts, as most research activity builds up on one of the previously available frameworks. Therefore, this chapter will cover the frameworks that are currently regarded as the best in the field, and then highlight some of the recent improvements that have been made. 2.1 HOG This descriptor was first purposed in 2005 in the paper Histograms of Oriented Gradients for Human Detection [1]. It is based on the concept that the distribution of intensity gradients or edge directions can define an object within an image. The practical implementation is done by dividing the image into spatial regions, called cells, which will contain a local 1D histogram of gradient directions or edge orientations over the cell pixels. The combination of histograms from all cells forms the image descriptor. In order to diminish the effect of illumination and shadowing variance, the local results should be contrast-normalize. This can be achieved by measuring the intensity across a larger area, called block, and using this value to normalize all cells within the block Detection Results The paper provides results from two different data sets: the MIT pedestrian database, which contains only front or back views, with a limited range of poses; the INRIA database, which was developed by the paper authors and provides a bigger challenge for the descriptor, by containing images of people standing but in any orientation, with a wide range of backgrounds, including crowds. For the MIT data set, the descriptor performed near-perfectly, with a miss rate of less than 1% at 10 4 FPPW. It was due to these results that the INRIA data set was developed. Its miss rate, for 3
16 4 State of the Art the same FPPW rate, is of 10%. While these results are worse than for the MIT data set, they still provide a big improvement over other descriptors. All these results can be seen in image 2.1. Figure 2.1: Detection performance of the HOG descriptor (from [1]) Conclusion The HOG descriptor provides a big improvement over previously used methods, which has led to its popularity as a starting point for recent projects in this field. Nevertheless, it misses a few key features, like accounting for image orientation. Its detection speed is also of concern, specially when applied to high resolution images and video. 2.2 CENTRIST The CENsus TRansform histogram descriptor first appeared in the paper CENTRIST: A Visual Descriptor for Scene Categorization [4] in As defined in this paper, "Census Transform (CT) is a non-parametric local transform originally designed for establishing correspondence between local patches. Census transform compares the intensity value of a pixel with its eight neighboring pixels (...). If the center pixel is bigger than (or equal to) one of its neighbors, a bit 1 is set in the corresponding location. Otherwise a bit 0 is set." [4] The result is a 3x3 binary matrix, with the central position empty, which can be translated into a base-10 number in the [0 255] range. This number represents the Census Transform value for the central pixel. Repeating the process for each pixel in the image, the resulting set of values can then be used as input to the classifier. The initial processing of the image using the Census Transform method allows the classifier to do an easier recognition of the scene, as it can ignore distracting elements, like textures and color, and focus on the more important geometric features and structural properties.
17 2.3 Viola-Jones Detection Results CENTRIST was compared with 2 other visual descriptors, SIFT [5] and Gist [6]. In one test, with both outdoor and indoor environments, CENTRIST s accuracy was of 83.88±0.76%, while Gist got a rate of 73.28±0.67%. As for SIFT, its highest discriminative power makes it inefficient for images with high variation. It got a rate of 57.24% false negative results, against 35.83% for CENTRIST Conclusion This method allows for a good evaluation of the type of scene. However, it does not implement object detection. 2.3 Viola-Jones This framework describes a method of detecting objects in a scene. Its mostly used implementation focuses on face detection [7], which, in the context of this project, makes it useful as a first detection algorithm. This procedure works by classifying images based on simple features. Using features instead of working directly on the pixels makes for a much faster processing, allowing the focus to be put on the quality of the results. By also implementing a cascade of classifiers, this method can achieve good detection rates (>85%) while providing low false positive rates (<10 5 ). As an example, a target rate of 0.9 can be obtained by using a 10 stage classifier. Each stage needs to have a detection rate of 0.99, which may seem difficult to achieve, but by having a large margin for error, it becomes much easier. More precisely, each stage can have a false positive rate of 0.3. The end result is: Detection rate: = False positive rate: = Detection Results Table 2.1 presents the detection accuracy of the Viola-Jones framework, compared to the Rowley- Baluja-Kanade [8] results, using the MIT-CMU test set, which contains 130 images and 507 faces. False positives Detector Viola-Jones 78.3% 85.2% 90.8% 91.8% 93.7% Rowley-Baluja-Kanade 83.2% 86.0% 89.2% 90.1% 89.9% Table 2.1: Detection rates for certain numbers of false positives As for its speed, on a 700 Mhz Pentium III processor, using 384 by 288 images, each one took seconds to process, which is about 15 times faster than the Rowley-Baluja-Kanade detector.
18 6 State of the Art Conclusion The Viola-Jones framework provides good and fast results for face detection, which can be useful as a first step for this project. By detecting the face first, the solution developed could then more easily detect the rest of the person, achieving better results in less time. 2.4 C 4 C 4 [2] is a detector based on CENTRIST which focuses on contour cues for its detection. The method works by first creating the Sobel gradients of the image, then computing the Census Transform values and creating a single integral image. This image is then resized and the brute-force scan is performed. The major performance advantage comes from only using one integral image, and from the fact that CENTRIST does not require normalization, unlike HOG Detection Results A comparison with HOG can be seen on image 2.2. Figure 2.2: Comparison between C 4 and HOG on the INRIA data set (from [2]) When comparing detection speed, C 4 can process a 640 by 480 video at 20 FPS using 1 core of a 2.8GHz CPU. The nearest comparable solution ran at 10 FPS, while also used parallel processing on a GPU Conclusion This implementation seems to be fast and accurate, by improving on previous work. However, it is still not as accurate as other methods, which is the most important factor, as CPU performance is constantly improving.
19 2.5 HOG-LBP HOG-LBP This method combines the framework described in 2.1 with a Local Binary Pattern, in a detector that also handles partial occlusion [9] by taking advantage of the LBP high discrimination, along with the HOG edge and local shape information capture. After computing the Histogram of Oriented Gradients and the Local Binary Pattern integral images, these are combined in an augmented feature vector. It is on this vector that the sliding window acts. By feeding the sliding window results to an SVM, each block can be scored. If the SVM scores the block with an ambiguous classification, an image segmentation algorithm is run, which segments the possible occlusion regions Detection Results The HOG-LBP method achieved a detection rate of 91.3% at 10 6 FPPW and 94.7% at 10 5 FPPW using the INRIA data set. This compares with HOG s rate of 90% at 10 4 FPPW. With a custom upper body data set, the improvement over HOG was of 20% at Conclusion Despite increasing the detector complexity, this method proved effective in detecting partially occluded subjects. This kind of situation will be the most common in this project, and this type of approach may be one of the more effective ones for a store environment.
20 8 State of the Art
21 Chapter 3 Planning This section presents an overview on how the project development will take place. It includes some of the expected challenges and results, an estimation of the development schedule and the tools that will be used. 3.1 Work plan Figure 3.1: Work plan estimate Note: Dates for Dissertation document delivery and presentation are not yet available. These estimates were based on previous years dates. 3.2 Challenges The biggest challenge perceived at this point will be understanding the initial code, how it works, and how each function affects the end result. After this has been accomplished, the development of the necessary algorithms for the detection to work in a complex environment will start. 9
22 10 Planning 3.3 Expected results At the end of this project, it is expected that the solution provided allows for an accurate human detection in a complex store environment. This solution should account for partial subject occlusion, either by objects or other subjects, and its results should fall within a reasonable false positive and false negative range, based on similar solutions. 3.4 Tools The development will be done on the Windows and Linux operating systems, using an appropriate Integrated Development Environment, which will be chosen based on the project that will serve as a basis for this work, thus minimizing the risk of errors and avoiding the need to configure a different IDE from scratch.
23 Chapter 4 Conclusion Effective and reliable human detection on still images or video is presently one of the most challenging aspects in the field of computer vision. The state of the art presented in this document shows some of the most relevant work currently behind developed in this area, and related to the subject of this project. By focusing on these approaches to the problem, a new solution is expected to be developed and implemented in the following months, bringing forward another small contribution to help improve the human detection systems. 11
24 12 Conclusion
25 References [1] N. Dalal and B. Triggs. Histograms of oriented gradients for human detection. In Computer Vision and Pattern Recognition, CVPR IEEE Computer Society Conference on, volume 1, pages vol. 1, Cited on pages v, 3, and 4. [2] Jianxin Wu, C. Geyer, and J.M. Rehg. Real-time human detection using contour cues. In Robotics and Automation (ICRA), 2011 IEEE International Conference on, pages , doi: /icra Cited on pages v and 6. [3] Douglas McCormick. Face recognition failed to find boston bombers, April URL: face-recognition-failed-to-find-boston-bombers. Cited on page 1. [4] Jianxin Wu and J.M. Rehg. Centrist: A visual descriptor for scene categorization. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 33(8): , doi: /TPAMI Cited on page 4. [5] David G Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60(2):91 110, Cited on page 5. [6] Aude Oliva and Antonio Torralba. Modeling the shape of the scene: A holistic representation of the spatial envelope. International Journal of Computer Vision, 42: , Cited on page 5. [7] Paul Viola and Michael Jones. Robust real-time object detection. In International Journal of Computer Vision, Cited on page 5. [8] Henry A Rowley, Shumeet Baluja, and Takeo Kanade. Neural network-based face detection. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 20(1):23 38, Cited on page 5. [9] Xiaoyu Wang, Tony X Han, and Shuicheng Yan. An hog-lbp human detector with partial occlusion handling. In Computer Vision, 2009 IEEE 12th International Conference on, pages IEEE, Cited on page 7. 13
Discriminative classifiers for image recognition
Discriminative classifiers for image recognition May 26 th, 2015 Yong Jae Lee UC Davis Outline Last time: window-based generic object detection basic pipeline face detection with boosting as case study
More informationFeature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking
Feature descriptors Alain Pagani Prof. Didier Stricker Computer Vision: Object and People Tracking 1 Overview Previous lectures: Feature extraction Today: Gradiant/edge Points (Kanade-Tomasi + Harris)
More informationWindow based detectors
Window based detectors CS 554 Computer Vision Pinar Duygulu Bilkent University (Source: James Hays, Brown) Today Window-based generic object detection basic pipeline boosting classifiers face detection
More informationHuman Detection. A state-of-the-art survey. Mohammad Dorgham. University of Hamburg
Human Detection A state-of-the-art survey Mohammad Dorgham University of Hamburg Presentation outline Motivation Applications Overview of approaches (categorized) Approaches details References Motivation
More informationHOG-based Pedestriant Detector Training
HOG-based Pedestriant Detector Training evs embedded Vision Systems Srl c/o Computer Science Park, Strada Le Grazie, 15 Verona- Italy http: // www. embeddedvisionsystems. it Abstract This paper describes
More informationObject Detection Design challenges
Object Detection Design challenges How to efficiently search for likely objects Even simple models require searching hundreds of thousands of positions and scales Feature design and scoring How should
More informationObject Category Detection: Sliding Windows
04/10/12 Object Category Detection: Sliding Windows Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Today s class: Object Category Detection Overview of object category detection Statistical
More informationMultiple-Person Tracking by Detection
http://excel.fit.vutbr.cz Multiple-Person Tracking by Detection Jakub Vojvoda* Abstract Detection and tracking of multiple person is challenging problem mainly due to complexity of scene and large intra-class
More information2D Image Processing Feature Descriptors
2D Image Processing Feature Descriptors Prof. Didier Stricker Kaiserlautern University http://ags.cs.uni-kl.de/ DFKI Deutsches Forschungszentrum für Künstliche Intelligenz http://av.dfki.de 1 Overview
More informationHuman Motion Detection and Tracking for Video Surveillance
Human Motion Detection and Tracking for Video Surveillance Prithviraj Banerjee and Somnath Sengupta Department of Electronics and Electrical Communication Engineering Indian Institute of Technology, Kharagpur,
More informationHistograms of Oriented Gradients for Human Detection p. 1/1
Histograms of Oriented Gradients for Human Detection p. 1/1 Histograms of Oriented Gradients for Human Detection Navneet Dalal and Bill Triggs INRIA Rhône-Alpes Grenoble, France Funding: acemedia, LAVA,
More informationA New Strategy of Pedestrian Detection Based on Pseudo- Wavelet Transform and SVM
A New Strategy of Pedestrian Detection Based on Pseudo- Wavelet Transform and SVM M.Ranjbarikoohi, M.Menhaj and M.Sarikhani Abstract: Pedestrian detection has great importance in automotive vision systems
More informationCS4495/6495 Introduction to Computer Vision. 8C-L1 Classification: Discriminative models
CS4495/6495 Introduction to Computer Vision 8C-L1 Classification: Discriminative models Remember: Supervised classification Given a collection of labeled examples, come up with a function that will predict
More informationHuman detection using local shape and nonredundant
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2010 Human detection using local shape and nonredundant binary patterns
More informationhttps://en.wikipedia.org/wiki/the_dress Recap: Viola-Jones sliding window detector Fast detection through two mechanisms Quickly eliminate unlikely windows Use features that are fast to compute Viola
More informationLinear combinations of simple classifiers for the PASCAL challenge
Linear combinations of simple classifiers for the PASCAL challenge Nik A. Melchior and David Lee 16 721 Advanced Perception The Robotics Institute Carnegie Mellon University Email: melchior@cmu.edu, dlee1@andrew.cmu.edu
More informationPedestrian Detection and Tracking in Images and Videos
Pedestrian Detection and Tracking in Images and Videos Azar Fazel Stanford University azarf@stanford.edu Viet Vo Stanford University vtvo@stanford.edu Abstract The increase in population density and accessibility
More informationMobile Human Detection Systems based on Sliding Windows Approach-A Review
Mobile Human Detection Systems based on Sliding Windows Approach-A Review Seminar: Mobile Human detection systems Njieutcheu Tassi cedrique Rovile Department of Computer Engineering University of Heidelberg
More informationFAST HUMAN DETECTION USING TEMPLATE MATCHING FOR GRADIENT IMAGES AND ASC DESCRIPTORS BASED ON SUBTRACTION STEREO
FAST HUMAN DETECTION USING TEMPLATE MATCHING FOR GRADIENT IMAGES AND ASC DESCRIPTORS BASED ON SUBTRACTION STEREO Makoto Arie, Masatoshi Shibata, Kenji Terabayashi, Alessandro Moro and Kazunori Umeda Course
More informationClassification and Detection in Images. D.A. Forsyth
Classification and Detection in Images D.A. Forsyth Classifying Images Motivating problems detecting explicit images classifying materials classifying scenes Strategy build appropriate image features train
More informationRecent Researches in Automatic Control, Systems Science and Communications
Real time human detection in video streams FATMA SAYADI*, YAHIA SAID, MOHAMED ATRI AND RACHED TOURKI Electronics and Microelectronics Laboratory Faculty of Sciences Monastir, 5000 Tunisia Address (12pt
More informationPreviously. Window-based models for generic object detection 4/11/2011
Previously for generic object detection Monday, April 11 UT-Austin Instance recognition Local features: detection and description Local feature matching, scalable indexing Spatial verification Intro to
More informationObject detection using non-redundant local Binary Patterns
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2010 Object detection using non-redundant local Binary Patterns Duc Thanh
More informationPerson Detection in Images using HoG + Gentleboost. Rahul Rajan June 1st July 15th CMU Q Robotics Lab
Person Detection in Images using HoG + Gentleboost Rahul Rajan June 1st July 15th CMU Q Robotics Lab 1 Introduction One of the goals of computer vision Object class detection car, animal, humans Human
More informationAffine-invariant scene categorization
University of Wollongong Research Online Faculty of Engineering and Information Sciences - Papers: Part A Faculty of Engineering and Information Sciences 2014 Affine-invariant scene categorization Xue
More informationPedestrian Detection with Occlusion Handling
Pedestrian Detection with Occlusion Handling Yawar Rehman 1, Irfan Riaz 2, Fan Xue 3, Jingchun Piao 4, Jameel Ahmed Khan 5 and Hyunchul Shin 6 Department of Electronics and Communication Engineering, Hanyang
More informationMulti-Object Tracking Based on Tracking-Learning-Detection Framework
Multi-Object Tracking Based on Tracking-Learning-Detection Framework Songlin Piao, Karsten Berns Robotics Research Lab University of Kaiserslautern Abstract. This paper shows the framework of robust long-term
More informationDetecting Object Instances Without Discriminative Features
Detecting Object Instances Without Discriminative Features Edward Hsiao June 19, 2013 Thesis Committee: Martial Hebert, Chair Alexei Efros Takeo Kanade Andrew Zisserman, University of Oxford 1 Object Instance
More informationResearch on Robust Local Feature Extraction Method for Human Detection
Waseda University Doctoral Dissertation Research on Robust Local Feature Extraction Method for Human Detection TANG, Shaopeng Graduate School of Information, Production and Systems Waseda University Feb.
More informationHUMAN POSTURE DETECTION WITH THE HELP OF LINEAR SVM AND HOG FEATURE ON GPU
International Journal of Computer Engineering and Applications, Volume IX, Issue VII, July 2015 HUMAN POSTURE DETECTION WITH THE HELP OF LINEAR SVM AND HOG FEATURE ON GPU Vaibhav P. Janbandhu 1, Sanjay
More informationSkin and Face Detection
Skin and Face Detection Linda Shapiro EE/CSE 576 1 What s Coming 1. Review of Bakic flesh detector 2. Fleck and Forsyth flesh detector 3. Details of Rowley face detector 4. Review of the basic AdaBoost
More informationClassifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao
Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao Motivation Image search Building large sets of classified images Robotics Background Object recognition is unsolved Deformable shaped
More informationA Hybrid Face Detection System using combination of Appearance-based and Feature-based methods
IJCSNS International Journal of Computer Science and Network Security, VOL.9 No.5, May 2009 181 A Hybrid Face Detection System using combination of Appearance-based and Feature-based methods Zahra Sadri
More informationAn Implementation on Histogram of Oriented Gradients for Human Detection
An Implementation on Histogram of Oriented Gradients for Human Detection Cansın Yıldız Dept. of Computer Engineering Bilkent University Ankara,Turkey cansin@cs.bilkent.edu.tr Abstract I implemented a Histogram
More informationHand Posture Recognition Using Adaboost with SIFT for Human Robot Interaction
Hand Posture Recognition Using Adaboost with SIFT for Human Robot Interaction Chieh-Chih Wang and Ko-Chih Wang Department of Computer Science and Information Engineering Graduate Institute of Networking
More informationFace Detection and Alignment. Prof. Xin Yang HUST
Face Detection and Alignment Prof. Xin Yang HUST Many slides adapted from P. Viola Face detection Face detection Basic idea: slide a window across image and evaluate a face model at every location Challenges
More informationHuman-Robot Interaction
Human-Robot Interaction Elective in Artificial Intelligence Lecture 6 Visual Perception Luca Iocchi DIAG, Sapienza University of Rome, Italy With contributions from D. D. Bloisi and A. Youssef Visual Perception
More informationTemplates and Background Subtraction. Prof. D. Stricker Doz. G. Bleser
Templates and Background Subtraction Prof. D. Stricker Doz. G. Bleser 1 Surveillance Video: Example of multiple people tracking http://www.youtube.com/watch?v=inqv34bchem&feature=player_embedded As for
More informationThe Population Density of Early Warning System Based On Video Image
International Journal of Research in Engineering and Science (IJRES) ISSN (Online): 2320-9364, ISSN (Print): 2320-9356 Volume 4 Issue 4 ǁ April. 2016 ǁ PP.32-37 The Population Density of Early Warning
More informationGeneric Object-Face detection
Generic Object-Face detection Jana Kosecka Many slides adapted from P. Viola, K. Grauman, S. Lazebnik and many others Today Window-based generic object detection basic pipeline boosting classifiers face
More informationCategory vs. instance recognition
Category vs. instance recognition Category: Find all the people Find all the buildings Often within a single image Often sliding window Instance: Is this face James? Find this specific famous building
More informationLarge-Scale Traffic Sign Recognition based on Local Features and Color Segmentation
Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation M. Blauth, E. Kraft, F. Hirschenberger, M. Böhm Fraunhofer Institute for Industrial Mathematics, Fraunhofer-Platz 1,
More informationAnnouncements. Recognition. Recognition. Recognition. Recognition. Homework 3 is due May 18, 11:59 PM Reading: Computer Vision I CSE 152 Lecture 14
Announcements Computer Vision I CSE 152 Lecture 14 Homework 3 is due May 18, 11:59 PM Reading: Chapter 15: Learning to Classify Chapter 16: Classifying Images Chapter 17: Detecting Objects in Images Given
More informationStudy of Viola-Jones Real Time Face Detector
Study of Viola-Jones Real Time Face Detector Kaiqi Cen cenkaiqi@gmail.com Abstract Face detection has been one of the most studied topics in computer vision literature. Given an arbitrary image the goal
More informationPEOPLE IN SEATS COUNTING VIA SEAT DETECTION FOR MEETING SURVEILLANCE
PEOPLE IN SEATS COUNTING VIA SEAT DETECTION FOR MEETING SURVEILLANCE Hongyu Liang, Jinchen Wu, and Kaiqi Huang National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Science
More informationPedestrian Detection with Improved LBP and Hog Algorithm
Open Access Library Journal 2018, Volume 5, e4573 ISSN Online: 2333-9721 ISSN Print: 2333-9705 Pedestrian Detection with Improved LBP and Hog Algorithm Wei Zhou, Suyun Luo Automotive Engineering College,
More informationHISTOGRAMS OF ORIENTATIO N GRADIENTS
HISTOGRAMS OF ORIENTATIO N GRADIENTS Histograms of Orientation Gradients Objective: object recognition Basic idea Local shape information often well described by the distribution of intensity gradients
More informationRecap Image Classification with Bags of Local Features
Recap Image Classification with Bags of Local Features Bag of Feature models were the state of the art for image classification for a decade BoF may still be the state of the art for instance retrieval
More informationLearning Visual Semantics: Models, Massive Computation, and Innovative Applications
Learning Visual Semantics: Models, Massive Computation, and Innovative Applications Part II: Visual Features and Representations Liangliang Cao, IBM Watson Research Center Evolvement of Visual Features
More informationBayes Risk. Classifiers for Recognition Reading: Chapter 22 (skip 22.3) Discriminative vs Generative Models. Loss functions in classifiers
Classifiers for Recognition Reading: Chapter 22 (skip 22.3) Examine each window of an image Classify object class within each window based on a training set images Example: A Classification Problem Categorize
More informationObject Category Detection: Sliding Windows
03/18/10 Object Category Detection: Sliding Windows Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Goal: Detect all instances of objects Influential Works in Detection Sung-Poggio
More informationImplementation of a Pedestrian Detection Device based on CENTRIST for an Embedded Environment
, pp.123-127 http://dx.doi.org/10.14257/astl.2014.46.29 Implementation of a Pedestrian Detection Device based on CENTRIST for an Embedded Environment Yun-Seop Hwang 1, Jae-Chang Kwak 2, Kwang-Yeob Lee
More informationCS231A Section 6: Problem Set 3
CS231A Section 6: Problem Set 3 Kevin Wong Review 6 -! 1 11/09/2012 Announcements PS3 Due 2:15pm Tuesday, Nov 13 Extra Office Hours: Friday 6 8pm Huang Common Area, Basement Level. Review 6 -! 2 Topics
More informationExploiting scene constraints to improve object detection algorithms for industrial applications
Exploiting scene constraints to improve object detection algorithms for industrial applications PhD Public Defense Steven Puttemans Promotor: Toon Goedemé 2 A general introduction Object detection? Help
More informationClassifiers for Recognition Reading: Chapter 22 (skip 22.3)
Classifiers for Recognition Reading: Chapter 22 (skip 22.3) Examine each window of an image Classify object class within each window based on a training set images Slide credits for this chapter: Frank
More informationBoosting Sex Identification Performance
Boosting Sex Identification Performance Shumeet Baluja, 2 Henry Rowley shumeet@google.com har@google.com Google, Inc. 2 Carnegie Mellon University, Computer Science Department Abstract This paper presents
More informationHuman detections using Beagle board-xm
Human detections using Beagle board-xm CHANDAN KUMAR 1 V. AJAY KUMAR 2 R. MURALI 3 1 (M. TECH STUDENT, EMBEDDED SYSTEMS, DEPARTMENT OF ELECTRONICS AND COMMUNICATION ENGINEERING, VIJAYA KRISHNA INSTITUTE
More informationPeople detection in complex scene using a cascade of Boosted classifiers based on Haar-like-features
People detection in complex scene using a cascade of Boosted classifiers based on Haar-like-features M. Siala 1, N. Khlifa 1, F. Bremond 2, K. Hamrouni 1 1. Research Unit in Signal Processing, Image Processing
More informationCategorization by Learning and Combining Object Parts
Categorization by Learning and Combining Object Parts Bernd Heisele yz Thomas Serre y Massimiliano Pontil x Thomas Vetter Λ Tomaso Poggio y y Center for Biological and Computational Learning, M.I.T., Cambridge,
More informationFROM VIDEO STREAMS IN THE WILD
SEMANTIC FACE SEGMENTATION FROM VIDEO STREAMS IN THE WILD Student: Deividas Skiparis Supervisors: Pascal Landry (Imersivo) and Sergio Escalera (UB) In Collaboration With Imersivo SL 7/5/2017 2 Problem
More informationDetection of a Single Hand Shape in the Foreground of Still Images
CS229 Project Final Report Detection of a Single Hand Shape in the Foreground of Still Images Toan Tran (dtoan@stanford.edu) 1. Introduction This paper is about an image detection system that can detect
More informationBeyond Bags of features Spatial information & Shape models
Beyond Bags of features Spatial information & Shape models Jana Kosecka Many slides adapted from S. Lazebnik, FeiFei Li, Rob Fergus, and Antonio Torralba Detection, recognition (so far )! Bags of features
More informationFast Human Detection Using a Cascade of Histograms of Oriented Gradients
MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Fast Human Detection Using a Cascade of Histograms of Oriented Gradients Qiang Zhu, Shai Avidan, Mei-Chen Yeh, Kwang-Ting Cheng TR26-68 June
More informationDesigning Applications that See Lecture 7: Object Recognition
stanford hci group / cs377s Designing Applications that See Lecture 7: Object Recognition Dan Maynes-Aminzade 29 January 2008 Designing Applications that See http://cs377s.stanford.edu Reminders Pick up
More informationRapid Face and Object Detection in ios
Rapid Face and Object Detection in ios Instructor - Simon Lucey 16-623 - Designing Computer Vision Apps Today Background on Rapid Face Detection Facial Feature Detection in ios. Dlib for Object Detection.
More informationProgress Report of Final Year Project
Progress Report of Final Year Project Project Title: Design and implement a face-tracking engine for video William O Grady 08339937 Electronic and Computer Engineering, College of Engineering and Informatics,
More informationClassification of objects from Video Data (Group 30)
Classification of objects from Video Data (Group 30) Sheallika Singh 12665 Vibhuti Mahajan 12792 Aahitagni Mukherjee 12001 M Arvind 12385 1 Motivation Video surveillance has been employed for a long time
More informationObject and Class Recognition I:
Object and Class Recognition I: Object Recognition Lectures 10 Sources ICCV 2005 short courses Li Fei-Fei (UIUC), Rob Fergus (Oxford-MIT), Antonio Torralba (MIT) http://people.csail.mit.edu/torralba/iccv2005
More informationA Texture-based Method for Detecting Moving Objects
A Texture-based Method for Detecting Moving Objects Marko Heikkilä University of Oulu Machine Vision Group FINLAND Introduction The moving object detection, also called as background subtraction, is one
More informationDeformable Part Models
CS 1674: Intro to Computer Vision Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 9, 2016 Today: Object category detection Window-based approaches: Last time: Viola-Jones
More informationDetecting and Reading Text in Natural Scenes
October 19, 2004 X. Chen, A. L. Yuille Outline Outline Goals Example Main Ideas Results Goals Outline Goals Example Main Ideas Results Given an image of an outdoor scene, goals are to: Identify regions
More informationPrincipal Component Analysis and Neural Network Based Face Recognition
Principal Component Analysis and Neural Network Based Face Recognition Qing Jiang Mailbox Abstract People in computer vision and pattern recognition have been working on automatic recognition of human
More informationFace detection and recognition. Many slides adapted from K. Grauman and D. Lowe
Face detection and recognition Many slides adapted from K. Grauman and D. Lowe Face detection and recognition Detection Recognition Sally History Early face recognition systems: based on features and distances
More informationLecture 10 Detectors and descriptors
Lecture 10 Detectors and descriptors Properties of detectors Edge detectors Harris DoG Properties of detectors SIFT Shape context Silvio Savarese Lecture 10-26-Feb-14 From the 3D to 2D & vice versa P =
More informationTri-modal Human Body Segmentation
Tri-modal Human Body Segmentation Master of Science Thesis Cristina Palmero Cantariño Advisor: Sergio Escalera Guerrero February 6, 2014 Outline 1 Introduction 2 Tri-modal dataset 3 Proposed baseline 4
More informationLocal Image Features
Local Image Features Ali Borji UWM Many slides from James Hayes, Derek Hoiem and Grauman&Leibe 2008 AAAI Tutorial Overview of Keypoint Matching 1. Find a set of distinctive key- points A 1 A 2 A 3 B 3
More informationFace Detection using Hierarchical SVM
Face Detection using Hierarchical SVM ECE 795 Pattern Recognition Christos Kyrkou Fall Semester 2010 1. Introduction Face detection in video is the process of detecting and classifying small images extracted
More informationVisual Object Recognition
Visual Object Recognition Lecture 3: Descriptors Per-Erik Forssén, docent Computer Vision Laboratory Department of Electrical Engineering Linköping University 2015 2014 Per-Erik Forssén Lecture 3: Descriptors
More informationSURF. Lecture6: SURF and HOG. Integral Image. Feature Evaluation with Integral Image
SURF CSED441:Introduction to Computer Vision (2015S) Lecture6: SURF and HOG Bohyung Han CSE, POSTECH bhhan@postech.ac.kr Speed Up Robust Features (SURF) Simplified version of SIFT Faster computation but
More informationVisual Detection and Species Classification of Orchid Flowers
14-22 MVA2015 IAPR International Conference on Machine Vision Applications, May 18-22, 2015, Tokyo, JAPAN Visual Detection and Species Classification of Orchid Flowers Steven Puttemans & Toon Goedemé KU
More informationDetecting Printed and Handwritten Partial Copies of Line Drawings Embedded in Complex Backgrounds
9 1th International Conference on Document Analysis and Recognition Detecting Printed and Handwritten Partial Copies of Line Drawings Embedded in Complex Backgrounds Weihan Sun, Koichi Kise Graduate School
More informationFace and Nose Detection in Digital Images using Local Binary Patterns
Face and Nose Detection in Digital Images using Local Binary Patterns Stanko Kružić Post-graduate student University of Split, Faculty of Electrical Engineering, Mechanical Engineering and Naval Architecture
More informationSEMANTIC SEGMENTATION AS IMAGE REPRESENTATION FOR SCENE RECOGNITION. Ahmed Bassiouny, Motaz El-Saban. Microsoft Advanced Technology Labs, Cairo, Egypt
SEMANTIC SEGMENTATION AS IMAGE REPRESENTATION FOR SCENE RECOGNITION Ahmed Bassiouny, Motaz El-Saban Microsoft Advanced Technology Labs, Cairo, Egypt ABSTRACT We introduce a novel approach towards scene
More informationCategory-level localization
Category-level localization Cordelia Schmid Recognition Classification Object present/absent in an image Often presence of a significant amount of background clutter Localization / Detection Localize object
More informationTraffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers
Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers A. Salhi, B. Minaoui, M. Fakir, H. Chakib, H. Grimech Faculty of science and Technology Sultan Moulay Slimane
More informationSelection of Scale-Invariant Parts for Object Class Recognition
Selection of Scale-Invariant Parts for Object Class Recognition Gy. Dorkó and C. Schmid INRIA Rhône-Alpes, GRAVIR-CNRS 655, av. de l Europe, 3833 Montbonnot, France fdorko,schmidg@inrialpes.fr Abstract
More informationRTSP Based Video Surveillance System Using IP Camera for Human Detection in OpenCV
RTSP Based Video Surveillance System Using IP Camera for Human Detection in OpenCV K. Bapayya 1,K. Sujitha 2, Mr. SD. Akthar Basha 3 1 Asst. Professor, Department of ECE, CVR College of Engineering, Hyderabad-501510
More informationHuman detection using histogram of oriented gradients. Srikumar Ramalingam School of Computing University of Utah
Human detection using histogram of oriented gradients Srikumar Ramalingam School of Computing University of Utah Reference Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,
More informationComputer Science Faculty, Bandar Lampung University, Bandar Lampung, Indonesia
Application Object Detection Using Histogram of Oriented Gradient For Artificial Intelegence System Module of Nao Robot (Control System Laboratory (LSKK) Bandung Institute of Technology) A K Saputra 1.,
More informationImage Analysis. Window-based face detection: The Viola-Jones algorithm. iphoto decides that this is a face. It can be trained to recognize pets!
Image Analysis 2 Face detection and recognition Window-based face detection: The Viola-Jones algorithm Christophoros Nikou cnikou@cs.uoi.gr Images taken from: D. Forsyth and J. Ponce. Computer Vision:
More informationFace/Flesh Detection and Face Recognition
Face/Flesh Detection and Face Recognition Linda Shapiro EE/CSE 576 1 What s Coming 1. Review of Bakic flesh detector 2. Fleck and Forsyth flesh detector 3. Details of Rowley face detector 4. The Viola
More informationSUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS
SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS Cognitive Robotics Original: David G. Lowe, 004 Summary: Coen van Leeuwen, s1460919 Abstract: This article presents a method to extract
More informationMULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION
MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION Panca Mudjirahardjo, Rahmadwati, Nanang Sulistiyanto and R. Arief Setyawan Department of Electrical Engineering, Faculty of
More informationAn Object Detection System using Image Reconstruction with PCA
An Object Detection System using Image Reconstruction with PCA Luis Malagón-Borja and Olac Fuentes Instituto Nacional de Astrofísica Óptica y Electrónica, Puebla, 72840 Mexico jmb@ccc.inaoep.mx, fuentes@inaoep.mx
More informationMultiple Kernel Learning for Emotion Recognition in the Wild
Multiple Kernel Learning for Emotion Recognition in the Wild Karan Sikka, Karmen Dykstra, Suchitra Sathyanarayana, Gwen Littlewort and Marian S. Bartlett Machine Perception Laboratory UCSD EmotiW Challenge,
More informationImage Processing Pipeline for Facial Expression Recognition under Variable Lighting
Image Processing Pipeline for Facial Expression Recognition under Variable Lighting Ralph Ma, Amr Mohamed ralphma@stanford.edu, amr1@stanford.edu Abstract Much research has been done in the field of automated
More informationAn Associate-Predict Model for Face Recognition FIPA Seminar WS 2011/2012
An Associate-Predict Model for Face Recognition FIPA Seminar WS 2011/2012, 19.01.2012 INSTITUTE FOR ANTHROPOMATICS, FACIAL IMAGE PROCESSING AND ANALYSIS YIG University of the State of Baden-Wuerttemberg
More informationHuman Detection and Tracking for Video Surveillance: A Cognitive Science Approach
Human Detection and Tracking for Video Surveillance: A Cognitive Science Approach Vandit Gajjar gajjar.vandit.381@ldce.ac.in Ayesha Gurnani gurnani.ayesha.52@ldce.ac.in Yash Khandhediya khandhediya.yash.364@ldce.ac.in
More informationLOCAL AND GLOBAL DESCRIPTORS FOR PLACE RECOGNITION IN ROBOTICS
8th International DAAAM Baltic Conference "INDUSTRIAL ENGINEERING - 19-21 April 2012, Tallinn, Estonia LOCAL AND GLOBAL DESCRIPTORS FOR PLACE RECOGNITION IN ROBOTICS Shvarts, D. & Tamre, M. Abstract: The
More information