Mobile Human Detection Systems based on Sliding Windows Approach-A Review

Size: px
Start display at page:

Download "Mobile Human Detection Systems based on Sliding Windows Approach-A Review"

Transcription

1 Mobile Human Detection Systems based on Sliding Windows Approach-A Review Seminar: Mobile Human detection systems Njieutcheu Tassi cedrique Rovile Department of Computer Engineering University of Heidelberg Germany, Mannheim Abstract In this paper a comprehensive review of human detection based on sliding windows technique is presented. The image data generated by a fixed camera mounted on a mobile agent is densely sampled, generating a large number of detection windows, which are separately presented to a linear Support Vector Machine (SVM) for classification. The classifier input vectors are Histogram of Oriented Gradients (HOG) features extracted from each detection window and used as human descriptor. In order to reduce the high computational cost due to the process of a large number of detection windows, magnitude and entropy filters are employed to discard unlikely windows or parts of the image with no relevant information to the human detection task. The experiments performed show that a high number of detection windows can be discarded by the proposed filters. But, a slight reduction on recall is observed, due to the fact that some windows cover only parts of a person, diminishing the accuracy. Keywords Sliding window; HOG features; SVM Classifier; Magnitde and Entropy Filter. I. INTRODUCTION Over the recent years, a large number of cameras available generate a huge amount of data, requiring the application processing the data to understand the scene. A useful question to ask is whether the generated image data contains one or more instances of a certain object: a car, a dog, a person, and so forth. Algorithms that answer this question in case of a human are called human detectors and are crucial for diverse application areas including pedestrian detection, video surveillance and monitoring, person tracking, action and activity recognition, person re-identification and human machine interaction. Therefore, human detection has become one of the most active and attractive research topics in the area of computer vision and pattern recognition. In this study, we focus on detecting humans and do not consider recognition of their activities. This paper is organized as follow: In section II, we formulate the human detection problem and present our solution approach. We briefly discuss related work in section III and give a detailed description of each stage of the detection process chain in section IV. Experiments results are evaluated in section V and in section VI, we summarize the main conclusion. II. PROBLEM FORMULATION AND SOLUTION APPROACH A. Problem formulation Detecting humans in images is one of the important challenges in computer vision. This is due to factors such as the large variation of appearance, changes in illumination, low quality of the acquired data and the different size of the human in the image. At that, human bodies are non-rigid and highly articulated. This implies that we have to deal with different poses and postures. Additionally, it is not possible to take advantage of specific textures and colour information due to the variability of worn cloths. However, besides all these challenges, low computational cost, high detection rate and reliable detection are needed to fulfill the requirement of most applications. B. Solution approach Here, an overview of our single detection window method is given. A flow chart of the human detection process is illustrated in Figure 1. The idea is to generate a set of images (S) with different resolutions based on the down sampling method. All generated images are then densely scanned with a sliding window, ensuring that all humans are covered. The scanning stage provides a set of windows (W) with a large number of detection windows. In order to reduce the number of detection windows, a filtering stage based on magnitude or entropy filter is applied. This stage reduces the search space and keep only potential Regions Of Interest (ROI) to be presented to the classifier. The output of the filtering stage is a set of selected windows (M), with M << W. Each selected windows is separately classified. Therefore, Histogram of Oriented Gradient (HOG) features vector (V) is extracted over each window and passed to the linear Support Vector Machine (SVM) as input. The linear SVM classifies each window as belongs to the human or non-human class, which are members of the set Y.

2 Figure 1: Overview of proposed methods. III. RELATED WORK Several human detection approaches have been proposed in the past years to address the referred problems. The key purpose of this paper is to provide a comprehensive review on studies conducted in the area of human detection based on sliding windows. The first need is a feature set that allows the human form to be discriminated cleanly. Therefore, the Histogram of Oriented Gradient (HOG) feature presented by Dalal and Triggs [1] is used as human descriptor. The development of methods reducing the computational cost is also desirable. One way of achieving that is to apply a filtering stage before the features extraction stage as proposed by Artur et al. [2]. This study is made based on the aforementioned papers. IV. METHODS This section gives details of our methods and highlights the need of each proposed method. A. Downsampling In order to remain scale-invariant, human detection algorithms utilize the rescaling of the input image frame. Furthermore, applying this technique allow us to deal with different human heights in the image data due to their distance to the camera. By decreasing the sampling rate with a fixed scale factor k, the number of samples that represent the original signal and the size of the input image frame are reduced. However, when a signal is down sampled in frequency domain, the high frequency portion of the signal will be aliased with the low frequency portion. In order to avoid this, the original image needs to be preprocessed with an alias (low pass) filter to remove the high frequency portion, so that aliasing will not occur. The process of image down sampling is illustrated in Figure 2, where I(m,n) denotes the input image data matrix, f(m,n) preprocessed image data matrix, d(m,n) down sampled image data matrix, m number of rows and n number of columns of the image data matrix. Keep in mind that the size of d(m,n) is less than I(m,n). Successively down sampling the input image and using the output as input for new images with low resolution, yields to a scale pyramid, which builds the set of images that will be segmented in the next step as explained below. B. Image segmentation The first step of any detector based on sliding window consists of generating a set of detection window based on the sliding windows approach, which is widely used in object recognition tasks. The sliding windows then searches for the corresponding humans in all scales of the image by sampling all images from the scale pyramid with a moving window of variable or fixed size according to the requirements of the application under consideration. For the human detection task, a moving window of fixed size (128x64 pixels) is used to densely scan the image as proposed by Dallal and Triggs [1], ensuring that all humans are covered. As these windows are generated in a wide range of scales and strides, we have a set of overlapping windows presenting redundancy, which highlights the need of the next method. C. Detection windows Filtering In order to reduce the amount of data processes by the human detection system, a filtering stage is applied. Here, candidate windows are presented to an optional filter, which selects a subset of generated detection windows that will be presented to a classifier and discard the remaining windows. The filtering stage does not perform any type of features extraction processing on the discarded windows and therefore provides a computational cost reduction. The following filtering approaches are used in our evaluation, the entropy and magnitude filters [2]. 1) Entropy filter The main idea behind this filter is to extract histogram of gradient orientation over each detection window. Windows with histogram presenting low entropy are rejected and those with high entropy are selected for further processing. The flow diagram of this filter is illustrated in Figure 3. The threshold value is experimentally set. Figure 3: Flow diagram of the entropy filter. Figure 2: Image down sampling flow.

3 2) Magnitude Filter This filter computes the average of the gradient magnitude within a detection window and uses it as a cue for selection as illustrated in Figure 4. The threshold value of this filter is different from the threshold value of the entropy filter and is also experimentally set. Hence, the gradient magnitude is a feature used to create Histogram of Oriented Gradient (HOG) features. Therefore, after this filtering stage, there is no extra computational cost. X R C R C R C Y R C R C R C The gradient is then transformed to polar coordinates, with the angle constrained between 0 and 180 degrees. The magnitude µ and positive orientation θ are obtained as stated in equation (3) and (4), where tan -1 2 is the four quadrant inverse tangent, which yields values between - and +. µ X Y θ tan X Y mod Figure 4: Flow diagram of the magnitude filter. D. Features Extraction However, before presenting the selected detection windows to a classifier, the first need is a feature set that allows the human form to be discriminated cleanly, even in cluttered backgrounds under different illumination. Nevertheless, a robust feature makes the classifier s job as easy as possible. Therefore, Histogram of Oriented Gradient (HOG) is used as human descriptor. Details of the HOG extraction step and effects of the parameters choices on detector performance are covered by Dalal and Triggs [1] in their work. Though, some details of the computation of each step of the feature processing chain may be found in the paper of Tomasi [3]. The following description of the features extraction chain is fleshed out using parameter values from the paper quoted above, keeping in mind that different imaging situations may need different parameters values. The input is assumed to be a window I from the set of selected detection windows (M). 1) Gradient computation Detector performance is sensitive by the way in which gradients are computed. Here, we approximate the two components I X and I Y of the gradient at pixel intensity I(R,C) of the sub-image represented by a selected detection window I by central differences as stated in equations (1) and (2), where R denotes the row and C column index of the corresponding pixel of the image data matrix. For colour images, we calculate seperate gradients for each of the three colour channel and take the one with the largest gradient magnitude as the pixel s gradient within its orientation. 2) Cell orientation histograms This step is the fundamental nonlinearity of the descriptor. The gradient image is divided into adjacent, non-overlapping cells of CxC pixels (C = 9). In each cell, histogram of gradient magnitude is computed based on the orientation of the gradient element centred on it. Hence, each pixel within a cell calculates a weighted vote. The votes are accumulated into B orientation bins (B = 9). The B orientation bins are evenly spaced over 0 to 180 (unsigned gradient). The vote is a function of the gradient magnitude representing soft presence/absence of an edge at the pixel. To reduce aliasing, votes are interpolated bilinearly between the neighbouring bin centres in both orientations and positions. Specifically, the bins are numbered 0 through B-1 and have width W = (180 /B). The bin with index i has boundaries [w i, w i+1 ] and center c i = w i+1/2. A pixel with magnitude µ and orientation θ contributes a vote v j to bin with index j as stated in equation (5) and (6). The resulting cell histogram is a vector of B non negative elements. j µ C j+1 - θ W j θ mod 3) Block normalization Due to local variations in illumination and foregroundbackground contrast, cells are grouped into overlapping blocks of C x C cells (C = 2) and each block is separately contrast normalized, in order to reduce the effect of changes in contrast between images of the same object and preserve some information carry by gradient magnitude in cells within the same block. To achieve that, the four cell histograms in each block is carried into a single block feature b and normalized by its Euclidean norm as stated in equation (7), where ɛ is a small positive constant that prevents division by zero. The final

4 features vector is then the vector of all components of the normalized cell responses from all of the blocks within a selected detection window. The HOG features vector h can also be normalized according to equation (8), in order to make the features vector independent to overall sub-image contrast. b = (b / ( (II b II 2 + ɛ)) (7) h = (h / ( (II h II 2 + ɛ)) (8) E. Features Classification After the features extraction stage, a linear Support Vector Machine (SVM) is used, in order to classifier features vector of each selected detection window. The classification problem is formulated as finding the mapping of each features vector as belonging to the human or non-human class. The linear SVM algorithm learns by example to assign labels to the features vector. Therefore, it was trained in offline in a supervised manner with data from the INRIA person dataset. Figure 5: Tradeoff between scale factor and number of windows generated for a 640x480 image. V. EXPERIMENTAL EVALUATION In this section, we evaluate the effect of the scaling factor on detector performance. In addition, we evaluate several aspects of the proposed filters and present their results. It goes without saying that the following plots were made based on experiments performed by Dallal and Triggs [1] and Artur et al. [2]. A. Scaling factor evaluation In their first experiment, Artur et al. [2] evaluate the impact of the scaling factor on the number of detection windows generated, as well as the miss rate obtained by the detector. As showed in Figure 5 the number of detection windows grows quickly, while diminishing the scale factor. For instance, decreasing the scale factor k from 1,15 to 1,01, increase the number of detection windows like about fifteen time. Indeed, decreasing the scaling factor also decreases the miss rate achieved by the detector at 10 0 False Positive Per Image (FPPI) as presented in Figure 6. This result indicates that denser sampling yields to a lower miss rate with a large number of generated detection windows. Therefore, enabling the usage of small scaling factor, implies the need of a filtering stage, in order to discard a large number of generated detection windows and reduce the computational cost of the features extraction stage. Figure 6: Miss-rate of detector at 10 0 False Positive Per Image (FPPI) with different scale factor. B. Windows filtering evaluation In their experiment Artur et al. [2] also evaluate the results achieved by the entropy and magnitude filters by using a scale factor k = 1,15 and assuming that an ideal detector was to be used after the filtering stage. Both filters was able to reject nearly 30% of the generated detection windows, while preserving approximately the same recall rate as obtained without detection windows rejection as showed in Figure 8. As presented in Figure 7, using the entropy filter and increasing the percentage of discarded windows from round 30% to approximately 50%, increases the miss rate achieved by the detector at 10 0 False Positive Per Image (FPPI). But, this remains nearly constant, when applying the magnitude

5 filter. After analyze of Figure 7 and Figure 8, it comes out that the best result is achieved with the magnitude filter, which is able to discard about 54% of detection windows, with a slight increase on miss rate at 10 0 False Positive Per Image (FPPI) and reduction on recall. filter was not specified and make an evaluation of the detector performance difficult. In Figure 10, the detector performance obtained by Dalal and Triggs [1] without a filtering stage is presented. Owing to different metrics used to plot the performance of the presented detectors, a comparison between both is hard. Nevertheless, the detector with filtering stage performed poorly, when evaluating the selected detection windows, due to the random nature of the filter. Some selected windows might be slightly dislocated from a person s body, which needs to be fixed before presenting them to the linear Support Vector Machine (SVM) classifier. Figure 7: Miss-rate at 10 0 False Positive Per Image (FPPI) by applying a filter on the detector. Figure 9: Performance obtained by detector using magnitude or entropy filter. Figure 8: Relationship between rejection percentage and recall achieved by filters. C. Detectors performance evaluation We will like to compare the performance of the detector with filtering stage and without filtering stage. Therefore, the detection miss rate at 10 0 False Positive Per Image (FPPI) results obtained by Artur et al. [2] using a magnitude or entropy filter were presented in Figure 9. The number of detection windows discarded by the magnitude or entropy Figure 10: Performance obtained by detector without filter.

6 VI. CONCLUSION AND FUTURE WORK This study proposed a review of mobile human detection systems based on sliding window approach, where the image data captures from a fixed camera mounted on a mobile agent is densely scanned in all scale and location, in order to cover all humans in the image. As experimentally showed, using a small scaling factor improves detector performance, but increase the number of detection windows, which also increases the computational cost of the Histogram of Oriented Gradient (HOG) features extraction process. Hence, a filtering stage based on magnitude or entropy filter is used to reduce the number of generated windows, yielding to a computational cost reduction with a slight reduction on recall. Though, after performing a quantitative analysis on the number of rejected windows achieved by both filters and the influence of this on recall, it turned out that the magnitude is better than the entropy filter. Compared to the application of a detector method without a filtering stage, experimental evaluation showed that the detector with a filtering stage performs poorly, since some positives selected windows were wrongly classified by the linear Support Vector Machine (SVM), due to a slight dislocation of the selected windows from person s body. As future work we intend to evaluate the detector performance, when adjusting the detection window to person location. We also intend to apply a filtering stage based on motion detection and employ the proposed filters after classification, in order to remove possible false positives generated by the linear SVM. References [1] N. D. a. B. Triggs, "Histograms of Oriented Gradients for Human Detection," INRIA Rhône-Alps, 655 avenue de l Europe, Montbonnot 38334, France, [2] V. H. C. d. M. W. R. S. Artur Jordao Lima Correia, A Study of Filtering Approaches for Sliding Window Pedestrian Detection, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil, [3] C. Tomasi, Histograms of Oriented Gradients.

Human detection based on Sliding Window Approach

Human detection based on Sliding Window Approach Human detection based on Sliding Window Approach Heidelberg University Institute of Computer Engeneering Seminar: Mobile Human Detection Systems Name: Njieutcheu Tassi Cedrique Rovile Matr.Nr: 3348513

More information

Histograms of Oriented Gradients

Histograms of Oriented Gradients Histograms of Oriented Gradients Carlo Tomasi September 18, 2017 A useful question to ask of an image is whether it contains one or more instances of a certain object: a person, a face, a car, and so forth.

More information

Human detection using histogram of oriented gradients. Srikumar Ramalingam School of Computing University of Utah

Human detection using histogram of oriented gradients. Srikumar Ramalingam School of Computing University of Utah Human detection using histogram of oriented gradients Srikumar Ramalingam School of Computing University of Utah Reference Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

More information

Human Motion Detection and Tracking for Video Surveillance

Human Motion Detection and Tracking for Video Surveillance Human Motion Detection and Tracking for Video Surveillance Prithviraj Banerjee and Somnath Sengupta Department of Electronics and Electrical Communication Engineering Indian Institute of Technology, Kharagpur,

More information

Histograms of Oriented Gradients for Human Detection p. 1/1

Histograms of Oriented Gradients for Human Detection p. 1/1 Histograms of Oriented Gradients for Human Detection p. 1/1 Histograms of Oriented Gradients for Human Detection Navneet Dalal and Bill Triggs INRIA Rhône-Alpes Grenoble, France Funding: acemedia, LAVA,

More information

Histogram of Oriented Gradients for Human Detection

Histogram of Oriented Gradients for Human Detection Histogram of Oriented Gradients for Human Detection Article by Navneet Dalal and Bill Triggs All images in presentation is taken from article Presentation by Inge Edward Halsaunet Introduction What: Detect

More information

EE368 Project Report CD Cover Recognition Using Modified SIFT Algorithm

EE368 Project Report CD Cover Recognition Using Modified SIFT Algorithm EE368 Project Report CD Cover Recognition Using Modified SIFT Algorithm Group 1: Mina A. Makar Stanford University mamakar@stanford.edu Abstract In this report, we investigate the application of the Scale-Invariant

More information

Computer Science Faculty, Bandar Lampung University, Bandar Lampung, Indonesia

Computer Science Faculty, Bandar Lampung University, Bandar Lampung, Indonesia Application Object Detection Using Histogram of Oriented Gradient For Artificial Intelegence System Module of Nao Robot (Control System Laboratory (LSKK) Bandung Institute of Technology) A K Saputra 1.,

More information

MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION

MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION Panca Mudjirahardjo, Rahmadwati, Nanang Sulistiyanto and R. Arief Setyawan Department of Electrical Engineering, Faculty of

More information

Feature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking

Feature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking Feature descriptors Alain Pagani Prof. Didier Stricker Computer Vision: Object and People Tracking 1 Overview Previous lectures: Feature extraction Today: Gradiant/edge Points (Kanade-Tomasi + Harris)

More information

Object Detection Design challenges

Object Detection Design challenges Object Detection Design challenges How to efficiently search for likely objects Even simple models require searching hundreds of thousands of positions and scales Feature design and scoring How should

More information

Category vs. instance recognition

Category vs. instance recognition Category vs. instance recognition Category: Find all the people Find all the buildings Often within a single image Often sliding window Instance: Is this face James? Find this specific famous building

More information

https://en.wikipedia.org/wiki/the_dress Recap: Viola-Jones sliding window detector Fast detection through two mechanisms Quickly eliminate unlikely windows Use features that are fast to compute Viola

More information

2D Image Processing Feature Descriptors

2D Image Processing Feature Descriptors 2D Image Processing Feature Descriptors Prof. Didier Stricker Kaiserlautern University http://ags.cs.uni-kl.de/ DFKI Deutsches Forschungszentrum für Künstliche Intelligenz http://av.dfki.de 1 Overview

More information

Classification of objects from Video Data (Group 30)

Classification of objects from Video Data (Group 30) Classification of objects from Video Data (Group 30) Sheallika Singh 12665 Vibhuti Mahajan 12792 Aahitagni Mukherjee 12001 M Arvind 12385 1 Motivation Video surveillance has been employed for a long time

More information

SURF. Lecture6: SURF and HOG. Integral Image. Feature Evaluation with Integral Image

SURF. Lecture6: SURF and HOG. Integral Image. Feature Evaluation with Integral Image SURF CSED441:Introduction to Computer Vision (2015S) Lecture6: SURF and HOG Bohyung Han CSE, POSTECH bhhan@postech.ac.kr Speed Up Robust Features (SURF) Simplified version of SIFT Faster computation but

More information

Object Category Detection: Sliding Windows

Object Category Detection: Sliding Windows 04/10/12 Object Category Detection: Sliding Windows Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Today s class: Object Category Detection Overview of object category detection Statistical

More information

An Implementation on Histogram of Oriented Gradients for Human Detection

An Implementation on Histogram of Oriented Gradients for Human Detection An Implementation on Histogram of Oriented Gradients for Human Detection Cansın Yıldız Dept. of Computer Engineering Bilkent University Ankara,Turkey cansin@cs.bilkent.edu.tr Abstract I implemented a Histogram

More information

PEOPLE IN SEATS COUNTING VIA SEAT DETECTION FOR MEETING SURVEILLANCE

PEOPLE IN SEATS COUNTING VIA SEAT DETECTION FOR MEETING SURVEILLANCE PEOPLE IN SEATS COUNTING VIA SEAT DETECTION FOR MEETING SURVEILLANCE Hongyu Liang, Jinchen Wu, and Kaiqi Huang National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Science

More information

Person Detection in Images using HoG + Gentleboost. Rahul Rajan June 1st July 15th CMU Q Robotics Lab

Person Detection in Images using HoG + Gentleboost. Rahul Rajan June 1st July 15th CMU Q Robotics Lab Person Detection in Images using HoG + Gentleboost Rahul Rajan June 1st July 15th CMU Q Robotics Lab 1 Introduction One of the goals of computer vision Object class detection car, animal, humans Human

More information

Classification and Detection in Images. D.A. Forsyth

Classification and Detection in Images. D.A. Forsyth Classification and Detection in Images D.A. Forsyth Classifying Images Motivating problems detecting explicit images classifying materials classifying scenes Strategy build appropriate image features train

More information

Category-level localization

Category-level localization Category-level localization Cordelia Schmid Recognition Classification Object present/absent in an image Often presence of a significant amount of background clutter Localization / Detection Localize object

More information

Efficient Acquisition of Human Existence Priors from Motion Trajectories

Efficient Acquisition of Human Existence Priors from Motion Trajectories Efficient Acquisition of Human Existence Priors from Motion Trajectories Hitoshi Habe Hidehito Nakagawa Masatsugu Kidode Graduate School of Information Science, Nara Institute of Science and Technology

More information

Research on Robust Local Feature Extraction Method for Human Detection

Research on Robust Local Feature Extraction Method for Human Detection Waseda University Doctoral Dissertation Research on Robust Local Feature Extraction Method for Human Detection TANG, Shaopeng Graduate School of Information, Production and Systems Waseda University Feb.

More information

A Study of Filtering Approaches for Sliding Window Pedestrian Detection

A Study of Filtering Approaches for Sliding Window Pedestrian Detection A Study of Filtering Approaches for Sliding Window Pedestrian Detection Artur Jorda o Lima Correia, Victor Hugo Cunha de Melo, William Robson Schwartz Department of Computer Science, Universidade Federal

More information

Tri-modal Human Body Segmentation

Tri-modal Human Body Segmentation Tri-modal Human Body Segmentation Master of Science Thesis Cristina Palmero Cantariño Advisor: Sergio Escalera Guerrero February 6, 2014 Outline 1 Introduction 2 Tri-modal dataset 3 Proposed baseline 4

More information

HOG-based Pedestriant Detector Training

HOG-based Pedestriant Detector Training HOG-based Pedestriant Detector Training evs embedded Vision Systems Srl c/o Computer Science Park, Strada Le Grazie, 15 Verona- Italy http: // www. embeddedvisionsystems. it Abstract This paper describes

More information

Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation

Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation M. Blauth, E. Kraft, F. Hirschenberger, M. Böhm Fraunhofer Institute for Industrial Mathematics, Fraunhofer-Platz 1,

More information

Human detections using Beagle board-xm

Human detections using Beagle board-xm Human detections using Beagle board-xm CHANDAN KUMAR 1 V. AJAY KUMAR 2 R. MURALI 3 1 (M. TECH STUDENT, EMBEDDED SYSTEMS, DEPARTMENT OF ELECTRONICS AND COMMUNICATION ENGINEERING, VIJAYA KRISHNA INSTITUTE

More information

Seminar Heidelberg University

Seminar Heidelberg University Seminar Heidelberg University Mobile Human Detection Systems Pedestrian Detection by Stereo Vision on Mobile Robots Philip Mayer Matrikelnummer: 3300646 Motivation Fig.1: Pedestrians Within Bounding Box

More information

Object detection using non-redundant local Binary Patterns

Object detection using non-redundant local Binary Patterns University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2010 Object detection using non-redundant local Binary Patterns Duc Thanh

More information

FAST HUMAN DETECTION USING TEMPLATE MATCHING FOR GRADIENT IMAGES AND ASC DESCRIPTORS BASED ON SUBTRACTION STEREO

FAST HUMAN DETECTION USING TEMPLATE MATCHING FOR GRADIENT IMAGES AND ASC DESCRIPTORS BASED ON SUBTRACTION STEREO FAST HUMAN DETECTION USING TEMPLATE MATCHING FOR GRADIENT IMAGES AND ASC DESCRIPTORS BASED ON SUBTRACTION STEREO Makoto Arie, Masatoshi Shibata, Kenji Terabayashi, Alessandro Moro and Kazunori Umeda Course

More information

Computer Vision for HCI. Topics of This Lecture

Computer Vision for HCI. Topics of This Lecture Computer Vision for HCI Interest Points Topics of This Lecture Local Invariant Features Motivation Requirements, Invariances Keypoint Localization Features from Accelerated Segment Test (FAST) Harris Shi-Tomasi

More information

HISTOGRAMS OF ORIENTATIO N GRADIENTS

HISTOGRAMS OF ORIENTATIO N GRADIENTS HISTOGRAMS OF ORIENTATIO N GRADIENTS Histograms of Orientation Gradients Objective: object recognition Basic idea Local shape information often well described by the distribution of intensity gradients

More information

Pedestrian Detection and Tracking in Images and Videos

Pedestrian Detection and Tracking in Images and Videos Pedestrian Detection and Tracking in Images and Videos Azar Fazel Stanford University azarf@stanford.edu Viet Vo Stanford University vtvo@stanford.edu Abstract The increase in population density and accessibility

More information

CS 231A Computer Vision (Fall 2012) Problem Set 3

CS 231A Computer Vision (Fall 2012) Problem Set 3 CS 231A Computer Vision (Fall 2012) Problem Set 3 Due: Nov. 13 th, 2012 (2:15pm) 1 Probabilistic Recursion for Tracking (20 points) In this problem you will derive a method for tracking a point of interest

More information

Using the Deformable Part Model with Autoencoded Feature Descriptors for Object Detection

Using the Deformable Part Model with Autoencoded Feature Descriptors for Object Detection Using the Deformable Part Model with Autoencoded Feature Descriptors for Object Detection Hyunghoon Cho and David Wu December 10, 2010 1 Introduction Given its performance in recent years' PASCAL Visual

More information

Object Recognition II

Object Recognition II Object Recognition II Linda Shapiro EE/CSE 576 with CNN slides from Ross Girshick 1 Outline Object detection the task, evaluation, datasets Convolutional Neural Networks (CNNs) overview and history Region-based

More information

SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS

SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS Cognitive Robotics Original: David G. Lowe, 004 Summary: Coen van Leeuwen, s1460919 Abstract: This article presents a method to extract

More information

Face and Nose Detection in Digital Images using Local Binary Patterns

Face and Nose Detection in Digital Images using Local Binary Patterns Face and Nose Detection in Digital Images using Local Binary Patterns Stanko Kružić Post-graduate student University of Split, Faculty of Electrical Engineering, Mechanical Engineering and Naval Architecture

More information

Deformable Part Models

Deformable Part Models CS 1674: Intro to Computer Vision Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 9, 2016 Today: Object category detection Window-based approaches: Last time: Viola-Jones

More information

Scale Invariant Feature Transform

Scale Invariant Feature Transform Scale Invariant Feature Transform Why do we care about matching features? Camera calibration Stereo Tracking/SFM Image moiaicing Object/activity Recognition Objection representation and recognition Image

More information

International Journal Of Global Innovations -Vol.4, Issue.I Paper Id: SP-V4-I1-P17 ISSN Online:

International Journal Of Global Innovations -Vol.4, Issue.I Paper Id: SP-V4-I1-P17 ISSN Online: IMPLEMENTATION OF EMBEDDED HUMAN TRACKING SYSTEM USING DM3730 DUALCORE PROCESSOR #1 DASARI ALEKHYA M.TECH Student, #2 Dr. SYED ABUDHAGIR.U Associate Professor, Dept of ECE B.V.RAJU INSTITUTE OF TECHNOLOGY,

More information

CS 231A Computer Vision (Winter 2018) Problem Set 3

CS 231A Computer Vision (Winter 2018) Problem Set 3 CS 231A Computer Vision (Winter 2018) Problem Set 3 Due: Feb 28, 2018 (11:59pm) 1 Space Carving (25 points) Dense 3D reconstruction is a difficult problem, as tackling it from the Structure from Motion

More information

Study of Viola-Jones Real Time Face Detector

Study of Viola-Jones Real Time Face Detector Study of Viola-Jones Real Time Face Detector Kaiqi Cen cenkaiqi@gmail.com Abstract Face detection has been one of the most studied topics in computer vision literature. Given an arbitrary image the goal

More information

ELEC Dr Reji Mathew Electrical Engineering UNSW

ELEC Dr Reji Mathew Electrical Engineering UNSW ELEC 4622 Dr Reji Mathew Electrical Engineering UNSW Review of Motion Modelling and Estimation Introduction to Motion Modelling & Estimation Forward Motion Backward Motion Block Motion Estimation Motion

More information

CS 223B Computer Vision Problem Set 3

CS 223B Computer Vision Problem Set 3 CS 223B Computer Vision Problem Set 3 Due: Feb. 22 nd, 2011 1 Probabilistic Recursion for Tracking In this problem you will derive a method for tracking a point of interest through a sequence of images.

More information

GPU-based pedestrian detection for autonomous driving

GPU-based pedestrian detection for autonomous driving Procedia Computer Science Volume 80, 2016, Pages 2377 2381 ICCS 2016. The International Conference on Computational Science GPU-based pedestrian detection for autonomous driving V. Campmany 1,2, S. Silva

More information

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor COSC160: Detection and Classification Jeremy Bolton, PhD Assistant Teaching Professor Outline I. Problem I. Strategies II. Features for training III. Using spatial information? IV. Reducing dimensionality

More information

Object Category Detection. Slides mostly from Derek Hoiem

Object Category Detection. Slides mostly from Derek Hoiem Object Category Detection Slides mostly from Derek Hoiem Today s class: Object Category Detection Overview of object category detection Statistical template matching with sliding window Part-based Models

More information

Object Tracking using HOG and SVM

Object Tracking using HOG and SVM Object Tracking using HOG and SVM Siji Joseph #1, Arun Pradeep #2 Electronics and Communication Engineering Axis College of Engineering and Technology, Ambanoly, Thrissur, India Abstract Object detection

More information

Multiple-Person Tracking by Detection

Multiple-Person Tracking by Detection http://excel.fit.vutbr.cz Multiple-Person Tracking by Detection Jakub Vojvoda* Abstract Detection and tracking of multiple person is challenging problem mainly due to complexity of scene and large intra-class

More information

Face detection and recognition. Detection Recognition Sally

Face detection and recognition. Detection Recognition Sally Face detection and recognition Detection Recognition Sally Face detection & recognition Viola & Jones detector Available in open CV Face recognition Eigenfaces for face recognition Metric learning identification

More information

C. Premsai 1, Prof. A. Kavya 2 School of Computer Science, School of Computer Science Engineering, Engineering VIT Chennai, VIT Chennai

C. Premsai 1, Prof. A. Kavya 2 School of Computer Science, School of Computer Science Engineering, Engineering VIT Chennai, VIT Chennai Traffic Sign Detection Via Graph-Based Ranking and Segmentation Algorithm C. Premsai 1, Prof. A. Kavya 2 School of Computer Science, School of Computer Science Engineering, Engineering VIT Chennai, VIT

More information

Image Features: Local Descriptors. Sanja Fidler CSC420: Intro to Image Understanding 1/ 58

Image Features: Local Descriptors. Sanja Fidler CSC420: Intro to Image Understanding 1/ 58 Image Features: Local Descriptors Sanja Fidler CSC420: Intro to Image Understanding 1/ 58 [Source: K. Grauman] Sanja Fidler CSC420: Intro to Image Understanding 2/ 58 Local Features Detection: Identify

More information

Scale Invariant Feature Transform

Scale Invariant Feature Transform Why do we care about matching features? Scale Invariant Feature Transform Camera calibration Stereo Tracking/SFM Image moiaicing Object/activity Recognition Objection representation and recognition Automatic

More information

Car Detecting Method using high Resolution images

Car Detecting Method using high Resolution images Car Detecting Method using high Resolution images Swapnil R. Dhawad Department of Electronics and Telecommunication Engineering JSPM s Rajarshi Shahu College of Engineering, Savitribai Phule Pune University,

More information

Histogram of Oriented Gradients (HOG) for Object Detection

Histogram of Oriented Gradients (HOG) for Object Detection Histogram of Oriented Gradients (HOG) for Object Detection Navneet DALAL Joint work with Bill TRIGGS and Cordelia SCHMID Goal & Challenges Goal: Detect and localise people in images and videos n Wide variety

More information

Selective Search for Object Recognition

Selective Search for Object Recognition Selective Search for Object Recognition Uijlings et al. Schuyler Smith Overview Introduction Object Recognition Selective Search Similarity Metrics Results Object Recognition Kitten Goal: Problem: Where

More information

Fast Human Detection with Cascaded Ensembles. Berkin Bilgiç

Fast Human Detection with Cascaded Ensembles. Berkin Bilgiç Fast Human Detection with Cascaded Ensembles by Berkin Bilgiç Submitted to the Department of Electrical Engineering and Computer Science in partial fulfillment of the requirements for the degree of Master

More information

DEPARTMENT OF INFORMATICS

DEPARTMENT OF INFORMATICS DEPARTMENT OF INFORMATICS TECHNISCHE UNIVERSITÄT MÜNCHEN Bachelor s Thesis in Informatics Pedestrian detection in urban environments based on vision and depth data Andreas Kreutz DEPARTMENT OF INFORMATICS

More information

Introduction. Introduction. Related Research. SIFT method. SIFT method. Distinctive Image Features from Scale-Invariant. Scale.

Introduction. Introduction. Related Research. SIFT method. SIFT method. Distinctive Image Features from Scale-Invariant. Scale. Distinctive Image Features from Scale-Invariant Keypoints David G. Lowe presented by, Sudheendra Invariance Intensity Scale Rotation Affine View point Introduction Introduction SIFT (Scale Invariant Feature

More information

Robotics Programming Laboratory

Robotics Programming Laboratory Chair of Software Engineering Robotics Programming Laboratory Bertrand Meyer Jiwon Shin Lecture 8: Robot Perception Perception http://pascallin.ecs.soton.ac.uk/challenges/voc/databases.html#caltech car

More information

Vision. OCR and OCV Application Guide OCR and OCV Application Guide 1/14

Vision. OCR and OCV Application Guide OCR and OCV Application Guide 1/14 Vision OCR and OCV Application Guide 1.00 OCR and OCV Application Guide 1/14 General considerations on OCR Encoded information into text and codes can be automatically extracted through a 2D imager device.

More information

Combining PGMs and Discriminative Models for Upper Body Pose Detection

Combining PGMs and Discriminative Models for Upper Body Pose Detection Combining PGMs and Discriminative Models for Upper Body Pose Detection Gedas Bertasius May 30, 2014 1 Introduction In this project, I utilized probabilistic graphical models together with discriminative

More information

An Optimized Sliding Window Approach to Pedestrian Detection

An Optimized Sliding Window Approach to Pedestrian Detection An Optimized Sliding Window Approach to Pedestrian Detection Victor Hugo Cunha de Melo, Samir Leão, David Menotti, William Robson Schwartz Computer Science Department, Universidade Federal de Minas Gerais,

More information

Crowd Event Recognition Using HOG Tracker

Crowd Event Recognition Using HOG Tracker Crowd Event Recognition Using HOG Tracker Carolina Gárate Piotr Bilinski Francois Bremond Pulsar Pulsar Pulsar INRIA INRIA INRIA Sophia Antipolis, France Sophia Antipolis, France Sophia Antipolis, France

More information

SIFT: SCALE INVARIANT FEATURE TRANSFORM SURF: SPEEDED UP ROBUST FEATURES BASHAR ALSADIK EOS DEPT. TOPMAP M13 3D GEOINFORMATION FROM IMAGES 2014

SIFT: SCALE INVARIANT FEATURE TRANSFORM SURF: SPEEDED UP ROBUST FEATURES BASHAR ALSADIK EOS DEPT. TOPMAP M13 3D GEOINFORMATION FROM IMAGES 2014 SIFT: SCALE INVARIANT FEATURE TRANSFORM SURF: SPEEDED UP ROBUST FEATURES BASHAR ALSADIK EOS DEPT. TOPMAP M13 3D GEOINFORMATION FROM IMAGES 2014 SIFT SIFT: Scale Invariant Feature Transform; transform image

More information

Part-Based Skew Estimation for Mathematical Expressions

Part-Based Skew Estimation for Mathematical Expressions Soma Shiraishi, Yaokai Feng, and Seiichi Uchida shiraishi@human.ait.kyushu-u.ac.jp {fengyk,uchida}@ait.kyushu-u.ac.jp Abstract We propose a novel method for the skew estimation on text images containing

More information

Chapter 3 Image Registration. Chapter 3 Image Registration

Chapter 3 Image Registration. Chapter 3 Image Registration Chapter 3 Image Registration Distributed Algorithms for Introduction (1) Definition: Image Registration Input: 2 images of the same scene but taken from different perspectives Goal: Identify transformation

More information

A New Strategy of Pedestrian Detection Based on Pseudo- Wavelet Transform and SVM

A New Strategy of Pedestrian Detection Based on Pseudo- Wavelet Transform and SVM A New Strategy of Pedestrian Detection Based on Pseudo- Wavelet Transform and SVM M.Ranjbarikoohi, M.Menhaj and M.Sarikhani Abstract: Pedestrian detection has great importance in automotive vision systems

More information

IMAGE RETRIEVAL USING VLAD WITH MULTIPLE FEATURES

IMAGE RETRIEVAL USING VLAD WITH MULTIPLE FEATURES IMAGE RETRIEVAL USING VLAD WITH MULTIPLE FEATURES Pin-Syuan Huang, Jing-Yi Tsai, Yu-Fang Wang, and Chun-Yi Tsai Department of Computer Science and Information Engineering, National Taitung University,

More information

Feature Detection. Raul Queiroz Feitosa. 3/30/2017 Feature Detection 1

Feature Detection. Raul Queiroz Feitosa. 3/30/2017 Feature Detection 1 Feature Detection Raul Queiroz Feitosa 3/30/2017 Feature Detection 1 Objetive This chapter discusses the correspondence problem and presents approaches to solve it. 3/30/2017 Feature Detection 2 Outline

More information

Detecting Printed and Handwritten Partial Copies of Line Drawings Embedded in Complex Backgrounds

Detecting Printed and Handwritten Partial Copies of Line Drawings Embedded in Complex Backgrounds 9 1th International Conference on Document Analysis and Recognition Detecting Printed and Handwritten Partial Copies of Line Drawings Embedded in Complex Backgrounds Weihan Sun, Koichi Kise Graduate School

More information

Region-based Segmentation and Object Detection

Region-based Segmentation and Object Detection Region-based Segmentation and Object Detection Stephen Gould Tianshi Gao Daphne Koller Presented at NIPS 2009 Discussion and Slides by Eric Wang April 23, 2010 Outline Introduction Model Overview Model

More information

Non-rigid body Object Tracking using Fuzzy Neural System based on Multiple ROIs and Adaptive Motion Frame Method

Non-rigid body Object Tracking using Fuzzy Neural System based on Multiple ROIs and Adaptive Motion Frame Method Proceedings of the 2009 IEEE International Conference on Systems, Man, and Cybernetics San Antonio, TX, USA - October 2009 Non-rigid body Object Tracking using Fuzzy Neural System based on Multiple ROIs

More information

Ensemble of Bayesian Filters for Loop Closure Detection

Ensemble of Bayesian Filters for Loop Closure Detection Ensemble of Bayesian Filters for Loop Closure Detection Mohammad Omar Salameh, Azizi Abdullah, Shahnorbanun Sahran Pattern Recognition Research Group Center for Artificial Intelligence Faculty of Information

More information

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks Si Chen The George Washington University sichen@gwmail.gwu.edu Meera Hahn Emory University mhahn7@emory.edu Mentor: Afshin

More information

Feature Descriptors. CS 510 Lecture #21 April 29 th, 2013

Feature Descriptors. CS 510 Lecture #21 April 29 th, 2013 Feature Descriptors CS 510 Lecture #21 April 29 th, 2013 Programming Assignment #4 Due two weeks from today Any questions? How is it going? Where are we? We have two umbrella schemes for object recognition

More information

An Object Detection System using Image Reconstruction with PCA

An Object Detection System using Image Reconstruction with PCA An Object Detection System using Image Reconstruction with PCA Luis Malagón-Borja and Olac Fuentes Instituto Nacional de Astrofísica Óptica y Electrónica, Puebla, 72840 Mexico jmb@ccc.inaoep.mx, fuentes@inaoep.mx

More information

Computationally Efficient Serial Combination of Rotation-invariant and Rotation Compensating Iris Recognition Algorithms

Computationally Efficient Serial Combination of Rotation-invariant and Rotation Compensating Iris Recognition Algorithms Computationally Efficient Serial Combination of Rotation-invariant and Rotation Compensating Iris Recognition Algorithms Andreas Uhl Department of Computer Sciences University of Salzburg, Austria uhl@cosy.sbg.ac.at

More information

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science. Professor William Hoff Dept of Electrical Engineering &Computer Science http://inside.mines.edu/~whoff/ 1 People Detection Some material for these slides comes from www.cs.cornell.edu/courses/cs4670/2012fa/lectures/lec32_object_recognition.ppt

More information

Evaluation of Hardware Oriented MRCoHOG using Logic Simulation

Evaluation of Hardware Oriented MRCoHOG using Logic Simulation Evaluation of Hardware Oriented MRCoHOG using Logic Simulation Yuta Yamasaki 1, Shiryu Ooe 1, Akihiro Suzuki 1, Kazuhiro Kuno 2, Hideo Yamada 2, Shuichi Enokida 3 and Hakaru Tamukoh 1 1 Graduate School

More information

A Cascade of Feed-Forward Classifiers for Fast Pedestrian Detection

A Cascade of Feed-Forward Classifiers for Fast Pedestrian Detection A Cascade of eed-orward Classifiers for ast Pedestrian Detection Yu-ing Chen,2 and Chu-Song Chen,3 Institute of Information Science, Academia Sinica, aipei, aiwan 2 Dept. of Computer Science and Information

More information

Selection of Scale-Invariant Parts for Object Class Recognition

Selection of Scale-Invariant Parts for Object Class Recognition Selection of Scale-Invariant Parts for Object Class Recognition Gy. Dorkó and C. Schmid INRIA Rhône-Alpes, GRAVIR-CNRS 655, av. de l Europe, 3833 Montbonnot, France fdorko,schmidg@inrialpes.fr Abstract

More information

Recent Researches in Automatic Control, Systems Science and Communications

Recent Researches in Automatic Control, Systems Science and Communications Real time human detection in video streams FATMA SAYADI*, YAHIA SAID, MOHAMED ATRI AND RACHED TOURKI Electronics and Microelectronics Laboratory Faculty of Sciences Monastir, 5000 Tunisia Address (12pt

More information

Outline 7/2/201011/6/

Outline 7/2/201011/6/ Outline Pattern recognition in computer vision Background on the development of SIFT SIFT algorithm and some of its variations Computational considerations (SURF) Potential improvement Summary 01 2 Pattern

More information

Ceiling Analysis of Pedestrian Recognition Pipeline for an Autonomous Car Application

Ceiling Analysis of Pedestrian Recognition Pipeline for an Autonomous Car Application Ceiling Analysis of Pedestrian Recognition Pipeline for an Autonomous Car Application Henry Roncancio, André Carmona Hernandes and Marcelo Becker Mobile Robotics Lab (LabRoM) São Carlos School of Engineering

More information

EE795: Computer Vision and Intelligent Systems

EE795: Computer Vision and Intelligent Systems EE795: Computer Vision and Intelligent Systems Spring 2012 TTh 17:30-18:45 FDH 204 Lecture 09 130219 http://www.ee.unlv.edu/~b1morris/ecg795/ 2 Outline Review Feature Descriptors Feature Matching Feature

More information

Postprint.

Postprint. http://www.diva-portal.org Postprint This is the accepted version of a paper presented at 14th International Conference of the Biometrics Special Interest Group, BIOSIG, Darmstadt, Germany, 9-11 September,

More information

Features Points. Andrea Torsello DAIS Università Ca Foscari via Torino 155, Mestre (VE)

Features Points. Andrea Torsello DAIS Università Ca Foscari via Torino 155, Mestre (VE) Features Points Andrea Torsello DAIS Università Ca Foscari via Torino 155, 30172 Mestre (VE) Finding Corners Edge detectors perform poorly at corners. Corners provide repeatable points for matching, so

More information

Pedestrian Detection using Infrared images and Histograms of Oriented Gradients

Pedestrian Detection using Infrared images and Histograms of Oriented Gradients Intelligent Vehicles Symposium 26, June 3-5, 26, Tokyo, Japan 6- Pedestrian Detection using Infrared images and Histograms of Oriented Gradients F. Suard, A. Rakotomamonjy, A. Bensrhair Lab. PSI CNRS FRE

More information

Sparse coding for image classification

Sparse coding for image classification Sparse coding for image classification Columbia University Electrical Engineering: Kun Rong(kr2496@columbia.edu) Yongzhou Xiang(yx2211@columbia.edu) Yin Cui(yc2776@columbia.edu) Outline Background Introduction

More information

Classifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao

Classifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao Motivation Image search Building large sets of classified images Robotics Background Object recognition is unsolved Deformable shaped

More information

Human-Robot Interaction

Human-Robot Interaction Human-Robot Interaction Elective in Artificial Intelligence Lecture 6 Visual Perception Luca Iocchi DIAG, Sapienza University of Rome, Italy With contributions from D. D. Bloisi and A. Youssef Visual Perception

More information

Robust PDF Table Locator

Robust PDF Table Locator Robust PDF Table Locator December 17, 2016 1 Introduction Data scientists rely on an abundance of tabular data stored in easy-to-machine-read formats like.csv files. Unfortunately, most government records

More information

Hand Posture Recognition Using Adaboost with SIFT for Human Robot Interaction

Hand Posture Recognition Using Adaboost with SIFT for Human Robot Interaction Hand Posture Recognition Using Adaboost with SIFT for Human Robot Interaction Chieh-Chih Wang and Ko-Chih Wang Department of Computer Science and Information Engineering Graduate Institute of Networking

More information

Face Detection for Skintone Images Using Wavelet and Texture Features

Face Detection for Skintone Images Using Wavelet and Texture Features Face Detection for Skintone Images Using Wavelet and Texture Features 1 H.C. Vijay Lakshmi, 2 S. Patil Kulkarni S.J. College of Engineering Mysore, India 1 vijisjce@yahoo.co.in, 2 pk.sudarshan@gmail.com

More information

Removing Shadows from Images

Removing Shadows from Images Removing Shadows from Images Zeinab Sadeghipour Kermani School of Computing Science Simon Fraser University Burnaby, BC, V5A 1S6 Mark S. Drew School of Computing Science Simon Fraser University Burnaby,

More information