OBJECT RECOGNITION ALGORITHM FOR MOBILE DEVICES

Size: px
Start display at page:

Download "OBJECT RECOGNITION ALGORITHM FOR MOBILE DEVICES"

Transcription

1 Image Processing & Communication, vol. 18,no. 1, pp DOI: /v x 31 OBJECT RECOGNITION ALGORITHM FOR MOBILE DEVICES RAFAŁ KOZIK ADAM MARCHEWKA Institute of Telecommunications, University of Technology & Life Sciences in Bydgoszcz, ul. Kaliskiego 7, Bydgoszcz, Poland Abstract. In this paper an object recognition algorithm for mobile devices is presented. The algorithm is based on a hierarchical approach for visual information coding proposed by Riesenhuber and Poggio [1] and later extended by Serre et al. [2]. The proposed method adapts an efficient algorithm to extract the information about local gradients. This allows the algorithm to approximate the behaviour of simple cells layer of Riesenhuber and Poggio model. Moreover, it is also demonstrated that the proposed algorithm can be successfully deployed on a low-cost Android smartphone. 1 Introduction This paper is a continuation of author s recent research on object detection algorithms dedicate for mobile devices. The tentative results have been presented in [4]. In contrast to previous work, modifications to overall method architecture have been proposed. Moreover, the experiments described in this paper have been conducted for extended database of images. The proposed approach follows the idea of HMAX model proposed by Riesenhuber and Poggio [1]. It adapts the hierarchical fed-forward processing approach. However, modifications are introduced in order to reduce computational complexity of the original algorithm. Moreover, in contrast to Mutch and Lowe [10] model an additional layer that mimics the "retina codding" mechanism is added. The results showed that this step increases the robustness of the proposed method. However, it is considered as an optional step of visual information processing, because it introduces additional computational burden that can increase the response time (the frame rate drops significantly) of at low-cost mobile devices. In contrast to original HMAX model, a different method for calculating the S 1 layer response is proposed. The responses of 4 Gabor filters are approximated with algorithm that mimic the behaviour of HOG descriptors. The HOG [3] descriptors are commonly use for object recognition problems. The image is decomposed into small cells. In each cell an histogram of oriented gradients is used. Typically for HOG features the result is normalised and the descriptor is returned for each cell. However, in contrast to original HOG descriptor proposed by Dalal and Triggs, in this method (in order to simplify the computations) the block of cells are not overlapping, and

2 32 R. Kozik, A. Marchewka the input image is not normalised. Moreover, in previous work [7] the S 2 and C 2 layers were replaced with machine-learned classifier. Such approach significantly simplifies the learning process and decreases the amount of computations. However, this impacts the invariance to shape changes of a recognised object and the feature vectors presented to classifier are relatively of grater dimensionality. 2 The HMAX model overview The HMAX Visual Cortex model proposed by Riesenhuber and Poggio [1] exploits a hierarchical structure for the image processing and coding. It is arranged in several layers that process information in a bottom-up manner. The lowest layer is fed with a grayscale image. The higher layers of the model are either called "S" or "C". These names correspond to simple the (S) and complex (C) cells discovered by Hubel and Wiesel [9]. Both type of cells are located in the striate cortex (called V1), which is the part of visual cortex that lies in the most posterior area of the occipital lobe. The simple cells located in "S" layers apply local filters that responses form a vector of texture features. As noted by Hubel and Wiesel the individual cell in the cortex respond to the presence of edges. They also discovered that these cells are sensitive to edge orientation (some of cells fire only when a given orientation of an edge is observed). The complex cells located in "C" layers calculate in a limited range of a previous layer the strongest responses of a given type (orientation). That way more complex combination of simple features are obtained combined from three simple features. The hierarchical HMAX model proposed by Mutch and Lowe [10] is a modification of the model presented by Serre et al. in [2]. It introduces two layers of simple cells (S 1 and S 2 ) and two layers of complex cells (see Fig. 1). It uses set of filters designed to emulate V1 simple cells. The Fig. 1: The structure of a hierarchical model proposed by Mutch and Lowe [10]. (used symbols: I - image, S - simple cells, C - complex cells, GF - Gabor Filters, F - prototype features vectors, - convolution operation) simple layers are computed using a hard max filter [11]. It means that the "C" cells responses are the maximum values of the associated "S" cells. As it is shown in the Fig. 1 the images are processed by the subsequent simple and complex cells layers and reduced to feature vectors, which are further used in the classification process. The set of features (F ) is shared across all images and object categories. Features are computed hierarchically in subsequent layers built from the previous one by alternating the template matching and the max pooling operations. The S 1 layer in the Mutch and Lowe [10] model adapts the 2D Gabor filters computed for four orientations (horizontal, vertical, and two diagonals) at each possible position and scale. The Gabor filters are in size, and are described by: G(x, y) = e (X2 +γy 2 )/(2σ 2) cos( 2π λ ) (1) where X = x cos φ y sin φ and Y = x sin φ+y cos φ; x, y < 5; 5 >, and φ < 0; π >. The aspect ration (γ), effective width (σ), and wavelength (λ) are set to 0.3, 4.5 and 5.6 respectively. The response of a patch of pixels X to a particular filter G is computed using the formula (2). Xi G i R(X, G) = abs( ) (2) X 2 i The complex cells located in C 1 layer pool associated units in the S 1 layer. For each orientation, the S 1 re-

3 Image Processing & Communication, vol. 18, no. 1, pp sponses are convolved with a max filter, that is of size in x, y dimension (position) and has 2 units of deep in scale. As it is shown in the Fig. 1, the intermediate S 2 layer is formed by convolving the C 1 layer response with a set of intermediate-level features (depicted as F in Fig. 1). The set of intermediate-level features is established during the learning phase. For a given set of learning images C 1 responses are computed. The most meaningful features are selected using SVM weighting. Mutch and Lowe [10] suggested to sub-sample the C 1 responses before feature selection. Therefore, authors select at random positions and scales of the C 1 layer patches of size 4 4, 8 8, 12 12, and Selected and weighted features compose so called prototypes that are used in the Mutch and Lowe model as filters which responses create the S 2 layer. The C 2 layer composes a global feature vector which particular element corresponds to the maximum response to a given prototype patch. In order to identify the visual object on the basis of feature vector a classifier is learnt (e.g. SVM). 3 The proposed method In order to approximate the S 1 layer behaviour of the HMAX model the algorithms that mimics the HOG descriptors is used. However, to make this step as simple as possible (due to the low computation power of mobile devices) only gradient orientation binning schema is adapted. In fact, the intention here is no to compute HOG descriptor, but to approximate the response of four Gabor filters used in original HMAX model. In the S 1 layer there are N M 4 simple cells arranged in a grid of size N M blocks. In each block there are 4 cells. Each cell is assigned a receptive field (pixels inside the block). Each cell activates depend on a stimuli. In this case there are four possible stimulus, namely vertical, horizontal, left diagonal, and right diagonal edges. As a result the S 1 simple cells layer output has dimensionality of a size 4 (x, y, scale and 4 cells). In order to compute responses of all four cells inside a given block (receptive field), an Algorithm 1 is applied. Algorithm 1 Algorithm for calculating S 1 response Require: Grayscaled image I of W H size Ensure: S 1 layer of size N M 4 Assign G min the low-threshold for gradient magnitude. for each pixel (x, y) in image I do Compute horizontal G x and vertical G y gradients using Prewitt operator Compute gradient magnitude G x,y in point (x, y) n x N W ; m y M W if G < G min then go to next pixel else active get_cell_type(g x, G y ) S 1 [n, m, active] S 1 [n, m, active] + G x,y end if end for The algorithm computes the responses of all cells using only one iteration over the whole input image I. For each pixel at position (x, y) a vertical and horizontal gradients are computed (G x and G y ). Given the pixel position (x, y) and gradient vector [G x, G y ] the algorithm indicates the block position (n, m) and type of cell (active) that response has to be incremented by G. In order to classify given gradient vector [G x, G y ] as horizontal, diagonal or vertical the get_cell_type(, ) function uses simple schema for voting. If a point ( G x, G y ) is located between line y = 0.3 x and y = 3.3 x it is classified as a diagonal. If G y is positive then the vector is classified as a right diagonal (otherwise it is a left diagonal). In case the point ( G x, G y ) is located above line y = 3.3 x the gradient vector is classified as vertical and as horizontal when it lies below line y = 0.3 x. The processing in S 1 layer is applied for three scales (an original image, and two images scaled by a factor of 0.7 and 1.5). This is a necessary operation to achieve the response of C 1 layer. The C 1 complex layer response is

4 34 R. Kozik, A. Marchewka computed using max-pooling filter, that is applied to S 1 layer. This layer allows the algorithm to achieve scale invariance. The output of the C 1 layer cells is used in order to select features that are meaningful for a given task of an object detection. Firstly, randomly selected images of an object are reduced to a features vectors by a consecutive layers of the proposed model. Afterwards, the PCA is applied to calculate the eigenvectors. Each eigenvector is used as a single feature in a features vector. The response of S 2 layer is obtained using eigenvectors to compute the dot product with the C 1 layer. Finally, an classifier is learned to recognise an object using S 2 response layer. For this paper different machinelearnt algorithms are investigated. 4 Experiments and Results was possible to achieve 95,6% of detection rate while having 2.3% of false positives. It can be noticed that adding additional trees does not improve the effectiveness. The early prototype of proposed algorithm was deployed on an Android device. The code was written in pure Java and tested on Samsung Galaxy Ace device. This device is equipped with 800 MHz CPU, 278 MB of RAM and Android 2.3 operating system. Current version implements brute force Nearest Neighbour classifier, operates only for one scale (PC scans three scale - an original image, and two images scaled by a factor of 0.7 and 1.5), and achieves about 10 FPS when less than 10 training samples are provided. Some examples of object detection are shown in Fig. 3. During the testing it was noticed that the algorithm is able to recognise an object on a cluttered background even if only few learning samples are provided. In contrast to the previous work [4], the data base of images has been extended and new experiments have been conducted, which aimed at evaluating the effectiveness of different classifiers. In this work tree-based and rule-based classification approaches have been compared, namely N (5,10,15) Random Trees, J48 and PART (previously used by author in [5, 6, 4]) algorithms. The comparison has been presented in Fig.2. For the experiments the dataset was randomly split for training and testing samples (in proportion 70% to 30% respectively). For training samples the C 1 layers responses were calculated and eigenvectors were extracted. The eigenvectors were used to calculate the S 2 layers responses that were further used to learn classifier. During the testing phase the eigenvectors were again used in order to extract S 2 layers responses for testing samples. The whole procedure was repeated 10 times ( the dataset was randomly reshuffled before split) and the results were averaged. The best results were obtained for 10 Random Trees. It Fig. 3: Example of object detection (mug and doors) with proposed algorithm deployed on an Android device 5 Conclusions In this paper an algorithm for object recognition dedicated form mobile devices was presented. The algorithm is based on a hierarchical approach for visual information coding proposed by Riesenhuber and Poggio [1] and later

5 False Positive Rate Image Processing & Communication, vol. 18, no. 1, pp ,04 0,035 0,03 0,025 0,02 0,015 0,01 0, random trees 5 random trees 15 random trees J48 PART Expon. (10 random trees) Expon. (5 random trees) Expon. (15 random trees) Expon. (J48) Expon. (PART) 0 0,78 0,8 0,82 0,84 0,86 0,88 0,9 0,92 0,94 0,96 0,98 Detection Rate Fig. 2: Different classifiers effectiveness extended by Serre et al. [2]. The experiments show that the introduced algorithm allows for efficient feature extraction and a visual information coding. Moreover, it was shown that it is also possible to deploy proposed approach on a low-cost mobile device. The results are promising and show that described in this paper algorithm can be further investigated as one of assistive solutions dedicated for visually impaired people. References [1] M. Riesenhuber, T. Poggio, Hierarchical models of object recognition in cortex, Nat Neurosci, Vol. 2, pp , 1999 [2] T. Serre, G. Kreiman, M. Kouh, C. Cadieu, U. Knoblich T. Poggio, A quantitative theory of immediate visual recognition, In:Progress in Brain Research, Computational Neuroscience: Theoretical Insights into Brain Function, Vol. 165, pp , 2007 [3] N. Dalal, B. Triggs, Histograms of oriented gradients for human detection, Computer Vision and Pattern Recognition, CVPR 2005, IEEE Computer Society Conference on, Vol.1, pp , June 2005 [4] R. Kozik, A Simplified Visual Cortex Model for Efficient Image Codding and Object Recognition, Image Processing and Communications Challenges 5, Advances in Intelligent Systems and Computing, pp , 2013 [5] R. Kozik, Rapid Threat Detection for Stereovision Mobility Aid System, In: T. Czachorski et al. (Eds.): Man-Machine Interactions 2, AISC 103, pp , 2011 [6] R. Kozik, Stereovision system for visually impaired. Burduk, Robert (ed.) et al., Computer recognition systems 4, Advances in Intelligent and Soft Computing 95, pp , 2011 [7] R. Kozik, A Proposal of Biologically Inspired Hi-

6 36 R. Kozik, A. Marchewka erarchical Approach to Object Recognition, Journal of Medical Informatics & Technologies, Vol. 22/2013, ISSN , pp , 2013 [8] R. Kozik, A Simplified Visual Cortex Model for Efficient Image Codding and Object Recognition, Image Processing and Communications Challenges 5, Advances in Intelligent Systems and Computing S. Choras, Ryszard, pp , 2013 [9] D.H. Hubel, T.N. Wiesel Receptive fields, binocular interaction and functional architecture in the cat s visual cortex, J Physiol, Vol. 160, pp , 1962 [10] J. Mutch, D.G. Lowe, Object class recognition and localization using sparse features with limited receptive fields,international Journal of Computer Vision (IJCV), Vol. 80, No. 1, pp , October 2008 [11] MAX pooling, php/pooling

Developing Open Source code for Pyramidal Histogram Feature Sets

Developing Open Source code for Pyramidal Histogram Feature Sets Developing Open Source code for Pyramidal Histogram Feature Sets BTech Project Report by Subodh Misra subodhm@iitk.ac.in Y648 Guide: Prof. Amitabha Mukerjee Dept of Computer Science and Engineering IIT

More information

Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science

Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science V1-inspired orientation selective filters for image processing and computer vision Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science 2 Most of the images in this

More information

Biologically-Inspired Translation, Scale, and rotation invariant object recognition models

Biologically-Inspired Translation, Scale, and rotation invariant object recognition models Rochester Institute of Technology RIT Scholar Works Theses Thesis/Dissertation Collections 2007 Biologically-Inspired Translation, Scale, and rotation invariant object recognition models Myung Woo Follow

More information

Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science

Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science 2D Gabor functions and filters for image processing and computer vision Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science 2 Most of the images in this presentation

More information

Modeling Visual Cortex V4 in Naturalistic Conditions with Invari. Representations

Modeling Visual Cortex V4 in Naturalistic Conditions with Invari. Representations Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant and Sparse Image Representations Bin Yu Departments of Statistics and EECS University of California at Berkeley Rutgers University, May

More information

Object Detection Design challenges

Object Detection Design challenges Object Detection Design challenges How to efficiently search for likely objects Even simple models require searching hundreds of thousands of positions and scales Feature design and scoring How should

More information

Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science

Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science 2D Gabor functions and filters for image processing and computer vision Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science 2 Most of the images in this presentation

More information

MORPH-II: Feature Vector Documentation

MORPH-II: Feature Vector Documentation MORPH-II: Feature Vector Documentation Troy P. Kling NSF-REU Site at UNC Wilmington, Summer 2017 1 MORPH-II Subsets Four different subsets of the MORPH-II database were selected for a wide range of purposes,

More information

Feature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking

Feature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking Feature descriptors Alain Pagani Prof. Didier Stricker Computer Vision: Object and People Tracking 1 Overview Previous lectures: Feature extraction Today: Gradiant/edge Points (Kanade-Tomasi + Harris)

More information

Human Vision Based Object Recognition Sye-Min Christina Chan

Human Vision Based Object Recognition Sye-Min Christina Chan Human Vision Based Object Recognition Sye-Min Christina Chan Abstract Serre, Wolf, and Poggio introduced an object recognition algorithm that simulates image processing in visual cortex and claimed to

More information

Exploration of Hardware Acceleration for a Neuromorphic Visual Classification System

Exploration of Hardware Acceleration for a Neuromorphic Visual Classification System Exploration of Hardware Acceleration for a Neuromorphic Visual Classification System Ikenna J. Okafor, McNair Scholar The Pennsylvania State University McNair Faculty Research Advisors: Kevin M. Irick,

More information

An Implementation on Histogram of Oriented Gradients for Human Detection

An Implementation on Histogram of Oriented Gradients for Human Detection An Implementation on Histogram of Oriented Gradients for Human Detection Cansın Yıldız Dept. of Computer Engineering Bilkent University Ankara,Turkey cansin@cs.bilkent.edu.tr Abstract I implemented a Histogram

More information

https://en.wikipedia.org/wiki/the_dress Recap: Viola-Jones sliding window detector Fast detection through two mechanisms Quickly eliminate unlikely windows Use features that are fast to compute Viola

More information

2D Image Processing Feature Descriptors

2D Image Processing Feature Descriptors 2D Image Processing Feature Descriptors Prof. Didier Stricker Kaiserlautern University http://ags.cs.uni-kl.de/ DFKI Deutsches Forschungszentrum für Künstliche Intelligenz http://av.dfki.de 1 Overview

More information

Human detection using histogram of oriented gradients. Srikumar Ramalingam School of Computing University of Utah

Human detection using histogram of oriented gradients. Srikumar Ramalingam School of Computing University of Utah Human detection using histogram of oriented gradients Srikumar Ramalingam School of Computing University of Utah Reference Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

More information

Human Motion Detection and Tracking for Video Surveillance

Human Motion Detection and Tracking for Video Surveillance Human Motion Detection and Tracking for Video Surveillance Prithviraj Banerjee and Somnath Sengupta Department of Electronics and Electrical Communication Engineering Indian Institute of Technology, Kharagpur,

More information

Object Category Detection: Sliding Windows

Object Category Detection: Sliding Windows 04/10/12 Object Category Detection: Sliding Windows Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Today s class: Object Category Detection Overview of object category detection Statistical

More information

Online Learning for Object Recognition with a Hierarchical Visual Cortex Model

Online Learning for Object Recognition with a Hierarchical Visual Cortex Model Online Learning for Object Recognition with a Hierarchical Visual Cortex Model Stephan Kirstein, Heiko Wersing, and Edgar Körner Honda Research Institute Europe GmbH Carl Legien Str. 30 63073 Offenbach

More information

The SIFT (Scale Invariant Feature

The SIFT (Scale Invariant Feature The SIFT (Scale Invariant Feature Transform) Detector and Descriptor developed by David Lowe University of British Columbia Initial paper ICCV 1999 Newer journal paper IJCV 2004 Review: Matt Brown s Canonical

More information

Computational Models of V1 cells. Gabor and CORF

Computational Models of V1 cells. Gabor and CORF 1 Computational Models of V1 cells Gabor and CORF George Azzopardi Nicolai Petkov 2 Primary visual cortex (striate cortex or V1) 3 Types of V1 Cells Hubel and Wiesel, Nobel Prize winners Three main types

More information

Sketchable Histograms of Oriented Gradients for Object Detection

Sketchable Histograms of Oriented Gradients for Object Detection Sketchable Histograms of Oriented Gradients for Object Detection No Author Given No Institute Given Abstract. In this paper we investigate a new representation approach for visual object recognition. The

More information

Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers

Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers A. Salhi, B. Minaoui, M. Fakir, H. Chakib, H. Grimech Faculty of science and Technology Sultan Moulay Slimane

More information

A Sparse and Locally Shift Invariant Feature Extractor Applied to Document Images

A Sparse and Locally Shift Invariant Feature Extractor Applied to Document Images A Sparse and Locally Shift Invariant Feature Extractor Applied to Document Images Marc Aurelio Ranzato Yann LeCun Courant Institute of Mathematical Sciences New York University - New York, NY 10003 Abstract

More information

Discriminative classifiers for image recognition

Discriminative classifiers for image recognition Discriminative classifiers for image recognition May 26 th, 2015 Yong Jae Lee UC Davis Outline Last time: window-based generic object detection basic pipeline face detection with boosting as case study

More information

Automated Canvas Analysis for Painting Conservation. By Brendan Tobin

Automated Canvas Analysis for Painting Conservation. By Brendan Tobin Automated Canvas Analysis for Painting Conservation By Brendan Tobin 1. Motivation Distinctive variations in the spacings between threads in a painting's canvas can be used to show that two sections of

More information

Deep Convolutional Neural Networks. Nov. 20th, 2015 Bruce Draper

Deep Convolutional Neural Networks. Nov. 20th, 2015 Bruce Draper Deep Convolutional Neural Networks Nov. 20th, 2015 Bruce Draper Background: Fully-connected single layer neural networks Feed-forward classification Trained through back-propagation Example Computer Vision

More information

Detecting Printed and Handwritten Partial Copies of Line Drawings Embedded in Complex Backgrounds

Detecting Printed and Handwritten Partial Copies of Line Drawings Embedded in Complex Backgrounds 9 1th International Conference on Document Analysis and Recognition Detecting Printed and Handwritten Partial Copies of Line Drawings Embedded in Complex Backgrounds Weihan Sun, Koichi Kise Graduate School

More information

A Sparse and Locally Shift Invariant Feature Extractor Applied to Document Images

A Sparse and Locally Shift Invariant Feature Extractor Applied to Document Images A Sparse and Locally Shift Invariant Feature Extractor Applied to Document Images Marc Aurelio Ranzato Yann LeCun Courant Institute of Mathematical Sciences New York University - New York, NY 10003 Abstract

More information

Deep Learning. Deep Learning. Practical Application Automatically Adding Sounds To Silent Movies

Deep Learning. Deep Learning. Practical Application Automatically Adding Sounds To Silent Movies http://blog.csdn.net/zouxy09/article/details/8775360 Automatic Colorization of Black and White Images Automatically Adding Sounds To Silent Movies Traditionally this was done by hand with human effort

More information

MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION

MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION Panca Mudjirahardjo, Rahmadwati, Nanang Sulistiyanto and R. Arief Setyawan Department of Electrical Engineering, Faculty of

More information

Object recognition in images by human vision and computer vision Chen, Q.; Dijkstra, J.; de Vries, B.

Object recognition in images by human vision and computer vision Chen, Q.; Dijkstra, J.; de Vries, B. Object recognition in images by human vision and computer vision Chen, Q.; Dijkstra, J.; de Vries, B. Published in: Conference proceeding of 10th International Conference on Design & Decision Support Systems

More information

Face Detection Using Convolutional Neural Networks and Gabor Filters

Face Detection Using Convolutional Neural Networks and Gabor Filters Face Detection Using Convolutional Neural Networks and Gabor Filters Bogdan Kwolek Rzeszów University of Technology W. Pola 2, 35-959 Rzeszów, Poland bkwolek@prz.rzeszow.pl Abstract. This paper proposes

More information

Robust PDF Table Locator

Robust PDF Table Locator Robust PDF Table Locator December 17, 2016 1 Introduction Data scientists rely on an abundance of tabular data stored in easy-to-machine-read formats like.csv files. Unfortunately, most government records

More information

Analysis of features vectors of images computed on the basis of hierarchical models of object recognition in cortex

Analysis of features vectors of images computed on the basis of hierarchical models of object recognition in cortex Analysis of features vectors of images computed on the basis of hierarchical models of object recognition in cortex Ankit Agrawal(Y7057); Jyoti Meena(Y7184) November 22, 2010 SE 367 : Introduction to Cognitive

More information

Car Detecting Method using high Resolution images

Car Detecting Method using high Resolution images Car Detecting Method using high Resolution images Swapnil R. Dhawad Department of Electronics and Telecommunication Engineering JSPM s Rajarshi Shahu College of Engineering, Savitribai Phule Pune University,

More information

A Multi-Scale Generalization of the HoG and HMAX Image Descriptors for Object Detection Stanley M Bileschi

A Multi-Scale Generalization of the HoG and HMAX Image Descriptors for Object Detection Stanley M Bileschi Computer Science and Artificial Intelligence Laboratory Technical Report MIT-CSAIL-TR-2008-019 CBCL-271 April 9, 2008 A Multi-Scale Generalization of the HoG and HMAX Image Descriptors for Object Detection

More information

Sparse Models in Image Understanding And Computer Vision

Sparse Models in Image Understanding And Computer Vision Sparse Models in Image Understanding And Computer Vision Jayaraman J. Thiagarajan Arizona State University Collaborators Prof. Andreas Spanias Karthikeyan Natesan Ramamurthy Sparsity Sparsity of a vector

More information

Augmented Reality VU. Computer Vision 3D Registration (2) Prof. Vincent Lepetit

Augmented Reality VU. Computer Vision 3D Registration (2) Prof. Vincent Lepetit Augmented Reality VU Computer Vision 3D Registration (2) Prof. Vincent Lepetit Feature Point-Based 3D Tracking Feature Points for 3D Tracking Much less ambiguous than edges; Point-to-point reprojection

More information

CS 523: Multimedia Systems

CS 523: Multimedia Systems CS 523: Multimedia Systems Angus Forbes creativecoding.evl.uic.edu/courses/cs523 Today - Convolutional Neural Networks - Work on Project 1 http://playground.tensorflow.org/ Convolutional Neural Networks

More information

Overview. Object Recognition. Neurobiology of Vision. Invariances in higher visual cortex

Overview. Object Recognition. Neurobiology of Vision. Invariances in higher visual cortex 3 / 27 4 / 27 Overview Object Recognition Mark van Rossum School of Informatics, University of Edinburgh January 15, 2018 Neurobiology of Vision Computational Object Recognition: What s the Problem? Fukushima

More information

Histogram of Oriented Gradients for Human Detection

Histogram of Oriented Gradients for Human Detection Histogram of Oriented Gradients for Human Detection Article by Navneet Dalal and Bill Triggs All images in presentation is taken from article Presentation by Inge Edward Halsaunet Introduction What: Detect

More information

A Novel Extreme Point Selection Algorithm in SIFT

A Novel Extreme Point Selection Algorithm in SIFT A Novel Extreme Point Selection Algorithm in SIFT Ding Zuchun School of Electronic and Communication, South China University of Technolog Guangzhou, China zucding@gmail.com Abstract. This paper proposes

More information

Object Tracking using HOG and SVM

Object Tracking using HOG and SVM Object Tracking using HOG and SVM Siji Joseph #1, Arun Pradeep #2 Electronics and Communication Engineering Axis College of Engineering and Technology, Ambanoly, Thrissur, India Abstract Object detection

More information

Tongue Recognition From Images

Tongue Recognition From Images Tongue Recognition From Images Ryszard S. Choraś Institute of Telecommunications and Computer Science UTP University of Sciences and Technology 85-796 Bydgoszcz, Poland Email: choras@utp.edu.pl Abstract

More information

Recap Image Classification with Bags of Local Features

Recap Image Classification with Bags of Local Features Recap Image Classification with Bags of Local Features Bag of Feature models were the state of the art for image classification for a decade BoF may still be the state of the art for instance retrieval

More information

Learning Feature Hierarchies for Object Recognition

Learning Feature Hierarchies for Object Recognition Learning Feature Hierarchies for Object Recognition Koray Kavukcuoglu Computer Science Department Courant Institute of Mathematical Sciences New York University Marc Aurelio Ranzato, Kevin Jarrett, Pierre

More information

Category vs. instance recognition

Category vs. instance recognition Category vs. instance recognition Category: Find all the people Find all the buildings Often within a single image Often sliding window Instance: Is this face James? Find this specific famous building

More information

Histogram of Oriented Phase (HOP): A New Descriptor Based on Phase Congruency

Histogram of Oriented Phase (HOP): A New Descriptor Based on Phase Congruency University of Dayton ecommons Electrical and Computer Engineering Faculty Publications Department of Electrical and Computer Engineering 5-2016 Histogram of Oriented Phase (HOP): A New Descriptor Based

More information

Tri-modal Human Body Segmentation

Tri-modal Human Body Segmentation Tri-modal Human Body Segmentation Master of Science Thesis Cristina Palmero Cantariño Advisor: Sergio Escalera Guerrero February 6, 2014 Outline 1 Introduction 2 Tri-modal dataset 3 Proposed baseline 4

More information

Facial Expression Classification with Random Filters Feature Extraction

Facial Expression Classification with Random Filters Feature Extraction Facial Expression Classification with Random Filters Feature Extraction Mengye Ren Facial Monkey mren@cs.toronto.edu Zhi Hao Luo It s Me lzh@cs.toronto.edu I. ABSTRACT In our work, we attempted to tackle

More information

Window based detectors

Window based detectors Window based detectors CS 554 Computer Vision Pinar Duygulu Bilkent University (Source: James Hays, Brown) Today Window-based generic object detection basic pipeline boosting classifiers face detection

More information

Illumination-Robust Face Recognition based on Gabor Feature Face Intrinsic Identity PCA Model

Illumination-Robust Face Recognition based on Gabor Feature Face Intrinsic Identity PCA Model Illumination-Robust Face Recognition based on Gabor Feature Face Intrinsic Identity PCA Model TAE IN SEOL*, SUN-TAE CHUNG*, SUNHO KI**, SEONGWON CHO**, YUN-KWANG HONG*** *School of Electronic Engineering

More information

Bio-inspired Binocular Disparity with Position-Shift Receptive Field

Bio-inspired Binocular Disparity with Position-Shift Receptive Field Bio-inspired Binocular Disparity with Position-Shift Receptive Field Fernanda da C. e C. Faria, Jorge Batista and Helder Araújo Institute of Systems and Robotics, Department of Electrical Engineering and

More information

Classification of objects from Video Data (Group 30)

Classification of objects from Video Data (Group 30) Classification of objects from Video Data (Group 30) Sheallika Singh 12665 Vibhuti Mahajan 12792 Aahitagni Mukherjee 12001 M Arvind 12385 1 Motivation Video surveillance has been employed for a long time

More information

Massively Parallel Multiclass Object Recognition

Massively Parallel Multiclass Object Recognition Vision, Modeling, and Visualization (2010) Massively Parallel Multiclass Object Recognition Helmut Sedding, Ferdinand Deger, Holger Dammertz, Jan Bouecke, Hendrik P. A. Lensch Ulm University, Germany Abstract

More information

Introduction to Deep Learning

Introduction to Deep Learning ENEE698A : Machine Learning Seminar Introduction to Deep Learning Raviteja Vemulapalli Image credit: [LeCun 1998] Resources Unsupervised feature learning and deep learning (UFLDL) tutorial (http://ufldl.stanford.edu/wiki/index.php/ufldl_tutorial)

More information

Robotics Programming Laboratory

Robotics Programming Laboratory Chair of Software Engineering Robotics Programming Laboratory Bertrand Meyer Jiwon Shin Lecture 8: Robot Perception Perception http://pascallin.ecs.soton.ac.uk/challenges/voc/databases.html#caltech car

More information

A New Algorithm for Shape Detection

A New Algorithm for Shape Detection IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 19, Issue 3, Ver. I (May.-June. 2017), PP 71-76 www.iosrjournals.org A New Algorithm for Shape Detection Hewa

More information

ANALYSIS DIRECTIONAL FEATURES IN IMAGES USING GABOR FILTERS

ANALYSIS DIRECTIONAL FEATURES IN IMAGES USING GABOR FILTERS ANALYSIS DIRECTIONAL FEATURES IN IMAGES USING GABOR FILTERS Z.-Q. Liu*, R. M. Rangayyan*, and C. B. Frank** Departments of Electrical Engineering* and Surgery**, The University of Calgary Calgary, Alberta,

More information

Mobile Human Detection Systems based on Sliding Windows Approach-A Review

Mobile Human Detection Systems based on Sliding Windows Approach-A Review Mobile Human Detection Systems based on Sliding Windows Approach-A Review Seminar: Mobile Human detection systems Njieutcheu Tassi cedrique Rovile Department of Computer Engineering University of Heidelberg

More information

HISTOGRAMS OF ORIENTATIO N GRADIENTS

HISTOGRAMS OF ORIENTATIO N GRADIENTS HISTOGRAMS OF ORIENTATIO N GRADIENTS Histograms of Orientation Gradients Objective: object recognition Basic idea Local shape information often well described by the distribution of intensity gradients

More information

Computer Science Faculty, Bandar Lampung University, Bandar Lampung, Indonesia

Computer Science Faculty, Bandar Lampung University, Bandar Lampung, Indonesia Application Object Detection Using Histogram of Oriented Gradient For Artificial Intelegence System Module of Nao Robot (Control System Laboratory (LSKK) Bandung Institute of Technology) A K Saputra 1.,

More information

Obtaining Feature Correspondences

Obtaining Feature Correspondences Obtaining Feature Correspondences Neill Campbell May 9, 2008 A state-of-the-art system for finding objects in images has recently been developed by David Lowe. The algorithm is termed the Scale-Invariant

More information

SURF. Lecture6: SURF and HOG. Integral Image. Feature Evaluation with Integral Image

SURF. Lecture6: SURF and HOG. Integral Image. Feature Evaluation with Integral Image SURF CSED441:Introduction to Computer Vision (2015S) Lecture6: SURF and HOG Bohyung Han CSE, POSTECH bhhan@postech.ac.kr Speed Up Robust Features (SURF) Simplified version of SIFT Faster computation but

More information

Convolutional Neural Networks. Computer Vision Jia-Bin Huang, Virginia Tech

Convolutional Neural Networks. Computer Vision Jia-Bin Huang, Virginia Tech Convolutional Neural Networks Computer Vision Jia-Bin Huang, Virginia Tech Today s class Overview Convolutional Neural Network (CNN) Training CNN Understanding and Visualizing CNN Image Categorization:

More information

Enhanced object recognition in cortex-like machine vision

Enhanced object recognition in cortex-like machine vision Enhanced object recognition in cortex-like machine vision Aristeidis Tsitiridis 1, Peter WT Yuen 1, Izzati Ibrahim 1, Umar Soori 1, Tong Chen 1, Kan Hong 1, Zhengjie Wang 2, David James 1 and Mark Richardson

More information

EXPRESSION-INDEPENDENT FACE RECOGNITION USING BIOLOGICALLY INSPIRED FEATURES

EXPRESSION-INDEPENDENT FACE RECOGNITION USING BIOLOGICALLY INSPIRED FEATURES Abstract EXPRESSION-INDEPENDEN FACE RECOGNIION USING BIOLOGICALLY INSPIRED FEAURES REZA EBRAHIMPOUR, AHMAD JAHANI, ALI AMIRI AND MASOOM NAZARI Brain & Intelligent Systems Research Lab,Department of Electrical

More information

Advanced Video Content Analysis and Video Compression (5LSH0), Module 4

Advanced Video Content Analysis and Video Compression (5LSH0), Module 4 Advanced Video Content Analysis and Video Compression (5LSH0), Module 4 Visual feature extraction Part I: Color and texture analysis Sveta Zinger Video Coding and Architectures Research group, TU/e ( s.zinger@tue.nl

More information

Hand Posture Recognition Using Convolutional Neural Network

Hand Posture Recognition Using Convolutional Neural Network Hand Posture Recognition Using Convolutional Neural Network Dennis Núñez Fernández 2 and Bogdan Kwolek 1 1 AGH University of Science and Technology, 30 Mickiewicza, 30-059 Krakow, Poland http://home.agh.edu.pl/~bkw/contact.html

More information

Visual object classification by sparse convolutional neural networks

Visual object classification by sparse convolutional neural networks Visual object classification by sparse convolutional neural networks Alexander Gepperth 1 1- Ruhr-Universität Bochum - Institute for Neural Dynamics Universitätsstraße 150, 44801 Bochum - Germany Abstract.

More information

Object-Based Saliency Maps Harry Marr Computing BSc 2009/2010

Object-Based Saliency Maps Harry Marr Computing BSc 2009/2010 Object-Based Saliency Maps Harry Marr Computing BSc 2009/2010 The candidate confirms that the work submitted is their own and the appropriate credit has been given where reference has been made to the

More information

Speeding up the Detection of Line Drawings Using a Hash Table

Speeding up the Detection of Line Drawings Using a Hash Table Speeding up the Detection of Line Drawings Using a Hash Table Weihan Sun, Koichi Kise 2 Graduate School of Engineering, Osaka Prefecture University, Japan sunweihan@m.cs.osakafu-u.ac.jp, 2 kise@cs.osakafu-u.ac.jp

More information

Histograms of Oriented Gradients for Human Detection p. 1/1

Histograms of Oriented Gradients for Human Detection p. 1/1 Histograms of Oriented Gradients for Human Detection p. 1/1 Histograms of Oriented Gradients for Human Detection Navneet Dalal and Bill Triggs INRIA Rhône-Alpes Grenoble, France Funding: acemedia, LAVA,

More information

Sparsity-Regularized HMAX for Visual Recognition

Sparsity-Regularized HMAX for Visual Recognition 1 Sparsity-Regularized HMAX for Visual Recognition Xiaolin Hu 1,, Jianwei Zhang 2, Jianmin Li 1, Bo Zhang 1 1 State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory for

More information

TEXTURE CLASSIFICATION METHODS: A REVIEW

TEXTURE CLASSIFICATION METHODS: A REVIEW TEXTURE CLASSIFICATION METHODS: A REVIEW Ms. Sonal B. Bhandare Prof. Dr. S. M. Kamalapur M.E. Student Associate Professor Deparment of Computer Engineering, Deparment of Computer Engineering, K. K. Wagh

More information

Modern Object Detection. Most slides from Ali Farhadi

Modern Object Detection. Most slides from Ali Farhadi Modern Object Detection Most slides from Ali Farhadi Comparison of Classifiers assuming x in {0 1} Learning Objective Training Inference Naïve Bayes maximize j i logp + logp ( x y ; θ ) ( y ; θ ) i ij

More information

Computer vision: models, learning and inference. Chapter 13 Image preprocessing and feature extraction

Computer vision: models, learning and inference. Chapter 13 Image preprocessing and feature extraction Computer vision: models, learning and inference Chapter 13 Image preprocessing and feature extraction Preprocessing The goal of pre-processing is to try to reduce unwanted variation in image due to lighting,

More information

Preliminary Local Feature Selection by Support Vector Machine for Bag of Features

Preliminary Local Feature Selection by Support Vector Machine for Bag of Features Preliminary Local Feature Selection by Support Vector Machine for Bag of Features Tetsu Matsukawa Koji Suzuki Takio Kurita :University of Tsukuba :National Institute of Advanced Industrial Science and

More information

Feature Detection. Raul Queiroz Feitosa. 3/30/2017 Feature Detection 1

Feature Detection. Raul Queiroz Feitosa. 3/30/2017 Feature Detection 1 Feature Detection Raul Queiroz Feitosa 3/30/2017 Feature Detection 1 Objetive This chapter discusses the correspondence problem and presents approaches to solve it. 3/30/2017 Feature Detection 2 Outline

More information

Human detections using Beagle board-xm

Human detections using Beagle board-xm Human detections using Beagle board-xm CHANDAN KUMAR 1 V. AJAY KUMAR 2 R. MURALI 3 1 (M. TECH STUDENT, EMBEDDED SYSTEMS, DEPARTMENT OF ELECTRONICS AND COMMUNICATION ENGINEERING, VIJAYA KRISHNA INSTITUTE

More information

Introduction to visual computation and the primate visual system

Introduction to visual computation and the primate visual system Introduction to visual computation and the primate visual system Problems in vision Basic facts about the visual system Mathematical models for early vision Marr s computational philosophy and proposal

More information

Face Recognition Based On Granular Computing Approach and Hybrid Spatial Features

Face Recognition Based On Granular Computing Approach and Hybrid Spatial Features Face Recognition Based On Granular Computing Approach and Hybrid Spatial Features S.Sankara vadivu 1, K. Aravind Kumar 2 Final Year Student of M.E, Department of Computer Science and Engineering, Manonmaniam

More information

Fuzzy based Multiple Dictionary Bag of Words for Image Classification

Fuzzy based Multiple Dictionary Bag of Words for Image Classification Available online at www.sciencedirect.com Procedia Engineering 38 (2012 ) 2196 2206 International Conference on Modeling Optimisation and Computing Fuzzy based Multiple Dictionary Bag of Words for Image

More information

Motion Estimation and Optical Flow Tracking

Motion Estimation and Optical Flow Tracking Image Matching Image Retrieval Object Recognition Motion Estimation and Optical Flow Tracking Example: Mosiacing (Panorama) M. Brown and D. G. Lowe. Recognising Panoramas. ICCV 2003 Example 3D Reconstruction

More information

Filtering, scale, orientation, localization, and texture. Nuno Vasconcelos ECE Department, UCSD (with thanks to David Forsyth)

Filtering, scale, orientation, localization, and texture. Nuno Vasconcelos ECE Department, UCSD (with thanks to David Forsyth) Filtering, scale, orientation, localization, and texture Nuno Vasconcelos ECE Department, UCSD (with thanks to David Forsyth) Beyond edges we have talked a lot about edges while they are important, it

More information

A new approach to reference point location in fingerprint recognition

A new approach to reference point location in fingerprint recognition A new approach to reference point location in fingerprint recognition Piotr Porwik a) and Lukasz Wieclaw b) Institute of Informatics, Silesian University 41 200 Sosnowiec ul. Bedzinska 39, Poland a) porwik@us.edu.pl

More information

Face and Nose Detection in Digital Images using Local Binary Patterns

Face and Nose Detection in Digital Images using Local Binary Patterns Face and Nose Detection in Digital Images using Local Binary Patterns Stanko Kružić Post-graduate student University of Split, Faculty of Electrical Engineering, Mechanical Engineering and Naval Architecture

More information

Object Recognition II

Object Recognition II Object Recognition II Linda Shapiro EE/CSE 576 with CNN slides from Ross Girshick 1 Outline Object detection the task, evaluation, datasets Convolutional Neural Networks (CNNs) overview and history Region-based

More information

Study of Viola-Jones Real Time Face Detector

Study of Viola-Jones Real Time Face Detector Study of Viola-Jones Real Time Face Detector Kaiqi Cen cenkaiqi@gmail.com Abstract Face detection has been one of the most studied topics in computer vision literature. Given an arbitrary image the goal

More information

Histograms of Oriented Gradients

Histograms of Oriented Gradients Histograms of Oriented Gradients Carlo Tomasi September 18, 2017 A useful question to ask of an image is whether it contains one or more instances of a certain object: a person, a face, a car, and so forth.

More information

Biometrics Technology: Image Processing & Pattern Recognition (by Dr. Dickson Tong)

Biometrics Technology: Image Processing & Pattern Recognition (by Dr. Dickson Tong) Biometrics Technology: Image Processing & Pattern Recognition (by Dr. Dickson Tong) References: [1] http://homepages.inf.ed.ac.uk/rbf/hipr2/index.htm [2] http://www.cs.wisc.edu/~dyer/cs540/notes/vision.html

More information

FAST HUMAN DETECTION USING TEMPLATE MATCHING FOR GRADIENT IMAGES AND ASC DESCRIPTORS BASED ON SUBTRACTION STEREO

FAST HUMAN DETECTION USING TEMPLATE MATCHING FOR GRADIENT IMAGES AND ASC DESCRIPTORS BASED ON SUBTRACTION STEREO FAST HUMAN DETECTION USING TEMPLATE MATCHING FOR GRADIENT IMAGES AND ASC DESCRIPTORS BASED ON SUBTRACTION STEREO Makoto Arie, Masatoshi Shibata, Kenji Terabayashi, Alessandro Moro and Kazunori Umeda Course

More information

EECS150 - Digital Design Lecture 14 FIFO 2 and SIFT. Recap and Outline

EECS150 - Digital Design Lecture 14 FIFO 2 and SIFT. Recap and Outline EECS150 - Digital Design Lecture 14 FIFO 2 and SIFT Oct. 15, 2013 Prof. Ronald Fearing Electrical Engineering and Computer Sciences University of California, Berkeley (slides courtesy of Prof. John Wawrzynek)

More information

A Robust Method for Circle / Ellipse Extraction Based Canny Edge Detection

A Robust Method for Circle / Ellipse Extraction Based Canny Edge Detection International Journal of Research Studies in Science, Engineering and Technology Volume 2, Issue 5, May 2015, PP 49-57 ISSN 2349-4751 (Print) & ISSN 2349-476X (Online) A Robust Method for Circle / Ellipse

More information

Feature Descriptors. CS 510 Lecture #21 April 29 th, 2013

Feature Descriptors. CS 510 Lecture #21 April 29 th, 2013 Feature Descriptors CS 510 Lecture #21 April 29 th, 2013 Programming Assignment #4 Due two weeks from today Any questions? How is it going? Where are we? We have two umbrella schemes for object recognition

More information

International Journal Of Global Innovations -Vol.4, Issue.I Paper Id: SP-V4-I1-P17 ISSN Online:

International Journal Of Global Innovations -Vol.4, Issue.I Paper Id: SP-V4-I1-P17 ISSN Online: IMPLEMENTATION OF EMBEDDED HUMAN TRACKING SYSTEM USING DM3730 DUALCORE PROCESSOR #1 DASARI ALEKHYA M.TECH Student, #2 Dr. SYED ABUDHAGIR.U Associate Professor, Dept of ECE B.V.RAJU INSTITUTE OF TECHNOLOGY,

More information

HOG-based Pedestriant Detector Training

HOG-based Pedestriant Detector Training HOG-based Pedestriant Detector Training evs embedded Vision Systems Srl c/o Computer Science Park, Strada Le Grazie, 15 Verona- Italy http: // www. embeddedvisionsystems. it Abstract This paper describes

More information

Coupling of Evolution and Learning to Optimize a Hierarchical Object Recognition Model

Coupling of Evolution and Learning to Optimize a Hierarchical Object Recognition Model Coupling of Evolution and Learning to Optimize a Hierarchical Object Recognition Model Georg Schneider, Heiko Wersing, Bernhard Sendhoff, and Edgar Körner Honda Research Institute Europe GmbH Carl-Legien-Strasse

More information

Decorrelated Local Binary Pattern for Robust Face Recognition

Decorrelated Local Binary Pattern for Robust Face Recognition International Journal of Advanced Biotechnology and Research (IJBR) ISSN 0976-2612, Online ISSN 2278 599X, Vol-7, Special Issue-Number5-July, 2016, pp1283-1291 http://www.bipublication.com Research Article

More information

GPU Accelerated Number Plate Localization in Crowded Situation

GPU Accelerated Number Plate Localization in Crowded Situation International Journal of Advances in Intelligent Informatics ISSN: 2442-6571 Vol 1, No 3, November 2015, pp. 150-157 150 GPU Accelerated Number Plate Localization in Crowded Situation Adhi Prahara a,1,*,

More information