OBJECT RECOGNITION ALGORITHM FOR MOBILE DEVICES
|
|
- Susanna Reynolds
- 6 years ago
- Views:
Transcription
1 Image Processing & Communication, vol. 18,no. 1, pp DOI: /v x 31 OBJECT RECOGNITION ALGORITHM FOR MOBILE DEVICES RAFAŁ KOZIK ADAM MARCHEWKA Institute of Telecommunications, University of Technology & Life Sciences in Bydgoszcz, ul. Kaliskiego 7, Bydgoszcz, Poland Abstract. In this paper an object recognition algorithm for mobile devices is presented. The algorithm is based on a hierarchical approach for visual information coding proposed by Riesenhuber and Poggio [1] and later extended by Serre et al. [2]. The proposed method adapts an efficient algorithm to extract the information about local gradients. This allows the algorithm to approximate the behaviour of simple cells layer of Riesenhuber and Poggio model. Moreover, it is also demonstrated that the proposed algorithm can be successfully deployed on a low-cost Android smartphone. 1 Introduction This paper is a continuation of author s recent research on object detection algorithms dedicate for mobile devices. The tentative results have been presented in [4]. In contrast to previous work, modifications to overall method architecture have been proposed. Moreover, the experiments described in this paper have been conducted for extended database of images. The proposed approach follows the idea of HMAX model proposed by Riesenhuber and Poggio [1]. It adapts the hierarchical fed-forward processing approach. However, modifications are introduced in order to reduce computational complexity of the original algorithm. Moreover, in contrast to Mutch and Lowe [10] model an additional layer that mimics the "retina codding" mechanism is added. The results showed that this step increases the robustness of the proposed method. However, it is considered as an optional step of visual information processing, because it introduces additional computational burden that can increase the response time (the frame rate drops significantly) of at low-cost mobile devices. In contrast to original HMAX model, a different method for calculating the S 1 layer response is proposed. The responses of 4 Gabor filters are approximated with algorithm that mimic the behaviour of HOG descriptors. The HOG [3] descriptors are commonly use for object recognition problems. The image is decomposed into small cells. In each cell an histogram of oriented gradients is used. Typically for HOG features the result is normalised and the descriptor is returned for each cell. However, in contrast to original HOG descriptor proposed by Dalal and Triggs, in this method (in order to simplify the computations) the block of cells are not overlapping, and
2 32 R. Kozik, A. Marchewka the input image is not normalised. Moreover, in previous work [7] the S 2 and C 2 layers were replaced with machine-learned classifier. Such approach significantly simplifies the learning process and decreases the amount of computations. However, this impacts the invariance to shape changes of a recognised object and the feature vectors presented to classifier are relatively of grater dimensionality. 2 The HMAX model overview The HMAX Visual Cortex model proposed by Riesenhuber and Poggio [1] exploits a hierarchical structure for the image processing and coding. It is arranged in several layers that process information in a bottom-up manner. The lowest layer is fed with a grayscale image. The higher layers of the model are either called "S" or "C". These names correspond to simple the (S) and complex (C) cells discovered by Hubel and Wiesel [9]. Both type of cells are located in the striate cortex (called V1), which is the part of visual cortex that lies in the most posterior area of the occipital lobe. The simple cells located in "S" layers apply local filters that responses form a vector of texture features. As noted by Hubel and Wiesel the individual cell in the cortex respond to the presence of edges. They also discovered that these cells are sensitive to edge orientation (some of cells fire only when a given orientation of an edge is observed). The complex cells located in "C" layers calculate in a limited range of a previous layer the strongest responses of a given type (orientation). That way more complex combination of simple features are obtained combined from three simple features. The hierarchical HMAX model proposed by Mutch and Lowe [10] is a modification of the model presented by Serre et al. in [2]. It introduces two layers of simple cells (S 1 and S 2 ) and two layers of complex cells (see Fig. 1). It uses set of filters designed to emulate V1 simple cells. The Fig. 1: The structure of a hierarchical model proposed by Mutch and Lowe [10]. (used symbols: I - image, S - simple cells, C - complex cells, GF - Gabor Filters, F - prototype features vectors, - convolution operation) simple layers are computed using a hard max filter [11]. It means that the "C" cells responses are the maximum values of the associated "S" cells. As it is shown in the Fig. 1 the images are processed by the subsequent simple and complex cells layers and reduced to feature vectors, which are further used in the classification process. The set of features (F ) is shared across all images and object categories. Features are computed hierarchically in subsequent layers built from the previous one by alternating the template matching and the max pooling operations. The S 1 layer in the Mutch and Lowe [10] model adapts the 2D Gabor filters computed for four orientations (horizontal, vertical, and two diagonals) at each possible position and scale. The Gabor filters are in size, and are described by: G(x, y) = e (X2 +γy 2 )/(2σ 2) cos( 2π λ ) (1) where X = x cos φ y sin φ and Y = x sin φ+y cos φ; x, y < 5; 5 >, and φ < 0; π >. The aspect ration (γ), effective width (σ), and wavelength (λ) are set to 0.3, 4.5 and 5.6 respectively. The response of a patch of pixels X to a particular filter G is computed using the formula (2). Xi G i R(X, G) = abs( ) (2) X 2 i The complex cells located in C 1 layer pool associated units in the S 1 layer. For each orientation, the S 1 re-
3 Image Processing & Communication, vol. 18, no. 1, pp sponses are convolved with a max filter, that is of size in x, y dimension (position) and has 2 units of deep in scale. As it is shown in the Fig. 1, the intermediate S 2 layer is formed by convolving the C 1 layer response with a set of intermediate-level features (depicted as F in Fig. 1). The set of intermediate-level features is established during the learning phase. For a given set of learning images C 1 responses are computed. The most meaningful features are selected using SVM weighting. Mutch and Lowe [10] suggested to sub-sample the C 1 responses before feature selection. Therefore, authors select at random positions and scales of the C 1 layer patches of size 4 4, 8 8, 12 12, and Selected and weighted features compose so called prototypes that are used in the Mutch and Lowe model as filters which responses create the S 2 layer. The C 2 layer composes a global feature vector which particular element corresponds to the maximum response to a given prototype patch. In order to identify the visual object on the basis of feature vector a classifier is learnt (e.g. SVM). 3 The proposed method In order to approximate the S 1 layer behaviour of the HMAX model the algorithms that mimics the HOG descriptors is used. However, to make this step as simple as possible (due to the low computation power of mobile devices) only gradient orientation binning schema is adapted. In fact, the intention here is no to compute HOG descriptor, but to approximate the response of four Gabor filters used in original HMAX model. In the S 1 layer there are N M 4 simple cells arranged in a grid of size N M blocks. In each block there are 4 cells. Each cell is assigned a receptive field (pixels inside the block). Each cell activates depend on a stimuli. In this case there are four possible stimulus, namely vertical, horizontal, left diagonal, and right diagonal edges. As a result the S 1 simple cells layer output has dimensionality of a size 4 (x, y, scale and 4 cells). In order to compute responses of all four cells inside a given block (receptive field), an Algorithm 1 is applied. Algorithm 1 Algorithm for calculating S 1 response Require: Grayscaled image I of W H size Ensure: S 1 layer of size N M 4 Assign G min the low-threshold for gradient magnitude. for each pixel (x, y) in image I do Compute horizontal G x and vertical G y gradients using Prewitt operator Compute gradient magnitude G x,y in point (x, y) n x N W ; m y M W if G < G min then go to next pixel else active get_cell_type(g x, G y ) S 1 [n, m, active] S 1 [n, m, active] + G x,y end if end for The algorithm computes the responses of all cells using only one iteration over the whole input image I. For each pixel at position (x, y) a vertical and horizontal gradients are computed (G x and G y ). Given the pixel position (x, y) and gradient vector [G x, G y ] the algorithm indicates the block position (n, m) and type of cell (active) that response has to be incremented by G. In order to classify given gradient vector [G x, G y ] as horizontal, diagonal or vertical the get_cell_type(, ) function uses simple schema for voting. If a point ( G x, G y ) is located between line y = 0.3 x and y = 3.3 x it is classified as a diagonal. If G y is positive then the vector is classified as a right diagonal (otherwise it is a left diagonal). In case the point ( G x, G y ) is located above line y = 3.3 x the gradient vector is classified as vertical and as horizontal when it lies below line y = 0.3 x. The processing in S 1 layer is applied for three scales (an original image, and two images scaled by a factor of 0.7 and 1.5). This is a necessary operation to achieve the response of C 1 layer. The C 1 complex layer response is
4 34 R. Kozik, A. Marchewka computed using max-pooling filter, that is applied to S 1 layer. This layer allows the algorithm to achieve scale invariance. The output of the C 1 layer cells is used in order to select features that are meaningful for a given task of an object detection. Firstly, randomly selected images of an object are reduced to a features vectors by a consecutive layers of the proposed model. Afterwards, the PCA is applied to calculate the eigenvectors. Each eigenvector is used as a single feature in a features vector. The response of S 2 layer is obtained using eigenvectors to compute the dot product with the C 1 layer. Finally, an classifier is learned to recognise an object using S 2 response layer. For this paper different machinelearnt algorithms are investigated. 4 Experiments and Results was possible to achieve 95,6% of detection rate while having 2.3% of false positives. It can be noticed that adding additional trees does not improve the effectiveness. The early prototype of proposed algorithm was deployed on an Android device. The code was written in pure Java and tested on Samsung Galaxy Ace device. This device is equipped with 800 MHz CPU, 278 MB of RAM and Android 2.3 operating system. Current version implements brute force Nearest Neighbour classifier, operates only for one scale (PC scans three scale - an original image, and two images scaled by a factor of 0.7 and 1.5), and achieves about 10 FPS when less than 10 training samples are provided. Some examples of object detection are shown in Fig. 3. During the testing it was noticed that the algorithm is able to recognise an object on a cluttered background even if only few learning samples are provided. In contrast to the previous work [4], the data base of images has been extended and new experiments have been conducted, which aimed at evaluating the effectiveness of different classifiers. In this work tree-based and rule-based classification approaches have been compared, namely N (5,10,15) Random Trees, J48 and PART (previously used by author in [5, 6, 4]) algorithms. The comparison has been presented in Fig.2. For the experiments the dataset was randomly split for training and testing samples (in proportion 70% to 30% respectively). For training samples the C 1 layers responses were calculated and eigenvectors were extracted. The eigenvectors were used to calculate the S 2 layers responses that were further used to learn classifier. During the testing phase the eigenvectors were again used in order to extract S 2 layers responses for testing samples. The whole procedure was repeated 10 times ( the dataset was randomly reshuffled before split) and the results were averaged. The best results were obtained for 10 Random Trees. It Fig. 3: Example of object detection (mug and doors) with proposed algorithm deployed on an Android device 5 Conclusions In this paper an algorithm for object recognition dedicated form mobile devices was presented. The algorithm is based on a hierarchical approach for visual information coding proposed by Riesenhuber and Poggio [1] and later
5 False Positive Rate Image Processing & Communication, vol. 18, no. 1, pp ,04 0,035 0,03 0,025 0,02 0,015 0,01 0, random trees 5 random trees 15 random trees J48 PART Expon. (10 random trees) Expon. (5 random trees) Expon. (15 random trees) Expon. (J48) Expon. (PART) 0 0,78 0,8 0,82 0,84 0,86 0,88 0,9 0,92 0,94 0,96 0,98 Detection Rate Fig. 2: Different classifiers effectiveness extended by Serre et al. [2]. The experiments show that the introduced algorithm allows for efficient feature extraction and a visual information coding. Moreover, it was shown that it is also possible to deploy proposed approach on a low-cost mobile device. The results are promising and show that described in this paper algorithm can be further investigated as one of assistive solutions dedicated for visually impaired people. References [1] M. Riesenhuber, T. Poggio, Hierarchical models of object recognition in cortex, Nat Neurosci, Vol. 2, pp , 1999 [2] T. Serre, G. Kreiman, M. Kouh, C. Cadieu, U. Knoblich T. Poggio, A quantitative theory of immediate visual recognition, In:Progress in Brain Research, Computational Neuroscience: Theoretical Insights into Brain Function, Vol. 165, pp , 2007 [3] N. Dalal, B. Triggs, Histograms of oriented gradients for human detection, Computer Vision and Pattern Recognition, CVPR 2005, IEEE Computer Society Conference on, Vol.1, pp , June 2005 [4] R. Kozik, A Simplified Visual Cortex Model for Efficient Image Codding and Object Recognition, Image Processing and Communications Challenges 5, Advances in Intelligent Systems and Computing, pp , 2013 [5] R. Kozik, Rapid Threat Detection for Stereovision Mobility Aid System, In: T. Czachorski et al. (Eds.): Man-Machine Interactions 2, AISC 103, pp , 2011 [6] R. Kozik, Stereovision system for visually impaired. Burduk, Robert (ed.) et al., Computer recognition systems 4, Advances in Intelligent and Soft Computing 95, pp , 2011 [7] R. Kozik, A Proposal of Biologically Inspired Hi-
6 36 R. Kozik, A. Marchewka erarchical Approach to Object Recognition, Journal of Medical Informatics & Technologies, Vol. 22/2013, ISSN , pp , 2013 [8] R. Kozik, A Simplified Visual Cortex Model for Efficient Image Codding and Object Recognition, Image Processing and Communications Challenges 5, Advances in Intelligent Systems and Computing S. Choras, Ryszard, pp , 2013 [9] D.H. Hubel, T.N. Wiesel Receptive fields, binocular interaction and functional architecture in the cat s visual cortex, J Physiol, Vol. 160, pp , 1962 [10] J. Mutch, D.G. Lowe, Object class recognition and localization using sparse features with limited receptive fields,international Journal of Computer Vision (IJCV), Vol. 80, No. 1, pp , October 2008 [11] MAX pooling, php/pooling
Developing Open Source code for Pyramidal Histogram Feature Sets
Developing Open Source code for Pyramidal Histogram Feature Sets BTech Project Report by Subodh Misra subodhm@iitk.ac.in Y648 Guide: Prof. Amitabha Mukerjee Dept of Computer Science and Engineering IIT
More informationNicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science
V1-inspired orientation selective filters for image processing and computer vision Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science 2 Most of the images in this
More informationBiologically-Inspired Translation, Scale, and rotation invariant object recognition models
Rochester Institute of Technology RIT Scholar Works Theses Thesis/Dissertation Collections 2007 Biologically-Inspired Translation, Scale, and rotation invariant object recognition models Myung Woo Follow
More informationNicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science
2D Gabor functions and filters for image processing and computer vision Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science 2 Most of the images in this presentation
More informationModeling Visual Cortex V4 in Naturalistic Conditions with Invari. Representations
Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant and Sparse Image Representations Bin Yu Departments of Statistics and EECS University of California at Berkeley Rutgers University, May
More informationObject Detection Design challenges
Object Detection Design challenges How to efficiently search for likely objects Even simple models require searching hundreds of thousands of positions and scales Feature design and scoring How should
More informationNicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science
2D Gabor functions and filters for image processing and computer vision Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science 2 Most of the images in this presentation
More informationMORPH-II: Feature Vector Documentation
MORPH-II: Feature Vector Documentation Troy P. Kling NSF-REU Site at UNC Wilmington, Summer 2017 1 MORPH-II Subsets Four different subsets of the MORPH-II database were selected for a wide range of purposes,
More informationFeature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking
Feature descriptors Alain Pagani Prof. Didier Stricker Computer Vision: Object and People Tracking 1 Overview Previous lectures: Feature extraction Today: Gradiant/edge Points (Kanade-Tomasi + Harris)
More informationHuman Vision Based Object Recognition Sye-Min Christina Chan
Human Vision Based Object Recognition Sye-Min Christina Chan Abstract Serre, Wolf, and Poggio introduced an object recognition algorithm that simulates image processing in visual cortex and claimed to
More informationExploration of Hardware Acceleration for a Neuromorphic Visual Classification System
Exploration of Hardware Acceleration for a Neuromorphic Visual Classification System Ikenna J. Okafor, McNair Scholar The Pennsylvania State University McNair Faculty Research Advisors: Kevin M. Irick,
More informationAn Implementation on Histogram of Oriented Gradients for Human Detection
An Implementation on Histogram of Oriented Gradients for Human Detection Cansın Yıldız Dept. of Computer Engineering Bilkent University Ankara,Turkey cansin@cs.bilkent.edu.tr Abstract I implemented a Histogram
More informationhttps://en.wikipedia.org/wiki/the_dress Recap: Viola-Jones sliding window detector Fast detection through two mechanisms Quickly eliminate unlikely windows Use features that are fast to compute Viola
More information2D Image Processing Feature Descriptors
2D Image Processing Feature Descriptors Prof. Didier Stricker Kaiserlautern University http://ags.cs.uni-kl.de/ DFKI Deutsches Forschungszentrum für Künstliche Intelligenz http://av.dfki.de 1 Overview
More informationHuman detection using histogram of oriented gradients. Srikumar Ramalingam School of Computing University of Utah
Human detection using histogram of oriented gradients Srikumar Ramalingam School of Computing University of Utah Reference Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,
More informationHuman Motion Detection and Tracking for Video Surveillance
Human Motion Detection and Tracking for Video Surveillance Prithviraj Banerjee and Somnath Sengupta Department of Electronics and Electrical Communication Engineering Indian Institute of Technology, Kharagpur,
More informationObject Category Detection: Sliding Windows
04/10/12 Object Category Detection: Sliding Windows Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Today s class: Object Category Detection Overview of object category detection Statistical
More informationOnline Learning for Object Recognition with a Hierarchical Visual Cortex Model
Online Learning for Object Recognition with a Hierarchical Visual Cortex Model Stephan Kirstein, Heiko Wersing, and Edgar Körner Honda Research Institute Europe GmbH Carl Legien Str. 30 63073 Offenbach
More informationThe SIFT (Scale Invariant Feature
The SIFT (Scale Invariant Feature Transform) Detector and Descriptor developed by David Lowe University of British Columbia Initial paper ICCV 1999 Newer journal paper IJCV 2004 Review: Matt Brown s Canonical
More informationComputational Models of V1 cells. Gabor and CORF
1 Computational Models of V1 cells Gabor and CORF George Azzopardi Nicolai Petkov 2 Primary visual cortex (striate cortex or V1) 3 Types of V1 Cells Hubel and Wiesel, Nobel Prize winners Three main types
More informationSketchable Histograms of Oriented Gradients for Object Detection
Sketchable Histograms of Oriented Gradients for Object Detection No Author Given No Institute Given Abstract. In this paper we investigate a new representation approach for visual object recognition. The
More informationTraffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers
Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers A. Salhi, B. Minaoui, M. Fakir, H. Chakib, H. Grimech Faculty of science and Technology Sultan Moulay Slimane
More informationA Sparse and Locally Shift Invariant Feature Extractor Applied to Document Images
A Sparse and Locally Shift Invariant Feature Extractor Applied to Document Images Marc Aurelio Ranzato Yann LeCun Courant Institute of Mathematical Sciences New York University - New York, NY 10003 Abstract
More informationDiscriminative classifiers for image recognition
Discriminative classifiers for image recognition May 26 th, 2015 Yong Jae Lee UC Davis Outline Last time: window-based generic object detection basic pipeline face detection with boosting as case study
More informationAutomated Canvas Analysis for Painting Conservation. By Brendan Tobin
Automated Canvas Analysis for Painting Conservation By Brendan Tobin 1. Motivation Distinctive variations in the spacings between threads in a painting's canvas can be used to show that two sections of
More informationDeep Convolutional Neural Networks. Nov. 20th, 2015 Bruce Draper
Deep Convolutional Neural Networks Nov. 20th, 2015 Bruce Draper Background: Fully-connected single layer neural networks Feed-forward classification Trained through back-propagation Example Computer Vision
More informationDetecting Printed and Handwritten Partial Copies of Line Drawings Embedded in Complex Backgrounds
9 1th International Conference on Document Analysis and Recognition Detecting Printed and Handwritten Partial Copies of Line Drawings Embedded in Complex Backgrounds Weihan Sun, Koichi Kise Graduate School
More informationA Sparse and Locally Shift Invariant Feature Extractor Applied to Document Images
A Sparse and Locally Shift Invariant Feature Extractor Applied to Document Images Marc Aurelio Ranzato Yann LeCun Courant Institute of Mathematical Sciences New York University - New York, NY 10003 Abstract
More informationDeep Learning. Deep Learning. Practical Application Automatically Adding Sounds To Silent Movies
http://blog.csdn.net/zouxy09/article/details/8775360 Automatic Colorization of Black and White Images Automatically Adding Sounds To Silent Movies Traditionally this was done by hand with human effort
More informationMULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION
MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION Panca Mudjirahardjo, Rahmadwati, Nanang Sulistiyanto and R. Arief Setyawan Department of Electrical Engineering, Faculty of
More informationObject recognition in images by human vision and computer vision Chen, Q.; Dijkstra, J.; de Vries, B.
Object recognition in images by human vision and computer vision Chen, Q.; Dijkstra, J.; de Vries, B. Published in: Conference proceeding of 10th International Conference on Design & Decision Support Systems
More informationFace Detection Using Convolutional Neural Networks and Gabor Filters
Face Detection Using Convolutional Neural Networks and Gabor Filters Bogdan Kwolek Rzeszów University of Technology W. Pola 2, 35-959 Rzeszów, Poland bkwolek@prz.rzeszow.pl Abstract. This paper proposes
More informationRobust PDF Table Locator
Robust PDF Table Locator December 17, 2016 1 Introduction Data scientists rely on an abundance of tabular data stored in easy-to-machine-read formats like.csv files. Unfortunately, most government records
More informationAnalysis of features vectors of images computed on the basis of hierarchical models of object recognition in cortex
Analysis of features vectors of images computed on the basis of hierarchical models of object recognition in cortex Ankit Agrawal(Y7057); Jyoti Meena(Y7184) November 22, 2010 SE 367 : Introduction to Cognitive
More informationCar Detecting Method using high Resolution images
Car Detecting Method using high Resolution images Swapnil R. Dhawad Department of Electronics and Telecommunication Engineering JSPM s Rajarshi Shahu College of Engineering, Savitribai Phule Pune University,
More informationA Multi-Scale Generalization of the HoG and HMAX Image Descriptors for Object Detection Stanley M Bileschi
Computer Science and Artificial Intelligence Laboratory Technical Report MIT-CSAIL-TR-2008-019 CBCL-271 April 9, 2008 A Multi-Scale Generalization of the HoG and HMAX Image Descriptors for Object Detection
More informationSparse Models in Image Understanding And Computer Vision
Sparse Models in Image Understanding And Computer Vision Jayaraman J. Thiagarajan Arizona State University Collaborators Prof. Andreas Spanias Karthikeyan Natesan Ramamurthy Sparsity Sparsity of a vector
More informationAugmented Reality VU. Computer Vision 3D Registration (2) Prof. Vincent Lepetit
Augmented Reality VU Computer Vision 3D Registration (2) Prof. Vincent Lepetit Feature Point-Based 3D Tracking Feature Points for 3D Tracking Much less ambiguous than edges; Point-to-point reprojection
More informationCS 523: Multimedia Systems
CS 523: Multimedia Systems Angus Forbes creativecoding.evl.uic.edu/courses/cs523 Today - Convolutional Neural Networks - Work on Project 1 http://playground.tensorflow.org/ Convolutional Neural Networks
More informationOverview. Object Recognition. Neurobiology of Vision. Invariances in higher visual cortex
3 / 27 4 / 27 Overview Object Recognition Mark van Rossum School of Informatics, University of Edinburgh January 15, 2018 Neurobiology of Vision Computational Object Recognition: What s the Problem? Fukushima
More informationHistogram of Oriented Gradients for Human Detection
Histogram of Oriented Gradients for Human Detection Article by Navneet Dalal and Bill Triggs All images in presentation is taken from article Presentation by Inge Edward Halsaunet Introduction What: Detect
More informationA Novel Extreme Point Selection Algorithm in SIFT
A Novel Extreme Point Selection Algorithm in SIFT Ding Zuchun School of Electronic and Communication, South China University of Technolog Guangzhou, China zucding@gmail.com Abstract. This paper proposes
More informationObject Tracking using HOG and SVM
Object Tracking using HOG and SVM Siji Joseph #1, Arun Pradeep #2 Electronics and Communication Engineering Axis College of Engineering and Technology, Ambanoly, Thrissur, India Abstract Object detection
More informationTongue Recognition From Images
Tongue Recognition From Images Ryszard S. Choraś Institute of Telecommunications and Computer Science UTP University of Sciences and Technology 85-796 Bydgoszcz, Poland Email: choras@utp.edu.pl Abstract
More informationRecap Image Classification with Bags of Local Features
Recap Image Classification with Bags of Local Features Bag of Feature models were the state of the art for image classification for a decade BoF may still be the state of the art for instance retrieval
More informationLearning Feature Hierarchies for Object Recognition
Learning Feature Hierarchies for Object Recognition Koray Kavukcuoglu Computer Science Department Courant Institute of Mathematical Sciences New York University Marc Aurelio Ranzato, Kevin Jarrett, Pierre
More informationCategory vs. instance recognition
Category vs. instance recognition Category: Find all the people Find all the buildings Often within a single image Often sliding window Instance: Is this face James? Find this specific famous building
More informationHistogram of Oriented Phase (HOP): A New Descriptor Based on Phase Congruency
University of Dayton ecommons Electrical and Computer Engineering Faculty Publications Department of Electrical and Computer Engineering 5-2016 Histogram of Oriented Phase (HOP): A New Descriptor Based
More informationTri-modal Human Body Segmentation
Tri-modal Human Body Segmentation Master of Science Thesis Cristina Palmero Cantariño Advisor: Sergio Escalera Guerrero February 6, 2014 Outline 1 Introduction 2 Tri-modal dataset 3 Proposed baseline 4
More informationFacial Expression Classification with Random Filters Feature Extraction
Facial Expression Classification with Random Filters Feature Extraction Mengye Ren Facial Monkey mren@cs.toronto.edu Zhi Hao Luo It s Me lzh@cs.toronto.edu I. ABSTRACT In our work, we attempted to tackle
More informationWindow based detectors
Window based detectors CS 554 Computer Vision Pinar Duygulu Bilkent University (Source: James Hays, Brown) Today Window-based generic object detection basic pipeline boosting classifiers face detection
More informationIllumination-Robust Face Recognition based on Gabor Feature Face Intrinsic Identity PCA Model
Illumination-Robust Face Recognition based on Gabor Feature Face Intrinsic Identity PCA Model TAE IN SEOL*, SUN-TAE CHUNG*, SUNHO KI**, SEONGWON CHO**, YUN-KWANG HONG*** *School of Electronic Engineering
More informationBio-inspired Binocular Disparity with Position-Shift Receptive Field
Bio-inspired Binocular Disparity with Position-Shift Receptive Field Fernanda da C. e C. Faria, Jorge Batista and Helder Araújo Institute of Systems and Robotics, Department of Electrical Engineering and
More informationClassification of objects from Video Data (Group 30)
Classification of objects from Video Data (Group 30) Sheallika Singh 12665 Vibhuti Mahajan 12792 Aahitagni Mukherjee 12001 M Arvind 12385 1 Motivation Video surveillance has been employed for a long time
More informationMassively Parallel Multiclass Object Recognition
Vision, Modeling, and Visualization (2010) Massively Parallel Multiclass Object Recognition Helmut Sedding, Ferdinand Deger, Holger Dammertz, Jan Bouecke, Hendrik P. A. Lensch Ulm University, Germany Abstract
More informationIntroduction to Deep Learning
ENEE698A : Machine Learning Seminar Introduction to Deep Learning Raviteja Vemulapalli Image credit: [LeCun 1998] Resources Unsupervised feature learning and deep learning (UFLDL) tutorial (http://ufldl.stanford.edu/wiki/index.php/ufldl_tutorial)
More informationRobotics Programming Laboratory
Chair of Software Engineering Robotics Programming Laboratory Bertrand Meyer Jiwon Shin Lecture 8: Robot Perception Perception http://pascallin.ecs.soton.ac.uk/challenges/voc/databases.html#caltech car
More informationA New Algorithm for Shape Detection
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 19, Issue 3, Ver. I (May.-June. 2017), PP 71-76 www.iosrjournals.org A New Algorithm for Shape Detection Hewa
More informationANALYSIS DIRECTIONAL FEATURES IN IMAGES USING GABOR FILTERS
ANALYSIS DIRECTIONAL FEATURES IN IMAGES USING GABOR FILTERS Z.-Q. Liu*, R. M. Rangayyan*, and C. B. Frank** Departments of Electrical Engineering* and Surgery**, The University of Calgary Calgary, Alberta,
More informationMobile Human Detection Systems based on Sliding Windows Approach-A Review
Mobile Human Detection Systems based on Sliding Windows Approach-A Review Seminar: Mobile Human detection systems Njieutcheu Tassi cedrique Rovile Department of Computer Engineering University of Heidelberg
More informationHISTOGRAMS OF ORIENTATIO N GRADIENTS
HISTOGRAMS OF ORIENTATIO N GRADIENTS Histograms of Orientation Gradients Objective: object recognition Basic idea Local shape information often well described by the distribution of intensity gradients
More informationComputer Science Faculty, Bandar Lampung University, Bandar Lampung, Indonesia
Application Object Detection Using Histogram of Oriented Gradient For Artificial Intelegence System Module of Nao Robot (Control System Laboratory (LSKK) Bandung Institute of Technology) A K Saputra 1.,
More informationObtaining Feature Correspondences
Obtaining Feature Correspondences Neill Campbell May 9, 2008 A state-of-the-art system for finding objects in images has recently been developed by David Lowe. The algorithm is termed the Scale-Invariant
More informationSURF. Lecture6: SURF and HOG. Integral Image. Feature Evaluation with Integral Image
SURF CSED441:Introduction to Computer Vision (2015S) Lecture6: SURF and HOG Bohyung Han CSE, POSTECH bhhan@postech.ac.kr Speed Up Robust Features (SURF) Simplified version of SIFT Faster computation but
More informationConvolutional Neural Networks. Computer Vision Jia-Bin Huang, Virginia Tech
Convolutional Neural Networks Computer Vision Jia-Bin Huang, Virginia Tech Today s class Overview Convolutional Neural Network (CNN) Training CNN Understanding and Visualizing CNN Image Categorization:
More informationEnhanced object recognition in cortex-like machine vision
Enhanced object recognition in cortex-like machine vision Aristeidis Tsitiridis 1, Peter WT Yuen 1, Izzati Ibrahim 1, Umar Soori 1, Tong Chen 1, Kan Hong 1, Zhengjie Wang 2, David James 1 and Mark Richardson
More informationEXPRESSION-INDEPENDENT FACE RECOGNITION USING BIOLOGICALLY INSPIRED FEATURES
Abstract EXPRESSION-INDEPENDEN FACE RECOGNIION USING BIOLOGICALLY INSPIRED FEAURES REZA EBRAHIMPOUR, AHMAD JAHANI, ALI AMIRI AND MASOOM NAZARI Brain & Intelligent Systems Research Lab,Department of Electrical
More informationAdvanced Video Content Analysis and Video Compression (5LSH0), Module 4
Advanced Video Content Analysis and Video Compression (5LSH0), Module 4 Visual feature extraction Part I: Color and texture analysis Sveta Zinger Video Coding and Architectures Research group, TU/e ( s.zinger@tue.nl
More informationHand Posture Recognition Using Convolutional Neural Network
Hand Posture Recognition Using Convolutional Neural Network Dennis Núñez Fernández 2 and Bogdan Kwolek 1 1 AGH University of Science and Technology, 30 Mickiewicza, 30-059 Krakow, Poland http://home.agh.edu.pl/~bkw/contact.html
More informationVisual object classification by sparse convolutional neural networks
Visual object classification by sparse convolutional neural networks Alexander Gepperth 1 1- Ruhr-Universität Bochum - Institute for Neural Dynamics Universitätsstraße 150, 44801 Bochum - Germany Abstract.
More informationObject-Based Saliency Maps Harry Marr Computing BSc 2009/2010
Object-Based Saliency Maps Harry Marr Computing BSc 2009/2010 The candidate confirms that the work submitted is their own and the appropriate credit has been given where reference has been made to the
More informationSpeeding up the Detection of Line Drawings Using a Hash Table
Speeding up the Detection of Line Drawings Using a Hash Table Weihan Sun, Koichi Kise 2 Graduate School of Engineering, Osaka Prefecture University, Japan sunweihan@m.cs.osakafu-u.ac.jp, 2 kise@cs.osakafu-u.ac.jp
More informationHistograms of Oriented Gradients for Human Detection p. 1/1
Histograms of Oriented Gradients for Human Detection p. 1/1 Histograms of Oriented Gradients for Human Detection Navneet Dalal and Bill Triggs INRIA Rhône-Alpes Grenoble, France Funding: acemedia, LAVA,
More informationSparsity-Regularized HMAX for Visual Recognition
1 Sparsity-Regularized HMAX for Visual Recognition Xiaolin Hu 1,, Jianwei Zhang 2, Jianmin Li 1, Bo Zhang 1 1 State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory for
More informationTEXTURE CLASSIFICATION METHODS: A REVIEW
TEXTURE CLASSIFICATION METHODS: A REVIEW Ms. Sonal B. Bhandare Prof. Dr. S. M. Kamalapur M.E. Student Associate Professor Deparment of Computer Engineering, Deparment of Computer Engineering, K. K. Wagh
More informationModern Object Detection. Most slides from Ali Farhadi
Modern Object Detection Most slides from Ali Farhadi Comparison of Classifiers assuming x in {0 1} Learning Objective Training Inference Naïve Bayes maximize j i logp + logp ( x y ; θ ) ( y ; θ ) i ij
More informationComputer vision: models, learning and inference. Chapter 13 Image preprocessing and feature extraction
Computer vision: models, learning and inference Chapter 13 Image preprocessing and feature extraction Preprocessing The goal of pre-processing is to try to reduce unwanted variation in image due to lighting,
More informationPreliminary Local Feature Selection by Support Vector Machine for Bag of Features
Preliminary Local Feature Selection by Support Vector Machine for Bag of Features Tetsu Matsukawa Koji Suzuki Takio Kurita :University of Tsukuba :National Institute of Advanced Industrial Science and
More informationFeature Detection. Raul Queiroz Feitosa. 3/30/2017 Feature Detection 1
Feature Detection Raul Queiroz Feitosa 3/30/2017 Feature Detection 1 Objetive This chapter discusses the correspondence problem and presents approaches to solve it. 3/30/2017 Feature Detection 2 Outline
More informationHuman detections using Beagle board-xm
Human detections using Beagle board-xm CHANDAN KUMAR 1 V. AJAY KUMAR 2 R. MURALI 3 1 (M. TECH STUDENT, EMBEDDED SYSTEMS, DEPARTMENT OF ELECTRONICS AND COMMUNICATION ENGINEERING, VIJAYA KRISHNA INSTITUTE
More informationIntroduction to visual computation and the primate visual system
Introduction to visual computation and the primate visual system Problems in vision Basic facts about the visual system Mathematical models for early vision Marr s computational philosophy and proposal
More informationFace Recognition Based On Granular Computing Approach and Hybrid Spatial Features
Face Recognition Based On Granular Computing Approach and Hybrid Spatial Features S.Sankara vadivu 1, K. Aravind Kumar 2 Final Year Student of M.E, Department of Computer Science and Engineering, Manonmaniam
More informationFuzzy based Multiple Dictionary Bag of Words for Image Classification
Available online at www.sciencedirect.com Procedia Engineering 38 (2012 ) 2196 2206 International Conference on Modeling Optimisation and Computing Fuzzy based Multiple Dictionary Bag of Words for Image
More informationMotion Estimation and Optical Flow Tracking
Image Matching Image Retrieval Object Recognition Motion Estimation and Optical Flow Tracking Example: Mosiacing (Panorama) M. Brown and D. G. Lowe. Recognising Panoramas. ICCV 2003 Example 3D Reconstruction
More informationFiltering, scale, orientation, localization, and texture. Nuno Vasconcelos ECE Department, UCSD (with thanks to David Forsyth)
Filtering, scale, orientation, localization, and texture Nuno Vasconcelos ECE Department, UCSD (with thanks to David Forsyth) Beyond edges we have talked a lot about edges while they are important, it
More informationA new approach to reference point location in fingerprint recognition
A new approach to reference point location in fingerprint recognition Piotr Porwik a) and Lukasz Wieclaw b) Institute of Informatics, Silesian University 41 200 Sosnowiec ul. Bedzinska 39, Poland a) porwik@us.edu.pl
More informationFace and Nose Detection in Digital Images using Local Binary Patterns
Face and Nose Detection in Digital Images using Local Binary Patterns Stanko Kružić Post-graduate student University of Split, Faculty of Electrical Engineering, Mechanical Engineering and Naval Architecture
More informationObject Recognition II
Object Recognition II Linda Shapiro EE/CSE 576 with CNN slides from Ross Girshick 1 Outline Object detection the task, evaluation, datasets Convolutional Neural Networks (CNNs) overview and history Region-based
More informationStudy of Viola-Jones Real Time Face Detector
Study of Viola-Jones Real Time Face Detector Kaiqi Cen cenkaiqi@gmail.com Abstract Face detection has been one of the most studied topics in computer vision literature. Given an arbitrary image the goal
More informationHistograms of Oriented Gradients
Histograms of Oriented Gradients Carlo Tomasi September 18, 2017 A useful question to ask of an image is whether it contains one or more instances of a certain object: a person, a face, a car, and so forth.
More informationBiometrics Technology: Image Processing & Pattern Recognition (by Dr. Dickson Tong)
Biometrics Technology: Image Processing & Pattern Recognition (by Dr. Dickson Tong) References: [1] http://homepages.inf.ed.ac.uk/rbf/hipr2/index.htm [2] http://www.cs.wisc.edu/~dyer/cs540/notes/vision.html
More informationFAST HUMAN DETECTION USING TEMPLATE MATCHING FOR GRADIENT IMAGES AND ASC DESCRIPTORS BASED ON SUBTRACTION STEREO
FAST HUMAN DETECTION USING TEMPLATE MATCHING FOR GRADIENT IMAGES AND ASC DESCRIPTORS BASED ON SUBTRACTION STEREO Makoto Arie, Masatoshi Shibata, Kenji Terabayashi, Alessandro Moro and Kazunori Umeda Course
More informationEECS150 - Digital Design Lecture 14 FIFO 2 and SIFT. Recap and Outline
EECS150 - Digital Design Lecture 14 FIFO 2 and SIFT Oct. 15, 2013 Prof. Ronald Fearing Electrical Engineering and Computer Sciences University of California, Berkeley (slides courtesy of Prof. John Wawrzynek)
More informationA Robust Method for Circle / Ellipse Extraction Based Canny Edge Detection
International Journal of Research Studies in Science, Engineering and Technology Volume 2, Issue 5, May 2015, PP 49-57 ISSN 2349-4751 (Print) & ISSN 2349-476X (Online) A Robust Method for Circle / Ellipse
More informationFeature Descriptors. CS 510 Lecture #21 April 29 th, 2013
Feature Descriptors CS 510 Lecture #21 April 29 th, 2013 Programming Assignment #4 Due two weeks from today Any questions? How is it going? Where are we? We have two umbrella schemes for object recognition
More informationInternational Journal Of Global Innovations -Vol.4, Issue.I Paper Id: SP-V4-I1-P17 ISSN Online:
IMPLEMENTATION OF EMBEDDED HUMAN TRACKING SYSTEM USING DM3730 DUALCORE PROCESSOR #1 DASARI ALEKHYA M.TECH Student, #2 Dr. SYED ABUDHAGIR.U Associate Professor, Dept of ECE B.V.RAJU INSTITUTE OF TECHNOLOGY,
More informationHOG-based Pedestriant Detector Training
HOG-based Pedestriant Detector Training evs embedded Vision Systems Srl c/o Computer Science Park, Strada Le Grazie, 15 Verona- Italy http: // www. embeddedvisionsystems. it Abstract This paper describes
More informationCoupling of Evolution and Learning to Optimize a Hierarchical Object Recognition Model
Coupling of Evolution and Learning to Optimize a Hierarchical Object Recognition Model Georg Schneider, Heiko Wersing, Bernhard Sendhoff, and Edgar Körner Honda Research Institute Europe GmbH Carl-Legien-Strasse
More informationDecorrelated Local Binary Pattern for Robust Face Recognition
International Journal of Advanced Biotechnology and Research (IJBR) ISSN 0976-2612, Online ISSN 2278 599X, Vol-7, Special Issue-Number5-July, 2016, pp1283-1291 http://www.bipublication.com Research Article
More informationGPU Accelerated Number Plate Localization in Crowded Situation
International Journal of Advances in Intelligent Informatics ISSN: 2442-6571 Vol 1, No 3, November 2015, pp. 150-157 150 GPU Accelerated Number Plate Localization in Crowded Situation Adhi Prahara a,1,*,
More information