Cephalometric Landmarks Localization Based on Histograms of Oriented Gradients

Size: px
Start display at page:

Download "Cephalometric Landmarks Localization Based on Histograms of Oriented Gradients"

Transcription

1 Cephalometric Landmarks Localization Based on Histograms of Oriented Gradients Ali A. Pouyan and M. Farshbaf, Members, IEEE School of Computer Engineering Shahrood University of Technology Shahrood, Iran Abstract In this paper, we explored the use of certain image features, block-wise histograms of local orientations. They are used in many current object recognition algorithms, for the task of locating cephalometric landmarks on X-ray images. After reviewing existing cephalometric landmark detection systems, we show experimentally that grids of Histograms of Oriented Gradients (HOG) descriptors significantly perform well on this task. The influence of bin and detection window size on performance for three landmarks has been studied. It has been shown that, fine orientation binning, large enough detection windows which contain all features around landmark are all important for good estimation of landmark position. Key words cephalometric landmark detection, pattern recognition, classification, histograms of oriented gradients. 1. INTRODUCTION Cephalometry, being the scientific measurement of the head s dimensions on x-ray images, is widely used in orthodontics, orthopedics, and other areas of oral and maxillofacial surgery to assess craniofacial growth and plan treatment. It is generally based on a set of agreed feature points called cephalometric landmarks. Once the landmarks are located, the measurement and analysis of various angular and linear parameters can be easily performed. Therefore, locating craniofacial landmarks on x-rays is of great importance for cephalometric analysis. However, manual landmarking is a tedious and time consuming process which takes about 15 to 20 minutes on each case for an experienced surgeon. Computerized landmarks localization is an interesting way to release these difficulties [1, 2, 3]. The number of defined landmarks is quite large. It is noticeable that many of these landmarks are rarely used. Reference [4]defined a smaller set of landmarks containing the most frequently used landmarks by orthodontists in what is called cephalometric evaluation[5]. The positions of some of these landmarks are demonstrated in Fig.1. Table 1 presents the definition of landmarks used in this paper. To determine the effectiveness of HOG features 3 landmarks have been selected. One easy landmark, Gnathion, one of medium difficulty, Incisal Upper Incisor Tip and a hard landmark, Sella. Detecting these landmarks in images is a challenging task owning to human variable skull appearance and other variations such as rotation, caused by the patient being improperly posed while taking the X-ray, or by the improper placement of the X-ray film during digitization. Brightness variety, controlled by the radiographer, and by the f-number of the camera aperture causes another challenge in landmark detection [6]. The first need to find the landmarks is a robust feature set that allows the landmark regions be discriminated clearly even under difficult illumination. We study the issue of locally normalized Histograms of Oriented Gradient (HOG) descriptors for providing a first estimation of the region that the landmark can be found within. The estimated position of the landmark is also detected by referring to a predefined point in the detection window. We make a detailed study around the effects of various implementation choices for detection window sizes and orientation bin sizes on detector performance. In this paper Support Vector Machines (SVM)are used as a base line classifier. The performance of this approach is evaluated on the cephalograms data set provided by [7]. We briefly discuss works on cephalometric landmark detection methods besides a brief description of HOG in section 2, describe our data set in section 3, give an overview of our method in section 4 and present a detailed description and experimental evaluation of each stage of the process in section 5. The main conclusion is summarized in section PREVIOUS WORKS There have been previous attempts to automate cephalometric analysis with the aim of reducing the time required to obtain an analysis, improving the accuracy of landmark identification and reducing the errors due to clinician subjectivity. These methods have been categorized in 4 classes [8]: The first class called handcrafted algorithms, are knowledge based methods that locate landmarks on relevant edges. Reference [9] located landmarks on edges after noise filtering and image enhancement. Based on their works some of the methods in this class have been improved by using a resolution pyramid to extract relevant lines in a given region [10, 11, 12]. References [13, 14, 15, 16, 17, 18] presented similar edge tracking methods. PCNN was used by [19] to highlight regions and then line following is employed to find the position of the landmarks. All of these edge tracking methods are dependent on the quality of the x-ray and give good results for landmarks which are on or near to edges in the image. The second class of researchers used mathematical or statistical models to reduce the search area. Reference [20] used sub-image matching based on gray-scale mathematical morphology. Reference [7] used

2 image based on extracting features from face profile and the algorithm found landmarks by mapping. Reference [32] modeled the size of skull, rotation and translation by some feature points. Edge detection and NN was employed to estimate the possible coordinates of landmarks. Then a modified ASM is applied to locate the exact location of landmarks. Our method, in this paper, can be classified in the 4th class of researches. Figure 1.Location of landmarks used in [7]. Table 1.Definition of landmarks used in this paper. Landmark Definition Sella The Midpoint of the hyperphysical fossa Gnathion The most anterior and inferior point of the Incisal Upper Incisor Tip bony chin Tip of the crown of the most anterior maxillary central incisor Active Shape Model. Special spectroscopy was used by [21] to establish the statistical gray model. In [22] improved the work of [20] by using a line detection module to search for the most significant lines then utilized mathematical morphology approach similar to the one used by [20]. The third class of researchers used Neural Networks, genetic algorithms and fuzzy systems to locate landmarks. References [6, 24] used a combination of Neural Networks and genetic algorithms. Reference [5] used a combination of Neural Networks and fuzzy systems. Reference [24] used SVM in conjunction with Projected Principal-Edge Distribution (PPED) representation as feature vectors, to detect 16 landmarks. Template matching and fuzzy decision making was used in [25], developing an application of iterative self-organizing data (ISODATA) pattern recognition technique together with a fuzzy logic decision making algorithm in order to increase accuracy of the results. Recently the 4th class of researchers uses a combination of those three classes. References [27,2] divided every training shape to 10 regions, and for every region, Principal component Analysis (PCA) is employed to characterize its shape and gray profile statistics. For an input image, some reference landmarks are recognized and the input shape is divided using the landmarks, and then landmarks are located by an active shape model. Reference[27] introduced a method based on Partial Least Square Regression (PLSR) to extract the relation between selected points coordinates on X-ray images and the expected location of set of landmarks. CNN was used by [28, 29] for highlighting the region. In these approaches identification of each landmark s coordinates have been done by appropriate CNN templates, then landmark-specific algorithms have been applied to pinpoint the landmark location. In [1] estimated the initial location for each landmark based on the similarities in human anatomical structure. A modification on using Active Shape Models (ASM) was applied by[8, 30]. The possible coordinates of landmarks were estimated by using LVQ for every new image, and active shape model used for pinpoint the exact location of landmarks. In[31]points were registered on the reference 3. DATA SET AND METHODOLOGY We evaluated out our method on the dataset provided by [7] that is reported containing 63 pretreatment cephalograms with good quality and condition. It is provided that the cephalograms were scanned at 100 dpi and 750 x 950 pixels with 256 gray levels. The position of 16 deferent landmarks on the cephalograms is specified in a separated accompanying file. In the proposed method a leave one out scheme for choosing training and testing sets from the 63 available cephalograms. Leave one out method of testing is statistically acceptable, makes maximum use of the training set and provides a maximum number of tested images. To reduce searching time and finding a first estimation of the landmark location, input images have been scaled to 1/8 their original size. The scaled down image is demonstrated in Fig.2. A detection window with step size of two pixels scans the whole image to find a first estimation of the landmark window. We have defined a specific point for each landmark in the detection window. In training steps this point is located on the landmark location and then the detection window is cropped. The position of this point on the detection window is landmark specific. A general candidate for landmark point can be the central point of the window. This strategy of choosing the landmark point for some landmarks that reside on the boundary of the cephalograms, such as Gnathion, causes the window to pass the boundary of the image. For these landmarks we can choose a point in the window that by residing that window on the landmark over that point whole of the window resides in the image boundaries. For example in Gnathion landmark the window is located somehow the landmark is horizontally centered in the window and is about 32 pixels upper from the bottom of the window at original scale, see Fig.2.c. In training steps, a set of 2 patches with specific dimensions for each landmark (demonstrated in Tabel.2)are sampled randomly from goal free regions in each image, which finally is equal to 2 62 patches. These patches provided the initial negative set. Goal free regions are supposed to be the regions around the landmark window that have overlapping equals to half or less in each dimension with the landmark window. Also a set of 62 patches with the same dimensions in the training step containing the landmark window constructs the positive samples set. In the test steps, tiling the detection window with a dense ( in fact, overlapping) grid of HOG descriptors and using the combined feature vector in a conventional SVM based window classifier gives our region detector chain in each level.

3 4. OUTLINE OF THE PROPOSED METHOD This section provides an overview of our recognition chain. Following steps are performed in a detection evaluation: Scaling down images to 1/8 their original size. Leave one image out and make a train set containing other 62 images in the data set. Extracting HOG features, as it is described in the next sub-section, and train SVM on the positive and negative patched extracted from the train set. Test the system on the left out image and found a first estimation of the landmark position on this scale. Perform all of these steps on all images in data set. At first landmark window detection is started on images with1/8 scale of original images. By scaling down the images, the problem of translation or shift in the image is solved since it guarantees that the distances between image components made shorter and the variety becomes less effective. After extracting the HOG features, whichh is described in the next section in more details, the image is feed to an SVM classifier. The SVM is trained with negative and positive sample patches described in section 3. After training the model we feed the input example to the trained model. In this step HOG features are extracted and the feature vector is constructed for the input window. Then the model classifies the input image as negative or positive sample. We used SVMperf [33, 34], a version of SVM light, as the classifier. This classifier assigns a score to the image as the result of classification. In this research the detection window with the most positive score is chosen as the final location of the landmark window. Then the detection process described above is repeated for each image. In the following sub sections the feature extraction procedure is explained in more details. For evaluating the effectiveness of these feature vectors we used an rbf SVM trained with SVMperf [37, 36]. It s claimed that it trains conventional linear classification SVMs optimizing error rate in time that is linear in the size of the training data through an alternative, but equivalent formulation of the training problem. 4.1 Feature Extraction HOG is a feature set that is based on evaluating well- orientations normalized local histograms of image gradient in a dense grid. In practice the image window is divided to small spatial regions ("cells"). For each cell, a local 1D histogram of gradient directions is accumulated. Then all histogram entries of cells in the windoww are combined to make the final feature vector (Fig.3). It is recommended in[35] that for better invariance to illumination, shadowing and etc. contrast normalize the local responses before using them. This can be done by accumulating a measure of histogram "energy" over somewhat larger spatial regions called ("blocks") and using the results to normalize all the cells in the block. For extracting the HOG features, we used the Basic derivation filters with no smoothing before, as in [35]. c) a) b) d) Figure 2.a) A scaled down sample cephalogram from the dataset, 1/8 of its original scale. b,c,d ) detection windows and their predfined location of landmarks Sella, Gnathion and Incisal The basic derivation filters are specified by: [h x ]=[h y ] t = [1 0-1] (1) Table 2.detection window size for each landmark. All the sizes are in scaled image to 1/8 its original size. Landmark Sella Gnathion Upper Incisor Tip window size 20 x 28 24x20 28 x 28 After that, the next step that is called the fundamental nonlinearity of the descriptor[35]is applied. Each pixel calculates a weighted vote for an edge orientation histogram channel based on the orientation of the gradient element centered on it, and the votes are accumulated into orientation bins over local spatial regions that are called cells. After that block normalization is performed. Reference [35] evaluated a number of different normalization schemes. Most of them are based on grouping cells into larger spatial blocks and contrast normalizing each block separately. The final descriptor is then the vector of all components of the normalized cell responses from all of the blocks in the detection window. In fact, they typically overlap the blocks so that each scalar cell response contributes several components to the final descriptor vector, each normalized with respect to a different block. We used the class of blocks named R-HOG introduced in [35].They are computed in dense grids at a single scale and used as part of a larger code vector that implicitly encodes spatial position relative to the detection window. The square R-HOG's, i.e. 2 2 grids of 4 4 pixels cells each containing 9 orientation bins is employed here. In reference [35] they used such a descriptor as the default for their detector but cell size was 8 8 pixels. As introduced in [35] and [36], it is useful to down-weight pixels near the edges of the block by applying a Gaussian spatial window to each pixel before accumulating orientation votes into cells. The purpose of this Gaussian window is to avoid sudden changes in the descriptor with small changes in the position of the block. Finally, the feature vector is modified by a normalization

4 Orientation Voting Figure 3. HOG feature Extraction Overlapping Blocks method[35,36], to reduce the effects of illumination change. Here L2 norm is used as in[35,36]. 5. EXPERIMENTAL RESULTS We now give details of our implementations for the proposed approach and systematically study the effect of various choices of window size and bin size on recognizer performance. Throughout this section we refer result to our default detector which has the following properties, described below: Gray scale with no image enhancement; [-1, 0, 1] gradient filter with no smoothing; Linear gradient voting into 9 orientation bins in (unsigned gradients) ; 8 8 pixel blocks of four 4 4 pixel cells; Gaussian spatial window with sigma=8 pixel; L2-Hys (Lowe style clipped L2 norm) block normalization; block spacing stride 4 pixels (hence 4 fold coverage of each cell); SVMperf classifier with rbf kernel and C=2;detection window size is landmark specific and is introduced in Table 2. All the experiments are done on the scale that is one eighth of the original size. If the detected landmark location is in a circle of 4 pixel radios around the landmark location in that scale the detection window is categorized as true positive and if the detected landmarks outside this circle are categorized as false negatives. A comparison between effects of variations of different window sizes in the performance of the proposed system is shown in Fig.4. It is demonstrated in Fig.4 that decreasing or increasing the window size decreases the performance within 10 percent at 1% false alarm rate (i.e. increases miss probability about 10 percent for each landmark at 1% false alarm rate). Fig.5 and Fig.6 demonstrate the effect of changing bin size on performance of the proposed system. According to Fig.5,6 increasing the number of orientation bins improves performance significantly up to 9 bins, but makes little difference beyond this. This is for bins spaced over 0 180, i.e. the sign of the gradient is ignored(fig.6).including signed gradients (orientation range 0 360, as in the original SIFT descriptor) decreases the performance, even when the number of bins is also doubled to preserve the original orientation resolution (Fig.6). Table 3 determines some results about the experiments down on different landmarks with the default landmark specific window size. It can be conceived from the results that in the experimented image scale (1/8 of the original image resolution) the proposed method can detect the landmark window within 4 pixel of the actual position of the window in 85 percent of evaluations. This could be a Table 3. Landmark detection rate in different distances to real position (scale 1/8) Landmark Window Detection Rate (%) Within Average Distance 1pix 2 pix 3 pix 4 pix 5 pix Sella pixels Gnathion pixels Incisal Upper Incisor Tip pixels reliable estimation of the landmark location for further post processing. 6. CONCLUSIONS AND FUTURE WORK We have shown that using locally normalized histogram of gradient orientations features similar to SIFT descriptors [36] in a dense overlapping grid gives very good results for landmark window estimation. We studied the influence of some of descriptor parameters and concluded that fine-scale gradients, fine orientation binning, relatively coarse spatial binning, and high-quality local contrast normalization in overlapping descriptor blocks are all important for good performance. The results provided in this paper determine the ability of the system to estimate the location of landmark very well. But in Cephalometry literature the accepted detected landmark must be within 2mm of the real location of the landmark. We are working on extending the method by applying a post processing around the detected landmark position in this paper that can pinpoint the exact position of the landmark on the original image resolution. The method is also based on employing the HOG features and its results will be published elsewhere soon. Another improvement to the system could be using a strategy for choosing SVM output in classification stage. In the proposed paper the highest output was chosen as the best answer but sometimes the highest output causes misclassification. That s because the similarities between the input window and the learned samples are very small. In these situations highest output strategy does not work very well and sometimes results in wrong estimations of landmark location. We are also working on other configurations of the HOG and SVM to make it more effective and reducing the miss classification rate while increasing the detection rate. REFERENCES [1] H. Mohseni and S. Kasaei, "Automatic Localization of Cephalometric Landmarks," 2007 IEEE International Symposium on Signal Processing and Information Technology, 2007, pp [2] W. Yue, D. Yin, C. Li, G. Wang, and T. Xu, "Automated 2-D Cephalometric Analysis on X-ray Images by a Model-Based Approach," IEEE Transactions on Biomedical Engineering, vol. 53,no.8, 2006, pp [3] L.M. Finlay, "Craniometry and Cephalometry: A History Prior to the Advent of Radiography," Journal of Angle Orthodontics, vol. 50, no. 4, 1980, pp [4] T. Rakosi, An Atlas and Manual of Cephalometric Radiology, London: Wolfe Medical, 1982.

5 a)sella b)gnathion Figure 4. Effects of window size variation on performance of system on 3 landmarks. c) Upper Incisor a)sella b)gnathion c) Upper Incisor Figure 5. Effects of bin size variation on performance of system on 3 landmarks. (unsigned bin 0-180) a)sella b)gnathion c) Upper Incisor Figure 6. Effects of bin size variation on performance of system on 3 landmarks. (signed bin 0-360) [5] I. El-feghi, M.A. Sid-ahmed, and M. Ahmadi, "Automatic localization of craniofacial landmarks for Rate assisted cephalometry," Pattern Recognition, vol. 37, no.3, 2004, pp [6] Y.T. Chen, K.S. Cheng, and J.K. Liu, "Improving cephalogram analysis through feature subimageextraction.," IEEE engineering in medicine and biology magazine, vol. 18,no.1, 1999, pp [7] T.J. Hutton, S. Cunningham, and P. Hammond, "An evaluation of active shape models for the automatic identification of cephalometric landmarks," European Journal Of Orthodontics, vol. 22, no.5, 2000, pp [8] R. Kafieh, A. Mehri, and S. Sadri, "Automatic landmark detection in cephalometry using a modified Active Shape Model with sub image matching," International Conference on Machine Vision, ICMV 2007, 2007, pp [9] A.D. Levy-Mandel, A.N. Venetsanopoulos, and J.K. Tsotsos, "Knowledge-Based Landmarking of Cephalograms," Computers and Biomedical Research, vol. 19, no. 3, 1986, pp [10] W. Tong, S.T. Nugent, G.M. Jensen, and D.F. Fay, "An algorithm for locating landmarks on dental X-rays," Engineering in Medicineand Biology Society, IEEE, 1989, pp [11] S. Parthasaraty, S.T. Nugent, P.G. Gregson, and D.F. Fay, "Automatic landmarking of cephalograms,"computers and Biomedical research, vol. 22, no. 3, 1989, pp [12] W. Tong, S.T. Nugent, P.H.Gregson, G.M. Jensen, and D.F. Fay, "Landmarking of cephalograms using a microcomputer system," Computers and Biomedical research, vol. 23, no. 4, 1990, pp [13] D.N. Davis and C.J.Taylor, "An intelligent segmentation system for lateral skull x-ray images," Proceedings of the Fifth Alvey Vision Conference,1989, pp [14] D.B. Forsyth and D.N. Davis, "Assessment of an automated cephalometric analysis system,"theeuropean Journal of Orthodontics, vol. 18, no. 5, 1996, pp [15] D.N. Davis and C.J. Taylor, "An Intelligent Visual Task System For Lateral Skull X-ray Images,"Proceedings of the British Machine Vision Conference, 1990, pp

6 [16] J. Ren,D.Liu, D. Feng, J. Shao, R. Zhao, Y. Liao, Z. Lin. Science, "A Knowledge-based Automatic Cephalometric Analysis Method," Proceedings of the 20th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, [17] D.N. Davis and C.J. Taylor, "Blackboard architecture for automating cephalometric analysis,"informatics for Health and Social Care, vol. 16, no. 2, 1991, pp [18] B. Romaniuk, M. Desvignes, M. Revenu, and M.J. Deshayes, G. Image, F. Caen, "Contour tracking by minimal cost path approach.application to cephalometry,"2004 International Conference on Image Processing, 2004, pp [19] A. Innes, V. Ciesielski, J. Mamutil, and S. John, "Landmark detection for cephalometric radiology images using pulse coupled neural networks," Int. Conf. on Artificial Intelligence, 2002, pp [20] J. Cardillo and M.A. Sid-ahmed, "An image processing system for the automatic extraction of craniofacial landmarks," Nuclear Science Symposium and Medical Imaging Conference,IEEE, vol. 3, 1991, pp [21] D.J. Rudolph, P.M. Sinclair, and J.M.Coggins "Automatic computerized radiographic identification of cephalometric landmarks,"american Journal of Orthodontics and Dentofacial Orthopedics, vol. 113, no. 2, 1998, pp [22] V. Grau, M.C. Juan, C. Monserrat, and C. Knoll, "Automatic localization of cephalometric landmarks," Journal of Biomedical Informatics, vol. 34, no. 3, 2001, pp [23] Y. Chen, K. Cheng, and J. Liul, "Feature subimage extraction for cephalogram landmarking," Proceedings of the 20th Annual International Conference of the IEEE Engineering in Medicine and Biology Society., vol. 20, 1998, pp [24] S. Chakrabartty, M. Yagi, T. Shibata, and G. Cauwenberghs, Robust cephalometric landmark identification using support vector machines," 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003.Proceedings. (ICASSP '03)., pp. II [25] S. Sanei, P.Sanei, M.Zahabsaniei, "Fuzzy detection of craniofacial landmarks," 6th International Conference on Image Processing and its Applications, 1997, pp [26] W. Yue, D. Yin, C.H. Li, G. Wang,andT.Xu, "Locating Large-Scale Craniofacial Feature Points on X-ray Images for Automated Cephalometric Analysis," IEEE, Los Alamitos, [27] I. El-Fegh, Y. Alginahi, M.A.Sid-Ahmed and M. Ahmadi, "Craniofacial Landmarks Extraction by Partial Least Squares Regression,"Proceedings of the 2004 International Symposium on Circuits and Systems, ISCAS'04, [28] D. Giordano, R. Leonardi, F. Maiorana, G. Cristaldi, M.L. Distefano, I. Informatica, U. Catania, and V.A. Doria, "Automatic Landmarking of Cephalograms by Cellular Neural Networks," Artificial Intelligence in Medicine, 2005, pp [29] R. Leonardi, D. Giordano, and F. Maiorana, "An evaluation of cellular neural networks for the automatic identification of cephalometric landmarks on digital images," Journal of biomedicine & biotechnology, vol. 2009, [30] R. Kafieh, S. Sadri, A. Mehri, and H. Raji, "Using a Combination of Model Based and Intelligent methods in Automatic Landmark Detection in Cephalometry," Innovations in Information Technologies (IIT), 2007, pp [31] I. El-Fegh, M. Galhood, M. Sid-Ahmed, and M. Ahmadi, "Automated 2-D cephalometric analysis of X-ray by image registration approach based on least square approximation.," Annual International Conference of the IEEE Engineering in Medicine and Biology Society., vol. 2008, 2008, pp [32] R. Kafieh, S. Sadri, A. Mehri, and H. Raji, "Discrimination of Bony Structures in Cephalograms for Automatic Landmark Detection," CSICC, 2008, pp [33] T. Joachims, "A Support Vector Method for Multivariate Performance Measures,"Proceedings of the 22nd international conference on Machine learning, 2005, pp [34] T. Joachims, "Training Linear SVMs in Linear Time,"Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, 2006, pp [35] N. Dalal and B. Triggs, "Histograms of oriented gradients for human detection," IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2005.CVPR 2005, vol. 1, 2005, pp [36] D.G. Lowe, "Distinctive Image Features from Scale-Invariant Keypoints," International Journal of Computer Vision, vol. 60, no. 2, 2004, pp

Automatic Landmark Detection in Cephalometry Using a Modified Active Shape Model with Sub Image Matching

Automatic Landmark Detection in Cephalometry Using a Modified Active Shape Model with Sub Image Matching Automatic Landmark Detection in Cephalometry Using a Modified Active Shape Model with Sub Image Matching Rahele Kafieh, Alireza mehri Department of biomedical engineering Isfahahan University of medicine

More information

Available online at ScienceDirect. Procedia Computer Science 37 (2014 )

Available online at   ScienceDirect. Procedia Computer Science 37 (2014 ) Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 37 (2014 ) 293 300 The 4th International Conference on Current and Future Trends of Information and Communication Technologies

More information

Automatic Cephalometric Analysis

Automatic Cephalometric Analysis Review Article Automatic Cephalometric Analysis A Systematic Review Rosalia Leonardi a ; Daniela Giordano b ; Francesco Maiorana c ; Concetto Spampinato d ABSTRACT Objective: To describe the techniques

More information

Human Motion Detection and Tracking for Video Surveillance

Human Motion Detection and Tracking for Video Surveillance Human Motion Detection and Tracking for Video Surveillance Prithviraj Banerjee and Somnath Sengupta Department of Electronics and Electrical Communication Engineering Indian Institute of Technology, Kharagpur,

More information

Epithelial rosette detection in microscopic images

Epithelial rosette detection in microscopic images Epithelial rosette detection in microscopic images Kun Liu,3, Sandra Ernst 2,3, Virginie Lecaudey 2,3 and Olaf Ronneberger,3 Department of Computer Science 2 Department of Developmental Biology 3 BIOSS

More information

Automatic Land Marking on Lateral Cephalogram Using Single Fixed View Appearance Model for Gender Identification

Automatic Land Marking on Lateral Cephalogram Using Single Fixed View Appearance Model for Gender Identification www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 2. Issue 10 October 2013 Page No. 3075-3080 Automatic Land Marking on Lateral Cephalogram Using Single Fixed

More information

Classification of objects from Video Data (Group 30)

Classification of objects from Video Data (Group 30) Classification of objects from Video Data (Group 30) Sheallika Singh 12665 Vibhuti Mahajan 12792 Aahitagni Mukherjee 12001 M Arvind 12385 1 Motivation Video surveillance has been employed for a long time

More information

Mobile Human Detection Systems based on Sliding Windows Approach-A Review

Mobile Human Detection Systems based on Sliding Windows Approach-A Review Mobile Human Detection Systems based on Sliding Windows Approach-A Review Seminar: Mobile Human detection systems Njieutcheu Tassi cedrique Rovile Department of Computer Engineering University of Heidelberg

More information

MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION

MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION Panca Mudjirahardjo, Rahmadwati, Nanang Sulistiyanto and R. Arief Setyawan Department of Electrical Engineering, Faculty of

More information

Car Detecting Method using high Resolution images

Car Detecting Method using high Resolution images Car Detecting Method using high Resolution images Swapnil R. Dhawad Department of Electronics and Telecommunication Engineering JSPM s Rajarshi Shahu College of Engineering, Savitribai Phule Pune University,

More information

Computer Science Faculty, Bandar Lampung University, Bandar Lampung, Indonesia

Computer Science Faculty, Bandar Lampung University, Bandar Lampung, Indonesia Application Object Detection Using Histogram of Oriented Gradient For Artificial Intelegence System Module of Nao Robot (Control System Laboratory (LSKK) Bandung Institute of Technology) A K Saputra 1.,

More information

A Novel Extreme Point Selection Algorithm in SIFT

A Novel Extreme Point Selection Algorithm in SIFT A Novel Extreme Point Selection Algorithm in SIFT Ding Zuchun School of Electronic and Communication, South China University of Technolog Guangzhou, China zucding@gmail.com Abstract. This paper proposes

More information

Detecting Printed and Handwritten Partial Copies of Line Drawings Embedded in Complex Backgrounds

Detecting Printed and Handwritten Partial Copies of Line Drawings Embedded in Complex Backgrounds 9 1th International Conference on Document Analysis and Recognition Detecting Printed and Handwritten Partial Copies of Line Drawings Embedded in Complex Backgrounds Weihan Sun, Koichi Kise Graduate School

More information

Histogram of Oriented Gradients for Human Detection

Histogram of Oriented Gradients for Human Detection Histogram of Oriented Gradients for Human Detection Article by Navneet Dalal and Bill Triggs All images in presentation is taken from article Presentation by Inge Edward Halsaunet Introduction What: Detect

More information

An Implementation on Histogram of Oriented Gradients for Human Detection

An Implementation on Histogram of Oriented Gradients for Human Detection An Implementation on Histogram of Oriented Gradients for Human Detection Cansın Yıldız Dept. of Computer Engineering Bilkent University Ankara,Turkey cansin@cs.bilkent.edu.tr Abstract I implemented a Histogram

More information

Fully automatic cephalometric evaluation using Random Forest regression-voting

Fully automatic cephalometric evaluation using Random Forest regression-voting Fully automatic cephalometric evaluation using Random Forest regression-voting Claudia Lindner and Tim F. Cootes Centre for Imaging Sciences, University of Manchester, UK Abstract. Cephalometric analysis

More information

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor COSC160: Detection and Classification Jeremy Bolton, PhD Assistant Teaching Professor Outline I. Problem I. Strategies II. Features for training III. Using spatial information? IV. Reducing dimensionality

More information

Human detection using histogram of oriented gradients. Srikumar Ramalingam School of Computing University of Utah

Human detection using histogram of oriented gradients. Srikumar Ramalingam School of Computing University of Utah Human detection using histogram of oriented gradients Srikumar Ramalingam School of Computing University of Utah Reference Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

More information

Feature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking

Feature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking Feature descriptors Alain Pagani Prof. Didier Stricker Computer Vision: Object and People Tracking 1 Overview Previous lectures: Feature extraction Today: Gradiant/edge Points (Kanade-Tomasi + Harris)

More information

EE368 Project Report CD Cover Recognition Using Modified SIFT Algorithm

EE368 Project Report CD Cover Recognition Using Modified SIFT Algorithm EE368 Project Report CD Cover Recognition Using Modified SIFT Algorithm Group 1: Mina A. Makar Stanford University mamakar@stanford.edu Abstract In this report, we investigate the application of the Scale-Invariant

More information

MULTI-VIEW FACE DETECTION AND POSE ESTIMATION EMPLOYING EDGE-BASED FEATURE VECTORS

MULTI-VIEW FACE DETECTION AND POSE ESTIMATION EMPLOYING EDGE-BASED FEATURE VECTORS MULTI-VIEW FACE DETECTION AND POSE ESTIMATION EMPLOYING EDGE-BASED FEATURE VECTORS Daisuke Moriya, Yasufumi Suzuki, and Tadashi Shibata Masakazu Yagi and Kenji Takada Department of Frontier Informatics,

More information

CS229: Action Recognition in Tennis

CS229: Action Recognition in Tennis CS229: Action Recognition in Tennis Aman Sikka Stanford University Stanford, CA 94305 Rajbir Kataria Stanford University Stanford, CA 94305 asikka@stanford.edu rkataria@stanford.edu 1. Motivation As active

More information

Face Detection using Hierarchical SVM

Face Detection using Hierarchical SVM Face Detection using Hierarchical SVM ECE 795 Pattern Recognition Christos Kyrkou Fall Semester 2010 1. Introduction Face detection in video is the process of detecting and classifying small images extracted

More information

Histograms of Oriented Gradients for Human Detection p. 1/1

Histograms of Oriented Gradients for Human Detection p. 1/1 Histograms of Oriented Gradients for Human Detection p. 1/1 Histograms of Oriented Gradients for Human Detection Navneet Dalal and Bill Triggs INRIA Rhône-Alpes Grenoble, France Funding: acemedia, LAVA,

More information

Semi-Automatic Detection of Cervical Vertebrae in X-ray Images Using Generalized Hough Transform

Semi-Automatic Detection of Cervical Vertebrae in X-ray Images Using Generalized Hough Transform Semi-Automatic Detection of Cervical Vertebrae in X-ray Images Using Generalized Hough Transform Mohamed Amine LARHMAM, Saïd MAHMOUDI and Mohammed BENJELLOUN Faculty of Engineering, University of Mons,

More information

Preliminary Local Feature Selection by Support Vector Machine for Bag of Features

Preliminary Local Feature Selection by Support Vector Machine for Bag of Features Preliminary Local Feature Selection by Support Vector Machine for Bag of Features Tetsu Matsukawa Koji Suzuki Takio Kurita :University of Tsukuba :National Institute of Advanced Industrial Science and

More information

Object detection using non-redundant local Binary Patterns

Object detection using non-redundant local Binary Patterns University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2010 Object detection using non-redundant local Binary Patterns Duc Thanh

More information

A Comparison of SIFT, PCA-SIFT and SURF

A Comparison of SIFT, PCA-SIFT and SURF A Comparison of SIFT, PCA-SIFT and SURF Luo Juan Computer Graphics Lab, Chonbuk National University, Jeonju 561-756, South Korea qiuhehappy@hotmail.com Oubong Gwun Computer Graphics Lab, Chonbuk National

More information

HISTOGRAMS OF ORIENTATIO N GRADIENTS

HISTOGRAMS OF ORIENTATIO N GRADIENTS HISTOGRAMS OF ORIENTATIO N GRADIENTS Histograms of Orientation Gradients Objective: object recognition Basic idea Local shape information often well described by the distribution of intensity gradients

More information

2D Image Processing Feature Descriptors

2D Image Processing Feature Descriptors 2D Image Processing Feature Descriptors Prof. Didier Stricker Kaiserlautern University http://ags.cs.uni-kl.de/ DFKI Deutsches Forschungszentrum für Künstliche Intelligenz http://av.dfki.de 1 Overview

More information

Lecture 10 Detectors and descriptors

Lecture 10 Detectors and descriptors Lecture 10 Detectors and descriptors Properties of detectors Edge detectors Harris DoG Properties of detectors SIFT Shape context Silvio Savarese Lecture 10-26-Feb-14 From the 3D to 2D & vice versa P =

More information

SURF. Lecture6: SURF and HOG. Integral Image. Feature Evaluation with Integral Image

SURF. Lecture6: SURF and HOG. Integral Image. Feature Evaluation with Integral Image SURF CSED441:Introduction to Computer Vision (2015S) Lecture6: SURF and HOG Bohyung Han CSE, POSTECH bhhan@postech.ac.kr Speed Up Robust Features (SURF) Simplified version of SIFT Faster computation but

More information

Action Recognition & Categories via Spatial-Temporal Features

Action Recognition & Categories via Spatial-Temporal Features Action Recognition & Categories via Spatial-Temporal Features 华俊豪, 11331007 huajh7@gmail.com 2014/4/9 Talk at Image & Video Analysis taught by Huimin Yu. Outline Introduction Frameworks Feature extraction

More information

Chapter 3 Image Registration. Chapter 3 Image Registration

Chapter 3 Image Registration. Chapter 3 Image Registration Chapter 3 Image Registration Distributed Algorithms for Introduction (1) Definition: Image Registration Input: 2 images of the same scene but taken from different perspectives Goal: Identify transformation

More information

A NEW FEATURE BASED IMAGE REGISTRATION ALGORITHM INTRODUCTION

A NEW FEATURE BASED IMAGE REGISTRATION ALGORITHM INTRODUCTION A NEW FEATURE BASED IMAGE REGISTRATION ALGORITHM Karthik Krish Stuart Heinrich Wesley E. Snyder Halil Cakir Siamak Khorram North Carolina State University Raleigh, 27695 kkrish@ncsu.edu sbheinri@ncsu.edu

More information

Object Tracking using HOG and SVM

Object Tracking using HOG and SVM Object Tracking using HOG and SVM Siji Joseph #1, Arun Pradeep #2 Electronics and Communication Engineering Axis College of Engineering and Technology, Ambanoly, Thrissur, India Abstract Object detection

More information

Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation

Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation M. Blauth, E. Kraft, F. Hirschenberger, M. Böhm Fraunhofer Institute for Industrial Mathematics, Fraunhofer-Platz 1,

More information

Histograms of Oriented Gradients

Histograms of Oriented Gradients Histograms of Oriented Gradients Carlo Tomasi September 18, 2017 A useful question to ask of an image is whether it contains one or more instances of a certain object: a person, a face, a car, and so forth.

More information

III. VERVIEW OF THE METHODS

III. VERVIEW OF THE METHODS An Analytical Study of SIFT and SURF in Image Registration Vivek Kumar Gupta, Kanchan Cecil Department of Electronics & Telecommunication, Jabalpur engineering college, Jabalpur, India comparing the distance

More information

Face Recognition Using Vector Quantization Histogram and Support Vector Machine Classifier Rong-sheng LI, Fei-fei LEE *, Yan YAN and Qiu CHEN

Face Recognition Using Vector Quantization Histogram and Support Vector Machine Classifier Rong-sheng LI, Fei-fei LEE *, Yan YAN and Qiu CHEN 2016 International Conference on Artificial Intelligence: Techniques and Applications (AITA 2016) ISBN: 978-1-60595-389-2 Face Recognition Using Vector Quantization Histogram and Support Vector Machine

More information

A Rapid Automatic Image Registration Method Based on Improved SIFT

A Rapid Automatic Image Registration Method Based on Improved SIFT Available online at www.sciencedirect.com Procedia Environmental Sciences 11 (2011) 85 91 A Rapid Automatic Image Registration Method Based on Improved SIFT Zhu Hongbo, Xu Xuejun, Wang Jing, Chen Xuesong,

More information

Three possible approaches may be used to

Three possible approaches may be used to ORIGINAL ARTICLE Automatic computerized radiographic identification of cephalometric landmarks D.J. Rudolph, DDS, PhD, a P.M. Sinclair, DDS, MSD, b and J.M. Coggins, PhD c Los Angeles, Calif., and Chapel

More information

Sketchable Histograms of Oriented Gradients for Object Detection

Sketchable Histograms of Oriented Gradients for Object Detection Sketchable Histograms of Oriented Gradients for Object Detection No Author Given No Institute Given Abstract. In this paper we investigate a new representation approach for visual object recognition. The

More information

Ensemble of Bayesian Filters for Loop Closure Detection

Ensemble of Bayesian Filters for Loop Closure Detection Ensemble of Bayesian Filters for Loop Closure Detection Mohammad Omar Salameh, Azizi Abdullah, Shahnorbanun Sahran Pattern Recognition Research Group Center for Artificial Intelligence Faculty of Information

More information

Evaluation of Local Space-time Descriptors based on Cuboid Detector in Human Action Recognition

Evaluation of Local Space-time Descriptors based on Cuboid Detector in Human Action Recognition International Journal of Innovation and Applied Studies ISSN 2028-9324 Vol. 9 No. 4 Dec. 2014, pp. 1708-1717 2014 Innovative Space of Scientific Research Journals http://www.ijias.issr-journals.org/ Evaluation

More information

Tensor Decomposition of Dense SIFT Descriptors in Object Recognition

Tensor Decomposition of Dense SIFT Descriptors in Object Recognition Tensor Decomposition of Dense SIFT Descriptors in Object Recognition Tan Vo 1 and Dat Tran 1 and Wanli Ma 1 1- Faculty of Education, Science, Technology and Mathematics University of Canberra, Australia

More information

Selective Search for Object Recognition

Selective Search for Object Recognition Selective Search for Object Recognition Uijlings et al. Schuyler Smith Overview Introduction Object Recognition Selective Search Similarity Metrics Results Object Recognition Kitten Goal: Problem: Where

More information

IMAGE RETRIEVAL USING VLAD WITH MULTIPLE FEATURES

IMAGE RETRIEVAL USING VLAD WITH MULTIPLE FEATURES IMAGE RETRIEVAL USING VLAD WITH MULTIPLE FEATURES Pin-Syuan Huang, Jing-Yi Tsai, Yu-Fang Wang, and Chun-Yi Tsai Department of Computer Science and Information Engineering, National Taitung University,

More information

A New Algorithm for Shape Detection

A New Algorithm for Shape Detection IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 19, Issue 3, Ver. I (May.-June. 2017), PP 71-76 www.iosrjournals.org A New Algorithm for Shape Detection Hewa

More information

Automated Canvas Analysis for Painting Conservation. By Brendan Tobin

Automated Canvas Analysis for Painting Conservation. By Brendan Tobin Automated Canvas Analysis for Painting Conservation By Brendan Tobin 1. Motivation Distinctive variations in the spacings between threads in a painting's canvas can be used to show that two sections of

More information

CS 231A Computer Vision (Fall 2012) Problem Set 3

CS 231A Computer Vision (Fall 2012) Problem Set 3 CS 231A Computer Vision (Fall 2012) Problem Set 3 Due: Nov. 13 th, 2012 (2:15pm) 1 Probabilistic Recursion for Tracking (20 points) In this problem you will derive a method for tracking a point of interest

More information

Computer-Aided Diagnosis in Abdominal and Cardiac Radiology Using Neural Networks

Computer-Aided Diagnosis in Abdominal and Cardiac Radiology Using Neural Networks Computer-Aided Diagnosis in Abdominal and Cardiac Radiology Using Neural Networks Du-Yih Tsai, Masaru Sekiya and Yongbum Lee Department of Radiological Technology, School of Health Sciences, Faculty of

More information

Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes

Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes 2009 10th International Conference on Document Analysis and Recognition Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes Alireza Alaei

More information

Pedestrian Detection and Tracking in Images and Videos

Pedestrian Detection and Tracking in Images and Videos Pedestrian Detection and Tracking in Images and Videos Azar Fazel Stanford University azarf@stanford.edu Viet Vo Stanford University vtvo@stanford.edu Abstract The increase in population density and accessibility

More information

Deformable Part Models

Deformable Part Models CS 1674: Intro to Computer Vision Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 9, 2016 Today: Object category detection Window-based approaches: Last time: Viola-Jones

More information

PEOPLE IN SEATS COUNTING VIA SEAT DETECTION FOR MEETING SURVEILLANCE

PEOPLE IN SEATS COUNTING VIA SEAT DETECTION FOR MEETING SURVEILLANCE PEOPLE IN SEATS COUNTING VIA SEAT DETECTION FOR MEETING SURVEILLANCE Hongyu Liang, Jinchen Wu, and Kaiqi Huang National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Science

More information

Human detections using Beagle board-xm

Human detections using Beagle board-xm Human detections using Beagle board-xm CHANDAN KUMAR 1 V. AJAY KUMAR 2 R. MURALI 3 1 (M. TECH STUDENT, EMBEDDED SYSTEMS, DEPARTMENT OF ELECTRONICS AND COMMUNICATION ENGINEERING, VIJAYA KRISHNA INSTITUTE

More information

An Intelligent Visual Task System For Lateral Skull X-ray Images

An Intelligent Visual Task System For Lateral Skull X-ray Images An Intelligent Visual Task System For Lateral Skull X-ray Images D.N. Davis and C.J. Taylor Wolfson Image Analysis Unit Department of Medical Biophysics University of Manchester Oxford Road MANCHESTER

More information

Outline 7/2/201011/6/

Outline 7/2/201011/6/ Outline Pattern recognition in computer vision Background on the development of SIFT SIFT algorithm and some of its variations Computational considerations (SURF) Potential improvement Summary 01 2 Pattern

More information

Video annotation based on adaptive annular spatial partition scheme

Video annotation based on adaptive annular spatial partition scheme Video annotation based on adaptive annular spatial partition scheme Guiguang Ding a), Lu Zhang, and Xiaoxu Li Key Laboratory for Information System Security, Ministry of Education, Tsinghua National Laboratory

More information

Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers

Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers A. Salhi, B. Minaoui, M. Fakir, H. Chakib, H. Grimech Faculty of science and Technology Sultan Moulay Slimane

More information

Object Recognition Algorithms for Computer Vision System: A Survey

Object Recognition Algorithms for Computer Vision System: A Survey Volume 117 No. 21 2017, 69-74 ISSN: 1311-8080 (printed version); ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu ijpam.eu Object Recognition Algorithms for Computer Vision System: A Survey Anu

More information

SCALE INVARIANT FEATURE TRANSFORM (SIFT)

SCALE INVARIANT FEATURE TRANSFORM (SIFT) 1 SCALE INVARIANT FEATURE TRANSFORM (SIFT) OUTLINE SIFT Background SIFT Extraction Application in Content Based Image Search Conclusion 2 SIFT BACKGROUND Scale-invariant feature transform SIFT: to detect

More information

Robust PDF Table Locator

Robust PDF Table Locator Robust PDF Table Locator December 17, 2016 1 Introduction Data scientists rely on an abundance of tabular data stored in easy-to-machine-read formats like.csv files. Unfortunately, most government records

More information

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011 Previously Part-based and local feature models for generic object recognition Wed, April 20 UT-Austin Discriminative classifiers Boosting Nearest neighbors Support vector machines Useful for object recognition

More information

Comment Extraction from Blog Posts and Its Applications to Opinion Mining

Comment Extraction from Blog Posts and Its Applications to Opinion Mining Comment Extraction from Blog Posts and Its Applications to Opinion Mining Huan-An Kao, Hsin-Hsi Chen Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan

More information

Local Features: Detection, Description & Matching

Local Features: Detection, Description & Matching Local Features: Detection, Description & Matching Lecture 08 Computer Vision Material Citations Dr George Stockman Professor Emeritus, Michigan State University Dr David Lowe Professor, University of British

More information

Tri-modal Human Body Segmentation

Tri-modal Human Body Segmentation Tri-modal Human Body Segmentation Master of Science Thesis Cristina Palmero Cantariño Advisor: Sergio Escalera Guerrero February 6, 2014 Outline 1 Introduction 2 Tri-modal dataset 3 Proposed baseline 4

More information

N.Priya. Keywords Compass mask, Threshold, Morphological Operators, Statistical Measures, Text extraction

N.Priya. Keywords Compass mask, Threshold, Morphological Operators, Statistical Measures, Text extraction Volume, Issue 8, August ISSN: 77 8X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com A Combined Edge-Based Text

More information

Discriminative classifiers for image recognition

Discriminative classifiers for image recognition Discriminative classifiers for image recognition May 26 th, 2015 Yong Jae Lee UC Davis Outline Last time: window-based generic object detection basic pipeline face detection with boosting as case study

More information

Part-based and local feature models for generic object recognition

Part-based and local feature models for generic object recognition Part-based and local feature models for generic object recognition May 28 th, 2015 Yong Jae Lee UC Davis Announcements PS2 grades up on SmartSite PS2 stats: Mean: 80.15 Standard Dev: 22.77 Vote on piazza

More information

CS 223B Computer Vision Problem Set 3

CS 223B Computer Vision Problem Set 3 CS 223B Computer Vision Problem Set 3 Due: Feb. 22 nd, 2011 1 Probabilistic Recursion for Tracking In this problem you will derive a method for tracking a point of interest through a sequence of images.

More information

Hand Posture Recognition Using Adaboost with SIFT for Human Robot Interaction

Hand Posture Recognition Using Adaboost with SIFT for Human Robot Interaction Hand Posture Recognition Using Adaboost with SIFT for Human Robot Interaction Chieh-Chih Wang and Ko-Chih Wang Department of Computer Science and Information Engineering Graduate Institute of Networking

More information

Co-occurrence Histograms of Oriented Gradients for Pedestrian Detection

Co-occurrence Histograms of Oriented Gradients for Pedestrian Detection Co-occurrence Histograms of Oriented Gradients for Pedestrian Detection Tomoki Watanabe, Satoshi Ito, and Kentaro Yokoi Corporate Research and Development Center, TOSHIBA Corporation, 1, Komukai-Toshiba-cho,

More information

Image Processing. Image Features

Image Processing. Image Features Image Processing Image Features Preliminaries 2 What are Image Features? Anything. What they are used for? Some statements about image fragments (patches) recognition Search for similar patches matching

More information

Feature Detection. Raul Queiroz Feitosa. 3/30/2017 Feature Detection 1

Feature Detection. Raul Queiroz Feitosa. 3/30/2017 Feature Detection 1 Feature Detection Raul Queiroz Feitosa 3/30/2017 Feature Detection 1 Objetive This chapter discusses the correspondence problem and presents approaches to solve it. 3/30/2017 Feature Detection 2 Outline

More information

Developing Open Source code for Pyramidal Histogram Feature Sets

Developing Open Source code for Pyramidal Histogram Feature Sets Developing Open Source code for Pyramidal Histogram Feature Sets BTech Project Report by Subodh Misra subodhm@iitk.ac.in Y648 Guide: Prof. Amitabha Mukerjee Dept of Computer Science and Engineering IIT

More information

Edge Detection for Dental X-ray Image Segmentation using Neural Network approach

Edge Detection for Dental X-ray Image Segmentation using Neural Network approach Volume 1, No. 7, September 2012 ISSN 2278-1080 The International Journal of Computer Science & Applications (TIJCSA) RESEARCH PAPER Available Online at http://www.journalofcomputerscience.com/ Edge Detection

More information

Extracting Spatio-temporal Local Features Considering Consecutiveness of Motions

Extracting Spatio-temporal Local Features Considering Consecutiveness of Motions Extracting Spatio-temporal Local Features Considering Consecutiveness of Motions Akitsugu Noguchi and Keiji Yanai Department of Computer Science, The University of Electro-Communications, 1-5-1 Chofugaoka,

More information

Human-Robot Interaction

Human-Robot Interaction Human-Robot Interaction Elective in Artificial Intelligence Lecture 6 Visual Perception Luca Iocchi DIAG, Sapienza University of Rome, Italy With contributions from D. D. Bloisi and A. Youssef Visual Perception

More information

Scale Invariant Feature Transform

Scale Invariant Feature Transform Scale Invariant Feature Transform Why do we care about matching features? Camera calibration Stereo Tracking/SFM Image moiaicing Object/activity Recognition Objection representation and recognition Image

More information

Object and Action Detection from a Single Example

Object and Action Detection from a Single Example Object and Action Detection from a Single Example Peyman Milanfar* EE Department University of California, Santa Cruz *Joint work with Hae Jong Seo AFOSR Program Review, June 4-5, 29 Take a look at this:

More information

COMBINING NEURAL NETWORKS FOR SKIN DETECTION

COMBINING NEURAL NETWORKS FOR SKIN DETECTION COMBINING NEURAL NETWORKS FOR SKIN DETECTION Chelsia Amy Doukim 1, Jamal Ahmad Dargham 1, Ali Chekima 1 and Sigeru Omatu 2 1 School of Engineering and Information Technology, Universiti Malaysia Sabah,

More information

Face Recognition using SURF Features and SVM Classifier

Face Recognition using SURF Features and SVM Classifier International Journal of Electronics Engineering Research. ISSN 0975-6450 Volume 8, Number 1 (016) pp. 1-8 Research India Publications http://www.ripublication.com Face Recognition using SURF Features

More information

Generic Face Alignment Using an Improved Active Shape Model

Generic Face Alignment Using an Improved Active Shape Model Generic Face Alignment Using an Improved Active Shape Model Liting Wang, Xiaoqing Ding, Chi Fang Electronic Engineering Department, Tsinghua University, Beijing, China {wanglt, dxq, fangchi} @ocrserv.ee.tsinghua.edu.cn

More information

SIFT - scale-invariant feature transform Konrad Schindler

SIFT - scale-invariant feature transform Konrad Schindler SIFT - scale-invariant feature transform Konrad Schindler Institute of Geodesy and Photogrammetry Invariant interest points Goal match points between images with very different scale, orientation, projective

More information

Development in Object Detection. Junyuan Lin May 4th

Development in Object Detection. Junyuan Lin May 4th Development in Object Detection Junyuan Lin May 4th Line of Research [1] N. Dalal and B. Triggs. Histograms of oriented gradients for human detection, CVPR 2005. HOG Feature template [2] P. Felzenszwalb,

More information

Final Project Face Detection and Recognition

Final Project Face Detection and Recognition Final Project Face Detection and Recognition Submission Guidelines: 1. Follow the guidelines detailed in the course website and information page.. Submission in pairs is allowed for all students registered

More information

CS 4495 Computer Vision A. Bobick. CS 4495 Computer Vision. Features 2 SIFT descriptor. Aaron Bobick School of Interactive Computing

CS 4495 Computer Vision A. Bobick. CS 4495 Computer Vision. Features 2 SIFT descriptor. Aaron Bobick School of Interactive Computing CS 4495 Computer Vision Features 2 SIFT descriptor Aaron Bobick School of Interactive Computing Administrivia PS 3: Out due Oct 6 th. Features recap: Goal is to find corresponding locations in two images.

More information

Component-based Face Recognition with 3D Morphable Models

Component-based Face Recognition with 3D Morphable Models Component-based Face Recognition with 3D Morphable Models B. Weyrauch J. Huang benjamin.weyrauch@vitronic.com jenniferhuang@alum.mit.edu Center for Biological and Center for Biological and Computational

More information

Recognition of Animal Skin Texture Attributes in the Wild. Amey Dharwadker (aap2174) Kai Zhang (kz2213)

Recognition of Animal Skin Texture Attributes in the Wild. Amey Dharwadker (aap2174) Kai Zhang (kz2213) Recognition of Animal Skin Texture Attributes in the Wild Amey Dharwadker (aap2174) Kai Zhang (kz2213) Motivation Patterns and textures are have an important role in object description and understanding

More information

arxiv: v2 [cs.cv] 17 Nov 2014

arxiv: v2 [cs.cv] 17 Nov 2014 WINDOW-BASED DESCRIPTORS FOR ARABIC HANDWRITTEN ALPHABET RECOGNITION: A COMPARATIVE STUDY ON A NOVEL DATASET Marwan Torki, Mohamed E. Hussein, Ahmed Elsallamy, Mahmoud Fayyaz, Shehab Yaser Computer and

More information

Sparse coding for image classification

Sparse coding for image classification Sparse coding for image classification Columbia University Electrical Engineering: Kun Rong(kr2496@columbia.edu) Yongzhou Xiang(yx2211@columbia.edu) Yin Cui(yc2776@columbia.edu) Outline Background Introduction

More information

Multiview Pedestrian Detection Based on Online Support Vector Machine Using Convex Hull

Multiview Pedestrian Detection Based on Online Support Vector Machine Using Convex Hull Multiview Pedestrian Detection Based on Online Support Vector Machine Using Convex Hull Revathi M K 1, Ramya K P 2, Sona G 3 1, 2, 3 Information Technology, Anna University, Dr.Sivanthi Aditanar College

More information

Research Article International Journals of Advanced Research in Computer Science and Software Engineering ISSN: X (Volume-7, Issue-7)

Research Article International Journals of Advanced Research in Computer Science and Software Engineering ISSN: X (Volume-7, Issue-7) International Journals of Advanced Research in Computer Science and Software Engineering ISSN: 2277-128X (Volume-7, Issue-7) Research Article July 2017 Technique for Text Region Detection in Image Processing

More information

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks Si Chen The George Washington University sichen@gwmail.gwu.edu Meera Hahn Emory University mhahn7@emory.edu Mentor: Afshin

More information

Beyond Bags of Features

Beyond Bags of Features : for Recognizing Natural Scene Categories Matching and Modeling Seminar Instructed by Prof. Haim J. Wolfson School of Computer Science Tel Aviv University December 9 th, 2015

More information

Classifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao

Classifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao Motivation Image search Building large sets of classified images Robotics Background Object recognition is unsolved Deformable shaped

More information

C. Premsai 1, Prof. A. Kavya 2 School of Computer Science, School of Computer Science Engineering, Engineering VIT Chennai, VIT Chennai

C. Premsai 1, Prof. A. Kavya 2 School of Computer Science, School of Computer Science Engineering, Engineering VIT Chennai, VIT Chennai Traffic Sign Detection Via Graph-Based Ranking and Segmentation Algorithm C. Premsai 1, Prof. A. Kavya 2 School of Computer Science, School of Computer Science Engineering, Engineering VIT Chennai, VIT

More information