Scenery Character Detection with Environmental Context
|
|
- Elizabeth Manning
- 5 years ago
- Views:
Transcription
1 cenery haracter Detection with Environmental ontext Yasuhiro Kunishige, Feng Yaokai, and eiichi Uchida Kyushu University, Fukuoka, 9-9, Japan Abstract For scenery detection, we introduce environmental context, which is modeled by scene components, such as sky and building. Environmental context is expected to regulate the probability of existence at a specific region in a scenery image. For example, if a region looks like a part of a building, the region has a higher probability than another region like a part of the sky. In this paper, environmental context is represented by state-of-the-art texture and color s and utilized in two different ways. Through experimental results, it was clearly shown that the environmental context has an effect of improving detection accuracy. Keywords-scenery detection, environmental context,, I. INTRODUTION The main contribution of this paper is to show the usefulness of environmental context, which is modeled by scene components, such as sky, ground, building, etc., for scenery detection. onsider a small region (a block or a connected component ()) in a scenery image. If the region (and its surroundings) look like a part of the sky, we can say that a existence probability at this region is low. This is because s never exist in the sky. This simple example indicates that environmental context is useful for detecting scenery s. Past trials on scenery detection (e.g., [] []) have never utilized environmental context. They tried to employ various s just for representing areas. Unfortunately, the past trials suffer from many false detections due to complicated non- areas. One naive strategy to suppress the false detections is to apply more strict conditions to the detectors; however, this strategy will result in the situation that various s (such as decorated s) are overlooked. To summarize, there is a severe trade-off when we try to detect s only by s. Our trials will utilize not only s for representing s but also s for representing environmental context, in order to relax the trade-off. Among several methodologies of utilizing environmental context in scenery detection, we will examine two methods, later called Method and Method. The former is a rather straightforward method where environmental context s are concatenated with s and then a /non- discrimination is performed. The latter is a more elaborated method where the likelihoods of all scene component categories (such as sky ) are first calculated and the discrimination is performed using the likelihoods as s. Experimental results will show the better performance of those methods over the simple method without environmental context, which is called Method later. The remainder of this paper is organized as follows. After a brief description of the preprocessing step in ection II, the extraction of not only s but also environmental s is described in ection III. Then, the three /non- discrimination methods using those s are described in ection IV. In ection V, the performance of detection are evaluated qualitatively and quantitatively through experiments using real ground-truthed scenery images. Finally, ection VI draws our conclusion and future work. II. PREPROEING In this paper, scenery detection is formulated as a /non- discrimination problem at each connected component (). Accordingly, the first-step is the decomposition of the entire image into non-overlapping s. is a better unit for our detection problem than a fixed-size block because the size of a (and environmental context) varies largely in the target scenery image and any fixed-size block cannot deal with this large variation. For the decomposition, trinarization by Niblack s method [] is used. Its two thresholds are determined at each pixel by local mean and standard deviation values. Precisely speaking, we use an improved version [], [] of the Niblack s original method. This improvement is effective to increase the robustness against noises and shadings around s. Note that very small s (less than pixel size) are always treated as non-s because they are unreadable even if they are actually s. III. FEATURE EXTRATION A. Extraction of haracter Features Many researcher have proposed their own s which are expected to be suitable for evaluating the likelihood that a target (or block, or pixel) is a. In this paper, by referring very recent trials [], [], the following twelve s (-) are extracted for each : the ratio between areas of the and the entire image, the ratio between width of the and the entire image, the aspect ratio of the, contour roughness, the number of holes, the ratio between area and squared contour length of the. the ratio between areas of the and its bounding
2 method (dim) scenery image trinarization connected component Figure. (dim) environ. context (99dim) extraction +envion. method (dim) method score (dim) discrimination Overview of the proposed method. There are three different methods to be compared. detection result box, mean of stroke width, standard deviation of stroke width, the ratio between areas of a dilated and the entire image, the ratio between contour lengths of original and the dilated s, and edge contrast. B. Extraction of Environmental ontext Features Again, the main purpose is to introduce environmental context into the detection problem. For this purpose, we introduce 99 environmental context s of each. Among them, s are texton s [] of the target. Texton s have been recently proposed for describing textures. In [9], [], it was proved that they can provide a good performance of environmental objects, such as sky, road, green, etc. Texton s are based on responses from 9 Gaussian filters, Laplacian of Gaussian (LoG) filter, and Gaussian derivative filters. Then for each response the mean, standard derivation, Kurtosis, and skewness within the are calculated. ( = (9++).) Hereafter, the s from the Gaussian filters, the LoG filter, and the Gaussian derivative filters are denoted as E-, E-, E-, respectively. It is important to note that on the above calculation, -pixel margin is added around the. By this margin, we can incorporate environmental context around the target. The remaining s are employed to represent non-texture istics of environmental context. pecifically, s are L*a*b* color s, s are positional s, and is an area. The color s are extracted as the mean, standard derivation, Kurtosis, and skewness for each of L* (E-), a* (E9-9), and b* (E9-9) values of the target. The positional s (E9,9) are the x and y-coordinates of gravity point of the. The area (E99) is the number of pixels of. Note that the positional s are employed because they can represent an environmental context []; for example, sky will be generally located in an upper part of the image. IV. HARATER/NON-HARATER DIRIMINATION WITH ENVIRONMENTAL ONTEXT A. Random Forest The three discrimination methods (Methods ) commonly employ []. As shown in Fig., Test sample Random subset tree Training data tree K lass lass Voting Freq. Figure. Random. Random sampling Random subset K lass is an ensemble of K classifiers. Each classifier is a decision tree and trained by a subset of whole training data. During the training of each decision tree, effective s are selected automatically. The final classification result is determined by the majority voting of K decision results. In this paper, K and the depth of each tree are fixed at and, respectively. There are several merits of for our detection task. First, because of its sampling strategy and ensemble framework, is robust to outliers in training data. ince there are very special shapes in scenery images, overfitting will become serious without this robustness. econd, its selection mechanism is suitable for the task because we do not know useful s in advance. Third, we can investigate the importance of each on the discrimination by. (In ection V, we will observe the importance of each introduced in ection III.) Other more general merits are its fast computation and multi-class recognition ability. B. Three Discrimination Methodologies Method uses only the s (-). A is trained with those s, that is, in the -dimensional space. If votes for are more than votes for non- among K = votes, the target is detected as a. Detection results by Method will indicate an up-to-date accuracy of scenery detection, where environmental context is not considered. In other words, we can consider Method
3 Table I DETETION RATE. PARENTHEIZED VALUE ARE PIXEL-WIE DETETION RATE. Recall Precision F-value Method... (.) (.) (.) Method... (.) (.) (.9) Method...9 (.) (.9) (.) Figure. Ground-truth of scene component category. better better as one of the best past trials. Method uses not only the s (-) but also the environmental context s (E-99). A is trained in -dimensional space. Detection results by Method will indicate whether the environmental context s are useful. Method is the most elaborated for utilizing environmental contexts more explicitly. The likelihoods of several scene component categories are first calculated and then used as values for the discrimination. Precisely, Method is organized in the following two-step manner. ) As the first step, we calculate seven score s, which are comprised of one score and six scene component score s. The former is derived as the number of votes for in the of Method. The latter six s are derived by another trained to classify the into one of six scene component categories, sky, green, sign(board), ground, building, and others. For example, the sky score is derived as the number of votes for the sky category in the. ) As the second step, the final /non- discrimination is done by a single and the above seven score s. V. EXPERIMENTAL REULT A. Dataset A scenery image dataset were prepared for our experiments. Using Google Image earch TM, top photo images (each of which contains some s and has a size around ) were first collected. The keywords used in the search were park and sign. Those images were then decomposed into a training dataset ( images) and a test dataset ( images). For each image, a ground-truth was attached manually. pecifically, for each, label or non- label was assigned by a human operator. For a which contains both and non- regions, non label was assigned. For Method, the scene component category was also assigned to the. Figure shows an example of the ground-truth of environmental context. Precision[%] Method Method Method Recall[%] (a) -wise evaluation Figure. B. Quantitative Evaluation Method Method Method Recall[%] (b) Pixel-wise evaluation Detection accuracy. Table I and Fig. show the detection accuracy of the three methods. Figure is an RO curve showing the change of recall and precision according to a parameter k ( k K = ) of the ; if the number of votes for (against non- ) exceeds k, the target is detected as a region. The evaluation has been done in two ways; -wise evaluation and pixel-wise evaluation. The results by the -wise evaluation clearly show the superiority of Methods and over Method. Again, Method is one of the best conventional detection methods, where the environmental context is not used. Thus, the superiority confirms that the environmental context is very useful to scenery detection. Methods and have no significant difference by wise evaluation. This fact is interesting at the following point; Method uses -dimensional score s and its final decision totally relies on these seven s. This indicates that score s, the likelihoods of the main scene components, are very compact and sufficient representations of the environment for scenery detection. The pixel-wise evaluation showed that Method is better than Method ; this indicates that Method failed at some larger s. A typical example of this failure will be shown in ection V-. This is also the reason why Method could not outperform Method by the pixel-wise evaluation as shown in Fig. (b). Figure shows an importance for each at the /non- discrimination by. The importance was evaluated by the Out of Bag method, where the to be evaluated is replaced by another
4 (a) Method (b) Method Figure. E 9 E 9 E 99 E 9-9 E 9-9 E - E - E - E - E 9- E - E - E - E - E 9- E - E - E - E - E 9- E - E - E - E - E 9- E - E - 9 Importance (x-) Importance (x-) 9 Importance (x-) (c) Method Importance of each on discrimination by. (a) (b) (c) (d) Original Method Figure. Method Method Detection results. (a) (b) Figure. Visualization of score value. From left to right, original image, and,, sky, green, sign(board), ground, and building s.
5 and the accuracy degradation by the replacement is measured. If a larger degradation is observed, the is more important. Figure (a) shows that no has very low importance and the ratio between areas of the and its bounding box () has the top importance among the s. A more interesting fact is that as shown in Fig. (b), both of the s and the environmental context s are important in Method. This fact supports that Method utilizes the environmental context s for better detection accuracy than Method. Figure (c) shows that Method utilizes all of the seven score s and especially, green () and signboard () s are important for scenery detection.. Qualitative Evaluation Figure shows detection results by Methods on four scenery images. In Fig. (a) and (b), Method produced many false detections around sky and green regions. Those false detections decrease drastically by using environmental context. Figure (a) is a visualization of the score of Fig. (b). The false detections by Method are found in the sky region (Fig. (b)) and the region has high sky values (Fig. (a)). This fact indicates that the high sky score values could suppress the false detection and thus environmental context is useful to improve detection performance. Figure (c) is an example that environmental context could not improve the result. In this scenery image, thin grasses are overlapped on a signboard. Accordingly, most s of s are severely broken and far different from s of normal s. In the present framework, if a of a is broken, it is difficult to recognize it as a correctly, even with environmental context s. Figure (d) is an example showing a difference between Methods and. Both methods could successfully remove false detections in the green region, Method, however, wrongly detect the of a signboard region as a (large). One reason of this false detection is existence of the edges caused by the s on the signboard. Method was badly affected by these edges. In contrast, as shown in Fig. (b), Method could give larger signboard values correctly for this signboard region and finally give a correct detection result as a signboard. VI. ONLUION Environmental context, which is modeled by scene components, was utilized for scenery detection. The environmental context was represented by many s, which are mainly of texture s and color s, and utilized in two different methods. One method utilized the environmental context s directly in /non discrimination. The other utilized them as scores showing the likelihoods with respect to environmental components, such as sky and green. Both methods could show clear superiority over the detection method without environmental s in experimental results. Future work will focus on the following points. (i) A better segmentation method is necessary since it affects detection performance severely. (ii) The environmental context s of s around the target should be utilized. (iii) Another kind of context s, such as geometric context [] (which can estimate flat areas) and visual saliency [], will be useful because areas with higher flatness and/or saliency will have a higher probability of existing s. AKNOWLEDGMENT The authors greatly thank Prof. K. Kise and Prof. M. Iwamura (Osaka Pref. Univ., Japan) and Prof.. Omachi (Tohoku Univ., Japan) for their precious comments. This research was partially supported by JT, RET. REFERENE [] X. hen, and A. L. Yuille, Detecting and reading text in natural scenes, VPR,. []. Lucas, et al., IDAR robust reading competitions: entries, results, and future directions, IJDAR,. [] K. Zhu, F. Qi, R. Jiang, and L. Xu, Automatic detection and segmentation in natural scene images, J. Zhejiang University, cience A, vol., No.,. [] L. Xu, H. Nagayoshi, and H. ako, Kanji detection from complex real scene images based on properties, DA,. [] H. Goto, Redefining the DT-based for scene text detection, IJDAR,. [] W. Niblack, An Introduction to Digital Image Processing, Prentice Hall, 9. [] L. L. Winger, J. A. Robinson, and M. E. Jernigan, Lowcomplexity extraction in low-contrast scene images, IJPRAI,. [] T. Leung, and J. Malik, Representing and recognizing the visual appearance of materials using three-dimensional textons, IJV,. [9] J. hotton, J. Winn,. Rother, and A. riminisi, TextonBoost: Joint appearance, shape and context modeling for multi-class object recognition and segmentation, EV,. [] Y. Kang, K. Yamaguchi, T. Naito, and Y. Ninomiya, Road scene labeling using fm module and D bag of texton, IV-dRR, 9. [] L. Breiman, Random s, Machine Learning,. [] D. Hoiem, A. A. Efros, and M. Hebert, Geometric context from a single image, IV,. [] L. Itti,. Koch, and E. Niebur, A model of saliency-based visual attention for rapid scene analysis, PAMI, 99.
Conspicuous Character Patterns
Conspicuous Character Patterns Seiichi Uchida Kyushu Univ., Japan Ryoji Hattori Masakazu Iwamura Kyushu Univ., Japan Osaka Pref. Univ., Japan Koichi Kise Osaka Pref. Univ., Japan Shinichiro Omachi Tohoku
More informationPart-Based Skew Estimation for Mathematical Expressions
Soma Shiraishi, Yaokai Feng, and Seiichi Uchida shiraishi@human.ait.kyushu-u.ac.jp {fengyk,uchida}@ait.kyushu-u.ac.jp Abstract We propose a novel method for the skew estimation on text images containing
More informationAn ICA based Approach for Complex Color Scene Text Binarization
An ICA based Approach for Complex Color Scene Text Binarization Siddharth Kherada IIIT-Hyderabad, India siddharth.kherada@research.iiit.ac.in Anoop M. Namboodiri IIIT-Hyderabad, India anoop@iiit.ac.in
More informationLinear combinations of simple classifiers for the PASCAL challenge
Linear combinations of simple classifiers for the PASCAL challenge Nik A. Melchior and David Lee 16 721 Advanced Perception The Robotics Institute Carnegie Mellon University Email: melchior@cmu.edu, dlee1@andrew.cmu.edu
More informationSummarization of Egocentric Moving Videos for Generating Walking Route Guidance
Summarization of Egocentric Moving Videos for Generating Walking Route Guidance Masaya Okamoto and Keiji Yanai Department of Informatics, The University of Electro-Communications 1-5-1 Chofugaoka, Chofu-shi,
More informationToward Part-based Document Image Decoding
2012 10th IAPR International Workshop on Document Analysis Systems Toward Part-based Document Image Decoding Wang Song, Seiichi Uchida Kyushu University, Fukuoka, Japan wangsong@human.ait.kyushu-u.ac.jp,
More informationA Hierarchical Visual Saliency Model for Character Detection in Natural Scenes
A Hierarchical Visual Saliency Model for Character Detection in Natural Scenes Renwu Gao 1, Faisal Shafait 2, Seiichi Uchida 3, and Yaokai Feng 3 1 Information Sciene and Electrical Engineering, Kyushu
More informationAUTONOMOUS IMAGE EXTRACTION AND SEGMENTATION OF IMAGE USING UAV S
AUTONOMOUS IMAGE EXTRACTION AND SEGMENTATION OF IMAGE USING UAV S Radha Krishna Rambola, Associate Professor, NMIMS University, India Akash Agrawal, Student at NMIMS University, India ABSTRACT Due to the
More informationAnalysis: TextonBoost and Semantic Texton Forests. Daniel Munoz Februrary 9, 2009
Analysis: TextonBoost and Semantic Texton Forests Daniel Munoz 16-721 Februrary 9, 2009 Papers [shotton-eccv-06] J. Shotton, J. Winn, C. Rother, A. Criminisi, TextonBoost: Joint Appearance, Shape and Context
More informationTexton Clustering for Local Classification using Scene-Context Scale
Texton Clustering for Local Classification using Scene-Context Scale Yousun Kang Tokyo Polytechnic University Atsugi, Kanakawa, Japan 243-0297 Email: yskang@cs.t-kougei.ac.jp Sugimoto Akihiro National
More informationCOSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor
COSC160: Detection and Classification Jeremy Bolton, PhD Assistant Teaching Professor Outline I. Problem I. Strategies II. Features for training III. Using spatial information? IV. Reducing dimensionality
More informationImage Resizing Based on Gradient Vector Flow Analysis
Image Resizing Based on Gradient Vector Flow Analysis Sebastiano Battiato battiato@dmi.unict.it Giovanni Puglisi puglisi@dmi.unict.it Giovanni Maria Farinella gfarinellao@dmi.unict.it Daniele Ravì rav@dmi.unict.it
More informationA Method of Annotation Extraction from Paper Documents Using Alignment Based on Local Arrangements of Feature Points
A Method of Annotation Extraction from Paper Documents Using Alignment Based on Local Arrangements of Feature Points Tomohiro Nakai, Koichi Kise, Masakazu Iwamura Graduate School of Engineering, Osaka
More informationBagging for One-Class Learning
Bagging for One-Class Learning David Kamm December 13, 2008 1 Introduction Consider the following outlier detection problem: suppose you are given an unlabeled data set and make the assumptions that one
More informationSaliency Detection for Videos Using 3D FFT Local Spectra
Saliency Detection for Videos Using 3D FFT Local Spectra Zhiling Long and Ghassan AlRegib School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA ABSTRACT
More informationTri-modal Human Body Segmentation
Tri-modal Human Body Segmentation Master of Science Thesis Cristina Palmero Cantariño Advisor: Sergio Escalera Guerrero February 6, 2014 Outline 1 Introduction 2 Tri-modal dataset 3 Proposed baseline 4
More informationScene Text Detection Using Machine Learning Classifiers
601 Scene Text Detection Using Machine Learning Classifiers Nafla C.N. 1, Sneha K. 2, Divya K.P. 3 1 (Department of CSE, RCET, Akkikkvu, Thrissur) 2 (Department of CSE, RCET, Akkikkvu, Thrissur) 3 (Department
More informationEstimating Human Pose in Images. Navraj Singh December 11, 2009
Estimating Human Pose in Images Navraj Singh December 11, 2009 Introduction This project attempts to improve the performance of an existing method of estimating the pose of humans in still images. Tasks
More informationPEOPLE IN SEATS COUNTING VIA SEAT DETECTION FOR MEETING SURVEILLANCE
PEOPLE IN SEATS COUNTING VIA SEAT DETECTION FOR MEETING SURVEILLANCE Hongyu Liang, Jinchen Wu, and Kaiqi Huang National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Science
More informationReal-Time Document Image Retrieval for a 10 Million Pages Database with a Memory Efficient and Stability Improved LLAH
2011 International Conference on Document Analysis and Recognition Real-Time Document Image Retrieval for a 10 Million Pages Database with a Memory Efficient and Stability Improved LLAH Kazutaka Takeda,
More informationRSRN: Rich Side-output Residual Network for Medial Axis Detection
RSRN: Rich Side-output Residual Network for Medial Axis Detection Chang Liu, Wei Ke, Jianbin Jiao, and Qixiang Ye University of Chinese Academy of Sciences, Beijing, China {liuchang615, kewei11}@mails.ucas.ac.cn,
More informationCombining PGMs and Discriminative Models for Upper Body Pose Detection
Combining PGMs and Discriminative Models for Upper Body Pose Detection Gedas Bertasius May 30, 2014 1 Introduction In this project, I utilized probabilistic graphical models together with discriminative
More informationDESIGN AND EVALUATION OF MACHINE LEARNING MODELS WITH STATISTICAL FEATURES
EXPERIMENTAL WORK PART I CHAPTER 6 DESIGN AND EVALUATION OF MACHINE LEARNING MODELS WITH STATISTICAL FEATURES The evaluation of models built using statistical in conjunction with various feature subset
More informationPatch-Based Image Classification Using Image Epitomes
Patch-Based Image Classification Using Image Epitomes David Andrzejewski CS 766 - Final Project December 19, 2005 Abstract Automatic image classification has many practical applications, including photo
More informationShort Survey on Static Hand Gesture Recognition
Short Survey on Static Hand Gesture Recognition Huu-Hung Huynh University of Science and Technology The University of Danang, Vietnam Duc-Hoang Vo University of Science and Technology The University of
More informationActiveStereoNet: End-to-End Self-Supervised Learning for Active Stereo Systems (Supplementary Materials)
ActiveStereoNet: End-to-End Self-Supervised Learning for Active Stereo Systems (Supplementary Materials) Yinda Zhang 1,2, Sameh Khamis 1, Christoph Rhemann 1, Julien Valentin 1, Adarsh Kowdle 1, Vladimir
More informationNTHU Rain Removal Project
People NTHU Rain Removal Project Networked Video Lab, National Tsing Hua University, Hsinchu, Taiwan Li-Wei Kang, Institute of Information Science, Academia Sinica, Taipei, Taiwan Chia-Wen Lin *, Department
More informationNearest Clustering Algorithm for Satellite Image Classification in Remote Sensing Applications
Nearest Clustering Algorithm for Satellite Image Classification in Remote Sensing Applications Anil K Goswami 1, Swati Sharma 2, Praveen Kumar 3 1 DRDO, New Delhi, India 2 PDM College of Engineering for
More informationAutomatic Photo Popup
Automatic Photo Popup Derek Hoiem Alexei A. Efros Martial Hebert Carnegie Mellon University What Is Automatic Photo Popup Introduction Creating 3D models from images is a complex process Time-consuming
More informationLearning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009
Learning and Inferring Depth from Monocular Images Jiyan Pan April 1, 2009 Traditional ways of inferring depth Binocular disparity Structure from motion Defocus Given a single monocular image, how to infer
More informationInvariant Recognition of Hand-Drawn Pictograms Using HMMs with a Rotating Feature Extraction
Invariant Recognition of Hand-Drawn Pictograms Using HMMs with a Rotating Feature Extraction Stefan Müller, Gerhard Rigoll, Andreas Kosmala and Denis Mazurenok Department of Computer Science, Faculty of
More informationText Detection in Indoor/Outdoor Scene Images
Text Detection in Indoor/Outdoor Scene Images B. Gatos, I. Pratikakis, K. Kepene and S.J. Perantonis Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Center
More informationLogical Templates for Feature Extraction in Fingerprint Images
Logical Templates for Feature Extraction in Fingerprint Images Bir Bhanu, Michael Boshra and Xuejun Tan Center for Research in Intelligent Systems University of Califomia, Riverside, CA 9252 1, USA Email:
More informationCharacter Recognition
Character Recognition 5.1 INTRODUCTION Recognition is one of the important steps in image processing. There are different methods such as Histogram method, Hough transformation, Neural computing approaches
More informationSolution: filter the image, then subsample F 1 F 2. subsample blur subsample. blur
Pyramids Gaussian pre-filtering Solution: filter the image, then subsample blur F 0 subsample blur subsample * F 0 H F 1 F 1 * H F 2 { Gaussian pyramid blur F 0 subsample blur subsample * F 0 H F 1 F 1
More informationDetection of Cut-And-Paste in Document Images
Detection of Cut-And-Paste in Document Images Ankit Gandhi and C. V. Jawahar Center for Visual Information Technology, IIIT-Hyderabad, India Email: ankit.gandhiug08@students.iiit.ac.in, jawahar@iiit.ac.in
More informationRobust Phase-Based Features Extracted From Image By A Binarization Technique
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 18, Issue 4, Ver. IV (Jul.-Aug. 2016), PP 10-14 www.iosrjournals.org Robust Phase-Based Features Extracted From
More informationDet De e t cting abnormal event n s Jaechul Kim
Detecting abnormal events Jaechul Kim Purpose Introduce general methodologies used in abnormality detection Deal with technical details of selected papers Abnormal events Easy to verify, but hard to describe
More informationA novel template matching method for human detection
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2009 A novel template matching method for human detection Duc Thanh Nguyen
More informationColor Image Segmentation
Color Image Segmentation Yining Deng, B. S. Manjunath and Hyundoo Shin* Department of Electrical and Computer Engineering University of California, Santa Barbara, CA 93106-9560 *Samsung Electronics Inc.
More informationImage Classification Using Wavelet Coefficients in Low-pass Bands
Proceedings of International Joint Conference on Neural Networks, Orlando, Florida, USA, August -7, 007 Image Classification Using Wavelet Coefficients in Low-pass Bands Weibao Zou, Member, IEEE, and Yan
More informationFine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes
2009 10th International Conference on Document Analysis and Recognition Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes Alireza Alaei
More informationSegmentation. Bottom up Segmentation Semantic Segmentation
Segmentation Bottom up Segmentation Semantic Segmentation Semantic Labeling of Street Scenes Ground Truth Labels 11 classes, almost all occur simultaneously, large changes in viewpoint, scale sky, road,
More informationHuman Motion Detection and Tracking for Video Surveillance
Human Motion Detection and Tracking for Video Surveillance Prithviraj Banerjee and Somnath Sengupta Department of Electronics and Electrical Communication Engineering Indian Institute of Technology, Kharagpur,
More informationEE368 Project Report CD Cover Recognition Using Modified SIFT Algorithm
EE368 Project Report CD Cover Recognition Using Modified SIFT Algorithm Group 1: Mina A. Makar Stanford University mamakar@stanford.edu Abstract In this report, we investigate the application of the Scale-Invariant
More informationN.Priya. Keywords Compass mask, Threshold, Morphological Operators, Statistical Measures, Text extraction
Volume, Issue 8, August ISSN: 77 8X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com A Combined Edge-Based Text
More informationRESTORATION OF DEGRADED DOCUMENTS USING IMAGE BINARIZATION TECHNIQUE
RESTORATION OF DEGRADED DOCUMENTS USING IMAGE BINARIZATION TECHNIQUE K. Kaviya Selvi 1 and R. S. Sabeenian 2 1 Department of Electronics and Communication Engineering, Communication Systems, Sona College
More informationAN IMPROVED K-MEANS CLUSTERING ALGORITHM FOR IMAGE SEGMENTATION
AN IMPROVED K-MEANS CLUSTERING ALGORITHM FOR IMAGE SEGMENTATION WILLIAM ROBSON SCHWARTZ University of Maryland, Department of Computer Science College Park, MD, USA, 20742-327, schwartz@cs.umd.edu RICARDO
More informationFabric Defect Detection Based on Computer Vision
Fabric Defect Detection Based on Computer Vision Jing Sun and Zhiyu Zhou College of Information and Electronics, Zhejiang Sci-Tech University, Hangzhou, China {jings531,zhouzhiyu1993}@163.com Abstract.
More informationA Laplacian Based Novel Approach to Efficient Text Localization in Grayscale Images
A Laplacian Based Novel Approach to Efficient Text Localization in Grayscale Images Karthik Ram K.V & Mahantesh K Department of Electronics and Communication Engineering, SJB Institute of Technology, Bangalore,
More informationCursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network
Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network Utkarsh Dwivedi 1, Pranjal Rajput 2, Manish Kumar Sharma 3 1UG Scholar, Dept. of CSE, GCET, Greater Noida,
More informationI. INTRODUCTION. Figure-1 Basic block of text analysis
ISSN: 2349-7637 (Online) (RHIMRJ) Research Paper Available online at: www.rhimrj.com Detection and Localization of Texts from Natural Scene Images: A Hybrid Approach Priyanka Muchhadiya Post Graduate Fellow,
More informationHand Posture Recognition Using Adaboost with SIFT for Human Robot Interaction
Hand Posture Recognition Using Adaboost with SIFT for Human Robot Interaction Chieh-Chih Wang and Ko-Chih Wang Department of Computer Science and Information Engineering Graduate Institute of Networking
More informationMedical images, segmentation and analysis
Medical images, segmentation and analysis ImageLab group http://imagelab.ing.unimo.it Università degli Studi di Modena e Reggio Emilia Medical Images Macroscopic Dermoscopic ELM enhance the features of
More informationClassification. Vladimir Curic. Centre for Image Analysis Swedish University of Agricultural Sciences Uppsala University
Classification Vladimir Curic Centre for Image Analysis Swedish University of Agricultural Sciences Uppsala University Outline An overview on classification Basics of classification How to choose appropriate
More informationSmall Object Segmentation Based on Visual Saliency in Natural Images
J Inf Process Syst, Vol.9, No.4, pp.592-601, December 2013 http://dx.doi.org/10.3745/jips.2013.9.4.592 pissn 1976-913X eissn 2092-805X Small Object Segmentation Based on Visual Saliency in Natural Images
More informationTraffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers
Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers A. Salhi, B. Minaoui, M. Fakir, H. Chakib, H. Grimech Faculty of science and Technology Sultan Moulay Slimane
More informationA New Algorithm for Shape Detection
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 19, Issue 3, Ver. I (May.-June. 2017), PP 71-76 www.iosrjournals.org A New Algorithm for Shape Detection Hewa
More informationSelf Lane Assignment Using Smart Mobile Camera For Intelligent GPS Navigation and Traffic Interpretation
For Intelligent GPS Navigation and Traffic Interpretation Tianshi Gao Stanford University tianshig@stanford.edu 1. Introduction Imagine that you are driving on the highway at 70 mph and trying to figure
More informationExploring Curve Fitting for Fingers in Egocentric Images
Exploring Curve Fitting for Fingers in Egocentric Images Akanksha Saran Robotics Institute, Carnegie Mellon University 16-811: Math Fundamentals for Robotics Final Project Report Email: asaran@andrew.cmu.edu
More informationCS 223B Computer Vision Problem Set 3
CS 223B Computer Vision Problem Set 3 Due: Feb. 22 nd, 2011 1 Probabilistic Recursion for Tracking In this problem you will derive a method for tracking a point of interest through a sequence of images.
More informationCS 231A Computer Vision (Fall 2012) Problem Set 3
CS 231A Computer Vision (Fall 2012) Problem Set 3 Due: Nov. 13 th, 2012 (2:15pm) 1 Probabilistic Recursion for Tracking (20 points) In this problem you will derive a method for tracking a point of interest
More informationMorphological Change Detection Algorithms for Surveillance Applications
Morphological Change Detection Algorithms for Surveillance Applications Elena Stringa Joint Research Centre Institute for Systems, Informatics and Safety TP 270, Ispra (VA), Italy elena.stringa@jrc.it
More informationWITH the increasing use of digital image capturing
800 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 20, NO. 3, MARCH 2011 A Hybrid Approach to Detect and Localize Texts in Natural Scene Images Yi-Feng Pan, Xinwen Hou, and Cheng-Lin Liu, Senior Member, IEEE
More informationHuman Upper Body Pose Estimation in Static Images
1. Research Team Human Upper Body Pose Estimation in Static Images Project Leader: Graduate Students: Prof. Isaac Cohen, Computer Science Mun Wai Lee 2. Statement of Project Goals This goal of this project
More informationEnhanced Hemisphere Concept for Color Pixel Classification
2016 International Conference on Multimedia Systems and Signal Processing Enhanced Hemisphere Concept for Color Pixel Classification Van Ng Graduate School of Information Sciences Tohoku University Sendai,
More information3 October, 2013 MVA ENS Cachan. Lecture 2: Logistic regression & intro to MIL Iasonas Kokkinos
Machine Learning for Computer Vision 1 3 October, 2013 MVA ENS Cachan Lecture 2: Logistic regression & intro to MIL Iasonas Kokkinos Iasonas.kokkinos@ecp.fr Department of Applied Mathematics Ecole Centrale
More informationCan Similar Scenes help Surface Layout Estimation?
Can Similar Scenes help Surface Layout Estimation? Santosh K. Divvala, Alexei A. Efros, Martial Hebert Robotics Institute, Carnegie Mellon University. {santosh,efros,hebert}@cs.cmu.edu Abstract We describe
More informationUsing the Kolmogorov-Smirnov Test for Image Segmentation
Using the Kolmogorov-Smirnov Test for Image Segmentation Yong Jae Lee CS395T Computational Statistics Final Project Report May 6th, 2009 I. INTRODUCTION Image segmentation is a fundamental task in computer
More informationApparel Classifier and Recommender using Deep Learning
Apparel Classifier and Recommender using Deep Learning Live Demo at: http://saurabhg.me/projects/tag-that-apparel Saurabh Gupta sag043@ucsd.edu Siddhartha Agarwal siagarwa@ucsd.edu Apoorve Dave a1dave@ucsd.edu
More informationStacked Denoising Autoencoders for Face Pose Normalization
Stacked Denoising Autoencoders for Face Pose Normalization Yoonseop Kang 1, Kang-Tae Lee 2,JihyunEun 2, Sung Eun Park 2 and Seungjin Choi 1 1 Department of Computer Science and Engineering Pohang University
More informationDeformable Part Models
CS 1674: Intro to Computer Vision Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 9, 2016 Today: Object category detection Window-based approaches: Last time: Viola-Jones
More informationUnsupervised discovery of category and object models. The task
Unsupervised discovery of category and object models Martial Hebert The task 1 Common ingredients 1. Generate candidate segments 2. Estimate similarity between candidate segments 3. Prune resulting (implicit)
More informationRecognition of Animal Skin Texture Attributes in the Wild. Amey Dharwadker (aap2174) Kai Zhang (kz2213)
Recognition of Animal Skin Texture Attributes in the Wild Amey Dharwadker (aap2174) Kai Zhang (kz2213) Motivation Patterns and textures are have an important role in object description and understanding
More informationGeneric Face Alignment Using an Improved Active Shape Model
Generic Face Alignment Using an Improved Active Shape Model Liting Wang, Xiaoqing Ding, Chi Fang Electronic Engineering Department, Tsinghua University, Beijing, China {wanglt, dxq, fangchi} @ocrserv.ee.tsinghua.edu.cn
More informationConvolution Neural Networks for Chinese Handwriting Recognition
Convolution Neural Networks for Chinese Handwriting Recognition Xu Chen Stanford University 450 Serra Mall, Stanford, CA 94305 xchen91@stanford.edu Abstract Convolutional neural networks have been proven
More informationAutomatic Shadow Removal by Illuminance in HSV Color Space
Computer Science and Information Technology 3(3): 70-75, 2015 DOI: 10.13189/csit.2015.030303 http://www.hrpub.org Automatic Shadow Removal by Illuminance in HSV Color Space Wenbo Huang 1, KyoungYeon Kim
More informationRoom Reconstruction from a Single Spherical Image by Higher-order Energy Minimization
Room Reconstruction from a Single Spherical Image by Higher-order Energy Minimization Kosuke Fukano, Yoshihiko Mochizuki, Satoshi Iizuka, Edgar Simo-Serra, Akihiro Sugimoto, and Hiroshi Ishikawa Waseda
More informationBinarization of Color Character Strings in Scene Images Using K-means Clustering and Support Vector Machines
2011 International Conference on Document Analysis and Recognition Binarization of Color Character Strings in Scene Images Using K-means Clustering and Support Vector Machines Toru Wakahara Kohei Kita
More informationFeature Selection for Image Retrieval and Object Recognition
Feature Selection for Image Retrieval and Object Recognition Nuno Vasconcelos et al. Statistical Visual Computing Lab ECE, UCSD Presented by Dashan Gao Scalable Discriminant Feature Selection for Image
More informationCHAPTER 6 DETECTION OF MASS USING NOVEL SEGMENTATION, GLCM AND NEURAL NETWORKS
130 CHAPTER 6 DETECTION OF MASS USING NOVEL SEGMENTATION, GLCM AND NEURAL NETWORKS A mass is defined as a space-occupying lesion seen in more than one projection and it is described by its shapes and margin
More informationA Technique for Classification of Printed & Handwritten text
123 A Technique for Classification of Printed & Handwritten text M.Tech Research Scholar, Computer Engineering Department, Yadavindra College of Engineering, Punjabi University, Guru Kashi Campus, Talwandi
More informationDetecting Salient Contours Using Orientation Energy Distribution. Part I: Thresholding Based on. Response Distribution
Detecting Salient Contours Using Orientation Energy Distribution The Problem: How Does the Visual System Detect Salient Contours? CPSC 636 Slide12, Spring 212 Yoonsuck Choe Co-work with S. Sarma and H.-C.
More informationImage Segmentation Techniques for Object-Based Coding
Image Techniques for Object-Based Coding Junaid Ahmed, Joseph Bosworth, and Scott T. Acton The Oklahoma Imaging Laboratory School of Electrical and Computer Engineering Oklahoma State University {ajunaid,bosworj,sacton}@okstate.edu
More informationTask analysis based on observing hands and objects by vision
Task analysis based on observing hands and objects by vision Yoshihiro SATO Keni Bernardin Hiroshi KIMURA Katsushi IKEUCHI Univ. of Electro-Communications Univ. of Karlsruhe Univ. of Tokyo Abstract In
More informationECS 289H: Visual Recognition Fall Yong Jae Lee Department of Computer Science
ECS 289H: Visual Recognition Fall 2014 Yong Jae Lee Department of Computer Science Plan for today Questions? Research overview Standard supervised visual learning building Category models Annotators tree
More informationUlas Bagci
CAP5415-Computer Vision Lecture 14-Decision Forests for Computer Vision Ulas Bagci bagci@ucf.edu 1 Readings Slide Credits: Criminisi and Shotton Z. Tu R.Cipolla 2 Common Terminologies Randomized Decision
More informationExperimentation on the use of Chromaticity Features, Local Binary Pattern and Discrete Cosine Transform in Colour Texture Analysis
Experimentation on the use of Chromaticity Features, Local Binary Pattern and Discrete Cosine Transform in Colour Texture Analysis N.Padmapriya, Ovidiu Ghita, and Paul.F.Whelan Vision Systems Laboratory,
More informationAdaptive Skin Color Classifier for Face Outline Models
Adaptive Skin Color Classifier for Face Outline Models M. Wimmer, B. Radig, M. Beetz Informatik IX, Technische Universität München, Germany Boltzmannstr. 3, 87548 Garching, Germany [wimmerm, radig, beetz]@informatik.tu-muenchen.de
More informationISSN Vol.03,Issue.02, June-2015, Pages:
WWW.IJITECH.ORG ISSN 2321-8665 Vol.03,Issue.02, June-2015, Pages:0077-0082 Evaluation of Ancient Documents and Images by using Phase Based Binarization K. SRUJANA 1, D. C. VINOD R KUMAR 2 1 PG Scholar,
More informationDiscrete Optimization of Ray Potentials for Semantic 3D Reconstruction
Discrete Optimization of Ray Potentials for Semantic 3D Reconstruction Marc Pollefeys Joined work with Nikolay Savinov, Christian Haene, Lubor Ladicky 2 Comparison to Volumetric Fusion Higher-order ray
More informationBagging and Boosting Algorithms for Support Vector Machine Classifiers
Bagging and Boosting Algorithms for Support Vector Machine Classifiers Noritaka SHIGEI and Hiromi MIYAJIMA Dept. of Electrical and Electronics Engineering, Kagoshima University 1-21-40, Korimoto, Kagoshima
More informationThe Vehicle Logo Location System based on saliency model
ISSN 746-7659, England, UK Journal of Information and Computing Science Vol. 0, No. 3, 205, pp. 73-77 The Vehicle Logo Location System based on saliency model Shangbing Gao,2, Liangliang Wang, Hongyang
More informationBus Detection and recognition for visually impaired people
Bus Detection and recognition for visually impaired people Hangrong Pan, Chucai Yi, and Yingli Tian The City College of New York The Graduate Center The City University of New York MAP4VIP Outline Motivation
More informationLarge-Scale Traffic Sign Recognition based on Local Features and Color Segmentation
Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation M. Blauth, E. Kraft, F. Hirschenberger, M. Böhm Fraunhofer Institute for Industrial Mathematics, Fraunhofer-Platz 1,
More informationExtracting Layers and Recognizing Features for Automatic Map Understanding. Yao-Yi Chiang
Extracting Layers and Recognizing Features for Automatic Map Understanding Yao-Yi Chiang 0 Outline Introduction/ Problem Motivation Map Processing Overview Map Decomposition Feature Recognition Discussion
More informationA System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation
A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation K. Roy, U. Pal and B. B. Chaudhuri CVPR Unit; Indian Statistical Institute, Kolkata-108; India umapada@isical.ac.in
More informationA Novel Approach to Image Segmentation for Traffic Sign Recognition Jon Jay Hack and Sidd Jagadish
A Novel Approach to Image Segmentation for Traffic Sign Recognition Jon Jay Hack and Sidd Jagadish Introduction/Motivation: As autonomous vehicles, such as Google s self-driving car, have recently become
More informationImproving the Efficiency of Fast Using Semantic Similarity Algorithm
International Journal of Scientific and Research Publications, Volume 4, Issue 1, January 2014 1 Improving the Efficiency of Fast Using Semantic Similarity Algorithm D.KARTHIKA 1, S. DIVAKAR 2 Final year
More informationSeparation of Overlapping Text from Graphics
Separation of Overlapping Text from Graphics Ruini Cao, Chew Lim Tan School of Computing, National University of Singapore 3 Science Drive 2, Singapore 117543 Email: {caorn, tancl}@comp.nus.edu.sg Abstract
More information