Study on the Signboard Region Detection in Natural Image

Similar documents
Robot localization method based on visual features and their geometric relationship

Lane Detection using Fuzzy C-Means Clustering

MATHEMATICAL IMAGE PROCESSING FOR AUTOMATIC NUMBER PLATE RECOGNITION SYSTEM

Fast Vehicle Detection and Counting Using Background Subtraction Technique and Prewitt Edge Detection

Car License Plate Detection Based on Line Segments

An Improvement of the Occlusion Detection Performance in Sequential Images Using Optical Flow

ECE 172A: Introduction to Intelligent Systems: Machine Vision, Fall Midterm Examination

Non-rigid body Object Tracking using Fuzzy Neural System based on Multiple ROIs and Adaptive Motion Frame Method

Automatic Pipeline Generation by the Sequential Segmentation and Skelton Construction of Point Cloud

Text Information Extraction And Analysis From Images Using Digital Image Processing Techniques

Extracting Road Signs using the Color Information

Rectification of distorted elemental image array using four markers in three-dimensional integral imaging

Implementation of a Pedestrian Detection Device based on CENTRIST for an Embedded Environment

Traffic Signs Recognition Experiments with Transform based Traffic Sign Recognition System

Object Tracking using Superpixel Confidence Map in Centroid Shifting Method

Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers

Stereo Vision Image Processing Strategy for Moving Object Detecting

Touching-Pigs Segmentation using Concave Points in Continuous Video Frames

Change detection using joint intensity histogram

A Road Marking Extraction Method Using GPGPU

Transactions on Information and Communications Technologies vol 16, 1996 WIT Press, ISSN

Auto-Digitizer for Fast Graph-to-Data Conversion

Example 1: Give the coordinates of the points on the graph.

CLASSIFICATION FOR ROADSIDE OBJECTS BASED ON SIMULATED LASER SCANNING

Towards Knowledge-Based Extraction of Roads from 1m-resolution Satellite Images

A MODULARIZED APPROACH FOR REAL TIME VEHICULAR SURVEILLANCE WITH A CUSTOM HOG BASED LPR SYSTEM. Vivek Joy 1, Kakkanad, Kochi, Kerala.

Fully Automatic Methodology for Human Action Recognition Incorporating Dynamic Information

Keywords: Thresholding, Morphological operations, Image filtering, Adaptive histogram equalization, Ceramic tile.


3D Grid Size Optimization of Automatic Space Analysis for Plant Facility Using Point Cloud Data

Computer Vision. Image Segmentation. 10. Segmentation. Computer Engineering, Sejong University. Dongil Han

Extraction of Human Gait Features from Enhanced Human Silhouette Images

Determination of the Parameter for Transformation of Local Geodetic System to the World Geodetic System using GNSS

Chapter 10: Image Segmentation. Office room : 841

Improving License Plate Recognition Rate using Hybrid Algorithms

Extracting Layers and Recognizing Features for Automatic Map Understanding. Yao-Yi Chiang

Looming Motion Segmentation in Vehicle Tracking System using Wavelet Transforms

A Noise-Robust and Adaptive Image Segmentation Method based on Splitting and Merging method

CORRELATION BASED CAR NUMBER PLATE EXTRACTION SYSTEM

Human Motion Detection and Tracking for Video Surveillance

Vision-based Frontal Vehicle Detection and Tracking

Morphological Change Detection Algorithms for Surveillance Applications

A Practical Camera Calibration System on Mobile Phones

Classification of objects from Video Data (Group 30)

Vehicle Detection Method using Haar-like Feature on Real Time System

Laser Sensor for Obstacle Detection of AGV

CAP 5415 Computer Vision Fall 2012

Post-Classification Change Detection of High Resolution Satellite Images Using AdaBoost Classifier

Color Local Texture Features Based Face Recognition

IMPLEMENTATION OF SPATIAL FUZZY CLUSTERING IN DETECTING LIP ON COLOR IMAGES

Image Analysis - Lecture 5

CIRCULAR MOIRÉ PATTERNS IN 3D COMPUTER VISION APPLICATIONS

Bus Detection and recognition for visually impaired people

Scene Text Detection Using Machine Learning Classifiers

Motion Detection Algorithm

Segmentation of Kannada Handwritten Characters and Recognition Using Twelve Directional Feature Extraction Techniques

Practical Image and Video Processing Using MATLAB

Chapter 1. Linear Equations and Straight Lines. 2 of 71. Copyright 2014, 2010, 2007 Pearson Education, Inc.

EE 584 MACHINE VISION

Finger Print Enhancement Using Minutiae Based Algorithm

CHAPTER 3 RETINAL OPTIC DISC SEGMENTATION

Pedestrian Detection Using Correlated Lidar and Image Data EECS442 Final Project Fall 2016

Mobile Human Detection Systems based on Sliding Windows Approach-A Review

Histogram and watershed based segmentation of color images

Restoring Warped Document Image Based on Text Line Correction

A Robust Hand Gesture Recognition Using Combined Moment Invariants in Hand Shape

Auto-focusing Technique in a Projector-Camera System

HOG-Based Person Following and Autonomous Returning Using Generated Map by Mobile Robot Equipped with Camera and Laser Range Finder

Research on QR Code Image Pre-processing Algorithm under Complex Background

A Study of Medical Image Analysis System

DETECTION OF 3D POINTS ON MOVING OBJECTS FROM POINT CLOUD DATA FOR 3D MODELING OF OUTDOOR ENVIRONMENTS

Real-time Detection of Illegally Parked Vehicles Using 1-D Transformation

Non-Linear Masking based Contrast Enhancement via Illumination Estimation

Available online at ScienceDirect. Procedia Computer Science 56 (2015 )

Segmentation algorithm for monochrome images generally are based on one of two basic properties of gray level values: discontinuity and similarity.

OBJECT detection in general has many applications

Suspicious Activity Detection of Moving Object in Video Surveillance System

Extraction and Recognition of Alphanumeric Characters from Vehicle Number Plate

Research Fellow, Korea Institute of Civil Engineering and Building Technology, Korea (*corresponding author) 2

Stereo Image Rectification for Simple Panoramic Image Generation

Critique: Efficient Iris Recognition by Characterizing Key Local Variations

Film Line scratch Detection using Neural Network and Morphological Filter

IMAGE CLUSTERING AND CLASSIFICATION

A Novel Container ISO-Code Recognition Method using Texture Clustering with a Spatial Structure Window

A threshold decision of the object image by using the smart tag

DESIGNING A REAL TIME SYSTEM FOR CAR NUMBER DETECTION USING DISCRETE HOPFIELD NETWORK

MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION

Lecture 6: Edge Detection

The Study of Identification Algorithm Weeds based on the Order Morphology

Pedestrian Detection with Improved LBP and Hog Algorithm

Selective Search for Object Recognition

Introduction. Chapter Overview

A Study on the IoT Sensor Interaction Transmission System based on BigData

Efficient SLAM Scheme Based ICP Matching Algorithm Using Image and Laser Scan Information

Topic 6 Representation and Description

Iris Recognition for Eyelash Detection Using Gabor Filter

Types of Edges. Why Edge Detection? Types of Edges. Edge Detection. Gradient. Edge Detection

A Laplacian Based Novel Approach to Efficient Text Localization in Grayscale Images

Applicability Estimation of Mobile Mapping. System for Road Management

Topic 4 Image Segmentation

Transcription:

, pp.179-184 http://dx.doi.org/10.14257/astl.2016.140.34 Study on the Signboard Region Detection in Natural Image Daeyeong Lim 1, Youngbaik Kim 2, Incheol Park 1, Jihoon seung 1, Kilto Chong 1,* 1 1567 Baekje-daero, deokjin-gu, Jeonju-si, Jeollabuk-do 54896 Republic of Korea 2 KEPCO Engineering and Construction Company, Inc., Republic of Korea 1 tamudylim@gmail.com, 2 ybkkim@kepco-enc.com, 1 incp486@naver.com, 1 jeehun@nate.com, 1,* kitchong@jbnu.ac.kr Abstract. In this paper, we present the prior art to find the ROI of computer vision research in order to acquire the necessary information using images. or video. We adaptively obtain the ROI from the problems of conventional method. A method for signboard detection is proposed by searching the Hough line and determining the area of interest and reconstructing it taking into consideration the warped image. Keywords: Sign-board image, Natural scenes, Hough transform 1 Introduction For computer vision research to acquire the necessary information using images or video, prior art searching ROI is needed. In this paper, ROI detection method of building signboard is studied. The text in a natural image can provide the most clear and meaningful information available. Particularly, the signboard includes information such as business name and the telephone number. Since the characteristics of the building can be known through classification, it can be used as the main information for autonomous navigation of the unmanned robot[1]. We focus on preprocessing work which has the greatest influence on the detection of the candidate area of the signboard and compare existing techniques with the proposed technique. 2 Signboard ROI Detection 2.1 Difficulty in Recognizing Signs in Natural Images Similar studies on sign recognition include license plate recognition, traffic sign recognition, and container identification. The above given studies show the size and color as well as the approximate location of the attachment. On the other hand, signboards are composed of size, location, and various colors. Signboard image recognition present various obstacles due to their characteristics. ISSN: 2287-1233 ASTL Copyright 2016 SERSC

Common difficulties in recognition are the effects of shadows and lighting as well as shape distortion due to the tilting of the camera during shooting. This difficulty can be solved through color analysis and structural analysis. In particular, there are various types of obstacles in recognizing signboards. Typical examples of these obstacles are electric poles, wires, trees, cars, and doors. Since most natural images contain these obstacles, they must be discussed. Horizontal edge, vertical edge, point and line information are used in order to recognize a signboard. Complex horizontal, vertical, and point information are contained in the case of trees and it is difficult to separate them into non-signboard areas. It is difficult to separate wires, cars, doors, etc. into non-signboard areas due to light reflection and color information. Therefore, existing methods to solve the problems inherent in natural images is reviewed and their performance is compared with the proposed method. 2.2 Related Works Fig. 1. Block Diagram of the proposed method. The procedure for detecting the ROI of the proposed signboard is shown in the figure above. There is an image preprocessing step for image binarization, which is introduced in Section 2.2. The subsequent steps are covered in Sections 2.3 through 2.4. 2.2.1 Horizontal Edge The Sobel mask technique is used for the preprocessing method using the horizontal edge. Sobel is used to detect an edge as an operator for performing a derivative on an image to compute a gradient. Since the projected pixel values are relatively averaged, they have a strong characteristic in noise [2]. If there are no obstacles or interference in natural reminiscence, the signboard can be separated quickly and accurately using the horizontal edge of the Sobel mask. However, if there is an obstacle or interference, the Sobel mask includes edge information of the non-signboard area, which makes it difficult to separate the background from the signboard. 180 Copyright 2016 SERSC

2.2.2 Color Distance The color information of the image is important for sign recognition. Many studies use HSV color models. Signs can be defined into similar colors in natural images. The similarity of colors is measured by the color distance. Using the H, S, and V values, the color distance can be calculated as follows. CD = Hue 2 + Saturation 2 + Value 2 The histogram of the image is obtained by using the CD value obtained above [3]. Fig. 2. Histogram after CD application. Fig. 3. Histogram after fuzzy c-mean application. Based on the obtained histogram, it is possible to separate non-signboard areas by using a portion having a large amount of information. Such a method requires the signboard to be at least half the size of the image size, and only when the color distance between the background and the signboard is too long. 2.2.3 Fuzzy C-mean The fuzzy c-mean algorithm (FCM) is used to cluster image color information using feature vectors. The FCM algorithm has the following objective function. c J FCM = (U, V) = u ik p x k v i 2 N i=1 k=1 The performance index J FCM (U, V) measures the weighted sum of distance between cluster centre and elements in corresponding fuzzy clusters and it is minimized when the pixel is close to centroid of their clusters. That pixel is assigned with high membership values and the remaining pixels are with low membership value[4]. After performing the fuzzy c-mean, it can be seen that the histogram is classified into two cluster values as described figure 3. 2.2.4 Proposed Method The proposed preprocessing technique is based on morphology. Morphology means analyzing the geometric shape of an image that involves erode and dilate operations. Copyright 2016 SERSC 181

The preprocessing process is as follows. Erode Splitting and Merging Dilate Splitting and Merging Dilate of the gray scale image f by the structural element b is denoted by f b and is defined as follows. (f b)(x, y) = max{f(x x, y y ) + b(x, y ) (x, y ) D b } Erosion of the gray scale image f by the structural element b is denoted by f b and is defined as follows. (f b)(x, y) = min{f(x + x, y + y ) b(x, y ) (x, y ) D b } Subtract the structural element value of the image pixel values at each movement position and then calculate the minimum value. Splitting and Merging steps can be summarized as follows. Step1. Split into four disjoint quadrants any region R i for which P(R i ) = FALSE. Step2. When no further splitting is possible, merge nay adjacent regions R i and R j for which P(R i R j ) = TRUE. Step3. Stop when no further merging is possible. 2.3 Hough Transform In Hough transform, we consider point (x, y) and all the lines that pass through this point. There are infinitely many lines passing through the point (x, y) and all of the lines satisfy the slope-slicing linear formula y = ax + b for certain values of a and b. If we write this equation as b = ax + y and consider the ab-plane, we have one linear equation for a fixed pair(x, y). For line detection through Hough transform, two pairs of points were calculated for ten line extractions, and line components less than about seventeen percent (17%) of the image width were excluded from line components. Adjacent lines and intersecting lines were removed for the detected ten line components. The following figure show the results of applying four techniques to natural images that include many horizontal components of doors, windows, and bricks. It can be confirmed that the proposed method effectively detects the upper and lower lines of the signboard area. (a) (b) 182 Copyright 2016 SERSC

(c) Fig. 4. (a) Hough transform result using Horizontal edge, (b) Color distance, (c) Fuzzy c-mean, and (d) Proposed method. (d) 2.4 Image Division and Distortion Correction Image division is performed by storing the image between two straight lines using a linear equation calculated from the Hough transform. Alignment was performed based on y-intercept in order to distinguish the uppermost and lowermost straight lines. Distortion reconstruction performed geometric transformation through projection of warped images [5]. Four points on both sides of the image were used for image projection. In addition, the signboard part can be detected in the candidate area using the vertical axis component of the gray image. 3 Experiment Results A total of 107 natural images were used in testing the proposed method which were then compared with the existing three methods. The performance index are subdivided and the final results are somewhat lower because they all exhibit 0% detection except for certain conditions. Table 1. The performance analysis. Algorithm Accuracy (%) Processing speed (s) Horizontal edge(he) 49.8131 0.0153 Color distance(cd) 76.0280 0.0421 Fuzzy c-mean(fc) 63.1776 4.1671 Proposed method(pm) 87.5701 0.0428 Experimental results show that PM has the highest accuracy at 87.5701% with an average operation speed for natural images at 0.0428s. FC is considered to be inadequate in terms of real-time application due to its slower computation speed even though it has 63.1776% accuracy. The processing speed of HE is fast, but it is difficult to use it alone without the constraint that the signboard image is located at the center of the image. On the other hand, CD with similar computation speed to PM showed a negative accuracy difference of 11.5421. CD can be used to improve PM s accuracy through CM and PM fusion because five images were detected out of ten images by CD that cannot be detected by PM. Copyright 2016 SERSC 183

4 Conclusion In this paper, we propose the preprocessing technique of natural image to recognize signboards through Hough transform. The effectiveness of the proposed method is verified compared with the existing methods since the preprocessing method has a great influence on the recognition of the signboard. Experimental results show that the proposed method has high accuracy and suitable processing speed in real time. The proposed method can detect the signboard area without being restricted as to the size and position of the signboard. Sign recognition technology can be used not only to provide information necessary navigation of an autonomous mobile robots but also to the users. As shown in the experimental results, performance improvement can be expected by combining CM with the PM. In the future, we plan to increase recognition rate of signboards of various natural images and also continue to study word recognition through text detection. Acknowledgements. This research was financially supported by the Ministry of Trade, Industry and Energy(MOTIE) and Korea Institute for Advancement of Technology(KIAT) through the International Cooperative R&D program(n046200012) and the Brain Korea 21 PLUS Project, National Research Foundation (NRF) of Korea. It was also supported by the NRF grant funded by the Korea government (MEST) (No. 2013R1A2A2A01068127) and (No. 2013R1A1A2A10009458) as well as by Basic Science Research Program (NRF) funded by the Ministry of Science. References 1. Park, K.H., Lee, G. S.: A sign board region detection using Hough Transform. pp.57 59, vol. 12, No. 1, kmms (2009) 2. Park, J., Lee, G.: Automatic detection and recognition of Korean text in outdoor signboard images. pp.1728 1739. Elsevier, pattern recognition letters 31(2010) 3. Adam, A., Ioannidis, C.: Automatic Road-sign detection and classification based on support vector machines and HOG descriptors. pp.1 7, vol. II-5, ISPRS (2014) 4. Mahajan, S.M., Dubey, Y.K.: Color Image Segmentation using Kernalized Fuzzy C-means Clustering, pp.1142 1146, IEEE (2015) 5. Milevskiy, I., Ha, J.Y.: A Fast Algorithm for Korean Text Extraction and Segmentation from Subway Signboard Images Utilizing Smartphone Sensor. pp.161 166. vol.5, No.3, JCSE (2011) 184 Copyright 2016 SERSC