Extracting Characters From Books Based On The OCR Technology

Similar documents
An algorithm of lips secondary positioning and feature extraction based on YCbCr color space SHEN Xian-geng 1, WU Wei 2

Multi-Step Segmentation Method Based on Adaptive Thresholds for Chinese Calligraphy Characters

Research on QR Code Image Pre-processing Algorithm under Complex Background

An Efficient Character Segmentation Algorithm for Printed Chinese Documents

HANDWRITTEN GURMUKHI CHARACTER RECOGNITION USING WAVELET TRANSFORMS

Word Matching of handwritten scripts

An adaptive container code character segmentation algorithm Yajie Zhu1, a, Chenglong Liang2, b

Time Stamp Detection and Recognition in Video Frames

A Method of Face Detection Based On Improved YCBCR Skin Color Model Fan Jihui1, a, Wang Hongxing2, b

Study on Image Position Algorithm of the PCB Detection

Video Inter-frame Forgery Identification Based on Optical Flow Consistency

An Improved DCT Based Color Image Watermarking Scheme Xiangguang Xiong1, a

Research of Traffic Flow Based on SVM Method. Deng-hong YIN, Jian WANG and Bo LI *

Scene Text Detection Using Machine Learning Classifiers

A Technique for Classification of Printed & Handwritten text

Image Segmentation Based on Watershed and Edge Detection Techniques

Enhanced Image. Improved Dam point Labelling

Numerical Recognition in the Verification Process of Mechanical and Electronic Coal Mine Anemometer

2 Proposed Methodology

Project Report for EE7700

Analysis of TFT-LCD Point Defect Detection System Based on Machine Vision

A Robust Color Image Watermarking Using Maximum Wavelet-Tree Difference Scheme

Implementation Of Harris Corner Matching Based On FPGA

An Adaptive Threshold LBP Algorithm for Face Recognition

Text Information Extraction And Analysis From Images Using Digital Image Processing Techniques

ABJAD: AN OFF-LINE ARABIC HANDWRITTEN RECOGNITION SYSTEM

Face Recognition Technology Based On Image Processing Chen Xin, Yajuan Li, Zhimin Tian

IMAGE S EGMENTATION AND TEXT EXTRACTION: APPLICATION TO THE EXTRACTION OF TEXTUAL INFORMATION IN SCENE IMAGES

OCR For Handwritten Marathi Script

Nitesh Kumar Singh, Avinash verma, Anurag kumar

HIKVISION. Automatic Number Plate Recognition Technology. A universal and efficient algorithm for global ANPR application

Artifacts and Textured Region Detection

A New Technique of Extraction of Edge Detection Using Digital Image Processing

Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers

N.Priya. Keywords Compass mask, Threshold, Morphological Operators, Statistical Measures, Text extraction

The Application of Image Processing to Solve Occlusion Issue in Object Tracking

International Journal of Advance Research in Engineering, Science & Technology

An embedded system of Face Recognition based on ARM and HMM

A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script

A Model-based Line Detection Algorithm in Documents

Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network

An Efficient Character Segmentation Based on VNP Algorithm

A Fast Method of Vehicle Logo Location Honglin Li

CORRELATION BASED CAR NUMBER PLATE EXTRACTION SYSTEM

An Objective Evaluation Methodology for Handwritten Image Document Binarization Techniques

Study on Intelligent Transportation Moving Targets Detection Based on. background Image Difference Method

6. Applications - Text recognition in videos - Semantic video analysis

Optimizing the Deblocking Algorithm for. H.264 Decoder Implementation

A Tentative Study on Ward Monitoring System based on Zigbee Technology Jifeng Liang

A Background Modeling Approach Based on Visual Background Extractor Taotao Liu1, a, Lin Qi2, b and Guichi Liu2, c

Journal of Chemical and Pharmaceutical Research, 2015, 7(3): Research Article

Introducing Robotics Vision System to a Manufacturing Robotics Course

Texture Sensitive Image Inpainting after Object Morphing

Fingerprint Ridge Distance Estimation: Algorithms and the Performance*

Optical Character Recognition Based Speech Synthesis System Using LabVIEW

Keyword Spotting in Document Images through Word Shape Coding

Segmentation of Characters of Devanagari Script Documents

Skew Detection and Correction of Document Image using Hough Transform Method

Indian Multi-Script Full Pin-code String Recognition for Postal Automation

Temperature Calculation of Pellet Rotary Kiln Based on Texture

Locating 1-D Bar Codes in DCT-Domain

Implementation and Advanced Results on the Non-Interrupted Skeletonization Algorithm

Critique: Efficient Iris Recognition by Characterizing Key Local Variations

Counting Particles or Cells Using IMAQ Vision

(Refer Slide Time 00:17) Welcome to the course on Digital Image Processing. (Refer Slide Time 00:22)

Handwritten Script Recognition at Block Level

Application of Geometry Rectification to Deformed Characters Recognition Liqun Wang1, a * and Honghui Fan2

Your Flowchart Secretary: Real-Time Hand-Written Flowchart Converter

An Efficient Image Sharpening Filter for Enhancing Edge Detection Techniques for 2D, High Definition and Linearly Blurred Images

RECOGNIZING TYPESET DOCUMENTS USING WALSH TRANSFORMATION. Attila Fazekas and András Hajdu University of Debrecen 4010, Debrecen PO Box 12, Hungary

Recognition of Unconstrained Malayalam Handwritten Numeral

Serial Communication Based on LabVIEW for the Development of an ECG Monitor

Mobile Camera Based Calculator

SEVERAL METHODS OF FEATURE EXTRACTION TO HELP IN OPTICAL CHARACTER RECOGNITION

Journal of Applied Research and Technology ISSN: Centro de Ciencias Aplicadas y Desarrollo Tecnológico.

Carmen Alonso Montes 23rd-27th November 2015

MOVING OBJECT DETECTION USING BACKGROUND SUBTRACTION ALGORITHM USING SIMULINK

Research on the Application of Digital Images Based on the Computer Graphics. Jing Li 1, Bin Hu 2

Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes

Mobile Camera Based Text Detection and Translation

Bit-Plane Decomposition Steganography Using Wavelet Compressed Video

Resolution Improvement Processing of Post-stack Seismic Data and Example Analysis - Taking NEB Gas Field As an Example in Indonesia

Historical Handwritten Document Image Segmentation Using Background Light Intensity Normalization

Detecting Salient Contours Using Orientation Energy Distribution. Part I: Thresholding Based on. Response Distribution

I. INTRODUCTION. Figure-1 Basic block of text analysis

Keywords Connected Components, Text-Line Extraction, Trained Dataset.

Quaternion-based color difference measure for removing impulse noise in color images

MATRIX BASED INDEXING TECHNIQUE FOR VIDEO DATA

Face Recognition Using Vector Quantization Histogram and Support Vector Machine Classifier Rong-sheng LI, Fei-fei LEE *, Yan YAN and Qiu CHEN

Restoring Warped Document Image Based on Text Line Correction

Recognition of Gurmukhi Text from Sign Board Images Captured from Mobile Camera

International Journal of Modern Engineering and Research Technology

Palmprint Recognition Using Transform Domain and Spatial Domain Techniques

Wavelength Estimation Method Based on Radon Transform and Image Texture

EXTRACTING TEXT FROM VIDEO

Tables Part I. Session 45: Creating Tables Session 46: Modifying the Layout of Tables Session 47: Formatting the Design of Tables. Unit.

Investigation of Algorithms for Calculating Target Region Area

Iterative Removing Salt and Pepper Noise based on Neighbourhood Information

Research Fellow, Korea Institute of Civil Engineering and Building Technology, Korea (*corresponding author) 2

High Resolution Remote Sensing Image Classification based on SVM and FCM Qin LI a, Wenxing BAO b, Xing LI c, Bin LI d

Transcription:

2016 International Conference on Engineering and Advanced Technology (ICEAT-16) Extracting Characters From Books Based On The OCR Technology Mingkai Zhang1, a, Xiaoyi Bao1, b,xin Wang1, c, Jifeng Ding1, d and Xiaoke Xu1, e 1 School of Dalian,Nationalities University, Liaoning 116600, China; a b c d 974569651@qq.com, 934952212@qq.com, 1134724200@qq.com, djf@dlnu.edu.cn Keywords: image pre-processing, character segmentation, feature extraction. Abstract. Since the 1990s,information based on image, video and audio frequency become popular trends during daily information exchange. Information identification about images and videos which based on the Optical Character Recognition(OCR)have become more and more popular in various enterprise. Compared with traditional manual entry, OCR is not only save manpower cost but also optimize the resource configurations. 1. Introduction With the development of communication technology, multimedia technological such as image, video and audio frequency is now coming into every field of our life. More and more people try to recognize text information from these multimedia and show them out to copy, store and export. Nowadays we plan to achieve the method to extract information in order to the next step for audio processing. So how to extract these information is a vital technology for this. In the digital image processing field, text image has abundant edges and grains. There are both advantages and disadvantages between the two characteristics. As edge feature, the algorithm speed is faster but the complex background make the final result worse. As texture feature, universal property is widely but it has a higher complexity of algorithm. This article in predecessor s research foundation, finds a algorithm which aims to scripts format and direction of rules to extract the information we need in text. 2.Overall design The hardware of the system is collected by the image acquisition circuit, then analyze, process, recognize and understand the image by main chip, then display by the display circuit. The image acquisition circuit is a OV2640 camera, The camera has high sensitivity, high flexibility, support JPEG output, etc.. Main chip is STM32F407,it is a chip that the most advanced in 32 series,it has a DSP that the highest frequency band is168m. Display circuit is a 4.3 inch color screen,this module uses the TFTLCD panel that can display 16 colors true color image,software is processing the image though Image gray, Binarization, Image denoising, Character cutting 3.Image pre-processing When the system test text image, it will locate and extract the text area of text image. Firstly we need to do is image pre-processing. Pre-processing can be divided into image gray, binarization, denoisingand character segmentation. Image gray. Before processing the image, we need to transform the color image collected by camera into gray scale consist of 256 colors. In this way, computer can process the image better. In RGB model, when R=G=B, color express one of gray color and the number R=G=B called gray level, therefore, each pixel only need to one byte to put gray level(intensity or brightness value)which range form 0-255. Copyright 2017, the Authors. Published by Atlantis Press. This is an open access article under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/). 388

Binarization.Binarization image is gray, the pixels set to a value of 0 (black) or 255 (white), Text i mage binarization is separated from the background of strokes. Omitting irrelevant information for f acilitate subsequent processing. Then the binarization threshold is very important, The larger thresh old will retain more information,but a lot of useless information may be retained to cause interferen ce; The smaller threshold will lost a lot of useful information,cause the information is not complete when extract the text.so how to decide the binarization threshold is the important of the algorithm. Binarization algorithm has many kinds,the flow diagram is as follows. start According to the gray level histogram, record the maximum and minimum value of the gray Initial threshold =Gary value of the minimum*weight coefficient + Gary value of the maximum*weight coefficient Segmentation image, the average value of the gray scale of the two segmented regions are obtained New threshold= gray scale1 *weight coefficient1 + maximum2*weight coefficient2 New threshold to replace the old threshold Whether the new threshold is equal NO YES Using the threshold image segmentation Output binary image finish Figure 1. Binarization flow diagram Image denoising. The factors which hinder the information acquisition called pattern noise. These noises can be brought during spread or quantization. For collecting text image s system, the noise is more brought by the collecting process, which called multiplicative noise. What image denoising do is eliminating disturb factors that infect the right information. The figure diagram is as flows. 389

Connected domain unit Merge adjacent connected connected domain elements Width of computing unit Appropriate Too Small Merge into other units Find a cell containing the adhesion character Searching for the best segmentation position by crossing algorithm Removal of unreasonable segmentation position Determine the final segmentation location Complete segmentation Figure 2. Denoising flow diagram Character cutting. The contents of text image are not single character but combine with many characters include: Chinese characters, punctuation and charts. Character cutting is significant part of paper reading system because the direction test of text image is achieves by using a single character direction. If character in text can regard as the basis, we must divide whole text image into single character area. So the text requires character segmentation. There are two methods of character cutting. One of it is based on characters architectural feature. This feature use characters height, width pitch and stroke structure feature to cut the character. Another method is based on character recognition, this method can recognize character while cutting. And confirm the final position according to the result of recognition. The second method need combine with recognition algorithm so we adopt the first algorithm. 4.Extract of Chinese character features Every Chinese character feature s has different applicable environment. We should choose different Chinese character feature for different systems to reach the best result. Thirteen feature point extraction method.thirteen feature point extraction is a method that extracting thirteen feature points from each character. This algorithm has better adaptation. First step is to divide texts into eight areas randomly and make statistic of the black pixel number, then we can obtain eight proper values. Second step is to record the middle of two row and the middle of the two row, regard them as four feature vectors. It means draw two lines at horizontal s 1/3 and vertical 2/3 and record the number of black pixels of the two lines and the image of the text to get four character values. 390

The last feature value is the sum of all black pixels of the text image. We get thirteen feature point, that is the thirteen feature point extraction method. 5.Recognition algorithm Correlation matching method.correlation matching method is a statistical recognition method. The advantage of statistical method is that it is easy to build the feature extraction and template, strong anti-interference capability, it makes that it insensitive to local noise. The disadvantage is that the ability to distinguish similar words is weak, It is sensitive to the change of the style of writing. The figure diagram is as flows. 6.Systematic measurement and analysis Figure 3. Test result The system has the highest recognition rate for the training template.it can reach 100%, For any interception of the better quality of the text image, Recognition rate is also good, but the recognition rate is low for the text image collected by the camera 7. Summary With the development of information society, Chinese character recognition technology is becoming more and more important. We designed this system on the basis of summarizing the previous work done by our predecessors. The system has achieved the expected requirements in recognition rate and recognition effect. The problem that needs to be solved is to improve the pretreatment technology of Chinese character recognition system. References [1]Takashi M.A survey of image Processing and computer vision software systems on work stations. IAPR/TCS,1992.1~35 [2]KitehenL,Rosenfeld A.Gray-levele comer detection [J]. Pattem Recognition Letters, 1982,l:95~102 [3] R.Stefaneli.Some Parallel thinning algorithms for digital Pictures. J. Ass. ComPuter [M]. Mach. 18, 1971,255~264 [4] Keghbati H K, An Overview of Data ComPression Technique. ComPuter, 1982,14(4):71~76 391

[5]Mori K, LMasuda. Advances in Recognition of Chinese characters, Proe. Of 5thInten. eonf. On Pattern Recognition, 1980:692~720. [6] ]Whilchello A P, Yan H. Linking broken character borders with variable sized marks to improve recognition. Pattern Recognition, 1996, 29(8): 1429 ~ 1433. [7] Yu T, Tang Y Y. The feature extraction of Chinese character based on Contour Information. Proceedings of the Fifth International Conference on Document Analysis and Recognition, 1999:637~640 [8] Aftabul l, Md R H, Razwanur R et al. Designing ANN using sensitivity and hypothesis correlation testing. 10th International Conference on Computer and Information Teehnology,2008:l~6. [9] Huang L, Huang X, Multiresolutiong Recognition of Offiine Handwritten Chinese Characters with Wavelet Transform,ICDAR,2001,63 [10] Wang X F, Ding X Q, Liu C S, Optinimized Gabor Filter Based Feature Extraction for Character Recognition, ICPR,2002,223 392