A Multimodal Framework for the Recognition of Ancient Tamil Handwritten Characters in Palm Manuscript Using Boolean Bitmap Pattern of Image Zoning

Size: px
Start display at page:

Download "A Multimodal Framework for the Recognition of Ancient Tamil Handwritten Characters in Palm Manuscript Using Boolean Bitmap Pattern of Image Zoning"

Transcription

1 A Multimodal Framework for the Recognition of Ancient Tamil Handwritten s in Palm Manuscript Using Boolean Bitmap Pattern of Zoning E.K.Vellingiriraj, Asst. Professor and Dr.P.Balasubramanie, Professor Department of Computer Science & Engineering, Kongu Engineering College, Perundurai, girirajek@rediffmail.com Abstract: Tamil is one of the oldest languages in the world with rich literature. In the ancient days, the writers, especially in Tamilnadu, used palm leaves to encrypt their writing. A very good example of the usage of Palm leaf manuscripts to store the history is Tamil grammar book named Tolkappiyam which was written during 4th B.C. The ancient literature includes many palm leaf manuscripts that contain Sangam works, classics, Saiva, Vaishnava and Jain works, medical works, food, astronomy & astrology, vaastu & Kaama shastra, jewellery, music, dance & drama, medicine, Siddha and so on. Over the 3, 500 Tamil manuscripts are available in Saraswathi Mahal Library located in Thanjavur, Taminadu, India. In this library, only a few palm leaf manuscripts are digitalized and many are to be digitalized so as to enable quick reference in the future. The objective of the proposed research is to develop the model that can recognize Tamil characters from palm manuscripts and convert them into text format. In the field of handwritten character recognition, image zoning is a widespread technique for feature extraction since it is rightly considered to be able to cope with handwritten pattern variability. 1. Introduction: Tamil Handwritten character recognition is one of the most difficult tasks in the pattern recognition system. There are lots of difficult things which can be solved through image processing technique: separating each character, recognizing character fonts and written styles used in different centuries. Many researchers try to apply many techniques for breaking through the complex problems of Tamil handwritten character recognition. The optical character recognition (OCR) is one of the techniques which can be defined as the process of recognizing either printed or handwritten text from document images and converting them into text format. There are many algorithms being used in the literature to perform this conversion task for specific language. In Tamil language, there are so many researchers who research using Tamil OCR but they are unable to obtain not more than 90% of accuracy. The problem is to go one step up to recognize the ancient Tamil characters. The objective of the current research is to recognize the ancient Tamil characters from old palm manuscripts by converting them into text format. 2. Related Work: According to [1], the image of palm manuscripts is used to extract a text line. This conventional text line extraction can be roughly classified into four categories: i. Projection based methods ii. Hough transform based methods iii. Bottom up grouping methods iv. segmentation based methods Most of works based on text line segmentation can be roughly categorized as bottom-up or top-down approaches. In the topdown methodology, a document page is first segmented into zones, and a zone is then segmented into lines, and so on. Projection based methods is one of the most successful top-down algorithms for printed documents and it can be applied on handwritings only if gaps between two neighboring handwritten lines are sufficient. Projection based methods allow the efficient extraction of text lines [2], [3]. We use this method for palm manuscripts. 139

2 3. Methodology: Here, all details of the proposed system design are given. First, the overall framework of the ancient Tamil handwritten character recognition system is given. 3.1 System Architecture Overview: Fig. 1: Framework of the Ancient Tamil Handwritten character recognition in Palm manuscripts 3.2 System Structure Chart: Based on the system framework in the previous section, the Tamil palm leaf image is converted into Tamil text format. This framework includes, i) scanning ii) preprocessing iii) Feature extraction iv) recognition v) Text conversion. Tamil Palm Leaf Recognition Scanning Preprocessing Feature Extraction Recognition Display Result Cropping Resizing Thicken Binarization to Boolean Matrix Grouping Pattern Matching Text Conversion Fig 2: Structure chart of Ancient Tamil handwritten characters recognition by image zoning using the Boolean matrix i. Scanning: In the first stage, the Tamil palm leaf manuscripts belonging to different centuries would be collected from various places in Tamilnadu. These manuscripts are scanned by 4800 dpi scanner and stored in Jpeg format. ii. Preprocessing: In the image preprocessing module, the proposed system would prepare a palm manuscript handwritten character image for the feature extraction module. This stage consists of five sub-processes: a) image cropping b) segmentation c) image re-sizing d) image thickening and e) binarization. Each of these subprocesses are given below: 140

3 a) cropping: This process involves the cropping of each word. The scanned palm leaf image would have white space. Using this white space, words are cropped. b) Segmentation: There are three types of segmentation. Line segmentation, word segmentation and character segmentation. Researchers used various techniques for segmentation like threshold techniques, region based method, edge based method, graph based methods, clustering methods, compression based methods, histogram methods, watershed transformation and model based methods. Here we will take the edge detection method to segment the characters. The edge detection is the name for a set of mathematical methods which aim at identifying points in a digital image at which the image brightness changes sharply or, more formally has discontinuities. Marr-Hildreth algorithm is one of the edge detection, which has a method of detecting edges in digital images. c) re-sizing: Each segmented character is in different size. So, it is necessary to change all the characters into equal size. In the proposed method, the character image is re-sized into 100X100 pixels. 100 X 100 Pixels d) thickening: Each darkened pixel of the re-sized character is thickened through darkening the nearest pixels. Using the nearest algorithm, a thin character is changed into thicker character through darkening the color of the nearest pixel for a target range. e) binarization: Each character is stored in Boolean matrix in either 0 s or 1 s. Using the image zoning technique [5], all the dark pixels are stored in 1 s and light pixel in 0 s. iii. Feature extraction: This feature extraction module extracts the basic components of Tamil characters. There are three sub modules a) conversion into Boolean Matrix b) Grouping and c) Pattern Matching the details of which are given below: a) conversion into Boolean Matrix: Each character from palm leaf is converted into Boolean matrix. Similarly, all the actual Tamil character sets (Fig 4) are also converted into the Boolean matrix because palm 141

4 leaf manuscripts were written in hand and might be unreadable to different people and it is necessary to make them readable. Fig 4: The Tamil character sets b) Grouping: Palm manuscripts were used in different centuries and the words appear in them are of different styles and strokes (Fig 5). Each individual script is stored in Boolean matrix. Fig 5. Tamil Scripts in Different Centuries c) Pattern Matching: The stored original text Boolean matrix would be compared with the new preprocessing Tamil character using a range. 4. Recognition: Pattern matching identifies each similar character in the predefined Boolean matrix. Then, the matching Boolean matrix is converted into equal Unicode Tamil fonts. 4. Modeling: A character model is a record of all the characters set that are of equal Boolean matrix. The combination of the two Boolean matrix is also equal to the single character. For example: The equivalent Boolean matrix is given below: + = 142

5 A simple algorithm that is can be used for character matching is as follows: M is the Boolean matrix one by one in all character sets in Tamil scripts ( set 1 to 67). N is the Boolean matrix for current handwritten character in palm manuscripts. 1. Predefine M in all Boolean matrix 2. Check the current character Boolean matrix N with M 3. If the range of both Boolean matrix is set to 1, the characters are identified ( set no 1 to 55) 4. If not, check the part of the Boolean matrix with the entire predefined matrix ( set no 1 to 30) If Boolean matrix is matches, then check all the other parts of the Boolean matrix ( set 56, 57, 58, 63, 64, 65) 5. Conclusion: In this paper, we have proposed a simple method for converting ancient Tamil handwritten scripts into text format. There are thousands of Tamil palm manuscripts that are yet to be digitalized. The aim of this paper is to convert the palm manuscript image into digitized text format. However, our method has some difficulties in handling cases such as cursive Tamil script, merging of two Boolean matrixes, and a hole in palm manuscript image. These are only some basic issues which can be overcome through future extension of character recognition. References: Hyung Il Koo and Nam Ik Cho, Text Line Extraction Chinese Documents Based on an Energy Minimization Framework, IEEE Trans. On Processing, Vol.21, no.3, pp , Mar G. Nagy, S. Seth, and M. Viswanathan, A Prototype Document Analysis System for Technical Journals, Computer, vol. 25, no. 7, pp , Jul F. Shafait, D. Keysers, and T. M. Breuel, Performance Evaluation and Benchmarking of Six-Page Segmentation Algorithms, IEEE Trans. Pattern Anal. Mach. Intell., vol. 30, no. 6, pp , Jun Y.Liang, M.C.Fairhurst & R.M.Guest, A Synthesisd Word Approach to Word Retrieval in Handwritten Documents, Elsevier Pattern Recognition, Vol.45, PP , June Giuseppe Pirlo, Donato Impedovo, Adaptive Membership Functions for Handwritten Recognition by Voronoi-Based Zoning, IEEE Trans on Processing, Vol 21, No 9, PP , Sep Chomtip Pornpanomchai, Verachag Wongsawangtham, Satheanpong Jeungudomporn, and Nannaphat Chatsumpun, Thai Handwritten Recognition by Genetic Algorithm (THCRGA), IACSIT Journal of Engineering and Technology, Vol 3, No 2, Apr Qiu-Fend Wang, Fei Yin, and Cheng-Lin Liu, Handwritten Chinese Text Recognition by Integrating Multiple Contexts, IEEE Trans on Pattern Analysis and Machine Intelligence, Vol 34, No 8, Aug Tiji M Jose and Amitabh Wahi, Recognition of Tamil Handwritten s using Daubechies Wavelet Transforms and Feed-forward Back Propagation Network, IJCA, Vol 64, No 8, PP , Feb Jin Chen, Daniel Lopresti, Model Based Ruling Line Detection in Noisy Handwritten Documents, Pattern Recognition Letters, Elsevier, A Bharath and Sriganesh Madhvanath, HMM-Based Lexicon-Driven and Lexicon-Free Word Recognition for Online Handwritten Indic Scripts, IEEE Trans on Pattern Analysis and Machine Intelligence, Vol 34, No 4, Apr

Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network

Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network Utkarsh Dwivedi 1, Pranjal Rajput 2, Manish Kumar Sharma 3 1UG Scholar, Dept. of CSE, GCET, Greater Noida,

More information

A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script

A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script Arwinder Kaur 1, Ashok Kumar Bathla 2 1 M. Tech. Student, CE Dept., 2 Assistant Professor, CE Dept.,

More information

DATABASE DEVELOPMENT OF HISTORICAL DOCUMENTS: SKEW DETECTION AND CORRECTION

DATABASE DEVELOPMENT OF HISTORICAL DOCUMENTS: SKEW DETECTION AND CORRECTION DATABASE DEVELOPMENT OF HISTORICAL DOCUMENTS: SKEW DETECTION AND CORRECTION S P Sachin 1, Banumathi K L 2, Vanitha R 3 1 UG, Student of Department of ECE, BIET, Davangere, (India) 2,3 Assistant Professor,

More information

Tamil Image Text to Speech Using Rasperry PI

Tamil Image Text to Speech Using Rasperry PI Tamil Image Text to Speech Using Rasperry PI V.Suresh Babu 1, D.Deviga 2, A.Gayathri 3, V.Kiruthika 4 and B.Gayathri 5 1 Associate Professor, 2,3,4,5 UG Scholar, Department of ECE, Hindusthan Institute

More information

HANDWRITTEN GURMUKHI CHARACTER RECOGNITION USING WAVELET TRANSFORMS

HANDWRITTEN GURMUKHI CHARACTER RECOGNITION USING WAVELET TRANSFORMS International Journal of Electronics, Communication & Instrumentation Engineering Research and Development (IJECIERD) ISSN 2249-684X Vol.2, Issue 3 Sep 2012 27-37 TJPRC Pvt. Ltd., HANDWRITTEN GURMUKHI

More information

Segmentation of Kannada Handwritten Characters and Recognition Using Twelve Directional Feature Extraction Techniques

Segmentation of Kannada Handwritten Characters and Recognition Using Twelve Directional Feature Extraction Techniques Segmentation of Kannada Handwritten Characters and Recognition Using Twelve Directional Feature Extraction Techniques 1 Lohitha B.J, 2 Y.C Kiran 1 M.Tech. Student Dept. of ISE, Dayananda Sagar College

More information

Handwritten Devanagari Character Recognition Model Using Neural Network

Handwritten Devanagari Character Recognition Model Using Neural Network Handwritten Devanagari Character Recognition Model Using Neural Network Gaurav Jaiswal M.Sc. (Computer Science) Department of Computer Science Banaras Hindu University, Varanasi. India gauravjais88@gmail.com

More information

Keywords Connected Components, Text-Line Extraction, Trained Dataset.

Keywords Connected Components, Text-Line Extraction, Trained Dataset. Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Language Independent

More information

Biometric Security System Using Palm print

Biometric Security System Using Palm print ISSN (Online) : 2319-8753 ISSN (Print) : 2347-6710 International Journal of Innovative Research in Science, Engineering and Technology Volume 3, Special Issue 3, March 2014 2014 International Conference

More information

Segmentation of Characters of Devanagari Script Documents

Segmentation of Characters of Devanagari Script Documents WWJMRD 2017; 3(11): 253-257 www.wwjmrd.com International Journal Peer Reviewed Journal Refereed Journal Indexed Journal UGC Approved Journal Impact Factor MJIF: 4.25 e-issn: 2454-6615 Manpreet Kaur Research

More information

A New Algorithm for Detecting Text Line in Handwritten Documents

A New Algorithm for Detecting Text Line in Handwritten Documents A New Algorithm for Detecting Text Line in Handwritten Documents Yi Li 1, Yefeng Zheng 2, David Doermann 1, and Stefan Jaeger 1 1 Laboratory for Language and Media Processing Institute for Advanced Computer

More information

Hand Written Character Recognition using VNP based Segmentation and Artificial Neural Network

Hand Written Character Recognition using VNP based Segmentation and Artificial Neural Network International Journal of Emerging Engineering Research and Technology Volume 4, Issue 6, June 2016, PP 38-46 ISSN 2349-4395 (Print) & ISSN 2349-4409 (Online) Hand Written Character Recognition using VNP

More information

Structural Feature Extraction to recognize some of the Offline Isolated Handwritten Gujarati Characters using Decision Tree Classifier

Structural Feature Extraction to recognize some of the Offline Isolated Handwritten Gujarati Characters using Decision Tree Classifier Structural Feature Extraction to recognize some of the Offline Isolated Handwritten Gujarati Characters using Decision Tree Classifier Hetal R. Thaker Atmiya Institute of Technology & science, Kalawad

More information

ISSN: [Mukund* et al., 6(4): April, 2017] Impact Factor: 4.116

ISSN: [Mukund* et al., 6(4): April, 2017] Impact Factor: 4.116 IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY ENGLISH CURSIVE SCRIPT RECOGNITION Miss.Yewale Poonam Mukund*, Dr. M.S.Deshpande * Electronics and Telecommunication, TSSM's Bhivarabai

More information

Time Stamp Detection and Recognition in Video Frames

Time Stamp Detection and Recognition in Video Frames Time Stamp Detection and Recognition in Video Frames Nongluk Covavisaruch and Chetsada Saengpanit Department of Computer Engineering, Chulalongkorn University, Bangkok 10330, Thailand E-mail: nongluk.c@chula.ac.th

More information

Hilditch s Algorithm Based Tamil Character Recognition

Hilditch s Algorithm Based Tamil Character Recognition Hilditch s Algorithm Based Tamil Character Recognition V. Karthikeyan Department of ECE, SVS College of Engineering Coimbatore, India, Karthick77keyan@gmail.com Abstract-Character identification plays

More information

Handwritten Script Recognition at Block Level

Handwritten Script Recognition at Block Level Chapter 4 Handwritten Script Recognition at Block Level -------------------------------------------------------------------------------------------------------------------------- Optical character recognition

More information

On Segmentation of Documents in Complex Scripts

On Segmentation of Documents in Complex Scripts On Segmentation of Documents in Complex Scripts K. S. Sesh Kumar, Sukesh Kumar and C. V. Jawahar Centre for Visual Information Technology International Institute of Information Technology, Hyderabad, India

More information

I. INTRODUCTION. Figure-1 Basic block of text analysis

I. INTRODUCTION. Figure-1 Basic block of text analysis ISSN: 2349-7637 (Online) (RHIMRJ) Research Paper Available online at: www.rhimrj.com Detection and Localization of Texts from Natural Scene Images: A Hybrid Approach Priyanka Muchhadiya Post Graduate Fellow,

More information

Segmentation of Isolated and Touching characters in Handwritten Gurumukhi Word using Clustering approach

Segmentation of Isolated and Touching characters in Handwritten Gurumukhi Word using Clustering approach Segmentation of Isolated and Touching characters in Handwritten Gurumukhi Word using Clustering approach Akashdeep Kaur Dr.Shaveta Rani Dr. Paramjeet Singh M.Tech Student (Associate Professor) (Associate

More information

HCR Using K-Means Clustering Algorithm

HCR Using K-Means Clustering Algorithm HCR Using K-Means Clustering Algorithm Meha Mathur 1, Anil Saroliya 2 Amity School of Engineering & Technology Amity University Rajasthan, India Abstract: Hindi is a national language of India, there are

More information

OCR For Handwritten Marathi Script

OCR For Handwritten Marathi Script International Journal of Scientific & Engineering Research Volume 3, Issue 8, August-2012 1 OCR For Handwritten Marathi Script Mrs.Vinaya. S. Tapkir 1, Mrs.Sushma.D.Shelke 2 1 Maharashtra Academy Of Engineering,

More information

An Efficient Character Segmentation Based on VNP Algorithm

An Efficient Character Segmentation Based on VNP Algorithm Research Journal of Applied Sciences, Engineering and Technology 4(24): 5438-5442, 2012 ISSN: 2040-7467 Maxwell Scientific organization, 2012 Submitted: March 18, 2012 Accepted: April 14, 2012 Published:

More information

Robust Phase-Based Features Extracted From Image By A Binarization Technique

Robust Phase-Based Features Extracted From Image By A Binarization Technique IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 18, Issue 4, Ver. IV (Jul.-Aug. 2016), PP 10-14 www.iosrjournals.org Robust Phase-Based Features Extracted From

More information

Handwriting Recognition of Diverse Languages

Handwriting Recognition of Diverse Languages Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology ISSN 2320 088X IMPACT FACTOR: 6.017 IJCSMC,

More information

Text lines and snippets extraction for 19th century handwriting documents layout analysis

Text lines and snippets extraction for 19th century handwriting documents layout analysis Author manuscript, published in "2009 10th International Conference on Document Analysis and Recognition, Barcelona : Spain (2009)" Text lines and snippets extraction for 19th century handwriting documents

More information

A New Approach to Detect and Extract Characters from Off-Line Printed Images and Text

A New Approach to Detect and Extract Characters from Off-Line Printed Images and Text Available online at www.sciencedirect.com Procedia Computer Science 17 (2013 ) 434 440 Information Technology and Quantitative Management (ITQM2013) A New Approach to Detect and Extract Characters from

More information

Recognition of Gurmukhi Text from Sign Board Images Captured from Mobile Camera

Recognition of Gurmukhi Text from Sign Board Images Captured from Mobile Camera International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 17 (2014), pp. 1839-1845 International Research Publications House http://www. irphouse.com Recognition of

More information

International Journal of Advance Research in Engineering, Science & Technology

International Journal of Advance Research in Engineering, Science & Technology Impact Factor (SJIF): 4.542 International Journal of Advance Research in Engineering, Science & Technology e-issn: 2393-9877, p-issn: 2394-2444 Volume 4, Issue 4, April-2017 A Simple Effective Algorithm

More information

Classification of Printed Chinese Characters by Using Neural Network

Classification of Printed Chinese Characters by Using Neural Network Classification of Printed Chinese Characters by Using Neural Network ATTAULLAH KHAWAJA Ph.D. Student, Department of Electronics engineering, Beijing Institute of Technology, 100081 Beijing, P.R.CHINA ABDUL

More information

Writer Recognizer for Offline Text Based on SIFT

Writer Recognizer for Offline Text Based on SIFT Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 5, May 2015, pg.1057

More information

Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network

Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network 139 Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network Harmit Kaur 1, Simpel Rani 2 1 M. Tech. Research Scholar (Department of Computer Science & Engineering), Yadavindra College

More information

A Technique for Offline Handwritten Character Recognition

A Technique for Offline Handwritten Character Recognition A Technique for Offline Handwritten Character Recognition 1 Shilpy Bansal, 2 Mamta Garg, 3 Munish Kumar 1 Lecturer, Department of Computer Science Engineering, BMSCET, Muktsar, Punjab 2 Assistant Professor,

More information

Recognition-based Segmentation of Nom Characters from Body Text Regions of Stele Images Using Area Voronoi Diagram

Recognition-based Segmentation of Nom Characters from Body Text Regions of Stele Images Using Area Voronoi Diagram Author manuscript, published in "International Conference on Computer Analysis of Images and Patterns - CAIP'2009 5702 (2009) 205-212" DOI : 10.1007/978-3-642-03767-2 Recognition-based Segmentation of

More information

Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network

Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network International Journal of Computer Science & Communication Vol. 1, No. 1, January-June 2010, pp. 91-95 Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network Raghuraj

More information

AN EFFICIENT BINARIZATION TECHNIQUE FOR FINGERPRINT IMAGES S. B. SRIDEVI M.Tech., Department of ECE

AN EFFICIENT BINARIZATION TECHNIQUE FOR FINGERPRINT IMAGES S. B. SRIDEVI M.Tech., Department of ECE AN EFFICIENT BINARIZATION TECHNIQUE FOR FINGERPRINT IMAGES S. B. SRIDEVI M.Tech., Department of ECE sbsridevi89@gmail.com 287 ABSTRACT Fingerprint identification is the most prominent method of biometric

More information

A Wavelet Based Recognition System for Printed Malayalam Characters

A Wavelet Based Recognition System for Printed Malayalam Characters A Wavelet Based Recognition System for Printed Malayalam Characters M. Abdul Rahiman 1, M. S. Rajasree 2 1 Asst Professor, Department of Computer Science & Engg LBS Institute of Technology for Women, Trivandrum,

More information

High Performance Layout Analysis of Arabic and Urdu Document Images

High Performance Layout Analysis of Arabic and Urdu Document Images High Performance Layout Analysis of Arabic and Urdu Document Images Syed Saqib Bukhari 1, Faisal Shafait 2, and Thomas M. Breuel 1 1 Technical University of Kaiserslautern, Germany 2 German Research Center

More information

Input sensitive thresholding for ancient Hebrew manuscript

Input sensitive thresholding for ancient Hebrew manuscript Pattern Recognition Letters 26 (2005) 1168 1173 www.elsevier.com/locate/patrec Input sensitive thresholding for ancient Hebrew manuscript Itay Bar-Yosef * Department of Computer Science, Ben Gurion University,

More information

Handwritten Character Recognition: A Comprehensive Review on Geometrical Analysis

Handwritten Character Recognition: A Comprehensive Review on Geometrical Analysis IOSR Journal of Computer Engineering (IOSRJCE) eissn: 22780661,pISSN: 22788727, Volume 17, Issue 2, Ver. IV (Mar Apr. 2015), PP 8388 www.iosrjournals.org Handwritten Character Recognition: A Comprehensive

More information

Multi-Layer Perceptron Network For Handwritting English Character Recoginition

Multi-Layer Perceptron Network For Handwritting English Character Recoginition Multi-Layer Perceptron Network For Handwritting English Character Recoginition 1 Mohit Mittal, 2 Tarun Bhalla 1,2 Anand College of Engg & Mgmt., Kapurthala, Punjab, India Abstract Handwriting recognition

More information

Automatically Algorithm for Physician s Handwritten Segmentation on Prescription

Automatically Algorithm for Physician s Handwritten Segmentation on Prescription Automatically Algorithm for Physician s Handwritten Segmentation on Prescription Narumol Chumuang 1 and Mahasak Ketcham 2 Department of Information Technology, Faculty of Information Technology, King Mongkut's

More information

Mobile Application with Optical Character Recognition Using Neural Network

Mobile Application with Optical Character Recognition Using Neural Network Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 1, January 2015,

More information

International Journal of Electrical, Electronics ISSN No. (Online): and Computer Engineering 3(2): 85-90(2014)

International Journal of Electrical, Electronics ISSN No. (Online): and Computer Engineering 3(2): 85-90(2014) I J E E E C International Journal of Electrical, Electronics ISSN No. (Online): 2277-2626 Computer Engineering 3(2): 85-90(2014) Robust Approach to Recognize Localize Text from Natural Scene Images Khushbu

More information

A Review on Handwritten Character Recognition

A Review on Handwritten Character Recognition IJCST Vo l. 8, Is s u e 1, Ja n - Ma r c h 2017 ISSN : 0976-8491 (Online) ISSN : 2229-4333 (Print) A Review on Handwritten Character Recognition 1 Anisha Sharma, 2 Soumil Khare, 3 Sachin Chavan 1,2,3 Dept.

More information

INTERNATIONAL RESEARCH JOURNAL OF MULTIDISCIPLINARY STUDIES

INTERNATIONAL RESEARCH JOURNAL OF MULTIDISCIPLINARY STUDIES STUDIES & SPPP's, Karmayogi Engineering College, Pandharpur Organize National Conference Special Issue March 2016 Neuro-Fuzzy System based Handwritten Marathi System Numerals Recognition 1 Jayashri H Patil(Madane),

More information

User Signature Identification and Image Pixel Pattern Verification

User Signature Identification and Image Pixel Pattern Verification Global Journal of Pure and Applied Mathematics. ISSN 0973-1768 Volume 13, Number 7 (2017), pp. 3193-3202 Research India Publications http://www.ripublication.com User Signature Identification and Image

More information

Semi-Automatic Transcription Tool for Ancient Manuscripts

Semi-Automatic Transcription Tool for Ancient Manuscripts The Venice Atlas A Digital Humanities atlas project by DH101 EPFL Students Semi-Automatic Transcription Tool for Ancient Manuscripts In this article, we investigate various techniques from the fields of

More information

FREEMAN CODE BASED ONLINE HANDWRITTEN CHARACTER RECOGNITION FOR MALAYALAM USING BACKPROPAGATION NEURAL NETWORKS

FREEMAN CODE BASED ONLINE HANDWRITTEN CHARACTER RECOGNITION FOR MALAYALAM USING BACKPROPAGATION NEURAL NETWORKS FREEMAN CODE BASED ONLINE HANDWRITTEN CHARACTER RECOGNITION FOR MALAYALAM USING BACKPROPAGATION NEURAL NETWORKS Amritha Sampath 1, Tripti C 2 and Govindaru V 3 1 Department of Computer Science and Engineering,

More information

Mono-font Cursive Arabic Text Recognition Using Speech Recognition System

Mono-font Cursive Arabic Text Recognition Using Speech Recognition System Mono-font Cursive Arabic Text Recognition Using Speech Recognition System M.S. Khorsheed Computer & Electronics Research Institute, King AbdulAziz City for Science and Technology (KACST) PO Box 6086, Riyadh

More information

A Brief Study of Feature Extraction and Classification Methods Used for Character Recognition of Brahmi Northern Indian Scripts

A Brief Study of Feature Extraction and Classification Methods Used for Character Recognition of Brahmi Northern Indian Scripts 25 A Brief Study of Feature Extraction and Classification Methods Used for Character Recognition of Brahmi Northern Indian Scripts Rohit Sachdeva, Asstt. Prof., Computer Science Department, Multani Mal

More information

Skew Detection and Correction of Document Image using Hough Transform Method

Skew Detection and Correction of Document Image using Hough Transform Method Skew Detection and Correction of Document Image using Hough Transform Method [1] Neerugatti Varipally Vishwanath, [2] Dr.T. Pearson, [3] K.Chaitanya, [4] MG JaswanthSagar, [5] M.Rupesh [1] Asst.Professor,

More information

Online Handwritten Devnagari Word Recognition using HMM based Technique

Online Handwritten Devnagari Word Recognition using HMM based Technique Online Handwritten Devnagari Word using HMM based Technique Prachi Patil Master of Engineering Dept. of Electronics & Telecommunication Dr. D. Y. Patil SOE, Pune, India Saniya Ansari Professor Dept. of

More information

LECTURE 6 TEXT PROCESSING

LECTURE 6 TEXT PROCESSING SCIENTIFIC DATA COMPUTING 1 MTAT.08.042 LECTURE 6 TEXT PROCESSING Prepared by: Amnir Hadachi Institute of Computer Science, University of Tartu amnir.hadachi@ut.ee OUTLINE Aims Character Typology OCR systems

More information

Hangul Recognition Using Support Vector Machine

Hangul Recognition Using Support Vector Machine Hangul Recognition Using Support Vector Machine Rahmatina Hidayati University of Brawijaya Moechammad Sarosa State Polytechnic of Malang Panca Mudjirahardjo University of Brawijaya Abstract: The recognition

More information

Restoring Warped Document Image Based on Text Line Correction

Restoring Warped Document Image Based on Text Line Correction Restoring Warped Document Image Based on Text Line Correction * Dep. of Electrical Engineering Tamkang University, New Taipei, Taiwan, R.O.C *Correspondending Author: hsieh@ee.tku.edu.tw Abstract Document

More information

A Segmentation Free Approach to Arabic and Urdu OCR

A Segmentation Free Approach to Arabic and Urdu OCR A Segmentation Free Approach to Arabic and Urdu OCR Nazly Sabbour 1 and Faisal Shafait 2 1 Department of Computer Science, German University in Cairo (GUC), Cairo, Egypt; 2 German Research Center for Artificial

More information

Optical Character Recognition

Optical Character Recognition Optical Character Recognition Jagruti Chandarana 1, Mayank Kapadia 2 1 Department of Electronics and Communication Engineering, UKA TARSADIA University 2 Assistant Professor, Department of Electronics

More information

Recognition of online captured, handwritten Tamil words on Android

Recognition of online captured, handwritten Tamil words on Android Recognition of online captured, handwritten Tamil words on Android A G Ramakrishnan and Bhargava Urala K Medical Intelligence and Language Engineering (MILE) Laboratory, Dept. of Electrical Engineering,

More information

RESTORATION OF DEGRADED DOCUMENTS USING IMAGE BINARIZATION TECHNIQUE

RESTORATION OF DEGRADED DOCUMENTS USING IMAGE BINARIZATION TECHNIQUE RESTORATION OF DEGRADED DOCUMENTS USING IMAGE BINARIZATION TECHNIQUE K. Kaviya Selvi 1 and R. S. Sabeenian 2 1 Department of Electronics and Communication Engineering, Communication Systems, Sona College

More information

OFF-LINE HANDWRITTEN JAWI CHARACTER SEGMENTATION USING HISTOGRAM NORMALIZATION AND SLIDING WINDOW APPROACH FOR HARDWARE IMPLEMENTATION

OFF-LINE HANDWRITTEN JAWI CHARACTER SEGMENTATION USING HISTOGRAM NORMALIZATION AND SLIDING WINDOW APPROACH FOR HARDWARE IMPLEMENTATION OFF-LINE HANDWRITTEN JAWI CHARACTER SEGMENTATION USING HISTOGRAM NORMALIZATION AND SLIDING WINDOW APPROACH FOR HARDWARE IMPLEMENTATION Zaidi Razak 1, Khansa Zulkiflee 2, orzaily Mohamed or 3, Rosli Salleh

More information

Text-Line Extraction from Handwritten Document images using Histogram and Connected Component Analysis

Text-Line Extraction from Handwritten Document images using Histogram and Connected Component Analysis Text-Line Extraction from Handwritten Document images using Histogram and Connected Component Analysis G. G. Rajput Rani Channamma University Belagavi, Karnataka Suryakant B. Ummapure Dept. of Computer

More information

Indian Multi-Script Full Pin-code String Recognition for Postal Automation

Indian Multi-Script Full Pin-code String Recognition for Postal Automation 2009 10th International Conference on Document Analysis and Recognition Indian Multi-Script Full Pin-code String Recognition for Postal Automation U. Pal 1, R. K. Roy 1, K. Roy 2 and F. Kimura 3 1 Computer

More information

Multi-font Numerals Recognition for Urdu Script based Languages

Multi-font Numerals Recognition for Urdu Script based Languages Multi-font Numerals Recognition for Urdu Script based Languages Muhammad Imran Razzak, S.A. Hussain, Abdel Belaïd, Muhammad Sher To cite this version: Muhammad Imran Razzak, S.A. Hussain, Abdel Belaïd,

More information

Segmentation Framework for Multi-Oriented Text Detection and Recognition

Segmentation Framework for Multi-Oriented Text Detection and Recognition Segmentation Framework for Multi-Oriented Text Detection and Recognition Shashi Kant, Sini Shibu Department of Computer Science and Engineering, NRI-IIST, Bhopal Abstract - Here in this paper a new and

More information

Odia Offline Character Recognition using DWT Features

Odia Offline Character Recognition using DWT Features IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-issn: 2278-2834,p- ISSN: 2278-8735. PP 31-37 www.iosrjournals.org Odia Offline Character Recognition using DWT Features Bhabani Dash

More information

Dynamic Stroke Information Analysis for Video-Based Handwritten Chinese Character Recognition

Dynamic Stroke Information Analysis for Video-Based Handwritten Chinese Character Recognition Dynamic Stroke Information Analysis for Video-Based Handwritten Chinese Character Recognition Feng Lin and Xiaoou Tang Department of Information Engineering The Chinese University of Hong Kong Shatin,

More information

OFFLINE SIGNATURE VERIFICATION USING SUPPORT LOCAL BINARY PATTERN

OFFLINE SIGNATURE VERIFICATION USING SUPPORT LOCAL BINARY PATTERN OFFLINE SIGNATURE VERIFICATION USING SUPPORT LOCAL BINARY PATTERN P.Vickram, Dr. A. Sri Krishna and D.Swapna Department of Computer Science & Engineering, R.V. R & J.C College of Engineering, Guntur ABSTRACT

More information

Text Extraction from Natural Scene Images and Conversion to Audio in Smart Phone Applications

Text Extraction from Natural Scene Images and Conversion to Audio in Smart Phone Applications Text Extraction from Natural Scene Images and Conversion to Audio in Smart Phone Applications M. Prabaharan 1, K. Radha 2 M.E Student, Department of Computer Science and Engineering, Muthayammal Engineering

More information

Image Normalization and Preprocessing for Gujarati Character Recognition

Image Normalization and Preprocessing for Gujarati Character Recognition 334 Image Normalization and Preprocessing for Gujarati Character Recognition Jayashree Rajesh Prasad Department of Computer Engineering, Sinhgad College of Engineering, University of Pune, Pune, Mahaashtra

More information

NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: VOLUME 5, ISSUE

NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: VOLUME 5, ISSUE OPTICAL HANDWRITTEN DEVNAGARI CHARACTER RECOGNITION USING ARTIFICIAL NEURAL NETWORK APPROACH JYOTI A.PATIL Ashokrao Mane Group of Institution, Vathar Tarf Vadgaon, India. DR. SANJAY R. PATIL Ashokrao Mane

More information

A Technique for Classification of Printed & Handwritten text

A Technique for Classification of Printed & Handwritten text 123 A Technique for Classification of Printed & Handwritten text M.Tech Research Scholar, Computer Engineering Department, Yadavindra College of Engineering, Punjabi University, Guru Kashi Campus, Talwandi

More information

An Integrated Skew Detection And Correction Using Fast Fourier Transform And DCT

An Integrated Skew Detection And Correction Using Fast Fourier Transform And DCT An Integrated Skew Detection And Correction Using Fast Fourier Transform And DCT Mandip Kaur, Simpel Jindal Abstract: Skew detection and correction is very important task before pre-processing of an image

More information

Line and Word Segmentation Approach for Printed Documents

Line and Word Segmentation Approach for Printed Documents Line and Word Segmentation Approach for Printed Documents Nallapareddy Priyanka Computer Vision and Pattern Recognition Unit Indian Statistical Institute, 203 B.T. Road, Kolkata-700108, India Srikanta

More information

II. WORKING OF PROJECT

II. WORKING OF PROJECT Handwritten character Recognition and detection using histogram technique Tanmay Bahadure, Pranay Wekhande, Manish Gaur, Shubham Raikwar, Yogendra Gupta ABSTRACT : Cursive handwriting recognition is a

More information

Handwritten Character Recognition System using Chain code and Correlation Coefficient

Handwritten Character Recognition System using Chain code and Correlation Coefficient Handwritten Character Recognition System using Chain code and Correlation Coefficient Ravi Sheth1, N C Chauhan2, Mahesh M Goyani3, Kinjal A Mehta4 1Information Technology Dept., A.D Patel Institute of

More information

Image Text Extraction and Recognition using Hybrid Approach of Region Based and Connected Component Methods

Image Text Extraction and Recognition using Hybrid Approach of Region Based and Connected Component Methods Image Text Extraction and Recognition using Hybrid Approach of Region Based and Connected Component Methods Ms. N. Geetha 1 Assistant Professor Department of Computer Applications Vellalar College for

More information

Locating Tables in Scanned Documents for Reconstructing and Republishing

Locating Tables in Scanned Documents for Reconstructing and Republishing Locating Tables in Scanned Documents for Reconstructing and Republishing MAC Akmal Jahan Post Graduate Institute of Science University of Peradeniya Peradeniya, Sri Lanka Abstract Pool of knowledge available

More information

Unique Journal of Engineering and Advanced Sciences Available online: Research Article

Unique Journal of Engineering and Advanced Sciences Available online:  Research Article ISSN 2348-375X Unique Journal of Engineering and Advanced Sciences Available online: www.ujconline.net Research Article DETECTION AND RECOGNITION OF THE TEXT THROUGH CONNECTED COMPONENT CLUSTERING AND

More information

Seminar. Topic: Object and character Recognition

Seminar. Topic: Object and character Recognition Seminar Topic: Object and character Recognition Tse Ngang Akumawah Lehrstuhl für Praktische Informatik 3 Table of content What's OCR? Areas covered in OCR Procedure Where does clustering come in Neural

More information

Extracting Characters From Books Based On The OCR Technology

Extracting Characters From Books Based On The OCR Technology 2016 International Conference on Engineering and Advanced Technology (ICEAT-16) Extracting Characters From Books Based On The OCR Technology Mingkai Zhang1, a, Xiaoyi Bao1, b,xin Wang1, c, Jifeng Ding1,

More information

Automatic Recognition and Verification of Handwritten Legal and Courtesy Amounts in English Language Present on Bank Cheques

Automatic Recognition and Verification of Handwritten Legal and Courtesy Amounts in English Language Present on Bank Cheques Automatic Recognition and Verification of Handwritten Legal and Courtesy Amounts in English Language Present on Bank Cheques Ajay K. Talele Department of Electronics Dr..B.A.T.U. Lonere. Sanjay L Nalbalwar

More information

N.Priya. Keywords Compass mask, Threshold, Morphological Operators, Statistical Measures, Text extraction

N.Priya. Keywords Compass mask, Threshold, Morphological Operators, Statistical Measures, Text extraction Volume, Issue 8, August ISSN: 77 8X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com A Combined Edge-Based Text

More information

DEVANAGARI SCRIPT SEPARATION AND RECOGNITION USING MORPHOLOGICAL OPERATIONS AND OPTIMIZED FEATURE EXTRACTION METHODS

DEVANAGARI SCRIPT SEPARATION AND RECOGNITION USING MORPHOLOGICAL OPERATIONS AND OPTIMIZED FEATURE EXTRACTION METHODS DEVANAGARI SCRIPT SEPARATION AND RECOGNITION USING MORPHOLOGICAL OPERATIONS AND OPTIMIZED FEATURE EXTRACTION METHODS Sushilkumar N. Holambe Dr. Ulhas B. Shinde Shrikant D. Mali Persuing PhD at Principal

More information

LITERATURE REVIEW. For Indian languages most of research work is performed firstly on Devnagari script and secondly on Bangla script.

LITERATURE REVIEW. For Indian languages most of research work is performed firstly on Devnagari script and secondly on Bangla script. LITERATURE REVIEW For Indian languages most of research work is performed firstly on Devnagari script and secondly on Bangla script. The study of recognition for handwritten Devanagari compound character

More information

Scene Text Detection Using Machine Learning Classifiers

Scene Text Detection Using Machine Learning Classifiers 601 Scene Text Detection Using Machine Learning Classifiers Nafla C.N. 1, Sneha K. 2, Divya K.P. 3 1 (Department of CSE, RCET, Akkikkvu, Thrissur) 2 (Department of CSE, RCET, Akkikkvu, Thrissur) 3 (Department

More information

Test Segmentation of MRC Document Compression and Decompression by Using MATLAB

Test Segmentation of MRC Document Compression and Decompression by Using MATLAB Test Segmentation of MRC Document Compression and Decompression by Using MATLAB N.Rajeswari 1, S.Rathnapriya 2, S.Nijandan 3 Assistant Professor/EEE, GRT Institute of Engineering & Technology, Tamilnadu,

More information

A Simple Text-line segmentation Method for Handwritten Documents

A Simple Text-line segmentation Method for Handwritten Documents A Simple Text-line segmentation Method for Handwritten Documents M.Ravi Kumar Assistant professor Shankaraghatta-577451 R. Pradeep Shankaraghatta-577451 Prasad Babu Shankaraghatta-5774514th B.S.Puneeth

More information

Recognition of Unconstrained Malayalam Handwritten Numeral

Recognition of Unconstrained Malayalam Handwritten Numeral Recognition of Unconstrained Malayalam Handwritten Numeral U. Pal, S. Kundu, Y. Ali, H. Islam and N. Tripathy C VPR Unit, Indian Statistical Institute, Kolkata-108, India Email: umapada@isical.ac.in Abstract

More information

Implementation and Advanced Results on the Non-Interrupted Skeletonization Algorithm

Implementation and Advanced Results on the Non-Interrupted Skeletonization Algorithm Implementation and Advanced Results on the n-interrupted Skeletonization Algorithm Khalid Saeed, Mariusz Rybnik, Marek Tabedzki Computer Engineering Department Faculty of Computer Science Bialystok University

More information

A New Technique for Segmentation of Handwritten Numerical Strings of Bangla Language

A New Technique for Segmentation of Handwritten Numerical Strings of Bangla Language I.J. Information Technology and Computer Science, 2013, 05, 38-43 Published Online April 2013 in MECS (http://www.mecs-press.org/) DOI: 10.5815/ijitcs.2013.05.05 A New Technique for Segmentation of Handwritten

More information

Neural Network Classifier for Isolated Character Recognition

Neural Network Classifier for Isolated Character Recognition Neural Network Classifier for Isolated Character Recognition 1 Ruby Mehta, 2 Ravneet Kaur 1 M.Tech (CSE), Guru Nanak Dev University, Amritsar (Punjab), India 2 M.Tech Scholar, Computer Science & Engineering

More information

An Efficient Method to Extract Digital Text From Scanned Image Text

An Efficient Method to Extract Digital Text From Scanned Image Text An Efficient Method to Extract Digital Text From Scanned Image Text Jenick Johnson ECE Dept., Christ the King Engineering College Coimbatore-641104, Tamil Nadu, India Suresh Babu. V ECE Dept., Christ the

More information

Document Image Segmentation using Discriminative Learning over Connected Components

Document Image Segmentation using Discriminative Learning over Connected Components Document Image Segmentation using Discriminative Learning over Connected Components Syed Saqib Bukhari Technical University of bukhari@informatik.unikl.de Mayce Ibrahim Ali Al Azawi Technical University

More information

MURDOCH RESEARCH REPOSITORY.

MURDOCH RESEARCH REPOSITORY. MURDOCH RESEARCH REPOSITORY http://researchrepository.murdoch.edu.au This is the author's final version of the work, as accepted for publication following peer review but without the publisher's layout

More information

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK HANDWRITTEN DEVANAGARI CHARACTERS RECOGNITION THROUGH SEGMENTATION AND ARTIFICIAL

More information

A System to Automatically Index Genealogical Microfilm Titleboards Introduction Preprocessing Method Identification

A System to Automatically Index Genealogical Microfilm Titleboards Introduction Preprocessing Method Identification A System to Automatically Index Genealogical Microfilm Titleboards Samuel James Pinson, Mark Pinson and William Barrett Department of Computer Science Brigham Young University Introduction Millions of

More information

Isolated Curved Gurmukhi Character Recognition Using Projection of Gradient

Isolated Curved Gurmukhi Character Recognition Using Projection of Gradient International Journal of Computational Intelligence Research ISSN 0973-1873 Volume 13, Number 6 (2017), pp. 1387-1396 Research India Publications http://www.ripublication.com Isolated Curved Gurmukhi Character

More information

Localization, Extraction and Recognition of Text in Telugu Document Images

Localization, Extraction and Recognition of Text in Telugu Document Images Localization, Extraction and Recognition of Text in Telugu Document Images Atul Negi Department of CIS University of Hyderabad Hyderabad 500046, India atulcs@uohyd.ernet.in K. Nikhil Shanker Department

More information

Handwritten character and word recognition using their geometrical features through neural networks

Handwritten character and word recognition using their geometrical features through neural networks Handwritten character and word recognition using their geometrical features through neural networks Sudarshan Sawant 1, Prof. Seema Baji 2 1 Student, Department of electronics and Tele-communications,

More information