A Brief Study of Feature Extraction and Classification Methods Used for Character Recognition of Brahmi Northern Indian Scripts

Size: px
Start display at page:

Download "A Brief Study of Feature Extraction and Classification Methods Used for Character Recognition of Brahmi Northern Indian Scripts"

Transcription

1 25 A Brief Study of Feature Extraction and Classification Methods Used for Character Recognition of Brahmi Northern Indian Scripts Rohit Sachdeva, Asstt. Prof., Computer Science Department, Multani Mal Modi College, Patiala Dharam Veer Sharma, Asstt. Prof., Department of Computer Science, Punjabi University, Patiala ABSTRACT According to the 8th schedule of Indian constitution, there are 22 official languages and 122 regional languages prevalent in India. In the last few decades, the recognition of these scripts has been prominent area of research. Among these scripts most of the recognition research work has been done for Bangla, Devanagari, Gujrati, Gurumukhi and Telugu scripts etc. Commercial OCRs were available for various scripts like Latin, Japanese, Chinese, Roman, Arabic scripts. OCR systems for few Indian scripts are available and others are in the stage of development for preserving manuscripts and ancient literatures written in different Indian scripts and making digital libraries for the documents. Further, overall accuracy of the recognition, feature extraction and classification are crucial phases. This paper attempts to give a brief summary of various feature extraction and classification methods used for recognition process of Brahmi Northern Indian scripts by the researchers in last few decades. Keywords Optical Character Recognition (OCR), Brahmi Northern Indian Script, Character Recognition, Feature Extraction, Classification. INTRODUCTION The first idea of the concept of the OCR was given by Tausheck [1] and Handel [2]. Early version of OCR research was to recognize characters from images of machine printed text with single size and single font. Optical Character Recognition (OCR) is an approach that converts printed, typewritten or handwritten into editable form, which can be further used as per necessity. OCRs are divided into two sub categories: type written or printed text and Hand written text. Handwritten Text OCR can further divided into two sub categories: Online Recognition and Offline Recognition. Later is a process which means that after the completion of writing or printing, the recognition starts. During the last three decades, character recognition research has been prominent area of research field. India is a multi-lingual or multi script country. According to the 8 th schedule of Indian constitution there are 22 official languages and 122 regional languages. Most of the Indian scripts have their origin from Brahmi script, through with certain alterations. Brahmi is the ancestor of hundreds of languages predominantly used in the Indian sub-continent as well as in South-East and East Asia. In India, Brahmi script divided zonally into two sub scripts: Northern and Southern Script. Bengala, Devanagari, Gujrati, Gurmukhi and Oriya are Northern Scripts and Kannada, Malayalam, Tamil and Telugu are Southern Scripts. Development of OCR system for these Indic Scripts has several application areas such as preserving manuscripts and ancient literatures written in different Indian scripts and making digital libraries. OCR system comprises the following steps: 1. Image Digitation 2. Preprocessing 3. Feature Extraction 4. Classification 5. Post-Processing 1. Image Digitation: Image of the source document is firstly scanned and stored in some image file in the form of bitmaps. This is also called digitization. With the aid of scanning method, digital image of the source document is captured. 2. Pre-processing: In pre-processing, some pre-processing of the image, having text or data which has to be recognized, is required for improving the recognition accuracy. The preprocessing activities may include the following: noise removal, document level skew detection and correction, binarization of the digitized image, size normalization and segmentation at all the levels i.e. line, word and character.

2 26 Figure 1- Process of OCR 3. Feature Extraction: The main objective of feature extraction is to capture the vital characteristics of the symbols. It is the crucial and most vital stage of recognition process. Under this step, the features are extracted from segmented symbols. So, to attain the high recognition percentage, selection of feature extraction method becomes vital factor. The extracted features may be structural, statistical or moments based. Some feature extraction methods are Contour Profile, Deformable Templates, Moments calculation (Ex- Geometrical, Hu-moments, Zernike), Projection Histogram, Template matching, Zoning etc. 4. Classification: In the classification step, the features extracted from previous steps i.e. feature extraction step are used to recognize the text segment according to the preset rules. Classification is usually done by comparing the feature vectors corresponding to the input character with the representative(s) of each character class, using a distance metric. It is the procedure of assigning the detected data to their corresponding class with respect to groups with homogeneous characteristics, with the aim of cultivated numerous objects from each other within the image. It is carried out on the basis of stored features in the feature database, such as global and structural features etc. On the basis of decision rule, classification divides the feature space into several classes. Various classification procedures used in earlier developed Optical character recognition systems are Bayesian Classification, Decision Tree Classification, K-Nearest Neighbors, Neural Network, and Support Vector Machine. 5. Post-Processing: Post-processing step involves grouping of symbols. The process of performing the association of symbols into strings is referred to as grouping. FEATURE EXTRACTION AND CLASSIFICATION METHODS USED IN CHARACTER RECOGNITION FOR BRAHMI NORTHERN INDIAN SCRIPTS Bangla Chaudhuri et al. [3] represented a bilingual OCR system which is used for recognize Bangla and Devanagari script. A headline deletion process was used for character segmentation. For easier recognition, a text line was divided into three sub zones. For the recognition of basic and modified characters, structural feature and binary tree classifier was used and for the recognition of compound character, a hybrid method combining structural and run based template features was used. For Bangla script Jalal et al. made very prominent effort. They presented the system which used Bounded rectangle calculation, Chain code generation; Slope distribution generation features extraction methods along with neural network classifier. Authors claimed that their system attained 96% of accuracy.

3 27 U. Bhattacharya et al. [5] have given a system for recognition of Handwritten Bangla Characters. They used local chain-code histograms for obtaining features of input character and MLP classifier. They claimed that their system achieved 92.14% accuracy on testing sets and 94.65% on training sets. Devanagari M. K. Sinha et al.[6] proposed a template based OCR system for handwritten Devananagri documents. In term of primitives and relationships, the system stores structural descriptors for each symbol of the script. They used structural feature method along with decision tree classifier. They claimed their system achieved 90% accuracy. VeenaBansal et al. [7] presented hybrid classifier-based complete OCR for printed Hindi text written in Devanagari script. This system also supports the touching characters and compound characters in noisy environment. For the character segmentation, a projection profile technique was used by them. The system used following multiple features extraction method such as coverage of the region of the core strip, Horizontal zero crossings, Moments, Number of positions of the vertex points, Structural descriptors of the characters for classification, Vertical bar feature along with hybrid classifiers. At the character level overall accuracy attained by system was 93%. Reena Bajaj et al. [8] have proposed system, to recognize handwritten numeral of Devanagri script. They suggested a method for recognition of handwritten Devnagari numerals using density, moment of curve and descriptive component feature with MLP classifiers. Gujrati Antani et al.[9] have proposed the classification of a subset of printed or digitized Gujrati characters. Euclidean Minimum Distance, Hamming Distance classifier and K- Nearest Neighbour classifier have been used for classification with template matching. But a very low recognition rate of 67 percent is reported. Yajnik et al.[10] have developed a system for classification of Sets of printed Gujarati characters and modifiers using ANN architectures by considering linear activation functions in the output layer. Printed Guajarati text features were extracted in terms of wavelet coefficients. They have used two Multi-Layer Perceptron (MLP) networks which are used for classification of alphabets in middle zone and lower zone separately. These networks achieve percent accuracy for middle zone and percent accuracy for lower zone alphabets and modifiers. Prachi et al. [11] proposed a Gujrati OCR system for the recognition of basic characters in printed Gujarati text. Principal Component Analysis (PCA) was used to extract the features of printed Guajarati characters. For the classification of characters based on features Hopfield Neural classifier had been used by them. The system attained the 93.25% accuracy. Gurumukhi G. S. Lehal et al.[12] proposed a OCR system for Gurumukhi script. They used Local features such as branches, concave/convex parts, joints, number of endpoints etc. and Global features such as connectivity, number of holes and projection profiles etc. along with hybrid classification technique such as binary decision tree and Nearest Neighbour classifiers. They achieved a recognition rate of 91.6%. Dharam Veer Sharma et al.[13] used zoning feature with hybrid classification technique using K-Nearest Nehighbor and Support Vector Machine classifier, But a very low recognition rate of 67 percent is reported. Geeta et al. [14] proposed an OCR system for Gurumukhi numerals. They used Zone Distance features and Support Vector Machine classifier. They stated that their system attained 99.73% accuracy. Oriya B Chaudhuri et al. [15] presented a model for Oriya script OCR. They used Directional as well as Global Features and classified them using Decision tree classifier. They attained 96.03% accuracy at character level. For off-line unconstrained Oriya handwritten numerals a system was proposed by Roy et al.[16]. They used histograms of direction chain code of the contour points of the numerals as features and a neural network based classifier. They attained % accuracy. CONCLUSION WITH COMPARISON TABLE A brief study of feature extraction and classification methods used for character recognition of Brahmi Northern Indian Scripts shown in a tabular form is given below.

4 28 Sr No 1 2 Languages Feature Extraction Methods Classification Methods Bangla Devanagari 3 Gujrati 4 Gurmukhi 5 Oriya Structural and template feature[3] Bounded rectangle calculation, Chain code generation; Slope distribution generation[4] For basic and modified characters - Decision tree Classifier For Compound characters Hybrid Approach Neural Network 96 Chain Code Histogram[5] Multi-Layer Perceptron (MLP) Structural feature[6] Decision tree Classifier 90 Statistical [7] Hybrid Classifier 93 Density, moment of curve and descriptive component feature[8] Template Matching[9] wavelet coefficients[10] Multi-Layer Perceptron (MLP) Euclidean Minimum Distance, Hamming Distance classifier and K-Nearest Neighbour Two level Multi-Layer Perceptron (MLP) Principal Component Analysis (PCA)[11] Hopfield Neural classifier Local Features and global Binary Decision Tree and K- Features[12] Nearest Neighbour classifiers 95 Zoning[13] (Handwritten) K-Nearest Nehighbor and Support Vector Machine 72.7 Zone Distance [14] (Numerals) Support Vector Machine Directional and Global [15] Decision Tree Histograms of Direction Chain code of the contour points[16] Decision Tree Recognition Rate(in %) This study will definitely help developers and research scholars in the concerned area. OCR for these scripts that works under all possible circumstances and gives highly precise results, is the area which needs to be explored more to get more precise results. By using the hybrid methods, higher recognition rate could be attained REFERENCES [1] G. Tauschek, Reading machine, U.S. Patent , Dec [2] P. W. Handel, Statistical machine, U.S. Patent , June 1933 [3] B. B. Chaudhuri and U. Pal, An OCR System to Read Two Indian Language Scripts: Bangla and Devnagari (Hindi), IEEE vol.2, Aug [4] Jalal UddinMahtnud, Mohammed FerozRaihan and ChowdhuryMofizurRahman, A Complete OCR System for Continuous Bengali Characters, IEEE Vol. Oct [5] U. Bhattacharya1, M. Shridhar, and S.K. Parui1, On Recognition of Handwritten Bangla Characters, in proceeding of the Indian Conference on Computer Vision, pp , [6] M. K. Sinha, Mahabala., Machine Recognition of Devnagari Script, IEEE T. SYST. MAN Cyb., vol. 9, pp ,1979. [7] VeenaBansla and R M K Sinha, A Complete OCR for printed Hindi Text in Devanagari Script, IEEE pp , [8] Reena Bajaj, LipikaDey and SantanuChaudhury, Devnagari numeral recognition by combining decision of multiple connectionist classifiers, Vol. 27, Part 1, pp , 2002 [9] S. Antani, L. Agnihotri, Gujarati Character Recognition, Proc. of the 5th ICDAR, pp , [10] Yajnik, S. R. Mohan, Identification of Gujarati Characters Using Wavelets and Neural Networks, in the proceeding of the International Conference on Artificial Intelligence and Soft Computing, pp , 2006 [11] PrachiSolanki, Malay Bhatt, Printed Gujarati Script OCR using Hopfield Neural Network, International Journal of Computer Applications, Volume 69 No.13,pp 33-37, 2013.

5 29 [12] G. S. Lehal and Chandan Singh, Feature Extraction and Classification for OCR of Gurmukhi Script. Vivek, Vol. 12(2), pp. 2-12, 1999 [13] Dharam Veer Sharma, PuneetJhajj, Recognition of Isolated Handwritten Characters in Gurmukhi Script, International Journal of Computer Applications ( ), Volume 4 No.8,2010 [14] Gita Sinha, Rajneesh Rani, RenuDhir, Handwritten Gurmukhi Numeral Recognition using Zone-based Hybrid Feature Extraction Techniques, International Journal of Computer Applications( ), Volume 47- No. 21 June [15] S. Mohanti, Pattern Recognition in Alphabets of Oriya Language Using Kohonen Neural Network, International Journal Pattern Recognition Artificial Intelligence, Vol. 12, pp , [16] B. B. Chaudhuri, U. Pal, M Mitra, Automatic recognition of printed Oriya script, IEEE , 2001.

Optical Character Recognition

Optical Character Recognition Chapter 2 Optical Character Recognition 2.1 Introduction Optical Character Recognition (OCR) is one of the challenging areas of pattern recognition. It gained popularity among the research community due

More information

Recognition of Off-Line Handwritten Devnagari Characters Using Quadratic Classifier

Recognition of Off-Line Handwritten Devnagari Characters Using Quadratic Classifier Recognition of Off-Line Handwritten Devnagari Characters Using Quadratic Classifier N. Sharma, U. Pal*, F. Kimura**, and S. Pal Computer Vision and Pattern Recognition Unit, Indian Statistical Institute

More information

Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network

Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network 139 Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network Harmit Kaur 1, Simpel Rani 2 1 M. Tech. Research Scholar (Department of Computer Science & Engineering), Yadavindra College

More information

A Technique for Offline Handwritten Character Recognition

A Technique for Offline Handwritten Character Recognition A Technique for Offline Handwritten Character Recognition 1 Shilpy Bansal, 2 Mamta Garg, 3 Munish Kumar 1 Lecturer, Department of Computer Science Engineering, BMSCET, Muktsar, Punjab 2 Assistant Professor,

More information

PCA-based Offline Handwritten Character Recognition System

PCA-based Offline Handwritten Character Recognition System Smart Computing Review, vol. 3, no. 5, October 2013 346 Smart Computing Review PCA-based Offline Handwritten Character Recognition System Munish Kumar 1, M. K. Jindal 2, and R. K. Sharma 3 1 Computer Science

More information

SEVERAL METHODS OF FEATURE EXTRACTION TO HELP IN OPTICAL CHARACTER RECOGNITION

SEVERAL METHODS OF FEATURE EXTRACTION TO HELP IN OPTICAL CHARACTER RECOGNITION SEVERAL METHODS OF FEATURE EXTRACTION TO HELP IN OPTICAL CHARACTER RECOGNITION Binod Kumar Prasad * * Bengal College of Engineering and Technology, Durgapur, W.B., India. Rajdeep Kundu 2 2 Bengal College

More information

Segmentation Based Optical Character Recognition for Handwritten Marathi characters

Segmentation Based Optical Character Recognition for Handwritten Marathi characters Segmentation Based Optical Character Recognition for Handwritten Marathi characters Madhav Vaidya 1, Yashwant Joshi 2,Milind Bhalerao 3 Department of Information Technology 1 Department of Electronics

More information

LITERATURE REVIEW. For Indian languages most of research work is performed firstly on Devnagari script and secondly on Bangla script.

LITERATURE REVIEW. For Indian languages most of research work is performed firstly on Devnagari script and secondly on Bangla script. LITERATURE REVIEW For Indian languages most of research work is performed firstly on Devnagari script and secondly on Bangla script. The study of recognition for handwritten Devanagari compound character

More information

Isolated Handwritten Words Segmentation Techniques in Gurmukhi Script

Isolated Handwritten Words Segmentation Techniques in Gurmukhi Script Isolated Handwritten Words Segmentation Techniques in Gurmukhi Script Galaxy Bansal Dharamveer Sharma ABSTRACT Segmentation of handwritten words is a challenging task primarily because of structural features

More information

HANDWRITTEN GURMUKHI CHARACTER RECOGNITION USING WAVELET TRANSFORMS

HANDWRITTEN GURMUKHI CHARACTER RECOGNITION USING WAVELET TRANSFORMS International Journal of Electronics, Communication & Instrumentation Engineering Research and Development (IJECIERD) ISSN 2249-684X Vol.2, Issue 3 Sep 2012 27-37 TJPRC Pvt. Ltd., HANDWRITTEN GURMUKHI

More information

DEVANAGARI SCRIPT SEPARATION AND RECOGNITION USING MORPHOLOGICAL OPERATIONS AND OPTIMIZED FEATURE EXTRACTION METHODS

DEVANAGARI SCRIPT SEPARATION AND RECOGNITION USING MORPHOLOGICAL OPERATIONS AND OPTIMIZED FEATURE EXTRACTION METHODS DEVANAGARI SCRIPT SEPARATION AND RECOGNITION USING MORPHOLOGICAL OPERATIONS AND OPTIMIZED FEATURE EXTRACTION METHODS Sushilkumar N. Holambe Dr. Ulhas B. Shinde Shrikant D. Mali Persuing PhD at Principal

More information

Segmentation of Characters of Devanagari Script Documents

Segmentation of Characters of Devanagari Script Documents WWJMRD 2017; 3(11): 253-257 www.wwjmrd.com International Journal Peer Reviewed Journal Refereed Journal Indexed Journal UGC Approved Journal Impact Factor MJIF: 4.25 e-issn: 2454-6615 Manpreet Kaur Research

More information

Handwritten Character Recognition: A Comprehensive Review on Geometrical Analysis

Handwritten Character Recognition: A Comprehensive Review on Geometrical Analysis IOSR Journal of Computer Engineering (IOSRJCE) eissn: 22780661,pISSN: 22788727, Volume 17, Issue 2, Ver. IV (Mar Apr. 2015), PP 8388 www.iosrjournals.org Handwritten Character Recognition: A Comprehensive

More information

Isolated Curved Gurmukhi Character Recognition Using Projection of Gradient

Isolated Curved Gurmukhi Character Recognition Using Projection of Gradient International Journal of Computational Intelligence Research ISSN 0973-1873 Volume 13, Number 6 (2017), pp. 1387-1396 Research India Publications http://www.ripublication.com Isolated Curved Gurmukhi Character

More information

Degraded Text Recognition of Gurmukhi Script. Doctor of Philosophy. Manish Kumar

Degraded Text Recognition of Gurmukhi Script. Doctor of Philosophy. Manish Kumar Degraded Text Recognition of Gurmukhi Script A Thesis Submitted in fulfilment of the requirements for the award of the degree of Doctor of Philosophy Submitted by Manish Kumar (Registration No. 9000351)

More information

Recognition of Gurmukhi Text from Sign Board Images Captured from Mobile Camera

Recognition of Gurmukhi Text from Sign Board Images Captured from Mobile Camera International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 17 (2014), pp. 1839-1845 International Research Publications House http://www. irphouse.com Recognition of

More information

A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script

A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script Arwinder Kaur 1, Ashok Kumar Bathla 2 1 M. Tech. Student, CE Dept., 2 Assistant Professor, CE Dept.,

More information

Complementary Features Combined in a MLP-based System to Recognize Handwritten Devnagari Character

Complementary Features Combined in a MLP-based System to Recognize Handwritten Devnagari Character Journal of Information Hiding and Multimedia Signal Processing 2011 ISSN 2073-4212 Ubiquitous International Volume 2, Number 1, January 2011 Complementary Features Combined in a MLP-based System to Recognize

More information

Handwritten Gurumukhi Character Recognition Using Zoning Density and Background Directional Distribution Features

Handwritten Gurumukhi Character Recognition Using Zoning Density and Background Directional Distribution Features Handwritten Gurumukhi Character Recognition Using Zoning Density and Background Directional Distribution Features Kartar Singh Siddharth #1, Renu Dhir #2, Rajneesh Rani #3 # Department of Computer Science

More information

OCR For Handwritten Marathi Script

OCR For Handwritten Marathi Script International Journal of Scientific & Engineering Research Volume 3, Issue 8, August-2012 1 OCR For Handwritten Marathi Script Mrs.Vinaya. S. Tapkir 1, Mrs.Sushma.D.Shelke 2 1 Maharashtra Academy Of Engineering,

More information

Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network

Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network International Journal of Computer Science & Communication Vol. 1, No. 1, January-June 2010, pp. 91-95 Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network Raghuraj

More information

Handwritten Script Recognition at Block Level

Handwritten Script Recognition at Block Level Chapter 4 Handwritten Script Recognition at Block Level -------------------------------------------------------------------------------------------------------------------------- Optical character recognition

More information

Indian Multi-Script Full Pin-code String Recognition for Postal Automation

Indian Multi-Script Full Pin-code String Recognition for Postal Automation 2009 10th International Conference on Document Analysis and Recognition Indian Multi-Script Full Pin-code String Recognition for Postal Automation U. Pal 1, R. K. Roy 1, K. Roy 2 and F. Kimura 3 1 Computer

More information

Structural Feature Extraction to recognize some of the Offline Isolated Handwritten Gujarati Characters using Decision Tree Classifier

Structural Feature Extraction to recognize some of the Offline Isolated Handwritten Gujarati Characters using Decision Tree Classifier Structural Feature Extraction to recognize some of the Offline Isolated Handwritten Gujarati Characters using Decision Tree Classifier Hetal R. Thaker Atmiya Institute of Technology & science, Kalawad

More information

Chapter Review of HCR

Chapter Review of HCR Chapter 3 [3]Literature Review The survey of literature on character recognition showed that some of the researchers have worked based on application requirements like postal code identification [118],

More information

Segmentation of Isolated and Touching characters in Handwritten Gurumukhi Word using Clustering approach

Segmentation of Isolated and Touching characters in Handwritten Gurumukhi Word using Clustering approach Segmentation of Isolated and Touching characters in Handwritten Gurumukhi Word using Clustering approach Akashdeep Kaur Dr.Shaveta Rani Dr. Paramjeet Singh M.Tech Student (Associate Professor) (Associate

More information

Comparative Performance Analysis of Feature(S)- Classifier Combination for Devanagari Optical Character Recognition System

Comparative Performance Analysis of Feature(S)- Classifier Combination for Devanagari Optical Character Recognition System Comparative Performance Analysis of Feature(S)- Classifier Combination for Devanagari Optical Character Recognition System Jasbir Singh Department of Computer Science Punjabi University Patiala, India

More information

Gabor Features Based Script Identification of Lines within a Bilingual/Trilingual Document

Gabor Features Based Script Identification of Lines within a Bilingual/Trilingual Document , pp.1-12 http://dx.doi.org/10.14257/ijast.2014.66.01 Gabor Features Based Script Identification of Lines within a Bilingual/Trilingual Document Rajneesh Rani 1, Renu Dhir 1 and Gurpreet Singh Lehal 2

More information

Problems in Extraction of Date Field from Gurmukhi Documents

Problems in Extraction of Date Field from Gurmukhi Documents 115 Problems in Extraction of Date Field from Gurmukhi Documents Gursimranjeet Kaur 1, Simpel Rani 2 1 M.Tech. Scholar Yadwindra College of Engineering, Talwandi Sabo, Punjab, India sidhus702@gmail.com

More information

A Review on Different Character Segmentation Techniques for Handwritten Gurmukhi Scripts

A Review on Different Character Segmentation Techniques for Handwritten Gurmukhi Scripts WWJMRD2017; 3(10): 162-166 www.wwjmrd.com International Journal Peer Reviewed Journal Refereed Journal Indexed Journal UGC Approved Journal Impact Factor MJIF: 4.25 e-issn: 2454-6615 Manas Kaur Research

More information

Paper ID: NITETE&TC05 THE HANDWRITTEN DEVNAGARI NUMERALS RECOGNITION USING SUPPORT VECTOR MACHINE

Paper ID: NITETE&TC05 THE HANDWRITTEN DEVNAGARI NUMERALS RECOGNITION USING SUPPORT VECTOR MACHINE Paper ID: NITETE&TC05 THE HANDWRITTEN DEVNAGARI NUMERALS RECOGNITION USING SUPPORT VECTOR MACHINE Rupali Vitthalrao Suryawanshi Department of Electronics Engineering, Bharatratna Indira Gandhi College,

More information

Handwritten Numeral Recognition of Kannada Script

Handwritten Numeral Recognition of Kannada Script Handwritten Numeral Recognition of Kannada Script S.V. Rajashekararadhya Department of Electrical and Electronics Engineering CEG, Anna University, Chennai, India svr_aradhya@yahoo.co.in P. Vanaja Ranjan

More information

Handwritten Hindi Character Recognition System Using Edge detection & Neural Network

Handwritten Hindi Character Recognition System Using Edge detection & Neural Network Handwritten Hindi Character Recognition System Using Edge detection & Neural Network Tanuja K *, Usha Kumari V and Sushma T M Acharya Institute of Technology, Bangalore, India Abstract Handwritten recognition

More information

International Journal of Computer Engineering and Applications,

International Journal of Computer Engineering and Applications, International Journal of Computer Engineering and Applications, Volume XII, Issue I, Jan. 18, www.ijcea.com ISSN 2321-3469 TEXT RECOGNITION USING DIGITAL IMAGE PROCESSING TECHNIQUES S.Banu Chitra 1, Dr

More information

A survey on optical character recognition for Bangla and Devanagari scripts

A survey on optical character recognition for Bangla and Devanagari scripts Sādhanā Vol. 38, Part 1, February 2013, pp. 133 168. c Indian Academy of Sciences A survey on optical character recognition for Bangla and Devanagari scripts 1. Introduction SOUMEN BAG 1 and GAURAV HARIT

More information

Word-wise Hand-written Script Separation for Indian Postal automation

Word-wise Hand-written Script Separation for Indian Postal automation Word-wise Hand-written Script Separation for Indian Postal automation K. Roy U. Pal Dept. of Comp. Sc. & Engg. West Bengal University of Technology, Sector 1, Saltlake City, Kolkata-64, India Abstract

More information

Review of Automatic Handwritten Kannada Character Recognition Technique Using Neural Network

Review of Automatic Handwritten Kannada Character Recognition Technique Using Neural Network Review of Automatic Handwritten Kannada Character Recognition Technique Using Neural Network 1 Mukesh Kumar, 2 Dr.Jeeetendra Sheethlani 1 Department of Computer Science SSSUTMS, Sehore Abstract Data processing

More information

A two-stage approach for segmentation of handwritten Bangla word images

A two-stage approach for segmentation of handwritten Bangla word images A two-stage approach for segmentation of handwritten Bangla word images Ram Sarkar, Nibaran Das, Subhadip Basu, Mahantapas Kundu, Mita Nasipuri #, Dipak Kumar Basu Computer Science & Engineering Department,

More information

Line and Word Segmentation Approach for Printed Documents

Line and Word Segmentation Approach for Printed Documents Line and Word Segmentation Approach for Printed Documents Nallapareddy Priyanka Computer Vision and Pattern Recognition Unit Indian Statistical Institute, 203 B.T. Road, Kolkata-700108, India Srikanta

More information

Morphological Approach for Segmentation of Scanned Handwritten Devnagari Text

Morphological Approach for Segmentation of Scanned Handwritten Devnagari Text Abstract In this paper we present a system towards the of Hindi Handwritten Devnagari Text. Segmentation of script is essential for handwritten script recognition. This system deals with of (matras) and

More information

Feature Extraction and Classification for OCR of Gurmukhi Script

Feature Extraction and Classification for OCR of Gurmukhi Script Feature Extraction and Classification for OCR of Gurmukhi Script G. S. Lehal and Chandan Singh Department Of Computer Science & Engineering Punjabi University, Patiala 4700, INDIA E-mail : gslehal@mailcity.com,

More information

MOMENT AND DENSITY BASED HADWRITTEN MARATHI NUMERAL RECOGNITION

MOMENT AND DENSITY BASED HADWRITTEN MARATHI NUMERAL RECOGNITION MOMENT AND DENSITY BASED HADWRITTEN MARATHI NUMERAL RECOGNITION S. M. Mali Department of Computer Science, MAEER S Arts, Commerce and Science College, Pune Shankarmali007@gmail.com Abstract In this paper,

More information

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK HANDWRITTEN DEVANAGARI CHARACTERS RECOGNITION THROUGH SEGMENTATION AND ARTIFICIAL

More information

Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network

Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network Utkarsh Dwivedi 1, Pranjal Rajput 2, Manish Kumar Sharma 3 1UG Scholar, Dept. of CSE, GCET, Greater Noida,

More information

Handwritten Character Recognition A Review

Handwritten Character Recognition A Review International Journal of Scientific and Research Publications, Volume 5, Issue 3, March 2015 1 Handwritten Character Recognition A Review Surya Nath R S *, Afseena S ** * Computer Science, College of Engineering

More information

SEGMENTATION OF CHARACTERS WITHOUT MODIFIERS FROM A PRINTED BANGLA TEXT

SEGMENTATION OF CHARACTERS WITHOUT MODIFIERS FROM A PRINTED BANGLA TEXT SEGMENTATION OF CHARACTERS WITHOUT MODIFIERS FROM A PRINTED BANGLA TEXT ABSTRACT Rupak Bhattacharyya et al. (Eds) : ACER 2013, pp. 11 24, 2013. CS & IT-CSCP 2013 Fakruddin Ali Ahmed Department of Computer

More information

Recognition of Unconstrained Malayalam Handwritten Numeral

Recognition of Unconstrained Malayalam Handwritten Numeral Recognition of Unconstrained Malayalam Handwritten Numeral U. Pal, S. Kundu, Y. Ali, H. Islam and N. Tripathy C VPR Unit, Indian Statistical Institute, Kolkata-108, India Email: umapada@isical.ac.in Abstract

More information

Handwritten character and word recognition using their geometrical features through neural networks

Handwritten character and word recognition using their geometrical features through neural networks Handwritten character and word recognition using their geometrical features through neural networks Sudarshan Sawant 1, Prof. Seema Baji 2 1 Student, Department of electronics and Tele-communications,

More information

Devanagari Handwriting Recognition and Editing Using Neural Network

Devanagari Handwriting Recognition and Editing Using Neural Network Devanagari Handwriting Recognition and Editing Using Neural Network Sohan Lal Sahu RSR Rungta College of Engineering & Technology (RSR-RCET), Bhilai 490024 Abstract- Character recognition plays an important

More information

HCR Using K-Means Clustering Algorithm

HCR Using K-Means Clustering Algorithm HCR Using K-Means Clustering Algorithm Meha Mathur 1, Anil Saroliya 2 Amity School of Engineering & Technology Amity University Rajasthan, India Abstract: Hindi is a national language of India, there are

More information

FRAGMENTATION OF HANDWRITTEN TOUCHING CHARACTERS IN DEVANAGARI SCRIPT

FRAGMENTATION OF HANDWRITTEN TOUCHING CHARACTERS IN DEVANAGARI SCRIPT International Journal of Information Technology, Modeling and Computing (IJITMC) Vol. 2, No. 1, February 2014 FRAGMENTATION OF HANDWRITTEN TOUCHING CHARACTERS IN DEVANAGARI SCRIPT Shuchi Kapoor 1 and Vivek

More information

Handwritten Marathi Character Recognition on an Android Device

Handwritten Marathi Character Recognition on an Android Device Handwritten Marathi Character Recognition on an Android Device Tanvi Zunjarrao 1, Uday Joshi 2 1MTech Student, Computer Engineering, KJ Somaiya College of Engineering,Vidyavihar,India 2Associate Professor,

More information

Handwritten Devanagari Character Recognition

Handwritten Devanagari Character Recognition Handwritten Devanagari Character Recognition Akhil Deshmukh, Rahul Meshram, Sachin Kendre, Kunal Shah Department of Computer Engineering Sinhgad Institute of Technology (SIT) Lonavala University of Pune,

More information

Multi-Oriented Gujarati Characters Recognition: A Review

Multi-Oriented Gujarati Characters Recognition: A Review Multi-Oriented Gujarati Characters Recognition: A Review Nikisha B. Jariwala Asst. Professor Smt. Tanuben & Dr. Manubhai Trivedi College of Information Science, Surat, Gujarat, India nikisha_jariwala@yahoo.co.in

More information

CHAPTER 2 LITERATURE REVIEW

CHAPTER 2 LITERATURE REVIEW CHAPTER 2 LITERATURE REVIEW 2.1 Introduction There is a great need for OCR related research in Indian languages, even though there are many technical challenges as well as the lack of a commercial market

More information

Chapter 2. Literature Survey and Objectives. 2.1 Literature Survey

Chapter 2. Literature Survey and Objectives. 2.1 Literature Survey Chapter 2 Literature Survey and Objectives 2.1 Literature Survey In India, there are 18 official (Indian constitution accepted) languages. Two or more of these languages may be written in one script. Twelve

More information

LECTURE 6 TEXT PROCESSING

LECTURE 6 TEXT PROCESSING SCIENTIFIC DATA COMPUTING 1 MTAT.08.042 LECTURE 6 TEXT PROCESSING Prepared by: Amnir Hadachi Institute of Computer Science, University of Tartu amnir.hadachi@ut.ee OUTLINE Aims Character Typology OCR systems

More information

A HYBRID FEATURE EXTRACTION AND RECOGNITION TECHNIQUE FOR OFFLINE DEVNAGRI HADWRITING

A HYBRID FEATURE EXTRACTION AND RECOGNITION TECHNIQUE FOR OFFLINE DEVNAGRI HADWRITING A HYBRID FEATURE EXTRACTION AND RECOGNITION TECHNIQUE FOR OFFLINE DEVNAGRI HADWRITING Poonam Sharma Department of Computer Science The NorthCap University Email-Id: poonamsharma@ncuindia.edu Shivani Sihmar

More information

Devanagari Isolated Character Recognition by using Statistical features

Devanagari Isolated Character Recognition by using Statistical features Devanagari Isolated Character Recognition by using Statistical features ( Foreground Pixels Distribution, Zone Density and Background Directional Distribution feature and SVM Classifier) Mahesh Jangid

More information

Research Article Development of Comprehensive Devnagari Numeral and Character Database for Offline Handwritten Character Recognition

Research Article Development of Comprehensive Devnagari Numeral and Character Database for Offline Handwritten Character Recognition Applied Computational Intelligence and Soft Computing Volume 2012, Article ID 871834, 5 pages doi:10.1155/2012/871834 Research Article Development of Comprehensive Devnagari Numeral and Character base

More information

Multiple Classifier Combination for Off-line Handwritten Devnagari Character Recognition

Multiple Classifier Combination for Off-line Handwritten Devnagari Character Recognition Multiple Combination for Off-line Handwritten Devnagari Character Recognition Sandhya Arora Department of CSE & T Meghnad Saha nstitute of Technology Kolkata-700107 sandhyabhagat@yahoo.com Debotosh Bhattacharjee,

More information

Marathi Handwritten Numeral Recognition using Fourier Descriptors and Normalized Chain Code

Marathi Handwritten Numeral Recognition using Fourier Descriptors and Normalized Chain Code Marathi Handwritten Numeral Recognition using Fourier Descriptors and Normalized Chain Code G. G. Rajput Department of Computer Science Gulbarga University, Gulbarga 585106 Karnataka, India S. M. Mali

More information

Text-Line Extraction from Handwritten Document images using Histogram and Connected Component Analysis

Text-Line Extraction from Handwritten Document images using Histogram and Connected Component Analysis Text-Line Extraction from Handwritten Document images using Histogram and Connected Component Analysis G. G. Rajput Rani Channamma University Belagavi, Karnataka Suryakant B. Ummapure Dept. of Computer

More information

Handwritten Devanagari Character Recognition Model Using Neural Network

Handwritten Devanagari Character Recognition Model Using Neural Network Handwritten Devanagari Character Recognition Model Using Neural Network Gaurav Jaiswal M.Sc. (Computer Science) Department of Computer Science Banaras Hindu University, Varanasi. India gauravjais88@gmail.com

More information

Off-line Recognition of Hand-written Bengali Numerals using Morphological Features

Off-line Recognition of Hand-written Bengali Numerals using Morphological Features Off-line Recognition of Hand-written Bengali Numerals using Morphological Features Pulak Purkait and Bhabatosh Chanda ECSU, Indian Statistical Institute, Kolkata, India {pulak r, chanda}@isical.ac.in Abstract

More information

Character Recognition Using Matlab s Neural Network Toolbox

Character Recognition Using Matlab s Neural Network Toolbox Character Recognition Using Matlab s Neural Network Toolbox Kauleshwar Prasad, Devvrat C. Nigam, Ashmika Lakhotiya and Dheeren Umre B.I.T Durg, India Kauleshwarprasad2gmail.com, devnigam24@gmail.com,ashmika22@gmail.com,

More information

Online Bangla Handwriting Recognition System

Online Bangla Handwriting Recognition System 1 Online Bangla Handwriting Recognition System K. Roy Dept. of Comp. Sc. West Bengal University of Technology, BF 142, Saltlake, Kolkata-64, India N. Sharma, T. Pal and U. Pal Computer Vision and Pattern

More information

Image Normalization and Preprocessing for Gujarati Character Recognition

Image Normalization and Preprocessing for Gujarati Character Recognition 334 Image Normalization and Preprocessing for Gujarati Character Recognition Jayashree Rajesh Prasad Department of Computer Engineering, Sinhgad College of Engineering, University of Pune, Pune, Mahaashtra

More information

Performance Comparison of Devanagari Handwritten Numerals Recognition

Performance Comparison of Devanagari Handwritten Numerals Recognition Performance Comparison of Devanagari Handwritten Numerals Recognition Mahesh Jangid Kartar Singh Department of CSE Dr. B R Ambedkar NIT Jalandhar (India) Renu Dhir Department of CSE Dr. B R Ambedkar NIT

More information

Handwritten Hindi Numerals Recognition System

Handwritten Hindi Numerals Recognition System CS365 Project Report Handwritten Hindi Numerals Recognition System Submitted by: Akarshan Sarkar Kritika Singh Project Mentor: Prof. Amitabha Mukerjee 1 Abstract In this project, we consider the problem

More information

Building Multi Script OCR for Brahmi Scripts: Selection of Efficient Features

Building Multi Script OCR for Brahmi Scripts: Selection of Efficient Features Building Multi Script OCR for Brahmi Scripts: Selection of Efficient Features Md. Abul Hasnat Center for Research on Bangla Language Processing (CRBLP) Center for Research on Bangla Language Processing

More information

Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes

Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes 2009 10th International Conference on Document Analysis and Recognition Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes Alireza Alaei

More information

SEGMENTATION OF BROKEN CHARACTERS OF HANDWRITTEN GURMUKHI SCRIPT

SEGMENTATION OF BROKEN CHARACTERS OF HANDWRITTEN GURMUKHI SCRIPT 95 SEGMENTATION OF BROKEN CHARACTERS OF HANDWRITTEN GURMUKHI SCRIPT Bharti Mehta Department of Computer Engineering Yadavindra college of Engineering Talwandi Sabo (Bathinda) bhartimehta13@gmail.com Abstract:

More information

2015, IJARCSSE All Rights Reserved Page 665

2015, IJARCSSE All Rights Reserved Page 665 Volume 5, Issue 12, December 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Script Identification

More information

A Wavelet Based Recognition System for Printed Malayalam Characters

A Wavelet Based Recognition System for Printed Malayalam Characters A Wavelet Based Recognition System for Printed Malayalam Characters M. Abdul Rahiman 1, M. S. Rajasree 2 1 Asst Professor, Department of Computer Science & Engg LBS Institute of Technology for Women, Trivandrum,

More information

NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: VOLUME 5, ISSUE

NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: VOLUME 5, ISSUE OPTICAL HANDWRITTEN DEVNAGARI CHARACTER RECOGNITION USING ARTIFICIAL NEURAL NETWORK APPROACH JYOTI A.PATIL Ashokrao Mane Group of Institution, Vathar Tarf Vadgaon, India. DR. SANJAY R. PATIL Ashokrao Mane

More information

Character Segmentation for Telugu Image Document using Multiple Histogram Projections

Character Segmentation for Telugu Image Document using Multiple Histogram Projections Global Journal of Computer Science and Technology Graphics & Vision Volume 13 Issue 5 Version 1.0 Year 2013 Type: Double Blind Peer Reviewed International Research Journal Publisher: Global Journals Inc.

More information

An Improvement Study for Optical Character Recognition by using Inverse SVM in Image Processing Technique

An Improvement Study for Optical Character Recognition by using Inverse SVM in Image Processing Technique An Improvement Study for Optical Character Recognition by using Inverse SVM in Image Processing Technique I Dinesh KumarVerma, II Anjali Khatri I Assistant Professor (ECE) PDM College of Engineering, Bahadurgarh,

More information

A Study to Recognize Printed Gujarati Characters Using Tesseract OCR

A Study to Recognize Printed Gujarati Characters Using Tesseract OCR A Study to Recognize Printed Gujarati Characters Using Tesseract OCR Milind Kumar Audichya 1, Jatinderkumar R. Saini 2 1, 2 Computer Science, Gujarat Technological University Abstract: Optical Character

More information

Date Field Extraction from Gurmukhi Handwritten Documents

Date Field Extraction from Gurmukhi Handwritten Documents Advances in Computational Sciences and Technology ISSN 0973-6107 Volume 10, Number 6 (2017) pp. 1595-1606 Research India Publications http://www.ripublication.com Date Field Extraction from Gurmukhi Handwritten

More information

Department of Studies in Computer Science, Karnataka State Open University, Mysore, India 2

Department of Studies in Computer Science, Karnataka State Open University, Mysore, India 2 Volume 5, Issue 12, December 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com K-means Clustering

More information

Keywords Handwritten alphabet recognition, local binary pattern (LBP), feature Descriptor, nearest neighbor classifier.

Keywords Handwritten alphabet recognition, local binary pattern (LBP), feature Descriptor, nearest neighbor classifier. Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Efficient

More information

Opportunities and Challenges of Handwritten Sanskrit Character Recognition System

Opportunities and Challenges of Handwritten Sanskrit Character Recognition System Opportunities and Challenges of Handwritten System Shailendra Kumar Singh Research Scholar, CSE Department SLIET Longowal, Sangrur, Punjab, India Sks.it2012@gmail.com Manoj Kumar Sachan Assosiate Professor,

More information

Online Handwritten Devnagari Word Recognition using HMM based Technique

Online Handwritten Devnagari Word Recognition using HMM based Technique Online Handwritten Devnagari Word using HMM based Technique Prachi Patil Master of Engineering Dept. of Electronics & Telecommunication Dr. D. Y. Patil SOE, Pune, India Saniya Ansari Professor Dept. of

More information

CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS

CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS 8.1 Introduction The recognition systems developed so far were for simple characters comprising of consonants and vowels. But there is one

More information

USE OF JPEG ALGORITHM IN HANDWRITTEN DEVNAGRI NUMERAL RECOGNITION

USE OF JPEG ALGORITHM IN HANDWRITTEN DEVNAGRI NUMERAL RECOGNITION USE OF JPEG ALGORITHM IN HANDWRITTEN DEVNAGRI NUMERAL RECOGNITION Gajanan Birajdar 1 and Mansi Subhedar 2 Department of Electronics and Telecommunication Engineering, SIES Graduate School of Technology,

More information

A Recognition System for Devnagri and English Handwritten Numerals

A Recognition System for Devnagri and English Handwritten Numerals A Recognition System for Devnagri and English Handwritten Numerals G S Lehal 1 and Nivedan Bhatt 2 1 Department of Computer Science & Engineering, Thapar Institute of Engineering & Technology, Patiala,

More information

Enhancing the Character Segmentation Accuracy of Bangla OCR using BPNN

Enhancing the Character Segmentation Accuracy of Bangla OCR using BPNN Enhancing the Character Segmentation Accuracy of Bangla OCR using BPNN Shamim Ahmed 1, Mohammod Abul Kashem 2 1 M.S. Student, Department of Computer Science and Engineering, Dhaka University of Engineering

More information

A Technique for Classification of Printed & Handwritten text

A Technique for Classification of Printed & Handwritten text 123 A Technique for Classification of Printed & Handwritten text M.Tech Research Scholar, Computer Engineering Department, Yadavindra College of Engineering, Punjabi University, Guru Kashi Campus, Talwandi

More information

Chapter 2. OCR System: A Literature Survey. 2.1 History of machine recognition of scripts

Chapter 2. OCR System: A Literature Survey. 2.1 History of machine recognition of scripts Chapter 2 OCR System: A Literature Survey 2.1 History of machine recognition of scripts The overwhelming volume of paper-based data in corporations and offices challenges their ability to manage documents

More information

Automatic Recognition of Offline Handwritten Urdu Digits In Unconstrained Environment Using Daubechies Wavelet Transforms

Automatic Recognition of Offline Handwritten Urdu Digits In Unconstrained Environment Using Daubechies Wavelet Transforms IOSR Journal of Engineering (IOSRJEN) e-issn: 2250-3021, p-issn: 2278-8719 Vol. 3, Issue 9 (September. 2013), V2 PP 50-56 Automatic Recognition of Offline Handwritten Urdu Digits In Unconstrained Environment

More information

Cloud Based Mobile Business Card Reader in Tamil

Cloud Based Mobile Business Card Reader in Tamil Cloud Based Mobile Business Card Reader in Tamil Tamizhselvi. S.P, Vijayalakshmi Muthuswamy, S. Abirami Department of Information Science and Technology, CEG Campue, Anna University tamizh8306@gmail.com,

More information

Word level Script Identification from Bangla and Devanagri Handwritten Texts mixed with Roman Script

Word level Script Identification from Bangla and Devanagri Handwritten Texts mixed with Roman Script HTTPS://SITES.GOOGLE.COM/SITE/JOURNALOFCOMPUTING/ Word level Script Identification from Bangla and Devanagri Handwritten Texts mixed with Roman Script Ram Sarkar, Nibaran Das, Subhadip Basu, Mahantapas

More information

Character Recognition of High Security Number Plates Using Morphological Operator

Character Recognition of High Security Number Plates Using Morphological Operator Character Recognition of High Security Number Plates Using Morphological Operator Kamaljit Kaur * Department of Computer Engineering, Baba Banda Singh Bahadur Polytechnic College Fatehgarh Sahib,Punjab,India

More information

Mixture of Printed and Handwritten Kannada Numeral Recognition Using Normalized Chain Code and Wavelet Transform

Mixture of Printed and Handwritten Kannada Numeral Recognition Using Normalized Chain Code and Wavelet Transform Mixture Printed and Handwritten Numeral Recognition Using Normalized Chain Code and Wavelet Transform Shashikala Parameshwarappa 1, B.V.Dhandra 2 1 Department Computer Science and Engineering College,

More information

A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation

A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation K. Roy, U. Pal and B. B. Chaudhuri CVPR Unit; Indian Statistical Institute, Kolkata-108; India umapada@isical.ac.in

More information

I. INTRODUCTION. Keywords Script separation, Indian script, Bilingual (English-Oriya) OCR, Horizontal profiles, nearest neighbour.

I. INTRODUCTION. Keywords Script separation, Indian script, Bilingual (English-Oriya) OCR, Horizontal profiles, nearest neighbour. A Comparative Analysis of Classifiers Accuracies for Bilingual Printed Documents (Oriya-English) Sanghamitra Mohanty, Himadri Nandini Das Bebartta P.G. Department of Computer Science and Application, Utkal

More information

Bangla/English Script Identification Based on Analysis of Connected Component Profiles

Bangla/English Script Identification Based on Analysis of Connected Component Profiles Bangla/English Script Identification Based on Analysis of Connected Component Profiles Lijun Zhou 1,YueLu 1,2,andChewLimTan 3 1 Department of Computer Science and Technology, East China Normal University,

More information

DATABASE DEVELOPMENT OF HISTORICAL DOCUMENTS: SKEW DETECTION AND CORRECTION

DATABASE DEVELOPMENT OF HISTORICAL DOCUMENTS: SKEW DETECTION AND CORRECTION DATABASE DEVELOPMENT OF HISTORICAL DOCUMENTS: SKEW DETECTION AND CORRECTION S P Sachin 1, Banumathi K L 2, Vanitha R 3 1 UG, Student of Department of ECE, BIET, Davangere, (India) 2,3 Assistant Professor,

More information

A New Technique for Segmentation of Handwritten Numerical Strings of Bangla Language

A New Technique for Segmentation of Handwritten Numerical Strings of Bangla Language I.J. Information Technology and Computer Science, 2013, 05, 38-43 Published Online April 2013 in MECS (http://www.mecs-press.org/) DOI: 10.5815/ijitcs.2013.05.05 A New Technique for Segmentation of Handwritten

More information