Recent Advances In Telecommunications, Informatics And Educational Technologies

Size: px
Start display at page:

Download "Recent Advances In Telecommunications, Informatics And Educational Technologies"

Transcription

1 Geometric Feature Extraction from Urdu Ligatures NAILA KHAN, AWAIS ADNAN 2, SADIA BASAR Department of Computer Science, Institute of Management Sciences, 1-A, Sector E-5, Phase VII, Hayatabad Peshawar, Pakistan ABSTRAC: -This research aims at the extraction of geometric features from Urdu ligatures. Though structural features are robust, its extraction and analysis is exceptionally complex and time-consuming task. The extraction and analysis is uncomplicated in case of the geometric features. Geometric features are language, script and font independent. There are twelve significant geometric features extracted from the ligature images. Specifically, these twelve features are the height, width, aspect ratio, density function, perimeter, area, perimeter to area ratio, horizontal projection profile, vertical projection profile, start point, end point and the slope between start and end point. Keywords: Features, Geometric, Ligature, Structural, Urdu 1. Introduction Urdu is the national language of Pakistan [1-3]. There are 60 to 80 million Urdu speakers across the world [4]. Urdu script is very similar to the Arabic script and is written in Nastalique calligraphic style [5-7]. Nastalique font is highly cursive and context sensitive in nature [2, 8-10]. This cursiveness of text presents numerous challenges in character segmentation[8, 11]. Urdu words are composed of ligatures and isolated characters (see Fig. 1). Ligature is a sub-component of a word and can also be regarded as a sub-word [6, 12]. 2. Urdu Script There are certain characteristics accompanying with the Urdu script. The sub-headings below summarize some vital characteristics of Urdu Urdu Language Writing System Arabic, Urdu and Persian are written in Perso- Arabic script. Therefore, they share great likenesses at the written level. Urdu also uses a revised and extended set of Arabic and Persian alphabets [13]. Any Urdu language learner will be capable of reading the Arabic script without the knowledge and understanding of its written meaning. As shown in Fig. 1 Urdu alphabet has a total of 38 characters [14]. Out of these 38 characters, 28 are similar to the Arabic alphabet [15]. The shape of a character varies in Urdu according to its position in the word and henceforward extremely context sensitive. A character has different shapes when written in isolation, start, middle or end of a word [15]. In Urdu a word is composed of ligatures and ligatures are combination of characters [16]. In addition, Blank spaces are not regarded as separation or boundary between words. Diacritic marks are used with its accompanying characters for proper pronunciation. Urdu is written in the famous Nastalique calligraphic style whereas Arabic is written in Nasakh font. Nastalique calligraphic style is very complex and context sensitive in nature [17]. Figure 1: Urdu character set 2.2. Urdu Script Composition An Urdu script is composed of the following sub components [16]. Isolated Character: It is a character in its isolated form. Ligature: It is composed of two or more characters. It is also known as sub-word. Word: It is composed of isolated characters and ligatures. Sentence: It is composed of a collection of words and isolated characters. Any component level can be used in script recognition systems. Word and ligature are mostly preferred because of the fact that character recognition introduces segmentation complexities [8]. ISBN:

2 2.3. Bi-Directional Writing System Urdu is bidirectional writing system. Urdu numbers are written from left to right while its words are written from right to left [18]. consonants. Dental consonant as shown in Fig. 5 are spoken when tongue is pressed against the upper teeth. Figure 5: Retroflex consonants Figure 2: Urdu script writing direction 2.4. Diacritics Urdu characters are surrounded by special type of marks known as diacritics. The diacritic surround the characters main body and lie above or below it. There are three types of diacritics, 1. Nuqta (Dot) 2. Aerab Superscript ط 3. The nuqta(s) placement and number is used to distinguish several characters in the Urdu alphabet. The nuqta(s) can be placed below or above the associated character. The nuqta(s) can range from one to maximum three in number. Total 17 characters in the Urdu alphabet are accompanied by the nuqta(s). Figure 3: Showing the dots (nuqtas) accompanied by Urdu characters Characters represent consonants and diacritics serve as vowel marks. Diacritics are also known as Aerab. Aerab helps in the pronunciation of Urdu characters. Aerab are optional and written with the Urdu script when there is need to remove any confusion in the pronunciation [18]. The Aerab helps in changing the sound of the letter (see Fig. 4). Figure 4: Aerab in Urdu language Retroflex consonant is spoken when the tongue has a curled, flat or concave shape. These were not present in the Persian or Arabic alphabet. Three characters in Urdu are known as retroflex consonants. The retroflex consonants are created by placing the ط superscript on three Urdu characters. These Urdu characters are known as dental 3. Structural Features 3.1. Structural Features Depiction Structural features include the topological structure of the characters [19]. The characters can be defined by its morphological/ structural features such as start point, end point, branch point, cross point, branches, filled simple loop, open loop, double/complex loop, hedges, cusp etc. [17-22]. Structural features describe the corporal makeup of the characters (see Fig. 6). The structural features are highly effective for recognition and classification purposes [23, 24] Complications Associated With Structural Features There are numerous complications associated with the structural features. These complications are described below. Extraction of structural features is extremely complex task [24]. There are no general rules or approach set for extraction of structural features [24]. Structural features can be extracted from the skeleton of different symbols or characters [21, 24, 25]. Structural features have very less tolerance to rotation [24]. The designers and programmers are placed in an uncomfortable situation since there are no general rules for structural features identification and extraction. For each character, the features related with it are unknown and not fully defined. There is a lot of overhead in knowledge acquisition if the structural features are not established in advance [24]. Development of classification and recognition system that practices structural features is challenging. ISBN:

3 Figure 6: Structural features of some Urdu characters 4. Pre-Geometric Features Extraction 4.1. Corpora For the purpose of geometric feature extraction, 2430 most frequently used Urdu ligatures were collected. Due to deficiency of time for the current research, the ligatures were directly collected from the center for language engineering website [25]. Center for language engineering is an organization aimed at conduction research and development in various regional languages of the Pakistan [25]. The ligatures were extracted from 19.3 million corpuses according to the center for language engineering. The domain from which the corpus was extracted includes, sports/games, news, finance, culture/entertainment, consumer information and personal communications [25] Step-wise Solution For Geometric Feature Extraction 1. All 2430 ligatures are organized within bitmap images where each bitmap hold 21 or fewer ligatures arranged (see Fig. 7). 2. Each bitmap image is converted into pure black and white color. The ligatures i.e. the foreground are presented by white color while the black pixels present the background. The conversion into foreground and background is achieved using thresholding. 3. The images are segmented horizontally and vertically using horizontal projection profile and vertical projection profile respectively. Horizontal and vertical projection profile generates segmented ligature images. 4. For removal of unwanted pixels from the top and bottom of the ligature image, trimming is carried out. The result of trimming generates ligature images fit for geometric feature extraction. 5. The final shape of each ligature image is rectangular since image is two dimensional as shown in Fig. 7. Hence, geometrical features for a rectangle shape are measured and extracted. The features extracted from each ligature image are height, width, area, perimeter, aspect ratio, density function, ratio of area to perimeter, horizontal histogram, vertical histogram, start point, end point and slope between the start and end point. a. Height, width, area, perimeter, aspect ratio, density function, ratio of area to perimeter are extracted from the trimmed ligature image. b. The trimmed ligature images are resized to 32 x 32 before extracting horizontal and vertical histogram feature. c. Prior to the extraction of start and end point, all of nuqtas (dots) are removed from the ligatures images. Once nuqtas are removed, images are retrimmed. The start and end points are located and the slope between start and end point is found. a) Sample image b) Thresholded image Figure 7: Sample of segmented and trimmed ligature images 5. Geometric Features Proposition The proposed method does not take into account the detailed facts of the ligatures strokes and its structure. The proposed geometric features are extremely simple to extract and analyze. The geometric features suggested for extraction are, 1. Width: Measurement from side to side of ligature image. 2. Height: Measurement for the tallness of ligature image. 3. Aspect ratio: Ratio of height divided by width of a ligature image. 4. Density function: Total number of pixels covered by the ligature stroke within a ligature image. 5. Perimeter: Sum of sides of a ligature image. 6. Area: Product of width and height. 7. Perimeter to area ratio: Division of perimeter by area of ligature image. ISBN:

4 8. Horizontal projection profile: Sum of pixel intensities along each row in a ligature image. 9. Vertical projection profile: Sum of pixel intensities along each column in a ligature image. 10. Start point: The first pixel scanned from top to bottom at the left hand side border of the ligature image, where the ligature stroke touches border. 11. End point: The first pixel scanned from top to bottom at the right hand side border of the ligature image, where the ligature stroke touches border. 12. Slope between start and end point: The slope of the diagonal line when connecting the start point with the end point. 6. Geometric Features Extraction 6.1. Width Width is measurement for a geometric shape from one side to other side. Each ligature image takes different amount of space horizontally as shown in Fig. 8. Equation (1) is used for calculating the width for each segmented ligature image. (1) 6.3. Aspect Ratio The aspect ratio is defined as the measure of steepness of a line. The line connects two points on the coordinate plane. In other words, the aspect ratio of a line is the ratio between y and x. The value of y increases as the value of x increases in some amount. The aspect ratio of a line remains constant anywhere on the line. For aspect ratio of a ligature image the combination of two values i.e. height and width is calculated. Aspect ratio is sometimes also known as the slope. (3) Refer to (3) to know the relationship between height and width that is used to find the aspect ratio of the ligature image. Fig. 10 shows the ligature images with the least aspect ratio. Fig. 10 (a) has the least aspect ratio out of all the 2430 ligatures analyzed. If Fig. 11 is analyzed from right to left, the aspect ratio is increasing. Figure 8: The width of few segmented and trimmed Urdu ligatures 6.2. Height Height is the measurement of tallness. Each ligature is formed from characters of varying heights. Once the ligature is trimmed the height can be calculated. Refer to (2) to find the height for a ligature image. Figure 10: Ligature images with least aspect ratio Fig. 11 shows the ligature images with the maximum aspect ratio. Fig. 11 (a) to Fig. 11 (e) displays the maximum aspect ratio for different ligatures in descending order respectively. The ligature images having the maximum aspect ratio have slopes that are almost vertical along the y-axis. (2) Fig. 9 shows the height feature for few Urdu ligature images. Both height and width are vital geometric features. Other geometric features can be found by identifying relationship between these two features. Figure 11: Ligature images with maximum aspect ratio Figure 9: The height of few segmented and trimmed Urdu ligatures 6.4. Density Function The density function is used for finding distribution of total number of pixels in the image to the pixels covered by the main character in the image. ISBN:

5 Equation (4) is used for calculating the density function for a ligature image Perimeter (4) Perimeter is defined as the sum of all sides of a geometric shape or polygon. It is the measure of the length of a shape around its outermost extremities (see Fig. 12). Finding the perimeter for any quadrilateral is very simple; it is even simpler if the sides are of equal length. When dealing with squares since all side are of equal length, it is easier to calculate the perimeter. For rectangles, the perimeter can be using (5). (5) Rectangles have two sides with different lengths i.e. x and y and therefore the perimeter is equal to 2x + 2y (x and y can be referred to as the width and height). A rectangle can be a square but a square can never be a rectangle Perimeter to Area Ratio The perimeter to area ratio is the area divided by its perimeter, refer to (7). The perimeter and area calculated in earlier sections are used to calculate the perimeter to area ratio. = 6.8. Horizontal Projection Profile (7) Histogram is a graphical representation showing the distribution of data values [15]. It is used to show the distribution of pixels in a ligature image horizontally. The horizontal histogram consists of peaks and valleys showing the distribution of pixels in ligature image horizontally. In the proposed research an important step is performed before finding the horizontal and vertical projection profile. The entire ligature images are resized to 32 x 32. Fig. 15 shows the horizontal and vertical projection profile calculated for the ligature faen. Figure 12: The perimeter calculated for some Urdu ligature images 6.6. Area Area for any shape is the total amount of space that it covers. The ligature images are in rectangular shape. The area of a rectangle can be easily calculated by multiplying the height and width of the rectangle as shown in (6). Fig. 14 shows the perimeter calculated for four ligature images. (6) Figure 13: The area calculated for some Urdu ligature images Figure 14: Horizontal and vertical projection profiles for ligature faen. For horizontal projection profile, the image rows are scanned from top to bottom. For each row the sum of its pixels are calculated. When a row contains many white (on) pixels it is represented by a peak in horizontal projection profile. If all pixels in a row are black, (off) it represent the background and is displayed as a valley in horizontal projection profile. Once the horizontal histogram is computed, the maximum (peak) value of the histogram is found. The row index, which has the maximum (peak) value, is utilized as a geometric feature for classification and recognition purposes. If more than one row has the same maximum (peak) value, then the first row from top to bottom is selected as a feature. For ligature image faen, which has 32 ISBN:

6 rows and 32 columns, the row index selected as geometric feature is shown in Fig. 16. Figure 16: Steps for finding the start point, end point and slope between them Figure 15: Horizontal projection profile values along with rows index for ligature faen In Fig. 15 the maximum peak value for horizontal histogram is 15. In addition, the row of the image, which holds this peak value, is 28. Hence, 28 row index is utilized as the geometric feature for horizontal projection profile of the ligature image fean Vertical Projection Profile Vertical histogram for any ligature image is computed after taking the sum of pixels along each column [15]. Vertical projection profile is computed similarly to the horizontal projection profile but with one difference i.e. along the columns. The vertical projection profile for ligature fean can be seen in the Fig Start Point Start point and end point cannot be easily located in images having ligatures accompanied by nuqtas (dots). These nuqtas lead to erroneous data therefore the dots are removed from the ligature images. For finding the start end points as well as the slope between them, these steps were followed. 1. Nuqtas (dots) are removed from the image. 2. The image was trimmed for removal of unwanted pixels. 3. The final trimmed image is resized to 64 x Start and end points are extracted and the slope is calculated. Fig. 16 depicts the necessary steps taken for identification of start point, end point and the slope between them. In Fig. 17 the directional arrow shows that steps were carried out from left to right. Start point and end point are two important features. To calculate the start and end point and find its slope the ligature images are considered to be in the Cartesian coordinate plane (see Fig. 17). Where X covers the columns of the image and Y covers the rows within the image. Figure 17: Cartesian coordinate plane assumed for ligature image Start point is calculated as the pixel point in image which has value of 1(white) and is in the left most column of the ligature image. This point is actually the pixel point where the ligature stroke touches the left hand side border of the image. If ligature stroke touches the left hand side border at multiple points, the column is analyzed from top to bottom and the first pixel found with value 1 is taken into consideration. The position of the start point is found with pixel having coordinates, P(X1, Y1). Y1 is considered the row value and X1 the column value. The value of X1 is fixed to 1. The first column in the ligature image is scanned from top to bottom, the point where a pixel is on i.e. 1 (white) is taken as value for Y1. The intersection of X1 and Y1 gives us the required pixel P(X1, Y1).The start point for a ligature image can be seen in Fig. 18. The value of Y1 is used as geometric feature and stored for future reference. ISBN:

7 An important point here is that we may get a negative slope. The slope of the line is considered negative when the line was slanting upward from the left to the right (see Fig. 20). The slope is considered positive when the line was slanting downward from left to right (see Fig. 21). Figure 18: Finding start point for a ligature image End Point End point is calculated as the pixel point in image which has value of 1(white) and is in the right most column of the ligature image. This point is actually where the ligature touches the right side border of the image. If a ligature touches the border at multiple points, then the column is analyzed from top to bottom and the first value found is taken into consideration. The position of the pixel is found as P(X2, Y2). The value of X2 is fixed to 64. The last column of the image is scanned from top to bottom and the point where a pixel point is on i.e. 1(white) is the value of Y2. The combination of P(X2, Y2) gives the pixel where the ligature touches the right hand side border of the image. The value of Y2 is used as a geometric feature. The end point for a ligature image can be seen in Figure 19. Figure 19: Finding end point for a ligature image Slope between Start Point and End Point A straight line is created when the start point is connected to the end point. The slope of the line is found using the values X1, Y1, X2 and Y2 [26]. Equation (8) shows the formula for finding the slope between start and end point. The formula is the similar to the slope that we have used before i.e. change in height divided by width. The numerator takes into account the difference of Y values while the denominator takes into account the difference of X values. (8) Figure 20: Negative slope for a ligature image Figure 21: Positive slope for a ligature image To avoid negative values in the slope, absolute of difference between Y1 and Y2 and absolute of difference between X1 and X2 is taken into account before finding the slope. 7. Conclusion Urdu is extremely cursive and context sensitive language. Extracting structural/topological features is challenging, complex and time consuming. The geometrical features proposed are easier to understand, analyze, extract and utilize. These features can be used in various pattern recognition systems software s such as optical character recognition system. References [1] T. Nawaz, S. A. H. S. Naqvi, H. ur Rehman, and A. Faiz, "Optical character recognition system for urdu (naskh font) using pattern matching technique," International Journal of Image Processing (IJIP), vol. 3, p. 92, [2] S. T. Javed, S. Hussain, A. Maqbool, S. Asloob, S. Jamil, and H. Moin, "Segmentation free nastalique urdu ocr," in Proceedings of World Academy of Science, Engineering and Technology, 2010, pp [3] S. Hussain, "Resources for Urdu Language Processing," in IJCNLP, 2008, pp [4] S. Sardar and A. Wahab, "Optical character recognition system for Urdu," in Information and Emerging Technologies (ICIET), 2010 International Conference on, 2010, pp ISBN:

8 [5] O. Mukhtar, S. Setlur, and V. Govindaraju, "Experiments on urdu text recognition," in Guide to OCR for Indic Scripts, ed: Springer, 2010, pp [6] S. A. Sattar, S. Haque, M. K. Pathan, and Q. Gee, "Implementation challenges for nastaliq character recognition," in Wireless Networks, Information Processing and Systems, ed: Springer, 2009, pp [7] M. A. U. Rehman, "A new scale invariant optimized chain code for nastaliq character representation," in Computer Modeling and Simulation, ICCMS'10. Second International Conference on, 2010, pp [8] G. S. Lehal, "Ligature Segmentation for Urdu OCR," in Document Analysis and Recognition (ICDAR), th International Conference on, 2013, pp [9] S. A. Husain, A. Sajjad, and F. Anwar, "Online Urdu Character Recognition System," in MVA, 2007, pp [10] M. W. Sagheer, C. L. He, N. Nobile, and C. Y. Suen, "Holistic Urdu handwritten word recognition using support vector machine," in Pattern Recognition (ICPR), th International Conference on, 2010, pp [11] H. Malik and M. A. Fahiem, "Segmentation of printed urdu scripts using structural features," in Visualisation, VIZ'09. Second International Conference in, 2009, pp [12] S. Hussain and M. Afzal, "Urdu computing standards: Urdu zabta takhti (uzt) 1.01," in Multi Topic Conference, IEEE INMIC Technology for the 21st Century. Proceedings. IEEE International, 2001, pp [13] S. A. Husain, "A Multi-tier Holistic approach for Urdu Nastaliq Recognition," Multi Topic Conference,Abstracts 2002, p. 84, [14] N. Shahzad, B. Paulson, and T. Hammond, "Urdu Qaeda: Recognition System for Isolated Urdu Characters," in IUI 2009 Workshop on Sketch Recognition,, Sanibel Island, Florida, [15] U. Pal and A. Sarkar, "Recognition of Printed Urdu Script," presented at the Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2, [16] S. A. Sattar, S.-u. Haque, and M. K. Pathan, "A Finite State Model For Urdu Nastalique Optical Character Recognition," International Journal Of Computer Science And Network Security, vol. 9, September [17] M. I. Razzak, S. A. Husain, A. A. Mirza, and A. Belaid, "Fuzzy Based Preprocessing Using fusion Of Online And Offline Trait For Online Urdu Script Based Languages Character Recognition," International Journal Of Innovative Computing,Information And Control, vol. 8, pp , May [18] S. T. Javed, S. Hussain, A. Maqbool, S. Asloob, S. Jamil, and H. Moin, "Segmentation Free Nastalique Urdu OCR," World Academy Of Science, Engineering And Technology, vol. 70, [19] L. Eikvil, "Optical Character Recognition," Norsk Regnesentral, P.B. 114 Blindern, N-0314 Oslo1993. [20] S. A. Husain, A. Sajjad, and F. Anwar, "Online Urdu Character Recognition System," in MVA2007 IAPR Conference on Machine Vision Applications, [21] A. O. M. Saleh, "A method of coding handwritten Arabic characters and its application to context-free grammar," Pattern Recognition Letters, vol. 15, pp , 12// [22] M. I. Razzak, F. Anwar, S. A. Husain, A. Belaid, and M. Sher, "HMM and fuzzy logic: A hybrid approach for online Urdu script-based languages' character recognition," Know.-Based Syst., vol. 23, pp , [23] L. Heutte, T. Paquet, J. V. Moreau, Y. Lecourtier, and C. Olivier, "A structural/statistical feature based vector for handwritten character recognition," Pattern Recognition Letters, vol. 19, pp , 5// [24] M. S. Khorsheed, "Recognising handwritten Arabic manuscripts using a single hidden Markov model," Pattern Recognition Letters, vol. 24, pp , 10// [25] S. Alma'adeed, C. Higgins, and D. Elliman, "Off-line recognition of handwritten Arabic words using multiple hidden Markov models," Knowledge-Based Systems, vol. 17, pp , 5// [26] S. Malik and S. A. Khan, "Urdu Online Handwriting Recognition," Emerging Technologies, Proceedings of the IEEE Symposium, vol. 17, ISBN:

Bidirectional Urdu script. (a) (b) Urdu (a) character set and (b) diacritical marks

Bidirectional Urdu script. (a) (b) Urdu (a) character set and (b) diacritical marks Improving Nastalique-Specific Pre-Recognition Process for Urdu OCR Sobia Tariq Javed and Sarmad Hussain Center for Research in Urdu Language Processing National University of Computer and Emerging Sciences,

More information

A Segmentation Free Approach to Arabic and Urdu OCR

A Segmentation Free Approach to Arabic and Urdu OCR A Segmentation Free Approach to Arabic and Urdu OCR Nazly Sabbour 1 and Faisal Shafait 2 1 Department of Computer Science, German University in Cairo (GUC), Cairo, Egypt; 2 German Research Center for Artificial

More information

2009 International Conference on Emerging Technologies

2009 International Conference on Emerging Technologies 2009 International Conference on Emerging Technologies A Self Organizing Map Based Urdu Nasakh Character Recognition Syed Afaq Hussain *, Safdar Zaman ** and Muhammad Ayub ** afaq.husain@mail.au.edu.pk,

More information

Segmentation Free Nastalique Urdu OCR

Segmentation Free Nastalique Urdu OCR Segmentation Free Nastalique Urdu OCR Sobia T. Javed, Sarmad Hussain, Ameera Maqbool, Samia Asloob, Sehrish Jamil and Huma Moin Abstract Electronically available Urdu data is in image form which is very

More information

Ligature-based font size independent OCR for Noori Nastalique writing style

Ligature-based font size independent OCR for Noori Nastalique writing style Ligature-based font size independent OCR for Noori Nastalique writing style Qurat ul Ain Akram Sarmad Hussain Center for Language Engineering, Al-Khawarizmi Institute of Computer Science University of

More information

OCR For Handwritten Marathi Script

OCR For Handwritten Marathi Script International Journal of Scientific & Engineering Research Volume 3, Issue 8, August-2012 1 OCR For Handwritten Marathi Script Mrs.Vinaya. S. Tapkir 1, Mrs.Sushma.D.Shelke 2 1 Maharashtra Academy Of Engineering,

More information

A Fast Recognition System for Isolated Printed Characters Using Center of Gravity and Principal Axis

A Fast Recognition System for Isolated Printed Characters Using Center of Gravity and Principal Axis Applied Mathematics, 2013, 4, 1313-1319 http://dx.doi.org/10.4236/am.2013.49177 Published Online September 2013 (http://www.scirp.org/journal/am) A Fast Recognition System for Isolated Printed Characters

More information

CONTEXTUAL SHAPE ANALYSIS OF NASTALIQ

CONTEXTUAL SHAPE ANALYSIS OF NASTALIQ 288 CONTEXTUAL SHAPE ANALYSIS OF NASTALIQ Aamir Wali, Atif Gulzar, Ayesha Zia, Muhammad Ahmad Ghazali, Muhammad Irfan Rafiq, Muhammad Saqib Niaz, Sara Hussain, and Sheraz Bashir ABSTRACT Nastaliq calligraphic

More information

A Review on Different Character Segmentation Techniques for Handwritten Gurmukhi Scripts

A Review on Different Character Segmentation Techniques for Handwritten Gurmukhi Scripts WWJMRD2017; 3(10): 162-166 www.wwjmrd.com International Journal Peer Reviewed Journal Refereed Journal Indexed Journal UGC Approved Journal Impact Factor MJIF: 4.25 e-issn: 2454-6615 Manas Kaur Research

More information

Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes

Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes 2009 10th International Conference on Document Analysis and Recognition Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes Alireza Alaei

More information

Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network

Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network Utkarsh Dwivedi 1, Pranjal Rajput 2, Manish Kumar Sharma 3 1UG Scholar, Dept. of CSE, GCET, Greater Noida,

More information

ABJAD: AN OFF-LINE ARABIC HANDWRITTEN RECOGNITION SYSTEM

ABJAD: AN OFF-LINE ARABIC HANDWRITTEN RECOGNITION SYSTEM ABJAD: AN OFF-LINE ARABIC HANDWRITTEN RECOGNITION SYSTEM RAMZI AHMED HARATY and HICHAM EL-ZABADANI Lebanese American University P.O. Box 13-5053 Chouran Beirut, Lebanon 1102 2801 Phone: 961 1 867621 ext.

More information

Isolated Handwritten Words Segmentation Techniques in Gurmukhi Script

Isolated Handwritten Words Segmentation Techniques in Gurmukhi Script Isolated Handwritten Words Segmentation Techniques in Gurmukhi Script Galaxy Bansal Dharamveer Sharma ABSTRACT Segmentation of handwritten words is a challenging task primarily because of structural features

More information

Mono-font Cursive Arabic Text Recognition Using Speech Recognition System

Mono-font Cursive Arabic Text Recognition Using Speech Recognition System Mono-font Cursive Arabic Text Recognition Using Speech Recognition System M.S. Khorsheed Computer & Electronics Research Institute, King AbdulAziz City for Science and Technology (KACST) PO Box 6086, Riyadh

More information

HANDWRITTEN GURMUKHI CHARACTER RECOGNITION USING WAVELET TRANSFORMS

HANDWRITTEN GURMUKHI CHARACTER RECOGNITION USING WAVELET TRANSFORMS International Journal of Electronics, Communication & Instrumentation Engineering Research and Development (IJECIERD) ISSN 2249-684X Vol.2, Issue 3 Sep 2012 27-37 TJPRC Pvt. Ltd., HANDWRITTEN GURMUKHI

More information

Skew Angle Detection of Bangla Script using Radon Transform

Skew Angle Detection of Bangla Script using Radon Transform Skew Angle Detection of Bangla Script using Radon Transform S. M. Murtoza Habib, Nawsher Ahamed Noor and Mumit Khan Center for Research on Bangla Language Processing, BRAC University, Dhaka, Bangladesh.

More information

FREEMAN CODE BASED ONLINE HANDWRITTEN CHARACTER RECOGNITION FOR MALAYALAM USING BACKPROPAGATION NEURAL NETWORKS

FREEMAN CODE BASED ONLINE HANDWRITTEN CHARACTER RECOGNITION FOR MALAYALAM USING BACKPROPAGATION NEURAL NETWORKS FREEMAN CODE BASED ONLINE HANDWRITTEN CHARACTER RECOGNITION FOR MALAYALAM USING BACKPROPAGATION NEURAL NETWORKS Amritha Sampath 1, Tripti C 2 and Govindaru V 3 1 Department of Computer Science and Engineering,

More information

LECTURE 6 TEXT PROCESSING

LECTURE 6 TEXT PROCESSING SCIENTIFIC DATA COMPUTING 1 MTAT.08.042 LECTURE 6 TEXT PROCESSING Prepared by: Amnir Hadachi Institute of Computer Science, University of Tartu amnir.hadachi@ut.ee OUTLINE Aims Character Typology OCR systems

More information

FUZZY BASED PREPROCESSING USING FUSION OF ONLINE AND OFFLINE TRAIT FOR ONLINE URDU SCRIPT BASED LANGUAGES CHARACTER RECOGNITION

FUZZY BASED PREPROCESSING USING FUSION OF ONLINE AND OFFLINE TRAIT FOR ONLINE URDU SCRIPT BASED LANGUAGES CHARACTER RECOGNITION International Journal of Innovative Computing, Information and Control ICIC International c 2012 ISSN 1349-4198 Volume 8, Number 5(A), May 2012 pp. 3149 3161 FUZZY BASED PREPROCESSING USING FUSION OF ONLINE

More information

Scale and Rotation Invariant OCR for Pashto Cursive Script using MDLSTM Network

Scale and Rotation Invariant OCR for Pashto Cursive Script using MDLSTM Network Scale and Rotation Invariant OCR for Pashto Cursive Script using MDLSTM Network Riaz Ahmad, Muhammad Zeshan Afzal, Sheikh Faisal Rashid Marcus Liwicki, Thomas Breuel riaz@iupr.com, afzal@iupr.com, rashid@iupr.com,

More information

Multi-font Numerals Recognition for Urdu Script based Languages

Multi-font Numerals Recognition for Urdu Script based Languages Multi-font Numerals Recognition for Urdu Script based Languages Muhammad Imran Razzak, S.A. Hussain, Abdel Belaïd, Muhammad Sher To cite this version: Muhammad Imran Razzak, S.A. Hussain, Abdel Belaïd,

More information

Segmentation Based Optical Character Recognition for Handwritten Marathi characters

Segmentation Based Optical Character Recognition for Handwritten Marathi characters Segmentation Based Optical Character Recognition for Handwritten Marathi characters Madhav Vaidya 1, Yashwant Joshi 2,Milind Bhalerao 3 Department of Information Technology 1 Department of Electronics

More information

Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network

Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network 139 Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network Harmit Kaur 1, Simpel Rani 2 1 M. Tech. Research Scholar (Department of Computer Science & Engineering), Yadavindra College

More information

A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script

A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script Arwinder Kaur 1, Ashok Kumar Bathla 2 1 M. Tech. Student, CE Dept., 2 Assistant Professor, CE Dept.,

More information

Nastaleeq: A challenge accepted by Omega

Nastaleeq: A challenge accepted by Omega Nastaleeq: A challenge accepted by Omega Atif Gulzar, Shafiq ur Rahman Center for Research in Urdu Language Processing, National University of Computer and Emerging Sciences, Lahore, Pakistan atif dot

More information

Optical Character Recognition System for Arabic Text Using Cursive Multi-Directional Approach

Optical Character Recognition System for Arabic Text Using Cursive Multi-Directional Approach Journal of Computer Science 3 (7): 549-555, 2007 ISSN 1549-3636 2007 Science Publications Optical Character Recognition System for Arabic Text Using Cursive Multi-Directional Approach 1 Mansoor Al-A'ali

More information

Handwritten Devanagari Character Recognition Model Using Neural Network

Handwritten Devanagari Character Recognition Model Using Neural Network Handwritten Devanagari Character Recognition Model Using Neural Network Gaurav Jaiswal M.Sc. (Computer Science) Department of Computer Science Banaras Hindu University, Varanasi. India gauravjais88@gmail.com

More information

A New Technique for Segmentation of Handwritten Numerical Strings of Bangla Language

A New Technique for Segmentation of Handwritten Numerical Strings of Bangla Language I.J. Information Technology and Computer Science, 2013, 05, 38-43 Published Online April 2013 in MECS (http://www.mecs-press.org/) DOI: 10.5815/ijitcs.2013.05.05 A New Technique for Segmentation of Handwritten

More information

Automatic Recognition and Verification of Handwritten Legal and Courtesy Amounts in English Language Present on Bank Cheques

Automatic Recognition and Verification of Handwritten Legal and Courtesy Amounts in English Language Present on Bank Cheques Automatic Recognition and Verification of Handwritten Legal and Courtesy Amounts in English Language Present on Bank Cheques Ajay K. Talele Department of Electronics Dr..B.A.T.U. Lonere. Sanjay L Nalbalwar

More information

Word-wise Hand-written Script Separation for Indian Postal automation

Word-wise Hand-written Script Separation for Indian Postal automation Word-wise Hand-written Script Separation for Indian Postal automation K. Roy U. Pal Dept. of Comp. Sc. & Engg. West Bengal University of Technology, Sector 1, Saltlake City, Kolkata-64, India Abstract

More information

Anale. Seria Informatică. Vol. XVII fasc Annals. Computer Science Series. 17 th Tome 1 st Fasc. 2019

Anale. Seria Informatică. Vol. XVII fasc Annals. Computer Science Series. 17 th Tome 1 st Fasc. 2019 EVALUATION OF AN OPTICAL CHARACTER RECOGNITION MODEL FOR YORUBA TEXT 1 Abimbola Akintola, 2 Tunji Ibiyemi, 3 Amos Bajeh 1,3 Department of Computer Science, University of Ilorin, Nigeria 2 Department of

More information

NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: VOLUME 5, ISSUE

NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: VOLUME 5, ISSUE OPTICAL HANDWRITTEN DEVNAGARI CHARACTER RECOGNITION USING ARTIFICIAL NEURAL NETWORK APPROACH JYOTI A.PATIL Ashokrao Mane Group of Institution, Vathar Tarf Vadgaon, India. DR. SANJAY R. PATIL Ashokrao Mane

More information

Recognition of Unconstrained Malayalam Handwritten Numeral

Recognition of Unconstrained Malayalam Handwritten Numeral Recognition of Unconstrained Malayalam Handwritten Numeral U. Pal, S. Kundu, Y. Ali, H. Islam and N. Tripathy C VPR Unit, Indian Statistical Institute, Kolkata-108, India Email: umapada@isical.ac.in Abstract

More information

Image Normalization and Preprocessing for Gujarati Character Recognition

Image Normalization and Preprocessing for Gujarati Character Recognition 334 Image Normalization and Preprocessing for Gujarati Character Recognition Jayashree Rajesh Prasad Department of Computer Engineering, Sinhgad College of Engineering, University of Pune, Pune, Mahaashtra

More information

Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network

Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network International Journal of Computer Science & Communication Vol. 1, No. 1, January-June 2010, pp. 91-95 Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network Raghuraj

More information

Handwriting segmentation of unconstrained Oriya text

Handwriting segmentation of unconstrained Oriya text Sādhanā Vol. 31, Part 6, December 2006, pp. 755 769. Printed in India Handwriting segmentation of unconstrained Oriya text N TRIPATHY and U PAL Computer Vision and Pattern Recognition Unit, Indian Statistical

More information

Structural Feature Extraction to recognize some of the Offline Isolated Handwritten Gujarati Characters using Decision Tree Classifier

Structural Feature Extraction to recognize some of the Offline Isolated Handwritten Gujarati Characters using Decision Tree Classifier Structural Feature Extraction to recognize some of the Offline Isolated Handwritten Gujarati Characters using Decision Tree Classifier Hetal R. Thaker Atmiya Institute of Technology & science, Kalawad

More information

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK HANDWRITTEN DEVANAGARI CHARACTERS RECOGNITION THROUGH SEGMENTATION AND ARTIFICIAL

More information

A two-stage approach for segmentation of handwritten Bangla word images

A two-stage approach for segmentation of handwritten Bangla word images A two-stage approach for segmentation of handwritten Bangla word images Ram Sarkar, Nibaran Das, Subhadip Basu, Mahantapas Kundu, Mita Nasipuri #, Dipak Kumar Basu Computer Science & Engineering Department,

More information

One Dim~nsional Representation Of Two Dimensional Information For HMM Based Handwritten Recognition

One Dim~nsional Representation Of Two Dimensional Information For HMM Based Handwritten Recognition One Dim~nsional Representation Of Two Dimensional Information For HMM Based Handwritten Recognition Nafiz Arica Dept. of Computer Engineering, Middle East Technical University, Ankara,Turkey nafiz@ceng.metu.edu.

More information

An Efficient Character Segmentation Based on VNP Algorithm

An Efficient Character Segmentation Based on VNP Algorithm Research Journal of Applied Sciences, Engineering and Technology 4(24): 5438-5442, 2012 ISSN: 2040-7467 Maxwell Scientific organization, 2012 Submitted: March 18, 2012 Accepted: April 14, 2012 Published:

More information

Segmentation of Characters of Devanagari Script Documents

Segmentation of Characters of Devanagari Script Documents WWJMRD 2017; 3(11): 253-257 www.wwjmrd.com International Journal Peer Reviewed Journal Refereed Journal Indexed Journal UGC Approved Journal Impact Factor MJIF: 4.25 e-issn: 2454-6615 Manpreet Kaur Research

More information

Slant Correction using Histograms

Slant Correction using Histograms Slant Correction using Histograms Frank de Zeeuw Bachelor s Thesis in Artificial Intelligence Supervised by Axel Brink & Tijn van der Zant July 12, 2006 Abstract Slant is one of the characteristics that

More information

Time Stamp Detection and Recognition in Video Frames

Time Stamp Detection and Recognition in Video Frames Time Stamp Detection and Recognition in Video Frames Nongluk Covavisaruch and Chetsada Saengpanit Department of Computer Engineering, Chulalongkorn University, Bangkok 10330, Thailand E-mail: nongluk.c@chula.ac.th

More information

State of the art in Nastaleeq Script Recognition

State of the art in Nastaleeq Script Recognition State of the art in Nastaleeq Script Recognition Harmohan Sharma #1, Dharam Veer Sharma #2 #1 Department of Computer Science, Multani Mal Modi College, Patiala, Punjab, India #2 Department of Computer

More information

Comparative Performance Analysis of Feature(S)- Classifier Combination for Devanagari Optical Character Recognition System

Comparative Performance Analysis of Feature(S)- Classifier Combination for Devanagari Optical Character Recognition System Comparative Performance Analysis of Feature(S)- Classifier Combination for Devanagari Optical Character Recognition System Jasbir Singh Department of Computer Science Punjabi University Patiala, India

More information

Mobile Application with Optical Character Recognition Using Neural Network

Mobile Application with Optical Character Recognition Using Neural Network Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 1, January 2015,

More information

Segmentation of Kannada Handwritten Characters and Recognition Using Twelve Directional Feature Extraction Techniques

Segmentation of Kannada Handwritten Characters and Recognition Using Twelve Directional Feature Extraction Techniques Segmentation of Kannada Handwritten Characters and Recognition Using Twelve Directional Feature Extraction Techniques 1 Lohitha B.J, 2 Y.C Kiran 1 M.Tech. Student Dept. of ISE, Dayananda Sagar College

More information

HMM-Based Handwritten Amharic Word Recognition with Feature Concatenation

HMM-Based Handwritten Amharic Word Recognition with Feature Concatenation 009 10th International Conference on Document Analysis and Recognition HMM-Based Handwritten Amharic Word Recognition with Feature Concatenation Yaregal Assabie and Josef Bigun School of Information Science,

More information

Layout Analysis of Urdu Document Images

Layout Analysis of Urdu Document Images Layout Analysis of Urdu Document Images Faisal Shafait*, Adnan-ul-Hasan, Daniel Keysers*, and Thomas M. Breuel** *Image Understanding and Pattern Recognition (IUPR) research group German Research Center

More information

Middle School Math Course 3 Correlation of the ALEKS course Middle School Math 3 to the Illinois Assessment Framework for Grade 8

Middle School Math Course 3 Correlation of the ALEKS course Middle School Math 3 to the Illinois Assessment Framework for Grade 8 Middle School Math Course 3 Correlation of the ALEKS course Middle School Math 3 to the Illinois Assessment Framework for Grade 8 State Goal 6: Number Sense 6.8.01: 6.8.02: 6.8.03: 6.8.04: 6.8.05: = ALEKS

More information

IDIAP. Martigny - Valais - Suisse IDIAP

IDIAP. Martigny - Valais - Suisse IDIAP R E S E A R C H R E P O R T IDIAP Martigny - Valais - Suisse Off-Line Cursive Script Recognition Based on Continuous Density HMM Alessandro Vinciarelli a IDIAP RR 99-25 Juergen Luettin a IDIAP December

More information

The Clustering Technique for Thai Handwritten Recognition

The Clustering Technique for Thai Handwritten Recognition The Clustering Technique for Thai Handwritten Recognition Ithipan Methasate, Sutat Sae-tang Information Research and Development Division National Electronics and Computer Technology Center National Science

More information

Recognition of Printed Arabic Words with Fuzzy ARTMAP Neural Network

Recognition of Printed Arabic Words with Fuzzy ARTMAP Neural Network Recognition of Printed Arabic Words with Fuzzy ARTMAP Neural Network Adnan Amin' and Nabeel Murshed2 'School of Computer Science and Engineering University of New South Wales, Sydney-Australia amin@cse.unsw.edu.au

More information

Implementation and Advanced Results on the Non-Interrupted Skeletonization Algorithm

Implementation and Advanced Results on the Non-Interrupted Skeletonization Algorithm Implementation and Advanced Results on the n-interrupted Skeletonization Algorithm Khalid Saeed, Mariusz Rybnik, Marek Tabedzki Computer Engineering Department Faculty of Computer Science Bialystok University

More information

Segmentation of Bangla Handwritten Text

Segmentation of Bangla Handwritten Text Thesis Report Segmentation of Bangla Handwritten Text Submitted By: Sabbir Sadik ID:09301027 Md. Numan Sarwar ID: 09201027 CSE Department BRAC University Supervisor: Professor Dr. Mumit Khan Date: 13 th

More information

Building Multi Script OCR for Brahmi Scripts: Selection of Efficient Features

Building Multi Script OCR for Brahmi Scripts: Selection of Efficient Features Building Multi Script OCR for Brahmi Scripts: Selection of Efficient Features Md. Abul Hasnat Center for Research on Bangla Language Processing (CRBLP) Center for Research on Bangla Language Processing

More information

International Journal of Image Processing (IJIP)

International Journal of Image Processing (IJIP) Editor in Chief Professor Hu, Yu-Chen International Journal of Image Processing (IJIP) Book: 2009 Volume 3, Issue 3 Publishing Date: 31-06 - 2009 Proceedings ISSN (Online): 1985-2304 This work is subjected

More information

Course Number: Course Title: Geometry

Course Number: Course Title: Geometry Course Number: 1206310 Course Title: Geometry RELATED GLOSSARY TERM DEFINITIONS (89) Altitude The perpendicular distance from the top of a geometric figure to its opposite side. Angle Two rays or two line

More information

Mrs. Daniel s Geometry Vocab List

Mrs. Daniel s Geometry Vocab List Mrs. Daniel s Geometry Vocab List Geometry Definition: a branch of mathematics concerned with questions of shape, size, relative position of figures, and the properties of space. Reflectional Symmetry

More information

Comparative Analysis of Raw Images and Meta Feature based Urdu OCR using CNN and LSTM

Comparative Analysis of Raw Images and Meta Feature based Urdu OCR using CNN and LSTM Comparative Analysis of Raw Images and Meta Feature based Urdu OCR using CNN and LSTM Asma Naseer, Kashif Zafar Computer Science Department National University of Computer and Emerging Sciences Lahore,

More information

An Improvement Study for Optical Character Recognition by using Inverse SVM in Image Processing Technique

An Improvement Study for Optical Character Recognition by using Inverse SVM in Image Processing Technique An Improvement Study for Optical Character Recognition by using Inverse SVM in Image Processing Technique I Dinesh KumarVerma, II Anjali Khatri I Assistant Professor (ECE) PDM College of Engineering, Bahadurgarh,

More information

Tracing and Straightening the Baseline in Handwritten Persian/Arabic Text-line: A New Approach Based on Painting-technique

Tracing and Straightening the Baseline in Handwritten Persian/Arabic Text-line: A New Approach Based on Painting-technique Tracing and Straightening the Baseline in Handwritten Persian/Arabic Text-line: A New Approach Based on Painting-technique P. Nagabhushan and Alireza Alaei 1,2 Department of Studies in Computer Science,

More information

Indian Multi-Script Full Pin-code String Recognition for Postal Automation

Indian Multi-Script Full Pin-code String Recognition for Postal Automation 2009 10th International Conference on Document Analysis and Recognition Indian Multi-Script Full Pin-code String Recognition for Postal Automation U. Pal 1, R. K. Roy 1, K. Roy 2 and F. Kimura 3 1 Computer

More information

Scene Text Detection Using Machine Learning Classifiers

Scene Text Detection Using Machine Learning Classifiers 601 Scene Text Detection Using Machine Learning Classifiers Nafla C.N. 1, Sneha K. 2, Divya K.P. 3 1 (Department of CSE, RCET, Akkikkvu, Thrissur) 2 (Department of CSE, RCET, Akkikkvu, Thrissur) 3 (Department

More information

Interactive Math Glossary Terms and Definitions

Interactive Math Glossary Terms and Definitions Terms and Definitions Absolute Value the magnitude of a number, or the distance from 0 on a real number line Addend any number or quantity being added addend + addend = sum Additive Property of Area the

More information

CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS

CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS 8.1 Introduction The recognition systems developed so far were for simple characters comprising of consonants and vowels. But there is one

More information

Automatic Detection of Change in Address Blocks for Reply Forms Processing

Automatic Detection of Change in Address Blocks for Reply Forms Processing Automatic Detection of Change in Address Blocks for Reply Forms Processing K R Karthick, S Marshall and A J Gray Abstract In this paper, an automatic method to detect the presence of on-line erasures/scribbles/corrections/over-writing

More information

RESEARCH ON OPTIMIZATION OF IMAGE USING SKELETONIZATION TECHNIQUE WITH ADVANCED ALGORITHM

RESEARCH ON OPTIMIZATION OF IMAGE USING SKELETONIZATION TECHNIQUE WITH ADVANCED ALGORITHM 881 RESEARCH ON OPTIMIZATION OF IMAGE USING SKELETONIZATION TECHNIQUE WITH ADVANCED ALGORITHM Sarita Jain 1 Sumit Rana 2 Department of CSE 1 Department of CSE 2 Geeta Engineering College 1, Panipat, India

More information

Invariant Recognition of Hand-Drawn Pictograms Using HMMs with a Rotating Feature Extraction

Invariant Recognition of Hand-Drawn Pictograms Using HMMs with a Rotating Feature Extraction Invariant Recognition of Hand-Drawn Pictograms Using HMMs with a Rotating Feature Extraction Stefan Müller, Gerhard Rigoll, Andreas Kosmala and Denis Mazurenok Department of Computer Science, Faculty of

More information

The Ultimate Maths Vocabulary List

The Ultimate Maths Vocabulary List The Ultimate Maths Vocabulary List The 96 Words Every Pupil Needs to Know by the End of Year 6 KS1 & KS2 How to Use This Resource An essential building block in pupil s understanding of maths is their

More information

International Journal of Signal Processing, Image Processing and Pattern Recognition Vol.9, No.2 (2016) Figure 1. General Concept of Skeletonization

International Journal of Signal Processing, Image Processing and Pattern Recognition Vol.9, No.2 (2016) Figure 1. General Concept of Skeletonization Vol.9, No.2 (216), pp.4-58 http://dx.doi.org/1.1425/ijsip.216.9.2.5 Skeleton Generation for Digital Images Based on Performance Evaluation Parameters Prof. Gulshan Goyal 1 and Ritika Luthra 2 1 Associate

More information

Creating a Title Block & Border Using Chief Architect. Architectural Design & Residential Construction Penncrest High School

Creating a Title Block & Border Using Chief Architect. Architectural Design & Residential Construction Penncrest High School Creating a Title Block & Border Using Chief Architect Architectural Design & Residential Construction Penncrest High School 2017-2018 Select New Layout to begin designing your Title Block. Note: Once the

More information

Segmentation free Bangla OCR using HMM: Training and Recognition

Segmentation free Bangla OCR using HMM: Training and Recognition Segmentation free Bangla OCR using HMM: Training and Recognition Md. Abul Hasnat, S.M. Murtoza Habib, Mumit Khan BRAC University, Bangladesh mhasnat@gmail.com, murtoza@gmail.com, mumit@bracuniversity.ac.bd

More information

Chapter 1. Linear Equations and Straight Lines. 2 of 71. Copyright 2014, 2010, 2007 Pearson Education, Inc.

Chapter 1. Linear Equations and Straight Lines. 2 of 71. Copyright 2014, 2010, 2007 Pearson Education, Inc. Chapter 1 Linear Equations and Straight Lines 2 of 71 Outline 1.1 Coordinate Systems and Graphs 1.4 The Slope of a Straight Line 1.3 The Intersection Point of a Pair of Lines 1.2 Linear Inequalities 1.5

More information

Skew Detection and Correction Technique for Arabic Document Images Based on Centre of Gravity

Skew Detection and Correction Technique for Arabic Document Images Based on Centre of Gravity Journal of Computer Science 5 (5): 363-368, 2009 ISSN 1549-3636 2009 Science Publications Skew Detection and Correction Technique for Arabic Document Images Based on Centre of Gravity Atallah Mahmoud Al-Shatnawi

More information

Eighth Grade Math Assessment Framework Standard 6A Representations and Ordering

Eighth Grade Math Assessment Framework Standard 6A Representations and Ordering Eighth Grade Math Assessment Framework Standard 6A Representations and Ordering 6.8.01 Read, write, and recognize equivalent representations of integer powers of 10. Related Textbook pages Related Additional

More information

Segmentation-free optical character recognition for printed Urdu text

Segmentation-free optical character recognition for printed Urdu text Ud Din et al. EURASIP Journal on Image and Video Processing (2017) 2017:62 DOI 10.1186/s13640-017-0208-z EURASIP Journal on Image and Video Processing RESEARCH Open Access Segmentation-free optical character

More information

PRINTED ARABIC CHARACTERS CLASSIFICATION USING A STATISTICAL APPROACH

PRINTED ARABIC CHARACTERS CLASSIFICATION USING A STATISTICAL APPROACH PRINTED ARABIC CHARACTERS CLASSIFICATION USING A STATISTICAL APPROACH Ihab Zaqout Dept. of Information Technology Faculty of Engineering & Information Technology Al-Azhar University Gaza ABSTRACT In this

More information

SEVERAL METHODS OF FEATURE EXTRACTION TO HELP IN OPTICAL CHARACTER RECOGNITION

SEVERAL METHODS OF FEATURE EXTRACTION TO HELP IN OPTICAL CHARACTER RECOGNITION SEVERAL METHODS OF FEATURE EXTRACTION TO HELP IN OPTICAL CHARACTER RECOGNITION Binod Kumar Prasad * * Bengal College of Engineering and Technology, Durgapur, W.B., India. Rajdeep Kundu 2 2 Bengal College

More information

Machine vision. Summary # 6: Shape descriptors

Machine vision. Summary # 6: Shape descriptors Machine vision Summary # : Shape descriptors SHAPE DESCRIPTORS Objects in an image are a collection of pixels. In order to describe an object or distinguish between objects, we need to understand the properties

More information

A Feature based on Encoding the Relative Position of a Point in the Character for Online Handwritten Character Recognition

A Feature based on Encoding the Relative Position of a Point in the Character for Online Handwritten Character Recognition A Feature based on Encoding the Relative Position of a Point in the Character for Online Handwritten Character Recognition Dinesh Mandalapu, Sridhar Murali Krishna HP Laboratories India HPL-2007-109 July

More information

Short Survey on Static Hand Gesture Recognition

Short Survey on Static Hand Gesture Recognition Short Survey on Static Hand Gesture Recognition Huu-Hung Huynh University of Science and Technology The University of Danang, Vietnam Duc-Hoang Vo University of Science and Technology The University of

More information

Isolated Curved Gurmukhi Character Recognition Using Projection of Gradient

Isolated Curved Gurmukhi Character Recognition Using Projection of Gradient International Journal of Computational Intelligence Research ISSN 0973-1873 Volume 13, Number 6 (2017), pp. 1387-1396 Research India Publications http://www.ripublication.com Isolated Curved Gurmukhi Character

More information

Measurements using three-dimensional product imaging

Measurements using three-dimensional product imaging ARCHIVES of FOUNDRY ENGINEERING Published quarterly as the organ of the Foundry Commission of the Polish Academy of Sciences ISSN (1897-3310) Volume 10 Special Issue 3/2010 41 46 7/3 Measurements using

More information

Identifying Layout Classes for Mathematical Symbols Using Layout Context

Identifying Layout Classes for Mathematical Symbols Using Layout Context Rochester Institute of Technology RIT Scholar Works Articles 2009 Identifying Layout Classes for Mathematical Symbols Using Layout Context Ling Ouyang Rochester Institute of Technology Richard Zanibbi

More information

Online Bangla Handwriting Recognition System

Online Bangla Handwriting Recognition System 1 Online Bangla Handwriting Recognition System K. Roy Dept. of Comp. Sc. West Bengal University of Technology, BF 142, Saltlake, Kolkata-64, India N. Sharma, T. Pal and U. Pal Computer Vision and Pattern

More information

Preprocessing of Online Urdu Handwriting for Mobile Devices

Preprocessing of Online Urdu Handwriting for Mobile Devices IJCSNS International Journal of Computer Science and Network Security, VOL.17 No.10, October 2017 173 Preprocessing of Online Urdu Handwriting for Mobile Devices Fareeha Anwar, Muhammad Adnan Aftab,Dr.

More information

Historical Handwritten Document Image Segmentation Using Background Light Intensity Normalization

Historical Handwritten Document Image Segmentation Using Background Light Intensity Normalization Historical Handwritten Document Image Segmentation Using Background Light Intensity Normalization Zhixin Shi and Venu Govindaraju Center of Excellence for Document Analysis and Recognition (CEDAR), State

More information

Grade 9 Math Terminology

Grade 9 Math Terminology Unit 1 Basic Skills Review BEDMAS a way of remembering order of operations: Brackets, Exponents, Division, Multiplication, Addition, Subtraction Collect like terms gather all like terms and simplify as

More information

with Profile's Amplitude Filter

with Profile's Amplitude Filter Arabic Character Segmentation Using Projection-Based Approach with Profile's Amplitude Filter Mahmoud A. A. Mousa Dept. of Computer and Systems Engineering, Zagazig University, Zagazig, Egypt mamosa@zu.edu.eg

More information

Skew Detection and Correction of Document Image using Hough Transform Method

Skew Detection and Correction of Document Image using Hough Transform Method Skew Detection and Correction of Document Image using Hough Transform Method [1] Neerugatti Varipally Vishwanath, [2] Dr.T. Pearson, [3] K.Chaitanya, [4] MG JaswanthSagar, [5] M.Rupesh [1] Asst.Professor,

More information

Matija Gubec International School Zagreb MYP 0. Mathematics

Matija Gubec International School Zagreb MYP 0. Mathematics Matija Gubec International School Zagreb MYP 0 Mathematics 1 MYP0: Mathematics Unit 1: Natural numbers Through the activities students will do their own research on history of Natural numbers. Students

More information

MOMENT AND DENSITY BASED HADWRITTEN MARATHI NUMERAL RECOGNITION

MOMENT AND DENSITY BASED HADWRITTEN MARATHI NUMERAL RECOGNITION MOMENT AND DENSITY BASED HADWRITTEN MARATHI NUMERAL RECOGNITION S. M. Mali Department of Computer Science, MAEER S Arts, Commerce and Science College, Pune Shankarmali007@gmail.com Abstract In this paper,

More information

Content-based Information Retrieval from Handwritten Documents

Content-based Information Retrieval from Handwritten Documents Content-based Information Retrieval from Handwritten Documents Sargur Srihari, Chen Huang and Harish Srinivasan Center of Excellence for Document Analysis and Recognition (CEDAR) University at Buffalo,

More information

Number/Computation. addend Any number being added. digit Any one of the ten symbols: 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9

Number/Computation. addend Any number being added. digit Any one of the ten symbols: 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9 14 Number/Computation addend Any number being added algorithm A step-by-step method for computing array A picture that shows a number of items arranged in rows and columns to form a rectangle associative

More information

DEVANAGARI SCRIPT SEPARATION AND RECOGNITION USING MORPHOLOGICAL OPERATIONS AND OPTIMIZED FEATURE EXTRACTION METHODS

DEVANAGARI SCRIPT SEPARATION AND RECOGNITION USING MORPHOLOGICAL OPERATIONS AND OPTIMIZED FEATURE EXTRACTION METHODS DEVANAGARI SCRIPT SEPARATION AND RECOGNITION USING MORPHOLOGICAL OPERATIONS AND OPTIMIZED FEATURE EXTRACTION METHODS Sushilkumar N. Holambe Dr. Ulhas B. Shinde Shrikant D. Mali Persuing PhD at Principal

More information

Skeletonization Algorithm for Numeral Patterns

Skeletonization Algorithm for Numeral Patterns International Journal of Signal Processing, Image Processing and Pattern Recognition 63 Skeletonization Algorithm for Numeral Patterns Gupta Rakesh and Kaur Rajpreet Department. of CSE, SDDIET Barwala,

More information

Optical Character Recognition For Bangla Documents Using HMM

Optical Character Recognition For Bangla Documents Using HMM Optical Character Recognition For Bangla Documents Using HMM Md. Sheemam Monjel and Mumit Khan Dept. of CSE, BRAC University, Dhaka, Bangladesh. sheemam@bracuniversity.net, mumit@bracuniversity.net Abstract

More information

Math 7 Glossary Terms

Math 7 Glossary Terms Math 7 Glossary Terms Absolute Value Absolute value is the distance, or number of units, a number is from zero. Distance is always a positive value; therefore, absolute value is always a positive value.

More information