Pattern Recognition Letters

Size: px
Start display at page:

Download "Pattern Recognition Letters"

Transcription

1 Pattern Recognition Letters 32 (2011) Contents lists available at SciVerse ScienceDirect Pattern Recognition Letters journal homepage: An improved contour-based thinning method for character images Soumen Bag a,, Gaurav Harit b a Department of Computer Science and Engineering, Indian Institute of Technology Kharagpur, Kharagpur , India b Department of Computer Science and Engineering, Indian Institute of Technology Rajasthan, Jodhpur , India article info abstract Article history: Received 3 September 2010 Available online 11 September 2011 Communicated by A.M. Alimi Keywords: Ambiguous region Contour Medial axis Shape characteristics Skeleton segment Thinning Digital skeleton of character images, generated by thinning method, has a wide range of applications for shape analysis and classification. But thinning of character images is a big challenge. Removal of spurious strokes or deformities in thinning is a difficult problem. In this paper, we propose a contour-based thinning method used for performing skeletonization of printed noisy isolated character images. In this method, we use shape characteristics of text to get skeleton of nearly same as the true character shape. This approach helps to preserve the local features and true shapes of the character images. As a by-product of our thinning approach, the skeleton also gets segmented into strokes in vector form. Hence further stroke segmentation is not required. Experiment is done on printed English, Bengali, Hindi, and Tamil characters and we obtain much better results comparing with other thinning methods without any post-processing. Ó 2011 Elsevier B.V. All rights reserved. 1. Introduction Thinning of shape has a wide range of application in image processing, machine vision, and pattern recognition. But removal of spurious strokes or shape deformation in thinning is a difficult problem. In the past several decades many thinning algorithms have been developed considering all these problems (Lam et al., 1992; Vincze and K}ovári, 2009). They are broadly classified into two groups: raster scan-based and medial axis-based. Raster scan-based methods are classified into two other categories: sequential and parallel. Sequential algorithms consider one pixel at a time and visit the pixel by raster scanning (Arcelli and di Baja, 1989) or contour following (Arcelli, 1981). Parallel thinning algorithms are based on iterative processing and they consider a pixel for removal based on the results of previous iteration only (Datta and Parui, 1994; Huang et al., 2003; Leung et al., 2000; Zhang and Suen, 1984; Zhu and Zhang, 2008). Many of the raster scan-based character thinning methods can not preserve the local properties or features of the character images properly. As a result, they give slightly shape distorted output. Medial axis-based methods generate a central or median line of pattern directly in one pass without examining all the individual pixels (Martinez-Perez et al., 1987). They also give slight distorted result at some local regions. Here we attempt to minimize local distortions by making use of shape characteristics of text. Next we begin a brief description about few prominent raster scanbased thinning algorithms. Corresponding author. Tel.: addresses: bagsoumen@gmail.com (S. Bag), gharit@iitj.ac.in (G. Harit). Datta and Parui (1994) have proposed a parallel thinning algorithm which preserves connectivity and produces skeleton of one pixel thickness. Each iteration of the algorithm is divided into four sub-iterations. These sub-iterations use two 1 3 and two 3 1 templates for removing boundary pixels along east, west, north, and south directions, respectively. The method uses one 3 3 window to avoid the removal of critical point (which alters the connectivity) and end point (which shortens a leg of the skeleton). Leung et al. (2000) have introduced a contour following thinning method having no sub-iteration. They have used a lookup table to avoid the use of multiple templates for removing boundary pixels. The lookup table has 256 entries and each entry contains three fields (neighbor number, weight number, and connection number). The number of entries depends on the different possibilities of 8-connectivity of a pixel. The determination of the possibility of pixel removal depends on the values of the patterns of the spatial contour pixels. To improve the pixel connectivity of thinned character images, Huang et al. (2003) have introduced a new set of templates (three 4 3, one 4 4, and three 3 4). This algorithm considers all possible patterns of 8-connectivity of a pixel (similar to Leung et al. (2000)) and generates template-based elimination rule for deleting boundary pixels. The pixel deletion is done based on the number of black pixels in the 8-neighbor connectivity. Additionally, it compensates information loss by integrating the contour and skeleton of pattern. The information loss is detected based on the ratio of the skeleton and contour pixels. If the value is less than a predefined threshold then the thinned image is replaced by the contour image /$ - see front matter Ó 2011 Elsevier B.V. All rights reserved. doi: /j.patrec

2 S. Bag, G. Harit / Pattern Recognition Letters 32 (2011) Fig. 1. Distortions of different thinning methods: (a) input image; (b) thinned image (Red circle indicates distortion at junction and end points; Red arrow indicates spurious strokes at curvature region). (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.) The main contribution of Zhu and Zhang (2008) method is that it does not consider all pixels equally for performing character thinning. It is based on the substitution of pixels of strokes or curves which are most valuable parts for character recognition. This method uses a set of thinning templates, similar to Datta and Parui (1994), for boundary pixel removal and handling corners or junction points. This method has tried to preserve the shape of character images such as Chinese characters, English alphabets, and numerals after thinning. All the above discussed algorithms perform boundary pixel deletion using various thinning templates or values from lookup tables. Next we discuss medial axis-based thinning algorithm. For performing thinning of regular shape, Martinez-Perez et al. (1987) have introduced a medial axis-based thinning method. This approach does not use thinning templates or lookup tables for performing thinning. They have generated medial points in between two parallel contour segments and repeat the procedure for all remaining parallel contour segments. But they have not applied this concept for curve. Finally, we give a brief description of a thinning method which is used for skeleton simplification. Telea et al. (2004) have introduced a method to simplify skeleton structure of an image by removing few skeleton branches. The simplification is done by analyzing the quasi-stable points of the Bayesian energy function, parameterized by boundary of contour and internal structure of the skeleton. The experimental results show that it gives multi-scale skeleton at various abstract levels. Apart from these common methods, some new techniques (such as graphical method, wavelet, neural networks, critical kernels, and pulse coupled neural network) have been introduced into image thinning. Melhi et al. (2001) have proposed a method for thinning binary handwritten text images by generating graphical representations of words within the image. Tang and You (2003) have introduced a wavelet-based scheme to extract skeleton of ribbon-like shape. You and Tang (2007) have presented a scheme of extracting the skeleton of English and Chinese characters based on wavelet transform. Krishnapuram and Chen (1993) have applied recurrent neural networks to image thinning. Altuwaijri and Bayoumi (1995) have used a self-organizing neural network to thin digital images. Gu et al. (2004) and Shang et al. (2007) have proposed a binary image thinning algorithm by using the autowaves generated by pulse coupled neural network (PCNN). Bertrand and Couprie (2006) have introduced a new 2D parallel thinning algorithm using critical kernels. All the above mentioned algorithms can not retain the shape of the character images at the junction and end points properly and produce spurious strokes at high curvature region (Fig. 1). From that point of view, we address a contour-based thinning method for generating proper skeleton of isolated character images. The algorithm is non iterative and template free. The main advantage is that it uses shape characteristics of text to determine the areas within the image region to stop thinning partially. This approach helps to preserve the local features and true shape of the character. Additionally, it produces a set of vectorized strokes with the thin skeleton as by-product. These resultant skeletons have stronger ability to oppose shape deformation and more convenience to shape analysis and classification. Fig. 2. System architecture of the proposed method.

3 1838 S. Bag, G. Harit / Pattern Recognition Letters 32 (2011) This paper is organized as follows. Section 2 describes the methodology of the proposed method. Section 3 contains the experimental results of the proposed method and comparison with other thinning methods. This paper concludes with some remarks on the proposed method and future work in Section Proposed contour-based thinning algorithm In this section, we describe the proposed thinning strategy based on contour extraction and medial axis generation for text alphabets. Fig. 2 shows the system architecture of the proposed method. The advantages of our proposed technique are: Shape distortion at junction and end points and many of the spurious thinning branches which inevitably occur when applying other rasterized thinning algorithms do not occur in our proposed algorithm. It provides a vectorized output of the thin skeleton. This vectorized output is a collection of strokes. Hence further stroke segmentation is not required. Our approach is most suitable for thinning noisy printed text alphabets. Particularly it gives correct output even in the presence of changing width of the strokes Contour extraction and ordering Given a scanned document page, we binarize it using the Otsu s algorithm (Otsu, 1979). Currently we are working with documents with all text content, hence we identify the alphabets as connected components in binary images. This works for English (Casey and Lecolinet, 1996) and Tamil alphabets (Shanthi and Duraiswamy, 2005); however, for Bengali or Hindi documents the entire word gets identified as a single connected component because of the matra/shiro-rekha which connects the individual characters. For this case we separate out the individual akshara within a word by using the character segmentation methods reported in (Garain and Chaudhuri, 2002; Pal and Datta, 2003) for Bengali and in (Bansal and Sinha, 2002; Ma and Doermann, 2003) for Hindi. Given an isolated alphabet, its boundary contour is extracted. We detect a boundary pixel on the alphabet using a 3 3 mask centered on every black pixel of the character. If there is even a single background (white) pixel within the mask it would imply that the center black pixel is flagged as a boundary pixel. Starting from this first boundary pixel we traverse the boundary using a connected component aggregation algorithm, i.e., we go on aggregating the black pixels which are adjacent and also on the boundary. This produces a vector representation of the boundary contour. The result of contour extraction for Fig. 4(a) is shown in Fig. 4(b). After that, we perform the ordering of the contour pixels which helps us for traversing the contour in a particular direction. This boundary contour is then segmented into straight line or curved segments which we refer to as contour-strokes. a start point s is denoted C s. The other end i.e., the end point is allowed to extend to the adjacent contour pixel x 1, then to the next one x 2 in sequence and so on. Our objective is to find a suitable contour boundary pixel for the end point of C s. For each possible end point location e i, we get a hypothesized contour segment cse i. The orientation h i of this segment is the slope of the line joining the points s and e i. We analyze the change Dh i = h i h i 1. For a sequence of hypothesized end point positions e i =...,x k 1,x k,x k+1,..., we note the trend of Dh i being positive, negative, or zero. We form groups of successive pixels for which Dh i remains positive, negative, or zero. In other words the pixels forming a group are successive and have a common trend for Dh i. For each such group, say G p, which spans pixels, say x p1 to x p2, we compute the total change in orientation, Dh Gp = h p2 h p1. We also note the total number of pixels N Gp in the group G p. Since our objective is to identify a candidate end point e for the hypothesized contour segment C s which attempt to choose a suitable position for e from the available end points of the pixel groups G p s which have been identified with respect to the start point s of the contour segment C s. If Dh Gp > h threshold (value is set to ±10 ) for any group G p, then take the end point of C s as the end point of G p 1.IfG p is the first pixel group from the start point s, then restrict G p to extend up to only that pixel where Dh Gp = h threshold. If N Gp > N threshold (value is set to 0.5 times of the average height of the characters) for any group G p, then count the total number of pixels N s in C s up to the end pixel of G p 1.IfN s > N threshold then take the end point of G p 1 as the end point of C s. In the above formulation G p 1 denotes the group just preceding G p. Once we get the candidate end point for the hypothesized contour segment C s, we do a refinement of its location on the contour. In other words we search for its most appropriate position in the vicinity of the identified candidate position. This search is based on analyzing local variance of orientations in the forward and backward direction from each contour pixel position in the vicinity of the candidate end point. The position where both the forward and the backward variances are minimum/low is taken as the most appropriate position of the end point of C s. The result of contour segmentation for Fig. 4(b) is shown in Fig. 4(c) where the identified contour-strokes are shown in a different color. Each contour-stroke corresponds to a straight line stroke or a curved stroke within the character. Note that our notion of a stroke is different from the one used in handwriting recognition. Given that the character boundary contour has been now segmented into contour-strokes, as the next step we get the medial axis for the character Medial axis generation We obtain the medial axis by finding the pixel pairs hp i ; P 0 i i on the opposite boundary contour partitioned into contour segments (Fig. 3). To obtain the medial axis we need to identify the parallel 2.2. Contour segmentation In this section, we describe our algorithm to convert the vectorized boundary contour into component segments straight lines or curves. The novelty of this algorithm lies in its search-based identification of contour segmentation points. Given a vectorized contour the candidate segmentation points are identified by analyzing the incremental changes in the orientation as pixels are added at one end of a hypothesized contour segment. The orientation of a contour segment is the slope of the line joining its hypothesized end points. A hypothesized contour segment initiated from Fig. 3. Closest points on opposite contour segments.

4 S. Bag, G. Harit / Pattern Recognition Letters 32 (2011) contour-strokes on the boundary contour. While doing so we incorporate the most obvious text-specific knowledge: the distance between the parallel contour-strokes should be small (since the pen-width is generally small), and all the pixels in-between the two parallel contour-strokes should be black i.e., belong to the character itself and not the background. Processing starts from any contour-stroke, say C s. We process the pixels {x 1,x 2,...,x Ns } on the contour segment C s. 1. For a pixel x i on C s compute the local orientation. " # h local x i ¼ 1 2 Avg hðx i ; x iþj Þþ Avg hðx i ; x i j Þ j¼1to5 j¼1to5 where h(x i,x i+j ) refers to the orientation of the line segment joining pixel x i to the jth pixel following down the contour (i.e., forward direction), and h(x i,x i j ) refers to the orientation of the line segment joining pixel x i to the jth pixel preceding up the contour (i.e., backward direction). The averaging operation is denoted by Avg. 2. Compute the direction? x i perpendicular to the local orientation h local x i. 3. Starting from pixel x i on C s, traverse pixels along the direction of? x i till a border pixel (not on contour C s ) is reached. Let this border pixel be denoted as x 0 i. 4. Compute the local orientation h x 0 at the pixel x 0 local i i. 5. Compute the distance d xi x 0 between the points x i i and x 0 i. 6. If h local x i is nearly same as h x 0 (differ by ±10 ) and d xi x 0 < n local i i (value is set to 1.4 times of pen width), then mark the mid point of the line joining x i and x 0 i as a pixel on the medial axis. If the two conditions are not satisfied then the medial pixel is not marked since this is likely to be the junction point of two or more pen strokes, hence the ambiguity has to be resolved later. 7. Flag off the pixels x i and x 0 i as been processed and move to the pixel x i+1 and repeat the above steps. The above steps are repeated for all contour strokes. For each contour stroke we consider only the pixels which have not been flagged as already processed. The output of this step is given in Fig. 4(d). We see from the result that the medial axis of the character has come up as skeleton segments. These skeleton segments now need to be extrapolated to join the neighboring segments while ensuring that the extrapolation is within the character region Extrapolating skeleton segments in ambiguous regions We consider the skeleton segments of a character as forming the set S¼ S s 1 ; Se 1 ; S s 2 ; Se 2 ;...; S s m ; Se m, where S s i and S e i denote the start and end points of the ith skeleton segment. The extreme points now need to be extended so that the neighboring skeleton segments can be joined together. The steps to identify the neighboring skeleton segments are as follows: 1. For each extreme point of a skeleton segment find the close extreme points belonging to other skeleton segments. For this purpose we consider that the distance between the two extreme points should be less than e (value is set to 0.45 times of the average height of characters) and that the line joining the two extreme points should be within the character region. 2. If a given extreme point has two or more extreme points of other segments close enough then we make a proximity set, e.g., P¼ S s a ; Ss b ; Se c would indicate proximal extreme point of three skeleton segments a, b, and c. Each proximity set would correspond to extreme points which belong to the junction region belonging to multiple pen strokes. Identifying the medial axis points in the junction region of pen strokes gives skeleton points which tend to distort the true shape of the character. Our approach avoids identifying the medial axis points in such ambiguous regions and instead tries to extrapolate the extreme points of the skeleton segments in a proximity group such that the result would be very close to the true shape of the character. This is the major contribution of this work. The steps for extrapolation are as follows: 1. In a given proximity set P if any two skeleton segments have the orientation differ by ±10 then any one of the two close extreme points is extended to meet the other one. 2. If there is a skeleton segment whose orientation does not match with any other segment in the same proximity set, then its extreme point is extended till it meets the skeleton segment (extrapolated from other segments) or a junction (formed by extrapolating from other segments in P) on the skeleton segment or to the extreme point of the other skeleton segment. Of the three cases mentioned which one applies depends on the number of extreme points which are part of a proximity set. The result of extrapolation is showed in Fig. 4(e). Note that extrapolation result is almost the true shape of the character and does not produce spurious segments. Our approach gives correct thinning results even for very thick strokes. 3. Experimental results and discussion We collected characters from several heterogeneous printed documents in different scripts, such as English, Bengali, Hindi, and Tamil. The number of characters in the testing set is 2315 (450 for English, 690 for Bengali, 652 for Hindi, and 523 for Tamil). All the characters were collected in a systematic manner from printed pages scanned on a HP scanjet 5590 scanner at 300 dpi. We tested our proposed method on a randomly selected set of character images of different scripts. We find our approach is robust, and good results are achieved for characters of different scripts. Some results are shown in Fig. 5. There are two outputs of the algorithm: the set of vectorized skeleton segments (Fig. 5(d)) and the final skeleton image (Fig. 5(e)) formed by performing the extrapolation of the skeleton segments. All the programs are written in C++ using OpenCV 2.0 in the UNIX platform Comparison of experimental results with other thinning methods Fig. 4. Thinning results of printed character images: (a) Input image; (b) contour image; (c) segmented contour-strokes of different colors; (d) skeleton segments; (e) skeleton image. We applied Datta and Parui (1994), Huang et al. (2003), and Telea et al. (2004) thinning methods on our own dataset, and

5 1840 S. Bag, G. Harit / Pattern Recognition Letters 32 (2011) Fig. 5. Experimental results: (a) input image; (b) contour image; (c) segmented contour-strokes of different colors; (d) skeleton segments; (e) skeleton image. Fig. 6. Comparison of experimental results among different thinning methods: (a) printed character; (b) Datta and Parui; (c) Huang et al.; (d) Telea et al.; (e) proposed method. compared the results with our proposed algorithm. Fig. 6 shows a comparison of experimental results among these four thinning methods. For quantitative comparison of experimental results, we introduce three types of thinning distortions: (1) shape distortion at junction and end points, (2) spurious strokes at high curvature regions, and (3) recession of stroke length. We formulate simple measures for quantifying the amount of different types of distortions present in the thinned characters. The shape distortion at junction points is measured as a fraction of the number of junction points which were distorted to the total number of junction points in all the thinned characters obtained using the method. A junction point is considered as distorted if the branches emanating from the junction point have local deformations just near the junction point. The distorted branches are identified by manually observing the thinned characters obtained by applying the particular thinning method. Shape distortion at junction points # distorted junction points ¼ total # junction points In the same way we quantify the shape distortion at the end points as a fraction of the number of end points which have shape distortion of the stroke to the total number of end points for all thinning images in the experimental data set. We quantify the distortion due to spurious strokes as the fraction of the number of spurious strokes at high curvature regions to the total number of strokes. ð1þ # spurious strokes at high curvature regions Spurious strokes measure ¼ total # strokes ð2þ Some thinning methods yield thinned characters having stroke length shorter than that of the same stroke in the given image. We measure the recession of stroke length as follows: Recession of stroke length ¼ X p Y p X p where X p represents total number of object pixels of the thinned image generated by Datta and Parui method and Y p denotes the total number of object pixels in the thinned image as produced by any other method. We observed that Datta and Parui method does not lead to any recession of stroke lengths. So, this type of distortion has been formulated with reference to the thinned image generated by Datta and Parui method. We see that Datta and Parui and Huang et al. thinning results suffer from high shape distortion at the junction and end points and produce large number of unwanted strokes which are not acceptable for stroke segmentation and character recognition. Telea et al. (2004) thinning results suffer from recession of skeleton length and distortion at junction points. Table 1 shows a comparative study of different thinning distortions among these three thinning methods and our proposed method. In this table, we have used four different labels (i.e., High, Medium, Low, and No) to represent the ranges of quantitative values (in %) of different thinning distortions. The ranges are: High: %, Medium: 31 70%, Low: 1 30%, and No: 0%. Finally, we conclude that our method is ð3þ

6 S. Bag, G. Harit / Pattern Recognition Letters 32 (2011) Table 1 Comparative study of different thinning distortions among four thinning methods. Method Shape distortion at Spurious Junction End strokes points points Recession of stroke length Datta and Parui High High High No Huang et al. High Medium High No Telea et al. Medium No No Medium Proposed method No No No No et al., and Telea et al. All skeletonization algorithms are written in C++ and run in a Intel Core 2 Duo 2.20 GHz computer. Table 2 shows a comparison of the average computational times (in ms) taken to thin a character using these four thinning methods. We see that the proposed method is much faster than other three methods. This is because the proposed approach is non-iterative and template free. It provides a direct strategy to find the medial axis, while other methods determine the medial axis by removing the boundary using different templates and lookup tables. 4. Conclusion This paper improves the performance of existent character thinning algorithms. The main challenge of thinning character images is to preserve the shape of characters after thinning. The proposed algorithm avoids shape distortion by detecting the ambiguous regions and avoiding the identification of medial axis points within these regions. The resultant skeleton maintains the pixel connectivity and is very close to the medial axis. Additionally, our algorithm provides strokes of character images as an intermediate result during the thinning process. We have compared our experimental results with three other thinning algorithms and have observed better performance compared to all other algorithms. The proposed thinning approach has a good potential for applications to improve the performance of shape analysis and classification problems. In future, we shall use this method as an important preprocessing step for designing Indian language OCR systems. References Fig. 7. Comparison of experimental results with You and Tang method: (a) input image; (b) You and Tang (2007) method; (c) proposed method. Table 2 Average times in milliseconds taken to thin characters using four different thinning methods. Language Datta and Parui Huang et al. Telea et al. Proposed method English Bengali Hindi Tamil free from all these thinning distortions and gives much better results for thinning character images by maintaining all the basic thinning properties as well. The major improvement is that the deformation of the skeletal structure at the junction and end points of the skeletal branches is not there with our results. We compared our method with a wavelet-based skeletonization method proposed by You and Tang (2007). In their method English and Chinese character sets are used for experimental purpose. As our method is language specific and we tested our method on English and other three Indian languages, so we took the input images and skeleton images of English alphabets, numerals, and words from their reported dataset for experimental purpose. We observe that the performance of our method is good and comparable with their method. Fig. 7 shows the comparison of experimental results in-between our proposed method and You Tang method Comparison of computational time We compared the computational time of the proposed method with three other thinning methods, i,e., Datta and Parui, Huang Altuwaijri, M., Bayoumi, M., A new thinning algorithm for Arabic characters using self-organizing neural network. In: Proc. IEEE Internat. Symposium on Circuits and Systems, pp Arcelli, C., Pattern thinning by contour tracing. Comput. Graphics Image Process. 17 (2), Arcelli, C., di Baja, G.S., A one-pass two-operation process to detect the skeletal pixels on the 4-distance transform. IEEE Trans. Pattern Anal. Machine Intell. 11 (4), Bansal, V., Sinha, R.M.K., Segmentation of touching and fused Devanagari characters. Pattern Recognition 35, Bertrand, G., Couprie, M., New 2D parallel thinning algorithms based on critical kernels. In: Proc. Internat. Workshop on Combinatorial Image Analysis, pp Casey, R.G., Lecolinet, E., A survey of methods and strategies in character segmentation. IEEE Trans. Pattern Anal. Machine Intell. 18 (7), Datta, A., Parui, S.K., A robust parallel thinning algorithm for binary images. Pattern Recognition 27 (9), Garain, U., Chaudhuri, B.B., Segmentation of touching characters in printed Devnagari and Bangla scripts using fuzzy multifactorial analysis. IEEE Trans. Systems, Man, Cybernet. Part C: Appl. Rev. 32 (4), Gu, X., Yu, D., Zhang, L., Image thinning using pulse coupled neural network. Pattern Recognition Lett. 25, Huang, L., Wan, G., Liu, C., An improved parallel thinning algorithm. In: Proc. Internat. Conf. on Document Analysis and Recognition, vol. 2, pp Krishnapuram, R., Chen, F., Implementation of parallel thinning algorithms using recurrent neural networks. IEEE Trans. Neural Netw. 4, Lam, L., Lee, S.W., Suen, C.Y., Thinning methodologies A comprehensive survey. IEEE Trans. Pattern Anal. Machine Intell. 14 (9), Leung, W., Ng, C.M., Yu, P.C., Contour following parallel thinning for simple binary images. In: Proc. IEEE Internat. Conf. on Systems, Man, and Cybernetics, vol. 3, pp Ma, H., Doermann, D., Adaptive Hindi OCR using generalized hausdorff image comparison. ACM Trans. Asian Language Inform. Process. 2, Martinez-Perez, M.P., Jimenez, J., Navalon, J.L., A thinning algorithm based on contours. Comput. Vision, Graphics. Image Process. 39 (2), Melhi, M., Ipson, S.S., Booth, W., A novel triangulation procedure for thinning hand-written text. Pattern Recognition Lett. 22, Otsu, N., A threshold selection method from gray-level histogram. IEEE Trans. Systems, Man, Cybernet. 9 (1), Pal, U., Datta, S., Segmentation of Bangla unconstrained handwritten text. In: Proc. Internat. Conf. on Document Analysis and Recognition, pp Shang, L., Yi, Z., Ji, L., Binary image thinning using autowaves generated by PCNN. Neural Process. Lett. 25, Shanthi, N., Duraiswamy, K., Preprocessing algorithms for the recognition of Tamil handwritten characters. In: Proc. Internat. CALIBER, pp Tang, Y.Y., You, X.G., Skeletonization of ribbon-like shapes based on a new wavelet function. IEEE Trans. Pattern Anal. Machine Intell. 25,

7 1842 S. Bag, G. Harit / Pattern Recognition Letters 32 (2011) Telea, A., Sminchisescu, C., Dickinson, S., Optimal inference for hierarchical skeleton abstraction. In: Proc. Internat. Conf. on Pattern Recognition, vol. 4, pp Vincze, M., K}ovári, B., Comparative survey of thinning algorithms. In: Proc. Internat. Symposium of Hungarian Researchers on Computational Intelligence and Informatics, pp You, X., Tang, Y.Y., Wavelet-based approach to character skeleton. IEEE Trans. Image Process. 16 (5), Zhang, T.Y., Suen, C.Y., A fast parallel algorithm for thinning digital patterns. Comm. ACM 27 (3), Zhu, X., Zhang, S., A shape-adaptive thinning method for binary images. In: Proc. Internat. Conf. on Cyberworlds,

Skeletonization Algorithm for Numeral Patterns

Skeletonization Algorithm for Numeral Patterns International Journal of Signal Processing, Image Processing and Pattern Recognition 63 Skeletonization Algorithm for Numeral Patterns Gupta Rakesh and Kaur Rajpreet Department. of CSE, SDDIET Barwala,

More information

Segmentation of Characters of Devanagari Script Documents

Segmentation of Characters of Devanagari Script Documents WWJMRD 2017; 3(11): 253-257 www.wwjmrd.com International Journal Peer Reviewed Journal Refereed Journal Indexed Journal UGC Approved Journal Impact Factor MJIF: 4.25 e-issn: 2454-6615 Manpreet Kaur Research

More information

SEVERAL METHODS OF FEATURE EXTRACTION TO HELP IN OPTICAL CHARACTER RECOGNITION

SEVERAL METHODS OF FEATURE EXTRACTION TO HELP IN OPTICAL CHARACTER RECOGNITION SEVERAL METHODS OF FEATURE EXTRACTION TO HELP IN OPTICAL CHARACTER RECOGNITION Binod Kumar Prasad * * Bengal College of Engineering and Technology, Durgapur, W.B., India. Rajdeep Kundu 2 2 Bengal College

More information

A New Algorithm for Detecting Text Line in Handwritten Documents

A New Algorithm for Detecting Text Line in Handwritten Documents A New Algorithm for Detecting Text Line in Handwritten Documents Yi Li 1, Yefeng Zheng 2, David Doermann 1, and Stefan Jaeger 1 1 Laboratory for Language and Media Processing Institute for Advanced Computer

More information

International Journal of Signal Processing, Image Processing and Pattern Recognition Vol.9, No.2 (2016) Figure 1. General Concept of Skeletonization

International Journal of Signal Processing, Image Processing and Pattern Recognition Vol.9, No.2 (2016) Figure 1. General Concept of Skeletonization Vol.9, No.2 (216), pp.4-58 http://dx.doi.org/1.1425/ijsip.216.9.2.5 Skeleton Generation for Digital Images Based on Performance Evaluation Parameters Prof. Gulshan Goyal 1 and Ritika Luthra 2 1 Associate

More information

Simulation of Zhang Suen Algorithm using Feed- Forward Neural Networks

Simulation of Zhang Suen Algorithm using Feed- Forward Neural Networks Simulation of Zhang Suen Algorithm using Feed- Forward Neural Networks Ritika Luthra Research Scholar Chandigarh University Gulshan Goyal Associate Professor Chandigarh University ABSTRACT Image Skeletonization

More information

CHAPTER 1 INTRODUCTION

CHAPTER 1 INTRODUCTION CHAPTER 1 INTRODUCTION 1.1 Introduction Pattern recognition is a set of mathematical, statistical and heuristic techniques used in executing `man-like' tasks on computers. Pattern recognition plays an

More information

Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network

Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network International Journal of Computer Science & Communication Vol. 1, No. 1, January-June 2010, pp. 91-95 Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network Raghuraj

More information

A two-stage approach for segmentation of handwritten Bangla word images

A two-stage approach for segmentation of handwritten Bangla word images A two-stage approach for segmentation of handwritten Bangla word images Ram Sarkar, Nibaran Das, Subhadip Basu, Mahantapas Kundu, Mita Nasipuri #, Dipak Kumar Basu Computer Science & Engineering Department,

More information

Handwritten Devanagari Character Recognition Model Using Neural Network

Handwritten Devanagari Character Recognition Model Using Neural Network Handwritten Devanagari Character Recognition Model Using Neural Network Gaurav Jaiswal M.Sc. (Computer Science) Department of Computer Science Banaras Hindu University, Varanasi. India gauravjais88@gmail.com

More information

RESEARCH ON OPTIMIZATION OF IMAGE USING SKELETONIZATION TECHNIQUE WITH ADVANCED ALGORITHM

RESEARCH ON OPTIMIZATION OF IMAGE USING SKELETONIZATION TECHNIQUE WITH ADVANCED ALGORITHM 881 RESEARCH ON OPTIMIZATION OF IMAGE USING SKELETONIZATION TECHNIQUE WITH ADVANCED ALGORITHM Sarita Jain 1 Sumit Rana 2 Department of CSE 1 Department of CSE 2 Geeta Engineering College 1, Panipat, India

More information

FRAGMENTATION OF HANDWRITTEN TOUCHING CHARACTERS IN DEVANAGARI SCRIPT

FRAGMENTATION OF HANDWRITTEN TOUCHING CHARACTERS IN DEVANAGARI SCRIPT International Journal of Information Technology, Modeling and Computing (IJITMC) Vol. 2, No. 1, February 2014 FRAGMENTATION OF HANDWRITTEN TOUCHING CHARACTERS IN DEVANAGARI SCRIPT Shuchi Kapoor 1 and Vivek

More information

Isolated Handwritten Words Segmentation Techniques in Gurmukhi Script

Isolated Handwritten Words Segmentation Techniques in Gurmukhi Script Isolated Handwritten Words Segmentation Techniques in Gurmukhi Script Galaxy Bansal Dharamveer Sharma ABSTRACT Segmentation of handwritten words is a challenging task primarily because of structural features

More information

Recognition of Unconstrained Malayalam Handwritten Numeral

Recognition of Unconstrained Malayalam Handwritten Numeral Recognition of Unconstrained Malayalam Handwritten Numeral U. Pal, S. Kundu, Y. Ali, H. Islam and N. Tripathy C VPR Unit, Indian Statistical Institute, Kolkata-108, India Email: umapada@isical.ac.in Abstract

More information

4. RESULTS AND COMPARASIONS

4. RESULTS AND COMPARASIONS 80 4. RESULTS AND COMPARASIONS 4.1 Introduction The previous chapter describes the methodology of the proposed algorithm. In this chapter we discuss the results obtained by the proposed algorithms. We

More information

A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation

A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation K. Roy, U. Pal and B. B. Chaudhuri CVPR Unit; Indian Statistical Institute, Kolkata-108; India umapada@isical.ac.in

More information

OCR For Handwritten Marathi Script

OCR For Handwritten Marathi Script International Journal of Scientific & Engineering Research Volume 3, Issue 8, August-2012 1 OCR For Handwritten Marathi Script Mrs.Vinaya. S. Tapkir 1, Mrs.Sushma.D.Shelke 2 1 Maharashtra Academy Of Engineering,

More information

Structural Feature Extraction to recognize some of the Offline Isolated Handwritten Gujarati Characters using Decision Tree Classifier

Structural Feature Extraction to recognize some of the Offline Isolated Handwritten Gujarati Characters using Decision Tree Classifier Structural Feature Extraction to recognize some of the Offline Isolated Handwritten Gujarati Characters using Decision Tree Classifier Hetal R. Thaker Atmiya Institute of Technology & science, Kalawad

More information

A Brief Study of Feature Extraction and Classification Methods Used for Character Recognition of Brahmi Northern Indian Scripts

A Brief Study of Feature Extraction and Classification Methods Used for Character Recognition of Brahmi Northern Indian Scripts 25 A Brief Study of Feature Extraction and Classification Methods Used for Character Recognition of Brahmi Northern Indian Scripts Rohit Sachdeva, Asstt. Prof., Computer Science Department, Multani Mal

More information

A Model-based Line Detection Algorithm in Documents

A Model-based Line Detection Algorithm in Documents A Model-based Line Detection Algorithm in Documents Yefeng Zheng, Huiping Li, David Doermann Laboratory for Language and Media Processing Institute for Advanced Computer Studies University of Maryland,

More information

Available online at ScienceDirect. Procedia Computer Science 45 (2015 )

Available online at  ScienceDirect. Procedia Computer Science 45 (2015 ) Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 45 (2015 ) 205 214 International Conference on Advanced Computing Technologies and Applications (ICACTA- 2015) Automatic

More information

New Removal Operators for Surface Skeletonization

New Removal Operators for Surface Skeletonization New Removal Operators for Surface Skeletonization Carlo Arcelli, Gabriella Sanniti di Baja, and Luca Serino Institute of Cybernetics "E.Caianiello", CNR, Pozzuoli (Naples), Italy (c.arcelli, g.sannitidibaja,

More information

An Efficient Character Segmentation Based on VNP Algorithm

An Efficient Character Segmentation Based on VNP Algorithm Research Journal of Applied Sciences, Engineering and Technology 4(24): 5438-5442, 2012 ISSN: 2040-7467 Maxwell Scientific organization, 2012 Submitted: March 18, 2012 Accepted: April 14, 2012 Published:

More information

Input sensitive thresholding for ancient Hebrew manuscript

Input sensitive thresholding for ancient Hebrew manuscript Pattern Recognition Letters 26 (2005) 1168 1173 www.elsevier.com/locate/patrec Input sensitive thresholding for ancient Hebrew manuscript Itay Bar-Yosef * Department of Computer Science, Ben Gurion University,

More information

Layout Segmentation of Scanned Newspaper Documents

Layout Segmentation of Scanned Newspaper Documents , pp-05-10 Layout Segmentation of Scanned Newspaper Documents A.Bandyopadhyay, A. Ganguly and U.Pal CVPR Unit, Indian Statistical Institute 203 B T Road, Kolkata, India. Abstract: Layout segmentation algorithms

More information

INTERNATIONAL RESEARCH JOURNAL OF MULTIDISCIPLINARY STUDIES

INTERNATIONAL RESEARCH JOURNAL OF MULTIDISCIPLINARY STUDIES STUDIES & SPPP's, Karmayogi Engineering College, Pandharpur Organize National Conference Special Issue March 2016 Neuro-Fuzzy System based Handwritten Marathi System Numerals Recognition 1 Jayashri H Patil(Madane),

More information

THE description and representation of the shape of an object

THE description and representation of the shape of an object Enhancement of Shape Description and Representation by Slope Ali Salem Bin Samma and Rosalina Abdul Salam Abstract Representation and description of object shapes by the slopes of their contours or borders

More information

A New Method for Skeleton Pruning

A New Method for Skeleton Pruning A New Method for Skeleton Pruning Laura Alejandra Pinilla-Buitrago, José Fco. Martínez-Trinidad, and J.A. Carrasco-Ochoa Instituto Nacional de Astrofísica, Óptica y Electrónica Departamento de Ciencias

More information

Spotting Words in Latin, Devanagari and Arabic Scripts

Spotting Words in Latin, Devanagari and Arabic Scripts Spotting Words in Latin, Devanagari and Arabic Scripts Sargur N. Srihari, Harish Srinivasan, Chen Huang and Shravya Shetty {srihari,hs32,chuang5,sshetty}@cedar.buffalo.edu Center of Excellence for Document

More information

Character Recognition

Character Recognition Character Recognition 5.1 INTRODUCTION Recognition is one of the important steps in image processing. There are different methods such as Histogram method, Hough transformation, Neural computing approaches

More information

Segmentation Based Optical Character Recognition for Handwritten Marathi characters

Segmentation Based Optical Character Recognition for Handwritten Marathi characters Segmentation Based Optical Character Recognition for Handwritten Marathi characters Madhav Vaidya 1, Yashwant Joshi 2,Milind Bhalerao 3 Department of Information Technology 1 Department of Electronics

More information

Indian Multi-Script Full Pin-code String Recognition for Postal Automation

Indian Multi-Script Full Pin-code String Recognition for Postal Automation 2009 10th International Conference on Document Analysis and Recognition Indian Multi-Script Full Pin-code String Recognition for Postal Automation U. Pal 1, R. K. Roy 1, K. Roy 2 and F. Kimura 3 1 Computer

More information

Recognition of Off-Line Handwritten Devnagari Characters Using Quadratic Classifier

Recognition of Off-Line Handwritten Devnagari Characters Using Quadratic Classifier Recognition of Off-Line Handwritten Devnagari Characters Using Quadratic Classifier N. Sharma, U. Pal*, F. Kimura**, and S. Pal Computer Vision and Pattern Recognition Unit, Indian Statistical Institute

More information

Handwriting segmentation of unconstrained Oriya text

Handwriting segmentation of unconstrained Oriya text Sādhanā Vol. 31, Part 6, December 2006, pp. 755 769. Printed in India Handwriting segmentation of unconstrained Oriya text N TRIPATHY and U PAL Computer Vision and Pattern Recognition Unit, Indian Statistical

More information

1. INTRODUCTION. AMS Subject Classification. 68U10 Image Processing

1. INTRODUCTION. AMS Subject Classification. 68U10 Image Processing ANALYSING THE NOISE SENSITIVITY OF SKELETONIZATION ALGORITHMS Attila Fazekas and András Hajdu Lajos Kossuth University 4010, Debrecen PO Box 12, Hungary Abstract. Many skeletonization algorithms have been

More information

Handwritten Hindi Character Recognition System Using Edge detection & Neural Network

Handwritten Hindi Character Recognition System Using Edge detection & Neural Network Handwritten Hindi Character Recognition System Using Edge detection & Neural Network Tanuja K *, Usha Kumari V and Sushma T M Acharya Institute of Technology, Bangalore, India Abstract Handwritten recognition

More information

AN EFFICIENT BINARIZATION TECHNIQUE FOR FINGERPRINT IMAGES S. B. SRIDEVI M.Tech., Department of ECE

AN EFFICIENT BINARIZATION TECHNIQUE FOR FINGERPRINT IMAGES S. B. SRIDEVI M.Tech., Department of ECE AN EFFICIENT BINARIZATION TECHNIQUE FOR FINGERPRINT IMAGES S. B. SRIDEVI M.Tech., Department of ECE sbsridevi89@gmail.com 287 ABSTRACT Fingerprint identification is the most prominent method of biometric

More information

A New Approach to Detect and Extract Characters from Off-Line Printed Images and Text

A New Approach to Detect and Extract Characters from Off-Line Printed Images and Text Available online at www.sciencedirect.com Procedia Computer Science 17 (2013 ) 434 440 Information Technology and Quantitative Management (ITQM2013) A New Approach to Detect and Extract Characters from

More information

A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script

A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script Arwinder Kaur 1, Ashok Kumar Bathla 2 1 M. Tech. Student, CE Dept., 2 Assistant Professor, CE Dept.,

More information

Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes

Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes 2009 10th International Conference on Document Analysis and Recognition Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes Alireza Alaei

More information

Hidden Loop Recovery for Handwriting Recognition

Hidden Loop Recovery for Handwriting Recognition Hidden Loop Recovery for Handwriting Recognition David Doermann Institute of Advanced Computer Studies, University of Maryland, College Park, USA E-mail: doermann@cfar.umd.edu Nathan Intrator School of

More information

Word-wise Hand-written Script Separation for Indian Postal automation

Word-wise Hand-written Script Separation for Indian Postal automation Word-wise Hand-written Script Separation for Indian Postal automation K. Roy U. Pal Dept. of Comp. Sc. & Engg. West Bengal University of Technology, Sector 1, Saltlake City, Kolkata-64, India Abstract

More information

Segmentation of Bangla Handwritten Text

Segmentation of Bangla Handwritten Text Thesis Report Segmentation of Bangla Handwritten Text Submitted By: Sabbir Sadik ID:09301027 Md. Numan Sarwar ID: 09201027 CSE Department BRAC University Supervisor: Professor Dr. Mumit Khan Date: 13 th

More information

Extract an Essential Skeleton of a Character as a Graph from a Character Image

Extract an Essential Skeleton of a Character as a Graph from a Character Image Extract an Essential Skeleton of a Character as a Graph from a Character Image Kazuhisa Fujita University of Electro-Communications 1-5-1 Chofugaoka, Chofu, Tokyo, 182-8585 Japan k-z@nerve.pc.uec.ac.jp

More information

A Decision Tree Based Method to Classify Persian Handwritten Numerals by Extracting Some Simple Geometrical Features

A Decision Tree Based Method to Classify Persian Handwritten Numerals by Extracting Some Simple Geometrical Features A Decision Tree Based Method to Classify Persian Handwritten Numerals by Extracting Some Simple Geometrical Features Hamidreza Alvari, Seyed Mehdi Hazrati Fard, and Bahar Salehi Abstract Automatic recognition

More information

HCR Using K-Means Clustering Algorithm

HCR Using K-Means Clustering Algorithm HCR Using K-Means Clustering Algorithm Meha Mathur 1, Anil Saroliya 2 Amity School of Engineering & Technology Amity University Rajasthan, India Abstract: Hindi is a national language of India, there are

More information

Off-line Recognition of Hand-written Bengali Numerals using Morphological Features

Off-line Recognition of Hand-written Bengali Numerals using Morphological Features Off-line Recognition of Hand-written Bengali Numerals using Morphological Features Pulak Purkait and Bhabatosh Chanda ECSU, Indian Statistical Institute, Kolkata, India {pulak r, chanda}@isical.ac.in Abstract

More information

Image representation. 1. Introduction

Image representation. 1. Introduction Image representation Introduction Representation schemes Chain codes Polygonal approximations The skeleton of a region Boundary descriptors Some simple descriptors Shape numbers Fourier descriptors Moments

More information

Discrete 3D Tools Applied to 2D Grey-Level Images

Discrete 3D Tools Applied to 2D Grey-Level Images Discrete 3D Tools Applied to 2D Grey-Level Images Gabriella Sanniti di Baja 1, Ingela Nyström 2, and Gunilla Borgefors 3 1 Institute of Cybernetics "E.Caianiello", CNR, Pozzuoli, Italy gsdb@imagm.cib.na.cnr.it

More information

Time Stamp Detection and Recognition in Video Frames

Time Stamp Detection and Recognition in Video Frames Time Stamp Detection and Recognition in Video Frames Nongluk Covavisaruch and Chetsada Saengpanit Department of Computer Engineering, Chulalongkorn University, Bangkok 10330, Thailand E-mail: nongluk.c@chula.ac.th

More information

Extracting medial curves on 3D images

Extracting medial curves on 3D images Pattern Recognition Letters 23 (2002) 895 904 www.elsevier.com/locate/patrec Extracting medial curves on 3D images Cherng-Min Ma *, Shu-Yen Wan, Her-Kun Chang Department of Information Management, College

More information

Offline Tamil Handwritten Character Recognition using Chain Code and Zone based Features

Offline Tamil Handwritten Character Recognition using Chain Code and Zone based Features Offline Tamil Handwritten Character Recognition using Chain Code and Zone based Features M. Antony Robert Raj 1, S. Abirami 2 Department of Information Science and Technology Anna University, Chennai 600

More information

RESTORATION OF DEGRADED DOCUMENTS USING IMAGE BINARIZATION TECHNIQUE

RESTORATION OF DEGRADED DOCUMENTS USING IMAGE BINARIZATION TECHNIQUE RESTORATION OF DEGRADED DOCUMENTS USING IMAGE BINARIZATION TECHNIQUE K. Kaviya Selvi 1 and R. S. Sabeenian 2 1 Department of Electronics and Communication Engineering, Communication Systems, Sona College

More information

Restoring Warped Document Image Based on Text Line Correction

Restoring Warped Document Image Based on Text Line Correction Restoring Warped Document Image Based on Text Line Correction * Dep. of Electrical Engineering Tamkang University, New Taipei, Taiwan, R.O.C *Correspondending Author: hsieh@ee.tku.edu.tw Abstract Document

More information

Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network

Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network Utkarsh Dwivedi 1, Pranjal Rajput 2, Manish Kumar Sharma 3 1UG Scholar, Dept. of CSE, GCET, Greater Noida,

More information

International Journal of Computer & Organization Trends (IJCOT) Volume 6 Issue 4 July to August 2016

International Journal of Computer & Organization Trends (IJCOT) Volume 6 Issue 4 July to August 2016 To Propose an Improvement in Zhang-Suen Algorithm using Genetic Algorithm for Image Thinning Simrat Kaur Malik 1, Amrit Kaur 2 ¹Student, Department of Electronics and Communication, Punjabi University,

More information

Image Normalization and Preprocessing for Gujarati Character Recognition

Image Normalization and Preprocessing for Gujarati Character Recognition 334 Image Normalization and Preprocessing for Gujarati Character Recognition Jayashree Rajesh Prasad Department of Computer Engineering, Sinhgad College of Engineering, University of Pune, Pune, Mahaashtra

More information

NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: VOLUME 2, ISSUE 1 JAN-2015

NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: VOLUME 2, ISSUE 1 JAN-2015 Offline Handwritten Signature Verification using Neural Network Pallavi V. Hatkar Department of Electronics Engineering, TKIET Warana, India Prof.B.T.Salokhe Department of Electronics Engineering, TKIET

More information

A New Technique for Segmentation of Handwritten Numerical Strings of Bangla Language

A New Technique for Segmentation of Handwritten Numerical Strings of Bangla Language I.J. Information Technology and Computer Science, 2013, 05, 38-43 Published Online April 2013 in MECS (http://www.mecs-press.org/) DOI: 10.5815/ijitcs.2013.05.05 A New Technique for Segmentation of Handwritten

More information

Online Bangla Handwriting Recognition System

Online Bangla Handwriting Recognition System 1 Online Bangla Handwriting Recognition System K. Roy Dept. of Comp. Sc. West Bengal University of Technology, BF 142, Saltlake, Kolkata-64, India N. Sharma, T. Pal and U. Pal Computer Vision and Pattern

More information

Texture recognition of medical images with the ICM method

Texture recognition of medical images with the ICM method Nuclear Instruments and Methods in Physics Research A 525 (2004) 387 391 Texture recognition of medical images with the ICM method Jason M. Kinser*, Guisong Wang George Mason University, Institute for

More information

Handwritten Script Recognition at Block Level

Handwritten Script Recognition at Block Level Chapter 4 Handwritten Script Recognition at Block Level -------------------------------------------------------------------------------------------------------------------------- Optical character recognition

More information

A Fast Caption Detection Method for Low Quality Video Images

A Fast Caption Detection Method for Low Quality Video Images 2012 10th IAPR International Workshop on Document Analysis Systems A Fast Caption Detection Method for Low Quality Video Images Tianyi Gui, Jun Sun, Satoshi Naoi Fujitsu Research & Development Center CO.,

More information

Handwritten Devanagari Character Recognition

Handwritten Devanagari Character Recognition Handwritten Devanagari Character Recognition Akhil Deshmukh, Rahul Meshram, Sachin Kendre, Kunal Shah Department of Computer Engineering Sinhgad Institute of Technology (SIT) Lonavala University of Pune,

More information

PCA-based Offline Handwritten Character Recognition System

PCA-based Offline Handwritten Character Recognition System Smart Computing Review, vol. 3, no. 5, October 2013 346 Smart Computing Review PCA-based Offline Handwritten Character Recognition System Munish Kumar 1, M. K. Jindal 2, and R. K. Sharma 3 1 Computer Science

More information

An Objective Evaluation Methodology for Handwritten Image Document Binarization Techniques

An Objective Evaluation Methodology for Handwritten Image Document Binarization Techniques An Objective Evaluation Methodology for Handwritten Image Document Binarization Techniques K. Ntirogiannis, B. Gatos and I. Pratikakis Computational Intelligence Laboratory, Institute of Informatics and

More information

Recognition-based Segmentation of Nom Characters from Body Text Regions of Stele Images Using Area Voronoi Diagram

Recognition-based Segmentation of Nom Characters from Body Text Regions of Stele Images Using Area Voronoi Diagram Author manuscript, published in "International Conference on Computer Analysis of Images and Patterns - CAIP'2009 5702 (2009) 205-212" DOI : 10.1007/978-3-642-03767-2 Recognition-based Segmentation of

More information

Data Hiding in Binary Text Documents 1. Q. Mei, E. K. Wong, and N. Memon

Data Hiding in Binary Text Documents 1. Q. Mei, E. K. Wong, and N. Memon Data Hiding in Binary Text Documents 1 Q. Mei, E. K. Wong, and N. Memon Department of Computer and Information Science Polytechnic University 5 Metrotech Center, Brooklyn, NY 11201 ABSTRACT With the proliferation

More information

Scene Text Detection Using Machine Learning Classifiers

Scene Text Detection Using Machine Learning Classifiers 601 Scene Text Detection Using Machine Learning Classifiers Nafla C.N. 1, Sneha K. 2, Divya K.P. 3 1 (Department of CSE, RCET, Akkikkvu, Thrissur) 2 (Department of CSE, RCET, Akkikkvu, Thrissur) 3 (Department

More information

Scanner Parameter Estimation Using Bilevel Scans of Star Charts

Scanner Parameter Estimation Using Bilevel Scans of Star Charts ICDAR, Seattle WA September Scanner Parameter Estimation Using Bilevel Scans of Star Charts Elisa H. Barney Smith Electrical and Computer Engineering Department Boise State University, Boise, Idaho 8375

More information

OTCYMIST: Otsu-Canny Minimal Spanning Tree for Born-Digital Images

OTCYMIST: Otsu-Canny Minimal Spanning Tree for Born-Digital Images OTCYMIST: Otsu-Canny Minimal Spanning Tree for Born-Digital Images Deepak Kumar and A G Ramakrishnan Medical Intelligence and Language Engineering Laboratory Department of Electrical Engineering, Indian

More information

Topic 6 Representation and Description

Topic 6 Representation and Description Topic 6 Representation and Description Background Segmentation divides the image into regions Each region should be represented and described in a form suitable for further processing/decision-making Representation

More information

Automatic Recognition and Verification of Handwritten Legal and Courtesy Amounts in English Language Present on Bank Cheques

Automatic Recognition and Verification of Handwritten Legal and Courtesy Amounts in English Language Present on Bank Cheques Automatic Recognition and Verification of Handwritten Legal and Courtesy Amounts in English Language Present on Bank Cheques Ajay K. Talele Department of Electronics Dr..B.A.T.U. Lonere. Sanjay L Nalbalwar

More information

Devanagari Isolated Character Recognition by using Statistical features

Devanagari Isolated Character Recognition by using Statistical features Devanagari Isolated Character Recognition by using Statistical features ( Foreground Pixels Distribution, Zone Density and Background Directional Distribution feature and SVM Classifier) Mahesh Jangid

More information

On the use of Lexeme Features for writer verification

On the use of Lexeme Features for writer verification On the use of Lexeme Features for writer verification Anurag Bhardwaj, Abhishek Singh, Harish Srinivasan and Sargur Srihari Center of Excellence for Document Analysis and Recognition (CEDAR) University

More information

CoE4TN4 Image Processing

CoE4TN4 Image Processing CoE4TN4 Image Processing Chapter 11 Image Representation & Description Image Representation & Description After an image is segmented into regions, the regions are represented and described in a form suitable

More information

Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org) 1

Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org) 1 Minutiae Points Extraction using Biometric Fingerprint- Enhancement Vishal Wagh 1, Shefali Sonavane 2 1 Computer Science and Engineering Department, Walchand College of Engineering, Sangli, Maharashtra-416415,

More information

A Laplacian Based Novel Approach to Efficient Text Localization in Grayscale Images

A Laplacian Based Novel Approach to Efficient Text Localization in Grayscale Images A Laplacian Based Novel Approach to Efficient Text Localization in Grayscale Images Karthik Ram K.V & Mahantesh K Department of Electronics and Communication Engineering, SJB Institute of Technology, Bangalore,

More information

NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: VOLUME 5, ISSUE

NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: VOLUME 5, ISSUE OPTICAL HANDWRITTEN DEVNAGARI CHARACTER RECOGNITION USING ARTIFICIAL NEURAL NETWORK APPROACH JYOTI A.PATIL Ashokrao Mane Group of Institution, Vathar Tarf Vadgaon, India. DR. SANJAY R. PATIL Ashokrao Mane

More information

Speeding up Online Character Recognition

Speeding up Online Character Recognition K. Gupta, S. V. Rao, P. Viswanath, Speeding up Online Character Recogniton, Proceedings of Image and Vision Computing New Zealand 27, pp. 1, Hamilton, New Zealand, December 27. Speeding up Online Character

More information

Unique Journal of Engineering and Advanced Sciences Available online: Research Article

Unique Journal of Engineering and Advanced Sciences Available online:  Research Article ISSN 2348-375X Unique Journal of Engineering and Advanced Sciences Available online: www.ujconline.net Research Article DETECTION AND RECOGNITION OF THE TEXT THROUGH CONNECTED COMPONENT CLUSTERING AND

More information

Hand Written Telugu Character Recognition Using Bayesian Classifier

Hand Written Telugu Character Recognition Using Bayesian Classifier Hand Written Telugu Character Recognition Using Bayesian Classifier K.Mohana Lakshmi 1,K.Venkatesh 2,G.Sunaina 3, D.Sravani 4, P.Dayakar 5 ECE Deparment, JNTUH, CMR Technical Campus, Hyderabad, India.

More information

Morphological Approach for Segmentation of Scanned Handwritten Devnagari Text

Morphological Approach for Segmentation of Scanned Handwritten Devnagari Text Abstract In this paper we present a system towards the of Hindi Handwritten Devnagari Text. Segmentation of script is essential for handwritten script recognition. This system deals with of (matras) and

More information

CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS

CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS 8.1 Introduction The recognition systems developed so far were for simple characters comprising of consonants and vowels. But there is one

More information

HANDWRITTEN GURMUKHI CHARACTER RECOGNITION USING WAVELET TRANSFORMS

HANDWRITTEN GURMUKHI CHARACTER RECOGNITION USING WAVELET TRANSFORMS International Journal of Electronics, Communication & Instrumentation Engineering Research and Development (IJECIERD) ISSN 2249-684X Vol.2, Issue 3 Sep 2012 27-37 TJPRC Pvt. Ltd., HANDWRITTEN GURMUKHI

More information

A Document Image Analysis System on Parallel Processors

A Document Image Analysis System on Parallel Processors A Document Image Analysis System on Parallel Processors Shamik Sural, CMC Ltd. 28 Camac Street, Calcutta 700 016, India. P.K.Das, Dept. of CSE. Jadavpur University, Calcutta 700 032, India. Abstract This

More information

Character Segmentation for Telugu Image Document using Multiple Histogram Projections

Character Segmentation for Telugu Image Document using Multiple Histogram Projections Global Journal of Computer Science and Technology Graphics & Vision Volume 13 Issue 5 Version 1.0 Year 2013 Type: Double Blind Peer Reviewed International Research Journal Publisher: Global Journals Inc.

More information

Keywords Connected Components, Text-Line Extraction, Trained Dataset.

Keywords Connected Components, Text-Line Extraction, Trained Dataset. Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Language Independent

More information

Enhancing the Character Segmentation Accuracy of Bangla OCR using BPNN

Enhancing the Character Segmentation Accuracy of Bangla OCR using BPNN Enhancing the Character Segmentation Accuracy of Bangla OCR using BPNN Shamim Ahmed 1, Mohammod Abul Kashem 2 1 M.S. Student, Department of Computer Science and Engineering, Dhaka University of Engineering

More information

A Review on Different Character Segmentation Techniques for Handwritten Gurmukhi Scripts

A Review on Different Character Segmentation Techniques for Handwritten Gurmukhi Scripts WWJMRD2017; 3(10): 162-166 www.wwjmrd.com International Journal Peer Reviewed Journal Refereed Journal Indexed Journal UGC Approved Journal Impact Factor MJIF: 4.25 e-issn: 2454-6615 Manas Kaur Research

More information

Iterative Removing Salt and Pepper Noise based on Neighbourhood Information

Iterative Removing Salt and Pepper Noise based on Neighbourhood Information Iterative Removing Salt and Pepper Noise based on Neighbourhood Information Liu Chun College of Computer Science and Information Technology Daqing Normal University Daqing, China Sun Bishen Twenty-seventh

More information

Paper ID: NITETE&TC05 THE HANDWRITTEN DEVNAGARI NUMERALS RECOGNITION USING SUPPORT VECTOR MACHINE

Paper ID: NITETE&TC05 THE HANDWRITTEN DEVNAGARI NUMERALS RECOGNITION USING SUPPORT VECTOR MACHINE Paper ID: NITETE&TC05 THE HANDWRITTEN DEVNAGARI NUMERALS RECOGNITION USING SUPPORT VECTOR MACHINE Rupali Vitthalrao Suryawanshi Department of Electronics Engineering, Bharatratna Indira Gandhi College,

More information

Finger Print Enhancement Using Minutiae Based Algorithm

Finger Print Enhancement Using Minutiae Based Algorithm Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 8, August 2014,

More information

A Simplistic Way of Feature Extraction Directed towards a Better Recognition Accuracy

A Simplistic Way of Feature Extraction Directed towards a Better Recognition Accuracy International Journal of Engineering Research and Development e-issn: 2278-067X, p-issn: 2278-800X, www.ijerd.com Volume 3, Issue 7 (September 2012), PP. 43-49 A Simplistic Way of Feature Extraction Directed

More information

Line Net Global Vectorization: an Algorithm and Its Performance Evaluation

Line Net Global Vectorization: an Algorithm and Its Performance Evaluation Line Net Global Vectorization: an Algorithm and Its Performance Evaluation Jiqiang Song 1, Feng Su 1, Jibing Chen 1, Chiewlan Tai 2, and Shijie Cai 1 1 Department of Computer Science of Nanjing University,

More information

A Recognition System for Devnagri and English Handwritten Numerals

A Recognition System for Devnagri and English Handwritten Numerals A Recognition System for Devnagri and English Handwritten Numerals G S Lehal 1 and Nivedan Bhatt 2 1 Department of Computer Science & Engineering, Thapar Institute of Engineering & Technology, Patiala,

More information

Separation of Overlapping Text from Graphics

Separation of Overlapping Text from Graphics Separation of Overlapping Text from Graphics Ruini Cao, Chew Lim Tan School of Computing, National University of Singapore 3 Science Drive 2, Singapore 117543 Email: {caorn, tancl}@comp.nus.edu.sg Abstract

More information

Signature Recognition by Pixel Variance Analysis Using Multiple Morphological Dilations

Signature Recognition by Pixel Variance Analysis Using Multiple Morphological Dilations Signature Recognition by Pixel Variance Analysis Using Multiple Morphological Dilations H B Kekre 1, Department of Computer Engineering, V A Bharadi 2, Department of Electronics and Telecommunication**

More information

II. WORKING OF PROJECT

II. WORKING OF PROJECT Handwritten character Recognition and detection using histogram technique Tanmay Bahadure, Pranay Wekhande, Manish Gaur, Shubham Raikwar, Yogendra Gupta ABSTRACT : Cursive handwriting recognition is a

More information

Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network

Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network 139 Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network Harmit Kaur 1, Simpel Rani 2 1 M. Tech. Research Scholar (Department of Computer Science & Engineering), Yadavindra College

More information