Pattern Recognition 42 (2009) Contents lists available at ScienceDirect. Pattern Recognition

Size: px
Start display at page:

Download "Pattern Recognition 42 (2009) Contents lists available at ScienceDirect. Pattern Recognition"

Transcription

1 Pattern Recognition 42 (2009) Contents lists available at ScienceDirect Pattern Recognition journal homepage: Handwritten Chinese text line segmentation by clustering with distance metric learning Fei Yin, Cheng-Lin Liu National Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese Academy of Sciences, 95 Zhongguancun East Road, Beijing , PR China A R T I C L E I N F O A B S T R A C T Article history: Received 7 August 2008 Received in revised form 21 November 2008 Accepted 20 December 2008 Keywords: Handwritten text line segmentation Clustering Minimal spanning tree (MST) Distance metric learning Hypervolume reduction Separating text lines in unconstrained handwritten documents remains a challenge because the handwritten text lines are often un-uniformly skewed and curved, and the space between lines is not obvious. In this paper, we propose a novel text line segmentation algorithm based on minimal spanning tree (MST) clustering with distance metric learning. Given a distance metric, the connected components (CCs) of document image are grouped into a tree structure, from which text lines are extracted by dynamically cutting the edges using a new hypervolume reduction criterion and a straightness measure. By learning the distance metric in supervised learning on a dataset of pairs of CCs, the proposed algorithm is made robust to handle various documents with multi-skewed and curved text lines. In experiments on a database with 803 unconstrained handwritten Chinese document images containing a total of 8,169 lines, the proposed algorithm achieved a correct rate 98.02% of line detection, and compared favorably to other competitive algorithms Elsevier Ltd. All rights reserved. 1. Introduction Text line segmentation from document images is one of the major problems in document image analysis. It provides crucial information for the tasks of text block segmentation, character segmentation and recognition, and text string recognition. Whereas the difficulty of machine-printed document analysis mainly lies in the complex layout structure and degraded image quality, handwritten document analysis is difficult mainly due to the irregularity of layout and character shapes originated from the variability of writing styles. For unconstrained handwritten documents, text line segmentation and character segmentation-recognition are not solved though enormous efforts have been devoted to them and great advances have been made. Text line segmentation of handwritten documents is much more difficult than that of printed documents. Unlike that printed documents have approximately straight and parallel text lines, the lines in handwritten documents are often un-uniformly skewed and curved. Moreover, the spaces between handwritten text lines are often not obvious compared to the spaces between within-line characters, and some text lines may interfere with each other. Therefore, many text Corresponding author. Tel.: addresses: fyin@nlpr.ia.ac.cn (F. Yin), liucl@nlpr.ia.ac.cn (C.-L. Liu). line detection techniques, such as projection analysis [1 7] and K-nearest neighbor connected components (CCs) grouping [12 14], are not able to segment handwritten text lines successfully. Fig. 1 shows an example of unconstrained handwritten Chinese document with segmentation results by the X Y cut algorithm [1], the stroke skew correction algorithm [6], the Docstrum algorithm [12] and the piece-wise projection algorithm [5]. In this case, we can see that the X Y cut algorithm and the stroke skew correction algorithm succeed in detecting the text lines, but fail to locate the boundaries of text lines. The Docstrum algorithm can locate the boundaries of text lines very well, but fails to detect some lines (the first and fourth lines in Fig. 1(c)) correctly because of the anomalous size of characters. Although the piece-wise projection algorithm can overcome the aforementioned errors, it fails to segment some small-size CCs (the first and eighth lines in Fig. 1(d)). Many efforts have been devoted to the difficult problem of handwritten text line segmentation [1 28]. The methods can be roughly categorized into three classes: top-down, bottom-up, and hybrid. Top-down methods partition the document image recursively into text regions, text lines, and words/characters with the assumption of straight lines. Bottom-up methods group small units of image (pixels, CCs, characters, words, etc.) into text lines and then text regions. Bottom-up grouping can be viewed as a clustering process, which aggregates image components according to proximity and does not rely on the assumption of straight lines. Hybrid methods combine bottom-up grouping and top-down partitioning in different ways /$ - see front matter 2009 Elsevier Ltd. All rights reserved. doi: /j.patcog

2 F. Yin, C.-L. Liu / Pattern Recognition 42 (2009) Fig. 1. An example of handwritten document with text lines segmented by the X Y cut algorithm (a), the stroke skew correction algorithm (b), the Docstrum algorithm (c) and the piece-wise projection algorithm (d). All the three approaches have their disadvantages. Top-down methods do not perform well on curved and overlapping text lines. The performance of bottom-up grouping relies on some heuristic rules or artificial parameters, such as the between-component distance metric for clustering. On the other hand, hybrid methods are complicated in computation, and the design of a robust combination scheme is non-trivial. In this paper, we propose an effective bottom-up method for text line segmentation in unconstrained handwritten Chinese documents. Our approach is based on minimal spanning tree (MST) clustering of CCs and the distance metric between CCs is designed by supervised learning. The number of clusters, namely the number of text lines, is automatically decided by a new hypervolume reduction criterion. Except for some empirical parameters in pre-processing of CCs and in post-processing of text lines, the clustering algorithm itself has no artificial parameters. The experimental comparison of clustering with metric learning with that of artificially designed metric shows that supervised metric learning improves largely the accuracy of text line segmentation. The proposed method was also compared with other state-of-the-art methods in experiments on a large database of handwritten Chinese documents and its superiority was demonstrated. By customizing the between-component features and training with documents of specific languages, we suggest that the proposed method is also applicable to the documents of other languages. The rest of this paper is organized as follows. In Section 2, we give a brief review of the related works; An overall description of our clustering-based text line segmentation method is given in Section 3, and the distance metric learning scheme is elaborated in Section 4. In Section 5, we present the hypervolume reduction criterion and the straightness measure for text line grouping. Experimental results are presented in Section 6 and concluding remarks are given in Section Previous works The structure of a document image is a hierarchy of text regions, text lines, words, characters and CCs. Text lines can be extracted by either top-down region partitioning, bottom-up components aggregation, or a hybrid scheme. Some representative segmentation methods are reviewed below. The X Y cut algorithm [1,2] is a typical projection-based topdown segmentation method. It uses horizontal and vertical projection histograms alternately along the X and Y axis so as to partition the document image into a hierarchical tree structure in which each leaf node represents a text line. Because of the assumption of parallel text lines and significant between-line gaps, this method performs well only on printed documents. Some modified projection-based methods have been proposed to deal with slightly curved handwritten text lines. The piece-wise projection approaches partition the document image into several vertical strips [3 5]. The text lines, assumed to be approximately straight in a strip, are extracted from each strip according to horizontal projection profiles and then connected using heuristic rules. Su et al. [6] use the horizontal stroke histogram to detect the skew of handwritten Chinese documents and segment the text lines with the projection histogram along the estimated skew angle. Weliwitage et al. [7] describe a modified projection-based method called cut text minimization (CTM), in which an optimization technique is applied to minimize the text pixels cut while tracking the boundary between two lines after the start points of lines are found using projection. Similarly, Liwicki et al. [8] propose another improved projection-based method combined with slant detection, in which they use dynamic programming to find the paths between two consecutive lines. From a different viewpoint, some researchers proposed water reservoir based top-down methods [9 11]. They assume that hypothetical water flows, from both left and right sides of the image frame, face obstruction from

3 3148 F. Yin, C.-L. Liu / Pattern Recognition 42 (2009) characters of text lines, and the strips of areas left un-wetted on the image frame are labeled for extracting text lines. An obvious observation of most top-down methods is that their performance relies on the assumption of well-separable text lines: approximately straight and parallel globally or locally in a region. The Docstrum method of O'Gorman [12] is typical of bottom-up grouping. It merges neighboring CCs using rules based on the geometric relationship between K nearest neighbor units, and performs well on printed documents as well as handwritten documents with slightly curved lines. Under similar ideas, the Voronoi diagram combined with heuristic rules in [14] are used to merge CCs into text lines. In [13], each CC is represented by its vertical coordinates of the bounding box, and the CCs are grouped by weighted k-means clustering under the spatial constraints of valid address lines. Likforman-Sulem and Faure [15] develop a method based on perceptual grouping, in which text lines are iteratively constructed by grouping neighboring CCs according to three Gestalt criteria, namely, proximity, similarity and direction continuity. Although this method can integrate the local constraints with a global measure, it cannot be applied to poor structured documents. Nicola et al. [16] use the artificial intelligence concept of production system to search for an optimal alignment of CCs into text lines. On defining the initial state as a set of CCs in the un-segmented document and a possible alignment (text lines) of the CCs as the goal state, they give two operators ( merge or do not merge a component to its adjacent text lines) for traversing states under a best path search framework. The reliance of this method on heuristic rules makes it inefficient for unconstrained handwritten documents. The grouping of components to text lines has been treated using MST clustering [17,18], in which the CCs are grouped by MST with a hand-crafted distance metric and then the edges between text lines are deleted using heuristic rules, whose performance relies on the distance metric between components and the heuristic rules. The Hough transform has also been applied to handwritten text line detection with the gravity centers [19,20] or minima points [21] of CCs as the points to be fitted. Sometimes, the CCs are split into equally spaced blocks to be voted in the Hough domain [22]. In general, Hough transform based methods need a sophisticated post-processing procedure to extract the lines and involves high computation burden. From a different viewpoint, some researchers proposed smearingbased bottom-up methods. Shi et al. use an adaptive local connectivity map (ALCM) [23] or a fuzzy runlength [24], in which the value of each pixel is the sum of all pixels in the original image within a specified horizontal distance. After thresholding the smeared image, the CCs represent probable regions of text lines. Kennard and Barrett use a similar method with slight extension to deal with freeform handwritten historical documents [25]. All the smearing-based methods, like other bottom-up ones, involve parameters to be tuned artificially. Nicolas et al. treat document image segmentation as a labeling problem [26]. They partition the document image into a n m grid and construct a Markov random field (MRF) model based on the grid, and then label the grid pixels into some states. Their results show that this method does not perform robustly on handwritten documents. The level set based method proposed by Li et al. [27] is an effective hybrid approach for unconstrained handwritten documents. On converting a binary image to gray-scaled using a continuous anisotropic Gaussian kernel, the level set method is exploited to determine the boundary between neighboring text lines. Though reported high accuracies of text line segmentation, this method obviously suffers from high computational complexity. 3. Clustering based text line segmentation In this section, we describe the rationale of our approach and the MST clustering algorithm. The distance metric learning and text line Fig. 2. Hierarchical structure of a document image. grouping techniques are elaborated in Sections 4 and 5, respectively. The performance of MST clustering relies on the metric of distance between image components. After clustering, the resulting tree is carefully cut into subtrees each corresponding to a text line Rationale A document image can be viewed as a hierarchical structure as in Fig. 2: it consists of text lines, each text line consists of CCs, and a CC is made of black runs or pixels. Equivalently, a text line can be viewed as a cluster of stroke pixels or CCs. We prefer using CCs as the basic units of clustering because the CCs are easy to detect and the number of CCs is much smaller than that of stroke pixels. Obviously, an important feature of this clustering problem is that all clusters (text lines) have irregular boundaries. We use the MST algorithm for clustering the CCs into text lines because it does not assume a spherical shaped clustering structure of the underlying data as many other clustering algorithms do. Two important issues in clustering are the distance metric between units and the criterion for determining the number of clusters. A good metric for clustering CCs should meet the condition that the distance between two neighboring components in the same text line is smaller than that between different lines. In documents with close or interfering text lines, the Euclidean distance does not satisfy this. We previously used a hand-crafted distance metric [28], which works fairly well but not sufficiently. We hereby design a better metric by supervised learning on labeled pairs of CCs. By labeling some pairs as close (within the same text line) and some others as distant (between lines), a distance metric can be automatically learned to fit the target of small distance within text line and large distance between lines. Under a learned distance metric, the tree generated by MST algorithm has the desired characteristic that the neighboring CCs of the same text line are connected and each line corresponds to a subtree (Fig. 3). However, the branches (paths between terminal and branching nodes) do not correspond to text lines perfectly due to the variability of layout of text lines. We hence use a second-stage clustering procedure to dynamically cut the edges of the tree into groups corresponding to text lines. The criterion to select the edge to cut and the criterion to stop cutting (to determine the number of clusters) are important in the second stage. Simply deleting the shortest edge does not promise because the edges between different lines (red lines in Fig. 3) are not always longer than those within the same line. Our approach is to select the edge to cut such that the sum of hypervolumes of clusters

4 F. Yin, C.-L. Liu / Pattern Recognition 42 (2009) Distance metric learning Fig. 3. MST of a document image. As many clustering algorithms rely critically on the distance metric between pairs of input units, some recent studies have contributed to metric learning from data [32 34]. For improving the performance of fuzzy c-means clustering, an evolutionary algorithm was used to optimize the scales of the dimensions of input data set [32]. Domeniconi [33] proposed a variant of k-means algorithm in which individual Euclidean metric weights were learned for each cluster. Xing et al. [34] combined gradient descent and iterative projections to learn a Mahalanobis metric for k-means clustering. Inspired by these works of distance metric learning, we herein design our distance metric for text line segmentation by supervised learning. In our work, the definition of the distance between CCs is the key to make the generated MST have the components of the same text line in the same subtree and those of different lines in different subtrees. In Fig. 5, we give an example of MST clustering based on a hand-crafted metric [28] and the one based on learned metric proposed in this paper. In the figure, we mark the between-line edges with blue lines. We can observe that while the learned metric groups the CCs of the same text line in the same subtree (Fig. 5(b)), the hand-crafted metric splits some text lines into multiple subtrees (Fig. 5(a)) Problem formulation Fig. 4. The framework of our approach. For supervised learning of distance metric between CCs, we need some training samples of component pairs labeled as within-line and between-line. To do this, we annotated some training document images using our ground-truthing tool GTLC (Ground-truthing tool for handwritten Chinese Text Lines and Characters) [42], which labels text lines and characters by automated transcript alignment and hand correction. Let C ={x 1, x 2,...,x n } be a collection of CCs in a training document, where n is the number of components. We obtain two sets of component pairs as the samples for metric learning: is reduced maximally, and to stop clustering when the measure of straightness of text lines reaches a maximum. From the above description, the framework of our approach can be depicted as in Fig Clustering algorithm Our algorithm starts with a binary document image. In preprocessing, the CCs are labeled using a fast algorithm based on contour tracing [29]. Small components with few black pixels are considered as noises and are removed. We then estimate the dominant character size from the component-size histogram obtained using the method in [12]. Empirically, the components with height or width larger than three times of the dominant character height are split vertically or horizontally using the touching character splitting method in [30] because they are most likely to contain touching characters and such big components affect the result of MST clustering. Finally, each component is viewed as a node in a graph (document graph). Each pair of nodes is linked by an edge with the distance between them as the weight. The metric of distance is designed to strengthen within-line links and weaken between-line links. From the weighted document graph, a MST is built using the Kruskal's algorithm [31]. In the resulting tree, most edges correspond to within-line links and some correspond to between-line links. Since the Kruskal's MST algorithm is well known and can be easily found in the literature, we will not give its details in this paper. S ={(x i, x j ) x i and x j belong to the same line}, D ={(x i, x j ) x i and x j belong to different lines} Considering the fact that only the spatially neighboring components are linked in the MST, we can discard many component pairs from the sample set for accelerating metric learning. To do this, we construct the area Voronoi diagram [14] of the training document, which represents the spatial adjacency between the components. A component x i is the neighbor of another one x j only if they are adjacent in the Voronoi diagram. The pairs that are not adjacent are removed from S and D. The aim of metric learning is to make the distance between components in S small and the distance between components in D large under the learned metric. Hence, we formulate the problem of metric learning as a convex programming problem [35]: min A R m m s.t. x i x j 2 A (x i,x j ) S A 0, x i x j 2 A 1, (x i,x j ) D where A R m m (m is the dimensionality of the feature space characterizing component pairs) defines the distance metric: d(x i, x j ) = d A (x i, x j ) = x i x j A = v T ij A v ij,

5 3150 F. Yin, C.-L. Liu / Pattern Recognition 42 (2009) Fig. 5. The results of MST clustering with hand-crafted metric (a) and learned metric (b). and v ij is the feature vector characterizing the relationship between components x i and x j. A is determined by solving the convex programming problem Feature space The features characterizing the relationship between two components x i and x j are integral of the distance metric, and are influential to the performance of clustering. Below is a list of features (eight features in total) that we use. (1) Normalized horizontal and vertical distances between the centroids of two components. The horizontal/vertical distance between the centroids of two CCs measures the spatial closeness. For generalizing to different documents (with differing font size and imaging resolution), this distance should be normalized with respect to the character size (divided by the estimated dominant character size). (2) Normalized horizontal and vertical overlapping degree. If two components overlap horizontally (align vertically), the normalized horizontal overlap degree can be computed by [30]: novlp x = 1 2 ( ovlp W 1 + ovlp ) dist W 2 span, where ovlp is the overlapping width of two bounding boxes, W 1 and W 2 are the widths of the bounding boxes, dist is the horizontal distance between the centers of two bounding boxes, and span is the spanning width of two bounding boxes (Fig. 6). The normalized vertical overlap degree is computed similarly from the heights of two bounding boxes. (3) Normalized horizontal and vertical minimum run-length. Fig. 6. Definition of normalized horizontal overlap. Fig. 7. An example of minimum run-length (MRL). The horizontal minimum run-length (MRL x ) is the horizontal runlength between vertically overlapping (horizontally aligned) CCs, wherein the minimum horizontal distance between black runs is taken as the distance measure (Fig. 7). It is similarly normalized with respect to the dominant character size of the document image. The vertical minimum run-length (MRL y ) is computed similarly and normalized with respect to the dominant character size. (4) Height and width ratio of merged components.

6 F. Yin, C.-L. Liu / Pattern Recognition 42 (2009) Suppose two CCs are merged, then the Height Ratio is computed by: R hei = max(h 1, H 2 ), span where H 1 and H 2 are the heights of the bounding boxes, and span is the spanning height of two bounding boxes. The Width Ratio is computed similarity from the heights of two CCs. 5. Text line grouping Although the learned distance metric encourages the components in the same text line to be connected in a subtree, there are still some components from different lines connected. Since betweenline edges are not obvious because their lengths (distances between components) are not necessarily longer than the within-line edge lengths, to correctly recognize and cut the between-line edges is non-trivial. Although several algorithms [36 39] on this problem have been proposed, they do not perform satisfactorily in our case of handwritten Chinese text line segmentation. The cutting results for the image in Fig. 3 using the algorithms of [37,39] are shown in Fig. 8, where many cutting or connection errors occurred, which were pointed out by blue circles. Our MST-based text line grouping process consists of two phases: in initial grouping the MST is cut into subtrees using a hypervolume reduction criterion and a straightness measure, and in post-processing, some remaining text line errors are corrected using heuristic rules Initial grouping We use a criterion based on hypervolume [40] for selecting edges of MST to cut. By cutting some edges, each subtree corresponds to a cluster of CCs. The sum of hypervolumes of the clusters is computed for evaluating the partition: F v = k [det(c i )] 1/2, i=1 where det(c i ) is the determinant of the covariance matrix C i of cluster i, computed from the constituent black pixels of the CCs in the cluster. Initially, all the components in the MST are considered as a single cluster, and every edge is deleted tentatively to split the cluster into two clusters (subtrees). The edge with the maximal reduction of F v measure is selected to delete such that the total F v measure of the document is minimized. We call this as maximum hypervolume reduction criterion, denoted by: edge deleted = arg max ΔF v edge = arg max[f v (S k ) F v (S k+1 )], edge where S k ={T 1, T 2,...,T k } denote the partition of k disjoint subtrees (S 1 denotes the initial MST). The F v measure cannot evaluate the number of clusters since it always decreases as the number of clusters increases. Fortunately, it is reasonable to assume rectangular shapes for the text lines (if a text line is curvilinear, it can be divided into several sublines that are approximately straight). We conjecture that when the number of clusters (partitioned text lines) is appropriate, a measure of straightness of the text lines reaches a maximum. We compute the total straightness measure as: k ( ) 2 λi1 F s =, λ i=1 i2 where k is the number of clusters (partitioned text lines), λ i1 and λ i2 (λ i1 λ i2 ) are the eigenvalues of the covariance matrix of each cluster. Our experiments demonstrate that the F s measure performs well in finding the number of clusters: the number of maximum F s fits Fig. 8. Results of edge cutting for the image in Fig. 3 using the algorithm in [37] (a) and the algorithm in [39] (b). Cutting/connection errors are marked with blue circles. (For interpretation of the references to colour in this figure legend the reader is referred to the web version of this article.)

7 3152 F. Yin, C.-L. Liu / Pattern Recognition 42 (2009) Fig. 9. Partitioning criteria for the document in Fig. 3. (a)f V as a function of number of clusters; (b) F S as a function of number of clusters; (c) the partitioned text lines. well the actual number of text lines. An example is shown in Fig. 9. By iteratively deleting edges according to F V, the total F V measure and F s measure with increasing number of clusters are shown in Fig. 9(a) and (b), respectively. We can see that k = 5, corresponding to the maximum of F s, gives a preferable partition of text lines (Fig. 9(c)) Post-processing After initial grouping, most of text lines have been grouped correctly, but a few errors may still exist. For example, some lines are split into several pieces because of large within-line horizontal gaps, or some CCs are falsely grouped into other lines. Most of these errors can be corrected using some heuristic rules similar to [27]. The post-processing procedure has following steps: (1) Estimate the orientation of the initial text lines using the least mean squared-error method, and estimate their height and width using the method in [30]. (2) If the length of a text line is shorter than 1 10 of the image width or it contains less than three CCs and the height is larger than half of the average height of all lines, it is labeled as isolated line. The other text lines are labeled as unprocessed line. (3) If the height of an unprocessed line is smaller than half of the average height of all lines, all of its CCs are labeled as isolated CC. In an unprocessed line, if the distance between the centroid of a CC and the midline (which crosses the centroid of the text line and has the same orientation) is larger than half of the height of the text line, the CC is also labeled as isolated CC. (4) Select the longest unprocessed line, we can merge it with another unprocessed line if all the following conditions are met: (1) the difference of their orientation is less than 15 ; (2) the horizontal gap between their bounding boxes is less than 1 10 of the image width; (3) their bounding boxes overlap more than 50% of average height in the orthogonal of the average orientation. Mark the merged line as processed line. (5) Iterate Step 4 until there is no unprocessed line. (6) Merge an isolated CC to the i-th text line if the distance between the centroid of the CC and the midline of the text line is smaller than the height of the text line. If we cannot find a text line to merge the isolated CC, the CC is labeled as noise and is deleted. (7) Similar to Step 6, merge the CCs of an isolated line to a processed line, but if we cannot find a text line to merge the CC, we keep it in the isolated line, and if this isolates line still have CCs after merging all CCs, we label it as processed line. 6. Experimental results We evaluated the performance of our algorithm on a large database of unconstrained handwritten Chinese documents and compared with some existing reference algorithms. As follows, we briefly describe the database and evaluation methodology, outline the reference algorithms, and then present the experimental results Database A large database of unconstrained Chinese handwritten documents, HIT-MW [41], was collected by Harbin Institute of Technology and is publicly available for free use. The database contains 853 text forms written by more than 780 writers. There are 8,677 text lines in total and each line has characters on average. Each

8 F. Yin, C.-L. Liu / Pattern Recognition 42 (2009) Fig. 10. Example documents in the HIT-MW database. document was scanned at a resolution of 300DPI. A typical image size is approximately pixels, and each image contains 530 CCs on average. Fig. 10 shows two images in this database. Since the images in the HIT-MW database are not labeled at CCs level (only a part of images have been segmented into text lines), we have annotated all the 853 document images using our groundtruthing tool GTLC [42] Evaluation methodology Several evaluation schemes have been proposed for document image segmentation [43 45], but they were designed for printed documents or graphics and to evaluate the performance based on bounding boxes. It is not appropriate to measure handwritten text lines using bounding boxes because they are often curved and multiskewed. Therefore, we evaluate the performance by counting the number of matches between the pixels of detected text lines and the pixels in the ground-truth data. Similar to [27], wecalculate the MatchScore matrix between a detected text line and a groundtruthed line: MatchScore(i, j) = T(G i R j ) T(G i R j ), where G i is the set of pixels of the i-th ground-truthed text line, R j is the set of pixels of the j-th detected text line, T(S) is the cardinality of set S. The Hungarian algorithm is used to find one-to-one correspondence between the detected text lines and the ground-truthed ones [46]. Since the number of lines in two sets may be different, either a detected line or a ground-truthed line is allowed to be matched with a dummy line. The performance is evaluated at the text line level. If a ground-truthed line and the corresponding detected line share at least 95% of pixels, the detected text line is claimed to be correct. The percentage of correctly detected text lines out of the groundtruthed lines gives the correct detection rate (recall rate), and the percentage of false lines out of the detected lines gives the error rate Reference algorithms In addition to comparison with our previous clustering algorithm with hand-crafted metric [28], we compared the hypervolume reduction criterion in text line grouping with other criteria in [37,39]. Then, we compared the performance of the proposed algorithm not only with two algorithms X Y cut [1] and Docstrum [12] that were designed for printed documents, but also with two algorithms stroke skew correction [6] and piece-wise projection [5] that were designed for segmenting handwritten text lines. The hand-crafted metric was formed using a subset of the features described in Section 4.2: it is the weighted combination of the horizontal minimum runlength and the Euclidean distance between the centroids of two CCs, with the weight determined by the normalized vertical overlapping degree. This empirical combination was found to perform fairly well. For fair comparison with other methods, we also optimize the weighting parameter on some ground-truthed document images. After MST clustering, the algorithms in [37,39] cut edges in difference ways. The one in [37] finds a global threshold of edge length according to the edge length histogram of the linkage graph, then, all the edges with length over the threshold are cut. The authors of [37] demonstrated that this method was more efficient than the algorithm in [36]. The algorithm in [39] measures each hypothesized cluster (subtree) using the standard deviation of edge lengths within the subtree. The edge selected to cut is the one that reduces the average standard deviation maximally. This is similar to our method of hypervolume reduction but it uses deviation of edge lengths instead of hypervolume. As a typical top-down method, the X Y cut algorithm [1] builds a structural tree of the document by recursively analyzing the horizontal and vertical projection profiles of partitioned regions. The Docstrum algorithm [12] builds the document structure bottom-up by merging neighboring CCs. We used in our experiments a public domain implementation of the X Y cut and Docstrum algorithms [45]. The stroke skew correction algorithm [6] estimates the skew angle of text lines from the horizontal stroke histogram and then segments text lines using projection profiles after deskewing. The piece-wise projection algorithm [5] obtains an initial set of text lines from the piece-wise projection profiles, then any obstructing CCs are associated to a line above or below by a probability evaluated under Gaussian assumption or a distance metric. This algorithm was shown to perform very well in segmenting English and Arabic text lines. To yield best performance for the above algorithms (MST clustering with hand-crafted metric, X Y cut, Docstrum, piece-wise projection), we use the Nelder Mead simplex search method [47] to optimize the free parameters of them based on ground-truthed data as Mao et al. did in [45,48]. The hand-crafted metric has a weighting parameter to optimize. The X Y cut algorithm and the Docstrum

9 3154 F. Yin, C.-L. Liu / Pattern Recognition 42 (2009) Table 1 Correct rates of text line detection using learned and hand-crafted metrics. Correct detection Learned metric 1051 (95.02%) Hand-crafted metric (optimized weight) 1008 (91.14%) Hand-crafted metric (empirical weight) 975 (88.16%) Table 2 Correct rates of text line detection using different clustering criteria. Correct detection Hypervolume reduction 1051 (95.02%) Criterion in [37] 823 (74.41%) Criterion in [39] 341 (30.83%) algorithm each has four parameters optimized by simplex search, as done in [45]. For the piece-wise projection algorithm, the authors of [5] did not mention any free parameter. But in our implementation, we found that two parameters, the minimal difference and the minimal distance between the neighboring peak and valley of the projection histogram, need to be determined. The other strip-based projection methods were not evaluated in our experiments because they rely on many heuristic rules and free parameters, and so, it is hard to tune the rules and parameters to optimize the performance Performance comparison We conducted three experiments to compare the performance of metric learning with hand-crafted metric, compare hypervolume reduction in text line grouping with other MST cluster criteria, and compare the proposed text line segmentation algorithm with four existing methods Comparing metric learning with hand-crafted metric The example of Fig. 5 in Section 4 demonstrates that distance metric learning can obviously improve the performance of MST clustering of handwritten documents. To evaluate the performance quantitatively, we selected 150 images with complex layout in the HIT-MW database, 50 documents were used for distance metric learning and optimizing the weighting parameter of hand-crafted metric, and the remaining 100 documents containing 1,106 text lines were used for evaluation. Previously, the weight of hand-crafted metric was determined from the normalized vertical overlapping degree [28]. In this experiment, all the processing steps except the distance metric are the same. The correct rates of text line detection by MST clustering with learned metric and hand-crafted metric (with optimized weight and empirical weight) are shown in Table 1. We can see that distance metric learning improves the performance of text line segmentation significantly. By optimizing the weighting parameter of the handcrafted metric, the performance is also improved considerably compared to the one with empirical weight Comparing MST clustering criteria To compare the text line segmentation performance of the proposed hypervolume reduction criterion with other MST clustering criteria in [37,39], we used the same 150 images as in Section 6.4.1: 50 for metric learning and 100 for evaluation. For the three criteria compared, all the processing steps except the tree partitioning procedure are the same. The correct rates of text line detection on the 100 test images are shown in Table 2. We can see the hypervolume reduction criterion yields the best performance. Since the criterion in [37] finds a global threshold of edge length, between-line Table 3 Correct rates and error rates of text line detection on 803 images. Detected lines Correct detection Error rate (%) Proposed (with post-processing) (98.02%) 2.47 Proposed (w/o post-processing) (95.75%) 7.37 X Y cut [1] (45.07%) Docstrum [12] (65.38%) Stroke skew correction [6] (55.34%) Piece-wise projection [5] (92.07%) 7.72 edges shorter than the threshold cannot be deleted to separate the linked text lines. By the criterion in [39], we observed that the local minimum of the standard deviation reduction function always gives fewer clusters than the real number of text lines. This causes many text lines merged with each other. On the contrary, our algorithm based on the hypervolume reduction criterion combined with the straightness measure of text lines mostly finds the correct number of real clusters Comparison with existing methods To compare the performance of the proposed MST clusteringbased text line segmentation algorithm with the X-Y cut, Docstrum algorithm, stroke skew correction algorithm and piece-wise projection algorithm, we randomly selected 50 images (containing 508 text lines) in the HIT-HW database for training (distance metric learning and parameters tuning), and the remaining 803 images (containing 8,169 text lines) for evaluation. The correct rates (recall rates, percentage of correctly detected lines out of all ground-truthed ones) and error rates (percentage of error text lines out of all detected ones) are shown in Table 3. To justify the effect of post-processing with the proposed clustering-based method, we evaluated both the algorithm with post-processing and the one without post-processing. From Table 3, we can see that post-processing is effective to improve the correct rate of text line segmentation. However, even without post-processing, the proposed method still yields higher correct rate and lower error rate than the four existing algorithms. Among the existing methods, the piece-wise projection algorithm yields the best performance. We observe that the X Y cut and Docstrum algorithms tend to extract more text lines, but many of them do not match the ground-truthed lines. The stroke skew correction is designed for segmenting handwritten Chinese text lines, but it is only better than X Y cut algorithm. The piece-wise projection algorithm achieves competitive results because it can tolerate the multi-skew and curve of the text lines in a certain extent. Fig. 11 shows the segmentation results of a document image using the X Y cut, Docstrum, stroke skew correction, piece-wise projection and the proposed algorithm. From the figure, we can see that only the proposed algorithm segments the text lines totally correctly, while the X Y cut, Docstrum and stroke skew correction algorithms generate many false text lines. Though the piece-wise projection algorithm almost finds all text lines, some small CCs are falsely segmented such as the last text line in Fig. 11(d). Overall, the proposed algorithm performs very well on unconstrained handwritten Chinese documents with multi-skewed and curved text lines. The proposed cluster-based algorithm was implemented in C++ codes. The overall processing time for an image with size of pixels and containing 1000 CCs (about characters) is about 2.5 seconds on a personal computer with CPU of Pentium GHz and 1 GB Memory. This speed is nevertheless acceptable. We could not compare our method with the level set based method of Li et al.[27] because that method is non-trivial to implement and their image database is not available for our evaluation. Our algorithm and the one of Li et al. were compared with the X Y

10 F. Yin, C.-L. Liu / Pattern Recognition 42 (2009) Fig. 11. Segmentation results of a document image by X Y cut (a), Docstrum (b), stroke skew correction (c), piece-wise projection (d) and the proposed algorithm (e). cut and Docstrum algorithms and as the result, both yielded significantly higher correct detection rates than the X Y cut and Docstrum algorithms. Nevertheless, the level set based method turns out to be more computationally demanding: according to [27], the segmentation of an image of pixels costs about 20 seconds on a CPUof1.6GHzand1Gmemory Error analysis The proposed clustering-based method with metric learning, though performs sufficiently well, still remains some text line detection errors. The errors are mostly of two types: (1) error line splitting (ELS): a real text line is split into two or more lines (corresponding to multiple clusters); (2) error line merging (ELM): two or more real text lines are merged into a single cluster. We observed that the ELS occur when characters are inserted in a line, such as those (marked by blue circles) in Fig. 12. The ELM are mainly caused by the overlapping of two neighboring text lines, especially touching of characters, such as that (marked by blue circle) in Fig. 13. In this case, since two text lines are connected in only few touched characters, a post-processing procedure is necessary to separate them vertically.

11 3156 F. Yin, C.-L. Liu / Pattern Recognition 42 (2009) Fig. 12. An example of error line splitting. Fig. 13. An example of error line merging. 7. Conclusion We propose a new method for text line segmentation in unconstrained handwritten Chinese document images based on minimum spanning tree (MST) clustering with distance metric learning. This bottom-up method is able to segment multi-skewed, curved and slightly overlapping text lines. Except some empirical parameters (which are easy to determine and do not influence the performance critically) in pre-processing of connected components (CCs) and post-processing of text lines, this algorithm has no artificial parameter in clustering. In MST clustering, the metric of distance between CCs is learned on a dataset of pairs of components labeled as within-line or between-line. This avoids artificial tuning of metric and improves the clustering performance significantly. The number of clusters is automatically determined by cutting the edges of the generated tree using a hypervolume reduction criterion and a straightness measure. The proposed algorithm was evaluated on a large database of unconstrained handwritten Chinese documents, and was demonstrated superior to some previous algorithms. Our algorithm is to be further improved by refining the features of distance metric and the post-processing procedure, and to be evaluated on document images of various languages via customizing the between-component features and training with document images of specific languages. Acknowledgments The authors would like to thank Tonghua Su for authorizing us to use the HIT-MW database, Zhenglong Li for discussions on distance metric learning, Gang Liu and Yi Li for their suggestions on the experiments. This research was supported by the National Natural Science Foundation of China (NSFC) under grant nos and References [1] G. Nagy, S. Seth, M. Viswanathan, A prototype document image analysis system for technical journals, Computer 25 (7) (1992) [2] J. He, A.C. Downton, User-assisted archive document analysis for digital library construction, in: Proceedings of the Seventh International Conference on Document Analysis and Recognition, vol. 1, 2003, pp [3] A. Zahour, B. Taconet, P. Mercy, S. Ramdane, Arabic handwritten text-line extraction, in: Proceedings of the Sixth International Conference on Document Analysis and Recognition, 2001, pp [4] U. Pal, S. Datta, Segmentation of Bangla unconstrained handwritten text, in: Proceedings of the Seventh International Conference on Document Analysis and Recognition, vol. 2, 2003, pp [5] M. Arivazhagan, H. Srinivasan, S. Srihari, A statistical approach to line segmentation in handwritten documents, in: Document Recognition and Retrieval XIV, Proceedings of the SPIE, 2007, pp. 6500T [6] T. Su, T. Zhang, H. Huang, Y. Zhou, Skew detection for Chinese handwriting by horizontal stroke histogram, in: Proceedings of the Ninth International Conference on Document Analysis and Recognition, 2007, pp [7] C. Weliwitage, A.L. Harvey, A.B. Jennings, Handwritten document offline text line segmentation, in: Proceedings of Digital Image Computing: Techniques and Applications, 2005, pp [8] M. Liwicki, E. Indermuehle, H. Bunke, On-line handwritten text line detection using dynamic programming, in: Proceedings of Ninth International Conference on Document Analysis and Recognition, 2007, pp [9] S. Basu, C. Chaudhuri, M. Kundu, M. Nasipuri, D.K. Basu, Text line extraction from multi-skewed handwritten documents, Pattern Recognition 40 (6) (2007) [10] U. Pal, P.P. Roy, Multioriented and curved text lines extraction from Indian documents, IEEE Transactions on Systems, Man and Cybernetics, Part B 34 (4) (2004) [11] U. Pal, P.P. Roy, Text line extraction from India document, in: Proceeding of Fifth International Conference on Advances in Pattern Recognition, 2003, pp [12] L. O'Gorman, The document spectrum for page layout analysis, IEEE Transactions on Pattern Analysis and Machine Intelligence 15 (11) (1993) [13] F. Kimura, Y. Miyake, M. Shridhar, Handwritten ZIP code recognition using lexicon free word recognition algorithm, in: Proceeding of the Third International Conference on Document Analysis and Recognition, 1995, pp [14] K. Kise, A. Sato, M. Iwata, Segmentation of page images using the area Voronoi diagram, Computer Vision and Image Understanding 70 (3) (1998) [15] L. Likforman-Sulem, C. Faure, Extracting lines on handwritten document by perceptual grouping, in: Advances in Handwriting and Drawing: A Multidisciplinary Approach, 1994, pp [16] S. Nicola, T. Paquet, L. Heutte, Text line segmentation in handwritten document using a production system, in: Proceeding of the Ninth International Workshop on Frontiers in Handwriting Recognition, 2004, pp [17] I.S.I. Abuhaiba, S. Datta, M.J.J. Holt, Line extraction and stroke ordering of text pages, in: Proceeding of the Third International Conference on Document Analysis and Recognition, vol. 1, 1995, pp [18] A. Simon, J.-C. Pret, A.P. Johnson, A fast algorithm for bottom-up document layout analysis, IEEE Transactions on Pattern Analysis and Machine Intelligence 19 (3) (1997) [19] B. Yu, A. Jain, A robust and fast skew detection algorithm for generic document, Pattern Recognition 29 (10) (1996) [20] Y. Pu, Z. Shi, A natural learning algorithm based on Hough transform for text lines extraction in handwritten document, in: Proceeding of the Sixth International Workshop on Frontiers in Handwriting Recognition, 1998, pp [21] L. Likforman-Sulem, A. Hanimyan, C. Faure, A Hough based algorithm for extracting text lines in handwritten documents, in: Proceeding of the Third International Conference on Document Analysis and Recognition, 1995, pp [22] G. Louloudis, B. Gatos, I. Pratikakis, K. Halatis, A block-based Hough transform mapping for text line detection in handwritten document, in: Proceeding of the 10th International Workshop on Frontiers in Handwriting Recognition, 2006, pp [23] Z. Shi, S. Setlur, V. Govindaraju, Text extraction from gray scale historical document image using adaptive local connectivity map, in: Proceeding of the Eighth International Conference on Document Analysis and Recognition, vol. 2, 2005, pp

HANDWRITTEN TEXT LINE EXTRACTION BASED ON MINIMUM SPANNING TREE CLUSTERING

HANDWRITTEN TEXT LINE EXTRACTION BASED ON MINIMUM SPANNING TREE CLUSTERING HANDWRITTEN TEXT LINE EXTRACTION BASED ON MINIMUM SPANNING TREE CLUSTERING FEI YIN, CHENG-LIN LIU National Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese Academy of Sciences

More information

A New Algorithm for Detecting Text Line in Handwritten Documents

A New Algorithm for Detecting Text Line in Handwritten Documents A New Algorithm for Detecting Text Line in Handwritten Documents Yi Li 1, Yefeng Zheng 2, David Doermann 1, and Stefan Jaeger 1 1 Laboratory for Language and Media Processing Institute for Advanced Computer

More information

A Simple Text-line segmentation Method for Handwritten Documents

A Simple Text-line segmentation Method for Handwritten Documents A Simple Text-line segmentation Method for Handwritten Documents M.Ravi Kumar Assistant professor Shankaraghatta-577451 R. Pradeep Shankaraghatta-577451 Prasad Babu Shankaraghatta-5774514th B.S.Puneeth

More information

Keywords Connected Components, Text-Line Extraction, Trained Dataset.

Keywords Connected Components, Text-Line Extraction, Trained Dataset. Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Language Independent

More information

The Horizontal Segmentation of Lines in Chinese Handwritten Texts Based on the Intervals (Distances) in Fuzzy Triangles

The Horizontal Segmentation of Lines in Chinese Handwritten Texts Based on the Intervals (Distances) in Fuzzy Triangles J. Basic. Appl. Sci. Res., 3(4)165-172, 2013 2013, TextRoad Publication ISSN 2090-4304 Journal of Basic and Applied Scientific Research www.textroad.com The Horizontal Segmentation of Lines in Chinese

More information

Recognition-based Segmentation of Nom Characters from Body Text Regions of Stele Images Using Area Voronoi Diagram

Recognition-based Segmentation of Nom Characters from Body Text Regions of Stele Images Using Area Voronoi Diagram Author manuscript, published in "International Conference on Computer Analysis of Images and Patterns - CAIP'2009 5702 (2009) 205-212" DOI : 10.1007/978-3-642-03767-2 Recognition-based Segmentation of

More information

A Statistical approach to line segmentation in handwritten documents

A Statistical approach to line segmentation in handwritten documents A Statistical approach to line segmentation in handwritten documents Manivannan Arivazhagan, Harish Srinivasan and Sargur Srihari Center of Excellence for Document Analysis and Recognition (CEDAR) University

More information

Segmentation of Handwritten Textlines in Presence of Touching Components

Segmentation of Handwritten Textlines in Presence of Touching Components 2011 International Conference on Document Analysis and Recognition Segmentation of Handwritten Textlines in Presence of Touching Components Jayant Kumar Le Kang David Doermann Wael Abd-Almageed Institute

More information

Word Slant Estimation using Non-Horizontal Character Parts and Core-Region Information

Word Slant Estimation using Non-Horizontal Character Parts and Core-Region Information 2012 10th IAPR International Workshop on Document Analysis Systems Word Slant using Non-Horizontal Character Parts and Core-Region Information A. Papandreou and B. Gatos Computational Intelligence Laboratory,

More information

A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation

A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation K. Roy, U. Pal and B. B. Chaudhuri CVPR Unit; Indian Statistical Institute, Kolkata-108; India umapada@isical.ac.in

More information

Indian Multi-Script Full Pin-code String Recognition for Postal Automation

Indian Multi-Script Full Pin-code String Recognition for Postal Automation 2009 10th International Conference on Document Analysis and Recognition Indian Multi-Script Full Pin-code String Recognition for Postal Automation U. Pal 1, R. K. Roy 1, K. Roy 2 and F. Kimura 3 1 Computer

More information

OCR For Handwritten Marathi Script

OCR For Handwritten Marathi Script International Journal of Scientific & Engineering Research Volume 3, Issue 8, August-2012 1 OCR For Handwritten Marathi Script Mrs.Vinaya. S. Tapkir 1, Mrs.Sushma.D.Shelke 2 1 Maharashtra Academy Of Engineering,

More information

Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes

Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes 2009 10th International Conference on Document Analysis and Recognition Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes Alireza Alaei

More information

On Segmentation of Documents in Complex Scripts

On Segmentation of Documents in Complex Scripts On Segmentation of Documents in Complex Scripts K. S. Sesh Kumar, Sukesh Kumar and C. V. Jawahar Centre for Visual Information Technology International Institute of Information Technology, Hyderabad, India

More information

Learning-Based Candidate Segmentation Scoring for Real-Time Recognition of Online Overlaid Chinese Handwriting

Learning-Based Candidate Segmentation Scoring for Real-Time Recognition of Online Overlaid Chinese Handwriting 2013 12th International Conference on Document Analysis and Recognition Learning-Based Candidate Segmentation Scoring for Real-Time Recognition of Online Overlaid Chinese Handwriting Yan-Fei Lv 1, Lin-Lin

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervised Learning and Clustering Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr CS 551, Spring 2009 CS 551, Spring 2009 c 2009, Selim Aksoy (Bilkent University)

More information

Robust line segmentation for handwritten documents

Robust line segmentation for handwritten documents Robust line segmentation for handwritten documents Kamal Kuzhinjedathu, Harish Srinivasan and Sargur Srihari Center of Excellence for Document Analysis and Recognition (CEDAR) University at Buffalo, State

More information

Automatic Detection of Change in Address Blocks for Reply Forms Processing

Automatic Detection of Change in Address Blocks for Reply Forms Processing Automatic Detection of Change in Address Blocks for Reply Forms Processing K R Karthick, S Marshall and A J Gray Abstract In this paper, an automatic method to detect the presence of on-line erasures/scribbles/corrections/over-writing

More information

Layout Segmentation of Scanned Newspaper Documents

Layout Segmentation of Scanned Newspaper Documents , pp-05-10 Layout Segmentation of Scanned Newspaper Documents A.Bandyopadhyay, A. Ganguly and U.Pal CVPR Unit, Indian Statistical Institute 203 B T Road, Kolkata, India. Abstract: Layout segmentation algorithms

More information

Learning to Segment Document Images

Learning to Segment Document Images Learning to Segment Document Images K.S. Sesh Kumar, Anoop Namboodiri, and C.V. Jawahar Centre for Visual Information Technology, International Institute of Information Technology, Hyderabad, India Abstract.

More information

Multi-scale Techniques for Document Page Segmentation

Multi-scale Techniques for Document Page Segmentation Multi-scale Techniques for Document Page Segmentation Zhixin Shi and Venu Govindaraju Center of Excellence for Document Analysis and Recognition (CEDAR), State University of New York at Buffalo, Amherst

More information

A Feature based on Encoding the Relative Position of a Point in the Character for Online Handwritten Character Recognition

A Feature based on Encoding the Relative Position of a Point in the Character for Online Handwritten Character Recognition A Feature based on Encoding the Relative Position of a Point in the Character for Online Handwritten Character Recognition Dinesh Mandalapu, Sridhar Murali Krishna HP Laboratories India HPL-2007-109 July

More information

Hidden Loop Recovery for Handwriting Recognition

Hidden Loop Recovery for Handwriting Recognition Hidden Loop Recovery for Handwriting Recognition David Doermann Institute of Advanced Computer Studies, University of Maryland, College Park, USA E-mail: doermann@cfar.umd.edu Nathan Intrator School of

More information

Automatically Algorithm for Physician s Handwritten Segmentation on Prescription

Automatically Algorithm for Physician s Handwritten Segmentation on Prescription Automatically Algorithm for Physician s Handwritten Segmentation on Prescription Narumol Chumuang 1 and Mahasak Ketcham 2 Department of Information Technology, Faculty of Information Technology, King Mongkut's

More information

with Profile's Amplitude Filter

with Profile's Amplitude Filter Arabic Character Segmentation Using Projection-Based Approach with Profile's Amplitude Filter Mahmoud A. A. Mousa Dept. of Computer and Systems Engineering, Zagazig University, Zagazig, Egypt mamosa@zu.edu.eg

More information

Recognition of Unconstrained Malayalam Handwritten Numeral

Recognition of Unconstrained Malayalam Handwritten Numeral Recognition of Unconstrained Malayalam Handwritten Numeral U. Pal, S. Kundu, Y. Ali, H. Islam and N. Tripathy C VPR Unit, Indian Statistical Institute, Kolkata-108, India Email: umapada@isical.ac.in Abstract

More information

SYDE Winter 2011 Introduction to Pattern Recognition. Clustering

SYDE Winter 2011 Introduction to Pattern Recognition. Clustering SYDE 372 - Winter 2011 Introduction to Pattern Recognition Clustering Alexander Wong Department of Systems Design Engineering University of Waterloo Outline 1 2 3 4 5 All the approaches we have learned

More information

Skew Detection and Correction of Document Image using Hough Transform Method

Skew Detection and Correction of Document Image using Hough Transform Method Skew Detection and Correction of Document Image using Hough Transform Method [1] Neerugatti Varipally Vishwanath, [2] Dr.T. Pearson, [3] K.Chaitanya, [4] MG JaswanthSagar, [5] M.Rupesh [1] Asst.Professor,

More information

A Novel Approach for Minimum Spanning Tree Based Clustering Algorithm

A Novel Approach for Minimum Spanning Tree Based Clustering Algorithm IJCSES International Journal of Computer Sciences and Engineering Systems, Vol. 5, No. 2, April 2011 CSES International 2011 ISSN 0973-4406 A Novel Approach for Minimum Spanning Tree Based Clustering Algorithm

More information

Character Recognition

Character Recognition Character Recognition 5.1 INTRODUCTION Recognition is one of the important steps in image processing. There are different methods such as Histogram method, Hough transformation, Neural computing approaches

More information

Toward Part-based Document Image Decoding

Toward Part-based Document Image Decoding 2012 10th IAPR International Workshop on Document Analysis Systems Toward Part-based Document Image Decoding Wang Song, Seiichi Uchida Kyushu University, Fukuoka, Japan wangsong@human.ait.kyushu-u.ac.jp,

More information

Localization, Extraction and Recognition of Text in Telugu Document Images

Localization, Extraction and Recognition of Text in Telugu Document Images Localization, Extraction and Recognition of Text in Telugu Document Images Atul Negi Department of CIS University of Hyderabad Hyderabad 500046, India atulcs@uohyd.ernet.in K. Nikhil Shanker Department

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervised Learning and Clustering Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr CS 551, Spring 2008 CS 551, Spring 2008 c 2008, Selim Aksoy (Bilkent University)

More information

Classification. Vladimir Curic. Centre for Image Analysis Swedish University of Agricultural Sciences Uppsala University

Classification. Vladimir Curic. Centre for Image Analysis Swedish University of Agricultural Sciences Uppsala University Classification Vladimir Curic Centre for Image Analysis Swedish University of Agricultural Sciences Uppsala University Outline An overview on classification Basics of classification How to choose appropriate

More information

Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network

Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network Utkarsh Dwivedi 1, Pranjal Rajput 2, Manish Kumar Sharma 3 1UG Scholar, Dept. of CSE, GCET, Greater Noida,

More information

Pattern Recognition 41 (2008) Contents lists available at ScienceDirect. Pattern Recognition

Pattern Recognition 41 (2008) Contents lists available at ScienceDirect. Pattern Recognition Pattern Recognition 41 (2008) 3758 -- 3772 Contents lists available at ScienceDirect Pattern Recognition journal homepage: www.elsevier.com/locate/pr Text line detection in handwritten documents G. Louloudis

More information

Logical Templates for Feature Extraction in Fingerprint Images

Logical Templates for Feature Extraction in Fingerprint Images Logical Templates for Feature Extraction in Fingerprint Images Bir Bhanu, Michael Boshra and Xuejun Tan Center for Research in Intelligent Systems University of Califomia, Riverside, CA 9252 1, USA Email:

More information

II. WORKING OF PROJECT

II. WORKING OF PROJECT Handwritten character Recognition and detection using histogram technique Tanmay Bahadure, Pranay Wekhande, Manish Gaur, Shubham Raikwar, Yogendra Gupta ABSTRACT : Cursive handwriting recognition is a

More information

Time Stamp Detection and Recognition in Video Frames

Time Stamp Detection and Recognition in Video Frames Time Stamp Detection and Recognition in Video Frames Nongluk Covavisaruch and Chetsada Saengpanit Department of Computer Engineering, Chulalongkorn University, Bangkok 10330, Thailand E-mail: nongluk.c@chula.ac.th

More information

Structure in On-line Documents

Structure in On-line Documents Structure in On-line Documents Anil K. Jain and Anoop M. Namboodiri Department of Comp. Sci. and Engg. Michigan State University East Lansing, MI 4884 fjain, anoopg@cse.msu.edu Jayashree Subrahmonia IBM

More information

OFF-LINE HANDWRITTEN JAWI CHARACTER SEGMENTATION USING HISTOGRAM NORMALIZATION AND SLIDING WINDOW APPROACH FOR HARDWARE IMPLEMENTATION

OFF-LINE HANDWRITTEN JAWI CHARACTER SEGMENTATION USING HISTOGRAM NORMALIZATION AND SLIDING WINDOW APPROACH FOR HARDWARE IMPLEMENTATION OFF-LINE HANDWRITTEN JAWI CHARACTER SEGMENTATION USING HISTOGRAM NORMALIZATION AND SLIDING WINDOW APPROACH FOR HARDWARE IMPLEMENTATION Zaidi Razak 1, Khansa Zulkiflee 2, orzaily Mohamed or 3, Rosli Salleh

More information

Spotting Words in Latin, Devanagari and Arabic Scripts

Spotting Words in Latin, Devanagari and Arabic Scripts Spotting Words in Latin, Devanagari and Arabic Scripts Sargur N. Srihari, Harish Srinivasan, Chen Huang and Shravya Shetty {srihari,hs32,chuang5,sshetty}@cedar.buffalo.edu Center of Excellence for Document

More information

CS 534: Computer Vision Segmentation and Perceptual Grouping

CS 534: Computer Vision Segmentation and Perceptual Grouping CS 534: Computer Vision Segmentation and Perceptual Grouping Ahmed Elgammal Dept of Computer Science CS 534 Segmentation - 1 Outlines Mid-level vision What is segmentation Perceptual Grouping Segmentation

More information

Video annotation based on adaptive annular spatial partition scheme

Video annotation based on adaptive annular spatial partition scheme Video annotation based on adaptive annular spatial partition scheme Guiguang Ding a), Lu Zhang, and Xiaoxu Li Key Laboratory for Information System Security, Ministry of Education, Tsinghua National Laboratory

More information

WITH the increasing use of digital image capturing

WITH the increasing use of digital image capturing 800 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 20, NO. 3, MARCH 2011 A Hybrid Approach to Detect and Localize Texts in Natural Scene Images Yi-Feng Pan, Xinwen Hou, and Cheng-Lin Liu, Senior Member, IEEE

More information

TEXT line segmentation is one of the major components of

TEXT line segmentation is one of the major components of IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 30, NO. 8, AUGUST 2008 1313 Script-Independent Text Line Segmentation in Freestyle Handwritten Documents Yi Li, Student Member, IEEE,

More information

An Objective Evaluation Methodology for Handwritten Image Document Binarization Techniques

An Objective Evaluation Methodology for Handwritten Image Document Binarization Techniques An Objective Evaluation Methodology for Handwritten Image Document Binarization Techniques K. Ntirogiannis, B. Gatos and I. Pratikakis Computational Intelligence Laboratory, Institute of Informatics and

More information

Separation of Overlapping Text from Graphics

Separation of Overlapping Text from Graphics Separation of Overlapping Text from Graphics Ruini Cao, Chew Lim Tan School of Computing, National University of Singapore 3 Science Drive 2, Singapore 117543 Email: {caorn, tancl}@comp.nus.edu.sg Abstract

More information

CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS

CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS 8.1 Introduction The recognition systems developed so far were for simple characters comprising of consonants and vowels. But there is one

More information

Research on QR Code Image Pre-processing Algorithm under Complex Background

Research on QR Code Image Pre-processing Algorithm under Complex Background Scientific Journal of Information Engineering May 207, Volume 7, Issue, PP.-7 Research on QR Code Image Pre-processing Algorithm under Complex Background Lei Liu, Lin-li Zhou, Huifang Bao. Institute of

More information

Part-Based Skew Estimation for Mathematical Expressions

Part-Based Skew Estimation for Mathematical Expressions Soma Shiraishi, Yaokai Feng, and Seiichi Uchida shiraishi@human.ait.kyushu-u.ac.jp {fengyk,uchida}@ait.kyushu-u.ac.jp Abstract We propose a novel method for the skew estimation on text images containing

More information

CS 231A Computer Vision (Fall 2012) Problem Set 3

CS 231A Computer Vision (Fall 2012) Problem Set 3 CS 231A Computer Vision (Fall 2012) Problem Set 3 Due: Nov. 13 th, 2012 (2:15pm) 1 Probabilistic Recursion for Tracking (20 points) In this problem you will derive a method for tracking a point of interest

More information

A Touching Character Database from Chinese Handwriting for Assessing Segmentation Algorithms

A Touching Character Database from Chinese Handwriting for Assessing Segmentation Algorithms 2012 International Conference on Frontiers in Handwriting Recognition A Touching Character Database from Chinese Handwriting for Assessing Segmentation Algorithms Liang Xu, Fei Yin, Qiu-Feng Wang, Cheng-Lin

More information

Clustering CS 550: Machine Learning

Clustering CS 550: Machine Learning Clustering CS 550: Machine Learning This slide set mainly uses the slides given in the following links: http://www-users.cs.umn.edu/~kumar/dmbook/ch8.pdf http://www-users.cs.umn.edu/~kumar/dmbook/dmslides/chap8_basic_cluster_analysis.pdf

More information

A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script

A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script Arwinder Kaur 1, Ashok Kumar Bathla 2 1 M. Tech. Student, CE Dept., 2 Assistant Professor, CE Dept.,

More information

Segmentation of Characters of Devanagari Script Documents

Segmentation of Characters of Devanagari Script Documents WWJMRD 2017; 3(11): 253-257 www.wwjmrd.com International Journal Peer Reviewed Journal Refereed Journal Indexed Journal UGC Approved Journal Impact Factor MJIF: 4.25 e-issn: 2454-6615 Manpreet Kaur Research

More information

A Block-Based Hough Transform Mapping for Text Line Detection in Handwritten Documents

A Block-Based Hough Transform Mapping for Text Line Detection in Handwritten Documents A Bloc-Based Hough Transform Mapping for Text Line Detection in Handwritten Documents G. Louloudis, B. Gatos, I. Pratiais, K. Halatsis To cite this version: G. Louloudis, B. Gatos, I. Pratiais, K. Halatsis.

More information

Handwritten Word Recognition using Conditional Random Fields

Handwritten Word Recognition using Conditional Random Fields Handwritten Word Recognition using Conditional Random Fields Shravya Shetty Harish Srinivasan Sargur Srihari Center of Excellence for Document Analysis and Recognition (CEDAR) Department of Computer Science

More information

A Laplacian Based Novel Approach to Efficient Text Localization in Grayscale Images

A Laplacian Based Novel Approach to Efficient Text Localization in Grayscale Images A Laplacian Based Novel Approach to Efficient Text Localization in Grayscale Images Karthik Ram K.V & Mahantesh K Department of Electronics and Communication Engineering, SJB Institute of Technology, Bangalore,

More information

Skew Detection for Complex Document Images Using Fuzzy Runlength

Skew Detection for Complex Document Images Using Fuzzy Runlength Skew Detection for Complex Document Images Using Fuzzy Runlength Zhixin Shi and Venu Govindaraju Center of Excellence for Document Analysis and Recognition(CEDAR) State University of New York at Buffalo,

More information

An Efficient Character Segmentation Based on VNP Algorithm

An Efficient Character Segmentation Based on VNP Algorithm Research Journal of Applied Sciences, Engineering and Technology 4(24): 5438-5442, 2012 ISSN: 2040-7467 Maxwell Scientific organization, 2012 Submitted: March 18, 2012 Accepted: April 14, 2012 Published:

More information

CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION

CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION 6.1 INTRODUCTION Fuzzy logic based computational techniques are becoming increasingly important in the medical image analysis arena. The significant

More information

A two-stage approach for segmentation of handwritten Bangla word images

A two-stage approach for segmentation of handwritten Bangla word images A two-stage approach for segmentation of handwritten Bangla word images Ram Sarkar, Nibaran Das, Subhadip Basu, Mahantapas Kundu, Mita Nasipuri #, Dipak Kumar Basu Computer Science & Engineering Department,

More information

Isolated Handwritten Words Segmentation Techniques in Gurmukhi Script

Isolated Handwritten Words Segmentation Techniques in Gurmukhi Script Isolated Handwritten Words Segmentation Techniques in Gurmukhi Script Galaxy Bansal Dharamveer Sharma ABSTRACT Segmentation of handwritten words is a challenging task primarily because of structural features

More information

Clustering. CS294 Practical Machine Learning Junming Yin 10/09/06

Clustering. CS294 Practical Machine Learning Junming Yin 10/09/06 Clustering CS294 Practical Machine Learning Junming Yin 10/09/06 Outline Introduction Unsupervised learning What is clustering? Application Dissimilarity (similarity) of objects Clustering algorithm K-means,

More information

CS 223B Computer Vision Problem Set 3

CS 223B Computer Vision Problem Set 3 CS 223B Computer Vision Problem Set 3 Due: Feb. 22 nd, 2011 1 Probabilistic Recursion for Tracking In this problem you will derive a method for tracking a point of interest through a sequence of images.

More information

Available online at ScienceDirect. Procedia Computer Science 45 (2015 )

Available online at  ScienceDirect. Procedia Computer Science 45 (2015 ) Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 45 (2015 ) 205 214 International Conference on Advanced Computing Technologies and Applications (ICACTA- 2015) Automatic

More information

OTCYMIST: Otsu-Canny Minimal Spanning Tree for Born-Digital Images

OTCYMIST: Otsu-Canny Minimal Spanning Tree for Born-Digital Images OTCYMIST: Otsu-Canny Minimal Spanning Tree for Born-Digital Images Deepak Kumar and A G Ramakrishnan Medical Intelligence and Language Engineering Laboratory Department of Electrical Engineering, Indian

More information

Query-Sensitive Similarity Measure for Content-Based Image Retrieval

Query-Sensitive Similarity Measure for Content-Based Image Retrieval Query-Sensitive Similarity Measure for Content-Based Image Retrieval Zhi-Hua Zhou Hong-Bin Dai National Laboratory for Novel Software Technology Nanjing University, Nanjing 2193, China {zhouzh, daihb}@lamda.nju.edu.cn

More information

A New Technique for Segmentation of Handwritten Numerical Strings of Bangla Language

A New Technique for Segmentation of Handwritten Numerical Strings of Bangla Language I.J. Information Technology and Computer Science, 2013, 05, 38-43 Published Online April 2013 in MECS (http://www.mecs-press.org/) DOI: 10.5815/ijitcs.2013.05.05 A New Technique for Segmentation of Handwritten

More information

Text Area Detection from Video Frames

Text Area Detection from Video Frames Text Area Detection from Video Frames 1 Text Area Detection from Video Frames Xiangrong Chen, Hongjiang Zhang Microsoft Research China chxr@yahoo.com, hjzhang@microsoft.com Abstract. Text area detection

More information

A Model-based Line Detection Algorithm in Documents

A Model-based Line Detection Algorithm in Documents A Model-based Line Detection Algorithm in Documents Yefeng Zheng, Huiping Li, David Doermann Laboratory for Language and Media Processing Institute for Advanced Computer Studies University of Maryland,

More information

A Generalized Method to Solve Text-Based CAPTCHAs

A Generalized Method to Solve Text-Based CAPTCHAs A Generalized Method to Solve Text-Based CAPTCHAs Jason Ma, Bilal Badaoui, Emile Chamoun December 11, 2009 1 Abstract We present work in progress on the automated solving of text-based CAPTCHAs. Our method

More information

Prototype Selection for Handwritten Connected Digits Classification

Prototype Selection for Handwritten Connected Digits Classification 2009 0th International Conference on Document Analysis and Recognition Prototype Selection for Handwritten Connected Digits Classification Cristiano de Santana Pereira and George D. C. Cavalcanti 2 Federal

More information

Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network

Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network International Journal of Computer Science & Communication Vol. 1, No. 1, January-June 2010, pp. 91-95 Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network Raghuraj

More information

University of Florida CISE department Gator Engineering. Clustering Part 4

University of Florida CISE department Gator Engineering. Clustering Part 4 Clustering Part 4 Dr. Sanjay Ranka Professor Computer and Information Science and Engineering University of Florida, Gainesville DBSCAN DBSCAN is a density based clustering algorithm Density = number of

More information

Scene Text Detection Using Machine Learning Classifiers

Scene Text Detection Using Machine Learning Classifiers 601 Scene Text Detection Using Machine Learning Classifiers Nafla C.N. 1, Sneha K. 2, Divya K.P. 3 1 (Department of CSE, RCET, Akkikkvu, Thrissur) 2 (Department of CSE, RCET, Akkikkvu, Thrissur) 3 (Department

More information

Lesson 3. Prof. Enza Messina

Lesson 3. Prof. Enza Messina Lesson 3 Prof. Enza Messina Clustering techniques are generally classified into these classes: PARTITIONING ALGORITHMS Directly divides data points into some prespecified number of clusters without a hierarchical

More information

Text Line Detection for Heterogeneous Documents

Text Line Detection for Heterogeneous Documents Text Line Detection for Heterogeneous Documents Markus Diem, Florian Kleber and Robert Sablatnig Computer Vision Lab Vienna University of Technology Email: diem@caa.tuwien.ac.at Abstract Text line detection

More information

Online Bangla Handwriting Recognition System

Online Bangla Handwriting Recognition System 1 Online Bangla Handwriting Recognition System K. Roy Dept. of Comp. Sc. West Bengal University of Technology, BF 142, Saltlake, Kolkata-64, India N. Sharma, T. Pal and U. Pal Computer Vision and Pattern

More information

CHAPTER 1 INTRODUCTION

CHAPTER 1 INTRODUCTION CHAPTER 1 INTRODUCTION 1.1 Introduction Pattern recognition is a set of mathematical, statistical and heuristic techniques used in executing `man-like' tasks on computers. Pattern recognition plays an

More information

Text lines and snippets extraction for 19th century handwriting documents layout analysis

Text lines and snippets extraction for 19th century handwriting documents layout analysis Author manuscript, published in "2009 10th International Conference on Document Analysis and Recognition, Barcelona : Spain (2009)" Text lines and snippets extraction for 19th century handwriting documents

More information

Clustering Part 4 DBSCAN

Clustering Part 4 DBSCAN Clustering Part 4 Dr. Sanjay Ranka Professor Computer and Information Science and Engineering University of Florida, Gainesville DBSCAN DBSCAN is a density based clustering algorithm Density = number of

More information

Slant Correction using Histograms

Slant Correction using Histograms Slant Correction using Histograms Frank de Zeeuw Bachelor s Thesis in Artificial Intelligence Supervised by Axel Brink & Tijn van der Zant July 12, 2006 Abstract Slant is one of the characteristics that

More information

Improving Latent Fingerprint Matching Performance by Orientation Field Estimation using Localized Dictionaries

Improving Latent Fingerprint Matching Performance by Orientation Field Estimation using Localized Dictionaries Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 11, November 2014,

More information

RESTORATION OF DEGRADED DOCUMENTS USING IMAGE BINARIZATION TECHNIQUE

RESTORATION OF DEGRADED DOCUMENTS USING IMAGE BINARIZATION TECHNIQUE RESTORATION OF DEGRADED DOCUMENTS USING IMAGE BINARIZATION TECHNIQUE K. Kaviya Selvi 1 and R. S. Sabeenian 2 1 Department of Electronics and Communication Engineering, Communication Systems, Sona College

More information

A System to Automatically Index Genealogical Microfilm Titleboards Introduction Preprocessing Method Identification

A System to Automatically Index Genealogical Microfilm Titleboards Introduction Preprocessing Method Identification A System to Automatically Index Genealogical Microfilm Titleboards Samuel James Pinson, Mark Pinson and William Barrett Department of Computer Science Brigham Young University Introduction Millions of

More information

Tracing and Straightening the Baseline in Handwritten Persian/Arabic Text-line: A New Approach Based on Painting-technique

Tracing and Straightening the Baseline in Handwritten Persian/Arabic Text-line: A New Approach Based on Painting-technique Tracing and Straightening the Baseline in Handwritten Persian/Arabic Text-line: A New Approach Based on Painting-technique P. Nagabhushan and Alireza Alaei 1,2 Department of Studies in Computer Science,

More information

Image Segmentation Based on Watershed and Edge Detection Techniques

Image Segmentation Based on Watershed and Edge Detection Techniques 0 The International Arab Journal of Information Technology, Vol., No., April 00 Image Segmentation Based on Watershed and Edge Detection Techniques Nassir Salman Computer Science Department, Zarqa Private

More information

Enhancing Clustering Results In Hierarchical Approach By Mvs Measures

Enhancing Clustering Results In Hierarchical Approach By Mvs Measures International Journal of Engineering Research and Development e-issn: 2278-067X, p-issn: 2278-800X, www.ijerd.com Volume 10, Issue 6 (June 2014), PP.25-30 Enhancing Clustering Results In Hierarchical Approach

More information

HMM-Based Handwritten Amharic Word Recognition with Feature Concatenation

HMM-Based Handwritten Amharic Word Recognition with Feature Concatenation 009 10th International Conference on Document Analysis and Recognition HMM-Based Handwritten Amharic Word Recognition with Feature Concatenation Yaregal Assabie and Josef Bigun School of Information Science,

More information

Automatic Recognition and Verification of Handwritten Legal and Courtesy Amounts in English Language Present on Bank Cheques

Automatic Recognition and Verification of Handwritten Legal and Courtesy Amounts in English Language Present on Bank Cheques Automatic Recognition and Verification of Handwritten Legal and Courtesy Amounts in English Language Present on Bank Cheques Ajay K. Talele Department of Electronics Dr..B.A.T.U. Lonere. Sanjay L Nalbalwar

More information

A Fast Caption Detection Method for Low Quality Video Images

A Fast Caption Detection Method for Low Quality Video Images 2012 10th IAPR International Workshop on Document Analysis Systems A Fast Caption Detection Method for Low Quality Video Images Tianyi Gui, Jun Sun, Satoshi Naoi Fujitsu Research & Development Center CO.,

More information

Clustering. Informal goal. General types of clustering. Applications: Clustering in information search and analysis. Example applications in search

Clustering. Informal goal. General types of clustering. Applications: Clustering in information search and analysis. Example applications in search Informal goal Clustering Given set of objects and measure of similarity between them, group similar objects together What mean by similar? What is good grouping? Computation time / quality tradeoff 1 2

More information

Segmentation of Bangla Handwritten Text

Segmentation of Bangla Handwritten Text Thesis Report Segmentation of Bangla Handwritten Text Submitted By: Sabbir Sadik ID:09301027 Md. Numan Sarwar ID: 09201027 CSE Department BRAC University Supervisor: Professor Dr. Mumit Khan Date: 13 th

More information

Determining Document Skew Using Inter-Line Spaces

Determining Document Skew Using Inter-Line Spaces 2011 International Conference on Document Analysis and Recognition Determining Document Skew Using Inter-Line Spaces Boris Epshtein Google Inc. 1 1600 Amphitheatre Parkway, Mountain View, CA borisep@google.com

More information

12/12 A Chinese Words Detection Method in Camera Based Images Qingmin Chen, Yi Zhou, Kai Chen, Li Song, Xiaokang Yang Institute of Image Communication

12/12 A Chinese Words Detection Method in Camera Based Images Qingmin Chen, Yi Zhou, Kai Chen, Li Song, Xiaokang Yang Institute of Image Communication A Chinese Words Detection Method in Camera Based Images Qingmin Chen, Yi Zhou, Kai Chen, Li Song, Xiaokang Yang Institute of Image Communication and Information Processing, Shanghai Key Laboratory Shanghai

More information

EE 701 ROBOT VISION. Segmentation

EE 701 ROBOT VISION. Segmentation EE 701 ROBOT VISION Regions and Image Segmentation Histogram-based Segmentation Automatic Thresholding K-means Clustering Spatial Coherence Merging and Splitting Graph Theoretic Segmentation Region Growing

More information

Character Segmentation for Telugu Image Document using Multiple Histogram Projections

Character Segmentation for Telugu Image Document using Multiple Histogram Projections Global Journal of Computer Science and Technology Graphics & Vision Volume 13 Issue 5 Version 1.0 Year 2013 Type: Double Blind Peer Reviewed International Research Journal Publisher: Global Journals Inc.

More information

Writer Recognizer for Offline Text Based on SIFT

Writer Recognizer for Offline Text Based on SIFT Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 5, May 2015, pg.1057

More information