Shape Retrieval Based on Dynamic Programming

Size: px
Start display at page:

Download "Shape Retrieval Based on Dynamic Programming"

Transcription

1 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 9, NO. 1, JANUARY # can be ound by solving the ollowing system o equations: z =k H i # k = z =k H i^ i : (16) 3 Repeat 1 and 2 until cluster membership is unchanged. Since the number o objects is speciied in the beginning, the value o variance o noise does not aect the inal result and we do not have to speciy. The convergence in the second stage is guaranteed as the cost unction always decreases. Actually, i starting rom the same initial set o cluster centers, the second stage gives the same result as the two-step iterative procedure used in [2] does. 1 However, our algorithm requires much less computations, considering that ^ i and H i are already obtained rom the least-squares minimization o S i. Note also, that the above algorithm merges nonadjacent regions as well, which is not the case or some methods [1], [5], [9]. IV. RESULTS In Figs. 2 and 3, we show the results o applying the proposed merging algorithm on standard test sequences. The initial segmentation was obtained with the morphological multiscale technique [7]. The results or table tennis and lower garden sequences were obtained with optic low matching. As measurements y(x) we used the dense optic low ield, computed rom two successive images using the hierarchical method in [3]. For the calendar sequence, we used the model, which is based on the linearized intensity matching equation [6]. The improvements due to the use o the new similarity measure are conirmed by comparison with Fig. 2(c) and (d) in which we show the results o the existing methods in [2] and [9] applied or the same set o initial regions. More elaborate evaluation can be ound in [6]. [3] J. R. Bergen, P. Anandan, K. J. Hanna, and R. Hingorani, Hierarchical model-based motion estimation, in Proc. 2nd Eur. Con. Computer Vision, 1992, pp [4] P. J. Huber, Robust Statistic. New York: Wiley, [5] F. Moscheni, S. Bhattacharjee, and M. Kunt, Spatiotemporal segmentation based on region merging, IEEE Trans. Pattern Anal. Machine Intell., vol. 20, no. 9, pp , Sept [6] Intell. Sensory Inorm. Syst. Grp., Tech. Rep. 4,, Univ. Amsterdam, The Netherlands, available at rein/isis.html, Dec [7] P. Salembier, Morphological multiscale segmentation or image coding, Signal Process., vol. 38, pp , Sept [8] J. Y. A. Wang and E. H. Adelson, Representing moving images with layers, IEEE Trans. Image Processing, vol. 3, pp , Sept [9] L. Wu, J. Benois-Pineau, Ph. Delagnes, and D. Barba, Spatio temporal segmentation o image sequences or object-oriented low bit-rate image coding, Signal Process: Image Commun., vol. 8, pp , Sept Shape Retrieval Based on Dynamic Programming Evangelos Milios and Euripides G. M. Petrakis Abstract We propose a shape matching algorithm or deormed shapes based on dynamic programming. Our algorithm is capable o grouping together segments at iner scales in order to come up with appropriate correspondences with segments at coarser scales. We illustrate the eectiveness o our algorithm in retrieval o shapes by content on two dierent two-dimensional (2 D) datasets, one o static hand gesture shapes and another o marine lie shapes. We also demonstrate the superiority o our approach over traditional approaches to shape matching and retrieval, such as Fourier descriptors and geometric and sequential moments. Our evaluation is based on human relevance judgments ollowing a well-established methodology rom the inormation retrieval ield. Index Terms Dynamic programming, image database, query by example, relevance judgments, shape retrieval. V. CONCLUSION We have proposed a new criterion or similarity o regions movement in a video scene based on a statistical test or equality o motion parameters. The uncertainty in parameter estimation is incorporated in an optimal way. Using this measure, we have developed a new merging algorithm consisting o two stages. The agglomerative merging in the irst stage provides a good starting point or the second stage in which the regions are merged according to a K-means like algorithm. The improved perormance over existing methods has been demonstrated on real sequences. As extracted objects and their motion parameters are accurate, they can be used or content-based video retrieval in digital libraries. REFERENCES [1] G. Adiv, Determining 3-D motion and structure rom optical lows generated by several moving objects, IEEE Trans. Pattern Anal. Machine Intell., vol. PAMI 7, pp , [2] Y. Altunbasak, P. E. Eren, and A. M. Tekalp, Region-based parametric motion segmentation using color inormation, Graph. Models Image Process., vol. 60, pp , Jan We note, by the way, that in [2, Sec. IV-C] the mixed use o intensity matching and optic low matching does not guarantee convergence. I. INTRODUCTION Object recognition is an important problem in computer vision and has received considerable attention in the literature. Most approaches to object recognition are model-based [1], emphasizing the accuracy o recognition. They are limited to speciic image types and require that all shapes are preprocessed and labeled prior to storage. However, the increasing amounts o image data in many application domains has generated additional interest or real-time management and retrieval o shapes [2], [3]. There, the emphasis is not only on accuracy, but also Manuscript received December 1, 1998; revised July 27, This work was supported by a grant rom the Natural Sciences and Engineering Research Council o Canada. This work was perormed while E. G. M. Petrakis was visiting York University. The associate editor coordinating the review o this manuscript and approving it or publication was Pro. Thomas S. Huang. E. Milios was with the Department o Computer Science, York University, Toronto, Ont. M3J 1P3, Canada ( eem@cs.yorku.ca). He is now with the Faculty o Computer Science, Dalhousie University, Haliax, N.S. B3J 2X4, Canada ( eem@cs.dal.ca). E. G. M. Petrakis is with the Department o Electronic and Computer Engineering, Technical University o Crete, Chania, Crete, Greece ( petrakis@ced.tuc.gr). Publisher Item Identiier S (00) /00$ IEEE

2 142 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 9, NO. 1, JANUARY 2000 on eiciency (i.e., speed) o retrieval. Due to large image numbers, less emphasis may be given to preprocessing and labeling. The perormance o any object recognition system ultimately depends on the types o shape representations used and on the matching algorithms applied [4]. An important class o methods relies on symbolic entities obtained rom the representation o the shape s inlection points at various scales and is reerred to as curvature scale space (CSS) representation [5]. In [6], matching is perormed through interval trees, which are computed by tracking the CSS representation rom coarser to iner scales. In [7], matching is based on the maxima o CSS curves. Recently, matching in scale-space has been addressed with dynamic programming (DP) [8]. Regarding image database retrieval by shape content, [2] reports experiments with traditional shape representation and matching methods (i.e., Fourier descriptors, moment-based methods, combinations o such methods) on 500 trademark images. More recently, the eectiveness o such shape methods in conjunction with color eatures is investigated in [3] using 1100 trademark images taken rom two dierent datasets. In this work, we ocus on shape content and shape similarity retrievals. The contributions o this work are the ollowing. We propose a shape matching algorithm that is particularly eective or shape retrieval. We establish the superiority o our method over traditional shape matching methods such as Fourier descriptors and sequential and geometric moments. We tested our algorithm on a data set o 980 two-dimensional (2 D) hand gesture shapes and on a marine lie database with 1100 shapes. We introduce to the computer vision community a well established method rom inormation retrieval or the empirical evaluation o retrieval results obtained by many competing methods. We assume that shapes have already been extracted rom images in the orm o closed sequences o points. Automatic shape extraction rom images (or example via region segmentation or edge ollowing) is a nontrivial problem, and it is outside the scope o this paper. For our hand gesture dataset, contours are extracted rom images by taking the polygonal approximation o the hand boundary ater thresholding. The shapes o the marine dataset are already available in the desired orm. The major problem with segmented representations is that small perturbations to the shape can yield large changes in the segmentation. Thereore, the matching algorithm must be robust to segmentation changes. A standard ix is to represent the shape at multiple scales o resolution (smoothing), and either use a ull scale-space representation or matching [6], or like [8] and our approach, have the algorithm choose the appropriate scales or dierent parts o the shape. The rest o this work is organized as ollows: Work related to our proposed algorithm is discussed in Section II. Our shape matching algorithm is presented in Section III. The database set-up, the evaluation criteria along with the experimental results are presented in Section IV. Conclusions and issues or uture research are discussed in Section V. II. RELATED WORK Our algorithm is a substantial extension o the DP algorithm o [8], in the ollowing ways. We propose that the computation o the CSS representation be removed rom the matching algorithm. The CSS representation has two drawbacks: First, it tends to diuse the eects o a eature ar away rom its location as coarser scales are considered. This may be undesirable i such eatures have perceptual signiicance. Second, it is computationally expensive. Our algorithm is not only aster but also achieves matching accuracy comparable to that o the original ormulation [9]. Merging o neighboring segments in [8] is allowed only i such segments merge at some scale (not necessarily the same or the two shapes) in the CSS representation. We present a ormulation o the algorithm that does not use scale space to restrict search or segment merges. Our algorithm allows all segment merges, and relies only on the minimization o the overall matching cost to select the merges. We have implemented a dierent set o cost measures rom the original algorithm and have demonstrated their improved perormance [9]. The algorithm in [8] perorms a best-only search in a DP table as it looks or minimum-cost paths in the DP ramework. In [10], we have identiied instances, in which a best-only search strategy ails to ind a valid match between two shapes, although one exists [9]. We have extended the algorithm to perorm k-best search and we have demonstrated that or a small k (e.g., 5), the additional space and time requirements are modest and the algorithm can solve matching problems where the original one ails (see also Section III-E). III. SHAPE MATCHING The shape matching algorithm that lies at the core o our methodology takes in two shapes and computes: 1) their distance and 2) the correspondences between similar parts o the two shapes. In retrievals, only the distances are used. However, the correspondences help assess the plausibility o the distance computation, i necessary. A. Deinitions Let A and B be the two shapes to be matched and let A = a 0;a 1; 111;a N01 and B = b 0;b 1; 111;b M01 be the convex/concave segment sequences o the two shapes, with a i being the segment between two consecutive inlection points (i.e., points o change o curvature) p i and p i+1; similarly or b j. Then, a(i0nji); n 0, is the sequence o segments a i0n; a i0n+1; 111;a i ; similarly or b(j 0 mjj); m 0. The algorithm searches or segment correspondences at various levels o shape detail by allowing matching o merged sequences o consecutive segments, i this leads to the minimization o a cost unction. Each merging is a recursive application o the grammar rules CV C! C and VCV! V, where C and V denote convex and concave segments respectively [11]. The number o merged segments is always odd. A complete match is a correspondence between groups o consecutive segments in order, such that no segments are let unassociated. The goal is to ind the best association o segments in shape A to segments in shape B. This is ormulated as a minimization problem that is solved eiciently by dynamic programming: A table o partial costs is built and the optimal matching is searched in the orm o a path in the DP table that minimizes a total dissimilarity cost. The DP table has 2N columns and 2M rows, corresponding to segments o shape A and shape B, respectively repeated twice (to orce the algorithm to consider all possible relative rotations between the two shapes). All subscripts below are modulo N and M, respectively. A link between cells (i w01; j w01) and (i w ;j w ) denotes the matching o segments a(i w01 +1ji w ) with b(j w01 +1jj w ).Apath is a linked sequence o cells (i 0;j 0); (i 1;j 1); 111; (i w;jw), not necessarily adjacent, indicating a partial match. A complete match has (i 0 ;j 0 ) = (i w ;j w ). Function (a(i w01 +1ji w );b(j w01 +1jj w )) represents

3 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 9, NO. 1, JANUARY associations, which are not possible. To describe the illing o the k wth option cost g[k w ] o cell(i w ;j w ) o the DP table, we need to compute the ollowing (n; m; k) or all possible n; m, and k, and select the k wth smallest value (n; m; k) =g(i w 0 2n 0 1; j w 0 2m 0 1; k) + (a(i w 0 2nji w); b(jw 0 2mjj w)) ; n; m 0; 0 k<k: (1) Fig. 1. Example o a DP table. The cost in the above equation is the cost o association o segments a(i w 0 2nji w ) with b(j w 0 2mjj w ) and consists o a merging cost component and a dissimilarity cost component (see Section III-C below). Additional constraints on acceptable pairs (n; m) are that either t 1 = t 2 =0(a complete match has been ound), or t 1 > 0; t 2 > 0 (there exist unmatched segments in both shapes). the dissimilarity cost between its two arguments and is deined in Section III-C. Each cell cell(i; j) in the DP table contains the cost array g[k w] and associated bookkeeping data t 1 [k w ], t 2 [k w ], index[k w ], g n [k w ], g m [k w ], where k w varies rom 0 to K 0 1 and reers to the k w th best path (k wth option), or partial match, up to and including cell(i w;j w). Each cell keeps up to K best paths. Speciically, g[k w ] holds the cost o the path, t 1 [k w ] and t 2 [k w ] hold the number o unmatched segments in shapes A and B respectively or the path and index[k w], gn[kw] and g m [k w ] hold the back links or the path and allow the backward tracing o the path. I g n[kw] =nw and g m[kw] =mw or cell(i w;j w), then the previous cell and option in the path is cell(i w01; j w01) = cell(i w 0 2n w 0 1; j w 0 2m w 0 1) and index[k w ], respectively, where n w and m w are nonnegative integers. Notice that, i w01 = i w 0 2n w 0 1 and j w01 = j w 0 2m w 0 1, since the number o segments which are merged is always odd. Fig. 1 illustrates the lower hal o a DP table computed or the matching o two shapes with eight segments each. Two incomplete paths are shown ending at cell(10; 6). Numbers in cells indicate accumulated costs. The DP table consists o the initial value area (let hal) and the calculation area (right hal). In the initial value area all g terms are initialized to zero, t 1 to N and t 2 to M, implying that each o these cells can act as the irst cell in a path. The calculation area is computed and inally the optimal path is searched. A path is complete when its corresponding t 1 and t 2 at the inal cell o the path both become zero simultaneously. B. Algorithm The main idea in the algorithm is to ill the DP table cells by scanning orward and then search or the optimal complete path by tracing backward. We outline the algorithm as ollows. or do or do ill the K options in cell check i a complete path has been ound; end or end or ind the complete path with the lowest cost; retrace segment matches by ollowing the backward links. The or loop or j w does not run over all the indicated values, but only over those values that do not involve convex to concave segment C. Cost Components The cost term in (1) can be rewritten as (a(i w01 + 1ji w );b(j w01 +1jj w )). Following the notation o [8], this cost term consists o three additive components: (a(i w01 +1ji w );b(j w01 +1jj w )) = Merging Cost (a(i w01 +1ji w )) + Merging Cost (b(j w01 +1jj w )) + Dissim Cost (a(i w01 +1ji w );b(j w01 +1jj w )) where represents the relative importance o the merging and dissimilarity costs. In this work, was set to one. The irst two terms in (2) represent the cost o merging segments a(i w01 +1ji w ) in shape A and segments b(j w01 +1jj w ) in shape B, respectively, while the last term is the cost o associating the merged a(i w01 +1ji w ) with the merged b(j w01 +1jj w ). Requirements or reliable cost computation are the ollowing. Merging should ollow the process grammar rules [11] (i.e., each allowable merging should be a recursive application o the grammar rules CV C ) C and VCV ) V ). This is enorced by the DP algorithm. Merging a visually prominent segment (i.e., a large segment with high curvature) into a merged segment o the opposite type (convex or concave) should incur a high cost. To speciy this requirement, we need to deine visual prominence in geometric terms. The partial cost components arising rom dierent eatures o the shape should be combined into a total cost in a meaningul way. The heuristic cost computations that ollow attempt to satisy the above requirements. First, we deine geometric quantities (eatures) needed in the speciication o visual prominence o a segment according to Fig. 2. Rotation Angle i angle traversed by the tangent to the segment rom inlection point p i to inlection point p i+1 and shows how strongly a segment is curved. Length l i length o segment a i. Area a i area enclosed between the chord and the arc between the inlection points p i and p i+1. (2)

4 144 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 9, NO. 1, JANUARY 2000 For being length or area: 0 c =10 C segs o group For being rotation angle: all segs o shape : (7) Fig. 2. Geometric quantities or deining the prominence o a segment. 1) Dissimilarity Cost Computation: We assign a higher cost to segments (or groups o segments) with large dierences in more than one eature: Dissim Cost = W max d g (3) all eatures where = l;,ora. We choose the max operation instead o product (as in [8]) because in the product a small cost in terms o one eature can cancel the eect o a high cost in terms o another eature, something that may lead to a visually implausible outcome. The max operation addresses this problem. Factor W equals the number o eatures or which d is greater than 0:75 2 max d g, where = l; ; a. Thus, i all three eatures have uniormly large d, then the dissimilarity cost is multiplied by three. The term d, or = is deined as d = 2A 0 2B (4) 2 A +2 B where 2 A = i s=i s, and +1 2B = j s=j +1 s, and s being the rotation angle o segment with index s o shape A and shape B, respectively. The term d, or being l (length) or area (a), is deined as d = A F A 0 B F B (5) where F A = N01 s, s=0 A = i s=i +1 s o shape A and similarly or F B, B o shape B. 2) Merging Cost Computation: Let the types o the segments being merged be CV C 111VC. The opposite case is obtained by switching C and V. The merging cost is deined as ollows: Merging Cost = max w c g (6) all eatures where subscript reers to a eature (length, area or rotation angle). We choose the above maximization ormula instead o sum o products o terms comparing consecutive segments (as in [8]) or the ollowing reasons: we used max instead o product, because in a product a small cost in terms o one eature can cancel the eect o a high cost in terms o another. The reason or abandoning the sum o consecutive segments is because it implies that the plausibility o merging several segments can be reduced to the similarity o consecutive segments, which may not necessarily be true. Consider, or example, the case o a short and lat segment next to a long and curved one. In this case, it is plausible to merge the two, while the merging cost in [8] will be high. Another drawback o the use o a sum is that the merging cost increases with the number o segments merged, even i several very short segments are being merged into a large one. c =10 Csegs o group 0 Csegs o group + : (8) The intuition behind these ormulae is that they measure the visual prominence o the eatures o the absorbed segments (o type V ) relative to the absorbing segments (o type C). All costs c are within the interval [0, 2]. Cost c is close to 0 i the convex segments visually dominate the concave ones (hence it is plausible to absorb the concave ones), while it is close to two i the concave segments visually dominate the convex ones (hence it is not plausible to perorm the merge, thereore the merging cost should be high). For being any eature (length, area, rotation angle) the weight term is w = N 2 V segs o shape : (9) The intuition behind the weight term is to measure the visual prominence o the absorbed segments within the shape as a whole. Factor N=2 is heuristic. D. Matching Examples Fig. 3 illustrates segment correspondences (indicated by consecutive lines connecting the starting and ending points o the associated segments) obtained by matching hand silhouettes (let) and ish silhouettes (right). One o the shapes has been scaled to 50% o the original. Notice that the algorithm tried to come-up with plausible segment associations by matching groups o segments which, in the case o the ish shapes are due to shape detail. E. K-Option DP Search The algorithm in [8], except that it uses the CSS representation to constrain the search or possible matches, is a special case o the above K-option DP algorithm or K = 1. The motivation or introducing the K-option method is that, by using only one option at each cell, the optimal path is sometimes missed. This happens because the (n; m) pair chosen at cell(i; j) is the one which leads to the least cost path only up to cell(i; j). This path may not be the one that leads to the ormation o a complete path. It is possible that ater the entire DP table is computed, there exist only incomplete paths, in which case the algorithm ails to produce a match. Also, the best choice at cell(i; j) may lead to an expensive match o the remaining unmatched segments. This is illustrated in Fig. 1, which shows the DP table or matching two shapes with eight and our segments, respectively. Cells in the table are marked as ollows: (4, 0) as B, (6, 0) as C, (8, 2) as E, (9, 4) as A, and (10, 6) as D. At each cell, the number in bold represents the cost o the path up to that cell using the K =1option (dashed line). Numbers in italics represent the cost along the alternate path (K = 2, dotted

5 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 9, NO. 1, JANUARY (a) (b) Fig. 3. Segment associations reported by the matching algorithm on representative matches rom the gestures and the marine lie databases. line). Alternate path BEAD is ignored by the K =1option algorithm because o a least cost choice made at A. This problem is addressed by the K-option DP method where the choice o a best subpath is not made at cell A but deerred until a later point when more inormation is available. IV. SHAPE RETRIEVAL In our experiments we used the ollowing datasets. GESTURES 1 : Consists o 980 synthetic shapes that are generated rom 17 original hand gestures. We took the 17 shapes in pairs and, or each such pair, we produced a number o blended shapes by transorming one shape to the other using the shape morphing algorithm o [12]. To evaluate our method we took the 17 original shapes as queries. SQUID 2 : Consists o 1100 shapes o marine species. We careully selected 20 shapes rom the SQUID database and we used them as queries. Each shape is represented by its contour. All contours are preprocessed to contain between 80 and 100 points. The ish silhouettes o the SQUID database contain much iner contour detail than the hand silhouettes o the GESTURES database. The experiments are designed to illustrate the superiority o our shape matching algorithm over traditional methods o shape matching 1 We have made our database available on the Internet at http: // 2 SQUID is available rom and retrieval. All measurements below correspond to averages over 17 and 20 queries, respectively. A. Comparison with Other Methods The competitors to our method are as ollows. Fourier Descriptors [13]: This is known to be one o the most successul methods or the recognition o closed shapes. We computed the irst (lower order) 20 coeicients o the Fourier transorm. Sequential Moments [14]: This is one o the most eective moment-based methods or closed shapes. For each shape, a representation o 4 moment coeicients is computed rom its bounding contour. Geometric Moments [15]: This is the original and the most characteristic representative o a wide class o methods based on area moments. A representation o seven moment coeicients o the shape is computed rom the area it occupies. Additional reasons or choosing these methods or our evaluation are: a1) they are translation, rotation and scale invariant (the same as our method) and 2) they all ilter out shape detail so that, they can detect shape similarity at a coarse view-scale. Our method has these advantages too but, in addition, matching with our method which is local (as opposed to global matching with the above methods) reporting all associations between similar segments choosing (possibly) dierent scales or dierent parts o the shapes depending on noise or shape detail. Fourier descriptors and sequential and geometric moments are precomputed and stored in separate iles in the database along with the original contours. When a query is given, its representation is computed and it is matched with similar representations stored in the database. This typically takes less than less than 5 s on a SUN Ultra 1 or

6 146 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 9, NO. 1, JANUARY 2000 Fig. 4. Precision-recall diagram or the GESTURES database corresponding to (a) our shape matching algorithm, (b) Fourier descriptors, (c) sequential moments, and (d) geometric moments. each data set. For our method, no precomputed inormation is stored. Instead, the actual shape contours are used to search the database. For this reason, our method is the slowest requiring approximately 1 s per shape match on a Pentium PC, 200 MHz. Certain optimizations that could speed-up our method are possible, such as the precomputation and storage o the convex and concave segments o all shapes in the database, and the selection o a number o options K that ensures optimal tradeo between accuracy and amount o computation. B. Evaluation Criteria We used human relevance judgments to compute the eectiveness o each method. Two shapes (i.e., a query and a stored shape) are considered similar i a human judges that they represent the same igure. To measure eectiveness, or each candidate method we computed: 1) Precision: deined as the percentage o similar shapes retrieved with respect to the total number o retrieved shapes. 2) Recall: deined as the percentage o similar shapes retrieved with respect to the total number o similar shapes in the database. Because we don t have the resources to compare every query with each database shape (i.e., this would require, or each method, = visual judgments or the GESTURES dataset and = visual judgments or the SQUID dataset!), or each query, we merged the answers obtained by all candidate methods and we considered this as the database which is manually inspected or relevant entries. Notice however, that this method does not allow or absolute judgments such as method A misses 10% o the total similar answers in the database. It provides, however, a air basis or comparisons between methods allowing judgements such as method A returns 5% ewer correct answers than method B. C. Experimental Results Each query retrieves the best 50 shapes. For answer sets containing between one and 50 entries, we computed the average values o precision and recall. Precision and recall values are represented in precision-recall plots: The horizontal axis corresponds to recall and the vertical axis corresponds to precision. Each method is represented by a curve. Each point in such a curve is the average over 17 queries (or the GESTURES database) and 20 queries (or the SQUID database) respectively. The total number o points in each curve is 50 (i.e., we compute precision and recall or answers containing between one and Fig. 5. Precision-recall diagram or the SQUID database corresponding to: (a) our shape matching algorithm, (b) Fourier descriptors, (c) sequential moments, and (d) geometric moments. 50 shapes). Thereore, the top-let point o the diagram corresponds to the precision/recall values or the best answer (best match), while the bottom right point corresponds to the precision/recall values or the entire answer set with 50 retrieved shapes. Fig. 4 illustrates the precision-recall diagram or the GESTURES database. For small answer sets returning up to 12 shapes (corresponding to the let-most 12 points o a curve) our method and Fourier descriptors perorm about equally in terms o both, precision and recall. Notice that, or such small answer sets, both methods achieve precision close to one, that is, their answers are almost 100% correct. For larger answer sets, our method perorms clearly better than any other method, achieving up to 25% better recall and 20% better precision than the second best method (Fourier descriptors). This result demonstrates that our method is very well suited or image retrieval where one is (typically) interested in retrieving more than ten or 20 and up to 50 shapes. Fig. 5 illustrates the precision-recall diagram or the SQUID database. Our method perorms clearly better than any other method, achieving up to 18% better recall and 15% better precision. An important observation is that all methods achieved lower values o precision and recall than those achieved on the GESTURES database. Presumably, all methods are sensitive to noise and contour detail which mainly exist in the shapes o the SQUID database. We see that our method outperorms the others both the GESTURES and the SQUID databases. Although it is slower than its competitors, the selection o a method or shape retrieval should be based mainly on eectiveness (i.e., quality o results) rather than on eiciency (i.e., speed). V. CONCLUSIONS We propose a shape matching algorithm or handling shape similarity retrievals in image databases. Our algorithm is based on dynamic programming, perorms implicitly at multiple scales and allows the matching o deormed shapes. We demonstrate the superiority o our approach over traditional approaches to shape matching and retrieval (Fourier descriptors, geometric and sequential moments) using two dierent datasets with 980 and 1100 shapes, respectively. We also introduce to the Computer Vision community a well-established methodology or the evaluation o the retrieval results obtained by more than one competing methods.

7 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 9, NO. 1, JANUARY Current research is directed toward extending our matching algorithm or open curves. Future work includes the experimentation with more datasets and methods, and the handling o combined queries involving more than one eature (e.g., shape, color, text). Recently, we have proposed an indexing mechanism or our method which achieves up to three orders o magnitude speed-up over sequential scanning with very high accuracy, providing also or clustering visualization and browsing o a data set [9]. ACKNOWLEDGMENT The authors would like to thank P. Elinas, or his help in the experiments, Z. Rao or valuable discussions and or a shape morphing program to generate the GESTURES database, M. Roussou and P. Economopoulos or implementing the shape matching algorithm in C++, J. Baid, who introduced the K-option idea, and Pro. Mokhtarian, Centre or Vision, Speech, and Signal Processing Laboratory, University o Surrey, U.K., or providing the SQUID database or our experiments. REFERENCES [1] R. T. Chin and C. R. Dyer, Model-based recognition in robot vision, ACM Comput. Surv., vol. 18, pp , [2] B. M. Mehtre, M. S. Kankhanhalli, and W. F. Lee, Shape measures or content based image retrieval: A comparison, Inorm. Process. Manage., vol. 33, pp , [3] A. K. Jain and A. Vailaya, Shape-based retrieval: A case study with trademark image databases, Pattern Recognit., vol. 31, pp , [4] S. Loncaric, A survey o shape analysis techniques, Pattern Recognit., vol. 31, pp , [5] A. Witkin, Scale space iltering, in Proc. 8th YCA1, Karlsruhe, West Germany, 1983, pp [6] F. Mokhtarian and A. Mackworth, Scale-based description o plannar curves and two-dimensional shapes, IEEE Trans. Pattern Anal. Machine Intell., vol. PAMI 8, pp , [7] F. Mokhtarian, Silhouette-based object recognition through curvature scale space, IEEE Trans. Pattern Anal. Machine Intell., vol. 17, May [8] N. Ueda and S. Suzuki, Learning visual models rom shape contours using multiscale convex/concave structure matching, IEEE Trans. Pattern Anal. Machine Intell., vol. 15, pp , Apr [9] E. Milios and E. Petrakis, Eicient shape matching and retrieval at multiple scales, Dept. Comput. Sci., York Univ., Toronto, Ont., Canada, Dec [10] J. Baid and E. Milios, Deormed shape recognition using dynamic programming, Dept. Comput. Sci., York Univ., Toronto, Ont., Canada, Tech. Rep., [11] E. Milios, Shape matching using curvature processes, Comput. Vis., Graph., Image Process., vol. 47, pp , [12] T. W. Sederberg and E. Greenwood, A physical based approach to 2-D shape bending, Comput. Graph., vol. 26, pp , [13] T. P. Wallace and P. A. Wintz, An eicient three-dimensional aircrat recognition algorithm using normalized Fourier descriptors, Comput. Graph. Image Process., vol. 13, pp , [14] L. Gupta and M. D. Srinath, Contour sequence moments or the classiication o closed planar shapes, Pattern Recognit., vol. 20, pp , [15] M.-K. Hu, Visual pattern recognition by moment invariants, IRE Trans. Inorm. Theory, vol. IT-8, pp , Automatic Text Detection and Tracking in Digital Video Huiping Li, David Doermann, and Omid Kia Abstract Text that appears in a scene or is graphically added to video can provide an important supplemental source o index inormation as well as clues or decoding the video s structure and or classiication. In this work, we present algorithms or detecting and tracking text in digital video. Our system implements a scale-space eature extractor that eeds an artiicial neural processor to detect text blocks. Our text tracking scheme consists o two modules: a sum o squared dierence (SSD) based module to ind the initial position and a contour-based module to reine the position. Experiments conducted with a variety o video sources show that our scheme can detect and track text robustly. Index Terms Digital libraries, neural network, text detection, text tracking, video indexing. I. INTRODUCTION The continued prolieration o large amounts o digital video has increased demand or true content based indexing and retrieval systems. Traditionally, content has been indexed primarily by manual annotation [1], closed caption [2], or transcribed audio [3], but some work has also been done on the content analysis o the video itsel. One area where signiicant progress is being made is in the detection and recognition o text. Text which either appears in a scene or is graphically added to video provides an important supplemental source o index inormation. For example, sports scores, product names, scene locations, speaker names, movie credits, program introductions and special announcements oten appear in the image text and supplement or summarize the visual content, but may not be present in the transcript. Searches can easily be reined i access to this textual content is available. At a high level, text in digital video can be divided into two classes, scene text and graphic text. Scene text appears within the scene and is captured by the camera. Examples o scene text include street signs, billboards, text on trucks and writing on shirts. Graphic text, on the other hand, is text that is mechanically added to video rames to supplement the visual and audio content. Since it is purposeully added it is oten more structured and closely related to the subject than scene text. In some domains such as sports, however, scene text can be used to uniquely identiy objects (participants in the clip). Most related previous work has ocused on the extraction o graphic text [4] [6]. Although scene text is oten diicult to detect and extract due to its virtually unlimited range o poses, sizes, shapes and colors, it is important in applications such as navigation, surveillance, video classiication, or analysis o sporting events. Text oten spans tens or even hundreds o rames in digital video. Exploiting the temporal coherence o text by tracking it is useul not Manuscript received December 11, 1998; revised June 21, The associate editor coordinating the review o this manuscript and approving it or publication was Dr. Hong Jiang Zhang. H. Li and D. S. Doermann are with the Language and Media Processing Laboratory, Center or Automation Research, University o Maryland, College Park, MD USA ( huiping@car.umd.edu; doermann@car.umd.edu). O. Kia is with the National Institute o Standards and Technology (NIST), Gaithersburg, MD USA. Publisher Item Identiier S (00) /00$ IEEE

Efficient Retrieval by Shape Content

Efficient Retrieval by Shape Content Efficient Retrieval by Shape Content Euripides G.M. Petrakis Dept. of Electr. and Comp. Engineering Technical University of Crete, Chanea, Greece petrakis@ced.tuc.gr Evangelos Milios y Department of Computer

More information

Matching and Retrieval of Distorted and Occluded Shapes Using Dynamic Programming

Matching and Retrieval of Distorted and Occluded Shapes Using Dynamic Programming Matching and Retrieval of Distorted and Occluded Shapes Using Dynamic Programming Euripides G.M. Petrakis Aristeidis Diplaros Evangelos Milios April 10, 2002 Abstract We propose an approach for matching

More information

MAPI Computer Vision. Multiple View Geometry

MAPI Computer Vision. Multiple View Geometry MAPI Computer Vision Multiple View Geometry Geometry o Multiple Views 2- and 3- view geometry p p Kpˆ [ K R t]p Geometry o Multiple Views 2- and 3- view geometry Epipolar Geometry The epipolar geometry

More information

ROBUST FACE DETECTION UNDER CHALLENGES OF ROTATION, POSE AND OCCLUSION

ROBUST FACE DETECTION UNDER CHALLENGES OF ROTATION, POSE AND OCCLUSION ROBUST FACE DETECTION UNDER CHALLENGES OF ROTATION, POSE AND OCCLUSION Phuong-Trinh Pham-Ngoc, Quang-Linh Huynh Department o Biomedical Engineering, Faculty o Applied Science, Hochiminh University o Technology,

More information

Classification Method for Colored Natural Textures Using Gabor Filtering

Classification Method for Colored Natural Textures Using Gabor Filtering Classiication Method or Colored Natural Textures Using Gabor Filtering Leena Lepistö 1, Iivari Kunttu 1, Jorma Autio 2, and Ari Visa 1, 1 Tampere University o Technology Institute o Signal Processing P.

More information

Binary Morphological Model in Refining Local Fitting Active Contour in Segmenting Weak/Missing Edges

Binary Morphological Model in Refining Local Fitting Active Contour in Segmenting Weak/Missing Edges 0 International Conerence on Advanced Computer Science Applications and Technologies Binary Morphological Model in Reining Local Fitting Active Contour in Segmenting Weak/Missing Edges Norshaliza Kamaruddin,

More information

Neighbourhood Operations

Neighbourhood Operations Neighbourhood Operations Neighbourhood operations simply operate on a larger neighbourhood o piels than point operations Origin Neighbourhoods are mostly a rectangle around a central piel Any size rectangle

More information

Method estimating reflection coefficients of adaptive lattice filter and its application to system identification

Method estimating reflection coefficients of adaptive lattice filter and its application to system identification Acoust. Sci. & Tech. 28, 2 (27) PAPER #27 The Acoustical Society o Japan Method estimating relection coeicients o adaptive lattice ilter and its application to system identiication Kensaku Fujii 1;, Masaaki

More information

A New Approach to Computation of Curvature Scale Space Image for Shape Similarity Retrieval

A New Approach to Computation of Curvature Scale Space Image for Shape Similarity Retrieval A New Approach to Computation of Curvature Scale Space Image for Shape Similarity Retrieval Farzin Mokhtarian, Sadegh Abbasi and Josef Kittler Centre for Vision Speech and Signal Processing Department

More information

CS485/685 Computer Vision Spring 2012 Dr. George Bebis Programming Assignment 2 Due Date: 3/27/2012

CS485/685 Computer Vision Spring 2012 Dr. George Bebis Programming Assignment 2 Due Date: 3/27/2012 CS8/68 Computer Vision Spring 0 Dr. George Bebis Programming Assignment Due Date: /7/0 In this assignment, you will implement an algorithm or normalizing ace image using SVD. Face normalization is a required

More information

Gesture Recognition using a Probabilistic Framework for Pose Matching

Gesture Recognition using a Probabilistic Framework for Pose Matching Gesture Recognition using a Probabilistic Framework or Pose Matching Ahmed Elgammal Vinay Shet Yaser Yacoob Larry S. Davis Computer Vision Laboratory University o Maryland College Park MD 20742 USA elgammalvinayyaserlsd

More information

Reflection and Refraction

Reflection and Refraction Relection and Reraction Object To determine ocal lengths o lenses and mirrors and to determine the index o reraction o glass. Apparatus Lenses, optical bench, mirrors, light source, screen, plastic or

More information

Automatic Video Segmentation for Czech TV Broadcast Transcription

Automatic Video Segmentation for Czech TV Broadcast Transcription Automatic Video Segmentation or Czech TV Broadcast Transcription Jose Chaloupka Laboratory o Computer Speech Processing, Institute o Inormation Technology and Electronics Technical University o Liberec

More information

Concavity. Notice the location of the tangents to each type of curve.

Concavity. Notice the location of the tangents to each type of curve. Concavity We ve seen how knowing where a unction is increasing and decreasing gives a us a good sense o the shape o its graph We can reine that sense o shape by determining which way the unction bends

More information

Chapter 3 Image Enhancement in the Spatial Domain

Chapter 3 Image Enhancement in the Spatial Domain Chapter 3 Image Enhancement in the Spatial Domain Yinghua He School o Computer Science and Technology Tianjin University Image enhancement approaches Spatial domain image plane itsel Spatial domain methods

More information

Research on Image Splicing Based on Weighted POISSON Fusion

Research on Image Splicing Based on Weighted POISSON Fusion Research on Image Splicing Based on Weighted POISSO Fusion Dan Li, Ling Yuan*, Song Hu, Zeqi Wang School o Computer Science & Technology HuaZhong University o Science & Technology Wuhan, 430074, China

More information

3D Hand and Fingers Reconstruction from Monocular View

3D Hand and Fingers Reconstruction from Monocular View 3D Hand and Fingers Reconstruction rom Monocular View 1. Research Team Project Leader: Graduate Students: Pro. Isaac Cohen, Computer Science Sung Uk Lee 2. Statement o Project Goals The needs or an accurate

More information

Piecewise polynomial interpolation

Piecewise polynomial interpolation Chapter 2 Piecewise polynomial interpolation In ection.6., and in Lab, we learned that it is not a good idea to interpolate unctions by a highorder polynomials at equally spaced points. However, it transpires

More information

Hierarchical Representation of 2-D Shapes using Convex Polygons: a Contour-Based Approach

Hierarchical Representation of 2-D Shapes using Convex Polygons: a Contour-Based Approach Hierarchical Representation of 2-D Shapes using Convex Polygons: a Contour-Based Approach O. El Badawy, M. S. Kamel Pattern Analysis and Machine Intelligence Laboratory, Department of Systems Design Engineering,

More information

9.8 Graphing Rational Functions

9.8 Graphing Rational Functions 9. Graphing Rational Functions Lets begin with a deinition. Deinition: Rational Function A rational unction is a unction o the orm P where P and Q are polynomials. Q An eample o a simple rational unction

More information

Relaxing the 3L algorithm for an accurate implicit polynomial fitting

Relaxing the 3L algorithm for an accurate implicit polynomial fitting Relaxing the 3L algorithm or an accurate implicit polynomial itting Mohammad Rouhani Computer Vision Center Ediici O, Campus UAB 08193 Bellaterra, Barcelona, Spain rouhani@cvc.uab.es Angel D. Sappa Computer

More information

Abstract. I. Introduction

Abstract. I. Introduction 46th AIAA/ASME/ASCE/AHS/ASC Structures, Structural Dynamics & Materials Conerence 8 - April 005, Austin, Texas AIAA 005-83 M Intuitive Design Selection Using Visualized n-dimensional Pareto Frontier G.

More information

Invariant Recognition of Hand-Drawn Pictograms Using HMMs with a Rotating Feature Extraction

Invariant Recognition of Hand-Drawn Pictograms Using HMMs with a Rotating Feature Extraction Invariant Recognition of Hand-Drawn Pictograms Using HMMs with a Rotating Feature Extraction Stefan Müller, Gerhard Rigoll, Andreas Kosmala and Denis Mazurenok Department of Computer Science, Faculty of

More information

Using a Projected Subgradient Method to Solve a Constrained Optimization Problem for Separating an Arbitrary Set of Points into Uniform Segments

Using a Projected Subgradient Method to Solve a Constrained Optimization Problem for Separating an Arbitrary Set of Points into Uniform Segments Using a Projected Subgradient Method to Solve a Constrained Optimization Problem or Separating an Arbitrary Set o Points into Uniorm Segments Michael Johnson May 31, 2011 1 Background Inormation The Airborne

More information

A Robust Two Feature Points Based Depth Estimation Method 1)

A Robust Two Feature Points Based Depth Estimation Method 1) Vol.31, No.5 ACTA AUTOMATICA SINICA September, 2005 A Robust Two Feature Points Based Depth Estimation Method 1) ZHONG Zhi-Guang YI Jian-Qiang ZHAO Dong-Bin (Laboratory of Complex Systems and Intelligence

More information

MATRIX ALGORITHM OF SOLVING GRAPH CUTTING PROBLEM

MATRIX ALGORITHM OF SOLVING GRAPH CUTTING PROBLEM UDC 681.3.06 MATRIX ALGORITHM OF SOLVING GRAPH CUTTING PROBLEM V.K. Pogrebnoy TPU Institute «Cybernetic centre» E-mail: vk@ad.cctpu.edu.ru Matrix algorithm o solving graph cutting problem has been suggested.

More information

A Proposed Approach for Solving Rough Bi-Level. Programming Problems by Genetic Algorithm

A Proposed Approach for Solving Rough Bi-Level. Programming Problems by Genetic Algorithm Int J Contemp Math Sciences, Vol 6, 0, no 0, 45 465 A Proposed Approach or Solving Rough Bi-Level Programming Problems by Genetic Algorithm M S Osman Department o Basic Science, Higher Technological Institute

More information

Affine-invariant shape matching and recognition under partial occlusion

Affine-invariant shape matching and recognition under partial occlusion Title Affine-invariant shape matching and recognition under partial occlusion Author(s) Mai, F; Chang, CQ; Hung, YS Citation The 17th IEEE International Conference on Image Processing (ICIP 2010), Hong

More information

BI-DIRECTIONAL AFFINE MOTION COMPENSATION USING A CONTENT-BASED, NON-CONNECTED, TRIANGULAR MESH

BI-DIRECTIONAL AFFINE MOTION COMPENSATION USING A CONTENT-BASED, NON-CONNECTED, TRIANGULAR MESH BI-DIRECTIONAL AFFINE MOTION COMPENSATION USING A CONTENT-BASED, NON-CONNECTED, TRIANGULAR MESH Marc Servais, Theo Vlachos and Thomas Davies University of Surrey, UK; and BBC Research and Development,

More information

Lagrangian relaxations for multiple network alignment

Lagrangian relaxations for multiple network alignment Noname manuscript No. (will be inserted by the editor) Lagrangian relaxations or multiple network alignment Eric Malmi Sanjay Chawla Aristides Gionis Received: date / Accepted: date Abstract We propose

More information

Video Alignment. Literature Survey. Spring 2005 Prof. Brian Evans Multidimensional Digital Signal Processing Project The University of Texas at Austin

Video Alignment. Literature Survey. Spring 2005 Prof. Brian Evans Multidimensional Digital Signal Processing Project The University of Texas at Austin Literature Survey Spring 2005 Prof. Brian Evans Multidimensional Digital Signal Processing Project The University of Texas at Austin Omer Shakil Abstract This literature survey compares various methods

More information

A fast and area-efficient FPGA-based architecture for high accuracy logarithm approximation

A fast and area-efficient FPGA-based architecture for high accuracy logarithm approximation A ast and area-eicient FPGA-based architecture or high accuracy logarithm approximation Dimitris Bariamis, Dimitris Maroulis, Dimitris K. Iakovidis Department o Inormatics and Telecommunications University

More information

A Method of Sign Language Gesture Recognition Based on Contour Feature

A Method of Sign Language Gesture Recognition Based on Contour Feature Proceedings o the World Congress on Engineering and Computer Science 014 Vol I WCECS 014, -4 October, 014, San Francisco, USA A Method o Sign Language Gesture Recognition Based on Contour Feature Jingzhong

More information

3-D TERRAIN RECONSTRUCTION WITH AERIAL PHOTOGRAPHY

3-D TERRAIN RECONSTRUCTION WITH AERIAL PHOTOGRAPHY 3-D TERRAIN RECONSTRUCTION WITH AERIAL PHOTOGRAPHY Bin-Yih Juang ( 莊斌鎰 ) 1, and Chiou-Shann Fuh ( 傅楸善 ) 3 1 Ph. D candidate o Dept. o Mechanical Engineering National Taiwan University, Taipei, Taiwan Instructor

More information

Lab 9 - GEOMETRICAL OPTICS

Lab 9 - GEOMETRICAL OPTICS 161 Name Date Partners Lab 9 - GEOMETRICAL OPTICS OBJECTIVES Optics, developed in us through study, teaches us to see - Paul Cezanne Image rom www.weidemyr.com To examine Snell s Law To observe total internal

More information

Distribution Fields with Adaptive Kernels for Large Displacement Image Alignment

Distribution Fields with Adaptive Kernels for Large Displacement Image Alignment MEARS et al.: DISTRIBUTION FIELDS WITH ADAPTIVE KERNELS 1 Distribution Fields with Adaptive Kernels or Large Displacement Image Alignment Benjamin Mears bmears@cs.umass.edu Laura Sevilla Lara bmears@cs.umass.edu

More information

Video Alignment. Final Report. Spring 2005 Prof. Brian Evans Multidimensional Digital Signal Processing Project The University of Texas at Austin

Video Alignment. Final Report. Spring 2005 Prof. Brian Evans Multidimensional Digital Signal Processing Project The University of Texas at Austin Final Report Spring 2005 Prof. Brian Evans Multidimensional Digital Signal Processing Project The University of Texas at Austin Omer Shakil Abstract This report describes a method to align two videos.

More information

Comparison of Two Interactive Search Refinement Techniques

Comparison of Two Interactive Search Refinement Techniques Comparison o Two Interactive Search Reinement Techniques Olga Vechtomova Department o Management Sciences University o Waterloo 200 University Avenue West, Waterloo, Canada ovechtom@engmail.uwaterloo.ca

More information

ITU - Telecommunication Standardization Sector. G.fast: Far-end crosstalk in twisted pair cabling; measurements and modelling ABSTRACT

ITU - Telecommunication Standardization Sector. G.fast: Far-end crosstalk in twisted pair cabling; measurements and modelling ABSTRACT ITU - Telecommunication Standardization Sector STUDY GROUP 15 Temporary Document 11RV-22 Original: English Richmond, VA. - 3-1 Nov. 211 Question: 4/15 SOURCE 1 : TNO TITLE: G.ast: Far-end crosstalk in

More information

Local Image Registration: An Adaptive Filtering Framework

Local Image Registration: An Adaptive Filtering Framework Local Image Registration: An Adaptive Filtering Framework Gulcin Caner a,a.murattekalp a,b, Gaurav Sharma a and Wendi Heinzelman a a Electrical and Computer Engineering Dept.,University of Rochester, Rochester,

More information

AUTOMATIC OBJECT DETECTION IN VIDEO SEQUENCES WITH CAMERA IN MOTION. Ninad Thakoor, Jean Gao and Huamei Chen

AUTOMATIC OBJECT DETECTION IN VIDEO SEQUENCES WITH CAMERA IN MOTION. Ninad Thakoor, Jean Gao and Huamei Chen AUTOMATIC OBJECT DETECTION IN VIDEO SEQUENCES WITH CAMERA IN MOTION Ninad Thakoor, Jean Gao and Huamei Chen Computer Science and Engineering Department The University of Texas Arlington TX 76019, USA ABSTRACT

More information

An Efficient Single Chord-based Accumulation Technique (SCA) to Detect More Reliable Corners

An Efficient Single Chord-based Accumulation Technique (SCA) to Detect More Reliable Corners An Efficient Single Chord-based Accumulation Technique (SCA) to Detect More Reliable Corners Mohammad Asiful Hossain, Abdul Kawsar Tushar, and Shofiullah Babor Computer Science and Engineering Department,

More information

The Graph of an Equation Graph the following by using a table of values and plotting points.

The Graph of an Equation Graph the following by using a table of values and plotting points. Calculus Preparation - Section 1 Graphs and Models Success in math as well as Calculus is to use a multiple perspective -- graphical, analytical, and numerical. Thanks to Rene Descartes we can represent

More information

Model-based segmentation and recognition from range data

Model-based segmentation and recognition from range data Model-based segmentation and recognition from range data Jan Boehm Institute for Photogrammetry Universität Stuttgart Germany Keywords: range image, segmentation, object recognition, CAD ABSTRACT This

More information

EE 264: Image Processing and Reconstruction. Image Motion Estimation II. EE 264: Image Processing and Reconstruction. Outline

EE 264: Image Processing and Reconstruction. Image Motion Estimation II. EE 264: Image Processing and Reconstruction. Outline Peman Milanar Image Motion Estimation II Peman Milanar Outline. Introduction to Motion. Wh Estimate Motion? 3. Global s. Local Motion 4. Block Motion Estimation 5. Optical Flow Estimation Basics 6. Optical

More information

Querying Complex Spatio-Temporal Sequences in Human Motion Databases

Querying Complex Spatio-Temporal Sequences in Human Motion Databases Querying Complex Spatio-Temporal Sequences in Human Motion Databases Yueguo Chen #, Shouxu Jiang, Beng Chin Ooi #, Anthony KH Tung # # School o Computing, National University o Singapore Computing, Law

More information

A Cylindrical Surface Model to Rectify the Bound Document Image

A Cylindrical Surface Model to Rectify the Bound Document Image A Cylindrical Surace Model to Rectiy the Bound Document Image Huaigu Cao, Xiaoqing Ding, Changsong Liu Department o Electronic Engineering, Tsinghua University State Key Laboratory o Intelligent Technology

More information

NeTra-V: Towards an Object-based Video Representation

NeTra-V: Towards an Object-based Video Representation Proc. of SPIE, Storage and Retrieval for Image and Video Databases VI, vol. 3312, pp 202-213, 1998 NeTra-V: Towards an Object-based Video Representation Yining Deng, Debargha Mukherjee and B. S. Manjunath

More information

A SAR IMAGE REGISTRATION METHOD BASED ON SIFT ALGORITHM

A SAR IMAGE REGISTRATION METHOD BASED ON SIFT ALGORITHM A SAR IMAGE REGISTRATION METHOD BASED ON SIFT ALGORITHM W. Lu a,b, X. Yue b,c, Y. Zhao b,c, C. Han b,c, * a College o Resources and Environment, University o Chinese Academy o Sciences, Beijing, 100149,

More information

Multi-scale Techniques for Document Page Segmentation

Multi-scale Techniques for Document Page Segmentation Multi-scale Techniques for Document Page Segmentation Zhixin Shi and Venu Govindaraju Center of Excellence for Document Analysis and Recognition (CEDAR), State University of New York at Buffalo, Amherst

More information

Reducing the Bandwidth of a Sparse Matrix with Tabu Search

Reducing the Bandwidth of a Sparse Matrix with Tabu Search Reducing the Bandwidth o a Sparse Matrix with Tabu Search Raael Martí a, Manuel Laguna b, Fred Glover b and Vicente Campos a a b Dpto. de Estadística e Investigación Operativa, Facultad de Matemáticas,

More information

What is Clustering? Clustering. Characterizing Cluster Methods. Clusters. Cluster Validity. Basic Clustering Methodology

What is Clustering? Clustering. Characterizing Cluster Methods. Clusters. Cluster Validity. Basic Clustering Methodology Clustering Unsupervised learning Generating classes Distance/similarity measures Agglomerative methods Divisive methods Data Clustering 1 What is Clustering? Form o unsupervised learning - no inormation

More information

SUPER RESOLUTION IMAGE BY EDGE-CONSTRAINED CURVE FITTING IN THE THRESHOLD DECOMPOSITION DOMAIN

SUPER RESOLUTION IMAGE BY EDGE-CONSTRAINED CURVE FITTING IN THE THRESHOLD DECOMPOSITION DOMAIN SUPER RESOLUTION IMAGE BY EDGE-CONSTRAINED CURVE FITTING IN THE THRESHOLD DECOMPOSITION DOMAIN Tsz Chun Ho and Bing Zeng Department o Electronic and Computer Engineering The Hong Kong University o Science

More information

PATH PLANNING OF UNMANNED AERIAL VEHICLE USING DUBINS GEOMETRY WITH AN OBSTACLE

PATH PLANNING OF UNMANNED AERIAL VEHICLE USING DUBINS GEOMETRY WITH AN OBSTACLE Proceeding o International Conerence On Research, Implementation And Education O Mathematics And Sciences 2015, Yogyakarta State University, 17-19 May 2015 PATH PLANNING OF UNMANNED AERIAL VEHICLE USING

More information

Mobile Robot Static Path Planning Based on Genetic Simulated Annealing Algorithm

Mobile Robot Static Path Planning Based on Genetic Simulated Annealing Algorithm Mobile Robot Static Path Planning Based on Genetic Simulated Annealing Algorithm Wang Yan-ping 1, Wubing 2 1. School o Electric and Electronic Engineering, Shandong University o Technology, Zibo 255049,

More information

Monocular 3D human pose estimation with a semi-supervised graph-based method

Monocular 3D human pose estimation with a semi-supervised graph-based method Monocular 3D human pose estimation with a semi-supervised graph-based method Mahdieh Abbasi Computer Vision and Systems Laboratory Department o Electrical Engineering and Computer Engineering Université

More information

Automated Modelization of Dynamic Systems

Automated Modelization of Dynamic Systems Automated Modelization o Dynamic Systems Ivan Perl, Aleksandr Penskoi ITMO University Saint-Petersburg, Russia ivan.perl, aleksandr.penskoi@corp.imo.ru Abstract Nowadays, dierent kinds o modelling settled

More information

Optical flow estimation on image sequences with differently exposed frames. Tomas Bengtsson Tomas McKelvey Konstantin Lindström

Optical flow estimation on image sequences with differently exposed frames. Tomas Bengtsson Tomas McKelvey Konstantin Lindström Optical low estimation on image sequences with dierently exposed rames Tomas Bengtsson Tomas McKelvey Konstantin Lindström Optical Engineering 54(9), 093103 (September 015) Optical low estimation on image

More information

Classifier Evasion: Models and Open Problems

Classifier Evasion: Models and Open Problems . In Privacy and Security Issues in Data Mining and Machine Learning, eds. C. Dimitrakakis, et al. Springer, July 2011, pp. 92-98. Classiier Evasion: Models and Open Problems Blaine Nelson 1, Benjamin

More information

Triangular Mesh Segmentation Based On Surface Normal

Triangular Mesh Segmentation Based On Surface Normal ACCV2002: The 5th Asian Conference on Computer Vision, 23--25 January 2002, Melbourne, Australia. Triangular Mesh Segmentation Based On Surface Normal Dong Hwan Kim School of Electrical Eng. Seoul Nat

More information

Automated Planning for Feature Model Configuration based on Functional and Non-Functional Requirements

Automated Planning for Feature Model Configuration based on Functional and Non-Functional Requirements Automated Planning or Feature Model Coniguration based on Functional and Non-Functional Requirements Samaneh Soltani 1, Mohsen Asadi 1, Dragan Gašević 2, Marek Hatala 1, Ebrahim Bagheri 2 1 Simon Fraser

More information

GEOMETRICAL OPTICS OBJECTIVES

GEOMETRICAL OPTICS OBJECTIVES Geometrical Optics 207 Name Date Partners OBJECTIVES OVERVIEW GEOMETRICAL OPTICS To examine Snell s Law and observe total internal relection. To understand and use the lens equations. To ind the ocal length

More information

Compressed Sensing Image Reconstruction Based on Discrete Shearlet Transform

Compressed Sensing Image Reconstruction Based on Discrete Shearlet Transform Sensors & Transducers 04 by IFSA Publishing, S. L. http://www.sensorsportal.com Compressed Sensing Image Reconstruction Based on Discrete Shearlet Transorm Shanshan Peng School o Inormation Science and

More information

9.3 Transform Graphs of Linear Functions Use this blank page to compile the most important things you want to remember for cycle 9.

9.3 Transform Graphs of Linear Functions Use this blank page to compile the most important things you want to remember for cycle 9. 9. Transorm Graphs o Linear Functions Use this blank page to compile the most important things you want to remember or cycle 9.: Sec Math In-Sync by Jordan School District, Utah is licensed under a 6 Function

More information

A Robust Wipe Detection Algorithm

A Robust Wipe Detection Algorithm A Robust Wipe Detection Algorithm C. W. Ngo, T. C. Pong & R. T. Chin Department of Computer Science The Hong Kong University of Science & Technology Clear Water Bay, Kowloon, Hong Kong Email: fcwngo, tcpong,

More information

CS443: Digital Imaging and Multimedia Binary Image Analysis. Spring 2008 Ahmed Elgammal Dept. of Computer Science Rutgers University

CS443: Digital Imaging and Multimedia Binary Image Analysis. Spring 2008 Ahmed Elgammal Dept. of Computer Science Rutgers University CS443: Digital Imaging and Multimedia Binary Image Analysis Spring 2008 Ahmed Elgammal Dept. of Computer Science Rutgers University Outlines A Simple Machine Vision System Image segmentation by thresholding

More information

Water-Filling: A Novel Way for Image Structural Feature Extraction

Water-Filling: A Novel Way for Image Structural Feature Extraction Water-Filling: A Novel Way for Image Structural Feature Extraction Xiang Sean Zhou Yong Rui Thomas S. Huang Beckman Institute for Advanced Science and Technology University of Illinois at Urbana Champaign,

More information

An Analytic Model for Embedded Machine Vision: Architecture and Performance Exploration

An Analytic Model for Embedded Machine Vision: Architecture and Performance Exploration 419 An Analytic Model or Embedded Machine Vision: Architecture and Perormance Exploration Chan Kit Wai, Prahlad Vadakkepat, Tan Kok Kiong Department o Electrical and Computer Engineering, 4 Engineering

More information

Chapter 11 Arc Extraction and Segmentation

Chapter 11 Arc Extraction and Segmentation Chapter 11 Arc Extraction and Segmentation 11.1 Introduction edge detection: labels each pixel as edge or no edge additional properties of edge: direction, gradient magnitude, contrast edge grouping: edge

More information

Object Tracking with Dynamic Feature Graph

Object Tracking with Dynamic Feature Graph Object Tracking with Dynamic Feature Graph Feng Tang and Hai Tao Department o Computer Engineering, University o Caliornia, Santa Cruz {tang,tao}@soe.ucsc.edu Abstract Two major problems or model-based

More information

Motion Estimation for Video Coding Standards

Motion Estimation for Video Coding Standards Motion Estimation for Video Coding Standards Prof. Ja-Ling Wu Department of Computer Science and Information Engineering National Taiwan University Introduction of Motion Estimation The goal of video compression

More information

Shape Similarity Measurement for Boundary Based Features

Shape Similarity Measurement for Boundary Based Features Shape Similarity Measurement for Boundary Based Features Nafiz Arica 1 and Fatos T. Yarman Vural 2 1 Department of Computer Engineering, Turkish Naval Academy 34942, Tuzla, Istanbul, Turkey narica@dho.edu.tr

More information

Shape Descriptor using Polar Plot for Shape Recognition.

Shape Descriptor using Polar Plot for Shape Recognition. Shape Descriptor using Polar Plot for Shape Recognition. Brijesh Pillai ECE Graduate Student, Clemson University bpillai@clemson.edu Abstract : This paper presents my work on computing shape models that

More information

Classifier Evasion: Models and Open Problems

Classifier Evasion: Models and Open Problems Classiier Evasion: Models and Open Problems Blaine Nelson 1, Benjamin I. P. Rubinstein 2, Ling Huang 3, Anthony D. Joseph 1,3, and J. D. Tygar 1 1 UC Berkeley 2 Microsot Research 3 Intel Labs Berkeley

More information

Express Letters. A Simple and Efficient Search Algorithm for Block-Matching Motion Estimation. Jianhua Lu and Ming L. Liou

Express Letters. A Simple and Efficient Search Algorithm for Block-Matching Motion Estimation. Jianhua Lu and Ming L. Liou IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 7, NO. 2, APRIL 1997 429 Express Letters A Simple and Efficient Search Algorithm for Block-Matching Motion Estimation Jianhua Lu and

More information

An Efficient Configuration Methodology for Time-Division Multiplexed Single Resources

An Efficient Configuration Methodology for Time-Division Multiplexed Single Resources An Eicient Coniguration Methodology or Time-Division Multiplexed Single Resources Benny Akesson 1, Anna Minaeva 1, Přemysl Šůcha 1, Andrew Nelson 2 and Zdeněk Hanzálek 1 1 Czech Technical University in

More information

KANGAL REPORT

KANGAL REPORT Individual Penalty Based Constraint handling Using a Hybrid Bi-Objective and Penalty Function Approach Rituparna Datta Kalyanmoy Deb Mechanical Engineering IIT Kanpur, India KANGAL REPORT 2013005 Abstract

More information

THIN LENSES: BASICS. There are at least three commonly used symbols for object and image distances:

THIN LENSES: BASICS. There are at least three commonly used symbols for object and image distances: THN LENSES: BASCS BJECTVE: To study and veriy some o the laws o optics applicable to thin lenses by determining the ocal lengths o three such lenses ( two convex, one concave) by several methods. THERY:

More information

where ~n = ( ) t is the normal o the plane (ie, Q ~ t ~n =,8 Q ~ ), =( ~ X Y Z ) t and ~t = (t X t Y t Z ) t are the camera rotation and translation,

where ~n = ( ) t is the normal o the plane (ie, Q ~ t ~n =,8 Q ~ ), =( ~ X Y Z ) t and ~t = (t X t Y t Z ) t are the camera rotation and translation, Multi-Frame Alignment o Planes Lihi Zelnik-Manor Michal Irani Dept o Computer Science and Applied Math The Weizmann Institute o Science Rehovot, Israel Abstract Traditional plane alignment techniques are

More information

Improving Alignment of Faces for Recognition

Improving Alignment of Faces for Recognition Improving Alignment o Faces or Recognition Md. Kamrul Hasan Département de génie inormatique et génie logiciel École Polytechnique de Montréal, Québec, Canada md-kamrul.hasan@polymtl.ca Christopher J.

More information

Fingerprint Classification Using Orientation Field Flow Curves

Fingerprint Classification Using Orientation Field Flow Curves Fingerprint Classification Using Orientation Field Flow Curves Sarat C. Dass Michigan State University sdass@msu.edu Anil K. Jain Michigan State University ain@msu.edu Abstract Manual fingerprint classification

More information

Study and Analysis of Edge Detection and Implementation of Fuzzy Set. Theory Based Edge Detection Technique in Digital Images

Study and Analysis of Edge Detection and Implementation of Fuzzy Set. Theory Based Edge Detection Technique in Digital Images Study and Analysis o Edge Detection and Implementation o Fuzzy Set Theory Based Edge Detection Technique in Digital Images Anju K S Assistant Proessor, Department o Computer Science Baselios Mathews II

More information

Intelligent Optimization Methods for High-Dimensional Data Classification for Support Vector Machines

Intelligent Optimization Methods for High-Dimensional Data Classification for Support Vector Machines Intelligent Inormation Management, 200, 2, ***-*** doi:0.4236/iim.200.26043 Published Online June 200 (http://www.scirp.org/journal/iim) Intelligent Optimization Methods or High-Dimensional Data Classiication

More information

THE FINANCIAL CALCULATOR

THE FINANCIAL CALCULATOR Starter Kit CHAPTER 3 Stalla Seminars THE FINANCIAL CALCULATOR In accordance with the AIMR calculator policy in eect at the time o this writing, CFA candidates are permitted to use one o two approved calculators

More information

Unsupervised Learning of Probabilistic Models for Robot Navigation

Unsupervised Learning of Probabilistic Models for Robot Navigation Unsupervised Learning o Probabilistic Models or Robot Navigation Sven Koenig Reid G. Simmons School o Computer Science, Carnegie Mellon University Pittsburgh, PA 15213-3891 Abstract Navigation methods

More information

2 DETERMINING THE VANISHING POINT LOCA- TIONS

2 DETERMINING THE VANISHING POINT LOCA- TIONS IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL.??, NO.??, DATE 1 Equidistant Fish-Eye Calibration and Rectiication by Vanishing Point Extraction Abstract In this paper we describe

More information

5.2 Properties of Rational functions

5.2 Properties of Rational functions 5. Properties o Rational unctions A rational unction is a unction o the orm n n1 polynomial p an an 1 a1 a0 k k1 polynomial q bk bk 1 b1 b0 Eample 3 5 1 The domain o a rational unction is the set o all

More information

CS4670: Computer Vision

CS4670: Computer Vision CS4670: Computer Vision Noah Snavely Lecture 6: Feature matching and alignment Szeliski: Chapter 6.1 Reading Last time: Corners and blobs Scale-space blob detector: Example Feature descriptors We know

More information

MORPHOLOGICAL EDGE DETECTION AND CORNER DETECTION ALGORITHM USING CHAIN-ENCODING

MORPHOLOGICAL EDGE DETECTION AND CORNER DETECTION ALGORITHM USING CHAIN-ENCODING MORPHOLOGICAL EDGE DETECTION AND CORNER DETECTION ALGORITHM USING CHAIN-ENCODING Neeta Nain, Vijay Laxmi, Ankur Kumar Jain & Rakesh Agarwal Department of Computer Engineering Malaviya National Institute

More information

Efficient Tuning of SVM Hyperparameters Using Radius/Margin Bound and Iterative Algorithms

Efficient Tuning of SVM Hyperparameters Using Radius/Margin Bound and Iterative Algorithms IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 13, NO. 5, SEPTEMBER 2002 1225 Efficient Tuning of SVM Hyperparameters Using Radius/Margin Bound and Iterative Algorithms S. Sathiya Keerthi Abstract This paper

More information

A Laplacian Based Novel Approach to Efficient Text Localization in Grayscale Images

A Laplacian Based Novel Approach to Efficient Text Localization in Grayscale Images A Laplacian Based Novel Approach to Efficient Text Localization in Grayscale Images Karthik Ram K.V & Mahantesh K Department of Electronics and Communication Engineering, SJB Institute of Technology, Bangalore,

More information

AN 608: HST Jitter and BER Estimator Tool for Stratix IV GX and GT Devices

AN 608: HST Jitter and BER Estimator Tool for Stratix IV GX and GT Devices AN 608: HST Jitter and BER Estimator Tool or Stratix IV GX and GT Devices July 2010 AN-608-1.0 The high-speed communication link design toolkit (HST) jitter and bit error rate (BER) estimator tool is a

More information

Shape Retrieval with Flat Contour Segments

Shape Retrieval with Flat Contour Segments Shape Retrieval with Flat Contour Segments Dalong Li 1, Steven Simske Intelligent Enterprise Technologies Laboratory HP Laboratories Palo Alto HPL-2002-250 September 9 th, 2002* image database, image retrieval,

More information

Three-Dimensional Object Representations Chapter 8

Three-Dimensional Object Representations Chapter 8 Three-Dimensional Object Representations Chapter 8 3D Object Representation A surace can be analticall generated using its unction involving the coordinates. An object can be represented in terms o its

More information

I. INTRODUCTION. Figure-1 Basic block of text analysis

I. INTRODUCTION. Figure-1 Basic block of text analysis ISSN: 2349-7637 (Online) (RHIMRJ) Research Paper Available online at: www.rhimrj.com Detection and Localization of Texts from Natural Scene Images: A Hybrid Approach Priyanka Muchhadiya Post Graduate Fellow,

More information

A Quantitative Comparison of 4 Algorithms for Recovering Dense Accurate Depth

A Quantitative Comparison of 4 Algorithms for Recovering Dense Accurate Depth A Quantitative Comparison o 4 Algorithms or Recovering Dense Accurate Depth Baozhong Tian and John L. Barron Dept. o Computer Science University o Western Ontario London, Ontario, Canada {btian,barron}@csd.uwo.ca

More information

9. Reviewing Printed Circuit Board Schematics with the Quartus II Software

9. Reviewing Printed Circuit Board Schematics with the Quartus II Software November 2012 QII52019-12.1.0 9. Reviewing Printed Circuit Board Schematics with the Quartus II Sotware QII52019-12.1.0 This chapter provides guidelines or reviewing printed circuit board (PCB) schematics

More information

MORPHOLOGICAL BOUNDARY BASED SHAPE REPRESENTATION SCHEMES ON MOMENT INVARIANTS FOR CLASSIFICATION OF TEXTURES

MORPHOLOGICAL BOUNDARY BASED SHAPE REPRESENTATION SCHEMES ON MOMENT INVARIANTS FOR CLASSIFICATION OF TEXTURES International Journal of Computer Science and Communication Vol. 3, No. 1, January-June 2012, pp. 125-130 MORPHOLOGICAL BOUNDARY BASED SHAPE REPRESENTATION SCHEMES ON MOMENT INVARIANTS FOR CLASSIFICATION

More information

3D colonic polyp segmentation using dynamic deformable surfaces

3D colonic polyp segmentation using dynamic deformable surfaces 3D colonic polyp segmentation using dynamic deormable suraces Jianhua Yao 1, Ronald M. Summers 1 1 Diagnostic Radiology Department, Clinical Center National Institute o Health ABSTRACT An improved 3D method

More information

A Research on Moving Human Body Detection Based on the Depth Images of Kinect

A Research on Moving Human Body Detection Based on the Depth Images of Kinect Sensors & Transducers 2014 by IFSA Publishing, S. L. http://www.sensorsportal.com A Research on Moving Human Body Detection Based on the Depth Images o Kinect * Xi an Zhu, Jiaqi Huo Institute o Inormation

More information