A review of branch and band algorithms for geometric and statistical layout analysis

Size: px
Start display at page:

Download "A review of branch and band algorithms for geometric and statistical layout analysis"

Transcription

1 A revew of branch and band algorthms for geometrc and statstcal layout analyss Thomas M. Breuel To cte ths verson: Thomas M. Breuel. A revew of branch and band algorthms for geometrc and statstcal layout analyss. Jun 2004, <sc_ > HAL Id: sc_ Submtted on 7 Dec 2004 HAL s a mult-dscplnary open access archve for the depost and dssemnaton of scentfc research documents, whether they are publshed or not. The documents may come from teachng and research nsttutons n France or abroad, or from publc or prvate research centers. L archve ouverte plurdscplnare HAL, est destnée au dépôt et à la dffuson de documents scentfques de nveau recherche, publés ou non, émanant des établssements d ensegnement et de recherche franças ou étrangers, des laboratores publcs ou prvés.

2 A Revew of Branch-and-Bound Algorthms for Geometrc and Statstcal Layout Analyss Thomas M. Breuel Unversty of Kaserslautern and DFKI Résumé : Many dfferent approaches to the geometrc and statstcal analyss of document layouts have been proposed n the lterature. The development of practcal branchand-bound algorthms for solvng geometrc matchng problems under nose and uncertanty has enabled the formulaton of new classes of geometrc layout analyss methods based on globally optmal maxmum lkelhood nterpretatons for well-defned models of the spatal statstcs of document mages. I revew ths approach to geometrc layout analyss usng text lne fndng and column fndng n the presence of nose and uncertanty as examples and compare the approach wth selected other statstcal and geometrc layout analyss methods. Mots-clés : document layout analyss, geometrc matchng, text lne fndng, branch-and-bound algorthms, global optmzaton 1 Introducton In addton to ther purely textual content, rendered documents contan a wealth of nformaton n the geometrc arrangement of the text and fgures on the page the page layout. Examples of propertes encoded n the page layout are nformaton about whch text corresponds to the ttle, author, page number, and abstract of a document, the order n whch the body text s to be read (the readng order), and major logcal dvsons n the body text. Recoverng ths nformaton s the problem of document layout analyss. Document layout nformaton has a varety of uses. It s a key step n the converson of scanned documents nto machne readable form ; that s, what we typcally thnk of optcal character recognton (OCR) actually comprses both layout analyss and recognton of ndvdual characters. In fact, even the recognton of characters n OCR depends on correct document layout analyss, snce the nterpretaton of certan characters s affected by ther poston relatve to the text lne and snce statstcal language models depend on the correct readng order of the text. Furthermore, the user of an OCR system usually expects to obtan not just a vector graphcs fle wth thousands of characters placed at specfc locaton n the mage, but nstead an edtable and structured text fle that contans text n ts correct readng order and correctly dentfes actual lne and paragraph breaks. But whle OCR s perhaps the most mportant use of document layout analyss, t s not the only one. Document databases need to extract nformaton that permts ndexng and retreval of documents. Ths nformaton can often only be derved from the layout of the text (as opposed to the textual content or even font propertes). For example, ttles and authors of scentfc papers tend to be prnted at the top of the frst page, centered, wth the ttle mmedately precedng the author and separated from the rest of the text by whtespace, propertes that are recoverable by document layout analyss. Another applcaton of document layout analyss s magebased reformattng and reflow of documents, a technque that allows the dsplay of scanned documents on small-screen devces wthout OCR errors and whle preservng the appearance of the orgnal document [BRE 02b]. 2 Layout Prmtves The actual layout of a document s the result of the applcaton of complex, nteractng rules about where to place text on the page. Some of those rules are consequences of propertes of the human vsual system and attempts to make text more readable (e.g., keepng lne lengths below a certan number of characters per lne), others are the results of physcal constrants (e.g., page sze), constrants mposed by tradtonal type settng equpment (e.g., the use of straght and parallel text lnes), conventon (e.g., where page numbers and ttles go), as well as stylstc and artstc consderatons. Whle layouts can become enormously complex, almost all layouts tend to be composed of a number of recurrng prmtves. The most mportant of these are text lnes, text columns, sectons, and paragraphs. Furthermore, these prmtves have a number of common geometrc relatonshps between them, defned by ther relatve sze, spacng, algnment, and justfcaton. We call the extracton of these prmtves physcal document layout analyss. We refer to the extracton of hgher-level propertes of a document (lke ttles, authors, page numbers, etc.) as logcal document layout analyss. Logcal document layout analyss generally makes use of physcal document layout analyss to acheve ts goals. Ths work deals prmarly wth physcal document layout analyss, that s, the relable extracton of prmtves lke text lnes and text columns, and the geometrc relatonshp between those prmtves. 3 Prevous Methods A large number of physcal document layout analyss technques have been proposed n the lterature ([CAT 98] provdes a good overvew). In order to perform ther functon, layout analyss technques make assumptons (explctly or

3 mplctly) about the geometrc propertes of the layouts that they are appled to. Commonly made assumptons, by dfferent systems, are that text lnes are parallel to each other, that text on the page s composed of characters of approxmately the same font sze, that paragraph boundares are straght and perpendcular to text lnes, that paragraph boundares are rectangular, that larger logcal dvsons correspond to greater physcal spacng of text, that paragraphs of text contan only lmted amounts of whtespace, and many more. Such assumptons are approxmate expressons of the physcal, stylstc, and perceptual constrants on layout analyss. In practce, none of the assumptons made by current layout analyss systems result n human-lke performance on real documents ; that s, for each of those assumptons, there wll be some classes of documents that volate those assumptons, yet are easly nterpretable by a human reader. A complete taxonomy of the assumptons made by dfferent document layout analyss methods goes beyond the scope of ths paper, but we wll try to examne mportant dfferences as they arse. Whle the models revewed n ths paper does not permt the nterpretaton of all documents that a human reader could easly nterpret, t does relax some commonly made assumptons, whch makes t applcable to a wder class of document layouts. Frst, many prevous physcal layout analyss methods explctly or mplctly assume that text lnes on a page are (mostly) parallel to each other. For example, projecton methods and Fourer transform methods for skew estmaton assume that most of the text on the page s at a sngle orentaton and then attempt to correct page rotaton ( skew ) and transform all text lnes nto lnes that are orented at multples of 90 o relatve to the page boundares ; text lnes are then dentfed n a second step n ths corrected mage (see [HAR 94, BAI 87, SRI 89, BAI 87, OKU 99] for further detals). Ths assumpton of a global page rotaton of an otherwse rectlnear layout means that such methods are not applcable to some mportant classes of documents, such as those captured wth dgtal cameras and hence subject to perspectve dstortons. Some prevous methods for fndng text lnes do not requre rectlnear layouts, but they have had to sacrfce precse geometrc models of text lnes (n terms of baselnes, descenders, and ascenders) for smpler methods usng proxmty of connected components [O G 93, KIS 98] text lne fnder descrbed here does not make pror assumptons about text lne parallelsm, but stll models the geometry of each text lne precsely. Another assumpton commonly made by other layout analyss systems s that of a physcally represented layout herarchy. That s, layout components lke sectons, paragraphs, and text lnes form a herarchy based on ncluson : sectons are composed of paragraphs, and paragraphs are composed of text lnes. It seems plausble to take a top-down or bottomup approach to recoverng these layout components. That s, such methods frst dentfy sectons, then paragraphs, and fnally text lnes. Examples of such approaches are approaches lke X-Y cuts [NAG 84] that successvely splt the page nto smaller and smaller unts top-down, and approaches that use proxmty of connected components [O G 93, KIS 98] or mathematcal morphology [CHE 95] group elements on the page nto successvely hgher level groupng n a bottom up manner. However, n practce, the logcal layout herarchy does not appear to be very well represented n smple geometrc propertes of the physcal layout ; hgher-level layout propertes do not, n general correspond to larger amounts of whtespace or other large-scale features. Instead, logcal layout components lke paragraphs and sectons are often represented only through subtle means lke ndentaton or changes n font style (e.g., bold secton headngs), whle physcal layout structures lke text columns or page breaks that do not form part of the logcal layout herarchy often have much smpler geometrc representatons on the page. The layout analyss technques descrbed here are not organzed around a logcal layout herarchy ; nstead, they concentrate on dentfyng the vsually most salent propertes of the page layout ts columnar structure and ts text lnes. Logcal layout propertes lke paragraphs and sectons can then be recovered from that nformaton. 4 Statstcal and Geometrc Models 4.1 Text Lnes Probably both for typesettng and for perceptual reasons, text n many wrtng systems s set along straght lnes. For Latn scrpts, whch we focus on n ths work, each character rests ether on a baselne or on a lne of descenders (Fgure 1). The top of each character reaches ether up to the x-heght, or to the lne of ascenders. Note that the locaton of each character on the baselne or lne of descenders s only approxmate typographc conventons resultng from perceptual phenomena dctate that n well-desgned fonts, the actual locaton of ndvdual characters dffers slghtly from ther precse locaton relatve to the baselne. As s commonly done, we approxmate the shape of each character by ts (axs-algned) boundng box and use the bottom center of the boundng box as a reference pont [BAI 87]. If there s no rotaton present at all, ths reference pont wll le on the baselne or lne of descenders f the character does ; for small page rotatons, the poston of the reference pont wll dffer from the actual locaton of the character relatve to the baselne, but for the range of page rotatons encountered n practce, the dfference s slght. The amount of dfference depends, however, on the character shape, and t wll be larger for characters whose boundng box s defned by straght, off-center lnes (e.g., H ) than for characters lke T or O. Devatons from the baselne, errors resultng from usng the boundng box approxmaton, mage nose, and scanner quantzaton all result n some nose n the locaton of reference ponts relatve to the baselne or lne of descenders. We approxmate ths nose by a Gaussan dstrbuton, but bound the maxmum allowable devaton ; that s, beyond a certan devaton from a gven baselne, t s more lkely that the character n queston belongs to a dfferent baselne or background nose rather than belongng to the same baselne wth a large devaton. The text lne model shown n Fgure 1 s parameterzed by two lne parameters for the baselne (we use angle θ and dstance from orgn, r), plus three parameters for the sze of the

4 FIG. 1 The text lne model used by the algorthm. descenders, the x-heght, and the ascenders. However, every character rests ether on the baselne or lne of descenders. Therefore, n order to dentfy all characters contrbutng to a text lne, t s suffcent to model the baselne and the lne of descenders, a three-parameter problem. 1 Ignorng the lne of descenders for a moment, the lkelhood wth whch a character s found at some pont p gven a set of lne parameters (θ, r) s then (up to normalzaton factors) : P (p θ, r) max(β, G σ (d(p, l θ,r ))) (1) Here, l θ,r s the baselne, d(p, l θ,r ) s the dstance of p from the baselne, G σ (x) s a Gaussan dstrbuton wth zero mean and standard devaton σ, and β s a background probablty. Ths model s analogous to the model for geometrc matchng descrbed by [III 97], where the reader can fnd a more extensve justfcaton. Takng nto account the lne of descenders, we arrve at a mxture dstrbuton P (p θ, r, d d ) max(β, (1 λ) G σ (d(p, l θ,r )) + λ G σ (d(p, l θ,r dd ))) (2) = max(β, (1 λ) G σ (d(p, l θ,r )), λ G σ (d(p, l θ,r dd ))) (3) Here, d d s the dstance of the lne of descenders from the baselne, and λ s the frequency wth whch descenders occur. We have assumed that the ranges for whch the dstrbutons around the baselne and around the lne of descenders are greater than the background dstrbuton are well separated, whch has allowed us to replace the sum n Equaton 2 wth the max n Equaton 3. Wth the above statstcal model, we can now formulate text lne fndng as that of fndng a maxmum lkelhood soluton for our statstcal model : ˆl θ,r,dd = arg max P (p θ, r, d d ) (4) For the purpose of fndng maxmum lkelhood solutons, t s easer to take the logarthm of ths equaton (the logarthm 1 Modelng the lne of ascenders and the x-heght can be used to reject characters from a match that accdentally fall on the baselne or the lne of descenders. That can mprove robustness for very nosy documents or documents wth unusual layouts. s a monotonc functon) : ˆl θ,r,dd = arg max = arg max log P (p θ, r, d d ) (5) max(log β, log(1 λ) + log G σ (d(p, l θ,r )), log λ + log G σ (d(p, l θ,r ))) (6) = arg max max(0, log(1 λ) + log G σ (d(p, l θ,r )) log β, log λ + log G σ (d(p, l θ,r )) log β) (7) By choosng parameters approprately and wrtng x = d(p, l θ,r ) ths reduces to a partcularly smple form : ˆl θ,r,dd = arg max max(0, c 1 c 2 x 2, c 3 c 4 x 2 ) (8) Here, the parameters c 1,..., c 4 depend on β, σ, and λ. Ths s the optmzaton that we need to perform n order to fnd the maxmum lkelhood soluton to the text lne fndng problem under our statstcal model. It s closely related to robust least square fttng n robust statstcs [HUB 81]. Optmzng these knds of functons s dffcult wth tradtonal methods, such as gradent descent, because they have many local mnma and large flat regons. However, below, we wll see how we can fnd globally optmal solutons to such optmzaton problems. As stated above, the approach assumes that there s only a sngle text lne on the page plus background nose. However, for each reference pont that contrbutes to the total lkelhood for a partcular choce of lne parameters, we not only get a contrbuton to the total lkelhood, but also an ndcaton of whether t was consdered part of the text lne or whether t contrbuted as a background pont (β). Ths means that once we have found a maxmum lkelhood soluton, we can dvde the reference ponts that contrbuted to t nto those that contrbuted as background ponts and those that dd not. The reference ponts that dd not contrbute as background ponts are then counted as beng part of the lne, whle the background ponts can be re-examned for the next-best maxmum lkelhood soluton 2 2 A formal justfcaton of ths approach goes beyond the scope of ths

5 FIG. 2 A smple column model. The red rectangles are boundng boxes for connected components, the green dots are the reference ponts used for each boundng box, and the blue lne s a robust lnear ft to the reference ponts. 4.2 Column Boundares At frst sght, the problem of fndng text columns s qute smlar to the problem of fndng text lnes. That s, text s typeset n columns so that the leftmost character n each column algns on a straght lne wth the other leftmost characters on each text lne that s part of the column. We can therefore pck the bottom left corner of each character as an algnment pont and perform a lne matchng operaton smlar to that used for text lne fndng. The lkelhood model and optmzaton problems are also analogous to that for text lne fndng, except that modelng of baselnes s not requred. Ths approach s llustrated n Fgure 2. Such an approach has many of the advantages of the text lne fndng algorthm mentoned above : t can fnd column boundares even f they are not parallel to each other (e.g., n perspectvely dstorted document mages), and t does not rely on any global propertes of the page. An example of the applcaton of a combned text lne and column fndng method can be found n Fgure 3, where t s appled to the problem of removng perspectve dstortons from mages captured wth a hand-held camera. However, whle useful n some nstances, unlke the text lne model, such a column model s not satsfactory for complex documents because of the statstcs of real documents. One smple way of understandng that dfference s the followng observaton. If we pck out an arbtrary connected component correspondng to a character, t s wth hgh probablty part of some text lne. In contrast, only a small fracton of all characters are actually part of a column boundary (namely, only those characters at the begnnng of a text lne). Therefore, there s almost no possblty for false postves durng text lne fndng, snce there are very few connected components not part of text lnes, and those components are unlkely to be algned lnearly, whle there s a sgnfcant probablty of false postves durng text lne fndng, namely when characters that are otherwse not part of any column boundary accdentally algn lnearly. In order to recognze text columns relably, we need to take advantage of addtonal nformaton that helps us dstngush paper. actual text columns from accdental algnments of characters. One property that suggests tself s the use of whtespace. In fact, many algorthms for geometrc layout analyss rely only on whtespace for detectng column boundares. Examples are the use of X-Y segmentatons [NAG 84], and whtespace covers [BAI 94, BAI 90]. However, n our experence, whtespace covers by themselves, wthout the use of addtonal nformaton, are not a relable method for column detecton ether. But combnng the computaton of whtespace covers wth the computaton of columnar algnments turns out to result n a method that emprcally recognzes column boundares wth lower error rates than ether whtespace-only or algnment-only methods alone. There are several ways of formalzng the combnaton of whtespace covers wth measurng the algnment of connected components as a column boundary. Frst, we can start wth the maxmum whtespace rectangle formulaton as descrbed by Bard et al. [BAI 94, BAI 90] and add a requrement that a mnmal number of connected components be present around the outsde of that rectangle. Alternatvely, we can start wth a column fnder based on lnear algnment of reference ponts, as n Fgure 2 and add a requrement that sgnfcant amounts of whtespace be present next to the column boundary. For hstorcal reasons maxmal rectangle whtespace covers had already proven to be farly relable at detectng column boundares and snce we developed smple and effcent algorthms for computng them we adopted the frst approach. Ths approach and nsght then leads to a computatonal geometry problem for fndng column separators. In the case of axs-algned document layouts, we can state ths problem formally as follows (also llustrated n Fgure 5. Defnton 1 Axs-Algned Maxmum Whtespace Rectangle wth Halo Problem. The nput to the algorthm s a rectangular outer bound B, a collecton of axs-algned rectangular obstacles R, a dstance ɛ, and a threshold count h. The output of the algorthm s a rectangle M wth maxmal area satsfyng (1) M B, (2) for all R, area(m R ) = 0, and (3) the number of rectangular obstacles R wthn a dstance

6 FIG. 3 Applcaton of the text lne and column fnders to the problem of removng perspectve dstortons of documents captured wth hand-held cameras. The applcaton s enabled by the fact that the text lne and column fnders do not make any assumptons about parallelsm among text lnes or columns. FIG. 5 Maxmal whtespace rectangles wth halo. See the text for detals. of ɛ of M s greater than h, #{R : dst(r, M) ɛ} h. Ths defnton can be generalzed easly to non-axs algned rectangles, although (as we wll see) the resultng algorthm dffers sgnfcantly. Furthermore, n practce, t s useful to lmt the aspect rato of the maxmal rectangles found by the algorthm. Note that our reasonng for statng ths approach to column fndng has a rather dfferent motvaton from our approach to text lne fndng. In the case of text lne fndng, we started wth a clearly defned geometrc model of text lnes (characters restng ether on the baselne or lne of descenders), chose an error model, and derved the maxmum lkelhood soluton. In the case of column fndng, whle we were motvated by a smlar model algnment of characters along a lne we ultmately chose an algorthm based on maxmal whtespace rectangles because such algorthms have been found to work well n prevous work [BAI 94, BAI 90] ; the constrant of lnear algnment of the connected components correspondng to the frst character of each text lne adjacent to the column separator comes n through the halo requrement. Ths requrement of havng a halo of connected components under a gven error bounds corresponds to a bounded unform error nose model, as opposed to the Gaussan error model used wth the text lne fnder. 5 Branch-and-Bound Algorthms In the prevous secton, we have dscussed the knds of statstcal and geometrc models that correspond to text lnes and column boundares. However, we have so far left open the queston of how to fnd optmal solutons under those models ; wthout practcal algorthms, such models would smply not be very useful. In fact, the use of the models descrbed n the prevous secton has only been enabled through the development of a new class of practcal optmzaton algorthms. These algorthms, descrbed below, combne deas from branch-and-bound geometrc matchng algorthms n computer vson [BRE 92] and nterval arthmetc optmzaton [HAN 80]. Furthermore, they ncorporate an mportant optmzaton that we refer to as matchlsts [BRE 92]. The basc dea behnd these algorthms s to formulate the search for an optmal soluton as a search over a multd-

7 FIG. 4 A column model that takes nto account whtespace. The red rectangles are boundng boxes for connected components, the green dots are the reference ponts used for each boundng box, and the blue lne s a robust lnear ft to the reference ponts. The blue area to the left of the blue lne s the columnar whtespace. mensonal parameter space. In the case of fndng maxmum lkelhood text lne solutons, the parameter space s threedmensonal : t conssts of a range of angles [θ, θ], a range of dstances from the orgn [r, r], and a range of dstances for the lne of descenders from the baselne [d d, d d ]. In the case of fndng non-axs algned rectangles wth halo, the parameter space s four dmensonal 3 and conssts of a range of rectangle centers [x, x] and [y, y], a range of orentatons [θ, θ], and an aspect rato [r, r]. In both cases, the objectve functons (Equaton 8 and the area functon subject to the constrants n Defnton 1) are ll-suted to the usual gradent-based optmzaton methods because they contan large flat regons and many local mnma. In order to fnd good solutons to these problems, we need global optmzaton technques. The combnaton of branch-and-bound methods together wth nterval arthmetc has proven to be a powerful tool for these knds of problems [HAN 80, BRE 03b]. The dea s that we subdvde the parameter space nto hyper-rectangular subregons and compute upper and lower bounds on the value of the objectve functon over each subregon. Ths can be accomplshed usng nterval arthmetc. Interval arthmetc allows us to take an objectve functon n algebrac notaton and replace the arthmetc operatons occurrng n that objectve functon wth ther nterval equvalents. (Ths can be accomplshed automatcally n languages lke C++ through overloadng.) The resultng nterval objectve functon then computes upper and lower bounds on the value of the objectve functon over each hyper-rectangle n parameter space. Those bounds are not guaranteed to be tght, but they are guaranteed to be convergent [JAU 01] nformally, as the hyper-rectangle n parameter space becomes smaller, the bounds are guaranteed to approxmate the true value of the objectve functon closer and closer. In order to make ths approach practcal for these applca- 3 Such rectangles are actually descrbed by fve parameters, but the requrement that the rectangles are maxmal reduces the parameter space that needs to be searched to four dmensons, gven a partcular set of obstacles. tons, we need to apply an mportant optmzaton. Observe that the objectve functon for text lne fndng, gven by Equaton 8 s a sum of ndvdual contrbutons. Furthermore, we have rewrtten that objectve functon so that the contrbuton of background features to the total sum s zero. The nterval equvalent of ths objectve functon wll be evaluated repeatedly n the nner loop of the search algorthm. But t can be shown easly that f a contrbuton from some feature pont p s zero for a hyper-rectangle n parameter space, t wll reman zero for any subrectangle of that hyper-rectangle. All ponts makng zero contrbutons therefore need not be consdered durng further evaluatons of the objectve functon for subrectangles n parameter space. We keep track of the ponts makng non-zero contrbutons on a smple data structure, the matchlst. As we consder subrectangles of a hyper-rectangle n parameter space, ponts are removed from the matchlst and need not be evaluated any further n the computaton of the objectve functon for such subrectangles. Usng these nsghts, we can now wrte down pseudo-code for the global optmzaton algorthms for these geometrc problems (for brevty, we use the term regon nstead of hyper-rectangle n parameter space ) ; ths s shown n Fgure 6. Actual code for ths problem s, n fact, very close to def globally_optmze(regon,ponts): q = objectve_functon(regon,ponts) queue.enqueue(q,regon,ponts) whle not queue.s_empty(): (q,regon,ponts) = queue.dequeue_max() f accurate_enough(regon): return regon subregons = splt(regon) for subregon n subregons: subponts = [lst of pont n ponts f pont contrbutes to objectve_functon] subq = objectve_functon(subregon,subponts) queue.enqueue(subq,subregon,subponts) FIG. 6 Pseudo-code for fndng globally optmal solutons to the maxmum lkelhood problems descrbed n the text. the pseudo-code shown n Fgure 6. A queston that remans about these algorthms s how ef-

8 fcent they are. Commonly used methods for analyzng the performance of geometrc and numercal algorthms determne asymptotc complextes n terms of elementary arthmetc operatons. Such analyses are both dffcult and not very useful for these classes of algorthms. They are dffcult because the complexty of these algorthms depends on the desred accuracy of the computed results. Increasng the desred accuracy, the asymptotc complexty of the algorthm can be shown to be lnear n the number of solutons and the nput, but n practce, the algorthms are usually used wth a fxed, fnte accuracy dctated by the problem requrements. Furthermore, the problem nstances themselves are lmted n complexty : snce they are derved from scanned documents and there are physcal and perceptual constrants on the sze and complexty of such documents, asymptotc complexty s of less nterest than actual complexty on the range of problem complextes found n real-world applcatons. Havng sad that, we fnd average runnng tmes of approxmately 1 second for the text lne fnder (fndng all text lnes n a document mage) and less than 0.25 seconds for the whte space cover on modern PC hardware on the documents n the Unversty of Washngton Database 3 (UW3), wth approxmately lnear scalng of runnng tmes n terms of the number of connected components over the range of document complextes found n the UW3 database. 6 Dscusson Ths paper has revewed well-defned models of the spatal statstcs of document layouts and algorthms for fndng maxmum lkelhood solutons under such models. These approaches to layout analyss dffers from prevous methods for fndng geometrc layout prmtves n a number of mportant ways. Frst, they are guaranteed to fnd globally optmal solutons under the specfc geometrc and statstcal model chosen. That makes ther results reproducble and determnstc ; f the algorthm fals to come up wth a correct soluton, we know that ths must be a problem wth the statstcal model tself, no smply a falure of the algorthm to fnd the optmal soluton to the model. Emprcally, ths smplfes debuggng and parameter estmaton for layout analyss systems based on these methods. Second, the algorthms are expressed n terms of welldefned models of spatal statstcs and errors ; the dstrbutonal parameters (error bounds, frequency of background features) can be estmated from sample mages and have ntutve nterpretatons. At the same tme, the models also ncorporate strong pror knowledge about the geometry of document mages (e.g., the fact that text lnes are usually straght). Ths s n contrast to other well-founded statstcal models of page layout proposed recently [LIA 99, LIA 01] ; those models also take advantage of statstcal constrants, but they rely on more general statstcs of the relatve spatal dstrbutons of layout elements, makng both the parameter estmaton and the nference problem consderably harder. It s also n contrast wth projecton-based methods for layout analyss [BAI 87, NAG 84], whose crtera for determnng the locaton of text lnes or paragraphs are generally not drectly related to spatal statstcs of the locaton of characters on the page. Thrd, the methods presented here can operate locally ; that s, ndvdual text lnes and column separators can be found ndependently of each other. Ths allows us to apply the methods both to novel document types (e.g., documents wth text at varous dfferent orentatons) and to standard document types captured n novel ways (e.g., document capture wth hand-held cameras, resultng n perspectve dstorton). In contrast, many commonly used methods for geometrc layout analyss rely on global layout propertes. For example, methods lke projecton-based text lne fndng and layout analyss generally assume rectlnear layouts and the ablty to fnd a sngle, global page rotaton that wll deskew the page. There are some methods for geometrc layout analyss that do not make such global assumptons, such as the Vorono-based technque of [KIS 98] or closely related methods based on lnkng nearby connected components. However, whle they are able, n prncple, to operate on pages that have been perspectvely dstorted or contan text at varous dfferent orentatons, those methods rely on much weaker geometrc models of text lnes than the methods presented n ths paper. That suggests that they wll be more susceptble to false postve errors n the detecton of text lnes ; however, ths remans to be demonstrated on actual datasets. In ths short revew, we have focused on motvatng partcular models of spatal statstcs for layout analyss and outlned an approach to nference based on branch-and-bound methods that makes usng such models feasble for real document analyss problems. For detaled performance data n specfc applcatons, the reader s referred to the lterature. For data on the accuracy of text lne fndng and deskewng usng these methods, [BRE 02c] contans an evaluaton on the UW2 database. For data on the relablty wth whch the method fnds column separators, the reader s referred to [BRE 02a]. Results on non-axs algned column separator dentfcaton can be found n [BRE 03a]. More complete document analyss systems takng advantage of the novel capabltes of these methods have also been descrbed, ncludng a system for mage-based reflowng of document mages [BRE 02b]. However, the reader should keep n mnd that these methods can also be used as a drop-n replacement for commonly used exstng methods n document analyss, often yeldng some mprovement n performance, relablty, or functonalty over exstng methods. For example, the text lne fnder descrbed above can be used n place of projecton-based text lne fnders for relable skew estmaton and text lne dentfcaton. And the maxmal empty rectangle methods outlned above can be used as a drop-n replacement for the whtespace cover approach descrbed by [BAI 94, BAI 90]. Future work on these algorthms dvdes nto fundamental algorthmc mprovements on the one hand, and the exploraton of addtonal applcaton areas on the other. On the algorthmc sde, one open problem s the applcaton of these optmzaton methods to hgher-dmensonal parameter spaces. Currently parameter spaces beyond four or fve dmensons lead to mpractcally long computaton tmes. However, our lab s currently examnng methods for speedng up search that may allow us to extend that lmt for

9 many knds of objectve functons. We are also examnng other approaches to copng wth problems such as matchng smoothly curved text lnes (as they occur, for example, on photographc mages of curved book pages). On the applcaton sde, one of the most mportant areas s the use of these methods wth non-latn scrpts, languages, and page layouts. Furthermore, a detaled performance evaluaton and comparson between other recently proposed layout analyss technques [LIA 99, LIA 01, KIS 98] and these methods remans to be done. In order to facltate more wdespread adopton of these methods, we are makng avalable sample mplementatons for research purposes ; please contact the author for more nformaton. Références [BAI 87] BAIRD H. S., The Skew Angle of Prnted Documents, Proc., 1987 Conf. of the Socety of Photographc Scentsts and Engneers, Rochester, New York, [BAI 90] BAIRD H. S., JONES S. E., FORTUNE S. J., Image Segmentaton by Shape-Drected Covers, Proceedngs of the Tenth Internatonal Conference on Pattern Recognton, Atlantc Cty, New Jersey, 1990, pp [BAI 94] BAIRD H. S., Background Structure n Document Images, H. Bunke, P. S. P. Wang, & H. S. Bard (Eds.), Document Image Analyss, World Scentfc, Sngapore, 1994, pp [BRE 92] BREUEL T. M., Fast Recognton usng Adaptve Subdvsons of Transformaton Space, Proceedngs IEEE Conf. on Computer Vson and Pattern Recognton, 1992, pp [BRE 02a] BREUEL T. M., Two Algorthms for Geometrc Layout Analyss, Proceedngs of the Workshop on Document Analyss Systems, Prnceton, NJ, USA, [BRE 02b] BREUEL T. M., JANSSEN W. C., POPAT K., BAIRD H. S., Paper-to-PDA, Proceedngs of the Internatonal Conference on Pattern Recognton (ICPR 02), Quebec Cty, Quebec, Canada, [BRE 02c] BREUEL T., Robust Least Square Baselne Fndng usng a Branch and Bound Algorthm, Proceedngs of the SPIE - The Internatonal Socety for Optcal Engneerng, 2002, page (n press). [BRE 03a] BREUEL T. M., An Algorthm for Fndng Maxmal Whtespace Rectangles at Arbtrary Orentatons, Internatonal Conference on Document Analyss and Recognton, [BRE 03b] BREUEL T. M., On the Use of Interval Arthmetc n Geometrc Branch-and-Bound Algorthms, Pattern Recognton Letters,, [CAT 98] CATTONI R., COIANIZ T., MESSELODI S., MO- DENA C. M., Geometrc Layout Analyss Technques for Document Image Understandng : a Revew, rapport, 1998, IRST, Trento, Italy. [CHE 95] CHEN S., Document Layout Analyss Usng Recursve Morphologcal Transforms, PhD thess, Ph.D. thess, Unv. of Washngton, [HAN 80] HANSEN E., Global optmzaton usng nterval analyss the mult-dmensonal case, Numersche Mathematk, vol. 34, 1980, pp [HAR 94] HARALICK R. M., Document Image Understandng : Geometrc and Logcal Layout, CVPR94 : IEEE Computer Socety Conference on Computer Vson and Pattern Recognton, 1994, pp [HUB 81] HUBER P., Robust Statstcs, Sons, New York, John Wley and [III 97] III W. W., Statstcal approaches to feature-based object recognton., Internatonal Journal of Computer Vson, vol. 21, n o 1/2, 1997, pp [JAU 01] JAULIN L., KIEFFER M., DIDRIT O., WALTER E., Appled Interval Analyss, Sprnger Verlag, Berln, [KIS 98] KISE K., SATO A., IWATA M., Segmentaton of page mages usng the area Vorono Dagram, Computer Vson and Image Understandng, vol. 70, n o 3, 1998, pp [LIA 99] LIANG J., PHILLIPS I., HARALICK R., A Unfed Methodology for Document Structure Analyss, Workshop on Document Layout Interpretaton and ts Applcatons (DLIA), [LIA 01] LIANG J., PHILIPS I. T., HARALICK R. M., An Optmzaton Methodology for Document Structure Extracton on Latn Character Documents, Pattern Analyss and Machne Intellgence,, 2001, pp [NAG 84] NAGY G., SETH S., Herarchcal Representaton Of Optcally Scanned Documents, 7ICPR, vol. 84, 1984, pp [O G 93] O GORMAN L., The Document Spectrum for Page Layout Analyss, IEEE Transactons on Pattern Analyss and Machne Intellgence, vol. 15, n o 11, 1993, pp [OKU 99] OKUN O., PIETIKAINEN M., SAUVOLA J., Robust document skew detecton based on lne extracton, Proc. of the 11th Scandnavan Conference on Image Analyss (SCIA 99), June 7-11, Kangerlussuaq, Greenland, 1999, pp [SRI 89] SRIHARI S., GOVINDARAJU V., Analyss of textual mages usng the Hough transform, Machne Vson and Applcatons, vol. 2, 1989.

CS 534: Computer Vision Model Fitting

CS 534: Computer Vision Model Fitting CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

Feature Reduction and Selection

Feature Reduction and Selection Feature Reducton and Selecton Dr. Shuang LIANG School of Software Engneerng TongJ Unversty Fall, 2012 Today s Topcs Introducton Problems of Dmensonalty Feature Reducton Statstc methods Prncpal Components

More information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster

More information

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach Angle Estmaton and Correcton of Hand Wrtten, Textual and Large areas of Non-Textual Document Images: A Novel Approach D.R.Ramesh Babu Pyush M Kumat Mahesh D Dhannawat PES Insttute of Technology Research

More information

Mathematics 256 a course in differential equations for engineering students

Mathematics 256 a course in differential equations for engineering students Mathematcs 56 a course n dfferental equatons for engneerng students Chapter 5. More effcent methods of numercal soluton Euler s method s qute neffcent. Because the error s essentally proportonal to the

More information

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes SPH3UW Unt 7.3 Sphercal Concave Mrrors Page 1 of 1 Notes Physcs Tool box Concave Mrror If the reflectng surface takes place on the nner surface of the sphercal shape so that the centre of the mrror bulges

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces Range mages For many structured lght scanners, the range data forms a hghly regular pattern known as a range mage. he samplng pattern s determned by the specfc scanner. Range mage regstraton 1 Examples

More information

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints Australan Journal of Basc and Appled Scences, 2(4): 1204-1208, 2008 ISSN 1991-8178 Sum of Lnear and Fractonal Multobjectve Programmng Problem under Fuzzy Rules Constrants 1 2 Sanjay Jan and Kalash Lachhwan

More information

3D vector computer graphics

3D vector computer graphics 3D vector computer graphcs Paolo Varagnolo: freelance engneer Padova Aprl 2016 Prvate Practce ----------------------------------- 1. Introducton Vector 3D model representaton n computer graphcs requres

More information

Fitting & Matching. Lecture 4 Prof. Bregler. Slides from: S. Lazebnik, S. Seitz, M. Pollefeys, A. Effros.

Fitting & Matching. Lecture 4 Prof. Bregler. Slides from: S. Lazebnik, S. Seitz, M. Pollefeys, A. Effros. Fttng & Matchng Lecture 4 Prof. Bregler Sldes from: S. Lazebnk, S. Setz, M. Pollefeys, A. Effros. How do we buld panorama? We need to match (algn) mages Matchng wth Features Detect feature ponts n both

More information

The Codesign Challenge

The Codesign Challenge ECE 4530 Codesgn Challenge Fall 2007 Hardware/Software Codesgn The Codesgn Challenge Objectves In the codesgn challenge, your task s to accelerate a gven software reference mplementaton as fast as possble.

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour 6.854 Advanced Algorthms Petar Maymounkov Problem Set 11 (November 23, 2005) Wth: Benjamn Rossman, Oren Wemann, and Pouya Kheradpour Problem 1. We reduce vertex cover to MAX-SAT wth weghts, such that the

More information

Lecture 5: Multilayer Perceptrons

Lecture 5: Multilayer Perceptrons Lecture 5: Multlayer Perceptrons Roger Grosse 1 Introducton So far, we ve only talked about lnear models: lnear regresson and lnear bnary classfers. We noted that there are functons that can t be represented

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

Hermite Splines in Lie Groups as Products of Geodesics

Hermite Splines in Lie Groups as Products of Geodesics Hermte Splnes n Le Groups as Products of Geodescs Ethan Eade Updated May 28, 2017 1 Introducton 1.1 Goal Ths document defnes a curve n the Le group G parametrzed by tme and by structural parameters n the

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1 4/14/011 Outlne Dscrmnatve classfers for mage recognton Wednesday, Aprl 13 Krsten Grauman UT-Austn Last tme: wndow-based generc obect detecton basc ppelne face detecton wth boostng as case study Today:

More information

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for

More information

Accounting for the Use of Different Length Scale Factors in x, y and z Directions

Accounting for the Use of Different Length Scale Factors in x, y and z Directions 1 Accountng for the Use of Dfferent Length Scale Factors n x, y and z Drectons Taha Soch (taha.soch@kcl.ac.uk) Imagng Scences & Bomedcal Engneerng, Kng s College London, The Rayne Insttute, St Thomas Hosptal,

More information

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr)

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr) Helsnk Unversty Of Technology, Systems Analyss Laboratory Mat-2.08 Independent research projects n appled mathematcs (3 cr) "! #$&% Antt Laukkanen 506 R ajlaukka@cc.hut.f 2 Introducton...3 2 Multattrbute

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

Biostatistics 615/815

Biostatistics 615/815 The E-M Algorthm Bostatstcs 615/815 Lecture 17 Last Lecture: The Smplex Method General method for optmzaton Makes few assumptons about functon Crawls towards mnmum Some recommendatons Multple startng ponts

More information

Reducing Frame Rate for Object Tracking

Reducing Frame Rate for Object Tracking Reducng Frame Rate for Object Trackng Pavel Korshunov 1 and We Tsang Oo 2 1 Natonal Unversty of Sngapore, Sngapore 11977, pavelkor@comp.nus.edu.sg 2 Natonal Unversty of Sngapore, Sngapore 11977, oowt@comp.nus.edu.sg

More information

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton

More information

TN348: Openlab Module - Colocalization

TN348: Openlab Module - Colocalization TN348: Openlab Module - Colocalzaton Topc The Colocalzaton module provdes the faclty to vsualze and quantfy colocalzaton between pars of mages. The Colocalzaton wndow contans a prevew of the two mages

More information

Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity

Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity Journal of Sgnal and Informaton Processng, 013, 4, 114-119 do:10.436/jsp.013.43b00 Publshed Onlne August 013 (http://www.scrp.org/journal/jsp) Corner-Based Image Algnment usng Pyramd Structure wth Gradent

More information

CMPS 10 Introduction to Computer Science Lecture Notes

CMPS 10 Introduction to Computer Science Lecture Notes CPS 0 Introducton to Computer Scence Lecture Notes Chapter : Algorthm Desgn How should we present algorthms? Natural languages lke Englsh, Spansh, or French whch are rch n nterpretaton and meanng are not

More information

Edge Detection in Noisy Images Using the Support Vector Machines

Edge Detection in Noisy Images Using the Support Vector Machines Edge Detecton n Nosy Images Usng the Support Vector Machnes Hlaro Gómez-Moreno, Saturnno Maldonado-Bascón, Francsco López-Ferreras Sgnal Theory and Communcatons Department. Unversty of Alcalá Crta. Madrd-Barcelona

More information

Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram

Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram Shape Representaton Robust to the Sketchng Order Usng Dstance Map and Drecton Hstogram Department of Computer Scence Yonse Unversty Kwon Yun CONTENTS Revew Topc Proposed Method System Overvew Sketch Normalzaton

More information

Two Dimensional Projective Point Matching

Two Dimensional Projective Point Matching Two Dmensonal Projectve Pont Matchng Jason Denton & J. Ross Beverdge Colorado State Unversty Computer Scence Department Ft. Collns, CO 80523 denton@cs.colostate.edu Abstract Pont matchng s the task of

More information

Detection of an Object by using Principal Component Analysis

Detection of an Object by using Principal Component Analysis Detecton of an Object by usng Prncpal Component Analyss 1. G. Nagaven, 2. Dr. T. Sreenvasulu Reddy 1. M.Tech, Department of EEE, SVUCE, Trupath, Inda. 2. Assoc. Professor, Department of ECE, SVUCE, Trupath,

More information

User Authentication Based On Behavioral Mouse Dynamics Biometrics

User Authentication Based On Behavioral Mouse Dynamics Biometrics User Authentcaton Based On Behavoral Mouse Dynamcs Bometrcs Chee-Hyung Yoon Danel Donghyun Km Department of Computer Scence Department of Computer Scence Stanford Unversty Stanford Unversty Stanford, CA

More information

An Entropy-Based Approach to Integrated Information Needs Assessment

An Entropy-Based Approach to Integrated Information Needs Assessment Dstrbuton Statement A: Approved for publc release; dstrbuton s unlmted. An Entropy-Based Approach to ntegrated nformaton Needs Assessment June 8, 2004 Wllam J. Farrell Lockheed Martn Advanced Technology

More information

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics Introducton G10 NAG Fortran Lbrary Chapter Introducton G10 Smoothng n Statstcs Contents 1 Scope of the Chapter... 2 2 Background to the Problems... 2 2.1 Smoothng Methods... 2 2.2 Smoothng Splnes and Regresson

More information

Determining the Optimal Bandwidth Based on Multi-criterion Fusion

Determining the Optimal Bandwidth Based on Multi-criterion Fusion Proceedngs of 01 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 5 (01) (01) IACSIT Press, Sngapore Determnng the Optmal Bandwdth Based on Mult-crteron Fuson Ha-L Lang 1+, Xan-Mn

More information

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION Paulo Quntlano 1 & Antono Santa-Rosa 1 Federal Polce Department, Brasla, Brazl. E-mals: quntlano.pqs@dpf.gov.br and

More information

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009.

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009. Farrukh Jabeen Algorthms 51 Assgnment #2 Due Date: June 15, 29. Assgnment # 2 Chapter 3 Dscrete Fourer Transforms Implement the FFT for the DFT. Descrbed n sectons 3.1 and 3.2. Delverables: 1. Concse descrpton

More information

Wishing you all a Total Quality New Year!

Wishing you all a Total Quality New Year! Total Qualty Management and Sx Sgma Post Graduate Program 214-15 Sesson 4 Vnay Kumar Kalakband Assstant Professor Operatons & Systems Area 1 Wshng you all a Total Qualty New Year! Hope you acheve Sx sgma

More information

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms Course Introducton Course Topcs Exams, abs, Proects A quc loo at a few algorthms 1 Advanced Data Structures and Algorthms Descrpton: We are gong to dscuss algorthm complexty analyss, algorthm desgn technques

More information

y and the total sum of

y and the total sum of Lnear regresson Testng for non-lnearty In analytcal chemstry, lnear regresson s commonly used n the constructon of calbraton functons requred for analytcal technques such as gas chromatography, atomc absorpton

More information

A Gradient Difference based Technique for Video Text Detection

A Gradient Difference based Technique for Video Text Detection A Gradent Dfference based Technque for Vdeo Text Detecton Palaahnakote Shvakumara, Trung Quy Phan and Chew Lm Tan School of Computng, Natonal Unversty of Sngapore {shva, phanquyt, tancl }@comp.nus.edu.sg

More information

Multi-stable Perception. Necker Cube

Multi-stable Perception. Necker Cube Mult-stable Percepton Necker Cube Spnnng dancer lluson, Nobuuk Kaahara Fttng and Algnment Computer Vson Szelsk 6.1 James Has Acknowledgment: Man sldes from Derek Hoem, Lana Lazebnk, and Grauman&Lebe 2008

More information

Image Representation & Visualization Basic Imaging Algorithms Shape Representation and Analysis. outline

Image Representation & Visualization Basic Imaging Algorithms Shape Representation and Analysis. outline mage Vsualzaton mage Vsualzaton mage Representaton & Vsualzaton Basc magng Algorthms Shape Representaton and Analyss outlne mage Representaton & Vsualzaton Basc magng Algorthms Shape Representaton and

More information

For instance, ; the five basic number-sets are increasingly more n A B & B A A = B (1)

For instance, ; the five basic number-sets are increasingly more n A B & B A A = B (1) Secton 1.2 Subsets and the Boolean operatons on sets If every element of the set A s an element of the set B, we say that A s a subset of B, or that A s contaned n B, or that B contans A, and we wrte A

More information

A Gradient Difference based Technique for Video Text Detection

A Gradient Difference based Technique for Video Text Detection 2009 10th Internatonal Conference on Document Analyss and Recognton A Gradent Dfference based Technque for Vdeo Text Detecton Palaahnakote Shvakumara, Trung Quy Phan and Chew Lm Tan School of Computng,

More information

Vectorization of Image Outlines Using Rational Spline and Genetic Algorithm

Vectorization of Image Outlines Using Rational Spline and Genetic Algorithm 01 Internatonal Conference on Image, Vson and Computng (ICIVC 01) IPCSIT vol. 50 (01) (01) IACSIT Press, Sngapore DOI: 10.776/IPCSIT.01.V50.4 Vectorzaton of Image Outlnes Usng Ratonal Splne and Genetc

More information

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto

More information

A New Approach For the Ranking of Fuzzy Sets With Different Heights

A New Approach For the Ranking of Fuzzy Sets With Different Heights New pproach For the ankng of Fuzzy Sets Wth Dfferent Heghts Pushpnder Sngh School of Mathematcs Computer pplcatons Thapar Unversty, Patala-7 00 Inda pushpndersnl@gmalcom STCT ankng of fuzzy sets plays

More information

MATHEMATICS FORM ONE SCHEME OF WORK 2004

MATHEMATICS FORM ONE SCHEME OF WORK 2004 MATHEMATICS FORM ONE SCHEME OF WORK 2004 WEEK TOPICS/SUBTOPICS LEARNING OBJECTIVES LEARNING OUTCOMES VALUES CREATIVE & CRITICAL THINKING 1 WHOLE NUMBER Students wll be able to: GENERICS 1 1.1 Concept of

More information

Meta-heuristics for Multidimensional Knapsack Problems

Meta-heuristics for Multidimensional Knapsack Problems 2012 4th Internatonal Conference on Computer Research and Development IPCSIT vol.39 (2012) (2012) IACSIT Press, Sngapore Meta-heurstcs for Multdmensonal Knapsack Problems Zhbao Man + Computer Scence Department,

More information

A Comparison and Evaluation of Three Different Pose Estimation Algorithms In Detecting Low Texture Manufactured Objects

A Comparison and Evaluation of Three Different Pose Estimation Algorithms In Detecting Low Texture Manufactured Objects Clemson Unversty TgerPrnts All Theses Theses 12-2011 A Comparson and Evaluaton of Three Dfferent Pose Estmaton Algorthms In Detectng Low Texture Manufactured Objects Robert Krener Clemson Unversty, rkrene@clemson.edu

More information

Image Alignment CSC 767

Image Alignment CSC 767 Image Algnment CSC 767 Image algnment Image from http://graphcs.cs.cmu.edu/courses/15-463/2010_fall/ Image algnment: Applcatons Panorama sttchng Image algnment: Applcatons Recognton of object nstances

More information

AP PHYSICS B 2008 SCORING GUIDELINES

AP PHYSICS B 2008 SCORING GUIDELINES AP PHYSICS B 2008 SCORING GUIDELINES General Notes About 2008 AP Physcs Scorng Gudelnes 1. The solutons contan the most common method of solvng the free-response questons and the allocaton of ponts for

More information

Complex Numbers. Now we also saw that if a and b were both positive then ab = a b. For a second let s forget that restriction and do the following.

Complex Numbers. Now we also saw that if a and b were both positive then ab = a b. For a second let s forget that restriction and do the following. Complex Numbers The last topc n ths secton s not really related to most of what we ve done n ths chapter, although t s somewhat related to the radcals secton as we wll see. We also won t need the materal

More information

Analysis of Continuous Beams in General

Analysis of Continuous Beams in General Analyss of Contnuous Beams n General Contnuous beams consdered here are prsmatc, rgdly connected to each beam segment and supported at varous ponts along the beam. onts are selected at ponts of support,

More information

3D Virtual Eyeglass Frames Modeling from Multiple Camera Image Data Based on the GFFD Deformation Method

3D Virtual Eyeglass Frames Modeling from Multiple Camera Image Data Based on the GFFD Deformation Method NICOGRAPH Internatonal 2012, pp. 114-119 3D Vrtual Eyeglass Frames Modelng from Multple Camera Image Data Based on the GFFD Deformaton Method Norak Tamura, Somsangouane Sngthemphone and Katsuhro Ktama

More information

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique //00 :0 AM Outlne and Readng The Greedy Method The Greedy Method Technque (secton.) Fractonal Knapsack Problem (secton..) Task Schedulng (secton..) Mnmum Spannng Trees (secton.) Change Money Problem Greedy

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification

12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification Introducton to Artfcal Intellgence V22.0472-001 Fall 2009 Lecture 24: Nearest-Neghbors & Support Vector Machnes Rob Fergus Dept of Computer Scence, Courant Insttute, NYU Sldes from Danel Yeung, John DeNero

More information

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

K-means and Hierarchical Clustering

K-means and Hierarchical Clustering Note to other teachers and users of these sldes. Andrew would be delghted f you found ths source materal useful n gvng your own lectures. Feel free to use these sldes verbatm, or to modfy them to ft your

More information

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents

More information

PRÉSENTATIONS DE PROJETS

PRÉSENTATIONS DE PROJETS PRÉSENTATIONS DE PROJETS Rex Onlne (V. Atanasu) What s Rex? Rex s an onlne browser for collectons of wrtten documents [1]. Asde ths core functon t has however many other applcatons that make t nterestng

More information

Signature and Lexicon Pruning Techniques

Signature and Lexicon Pruning Techniques Sgnature and Lexcon Prunng Technques Srnvas Palla, Hansheng Le, Venu Govndaraju Centre for Unfed Bometrcs and Sensors Unversty at Buffalo {spalla2, hle, govnd}@cedar.buffalo.edu Abstract Handwrtten word

More information

A Robust Method for Estimating the Fundamental Matrix

A Robust Method for Estimating the Fundamental Matrix Proc. VIIth Dgtal Image Computng: Technques and Applcatons, Sun C., Talbot H., Ourseln S. and Adraansen T. (Eds.), 0- Dec. 003, Sydney A Robust Method for Estmatng the Fundamental Matrx C.L. Feng and Y.S.

More information

5 The Primal-Dual Method

5 The Primal-Dual Method 5 The Prmal-Dual Method Orgnally desgned as a method for solvng lnear programs, where t reduces weghted optmzaton problems to smpler combnatoral ones, the prmal-dual method (PDM) has receved much attenton

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Decson surface s a hyperplane (lne n 2D) n feature space (smlar to the Perceptron) Arguably, the most mportant recent dscovery n machne learnng In a nutshell: map the data to a predetermned

More information

Machine Learning. Support Vector Machines. (contains material adapted from talks by Constantin F. Aliferis & Ioannis Tsamardinos, and Martin Law)

Machine Learning. Support Vector Machines. (contains material adapted from talks by Constantin F. Aliferis & Ioannis Tsamardinos, and Martin Law) Machne Learnng Support Vector Machnes (contans materal adapted from talks by Constantn F. Alfers & Ioanns Tsamardnos, and Martn Law) Bryan Pardo, Machne Learnng: EECS 349 Fall 2014 Support Vector Machnes

More information

Ecient Computation of the Most Probable Motion from Fuzzy. Moshe Ben-Ezra Shmuel Peleg Michael Werman. The Hebrew University of Jerusalem

Ecient Computation of the Most Probable Motion from Fuzzy. Moshe Ben-Ezra Shmuel Peleg Michael Werman. The Hebrew University of Jerusalem Ecent Computaton of the Most Probable Moton from Fuzzy Correspondences Moshe Ben-Ezra Shmuel Peleg Mchael Werman Insttute of Computer Scence The Hebrew Unversty of Jerusalem 91904 Jerusalem, Israel Emal:

More information

An Accurate Evaluation of Integrals in Convex and Non convex Polygonal Domain by Twelve Node Quadrilateral Finite Element Method

An Accurate Evaluation of Integrals in Convex and Non convex Polygonal Domain by Twelve Node Quadrilateral Finite Element Method Internatonal Journal of Computatonal and Appled Mathematcs. ISSN 89-4966 Volume, Number (07), pp. 33-4 Research Inda Publcatons http://www.rpublcaton.com An Accurate Evaluaton of Integrals n Convex and

More information

Unsupervised Learning

Unsupervised Learning Pattern Recognton Lecture 8 Outlne Introducton Unsupervsed Learnng Parametrc VS Non-Parametrc Approach Mxture of Denstes Maxmum-Lkelhood Estmates Clusterng Prof. Danel Yeung School of Computer Scence and

More information

UB at GeoCLEF Department of Geography Abstract

UB at GeoCLEF Department of Geography   Abstract UB at GeoCLEF 2006 Mguel E. Ruz (1), Stuart Shapro (2), June Abbas (1), Slva B. Southwck (1) and Davd Mark (3) State Unversty of New York at Buffalo (1) Department of Lbrary and Informaton Studes (2) Department

More information

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices Internatonal Mathematcal Forum, Vol 7, 2012, no 52, 2549-2554 An Applcaton of the Dulmage-Mendelsohn Decomposton to Sparse Null Space Bases of Full Row Rank Matrces Mostafa Khorramzadeh Department of Mathematcal

More information

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 A mathematcal programmng approach to the analyss, desgn and

More information

MOTION BLUR ESTIMATION AT CORNERS

MOTION BLUR ESTIMATION AT CORNERS Gacomo Boracch and Vncenzo Caglot Dpartmento d Elettronca e Informazone, Poltecnco d Mlano, Va Ponzo, 34/5-20133 MILANO boracch@elet.polm.t, caglot@elet.polm.t Keywords: Abstract: Pont Spread Functon Parameter

More information

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z.

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z. TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS Muradalyev AZ Azerbajan Scentfc-Research and Desgn-Prospectng Insttute of Energetc AZ1012, Ave HZardab-94 E-mal:aydn_murad@yahoocom Importance of

More information

X- Chart Using ANOM Approach

X- Chart Using ANOM Approach ISSN 1684-8403 Journal of Statstcs Volume 17, 010, pp. 3-3 Abstract X- Chart Usng ANOM Approach Gullapall Chakravarth 1 and Chaluvad Venkateswara Rao Control lmts for ndvdual measurements (X) chart are

More information

Face Recognition University at Buffalo CSE666 Lecture Slides Resources:

Face Recognition University at Buffalo CSE666 Lecture Slides Resources: Face Recognton Unversty at Buffalo CSE666 Lecture Sldes Resources: http://www.face-rec.org/algorthms/ Overvew of face recognton algorthms Correlaton - Pxel based correspondence between two face mages Structural

More information

Chapter 6 Programmng the fnte element method Inow turn to the man subject of ths book: The mplementaton of the fnte element algorthm n computer programs. In order to make my dscusson as straghtforward

More information

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Proceedngs of the Wnter Smulaton Conference M E Kuhl, N M Steger, F B Armstrong, and J A Jones, eds A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Mark W Brantley Chun-Hung

More information

EXTENDED BIC CRITERION FOR MODEL SELECTION

EXTENDED BIC CRITERION FOR MODEL SELECTION IDIAP RESEARCH REPORT EXTEDED BIC CRITERIO FOR ODEL SELECTIO Itshak Lapdot Andrew orrs IDIAP-RR-0-4 Dalle olle Insttute for Perceptual Artfcal Intellgence P.O.Box 59 artgny Valas Swtzerland phone +4 7

More information

Exterior Orientation using Coplanar Parallel Lines

Exterior Orientation using Coplanar Parallel Lines Exteror Orentaton usng Coplanar Parallel Lnes Frank A. van den Heuvel Department of Geodetc Engneerng Delft Unversty of Technology Thsseweg 11, 69 JA Delft, The Netherlands Emal: F.A.vandenHeuvel@geo.tudelft.nl

More information

Review of approximation techniques

Review of approximation techniques CHAPTER 2 Revew of appromaton technques 2. Introducton Optmzaton problems n engneerng desgn are characterzed by the followng assocated features: the objectve functon and constrants are mplct functons evaluated

More information

Circuit Analysis I (ENGR 2405) Chapter 3 Method of Analysis Nodal(KCL) and Mesh(KVL)

Circuit Analysis I (ENGR 2405) Chapter 3 Method of Analysis Nodal(KCL) and Mesh(KVL) Crcut Analyss I (ENG 405) Chapter Method of Analyss Nodal(KCL) and Mesh(KVL) Nodal Analyss If nstead of focusng on the oltages of the crcut elements, one looks at the oltages at the nodes of the crcut,

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

Active Contours/Snakes

Active Contours/Snakes Actve Contours/Snakes Erkut Erdem Acknowledgement: The sldes are adapted from the sldes prepared by K. Grauman of Unversty of Texas at Austn Fttng: Edges vs. boundares Edges useful sgnal to ndcate occludng

More information

Detection of hand grasping an object from complex background based on machine learning co-occurrence of local image feature

Detection of hand grasping an object from complex background based on machine learning co-occurrence of local image feature Detecton of hand graspng an object from complex background based on machne learnng co-occurrence of local mage feature Shnya Moroka, Yasuhro Hramoto, Nobutaka Shmada, Tadash Matsuo, Yoshak Shra Rtsumekan

More information

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010 Smulaton: Solvng Dynamc Models ABE 5646 Week Chapter 2, Sprng 200 Week Descrpton Readng Materal Mar 5- Mar 9 Evaluatng [Crop] Models Comparng a model wth data - Graphcal, errors - Measures of agreement

More information

Computer Animation and Visualisation. Lecture 4. Rigging / Skinning

Computer Animation and Visualisation. Lecture 4. Rigging / Skinning Computer Anmaton and Vsualsaton Lecture 4. Rggng / Sknnng Taku Komura Overvew Sknnng / Rggng Background knowledge Lnear Blendng How to decde weghts? Example-based Method Anatomcal models Sknnng Assume

More information

Explicit Formulas and Efficient Algorithm for Moment Computation of Coupled RC Trees with Lumped and Distributed Elements

Explicit Formulas and Efficient Algorithm for Moment Computation of Coupled RC Trees with Lumped and Distributed Elements Explct Formulas and Effcent Algorthm for Moment Computaton of Coupled RC Trees wth Lumped and Dstrbuted Elements Qngan Yu and Ernest S.Kuh Electroncs Research Lab. Unv. of Calforna at Berkeley Berkeley

More information

Some Advanced SPC Tools 1. Cumulative Sum Control (Cusum) Chart For the data shown in Table 9-1, the x chart can be generated.

Some Advanced SPC Tools 1. Cumulative Sum Control (Cusum) Chart For the data shown in Table 9-1, the x chart can be generated. Some Advanced SP Tools 1. umulatve Sum ontrol (usum) hart For the data shown n Table 9-1, the x chart can be generated. However, the shft taken place at sample #21 s not apparent. 92 For ths set samples,

More information

Report on On-line Graph Coloring

Report on On-line Graph Coloring 2003 Fall Semester Comp 670K Onlne Algorthm Report on LO Yuet Me (00086365) cndylo@ust.hk Abstract Onlne algorthm deals wth data that has no future nformaton. Lots of examples demonstrate that onlne algorthm

More information