On the Computation of Document Frequency Statistics from Spoken Corpora using Factor Automata

Size: px
Start display at page:

Download "On the Computation of Document Frequency Statistics from Spoken Corpora using Factor Automata"

Transcription

1 INTERSPEECH 3 On the Computaton of Document Frequency Statstcs from Spoken Corpora usng Factor Automata Doğan Can, Shrkanth S. Narayanan, Department of Computer Scence and Department of Electrcal Engneerng, Unversty of Southern Calforna, Los Angeles, CA dogancan@usc.edu, shr@sp.usc.edu Abstract Factor automaton s an effcent data structure for representng all factors (substrngs) of a set of strngs (e.g. a fnte-state automaton). Ths noton can be generalzed to weghted automata by assocatng a weght to each factor. In ths paper, we consder the problem of computng expected document frequency (DF), and TF-IDF statstcs for all substrngs seen n a collecton of word lattces by means of factor automata. We present an algorthm whch transforms an acyclc weghted automaton, e.g. an ASR lattce, to a weghted factor automaton where the path weght of each factor represents the total weght assocated by the nput automaton to the set of strngs ncludng that factor at least once. We show how ths automaton can be used to effcently construct other types of weghted factor automata representng DF and TF-IDF statstcs for all factors seen n a large speech corpus. Compared to the state-of-the-art n computng these statstcs from spoken documents, our approach ) generalzes the statstcs from sngle words to contguous substrngs, ) provdes sgnfcant gans n terms of average run-tme and storage requrements and ) constructs effcent nverted ndex structures for retreval of such statstcs. Experments on a Turksh data set corroborate our clams. Index Terms: Factor Automata, Weghted Fnte-State Transducers, Lattce Indexng, Spoken Document Retreval. Introducton Speech retreval (SR) s a key technology whch ntegrates automatc speech recognton (ASR) and nformaton retreval (IR) to provde large scale access to spoken content. Computng document smlarty s a fundamental need n SR just as n the case of text retreval. Inverse document frequency (IDF) [, ] s an mportant term specfcty measure used n almost every IR system n some form. In ts most basc form, t s defned as log of the fracton of documents that nclude a term. IDF computaton nvolves computng document frequency (DF), smply defned as the number of documents that nclude a term. Term frequency (TF) s another mportant measure, whch n ts most basc form s defned as the number of occurrences of a term n a document. The more general class of term weghtng schemes known as TF-IDF, whch nvolves multplyng an IDF measure by a TF measure, consttutes the bass for quantfyng document smlarty n almost every IR system. Whle t s farly straghtforward to compute TF and DF measures from textual documents, the same cannot be sad for spoken ones snce the computaton should be carred out over probablty dstrbutons Supported by the Intellgence Advanced Research Projects Actvty (IARPA) va Department of Defense U.S. Army Research Laboratory (DoD / ARL) contract number W9NF--C-. over strngs, e.g. ASR lattces. Approxmatng TF and DF measures by the statstcs of -best ASR output s a common strategy used n SR. However, as shown by [3], proper estmaton of these measures over ASR lattces provdes sgnfcant gans n performance over the -best baselne. Factor automaton (Secton.3) s an effcent data structure for representng all factors (substrngs) of a set of strngs (e.g. a fnte-state automaton). Ths noton can be generalzed to weghted fnte-state automata [, 5] by assocatng a weght to each factor (Secton 3). In ths paper, we consder the problem of computng TF, DF, and TF-IDF measures n the expected sense for all factors seen n a collecton of word sequence lattces by means of weghted factor automata. We frst ntroduce the concept of factor weght of a strng x, defned as the total weght assocated by a weghted automaton to the set of strngs whch nclude x as a substrng at least once (Secton 3). Then, we present a novel algorthm (Secton 3.) for transformng an acyclc probablstc automaton, e.g. an ASR lattce, nto a term probablty factor automaton whch s an deal ndex structure for spoken utterance retreval applcatons. We then show how ths automaton can be used to effcently construct other weghted factor automata representng DF (Secton 3.), IDF and TF-IDF (Secton 3.3) statstcs for all factors of a collecton of ASR lattces. Fnally we provde experments evaluatng the average run-tme and storage requrements of our approach and conclude wth a bref dscusson (Secton ).. Prelmnares Ths secton ntroduces the defntons and notaton related to weghted transducers and automata [6], as well as the defntons for TF, DF and TF-IDF measures as used n ths paper... Semrngs Defnton A semrng s a 5-tuple (K,,,, ) where (K,, ) s a commutatve monod, (K,, ) s a monod, dstrbutes over and s an annhlator for. Table : Common semrngs. SEMIRING SET Boolean {, } Real R + {+ } + Max-tmes R + {+ } max Log R {, + } log + + Tropcal R {, + } mn + + a log b = log(e a + e b ) Copyrght 3 ISCA August 3, Lyon, France

2 .. Weghted Fnte-State Transducers and Automata Defnton A weghted fnte-state transducer T over a semrng (K,,,, ) s an 8-tuple T = (Σ,, Q, I, F, E, λ, ρ) where: Σ, are respectvely the fnte nput and output alphabets; Q s a fnte set of states; I, F Q are respectvely the set of ntal and fnal states; E Q (Σ {ε}) ( {ε}) K Q s a fnte set of arcs; λ I K, ρ F K are respectvely the ntal and fnal weght functons. Gven an arc e E, we denote by [e] ts nput label, o[e] ts output label, w[e] ts weght, p[e] ts orgn state and n[e] ts destnaton state. A path π = e e k s an element of E wth consecutve arcs satsfyng n[e ] = p[e ], =,..., k. We extend n and p to paths by settng n[π] = n[e k ] and p[π] = p[e ]. The labelng and the weght functons can also be extended to paths by defnng [π] = [e ] [e k ], o[π] = o[e ] o[e k ] and w[π] = w[e ] w[e k ]. We denote by Π(q, q ) the set of paths from q to q and by Π(q, x, y, q ) the set of paths from q to q wth nput label x Σ and output label y. These defntons can be extended to subsets S, S Q, e.g. Π(S, x, y, S ) = Π(q, x, y, q ). q S,q S An acceptng path n a transducer T s a path n Π(I, F ). A strng x s accepted by T f there exsts an acceptng path π labeled wth x on the nput sde. T s unambguous f for any strng x Σ there s at most one acceptng path labeled wth x on the nput sde. T s determnstc f t has at most one ntal state and at any state no two outgong transtons share the same nput label. Let π and π be two paths of a transducer T wth the same nput and output labels: [π] = [π ] and o[π] = o[π ]. We say that π = e e n and π = e e n are dentcal f they have the same number of transtons (n = n ) wth the same labels: [e k ] = [e k] and o[e k ] = o[e k] for k =,..., n. A transducer T s sad to be normalzed, f any two paths π and π wth the same nput and output labels are dentcal. The weght assocated by a transducer T to any nput-output strng par (x, y) Σ s gven by T (x, y) = π Π(I,x,y,F ) λ(p[π]) w[π] ρ(n[π]) and T (x, y) s defned to be when Π(I, x, y, F ) =. We denote by s[a] the -sum of the weghts of all acceptng paths of A when t s defned and n K. s[a] can be vewed as the shortest-dstance from the ntal states to the fnal states. A weghted fnte-state automaton A can be defned as a weghted fnte-state transducer wth dentcal nput and output labels. The weght assocated by A to (x, x) s denoted by A (x). Smlarly, n the graphcal representaton of weghted automata, output labels are omtted. A weghted automaton A defned over the probablty semrng (R +, +,,, ) s sad to be probablstc f for any state q Q, π Π(q,q) w[π], the sum of the weghts of all cycles at q, s well-defned and n R + and x Σ A (x) =..3. Factor Automata Defnton 3 Gven two strngs x, y Σ, x s a factor (substrng) of y f y = uxv for some u, v Σ. More generally, x s a factor of a language L Σ f x s a factor of some strng y L. The factor automaton S(y) of a strng y s the mnmal determnstc fnte-state automaton recognzng exactly the set of factors of y. S(y) can be bult n lnear tme and ts sze s lnear n the sze of the nput strng y [7, 8]. We denote by S(A) the mnmal determnstc automaton recognzng the set of factors of a fnte-state automaton A, that s the set of factors of the strngs accepted by A. A recent work [9] showed that the sze of the factor automaton S(A) s lnear n the sze of the nput automaton A = Q + E and provded an algorthm for the constructon of S(A) n lnear tme when A s unweghted... Term Frequency, Document Frequency, TF-IDF We defne the term frequency T F (x, ) as the number of occurrences of a term x n document D, and the document frequency DF (x) as the fracton of documents ncludng the term x n a collecton of documents {D =,, n}: DF (x) = { x D}. () n TF-IDF can be defned n a number of ways by choosng a TF and an IDF defnton from a multtude of optons. Here we use: T F IDF (x, ) = T F (x, ) log DF (x). () 3. Factor Automata of Weghted Automata Factor automaton (FA) noton can be generalzed to weghted automata by assocatng a weght to each factor. When the nput automata are probablstc, there are several meanngful statstcs that can be used as a weght. The weghted ndex structure descrbed n [] uses expected term frequences derved from probablstc automata. In general, any weght satsfyng the semrng condtons can be used. For nstance, the tmed factor transducer structure descrbed n [5] uses (probablty, start tme, end tme) trplets derved from tme algned ASR lattces. The unweghted FA can be thought as a weghted automaton over the Boolean semrng where the path weghts represent factor occurrence,.e. each factor of the nput automaton s assgned weght, any other strng s assgned weght. The natural extenson of ths occurrence concept to probablstc automata s smply the probablty of occurrence (at least once). Gven a probablstc nput automaton, the probablty of occurrence of a factor s smply the sum of probabltes assgned to each acceptng path ncludng that factor. More formally, for a weghted fnte-state automaton A over the semrng K, we defne the factor weght A (x) of a strng x Σ as the -sum of the weghts of all acceptng paths ncludng x as a factor: A (x) = π Π(I,F ) u,v Σ [π]=uxv λ(p[π]) w[π] ρ(n[π]) (3) Over the Boolean semrng, factor weght s smply the logcal dsjuncton of Boolean path weghts and reduces to factor occurrence. Over the real (equvalently log) semrng, factor weght corresponds to the sum ( log -sum) of path weghts, whch s smply the probablty ( log probablty) of occurrence of a factor (at least once) when the nput automaton s probablstc. Next secton presents a recpe based on weghted transducer algorthms [6] for transformng an acyclc probablstc automaton nto a term probablty (TP) factor automaton where the path weght of each factor represents ts probablty of occurrence,.e. factor weght over the real semrng. TP factor automaton has an deal weght structure for spoken utterance retreval [, ] applcatons where the goal s to retreve all utterances 7

3 a/ b/ b/ a/ 3/ ε/.5 b/ a/ ε/ a/.5 3 / a/ /.5 b/ b/.5 /.5 a/.5 3/ (a) (b) (c) ε/ ε/.5 ε:α/ a:a/ ε:α3/ b:b/ ε:α/ a:a/ ε:α/ a:/ b:/ b:/.5 ε:/.5 3 ε:/.5 ε:/.5 ε:/ a:/ 5 ε:/ ε:/ 7 8 ε:/ ε:/ ε:/ / δ:β/ δ3:β3/ (d) 6/ δ:β/ δ:β/ Fgure : Constructon of S from A. (a) Input automaton (A) over the real semrng, (b) after preprocessng (B), (c) resultng factor automaton (S), (d) after factor generaton (T ), (e) after factor normalzaton and determnzaton over the max-tmes semrng (N). ε:/.5 ε:/.5 a:/.5 (e) 6 ε:/ a:/ 9 contanng a query term n a large spoken archve. We should note that, unlke TF whch can be expressed as a sum over the posteror probabltes of factor occurrences [], TP measure gven by (3) s a sum over the probabltes of acceptng paths ncludng a factor. Ths subtle dfference makes the TP computaton sgnfcantly harder snce we have to make sure that the acceptng paths ncludng a factor multple tmes are counted only once. The man dea s to construct a mappng from any factor to the acceptng paths ncludng t and elmnatng the path replcatons caused by factors occurrng multple tmes n a path before dong the probablty summaton. 3.. Term Probablty Factor Automaton Let A denote an acyclc automaton over the log (real) semrng. Fgure a gves such an automaton over the real semrng. We can derve the TP factor automaton S from A n four steps: 3... Preprocessng We assume the nput s an acyclc probablstc automaton over the log (real) semrng. When that s not the case, we can use the general weght-pushng algorthm [6] to convert nput weghts to posteror probabltes. We also requre the nput to satsfy three specfc condtons whch are crucal for the correctness of the algorthm: ) t should be unambguous, ) no two arcs comng nto a state should have the same label and ) t should be ε- free except for the arcs orgnatng from the ntal states. All of these condtons can be satsfed by reversng the nput automaton, applyng weghted epslon removal and determnzaton on the reverse machne, and then reversng the resultng automaton [6]. Fgure b llustrates the result of these operatons Generaton Let B = (Σ, Σ, Q, I, F, E, λ, ρ) denote the automaton obtaned after preprocessng A. We generate factors of B (Fgure d) smply by creatng a unque ntal state q I / Q, a unque fnal state q F / Q, and two new arcs for each nonntal state q Q I: (q I, ε, α q,, q) and (q, δ q, β q,, q F ). Here, δ q s a unque dsambguaton symbol. α q (or β q) s a unque non-termnal symbol representng the prefx transducer U q = ({ε}, Σ, Q, I, {q}, E, λ, ) (resp. suffx transducer V q = ({ε}, Σ, Q, {q}, F, E,, ρ)) derved from B by settng q as the sole fnal (resp. ntal) state and replacng the nput symbols wth ε,.e. E = {(p[e], [e], ε, w[e], n[e]) e E}. Techncally, the (root) transducer T obtaned after these changes and the set of fnte state transducers {U q q Q I} {V q q Q I} defne a recursve transton network (RTN) []. Due to the unque dsambguaton symbols {δ q q Q I} on the nput sde of the new arcs nto the unque fnal state q F and the fact that B s unambguous wth no two arcs comng nto a state havng the same label, T s a functonal transducer. Optonally, we can flter the generated factors by composng a fnte-state grammar G on the nput sde of T []. The default flter rejects empty factors,.e. any path π n T for whch the only non-ε nput label s the dsambguaton symbol at the end Normalzaton We obtan a normalzed transducer N from T by applyng weghted transducer determnzaton, replacng the dsambguaton symbols wth ε symbols, replacng the non-termnal symbols [] wth the correspondng prefx and suffx transducers and fnally applyng weghted epslon removal. N defnes a mappng from each factor x of B to the set of paths n B that nclude x as a factor. Weght of a path π n N equals B (o[π]), the weght assocated by B to the path defned by the output symbols o[π]. Furthermore, all paths n N wth the same nput and output labels are dentcal and all dentcal paths n N have the same weght by constructon snce B s unambguous and the path weghts n N are determned solely by the output labels. We merge the dentcal paths n N by vewng t as an acceptor,.e. encodng nput-output labels as a sngle label, and applyng weghted automata determnzaton over the tropcal (max-tmes) semrng. Snce tropcal semrng s dempotent, determnzaton smply elmnates the repeatng paths whch are generated when a factor occurs multple tmes on an acceptng path of B. Note that t s crucal to use an dempotent semrng at ths pont snce otherwse repeatng paths are merged by -summng dentcal path weghts whch n general produces a sum dfferent from the nput path weghts. Fgure e llustrates the normalzed transducer (wth the dashed arc) and the fnal result after encoded determnzaton (wthout the dashed arc) Optmzaton At ths step, we project N to ts nput labels (representng factors) and apply weghted ε-removal and mnmzaton [6] over 8

4 the log semrng to obtan the TP factor automaton S. Note that we can drectly apply mnmzaton snce the machne s already determnstc. Fgure c llustrates the result of optmzaton. 3.. Document Frequency Factor Automaton Usng TP factor automata, we can effcently compute and store expected document frequences for each factor of a collecton of probablstc automata, e.g. ASR lattces. For each nput automaton A of the collecton {A =,..., n}, we construct a TP factor automaton S over the log semrng. The document frequency factor automaton S DF of the entre collecton s constructed smply by takng the unon U of ndvdual TP factor automata, concatenatng a smple automaton N = ({ε}, {ε}, {, }, {}, {}, {(, ε, ε, log(n), )},, ) representng the sze of the collecton and U, and fnally applyng weghted ε-removal, determnzaton, and mnmzaton over the log semrng Term Frequency and TF-IDF Factor Automaton We can use the algorthm descrbed n [] to construct a term frequency factor automaton S TF for each nput automaton A n the collecton. These automata can be combned wth the DF factor automaton S DF of the entre collecton to construct the TF-IDF factor automata S TF,TF IDF n whch path weghts represent (TF, TF-IDF) pars. The desred relaton between these automata can be expressed as: S TF,TF IDF (x) = ( S TF (x), S TF (x) log S DF (x)) Ths operaton can be carred out wth the weghted ntersecton operaton over a specal semrng structure known as the expectaton (or entropy) semrng [, 3]. Expectaton semrng s defned as follows: E = ((R {, + }) (R {, + }),,, (, ), (, )) (x, y ) (x, y ) = (x + x, y + y ) (x, y ) (x, y ) = (x x, x y + x y ) Let log A denote the weghted automaton derved from A by replacng each weght w R + by log w and let Φ (A) and Φ (A) denote the weghted automata over the expectaton semrng derved from A by replacng each weght w by the par (w, ) and (, w) respectvely. The factor automata representng TF, DF and (TF, TF-IDF) statstcs satsfy the followng dentty n the expectaton semrng: S TF,TF IDF = Φ (S TF ) Φ ( log S DF ) Here the second term on the rght can be recognzed as the factor automaton representng IDF statstcs. Hence, TF-IDF computaton reduces to weghted automata ntersecton n the expectaton semrng. Consder the vector space model defned over factors,.e. each dmenson corresponds to a factor. Inner product computaton n ths vector space between a query factor automaton Φ (SQ TF ) representng term frequences over the expectaton semrng and each factor automaton {S TF,TF IDF =,..., n} can be carred out by ntersectng the two automata and then performng a sngle-source shortestdstance computaton [] over the entropy semrng: < Φ (S TF Q ), S TF,TF IDF >= s[φ (S TF Q ) S TF,TF IDF ] One applcaton for ths nner product s the computaton of cosne smlarty between two spoken documents, e.g. ASR lattces correspondng to a voced query and a spoken document. Table : Runtme Results: TP Factor Automata vs. Baselne Max length 3 6 all log (# factors) Baselne tme (s) TP FA tme (s) Table 3: Factor Automata Comparson FA Type UW TF TP DF TF-IDF Σ S (M) 6 7 On dsk (MB) Tme (mn) Experments and Dscusson We conducted experments on the tranng subset of the Turksh language pack provded by the IARPA Babel program whch ncludes 8 hours of conversatonal telephone speech. Lattces were generated wth a speaker dependent DNN ASR system that was traned on the same data set usng IBM s Attla toolkt. All lattces were pruned to a logarthmc beam wdth of 5. Estmatng document frequences of sngle-word factors n a collecton of lattces has prevously been addressed n [3]. Ther recpe for computng probablty of occurrence conssts of composng the nput lattce wth a smple fnte-state flter that rejects the paths ncludng the target word, computng the total probablty of the remanng paths and complementng. Computaton s carred out one factor at a tme,.e. factors are enumerated and each one s processed ndependently. We mplemented a generalzed verson of ths recpe whch can be used wth mult-word factors for our baselne results. Both ths baselne algorthm and the factor automata constructon algorthms n consderaton were mplemented usng the OpenFst Lbrary [5]. Table gves a runtme comparson between the baselne and the TP factor automata constructon algorthm. We randomly selected lattces from our data set (total sze: #states + #arcs = 8K, dsk sze:.mb) and compared the total runtme whle changng the maxmum factor length wth a fntestate length restrcton flter []. Runtme complexty of the baselne method s exponental n the maxmum factor length (or lnear n the number of factors) due to the enumeraton of factors. Proposed method takes advantage of weghted transducer algorthms to do the computaton jontly for all factors. Table 3 compares the total runtme and storage requrements for varous factor automata. For these experments, we used the entre tranng set whch ncludes 88K lattces (total sze: #states + #arcs = 33M, dsk sze: 8MB) and a maxmum factor length of 3. Frst column (UW) represents the unweghted factor automata obtaned by removng all weghts from the nput lattces. Storage requrements seem to be comparable for the types of factor automata n consderaton. The runtme for the constructon of DF factor automaton excludes the tme spent for the constructon of TP factor automata. Smlarly the runtme for the constructon of TF-IDF factor automata excludes the tme spent for the constructon of TF and DF factor automata. Just lke TF factor automaton constructon [], TP factor automaton constructon algorthm of the prevous secton s worst case exponental n the nput sze and may blow up for some nput. The runtme reported n Table 3 s domnated by the tme spent on a small fracton of nput lattces for whch the algorthm took several mnutes to complete. The dfference n the average runtme characterstcs calls for further study. 9

5 5. References [] S. Robertson, Understandng nverse document frequency: On theoretcal arguments for df, Journal of Documentaton, vol. 6, p.,. [] D. Harman, The hstory of df and ts nfluences on r and other felds, n Chartng a New Course: Natural Language Processng and Informaton Retreval, ser. The Kluwer Internatonal Seres on Informaton Retreval, J. Tat, Ed. Sprnger Netherlands, 5, vol. 6, pp [3] D. Karakos, M. Dredze, K. W. Church, A. Jansen, and S. Khudanpur, Estmatng document frequences n a speech corpus. n ASRU, D. Nahamoo and M. Pcheny, Eds. IEEE,, pp. 7. [] C. Allauzen, M. Mohr, and M. Saraclar, General ndexaton of weghted automata: Applcaton to spoken utterance retreval, n Proc. HLT-NAACL Workshop on Interdscplnary Approaches to Speech Indexng and Retreval, Boston, MA, USA,, pp. 33. [5] D. Can and M. Saraclar, Lattce ndexng for spoken term detecton, Audo, Speech, and Language Processng, IEEE Transactons on, vol. 9, no. 8, pp , nov.. [6] M. Mohr, Weghted automata algorthms, n Handbook of Weghted Automata, ser. Monographs n Theoretcal Computer Scence. An EATCS Seres, M. Droste, W. Kuch, and H. Vogler, Eds. Sprnger Berln Hedelberg, 9, pp [7] A. Blumer, J. Blumer, A. Ehrenfeucht, D. Haussler, M. T. Chen, and J. Seferas, The smallest automaton recognsng the subwords of a text, Theoretcal Computer Scence, vol., pp. 3 55, 985. [8] M. Crochemore, Transducers and repettons, Theoretcal Computer Scence, vol. 5, no., pp , 986. [9] M. Mohr, P. Moreno, and E. Wensten, General suffx automaton constructon algorthm and space bounds, Theoretcal Computer Scence, vol., no. 37, pp , 9. [] M. Saraclar and R. Sproat, Lattce-based search for spoken utterance retreval, n Proc. HLT-NAACL, Boston, MA, USA,, pp [] C. Allauzen and M. Rley, A pushdown transducer extenson for the openfst lbrary, n Proceedngs of the 7th nternatonal conference on Implementaton and Applcaton of Automata, ser. CIAA. Berln, Hedelberg: Sprnger-Verlag,, pp [] J. Esner, Expectaton semrngs: Flexble EM for fnte-state transducers, n Proceedngs of the ESSLLI Workshop on Fnte- State Methods n Natural Language Processng (FSMNLP), G. van Noord, Ed., Helsnk, Aug., extended abstract (5 pages). [3] C. Cortes, M. Mohr, A. Rastog, and M. Rley, On the computaton of the relatve entropy of probablstc automata, Int. J. Found. Comput. Sc., vol. 9, no., pp. 9, 8. [] M. Mohr, Semrng frameworks and algorthms for shortestdstance problems, Journal of Automata, Languages and Combnatorcs, vol. 7, no. 3, pp. 3 35,. [5] C. Allauzen, M. Rley, J. Schalkwyk, W. Skut, and M. Mohr, OpenFst: A general and effcent weghted fnte-state transducer lbrary, n Proceedngs of the Nnth Internatonal Conference on Implementaton and Applcaton of Automata, (CIAA 7), ser. Lecture Notes n Computer Scence, vol Sprnger, 7, pp. 3,

Hermite Splines in Lie Groups as Products of Geodesics

Hermite Splines in Lie Groups as Products of Geodesics Hermte Splnes n Le Groups as Products of Geodescs Ethan Eade Updated May 28, 2017 1 Introducton 1.1 Goal Ths document defnes a curve n the Le group G parametrzed by tme and by structural parameters n the

More information

such that is accepted of states in , where Finite Automata Lecture 2-1: Regular Languages be an FA. A string is the transition function,

such that is accepted of states in , where Finite Automata Lecture 2-1: Regular Languages be an FA. A string is the transition function, * Lecture - Regular Languages S Lecture - Fnte Automata where A fnte automaton s a -tuple s a fnte set called the states s a fnte set called the alphabet s the transton functon s the ntal state s the set

More information

For instance, ; the five basic number-sets are increasingly more n A B & B A A = B (1)

For instance, ; the five basic number-sets are increasingly more n A B & B A A = B (1) Secton 1.2 Subsets and the Boolean operatons on sets If every element of the set A s an element of the set B, we say that A s a subset of B, or that A s contaned n B, or that B contans A, and we wrte A

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour 6.854 Advanced Algorthms Petar Maymounkov Problem Set 11 (November 23, 2005) Wth: Benjamn Rossman, Oren Wemann, and Pouya Kheradpour Problem 1. We reduce vertex cover to MAX-SAT wth weghts, such that the

More information

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

Data Representation in Digital Design, a Single Conversion Equation and a Formal Languages Approach

Data Representation in Digital Design, a Single Conversion Equation and a Formal Languages Approach Data Representaton n Dgtal Desgn, a Sngle Converson Equaton and a Formal Languages Approach Hassan Farhat Unversty of Nebraska at Omaha Abstract- In the study of data representaton n dgtal desgn and computer

More information

UB at GeoCLEF Department of Geography Abstract

UB at GeoCLEF Department of Geography   Abstract UB at GeoCLEF 2006 Mguel E. Ruz (1), Stuart Shapro (2), June Abbas (1), Slva B. Southwck (1) and Davd Mark (3) State Unversty of New York at Buffalo (1) Department of Lbrary and Informaton Studes (2) Department

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for

More information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices Internatonal Mathematcal Forum, Vol 7, 2012, no 52, 2549-2554 An Applcaton of the Dulmage-Mendelsohn Decomposton to Sparse Null Space Bases of Full Row Rank Matrces Mostafa Khorramzadeh Department of Mathematcal

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique //00 :0 AM Outlne and Readng The Greedy Method The Greedy Method Technque (secton.) Fractonal Knapsack Problem (secton..) Task Schedulng (secton..) Mnmum Spannng Trees (secton.) Change Money Problem Greedy

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

User Authentication Based On Behavioral Mouse Dynamics Biometrics

User Authentication Based On Behavioral Mouse Dynamics Biometrics User Authentcaton Based On Behavoral Mouse Dynamcs Bometrcs Chee-Hyung Yoon Danel Donghyun Km Department of Computer Scence Department of Computer Scence Stanford Unversty Stanford Unversty Stanford, CA

More information

Lecture 5: Multilayer Perceptrons

Lecture 5: Multilayer Perceptrons Lecture 5: Multlayer Perceptrons Roger Grosse 1 Introducton So far, we ve only talked about lnear models: lnear regresson and lnear bnary classfers. We noted that there are functons that can t be represented

More information

Sequential search. Building Java Programs Chapter 13. Sequential search. Sequential search

Sequential search. Building Java Programs Chapter 13. Sequential search. Sequential search Sequental search Buldng Java Programs Chapter 13 Searchng and Sortng sequental search: Locates a target value n an array/lst by examnng each element from start to fnsh. How many elements wll t need to

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

Problem Set 3 Solutions

Problem Set 3 Solutions Introducton to Algorthms October 4, 2002 Massachusetts Insttute of Technology 6046J/18410J Professors Erk Demane and Shaf Goldwasser Handout 14 Problem Set 3 Solutons (Exercses were not to be turned n,

More information

Private Information Retrieval (PIR)

Private Information Retrieval (PIR) 2 Levente Buttyán Problem formulaton Alce wants to obtan nformaton from a database, but she does not want the database to learn whch nformaton she wanted e.g., Alce s an nvestor queryng a stock-market

More information

Concurrent models of computation for embedded software

Concurrent models of computation for embedded software Concurrent models of computaton for embedded software and hardware! Researcher overvew what t looks lke semantcs what t means and how t relates desgnng an actor language actor propertes and how to represent

More information

A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION

A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION 1 THE PUBLISHING HOUSE PROCEEDINGS OF THE ROMANIAN ACADEMY, Seres A, OF THE ROMANIAN ACADEMY Volume 4, Number 2/2003, pp.000-000 A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION Tudor BARBU Insttute

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

Hierarchical clustering for gene expression data analysis

Hierarchical clustering for gene expression data analysis Herarchcal clusterng for gene expresson data analyss Gorgo Valentn e-mal: valentn@ds.unm.t Clusterng of Mcroarray Data. Clusterng of gene expresson profles (rows) => dscovery of co-regulated and functonally

More information

Related-Mode Attacks on CTR Encryption Mode

Related-Mode Attacks on CTR Encryption Mode Internatonal Journal of Network Securty, Vol.4, No.3, PP.282 287, May 2007 282 Related-Mode Attacks on CTR Encrypton Mode Dayn Wang, Dongda Ln, and Wenlng Wu (Correspondng author: Dayn Wang) Key Laboratory

More information

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices Steps for Computng the Dssmlarty, Entropy, Herfndahl-Hrschman and Accessblty (Gravty wth Competton) Indces I. Dssmlarty Index Measurement: The followng formula can be used to measure the evenness between

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification

12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification Introducton to Artfcal Intellgence V22.0472-001 Fall 2009 Lecture 24: Nearest-Neghbors & Support Vector Machnes Rob Fergus Dept of Computer Scence, Courant Insttute, NYU Sldes from Danel Yeung, John DeNero

More information

CS 534: Computer Vision Model Fitting

CS 534: Computer Vision Model Fitting CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust

More information

A New Approach For the Ranking of Fuzzy Sets With Different Heights

A New Approach For the Ranking of Fuzzy Sets With Different Heights New pproach For the ankng of Fuzzy Sets Wth Dfferent Heghts Pushpnder Sngh School of Mathematcs Computer pplcatons Thapar Unversty, Patala-7 00 Inda pushpndersnl@gmalcom STCT ankng of fuzzy sets plays

More information

Lobachevsky State University of Nizhni Novgorod. Polyhedron. Quick Start Guide

Lobachevsky State University of Nizhni Novgorod. Polyhedron. Quick Start Guide Lobachevsky State Unversty of Nzhn Novgorod Polyhedron Quck Start Gude Nzhn Novgorod 2016 Contents Specfcaton of Polyhedron software... 3 Theoretcal background... 4 1. Interface of Polyhedron... 6 1.1.

More information

Load Balancing for Hex-Cell Interconnection Network

Load Balancing for Hex-Cell Interconnection Network Int. J. Communcatons, Network and System Scences,,, - Publshed Onlne Aprl n ScRes. http://www.scrp.org/journal/jcns http://dx.do.org/./jcns.. Load Balancng for Hex-Cell Interconnecton Network Saher Manaseer,

More information

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics Introducton G10 NAG Fortran Lbrary Chapter Introducton G10 Smoothng n Statstcs Contents 1 Scope of the Chapter... 2 2 Background to the Problems... 2 2.1 Smoothng Methods... 2 2.2 Smoothng Splnes and Regresson

More information

Description of NTU Approach to NTCIR3 Multilingual Information Retrieval

Description of NTU Approach to NTCIR3 Multilingual Information Retrieval Proceedngs of the Thrd NTCIR Workshop Descrpton of NTU Approach to NTCIR3 Multlngual Informaton Retreval Wen-Cheng Ln and Hsn-Hs Chen Department of Computer Scence and Informaton Engneerng Natonal Tawan

More information

Available online at Available online at Advanced in Control Engineering and Information Science

Available online at   Available online at   Advanced in Control Engineering and Information Science Avalable onlne at wwwscencedrectcom Avalable onlne at wwwscencedrectcom Proceda Proceda Engneerng Engneerng 00 (2011) 15000 000 (2011) 1642 1646 Proceda Engneerng wwwelsevercom/locate/proceda Advanced

More information

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009.

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009. Farrukh Jabeen Algorthms 51 Assgnment #2 Due Date: June 15, 29. Assgnment # 2 Chapter 3 Dscrete Fourer Transforms Implement the FFT for the DFT. Descrbed n sectons 3.1 and 3.2. Delverables: 1. Concse descrpton

More information

Optimal Workload-based Weighted Wavelet Synopses

Optimal Workload-based Weighted Wavelet Synopses Optmal Workload-based Weghted Wavelet Synopses Yoss Matas School of Computer Scence Tel Avv Unversty Tel Avv 69978, Israel matas@tau.ac.l Danel Urel School of Computer Scence Tel Avv Unversty Tel Avv 69978,

More information

Performance Evaluation of Information Retrieval Systems

Performance Evaluation of Information Retrieval Systems Why System Evaluaton? Performance Evaluaton of Informaton Retreval Systems Many sldes n ths secton are adapted from Prof. Joydeep Ghosh (UT ECE) who n turn adapted them from Prof. Dk Lee (Unv. of Scence

More information

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1 4/14/011 Outlne Dscrmnatve classfers for mage recognton Wednesday, Aprl 13 Krsten Grauman UT-Austn Last tme: wndow-based generc obect detecton basc ppelne face detecton wth boostng as case study Today:

More information

Mathematics 256 a course in differential equations for engineering students

Mathematics 256 a course in differential equations for engineering students Mathematcs 56 a course n dfferental equatons for engneerng students Chapter 5. More effcent methods of numercal soluton Euler s method s qute neffcent. Because the error s essentally proportonal to the

More information

Chapter 6 Programmng the fnte element method Inow turn to the man subject of ths book: The mplementaton of the fnte element algorthm n computer programs. In order to make my dscusson as straghtforward

More information

A fast algorithm for color image segmentation

A fast algorithm for color image segmentation Unersty of Wollongong Research Onlne Faculty of Informatcs - Papers (Arche) Faculty of Engneerng and Informaton Scences 006 A fast algorthm for color mage segmentaton L. Dong Unersty of Wollongong, lju@uow.edu.au

More information

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints Australan Journal of Basc and Appled Scences, 2(4): 1204-1208, 2008 ISSN 1991-8178 Sum of Lnear and Fractonal Multobjectve Programmng Problem under Fuzzy Rules Constrants 1 2 Sanjay Jan and Kalash Lachhwan

More information

A Fuzzy Image Matching Algorithm with Linguistic Spatial Queries

A Fuzzy Image Matching Algorithm with Linguistic Spatial Queries Fuzzy Matchng lgorthm wth Lngustc Spatal Queres TZUNG-PEI HONG, SZU-PO WNG, TIEN-HIN WNG, EEN-HIN HIEN epartment of Electrcal Engneerng, Natonal Unversty of Kaohsung Insttute of Informaton Management,

More information

The Shortest Path of Touring Lines given in the Plane

The Shortest Path of Touring Lines given in the Plane Send Orders for Reprnts to reprnts@benthamscence.ae 262 The Open Cybernetcs & Systemcs Journal, 2015, 9, 262-267 The Shortest Path of Tourng Lnes gven n the Plane Open Access Ljuan Wang 1,2, Dandan He

More information

Edge Detection in Noisy Images Using the Support Vector Machines

Edge Detection in Noisy Images Using the Support Vector Machines Edge Detecton n Nosy Images Usng the Support Vector Machnes Hlaro Gómez-Moreno, Saturnno Maldonado-Bascón, Francsco López-Ferreras Sgnal Theory and Communcatons Department. Unversty of Alcalá Crta. Madrd-Barcelona

More information

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems A Unfed Framework for Semantcs and Feature Based Relevance Feedback n Image Retreval Systems Ye Lu *, Chunhu Hu 2, Xngquan Zhu 3*, HongJang Zhang 2, Qang Yang * School of Computng Scence Smon Fraser Unversty

More information

Smoothing Spline ANOVA for variable screening

Smoothing Spline ANOVA for variable screening Smoothng Splne ANOVA for varable screenng a useful tool for metamodels tranng and mult-objectve optmzaton L. Rcco, E. Rgon, A. Turco Outlne RSM Introducton Possble couplng Test case MOO MOO wth Game Theory

More information

b * -Open Sets in Bispaces

b * -Open Sets in Bispaces Internatonal Journal of Mathematcs and Statstcs Inventon (IJMSI) E-ISSN: 2321 4767 P-ISSN: 2321-4759 wwwjmsorg Volume 4 Issue 6 August 2016 PP- 39-43 b * -Open Sets n Bspaces Amar Kumar Banerjee 1 and

More information

An Entropy-Based Approach to Integrated Information Needs Assessment

An Entropy-Based Approach to Integrated Information Needs Assessment Dstrbuton Statement A: Approved for publc release; dstrbuton s unlmted. An Entropy-Based Approach to ntegrated nformaton Needs Assessment June 8, 2004 Wllam J. Farrell Lockheed Martn Advanced Technology

More information

An Approach in Coloring Semi-Regular Tilings on the Hyperbolic Plane

An Approach in Coloring Semi-Regular Tilings on the Hyperbolic Plane An Approach n Colorng Sem-Regular Tlngs on the Hyperbolc Plane Ma Louse Antonette N De Las Peñas, mlp@mathscmathadmueduph Glenn R Lago, glago@yahoocom Math Department, Ateneo de Manla Unversty, Loyola

More information

2x x l. Module 3: Element Properties Lecture 4: Lagrange and Serendipity Elements

2x x l. Module 3: Element Properties Lecture 4: Lagrange and Serendipity Elements Module 3: Element Propertes Lecture : Lagrange and Serendpty Elements 5 In last lecture note, the nterpolaton functons are derved on the bass of assumed polynomal from Pascal s trangle for the fled varable.

More information

Parallel matrix-vector multiplication

Parallel matrix-vector multiplication Appendx A Parallel matrx-vector multplcaton The reduced transton matrx of the three-dmensonal cage model for gel electrophoress, descrbed n secton 3.2, becomes excessvely large for polymer lengths more

More information

Analysis of Continuous Beams in General

Analysis of Continuous Beams in General Analyss of Contnuous Beams n General Contnuous beams consdered here are prsmatc, rgdly connected to each beam segment and supported at varous ponts along the beam. onts are selected at ponts of support,

More information

A NOTE ON FUZZY CLOSURE OF A FUZZY SET

A NOTE ON FUZZY CLOSURE OF A FUZZY SET (JPMNT) Journal of Process Management New Technologes, Internatonal A NOTE ON FUZZY CLOSURE OF A FUZZY SET Bhmraj Basumatary Department of Mathematcal Scences, Bodoland Unversty, Kokrajhar, Assam, Inda,

More information

TN348: Openlab Module - Colocalization

TN348: Openlab Module - Colocalization TN348: Openlab Module - Colocalzaton Topc The Colocalzaton module provdes the faclty to vsualze and quantfy colocalzaton between pars of mages. The Colocalzaton wndow contans a prevew of the two mages

More information

MOTION PANORAMA CONSTRUCTION FROM STREAMING VIDEO FOR POWER- CONSTRAINED MOBILE MULTIMEDIA ENVIRONMENTS XUNYU PAN

MOTION PANORAMA CONSTRUCTION FROM STREAMING VIDEO FOR POWER- CONSTRAINED MOBILE MULTIMEDIA ENVIRONMENTS XUNYU PAN MOTION PANORAMA CONSTRUCTION FROM STREAMING VIDEO FOR POWER- CONSTRAINED MOBILE MULTIMEDIA ENVIRONMENTS by XUNYU PAN (Under the Drecton of Suchendra M. Bhandarkar) ABSTRACT In modern tmes, more and more

More information

Wishing you all a Total Quality New Year!

Wishing you all a Total Quality New Year! Total Qualty Management and Sx Sgma Post Graduate Program 214-15 Sesson 4 Vnay Kumar Kalakband Assstant Professor Operatons & Systems Area 1 Wshng you all a Total Qualty New Year! Hope you acheve Sx sgma

More information

Integrated Expression-Invariant Face Recognition with Constrained Optical Flow

Integrated Expression-Invariant Face Recognition with Constrained Optical Flow Integrated Expresson-Invarant Face Recognton wth Constraned Optcal Flow Chao-Kue Hseh, Shang-Hong La 2, and Yung-Chang Chen Department of Electrcal Engneerng, Natonal Tsng Hua Unversty, Tawan 2 Department

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information

Learning-Based Top-N Selection Query Evaluation over Relational Databases

Learning-Based Top-N Selection Query Evaluation over Relational Databases Learnng-Based Top-N Selecton Query Evaluaton over Relatonal Databases Lang Zhu *, Wey Meng ** * School of Mathematcs and Computer Scence, Hebe Unversty, Baodng, Hebe 071002, Chna, zhu@mal.hbu.edu.cn **

More information

Load-Balanced Anycast Routing

Load-Balanced Anycast Routing Load-Balanced Anycast Routng Chng-Yu Ln, Jung-Hua Lo, and Sy-Yen Kuo Department of Electrcal Engneerng atonal Tawan Unversty, Tape, Tawan sykuo@cc.ee.ntu.edu.tw Abstract For fault-tolerance and load-balance

More information

Reducing Frame Rate for Object Tracking

Reducing Frame Rate for Object Tracking Reducng Frame Rate for Object Trackng Pavel Korshunov 1 and We Tsang Oo 2 1 Natonal Unversty of Sngapore, Sngapore 11977, pavelkor@comp.nus.edu.sg 2 Natonal Unversty of Sngapore, Sngapore 11977, oowt@comp.nus.edu.sg

More information

A Post Randomization Framework for Privacy-Preserving Bayesian. Network Parameter Learning

A Post Randomization Framework for Privacy-Preserving Bayesian. Network Parameter Learning A Post Randomzaton Framework for Prvacy-Preservng Bayesan Network Parameter Learnng JIANJIE MA K.SIVAKUMAR School Electrcal Engneerng and Computer Scence, Washngton State Unversty Pullman, WA. 9964-75

More information

Transaction-Consistent Global Checkpoints in a Distributed Database System

Transaction-Consistent Global Checkpoints in a Distributed Database System Proceedngs of the World Congress on Engneerng 2008 Vol I Transacton-Consstent Global Checkponts n a Dstrbuted Database System Jang Wu, D. Manvannan and Bhavan Thurasngham Abstract Checkpontng and rollback

More information

Complex Numbers. Now we also saw that if a and b were both positive then ab = a b. For a second let s forget that restriction and do the following.

Complex Numbers. Now we also saw that if a and b were both positive then ab = a b. For a second let s forget that restriction and do the following. Complex Numbers The last topc n ths secton s not really related to most of what we ve done n ths chapter, although t s somewhat related to the radcals secton as we wll see. We also won t need the materal

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

Cordial and 3-Equitable Labeling for Some Star Related Graphs

Cordial and 3-Equitable Labeling for Some Star Related Graphs Internatonal Mathematcal Forum, 4, 009, no. 31, 1543-1553 Cordal and 3-Equtable Labelng for Some Star Related Graphs S. K. Vadya Department of Mathematcs, Saurashtra Unversty Rajkot - 360005, Gujarat,

More information

Semi - - Connectedness in Bitopological Spaces

Semi - - Connectedness in Bitopological Spaces Journal of AL-Qadsyah for computer scence an mathematcs A specal Issue Researches of the fourth Internatonal scentfc Conference/Second صفحة 45-53 Sem - - Connectedness n Btopologcal Spaces By Qays Hatem

More information

Array transposition in CUDA shared memory

Array transposition in CUDA shared memory Array transposton n CUDA shared memory Mke Gles February 19, 2014 Abstract Ths short note s nspred by some code wrtten by Jeremy Appleyard for the transposton of data through shared memory. I had some

More information

Solving two-person zero-sum game by Matlab

Solving two-person zero-sum game by Matlab Appled Mechancs and Materals Onlne: 2011-02-02 ISSN: 1662-7482, Vols. 50-51, pp 262-265 do:10.4028/www.scentfc.net/amm.50-51.262 2011 Trans Tech Publcatons, Swtzerland Solvng two-person zero-sum game by

More information

Synthesizer 1.0. User s Guide. A Varying Coefficient Meta. nalytic Tool. Z. Krizan Employing Microsoft Excel 2007

Synthesizer 1.0. User s Guide. A Varying Coefficient Meta. nalytic Tool. Z. Krizan Employing Microsoft Excel 2007 Syntheszer 1.0 A Varyng Coeffcent Meta Meta-Analytc nalytc Tool Employng Mcrosoft Excel 007.38.17.5 User s Gude Z. Krzan 009 Table of Contents 1. Introducton and Acknowledgments 3. Operatonal Functons

More information

CMPS 10 Introduction to Computer Science Lecture Notes

CMPS 10 Introduction to Computer Science Lecture Notes CPS 0 Introducton to Computer Scence Lecture Notes Chapter : Algorthm Desgn How should we present algorthms? Natural languages lke Englsh, Spansh, or French whch are rch n nterpretaton and meanng are not

More information

Bridges and cut-vertices of Intuitionistic Fuzzy Graph Structure

Bridges and cut-vertices of Intuitionistic Fuzzy Graph Structure Internatonal Journal of Engneerng, Scence and Mathematcs (UGC Approved) Journal Homepage: http://www.jesm.co.n, Emal: jesmj@gmal.com Double-Blnd Peer Revewed Refereed Open Access Internatonal Journal -

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Decson surface s a hyperplane (lne n 2D) n feature space (smlar to the Perceptron) Arguably, the most mportant recent dscovery n machne learnng In a nutshell: map the data to a predetermned

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

Machine Learning 9. week

Machine Learning 9. week Machne Learnng 9. week Mappng Concept Radal Bass Functons (RBF) RBF Networks 1 Mappng It s probably the best scenaro for the classfcaton of two dataset s to separate them lnearly. As you see n the below

More information

Pruning Training Corpus to Speedup Text Classification 1

Pruning Training Corpus to Speedup Text Classification 1 Prunng Tranng Corpus to Speedup Text Classfcaton Jhong Guan and Shugeng Zhou School of Computer Scence, Wuhan Unversty, Wuhan, 430079, Chna hguan@wtusm.edu.cn State Key Lab of Software Engneerng, Wuhan

More information

Machine Learning. Topic 6: Clustering

Machine Learning. Topic 6: Clustering Machne Learnng Topc 6: lusterng lusterng Groupng data nto (hopefully useful) sets. Thngs on the left Thngs on the rght Applcatons of lusterng Hypothess Generaton lusters mght suggest natural groups. Hypothess

More information

A Hybrid Genetic Algorithm for Routing Optimization in IP Networks Utilizing Bandwidth and Delay Metrics

A Hybrid Genetic Algorithm for Routing Optimization in IP Networks Utilizing Bandwidth and Delay Metrics A Hybrd Genetc Algorthm for Routng Optmzaton n IP Networks Utlzng Bandwdth and Delay Metrcs Anton Redl Insttute of Communcaton Networks, Munch Unversty of Technology, Arcsstr. 21, 80290 Munch, Germany

More information

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 An Iteratve Soluton Approach to Process Plant Layout usng Mxed

More information

Determining the Optimal Bandwidth Based on Multi-criterion Fusion

Determining the Optimal Bandwidth Based on Multi-criterion Fusion Proceedngs of 01 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 5 (01) (01) IACSIT Press, Sngapore Determnng the Optmal Bandwdth Based on Mult-crteron Fuson Ha-L Lang 1+, Xan-Mn

More information

A Bilinear Model for Sparse Coding

A Bilinear Model for Sparse Coding A Blnear Model for Sparse Codng Davd B. Grmes and Rajesh P. N. Rao Department of Computer Scence and Engneerng Unversty of Washngton Seattle, WA 98195-2350, U.S.A. grmes,rao @cs.washngton.edu Abstract

More information

Can We Beat the Prefix Filtering? An Adaptive Framework for Similarity Join and Search

Can We Beat the Prefix Filtering? An Adaptive Framework for Similarity Join and Search Can We Beat the Prefx Flterng? An Adaptve Framework for Smlarty Jon and Search Jannan Wang Guolang L Janhua Feng Department of Computer Scence and Technology, Tsnghua Natonal Laboratory for Informaton

More information

CHAPTER 2 DECOMPOSITION OF GRAPHS

CHAPTER 2 DECOMPOSITION OF GRAPHS CHAPTER DECOMPOSITION OF GRAPHS. INTRODUCTION A graph H s called a Supersubdvson of a graph G f H s obtaned from G by replacng every edge uv of G by a bpartte graph,m (m may vary for each edge by dentfyng

More information

Syntactic Tree-based Relation Extraction Using a Generalization of Collins and Duffy Convolution Tree Kernel

Syntactic Tree-based Relation Extraction Using a Generalization of Collins and Duffy Convolution Tree Kernel Syntactc Tree-based Relaton Extracton Usng a Generalzaton of Collns and Duffy Convoluton Tree Kernel Mahdy Khayyaman Seyed Abolghasem Hassan Abolhassan Mrroshandel Sharf Unversty of Technology Sharf Unversty

More information

Journal of Chemical and Pharmaceutical Research, 2014, 6(6): Research Article

Journal of Chemical and Pharmaceutical Research, 2014, 6(6): Research Article Avalable onlne www.jocpr.com Journal of Chemcal and Pharmaceutcal Research, 2014, 6(6):2512-2520 Research Artcle ISSN : 0975-7384 CODEN(USA) : JCPRC5 Communty detecton model based on ncremental EM clusterng

More information

Module Management Tool in Software Development Organizations

Module Management Tool in Software Development Organizations Journal of Computer Scence (5): 8-, 7 ISSN 59-66 7 Scence Publcatons Management Tool n Software Development Organzatons Ahmad A. Al-Rababah and Mohammad A. Al-Rababah Faculty of IT, Al-Ahlyyah Amman Unversty,

More information

Fuzzy Modeling of the Complexity vs. Accuracy Trade-off in a Sequential Two-Stage Multi-Classifier System

Fuzzy Modeling of the Complexity vs. Accuracy Trade-off in a Sequential Two-Stage Multi-Classifier System Fuzzy Modelng of the Complexty vs. Accuracy Trade-off n a Sequental Two-Stage Mult-Classfer System MARK LAST 1 Department of Informaton Systems Engneerng Ben-Guron Unversty of the Negev Beer-Sheva 84105

More information

Biostatistics 615/815

Biostatistics 615/815 The E-M Algorthm Bostatstcs 615/815 Lecture 17 Last Lecture: The Smplex Method General method for optmzaton Makes few assumptons about functon Crawls towards mnmum Some recommendatons Multple startng ponts

More information

Document Representation and Clustering with WordNet Based Similarity Rough Set Model

Document Representation and Clustering with WordNet Based Similarity Rough Set Model IJCSI Internatonal Journal of Computer Scence Issues, Vol. 8, Issue 5, No 3, September 20 ISSN (Onlne): 694-084 www.ijcsi.org Document Representaton and Clusterng wth WordNet Based Smlarty Rough Set Model

More information

OPL: a modelling language

OPL: a modelling language OPL: a modellng language Carlo Mannno (from OPL reference manual) Unversty of Oslo, INF-MAT60 - Autumn 00 (Mathematcal optmzaton) ILOG Optmzaton Programmng Language OPL s an Optmzaton Programmng Language

More information

Optimizing Document Scoring for Query Retrieval

Optimizing Document Scoring for Query Retrieval Optmzng Document Scorng for Query Retreval Brent Ellwen baellwe@cs.stanford.edu Abstract The goal of ths project was to automate the process of tunng a document query engne. Specfcally, I used machne learnng

More information

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents

More information

A fault tree analysis strategy using binary decision diagrams

A fault tree analysis strategy using binary decision diagrams Loughborough Unversty Insttutonal Repostory A fault tree analyss strategy usng bnary decson dagrams Ths tem was submtted to Loughborough Unversty's Insttutonal Repostory by the/an author. Addtonal Informaton:

More information