Shadow Document Methods of Results Merging

Size: px
Start display at page:

Download "Shadow Document Methods of Results Merging"

Transcription

1 Shadow Documet Methods of Results Mergig Shegli Wu ad Fabio Crestai Departmet of Computer ad Iformatio Scieces Uiversity of Strathclyde, Glasgow, UK ABSTRACT I distributed iformatio retrieval systems, documet overlaps occur frequetly across results from differet databases. This is especially the case for meta-search egies which merge results from several geeral-purpose web search egies. This paper addresses the problem of mergig results which cotai overlaps i order to achieve better performace. Several algorithms for mergig results are proposed, which take advatage of the use of duplicate documets i two ways: oe correlates scores from differet results; the other regards duplicates as icreasig evidece of beig relevat to the give query. A variety of experimets have demostrated that these methods are effective. Categories ad Subject Descriptors H.3.3 [Iformatio Search ad Retrieval]: [Distributed Iformatio Retrieval, Result Mergig] Geeral Terms Algorithms Keywords Iformatio Retrieval, Data Fusio 1. INTRODUCTION With the widespread use of wide area etworks ad especially the Iteret, o-lie iformatio systems proliferate very rapidly. Users ofte fid it ecessary to search differet databases to satisfy a iformatio eed. I additio, for iformatio that is proprietary, costs moey, or its publisher wishes to cotrol it carefully, it caot be collected ad idexed i a cetralised database eviromet. I such cases, distributed iformatio retrieval (DIR) systems become a alterative solutio, ad oe i which mergig of results is a aspect which demads careful attetio. Permissio to make digital or hard copies of all or part of this work for persoal or classroom use is grated without fee provided that copies are ot made or distributed for profit or commercial advatage ad that copies bear this otice ad the full citatio o the first page. To copy otherwise, to republish, to post o servers or to redistribute to lists, requires prior specific permissio ad/or a fee. SAC 04, March 14-17, 2004, Nicosia, Cyprus. Copyright 2004 ACM /03/04...$5.00. For mergig of results, three cases ca be idetified depedig o the degree of overlap amog the selected databases for a give query [11]: 1. the databases are pairwise disjoit or early disjoit; 2. the databases overlap but are ot idetical; 3. the databases are idetical. Both cases 1 ad 3 have bee discussed extesively by may researchers, for example, i [2, 4, 5, 10, 14, 16, 17, 19]. However, results mergig algorithms suitable for Case 2 remais a ope questio [11]. Previously, meta search egies treated this i two typical ways. Oe regards duplicates as redudat iformatio, which are discarded whe we fid out that they exist. Such a method was used by Metacrawler i its early stage [14]. The other is usig some methods such as liear combiatio, which are suitable for Case 3. The latter method was used by Profusio [7]. Neither of the above solutios is a good solutio: the first does ot use the iformatio implied by the duplicates for better performace, while the secod igores the differece betwee idetical databases ad partially overlappig databases. I case 2, the results mergig problem is differet ad eve more complicated tha that i case 1 or case 3. I this paper, we preset a group of algorithms which are suitable for case 2. Our solutio itegrates the techiques used for case 1 ad case 3. Experimetal results are preseted for compariso of these algorithms. The rest of the paper is orgaised as follows. I Sectio 2 we review some related work o data fusio with totally idetical databases ad results mergig without overlappig. Sectio 3 presets a group of methods for mergig results from overlappig databases. The experimetal settig ad results are preseted i Sectio 4. Sectio 5 cocludes the paper. 2. RELATED WORK There has bee a lot of research o result mergig with idetical databases, which is called data fusio i may publicatios. Fox ad Shaw [6] proposed a umber of combiatio techiques icludig operators like Mi, Max, Comb- Sum ad CombMNZ. CombSum sets the score of each documet i the combiatio to the sum of the scores obtaied by the idividual database, while i CombMNZ the score of each documet was obtaied by multiplyig this sum by the umber of databases which had o-zero scores. Note that summig (CombSum) is equivalet to averagig while

2 CombMNZ is equivalet to weighted averagig. Lee [10] studied this further with six differet servers. His cotributio was to ormalise each iformatio retrieval server o a per-query basis which improved results substatially. Lee showed that CombMNZ worked best, followed by CombSum while operators like Mi ad Max were the worst. I additio, liear combiatio is valuable alterative. I Vogt ad Cottrell [17] the relevace of a documet to a query is computed by combiig both a score that captures the quality of each database ad a score that captures the quality of the documet with respect to the query. Formally, if q is a query, d is a documet, s is the umber of databases, ad w = (w 1, w 2,..., w s) are the database scores, the the overall relevace p of d P i the cotext of the combied list is give s by p(w, d, q) = i=1 w ip i (d, q). For o-overlappig results from differet databases, Roudrobi is a simple mergig method, which takes oe documet i tur from each available result set. However, the effectiveess of such a method depeds o the performaces of compoet database servers. If all the results have similar effectiveess, the Roud-robi performs well; if some of the results are very poor, the Roud-robi becomes very poor as well. Voorhees et al. [16] demostrated a way of improvig the above Roud-robi method. By ruig some traiig queries, they estimated the performace of each database. Whe a ew query is ecoutered, they retrieve a certai umber of documets from each database based o its estimated performace. Calla et al. [2] proposed a mergig strategy based o the scores achieved by both database ad documet. The score for each database is calculated by a database selectio algorithm, CORI, which idicates the goodess of a database with respect to a give query amog a set of available databases. The score of a documet is the value that documet obtais from that database, which idicates the goodess of that documet to a give query amog a set of retrieved documets from that database. Calve ad Savoy [3] proposed a mergig strategy based o logistic regressio. Some traiig queries are eeded for settig up the model. Their experimets show that the logistic regressio approach is sigificatly better tha Roud-robi, raw-score, ad ormalised raw-score approaches. Si ad Calla [15] used a liear regressio model for results mergig, with a assumptio that there is a cetralised sample database which stores a certai umber of documets from each database. The DIR broker rus the same query o the sample database as differet database servers do, picks up those duplicate documets i the results for the sample database ad for each database, ad ormalises the scores of these documets i all results by a liear regressio aalysis with the documet scores from the sample database servig as a stadard. Their experimetal results show that their method is slightly better tha CORI. Fially, let us review the results mergig policies used by some meta search egies. Metacrawler [14] is probably oe of the first meta search egies that were developed i the world wide web. Its policy was relatively simple, elimiatig duplicate URLs first the mergig the results i a Roud-robi fashio. Profusio [7], aother early meta search egie, used the liear combiatio algorithm. Savvy Search [8] focused primarily o the problem of learig ad idetifyig the right set of search egies to which to issue the queries, ad to a lesser extet o how the retured results are merged. Iquirus [9] took a differet approach to the problem. Istead of relyig o the egie s rakig mechaism, It retrieves the full cotets of the web pages retured ad rak them more like a traditioal search egie usig iformatio retrieval techiques applicable to full documets oly. Similar to Iquirus, Mearf [13] retrieved the full cotets of the web pages retured, the re-raked them based o the aalytical results o cotets ad/or liks of these web pages. Although regressio aalysis has bee used by Calve ad Savoy ad Si ad Calla, we use this method i a differet situatio, where partial overlaps exist betwee differet databases. I our case, o traiig is eeded as i Calve ad Savoy s work, ad either a cetralised sample database or cetralised iformatio retrieval system are eeded as i Si ad Calla s work. O the other had, how to merge documets which appear i oly oe result with those which appear i several differet results is a ew issue, which has ot bee cosidered before. 3. MERGING RESULTS FROM PARTIALLY OVERLAPPING DATABASES I this paper, we assume that we have a group of databases which are partially overlappig. For a give query, we obtai results from these databases. Except that, o descriptio iformatio about these databases is required. I additio, we do ot cosider dowloadig all the documets from differet databases for re-rakig. Usually, we do ot eed to check the full texts of all the documets i order to idetify may of the duplicates. For example, i most cases the URL ca be used as a idetifier for every documet i web search egies. I aother example we may estimate that two documets are the same by comparig their meta-data, such as the titles, authors, publishers, ad so o. Besides, oe assumptio we make is that all documets i the results have scores which have bee obtaied from their database servers. From overlappig databases, we have more iformatio (duplicates) to merge results tha from pairwise disjoit databases, i which o database has ay commo documets with ay other database. We may use this iformatio i two differet ways. First, the duplicate documet pairs i two differet results may be helpful for correlatig the scores of all the documets cotaied i two differet results. As i [3, 15], regressio aalysis is a useful tool for achievig this. O the other had, a documet appearig i more tha oe result ca be regarded as icreasig evidece of beig relevat to the give query. However, how to merge documets which occur oly i oe result with those which occur i more tha oe result is a ew issue. Some of the techiques [6, 10, 17] employed i data fusio ca be used here. However, there are some importat differeces which eed to be cosidered. I data fusio oe assumptio is: the more ofte a documet occurs i differet results, the the more likely it is a relevat documet to the give query. However, whe differet database servers retrieve from databases with some overlaps with each other, the above assumptio ca ot be applied directly. For example, suppose for a give query Q, database server A retrieves documets d 1 ad d 3, database server B retrieves documets d 2 ad d 3. Thus, d 3 occurs

3 i both results of A ad B, but d 1 ad d 2 oly occur oce. Does that suggest d 3 is more likely to be relevat to the query tha d 1 ad d 2? We are uable to reach such a coclusio uless we kow specifically that d 1 is i B s database ad d 2 is i A s database. However, we may ot have this iformatio ad i Sectio 3.2 we discuss this i more detail. 3.1 Regressio Aalysis For scores of the same documets i two differet results, we assume that a liear relatioship exists betwee them (x i ad y i ): y i = β 0 + β 1x i + ɛ i The least squares poit estimates b 1 ad b 0 of parameters β 1 ad β 0 i the simple liear regressio model are calculated by usig the formulae [1]: ad P i=1 b 1 = P xiyi (P xi)(p i=1 i=1 x2 i (P i=1 x i) 2 b 0 = P i=1 y i b 1( P i=1 )x i i=1 yi) If oly two systems are ivolved, the we eed oly do it oce. If ( > 2) database servers are ivolved, we assume that for each of them there are overlaps with at least oe other database. If this caot be satisfied for ay database, we ca use the method proposed by Si ad Calla [15], already metios i Sectio 2. We ca do 1 regressio aalyses, each with a pair of databases. The followig algorithm ca be used for selectig the pairs: 1. assig every retrieval system to a isolated ode; 2. check all pairs of odes which have o coectio (direct or idirect) ad have the most duplicate documets; make a coectio betwee these two odes; 3. repeat Step 2 util all odes are coected. For every selected pair of systems, we carry out a regressio aalysis for them. After 1 pairs, we ca get a ormalised score for every documet i every system without ay icosistecy. We assume that the more duplicate documets are ivolved, the more accurate the regressio aalysis is, that is why we select the pair with the most duplicate documets i the above algorithm. 3.2 Shadow Documet Methods After performig regressio aalysis for each pair of results, we ormalise those scores from differet databases ad make them comparable. Now the remaiig problem is how to deal with the duplicate documets for better performace. Oe solutio is to take the maximum score amog these duplicate documets as the score of that documet i the fused result, aother solutio is to take the miimal score, the third is to average them. The we ca merge these documets by descedig order o scores. These methods will be referred to as Max, Mi, ad Avg. Obviously these methods do ot use overlaps as icreasig evidece of relevace to the query. Next let us itroduce some more methods. Suppose we have two database servers A ad B. If documet d oly occurs i A s result but ot i B s result, there are two possibilities: (1) d is ot stored i B s database; (2) d is ot retrieved by B. I case 1, we guess that if d was i B s database, the B would assig d a score similar to s a (d). I case 2, d must have obtaied a very low score sice it does ot occur i the result. The possibility of case 2 is due to the extet of overlaps existig i A ad B. However, for ay particular documet, we caot judge which situatio is happeig. We itroduce a method which works like this. For a give query, if documet d occurs i both A ad B s results the we sum these two scores as d s total score; if d oly occurs i A s result with a score s a (d), we preted that there is a shadow documet of it i B, ad assig a score to the (shadow) documet d i B. We do the same for those documets which oly occur i B s result but ot i A s result. The we ca use the same method as above (sum) to merge all the results. More specifically, if documet d oly occurs i A s result, d s total score is calculated as s a (d)(2 k 1 ), where k 1 is a coefficiet (0 k 1 1). The situatio is similar for documets oly occurrig i B s result. To assig a desirable value to k 1 is importat. We leave this to be determied empirically by the experimets. Some modificatios ca be doe to the above method. Oe way is to let the coefficiet k 1 be related to the overlappig rate of two databases. We eed some statistical iformatio (e.g., size, overlappig rate) about these databases, ad methods such as the oe i [18] ca be used. However, to collect such iformatio usually takes a lot of effort. Here we assume that this iformatio is ot available. For a give query, we obtai differet results from differet databases. We calculate the umber of duplicate documets betwee each pair of them, the use this iformatio as the idicator of the overlappig rate of these two databases. Suppose is the umber of documets icluded i both results ad m is the umber of duplicate documets. A higher overlappig rate suggests that there are more duplicate documets i both databases. We ca rely o this to modify the above method. If documet d oly occurs i A, the d s total score is calculated as s a(d)(2 k 2 (1 m/)), where k 2 is a coefficiet (0 k 2 1). The same as k 1, we will carry out some experimets to empirically obtai a desirable value for k 2. I the followig these two methods will be referred to as Shadow Documet Method (SDM) 1 ad 2. Obviously, we ca easily exted these SDMs with more tha 2 database servers. 4. EXPERIMENTAL STUDY We caot fully report here the results of our extesive aalysis of the performace of the proposed methods. We will oly report the most iterestig results. 4.1 Experimetal Settigs Two groups of full-text databases were used for the experimets. Each group icluded 3 databases. All of them were subsets of the first two CDROMs of the TREC Collectio (TREC-12). The characteristics of these databases are preseted i Table 1. I group 1, WSJ idicates the Wall Street Joural collectio of articles ad the umbers idicate the years. Documets i group 1 are therefore quite homogeeous. I group 2, each database icludes the same

4 Group Databases Num. of docs Retrieval model WSJ(88-91) vector space 1 WSJ(87-89) okapi WSJ(91-92) laguage model AP(88-89) vector space 2 WSJ(87-89) okapi FR(88-89) laguage model Table 1: Databases used i the experimetatio Average precisio at 8 raks SDM 1 SDM documets from WSJ90 (6103 documets). I additio to those idicated, comig from Associated Press ews wires ad the Federal Register. Documets i this group are more heterogeeous. All groups have cosiderable overlap amog the databases, with group 1 havig by far the largest (over 50, 000 documets i commo). I both groups, a differet retrieval model was used for each database. The Lemur iformatio retrieval system, developed at Caregie Mello Uiversity, was used [12]. Lemur provides three optios as the retrieval model: vector space, okapi, ad laguage model. Users ca select oe of the models for their retrieval tasks. We demostrated the effectiveess of our algorithms for both heavily ad lightly overlappig situatios. I all cases, 100 queries (TREC topics ) were used for the experimets. 4.2 Results with Heavy Overlap Three databases, which are idicated i Table 1 as group 1, were used for the experimet. For each query, each iformatio retrieval system retrieves 1000 documets with scores. Differet mergig methods were experimeted with for these result sets. The experimetal results are show i Table 2, i which differet methods are compared with Roud-robi, ad the differece betwee them is show i paretheses. For the two SDMs, k 1 is set to be 0.05 ad k 2 is set to be 0.1, which are the optimum empirical values observed i our experimets. The effect of coefficiets o the SDMs will be discussed later. The experimetal results show that all methods are better tha Roud-robi. I particular, Ave, Max, ad Mi are very similar i performace, ad the two SDMs are very similar i performace as well. Because we use the duplicate documets for the regressio aalysis to ormalise the scores i differet results, this makes Ave, Max, ad Mi much similar. They outperform Roud-robi about 7%. SDMs are the best of all the methods. They outperform Roud-robi about 9%, ad they are slightly better tha Ave, Max, ad Mi (e.g., they are better tha Max by about 2%). Differet coefficiets were tested for the two SDMs. The experimetal results are show i Figure 1. For SDM1, the maximum value occurs whe k 1 is betwee 0.05 ad 0.1, the it goes dow quite quickly. For SDM2, the curve is very flat from the begiig to the ed. Because i may cases, the overlappig rate for a query result is aroud 300/1000, a value of 0.8 for k 2 is roughly equivalet to a value of 0.24 for k 1. However, it seems that i both cases, a very small coefficiet may improve the performace as effectively as a larger oe does. 4.3 Results with Light Overlap I this experimet, oly light overlaps exist over differet The value of coefficiet Figure 1: Performace of two SDMs with differet coefficiets ad with heavy overlaps amog databases. Average precisio at 8 raks The value of coefficiet Figure 2: Performace of two SDMs with differet coefficiets ad with light overlaps amog databases. databases. Three databases which are idicated as group 2 i Table 1 were used. Very ofte, there are oly about duplicate documets i differet results. I such a case, all 5 mergig methods behave very similarly, as expected. The reaso for that is the umber of duplicate documets is very small, so that it may ot be possible for the two SDMs to outperform Mi, Max, or Avg. The experimetal results are show i Tables 3. The results show that all mergig methods are cosiderably better (over 10% improvemet) tha Roud-robi. Figure 2 shows the experimetal results for two SDMs whe differet coefficiets are assiged. The SDM1 curve decreases whe the value of coefficiet icreases, while SDM2 is almost totally flat all the way from the begiig to the ed. I the above we have preseted experimetal results of two SDMs i two typical situatios: either heavy overlap or light overlap exists i differet results. SDM1 gets its maximum performace whe k 1 is betwee 0.05 ad 0.1 for heavy overlaps, while it gets its maximum performace whe k 1 takes a value of 0.01 or eve less for light overlaps. For SDM2, a value of 0.1 seems to fit well for both situatios. It suggests that for SDM1, it is better to select a differet value of k 1 for differet overlap rates, while this has bee cosidered i SDM2, a fixed value of 0.1 for k 2 is very likely to be suitable for all situatios. SDM 1 SDM 2

5 Docs Rak Roud-Robi Avg Max Mi SDM1 (k=0.05) SDM2 (k=0.1) (+11.0%) (+10.1%) (+11.0%) (+14.0%) (+14.5%) (+7.4%) (+7.7%) (+8.4%) (+10.0%) (+9.8%) (+9.6%) (+9.0%) (+9.7%) (+9.8%) (+9.7%) (+7.3%) (+7.2%) (+7.1%) (+10.3%) (+9.7%) (+8.1%) (+6.9%) (+7.9%) (+9.0%) (+9.1%) (+7.7%) (+6.3%) (+7.3%) (+7.9%) (+8.1%) (+6.0%) (+6.1%) (+5.9%) (+7.3%) (+7.6%) (+4.4%) (+4.3%) (+3.8%) (+6.0%) (+6.0%) Table 2: Precisio compariso of differet methods with heavy overlaps amog databases. Roud-robi serves as baselie. Docs Rak Roud-Robi Avg Max Mi SDM1 (k=0.05) SDM2 (k=0.1) (+11.5%) (+10.2%) (+10.66%) (+11.1%) (+9.7%) (+14.3%) (+14.3%) (+14.6%) (+14.1%) (+14.8%) (+17.9%) (+17.5%) (+17.5%) (+17.5%) (+17.5%) (+14.8%) (+14.7%) (+15.0%) (+14.7%) (+16.1%) (+16.7%) (+16.4%) (+16.4%) (+16.1%) (+16.4%) (+16.3%) (+15.9%) (+13.4%) (+13.4%) (+13.6%) (+16.2%) (+16.0%) (+16.2%) (+16.2%) (+14.1%) (+16.1%) (+15.8%) (+15.8%) (+16.0%) (+14.8%) Table 3: Precisio compariso of differet methods with light overlaps amog databases. Roud-robi serves as baselie. 4.4 Compariso with CombMNZ ad Comb- Sum Whe differet databases have cosiderable overlaps, is it a good solutio to use mergig algorithms such as CombMNZ ad CombSum? It is a iterestig questio, therefore, we coducted a experimet to evaluate the performaces of CombMNZ ad CombSum i such a situatio. As before, we use the three databases i group 1. Here we use two differet ormalisatio methods: liear regressio ad liear [0,1] ormalisatio [10]. We have described liear regressio i Sectio 3.1. Liear [0,1] ormalisatio is the usual method used for CombMNZ ad CombSum before. For a list of documets from a database, we use the followig formula to calculate the score for ay documet i the list: orm score = (score mi score)/(max score mi score) where max score deotes the highest score i the documet list, mi score deotes the lowest score i the documet list, score deotes the score of a give documet, ad orm score is the ormalised score of the give documet. Thus, the documet with the highest score is assiged a score of 1, the documet with the lowest score is assiged a score of 0, ay other documet is assiged a score betwee 0 ad 1. Experimetal results of CombMNZ ad CombSum are show i Table 4, For all of them, SDM1 with k 1 = 0.05 serves as the baselie. The results show that both CombMNZ ad CombSum are cosiderably worse tha SDM1. Usig either liear [0,1] ormalisatio or liear regressio ormalisatio, both CombMNZ ad CombSum are about 10% to 20% worse tha SDM Ehaced Shadow Documet Methods If we have more iformatio about the databases with regards to documet presece, ca we improve the performace of SDM1 ad SDM2? Suppose for ay give documet, we could kow if it is stored i a give database. Such a situatio may happe for some corporate etworks where documets could be distributed i differet databases with very specific domais. I this case we could idetify the source of ay documet by checkig its idetifier, ad at the same time, we could kow if a documets have bee stored i more tha oe site. Let us discuss this with two databases A ad B. For a give query Q, if a documet d oly occurs i the result from A but ot B, oe of two possible thigs must have happeed: (a) d is ot stored i B; (b) d is stored i B, but is ot retrieved. If the DIR system is able to distiguish which situatio has happeed, the we ca modify the two SDM methods i the followig way: if d is ot stored i B, the we just treat it as before; if d is stored i B but ot retrieved, the the documet d oly gets its score (ormalised) from A, but ot ay extra score from B. We refer to this modified SDMs methods as Ehaced SDM (ESDM), sice they make use of additioal iformatio. We carried out a experimetatio o ESDM1 ad ESDM2, with databases i group 1, comparig them with SDM1 ad SDM2. The results showed that o the top 5 ad 10 documets, both ESDM1 ad ESDM2 are better tha their couterparts. However, this situatio chages for higher documet raks. After 15, ESDM1 ad ESDM2 perform at the same level as SDM1 ad SDM2. 5. CONCLUSIONS I this paper we have preseted several methods for results mergig whe overlaps exist betwee differet results i a distributed iformatio retrieval system. These methods use liear regressio aalysis ad shadow documet mergig methods (SDM) to improve the performace of result mergig. Experimets have bee coducted i two typical

6 Docs Rak SDM1 (k=0.05) CombMNZ (Reg.) CombSum (Reg.) CombMNZ (Li.[0,1]) CombSum (Li.[0,1]) (-10.3%) (-10.3%) (-11.8%) (-11.8%) (-13.8%) (-13.8%) (-14.5%) (-13.6%) (-17.0%) (-15.9%) (-14.3%) (-12.1%) (-17.9%) (-16.2%) (-18.0%) (-15.0%) (-17.9%) (-15.8%) (-17.2%) (-13.4%) (-19.0%) (-16.3%) (-17.8%) (-13.1%) (-23.1%) (-20.0%) (-20.4%) (-14.6%) (-26.6%) (-21.8%) (-22.0%) (-11.2%) Table 4: Precisios of CombMNZ ad CombSum with heavy overlaps i databases. baselie. SDM1 serves as the situatios: heavy overlappig ad light overlappig. I the former situatio, the two SDMs perform slightly better tha Mi, Max, ad Avg. With heavy overlappig, we compared the performaces of CombMNZ ad CombSum ad SDM. The experimetal result shows that both CombMNZ ad CombSum are 10% to 20% worse tha SDM. Sice overlap happes frequetly i geeral web search egies, the methods proposed i this paper are very desirable for mergig web search results. Compared with the methods preseted i Sectio 2 our methods ca achieve very good results, but without the effort of dowloadig web pages ad local retrieval process. 6. REFERENCES [1] B. L. Bowerma ad R. T. O Coell. Liear Statistical Models: A Applied Approach. PWS-KENT Publishig Compay, [2] J. K. Calla, Z. Lu, ad W. Croft. Searchig distributed collectios with iferece etworks. I Proceedigs of the 18th Aual Iteratioal ACM SIGIR Coferece, pages 21 28, Seattle, USA, July [3] A. L. Calve ad J. Savoy. Database mergig strategy based o logistic regressio. Iformatio Processig ad Maagemet, 36(3): , [4] N. Craswell, P. Bailer, ad D. Hawkig. Server selectio o the world wide web. I Proceedigs of the Fifth ACM Coferece o Digital Libraries, pages 37 46, Sa Atoio, CA, USA, Jue [5] D. Dreiliger ad A. Howe. Experieces with selectiog search egies usig metasearch. ACM Trasactio o Iformatio Systems, 15(3): , [6] E. A. Fox ad J. Shaw. Combiatio of multiple searchs. I The Secod Text REtrieval Coferece (TREC-2), pages , Gaitherburg, MD, USA, August [7] S. Gauch, G. Wag, ad M. Gomez. Profusio: Itelliget fusio from multiple, distributed search egies. Joural of Uiversal Computer Sciece, 2(9): , September [8] A. Howe ad D. Dreiliger. Savvysearch: A meta-search egie that lears which search egies to query. AI Magazie, 18(2):19 25, [9] S. Lawrece ad C. L. Giles. Cotext ad page aalysis for improved web search. IEEE Iteret Computig, 2(4):38 46, [10] J. H. Lee. Aalysis of multiple evidece combiatio. I Proceedigs of the 20th Aual Iteratioal ACM SIGIR Coferece, pages , Philadelphia, Pesylvaia, USA, July [11] W. Meg, C. Yu, ad K. Liu. Buildig efficiet ad effective metasearch egies. ACM Computig Surveys, 34(1):48 89, March [12] P. Ogilvie ad J. Calla. Experimets usig the lemur toolkit. I Proceedigs of the 2001 Text REtrieval Coferece, pages , Gaithersburg, Marylad, USA, November [13] B. Ozteki, G. Karypis, ad V. Kumar. Expert agreemet ad cotet baed rerakig i a meta search eviromet usig mearf. I Proceedigs of the 11th Iteratioal World Wide Web Coferece, pages , Hoolulu, Hawaii, USA, May [14] E. Selberg ad O. Etzioi. The metacrawler architecture for resource aggregio o the web. IEEE Expert, 12(1):8 14, [15] L. Si ad J. Calla. Usig sampled data ad regressio to merge search egig results. I Proceedigs of the 25th Aual Iteratioal ACM SIGIR Coferece, pages 19 26, Tempere, Filad, August [16] E. M. Voorhees, N. K. Gupta, ad B. Johso-Laird. Learig collectio fusio strategies. I Proceedigs of the 18th Aual Iteratioal ACM SIGIR Coferece, pages , Seattle, Washigto, USA, July [17] C. C. Vort ad G. A. Cottrell. A fusio via a liear combiatio of scores. Iformatio Retrieval, 1(3): , [18] S. Wu, F. Gibb, ad F. Crestai. Experimets with documet archive size detectio. I Proceedigs of the 25th Europea Coferece o Iformatio Retrieval Research, pages , Pisa, Italy, April [19] C. Yu, G. Meg, K. Liu, W. Wu, ad N. Rishe. Efficiet ad effective metasearch for a large umber of text databases. I Proceedigs of the Eighth ACM Iteratioal Coferece o Iformatio ad Kowledge Maagemet, pages , Kasas City, USA, November 2001.

3D Model Retrieval Method Based on Sample Prediction

3D Model Retrieval Method Based on Sample Prediction 20 Iteratioal Coferece o Computer Commuicatio ad Maagemet Proc.of CSIT vol.5 (20) (20) IACSIT Press, Sigapore 3D Model Retrieval Method Based o Sample Predictio Qigche Zhag, Ya Tag* School of Computer

More information

Searching a Russian Document Collection Using English, Chinese and Japanese Queries

Searching a Russian Document Collection Using English, Chinese and Japanese Queries Searchig a Russia Documet Collectio Usig Eglish, Chiese ad Japaese Queries Fredric C. Gey (gey@ucdata.berkeley.edu) UC Data Archive & Techical Assistace Uiversity of Califoria, Berkeley, CA 94720 USA ABSTRACT.

More information

Ones Assignment Method for Solving Traveling Salesman Problem

Ones Assignment Method for Solving Traveling Salesman Problem Joural of mathematics ad computer sciece 0 (0), 58-65 Oes Assigmet Method for Solvig Travelig Salesma Problem Hadi Basirzadeh Departmet of Mathematics, Shahid Chamra Uiversity, Ahvaz, Ira Article history:

More information

CSC 220: Computer Organization Unit 11 Basic Computer Organization and Design

CSC 220: Computer Organization Unit 11 Basic Computer Organization and Design College of Computer ad Iformatio Scieces Departmet of Computer Sciece CSC 220: Computer Orgaizatio Uit 11 Basic Computer Orgaizatio ad Desig 1 For the rest of the semester, we ll focus o computer architecture:

More information

Improving Template Based Spike Detection

Improving Template Based Spike Detection Improvig Template Based Spike Detectio Kirk Smith, Member - IEEE Portlad State Uiversity petra@ee.pdx.edu Abstract Template matchig algorithms like SSE, Covolutio ad Maximum Likelihood are well kow for

More information

EFFECT OF QUERY FORMATION ON WEB SEARCH ENGINE RESULTS

EFFECT OF QUERY FORMATION ON WEB SEARCH ENGINE RESULTS Iteratioal Joural o Natural Laguage Computig (IJNLC) Vol. 2, No., February 203 EFFECT OF QUERY FORMATION ON WEB SEARCH ENGINE RESULTS Raj Kishor Bisht ad Ila Pat Bisht 2 Departmet of Computer Sciece &

More information

15-859E: Advanced Algorithms CMU, Spring 2015 Lecture #2: Randomized MST and MST Verification January 14, 2015

15-859E: Advanced Algorithms CMU, Spring 2015 Lecture #2: Randomized MST and MST Verification January 14, 2015 15-859E: Advaced Algorithms CMU, Sprig 2015 Lecture #2: Radomized MST ad MST Verificatio Jauary 14, 2015 Lecturer: Aupam Gupta Scribe: Yu Zhao 1 Prelimiaries I this lecture we are talkig about two cotets:

More information

Evaluation scheme for Tracking in AMI

Evaluation scheme for Tracking in AMI A M I C o m m u i c a t i o A U G M E N T E D M U L T I - P A R T Y I N T E R A C T I O N http://www.amiproject.org/ Evaluatio scheme for Trackig i AMI S. Schreiber a D. Gatica-Perez b AMI WP4 Trackig:

More information

Pruning and Summarizing the Discovered Time Series Association Rules from Mechanical Sensor Data Qing YANG1,a,*, Shao-Yu WANG1,b, Ting-Ting ZHANG2,c

Pruning and Summarizing the Discovered Time Series Association Rules from Mechanical Sensor Data Qing YANG1,a,*, Shao-Yu WANG1,b, Ting-Ting ZHANG2,c Advaces i Egieerig Research (AER), volume 131 3rd Aual Iteratioal Coferece o Electroics, Electrical Egieerig ad Iformatio Sciece (EEEIS 2017) Pruig ad Summarizig the Discovered Time Series Associatio Rules

More information

Python Programming: An Introduction to Computer Science

Python Programming: An Introduction to Computer Science Pytho Programmig: A Itroductio to Computer Sciece Chapter 1 Computers ad Programs 1 Objectives To uderstad the respective roles of hardware ad software i a computig system. To lear what computer scietists

More information

Algorithms for Disk Covering Problems with the Most Points

Algorithms for Disk Covering Problems with the Most Points Algorithms for Disk Coverig Problems with the Most Poits Bi Xiao Departmet of Computig Hog Kog Polytechic Uiversity Hug Hom, Kowloo, Hog Kog csbxiao@comp.polyu.edu.hk Qigfeg Zhuge, Yi He, Zili Shao, Edwi

More information

How do we evaluate algorithms?

How do we evaluate algorithms? F2 Readig referece: chapter 2 + slides Algorithm complexity Big O ad big Ω To calculate ruig time Aalysis of recursive Algorithms Next time: Litterature: slides mostly The first Algorithm desig methods:

More information

The isoperimetric problem on the hypercube

The isoperimetric problem on the hypercube The isoperimetric problem o the hypercube Prepared by: Steve Butler November 2, 2005 1 The isoperimetric problem We will cosider the -dimesioal hypercube Q Recall that the hypercube Q is a graph whose

More information

Lecture Notes 6 Introduction to algorithm analysis CSS 501 Data Structures and Object-Oriented Programming

Lecture Notes 6 Introduction to algorithm analysis CSS 501 Data Structures and Object-Oriented Programming Lecture Notes 6 Itroductio to algorithm aalysis CSS 501 Data Structures ad Object-Orieted Programmig Readig for this lecture: Carrao, Chapter 10 To be covered i this lecture: Itroductio to algorithm aalysis

More information

Polynomial Functions and Models. Learning Objectives. Polynomials. P (x) = a n x n + a n 1 x n a 1 x + a 0, a n 0

Polynomial Functions and Models. Learning Objectives. Polynomials. P (x) = a n x n + a n 1 x n a 1 x + a 0, a n 0 Polyomial Fuctios ad Models 1 Learig Objectives 1. Idetify polyomial fuctios ad their degree 2. Graph polyomial fuctios usig trasformatios 3. Idetify the real zeros of a polyomial fuctio ad their multiplicity

More information

Big-O Analysis. Asymptotics

Big-O Analysis. Asymptotics Big-O Aalysis 1 Defiitio: Suppose that f() ad g() are oegative fuctios of. The we say that f() is O(g()) provided that there are costats C > 0 ad N > 0 such that for all > N, f() Cg(). Big-O expresses

More information

An Improved Shuffled Frog-Leaping Algorithm for Knapsack Problem

An Improved Shuffled Frog-Leaping Algorithm for Knapsack Problem A Improved Shuffled Frog-Leapig Algorithm for Kapsack Problem Zhoufag Li, Ya Zhou, ad Peg Cheg School of Iformatio Sciece ad Egieerig Hea Uiversity of Techology ZhegZhou, Chia lzhf1978@126.com Abstract.

More information

A Note on Least-norm Solution of Global WireWarping

A Note on Least-norm Solution of Global WireWarping A Note o Least-orm Solutio of Global WireWarpig Charlie C. L. Wag Departmet of Mechaical ad Automatio Egieerig The Chiese Uiversity of Hog Kog Shati, N.T., Hog Kog E-mail: cwag@mae.cuhk.edu.hk Abstract

More information

Analysis Metrics. Intro to Algorithm Analysis. Slides. 12. Alg Analysis. 12. Alg Analysis

Analysis Metrics. Intro to Algorithm Analysis. Slides. 12. Alg Analysis. 12. Alg Analysis Itro to Algorithm Aalysis Aalysis Metrics Slides. Table of Cotets. Aalysis Metrics 3. Exact Aalysis Rules 4. Simple Summatio 5. Summatio Formulas 6. Order of Magitude 7. Big-O otatio 8. Big-O Theorems

More information

Identification of the Swiss Z24 Highway Bridge by Frequency Domain Decomposition Brincker, Rune; Andersen, P.

Identification of the Swiss Z24 Highway Bridge by Frequency Domain Decomposition Brincker, Rune; Andersen, P. Aalborg Uiversitet Idetificatio of the Swiss Z24 Highway Bridge by Frequecy Domai Decompositio Bricker, Rue; Aderse, P. Published i: Proceedigs of IMAC 2 Publicatio date: 22 Documet Versio Publisher's

More information

performance to the performance they can experience when they use the services from a xed location.

performance to the performance they can experience when they use the services from a xed location. I the Proceedigs of The First Aual Iteratioal Coferece o Mobile Computig ad Networkig (MobiCom 9) November -, 99, Berkeley, Califoria USA Performace Compariso of Mobile Support Strategies Rieko Kadobayashi

More information

IMP: Superposer Integrated Morphometrics Package Superposition Tool

IMP: Superposer Integrated Morphometrics Package Superposition Tool IMP: Superposer Itegrated Morphometrics Package Superpositio Tool Programmig by: David Lieber ( 03) Caisius College 200 Mai St. Buffalo, NY 4208 Cocept by: H. David Sheets, Dept. of Physics, Caisius College

More information

A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON

A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON Roberto Lopez ad Eugeio Oñate Iteratioal Ceter for Numerical Methods i Egieerig (CIMNE) Edificio C1, Gra Capitá s/, 08034 Barceloa, Spai ABSTRACT I this work

More information

One advantage that SONAR has over any other music-sequencing product I ve worked

One advantage that SONAR has over any other music-sequencing product I ve worked *gajedra* D:/Thomso_Learig_Projects/Garrigus_163132/z_productio/z_3B2_3D_files/Garrigus_163132_ch17.3d, 14/11/08/16:26:39, 16:26, page: 647 17 CAL 101 Oe advatage that SONAR has over ay other music-sequecig

More information

Accuracy Improvement in Camera Calibration

Accuracy Improvement in Camera Calibration Accuracy Improvemet i Camera Calibratio FaJie L Qi Zag ad Reihard Klette CITR, Computer Sciece Departmet The Uiversity of Aucklad Tamaki Campus, Aucklad, New Zealad fli006, qza001@ec.aucklad.ac.z r.klette@aucklad.ac.z

More information

Module 8-7: Pascal s Triangle and the Binomial Theorem

Module 8-7: Pascal s Triangle and the Binomial Theorem Module 8-7: Pascal s Triagle ad the Biomial Theorem Gregory V. Bard April 5, 017 A Note about Notatio Just to recall, all of the followig mea the same thig: ( 7 7C 4 C4 7 7C4 5 4 ad they are (all proouced

More information

Bayesian approach to reliability modelling for a probability of failure on demand parameter

Bayesian approach to reliability modelling for a probability of failure on demand parameter Bayesia approach to reliability modellig for a probability of failure o demad parameter BÖRCSÖK J., SCHAEFER S. Departmet of Computer Architecture ad System Programmig Uiversity Kassel, Wilhelmshöher Allee

More information

n n B. How many subsets of C are there of cardinality n. We are selecting elements for such a

n n B. How many subsets of C are there of cardinality n. We are selecting elements for such a 4. [10] Usig a combiatorial argumet, prove that for 1: = 0 = Let A ad B be disjoit sets of cardiality each ad C = A B. How may subsets of C are there of cardiality. We are selectig elemets for such a subset

More information

On-line cursive letter recognition using sequences of local minima/maxima. Robert Powalka

On-line cursive letter recognition using sequences of local minima/maxima. Robert Powalka O-lie cursive letter recogitio usig sequeces of local miima/maxima Summary Robert Powalka 19 th August 1993 This report presets the desig ad implemetatio of a o-lie cursive letter recogizer usig sequeces

More information

FREQUENCY ESTIMATION OF INTERNET PACKET STREAMS WITH LIMITED SPACE: UPPER AND LOWER BOUNDS

FREQUENCY ESTIMATION OF INTERNET PACKET STREAMS WITH LIMITED SPACE: UPPER AND LOWER BOUNDS FREQUENCY ESTIMATION OF INTERNET PACKET STREAMS WITH LIMITED SPACE: UPPER AND LOWER BOUNDS Prosejit Bose Evagelos Kraakis Pat Mori Yihui Tag School of Computer Sciece, Carleto Uiversity {jit,kraakis,mori,y

More information

Numerical Methods Lecture 6 - Curve Fitting Techniques

Numerical Methods Lecture 6 - Curve Fitting Techniques Numerical Methods Lecture 6 - Curve Fittig Techiques Topics motivatio iterpolatio liear regressio higher order polyomial form expoetial form Curve fittig - motivatio For root fidig, we used a give fuctio

More information

Analysis of Documents Clustering Using Sampled Agglomerative Technique

Analysis of Documents Clustering Using Sampled Agglomerative Technique Aalysis of Documets Clusterig Usig Sampled Agglomerative Techique Omar H. Karam, Ahmed M. Hamad, ad Sheri M. Moussa Abstract I this paper a clusterig algorithm for documets is proposed that adapts a samplig-based

More information

Lecturers: Sanjam Garg and Prasad Raghavendra Feb 21, Midterm 1 Solutions

Lecturers: Sanjam Garg and Prasad Raghavendra Feb 21, Midterm 1 Solutions U.C. Berkeley CS170 : Algorithms Midterm 1 Solutios Lecturers: Sajam Garg ad Prasad Raghavedra Feb 1, 017 Midterm 1 Solutios 1. (4 poits) For the directed graph below, fid all the strogly coected compoets

More information

CIS 121 Data Structures and Algorithms with Java Fall Big-Oh Notation Tuesday, September 5 (Make-up Friday, September 8)

CIS 121 Data Structures and Algorithms with Java Fall Big-Oh Notation Tuesday, September 5 (Make-up Friday, September 8) CIS 11 Data Structures ad Algorithms with Java Fall 017 Big-Oh Notatio Tuesday, September 5 (Make-up Friday, September 8) Learig Goals Review Big-Oh ad lear big/small omega/theta otatios Practice solvig

More information

Sectio 4, a prototype project of settig field weight with AHP method is developed ad the experimetal results are aalyzed. Fially, we coclude our work

Sectio 4, a prototype project of settig field weight with AHP method is developed ad the experimetal results are aalyzed. Fially, we coclude our work 200 2d Iteratioal Coferece o Iformatio ad Multimedia Techology (ICIMT 200) IPCSIT vol. 42 (202) (202) IACSIT Press, Sigapore DOI: 0.7763/IPCSIT.202.V42.0 Idex Weight Decisio Based o AHP for Iformatio Retrieval

More information

. Written in factored form it is easy to see that the roots are 2, 2, i,

. Written in factored form it is easy to see that the roots are 2, 2, i, CMPS A Itroductio to Programmig Programmig Assigmet 4 I this assigmet you will write a java program that determies the real roots of a polyomial that lie withi a specified rage. Recall that the roots (or

More information

Basic allocator mechanisms The course that gives CMU its Zip! Memory Management II: Dynamic Storage Allocation Mar 6, 2000.

Basic allocator mechanisms The course that gives CMU its Zip! Memory Management II: Dynamic Storage Allocation Mar 6, 2000. 5-23 The course that gives CM its Zip Memory Maagemet II: Dyamic Storage Allocatio Mar 6, 2000 Topics Segregated lists Buddy system Garbage collectio Mark ad Sweep Copyig eferece coutig Basic allocator

More information

Computer Science Foundation Exam. August 12, Computer Science. Section 1A. No Calculators! KEY. Solutions and Grading Criteria.

Computer Science Foundation Exam. August 12, Computer Science. Section 1A. No Calculators! KEY. Solutions and Grading Criteria. Computer Sciece Foudatio Exam August, 005 Computer Sciece Sectio A No Calculators! Name: SSN: KEY Solutios ad Gradig Criteria Score: 50 I this sectio of the exam, there are four (4) problems. You must

More information

6.854J / J Advanced Algorithms Fall 2008

6.854J / J Advanced Algorithms Fall 2008 MIT OpeCourseWare http://ocw.mit.edu 6.854J / 18.415J Advaced Algorithms Fall 2008 For iformatio about citig these materials or our Terms of Use, visit: http://ocw.mit.edu/terms. 18.415/6.854 Advaced Algorithms

More information

Lecture 6. Lecturer: Ronitt Rubinfeld Scribes: Chen Ziv, Eliav Buchnik, Ophir Arie, Jonathan Gradstein

Lecture 6. Lecturer: Ronitt Rubinfeld Scribes: Chen Ziv, Eliav Buchnik, Ophir Arie, Jonathan Gradstein 068.670 Subliear Time Algorithms November, 0 Lecture 6 Lecturer: Roitt Rubifeld Scribes: Che Ziv, Eliav Buchik, Ophir Arie, Joatha Gradstei Lesso overview. Usig the oracle reductio framework for approximatig

More information

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe CHAPTER 18 Strategies for Query Processig Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe Itroductio DBMS techiques to process a query Scaer idetifies

More information

Ontology-based Decision Support System with Analytic Hierarchy Process for Tour Package Selection

Ontology-based Decision Support System with Analytic Hierarchy Process for Tour Package Selection 2017 Asia-Pacific Egieerig ad Techology Coferece (APETC 2017) ISBN: 978-1-60595-443-1 Otology-based Decisio Support System with Aalytic Hierarchy Process for Tour Pacage Selectio Tie-We Sug, Chia-Jug Lee,

More information

Administrative UNSUPERVISED LEARNING. Unsupervised learning. Supervised learning 11/25/13. Final project. No office hours today

Administrative UNSUPERVISED LEARNING. Unsupervised learning. Supervised learning 11/25/13. Final project. No office hours today Admiistrative Fial project No office hours today UNSUPERVISED LEARNING David Kauchak CS 451 Fall 2013 Supervised learig Usupervised learig label label 1 label 3 model/ predictor label 4 label 5 Supervised

More information

CSC165H1 Worksheet: Tutorial 8 Algorithm analysis (SOLUTIONS)

CSC165H1 Worksheet: Tutorial 8 Algorithm analysis (SOLUTIONS) CSC165H1, Witer 018 Learig Objectives By the ed of this worksheet, you will: Aalyse the ruig time of fuctios cotaiig ested loops. 1. Nested loop variatios. Each of the followig fuctios takes as iput a

More information

Python Programming: An Introduction to Computer Science

Python Programming: An Introduction to Computer Science Pytho Programmig: A Itroductio to Computer Sciece Chapter 6 Defiig Fuctios Pytho Programmig, 2/e 1 Objectives To uderstad why programmers divide programs up ito sets of cooperatig fuctios. To be able to

More information

Data diverse software fault tolerance techniques

Data diverse software fault tolerance techniques Data diverse software fault tolerace techiques Complemets desig diversity by compesatig for desig diversity s s limitatios Ivolves obtaiig a related set of poits i the program data space, executig the

More information

Combination Labelings Of Graphs

Combination Labelings Of Graphs Applied Mathematics E-Notes, (0), - c ISSN 0-0 Available free at mirror sites of http://wwwmaththuedutw/ame/ Combiatio Labeligs Of Graphs Pak Chig Li y Received February 0 Abstract Suppose G = (V; E) is

More information

The Closest Line to a Data Set in the Plane. David Gurney Southeastern Louisiana University Hammond, Louisiana

The Closest Line to a Data Set in the Plane. David Gurney Southeastern Louisiana University Hammond, Louisiana The Closest Lie to a Data Set i the Plae David Gurey Southeaster Louisiaa Uiversity Hammod, Louisiaa ABSTRACT This paper looks at three differet measures of distace betwee a lie ad a data set i the plae:

More information

CS 683: Advanced Design and Analysis of Algorithms

CS 683: Advanced Design and Analysis of Algorithms CS 683: Advaced Desig ad Aalysis of Algorithms Lecture 6, February 1, 2008 Lecturer: Joh Hopcroft Scribes: Shaomei Wu, Etha Feldma February 7, 2008 1 Threshold for k CNF Satisfiability I the previous lecture,

More information

BAAN IVc/BaanERP. Conversion Guide Oracle7 to Oracle8

BAAN IVc/BaanERP. Conversion Guide Oracle7 to Oracle8 BAAN IVc/BaaERP A publicatio of: Baa Developmet B.V. P.O.Box 143 3770 AC Bareveld The Netherlads Prited i the Netherlads Baa Developmet B.V. 1999. All rights reserved. The iformatio i this documet is subject

More information

New HSL Distance Based Colour Clustering Algorithm

New HSL Distance Based Colour Clustering Algorithm The 4th Midwest Artificial Itelligece ad Cogitive Scieces Coferece (MAICS 03 pp 85-9 New Albay Idiaa USA April 3-4 03 New HSL Distace Based Colour Clusterig Algorithm Vasile Patrascu Departemet of Iformatics

More information

MOTIF XF Extension Owner s Manual

MOTIF XF Extension Owner s Manual MOTIF XF Extesio Ower s Maual Table of Cotets About MOTIF XF Extesio...2 What Extesio ca do...2 Auto settig of Audio Driver... 2 Auto settigs of Remote Device... 2 Project templates with Iput/ Output Bus

More information

Lecture 1: Introduction and Strassen s Algorithm

Lecture 1: Introduction and Strassen s Algorithm 5-750: Graduate Algorithms Jauary 7, 08 Lecture : Itroductio ad Strasse s Algorithm Lecturer: Gary Miller Scribe: Robert Parker Itroductio Machie models I this class, we will primarily use the Radom Access

More information

Chapter 1. Introduction to Computers and C++ Programming. Copyright 2015 Pearson Education, Ltd.. All rights reserved.

Chapter 1. Introduction to Computers and C++ Programming. Copyright 2015 Pearson Education, Ltd.. All rights reserved. Chapter 1 Itroductio to Computers ad C++ Programmig Copyright 2015 Pearso Educatio, Ltd.. All rights reserved. Overview 1.1 Computer Systems 1.2 Programmig ad Problem Solvig 1.3 Itroductio to C++ 1.4 Testig

More information

A Boolean Query Processing with a Result Cache in Mediator Systems

A Boolean Query Processing with a Result Cache in Mediator Systems A Boolea Query Processig with a Result Cache i Mediator Systems Jae-heo Cheog ad Sag-goo Lee * Departmet of Computer Sciece Seoul Natioal Uiversity Sa 56-1 Shillim-dog Kwaak-gu, Seoul Korea {cjh, sglee}cygus.su.ac.kr

More information

Lecture 28: Data Link Layer

Lecture 28: Data Link Layer Automatic Repeat Request (ARQ) 2. Go ack N ARQ Although the Stop ad Wait ARQ is very simple, you ca easily show that it has very the low efficiecy. The low efficiecy comes from the fact that the trasmittig

More information

Appendix D. Controller Implementation

Appendix D. Controller Implementation COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Iterface 5 th Editio Appedix D Cotroller Implemetatio Cotroller Implemetatios Combiatioal logic (sigle-cycle); Fiite state machie (multi-cycle, pipelied);

More information

What are we going to learn? CSC Data Structures Analysis of Algorithms. Overview. Algorithm, and Inputs

What are we going to learn? CSC Data Structures Analysis of Algorithms. Overview. Algorithm, and Inputs What are we goig to lear? CSC316-003 Data Structures Aalysis of Algorithms Computer Sciece North Carolia State Uiversity Need to say that some algorithms are better tha others Criteria for evaluatio Structure

More information

1&1 Next Level Hosting

1&1 Next Level Hosting 1&1 Next Level Hostig Performace Level: Performace that grows with your requiremets Copyright 1&1 Iteret SE 2017 1ad1.com 2 1&1 NEXT LEVEL HOSTING 3 Fast page loadig ad short respose times play importat

More information

CIS 121 Data Structures and Algorithms with Java Spring Stacks and Queues Monday, February 12 / Tuesday, February 13

CIS 121 Data Structures and Algorithms with Java Spring Stacks and Queues Monday, February 12 / Tuesday, February 13 CIS Data Structures ad Algorithms with Java Sprig 08 Stacks ad Queues Moday, February / Tuesday, February Learig Goals Durig this lab, you will: Review stacks ad queues. Lear amortized ruig time aalysis

More information

Text Summarization using Neural Network Theory

Text Summarization using Neural Network Theory Iteratioal Joural of Computer Systems (ISSN: 2394-065), Volume 03 Issue 07, July, 206 Available at http://www.ijcsolie.com/ Simra Kaur Jolly, Wg Cdr Ail Chopra 2 Departmet of CSE, Ligayas Uiversity, Faridabad

More information

An Efficient Algorithm for Graph Bisection of Triangularizations

An Efficient Algorithm for Graph Bisection of Triangularizations A Efficiet Algorithm for Graph Bisectio of Triagularizatios Gerold Jäger Departmet of Computer Sciece Washigto Uiversity Campus Box 1045 Oe Brookigs Drive St. Louis, Missouri 63130-4899, USA jaegerg@cse.wustl.edu

More information

Which movie we can suggest to Anne?

Which movie we can suggest to Anne? ECOLE CENTRALE SUPELEC MASTER DSBI DECISION MODELING TUTORIAL COLLABORATIVE FILTERING AS A MODEL OF GROUP DECISION-MAKING You kow that the low-tech way to get recommedatios for products, movies, or etertaiig

More information

The Magma Database file formats

The Magma Database file formats The Magma Database file formats Adrew Gaylard, Bret Pikey, ad Mart-Mari Breedt Johaesburg, South Africa 15th May 2006 1 Summary Magma is a ope-source object database created by Chris Muller, of Kasas City,

More information

ANN WHICH COVERS MLP AND RBF

ANN WHICH COVERS MLP AND RBF ANN WHICH COVERS MLP AND RBF Josef Boští, Jaromír Kual Faculty of Nuclear Scieces ad Physical Egieerig, CTU i Prague Departmet of Software Egieerig Abstract Two basic types of artificial eural etwors Multi

More information

Multi-Threading. Hyper-, Multi-, and Simultaneous Thread Execution

Multi-Threading. Hyper-, Multi-, and Simultaneous Thread Execution Multi-Threadig Hyper-, Multi-, ad Simultaeous Thread Executio 1 Performace To Date Icreasig processor performace Pipeliig. Brach predictio. Super-scalar executio. Out-of-order executio. Caches. Hyper-Threadig

More information

Lecture 5. Counting Sort / Radix Sort

Lecture 5. Counting Sort / Radix Sort Lecture 5. Coutig Sort / Radix Sort T. H. Corme, C. E. Leiserso ad R. L. Rivest Itroductio to Algorithms, 3rd Editio, MIT Press, 2009 Sugkyukwa Uiversity Hyuseug Choo choo@skku.edu Copyright 2000-2018

More information

R(n) ^ 0.4. RBR.RBR CORI.RBR Ideal(0).RBR

R(n) ^ 0.4. RBR.RBR CORI.RBR Ideal(0).RBR Comparig the Performace of Database Selectio Algorithms James C. Frech 1, Alliso L. Powell 1, Jamie Calla 2, Charles L. Viles 3 Travis Emmitt 1, Kevi J. Prey 1,Yu Mou 2 1 Dept. of Computer Sciece Uiversity

More information

Chapter 3 Classification of FFT Processor Algorithms

Chapter 3 Classification of FFT Processor Algorithms Chapter Classificatio of FFT Processor Algorithms The computatioal complexity of the Discrete Fourier trasform (DFT) is very high. It requires () 2 complex multiplicatios ad () complex additios [5]. As

More information

A New Morphological 3D Shape Decomposition: Grayscale Interframe Interpolation Method

A New Morphological 3D Shape Decomposition: Grayscale Interframe Interpolation Method A ew Morphological 3D Shape Decompositio: Grayscale Iterframe Iterpolatio Method D.. Vizireau Politehica Uiversity Bucharest, Romaia ae@comm.pub.ro R. M. Udrea Politehica Uiversity Bucharest, Romaia mihea@comm.pub.ro

More information

1.2 Binomial Coefficients and Subsets

1.2 Binomial Coefficients and Subsets 1.2. BINOMIAL COEFFICIENTS AND SUBSETS 13 1.2 Biomial Coefficiets ad Subsets 1.2-1 The loop below is part of a program to determie the umber of triagles formed by poits i the plae. for i =1 to for j =

More information

DETECTION OF LANDSLIDE BLOCK BOUNDARIES BY MEANS OF AN AFFINE COORDINATE TRANSFORMATION

DETECTION OF LANDSLIDE BLOCK BOUNDARIES BY MEANS OF AN AFFINE COORDINATE TRANSFORMATION Proceedigs, 11 th FIG Symposium o Deformatio Measuremets, Satorii, Greece, 2003. DETECTION OF LANDSLIDE BLOCK BOUNDARIES BY MEANS OF AN AFFINE COORDINATE TRANSFORMATION Michaela Haberler, Heribert Kahme

More information

Using VTR Emulation on Avid Systems

Using VTR Emulation on Avid Systems Usig VTR Emulatio o Avid Systems VTR emulatio allows you to cotrol a sequece loaded i the Record moitor from a edit cotroller for playback i the edit room alog with other sources. I this sceario the edit

More information

On Nonblocking Folded-Clos Networks in Computer Communication Environments

On Nonblocking Folded-Clos Networks in Computer Communication Environments O Noblockig Folded-Clos Networks i Computer Commuicatio Eviromets Xi Yua Departmet of Computer Sciece, Florida State Uiversity, Tallahassee, FL 3306 xyua@cs.fsu.edu Abstract Folded-Clos etworks, also referred

More information

Customer Portal Quick Reference User Guide

Customer Portal Quick Reference User Guide Customer Portal Quick Referece User Guide Overview This user guide is iteded for FM Approvals customers usig the Approval Iformatio Maagemet (AIM) customer portal to track their active projects. AIM is

More information

MATHEMATICAL METHODS OF ANALYSIS AND EXPERIMENTAL DATA PROCESSING (Or Methods of Curve Fitting)

MATHEMATICAL METHODS OF ANALYSIS AND EXPERIMENTAL DATA PROCESSING (Or Methods of Curve Fitting) MATHEMATICAL METHODS OF ANALYSIS AND EXPERIMENTAL DATA PROCESSING (Or Methods of Curve Fittig) I this chapter, we will eamie some methods of aalysis ad data processig; data obtaied as a result of a give

More information

Term Project Report. This component works to detect gesture from the patient as a sign of emergency message and send it to the emergency manager.

Term Project Report. This component works to detect gesture from the patient as a sign of emergency message and send it to the emergency manager. CS2310 Fial Project Loghao Li Term Project Report Itroductio I this project, I worked o expadig exercise 4. What I focused o is makig the real gesture recogizig sesor ad desig proper gestures ad recogizig

More information

ENGI 4421 Probability and Statistics Faculty of Engineering and Applied Science Problem Set 1 Descriptive Statistics

ENGI 4421 Probability and Statistics Faculty of Engineering and Applied Science Problem Set 1 Descriptive Statistics ENGI 44 Probability ad Statistics Faculty of Egieerig ad Applied Sciece Problem Set Descriptive Statistics. If, i the set of values {,, 3, 4, 5, 6, 7 } a error causes the value 5 to be replaced by 50,

More information

1 Graph Sparsfication

1 Graph Sparsfication CME 305: Discrete Mathematics ad Algorithms 1 Graph Sparsficatio I this sectio we discuss the approximatio of a graph G(V, E) by a sparse graph H(V, F ) o the same vertex set. I particular, we cosider

More information

Optimal Mapped Mesh on the Circle

Optimal Mapped Mesh on the Circle Koferece ANSYS 009 Optimal Mapped Mesh o the Circle doc. Ig. Jaroslav Štigler, Ph.D. Bro Uiversity of Techology, aculty of Mechaical gieerig, ergy Istitut, Abstract: This paper brigs out some ideas ad

More information

CSE 2320 Notes 8: Sorting. (Last updated 10/3/18 7:16 PM) Idea: Take an unsorted (sub)array and partition into two subarrays such that.

CSE 2320 Notes 8: Sorting. (Last updated 10/3/18 7:16 PM) Idea: Take an unsorted (sub)array and partition into two subarrays such that. CSE Notes 8: Sortig (Last updated //8 7:6 PM) CLRS 7.-7., 9., 8.-8. 8.A. QUICKSORT Cocepts Idea: Take a usorted (sub)array ad partitio ito two subarrays such that p q r x y z x y y z Pivot Customarily,

More information

Pattern Recognition Systems Lab 1 Least Mean Squares

Pattern Recognition Systems Lab 1 Least Mean Squares Patter Recogitio Systems Lab 1 Least Mea Squares 1. Objectives This laboratory work itroduces the OpeCV-based framework used throughout the course. I this assigmet a lie is fitted to a set of poits usig

More information

Math 3201 Notes Chapter 4: Rational Expressions & Equations

Math 3201 Notes Chapter 4: Rational Expressions & Equations Learig Goals: See p. tet.. Equivalet Ratioal Epressios ( classes) Read Goal p. 6 tet. Math 0 Notes Chapter : Ratioal Epressios & Equatios. Defie ad give a eample of a ratioal epressio. p. 6. Defie o-permissible

More information

Load balanced Parallel Prime Number Generator with Sieve of Eratosthenes on Cluster Computers *

Load balanced Parallel Prime Number Generator with Sieve of Eratosthenes on Cluster Computers * Load balaced Parallel Prime umber Geerator with Sieve of Eratosthees o luster omputers * Soowook Hwag*, Kyusik hug**, ad Dogseug Kim* *Departmet of Electrical Egieerig Korea Uiversity Seoul, -, Rep. of

More information

Counting the Number of Minimum Roman Dominating Functions of a Graph

Counting the Number of Minimum Roman Dominating Functions of a Graph Coutig the Number of Miimum Roma Domiatig Fuctios of a Graph SHI ZHENG ad KOH KHEE MENG, Natioal Uiversity of Sigapore We provide two algorithms coutig the umber of miimum Roma domiatig fuctios of a graph

More information

G r a d e. 5 M a t h e M a t i c s. shape and space

G r a d e. 5 M a t h e M a t i c s. shape and space G r a d e 5 M a t h e M a t i c s shape ad space Grade 5: Shape ad Space (Measuremet) (5.SS.1) Edurig Uderstadigs: there is o direct relatioship betwee perimeter ad area. Geeral Outcome: Use direct or

More information

Sorting in Linear Time. Data Structures and Algorithms Andrei Bulatov

Sorting in Linear Time. Data Structures and Algorithms Andrei Bulatov Sortig i Liear Time Data Structures ad Algorithms Adrei Bulatov Algorithms Sortig i Liear Time 7-2 Compariso Sorts The oly test that all the algorithms we have cosidered so far is compariso The oly iformatio

More information

Morgan Kaufmann Publishers 26 February, COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. Chapter 5

Morgan Kaufmann Publishers 26 February, COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. Chapter 5 Morga Kaufma Publishers 26 February, 28 COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Iterface 5 th Editio Chapter 5 Set-Associative Cache Architecture Performace Summary Whe CPU performace icreases:

More information

Redundancy Allocation for Series Parallel Systems with Multiple Constraints and Sensitivity Analysis

Redundancy Allocation for Series Parallel Systems with Multiple Constraints and Sensitivity Analysis IOSR Joural of Egieerig Redudacy Allocatio for Series Parallel Systems with Multiple Costraits ad Sesitivity Aalysis S. V. Suresh Babu, D.Maheswar 2, G. Ragaath 3 Y.Viaya Kumar d G.Sakaraiah e (Mechaical

More information

BAYESIAN WITH FULL CONDITIONAL POSTERIOR DISTRIBUTION APPROACH FOR SOLUTION OF COMPLEX MODELS. Pudji Ismartini

BAYESIAN WITH FULL CONDITIONAL POSTERIOR DISTRIBUTION APPROACH FOR SOLUTION OF COMPLEX MODELS. Pudji Ismartini Proceedig of Iteratioal Coferece O Research, Implemetatio Ad Educatio Of Mathematics Ad Scieces 014, Yogyakarta State Uiversity, 18-0 May 014 BAYESIAN WIH FULL CONDIIONAL POSERIOR DISRIBUION APPROACH FOR

More information

Chapter 9. Pointers and Dynamic Arrays. Copyright 2015 Pearson Education, Ltd.. All rights reserved.

Chapter 9. Pointers and Dynamic Arrays. Copyright 2015 Pearson Education, Ltd.. All rights reserved. Chapter 9 Poiters ad Dyamic Arrays Copyright 2015 Pearso Educatio, Ltd.. All rights reserved. Overview 9.1 Poiters 9.2 Dyamic Arrays Copyright 2015 Pearso Educatio, Ltd.. All rights reserved. Slide 9-3

More information

Intermediate Statistics

Intermediate Statistics Gait Learig Guides Itermediate Statistics Data processig & display, Cetral tedecy Author: Raghu M.D. STATISTICS DATA PROCESSING AND DISPLAY Statistics is the study of data or umerical facts of differet

More information

Creating Exact Bezier Representations of CST Shapes. David D. Marshall. California Polytechnic State University, San Luis Obispo, CA , USA

Creating Exact Bezier Representations of CST Shapes. David D. Marshall. California Polytechnic State University, San Luis Obispo, CA , USA Creatig Exact Bezier Represetatios of CST Shapes David D. Marshall Califoria Polytechic State Uiversity, Sa Luis Obispo, CA 93407-035, USA The paper presets a method of expressig CST shapes pioeered by

More information

ISSN (Print) Research Article. *Corresponding author Nengfa Hu

ISSN (Print) Research Article. *Corresponding author Nengfa Hu Scholars Joural of Egieerig ad Techology (SJET) Sch. J. Eg. Tech., 2016; 4(5):249-253 Scholars Academic ad Scietific Publisher (A Iteratioal Publisher for Academic ad Scietific Resources) www.saspublisher.com

More information

Pseudocode ( 1.1) Analysis of Algorithms. Primitive Operations. Pseudocode Details. Running Time ( 1.1) Estimating performance

Pseudocode ( 1.1) Analysis of Algorithms. Primitive Operations. Pseudocode Details. Running Time ( 1.1) Estimating performance Aalysis of Algorithms Iput Algorithm Output A algorithm is a step-by-step procedure for solvig a problem i a fiite amout of time. Pseudocode ( 1.1) High-level descriptio of a algorithm More structured

More information

Cubic Polynomial Curves with a Shape Parameter

Cubic Polynomial Curves with a Shape Parameter roceedigs of the th WSEAS Iteratioal Coferece o Robotics Cotrol ad Maufacturig Techology Hagzhou Chia April -8 00 (pp5-70) Cubic olyomial Curves with a Shape arameter MO GUOLIANG ZHAO YANAN Iformatio ad

More information

condition w i B i S maximum u i

condition w i B i S maximum u i ecture 10 Dyamic Programmig 10.1 Kapsack Problem November 1, 2004 ecturer: Kamal Jai Notes: Tobias Holgers We are give a set of items U = {a 1, a 2,..., a }. Each item has a weight w i Z + ad a utility

More information

Big-O Analysis. Asymptotics

Big-O Analysis. Asymptotics Big-O Aalysis 1 Defiitio: Suppose that f() ad g() are oegative fuctios of. The we say that f() is O(g()) provided that there are costats C > 0 ad N > 0 such that for all > N, f() Cg(). Big-O expresses

More information

HADOOP: A NEW APPROACH FOR DOCUMENT CLUSTERING

HADOOP: A NEW APPROACH FOR DOCUMENT CLUSTERING Y.K. Patil* Iteratioal Joural of Advaced Research i ISSN: 2278-6244 IT ad Egieerig Impact Factor: 4.54 HADOOP: A NEW APPROACH FOR DOCUMENT CLUSTERING Prof. V.S. Nadedkar** Abstract: Documet clusterig is

More information

Ch 9.3 Geometric Sequences and Series Lessons

Ch 9.3 Geometric Sequences and Series Lessons Ch 9.3 Geometric Sequeces ad Series Lessos SKILLS OBJECTIVES Recogize a geometric sequece. Fid the geeral, th term of a geometric sequece. Evaluate a fiite geometric series. Evaluate a ifiite geometric

More information