Optimized Query Planning of Continuous Aggregation Queries in Dynamic Data Dissemination Networks

Size: px
Start display at page:

Download "Optimized Query Planning of Continuous Aggregation Queries in Dynamic Data Dissemination Networks"

Transcription

1 WWW 007 / Trac: Performance and Scalablty Sesson: Scalable Systems for Dynamc Content Otmzed Query Plannng of Contnuous Aggregaton Queres n Dynamc Data Dssemnaton Networs Rajeev Guta IBM Inda Research Lab New Delh, Inda grajeev@n.bm.com Krth Ramamrtham Indan Insttute of Technology Mumba, Inda rth@cse.tb.ac.n ABSTRACT Contnuous ueres are used to montor changes to tme varyng data and to rovde results useful for onlne decson mang. Tycally a user desres to obtan the value of some aggregaton functon over dstrbuted data tems, for examle, to now (a) the average of temeratures sensed by a set of sensors (b) the value of ndex of md-ca stocs. In these ueres a clent secfes a coherency reurement as art of the uery. In ths aer we resent a low-cost, scalable technue to answer contnuous aggregaton ueres usng a content dstrbuton networ of dynamc data tems. In such a networ of data aggregators, each data aggregator serves a set of data tems at secfc coherences. Just as varous fragments of a dynamc web-age are served by one or more nodes of a content dstrbuton networ, our technue nvolves decomosng a clent uery nto sub-ueres and executng sub-ueres on judcously chosen data aggregators wth ther ndvdual sub-uery ncoherency bounds. We rovde a technue of gettng the otmal uery lan (.e., set of subueres and ther chosen data aggregators) whch satsfes clent uery s coherency reurement wth least cost, measured n terms of the number of refresh messages sent from aggregators to the clent. For estmatng uery executon cost, we buld a contnuous uery cost model whch can be used to estmate the number of messages reured to satsfy the clent secfed ncoherency bound. Performance results usng real-world traces show that our cost based uery lannng leads to ueres beng executed usng less than one thrd the number of messages reured by exstng schemes. Categores and Subject Descrtors H.3.5 [Onlne Informaton Servces]: Web Based Servces General Terms Algorthms, Management, Measurement, Performance, Desgn. Keywords Content dstrbuton networs, Dynamc data, contnuous aggregaton ueres, data coherency, uery dssemnaton cost. 1. INTRODUCTION Many data ntensve alcatons delvered over the Web suffer from erformance and scalablty ssues. Content dstrbuton networs (CDNs) solved the roblem for statc content usng Coyrght s held by the Internatonal World Wde Web Conference Commttee (IW3C). Dstrbuton of these aers s lmted to classroom use and ersonal use by others. WWW 007, May 8 1, 007, Banff, Alberta, Canada. ACM /07/0005. caches at the edge nodes of the networs. CDNs contnue to evolve to serve more and more dynamc alcatons [1, ]. A dynamcally generated web age s usually assembled usng a number of statc or dynamcally generated fragments. The statc fragments are served from the local caches whereas dynamc fragments are created ether by usng the cached data or by fetchng the data tems from the orgn data sources. One mortant ueston for satsfyng clent reuests through a networ of nodes s how to select the best node(s) to satsfy the reuest. For statc ages content reuested, roxmty to the clent and load on the nodes are the arameters generally used to select the arorate node [3, 4]. In dynamc CDNs, whle selectng the node(s) to satsfy the clent reuest, the central ste (to-level CDN node) has to ensure that age/data served meets clent s coherency reurements also. Technues to effcently serve fast changng data tems wth guaranteed ncoherency bounds have been roosed n the lterature [5, 6]. Such dynamc data dssemnaton networs can be used to dssemnate data such as stoc uotes, temerature data from sensors, traffc nformaton, and networ montorng data. In ths aer we roose a method to effcently answer aggregaton ueres nvolvng such data tems. In data dssemnaton schemes roosed n lterature [5, 6], a herarchcal networ of data aggregators s emloyed such that each data aggregator serves the data tem at some guaranteed ncoherency bound. Incoherency of a data tem at a gven node s defned as the dfference n value of the data tem at the data source and the value at that node. Although CDNs use ageurge [8] based coherency management, we assume that n dynamc data dssemnaton networs, these messages carry the new data values thereby an nvaldaton message becomes a refresh message. For mantanng a certan ncoherency bound, a data aggregator gets data udates from the data source or some hgher level data aggregator so that the data ncoherency s not more than the data ncoherency bound. In a herarchcal data dssemnaton networ a hgher level aggregator guarantees a tghter ncoherency bound comared to a lower level aggregator. Thus data refreshes are ushed from the data sources to the clents through the networ of aggregators. Dssemnaton networs for varous data tems (ossbly from dfferent data sources) can be overlad over a sngle networ of data aggregators as shown n Fgure 1. Thus, from a data dssemnaton caablty ont of vew, each data aggregator (DA) s characterzed by a set of (s, c ) ars, where s s the data tem whch the DA can dssemnate at an ncoherency bound c. Examle 1: In a networ of data aggregators managng data tems S 1 -S 4, varous aggregators can be characterzed as- D1: {(S 1, 0.5), (S 3, 0.)} D: {(S 1, 1.0), (S, 0.1), (S 4, 0.)} 31

2 WWW 007 / Trac: Performance and Scalablty Aggregator D1 can serve values of S 1 wth an ncoherency bound greater than or eual to 0.5 whereas D can dssemnate the same data tem at a looser ncoherency bound of 1.0 or more. Usually, clent s nterested n an aggregaton of these dynamc data tems at a certan ncoherency bound. These contnuous ueres are used to montor changes n dynamc data and rovde results useful for onlne decson mang. For generatng the result of a uery, data from multle sources s reured. As a result, the uery has to be evaluated ether at data aggregators [9] or at the clent. In ths aer we assume exstence of data dssemnaton networ of multle data tems to answer a class of ueres termed, contnuous ncoherency bounded weghted aggregaton ueres, whch are formally defned next. 1.1 Contnuous Incoherency-Bounded Weghted Aggregaton Queres A contnuous weghted aggregaton uery can be formally wrtten as: V s ( t) n = = s = 1 ( t) w V s s the value of a clent uery nvolvng n data tems wth the weght of the th data tem beng w, 1 n. s (t) s the value of the th data tem at the data source at tme t. Such a uery encomasses SQL aggregaton oerators SUM and AVG besdes general weghted aggregaton ueres such as ortfolo ueres, nvolvng aggregaton of stoc rces, weghted wth number of shares of stocs n the ortfolo. Due to sace lmtatons we are not resentng executon schemes for other aggregaton ueres such as MIN/MAX. Interested readers are referred to [10] for the extended verson of ths aer. Let the value of th data tem, n Euaton (1), nown to the clent/da be d (t). Then the data ncoherency s gven by s (t)- d (t). For a data tem whch needs to be dssemnated at an ncoherency bound C the data refresh s sent to the clent or lower level DA, f the s (t)-d (t) s more than C. If user secfed ncoherency bound for the uery s C, then the dssemnaton networ has to ensure that: n (1) ( s ( t) d ( t)) w C () = 1 Whenever data values at sources change such that uery ncoherency bound s volated, the udated value(s) s dssemnated to the clent. If the networ of aggregators can ensure that the th data tem has ncoherency bound C then the followng condton ensures that the uery ncoherency bound C s satsfed: n C w = 1 C A clent secfed uery ncoherency bound needs to be translated nto ncoherency bounds for ndvdual data tems or sub-ueres such that Euaton (3) s satsfed. It should be noted that Euaton (3) s suffcent condton for satsfyng the uery ncoherency bound but not the necessary. Ths way of translatng the uery ncoherency bound nto the sub-uery ncoherency bounds s reured f data s transferred between varous nodes usng only ush based mechansm. Such a translaton s not reured n ether a ull based mechansm as shown n our earler aer [9] or combnatons of ush and ull. In ths aer we (3) Sesson: Scalable Systems for Dynamc Content consder only ush based data dssemnaton among servers, DAs and clents. Next we resent the summary of our aroach towards executng the contnuous mult-data weghted addtve aggregaton uery wth the objectve of mnmzaton of number of refreshes from data aggregators to the clent. Our technue can be used for varous oular alcatons where dfferent clents reure aggregaton of multle data tems at ther ndvdual ncoherency bounds. Montorng stoc ortfolos s one such oular alcaton whch we use for erformance measurements. Sources Networ of data aggregators Clents Fgure 1: Data dssemnaton networ for multle data tems 1. Summary of Aroach and Contrbutons Consder a clent uery Q1=50 S S S 3 wth a reured ncoherency bound of 80 (n a stoc ortfolo S 1, S, S 3 can be dfferent stocs and ncoherency bound can be $80).We want to execute ths uery over data aggregators gven n Examle1 mnmzng number of refreshes. There are varous otons for the clent to get the data tems: The clent may get the data tems S 1, S and S 3 searately. The uery ncoherency bounds can be dvded among data tems n varous ways whle satsfyng Euaton 3. In ths aer, we show that gettng data tems ndeendently s a costly oton. Ths strategy gnores facts that the clent s nterested only n the aggregated value of the data tems and varous aggregators can dssemnate more than one data tem. If a sngle DA can dssemnate all three data tems reured to answer the clent uery, the DA can construct a comoste data tem corresondng to the clent uery (S =50 S S S 3 ) and dssemnate the result to the clent so that the uery ncoherency bound s not volated. It s obvous that f we get the uery result from a sngle DA, the number of refreshes wll be mnmum (as n ths case data tem udates may cancel out each other, thereby eeng the uery result wthn the ncoherency bound). As dfferent data aggregators dssemnate dfferent subsets of data tems, no data aggregator may have all the data tems reured to execute the clent uery whch s ndeed the case n Examle1. Further, even f an aggregator can dssemnate all the data tems, t may not be able to satsfy the uery coherency reurements. In such cases the uery has to be executed wth data from multle aggregators. Another oton s to dvde the uery nto a number of subueres and get ther values from ndvdual DAs. In that case, the clent uery result s obtaned by combnng the results of 3

3 WWW 007 / Trac: Performance and Scalablty more than one sub-uery. For the DAs gven n Examle1, the uery Q1 can be dvded n two alternatve ways: lan1: D1 {50 S S 3 }; D {S } lan: D1 {S 3 }; D {50 S 1, + 00 S }.e., n lan1 result of sub-uery 50 S S 3 s dssemnated by D1 and that of S (or 00 S ) by D. Combnng them at the clent gves the uery result. Selectng the otmal lan among varous otons s not-trval. As a thumb-rule, we should be selectng the lan wth lesser number of sub-ueres. But that s not guaranteed to be the lan wth the least number of messages. Further, we should select the sub-ueres such that udates to varous data tems aearng n a sub-uery have more chances of cancelng each other as that wll reduce the need for refresh to the clent (Euaton ). In the above examle, f udates to S 1 and S 3 are such that when S 1 ncreases, S 3 decreases, and vce-versa, then selectng lan1 may be benefcal. We gve an algorthm to select the uery lan based on these observatons. Whle solvng the above roblem of selectng the otmal lan we ensure that each data tem for a clent uery s dssemnated by one and only one data aggregator. Although a uery can be dvded n such a way that a sngle data tem s served by multle DAs (e.g., 50 S S S 3 s dvded nto two sub-ueres 50 S S and 70 S S 3 ); but n dong so the same data tem needs to be rocessed at multle aggregators, ncreasng the unnecessary rocessng load. By dvdng the clent uery nto dsjont sub-ueres we ensure that a data tem udate s rocessed only once for each uery (For examle, n case of ad data subscrtons t s not rudent to get the same data tem from the multle sources). The uery ncoherency bound needs to be dvded among subuery ncoherency bounds such that, besdes satsfyng the clent coherency reurements, the chosen DA (where the subuery s to be executed) s caable of satsfyng the allocated sub-uery ncoherency bound. For examle, n lan1 allocated ncoherency bound to the sub-uery 50S S 3 should be greater than 55 (=50* *0.) as that s the tghtest ncoherency bound whch the aggregator D1 can satsfy. We rove that the number of refreshes deends on the dvson of the uery ncoherency bounds among sub-uery ncoherency bounds. Smlar result was roved for data ncoherency bounds n [11]. Thus, what we need s a method of (a) otmally dvdng clent uery nto sub-ueres and (b) assgnng ncoherency bounds to them; such that (c) selected sub-ueres can be executed at chosen DAs and (d) total uery executon cost, n terms of number of refreshes, s mnmzed. We rove that the roblem of choosng sub-ueres whle mnmzng uery executon cost s an NPhard roblem. We gve effcent aroxmaton algorthms to choose the set of sub-ueres and ther corresondng ncoherency bounds for a gven clent uery. In contrast, all related wor n ths area [11, 1] roose gettng ndvdual data tems from the aggregators whch, as we show n ths aer, leads to large number of refreshes. For solvng the above roblem of otmally dvdng the clent uery nto sub-ueres, we frst need a method to estmate uery executon cost for varous alternatve otons. A method for estmatng the uery executon cost s another mortant contrbuton of ths aer. As we dvde the clent uery nto sub-ueres such that each sub-uery gets Sesson: Scalable Systems for Dynamc Content executed at dfferent aggregator nodes, the uery executon cost (.e., number of refreshes) s the sum of the executon costs of ts consttuent sub-ueres. We model the sub-uery executon cost as a functon of followng arameters: (a) Dssemnaton costs of the ndvdual data tems nvolved. The data dssemnaton cost s deendent on data dynamcs and ncoherency bound assocated wth the data. We model the data dynamcs usng a data synoss model, and the effect of the ncoherency bound usng an ncoherency bound model. These two models are combned to get the estmate of the data dssemnaton cost. (b) A correlaton measure of data dynamcs, uantfyng the chance that the udates of two data tems wll cancel each other out such that a sub-uery of ther sum wll ncur less refreshes than dssemnatng the ndvdual data changes. We use cosne smlarty between data tems for ths urose. Ths arameter s wdely used n nformaton retreval doman [0]. Through extensve smulatons we show that: Our method of dvdng uery nto sub-ueres and executng them at ndvdual DAs reures less than one thrd of the number of refreshes reured n the exstng schemes. For effcent executon, more dynamc data tem should be art of sub-uery nvolvng larger number of data tems. Our method of executng ueres over dynamc data dssemnaton networ s ractcal snce t can be mlemented usng a mechansm smlar to URL-rewrtng [4] n CDNs. Just le n a CDN, the clent sends ts uery to the central ste. For gettng arorate aggregators (edge nodes) to answer the clent uery (web age), the central ste has to frst determne whch data aggregators have the data tems reured for the clent uery. If the clent uery can not be answered by a sngle data aggregator, the uery s dvded nto sub-ueres (fragments) and each subuery s assgned to a sngle data aggregator. In case of a CDN, web age s dvson nto fragments s a age desgn ssue, whereas, for contnuous aggregaton ueres, ths ssue has to be handled on er-uery bass by consderng data dssemnaton caabltes of data aggregators as reresented n Examle Outlne of the Paer We gve a formal mathematcal defnton of the uery lan selecton roblem n Secton. Query cost model for a mult-data ncoherency bounded aggregaton uery s develoed n Secton 3. The uery cost model uses the data dssemnaton model resented n Secton 3.1 and cosne smlarty measure whch s exlaned n Secton 3.. In Secton 4, we frst rove that the otmzaton roblem resented n Secton s NP-hard then we gve aroxmate algorthms for the roblem. In Secton 5, erformance evaluaton done usng real-world traces s resented to show that our sub-uery based uery evaluaton scheme executes the clent uery at less than one thrd cost comared to other nown schemes. Related wor s resented n Secton 6 and the aer concludes n Secton 7.. QUERY PLAN SELECTION PROBLEM In ths secton, we gve a formal defnton of the otmzaton roblem descrbed n the revous secton. We are gven a set D of data aggregators, set S of data tems and one-to-many mang f: D(S, C) where C s a sub-set of real number reresentng ncoherency bounds for varous data tems (n the set S ) at 33

4 WWW 007 / Trac: Performance and Scalablty aggregators n D. Each ncomng clent uery over the data tems set S S has corresondng weghts gven as a set W. Thus the uery can be reresented as set of tules of data_tem, weght,.e., = {s, w } wth the uery ncoherency bound C. We need to erform the followng two tass such that the number of refreshes to the clent s mnmum: Tas1: Dvde the clent uery = {s, w } nto sub-ueres = { s, w } so that =.e., although dfferent sub-ueres may be executed at dfferent aggregators, combnng ther results gves the value of the clent uery. Tas: Allocate each sub-uery, wth ts ncoherency bound C, to data aggregators. Whle fulfllng the followng condtons: Condton1: Query ncoherency bound s satsfed,.e., C C. The sub-uery should be assgned to a data aggregator d D ff: Condton: The chosen aggregator should have all the data tems aearng n the sub-uery.e. S ( ) S ( f ( d )). Here ndcates roject oerator n relatonal algebra. Condton3: Data ncoherency bounds at the selected data aggregator c j = C ( σ ( f ( d )) s= s ( j) should be such that C X where s (j) s the j th data tem aearng n the subuery and X s the tghtest ncoherency bound the aggregator can ensure for the gven sub-uery. X can be calculated as: X =c j w j. Here ndcates select oerator n relatonal algebra. 3. QUERY COST MODEL Before develong the uery cost model we frst summarze the model to estmate the number of refreshes reured to dssemnate a data tem at certan ncoherency bound. For smulaton exerments we use data tems from sensor networ and stoc data domans as exlaned n our revous wor [9]. Stoc traces of 45 stocs were obtaned by erodcally ollng htt://fnance.yahoo.com. Sensor networ data used were temerature and wnd sensor data from Georges Ban Cruses Albatross Shboard [13]. Due to aucty of sace we resent results usng stoc data only but smlar results were obtaned for sensor data as well [14]. For detaled analyss and smulaton results, readers can refer to the extended verson of the aer [10]. 3.1 Data Dssemnaton Cost Cost of dssemnatng a data tem at a certan gven ncoherency bound C can be estmated by combnng two models: Incoherency bound model s used for estmatng deendency of data dssemnaton cost over the desred ncoherency bound. As er ths model, we have shown n [10] that the number of data refreshes s nversely roortonal to the suare of the ncoherency bound (1/C ). Smlar result was earler reorted n [5] where the data dynamcs was modeled as a randomwal rocess. Data dssemnaton cost 1/C (4) Sesson: Scalable Systems for Dynamc Content Data Synoss Model s used for estmatng the effect of data dynamcs on number of data refreshes. We defne a data dynamcs measure called, sumdff, to obtan a synoss of the data for redctng the dssemnaton cost. The number of udate messages for a data tem s lely to be hgher f the data tem changes more n a gven tme wndow. Thus we hyothesze that cost of data dssemnaton for a data tem wll be roortonal to sumdff, defned as: R s = s s 1 (5) where s and s -1 are the samled values of the data tem at th and (-1) th tme nstances (consecutve tcs). In [10] we corroborate the above hyothess usng smulaton over a large number of data tems. Pearson roduct moment correlaton coeffcent (PPMCC) [19] values, used for uantfyng lnearty between data sumdff and number of refreshes reured to mantan a fxed ncoherency bound, were found to be between 0.90 and 0.96 for varous values of ncoherency bounds. Sumdff value for a data tem can be calculated at the data source by tang runnng average of dfference between data values at the consecutve tcs. A data aggregator can also estmate the sumdff value by nterolatng the dssemnated values. Thus, the estmated dssemnaton cost for data tem S, dssemnated wth an ncoherency bound C, s roortonal to R s /C. Next we use ths result for develong the uery cost model. 3. Query Dssemnaton Cost Consder a case where a uery conssts of two data tems P and Q wth weghts w and w resectvely; and we want to estmate ts dssemnaton cost. If data tems are dssemnated searately, the uery sumdff wll be: R w R + w R = w + w (6) data = 1 1 Instead, f the aggregator uses the nformaton that clent s nterested n a uery over P and Q (rather than ther ndvdual values), t maes a comoste data tem w +w and dssemnates that data tem then the uery sumdff wll be: Ruery = w ( 1 ) + w ( 1 ) R uery s clearly less than or eual comared to R data. Thus we need to estmate the sumdff of an aggregaton uery (.e., R uery ) gven the sumdff values of ndvdual data tems (.e., R and R ). Only data aggregators are n oston to calculate R uery as dfferent data tems may be from dfferent sources. We develo the uery dssemnaton model n two stages Quantfyng correlaton between dynamcs of data From Euatons (6) and (7) we can see that f two data tems are correlated such that f value of one data tem ncreases, that of the other data tem also ncreases, then R uery wll be closer to R data whereas f the data tems are nversely correlated then R uery wll be less comared to R data. Thus, ntutvely, we can reresent the relatonsh between R uery and sumdff values of the ndvdual data tems usng a correlaton measure assocated wth the ar of data tems. Secfcally, f s the correlaton measure then R uery can be wrtten as: R uery (7) ( w R + w R + ρw R w R ) (8) 34

5 WWW 007 / Trac: Performance and Scalablty The correlaton measure s defned such that 1 +1, so, R uery wll always be less than w R +w R (as exlaned earler) and always be more than w R w R. The correlaton measure can be nterreted as cosne smlarty [0] between two streams reresented by data tems P and Q. Cosne smlarty s a wdely used measure n nformaton retreval doman where documents are reresented usng a vector-sace model and document smlarty s measured usng cosne of angle between two document reresentatons. For data streams P and Q, can be calculated as: ( 1)( 1) ρ = (9) ( ) ( ) Query normalzaton Suose we want to comare the cost of two ueres: a SUM uery nvolvng two data tems and an AVG uery nvolvng the same data tems. Let the uery ncoherency bound for the SUM and the AVG ueres be C 1 =C and C =C, resectvely. From Euaton (8), sumdff of the SUM uery wll be double that of the AVG uery (as the weght of each data tem n the SUM uery s double of that n the AVG uery). Hence, uery evaluaton cost (as er R /C ) of the SUM uery wll be half that of the AVG uery (as SUM uery ncoherency bound s double). But, ntutvely, dssemnatng the SUM of two data tems, at double the ncoherency bound should reure the same number of messages as ther AVG. Thus, there s a need to normalze uery costs. From a uery executon cost ont of vew, a uery wth weghts w and ncoherency bound C s same as uery wth weghts w and ncoherency bound.c. So, whle normalzng we need to ensure that both, uery weghts and ncoherency bounds, are multled by the same factor. Normalzed uery sumdff s gven by: R uery ( w R + wr + ρw RwR ) = (10) ( w + w + ρw w ).e., the value of the normalzng factor should be 1/ w + w + ρw w. The value of the ncoherency bound has to be adjusted by the same factor. Normalzaton ensures that ueres wth arbtrary values of weghts can be comared for executon cost estmates. From Euatons (9 and 10) the value of uery sumdff can be estmated at a data aggregator node f t has all the reured data tems dssemnated to t. An aggregator can use nterolated values of data tems to estmate as t s not (always) lely to have all the data udates. In the extended verson of the aer [10] we resent an effcent method (usng [1]) to calculate whch can also be used when the corresondng data tems are not beng dssemnated by the same data aggregator. Euaton (10) can be extended to get uery sumdff for any general weghted aggregaton uery gven by Euaton (1) as: R Q n w R + ρjww jr R = 1 = 1 j = 1, j = n n n w + ρ w w = 1 = 1 j = 1, j n n j j j (11) Sesson: Scalable Systems for Dynamc Content 3..3 Valdatng the uery cost model To valdate the uery cost model we erformed smulatons by constructng more than 50 weghted aggregaton ueres usng the stoc data wth each uery consstng of 3-7 data tems wth data weghts unformly dstrbuted between 1 and 10. For each uery the number of refreshes was counted for varous normalzed ncoherency bounds between 0.01 and 0.5. Fgure shows that the number of messages s roortonal to the normalzed uery sumdff as calculated usng Euaton (11) f ther normalzed ncoherency bounds are same. In ths case PPMCC value s found to be 95%. Smlarly, Fgure 3 shows the deendence of the number of refreshes on 1/C to rove that the relatonsh that holds between them for sngle data tem also holds for a uery wth multle data tems. The uery cost model can be used n varous alcatons of uery assgnment, load balancng, otmal order of rocessng, etc. In the next secton, we use ths uery cost model for our uery lan roblem to otmally dvde a clent uery nto sub-ueres and execute t over a networ of data aggregators so that the number of refreshes can be mnmzed Fgure : Varaton of uery cost wth uery sumdff (Normalzed C=0.3) Fgure 3: Number of refreshes for varyng uery ncoherency bounds 4. EXCEUCTING QUERIES USING SUB- QUERIES For executng an ncoherency bounded contnuous uery, a uery lan s reured whch ncludes the set of sub-ueres, ther ndvdual ncoherency bounds and data aggregators whch can execute these sub-ueres. We need to fnd the otmal uery executon lan whch satsfes clent coherency reurement wth the least number of refreshes. As exlaned n Secton 1, what we need s a mechansm to: Tas 1: Dvde the aggregaton uery nto sub-ueres; and 35

6 WWW 007 / Trac: Performance and Scalablty Tas : Allocate the uery ncoherency bound among them. whle satsfyng the followng condtons dentfed n Secton : condton 1. Query ncoherency bound s satsfed. condton. The chosen DA should be able to rovde all the data tems aearng n the sub-uery assgned to t. condton 3. Data ncoherency bounds at the chosen DA should be such that the sub-uery ncoherency bound can be satsfed at the chosen DA. Objectve : Number of refreshes should be mnmzed. Let the clent uery be dvded nto N sub-ueres { : 1N}; wth R beng sumdff of th sub-uery and C beng ncoherency bound assgned to t. As gven s Secton 3, the dssemnaton cost of a sub-uery s estmated to be roortonal to R /C. Thus uery cost estmate s gven by: N Z = ( R / C (1) = 1 Whle allocatng sub-uery ncoherency bounds we need to ensure that the uery coherency reurement C s satsfed (condton1);.e., N C C = 1 ) (13) For satsfyng condton, sub-ueres should be such that all ts data tems can be dssemnated by the chosen DA. Let X be the tghtest ncoherency bound (defned n Secton ) the chosen DA can satsfy for. For the condton3, we have to ensure that C X for each sub-uery and ts assgned data aggregator. Z needs to be mnmzed for mnmzng the number of refreshes as er the objectve. Before attemtng the hard roblem of otmzng Z, let us frst consder a smler roblem where values of C are gven. In ths smler roblem we dvde the clent uery nto sub-ueres to mnmze the estmated executon cost (Z) wthout consderng the otmal dvson of the uery ncoherency bound nto sub-uery ncoherency bounds. Besdes worng as a ste towards a soluton for the whole roblem ths case can also be used where allocaton of ncoherency bounds to sub-ueres s done ndeendent of the data dynamcs. For examle, t may be re-decded that ncoherency bounds for all data tems wll be the same. Thus, for a gven uery and ts ncoherency bounds, the sub-uery ncoherency bounds can be obtaned. Next we rove that ths smler verson of the roblem s NP-hard. 4.1 Fndng Otmal Query Plan s NP-hard For rovng that the roblem s NP-hard, we use reducton from 3-dmensonal matchng (3DM) roblem. For a gven clent uery and DA, let us frst defne maxmal sub-uery as the largest art of the uery whch can be dssemnated by the DA (.e., the maxmal sub-uery has all the uery data tems whch the DA can dssemnate at the reured ncoherency bound). For examle, for a clent uery 0S 1 +5S +35S 3 wth ncoherency bound 80; let the re-decded ncoherency bound for each data tem be 1. For the data aggregators D1 and D gven n Examle 1, the maxmal sub-uery for D1 wll be 1 =0S 1 +35S 3, whereas for D t wll be =0 S 1 + 5S. 3DM Problem: Gven three sets X, Y and Z, each wth N elements, and a set M X Y Z, s there a subset M1M such that M1 =N and no two elements of M1 agree n any coordnate? We use a slghtly dfferent (decson) verson of the otmzaton roblem to reduce the 3DM roblem. To solve the 3DM roblem we reduce t to a SUM uery of 3N tems: N The SUM uery: ( x + y + z ) for x X, y Y, z Z. Sesson: Scalable Systems for Dynamc Content = 1 We assume that all the data tems have the same sumdff values of 1; cosne smlarty between all the data tems s 0; and all data tems are allocated an ncoherency bound of 1. For each element (x, y j, z ) M, we assume the exstence of a data aggregator dssemnatng these three data tems only. In the decson verson of otmal lan roblem we as whether there exsts a uery lan wth uery cost estmate value N/3. If a uery lan wth cost estmate value N/3 exsts; t mles that the uery lan has N ueres wth 3 tems each (that wll lead to uery cost value of 1/3 er sub-uery as er Euaton (11) whereas any other combnaton of sub-ueres wll lead to more cost). Three data tems from each chosen data aggregators form a trlet for the set M1 whch solves 3DM. Because of sace constrants we are not gvng the comlete roof of NPhardness of the orgnal roblem. In general, there s no nown aroxmate algorthm for such a roblem. It should be noted that erformng Tas1 for achevng the objectve s NP-hard, so we gve two greedy heurstcs n next two sub-sectons; whereas Tas can be erformed otmally wth condtons1-3 whle achevng the objectve. In our aroach, we frst try to erform Tas1, whle satsfyng as many condtons as ossble, and then otmally erform Tas whle satsfyng all the condtons. 4. Mnmum Cost Heurstc Fgure 4 shows the outlne of greedy heurstcs where dfferent crtera () can be used to select sub-ueres. In ths secton we descrbe the case where the estmate of uery executon cost s mnmzed n each ste of the algorthm (mn-cost) whereas n the next secton we resent the case where gan due to executng a uery usng sub-ueres s maxmzed (max-gan) Query Plan wth Pre-decded Incoherency Bound Allocaton For the gven clent uery () and mang between data aggregators and the corresondng {data-tem, data ncoherency bound} ars (f: D(S, C)) maxmal sub-ueres can be obtaned for each data aggregator. Let A be the set of such maxmal subueres. In ths set, each uery a A can be dssemnated by a desgnated data aggregator at the assgned ncoherency bound. For each sub-uery a A, ts Sumdff R a s calculated usng Euaton 11. Usng the set A and sub-uery sumdffs, we use the algorthm outlned n Fgure 4 to get the set of sub-ueres mnmzng the uery cost. In ths Fgure each sub-uery a A s reresented by the set of data tems covered by t. As we need to mnmze the uery cost, a sub-uery wth mnmum cost er data tem s chosen n each teraton of the algorthm.e., crtera mnmze (R a /C a a ). All data tems covered by the selected subuery are removed from all the remanng sub-ueres n A before erformng the next teraton. 36

7 WWW 007 / Trac: Performance and Scalablty Result whle A choose a sub-uery a A wth crtera Result Result a A A-{a} for each data element e a for each b A b b-{e} f b = A A-{b} else calculate sumdff for modfed b return Result Fgure 4: Greedy algorthm for uery lan selecton 4.. Otmzng uery executon cost Now we consder the overall roblem to select the otmal set of sub-ueres whle smultaneously dvdng the uery ncoherency bound among them. In ths case we get the set of maxmal ueres (A) wthout consderng the mnmum ncoherency bounds that the data aggregators can satsfy (.e., condton3). In ths algorthm we frst get the otmal set of sub-ures wthout consderng the condton3 and then allocate ncoherency bound among them usng condton1 (Euaton (13)) and condton3. Lagrange multler scheme can be used to solve for ncoherency bounds (from Euatons 1 & 13) so that Z s mnmzed: C = R N = 1 C /( R ) (14).e., wthout the constrants of condton3, sub-uery ncoherency bounds should be allocated n roorton to R. Usng Euatons (1) and (14) we get: Z N 1/ 3 1 R / 3 = 1 = C (15) From Euaton (15), t s clear that for mnmzng the uery executon cost we should select the set of sub-ueres so that 3 R s mnmzed. We can do that by usng crtera 1/ mnmze ( R a / a ) n the algorthm descrbed n Fgure 4. Once we get the otmal set of sub-ueres we can use the Euaton (13) and condton3 (C X ) to otmally allocate the uery ncoherency bound among them. Ths allocaton roblem can be solved by varous convex otmzaton technues avalable n the lterature such as gradent descent method, barrer method etc. We used gradent descent method (fmncon functon n MATLAB) to solve ths non-lnear otmzaton roblem to get the values of ndvdual sub-uery ncoherency bounds. But ths method of frst selectng sub-ueres and then allocatng the ncoherency bounds has a roblem whch s descrbed next Satsfablty of Condton 3 In the soluton descrbed n the revous secton, we select the set of sub-ueres (and corresondng DAs) and then allocate the uery ncoherency bound among them usng convex otmzaton technues. But the roblem of ncoherency bound allocaton among chosen DAs may not have any feasble soluton. There may be stuatons where, although the data dssemnaton networ s able to satsfy the uery coherency reurements but once the set of sub-ueres (and corresondng DAs) s selected the ncoherency bound allocaton s not ossble. Examle : Consder a clent uery 50S S + 150S 3 wth the ncoherency bound of 80 and data dssemnaton networ consstng of two aggregators D1 and D as gven n Examle 1. There are (at-least) two ossble uery lans to answer the above uery: Plan1: D1 (50S S 3 ); D (S ) Plan: D1 (S 3 ); D (50S S ) In Secton 4.. we are selectng sub-ueres havng mnmum R, thus based on data dynamcs t s ossble that we select lan as the otmal lan. But from the data ncoherency bounds that aggregators D1 and D can ensure, we see that t s not ossble for lan to satsfy the clent secfed ncoherency bound as mnmum ncoherency bound that can be satsfed by the selected aggregators (X=50*1 +00* *0. =100) s greater than the uery ncoherency bound (=80). Thus although there exsts a lan (lan1) whch can satsfy the clent uery ncoherency bound, whle mnmzng the uery executon cost the above method cannot ensure that such a lan wll be selected. What we need s a comromse between the uery satsfablty and erformance. In Secton 4.. we are selectng the sub-ueres wthout consderng the data ncoherency bounds for the selected data aggregators. We correct that by selectng sub-ueres usng αx a ( Ra + ) as substtute objectve functon nstead of CR a R a. The second term ensures that whle selectng the otmal lan we refer the data aggregators havng tghter data ncoherency bounds (lower values of X a ) thus hgher chances of satsfyng the uery. The tunng arameter () can used to balance the objectves of mnmzng uery executon cost through subueres selecton and meetng the uery coherency reurements. We use X a / CRa n the second term as, accordng to Euaton (14), otmal ncoherency bound allocaton s lely to be done roortonal to CR a. In Secton 5., we measure effects of the tunng arameter on the uery satsfablty. 4.3 Maxmum Gan Heurstc In ths secton we resent an algorthm whch, nstead of mnmzng the estmated uery executon cost, maxmzes the estmated gans of executng clent uery usng sub-ueres. In ths algorthm, for each sub-uery, we calculate the relatve gan of executng t by fndng the sumdff dfference between cases when each data tem s obtaned searately and when all the data tems are aggregated as a sngle sub-uery. Thus, the relatve gan for a sub-uery w +w can be wrtten as: G uery Sesson: Scalable Systems for Dynamc Content = ( w R ( w R + w R + w R + ρw ) 1 w R R ) (16) Ths algorthm can be mlemented by usng crtera maxmze (G uery / a ) to get the set of sub-ueres and corresondng DAs. Then we use the convex otmzaton method outlned n Secton 4. to allocate ncoherency bounds among sub-ueres. To tacle the uery satsfablty ssue the uery gan Euaton (16) s modfed to: 37

8 WWW 007 / Trac: Performance and Scalablty G + w ' α( ) uery = Guery CRuery w X X (17) where X s mnmum ncoherency bound that can be satsfed for the data tem P; C s uery ncoherency bound and R uery s the uery sumdff (= ( w R + wr + ρw wrr ) ). Reasons for selectng the artcular substtute objectve functon are same as ones outlned n Secton In the next Secton, through erformance results, we show that ths algorthm erforms better than the mn-cost heurstc. 5. PERFORMANCE EVALUATION For erformance evaluaton we smulated the data dssemnaton networs of 5 stoc data tems over 5 aggregator nodes such that each aggregator can dssemnate combnatons of u to 10 data tems wth data ncoherency bounds chosen unformly between $0.005 and 0.0. Then we created 500 ortfolo ueres such that each uery has u to 10 randomly (unformly) selected data tems wth weghts varyng between and 10. These ueres were executed wth ncoherency bounds between 0.3 and 1.0 (.e., % of the uery value). In the frst set of exerments, we et the data ncoherency bounds at the data aggregators very low so that uery satsfablty can be ensured. 5.1 Comarson of algorthms For comarson wth our algorthms, resented n the revous secton, we consder varous other uery lan otons. Each uery can be executed by dssemnatng ndvdual data tems or by gettng sub-uery values from DAs. Set of sub-ueres can be selected usng sumdff based aroaches or any other random selecton. Sub-uery (or data) ncoherency bound can ether be re-decded or otmally allocated. Varous combnatons of these dmensons are covered n the followng algorthms: 1. No sub-uery, eual data ncoherency bound (naïve): In ths algorthm, the clent uery s executed wth each data tem beng dssemnated ndeendent of other data tems n the uery. Incoherency bound s dvded eually among the data tems. Ths algorthm acts as a baselne algorthm.. No sub-uery, otmal ncoherency bound (otc): In ths algorthm also data tems are dssemnated searately but ncoherency bound s dvded among data tems so that total number of refreshes can be mnmzed. Ths algorthm s smlar to the one resented n [11]. Here, the ncoherency bound s allocated dynamcally usng Euaton (14). 3. Random sub-uery selecton (random): In ths case, subueres are generated by randomly selectng one data aggregators and allocatng t the maxmal sub-uery consstng of uery data tems whch the aggregator can dssemnate. Then the rocess s reeated for the remanng data tems untl the whole uery s covered. Ths algorthm s desgned to see how the sub-uery selecton based on uery sumdff (Secton 4) wors n comarson to random selecton of sub-ueres. 4. Sub-uery selecton whle mnmzng sumdff (mn-cost): Ths algorthm s descrbed n Secton Sub-uery selecton whle maxmzng gan (max-gan): Ths algorthm s descrbed n Secton 4.3. Sesson: Scalable Systems for Dynamc Content Fgure 5 shows average number of refreshes reured for uery ncoherency bounds of $0.3, $0.5 and $0.8. The naïve algorthm reures more than three tmes the number of messages comared to mn-cost and max-gan algorthms. For ncoherency bound of $0.8 each uery reures 104 messages f t s executed just by otmzng ncoherency bound (otc) comared to 55 when we select the uery lan usng the max-gan algorthm. Further, although the otmzaton roblem s smlar to the coverng a set of data tems (uery) usng ts sub-sets (sub-ueres) for whch the greedy mn-cost algorthm s consdered to be most effcent [7], we see that max-gan algorthm reures 0-5% less messages comared to the mn-cost aroach. Reasons for max-gan algorthm erformng better than other algorthms are exlored n the next set of exerments. Although here we resented results for stoc traces (man-made data) smlar results were obtaned for sensor traces (natural data) as well. Fgure 5: Performance evaluaton of algorthms 5. Effect of Algorthmc Parameters These set of exerments were erformed to get an nsght nto varous characterstcs of our sub-uery selecton method whch lead t to erform better comared to other otons. We consder effects of three arameters on sub-uery selecton and, n turn on uery erformance: data dynamcs, correlaton between data dynamcs and uery satsfablty arameter Effect of data dynamcs In ths set of exerments, we wanted to see whether there s any defnte relatonsh between data dynamcs and sub-uery sze n whch that data tem aears. In ths exerment wth 10 data tems, 45 DAs were smulated such that each DA can dssemnate a dfferent set of data tems. Then 100 ueres were created each wth 3 data tems. In the otmal uery lan, each uery wll be executed wth two sub-ueres: one consstng of data tems and another wth sngle data tem (lan wth three one tem subueres wll be trvally neffcent). As the uery has only 3 data tems, only 3 such uery lans are ossble. We smulated all these otons to get the best uery lan. Fgure 6 shows varaton of average sub-uery sze n whch a artcular data tem aears versus sumdff value of the data tem. We can see that f a data tem s more dynamc, n the otmal lan, t s more lely to be art of larger sub-uery. Ths s an mortant observaton as t ndcates that for effcent uery evaluaton more dynamc data tems should be art of a larger sub-uery. Ths henomenon can be exlaned by the fact that by executng a uery as a combnaton of sub-ueres wll always be more effcent comared to gettng the data tems ndeendently. By combnng 38

9 WWW 007 / Trac: Performance and Scalablty Fgure 6: Effect of data sumdff on sub-uery sze more dynamc data tems we are lely to gan more. For comarson we also show the curve for the sub-uery selecton based on max-gan algorthm. It can be seen that sub-uery selecton usng max-gan s aroxmately same as that selected by the otmal soluton. By usng max-gan algorthm we acheve our objectve of dssemnatng more dynamc data tems as art of larger sub-ueres. For the max-gan algorthm, smlar results were obtaned for larger uery szes as well. In comarson, n the mn-cost algorthm most dynamc data tem s more lely to be dssemnated as sngle tem uery. Ths haens because the sumdff value of a more dynamc data tem wll be hgh thus n each ste of the mn-cost algorthm (Fgure 4), there s less chance of selectng a sub-uery wth more dynamc data tem. Thus, t s very lely that the hghly dynamc data tem wll be dssemnated as a sngle tem sub-uery resultng n bad erformance of the clent uery. Stll the mn-cost algorthm erforms better comared to random algorthm as t tres to execute the uery wth lesser number of sub-ueres. 5.. Effect of correlaton between data dynamcs To measure the effects of correlaton between data dynamcs (cosne smlarty) on the uery erformance, we comared the uery erformance wth the case when all the data tems are assumed to be ndeendent (.e., =0). For erformng these exerments we constructed 10 synthetc data traces so that values of for varous data tem ars were dstrbuted unformly between -1 and +1. Then 45 DAs were smulated so that each DA can dssemnate data tems. 100 ueres were generated, each wth 4 data tems. In ths case, each uery wll get executed wth sub-ueres of data tems each. Combnaton of sub-ueres wll be decded based on correlaton between data tems (sumdff values of all the data tems were the same). Table 1 comares the Table1: Effect of correlaton on number of refreshes Incoherency Bound Avg. number of msgs when s consdered Avg. number of msgs when s assumed to be results when cosne smlarty s taen nto account and when cosne smlarty s assumed to be 0 for all data tem ars. It can be seen that by consderng cosne smlarty number of refreshes reduce by aroxmately 1%. Ths result ndcates that for subuery selecton data dynamcs may be more mortant factor than the cosne smlarty between the data tems. Sesson: Scalable Systems for Dynamc Content 5..3 Effect of uery satsfablty arameter To smulate the stuaton where selected aggregators may not be able to satsfy the uery ncoherency bounds, we modfed the smulaton set u used n Secton 5.1 to set the mnmum data ncoherency bounds whch DAs can satsfy to be between.015 and Value of was vared between 0-0. The case =0 corresonds to the algorthm wthout dealng wth the uery satsfablty. Fgure 7 shows uery executon cost and number of unanswerable ueres as the value of s vared. As shown n the fgure as the value of s ncreased, ercentage of the unsatsfed ueres decreased for varous values of uery ncoherency bounds. Due to changed data ncoherency bounds of DAs, we found that 0% of ueres can not be satsfed even by the data aggregators wth tghtest data ncoherency bounds. Thus, whle resentng the results, we remove those ueres. At the uery ncoherency bound of $0.8, 40% are ueres can not be satsfed by the otmally Fgure 7: Effect of on uery satsfablty selected data aggregators but as we ncrease the value of to 10, only 3% ueres are unanswered. Such a value can be chosen to balance the erformance and satsfablty of ueres. For examle, a dynamc CDN may am at uery satsfablty of 95% for a gven dstrbuton of uery ncoherency bounds. If at any tme uery satsfablty s below the target, value of can be ncreased whereas n case of over achevng the target, the value of can be decreased to mrove the uery erformance Summary of erformance results Followng features of the uery lannng algorthm mrove erformance: Dvdng the uery nto sub-ueres and executng them at secfcally chosen data aggregators. Decdng the uery lan usng data sumdff based mechansm secfcally by maxmzng sub-uery gans. Includng more dynamc data as art of a larger sub-uery. We also showed that the max-gan algorthm s very close to the otmal algorthm n selectng sub-ueres based on data dynamcs. 6. RELATED WORK Varous mechansms for effcently mantanng ncoherency bounded aggregaton ueres over contnuously changng data tems are roosed n the lterature [11, 1, 16]. Our wor dstngushes tself by beng sub-uery based evaluaton to mnmze number of refreshes. In [11], authors roose usng data flters at the sources; nstead we assgn ncoherency bounds to sub-ueres whch reduce the number of refreshes for uery evaluaton, as exlaned n Secton 39

10 WWW 007 / Trac: Performance and Scalablty 5. Further, we roose that more dynamc data tems should be executed as art of larger sub-uery. In [], authors resent technue of reorganzng a data dssemnaton networ when clent reurements change. Instead, we try to answer the clent uery usng the exstng networ. Reorganzng aggregators s a longer term actvty whereas uery lannng can be done for short as well as long runnng ueres on more dynamc bass. Pull based data dssemnaton technues, where clents or data aggregators ull data tems such that uery reurements are met, are descrbed n [9,16]. For mnmzng the number of ulls, both model the ndvdual data tems and redct data values. In comarson, we consder the stuaton where dfferent sub-ueres, nvolvng multle data tems, can be evaluated at dfferent nodes. Further, ncoherency bound s aled over the sub-uery rather than to ndvdual data tems, leadng to effcent evaluaton of the uery. Satal and temoral correlatons between sensor data are used to reduce data refresh nstances n [17, 18]. We also consder correlaton n terms of cosne smlarty between data tems, but we use t for dvdng clent uery nto sub-ueres. Our wor can be extended by usng temoral and satal roertes of data tems for redctng ther correlaton measures. A method of assgnng clents data ueres to aggregators n a content dstrbuton networ s gven n [1]. We do for clent ueres consstng of multle data tems what [1] does for clent reurng ndvdual data tems. 7. CONCLUSIONS Ths aer resents a cost based aroach to mnmze the number of refreshes reured to execute an ncoherency bounded contnuous uery. For otmal executon we dvde the uery nto sub-ueres and evaluate each sub-uery at a chosen aggregator. Performance results show that by our method the uery can be executed usng less than one thrd the messages reured for exstng schemes. Further we showed that by executng ueres such that more dynamc data tems are art of a larger sub-uery we can mrove erformance. Our method of uery executon can be mlemented usng schemes smlar to that used n CDNs. Our uery cost model can also be used for other uroses such as load balancng varous aggregators, otmal uery executon lan at an aggregator node, etc. Usng the cost model for other alcatons and develong the cost model for more comlex ueres s our future wor. Acnowledgement: We would le to than Venatesan Charavarthy and Vnaya Pandt for helful dscusson on varous algorthms. 8. REFERENCES [1] A. Davs, J. Parh and W. Wehl. Edge Comutng: Extendng Enterrse Alcatons to the Edge of the Internet. WWW 004 [] D. VanderMeer, A. Datta, K. Dutta, H. Thomas and K. Ramamrtham. Proxy-Based Acceleraton of Dynamcally Generated Content on the World Wde Web. ACM Transactons on Database Systems (TODS) Vol. 9, June 004. [3] J. Dlley, B. Maggs, J. Parh, H. Proo, R. Staraman and B. Wehl. Globally Dstrbuted Content Delvery, IEEE Internet Comutng Set 00. Sesson: Scalable Systems for Dynamc Content [4] S. Rangarajan, S. Muerjee and P. Rodrguez. User Secfc Reuest Redrecton n a Content Delvery Networ, 8 th Intl. Worsho on Web Content Cachng and Dstrbuton (IWCW), 003. [5] S. Shah, K. Ramamrtham, and P. Shenoy. Mantanng Coherency of Dynamc Data n Cooeratng Reostores. VLDB 00. [6] Dynama: Cachng Technology for Dynmac Content Revealed. [7] D. S. Hochbaum. Aroxmaton algorthms for the set coverng and vertex cover roblems. SIAM Journal on Comutng, vol. 11 (3), 198. [8] Zongmng Fe. A Novel Aroach to Managng Consstency n Content Dstrbuton. WCW 001 [9] R. Guta, A. Pur, and K. Ramamrtham. Executng Incoherency Bounded Contnuous Queres at Web Data Aggregators. WWW 005. [10] Otmzed Executon of Contnuous Queres, APS 006, [11] C. Olston, J. Jang, and J. Wdom. Adatve Flter for Contnuous Queres over Dstrbuted Data Streams. SIGMOD 003. [1] S. Shah, K. Ramamrtham, and C. Ravshanar. Clent Assgnment n Content Dssemnaton Networs for Dynamc Data. VLDB 005. [13] NEFSC Scentfc Comuter System htt://sole.wh.who.edu/~jmannng//cruse/serve1.cg [14] Query cost model valdaton for sensor data. [15] D. S. Hochbaum. Aroxmaton algorthms for the set coverng and vertex cover roblems. SIAM Journal on Comutng, vol. 11 (3), 198. [16] S. Zhu and C. Ravshanar. Stochastc Consstency and Scalable Pull-Based Cachng for Erratc Data Sources. VLDB 004. [17] D. Chu, A. Deshande, J. Hellersten, W. Hong. Aroxmate Data Collecton n Sensor Networs usng Probablstc Models. ICDE 006. [18] A. Deshande, C. Guestrn, S. R. Madden, J. M. Hellersten, and W. Hong. Model-Drven Data Acuston n Sensor Networs. VLDB, 004. [19] Pearson Product moment correlaton coeffcent. htt:// / [0] Lam, W. and Ho, C.Y. Usng a Generalzed Instance Set for Automatc Text Categorzaton. SIGIR, [1] G. Cormode and M. Garofalas. Setchng Streams through the Net: Dstrbuted Aroxmate Query Tracng. VLDB 005. [] S. Agrawal, K. Ramamrtham and S. Shah. Constructon of a Temoral Coherency Preservng Dynamc Data Dssemnaton Networ. RTSS

IMRT workflow. Optimization and Inverse planning. Intensity distribution IMRT IMRT. Dose optimization for IMRT. Bram van Asselen

IMRT workflow. Optimization and Inverse planning. Intensity distribution IMRT IMRT. Dose optimization for IMRT. Bram van Asselen IMRT workflow Otmzaton and Inverse lannng 69 Gy Bram van Asselen IMRT Intensty dstrbuton Webb 003: IMRT s the delvery of radaton to the atent va felds that have non-unform radaton fluence Purose: Fnd a

More information

Application of Genetic Algorithms in Graph Theory and Optimization. Qiaoyan Yang, Qinghong Zeng

Application of Genetic Algorithms in Graph Theory and Optimization. Qiaoyan Yang, Qinghong Zeng 3rd Internatonal Conference on Materals Engneerng, Manufacturng Technology and Control (ICMEMTC 206) Alcaton of Genetc Algorthms n Grah Theory and Otmzaton Qaoyan Yang, Qnghong Zeng College of Mathematcs,

More information

On the Two-level Hybrid Clustering Algorithm

On the Two-level Hybrid Clustering Algorithm On the Two-level Clusterng Algorthm ng Yeow Cheu, Chee Keong Kwoh, Zongln Zhou Bonformatcs Research Centre, School of Comuter ngneerng, Nanyang Technologcal Unversty, Sngaore 639798 ezlzhou@ntu.edu.sg

More information

Available online at ScienceDirect. Procedia Computer Science 94 (2016 )

Available online at  ScienceDirect. Procedia Computer Science 94 (2016 ) Avalable onlne at www.scencedrect.com ScenceDrect Proceda Comuter Scence 94 (2016 ) 176 182 The 13th Internatonal Conference on Moble Systems and Pervasve Comutng (MobSPC 2016) An Effcent QoS-aware Web

More information

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour 6.854 Advanced Algorthms Petar Maymounkov Problem Set 11 (November 23, 2005) Wth: Benjamn Rossman, Oren Wemann, and Pouya Kheradpour Problem 1. We reduce vertex cover to MAX-SAT wth weghts, such that the

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique //00 :0 AM Outlne and Readng The Greedy Method The Greedy Method Technque (secton.) Fractonal Knapsack Problem (secton..) Task Schedulng (secton..) Mnmum Spannng Trees (secton.) Change Money Problem Greedy

More information

THE CONDENSED FUZZY K-NEAREST NEIGHBOR RULE BASED ON SAMPLE FUZZY ENTROPY

THE CONDENSED FUZZY K-NEAREST NEIGHBOR RULE BASED ON SAMPLE FUZZY ENTROPY Proceedngs of the 20 Internatonal Conference on Machne Learnng and Cybernetcs, Guln, 0-3 July, 20 THE CONDENSED FUZZY K-NEAREST NEIGHBOR RULE BASED ON SAMPLE FUZZY ENTROPY JUN-HAI ZHAI, NA LI, MENG-YAO

More information

A Scheduling Algorithm of Periodic Messages for Hard Real-time Communications on a Switched Ethernet

A Scheduling Algorithm of Periodic Messages for Hard Real-time Communications on a Switched Ethernet IJCSNS Internatonal Journal of Comuter Scence and Networ Securty VOL.6 No.5B May 26 A Schedulng Algorthm of Perodc Messages for Hard eal-tme Communcatons on a Swtched Ethernet Hee Chan Lee and Myung Kyun

More information

Region Segmentation Readings: Chapter 10: 10.1 Additional Materials Provided

Region Segmentation Readings: Chapter 10: 10.1 Additional Materials Provided Regon Segmentaton Readngs: hater 10: 10.1 Addtonal Materals Provded K-means lusterng tet EM lusterng aer Grah Parttonng tet Mean-Shft lusterng aer 1 Image Segmentaton Image segmentaton s the oeraton of

More information

Load-Balanced Anycast Routing

Load-Balanced Anycast Routing Load-Balanced Anycast Routng Chng-Yu Ln, Jung-Hua Lo, and Sy-Yen Kuo Department of Electrcal Engneerng atonal Tawan Unversty, Tape, Tawan sykuo@cc.ee.ntu.edu.tw Abstract For fault-tolerance and load-balance

More information

ACCURATE BIT ALLOCATION AND RATE CONTROL FOR DCT DOMAIN VIDEO TRANSCODING

ACCURATE BIT ALLOCATION AND RATE CONTROL FOR DCT DOMAIN VIDEO TRANSCODING ACCUATE BIT ALLOCATION AND ATE CONTOL FO DCT DOMAIN VIDEO TANSCODING Zhjun Le, Ncolas D. Georganas Multmeda Communcatons esearch Laboratory Unversty of Ottawa, Ottawa, Canada {lezj, georganas}@ mcrlab.uottawa.ca

More information

Hermite Splines in Lie Groups as Products of Geodesics

Hermite Splines in Lie Groups as Products of Geodesics Hermte Splnes n Le Groups as Products of Geodescs Ethan Eade Updated May 28, 2017 1 Introducton 1.1 Goal Ths document defnes a curve n the Le group G parametrzed by tme and by structural parameters n the

More information

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr)

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr) Helsnk Unversty Of Technology, Systems Analyss Laboratory Mat-2.08 Independent research projects n appled mathematcs (3 cr) "! #$&% Antt Laukkanen 506 R ajlaukka@cc.hut.f 2 Introducton...3 2 Multattrbute

More information

Resource and Virtual Function Status Monitoring in Network Function Virtualization Environment

Resource and Virtual Function Status Monitoring in Network Function Virtualization Environment Journal of Physcs: Conference Seres PAPER OPEN ACCESS Resource and Vrtual Functon Status Montorng n Network Functon Vrtualzaton Envronment To cte ths artcle: MS Ha et al 2018 J. Phys.: Conf. Ser. 1087

More information

A new Algorithm for Lossless Compression applied to two-dimensional Static Images

A new Algorithm for Lossless Compression applied to two-dimensional Static Images A new Algorthm for Lossless Comresson aled to two-dmensonal Statc Images JUAN IGNACIO LARRAURI Deartment of Technology Industral Unversty of Deusto Avda. Unversdades, 4. 48007 Blbao SPAIN larrau@deusto.es

More information

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

CS246: Mining Massive Datasets Jure Leskovec, Stanford University CS46: Mnng Massve Datasets Jure Leskovec, Stanford Unversty http://cs46.stanford.edu /19/013 Jure Leskovec, Stanford CS46: Mnng Massve Datasets, http://cs46.stanford.edu Perceptron: y = sgn( x Ho to fnd

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

Kent State University CS 4/ Design and Analysis of Algorithms. Dept. of Math & Computer Science LECT-16. Dynamic Programming

Kent State University CS 4/ Design and Analysis of Algorithms. Dept. of Math & Computer Science LECT-16. Dynamic Programming CS 4/560 Desgn and Analyss of Algorthms Kent State Unversty Dept. of Math & Computer Scence LECT-6 Dynamc Programmng 2 Dynamc Programmng Dynamc Programmng, lke the dvde-and-conquer method, solves problems

More information

5 The Primal-Dual Method

5 The Primal-Dual Method 5 The Prmal-Dual Method Orgnally desgned as a method for solvng lnear programs, where t reduces weghted optmzaton problems to smpler combnatoral ones, the prmal-dual method (PDM) has receved much attenton

More information

Smoothing Spline ANOVA for variable screening

Smoothing Spline ANOVA for variable screening Smoothng Splne ANOVA for varable screenng a useful tool for metamodels tranng and mult-objectve optmzaton L. Rcco, E. Rgon, A. Turco Outlne RSM Introducton Possble couplng Test case MOO MOO wth Game Theory

More information

CHAPTER 2 PROPOSED IMPROVED PARTICLE SWARM OPTIMIZATION

CHAPTER 2 PROPOSED IMPROVED PARTICLE SWARM OPTIMIZATION 24 CHAPTER 2 PROPOSED IMPROVED PARTICLE SWARM OPTIMIZATION The present chapter proposes an IPSO approach for multprocessor task schedulng problem wth two classfcatons, namely, statc ndependent tasks and

More information

TIME-EFFICIENT NURBS CURVE EVALUATION ALGORITHMS

TIME-EFFICIENT NURBS CURVE EVALUATION ALGORITHMS TIME-EFFICIENT NURBS CURVE EVALUATION ALGORITHMS Kestuts Jankauskas Kaunas Unversty of Technology, Deartment of Multmeda Engneerng, Studentu st. 5, LT-5368 Kaunas, Lthuana, kestuts.jankauskas@ktu.lt Abstract:

More information

Image Segmentation. Image Segmentation

Image Segmentation. Image Segmentation Image Segmentaton REGION ORIENTED SEGMENTATION Let R reresent the entre mage regon. Segmentaton may be vewed as a rocess that arttons R nto n subregons, R, R,, Rn,such that n= R = R.e., the every xel must

More information

Lecture Note 08 EECS 4101/5101 Instructor: Andy Mirzaian. All Nearest Neighbors: The Lifting Method

Lecture Note 08 EECS 4101/5101 Instructor: Andy Mirzaian. All Nearest Neighbors: The Lifting Method Lecture Note 08 EECS 4101/5101 Instructor: Andy Mrzaan Introducton All Nearest Neghbors: The Lftng Method Suose we are gven aset P ={ 1, 2,..., n }of n onts n the lane. The gven coordnates of the -th ont

More information

MODULE - 9 LECTURE NOTES 1 FUZZY OPTIMIZATION

MODULE - 9 LECTURE NOTES 1 FUZZY OPTIMIZATION Water Resources Systems Plannng an Management: vance Tocs Fuzzy Otmzaton MODULE - 9 LECTURE NOTES FUZZY OPTIMIZTION INTRODUCTION The moels scusse so far are crs an recse n nature. The term crs means chotonomous.e.,

More information

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms Course Introducton Course Topcs Exams, abs, Proects A quc loo at a few algorthms 1 Advanced Data Structures and Algorthms Descrpton: We are gong to dscuss algorthm complexty analyss, algorthm desgn technques

More information

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 35, NO. 3, MARCH

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 35, NO. 3, MARCH IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 35, NO. 3, MARCH 07 7 A Game Theoretc Aroach for Dstrbuted Resource Allocaton and Orchestraton of Softwarzed Networks Salvatore D Oro, Student Member,

More information

Outline. Seamless Image Stitching in the Gradient Domain. Related Approaches. Image Stitching. Introduction Related Work

Outline. Seamless Image Stitching in the Gradient Domain. Related Approaches. Image Stitching. Introduction Related Work Outlne Seamless Image Sttchng n the Gradent Doman Anat Levn, Assaf Zomet, Shmuel Peleg and Yar Wess ECCV 004 Presenter: Pn Wu Oct 007 Introducton Related Work GIST: Gradent-doman Image Sttchng GIST GIST

More information

Consistency and Update in Mobile Overlay Networks. Zhou SU, Jiro KATTO, Yasuhiko YASUDA, and Yu CHEN, Members, IEEE

Consistency and Update in Mobile Overlay Networks. Zhou SU, Jiro KATTO, Yasuhiko YASUDA, and Yu CHEN, Members, IEEE 1 Consstency and Update n Moble Overlay Networs Zhou SU, Jro KATTO, Yasuho YASUDA, and Yu CHEN, Members, IEEE Contact Emal: zhousu@asag.waseda.p Abstract Wth the development whereby moble content dstrbuton

More information

Support Strong Consistency for Mobile Dynamic Contents Delivery Network

Support Strong Consistency for Mobile Dynamic Contents Delivery Network Nnth IEEE Internatonal Symposum on Multmeda 27 Support Strong Consstency for Moble Dynamc Contents Delvery Networ Zhou Su, Jro Katto and Yasuho Yasuda Graduate School of Scence and Engneerng, Waseda Unversty,

More information

Problem Set 3 Solutions

Problem Set 3 Solutions Introducton to Algorthms October 4, 2002 Massachusetts Insttute of Technology 6046J/18410J Professors Erk Demane and Shaf Goldwasser Handout 14 Problem Set 3 Solutons (Exercses were not to be turned n,

More information

For instance, ; the five basic number-sets are increasingly more n A B & B A A = B (1)

For instance, ; the five basic number-sets are increasingly more n A B & B A A = B (1) Secton 1.2 Subsets and the Boolean operatons on sets If every element of the set A s an element of the set B, we say that A s a subset of B, or that A s contaned n B, or that B contans A, and we wrte A

More information

Recognition of Identifiers from Shipping Container Images Using Fuzzy Binarization and Enhanced Fuzzy Neural Network

Recognition of Identifiers from Shipping Container Images Using Fuzzy Binarization and Enhanced Fuzzy Neural Network Recognton of Identfers from Shng Contaner Images Usng uzzy Bnarzaton and Enhanced uzzy Neural Networ Kwang-Bae Km Det. of Comuter Engneerng, Slla Unversty, Korea gbm@slla.ac.r Abstract. In ths aer, we

More information

Bayesian Networks: Independencies and Inference. What Independencies does a Bayes Net Model?

Bayesian Networks: Independencies and Inference. What Independencies does a Bayes Net Model? Bayesan Networks: Indeendences and Inference Scott Daves and Andrew Moore Note to other teachers and users of these sldes. Andrew and Scott would be delghted f you found ths source materal useful n gvng

More information

QoS-Based Service Provision Schemes and Plan Durability in Service Composition

QoS-Based Service Provision Schemes and Plan Durability in Service Composition QoS-Based Servce Provson Schemes and Plan Durablty n Servce Comoston Koramt Pchanaharee and Twtte Senvongse Deartment of Comuter Engneerng, Faculty of Engneerng, Chulalongkorn Unversty Phyatha Road, Pathumwan,

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

11. APPROXIMATION ALGORITHMS

11. APPROXIMATION ALGORITHMS Copng wth NP-completeness 11. APPROXIMATION ALGORITHMS load balancng center selecton prcng method: vertex cover LP roundng: vertex cover generalzed load balancng knapsack problem Q. Suppose I need to solve

More information

The Codesign Challenge

The Codesign Challenge ECE 4530 Codesgn Challenge Fall 2007 Hardware/Software Codesgn The Codesgn Challenge Objectves In the codesgn challenge, your task s to accelerate a gven software reference mplementaton as fast as possble.

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervsed Learnng and Clusterng Why consder unlabeled samples?. Collectng and labelng large set of samples s costly Gettng recorded speech s free, labelng s tme consumng 2. Classfer could be desgned

More information

I 2 C: A Holistic Approach to Prolong the Sensor Network Lifetime

I 2 C: A Holistic Approach to Prolong the Sensor Network Lifetime : A Holstc Aroach to Prolong the Sensor Network Lfetme Yang Peng, Z L, Da Qao, and Wensheng Zhang Iowa State Unversty, Ames, IA 511 {yangeng, zl, da, wzhang}@astate.edu Abstract We resent a novel holstc

More information

A NEW HEURISTIC APPROACH FOR DEMAND RESPONSIVE TRANSPORTATION SYSTEMS

A NEW HEURISTIC APPROACH FOR DEMAND RESPONSIVE TRANSPORTATION SYSTEMS A NEW HEURISTIC APPROACH FOR DEMAND RESPONSIVE TRANSPORTATION SYSTEMS Ru Jorge Gomes Faculdade de Engenhara da Unversdade do Porto Rua Dr. Roberto Fras, 4200-465 Porto, Portugal rjgomes@fe.u.t Jorge Pnho

More information

A New Token Allocation Algorithm for TCP Traffic in Diffserv Network

A New Token Allocation Algorithm for TCP Traffic in Diffserv Network A New Token Allocaton Algorthm for TCP Traffc n Dffserv Network A New Token Allocaton Algorthm for TCP Traffc n Dffserv Network S. Sudha and N. Ammasagounden Natonal Insttute of Technology, Truchrappall,

More information

Load Balancing for Hex-Cell Interconnection Network

Load Balancing for Hex-Cell Interconnection Network Int. J. Communcatons, Network and System Scences,,, - Publshed Onlne Aprl n ScRes. http://www.scrp.org/journal/jcns http://dx.do.org/./jcns.. Load Balancng for Hex-Cell Interconnecton Network Saher Manaseer,

More information

A note on Schema Equivalence

A note on Schema Equivalence note on Schema Equvalence.H.M. ter Hofstede and H.. Proer and Th.P. van der Wede E.Proer@acm.org PUBLISHED S:.H.M. ter Hofstede, H.. Proer, and Th.P. van der Wede. Note on Schema Equvalence. Techncal Reort

More information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster

More information

GSLM Operations Research II Fall 13/14

GSLM Operations Research II Fall 13/14 GSLM 58 Operatons Research II Fall /4 6. Separable Programmng Consder a general NLP mn f(x) s.t. g j (x) b j j =. m. Defnton 6.. The NLP s a separable program f ts objectve functon and all constrants are

More information

Fuzzy Filtering Algorithms for Image Processing: Performance Evaluation of Various Approaches

Fuzzy Filtering Algorithms for Image Processing: Performance Evaluation of Various Approaches Proceedngs of the Internatonal Conference on Cognton and Recognton Fuzzy Flterng Algorthms for Image Processng: Performance Evaluaton of Varous Approaches Rajoo Pandey and Umesh Ghanekar Department of

More information

Fuzzy Multi Objective Transportation Model. Based on New Ranking Index on. Generalized LR Fuzzy Numbers

Fuzzy Multi Objective Transportation Model. Based on New Ranking Index on. Generalized LR Fuzzy Numbers Aled Mathematcal Scences, Vol 8, 4, no 8, 6849-6879 HIKARI Ltd, wwwm-hkarcom htt://dxdoorg/988/ams4486 Fuzzy Mult Obectve Transortaton Model Based on New Rankng Index on Generalzed LR Fuzzy Numbers Y L

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton

More information

Efficient Caching of Video Content to an Architecture of Proxies according to a Frequency-Based Cache Management Policy

Efficient Caching of Video Content to an Architecture of Proxies according to a Frequency-Based Cache Management Policy Effcent Cachng of Vdeo Content to an Archtecture of Proxes accordng to a Frequency-Based Cache Management Polcy Anna Satsou, Mchael Pateraks Laboratory of Informaton and Comuter Networks Deartment of Electronc

More information

CS 534: Computer Vision Model Fitting

CS 534: Computer Vision Model Fitting CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust

More information

Wishing you all a Total Quality New Year!

Wishing you all a Total Quality New Year! Total Qualty Management and Sx Sgma Post Graduate Program 214-15 Sesson 4 Vnay Kumar Kalakband Assstant Professor Operatons & Systems Area 1 Wshng you all a Total Qualty New Year! Hope you acheve Sx sgma

More information

IX Price and Volume Measures: Specific QNA-ANA Issues

IX Price and Volume Measures: Specific QNA-ANA Issues IX Prce and Volume Measures: Secfc QNA-ANA Issues A. Introducton 9.. Ths chater addresses a selected set of ssues for constructng tme seres of rce and volume measures that are of secfc mortance for the

More information

Broadcast Time Synchronization Algorithm for Wireless Sensor Networks Chaonong Xu 1)2)3), Lei Zhao 1)2), Yongjun Xu 1)2) and Xiaowei Li 1)2)

Broadcast Time Synchronization Algorithm for Wireless Sensor Networks Chaonong Xu 1)2)3), Lei Zhao 1)2), Yongjun Xu 1)2) and Xiaowei Li 1)2) Broadcast Tme Synchronzaton Algorthm for Wreless Sensor Networs Chaonong Xu )2)3), Le Zhao )2), Yongun Xu )2) and Xaowe L )2) ) Key Laboratory of Comuter Archtecture, Insttute of Comutng Technology Chnese

More information

Human Action Recognition Using Discriminative Models in the Learned Hierarchical Manifold Space

Human Action Recognition Using Discriminative Models in the Learned Hierarchical Manifold Space Human Acton Recognton Usng Dscrmnatve Models n the Learned Herarchcal Manfold Sace Le Han, We Lang*, nxao Wu and Yunde Ja School of Comuter Scence and Technology, Beng Insttute of Technology 5 South Zhongguancun

More information

Environmental Modelling & Software

Environmental Modelling & Software Envronmental Modellng & Software 90 (2017) 68e77 Contents lsts avalable at ScenceDrect Envronmental Modellng & Software ournal homeage: www.elsever.com/locate/envsoft Addng satal flexblty to source-recetor

More information

Simulation Based Analysis of FAST TCP using OMNET++

Simulation Based Analysis of FAST TCP using OMNET++ Smulaton Based Analyss of FAST TCP usng OMNET++ Umar ul Hassan 04030038@lums.edu.pk Md Term Report CS678 Topcs n Internet Research Sprng, 2006 Introducton Internet traffc s doublng roughly every 3 months

More information

A MULTILEVEL DECISION MAKING MODEL FOR THE SUPPLIER SELECTION PROBLEM IN A FUZZY SITUATION

A MULTILEVEL DECISION MAKING MODEL FOR THE SUPPLIER SELECTION PROBLEM IN A FUZZY SITUATION OPERATIONS RESEARCH AND DECISIONS No. 4 2017 DOI: 10.5277/ord170401 Ahmad Yusuf ADHAMI 1 Syed Mohd MUNEEB 1 Mohammad Asm NOMANI 2 A MULTILEVEL DECISION MAKING MODEL FOR THE SUPPLIER SELECTION PROBLEM IN

More information

X- Chart Using ANOM Approach

X- Chart Using ANOM Approach ISSN 1684-8403 Journal of Statstcs Volume 17, 010, pp. 3-3 Abstract X- Chart Usng ANOM Approach Gullapall Chakravarth 1 and Chaluvad Venkateswara Rao Control lmts for ndvdual measurements (X) chart are

More information

An efficient iterative source routing algorithm

An efficient iterative source routing algorithm An effcent teratve source routng algorthm Gang Cheng Ye Tan Nrwan Ansar Advanced Networng Lab Department of Electrcal Computer Engneerng New Jersey Insttute of Technology Newar NJ 7 {gc yt Ansar}@ntedu

More information

Optimizing Document Scoring for Query Retrieval

Optimizing Document Scoring for Query Retrieval Optmzng Document Scorng for Query Retreval Brent Ellwen baellwe@cs.stanford.edu Abstract The goal of ths project was to automate the process of tunng a document query engne. Specfcally, I used machne learnng

More information

Performance Evaluation of Information Retrieval Systems

Performance Evaluation of Information Retrieval Systems Why System Evaluaton? Performance Evaluaton of Informaton Retreval Systems Many sldes n ths secton are adapted from Prof. Joydeep Ghosh (UT ECE) who n turn adapted them from Prof. Dk Lee (Unv. of Scence

More information

Decision Strategies for Rating Objects in Knowledge-Shared Research Networks

Decision Strategies for Rating Objects in Knowledge-Shared Research Networks Decson Strateges for Ratng Objects n Knowledge-Shared Research etwors ALEXADRA GRACHAROVA *, HAS-JOACHM ER **, HASSA OUR ELD ** OM SUUROE ***, HARR ARAKSE *** * nsttute of Control and System Research,

More information

Index Terms-Software effort estimation, principle component analysis, datasets, neural networks, and radial basis functions.

Index Terms-Software effort estimation, principle component analysis, datasets, neural networks, and radial basis functions. ISO 9001:2008 Certfed Internatonal Journal of Engneerng and Innovatve Technology (IJEIT The Effect of Dmensonalty Reducton on the Performance of Software Cost Estmaton Models Ryadh A.K. Mehd College of

More information

Needed Information to do Allocation

Needed Information to do Allocation Complexty n the Database Allocaton Desgn Must tae relatonshp between fragments nto account Cost of ntegrty enforcements Constrants on response-tme, storage, and processng capablty Needed Informaton to

More information

Greedy Technique - Definition

Greedy Technique - Definition Greedy Technque Greedy Technque - Defnton The greedy method s a general algorthm desgn paradgm, bult on the follong elements: confguratons: dfferent choces, collectons, or values to fnd objectve functon:

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

UB at GeoCLEF Department of Geography Abstract

UB at GeoCLEF Department of Geography   Abstract UB at GeoCLEF 2006 Mguel E. Ruz (1), Stuart Shapro (2), June Abbas (1), Slva B. Southwck (1) and Davd Mark (3) State Unversty of New York at Buffalo (1) Department of Lbrary and Informaton Studes (2) Department

More information

A New Approach For the Ranking of Fuzzy Sets With Different Heights

A New Approach For the Ranking of Fuzzy Sets With Different Heights New pproach For the ankng of Fuzzy Sets Wth Dfferent Heghts Pushpnder Sngh School of Mathematcs Computer pplcatons Thapar Unversty, Patala-7 00 Inda pushpndersnl@gmalcom STCT ankng of fuzzy sets plays

More information

Gentle Early Detection Algorithm

Gentle Early Detection Algorithm Gentle Early etecton Algorthm Hussen Abdel-jaber 1, Mahmoud Abdeljaber 2, Hussen Abu Mansour 1, Malak El-Amr 1 1 Faculty of Comuter Studes, eartment of Informaton Technology and Comutng, Arab Oen Unversty,

More information

Overview. CSC 2400: Computer Systems. Pointers in C. Pointers - Variables that hold memory addresses - Using pointers to do call-by-reference in C

Overview. CSC 2400: Computer Systems. Pointers in C. Pointers - Variables that hold memory addresses - Using pointers to do call-by-reference in C CSC 2400: Comuter Systems Ponters n C Overvew Ponters - Varables that hold memory addresses - Usng onters to do call-by-reference n C Ponters vs. Arrays - Array names are constant onters Ponters and Strngs

More information

Semi-Supervised Biased Maximum Margin Analysis for Interactive Image Retrieval

Semi-Supervised Biased Maximum Margin Analysis for Interactive Image Retrieval IP-06850-00.R3 Sem-Suervsed Based Maxmum Margn Analyss for Interactve Image Retreval Lnng Zhang,, Student Member, IEEE, Lo Wang, Senor Member, IEEE and Wes Ln 3, Senor Member, IEEE School of Electrcal

More information

A Base Station-Coordinated Contention Resolution for IEEE PMP Networks

A Base Station-Coordinated Contention Resolution for IEEE PMP Networks A Base Staton-Coordnated Contenton Resoluton for IEEE 802.6 PMP Networks Wenyan Lu, Weja Ja,2, Wenfeng Du, and Ldong Ln 2 School of Informaton Scence & Engneerng, Central South Unversty, Changsha, 40083,

More information

Ontology based data warehouses federation management system

Ontology based data warehouses federation management system Ontolog based data warehouses federaton management sstem Naoual MOUHNI 1, Abderrafaa EL KALAY 2 1 Deartment of mathematcs and comuter scences, Unverst Cad Aad, Facult of scences and technologes Marrakesh,

More information

Introduction. 1. Mathematical formulation. 1.1 Standard formulation

Introduction. 1. Mathematical formulation. 1.1 Standard formulation Comact mathematcal formulaton for Grah Parttonng Marc BOULLE France Telecom R&D 2, Avenue Perre Marzn 22300 Lannon France marc.boulle@francetelecom.com Abstract. The grah arttonng roblem conssts of dvdng

More information

Keywords: Cloud Computing, Resource Allocation, Queueing Theory, Performance Measures.

Keywords: Cloud Computing, Resource Allocation, Queueing Theory, Performance Measures. Volume, Issue 9, Setember ISSN: 77 8X Internatonal Journal of Advanced Research n Comuter Scence and Software Engneerng Research Paer Avalable onlne at: wwwjarcssecom Resource Allocaton n Cloud Comutng

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

Advances. Photometric Stereo

Advances. Photometric Stereo Advances n Photometrc tereo uggero Pntus Tutor Prof. assmo Vanz Contents ) ntroducton g.3 ) mage formaton and reflectance ma g.5 3) Changng from two to four lghts g. 4) Basc theory gradent extracton g.

More information

Synthesizer 1.0. User s Guide. A Varying Coefficient Meta. nalytic Tool. Z. Krizan Employing Microsoft Excel 2007

Synthesizer 1.0. User s Guide. A Varying Coefficient Meta. nalytic Tool. Z. Krizan Employing Microsoft Excel 2007 Syntheszer 1.0 A Varyng Coeffcent Meta Meta-Analytc nalytc Tool Employng Mcrosoft Excel 007.38.17.5 User s Gude Z. Krzan 009 Table of Contents 1. Introducton and Acknowledgments 3. Operatonal Functons

More information

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices Steps for Computng the Dssmlarty, Entropy, Herfndahl-Hrschman and Accessblty (Gravty wth Competton) Indces I. Dssmlarty Index Measurement: The followng formula can be used to measure the evenness between

More information

A Clustering Algorithm for Chinese Adjectives and Nouns 1

A Clustering Algorithm for Chinese Adjectives and Nouns 1 Clusterng lgorthm for Chnese dectves and ouns Yang Wen, Chunfa Yuan, Changnng Huang 2 State Key aboratory of Intellgent Technology and System Deptartment of Computer Scence & Technology, Tsnghua Unversty,

More information

NOVEL APPROACH FOR MOVING HUMAN DETECTION AND TRACKING IN STATIC CAMERA VIDEO SEQUENCES

NOVEL APPROACH FOR MOVING HUMAN DETECTION AND TRACKING IN STATIC CAMERA VIDEO SEQUENCES THE PUBLIHING HOUE PROCEEDING OF THE ROMANIAN ACADEMY, eres A, OF THE ROMANIAN ACADEMY Volume 3, Number 3/202,. 269 277 NOVEL APPROACH FOR MOVING HUMAN DETECTION AND TRACKING IN TATIC CAMERA VIDEO EQUENCE

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Decson surface s a hyperplane (lne n 2D) n feature space (smlar to the Perceptron) Arguably, the most mportant recent dscovery n machne learnng In a nutshell: map the data to a predetermned

More information

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 A mathematcal programmng approach to the analyss, desgn and

More information

Skew Estimation in Document Images Based on an Energy Minimization Framework

Skew Estimation in Document Images Based on an Energy Minimization Framework Skew Estmaton n Document Images Based on an Energy Mnmzaton Framework Youbao Tang 1, Xangqan u 1, e Bu 2, and Hongyang ang 3 1 School of Comuter Scence and Technology, Harbn Insttute of Technology, Harbn,

More information

Some Advanced SPC Tools 1. Cumulative Sum Control (Cusum) Chart For the data shown in Table 9-1, the x chart can be generated.

Some Advanced SPC Tools 1. Cumulative Sum Control (Cusum) Chart For the data shown in Table 9-1, the x chart can be generated. Some Advanced SP Tools 1. umulatve Sum ontrol (usum) hart For the data shown n Table 9-1, the x chart can be generated. However, the shft taken place at sample #21 s not apparent. 92 For ths set samples,

More information

TPL-Aware Displacement-driven Detailed Placement Refinement with Coloring Constraints

TPL-Aware Displacement-driven Detailed Placement Refinement with Coloring Constraints TPL-ware Dsplacement-drven Detaled Placement Refnement wth Colorng Constrants Tao Ln Iowa State Unversty tln@astate.edu Chrs Chu Iowa State Unversty cnchu@astate.edu BSTRCT To mnmze the effect of process

More information

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics Introducton G10 NAG Fortran Lbrary Chapter Introducton G10 Smoothng n Statstcs Contents 1 Scope of the Chapter... 2 2 Background to the Problems... 2 2.1 Smoothng Methods... 2 2.2 Smoothng Splnes and Regresson

More information

Parallel matrix-vector multiplication

Parallel matrix-vector multiplication Appendx A Parallel matrx-vector multplcaton The reduced transton matrx of the three-dmensonal cage model for gel electrophoress, descrbed n secton 3.2, becomes excessvely large for polymer lengths more

More information

Quantifying Responsiveness of TCP Aggregates by Using Direct Sequence Spread Spectrum CDMA and Its Application in Congestion Control

Quantifying Responsiveness of TCP Aggregates by Using Direct Sequence Spread Spectrum CDMA and Its Application in Congestion Control Quantfyng Responsveness of TCP Aggregates by Usng Drect Sequence Spread Spectrum CDMA and Its Applcaton n Congeston Control Mehd Kalantar Department of Electrcal and Computer Engneerng Unversty of Maryland,

More information

An Image Fusion Approach Based on Segmentation Region

An Image Fusion Approach Based on Segmentation Region Rong Wang, L-Qun Gao, Shu Yang, Yu-Hua Cha, and Yan-Chun Lu An Image Fuson Approach Based On Segmentaton Regon An Image Fuson Approach Based on Segmentaton Regon Rong Wang, L-Qun Gao, Shu Yang 3, Yu-Hua

More information

Ramsey numbers of cubes versus cliques

Ramsey numbers of cubes versus cliques Ramsey numbers of cubes versus clques Davd Conlon Jacob Fox Choongbum Lee Benny Sudakov Abstract The cube graph Q n s the skeleton of the n-dmensonal cube. It s an n-regular graph on 2 n vertces. The Ramsey

More information

CHAPTER 2 DECOMPOSITION OF GRAPHS

CHAPTER 2 DECOMPOSITION OF GRAPHS CHAPTER DECOMPOSITION OF GRAPHS. INTRODUCTION A graph H s called a Supersubdvson of a graph G f H s obtaned from G by replacng every edge uv of G by a bpartte graph,m (m may vary for each edge by dentfyng

More information

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for

More information

Advanced LEACH: A Static Clustering-based Heteroneous Routing Protocol for WSNs

Advanced LEACH: A Static Clustering-based Heteroneous Routing Protocol for WSNs Advanced LEACH: A Statc Clusterng-based Heteroneous Routng Protocol for WSNs A. Iqbal 1, M. Akbar 1, N. Javad 1, S. H. Bouk 1, M. Ilah 1, R. D. Khan 2 1 COMSATS Insttute of Informaton Technology, Islamabad,

More information

BioTechnology. An Indian Journal FULL PAPER. Trade Science Inc.

BioTechnology. An Indian Journal FULL PAPER. Trade Science Inc. [Type text] [Type text] [Type text] ISSN : 0974-74 Volume 0 Issue BoTechnology 04 An Indan Journal FULL PAPER BTAIJ 0() 04 [684-689] Revew on Chna s sports ndustry fnancng market based on market -orented

More information

Machine Learning. Support Vector Machines. (contains material adapted from talks by Constantin F. Aliferis & Ioannis Tsamardinos, and Martin Law)

Machine Learning. Support Vector Machines. (contains material adapted from talks by Constantin F. Aliferis & Ioannis Tsamardinos, and Martin Law) Machne Learnng Support Vector Machnes (contans materal adapted from talks by Constantn F. Alfers & Ioanns Tsamardnos, and Martn Law) Bryan Pardo, Machne Learnng: EECS 349 Fall 2014 Support Vector Machnes

More information

Multilayer Neural Networks and Nearest Neighbor Classifier Performances for Image Annotation

Multilayer Neural Networks and Nearest Neighbor Classifier Performances for Image Annotation (IJACSA) Internatonal Journal of Advanced Comuter Scence and Alcatons, Vol. 3, No. 11, 01 Multlayer Neural Networs and Nearest Neghbor Classfer erformances for Image Annotaton Mustaha OUJAOURA Laboratory

More information