Efficiently Supporting Multiple Similarity Queries for Mining in Metric Databases

Size: px
Start display at page:

Download "Efficiently Supporting Multiple Similarity Queries for Mining in Metric Databases"

Transcription

1 Efficiently Supporting Multiple Siilarity Queries for Mining in Metric Databases Bernhard Braunüller, Martin Ester, Hans-Peter Kriegel, Jörg Sander Institute for Coputer Science, University of Munich Oettingenstr. 67, D München, Gerany eail: {braunue ester kriegel Abstract Metric databases are databases where a etric distance function is defined for pairs of database objects. In such databases, siilarity queries in the for of range queries or k-nearest neighbor queries are the ost iportant queries. In traditional query processing, single queries are issued independently by different users. In any data ining applications, however, the database is typically explored by iteratively asking siilarity queries for answers of previous siilarity queries. In this paper, we introduce a generic schee for such data ining algoriths and we develop a ethod to transfor such algoriths in a way that they can use ultiple siilarity queries, i.e. sets of queries issued siultaneously. We investigate two orthogonal approaches, reducing I/O cost as well as CPU cost, to speed-up the processing of ultiple siilarity queries. The proposed techniques apply to any type of siilarity query and to an ipleentation based on an index or using a sequential scan. Parallelization yields an additional ipressive speed-up. An extensive perforance evaluation confirs the efficiency of our approach and we conclude that ultiple siilarity queries should be provided as a basic DBMS operation in order to support any data ining applications in etric databases. 1 Introduction Metric databases are databases where a etric distance function is defined for pairs of database objects. A proinent special case are databases of objects fro a vector space, that is objects with nueric attributes. For exaple, ultiedia objects [Fal+ 94] are typically represented by a large nuber of nueric features such as shape descriptors or color histogras. In any scientific applications, e.g. in astronoy [Hog 97], autoatic facilities easure a large nuber of nueric values for each database object such as the aplitude eitted in soe frequency band. On the other hand, in a database onitoring WWW accesses, the objects ay odel URLs and these objects are not fro a vector space but a etric distance function can be supplied. Siilarity between database objects is expressed by the distance function such that a low distance corresponds to a high degree of siilarity whereas two objects with a large distance are considered to be rather dissiilar. Siilarity queries [WSB 98], e.g. range queries or k-nearest neighbor queries, are the ost iportant queries in etric databases. Such queries play a ajor role in applications such as ultiedia systes, decision support systes and data ining. A lot of research on analyzing large databases - anually and autoatically - has been conducted. Data exploration is the process of anually exploring a database [KLTW 96]. A user starts at a given database object and fro there he or she interactively navigates through the database, for exaple by iteratively retrieving all siilar objects. That is, the answers of previous queries ay be used as query objects for new siilarity queries. Knowledge discovery in databases (KDD) has been defined as the non-trivial process of discovering valid, novel, potentially useful, and ultiately understandable patterns fro data [FPS 96]. The core step of the KDD process 1

2 is the step of data ining, i.e. the application of appropriate algoriths that autoatically produce a particular enueration of patterns over the data. For exaple, a density-based clustering algorith such as DBSCAN [EKSX 96] starts fro soe object and repeatedly retrieves the neighborhood of objects which have been retrieved by previous queries as long as the density in this neighborhood is large enough. In traditional query processing, single queries are issued independently by different users. In anual data exploration as well as in autoatic data ining, however, any siilarity queries ust be answered in a single application. We define ultiple queries as sets of queries issued siultaneously. Clearly, ultiple queries provide uch ore potential for query optiization than single queries. In this paper, we investigate two orthogonal approaches to speed-up the processing of ultiple siilarity queries in etric databases: reduce I/O cost (that is, the nuber of disk accesses) and reduce CPU cost (that is, the nuber of distance calculations). Furtherore, we explore the potential of parallelization. The proposed techniques can be cobined in order to obtain a axiu perforance of processing ultiple siilarity queries. The rest of this paper is organized as follows. In section 2, we briefly review the standard ethods of processing single siilarity queries and we introduce soe basic notions and algoriths. Section 3 introduces an algorithic schee typical for any data ining applications and discusses several instances of this schee. Furtherore, we develop a ethod to transfor such algoriths so that they issue sets of ultiple queries. The concepts of ultiple siilarity queries are introduced in ore detail in section 4. Section 5 presents several techniques for efficiently supporting ultiple siilarity queries. An extensive perforance evaluation of the proposed techniques on two real databases is presented in section 6. Section 7 suarizes the contributions of this paper and outlines soe issues for future research. 2 Processing Single Siilarity Queries Let the database objects be drawn fro a set of Objects and let dist be a etric distance function for pairs of objects, i.e. dist: Objects Objects R +, and dist satisfies the following conditions. O 1, O 2, O 3 Objects: 1) dist(o 1, O 2 ) = 0 O 1 = O 2. (identity) 2) dist(o 1, O 2 ) = dist(o 2, O 1 ). (syetry) 3) dist(o 1,O 3 ) dist(o 1,O 2 ) + dist(o 2,O 3 ). ( -inequality) Often, the Euclidean distance or a weighted Euclidean distance is used as the distance function but, depending on the application, other distance functions ay be ore appropriate. For instance, quadratic for distance functions were successfully applied for an iage database using color histogras as features [SK 97]. Definition 1: (siilarity query) Let DB Objects be a database and let Q Objects be a query object. Let T denote the type of the siilarity query and let si T : Objects Objects Boolean be a predicate defining the siilarity of pairs of objects wrt. to the type T. A siilarity query, denoted as DB.siilarity_query(Q,T), returns the following database objects: DB.siilarity_query( Q, T) = { O DB si T ( OQ, )}. The specification of the query type T consists of three coponents: - T.range: a real nuber specifying a axiu distance between Q and an answer. - T.cardinality: an integer nuber defining the axiu cardinality of the set of answers. 2

3 - T.kind: a string indicating how to cobine the range condition and the cardinality condition. The well-known types of siilarity queries are obtained by different specializations of the query type. Definition 2: (range query) A range query with respect to a database DB Objects and a query object Q Objects is a siilarity query with T.range = ε, T.cardinality = + and T.kind = range which returns the following subset of database objects: DB.siilarity_query( Q, T) = { O DB dist( O, Q) ε}. Definition 3: (k-nearest neighbor query) A k-nearest neighbor query with respect to a database DB Objects and a query object Q Objects is a siilarity query with T.range = +, T.cardinality = k and T.kind = k-nearest neighbor which returns a set NN Q () k DB that contains k objects fro the database and for which the following condition holds: P NN Q () k, O DB NN Q () k : dist( P, Q) dist( O, Q). Other types of siilarity queries have been proposed in the literature. For instance, we ay be interested in the k-nearest neighbors but only in those within a specified range. In order to speed-up siilarity query processing, any spatial index structures (for good surveys see [Sa 89], [GG 98]) have been developed which are applicable for the iportant special case where the database objects are fro a vector space. For instance, the R-tree [Gut 84] generalizes the one-diensional B-tree to d-diensional data spaces, that is an R-tree anages d-diensional hyper-rectangles instead of one-diensional nueric keys. The R-tree and its variants such as the R*-tree [BKSS 90] are efficient only for relatively sall nubers of diensions d. Recently, index structures have been designed which are also efficient for soe larger values of d. For instance, the X-tree [BKK 96] is siilar to an R*-tree but introduces the concept of supernodes, i.e. nodes of variable size in the directory of the tree. Directory nodes are erged into one supernode, i.e. directory nodes are not split, if there is a high probability that all parts of the node have to be searched anyway for ost queries. In [Kei 97] and [WSB 98], it is shown that under the assuption of uniforly distributed data, above a certain diensionality no index structure can process a nearest neighbor query efficiently. Thus, it is suggested to use the sequential scan which obtains at least the benefits of sequential rather than rando disk I/O. In the VA-file [WSB 98], clever bit encodings of the data are used to speed-up the scan. The above index structures are only applicable for vector spaces. The ore general case of etric databases, however, is also iportant in applications such as WWW access log databases. Then, the database objects ay be sessions grouping all log entries with identical IP address and user id within a given axiu tie gap [MJHS 96]. General etric databases require other types of index structures, the so called etric trees. In these etric trees the triangle inequality is used to prune the search tree while processing a siilarity query. Most of these structures are static in the sense that they do not allow dynaic insertions and deletions of objects. A recent paper [CPZ 97] has introduced a dynaic etric index structure, the M-tree, which is a balanced tree that can be anaged on secondary eory. The leaf nodes of an M-tree store all the database objects. Directory nodes store so-called routing objects and associated covering radii to guide the search operations. 3

4 Query processing using a sequential scan is straightforward: all objects ust be visited to answer a query. When using an index it ay be possible to exclude large proportions of the data fro a search. To answer a range query using a tree-based index, the set of approxiations (e.g. hyper-rectangles in case of an R-tree) intersecting the query region is deterined recursively starting fro the root. In a directory node, the entries intersecting the query region are deterined and then their referenced child nodes are searched until the data pages are reached. The procedure is ore sophisticated for a k-nearest neighbor query because we do not know the k-nearest neighbor distance in advance. The algorith proposed by [HS 95] has been proven in [BBKK 97] to iniize the nuber of pages read fro disk. This algorith processes the data pages in ascending order of distance fro the query point and does not load those pages with an approxiation further away than the k-nearest neighbor found so far. To conclude this section, we present an algorith for processing single siilarity queries. This algorith is applicable for any type of siilarity query and it can be ipleented either by using an index structure or by perforing a sequential scan. Figure 1 presents the algorith as a ethod of the class DB (database). It takes two arguents, a query object Q and a query type T, and returns a list of objects answering the siilarity query. DB::siilarity_query(object Q; type T) Answers := initialize_answer_list(); deterine_relevant_data_pages(q, T); QueryDist :=T.Range; while Self.unprocessed_pages() do NextPage := read_next_page_fro_disk(); for each object O in NextPage do Distance := dist(o,q); if Distance QueryDist then Answers.insert(O); // in ascending order of dist(a,q) if Answers.cardinality() > T.Cardinality() then Answers.reove_last_eleent(); QueryDist := adapt_query_dist(distance,querydist,t); Self.prune_pages(QueryDist); return Answers; Figure 1: Algorith siilarity_query We discuss the ost iportant details of this algorith. The ethod DB::deterine_relevant_data_pages(Q,T) is based on the algorith presented in [BBKK 97] and it constructs a sequence of the physical addresses of all data pages which ay contain answers of the siilarity query specified by Q and T. Note that the resulting sequence is anaged as a private attribute of the class DB which is read by the ethod DB.unprocessed_pages() and which is updated by DB.read_next_page_fro_disk(). If the ipleentation akes use of an index structure, then a subset of all data pages of DB ay be recognized as irrelevant and thus is not returned. Otherwise, if a sequential scan is perfored, all data pages of DB are relevant. In both cases, the relevant data pages are ordered according to their physical address such that the nuber of disk seeks is iniized. DB.adapt_query_dist(Distance,QueryDist,T) changes the QueryDist only in the case of a k-nearest neighbor query but not in the case of a range query. DB.prune_pages(QueryDist) reoves all eleents page fro the internal DB attribute of relevant data pages satisfying dist(page,q) > QueryDist. Clearly, this ethod perfors no operation if QueryDist has not been adapted. 4

5 3 Data Mining Using Multiple Siilarity Queries In any data ining applications the database is explored by iteratively considering the neighborhood of soe start objects. In this section, we introduce a generic schee for such data ining algoriths and discuss soe typical instances of the schee. Furtherore, we develop a ethod to transfor such algoriths in a way that they can use ultiple siilarity queries instead of single siilarity queries. 3.1 Iterative Neighborhood Exploration Many data ining algoriths start fro a set of specified database objects and iteratively consider the neighborhood of the visited objects. The neighborhood of a given object is defined as the set of siilar database objects with respect to a siilarity query. We introduce a generic schee for such algoriths which we call ExploreNeighborhoods. Figure 2 depicts the algorithic schee in pseudo code notation where DB denotes a database, StartObjects denotes a subset of DB and SiType specifies the type of siilarity query. The dots in the arguent list of soe functions indicate additional arguents which ay be necessary for different instances of this algorithic schee. ExploreNeighborhoods(DB, StartObjects, SiType,...) ControlList := StartObjects; while ( condition_check(controllist,...) = TRUE ) do Object := ControlList.choose(...); proc_1(object,...); Answers := DB.siilarity_query(Object, SiType); proc_2(answers,...); ControlList := ( ControlList» filter(answers,...) ) - {Object}; end while; Figure 2: Algorithic schee ExploreNeighborhoods Starting fro the objects in the set StartObjects, the algorith repeatedly retrieves the neighborhood of objects taken fro the ContolList as long as the function condition_check returns TRUE for the ControlList. In the ost siple for, the function checks whether ControlList is not epty. If the neighborhood of objects should only be investigated up to a certain depth, then an additional paraeter for the nuber of steps that have to be perfored can be used in the function condition_check. The control structure of the ain loop works as follows: objects are selected fro the ControlList, one at a tie, and a siilarity query is perfored for this object. The procedures proc_1 and proc_2 perfor soe processing on the selected object as well as on the answers of the siilarity query that will vary fro task to task. Before repeating the loop, the ControlList is updated. Soe or all of the answers which are not yet processed are siply inserted into the ControlList. The function filter(answers,...) reoves fro the set of answers at least those objects which have already been in the ControlList in previous states of the algorith, if any exists. This ust be done to guarantee the terination of the algorith. 3.2 Instances of Iterative Neighborhood Exploration In the following, we discuss several typical instances of the ExploreNeighborhoods schee in order to show that any data ining applications follow this schee: 5

6 - Manual Data Exploration When anually exploring a ultiedia database, for exaple, in proc_2 the answers are visualized and the user ay store the ultiedia objects considered to be interesting. In the filter the user ay prune answers which are too dissiilar fro the initial start objects. - Spatial Association Rules [KH 95] introduces spatial association rules describing associations between objects based on spatial neighborhood relations. For instance, a rule ay be discovered stating that 80% of the selected towns are close to soe water such as a lake or river. In this algorith, the set StartObjects is equal to the set of all database objects of a specified type such as town. SiType corresponds to the type of spatial neighborhood such as intersects which is given by the user. proc_2 calculates the support, that is the relative frequency, of the retrieved pairs of objects and the filter passes all of these pairs which have at least a specified iniu support. - Density-Based Clustering DBSCAN [EKSX 96] is a typical density-based clustering algorith. To find a cluster, DBSCAN starts with soe database object o and retrieves all objects density-reachable fro o with respect to two paraeters Eps and MinPts. Initially, ControlList contains an arbitrary database object and range queries with a range of Eps are used as siilarity queries. proc_2 counts the answers and the filter passes all answers which have not yet been assigned to soe cluster if the cardinality of the set of answers is at least MinPts. - Siultaneous Classification of a Set of Objects In an astronoy database [Hog 97], for exaple, all new stars observed by a telescope during the night are processed and added to the database the next day. Part of this processing is to classify the set of new stars, that is to assign each of the to one of the well-known classes. k-nearest neighbor classifiers [Mit 97] are effective for this task and they issue a k-nearest neighbor query for each of the objects to be classified. In this case, proc_1 is epty. proc_2 finds the ajority class in the set of k-nearest neighbors and filter always returns an epty list, that is no additional query objects are generated. - Spatial Trend Detection A spatial trend has been defined as a regular change of one or ore non-spatial attributes when oving away fro a given start object o [EFKS 98]. Neighborhood paths starting fro o odel the oveent and a regression analysis is perfored on the respective attribute values for the objects of a neighborhood path to describe the regularity of change. For this data ining task, the ExploreNeighborhoods loop is additionally controlled by the nuber of steps (i.e. the length of a neighborhood path) and the procedures proc_1 and proc_2 perfor the regression analysis on the paths. - Proxiity Analysis The goal of proxiity analysis is to explain the existence of soe cluster of objects by using the features of neighboring objects. [KN 96] presents an algorith which can efficiently find the top-k objects that are closest to a given cluster. A second algorith takes these k objects as input and finds the features that are coon to ost of the. Characteristic properties such as ost of the clusters are close to private schools and parks ay be discovered. In this case, StartObjects contains all the objects of the specified cluster. proc_2 considers the features of the k-nearest neighbors and returns the ost coon ones. The filter returns an epty list iplying that no additional query objects are added. 6

7 3.3 Transforation into Multiple Query For An instance of Iterative Neighborhood Exploration can benefit fro a ultiple siilarity query because the algorithic schee can be transfored into a ultiple query for such that it uses ultiple siilarity queries instead of single siilarity queries. If the evaluation of a ultiple siilarity query for query objects can be perfored ore efficiently than the evaluation of the corresponding single siilarity queries - which is possible as we will see in the next sections - the runtie of the whole class of ExploreNeighborhoods-algoriths will be iproved. We assue the following ethod to be available: ListOfAnswerSetsDB::ultiple_siilarity_query(ListOfObjects, SiTypes); Let the siilarity queries for all of the query objects in ListOfObjects be soehow perfored siultaneously and let the generated answers for all of these queries be stored in soe internal buffer of the DBMS. If each of the queries is copletely answered after the call of ultiple_siilarity_query, successive calls containing queries which were already asked in a previous call of the ethod then can just pick the answers fro the buffer. Note, however, that we do not require the ultiple siilarity query to generate a coplete set of answers for each of the posed queries. One call of a ultiple siilarity query ust only guarantee that the answers for the first query object are coplete. We will discuss this weak specification of a ultiple siilarity query in ore detail in the next section. The intuitive eaning is that if we ask a siilarity query for the first object, we can additionally infor the DBMS that the siilarity queries for the other query objects will probably be asked later and the DBMS ay use this inforation to iprove the overall runtie for the set of queries by retrieving (soe of) the respective answers in advance. The transfored algorithic schee called ExploreNeighborhoodsMultiple is presented in figure 3. As we can see, the reforulation can be done in a purely syntactical way. Thus, a query optiizer can autoatically use ultiple siilarity queries to efficiently process an ExploreNeighborhoods-algorith if a ultiple siilarity query is available as a basic DBMS operation. Obviously, the algorithic schee ExploreNeighborhoodsMultiple perfors exactly the sae task as the original ExploreNeighborhoods schee. The only differences are that a set of objects is selected fro the ContolList instead of selecting a single object and a ultiple siilarity query is perfored instead of a single siilarity query. However, in one execution of the ain loop, the algorith processes only the first eleent of the set of selected objects and its corresponding set of answers. ExploreNeighborhoodsMultiple(DB, StartObjects, SiTypes,...) ControlList := StartObjects; while ( condition_check(controllist,...) = TRUE ) do ListOfObjects := ControlList.choose_ultiple(...); // ListOfObjects = [object 1,..., object ] proc_1(listofobjects.first(),...); SetOfAnswers:= DB.ultiple_siilarity_query(ListOfObjects, SiTypes); // SetOfAnswers=[answers 1,...,answers ], SiTypes=[SiType 1,...,SiType ] proc_2(setofanswers.first(),...); ControlList := (ControlList filter(setofanswers.first(),...)) - {ListOfObjects.first()}; end while; Figure 3: Algorithic schee ExploreNeighborhoodsMultiple 7

8 4 Multiple Siilarity Queries In this section, the notion of a ultiple siilarity query is presented in ore detail. Definition 4: (ultiple siilarity query) Let DB Objects be a database containing n objects. Let Queries = [Q 1, Q 2,..., Q ] be a sequence of query objects Q i Objects and let SiTypes = [T 1,..., T ] be the corresponding sequence of query types. A ultiple siilarity query, denoted by DB.ultiple_siilarity_query(Queries, SiTypes), returns a sequence Answers =[A 1,..., A ], containing for each eleent Q i in Queries a corresponding set A i of objects of DB where the following holds: 1.) A 1 = DB.siilarity_query(Q 1, T 1 ), and 2.) A i DB.siilarity_query(Q i, T i ) for all 2 i. Only for the first query, all answers ust be deterined in a single call of a ultiple siilarity query. The reaining queries ay be answered copletely or partially, depending on the ipleentation of the ultiple siilarity query. We will argue in the next subsection that an increental ipleentation, i.e. only the first query is answered copletely and other queries are answered partially in a single call of the ethod, ay be ore efficient if we consider the overall run-tie of an ExploreNeighborhoodsMultiple algorith. Using ultiple siilarity queries instead of single siilarity queries we ay spend less I/O tie and less CPU tie for a set of queries. First, we read a single page only once for the whole set of queries. Second, knowing a whole set of query objects in advance, we can use the distances between these query objects to replace expensive distance coputations by significantly cheaper distance coparisons using the triangle inequality. Our algorith for a ultiple siilarity query is depicted in figure 4. The only parts that differ fro the algorith for a single siilarity query (see figure 1) - besides the obvious handling of ultiple query objects and types - are as follows: - restore_fro_buffer([q 1,..,Q ],[T 1,..,T ]) - buffer_answers([answers 1,..,Answers ]) In the beginning, we have to restore (partial) answers fro an internal buffer -if available- and we have to store generated answers into this buffer at the end. - deterine_relevant_data_pages([q 1,..., Q ], [T 1,..., T ]) This procedure returns the set of all data pages relevant for Q 1 and, additionally, it returns soe or all of the relevant data pages for the reaining query objects, that is relevant_pages( Q 1 ) deterine_relevant_pages( [ Q 1,, Q ], [ T 1,, T ]) relevant_pages( Q i ) - First, a subset of the set of all queries is chosen which should be copletely answered. If the ipleentation is based on the linear scan, each data page is relevant. If using an index structure such as the X-tree, the set of all data pages which cannot be excluded fro the search for at least one of the selected queries is deterined fro the directory of the tree. Note that our ipleentation of a ultiple siilarity query on top of an index structure converges to the ethod for the linear scan when the page selectivity of the index decreases, e.g. with increasing diension of the data space. In the worst case, the index has no selectivity at all, which eans i = 1 8

9 DB::ultiple_siilarity_query(objects [Q 1,..., Q ]; types [T 1,..., T ]) [Answers 1,..., Answers ] := Self.restore_fro_buffer([Q 1,..., Q ], [T 1,..., T ]); Self.deterine_relevant_data_pages([Q 1,..., Q ], [T 1,..., T ]); for i fro 1 to do QueryDists i := T i.range; for i fro 1 to, for j fro i+1 to do QObjDists ij := dist(q i, Q j ); while Self.unprocessed_pages() do NextPage := Self.read_next_page_fro_disk(); for each object O in NextPage do for i fro 1 to do AvoidingDists i := UNDEFINED; for i fro 1 to do if Self.page_is_relevant(NextPage, Q i ) then if not avoid_dist_coputation(o,q i,qobjdists,avoidingdists) then Distance := dist(o, Q i ); AvoidingDists i := Distance; if Distance QueryDists i then Answers i.insert(o); // in ascending order of dist(a,q) if Answers i.cardinality() > T i.cardinality() then Answers i.reove_last_eleent(); QueryDists i := Self.adapt_query_dist(Distance,QueryDists i,t i ); Self.prune_pages(QueryDists i ); Self.buffer_answers([Answers 1,..., Answers ]); return [Answers 1,..., Answers ]; Figure 4: Algorith ultiple_siilarity_query that no data page can be excluded fro the siilarity search. The details of deterining the relevant data pages are presented in section avoid_dist_coputation(o,q i,qobjdists,avoidingdists) We calculate the inter-object distances for all pairs of query objects and store the into QObjDists. Distances which ust be calculated are teporarily stored into AvoidingDists. These distances and the QObjDists are needed for the application of the triangle inequality perfored by avoid_dist_coputation(o,q i,qobjdists, AvoidingDists). The details of avoiding distance calculations are presented in section Efficient Support for Multiple Siilarity Queries In this section, we present techniques that significantly reduce the aount of disk I/O as well as the nuber of CPU operations needed to evaluate a ultiple siilarity query copared to a set of single siilarity queries. Furtherore, we briefly discuss how to achieve a further perforance gain when using parallelization techniques for the processing of ultiple siilarity queries. Note that there is an upper liit for the nuber of ultiple siilarity queries which can be processed siultaneously. This liit is deterined by the aount of ain eory available to buffer the answers and by the coputational overhead for calculating the inter-object 9

10 distances between all pairs of query objects. Therefore, we assue that a total nuber of M siilarity queries is processed in M ---- consecutive blocks of ultiple queries. Let C i = i C I/O + i C CPU be the cost for siultaneously processing i siilarity queries. Then, the cost for evaluating M queries using single siilarity queries is equal to M C 1, the cost for evaluating M queries using ultiple siilarity queries is equal to M---- C. Consequently, for a ultiple siilarity query to iprove the efficiency of single siilarity queries, the following condition ust hold: C < C Reducing I/O Cost We discuss the algorith ultiple_siilarity_query fro an I/O cost point of view for two different ipleentations - one on top of the linear scan and another one on top of an index structure such as the X-tree. When perforing a linear scan, the ultiple siilarity query perfors a condition check for all query objects while perforing a single scan over the database and returns a sequence of answers for each query object. If the diension d of the data space is very high, the scan ay actually be the ost efficient ethod to answer siilarity queries because, in general, the perforance of index structures degenerates with increasing diension d. When using a tree-like index structure (e.g. X-tree) to answer a single siilarity query, a set of data pages which cannot be excluded fro the search is deterined fro the directory of the tree. These pages are then exained and the answers to the query are deterined. To answer a ultiple siilarity query for a set Q = [Q 1,..., Q ] of query objects, we propose a siilar procedure. First, we deterine the data pages to be read as if answering only the siilarity query for Q 1. However, when processing these pages, we do not only collect the answers in the neighborhood of Q 1 but we also collect answers for the Q i (i=2,..., ) if the pages loaded for Q 1 are also relevant for Q i. After this first step, the query for Q 1 is copletely finished and the answers for all the other objects are partially deterined. To deterine the coplete answers for the other query objects [Q 2,..., Q ] we have to call the ethod repeatedly for [Q 2,..., Q ], [Q 3,..., Q ],..., [Q ]. However, in subsequent calls the partial answers are first restored fro the internal buffer. For instance, the second call for [Q 2,..., Q ] will only consider data pages which are relevant for Q 2 but which have not been processed in the first call. This increental processing of a ultiple siilarity query has the advantage that (partial) answers to all of the queries can be presented to a user at a very early stage of the evaluation. Furtherore, the increental approach is very efficient if an ExploreNeigborhoods-algorith dynaically adds new query objects when processing the answers obtained for previous query objects. Let us assue a first call of DB.ultiple_siilarity_query([Q 1,...,Q ], [T 1,..., T ]). Furtherore, after this call let soe answers A 1,..., A k for the query object Q 1 be inserted into the ControlList of the ExploreNeigborhoods-algorith and let these objects be inserted into the sequence Q of query objects at the beginning of the second execution of the ain loop. Then, the ultiple siilarity query is executed for Q = [Q 2,..., Q, A 1,..., A k ] iplying that now all data pages are considered which have not been processed for object Q 1 but have to be loaded for object Q 2. It is very likely for an ExploreNeighborhoods-algorith that soe of these pages ust also be considered for soe of the objects A i (i=1,..., k). Then, the answers for the objects A i are (partially) collected fro the current data pages deterined by the object Q 2. These pages will not be loaded again when A i becoes the first eleent of Q. If we use a non- 10

11 increental evaluation of a ultiple siilarity query we have to load these pages again, resulting in an overall higher nuber of disk I/Os. For ultiple siilarity queries Q 1,..., Q the I/O cost is proportional to relevant_pages( Q i ) where S denotes the cardinality of a set S. Obviously, an I/O speed-up is achieved if (and only if) there are data pages which are relevant for ore than one query object - ore forally: if i = 1 relevant_pages( Q i ) < i = 1 relevant_pages( Q i ). C I/O C I/O In the case of the linear scan, it holds that =, because relevant_pages(q 1 ) =... = 1 C I/O i = 1 relevant_pages(q ), and therefore, the condition C I/O < 1 C I/O is obviously satisfied. The average I/O cost 1 C I/O for one query object is and the speed-up factor for a ultiple siilarity query copared to single siilarity queries (with respect to disk I/O) is exactly equal to. In the case of a tree-like index structure, the ratio i = 1 relevant_pages( Q i ) relevant_pages( Q i ) which deterines the actual speed-up factor cannot be analytically derived. However, in higher diensions it is very likely that a data page is relevant for ore than one query object, especially if the queries are dynaically generated by an ExploreNeighborhoods-algorith. Therefore, we assue that the condition < is also satisfied in this case, even though we expect the gain of a ultiple siilarity query on top of a tree-like index to be saller copared to an ipleentation for the sequential scan. Note that the perforance of a ultiple siilarity query with respect to the I/O cost is never worse than the perforance of a single query. i = 1 C I/O 1 C I/O 5.2 Reducing CPU Cost The basic idea for reducing the CPU cost is to use the triangle inequality to avoid distance coputations which are the ost expensive operations when evaluating a siilarity query. The proposed approach akes use of the fact that a distance calculation is typically uch ore expensive than a distance coparison. To apply the triangle inequality to avoid distance calculations, we need to know the distances for each pair of query objects (Q i,q j ) which have to be calculated and stored in advance. This coputational overhead is (up to a certain value of depending on the nuber n of database objects) relatively sall copared to the savings of distance coputations by such a preprocessing. Intuitively, there are two cases where the calculation of dist(q j,o) can be avoided for a query object Q j and a database object O if we already know the distance between Q j and a second query object Q i and the distance dist(q i,o) has already been calculated: first, the query objects Q i and Q j are close to each other and dist(q i,o) is large; second, the query objects Q i and Q j have a large distance fro each other and dist(q i,o) is sall. To outline the proposed ethod ore forally, we first define the notion of an avoidable distance calculation in the context of ultiple siilarity queries. 11

12 Definition 5: (avoidable distance calculation) Let Queries=[Q 1,..., Q ] be the query objects for a ultiple siilarity query, O DB, and let l, 1 l <, be a natural nuber. Furtherore, let the values of dist(q i, Q j ) be known for all 1 i, 1 j, and let dist(q i,o) be known for all 1 i l. Let QueryDist(Q i ), 1 i, denote the query distance of Q i in a current execution step of the ultiple siilarity query. Then, we call the calculation of dist(q l+1,o) avoidable with respect to Queries if we can conclude that dist( Q l + 1, O) QueryDist( Q l + 1 ) without having to calculate dist(q l+1,o). To show that a distance calculation is avoidable, we apply the triangle inequality - satisfied by the etric distance function dist - to the triangle defined by two query objects Q 1 and Q 2 and a database object O. We obtain the following three inequalities which hold siultaneously: (1) (2) dist( O, Q 1 ) dist( O, Q 2 ) + dist( Q 2, Q 1 ) dist( O, Q 2 ) dist( O, Q 1 ) + dist( Q 1, Q 2 ) (3) dist( Q 1, Q 2 ) dist( Q 1, O) + dist( O, Q 2 ). Inequality (1) can be used to show the avoidability of the calculation of dist(q 2,O) because it yields a lower bound for dist(q 2,O). This is foralized in the following lea. Lea 1. Let Q 1, Q 2 Objects be query objects and let O Objects be a database object. Let dist be a etric distance function dist: Objects Objects R +. If dist( O, Q 1 ) dist( Q 2, Q 1 ) + QueryDist( Q 2 ) holds, then it follows that dist( Q 2, O) QueryDist( Q 2 ) Proof. We reforulate inequality (1) as follows: dist( O, Q 2 ) dist( O, Q 1 ) dist( Q 2, Q 1 ). By assuption, dist( O, Q 1 ) dist( Q 2, Q 1 ) + QueryDist( Q 2 ). Then, dist( O, Q 1 ) dist( Q 2, Q 1 ) QueryDist( Q 2 ). By exploiting the syetry of dist we derive: dist( Q 2, O) dist( O, Q 1 ) dist( Q 2, Q 1 ) QueryDist( Q 2 ) Figure 5 (left) illustrates a situation where lea 1 holds and the calculation of dist(q 2,O) can be avoided. Inequality (2) is not useful for the purpose of avoiding distance calculations because it yields an upper bound dist(q 1,O) dist(q 1,Q 2 ) + + QueryDist(Q 2 ) dist(q 1,Q 2 ) dist(q 1,O) + + QueryDist(Q 2 ) Q 1 Q 2 Q 1 Q 2 O O QueryDist(Q 2 ) calculated in advance calculated for a previous query calculation can be avoided O: database object Q 1, Q 2 : query objects Figure 5: Illustration of lea 1 and lea 2 12

13 and not a lower bound for dist(q 2,O). Inequality (3), however, can be used analogously to inequality (1) and yields the following lea. Lea 2. Let Q 1, Q 2 Objects be query objects and let O Objects be a database object. Let dist be a etric distance function dist: Objects Objects R +. If dist( Q 2, Q 1 ) dist( O, Q 1 ) + QueryDist( Q 2 ) holds, then it follows that dist( Q 2, O) QueryDist( Q 2 ). Proof. Analogous to proof of lea 1. Figure 5 (right) depicts a case where lea 2 can be applied to avoid the calculation of dist(q 2,O). To conclude the two above leata, figure 6 illustrates the area of database objects O for which the calculation of the distance fro a query object Q 2 can be avoided. For exaple, the calculation of dist(q 2, O 1 ) can be avoided because dist( O 1, Q 1 ) dist( Q 2, Q 1 ) QueryDist( Q 2 ) holds, and the calculation of dist(q 2,O 2 ) can also be avoided because dist( O 2, Q 1 ) dist( Q 2, Q 1 ) + QueryDist( Q 2 ) holds. O 1 qq 2 2 o 3 o 2 q 1 O 2 Q 1 Area of objects O for which calculation of dist(q 2,O) can be avoided Q i : query objects O i : database objects QueryDist(Q 2 ) calculated in advance calculated for a previous query calculation can be avoided Figure 6: Area of database objects for which calculation of dist can be avoided The CPU cost for processing ultiple siilarity queries is given by the following forula ( 1) C CPU = tie( dist) + avoiding_tries tie( coparison) + not_avoided tie(dist) 2 where avoiding_tries denotes the nuber of (successful or not successful) applications of triangle inequalities and not_avoided denotes the nuber of distance calculations which actually have to be perfored. Obviously, this forula contains several application dependent paraeters which can only be deterined experientally. In the worst case, if no distance calculations can be avoided at all, it holds that C CPU. However, we observed that is significantly saller than C CPU if is sall copared to the database size (see section 6 for details). 5.3 Potentials for Parallelization In this section, we will briefly discuss the ipleentation of a ultiple siilarity query on top of a parallel query processor for a shared nothing environent. In such an environent, the data is distributed aong s servers such that the sae siilarity query is perfored on each server in parallel. However, each process has to 1 C CPU > 1 C CPU 13

14 look only at its local part of the data which is s ties saller than the whole database. The counication overhead in this setting is very sall so that a speed-up (copared to a sequential ipleentation) in the order of s C C s can be expected, i.e. the cost for perforing ultiple siilarity queries is reduced to s. The ipleentation of such a parallel query processor is trivial for the linear scan. For a parallel ipleentation, for exaple, of the X-tree see [Ber+ 97]. The transition fro one coputer to s coputers of the sae type also akes s-ties the ain eory available. If we use these additional resources when perforing ultiple siilarity queries in parallel, we can gain a rearkable speed-up factor for M s siilarity queries which is larger than the nuber s of achines. This effect is due to the fact that we can increase the nuber of query objects to be processed siultaneously if we have ore eory to buffer the answers. In a parallel environent each process produces only one s-th of the answer set for a query object on the average. Therefore, instead of evaluating M siilarity queries in blocks of queries on a single achine we can now use blocks of s queries. This eans that the cost for evaluating M queries using parallel ultiple si- M M ilarity queries is equal to copared to ---- C for the sequential ipleentation. Consequently, the speed-up factor for a parallel ultiple query versus a sequential ultiple query is larger than s s s if C s < s C holds. Fro sections 5.1 and 5.2 we know that we can expect this condition to be satisfied at s C CPU least for the I/O cost. We cannot prove that the condition < s C CPU holds for the CPU cost but section 6.2 deonstrates this experientally. Note, however, that even if this condition did not hold, we still would have the noral speed-up factor of s when using parallelization. C 6 Perforance Evaluation We perfored an extensive experiental evaluation of our technique for ultiple siilarity queries using real databases. The first database, part of the so-called Tycho catalogue [Hog 97], was provided by the European Space Agency (ESA) and contains 20-d feature vectors of 1,000,000 stars and galaxies. The second dataset was a large iage database containing 64-d color histogras of 112,000 iages fro TV snapshots. We investigated two extree instances of iterative neighborhood exploration discussed in section 3.2, one with independent queries and another one with highly dependent queries: On the Astronoy database, we tested siultaneous classification of a set of objects. M objects fro the database were chosen randoly and a k-nearest neighbor query was perfored for each of these query objects. On the iage database, we siulated anual data exploration by a nuber of c concurrent users in the following way. We randoly selected a first query object for each of the users and perfored a k-nearest neighbor query for each of the obtaining a total of c k answers. Then we perfored the following loop. While each of the hypothetic users chose one fro his k current answers, for each of the current answers we prefetched their k-nearest neighbors. After restricting the set of answers to the answers of the objects chosen by the users, we continued the loop with these new query objects etc. Thus, in each loop we generated = c k new query objects for which we perfored k-nearest neighbor queries. 14

15 We experiented with a broad range of k values and found that the average cost per k-nearest neighbor query was quite robust to the value of k. All the results reported in the following were obtained for k = 10 (Astronoy database) and k = 20 (iage database) which are typical paraeter values for the respective applications. All experients were perfored on Intel Pentiu II (300 MHz) based workstations running Linux 6.0, each workstation equipped with 128 MBytes of ain eory. Both, the linear scan and the X-tree were ipleented in C++. The block size of the X-tree was set to 32 KBytes and the buffer size was set to 10% of the X-tree size. 6.1 Reduction of I/O Cost We begin by studying the effect of our technique for ultiple siilarity queries on the I/O cost. Figure 7 depicts the average I/O cost per siilarity query with respect to the nuber of ultiple siilarity queries for the Astronoy database as well as for the iage database. For a single siilarity query, the X-tree outperfors the linear scan by a factor of 4.5 and 3.1. For = 100 query objects, however, the average I/O cost of the X-tree is 1.5 and 3.6 ties the average I/O cost of the linear scan. While the enorous reduction of I/O cost (a factor of nearly ) is expected for the linear scan, it is worth noticing that also the average I/O cost of the X-tree is reduced by a factor of 8.7 and 15 for 100 ultiple siilarity queries SCAN (Astronoy DB) X-tree (Astronoy DB) SCAN (Iage DB) X-tree (Iage DB) Average I/O cost (sec) Nuber of ultiple siilarity queries ( ) Figure 7: Average I/O cost per siilarity query 6.2 Reduction of CPU Cost The aount of CPU cost which can be saved when a data object is disqualified on the basis of the triangle inequality depends on the diensionality of the database since the CPU cost for a distance calculation increases with the diensionality whereas the CPU cost for evaluating the triangle inequality is constant. We easured the following average runties on our test databases. On 20-d data objects the CPU cost for calculating the Euclidean distance ( 4,3µsec ) was 52 ties the CPU cost for evaluating a triangle inequality ( 0,082µsec ) and on the 64-d data objects the factor was 155 ( 12,7µsec versus 0,082µsec ). We easured the average CPU cost per query for 10, 20, 40, 50 and 100 ultiple siilarity queries (cf. figure 8). For the linear scan, the average CPU cost for a siilarity query decreases fro 4.3 sec to 0.6 sec on the Astronoy database when increasing fro 1 to 100. This corresponds to a reduction of the CPU cost by a factor of 7.1. On the iage database, the factor of the CPU cost reduction is even 28. This effect can be ex- 15

16 Average CPU cost (sec) SCAN (Astronoy DB) X-tree (Astronoy DB) SCAN (Iage DB) X-tree (Iage DB) Nuber of ultiple siilarity queries ( ) Figure 8: Average CPU cost per siilarity query plained when considering the distribution of the databases: the Astronoy database is alost uniforly distributed, the iage database, however, is highly clustered. The linear scan profits fro clustered databases for the following reason: if the distance coputation for one data object fro a cluster can be avoided it is likely that the distance coputation for all other data objects lying in the sae cluster can also be avoided. For the X-tree, the effect of applying the triangle inequality is less than for the linear scan, it is 2.1 on the Astronoy database as well as on the iage database. The reason for this saller perforance gain of the X-tree is the fact that due to its indexing properties, the X-tree solely investigates data objects which are close to query objects. Since data objects which have a large distance to the query objects - and therefore a high probability to be excluded fro the distance calculation for ost of the query objects - are not considered, the potential for CPU cost reduction is less than for the linear scan. 6.3 Reduction of the total query cost We now consider the effect of our technique for ultiple siilarity queries on the total query cost and deterine the achieved speed-up. For both databases, figure 9 shows the average total query cost as the su of the average I/O cost and the average CPU cost. This can be done since the cost for anaging the query process can Average total query cost (sec) SCAN (Astronoy DB) X-tree (Astronoy DB) SCAN (Iage DB) X-tree (Iage DB) Nuber of ultiple siilarity queries ( ) Figure 9: Average total query cost per siilarity query 16

17 be neglected copared to the I/O cost and CPU cost. As expected, the average total query cost decreases with increasing for the linear scan and the X-tree. An iportant observation we ade is that for 20 (Astronoy database) and 100 (iage database) the total query cost is doinated by the CPU cost when perforing a linear scan. The average query cost of the X-tree was I/O bound for 100. Since the perforance gain is higher for the linear scan, the linear scan outperfors the X-tree for 10 (Astronoy database) and 100 (iage database). Figure 10 depicts the corresponding speed-up. When coparing = 100 to = 1, the linear scan achieves a speed-up of 28 on the Astronoy database and 68 on the iage database. For the X-tree, this speed-up is less due to the saller benefits fro the triangle inequality and the saller reduction of I/O cost. However, we still observe a speed-up of 7.2 on the Astronoy database and 12.1 on the iage database. Note that the speed-up factors are always higher on the iage database. Siilar to section 6.2, this effect can be explained with the distribution of the databases. Speed-up 70 SCAN (Astronoy DB) X-tree (Astronoy DB) 55 SCAN (Iage DB) 50 X-tree (Iage DB) Nuber of ultiple siilarity queries ( ) Figure 10: Speed-up with respect to 6.4 Effects of parallelization We also investigated the achievable speed-up when applying our technique for ultiple siilarity queries on top of a parallel query processor. The setting we used was a shared nothing environent with a TCP/IP network interconnecting 16 servers. For the ipleentation details of a parallel X-tree see [Ber+ 97]. For both databases, we perfored = 100 ultiple k-nearest neighbor queries on a single server and while we increased the nuber of servers (s = 4, 8, 16) we proportionally increased ( = 400, 800, 1600). Our technique of parallelization increases in order to exploit the fact that s ties the ain eory becoes available (see section 5.3). Figure 11 depicts the achieved speed-up per siilarity query coparing parallel ultiple siilarity queries to sequential ultiple siilarity queries. On the Astronoy database, the parallel linear scan achieves a superlinear speed-up using up to 8 servers and a near linear speed-up of 13.4 using 16 servers. For larger server nubers, i.e. also larger nubers of queries, two effects decrease the speed-up: (1) the cost for the coputation of the distance for each pair of query objects and (2) the cost for applying the triangle inequalities for each database object which - in the worst case - is also quadratic in. The second effect is less iportant for the X-tree because the X-tree visits only a considerably 17

Detection of Outliers and Reduction of their Undesirable Effects for Improving the Accuracy of K-means Clustering Algorithm

Detection of Outliers and Reduction of their Undesirable Effects for Improving the Accuracy of K-means Clustering Algorithm Detection of Outliers and Reduction of their Undesirable Effects for Iproving the Accuracy of K-eans Clustering Algorith Bahan Askari Departent of Coputer Science and Research Branch, Islaic Azad University,

More information

The optimization design of microphone array layout for wideband noise sources

The optimization design of microphone array layout for wideband noise sources PROCEEDINGS of the 22 nd International Congress on Acoustics Acoustic Array Systes: Paper ICA2016-903 The optiization design of icrophone array layout for wideband noise sources Pengxiao Teng (a), Jun

More information

Optimal Route Queries with Arbitrary Order Constraints

Optimal Route Queries with Arbitrary Order Constraints IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL.?, NO.?,? 20?? 1 Optial Route Queries with Arbitrary Order Constraints Jing Li, Yin Yang, Nikos Maoulis Abstract Given a set of spatial points DS,

More information

Clustering. Cluster Analysis of Microarray Data. Microarray Data for Clustering. Data for Clustering

Clustering. Cluster Analysis of Microarray Data. Microarray Data for Clustering. Data for Clustering Clustering Cluster Analysis of Microarray Data 4/3/009 Copyright 009 Dan Nettleton Group obects that are siilar to one another together in a cluster. Separate obects that are dissiilar fro each other into

More information

A Trajectory Splitting Model for Efficient Spatio-Temporal Indexing

A Trajectory Splitting Model for Efficient Spatio-Temporal Indexing A Trajectory Splitting Model for Efficient Spatio-Teporal Indexing Slobodan Rasetic Jörg Sander Jaes Elding Mario A. Nasciento Departent of Coputing Science University of Alberta Edonton, Alberta, Canada

More information

Identifying Converging Pairs of Nodes on a Budget

Identifying Converging Pairs of Nodes on a Budget Identifying Converging Pairs of Nodes on a Budget Konstantina Lazaridou Departent of Inforatics Aristotle University, Thessaloniki, Greece konlaznik@csd.auth.gr Evaggelia Pitoura Coputer Science and Engineering

More information

Energy-Efficient Disk Replacement and File Placement Techniques for Mobile Systems with Hard Disks

Energy-Efficient Disk Replacement and File Placement Techniques for Mobile Systems with Hard Disks Energy-Efficient Disk Replaceent and File Placeent Techniques for Mobile Systes with Hard Disks Young-Jin Ki School of Coputer Science & Engineering Seoul National University Seoul 151-742, KOREA youngjk@davinci.snu.ac.kr

More information

Theoretical Analysis of Local Search and Simple Evolutionary Algorithms for the Generalized Travelling Salesperson Problem

Theoretical Analysis of Local Search and Simple Evolutionary Algorithms for the Generalized Travelling Salesperson Problem Theoretical Analysis of Local Search and Siple Evolutionary Algoriths for the Generalized Travelling Salesperson Proble Mojgan Pourhassan ojgan.pourhassan@adelaide.edu.au Optiisation and Logistics, The

More information

Image Filter Using with Gaussian Curvature and Total Variation Model

Image Filter Using with Gaussian Curvature and Total Variation Model IJECT Vo l. 7, Is s u e 3, Ju l y - Se p t 016 ISSN : 30-7109 (Online) ISSN : 30-9543 (Print) Iage Using with Gaussian Curvature and Total Variation Model 1 Deepak Kuar Gour, Sanjay Kuar Shara 1, Dept.

More information

A Novel Fast Constructive Algorithm for Neural Classifier

A Novel Fast Constructive Algorithm for Neural Classifier A Novel Fast Constructive Algorith for Neural Classifier Xudong Jiang Centre for Signal Processing, School of Electrical and Electronic Engineering Nanyang Technological University Nanyang Avenue, Singapore

More information

Real-Time Detection of Invisible Spreaders

Real-Time Detection of Invisible Spreaders Real-Tie Detection of Invisible Spreaders MyungKeun Yoon Shigang Chen Departent of Coputer & Inforation Science & Engineering University of Florida, Gainesville, FL 3, USA {yoon, sgchen}@cise.ufl.edu Abstract

More information

The Internal Conflict of a Belief Function

The Internal Conflict of a Belief Function The Internal Conflict of a Belief Function Johan Schubert Abstract In this paper we define and derive an internal conflict of a belief function We decopose the belief function in question into a set of

More information

Data & Knowledge Engineering

Data & Knowledge Engineering Data & Knowledge Engineering 7 (211) 17 187 Contents lists available at ScienceDirect Data & Knowledge Engineering journal hoepage: www.elsevier.co/locate/datak An approxiate duplicate eliination in RFID

More information

Gromov-Hausdorff Distance Between Metric Graphs

Gromov-Hausdorff Distance Between Metric Graphs Groov-Hausdorff Distance Between Metric Graphs Jiwon Choi St Mark s School January, 019 Abstract In this paper we study the Groov-Hausdorff distance between two etric graphs We copute the precise value

More information

Investigation of The Time-Offset-Based QoS Support with Optical Burst Switching in WDM Networks

Investigation of The Time-Offset-Based QoS Support with Optical Burst Switching in WDM Networks Investigation of The Tie-Offset-Based QoS Support with Optical Burst Switching in WDM Networks Pingyi Fan, Chongxi Feng,Yichao Wang, Ning Ge State Key Laboratory on Microwave and Digital Counications,

More information

Geometry. The Method of the Center of Mass (mass points): Solving problems using the Law of Lever (mass points). Menelaus theorem. Pappus theorem.

Geometry. The Method of the Center of Mass (mass points): Solving problems using the Law of Lever (mass points). Menelaus theorem. Pappus theorem. Noveber 13, 2016 Geoetry. The Method of the enter of Mass (ass points): Solving probles using the Law of Lever (ass points). Menelaus theore. Pappus theore. M d Theore (Law of Lever). Masses (weights)

More information

Geo-activity Recommendations by using Improved Feature Combination

Geo-activity Recommendations by using Improved Feature Combination Geo-activity Recoendations by using Iproved Feature Cobination Masoud Sattari Middle East Technical University Ankara, Turkey e76326@ceng.etu.edu.tr Murat Manguoglu Middle East Technical University Ankara,

More information

Oblivious Routing for Fat-Tree Based System Area Networks with Uncertain Traffic Demands

Oblivious Routing for Fat-Tree Based System Area Networks with Uncertain Traffic Demands Oblivious Routing for Fat-Tree Based Syste Area Networks with Uncertain Traffic Deands Xin Yuan Wickus Nienaber Zhenhai Duan Departent of Coputer Science Florida State University Tallahassee, FL 3306 {xyuan,nienaber,duan}@cs.fsu.edu

More information

Computer Aided Drafting, Design and Manufacturing Volume 26, Number 2, June 2016, Page 13

Computer Aided Drafting, Design and Manufacturing Volume 26, Number 2, June 2016, Page 13 Coputer Aided Drafting, Design and Manufacturing Volue 26, uber 2, June 2016, Page 13 CADDM 3D reconstruction of coplex curved objects fro line drawings Sun Yanling, Dong Lijun Institute of Mechanical

More information

Verifying the structure and behavior in UML/OCL models using satisfiability solvers

Verifying the structure and behavior in UML/OCL models using satisfiability solvers IET Cyber-Physical Systes: Theory & Applications Review Article Verifying the structure and behavior in UML/OCL odels using satisfiability solvers ISSN 2398-3396 Received on 20th October 2016 Revised on

More information

On the Computation and Application of Prototype Point Patterns

On the Computation and Application of Prototype Point Patterns On the Coputation and Application of Prototype Point Patterns Katherine E. Tranbarger Freier 1 and Frederic Paik Schoenberg 2 Abstract This work addresses coputational probles related to the ipleentation

More information

1 Extended Boolean Model

1 Extended Boolean Model 1 EXTENDED BOOLEAN MODEL It has been well-known that the Boolean odel is too inflexible, requiring skilful use of Boolean operators to obtain good results. On the other hand, the vector space odel is flexible

More information

Different criteria of dynamic routing

Different criteria of dynamic routing Procedia Coputer Science Volue 66, 2015, Pages 166 173 YSC 2015. 4th International Young Scientists Conference on Coputational Science Different criteria of dynaic routing Kurochkin 1*, Grinberg 1 1 Kharkevich

More information

Utility-Based Resource Allocation for Mixed Traffic in Wireless Networks

Utility-Based Resource Allocation for Mixed Traffic in Wireless Networks IEEE IFOCO 2 International Workshop on Future edia etworks and IP-based TV Utility-Based Resource Allocation for ixed Traffic in Wireless etworks Li Chen, Bin Wang, Xiaohang Chen, Xin Zhang, and Dacheng

More information

Joint Measurement- and Traffic Descriptor-based Admission Control at Real-Time Traffic Aggregation Points

Joint Measurement- and Traffic Descriptor-based Admission Control at Real-Time Traffic Aggregation Points Joint Measureent- and Traffic Descriptor-based Adission Control at Real-Tie Traffic Aggregation Points Stylianos Georgoulas, Panos Triintzios and George Pavlou Centre for Counication Systes Research, University

More information

An Efficient Approach for Content Delivery in Overlay Networks

An Efficient Approach for Content Delivery in Overlay Networks An Efficient Approach for Content Delivery in Overlay Networks Mohaad Malli, Chadi Barakat, Walid Dabbous Projet Planète, INRIA-Sophia Antipolis, France E-ail:{alli, cbarakat, dabbous}@sophia.inria.fr

More information

A Low-Cost Multi-Failure Resilient Replication Scheme for High Data Availability in Cloud Storage

A Low-Cost Multi-Failure Resilient Replication Scheme for High Data Availability in Cloud Storage 216 IEEE 23rd International Conference on High Perforance Coputing A Low-Cost Multi-Failure Resilient Replication Schee for High Data Availability in Cloud Storage Jinwei Liu* and Haiying Shen *Departent

More information

Designing High Performance Web-Based Computing Services to Promote Telemedicine Database Management System

Designing High Performance Web-Based Computing Services to Promote Telemedicine Database Management System Designing High Perforance Web-Based Coputing Services to Proote Teleedicine Database Manageent Syste Isail Hababeh 1, Issa Khalil 2, and Abdallah Khreishah 3 1: Coputer Engineering & Inforation Technology,

More information

Performance Analysis of RAID in Different Workload

Performance Analysis of RAID in Different Workload Send Orders for Reprints to reprints@benthascience.ae 324 The Open Cybernetics & Systeics Journal, 2015, 9, 324-328 Perforance Analysis of RAID in Different Workload Open Access Zhang Dule *, Ji Xiaoyun,

More information

Solving the Damage Localization Problem in Structural Health Monitoring Using Techniques in Pattern Classification

Solving the Damage Localization Problem in Structural Health Monitoring Using Techniques in Pattern Classification Solving the Daage Localization Proble in Structural Health Monitoring Using Techniques in Pattern Classification CS 9 Final Project Due Dec. 4, 007 Hae Young Noh, Allen Cheung, Daxia Ge Introduction Structural

More information

Keyword Search in Spatial Databases: Towards Searching by Document

Keyword Search in Spatial Databases: Towards Searching by Document IEEE International Conference on Data Engineering Keyword Search in Spatial Databases: Towards Searching by Docuent Dongxiang Zhang #1, Yeow Meng Chee 2, Anirban Mondal 3, Anthony K. H. Tung #4, Masaru

More information

EE 364B Convex Optimization An ADMM Solution to the Sparse Coding Problem. Sonia Bhaskar, Will Zou Final Project Spring 2011

EE 364B Convex Optimization An ADMM Solution to the Sparse Coding Problem. Sonia Bhaskar, Will Zou Final Project Spring 2011 EE 364B Convex Optiization An ADMM Solution to the Sparse Coding Proble Sonia Bhaskar, Will Zou Final Project Spring 20 I. INTRODUCTION For our project, we apply the ethod of the alternating direction

More information

Defining and Surveying Wireless Link Virtualization and Wireless Network Virtualization

Defining and Surveying Wireless Link Virtualization and Wireless Network Virtualization 1 Defining and Surveying Wireless Link Virtualization and Wireless Network Virtualization Jonathan van de Belt, Haed Ahadi, and Linda E. Doyle The Centre for Future Networks and Counications - CONNECT,

More information

Collaborative Web Caching Based on Proxy Affinities

Collaborative Web Caching Based on Proxy Affinities Collaborative Web Caching Based on Proxy Affinities Jiong Yang T J Watson Research Center IBM jiyang@usibco Wei Wang T J Watson Research Center IBM ww1@usibco Richard Muntz Coputer Science Departent UCLA

More information

Analysing Real-Time Communications: Controller Area Network (CAN) *

Analysing Real-Time Communications: Controller Area Network (CAN) * Analysing Real-Tie Counications: Controller Area Network (CAN) * Abstract The increasing use of counication networks in tie critical applications presents engineers with fundaental probles with the deterination

More information

Scheduling Parallel Real-Time Recurrent Tasks on Multicore Platforms

Scheduling Parallel Real-Time Recurrent Tasks on Multicore Platforms IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, VOL., NO., NOV 27 Scheduling Parallel Real-Tie Recurrent Tasks on Multicore Platfors Risat Pathan, Petros Voudouris, and Per Stenströ Abstract We

More information

Closing The Performance Gap between Causal Consistency and Eventual Consistency

Closing The Performance Gap between Causal Consistency and Eventual Consistency Closing The Perforance Gap between Causal Consistency and Eventual Consistency Jiaqing Du Călin Iorgulescu Aitabha Roy Willy Zwaenepoel EPFL ABSTRACT It is well known that causal consistency is ore expensive

More information

MULTI-INDEX VOTING FOR ASYMMETRIC DISTANCE COMPUTATION IN A LARGE-SCALE BINARY CODES. Chih-Yi Chiu, Yu-Cyuan Liou, and Sheng-Hao Chou

MULTI-INDEX VOTING FOR ASYMMETRIC DISTANCE COMPUTATION IN A LARGE-SCALE BINARY CODES. Chih-Yi Chiu, Yu-Cyuan Liou, and Sheng-Hao Chou MULTI-INDEX VOTING FOR ASYMMETRIC DISTANCE COMPUTATION IN A LARGE-SCALE BINARY CODES Chih-Yi Chiu, Yu-Cyuan Liou, and Sheng-Hao Chou Departent of Coputer Science and Inforation Engineering, National Chiayi

More information

Improve Peer Cooperation using Social Networks

Improve Peer Cooperation using Social Networks Iprove Peer Cooperation using Social Networks Victor Ponce, Jie Wu, and Xiuqi Li Departent of Coputer Science and Engineering Florida Atlantic University Boca Raton, FL 33431 Noveber 5, 2007 Corresponding

More information

Boosted Detection of Objects and Attributes

Boosted Detection of Objects and Attributes L M M Boosted Detection of Objects and Attributes Abstract We present a new fraework for detection of object and attributes in iages based on boosted cobination of priitive classifiers. The fraework directly

More information

A Discrete Spring Model to Generate Fair Curves and Surfaces

A Discrete Spring Model to Generate Fair Curves and Surfaces A Discrete Spring Model to Generate Fair Curves and Surfaces Atsushi Yaada Kenji Shiada 2 Tootake Furuhata and Ko-Hsiu Hou 2 Tokyo Research Laboratory IBM Japan Ltd. LAB-S73 623-4 Shiotsurua Yaato Kanagawa

More information

COLOR HISTOGRAM AND DISCRETE COSINE TRANSFORM FOR COLOR IMAGE RETRIEVAL

COLOR HISTOGRAM AND DISCRETE COSINE TRANSFORM FOR COLOR IMAGE RETRIEVAL COLOR HISTOGRAM AND DISCRETE COSINE TRANSFORM FOR COLOR IMAGE RETRIEVAL 1 Te-Wei Chiang ( 蔣德威 ), 2 Tienwei Tsai ( 蔡殿偉 ), 3 Jeng-Ping Lin ( 林正平 ) 1 Dept. of Accounting Inforation Systes, Chilee Institute

More information

THE rapid growth and continuous change of the real

THE rapid growth and continuous change of the real IEEE TRANSACTIONS ON SERVICES COMPUTING, VOL. 8, NO. 1, JANUARY/FEBRUARY 2015 47 Designing High Perforance Web-Based Coputing Services to Proote Teleedicine Database Manageent Syste Isail Hababeh, Issa

More information

A Learning Framework for Nearest Neighbor Search

A Learning Framework for Nearest Neighbor Search A Learning Fraework for Nearest Neighbor Search Lawrence Cayton Departent of Coputer Science University of California, San Diego lcayton@cs.ucsd.edu Sanjoy Dasgupta Departent of Coputer Science University

More information

Mapping Data in Peer-to-Peer Systems: Semantics and Algorithmic Issues

Mapping Data in Peer-to-Peer Systems: Semantics and Algorithmic Issues Mapping Data in Peer-to-Peer Systes: Seantics and Algorithic Issues Anastasios Keentsietsidis Marcelo Arenas Renée J. Miller Departent of Coputer Science University of Toronto {tasos,arenas,iller}@cs.toronto.edu

More information

Enhancing Real-Time CAN Communications by the Prioritization of Urgent Messages at the Outgoing Queue

Enhancing Real-Time CAN Communications by the Prioritization of Urgent Messages at the Outgoing Queue Enhancing Real-Tie CAN Counications by the Prioritization of Urgent Messages at the Outgoing Queue ANTÓNIO J. PIRES (1), JOÃO P. SOUSA (), FRANCISCO VASQUES (3) 1,,3 Faculdade de Engenharia da Universidade

More information

Generating Mechanisms for Evolving Software Mirror Graph

Generating Mechanisms for Evolving Software Mirror Graph Journal of Modern Physics, 2012, 3, 1050-1059 http://dx.doi.org/10.4236/jp.2012.39139 Published Online Septeber 2012 (http://www.scirp.org/journal/jp) Generating Mechaniss for Evolving Software Mirror

More information

Ranking Spatial Data by Quality Preferences

Ranking Spatial Data by Quality Preferences 1 Ranking Spatial Data by Quality Preferences Man Lung Yiu, Hua Lu, Nikos Maoulis, and Michail Vaitis Abstract A spatial preference query ranks objects based on the qualities of features in their spatial

More information

Using Imperialist Competitive Algorithm in Optimization of Nonlinear Multiple Responses

Using Imperialist Competitive Algorithm in Optimization of Nonlinear Multiple Responses International Journal of Industrial Engineering & Production Research Septeber 03, Volue 4, Nuber 3 pp. 9-35 ISSN: 008-4889 http://ijiepr.iust.ac.ir/ Using Iperialist Copetitive Algorith in Optiization

More information

Brian Noguchi CS 229 (Fall 05) Project Final Writeup A Hierarchical Application of ICA-based Feature Extraction to Image Classification Brian Noguchi

Brian Noguchi CS 229 (Fall 05) Project Final Writeup A Hierarchical Application of ICA-based Feature Extraction to Image Classification Brian Noguchi A Hierarchical Application of ICA-based Feature Etraction to Iage Classification Introduction Iage classification poses one of the greatest challenges in the achine vision and achine learning counities.

More information

Modeling Parallel Applications Performance on Heterogeneous Systems

Modeling Parallel Applications Performance on Heterogeneous Systems Modeling Parallel Applications Perforance on Heterogeneous Systes Jaeela Al-Jaroodi, Nader Mohaed, Hong Jiang and David Swanson Departent of Coputer Science and Engineering University of Nebraska Lincoln

More information

Integrating fast mobility in the OLSR routing protocol

Integrating fast mobility in the OLSR routing protocol Integrating fast obility in the OLSR routing protocol Mounir BENZAID 1,2, Pascale MINET 1 and Khaldoun AL AGHA 1,2 1 INRIA, Doaine de Voluceau - B.P.105, 78153 Le Chesnay Cedex, FRANCE ounir.benzaid, pascale.inet@inria.fr

More information

Rule Extraction using Artificial Neural Networks

Rule Extraction using Artificial Neural Networks Rule Extraction using Artificial Neural Networks S. M. Karuzzaan 1 Ahed Ryadh Hasan 2 Abstract Artificial neural networks have been successfully applied to a variety of business application probles involving

More information

OPTIMAL COMPLEX SERVICES COMPOSITION IN SOA SYSTEMS

OPTIMAL COMPLEX SERVICES COMPOSITION IN SOA SYSTEMS Key words SOA, optial, coplex service, coposition, Quality of Service Piotr RYGIELSKI*, Paweł ŚWIĄTEK* OPTIMAL COMPLEX SERVICES COMPOSITION IN SOA SYSTEMS One of the ost iportant tasks in service oriented

More information

Distributed Multicast Tree Construction in Wireless Sensor Networks

Distributed Multicast Tree Construction in Wireless Sensor Networks Distributed Multicast Tree Construction in Wireless Sensor Networks Hongyu Gong, Luoyi Fu, Xinzhe Fu, Lutian Zhao 3, Kainan Wang, and Xinbing Wang Dept. of Electronic Engineering, Shanghai Jiao Tong University,

More information

ELEVATION SURFACE INTERPOLATION OF POINT DATA USING DIFFERENT TECHNIQUES A GIS APPROACH

ELEVATION SURFACE INTERPOLATION OF POINT DATA USING DIFFERENT TECHNIQUES A GIS APPROACH ELEVATION SURFACE INTERPOLATION OF POINT DATA USING DIFFERENT TECHNIQUES A GIS APPROACH Kulapraote Prathuchai Geoinforatics Center, Asian Institute of Technology, 58 Moo9, Klong Luang, Pathuthani, Thailand.

More information

A Learning Framework for Nearest Neighbor Search

A Learning Framework for Nearest Neighbor Search A Learning Fraework for Nearest Neighbor Search Lawrence Cayton Departent of Coputer Science University of California, San Diego lcayton@cs.ucsd.edu Sanjoy Dasgupta Departent of Coputer Science University

More information

A Hybrid Network Architecture for File Transfers

A Hybrid Network Architecture for File Transfers JOURNAL OF IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, VOL. 6, NO., JANUARY 9 A Hybrid Network Architecture for File Transfers Xiuduan Fang, Meber, IEEE, Malathi Veeraraghavan, Senior Meber,

More information

Automatic Graph Drawing Algorithms

Automatic Graph Drawing Algorithms Autoatic Graph Drawing Algoriths Susan Si sisuz@turing.utoronto.ca Deceber 7, 996. Ebeddings of graphs have been of interest to theoreticians for soe tie, in particular those of planar graphs and graphs

More information

A Measurement-Based Model for Parallel Real-Time Tasks

A Measurement-Based Model for Parallel Real-Time Tasks A Measureent-Based Model for Parallel Real-Tie Tasks Kunal Agrawal 1 Washington University in St. Louis St. Louis, MO, USA kunal@wustl.edu https://orcid.org/0000-0001-5882-6647 Sanjoy Baruah 2 Washington

More information

Comparative Evaluation of Color-Based Video Signatures in the Presence of Various Distortion Types

Comparative Evaluation of Color-Based Video Signatures in the Presence of Various Distortion Types Coparative Evaluation of Color-Based Video Signatures in the Presence of Various Distortion Types Aritz Sánchez de la Fuente, Patrick Ndjiki-Nya, Karsten Sühring, Tobias Hinz, Karsten üller, and Thoas

More information

I ve Seen Enough : Incrementally Improving Visualizations to Support Rapid Decision Making

I ve Seen Enough : Incrementally Improving Visualizations to Support Rapid Decision Making I ve Seen Enough : Increentally Iproving Visualizations to Support Rapid Decision Making Sajjadur Rahan Marya Aliakbarpour 2 Ha Kyung Kong Eric Blais 3 Karrie Karahalios,4 Aditya Paraeswaran Ronitt Rubinfield

More information

Planet Scale Software Updates

Planet Scale Software Updates Planet Scale Software Updates Christos Gkantsidis 1, Thoas Karagiannis 2, Pablo Rodriguez 1, and Milan Vojnović 1 1 Microsoft Research 2 UC Riverside Cabridge, UK Riverside, CA, USA {chrisgk,pablo,ilanv}@icrosoft.co

More information

λ-harmonious Graph Colouring Lauren DeDieu

λ-harmonious Graph Colouring Lauren DeDieu λ-haronious Graph Colouring Lauren DeDieu June 12, 2012 ABSTRACT In 198, Hopcroft and Krishnaoorthy defined a new type of graph colouring called haronious colouring. Haronious colouring is a proper vertex

More information

Smarter Balanced Assessment Consortium Claims, Targets, and Standard Alignment for Math

Smarter Balanced Assessment Consortium Claims, Targets, and Standard Alignment for Math Sarter Balanced Assessent Consortiu s, s, Stard Alignent for Math The Sarter Balanced Assessent Consortiu (SBAC) has created a hierarchy coprised of clais targets that together can be used to ake stateents

More information

Deterministic Voting in Distributed Systems Using Error-Correcting Codes

Deterministic Voting in Distributed Systems Using Error-Correcting Codes IEEE TRASACTIOS O PARALLEL AD DISTRIBUTED SYSTEMS, VOL. 9, O. 8, AUGUST 1998 813 Deterinistic Voting in Distributed Systes Using Error-Correcting Codes Lihao Xu and Jehoshua Bruck, Senior Meber, IEEE Abstract

More information

Efficient Estimation of Inclusion Coefficient using HyperLogLog Sketches

Efficient Estimation of Inclusion Coefficient using HyperLogLog Sketches Efficient Estiation of Inclusion Coefficient using HyperLogLog Sketches Azade Nazi, Bolin Ding, Vivek Narasayya, Surajit Chaudhuri Microsoft Research {aznazi, bolind, viveknar, surajitc}@icrosoft.co ABSTRACT

More information

Fair Energy-Efficient Sensing Task Allocation in Participatory Sensing with Smartphones

Fair Energy-Efficient Sensing Task Allocation in Participatory Sensing with Smartphones Advance Access publication on 24 February 2017 The British Coputer Society 2017. All rights reserved. For Perissions, please eail: journals.perissions@oup.co doi:10.1093/cojnl/bxx015 Fair Energy-Efficient

More information

CS 361 Meeting 8 9/24/18

CS 361 Meeting 8 9/24/18 CS 36 Meeting 8 9/4/8 Announceents. Hoework 3 due Friday. Review. The closure properties of regular languages provide a way to describe regular languages by building the out of sipler regular languages

More information

A Fast Multi-Objective Genetic Algorithm for Hardware-Software Partitioning In Embedded System Design

A Fast Multi-Objective Genetic Algorithm for Hardware-Software Partitioning In Embedded System Design A Fast Multi-Obective Genetic Algorith for Hardware-Software Partitioning In Ebedded Syste Design 1 M.Jagadeeswari, 2 M.C.Bhuvaneswari 1 Research Scholar, P.S.G College of Technology, Coibatore, India

More information

IN many interactive multimedia streaming applications, such

IN many interactive multimedia streaming applications, such 840 IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 8, NO. 5, MAY 06 A Geoetric Approach to Server Selection for Interactive Video Streaing Yaochen Hu, Student Meber, IEEE, DiNiu, Meber, IEEE, and Zongpeng Li, Senior

More information

TensorFlow and Keras-based Convolutional Neural Network in CAT Image Recognition Ang LI 1,*, Yi-xiang LI 2 and Xue-hui LI 3

TensorFlow and Keras-based Convolutional Neural Network in CAT Image Recognition Ang LI 1,*, Yi-xiang LI 2 and Xue-hui LI 3 2017 2nd International Conference on Coputational Modeling, Siulation and Applied Matheatics (CMSAM 2017) ISBN: 978-1-60595-499-8 TensorFlow and Keras-based Convolutional Neural Network in CAT Iage Recognition

More information

Design Optimization of Mixed Time/Event-Triggered Distributed Embedded Systems

Design Optimization of Mixed Time/Event-Triggered Distributed Embedded Systems Design Optiization of Mixed Tie/Event-Triggered Distributed Ebedded Systes Traian Pop, Petru Eles, Zebo Peng Dept. of Coputer and Inforation Science, Linköping University {trapo, petel, zebpe}@ida.liu.se

More information

Adaptive Holistic Scheduling for In-Network Sensor Query Processing

Adaptive Holistic Scheduling for In-Network Sensor Query Processing Adaptive Holistic Scheduling for In-Network Sensor Query Processing Hejun Wu and Qiong Luo Departent of Coputer Science and Engineering Hong Kong University of Science & Technology Clear Water Bay, Kowloon,

More information

A wireless sensor network for visual detection and classification of intrusions

A wireless sensor network for visual detection and classification of intrusions A wireless sensor network for visual detection and classification of intrusions ANDRZEJ SLUZEK 1,3, PALANIAPPAN ANNAMALAI 2, MD SAIFUL ISLAM 1 1 School of Coputer Engineering, 2 IntelliSys Centre Nanyang

More information

Guillotine subdivisions approximate polygonal subdivisions: Part III { Faster polynomial-time approximation schemes for

Guillotine subdivisions approximate polygonal subdivisions: Part III { Faster polynomial-time approximation schemes for Guillotine subdivisions approxiate polygonal subdivisions: Part III { Faster polynoial-tie approxiation schees for geoetric network optiization Joseph S. B. Mitchell y April 19, 1997; Last revision: May

More information

Shortest Path Determination in a Wireless Packet Switch Network System in University of Calabar Using a Modified Dijkstra s Algorithm

Shortest Path Determination in a Wireless Packet Switch Network System in University of Calabar Using a Modified Dijkstra s Algorithm International Journal of Engineering and Technical Research (IJETR) ISSN: 31-869 (O) 454-4698 (P), Volue-5, Issue-1, May 16 Shortest Path Deterination in a Wireless Packet Switch Network Syste in University

More information

International Journal of Scientific & Engineering Research, Volume 4, Issue 10, October ISSN

International Journal of Scientific & Engineering Research, Volume 4, Issue 10, October ISSN International Journal of Scientific & Engineering Research, Volue 4, Issue 0, October-203 483 Design an Encoding Technique Using Forbidden Transition free Algorith to Reduce Cross-Talk for On-Chip VLSI

More information

RECONFIGURABLE AND MODULAR BASED SYNTHESIS OF CYCLIC DSP DATA FLOW GRAPHS

RECONFIGURABLE AND MODULAR BASED SYNTHESIS OF CYCLIC DSP DATA FLOW GRAPHS RECONFIGURABLE AND MODULAR BASED SYNTHESIS OF CYCLIC DSP DATA FLOW GRAPHS AWNI ITRADAT Assistant Professor, Departent of Coputer Engineering, Faculty of Engineering, Hasheite University, P.O. Box 15459,

More information

3 Conference on Inforation Sciences and Systes, The Johns Hopkins University, March, 3 Sensitivity Characteristics of Cross-Correlation Distance Metric and Model Function F. Porikli Mitsubishi Electric

More information

QUERY ROUTING OPTIMIZATION IN SENSOR COMMUNICATION NETWORKS

QUERY ROUTING OPTIMIZATION IN SENSOR COMMUNICATION NETWORKS QUERY ROUTING OPTIMIZATION IN SENSOR COMMUNICATION NETWORKS Guofei Jiang and George Cybenko Institute for Security Technology Studies and Thayer School of Engineering Dartouth College, Hanover NH 03755

More information

MAPPING THE DATA FLOW MODEL OF COMPUTATION INTO AN ENHANCED VON NEUMANN PROCESSOR * Peter M. Maurer

MAPPING THE DATA FLOW MODEL OF COMPUTATION INTO AN ENHANCED VON NEUMANN PROCESSOR * Peter M. Maurer MAPPING THE DATA FLOW MODEL OF COMPUTATION INTO AN ENHANCED VON NEUMANN PROCESSOR * Peter M. Maurer Departent of Coputer Science and Engineering University of South Florida Tapa, FL 33620 Abstract -- The

More information

A Broadband Spectrum Sensing Algorithm in TDCS Based on ICoSaMP Reconstruction

A Broadband Spectrum Sensing Algorithm in TDCS Based on ICoSaMP Reconstruction MATEC Web of Conferences 73, 03073(08) https://doi.org/0.05/atecconf/087303073 SMIMA 08 A roadband Spectru Sensing Algorith in TDCS ased on I Reconstruction Liu Yang, Ren Qinghua, Xu ingzheng and Li Xiazhao

More information

Reconstruction of Time Series using Optimal Ordering of ICA Components

Reconstruction of Time Series using Optimal Ordering of ICA Components Reconstruction of Tie Series using Optial Ordering of ICA Coponents Ar Goneid and Abear Kael Departent of Coputer Science & Engineering, The Aerican University in Cairo, Cairo, Egypt e-ail: goneid@aucegypt.edu

More information

Structural Balance in Networks. An Optimizational Approach. Andrej Mrvar. Faculty of Social Sciences. University of Ljubljana. Kardeljeva pl.

Structural Balance in Networks. An Optimizational Approach. Andrej Mrvar. Faculty of Social Sciences. University of Ljubljana. Kardeljeva pl. Structural Balance in Networks An Optiizational Approach Andrej Mrvar Faculty of Social Sciences University of Ljubljana Kardeljeva pl. 5 61109 Ljubljana March 23 1994 Contents 1 Balanced and clusterable

More information

COMPUTER GENERATED HOLOGRAMS Optical Sciences 627 W.J. Dallas (Monday, August 23, 2004, 12:38 PM) PART III: CHAPTER ONE DIFFUSERS FOR CGH S

COMPUTER GENERATED HOLOGRAMS Optical Sciences 627 W.J. Dallas (Monday, August 23, 2004, 12:38 PM) PART III: CHAPTER ONE DIFFUSERS FOR CGH S COPUTER GEERATED HOLOGRAS Optical Sciences 67 W.J. Dallas (onday, August 3, 004, 1:38 P) PART III: CHAPTER OE DIFFUSERS FOR CGH S Part III: Chapter One Page 1 of 8 Introduction Hologras for display purposes

More information

QoS and Sensible Routing Decisions

QoS and Sensible Routing Decisions QoS and Sensible Routing Decisions Erol Gelenbe Dept. of Electrical & Electronic Engineering Iperial College London SW7 2BT e.gelenbe@iperial.ac.uk Abstract Network Quality of Service (QoS) criteria of

More information

Privacy-preserving String-Matching With PRAM Algorithms

Privacy-preserving String-Matching With PRAM Algorithms Privacy-preserving String-Matching With PRAM Algoriths Report in MTAT.07.022 Research Seinar in Cryptography, Fall 2014 Author: Sander Sii Supervisor: Peeter Laud Deceber 14, 2014 Abstract In this report,

More information

HIGH PERFORMANCE PRE-SEGMENTATION ALGORITHM FOR SONAR IMAGES

HIGH PERFORMANCE PRE-SEGMENTATION ALGORITHM FOR SONAR IMAGES HIGH PERFORMANCE PRE-SEGMENTATION ALGORITHM FOR SONAR IMAGES Benjain Lehann*, Konstantinos Siantidis*, Dieter Kraus** *ATLAS ELEKTRONIK GbH Sebaldsbrücker Heerstraße 235 D-28309 Breen, GERMANY Eail: benjain.lehann@atlas-elektronik.co

More information

A Beam Search Method to Solve the Problem of Assignment Cells to Switches in a Cellular Mobile Network

A Beam Search Method to Solve the Problem of Assignment Cells to Switches in a Cellular Mobile Network A Bea Search Method to Solve the Proble of Assignent Cells to Switches in a Cellular Mobile Networ Cassilda Maria Ribeiro Faculdade de Engenharia de Guaratinguetá - DMA UNESP - São Paulo State University

More information

Multipath Selection and Channel Assignment in Wireless Mesh Networks

Multipath Selection and Channel Assignment in Wireless Mesh Networks Multipath Selection and Channel Assignent in Wireless Mesh Networs Soo-young Jang and Chae Y. Lee Dept. of Industrial and Systes Engineering, KAIST, 373-1 Kusung-dong, Taejon, Korea Tel: +82-42-350-5916,

More information

Heterogeneous Radial Basis Function Networks

Heterogeneous Radial Basis Function Networks Proceedings of the International Conference on Neural Networks (ICNN ), vol. 2, pp. 23-2, June. Heterogeneous Radial Basis Function Networks D. Randall Wilson, Tony R. Martinez e-ail: randy@axon.cs.byu.edu,

More information

Efficient Constraint Evaluation Algorithms for Hierarchical Next-Best-View Planning

Efficient Constraint Evaluation Algorithms for Hierarchical Next-Best-View Planning Efficient Constraint Evaluation Algoriths for Hierarchical Next-Best-View Planning Ko-Li Low National University of Singapore lowl@cop.nus.edu.sg Anselo Lastra University of North Carolina at Chapel Hill

More information

POSITION-PATCH BASED FACE HALLUCINATION VIA LOCALITY-CONSTRAINED REPRESENTATION. Junjun Jiang, Ruimin Hu, Zhen Han, Tao Lu, and Kebin Huang

POSITION-PATCH BASED FACE HALLUCINATION VIA LOCALITY-CONSTRAINED REPRESENTATION. Junjun Jiang, Ruimin Hu, Zhen Han, Tao Lu, and Kebin Huang IEEE International Conference on ultiedia and Expo POSITION-PATCH BASED FACE HALLUCINATION VIA LOCALITY-CONSTRAINED REPRESENTATION Junjun Jiang, Ruiin Hu, Zhen Han, Tao Lu, and Kebin Huang National Engineering

More information

ACNS: Adaptive Complementary Neighbor Selection in BitTorrent-like Applications

ACNS: Adaptive Complementary Neighbor Selection in BitTorrent-like Applications ACNS: Adaptive Copleentary Neighbor Selection in BitTorrent-like Applications Zhenbao Zhou 1, 2, Zhenyu Li 1, 2, Gaogang Xie 1 1 Institute of Coputing Technology, Chinese Acadey of Sciences, Beijing 100190,

More information

Classification of Benign and Malignant DCE-MRI Breast Tumors by Analyzing the Most Suspect Region

Classification of Benign and Malignant DCE-MRI Breast Tumors by Analyzing the Most Suspect Region Classification of Benign and Malignant DCE-MRI Breast Tuors by Analyzing the Most Suspect Region Sylvia Glaßer 1, Uli Nieann 1, Uta Prei 2, Bernhard Prei 1, Myra Spiliopoulou 3 1 Departent for Siulation

More information

A Periodic Dynamic Load Balancing Method

A Periodic Dynamic Load Balancing Method 2016 3 rd International Conference on Engineering Technology and Application (ICETA 2016) ISBN: 978-1-60595-383-0 A Periodic Dynaic Load Balancing Method Taotao Fan* State Key Laboratory of Inforation

More information

Verdict Accuracy of Quick Reduct Algorithm using Clustering, Classification Techniques for Gene Expression Data

Verdict Accuracy of Quick Reduct Algorithm using Clustering, Classification Techniques for Gene Expression Data Verdict Accuracy of Quick Reduct Algorith using Clustering, Classification Techniques for Gene Expression Data T.Chandrasekhar 1, K.Thangavel 2 and E.N.Sathishkuar 3 1 Departent of Coputer Science, eriyar

More information

Efficient Learning of Generalized Linear and Single Index Models with Isotonic Regression

Efficient Learning of Generalized Linear and Single Index Models with Isotonic Regression Efficient Learning of Generalized Linear and Single Index Models with Isotonic Regression Sha M. Kakade Microsoft Research and Wharton, U Penn skakade@icrosoft.co Varun Kanade SEAS, Harvard University

More information

Effective Tracking of the Players and Ball in Indoor Soccer Games in the Presence of Occlusion

Effective Tracking of the Players and Ball in Indoor Soccer Games in the Presence of Occlusion Effective Tracking of the Players and Ball in Indoor Soccer Gaes in the Presence of Occlusion Soudeh Kasiri-Bidhendi and Reza Safabakhsh Airkabir Univerisity of Technology, Tehran, Iran {kasiri, safa}@aut.ac.ir

More information