Spatial Processing using Oracle Table Functions

Size: px
Start display at page:

Download "Spatial Processing using Oracle Table Functions"

Transcription

1 Spatial Processing using Oracle Table Functions Ravi Kanth V Kothuri, Siva Ravada and Weisheng Xu Spatial Technologies, NEDC, Oracle Corporation, Nashua NH Ravi.Kothuri, Siva.Ravada, Abstract Spatial joins and spatial index creation are two of the most expensive operations in Oracle Spatial. Since spatial indexing is implemented in extensible indexing framework where queries only return rows from a single table, spatial joins could not be effectively and efficiently implemented in Oracle8i and prior releases. On the other hand, spatial index creation involves much computation or I/O that could be easily parallelized. In this paper, we describe how Oracle Spatial applies parallel and pipelined table function technology to perform fast spatial joins and parallel index creation. This technology has been introduced in Oracle9i and allows users to iteratively return subsets of result rows to be used in the from clause of a SQL query. We present our experiences with these implementations and examine the performance on real datasets. 1 Introduction Spatial searching is a fundamental primitive in nontraditional databases such as GIS, CAD/CAM and multimedia applications. With the rapid proliferation of these databases in the past decade, extensive research has been conducted on the design of efficient data structures to enable fast spatial searching. Several data structures have been developed in this context. These include Quadtrees [23, 24, 29], R-trees [8, 25], hb-trees [15], TV-trees [14], SS-trees [31], and SR-trees [11]. Subsequent research has improved these basic structures further by proposing new techniques for query processing [3, 4, 6, 9, 12, 16, 18, 19, 28], faster and better index creation [7, 13, 27, 30], and better splitstrategies in dynamic updates [1, 2]. These techniques are especially effective for low-dimensional spatial data such as those in GIS and CAD/CAM applications. Commercial database vendors like IBM, and Oracle have also started implementing these indexing techniques to cater to the large and diverse GIS and CAD/CAM application markets. Oracle Spatial supports two spatial indexes: Linear Quadtrees and R-trees [22]. The Linear Quadtree (or Quadtree for short) computes tile approximations for data geometries at index creation time and creates B-tree indexes on the encoded tile approximations. On the other hand, R-trees construct a hierarchical structure using the MBRs of data geometries. A framework for optimizing most query operations in Quadtree and R-tree indexes has been developed in prior work [21, 22]. This work, however, does not address two grey areas that are still time-consuming: (1) index creation, and (2) R-tree spatial joins, which used a nested-loop join for lack of support for joins in Oracle extensible indexing. In this paper, we describe how to improve the performance of these two operations using parallel and pipelined table functions of Oracle9i [17]. For spatial joins, table functions can be easily used to pipeline the result rows after a join of both indexes. Using table functions for index creation in quadtrees, data is divided into smaller subsets and the subsets tessellated in parallel. Likewise in R-trees, subtrees are constructed on subsets of data in parallel and merged at the end. In this paper, we present performance improvements for index creation and spatial joins using this approach. Spatial index-based joins on real GIS datasets are faster by a factor of 6 times in comparison to a nested-loop join. Index creation improves by a factor of 2.6 on 4 processors. In summary, this work complements prior work on spatial query optimization and provides useful insight into implementation of domain-specific indexes in commercial databases. The rest of the paper is organized as follows. Section 2 gives a brief overview of table functions in Oracle9i. Section 3 describes Oracle Spatial functionality. Section 4 describes implementation of R-tree Spatial Joins using table functions. We compare the performance of spatial joins using nested-loop and index-based scan methods and discuss some related issues. Section 5 describes parallel index creation using parallel table functions. We present some results from creating spatial indexes on real datasets. The final section summarizes the results. 851

2 2 Parallel and Pipelined Table Functions in Oracle Most applications such as data warehousing require a transient table or collection that can be operated as regular database tables. To support such processing, most commercial database vendors including IBM and Oracle have implemented table functions. Table functions return a collection-type instance that can be cast to a table of appropriate columns and queried using regular SQL queries. In Oracle9i [17], table functions allow for iteratively fetching result rows and for parallel processing of the computation and row fetching. Several applications such as Oracle Spatial, OLAP, and Oracle Data Mining have implemented their functionality using this support for table functions. Table functions are functions that can produce a set of rows as output. In other words, table functions return a collection type instance (nested table and VARRAY datatypes). Users can use a table function in place of a regular table in the FROM clause of a SQL statement as in the following example: select * from TABLE(spatial_join(tab1, col1, tab2, col2, intersect )); The spatial join function is a table function that could return the rowids of the tables tab1, tab2 whenever the geometries in columns col1, col2 satisfy a specified relationship such as intersection. The function could be implemented either in C/Java (or PL/SQL 1 ) using a start-fetchclose methodology to perform the function (or part of it) in the start routine, iteratively return the result rows in the fetch routine and release memory resources in the close routine. Note that such iterative fetching of result rows (referred to as pipelining here) is essentialto supporttablefunctionsthat return a large set of rows that cannot fit in memory. In addition to pipelining of result rows in table functions, parallel execution of a function is supported by allowing functions to directly accept a set of rows (a cursor) corresponding to a sub-query operand providing a mechanism that allows a set of input rows to be partitioned across multiple instances of a parallel function Given this model for supporting pipelining of results and parallelizing an operation, in the next section we describe its application in supporting R-tree spatial joins and parallel index creation. 1 A different approach is used in PL/SQL 3 Oracle Spatial Oracle Spatial models 2-4 dimensional spatial data using an sdo geometry data type. For the 2-dimensional case, this data type models all the spatial data types defined by the Open GIS Consortium (OGC) and caters to most data occurring in GIS, CAD/CAM applications. Supported spatial data includes simple primitive elements such as points, lines, curves, polygons (with and without holes), and complex elements that are made up of a combination of primitive elements. The sdo geometry data type is implemented as an Oracle object datatype. This approach extends all the benefits of Oracle s object-relational database technology including replication to spatial data. Quadtree and R-tree indexes on spatial data are implemented using the extensible indexing framework of Oracle [5, 20]. This framework allows for the creation of new domain-specific indexes and associated query operators and provides for the integration of user-specified query, update and index creation routines inside Oracle server. Oracle Spatial supports a spatial index indextype for indexing spatial data. Quadtree and R-tree indexes are supported as part of this spatial index indextype. Since these indexes are implemented as part of the extensible indexing framework, spatial indexes can be easily created on sdo geometry columns of database tables using an extended SQL syntax. As part of such index creation, the corresponding spatial index creation routines are executed and the constructed spatial index is stored in the database as a spatial index table. The index table stores index information such as R-tree nodes in the case of R-trees and Quadtree tiles in the case of Quadtrees. The metadata for the entire index is stored as a row in a separate metadata table. This metadata includes the name of the index table storing the index, dimensionality, root pointer fanout parameters for an R-tree and the tiling level parameter for a Quadtree index. In addition to SQL-level index creation, inserts and updates to database tables that have a spatial index also automatically trigger an update of the corresponding spatial indexes. In addition to these advantages, extensible indexing also ensures statement or session-level concurrency and table-level recovery. To query the constructed spatial indexes, new predicates, referred to as operators, are defined. These operators can be included in the where clause of a SQL statement to select data that satisfy a specified query criterion with respect to a specified query window. Such operators are executed using index-associated procedures for query processing and allow for incremental processing of queries (see [20, 5, 22] for more details). Queries have been optimized in prior work [21]. In the next sections, we examine how to improve the performance of R-tree joins and index creation using pipelined and parallel table functions. 852

3 4 R-tree Spatial Joins Spatial joins select pairs of rows from two tables based on their spatial interaction. For example, a query could identify the number of pairs of geometries from the cities and rivers tables that intersect each other as follows: select count(*) from city_table a, river_table b where sdo_relate( a.city_geom, b.river_geom, intersect )= TRUE ; There are two ways to compute such joins: First approach is to iterate on the first table (cities) performing a spatial query on the second table (rivers) using each geometry in the first table. This is the nested-loop approach. The second approach is to traverse the associated spatial indexes on both the tables together [10, 26] and identify interacting geometries This is referred to as index-based spatial join approach. As with all other B-tree joins, index-based spatial join approach is faster than nested-loop join approach. However, until Oracle9i, there is no efficient mechanism to return pairs of rowids (rowids of first and second table) in Oracle. Currently, spatial joins can be rewritten and evaluated using table functions of Oracle9i as follows. The table names and geometry column names along with the interaction-type can be passed in to a spatial join function that returns the pair of rowids of the interacting geometries from the indexes of the two tables. select count(*) from city_table a, river_table b where (a.rowid, b.rowid) in (select rid1, rid2 from TABLE(spatial_join( city_table, city_geom, river_table, river_geom, intersect ))); 4.1 Parallelizing Spatial Join The drawback of the above approach is that it only has a single input stream and does not use the parallelism available through Oracle table function technology. For instance, if the two indexes are rooted at R1 and S1 as shown in Figure 1, the above approach will invoke one join operation of the trees rooted at R1 and S1. To better avail of the tablefunction-level parallelism, we modify our approach to perform a spatial-join of subtrees of the R-tree indexes. To this end, we descend each index by a certain level and identify the roots of the subtrees at that level and join the subtrees. For instance, if we descend by one level in Figure 1, this will result in 4 joins of the following subtree pairs:,, and. In general, we descend both trees as far below as to get appropriate number of subtree-joins. The spatial-join function is modified to include a cursor returning a set of R-tree subtree roots as follows: select count(*) from city a, river b where (a.rowid, b.rowid) in (select rid1, rid2 from TABLE(spatial_join( CURSOR(select * from table(subtree_root( city_table_index, level)), table(subtree_root( river_table_index, level))), city_table, city_geom, river_table, river_geom, intersect ))); 4.2 Evaluation using Pipelined Table Functions Spatial join is evaluated using the start-fetch-close interface of Oracle pipelined table functions. In the start method, the metadata of the two R-tree indexes that need to be joined is loaded and the subtree roots of the R-tree indexes (that are passed in as parameters to the spatial join function) are pushed onto a stack. In each fetch call, the spatial join processing is resumed using the contents of the stack and as many result join rowids are determined as specified in the fetch call by joining the two R-tree indexes. Once there are no more result rowids to be returned, the fetch call returns an empty collection and the memory resources are cleaned up in the subsequent close call. Next we describe the join processing in each fetch call in more detail. Since the data are arbitrarily complex geometry data, the join has to be evaluated in a 2-stage fashion. First the indexbased MBRs are compared for intersection with each other. An array of candidate pairs of geometries are computed using the two indexes. The size of this array is determined by existing memory resources. Once the candidate array is processed, the array is filled by resuming the index-based join of the two R-trees. Each candidate pair of geometries in the array are processed by first fetching the exact geometries from the two tables and then comparing them using a secondary (geometry-geometry) filter. Shekhar et al. [26] note that the right order of fetching the geometries is important for performance and the problem is NP-complete. Instead of a random order of fetching the geometries, sorting the candidate pair based on the first rowid is much better and expected to be within 20% of the best approximate solutions. We adopt this approach in Oracle Spatial. 853

4 R1 S1 R11 R12 S11 S12 Index of First Table (cities) Index of Second Table(rivers) Join Pairs of Subtrees for Parallelism (R11, S11), (R11, S12), (R12, S11), (R12, S12) Figure 1. Joining Two Spatial Indexes. 4.3 Experiments We examine the performance of index-based spatial-join on two real datasets: Counties and Star-clusters. We describe these datasets and compare the performance of the nested-loop and index-based spatial join for each of these datasets in turn. These experiments are conducted using alpha version of Oracle10i on a Sun 400MHz 4-CPU machine with 1GB memory. The first dataset contains the geometries for the 3230 counties in the Unites States. This data is joined with itself by specifying either intersection (distance of 0) or by specifying a distance. Table 1 reports the results. Distance Result Nested Spatial Index Size Loop Join s 144.7s s 221.9s s 271.8s s 331.4s Table 1. Comparison of join times using Nested-loop Join, Spatial-index Join for Counties data. Spatial-index Join is 33-55% faster. Next we examined the join performance for different sizes of the dataset and using parallel processing using the second dataset. The second dataset is 250K data about star locations/clusters in a cross-section of the sky (customer data publicly available). We varied the dataset size from 25 to 250K by choosing subsets of the original 250K data. We performed a self-join of each subset and examined the performance of spatial using (1) nested-loop based evaluation, (2) index-based join on 1 processor (I1), and (3) index-based join on 2 processors (I2). Table 2 shows the join query response time for each dataset size. For small dataset sizes of 25 polygons, the nested-loop method performs the same as the index-based join on one processor. This is because of the relatively small size of the dataset and the result sets. However, as we increase the dataset size, the result set size increases and we observe that the nested-loop method is nearly 6 times slower in most cases compared to the index-based join(i1). The gains from parallel processing are nearly 50% for most dataset sizes. Data Result Nested Index Index size size loop Join(1) Join(2) s 6.2s 3.47s s 3.5s 2.23s s 10.3s 7.2s s 83s 70s s 864s 676s Table 2. Comparison of join times using Nested-loop-based Join, Index-based Join on 1 and 2 processors for different dataset sizes. Index-based join using table functions is nearly 6 times faster. 854

5 Geometry Table Table fn Partitioning Tesselate Tesselate Index Table cessors. We used the US Block-groups data consisting of about 230K arbitrarily-shaped complex polygon geometries. Table 3 illustrates these results from 1 to 4 processors for both Quadtree and R-tree indexes. Since the geometries are large and complex, the Quadtree creation time is high compared to R-trees. We observe that index creation speeds up by a factor of 2.6 on 4 processors for Quadtree. Compared to that, R-tree creation does not involve expensive tessellation and is faster even in the sequential case and speeds up by a factor of 1.8. Tesselate Figure 2. Parallelizing Quadtree Index Creation. 5 Parallel Index Creation In this section, we describe how Quadtree and R-tree index creation can be parallelized using table functions. Quadtree index creation consists of the following steps: 1. For each data geometry, tessellate the geometry into tiles and store these tiles in an index table. 2. Construct B-tree indexes on the codes for the tiles. In order to parallelize the index creation operation, we need to parallelize the tessellation of geometries and create parallel B-tree indexes. The latter part is performed by specifying the parallel clause of a B-tree index statement in Oracle. In order to parallelize the tessellation which happens to be a substantial portion of the index creation time for large complex polygon geometries, we use a table function that takes as input a cursor for fetching the geometries and tessellates these geometries (and inserts the tiles in a specified table). This process is illustrated in Figure 2. Since parallel table functions partition the input cursor based on the specified operation-level parallelism, the tessellation process is performed in parallel on subsets of the input geometries from the table. Analogous to Quadtree construction, R-tree creation is also parallelized by using parallel table functions (1) to load the geometry data and compute minimum bounding rectangles, and (2) to cluster subtrees in parallel. 5.1 Experiments In this section, we describe some experimental results to compare the index creation performance on multiple pro- Number of Quadtree Creation R-tree Creation Processors time time s 454s s 296s s 258s Table 3. Parallel Quadtree and R-tree creation times using table functions: Speedup of up to 2.6 on 4 processors. 6 Conclusions In this paper, we examined how to improve two expensive operations in Oracle Spatial: R-tree spatial joins and parallel index creation. We described how parallel and pipelined table function technology can be used to perform spatial joins efficiently using the two associated R-tree indexes. We also examined the effect of using table functions to parallelize index creation. Both operations improved in performance by several factors compared to prior versions that did not support table functions. This demonstrates the effectiveness of parallel and pipelined table functions as a building technology tool to efficiently support complex domain-specific operations such as spatial joins and index creation. References [1] N. Beckmann, H. Kriegel, R. Schneider, and B. Seeger. The R* tree: An efficient and robust access method for points and rectangles. In Proc. ACM SIGMOD Int. Conf. on Management of Data, pages , [2] S. Berchtold, D. A. Keim, and H. P. Kreigel. The X-tree: An index structure for high dimensional data. Procȯf the Int. Conf. on Very Large Data Bases, [3] S. Berchtold, D. A. Keim, H.-P.Kriegel, and T. Seidl. A new technique for nearest neighbor search in high-dimensional space. IEEE Trans. on Knowledge and Data Engineering, 12(1):45 57,

6 [4] T. Brinkhoff, H. Horn, H. P. Kriegel, and R. Schneider. A storage and access architecture for efficient query processing in spatial database systems. In Symposium on Large Spatial Databases (SSD 93), LNCS 692, [5] S. Defazio, A. Daoud, L. A. Smith, and J. Srinivasan. Integrating ir and rdbms using cooperative indexing. In Proc. of ACM SIGIR Conf. on Information Retrieval, pages 84 92, [6] H. Ferhatosmanoglu, E. Tuncel, D. Agrawal, and A. E. Abbadi. Approximate nearest neighbor searching in multimedia databases. In Proc. Int. Conf. on Data Engineering, pages , [7] Y. J. Garcia, S. T. Leutenegger, and M. A. Lopez. A greedy algorithm for bulk loading R-trees. In Proc. of ACM GIS, [8] A. Guttman. R-trees: A dynamic index structure for spatial searching. Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 47 57, [9] G. Hjaltson and H. Samet. Ranking in spatial databases. In Symposium on Spatial Databases (SSD), [10] Y.-W. Huang, N. Jing, and E. A. Rundensteiner. Spatial joins using r-trees: Breadth-first traversal with global optimizations. In Procȯf the Int. Conf. on Very Large Data Bases, pages , [11] N. Katayama and S. Satoh. The SR-tree: An index structure for high-dimensional nearest-neighbor queries. Proc. ACM SIGMOD Int. Conf. on Management of Data, pages , May [12] M. Kornacker, C. Mohan, and J. Hellerstein. Concurrency and recovery in GiST. In Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 62 72, Tucson, Arizon, June [13] S. T. Leutenegger, M. A. Lopez, and J. M. Edgington. STR: A simple and efficient algorithm for R-tree packing. In Proc. Int. Conf. on Data Engineering, [14] K.-I. Lin, H. V. Jagdish, and C. Faloutsos. The TV-tree: An index structure for high-dimensional data. VLDB Journal, 3: , [15] D. B. Lomet and B. Salzberg. The hb-tree: A multiattribute indexing method with good guaranteed performance. Proc. ACM Symp. on Transactions of Database Systems, 15(4): , December [16] B. C. Ooi, C. Yu, K. L. Tan, and H. V. Jagadish. Indexing the distance: an efficient method to knn processing. In Procȯf the Int. Conf. on Very Large Data Bases, [17] Oracle Press. Parallel and Pipelined Table Functions. In Oracle9i SQL Reference Documentation, [18] D. Papadis, T. Sellis, Y. Theodoridis, and M. Egenhofer. Topological relations in the world of minimum bounding rectangles: a study with r-trees. In Proc. ACM SIGMOD Int. Conf. on Management of Data, pages , [19] K. V. Ravi Kanth, D. Agrawal, Amr El Abbadi, and Ambuj K. Singh. Dimensionality reduction for similarity searching in dynamic databases. In Proc. ACM SIGMOD Int. Conf. on Management of Data, [20] K. V. Ravi Kanth, Siva Ravada, J. Sharma, and J. Banerjee. Indexing medium-dimensionality data in oracle. In Proc. ACM SIGMOD Int. Conf. on Management of Data, [21] Ravi Kanth V Kothuri and Siva Ravada. Efficient processing of large spatial queries using interior approximations. In Symposium on Spatial and Temporal Databases (SSTD), [22] Ravi Kanth V Kothuri, Siva Ravada, and Daniel Abugov. Quadtree and r-trees in oracle spatial: A comparison using gis data. In Proc. ACM SIGMOD Int. Conf. on Management of Data, [23] H. Samet. Recent developments in linear quadtree-based geographic information systems. Image and Vision Computing, 5(3): , Aug [24] H. Samet. The design and analysis of spatial data structures. Addison-Wesley Publishing Co., [25] T. Sellis, N. Roussopoulos, and C. Faloutsos. The r -tree: A dynamic index for multi-dimensional objects. Procȯf the Int. Conf. on Very Large Data Bases, 13: , [26] S. Shekhar, C. Lu, S. Chawla, and S. Ravada. Efficient join index based join processing; a clustering approach. IEEE Trans. on Knowledge and Data Engineering. [27] Y. Theodoridis and T. K. Sellis. Optimization issues in r-tree construction. In Geographic Information Systems (IGIS), pages , [28] Y. Theodoridis and T. K. Sellis. A model for the prediction of r-tree performance. In Proc. ACM Symp. on Principles of Database Systems, [29] F. Wang. Relational-linear quadtree approach for twodimensional spatial representation and manipulation. IEEE Trans. on Knowledge and Data Engineering, 3(1): , Mar [30] D. White and R. Jain. Algorithms and strategies for similarity retrieval. Proc. of the SPIE Conference, [31] D. White and R. Jain. Similarity indexing with the SS-tree. Proc. Int. Conf. on Data Engineering, pages ,

Efficient Processing of Large Spatial Queries Using Interior Approximations

Efficient Processing of Large Spatial Queries Using Interior Approximations Efficient Processing of Large Spatial Queries Using Interior Approximations Ravi K. Kothuri and Siva Ravada Spatial Technologies, NEDC Oracle Corporation, Nashua NH 03062 {Ravi.Kothuri,Siva.Ravada}@oracle.com

More information

User accesses business site. Recommendations Engine. Recommendations to user 3 Data Mining for Personalization

User accesses business site. Recommendations Engine. Recommendations to user 3 Data Mining for Personalization Personalization and Location-based Technologies for E-Commerce Applications K. V. Ravi Kanth, and Siva Ravada Spatial Technologies, NEDC, Oracle Corporation, Nashua NH 03062. fravi.kothuri, Siva.Ravadag@oracle.com

More information

X-tree. Daniel Keim a, Benjamin Bustos b, Stefan Berchtold c, and Hans-Peter Kriegel d. SYNONYMS Extended node tree

X-tree. Daniel Keim a, Benjamin Bustos b, Stefan Berchtold c, and Hans-Peter Kriegel d. SYNONYMS Extended node tree X-tree Daniel Keim a, Benjamin Bustos b, Stefan Berchtold c, and Hans-Peter Kriegel d a Department of Computer and Information Science, University of Konstanz b Department of Computer Science, University

More information

Fast Similarity Search for High-Dimensional Dataset

Fast Similarity Search for High-Dimensional Dataset Fast Similarity Search for High-Dimensional Dataset Quan Wang and Suya You Computer Science Department University of Southern California {quanwang,suyay}@graphics.usc.edu Abstract This paper addresses

More information

Indexing Non-uniform Spatial Data

Indexing Non-uniform Spatial Data Indexing Non-uniform Spatial Data K. V. Ravi Kanth Divyakant Agrawal Amr El Abbadi Ambuj K. Singh Department of Computer Science University of California at Santa Barbara Santa Barbara, CA 93106 Abstract

More information

An index structure for efficient reverse nearest neighbor queries

An index structure for efficient reverse nearest neighbor queries An index structure for efficient reverse nearest neighbor queries Congjun Yang Division of Computer Science, Department of Mathematical Sciences The University of Memphis, Memphis, TN 38152, USA yangc@msci.memphis.edu

More information

Using Natural Clusters Information to Build Fuzzy Indexing Structure

Using Natural Clusters Information to Build Fuzzy Indexing Structure Using Natural Clusters Information to Build Fuzzy Indexing Structure H.Y. Yue, I. King and K.S. Leung Department of Computer Science and Engineering The Chinese University of Hong Kong Shatin, New Territories,

More information

V Simpósio Brasileiro de Geoinformática (GEOINFO 2003), Campos do Jordão (SP), Efficient Query Processing on the Relational Quadtree

V Simpósio Brasileiro de Geoinformática (GEOINFO 2003), Campos do Jordão (SP), Efficient Query Processing on the Relational Quadtree V Simpósio Brasileiro de Geoinformática (GEOINFO 23), Campos do Jordão (SP), 23. Efficient Query Processing on the Relational Quadtree HANS-PETER KRIEGEL, PETER KUNATH, MARTIN PFEIFLE, MATTHIAS RENZ University

More information

High Expressive Spatio-temporal Relations

High Expressive Spatio-temporal Relations From: FLAIRS-02 Proceedings. Copyright 2002, AAAI (www.aaai.org). All rights reserved. High Expressive Spatio-temporal Relations R. CHBEIR, Y. AMGHAR, A. FLORY LISI INSA de Lyon 20, Avenue A. Einstein

More information

So, we want to perform the following query:

So, we want to perform the following query: Abstract This paper has two parts. The first part presents the join indexes.it covers the most two join indexing, which are foreign column join index and multitable join index. The second part introduces

More information

SPATIAL RANGE QUERY. Rooma Rathore Graduate Student University of Minnesota

SPATIAL RANGE QUERY. Rooma Rathore Graduate Student University of Minnesota SPATIAL RANGE QUERY Rooma Rathore Graduate Student University of Minnesota SYNONYMS Range Query, Window Query DEFINITION Spatial range queries are queries that inquire about certain spatial objects related

More information

Experimental Evaluation of Spatial Indices with FESTIval

Experimental Evaluation of Spatial Indices with FESTIval Experimental Evaluation of Spatial Indices with FESTIval Anderson Chaves Carniel 1, Ricardo Rodrigues Ciferri 2, Cristina Dutra de Aguiar Ciferri 1 1 Department of Computer Science University of São Paulo

More information

Visualizing and Animating Search Operations on Quadtrees on the Worldwide Web

Visualizing and Animating Search Operations on Quadtrees on the Worldwide Web Visualizing and Animating Search Operations on Quadtrees on the Worldwide Web František Brabec Computer Science Department University of Maryland College Park, Maryland 20742 brabec@umiacs.umd.edu Hanan

More information

V Simpósio Brasileiro de Geoinformática (GEOINFO 2003), Campos do Jordão (SP), Stochastic Driven Relational R-Tree

V Simpósio Brasileiro de Geoinformática (GEOINFO 2003), Campos do Jordão (SP), Stochastic Driven Relational R-Tree V Simpósio Brasileiro de Geoinformática (GEOINFO 003, Campos do Jordão (SP, 003. Stochastic Driven Relational R-Tree HANS-PETER KRIEGEL, PETER KUNATH, MARTIN PFEIFLE, MARCO PÖTKE, MATTHIAS RENZ, PETRA-MARIA

More information

Spatiotemporal Access to Moving Objects. Hao LIU, Xu GENG 17/04/2018

Spatiotemporal Access to Moving Objects. Hao LIU, Xu GENG 17/04/2018 Spatiotemporal Access to Moving Objects Hao LIU, Xu GENG 17/04/2018 Contents Overview & applications Spatiotemporal queries Movingobjects modeling Sampled locations Linear function of time Indexing structure

More information

Two Ellipse-based Pruning Methods for Group Nearest Neighbor Queries

Two Ellipse-based Pruning Methods for Group Nearest Neighbor Queries Two Ellipse-based Pruning Methods for Group Nearest Neighbor Queries ABSTRACT Hongga Li Institute of Remote Sensing Applications Chinese Academy of Sciences, Beijing, China lihongga lhg@yahoo.com.cn Bo

More information

Boolean Bounding Predicates for Spatial Access Methods

Boolean Bounding Predicates for Spatial Access Methods Boolean Bounding Predicates for Spatial Access Methods (Extended Abstract) Megan Thomas and Joseph M. Hellerstein 1 Computer Science Division, University of California, Berkeley {mct, jmh}@cs.berkeley.edu

More information

Introduction to Spatial Database Systems

Introduction to Spatial Database Systems Introduction to Spatial Database Systems by Cyrus Shahabi from Ralf Hart Hartmut Guting s VLDB Journal v3, n4, October 1994 Data Structures & Algorithms 1. Implementation of spatial algebra in an integrated

More information

Striped Grid Files: An Alternative for Highdimensional

Striped Grid Files: An Alternative for Highdimensional Striped Grid Files: An Alternative for Highdimensional Indexing Thanet Praneenararat 1, Vorapong Suppakitpaisarn 2, Sunchai Pitakchonlasap 1, and Jaruloj Chongstitvatana 1 Department of Mathematics 1,

More information

Efficient Spatial Query Processing in Geographic Database Systems

Efficient Spatial Query Processing in Geographic Database Systems Efficient Spatial Query Processing in Geographic Database Systems Hans-Peter Kriegel, Thomas Brinkhoff, Ralf Schneider Institute for Computer Science, University of Munich Leopoldstr. 11 B, W-8000 München

More information

Improving the Query Performance of High-Dimensional Index Structures by Bulk Load Operations

Improving the Query Performance of High-Dimensional Index Structures by Bulk Load Operations Improving the Query Performance of High-Dimensional Index Structures by Bulk Load Operations Stefan Berchtold, Christian Böhm 2, and Hans-Peter Kriegel 2 AT&T Labs Research, 8 Park Avenue, Florham Park,

More information

ISSUES IN SPATIAL DATABASES AND GEOGRAPHICAL INFORMATION SYSTEMS (GIS) HANAN SAMET

ISSUES IN SPATIAL DATABASES AND GEOGRAPHICAL INFORMATION SYSTEMS (GIS) HANAN SAMET zk0 ISSUES IN SPATIAL DATABASES AND GEOGRAPHICAL INFORMATION SYSTEMS (GIS) HANAN SAMET COMPUTER SCIENCE DEPARTMENT AND CENTER FOR AUTOMATION RESEARCH AND INSTITUTE FOR ADVANCED COMPUTER STUDIES UNIVERSITY

More information

A Parallel Access Method for Spatial Data Using GPU

A Parallel Access Method for Spatial Data Using GPU A Parallel Access Method for Spatial Data Using GPU Byoung-Woo Oh Department of Computer Engineering Kumoh National Institute of Technology Gumi, Korea bwoh@kumoh.ac.kr Abstract Spatial access methods

More information

Indexing Techniques 3 rd Part

Indexing Techniques 3 rd Part Indexing Techniques 3 rd Part Presented by: Tarik Ben Touhami Supervised by: Dr. Hachim Haddouti CSC 5301 Spring 2003 Outline! Join indexes "Foreign column join index "Multitable join index! Indexing techniques

More information

AN OVERVIEW OF SPATIAL INDEXING WITHIN RDBMS

AN OVERVIEW OF SPATIAL INDEXING WITHIN RDBMS AN OVERVIEW OF SPATIAL INDEXING WITHIN RDBMS ADD SUBTITLE IN ALL CAPS DAVID DEHAAN SQL ANYWHERE QUERY PROCESSING TEAM SYBASE THURSDAY 9 FEB 2012 CHERITON SCHOOL OF COMPUTER SCIENCE, CS 448/648 OUTLINE

More information

Clustering For Similarity Search And Privacyguaranteed Publishing Of Hi-Dimensional Data Ashwini.R #1, K.Praveen *2, R.V.

Clustering For Similarity Search And Privacyguaranteed Publishing Of Hi-Dimensional Data Ashwini.R #1, K.Praveen *2, R.V. Clustering For Similarity Search And Privacyguaranteed Publishing Of Hi-Dimensional Data Ashwini.R #1, K.Praveen *2, R.V.Krishnaiah *3 #1 M.Tech, Computer Science Engineering, DRKIST, Hyderabad, Andhra

More information

Extending Rectangle Join Algorithms for Rectilinear Polygons

Extending Rectangle Join Algorithms for Rectilinear Polygons Extending Rectangle Join Algorithms for Rectilinear Polygons Hongjun Zhu, Jianwen Su, and Oscar H. Ibarra University of California at Santa Barbara Abstract. Spatial joins are very important but costly

More information

Constrained Nearest Neighbor Queries

Constrained Nearest Neighbor Queries Constrained Nearest Neighbor Queries Hakan Ferhatosmanoglu, Ioanna Stanoi, Divyakant Agrawal, and Amr El Abbadi Computer Science Department, University of California at Santa Barbara {hakan,ioana,agrawal,amr}@csucsbedu

More information

Processing a multimedia join through the method of nearest neighbor search

Processing a multimedia join through the method of nearest neighbor search Information Processing Letters 82 (2002) 269 276 Processing a multimedia join through the method of nearest neighbor search Harald Kosch a,, Solomon Atnafu b a Institute of Information Technology, University

More information

Chapter 1, Introduction

Chapter 1, Introduction CSI 4352, Introduction to Data Mining Chapter 1, Introduction Young-Rae Cho Associate Professor Department of Computer Science Baylor University What is Data Mining? Definition Knowledge Discovery from

More information

Distributed k-nn Query Processing for Location Services

Distributed k-nn Query Processing for Location Services Distributed k-nn Query Processing for Location Services Jonghyeong Han 1, Joonwoo Lee 1, Seungyong Park 1, Jaeil Hwang 1, and Yunmook Nah 1 1 Department of Electronics and Computer Engineering, Dankook

More information

Spatial Data Management

Spatial Data Management Spatial Data Management [R&G] Chapter 28 CS432 1 Types of Spatial Data Point Data Points in a multidimensional space E.g., Raster data such as satellite imagery, where each pixel stores a measured value

More information

Using the Holey Brick Tree for Spatial Data. in General Purpose DBMSs. Northeastern University

Using the Holey Brick Tree for Spatial Data. in General Purpose DBMSs. Northeastern University Using the Holey Brick Tree for Spatial Data in General Purpose DBMSs Georgios Evangelidis Betty Salzberg College of Computer Science Northeastern University Boston, MA 02115-5096 1 Introduction There is

More information

DeLiClu: Boosting Robustness, Completeness, Usability, and Efficiency of Hierarchical Clustering by a Closest Pair Ranking

DeLiClu: Boosting Robustness, Completeness, Usability, and Efficiency of Hierarchical Clustering by a Closest Pair Ranking In Proc. 10th Pacific-Asian Conf. on Advances in Knowledge Discovery and Data Mining (PAKDD'06), Singapore, 2006 DeLiClu: Boosting Robustness, Completeness, Usability, and Efficiency of Hierarchical Clustering

More information

A Real Time GIS Approximation Approach for Multiphase Spatial Query Processing Using Hierarchical-Partitioned-Indexing Technique

A Real Time GIS Approximation Approach for Multiphase Spatial Query Processing Using Hierarchical-Partitioned-Indexing Technique International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2017 IJSRCSEIT Volume 2 Issue 6 ISSN : 2456-3307 A Real Time GIS Approximation Approach for Multiphase

More information

Hardware Acceleration for Spatial Selections and Joins

Hardware Acceleration for Spatial Selections and Joins Acceleration for Spatial Selections and Joins Chengyu Sun Divyakant Agrawal Amr El Abbadi Department of Computer Science University of California, Santa Barbara Email:{cysun, agrawal, amr}@cs.ucsb.edu

More information

Indexing High-Dimensional Data for Content-Based Retrieval in Large Databases

Indexing High-Dimensional Data for Content-Based Retrieval in Large Databases Indexing High-Dimensional Data for Content-Based Retrieval in Large Databases Manuel J. Fonseca, Joaquim A. Jorge Department of Information Systems and Computer Science INESC-ID/IST/Technical University

More information

Spatial Data Management

Spatial Data Management Spatial Data Management Chapter 28 Database management Systems, 3ed, R. Ramakrishnan and J. Gehrke 1 Types of Spatial Data Point Data Points in a multidimensional space E.g., Raster data such as satellite

More information

On Nearest Neighbor Indexing of Nonlinear Trajectories

On Nearest Neighbor Indexing of Nonlinear Trajectories On Nearest Neighbor Indexing of Nonlinear Trajectories Charu C. Aggarwal IBM T. J. Watson Research Center 19 Skyline Drive Hawthorne, NY 10532 charu@us.ibm.com Dakshi Agrawal IBM T. J. Watson Research

More information

Benchmarking the UB-tree

Benchmarking the UB-tree Benchmarking the UB-tree Michal Krátký, Tomáš Skopal Department of Computer Science, VŠB Technical University of Ostrava, tř. 17. listopadu 15, Ostrava, Czech Republic michal.kratky@vsb.cz, tomas.skopal@vsb.cz

More information

Nearest Neighbor Search on Vertically Partitioned High-Dimensional Data

Nearest Neighbor Search on Vertically Partitioned High-Dimensional Data Nearest Neighbor Search on Vertically Partitioned High-Dimensional Data Evangelos Dellis, Bernhard Seeger, and Akrivi Vlachou Department of Mathematics and Computer Science, University of Marburg, Hans-Meerwein-Straße,

More information

Surrounding Join Query Processing in Spatial Databases

Surrounding Join Query Processing in Spatial Databases Surrounding Join Query Processing in Spatial Databases Lingxiao Li (B), David Taniar, Maria Indrawan-Santiago, and Zhou Shao Monash University, Melbourne, Australia lli278@student.monash.edu, {david.taniar,maria.indrawan,joe.shao}@monash.edu

More information

A Pivot-based Index Structure for Combination of Feature Vectors

A Pivot-based Index Structure for Combination of Feature Vectors A Pivot-based Index Structure for Combination of Feature Vectors Benjamin Bustos Daniel Keim Tobias Schreck Department of Computer and Information Science, University of Konstanz Universitätstr. 10 Box

More information

Summary. 4. Indexes. 4.0 Indexes. 4.1 Tree Based Indexes. 4.0 Indexes. 19-Nov-10. Last week: This week:

Summary. 4. Indexes. 4.0 Indexes. 4.1 Tree Based Indexes. 4.0 Indexes. 19-Nov-10. Last week: This week: Summary Data Warehousing & Data Mining Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de Last week: Logical Model: Cubes,

More information

Spatial Index Keyword Search in Multi- Dimensional Database

Spatial Index Keyword Search in Multi- Dimensional Database Spatial Index Keyword Search in Multi- Dimensional Database Sushma Ahirrao M. E Student, Department of Computer Engineering, GHRIEM, Jalgaon, India ABSTRACT: Nearest neighbor search in multimedia databases

More information

What we have covered?

What we have covered? What we have covered? Indexing and Hashing Data warehouse and OLAP Data Mining Information Retrieval and Web Mining XML and XQuery Spatial Databases Transaction Management 1 Lecture 6: Spatial Data Management

More information

Search K Nearest Neighbors on Air

Search K Nearest Neighbors on Air Search K Nearest Neighbors on Air Baihua Zheng 1, Wang-Chien Lee 2, and Dik Lun Lee 1 1 Hong Kong University of Science and Technology Clear Water Bay, Hong Kong {baihua,dlee}@cs.ust.hk 2 The Penn State

More information

Data Warehousing & Data Mining

Data Warehousing & Data Mining Data Warehousing & Data Mining Wolf-Tilo Balke Kinda El Maarry Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de Summary Last week: Logical Model: Cubes,

More information

An Overview of various methodologies used in Data set Preparation for Data mining Analysis

An Overview of various methodologies used in Data set Preparation for Data mining Analysis An Overview of various methodologies used in Data set Preparation for Data mining Analysis Arun P Kuttappan 1, P Saranya 2 1 M. E Student, Dept. of Computer Science and Engineering, Gnanamani College of

More information

CMSC724: Access Methods; Indexes 1 ; GiST

CMSC724: Access Methods; Indexes 1 ; GiST CMSC724: Access Methods; Indexes 1 ; GiST Amol Deshpande University of Maryland, College Park March 14, 2011 1 Partially based on notes from Joe Hellerstein Outline 1 Access Methods 2 B+-Tree 3 Beyond

More information

Optimization of Queries in Distributed Database Management System

Optimization of Queries in Distributed Database Management System Optimization of Queries in Distributed Database Management System Bhagvant Institute of Technology, Muzaffarnagar Abstract The query optimizer is widely considered to be the most important component of

More information

Development of Efficient & Optimized Algorithm for Knowledge Discovery in Spatial Database Systems

Development of Efficient & Optimized Algorithm for Knowledge Discovery in Spatial Database Systems Development of Efficient & Optimized Algorithm for Knowledge Discovery in Spatial Database Systems Kapil AGGARWAL, India Key words: KDD, SDBS, neighborhood graph, neighborhood path, neighborhood index

More information

The Effects of Dimensionality Curse in High Dimensional knn Search

The Effects of Dimensionality Curse in High Dimensional knn Search The Effects of Dimensionality Curse in High Dimensional knn Search Nikolaos Kouiroukidis, Georgios Evangelidis Department of Applied Informatics University of Macedonia Thessaloniki, Greece Email: {kouiruki,

More information

Distance Browsing in Spatial Databases

Distance Browsing in Spatial Databases Distance Browsing in Spatial Databases GÍSLI R. HJALTASON and HANAN SAMET University of Maryland We compare two different techniques for browsing through a collection of spatial objects stored in an R-tree

More information

Similarity Search in Time Series Databases

Similarity Search in Time Series Databases Similarity Search in Time Series Databases Maria Kontaki Apostolos N. Papadopoulos Yannis Manolopoulos Data Enginering Lab, Department of Informatics, Aristotle University, 54124 Thessaloniki, Greece INTRODUCTION

More information

Quadrant-Based MBR-Tree Indexing Technique for Range Query Over HBase

Quadrant-Based MBR-Tree Indexing Technique for Range Query Over HBase Quadrant-Based MBR-Tree Indexing Technique for Range Query Over HBase Bumjoon Jo and Sungwon Jung (&) Department of Computer Science and Engineering, Sogang University, 35 Baekbeom-ro, Mapo-gu, Seoul 04107,

More information

An Introduction to Spatial Databases

An Introduction to Spatial Databases An Introduction to Spatial Databases R. H. Guting VLDB Journal v3, n4, October 1994 Speaker: Giovanni Conforti Outline: a rather old (but quite complete) survey on Spatial DBMS Introduction & definition

More information

Code Transformation of DF-Expression between Bintree and Quadtree

Code Transformation of DF-Expression between Bintree and Quadtree Code Transformation of DF-Expression between Bintree and Quadtree Chin-Chen Chang*, Chien-Fa Li*, and Yu-Chen Hu** *Department of Computer Science and Information Engineering, National Chung Cheng University

More information

3. Data Storage and Index Structures

3. Data Storage and Index Structures 3 Data Storage and Index Structures 640 K ought to be enough for anybody Bill Gates One important property of DBMSs is their ability to handle large amounts of data In order to store and retrieve this

More information

NOVEL CACHE SEARCH TO SEARCH THE KEYWORD COVERS FROM SPATIAL DATABASE

NOVEL CACHE SEARCH TO SEARCH THE KEYWORD COVERS FROM SPATIAL DATABASE NOVEL CACHE SEARCH TO SEARCH THE KEYWORD COVERS FROM SPATIAL DATABASE 1 Asma Akbar, 2 Mohammed Naqueeb Ahmad 1 M.Tech Student, Department of CSE, Deccan College of Engineering and Technology, Darussalam

More information

Indexing and selection of data items in huge data sets by constructing and accessing tag collections

Indexing and selection of data items in huge data sets by constructing and accessing tag collections Indexing and selection of data items in huge data sets by constructing and accessing tag collections Sébastien Ponce CERN, Geneva LHCb Experiment sebastien.ponce@cern.ch tel +1-41-22-767-2143 Roger D.

More information

Multidimensional Data and Modelling - DBMS

Multidimensional Data and Modelling - DBMS Multidimensional Data and Modelling - DBMS 1 DBMS-centric approach Summary: l Spatial data is considered as another type of data beside conventional data in a DBMS. l Enabling advantages of DBMS (data

More information

An Efficient Approach for Color Pattern Matching Using Image Mining

An Efficient Approach for Color Pattern Matching Using Image Mining An Efficient Approach for Color Pattern Matching Using Image Mining * Manjot Kaur Navjot Kaur Master of Technology in Computer Science & Engineering, Sri Guru Granth Sahib World University, Fatehgarh Sahib,

More information

An Efficient Density Based Incremental Clustering Algorithm in Data Warehousing Environment

An Efficient Density Based Incremental Clustering Algorithm in Data Warehousing Environment An Efficient Density Based Incremental Clustering Algorithm in Data Warehousing Environment Navneet Goyal, Poonam Goyal, K Venkatramaiah, Deepak P C, and Sanoop P S Department of Computer Science & Information

More information

On Processing Location Based Top-k Queries in the Wireless Broadcasting System

On Processing Location Based Top-k Queries in the Wireless Broadcasting System On Processing Location Based Top-k Queries in the Wireless Broadcasting System HaRim Jung, ByungKu Cho, Yon Dohn Chung and Ling Liu Department of Computer Science and Engineering, Korea University, Seoul,

More information

Motivation. Powerful Database Primitives to Support High Performance Data Mining. Christian Böhm University for Health Informatics and Technology

Motivation. Powerful Database Primitives to Support High Performance Data Mining. Christian Böhm University for Health Informatics and Technology University for Health Informatics and Technology Powerful Database Primitives to Support High Performance Data Mining Tutorial, IEEE Int. Conf. on Data Mining, Dec/09/2002 Motivation 2 High Performance

More information

Database support for concurrent digital mock up

Database support for concurrent digital mock up Proceedings of the Tenth International IFIP TC5 WG-5.2; WG-5.3 Conference PROLAMAT 1998 Database support for concurrent digital mock up S. Berchtold, H. P. Kriegel, M. Pötke Institute for Computer Science,

More information

Space-partitioning Trees in PostgreSQL: Realization and Performance

Space-partitioning Trees in PostgreSQL: Realization and Performance Space-partitioning Trees in PostgreSQL: Realization and Performance Mohamed Y. Eltabakh Ramy Eltarras Walid G. Aref Computer Science Department, Purdue University {meltabak, rhassan, aref}@cs.purdue.edu

More information

Supporting Fuzzy Keyword Search in Databases

Supporting Fuzzy Keyword Search in Databases I J C T A, 9(24), 2016, pp. 385-391 International Science Press Supporting Fuzzy Keyword Search in Databases Jayavarthini C.* and Priya S. ABSTRACT An efficient keyword search system computes answers as

More information

Clustering for Mining in Large Spatial Databases

Clustering for Mining in Large Spatial Databases Published in Special Issue on Data Mining, KI-Journal, ScienTec Publishing, Vol. 1, 1998 Clustering for Mining in Large Spatial Databases Martin Ester, Hans-Peter Kriegel, Jörg Sander, Xiaowei Xu In the

More information

Optimal Dimension Order: A Generic Technique for the Similarity Join

Optimal Dimension Order: A Generic Technique for the Similarity Join 4th Int. Conf. on Data Warehousing and Knowledge Discovery (DaWaK) Aix-en-Provence, France, 2002. Optimal Dimension Order: A Generic Technique for the Similarity Join Christian Böhm 1, Florian Krebs 2,

More information

Introduction to Spatial Database Systems. Outline

Introduction to Spatial Database Systems. Outline Introduction to Spatial Database Systems by Cyrus Shahabi from Ralf Hart Hartmut Guting s VLDB Journal v3, n4, October 1994 1 Outline Introduction & definition Modeling Querying Data structures and algorithms

More information

Best Keyword Cover Search

Best Keyword Cover Search Vennapusa Mahesh Kumar Reddy Dept of CSE, Benaiah Institute of Technology and Science. Best Keyword Cover Search Sudhakar Babu Pendhurthi Assistant Professor, Benaiah Institute of Technology and Science.

More information

Computing Data Cubes Using Massively Parallel Processors

Computing Data Cubes Using Massively Parallel Processors Computing Data Cubes Using Massively Parallel Processors Hongjun Lu Xiaohui Huang Zhixian Li {luhj,huangxia,lizhixia}@iscs.nus.edu.sg Department of Information Systems and Computer Science National University

More information

Speeding up Bulk-Loading of Quadtrees

Speeding up Bulk-Loading of Quadtrees Speeding up Bulk-Loading of Quadtrees 0 Gísli R. Hjaltason 1 Hanan Samet 1 Yoram J. Sussmann 2 Computer Science Department, Center for Automation Research, and Institute for Advanced Computer Studies,

More information

SILC: Efficient Query Processing on Spatial Networks

SILC: Efficient Query Processing on Spatial Networks Hanan Samet hjs@cs.umd.edu Department of Computer Science University of Maryland College Park, MD 20742, USA Joint work with Jagan Sankaranarayanan and Houman Alborzi Proceedings of the 13th ACM International

More information

Oracle 9i Application Development and Tuning

Oracle 9i Application Development and Tuning Index 2NF, NOT 3NF or BCNF... 2:17 A Anomalies Present in this Relation... 2:18 Anomalies (Specific) in this Relation... 2:4 Application Design... 1:28 Application Environment... 1:1 Application-Specific

More information

Database Systems: Design, Implementation, and Management Tenth Edition. Chapter 8 Advanced SQL

Database Systems: Design, Implementation, and Management Tenth Edition. Chapter 8 Advanced SQL Database Systems: Design, Implementation, and Management Tenth Edition Chapter 8 Advanced SQL Objectives In this chapter, you will learn: How to use the advanced SQL JOIN operator syntax About the different

More information

Oracle Spatial. User s Guide and Reference. Release September 2000 Part Number A

Oracle Spatial. User s Guide and Reference. Release September 2000 Part Number A Oracle Spatial User s Guide and Reference Release 8.1.7 September 2000 Part Number A85337-01 Oracle Spatial User s Guide and Reference Part Number A85337-01 Release 8.1.7 Copyright 1997, 2000, Oracle Corporation.

More information

Extending High-Dimensional Indexing Techniques Pyramid and iminmax(θ) : Lessons Learned

Extending High-Dimensional Indexing Techniques Pyramid and iminmax(θ) : Lessons Learned Extending High-Dimensional Indexing Techniques Pyramid and iminmax(θ) : Lessons Learned Karthik Ganesan Pillai, Liessman Sturlaugson, Juan M. Banda, and Rafal A. Angryk Montana State University, Bozeman,

More information

Roadmap DB Sys. Design & Impl. Reference. Detailed roadmap. Spatial Access Methods - problem. z-ordering - Detailed outline.

Roadmap DB Sys. Design & Impl. Reference. Detailed roadmap. Spatial Access Methods - problem. z-ordering - Detailed outline. 15-721 D Sys. Design & Impl. Z-ordering Christos Faloutsos www.cs.cmu.edu/~christos Roadmap 1) Roots: System R and Ingres 2) Implementation: buffering, indexing, q-opt 3) Transactions: locking, recovery

More information

CMPUT 391 Database Management Systems. Spatial Data Management. University of Alberta 1. Dr. Jörg Sander, 2006 CMPUT 391 Database Management Systems

CMPUT 391 Database Management Systems. Spatial Data Management. University of Alberta 1. Dr. Jörg Sander, 2006 CMPUT 391 Database Management Systems CMPUT 391 Database Management Systems Spatial Data Management University of Alberta 1 Spatial Data Management Shortcomings of Relational Databases and ORDBMS Modeling Spatial Data Spatial Queries Space-Filling

More information

Indexing Fast Moving Objects for KNN Queries Based on Nearest. Landmarks

Indexing Fast Moving Objects for KNN Queries Based on Nearest. Landmarks Indexing Fast Moving Objects for KNN Queries Based on Nearest Landmarks Dan Lin 1 Rui Zhang 1 Aoying Zhou 2 1 Department of Computer Science The National University of Singapore, Singapore {lindan, zhangru1}@comp.nus.edu.sg

More information

SEQUENTIAL PATTERN MINING FROM WEB LOG DATA

SEQUENTIAL PATTERN MINING FROM WEB LOG DATA SEQUENTIAL PATTERN MINING FROM WEB LOG DATA Rajashree Shettar 1 1 Associate Professor, Department of Computer Science, R. V College of Engineering, Karnataka, India, rajashreeshettar@rvce.edu.in Abstract

More information

Search Space Reductions for Nearest-Neighbor Queries

Search Space Reductions for Nearest-Neighbor Queries Search Space Reductions for Nearest-Neighbor Queries Micah Adler 1 and Brent Heeringa 2 1 Department of Computer Science, University of Massachusetts, Amherst 140 Governors Drive Amherst, MA 01003 2 Department

More information

Introduction to Indexing R-trees. Hong Kong University of Science and Technology

Introduction to Indexing R-trees. Hong Kong University of Science and Technology Introduction to Indexing R-trees Dimitris Papadias Hong Kong University of Science and Technology 1 Introduction to Indexing 1. Assume that you work in a government office, and you maintain the records

More information

Module 4: Tree-Structured Indexing

Module 4: Tree-Structured Indexing Module 4: Tree-Structured Indexing Module Outline 4.1 B + trees 4.2 Structure of B + trees 4.3 Operations on B + trees 4.4 Extensions 4.5 Generalized Access Path 4.6 ORACLE Clusters Web Forms Transaction

More information

Principles of Data Management. Lecture #14 (Spatial Data Management)

Principles of Data Management. Lecture #14 (Spatial Data Management) Principles of Data Management Lecture #14 (Spatial Data Management) Instructor: Mike Carey mjcarey@ics.uci.edu Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1 Today s Notable News v Project

More information

Query Optimization in Distributed Databases. Dilşat ABDULLAH

Query Optimization in Distributed Databases. Dilşat ABDULLAH Query Optimization in Distributed Databases Dilşat ABDULLAH 1302108 Department of Computer Engineering Middle East Technical University December 2003 ABSTRACT Query optimization refers to the process of

More information

High Performance Clustering Based on the Similarity Join

High Performance Clustering Based on the Similarity Join Proc.9th Int. Conf. on Information and Knowledge Management CIKM 2000, Washington, DC. High Performance Clustering Based on the Similarity Join Christian Böhm, Bernhard Braunmüller, Markus Breunig, Hans-Peter

More information

DBSCAN. Presented by: Garrett Poppe

DBSCAN. Presented by: Garrett Poppe DBSCAN Presented by: Garrett Poppe A density-based algorithm for discovering clusters in large spatial databases with noise by Martin Ester, Hans-peter Kriegel, Jörg S, Xiaowei Xu Slides adapted from resources

More information

XZ-Ordering: A Space-Filling Curve for Objects with Spatial Extension

XZ-Ordering: A Space-Filling Curve for Objects with Spatial Extension 6th Int. Symposium on Large Spatial Databases (SSD), 1999, Hong Kong, China XZ-Ordering: A Space-Filling Curve for Objects with Spatial Extension Christian Böhm 1, Gerald Klump 1 and Hans-Peter Kriegel

More information

Distance-based Outlier Detection: Consolidation and Renewed Bearing

Distance-based Outlier Detection: Consolidation and Renewed Bearing Distance-based Outlier Detection: Consolidation and Renewed Bearing Gustavo. H. Orair, Carlos H. C. Teixeira, Wagner Meira Jr., Ye Wang, Srinivasan Parthasarathy September 15, 2010 Table of contents Introduction

More information

Time Series:Similarity Search and its Applications

Time Series:Similarity Search and its Applications Time Series:Similarity Search and its Applications Tripti Negi and Veena Bansal Abstract Identify companied with similar growth pattern. A model for identifying similar time series has been developed.

More information

Oracle Spatial. User s Guide and Reference. Release June 2001 Part No. A

Oracle Spatial. User s Guide and Reference. Release June 2001 Part No. A Oracle Spatial User s Guide and Reference Release 9.0.1 June 2001 Part No. A88805-01 Oracle Spatial User s Guide and Reference, Release 9.0.1 Part No. A88805-01 Copyright 1997, 2001, Oracle Corporation.

More information

Appropriate Item Partition for Improving the Mining Performance

Appropriate Item Partition for Improving the Mining Performance Appropriate Item Partition for Improving the Mining Performance Tzung-Pei Hong 1,2, Jheng-Nan Huang 1, Kawuu W. Lin 3 and Wen-Yang Lin 1 1 Department of Computer Science and Information Engineering National

More information

A Unit of SequelGate Innovative Technologies Pvt. Ltd. All Training Sessions are Completely Practical & Real-time

A Unit of SequelGate Innovative Technologies Pvt. Ltd. All Training Sessions are Completely Practical & Real-time SQL Basics & PL-SQL Complete Practical & Real-time Training Sessions A Unit of SequelGate Innovative Technologies Pvt. Ltd. ISO Certified Training Institute Microsoft Certified Partner Training Highlights

More information

Approximate Continuous K Nearest Neighbor Queries for Continuous Moving Objects with Pre-Defined Paths

Approximate Continuous K Nearest Neighbor Queries for Continuous Moving Objects with Pre-Defined Paths Approximate Continuous K Nearest Neighbor Queries for Continuous Moving Objects with Pre-Defined Paths Yu-Ling Hsueh, Roger Zimmermann, and Meng-Han Yang Computer Science Department University of Southern

More information

Object-Relational Management of Complex Geographical Objects

Object-Relational Management of Complex Geographical Objects Object-Relational Management of Complex Geographical Objects Hans-Peter Kriegel, Peter Kunath, Martin Pfeifle, Matthias Renz University of Munich Oettingenstrasse 67 D-80538 Munich, GERMANY +49-89-280990

More information

Database Systems: Design, Implementation, and Management Tenth Edition. Chapter 8 Advanced SQL

Database Systems: Design, Implementation, and Management Tenth Edition. Chapter 8 Advanced SQL Database Systems: Design, Implementation, and Management Tenth Edition Chapter 8 Advanced SQL SQL Join Operators Join operation merges rows from two tables and returns the rows with one of the following:

More information