XML Query Processing and Optimization
|
|
- Dwain Wilkerson
- 5 years ago
- Views:
Transcription
1 XML Query Processing and Optimization Bartley D. Richardson Department of Electrical & Computer Engineering and Computer Science University of Cincinnati December 16, 2005
2 Outline Background XML As A Data Model XML Overview OEM Overview Indexing Over XML B+, XR, and XB Trees DataGuides ToXin Constraint Sequencing XML Query Styles Proposed Research Characterize Parameters and Features Performance Study Create User Tools B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
3 Traditional Query Optimization SQL query travels through two levels of optimization Final product is an access plan for the query B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
4 Traditional Logical Optimization Query converted to relational algebra (RA) tree Transformations are performed on the tree Figure: RA Logical Optimization B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
5 Traditional Physical Optimization Takes into account file organization Chooses algorithms for operators Figure: RA Physical Optimization B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
6 XML Overview Syntactically similar to HTML Information stored in content as well as structure Data represented is self-describing Generally does not require outside schema Presents new challenges for query processing and optimization B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
7 XQuery Designed to meet requirements set by W3C for an XML query language Includes a well-defined grammar FLWR expressions (For, Let, Where, Return) No built-in optimization techniques Current version (XQuery 1.0) is extension of XPath 2.0 XQuery not used in proposal but will be leveraged as an acceptable interface to XML documents and used to extract information B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
8 XPath Simplest type of query in XML Expressions resemble UNIX directory structure Parent-child relationship denoted by a slash (/) Ancestor-descendant relationship denoted by a double-slash (//) Text in brackets ([ ]) acts as a filter on data to be returned XPath Example /bookstore/book[price>35]/title B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
9 XML Example FoodDrink XML Example <FoodDrink> <restaurant id= R001 > <name>chili s</name> <phone> </phone> <owner>g. Peppard</owner> </restaurant> <restaurant id= R002 > <name>maggiano s</name> <owner>g. Peppard</owner> <manager>crowley/manager> </restaurant> <bar id= B001 > <name>crowley</name> <style>irish</style> </bar> </FoodDrink> B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
10 OEM Overview XML data can be represented by the Object Exchange Model (OEM) [PGMW95] Self-describing Retains simplicity of relational models Allows flexibility of object-oriented models B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
11 OEM Representation Figure: Corresponding FoodDrink OEM Representation B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
12 Challenges for XML Query Processing/Optimization XML queries themselves are more complicated New kinds of indexes and traversal strategies Many kinds of indexes Many choices on how to store and traverse trees Results in more query execution strategies Complicated database statistics due to number of possibilities for tree shapes B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
13 Node Labeling Nodes of OEM must be labeled for B + -, XR-, and XB-trees Use extended preorder traversal [LM01] (an extension of Dietz s original numbering scheme [Die82]) Labels each node with a pair of numbers <order,size> Allows insertions to be made into the tree without need for global reordering B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
14 OEM Example with EPT Labeling Figure: OEM with Extended Preorder Traversal Node Labeling B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
15 B + -Trees Used in relational database systems to implement dynamic multilevel index [EN00] Does not require reorganization of entire file to maintain performance Due to hierarchical nature, used for processing XML structural joins Performance later improved upon by XR- and XB-trees B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
16 XR-Trees A B + -tree that is built on start points of element intervals Designed for strictly nested XML data Allows all ancestors and descendants for a given element to be identified optimally Lacks capability to handle highly recursive XML elements efficiently [LLHC04] B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
17 XB-Trees Combines structural features of both the B + -tree and R-tree Indexes pre-assigned intervals of elements in the tree (similar to one-dimensional R-tree) Constructs the index on start points of the intervals (similar to B + -tree) Main difference is that size value of <order,size> must be propagated up index Developed for use in processing holistic twig joins [BKS02] B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
18 A Sample XB-Tree Figure: Sample XB-tree B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
19 DataGuide Definition Concise and accurate summary of database structure [GW97] Dynamically generated Conforms to the data One OEM source database may have multiple DataGuides Use and effectiveness for indexing XML documents on their own has not been addressed Can be used in conjunction with other indexes VIndex (Value Index) LIndex (Link Index) BIndex (Parent/Child pairs) PIndex (Path Index) B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
20 DataGuide Example Figure: DataGuide for OEM Example B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
21 ToXin Developed within the ToX (Toronto XML Engine) project [RM01] Seeks to exploit overall path structure of XML databases in all stages of query processing Index consists of two main structures Value Index: stores XML nodes and corresponding values Path Index: Consists of index tree (a DataGuide) and set of instance tables (used to identify parent-child relationships) Includes redundant information (potentially costly for large databases) B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
22 ToXin Example Figure: Sample ToXin Tree and Tables B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
23 Constraint Sequencing Operates by encoding the entire tree at once [WM05] Index is built that allows selection of an object or path by matching subsequences Encoded information represented by adding prefixes (forward prefix) to value nodes that encode their path along the tree Similar to extended preorder traversal, but uses depth-first traversal of tree to assign the value order(x) Example Sequence <&1,&1&2,&1&2&5,&1&2&6,&1&2&7,... > <&1,&1&2,&1&2(Chili s),&1&2( ),&1&2(g. Peppard)... > B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
24 Constraint Sequencing Open Questions Challenges Encoding tree without introducing false hits or dismissals Authors have identified these problems and developed techniques to overcome Open Questions Appears efficient, but no full comparison to other indexing techniques Options for underlying data structure have not been investigated B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
25 Query Styles Overview Various styles of queries can be classified in traditional terms (Select, Project, Join) Select Simple Path Complex Path Project Value Object Join Structural Value-based B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
26 Query Styles Overview Various styles of queries can be classified in traditional terms (Select, Project, Join) Select Simple Path Complex Path Project Value Object Join Structural Value-based B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
27 Query Styles Overview Various styles of queries can be classified in traditional terms (Select, Project, Join) Select Simple Path Complex Path Project Value Object Join Structural Value-based B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
28 Query Styles Overview Various styles of queries can be classified in traditional terms (Select, Project, Join) Select Simple Path Complex Path Project Value Object Join Structural Value-based B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
29 Select: Simple Path Give me the phone number of Chili s. Chili s ]/phone B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
30 Select: Complex Path Give me the name of all restaurants owned by G. Peppard. G. Peppard ]/name B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
31 Project: Value Give me the name of all establishments. //name B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
32 Project: Object Give me all information about all restaurants. //restaurant B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
33 Join: Structural Give me the name of all restaurants owned by G. Peppard. G. Peppard ]/name B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
34 Join: Value-based Name of all places where manager name is the same as place name B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
35 Database Statistics Statistics kept for relational databases are straightforward Due to tree structure of XML, more parameters (statistics) to model for cost-based optimizers XML optimizer can start evaluating a path expression from any point in the path McHugh and Widom [MW99] identify salient features and develop a formal model More statistics may be necessary, but this model will be used as a starting point B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
36 Database Statistics - Model Statistics maintained for each subpath p with a length of at most k include the total number of atomic objects (for each atomic type) that are reachable via p, the minimum and maximum values (for each atomic type) of all atomic objects reachable via p, the total number of instances of path p ( p ), the total number of distinct objects reachable via p ( p d ), the total number of l-labeled subobjects reachable via p ( p l ), and the total number of l-labeled edges to any instance of p ( p l ). B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
37 Database Statistics - Fan In/Out Statistics can help describe the shapes of query trees presented to an XML database. Fan-out for any given path expression is computed by p p l p d Fan-in is computed by p pl p d. B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
38 Proposed Research Overview Our proposed research includes the following items Classification of XML query styles Investigation into possible tree shapes (database statistics) for XML documents and queries Performance study across all parameters Leverage performance study to create user tools B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
39 XML Query Styles Have a foundation to identify and classify XML query styles presented in the literature This allows for current and future research to be classified and referenced using a common framework As the area evolves, more styles may become necessary B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
40 Remaining Research Objectives of this research still remaining include: Determine if database statistics available are enough to effectively describe all tree shapes and distinguish trees from one another Conduct a performance study across all parameters (indexing techniques, data structures, query styles, tree shapes) Create user tools that take a wide, generic approach to automating query optimization B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
41 Remaining Research Objectives of this research still remaining include: Determine if database statistics available are enough to effectively describe all tree shapes and distinguish trees from one another Conduct a performance study across all parameters (indexing techniques, data structures, query styles, tree shapes) Create user tools that take a wide, generic approach to automating query optimization B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
42 Remaining Research Objectives of this research still remaining include: Determine if database statistics available are enough to effectively describe all tree shapes and distinguish trees from one another Conduct a performance study across all parameters (indexing techniques, data structures, query styles, tree shapes) Create user tools that take a wide, generic approach to automating query optimization B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
43 Current XML Optimization Work We are aware of two projects that perform XML query optimization Timber: System proposed out of University of Michigan Stanford: Similar project out of Stanford University Both projects are limited to their own local indexing research Do not provide a wide enough scope for a general XML query optimizer Some of their work may be able to be leveraged on our proposed research B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
44 Questions Questions? B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
45 Extended Preorder Traversal There are three conditions placed on the values for order and size. 1 For tree node y and its parent x, [order(y), order(y) + size(y)] contained in [order(x), order(x) + size(x)] 2 For two sibling nodes x and y, order(x) + size(x) < order(y) if x is predecessor of y in preorder traversal 3 For any node x, size(x) y size(y) for all y s that are a direct child of x. B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
46 References I Serge Abiteboul, Peter Buneman, and Dan Suciu. Data on the Web. Morgan Kaufmann Publishers Inc., ANSI. American National Standards Institute: The database language SQL. Document ANSI X3.135, Nicolas Bruno, Nick Koudas, and Divesh Srivastava. Holistic twig joins: Optimal xml pattern matching. In Proceedings of the 2002 ACM SIGMOD International Conference on Management of data, pages , Madison, Wisconsin, USA, June B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
47 References II R. G. G. Cattell, Douglas K. Barry, Dirk Bartels, Mark Berler, Jeff Eastman, Sophie Gamerman, David Jordan, Adam Springer, Henry Strickland, and Drew Wade. The Object Database Standard: ODMG 2.0. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, E. F. Codd. A relational model of data for large shared data banks. Communications of the ACM, 13(6): , B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
48 References III Shu-Yao Chien, Zografoula Vagena, Donghui Zhang, and Vassilis J. Tsotras. Efficient structural joins on indexed xml documents. In Proceedings of the 28th International Conference on Very Large Data Bases (VLDB 02), pages , Hong Kong, China, August Paul F. Dietz. Maintaining order in a linked list. In Proceedings of the 14th Annual ACM Symposium on Theory of Computing (STOC 82), pages , San Francisco, CA, United States, May B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
49 References IV Ramez Elmasri and Shamkant B. Navathe. Fundamentals of Database Systems. Addison-Wesley, 3rd edition, Roy Goldman and Jennifer Widom. Dataguides: Enabling query formulation and optimization in semistructured databases. In Proceedings of 23rd International Conference on Very Large Data Bases (VLDB 97), pages , San Francisco, CA, USA, August Su Cheng Haw and G. S. V. Radha Krishna Rao. Query optimization techniques for XML databases. International Journal of Information Technology, 2(1):97 104, B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
50 References V Haifeng Jiang, Hongjun Lu, Wei Wang, and Beng Chin Ooi. XR-Tree: Indexing XML data for efficient structural joins. In Proceedings of the 19th International Conference on Data Engineering (ICDE 03), pages , Bangalore, India, March Hanyu Li, Mong-Li Lee, Wynne Hsu, and Chao Chen. An evaluation of xml indexes for structural join. SIGMOD Record, 33(3):28 33, September Quanzhong Li and Bongki Moon. Indexing and querying XML data for regular path expressions. In Proceedings of the 27th International Conference on Very Large Data Bases (VLDB 01), pages , San Francisco, CA, United States, September B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
51 References VI Jason McHugh and Jennifer Widom. Query optimization for XML. In The VLDB Journal, pages , Edinburgh, U.K., September Yannis Papakonstantinou, Hector Garcia-Molina, and Jennifer Widom. Object exchange across heterogeneous information sources. In Proceedings of the 11th International Conference on Data Engineering (ICDE 95), pages , Taipei, Taiwan, March IEEE Computer Society. B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
52 References VII Flavio Rizzolo and Alberto O. Mendelzon. Indexing XML data with ToXin. In Proceedings of the 4th International Workshop on the Web and Databases (WebDB 2001), pages 49 54, Santa Barbara, CA, USA, May Haixun Wang and Xiaofeng Meng. On the sequencing of tree structures for xml indexing. In Proceedings of the 21st International Conference on Data Engineering (ICDE 05), pages , Tokyo, Japan, April B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
53 References VIII Yuqing Wu, Jignesh Patel, and H. Jagadish. Structural join order selection for XML query optimization. In Proceedings of the 19th International Conference on Data Engineering (ICDE 03), pages , Bangalore, India, March Extensible Markup Language (XML). XPath amoeller/xml/linking/xpath20.html. XQuery 1.0: An XML Query Language. B. Richardson (ECECS-UC) XML Query Processing and Optimization December 16, / 57
Indexing XML Data with ToXin
Indexing XML Data with ToXin Flavio Rizzolo, Alberto Mendelzon University of Toronto Department of Computer Science {flavio,mendel}@cs.toronto.edu Abstract Indexing schemes for semistructured data have
More informationSome aspects of references behaviour when querying XML with XQuery
Some aspects of references behaviour when querying XML with XQuery c B.Khvostichenko boris.khv@pobox.spbu.ru B.Novikov borisnov@acm.org Abstract During the XQuery query evaluation, the query output is
More informationA Two-Step Approach for Tree-structured XPath Query Reduction
A Two-Step Approach for Tree-structured XPath Query Reduction Minsoo Lee, Yun-mi Kim, and Yoon-kyung Lee Abstract XML data consists of a very flexible tree-structure which makes it difficult to support
More informationFull-Text and Structural XML Indexing on B + -Tree
Full-Text and Structural XML Indexing on B + -Tree Toshiyuki Shimizu 1 and Masatoshi Yoshikawa 2 1 Graduate School of Information Science, Nagoya University shimizu@dl.itc.nagoya-u.ac.jp 2 Information
More informationAn Efficient XML Index Structure with Bottom-Up Query Processing
An Efficient XML Index Structure with Bottom-Up Query Processing Dong Min Seo, Jae Soo Yoo, and Ki Hyung Cho Department of Computer and Communication Engineering, Chungbuk National University, 48 Gaesin-dong,
More informationSemistructured Data Store Mapping with XML and Its Reconstruction
Semistructured Data Store Mapping with XML and Its Reconstruction Enhong CHEN 1 Gongqing WU 1 Gabriela Lindemann 2 Mirjam Minor 2 1 Department of Computer Science University of Science and Technology of
More informationLabeling and Querying Dynamic XML Trees
Labeling and Querying Dynamic XML Trees Jiaheng Lu, Tok Wang Ling School of Computing, National University of Singapore 3 Science Drive 2, Singapore 117543 {lujiahen,lingtw}@comp.nus.edu.sg Abstract With
More informationA FRAMEWORK FOR EFFICIENT DATA SEARCH THROUGH XML TREE PATTERNS
A FRAMEWORK FOR EFFICIENT DATA SEARCH THROUGH XML TREE PATTERNS SRIVANI SARIKONDA 1 PG Scholar Department of CSE P.SANDEEP REDDY 2 Associate professor Department of CSE DR.M.V.SIVA PRASAD 3 Principal Abstract:
More informationEstimating the Selectivity of XML Path Expression with predicates by Histograms
Estimating the Selectivity of XML Path Expression with predicates by Histograms Yu Wang 1, Haixun Wang 2, Xiaofeng Meng 1, and Shan Wang 1 1 Information School, Renmin University of China, Beijing 100872,
More informationCompression of the Stream Array Data Structure
Compression of the Stream Array Data Structure Radim Bača and Martin Pawlas Department of Computer Science, Technical University of Ostrava Czech Republic {radim.baca,martin.pawlas}@vsb.cz Abstract. In
More informationStoring and Maintaining Semistructured Data Efficiently in an Object-Relational Database
Storing and Maintaining Semistructured Data Efficiently in an Object-Relational Database Yuanying Mo National University of Singapore moyuanyi@comp.nus.edu.sg Tok Wang Ling National University of Singapore
More informationOutline. Approximation: Theory and Algorithms. Ordered Labeled Trees in a Relational Database (II/II) Nikolaus Augsten. Unit 5 March 30, 2009
Outline Approximation: Theory and Algorithms Ordered Labeled Trees in a Relational Database (II/II) Nikolaus Augsten 1 2 3 Experimental Comparison of the Encodings Free University of Bozen-Bolzano Faculty
More informationEfficient Query Optimization Of XML Tree Pattern Matching By Using Holistic Approach
P P IJISET - International Journal of Innovative Science, Engineering & Technology, Vol. 2 Issue 7, July 2015. Efficient Query Optimization Of XML Tree Pattern Matching By Using Holistic Approach 1 Miss.
More informationSymmetrically Exploiting XML
Symmetrically Exploiting XML Shuohao Zhang and Curtis Dyreson School of E.E. and Computer Science Washington State University Pullman, Washington, USA The 15 th International World Wide Web Conference
More informationAccelerating XML Structural Matching Using Suffix Bitmaps
Accelerating XML Structural Matching Using Suffix Bitmaps Feng Shao, Gang Chen, and Jinxiang Dong Dept. of Computer Science, Zhejiang University, Hangzhou, P.R. China microf_shao@msn.com, cg@zju.edu.cn,
More informationEvaluating XPath Queries
Chapter 8 Evaluating XPath Queries Peter Wood (BBK) XML Data Management 201 / 353 Introduction When XML documents are small and can fit in memory, evaluating XPath expressions can be done efficiently But
More informationOptimize Twig Query Pattern Based on XML Schema
JOURNAL OF SOFTWARE, VOL. 8, NO. 6, JUNE 2013 1479 Optimize Twig Query Pattern Based on XML Schema Hui Li Beijing University of Technology, Beijing, China Email: xiaodadaxiao2000@163.com HuSheng Liao and
More informationLabeling Scheme and Structural Joins for Graph-Structured XML Data
Labeling Scheme and Structural Joins for Graph-Structured XML Data Hongzhi Wang 1,2, Wei Wang 1,3, Xuemin Lin 1,3, and Jianzhong Li 2 1 University of New South Wales, Australia {hongzhiw, weiw, lxue}@cse.unsw.edu.au
More informationIntegrating Path Index with Value Index for XML data
Integrating Path Index with Value Index for XML data Jing Wang 1, Xiaofeng Meng 2, Shan Wang 2 1 Institute of Computing Technology, Chinese Academy of Sciences, 100080 Beijing, China cuckoowj@btamail.net.cn
More informationPathStack : A Holistic Path Join Algorithm for Path Query with Not-predicates on XML Data
PathStack : A Holistic Path Join Algorithm for Path Query with Not-predicates on XML Data Enhua Jiao, Tok Wang Ling, Chee-Yong Chan School of Computing, National University of Singapore {jiaoenhu,lingtw,chancy}@comp.nus.edu.sg
More informationTwigList: Make Twig Pattern Matching Fast
TwigList: Make Twig Pattern Matching Fast Lu Qin, Jeffrey Xu Yu, and Bolin Ding The Chinese University of Hong Kong, China {lqin,yu,blding}@se.cuhk.edu.hk Abstract. Twig pattern matching problem has been
More informationAn Implementation of Tree Pattern Matching Algorithms for Enhancement of Query Processing Operations in Large XML Trees
An Implementation of Tree Pattern Matching Algorithms for Enhancement of Query Processing Operations in Large XML Trees N. Murugesan 1 and R.Santhosh 2 1 PG Scholar, 2 Assistant Professor, Department of
More informationBrowsing in the tsimmis System. Stanford University. into requests the source can execute. The data returned by the source is converted back into the
Information Translation, Mediation, and Mosaic-Based Browsing in the tsimmis System SIGMOD Demo Proposal (nal version) Joachim Hammer, Hector Garcia-Molina, Kelly Ireland, Yannis Papakonstantinou, Jerey
More informationAn Improved Prefix Labeling Scheme: A Binary String Approach for Dynamic Ordered XML
An Improved Prefix Labeling Scheme: A Binary String Approach for Dynamic Ordered XML Changqing Li and Tok Wang Ling Department of Computer Science, National University of Singapore {lichangq, lingtw}@comp.nus.edu.sg
More informationFast XML Structural Join Algorithms by Partitioning
Fast XML Structural Join Algorithms by Partitioning Nan Tang Jeffrey Xu Yu Kam-Fai Wong The Chinese University of Hong Kong, Hong Kong, China Jianxin Li Swinburne University of Technology email: {ntang,yu,kfwong}@se.cuhk.edu.hk
More informationXML: Extensible Markup Language
XML: Extensible Markup Language CSC 375, Fall 2015 XML is a classic political compromise: it balances the needs of man and machine by being equally unreadable to both. Matthew Might Slides slightly modified
More informationAspects of an XML-Based Phraseology Database Application
Aspects of an XML-Based Phraseology Database Application Denis Helic 1 and Peter Ďurčo2 1 University of Technology Graz Insitute for Information Systems and Computer Media dhelic@iicm.edu 2 University
More informationTwigStack + : Holistic Twig Join Pruning Using Extended Solution Extension
Vol. 8 No.2B 2007 603-609 Article ID: + : Holistic Twig Join Pruning Using Extended Solution Extension ZHOU Junfeng 1,2, XIE Min 1, MENG Xiaofeng 1 1 School of Information, Renmin University of China,
More informationInvestigation into Indexing XML Data Techniques
Investigation into Indexing XML Data Techniques Alhadi Klaib, Joan Lu Department of Informatics University of Huddersfield Huddersfield, UK Abstract- The rapid development of XML technology improves the
More informationInteractive Query and Search in Semistructured Databases æ
Interactive Query and Search in Semistructured Databases Roy Goldman, Jennifer Widom Stanford University froyg,widomg@cs.stanford.edu www-db.stanford.edu Abstract Semistructured graph-based databases have
More informationAnswering XML Twig Queries with Automata
Answering XML Twig Queries with Automata Bing Sun, Bo Zhou, Nan Tang, Guoren Wang, Ge Yu, and Fulin Jia Northeastern University, Shenyang, China {sunb,wanggr,yuge,dbgroup}@mail.neu.edu.cn Abstract. XML
More informationCHAPTER 3 LITERATURE REVIEW
20 CHAPTER 3 LITERATURE REVIEW This chapter presents query processing with XML documents, indexing techniques and current algorithms for generating labels. Here, each labeling algorithm and its limitations
More informationExperimental Evaluation of Query Processing Techniques over Multiversion XML Documents
Experimental Evaluation of Query Processing Techniques over Multiversion XML Documents Adam Woss Computer Science University of California, Riverside awoss@cs.ucr.edu Vassilis J. Tsotras Computer Science
More informationStoring and Querying XML Documents Without Using Schema Information
Storing and Querying XML Documents Without Using Schema Information Kanda Runapongsa Department of Computer Engineering Khon Kaen University, Thailand krunapon@kku.ac.th Jignesh M. Patel Department of
More informationThe Research on Coding Scheme of Binary-Tree for XML
Available online at www.sciencedirect.com Procedia Engineering 24 (2011 ) 861 865 2011 International Conference on Advances in Engineering The Research on Coding Scheme of Binary-Tree for XML Xiao Ke *
More informationData Centric Integrated Framework on Hotel Industry. Bridging XML to Relational Database
Data Centric Integrated Framework on Hotel Industry Bridging XML to Relational Database Introduction extensible Markup Language (XML) is a promising Internet standard for data representation and data exchange
More informationNew Path Based Index Structure for Processing CAS Queries over XML Database
CIT. Journal of Computing and Information Technology, Vol. 25, No. 3, September 2017, 211 225 doi: 10.20532/cit.2017.1003557 211 New Path Based Index Structure for Processing CAS Queries over XML Database
More informationTwigINLAB: A Decomposition-Matching-Merging Approach To Improving XML Query Processing
American Journal of Applied Sciences 5 (9): 99-25, 28 ISSN 546-9239 28 Science Publications TwigINLAB: A Decomposition-Matching-Merging Approach To Improving XML Query Processing Su-Cheng Haw and Chien-Sing
More informationA New Way of Generating Reusable Index Labels for Dynamic XML
A New Way of Generating Reusable Index Labels for Dynamic XML P. Jayanthi, Dr. A. Tamilarasi Department of CSE, Kongu Engineering College, Perundurai 638 052, Erode, Tamilnadu, India. Abstract XML now
More informationADT 2009 Other Approaches to XQuery Processing
Other Approaches to XQuery Processing Stefan Manegold Stefan.Manegold@cwi.nl http://www.cwi.nl/~manegold/ 12.11.2009: Schedule 2 RDBMS back-end support for XML/XQuery (1/2): Document Representation (XPath
More informationAggregate Query Processing of Streaming XML Data
ggregate Query Processing of Streaming XML Data Yaw-Huei Chen and Ming-Chi Ho Department of Computer Science and Information Engineering National Chiayi University {ychen, s0920206@mail.ncyu.edu.tw bstract
More informationQuickXDB: A Prototype of a Native XML QuickXDB: Prototype of Native XML DBMS DBMS
QuickXDB: A Prototype of a Native XML QuickXDB: Prototype of Native XML DBMS DBMS Petr Lukáš, Radim Bača, and Michal Krátký Petr Lukáš, Radim Bača, and Michal Krátký Department of Computer Science, VŠB
More informationKeyword search in relational databases. By SO Tsz Yan Amanda & HON Ka Lam Ethan
Keyword search in relational databases By SO Tsz Yan Amanda & HON Ka Lam Ethan 1 Introduction Ubiquitous relational databases Need to know SQL and database structure Hard to define an object 2 Query representation
More informationRELATIONAL STORAGE FOR XML RULES
RELATIONAL STORAGE FOR XML RULES A. A. Abd El-Aziz Research Scholar Dept. of Information Science & Technology Anna University Email: abdelazizahmed@auist.net Professor A. Kannan Dept. of Information Science
More informationIndex Structures for Matching XML Twigs Using Relational Query Processors
Index Structures for Matching XML Twigs Using Relational Query Processors Zhiyuan Chen University of Maryland at Baltimore County zhchen@umbc.com Nick Koudas AT&T Labs Research koudas@research.att.com
More informationDesign of Index Schema based on Bit-Streams for XML Documents
Design of Index Schema based on Bit-Streams for XML Documents Youngrok Song 1, Kyonam Choo 3 and Sangmin Lee 2 1 Institute for Information and Electronics Research, Inha University, Incheon, Korea 2 Department
More informationCreating a Mediated Schema Based on Initial Correspondences
Creating a Mediated Schema Based on Initial Correspondences Rachel A. Pottinger University of Washington Seattle, WA, 98195 rap@cs.washington.edu Philip A. Bernstein Microsoft Research Redmond, WA 98052-6399
More informationTwigX-Guide: An Efficient Twig Pattern Matching System Extending DataGuide Indexing and Region Encoding Labeling
JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 25, 603-617 (2009) Short Paper TwigX-Guide: An Efficient Twig Pattern Matching System Extending DataGuide Indexing and Region Encoding Labeling Department
More informationA FRACTIONAL NUMBER BASED LABELING SCHEME FOR DYNAMIC XML UPDATING
A FRACTIONAL NUMBER BASED LABELING SCHEME FOR DYNAMIC XML UPDATING Meghdad Mirabi 1, Hamidah Ibrahim 2, Leila Fathi 3,Ali Mamat 4, and Nur Izura Udzir 5 INTRODUCTION 1 Universiti Putra Malaysia, Malaysia,
More informationQuerying Tree-Structured Data Using Dimension Graphs
Querying Tree-Structured Data Using Dimension Graphs Dimitri Theodoratos 1 and Theodore Dalamagas 2 1 Dept. of Computer Science New Jersey Institute of Technology Newark, NJ 07102 dth@cs.njit.edu 2 School
More informationElement Algebra. 1 Introduction. M. G. Manukyan
Element Algebra M. G. Manukyan Yerevan State University Yerevan, 0025 mgm@ysu.am Abstract. An element algebra supporting the element calculus is proposed. The input and output of our algebra are xdm-elements.
More informationSchema-Based XML-to-SQL Query Translation Using Interval Encoding
2011 Eighth International Conference on Information Technology: New Generations Schema-Based XML-to-SQL Query Translation Using Interval Encoding Mustafa Atay Department of Computer Science Winston-Salem
More informationPAPER Full-Text and Structural Indexing of XML Documents on B + -Tree
IEICE TRANS. INF. & SYST., VOL.E89 D, NO.1 JANUARY 2006 237 PAPER Full-Text and Structural Indexing of XML Documents on B + -Tree Toshiyuki SHIMIZU a), Nonmember and Masatoshi YOSHIKAWA b), Member SUMMARY
More informationEfficient Integration of Structure Indexes of XML
Efficient Integration of Structure Indexes of XML Taro L. Saito Shinichi Morishita University of Tokyo, Japan, {leo, moris}@cb.k.u-tokyo.ac.jp Abstract. Several indexing methods have been proposed to encode
More informationToX The Toronto XML Engine
ToX The Toronto XML Engine Denilson Barbosa 1 Attila Barta 1 Alberto Mendelzon 1 George Mihaila 2 Flavio Rizzolo 1 Patricia Rodriguez-Gianolli 1 1 Department of Computer Science University of Toronto {dmb,atibarta,mendel,flavio,prg}@cs.toronto.edu
More informationAn Extended Byte Carry Labeling Scheme for Dynamic XML Data
Available online at www.sciencedirect.com Procedia Engineering 15 (2011) 5488 5492 An Extended Byte Carry Labeling Scheme for Dynamic XML Data YU Sheng a,b WU Minghui a,b, * LIU Lin a,b a School of Computer
More informationPath Query Reduction and Diffusion for Distributed Semi-structured Data Retrieval+
Path Query Reduction and Diffusion for Distributed Semi-structured Data Retrieval+ Jaehyung Lee, Yon Dohn Chung, Myoung Ho Kim Division of Computer Science, Department of EECS Korea Advanced Institute
More informationOutline. Depth-first Binary Tree Traversal. Gerênciade Dados daweb -DCC922 - XML Query Processing. Motivation 24/03/2014
Outline Gerênciade Dados daweb -DCC922 - XML Query Processing ( Apresentação basedaem material do livro-texto [Abiteboul et al., 2012]) 2014 Motivation Deep-first Tree Traversal Naïve Page-based Storage
More informationIndices in XML databases. Hadj Mahboubi. University of Lyon (ERIC Lyon 2) 5 avenue Pierre Mendès-France, Bron Cedex, France
Indices in XML databases Hadj Mahboubi University of Lyon (ERIC Lyon 2) 5 avenue Pierre Mendès-France, 69676 Bron Cedex, France Phone: +33 478 773 111 Fax: +33 478 772 375 hadj.mahboubi@eric.univ-lyon2.fr
More informationA Survey Of Algorithms Related To Xml Based Pattern Matching
A Survey Of Algorithms Related To Xml Based Pattern Matching Dr.R.Sivarama Prasad 1, D.Bujji Babu 2, Sk.Habeeb 3, Sd.Jasmin 4 1 Coordinator,International Business Studies, Acharya Nagarjuna University,Guntur,A.P,India,
More informationWrapper 2 Wrapper 3. Information Source 2
Integration of Semistructured Data Using Outer Joins Koichi Munakata Industrial Electronics & Systems Laboratory Mitsubishi Electric Corporation 8-1-1, Tsukaguchi Hon-machi, Amagasaki, Hyogo, 661, Japan
More informationA NEW WATERMARKING TECHNIQUE FOR SECURE DATABASE
Online Journal, www.ijcea.com A NEW WATERMARKING TECHNIQUE FOR SECURE DATABASE Jun Ziang Pinn 1 and A. Fr. Zung 2 1,2 P. S. University for Technology, Harbin 150001, P. R. China ABSTRACT Digital multimedia
More informationOptimising XML-Based Web Information Systems
Optimising XML-Based Web Information Systems Colm Noonan and Mark Roantree Interoperable Systems Group, Dublin City University, Ireland - {mark,cnoonan}@computing.dcu.ie Abstract. Many Web Information
More informationNovel Materialized View Selection in a Multidimensional Database
Graphic Era University From the SelectedWorks of vijay singh Winter February 10, 2009 Novel Materialized View Selection in a Multidimensional Database vijay singh Available at: https://works.bepress.com/vijaysingh/5/
More informationFolder(Inbox) Message Message. Body
Rening OEM to Improve Features of Query Languages for Semistructured Data Pavel Hlousek Charles University, Faculty of Mathematics and Physics, Prague, Czech Republic Abstract. Semistructured data can
More informationMETAXPath. Utah State University. From the SelectedWorks of Curtis Dyreson. Curtis Dyreson, Utah State University Michael H. Böhen Christian S.
Utah State University From the SelectedWorks of Curtis Dyreson December, 2001 METAXPath Curtis Dyreson, Utah State University Michael H. Böhen Christian S. Jensen Available at: https://works.bepress.com/curtis_dyreson/11/
More informationClassifying Elements for XML Query Transformation
Classifying Elements for XML Query Transformation c Ke Geng University of Auckland, New Zealand ke@cs.auckland.ac.nz Abstract Research into XML query transformation has become important with the increased
More informationEstimating Result Size and Execution Times for Graph Queries
Estimating Result Size and Execution Times for Graph Queries Silke Trißl 1 and Ulf Leser 1 Humboldt-Universität zu Berlin, Institut für Informatik, Unter den Linden 6, 10099 Berlin, Germany {trissl,leser}@informatik.hu-berlin.de
More informationManaging Changes to Schema of Data Sources in a Data Warehouse
Association for Information Systems AIS Electronic Library (AISeL) AMCIS 2001 Proceedings Americas Conference on Information Systems (AMCIS) December 2001 Managing Changes to Schema of Data Sources in
More informationSearching SNT in XML Documents Using Reduction Factor
Searching SNT in XML Documents Using Reduction Factor Mary Posonia A Department of computer science, Sathyabama University, Tamilnadu, Chennai, India maryposonia@sathyabamauniversity.ac.in http://www.sathyabamauniversity.ac.in
More informationCBSL A Compressed Binary String Labeling Scheme for Dynamic Update of XML Documents
CIT. Journal of Computing and Information Technology, Vol. 26, No. 2, June 2018, 99 114 doi: 10.20532/cit.2018.1003955 99 CBSL A Compressed Binary String Labeling Scheme for Dynamic Update of XML Documents
More informationStoring and Querying Multiversion XML Documents using Durable Node Numbers
Storing and Querying Multiversion XML Documents using Durable Node Numbers Shu-Yao Chien Dept. of CS UCLA csy@cs.ucla.edu Vassilis J. Tsotras Dept. of CS&E UC Riverside tsotras@cs.ucr.edu Carlo Zaniolo
More informationIndex-Driven XQuery Processing in the exist XML Database
Index-Driven XQuery Processing in the exist XML Database Wolfgang Meier wolfgang@exist-db.org The exist Project XML Prague, June 17, 2006 Outline 1 Introducing exist 2 Node Identification Schemes and Indexing
More informationRelational Storage for XML Rules
Relational Storage for XML Rules A. A. Abd El-Aziz Research Scholar Dept. of Information Science & Technology Anna University Email: abdelazizahmed@auist.net A. Kannan Professor Dept. of Information Science
More informationTDDD43. Theme 1.2: XML query languages. Fang Wei- Kleiner h?p:// TDDD43
Theme 1.2: XML query languages Fang Wei- Kleiner h?p://www.ida.liu.se/~ Query languages for XML Xpath o Path expressions with conditions o Building block of other standards (XQuery, XSLT, XLink, XPointer,
More informationMQEB: Metadata-based Query Evaluation of Bi-labeled XML data
MQEB: Metadata-based Query Evaluation of Bi-labeled XML data Rajesh Kumar A and P Sreenivasa Kumar Department of Computer Science and Engineering Indian Institute of Technology Madras Chennai 600036, India.
More informationA Cloud Computing Implementation of XML Indexing Method Using Hadoop
A Cloud Computing Implementation of XML Indexing Method Using Hadoop Wen-Chiao Hsu 1, I-En Liao 2,**, and Hsiao-Chen Shih 3 1,2,3 Department of Computer Science and Engineering National Chung-Hsing University,
More informationLecture2: Database Environment
College of Computer and Information Sciences - Information Systems Dept. Lecture2: Database Environment 1 IS220 : D a t a b a s e F u n d a m e n t a l s Topics Covered Data abstraction Schemas and Instances
More informationCopyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 27-1
Slide 27-1 Chapter 27 XML: Extensible Markup Language Chapter Outline Introduction Structured, Semi structured, and Unstructured Data. XML Hierarchical (Tree) Data Model. XML Documents, DTD, and XML Schema.
More informationEffective Schema-Based XML Query Optimization Techniques
Effective Schema-Based XML Query Optimization Techniques Guoren Wang and Mengchi Liu School of Computer Science Carleton University, Canada {wanggr, mengchi}@scs.carleton.ca Bing Sun, Ge Yu, and Jianhua
More informationADT 2010 ADT XQuery Updates in MonetDB/XQuery & Other Approaches to XQuery Processing
1 XQuery Updates in MonetDB/XQuery & Other Approaches to XQuery Processing Stefan Manegold Stefan.Manegold@cwi.nl http://www.cwi.nl/~manegold/ MonetDB/XQuery: Updates Schedule 9.11.1: RDBMS back-end support
More informationEfficient Indexing and Querying in XML Databases
Efficient Indexing and Querying in XML Databases Ankita Atrey ankita.atrey2012@vit.ac.in School of Computing Science and Engineering, VIT University, Vellore, India Vinay Rawal rawalvin@cse.iitk.ac.in
More informationSFilter: A Simple and Scalable Filter for XML Streams
SFilter: A Simple and Scalable Filter for XML Streams Abdul Nizar M., G. Suresh Babu, P. Sreenivasa Kumar Indian Institute of Technology Madras Chennai - 600 036 INDIA nizar@cse.iitm.ac.in, sureshbabuau@gmail.com,
More informationDatabase Fundamentals Chapter 1
Database Fundamentals Chapter 1 Class 01: Database Fundamentals 1 What is a Database? The ISO/ANSI SQL Standard does not contain a definition of the term database. In fact, the term is never mentioned
More informationA New Method of Generating Index Label for Dynamic XML Data
Journal of Computer Science 7 (3): 421-426, 2011 ISSN 1549-3636 2011 Science Publications A New Method of Generating Index Label for Dynamic XML Data Jayanthi Paramasivam and Tamilarasi Angamuthu Department
More informationI. Khalil Ibrahim, V. Dignum, W. Winiwarter, E. Weippl, Logic Based Approach to Semantic Query Transformation for Knowledge Management Applications,
I. Khalil Ibrahim, V. Dignum, W. Winiwarter, E. Weippl, Logic Based Approach to Semantic Query Transformation for Knowledge Management Applications, Proc. of the International Conference on Knowledge Management
More informationA System for Storing, Retrieving, Organizing and Managing Web Services Metadata Using Relational Database *
BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 6, No 1 Sofia 2006 A System for Storing, Retrieving, Organizing and Managing Web Services Metadata Using Relational Database
More informationRelational Index Support for XPath Axes
Relational Index Support for XPath Axes Leo Yuen and Chung Keung Poon Department of Computer Science City University of Hong Kong {leo,ckpoon}@cs.cityu.edu.hk Abstract. In this paper, we designed efficient
More informationQuerying Spatiotemporal XML Using DataFoX
Querying Spatiotemporal XML Using DataFoX Yi Chen Peter Revesz Computer Science and Engineering Department University of Nebraska-Lincoln Lincoln, NE 68588, USA {ychen,revesz}@cseunledu Abstract We describe
More informationData Warehousing Alternatives for Mobile Environments
Data Warehousing Alternatives for Mobile Environments I. Stanoi D. Agrawal A. El Abbadi Department of Computer Science University of California Santa Barbara, CA 93106 S. H. Phatak B. R. Badrinath Department
More informationNested Intervals Tree Encoding with Continued Fractions
Nested Intervals Tree Encoding with Continued Fractions VADIM TROPASHKO Oracle Corp There is nothing like abstraction To take away your intuition Shai Simonson http://aduniorg/courses/discrete/ We introduce
More informationGrouping in XML. Abstract. XML permits repeated and missing sub-elements, and missing
Grouping in XML Stelios Paparizos 1, Shurug Al-Khalifa 1, H. V. Jagadish 1, Laks Lakshmanan 2, Andrew Nierman 1, Divesh Srivastava 3, and Yuqing Wu 1 1 University of Michigan, Ann Arbor, MI, USA fspapariz,
More informationAssociation Rule Mining from XML Data
144 Conference on Data Mining DMIN'06 Association Rule Mining from XML Data Qin Ding and Gnanasekaran Sundarraj Computer Science Program The Pennsylvania State University at Harrisburg Middletown, PA 17057,
More informationRank-aware XML Data Model and Algebra: Towards Unifying Exact Match and Similar Match in XML
Proceedings of the 7th WSEAS International Conference on Multimedia, Internet & Video Technologies, Beijing, China, September 15-17, 2007 253 Rank-aware XML Data Model and Algebra: Towards Unifying Exact
More information10/24/12. What We Have Learned So Far. XML Outline. Where We are Going Next. XML vs Relational. What is XML? Introduction to Data Management CSE 344
What We Have Learned So Far Introduction to Data Management CSE 344 Lecture 12: XML and XPath A LOT about the relational model Hand s on experience using a relational DBMS From basic to pretty advanced
More informationOpen Access The Three-dimensional Coding Based on the Cone for XML Under Weaving Multi-documents
Send Orders for Reprints to reprints@benthamscience.ae 676 The Open Automation and Control Systems Journal, 2014, 6, 676-683 Open Access The Three-dimensional Coding Based on the Cone for XML Under Weaving
More informationA Commit Scheduler for XML Databases
A Commit Scheduler for XML Databases Stijn Dekeyser and Jan Hidders University of Antwerp Abstract. The hierarchical and semistructured nature of XML data may cause complicated update-behavior. Updates
More informationEfficient Evaluation of Generalized Path Pattern Queries on XML Data
Efficient Evaluation of Generalized Path Pattern Queries on XML Data Xiaoying Wu NJIT, USA xw43@njit.edu Stefanos Souldatos NTUA, Greece stef@dblab.ece.ntua.gr Theodore Dalamagas NTUA, Greece dalamag@dblab.ece.ntua.gr
More informationA MODEL FOR ADVANCED QUERY CAPABILITY DESCRIPTION IN MEDIATOR SYSTEMS
A MODEL FOR ADVANCED QUERY CAPABILITY DESCRIPTION IN MEDIATOR SYSTEMS Alberto Pan, Paula Montoto and Anastasio Molano Denodo Technologies, Almirante Fco. Moreno 5 B, 28040 Madrid, Spain Email: apan@denodo.com,
More informationInferring Structure in Semistructured Data
Inferring Structure in Semistructured Data SVETLOZAR NESTOROV æ SERGE ABITEBOUL y RAJEEV MOTWANI z Department of Computer Science Stanford University Stanford, CA 94305-9040 fevtimov,abiteboug@db.stanford.edu,
More information