Design of Index Schema based on Bit-Streams for XML Documents

Size: px
Start display at page:

Download "Design of Index Schema based on Bit-Streams for XML Documents"

Transcription

1 Design of Index Schema based on Bit-Streams for XML Documents Youngrok Song 1, Kyonam Choo 3 and Sangmin Lee 2 1 Institute for Information and Electronics Research, Inha University, Incheon, Korea 2 Department of Electronic Engineering, Inha University, Incheon, Korea 3 Computing and Information Center, Incheon University, Incheon, Korea gateway32@inha.ac.kr, kyonam@incheon.ac.kr, sanglee@inha.ac.kr Abstract In this paper, the structural information between tree nodes of XML documents is represented without any structural changes of the tree by converting that number information added to the tree into bit streams. It is also shown that other structural information can be retrieved and added to index schema in the process. In this process, we confirm that the response time can be minimized by conducting a bit operation between the bit streams with index schema in use, and accurate results can be reached by searching only with information of record sets by nodes in index files. Keywords: Index schema, XML, Database structure 1. Introduction As demand for using XML(eXtensible Markup Language) documents and the quantity of documents are increased, the amount of information needed to define the documents by XML standard is rapidly increasing as well. It has become more difficult to locate needed information, so methods to search and manage XML documents information more efficiently are necessary [1-4], and recent studies have been actively under way to store XML documents information in storage media such as databases[5-7]. These studies aim at supporting efficient route search of a single large volume XML document or several XML documents with the same structure. Thus, when using the aforementioned indexing techniques to find the wanted routes from several XML documents with different structures, we need to compose each index for each XML document and investigate every index. Though indexing methods have been suggested to complement the defects, such methods still have the drawback of lowered performance when the ancestordescendant relationship appears in the middle of the route, not in the root [8]. In this paper, query processing in diverse forms is given flexibility by adding the structural information field that is needed for the structure of the existing index files when additional structural information is added to XML documents. As a result, the given query analysis time is reduced, and thus response time to output return is reduced by using query schema based on index schema in order to improve the query processing efficiency. 2. Index Schema based on Bit-Streams This paper has gone through the following processes to achieve an index schema as in Figure 1. 1) Build up XML documents into trees by using DOM (document object model) trees. 2) Give sequential numbers to the built-up trees by the node of each level. 3) Trees given numbers are rebuilt-up. 4) Data to be stored in index files is acquired from the trees. 5) Make bit streams by using the number given to each node. 6) Store 131

2 the data gained from [4), 5)] in the index files. Figure 2 is a DTD(Document Type Definition) used for delivery between buyers and sellers. Figure 1. Index Schema Figure 2. DTD of Delivery XML Documents 2.1. XML Tree Numbering Technique In Figure 3, the node of each level is sequentially given a number, and the bits that can express each node are assigned in as the same number as that of the node of each level, among the numbers given to nodes of trees, those starting with bit 0 are excluded, and the numbers should be given from bit 1. In this way, the sequential numbers are given from the root node to the least significant nodes. After the operations are completed, the only bit stream of each route is generated when the assigned numbers are connected with the root node as the reference point. The value itself shows the super-sub relationship, the structural information of the entire node in the same tree such as parent-children, ancestor-descendant and sibling nodes from the root node up to the node where the route ends. Figure 3. Example Tree of Delivery XML Documents Given Bits 132

3 Bit stream per each node International Journal of Software Engineering and Its Applications 2.2. Technique of Converting Bit Stream of XML Structural Information Figure 3 shows that the bit stream value that is given to each node is gained by successively visiting each from the root node, and with this bit stream, the bit value by level is stored from the least significant bit in the assigned fixed bit space. The bit values from L 0 to L 7 expressed in table 1 show the bit stream value concerning the hierarchical relationship by node in Figure 3. The bit stream values correspond with all the nodes, one to one, and they are the only and unique values. Moreover, as this value is the unique value, if the bit value by each node name is known, then the whole XML document can be restored. The entire size of the bit stream is 64 bits, which means that the fixed space is assigned. Yet, if the level of an XML document goes up or the node number by level increases, then the value will exceed the fixed bit at 64 bit. In that case, it can be solved by assigning a space bigger than 64 bits. Table 1. Bit Stream Assigned to Each Node Level L 0 L 1 L 2 L 3 L 4 L 5 L 6 L i : i times repetition of bit Building up Index Schema based on Database Table In the index schema table, the names of each of the nodes, unique bit stream values, bit stream values of parent node and level values that are needed for query analysis when user's queries come up are stored. Table 2 shows the schema structure of the index files and an example of an index schema table that shows the attribute structural information of a node. Table 2. Table Field of Index Schema Table Field Roles Example N_name names of XML tree nodes Tel B_value bit stream values of each node B Tb_len certain node length of the whole bit stream 6 B_len bit stream length assigned temporarily to the present node 3 Level the level of the present node on XML tree 2 P_value bit stream value of the parent node of the present node Ctype shows if the present node is a primitive or an attribute 1 Position shows the order of the present node as a child node to the parent node 3 D_num the number of XML document 1 133

4 Data the value each node has Table 3 is a table that writes the basic document of Figure 3 based upon the DTD of Figure 2 and stores the information gained from the XML tree. Here, the record that is set by node stored in the database is the only value. Thus, the query processing method of using record sets, which is suggested in Table 3, is much more useful than the joining method based on the association between tables in which fragmented data are stored and the case of performing its additional operations in order to find out result sets. Now, it is necessary to find out how to use the structural information in Table 3. First, if a query is about a particular document, then the query can be more easily addressed by referring to the document information D_num field. Even when the query does not have any document information, the query forms that can be expressed by combinations of node information of different documents can be avoided in advance by looking at the returned results of referring to D_num field. For example, suppose the search query about (2) iname node in Table 3 is given, and looks at the result search processes. First, (2) iname node can be searched and located from table 3 by using the document information that is gained from the given query analysis(d_num field), primitives composing query and name of attributes(n_name field), values about attributes and primitives(data field) and level information(level field). At this point, B, bit stream value of iname node(b_value field) shows the location of iname node on the XML document trees. Again, the host nodes can also be identified through the bit stream(p_value field) of the parent node. Because the bit stream values of (1) item node in Table 3 are identical to B, the bit stream values of host nodes of (2) iname, it is possible to know that (1) item is the parent node of (2) iname. When such operations are repeated, any node from (2) iname to the root node can be searched. If the increase in the amount of documents may cause bit streams of the parent nodes to agree with each other, but it is possible to identify a parent node by comparing the level information, names of primitives and attributes and data. Moreover, if search query for sibling nodes of (2) iname is given, all node information that have the same value of the D_num field and Level field of (2) iname node makes sibling nodes, without particular join operations. If different structural information is added that cannot be stored in the index file structure suggested in documents, expansibility of index algorithm is guaranteed by adding a field in which new structural information can be stored without modifying the existing index file structures. Table 3. Example of Index Structure based on Conceptual Database Structure N_name B_value Tb_len B_len Level Data P_value Ctype Position D_num delivery none receiver none name B Leo date tel B ellipsis (1) item B none B iname 00000B6B xml B payment B true B (2) iname B ellipsis XML Guide B

5 3. Experiments and Results Experiment data was arranged and the performance was evaluated in order to determine the validity of index and query schema to process the XML documents presented so far. The accuracy test was conducted about the general query types by using 1,000 all different XML documents, and another test was carried out to compare the performances between the experiments of existing XRel [7] and INRIA [9] and the methods suggested in this paper. The index algorithm suggested by Chien[6] focuses on the index techniques centered on three points. That is, to effectively process XML queries, it takes 1) quickly testing the structural relationship of ancestor-descendant (parent-child) between the two given primitives, 2) quickly finding out the candidate list that satisfies the structural relationship in 1), and 3) effectively drawing the pairs that satisfy the structural relationship from the candidate list, the result of 2). This paper also suggested the algorithm concerning the new structure search of 1) and 2) mentioned above. However, Chien proved its efficiency with B+ trees in stage 3), but this paper presents a more efficient method of storing and searching the index algorithm that suits the processes of 1) and 2). As the queries are already optimized after the processes of 1) and 2), the result of one to one corresponding can be directly obtained from database in stage 3). However, in comparison with Chien's method, this paper shows a slight difference. As Chien concentrates on the structural relationship index, he does not consider the characteristics of the storage media such as databases. Rather, he simply stores and searches the structural information with B+ tree. However, there is a close relationship between the index structure and the storage structure in this paper, it is not appropriate to be compared with Chien's method the only indexes XML documents. The index schema of XML documents suggested in this paper was conducted 1,000 times by repeating the queries with the XPath expression depth of 10 by using experiment data, and the results are in figure 4. As shown in figure 4, the accuracy was more than 94%, though the number of primitives, attributes and texts in expressions increases. The error rate was less than 6%, the reason is that with the increased complexity of documents, the number of nodes by level in XML document trees increases. Thus, as the length of bit stream that can express structural information of each node exceeds 64 bits, query processing operations fail, otherwise, inappropriate results that were returned though query processing were successful. Figure 4. Accuracy of Retrieval Results for XML Queries 135

6 4. Conclusion This paper proposed an effective index and query schema to search large amounts of XML document structural information. The query accuracy test with 1,000 various XML documents and the query-response time measured in the experiment over Shakespeare plays whose documents are deep and complicated were conducted, and this proved the efficiency of the suggested index and query methods. Therefore, it can be said that using the index and query schema suggested in this paper shows that the accuracy of drawing results of the query search in the query accuracy performance test is more than 94%. The query expressions were made of queries in the forms with general and complicated structures that can occur in all XML document structures. One of the factors affecting the query accuracy performance test experiment is the bit stream overflow that the lengths of bit streams expressing each node by level exceed 64 bits as the complexity of documents increases. To solve this problem, the 64 bit bit-stream length should be expanded. Acknowledgements This work was supported by Key Research Institute Program through the National Research Foundation of Korea(NRF) funded by the Ministry of Education, Science and Technology( ) References [1] T. Dao, An indexing model for structured document to support queries on content, structure and attributes, Proc. of IEEE ADL, (1998), pp [2] T. Milo and D. Suciu, Index structures for path expressions, Proc. of 7th International Conference on Database Theory, (1999), pp [3] B. F. Cooper, N. Sample, M. J. Franklin, G. R. Hjaltason and M. Shadmon, A fast index for semistructured data, Proc. of the 27th VLDB Conference, (2001), pp [4] Y. R. Song, K. N. Choo, Y. S. Woo, H. K. Min and W. U. Lee, Multi-indexing system for news stories based on XML documents, Lecture Notes in Computer Science, vol. 3815, (2005), pp [5] C. Zhang, J. Naughton, D. DeWitt, Q. Luo and G. Lohman, On supporting containment queries in relational database management systems, Proc. of the 2001 ACM SIGMOD International Conference on Management of Data, vol. 30, (2001), pp [6] S. Y. Chien, Z. Vagena, D. Zhang, V. J. Tsotras and C. Zaniolo, Efficient structural joins on indexed XML documents, Proc. of the 28th VLDB Conference, (2002), pp [7] M. Yoshikawa, T. Amagasa, T. Shimura and S. Uemura, XRel: a path-based approach to storage and retrieval of XML documents using relational databases, ACM Transaction on Internet Technology, vol. 1, (2001), pp [8] C. W. Chung, J. K. Min and K. S. Shim, APEX: an adaptive path index for XML data, Proc. of the 2002 ACM SIGMOD International Conference on Management of Data, (2002), pp [9] D. Florescu and D. Kossmann, A performance evaluation of alternative mapping schemes for storing XML data in a relational database, INRIA Technical Report 3680, (1999). 136

An Efficient XML Index Structure with Bottom-Up Query Processing

An Efficient XML Index Structure with Bottom-Up Query Processing An Efficient XML Index Structure with Bottom-Up Query Processing Dong Min Seo, Jae Soo Yoo, and Ki Hyung Cho Department of Computer and Communication Engineering, Chungbuk National University, 48 Gaesin-dong,

More information

A FRACTIONAL NUMBER BASED LABELING SCHEME FOR DYNAMIC XML UPDATING

A FRACTIONAL NUMBER BASED LABELING SCHEME FOR DYNAMIC XML UPDATING A FRACTIONAL NUMBER BASED LABELING SCHEME FOR DYNAMIC XML UPDATING Meghdad Mirabi 1, Hamidah Ibrahim 2, Leila Fathi 3,Ali Mamat 4, and Nur Izura Udzir 5 INTRODUCTION 1 Universiti Putra Malaysia, Malaysia,

More information

Full-Text and Structural XML Indexing on B + -Tree

Full-Text and Structural XML Indexing on B + -Tree Full-Text and Structural XML Indexing on B + -Tree Toshiyuki Shimizu 1 and Masatoshi Yoshikawa 2 1 Graduate School of Information Science, Nagoya University shimizu@dl.itc.nagoya-u.ac.jp 2 Information

More information

Data Centric Integrated Framework on Hotel Industry. Bridging XML to Relational Database

Data Centric Integrated Framework on Hotel Industry. Bridging XML to Relational Database Data Centric Integrated Framework on Hotel Industry Bridging XML to Relational Database Introduction extensible Markup Language (XML) is a promising Internet standard for data representation and data exchange

More information

Estimating the Selectivity of XML Path Expression with predicates by Histograms

Estimating the Selectivity of XML Path Expression with predicates by Histograms Estimating the Selectivity of XML Path Expression with predicates by Histograms Yu Wang 1, Haixun Wang 2, Xiaofeng Meng 1, and Shan Wang 1 1 Information School, Renmin University of China, Beijing 100872,

More information

Schema-Based XML-to-SQL Query Translation Using Interval Encoding

Schema-Based XML-to-SQL Query Translation Using Interval Encoding 2011 Eighth International Conference on Information Technology: New Generations Schema-Based XML-to-SQL Query Translation Using Interval Encoding Mustafa Atay Department of Computer Science Winston-Salem

More information

Integrating Path Index with Value Index for XML data

Integrating Path Index with Value Index for XML data Integrating Path Index with Value Index for XML data Jing Wang 1, Xiaofeng Meng 2, Shan Wang 2 1 Institute of Computing Technology, Chinese Academy of Sciences, 100080 Beijing, China cuckoowj@btamail.net.cn

More information

PAPER Full-Text and Structural Indexing of XML Documents on B + -Tree

PAPER Full-Text and Structural Indexing of XML Documents on B + -Tree IEICE TRANS. INF. & SYST., VOL.E89 D, NO.1 JANUARY 2006 237 PAPER Full-Text and Structural Indexing of XML Documents on B + -Tree Toshiyuki SHIMIZU a), Nonmember and Masatoshi YOSHIKAWA b), Member SUMMARY

More information

Relational Index Support for XPath Axes

Relational Index Support for XPath Axes Relational Index Support for XPath Axes Leo Yuen and Chung Keung Poon Department of Computer Science City University of Hong Kong {leo,ckpoon}@cs.cityu.edu.hk Abstract. In this paper, we designed efficient

More information

An Extended Byte Carry Labeling Scheme for Dynamic XML Data

An Extended Byte Carry Labeling Scheme for Dynamic XML Data Available online at www.sciencedirect.com Procedia Engineering 15 (2011) 5488 5492 An Extended Byte Carry Labeling Scheme for Dynamic XML Data YU Sheng a,b WU Minghui a,b, * LIU Lin a,b a School of Computer

More information

Evaluating XPath Queries

Evaluating XPath Queries Chapter 8 Evaluating XPath Queries Peter Wood (BBK) XML Data Management 201 / 353 Introduction When XML documents are small and can fit in memory, evaluating XPath expressions can be done efficiently But

More information

A Clustering-based Scheme for Labeling XML Trees

A Clustering-based Scheme for Labeling XML Trees 84 IJCSNS International Journal of Computer Science and Network Security, VOL.6 No.9A, September 2006 A Clustering-based Scheme for Labeling XML Trees Sadegh Soltan, and Masoud Rahgozar, University of

More information

PathStack : A Holistic Path Join Algorithm for Path Query with Not-predicates on XML Data

PathStack : A Holistic Path Join Algorithm for Path Query with Not-predicates on XML Data PathStack : A Holistic Path Join Algorithm for Path Query with Not-predicates on XML Data Enhua Jiao, Tok Wang Ling, Chee-Yong Chan School of Computing, National University of Singapore {jiaoenhu,lingtw,chancy}@comp.nus.edu.sg

More information

An approach to the model-based fragmentation and relational storage of XML-documents

An approach to the model-based fragmentation and relational storage of XML-documents An approach to the model-based fragmentation and relational storage of XML-documents Christian Süß Fakultät für Mathematik und Informatik, Universität Passau, D-94030 Passau, Germany Abstract A flexible

More information

Storing and Querying XML Documents Without Using Schema Information

Storing and Querying XML Documents Without Using Schema Information Storing and Querying XML Documents Without Using Schema Information Kanda Runapongsa Department of Computer Engineering Khon Kaen University, Thailand krunapon@kku.ac.th Jignesh M. Patel Department of

More information

Path-based XML Relational Storage Approach

Path-based XML Relational Storage Approach Available online at www.sciencedirect.com Physics Procedia 33 (2012 ) 1621 1625 2012 International Conference on Medical Physics and Biomedical Engineering Path-based XML Relational Storage Approach Qi

More information

An Improved Prefix Labeling Scheme: A Binary String Approach for Dynamic Ordered XML

An Improved Prefix Labeling Scheme: A Binary String Approach for Dynamic Ordered XML An Improved Prefix Labeling Scheme: A Binary String Approach for Dynamic Ordered XML Changqing Li and Tok Wang Ling Department of Computer Science, National University of Singapore {lichangq, lingtw}@comp.nus.edu.sg

More information

A Two-Step Approach for Tree-structured XPath Query Reduction

A Two-Step Approach for Tree-structured XPath Query Reduction A Two-Step Approach for Tree-structured XPath Query Reduction Minsoo Lee, Yun-mi Kim, and Yoon-kyung Lee Abstract XML data consists of a very flexible tree-structure which makes it difficult to support

More information

SFilter: A Simple and Scalable Filter for XML Streams

SFilter: A Simple and Scalable Filter for XML Streams SFilter: A Simple and Scalable Filter for XML Streams Abdul Nizar M., G. Suresh Babu, P. Sreenivasa Kumar Indian Institute of Technology Madras Chennai - 600 036 INDIA nizar@cse.iitm.ac.in, sureshbabuau@gmail.com,

More information

A FRAMEWORK FOR EFFICIENT DATA SEARCH THROUGH XML TREE PATTERNS

A FRAMEWORK FOR EFFICIENT DATA SEARCH THROUGH XML TREE PATTERNS A FRAMEWORK FOR EFFICIENT DATA SEARCH THROUGH XML TREE PATTERNS SRIVANI SARIKONDA 1 PG Scholar Department of CSE P.SANDEEP REDDY 2 Associate professor Department of CSE DR.M.V.SIVA PRASAD 3 Principal Abstract:

More information

An Efficient XML Index Technique with Relative Position Coordinate

An Efficient XML Index Technique with Relative Position Coordinate An Efficient XML Index Technique with Relative Position Coordinate Tacgon Kim, Wooseang Kim Dept. of Computer Science, Kwangwoon Universicy Wolye-dong, Nowon-gu, Seoul, Korea Abstract: - Recently, a lot

More information

The Research on Coding Scheme of Binary-Tree for XML

The Research on Coding Scheme of Binary-Tree for XML Available online at www.sciencedirect.com Procedia Engineering 24 (2011 ) 861 865 2011 International Conference on Advances in Engineering The Research on Coding Scheme of Binary-Tree for XML Xiao Ke *

More information

An Implementation of Tree Pattern Matching Algorithms for Enhancement of Query Processing Operations in Large XML Trees

An Implementation of Tree Pattern Matching Algorithms for Enhancement of Query Processing Operations in Large XML Trees An Implementation of Tree Pattern Matching Algorithms for Enhancement of Query Processing Operations in Large XML Trees N. Murugesan 1 and R.Santhosh 2 1 PG Scholar, 2 Assistant Professor, Department of

More information

TwigINLAB: A Decomposition-Matching-Merging Approach To Improving XML Query Processing

TwigINLAB: A Decomposition-Matching-Merging Approach To Improving XML Query Processing American Journal of Applied Sciences 5 (9): 99-25, 28 ISSN 546-9239 28 Science Publications TwigINLAB: A Decomposition-Matching-Merging Approach To Improving XML Query Processing Su-Cheng Haw and Chien-Sing

More information

CSE 530A. B+ Trees. Washington University Fall 2013

CSE 530A. B+ Trees. Washington University Fall 2013 CSE 530A B+ Trees Washington University Fall 2013 B Trees A B tree is an ordered (non-binary) tree where the internal nodes can have a varying number of child nodes (within some range) B Trees When a key

More information

Labeling and Querying Dynamic XML Trees

Labeling and Querying Dynamic XML Trees Labeling and Querying Dynamic XML Trees Jiaheng Lu, Tok Wang Ling School of Computing, National University of Singapore 3 Science Drive 2, Singapore 117543 {lujiahen,lingtw}@comp.nus.edu.sg Abstract With

More information

Investigation into Indexing XML Data Techniques

Investigation into Indexing XML Data Techniques Investigation into Indexing XML Data Techniques Alhadi Klaib, Joan Lu Department of Informatics University of Huddersfield Huddersfield, UK Abstract- The rapid development of XML technology improves the

More information

Indexing XML Data with ToXin

Indexing XML Data with ToXin Indexing XML Data with ToXin Flavio Rizzolo, Alberto Mendelzon University of Toronto Department of Computer Science {flavio,mendel}@cs.toronto.edu Abstract Indexing schemes for semistructured data have

More information

The Study of Genetic Algorithm-based Task Scheduling for Cloud Computing

The Study of Genetic Algorithm-based Task Scheduling for Cloud Computing The Study of Genetic Algorithm-based Task Scheduling for Cloud Computing Sung Ho Jang, Tae Young Kim, Jae Kwon Kim and Jong Sik Lee School of Information Engineering Inha University #253, YongHyun-Dong,

More information

Aggregate Query Processing of Streaming XML Data

Aggregate Query Processing of Streaming XML Data ggregate Query Processing of Streaming XML Data Yaw-Huei Chen and Ming-Chi Ho Department of Computer Science and Information Engineering National Chiayi University {ychen, s0920206@mail.ncyu.edu.tw bstract

More information

Lecture2: Database Environment

Lecture2: Database Environment College of Computer and Information Sciences - Information Systems Dept. Lecture2: Database Environment 1 IS220 : D a t a b a s e F u n d a m e n t a l s Topics Covered Data abstraction Schemas and Instances

More information

Efficient Query Optimization Of XML Tree Pattern Matching By Using Holistic Approach

Efficient Query Optimization Of XML Tree Pattern Matching By Using Holistic Approach P P IJISET - International Journal of Innovative Science, Engineering & Technology, Vol. 2 Issue 7, July 2015. Efficient Query Optimization Of XML Tree Pattern Matching By Using Holistic Approach 1 Miss.

More information

XML Filtering Technologies

XML Filtering Technologies XML Filtering Technologies Introduction Data exchange between applications: use XML Messages processed by an XML Message Broker Examples Publish/subscribe systems [Altinel 00] XML message routing [Snoeren

More information

ISSN: [Lakshmikandan* et al., 6(3): March, 2017] Impact Factor: 4.116

ISSN: [Lakshmikandan* et al., 6(3): March, 2017] Impact Factor: 4.116 IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY AN EFFICIENT EFFECTIVE DYNAMIC XML DATA BROADCASTING METHOD IN MOBILE WIRELESS NETWORK USING XPATH QUERIES Mr. A.Lakshmikandan

More information

UPDATING MULTIDIMENSIONAL XML DOCUMENTS 1)

UPDATING MULTIDIMENSIONAL XML DOCUMENTS 1) UPDATING MULTIDIMENSIONAL XML DOCUMENTS ) Nikolaos Fousteris, Manolis Gergatsoulis, Yannis Stavrakas Department of Archive and Library Science, Ionian University, Ioannou Theotoki 72, 4900 Corfu, Greece.

More information

Chapter 13 XML: Extensible Markup Language

Chapter 13 XML: Extensible Markup Language Chapter 13 XML: Extensible Markup Language - Internet applications provide Web interfaces to databases (data sources) - Three-tier architecture Client V Application Programs Webserver V Database Server

More information

A Novel Replication Strategy for Efficient XML Data Broadcast in Wireless Mobile Networks

A Novel Replication Strategy for Efficient XML Data Broadcast in Wireless Mobile Networks JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 32, 309-327 (2016) A Novel Replication Strategy for Efficient XML Data Broadcast in Wireless Mobile Networks ALI BORJIAN BOROUJENI 1 AND MEGHDAD MIRABI 2

More information

A Structural Numbering Scheme for XML Data

A Structural Numbering Scheme for XML Data A Structural Numbering Scheme for XML Data Alfred M. Martin WS2002/2003 February/March 2003 Based on workout made during the EDBT 2002 Workshops Dao Dinh Khal, Masatoshi Yoshikawa, and Shunsuke Uemura

More information

Open Access The Three-dimensional Coding Based on the Cone for XML Under Weaving Multi-documents

Open Access The Three-dimensional Coding Based on the Cone for XML Under Weaving Multi-documents Send Orders for Reprints to reprints@benthamscience.ae 676 The Open Automation and Control Systems Journal, 2014, 6, 676-683 Open Access The Three-dimensional Coding Based on the Cone for XML Under Weaving

More information

XPath. Lecture 36. Robb T. Koether. Wed, Apr 16, Hampden-Sydney College. Robb T. Koether (Hampden-Sydney College) XPath Wed, Apr 16, / 28

XPath. Lecture 36. Robb T. Koether. Wed, Apr 16, Hampden-Sydney College. Robb T. Koether (Hampden-Sydney College) XPath Wed, Apr 16, / 28 XPath Lecture 36 Robb T. Koether Hampden-Sydney College Wed, Apr 16, 2014 Robb T. Koether (Hampden-Sydney College) XPath Wed, Apr 16, 2014 1 / 28 1 XPath 2 Executing XPath Expressions 3 XPath Expressions

More information

A Dynamic Labeling Scheme using Vectors

A Dynamic Labeling Scheme using Vectors A Dynamic Labeling Scheme using Vectors Liang Xu, Zhifeng Bao, Tok Wang Ling School of Computing, National University of Singapore {xuliang, baozhife, lingtw}@comp.nus.edu.sg Abstract. The labeling problem

More information

XML: Extensible Markup Language

XML: Extensible Markup Language XML: Extensible Markup Language CSC 375, Fall 2015 XML is a classic political compromise: it balances the needs of man and machine by being equally unreadable to both. Matthew Might Slides slightly modified

More information

TwigList: Make Twig Pattern Matching Fast

TwigList: Make Twig Pattern Matching Fast TwigList: Make Twig Pattern Matching Fast Lu Qin, Jeffrey Xu Yu, and Bolin Ding The Chinese University of Hong Kong, China {lqin,yu,blding}@se.cuhk.edu.hk Abstract. Twig pattern matching problem has been

More information

Effective Schema-Based XML Query Optimization Techniques

Effective Schema-Based XML Query Optimization Techniques Effective Schema-Based XML Query Optimization Techniques Guoren Wang and Mengchi Liu School of Computer Science Carleton University, Canada {wanggr, mengchi}@scs.carleton.ca Bing Sun, Ge Yu, and Jianhua

More information

L-Tree: a Dynamic Labeling Structure for Ordered XML Data

L-Tree: a Dynamic Labeling Structure for Ordered XML Data L-Tree: a Dynamic Labeling Structure for Ordered XML Data Yi Chen, George A. Mihaila, Rajesh Bordawekar, and Sriram Padmanabhan University of Pennsylvania, yicn@seas.upenn.edu IBM T.J. Watson Research

More information

A System for Storing, Retrieving, Organizing and Managing Web Services Metadata Using Relational Database *

A System for Storing, Retrieving, Organizing and Managing Web Services Metadata Using Relational Database * BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 6, No 1 Sofia 2006 A System for Storing, Retrieving, Organizing and Managing Web Services Metadata Using Relational Database

More information

A Persistent Labelling Scheme for XML and tree Databases 1

A Persistent Labelling Scheme for XML and tree Databases 1 A Persistent Labelling Scheme for XML and tree Databases 1 Alban Gabillon Majirus Fansi 2 Université de Pau et des Pays de l'adour IUT des Pays de l'adour LIUPPA/CSYSEC 40000 Mont-de-Marsan, France alban.gabillon@univ-pau.fr

More information

Trees : Part 1. Section 4.1. Theory and Terminology. A Tree? A Tree? Theory and Terminology. Theory and Terminology

Trees : Part 1. Section 4.1. Theory and Terminology. A Tree? A Tree? Theory and Terminology. Theory and Terminology Trees : Part Section. () (2) Preorder, Postorder and Levelorder Traversals Definition: A tree is a connected graph with no cycles Consequences: Between any two vertices, there is exactly one unique path

More information

A Hybrid Routing Algorithm for an Efficient Shortest Path Decision in Network Routing

A Hybrid Routing Algorithm for an Efficient Shortest Path Decision in Network Routing A Hybrid Routing Algorithm for an Efficient Shortest Path Decision in Network Routing Taehwan Cho, Kyeongseob Kim, Wanoh Yoon and Sangbang Choi* Department of Electronics Engineering, Inha University,

More information

Outline. Depth-first Binary Tree Traversal. Gerênciade Dados daweb -DCC922 - XML Query Processing. Motivation 24/03/2014

Outline. Depth-first Binary Tree Traversal. Gerênciade Dados daweb -DCC922 - XML Query Processing. Motivation 24/03/2014 Outline Gerênciade Dados daweb -DCC922 - XML Query Processing ( Apresentação basedaem material do livro-texto [Abiteboul et al., 2012]) 2014 Motivation Deep-first Tree Traversal Naïve Page-based Storage

More information

A Structural Numbering Scheme for XML Data

A Structural Numbering Scheme for XML Data A Structural Numbering Scheme for XML Data Dao Dinh Kha 1, Masatoshi Yoshikawa 1,2, and Shunsuke Uemura 1 1 Graduate School of Information Science Nara Institute of Science and Technology 8916-5 Takayama,

More information

CBSL A Compressed Binary String Labeling Scheme for Dynamic Update of XML Documents

CBSL A Compressed Binary String Labeling Scheme for Dynamic Update of XML Documents CIT. Journal of Computing and Information Technology, Vol. 26, No. 2, June 2018, 99 114 doi: 10.20532/cit.2018.1003955 99 CBSL A Compressed Binary String Labeling Scheme for Dynamic Update of XML Documents

More information

Informatics 1: Data & Analysis

Informatics 1: Data & Analysis T O Y H Informatics 1: Data & Analysis Lecture 11: Navigating XML using XPath Ian Stark School of Informatics The University of Edinburgh Tuesday 26 February 2013 Semester 2 Week 6 E H U N I V E R S I

More information

Compression of the Stream Array Data Structure

Compression of the Stream Array Data Structure Compression of the Stream Array Data Structure Radim Bača and Martin Pawlas Department of Computer Science, Technical University of Ostrava Czech Republic {radim.baca,martin.pawlas}@vsb.cz Abstract. In

More information

International Journal of Advanced Research in Computer Science and Software Engineering

International Journal of Advanced Research in Computer Science and Software Engineering ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: Twig Pattern Matching Algorithms for XML D.BUJJI BABU 1 Dr. R.SIVA

More information

Ecient XPath Axis Evaluation for DOM Data Structures

Ecient XPath Axis Evaluation for DOM Data Structures Ecient XPath Axis Evaluation for DOM Data Structures Jan Hidders Philippe Michiels University of Antwerp Dept. of Math. and Comp. Science Middelheimlaan 1, BE-2020 Antwerp, Belgium, fjan.hidders,philippe.michielsg@ua.ac.be

More information

Efficient Integration of Structure Indexes of XML

Efficient Integration of Structure Indexes of XML Efficient Integration of Structure Indexes of XML Taro L. Saito Shinichi Morishita University of Tokyo, Japan, {leo, moris}@cb.k.u-tokyo.ac.jp Abstract. Several indexing methods have been proposed to encode

More information

Storing DTD-Conscious XML Data in XEDY

Storing DTD-Conscious XML Data in XEDY Storing DTD-Conscious XL Data in XEDY Sourav S. Bhowmick 1,TayKhimWee 1, Erwin Leonardi 1, and Sanjay adria 2 1 School of Computer Engineering, anyang Technological University Singapore assourav@ntu.edu.sg

More information

A Survey Of Algorithms Related To Xml Based Pattern Matching

A Survey Of Algorithms Related To Xml Based Pattern Matching A Survey Of Algorithms Related To Xml Based Pattern Matching Dr.R.Sivarama Prasad 1, D.Bujji Babu 2, Sk.Habeeb 3, Sd.Jasmin 4 1 Coordinator,International Business Studies, Acharya Nagarjuna University,Guntur,A.P,India,

More information

MAXDOR: Mapping XML Document into Relational Database

MAXDOR: Mapping XML Document into Relational Database 108 The Open Information Systems Journal, 2009, 3, 108-122 MAXDOR: Mapping XML Document into Relational Database Ibrahim Dweib*, Ayman Awadi and Joan Lu Open Access School of Computing and Engineering,

More information

DDE: From Dewey to a Fully Dynamic XML Labeling Scheme

DDE: From Dewey to a Fully Dynamic XML Labeling Scheme : From Dewey to a Fully Dynamic XML Labeling Scheme Liang Xu, Tok Wang Ling, Huayu Wu, Zhifeng Bao School of Computing National University of Singapore {xuliang,lingtw,wuhuayu,baozhife}@compnusedusg ABSTRACT

More information

Concurrency in XML. Concurrency control is a method used to ensure that database transactions are executed in a safe manner.

Concurrency in XML. Concurrency control is a method used to ensure that database transactions are executed in a safe manner. Concurrency in XML Concurrency occurs when two or more execution flows are able to run simultaneously. Concurrency control is a method used to ensure that database transactions are executed in a safe manner.

More information

Indices in XML databases. Hadj Mahboubi. University of Lyon (ERIC Lyon 2) 5 avenue Pierre Mendès-France, Bron Cedex, France

Indices in XML databases. Hadj Mahboubi. University of Lyon (ERIC Lyon 2) 5 avenue Pierre Mendès-France, Bron Cedex, France Indices in XML databases Hadj Mahboubi University of Lyon (ERIC Lyon 2) 5 avenue Pierre Mendès-France, 69676 Bron Cedex, France Phone: +33 478 773 111 Fax: +33 478 772 375 hadj.mahboubi@eric.univ-lyon2.fr

More information

Tree-Pattern Queries on a Lightweight XML Processor

Tree-Pattern Queries on a Lightweight XML Processor Tree-Pattern Queries on a Lightweight XML Processor MIRELLA M. MORO Zografoula Vagena Vassilis J. Tsotras Research partially supported by CAPES, NSF grant IIS 0339032, UC Micro, and Lotus Interworks Outline

More information

Hamilton paths & circuits. Gray codes. Hamilton Circuits. Planar Graphs. Hamilton circuits. 10 Nov 2015

Hamilton paths & circuits. Gray codes. Hamilton Circuits. Planar Graphs. Hamilton circuits. 10 Nov 2015 Hamilton paths & circuits Def. A path in a multigraph is a Hamilton path if it visits each vertex exactly once. Def. A circuit that is a Hamilton path is called a Hamilton circuit. Hamilton circuits Constructing

More information

Web scraping and crawling, open data, markup languages and data shaping. Paolo Boldi Dipartimento di Informatica Università degli Studi di Milano

Web scraping and crawling, open data, markup languages and data shaping. Paolo Boldi Dipartimento di Informatica Università degli Studi di Milano Web scraping and crawling, open data, markup languages and data shaping Paolo Boldi Dipartimento di Informatica Università degli Studi di Milano Data Analysis Three steps Data Analysis Three steps In every

More information

Computational Optimization ISE 407. Lecture 16. Dr. Ted Ralphs

Computational Optimization ISE 407. Lecture 16. Dr. Ted Ralphs Computational Optimization ISE 407 Lecture 16 Dr. Ted Ralphs ISE 407 Lecture 16 1 References for Today s Lecture Required reading Sections 6.5-6.7 References CLRS Chapter 22 R. Sedgewick, Algorithms in

More information

A Data Model for Temporal XML Documents

A Data Model for Temporal XML Documents A Data Model for Temporal XML Documents Toshiyuki Amagasa Masatoshi Yoshikawa Shunsuke Uemura Graduate School of Information Science Nara Institute of Science and Technology 8916 5 Takayama, Ikoma 630

More information

Index Structures for Matching XML Twigs Using Relational Query Processors

Index Structures for Matching XML Twigs Using Relational Query Processors Index Structures for Matching XML Twigs Using Relational Query Processors Zhiyuan Chen University of Maryland at Baltimore County zhchen@umbc.com Nick Koudas AT&T Labs Research koudas@research.att.com

More information

An Adaptive Query Processing Method according to System Environments in Database Broadcasting Systems

An Adaptive Query Processing Method according to System Environments in Database Broadcasting Systems An Query Processing Method according to System Environments in Database Broadcasting Systems M. KASHITA T. TERADA T. HARA Graduate School of Engineering, Cybermedia Center, Graduate School of Information

More information

Querying Tree-Structured Data Using Dimension Graphs

Querying Tree-Structured Data Using Dimension Graphs Querying Tree-Structured Data Using Dimension Graphs Dimitri Theodoratos 1 and Theodore Dalamagas 2 1 Dept. of Computer Science New Jersey Institute of Technology Newark, NJ 07102 dth@cs.njit.edu 2 School

More information

A Survey on Keyword Diversification Over XML Data

A Survey on Keyword Diversification Over XML Data ISSN (Online) : 2319-8753 ISSN (Print) : 2347-6710 International Journal of Innovative Research in Science, Engineering and Technology An ISO 3297: 2007 Certified Organization Volume 6, Special Issue 5,

More information

Motivation for B-Trees

Motivation for B-Trees 1 Motivation for Assume that we use an AVL tree to store about 20 million records We end up with a very deep binary tree with lots of different disk accesses; log2 20,000,000 is about 24, so this takes

More information

Symmetrically Exploiting XML

Symmetrically Exploiting XML Symmetrically Exploiting XML Shuohao Zhang and Curtis Dyreson School of E.E. and Computer Science Washington State University Pullman, Washington, USA The 15 th International World Wide Web Conference

More information

Designing Views to Answer Queries under Set, Bag,and BagSet Semantics

Designing Views to Answer Queries under Set, Bag,and BagSet Semantics Designing Views to Answer Queries under Set, Bag,and BagSet Semantics Rada Chirkova Department of Computer Science, North Carolina State University Raleigh, NC 27695-7535 chirkova@csc.ncsu.edu Foto Afrati

More information

Bottom-Up Evaluation of Twig Join Pattern Queries in XML Document Databases

Bottom-Up Evaluation of Twig Join Pattern Queries in XML Document Databases Bottom-Up Evaluation of Twig Join Pattern Queries in XML Document Databases Yangjun Chen Department of Applied Computer Science University of Winnipeg Winnipeg, Manitoba, Canada R3B 2E9 y.chen@uwinnipeg.ca

More information

A New Way of Generating Reusable Index Labels for Dynamic XML

A New Way of Generating Reusable Index Labels for Dynamic XML A New Way of Generating Reusable Index Labels for Dynamic XML P. Jayanthi, Dr. A. Tamilarasi Department of CSE, Kongu Engineering College, Perundurai 638 052, Erode, Tamilnadu, India. Abstract XML now

More information

RELATIONAL STORAGE FOR XML RULES

RELATIONAL STORAGE FOR XML RULES RELATIONAL STORAGE FOR XML RULES A. A. Abd El-Aziz Research Scholar Dept. of Information Science & Technology Anna University Email: abdelazizahmed@auist.net Professor A. Kannan Dept. of Information Science

More information

XML Storage and Indexing

XML Storage and Indexing XML Storage and Indexing Web Data Management and Distribution Serge Abiteboul Ioana Manolescu Philippe Rigaux Marie-Christine Rousset Pierre Senellart Web Data Management and Distribution http://webdam.inria.fr/textbook

More information

Trees. Carlos Moreno uwaterloo.ca EIT https://ece.uwaterloo.ca/~cmoreno/ece250

Trees. Carlos Moreno uwaterloo.ca EIT https://ece.uwaterloo.ca/~cmoreno/ece250 Carlos Moreno cmoreno @ uwaterloo.ca EIT-4103 https://ece.uwaterloo.ca/~cmoreno/ece250 Standard reminder to set phones to silent/vibrate mode, please! Announcements Part of assignment 3 posted additional

More information

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 27-1

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 27-1 Slide 27-1 Chapter 27 XML: Extensible Markup Language Chapter Outline Introduction Structured, Semi structured, and Unstructured Data. XML Hierarchical (Tree) Data Model. XML Documents, DTD, and XML Schema.

More information

Uses for Trees About Trees Binary Trees. Trees. Seth Long. January 31, 2010

Uses for Trees About Trees Binary Trees. Trees. Seth Long. January 31, 2010 Uses for About Binary January 31, 2010 Uses for About Binary Uses for Uses for About Basic Idea Implementing Binary Example: Expression Binary Search Uses for Uses for About Binary Uses for Storage Binary

More information

Semistructured Data Store Mapping with XML and Its Reconstruction

Semistructured Data Store Mapping with XML and Its Reconstruction Semistructured Data Store Mapping with XML and Its Reconstruction Enhong CHEN 1 Gongqing WU 1 Gabriela Lindemann 2 Mirjam Minor 2 1 Department of Computer Science University of Science and Technology of

More information

Schemaless Approach of Mapping XML Document into Relational Database

Schemaless Approach of Mapping XML Document into Relational Database Schemaless Approach of Mapping XML Document into Relational Database Ibrahim Dweib 1, Ayman Awadi 2, Seif Elduola Fath Elrhman 1, Joan Lu 1 University of Huddersfield 1 Alkhoja Group 2 ibrahim_thweib@yahoo.c

More information

Answering XML Twig Queries with Automata

Answering XML Twig Queries with Automata Answering XML Twig Queries with Automata Bing Sun, Bo Zhou, Nan Tang, Guoren Wang, Ge Yu, and Fulin Jia Northeastern University, Shenyang, China {sunb,wanggr,yuge,dbgroup}@mail.neu.edu.cn Abstract. XML

More information

Trees. Tree Structure Binary Tree Tree Traversals

Trees. Tree Structure Binary Tree Tree Traversals Trees Tree Structure Binary Tree Tree Traversals The Tree Structure Consists of nodes and edges that organize data in a hierarchical fashion. nodes store the data elements. edges connect the nodes. The

More information

TwigX-Guide: An Efficient Twig Pattern Matching System Extending DataGuide Indexing and Region Encoding Labeling

TwigX-Guide: An Efficient Twig Pattern Matching System Extending DataGuide Indexing and Region Encoding Labeling JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 25, 603-617 (2009) Short Paper TwigX-Guide: An Efficient Twig Pattern Matching System Extending DataGuide Indexing and Region Encoding Labeling Department

More information

On Label Stream Partition for Efficient Holistic Twig Join

On Label Stream Partition for Efficient Holistic Twig Join On Label Stream Partition for Efficient Holistic Twig Join Bo Chen 1, Tok Wang Ling 1,M.TamerÖzsu2, and Zhenzhou Zhu 1 1 School of Computing, National University of Singapore {chenbo, lingtw, zhuzhenz}@comp.nus.edu.sg

More information

STORING-UPDATING AND QUERYING MULTIDIMENSIONAL XML DOCUMENTS USING RELATIONAL DATABASES 1

STORING-UPDATING AND QUERYING MULTIDIMENSIONAL XML DOCUMENTS USING RELATIONAL DATABASES 1 ISBN: 978-972-8924-44-7 2007 IADIS STORING-UPDATING AND QUERYING MULTIDIMENSIONAL XML DOCUMENTS USING RELATIONAL DATABASES 1 Nikolaos Fousteris, Yannis Stavrakas, Manolis Gergatsoulis Department of Archive

More information

Searching SNT in XML Documents Using Reduction Factor

Searching SNT in XML Documents Using Reduction Factor Searching SNT in XML Documents Using Reduction Factor Mary Posonia A Department of computer science, Sathyabama University, Tamilnadu, Chennai, India maryposonia@sathyabamauniversity.ac.in http://www.sathyabamauniversity.ac.in

More information

Lecture 32. No computer use today. Reminders: Homework 11 is due today. Project 6 is due next Friday. Questions?

Lecture 32. No computer use today. Reminders: Homework 11 is due today. Project 6 is due next Friday. Questions? Lecture 32 No computer use today. Reminders: Homework 11 is due today. Project 6 is due next Friday. Questions? Friday, April 1 CS 215 Fundamentals of Programming II - Lecture 32 1 Outline Introduction

More information

PACD: A BITMAP-BASED FRAMEWORK FOR PROCESSING XML DATA

PACD: A BITMAP-BASED FRAMEWORK FOR PROCESSING XML DATA PACD: A BITMAP-BASED FRAMEWORK FOR PROCESSING XML DATA Mohammed Al-Badawi 1, Barry Eaglestone 2, Siobhán North 1 1 Department of Computer Science, The University of Sheffield, Sheffield, UK m.badawi@dcs.shef.ac.uk,s.north@dcs.shef.ac.uk

More information

Storing and Maintaining Semistructured Data Efficiently in an Object-Relational Database

Storing and Maintaining Semistructured Data Efficiently in an Object-Relational Database Storing and Maintaining Semistructured Data Efficiently in an Object-Relational Database Yuanying Mo National University of Singapore moyuanyi@comp.nus.edu.sg Tok Wang Ling National University of Singapore

More information

XML Systems & Benchmarks

XML Systems & Benchmarks XML Systems & Benchmarks Christoph Staudt Peter Chiv Saarland University, Germany July 1st, 2003 Main Goals of our talk Part I Show up how databases and XML come together Make clear the problems that arise

More information

Some aspects of references behaviour when querying XML with XQuery

Some aspects of references behaviour when querying XML with XQuery Some aspects of references behaviour when querying XML with XQuery c B.Khvostichenko boris.khv@pobox.spbu.ru B.Novikov borisnov@acm.org Abstract During the XQuery query evaluation, the query output is

More information

Data Abstractions. National Chiao Tung University Chun-Jen Tsai 05/23/2012

Data Abstractions. National Chiao Tung University Chun-Jen Tsai 05/23/2012 Data Abstractions National Chiao Tung University Chun-Jen Tsai 05/23/2012 Concept of Data Structures How do we store some conceptual structure in a linear memory? For example, an organization chart: 2/32

More information

Kikori-KS: An Effective and Efficient Keyword Search System for Digital Libraries in XML

Kikori-KS: An Effective and Efficient Keyword Search System for Digital Libraries in XML Kikori-KS An Effective and Efficient Keyword Search System for Digital Libraries in XML Toshiyuki Shimizu 1, Norimasa Terada 2, and Masatoshi Yoshikawa 1 1 Graduate School of Informatics, Kyoto University

More information

An XML Routing Synopsis for Unstructured P2P Networks

An XML Routing Synopsis for Unstructured P2P Networks An XML Routing Synopsis for Unstructured P2P Networks Qiang Wang University of Waterloo q6wang@uwaterloo.ca Abhay Kumar Jha IIT, Bombay abhaykj@cse.iitb.ac.in M. Tamer Özsu University of Waterloo tozsu@uwaterloo.ca

More information

Top-k Keyword Search Over Graphs Based On Backward Search

Top-k Keyword Search Over Graphs Based On Backward Search Top-k Keyword Search Over Graphs Based On Backward Search Jia-Hui Zeng, Jiu-Ming Huang, Shu-Qiang Yang 1College of Computer National University of Defense Technology, Changsha, China 2College of Computer

More information

Efficient XML Storage based on DTM for Read-oriented Workloads

Efficient XML Storage based on DTM for Read-oriented Workloads fficient XML Storage based on DTM for Read-oriented Workloads Graduate School of Information Science, Nara Institute of Science and Technology Makoto Yui Jun Miyazaki, Shunsuke Uemura, Hirokazu Kato International

More information