Folder(Inbox) Message Message. Body

Size: px
Start display at page:

Download "Folder(Inbox) Message Message. Body"

Transcription

1 Rening OEM to Improve Features of Query Languages for Semistructured Data Pavel Hlousek Charles University, Faculty of Mathematics and Physics, Prague, Czech Republic Abstract. Semistructured data can be modeled by the graph-oriented data model OEM. Due to its general properties, associated query languages become too complex because they work with cycles in data graphs. On the other hand, there are applications that require manipulating only tree-structured fragments of semistructured data (part-subpart relationships), and preservation of its other relationships. Therefore we rene the OEM model to distinguish between two types of edges. This renement enables to simplify semantics of query expressions and improve default structuring of the query result by keeping a minimal structural context, which the specied nodes had in the source data tree. We document the proposed notions on the language MailQL whose prototype is under development. Introduction Semistructured data can be modeled by the graph-oriented data model OEM (introduced in [Widom et al, 1995 ]), which allows cycles in the data graph. Because query languages over this data need to count with possibility of cycles, they become complex and complicated. Also the size of an answer to a query may contain data that are useless for the user, because with each node there is its whole subgraph returned. This is because data is held by nodes without outgoing edges in OEM model. On the other hand, data is often of tree structure in terms of part-subpart relationship, and cycles in the graph are then caused by references, which represent sort of added information. Therefore, in section, we rene the OEM model, not to lose notion of part-subpart relationships in the cyclic graph. This aects answers to queries, which return with each node its subgraph that represents only its subparts. Because part-subpart relationships form a tree, we do not have to bother with cycles any more, and can make our query language much simpler. This can be very suitable for query languages for XML data, where part-subpart may correspond to the element-subelement relationship and cycles are realized through IDREFS attribute. The fact, that we can work with data as if it were a tree, creates many possibilities for query languages over semistructured data. An outstanding one is presented in section. It illustrates default structuring of result nodes keeping a minimal structural context, these nodes had in the source data graph. Other similar languages, like Lorel [Abiteboul et al, 1996 ], XML-QL [Deutsch et al, 1998 ] or UnQL [Buneman et al, 1996 ], force the user to use some \construct" clause, where he/she explicitly species the result structure. And if they provide some default structuring, then they simply return set of nodes. In the following text, we use an message base as an example of semistructured data, and associated query language MailQL, which was developed as the author's master thesis [Hlousek, 2000 ]. In the message base, part-subpart relationships are e.g. folder-subfolder, folder-message, message- elds, and cycles are caused e.g. by message threads (described later). There are many other areas in semistructured data, where part-subpart relationships can be found: image and its subregions, XML data with elements containing elements, etc. Data Model In this section, we dene OEM model and its renement. OEM Model Object Exchange Model (OEM), rst appearing in TSIMMIS project [Widom et al, 1995 ], is the de facto standard in modeling semistructured data. The following denition of OEM is borrowed from [Abitebout et al, 2000 ].

2 Denition 1: An OEM object is a quadruple (label, oid, type, value), where label is a character string, oid is the object's unique identier, and type is either complex or some identier denoting an atomic type (like integer, string, etc.). When type is complex, then the object is called a complex object, and value is a set of oids. Otherwise, the object is an atomic object, and value is an atomic value of that type. Thus data represented by OEM model is kept by OEM objects of atomic type, which are referred to by OEM objects of complex type. OEM data is usually understood as an oriented graph with labeled nodes, where OEM objects correspond to nodes, and for each node n representing complex OEM object o = (label; oid; complex; value), there are edges leading from n to nodes that represent OEM objects in o's value eld. Figure 1 shows how this model can describe part of our tree-like message base. It is also apparent from the picture, that we use very common modication of the OEM model { labels are attached to edges, rather than nodes. Thus we take our message base as an oriented edge-labeled data graph. Folder(Inbox) Folder(Private) Message Message Field(From) Field(To) Body Figure 1. Part of message base modeled by OEM. The complex nodes are distinguished from atomic ones by empty circles. Denition 2: We say, that m is l-suboject of n, if there exists an edge labeled l leading from n to m. Henceforth, OEM objects are simply called objects. We should also note, that in the following text, we mix terms node and object, while according to OEM denition a node represents an object, and vice versa. Rening OEM Model Now let us turn our attention to our example of message base. Let us enrich our message base model that consists so far only of part-subpart relationships like folder-message, with edges providing us with some added information. Let us add to our model message threads. Message thread of a message involves all replies to message itself and all replies to its replies, etc., as well as all preceding messages, i.e. messages which have our message among its replies, or replies to its replies, etc. In message base model a message thread would be expressed by edges labeled thread, leading to all nodes that represent a message in the message's thread. We mentioned earlier, that with each node, there is its subgraph returned as an answer to a query in other OEM oriented languages. So with each message there must be always all messages from its thread returned. This OEM specic behaviour does not satisfy us, because we might be concerned in the messages only, not caring about their message threads, which might be in some cases very large, and so they could wastefully enlarge the size of the result. Therefore we rene the OEM model to distinguish between two types of edges. This renement is described by Def. 3. Denition 3: (i) Core edges are edges describing the tree structure of data, given by part-subpart relationships. (ii) Secondary edges are edges describing added information. Core (secondary) paths are oriented paths consisting only of core (secondary) edges. Def. 3 says that core edges are edges describing part-subpart relationships (e.g. edges from message to its elds), while secondary edges are edges describing other than part-subpart relationships (e.g. edges from message to its message thread). Denition 4: By core data tree of data graph G = (V; E) we denote its subgraph G 0 = (V; E 0 ), where E 0 E consists of all core edges.

3 We should note, that data graph and core data tree share the same nodes. The only dierence is that core data tree is composed exclusively from core edges, and therefore it is always a tree, because core edges take place where the part-subpart relationship holds, while the complete data graph can contain cycles, which are caused by presence of secondary edges. Here we made a simplication talking about data tree. Instead of tree, we might more generally talk about rooted acyclic graph, but there is no major consequence for this article in distinguishing these two, but number of roots. Henceforth, by root of a data tree we will understand any root in rooted acyclic graph. Now back to our example, by enriching the tree-like message base model with edges that provide some additional information, the data graph is no longer tree: it is a cyclic graph. But using Def. 3 and Def. 4, we do not lose notion of the previous tree in the data graph (due to core edges). Figure 2 illustrates this situation on our message base model enriched with thread-edges. Message Message Thread Thread From Body To Attachment Figure 2. Part of message base with core edges (drawn with thicker lines) and secondary edges (drawn with thinner lines). Message-nodes have their outgoing edges heavily reduced to simplify the gure, normally, there would be many more of them. The introduced renement was done to achieve this feature: let the query language work with the core data tree instead of working with the whole data graph. Specically, with each specied node there will be its subgraph from core data tree in the result. So asking for a message, there will not be all messages from its thread in the result, like in OEM. Furthermore, the result, just before shipped to the user, will be provided with those secondary edges, where both nodes of each secondary edge were preserved. Thus those messages from message's thread, that got into result, will remain connected with thread-edges. Path Expressions Now, when our semistructured data model is rened, we need a way to navigate through the data graph. Nowadays, the most suitable tool seem to be path expressions. Their presence in languages over semistructured data is almost a rule, because of their navigational syntax. If o is object and l is label, then by expression o:l we denote set of l-subobjects of o. We should notice, that o:l always denotes set of objects. This semantics of path expressions is typical for semistructured data. Simple Path Expressions. Simple path expression is expression r:l 1 : : : : l n, where r is root node and l 1 ; : : : ; l n are edge labels. Data path is a sequence of o 0 ; l 1 ; o 1 ; : : : ; l n ; o n, where o i are objects, and for each i there is an edge between o i?1 and o i labeled l i. According to these denitions we can see, that there can be more than one data path that satises some simple path expression. Semantics of simple path expressions is very intuitive. We will explain it on example of root.a.b. Expression root denotes the starting object. Expression root.a denotes set X of all objects, for which there exists an edge leading to them from root and labeled A. Expression root.a.b denotes then set Y of all objects, for which there exists an edge leading to them from any object in X and labeled B. Thus each simple path expression denotes a set of objects, even in case that there is no data path satisfying it. In such a case, path expression denotes an empty set. General Path Expressions. General path expressions enhance the power of simple path expressions by enabling use of wild cards and regular expressions in path expressions. With wild cards we can substitute either an edge label (using %) or a sequence of edge labels (using ). Thus expression root.%.b means root.any label.b, and expression root..z means root.any path.z.

4 Regular expressions enable use of path expressions such as root(.aj.c).b, which matches two simple path expressions root.a.b and root.c.b, and so results in union of two sets of objects. Usually there are many more constructs that are typical for regular expressions, and usage of wild cards could be widen much more as well. But to talk about these is not the goal of our paper. Common Prex. Using path expressions, we often refer to common prex of two or more path expressions. First we dene predicate IsPrex, denition of common prex follows. We should note, that by common prex we always mean the longest common prex. Denition 5: Let pe 1 and pe 2 be path expressions. Predicate IsP refix(pe 1 ; pe 2 ) is true if pe 1 = X:l 1 : : : : l n and pe 2 = X:l 1 : : : : l n+m, where n 0 and m 0. Denition 6: Let pe 1 and pe 2 be path expressions. By common prex we denote path expression pe = X:l 1 : : : :l k, where both IsP refix(pe; pe 1 ) and IsP refix(pe; pe 2 ) are true, and there is no path expression pe 0 = pe:l k+1 such that both IsP refix(pe 0 ; pe 1 ) and IsP refix(pe 0 ; pe 2 ) are true. Example Syntax As introduced earlier, MailQL is a query language over an message base, which we use here as an example of semistructured data modeled by rened OEM model. MailQL queries borrow their syntax from OQL [Cattel et al, 1997 ] and its semistructured data oriented successor Lorel [Abiteboul et al, 1996 ]. SELECT list of path expressions FROM list of aliases for path expressions WHERE boolean expression Example (1) shows a simple query in MailQL, which returns elds From and Date of all messages that contain string 'MailQL' in its Subject eld. SELECT m.from, m.date FROM Inbox.Message: m WHERE m.subject CONTAINS 'MailQL' (1) Note, that FROM clause in a MailQL query plays a dierent role than in other syntactically similar languages: here it only denes aliases (m) for path expressions (Inbox.Message). Before the query is executed, all occurances of aliases are substituted by appropriate path expressions in the SELECT and WHERE clause, therefore the FROM clause is no longer needed after that. Automatic Construction of the Result Structure In this section, we will introduce an interesting use of OEM renement. So far, we considered what should be returned with specied nodes. It means we were inspecting the part of the core data tree on the path from a node to leaves. Now we will switch our attention to the other part: the path from a node to the root of the core data tree. The question is: In what structural relationships should the nodes, specied in SELECT clause of a query, be? Current languages for semistructured data usually provide some \construct" clause which explicitly denes the structure of result. Some of them provide default structuring as well, which means returning a set. But having the core data tree, we can improve default structuring by keeping a minimal structural context, which the specied nodes had in the source data tree. Simply said, path expressions from SELECT clause specify nodes in data graph. All these nodes will be returned (with their subgraphs, as described in ). And we also want all these nodes to stay in the same structural relationships to each other as they did in the source data tree. This could be surely realized even so, that we would preserve whole paths leading to these nodes from the root of the core data tree. But our solution vertically reduces these paths and keeps from them only \interesting" nodes { nodes, in which the path forks to reach specied nodes. Compared with other languages for semistructured data, we might miss strong result restructuring features, but we think, that in many cases the user needs to see minimal structural context of data, which is just what our language provides. Furthermore, all introduced features of our language, could be incorporated into any query language with strong formatting options and thus provide default structuring of result.

5 Minimal Structural Context HLOUSEK: SEMISTRUCTURED DATA, OEM REFINEMENT Let us rst formalize vertical reduction. Denition 7: Let T = (V; E) be a tree, where V is a set of nodes, and E is a set of edges. We say that tree T 0 = (V 0 ; E 0 ) is vertically reduced tree of T, if both following conditions are true: i) V 0 V ii) 8e 0 = (v 0 1 ; v0 2) 2 E 0 it is true, that either (a) e 0 2 E (an edge preserved from T ); or (b) there exists sequence e 1 ; : : : ; e n of edges from E n E 0 and sequence v k2 ; : : : ; v kn of nodes from V n V 0, such that v 0 1 ; e 1; v k2 ; : : : ; v kn ; e n ; v 0 2 is path in T. Denition 7 says, that T 0 was obtained from T by leaving out some nodes in such a way, that if there was a path between two nodes in T, and if both nodes were preserved in T 0, than there must exist a path between them in T 0, as well. Now let us turn our attention to \interesting" nodes. As it was declared in the beginning of this section, the only nodes, that are preserved from paths leading to queried data, are those nodes, where these paths fork. Given two path expressions, we can determine the path expression of nodes, where the paths fork, by nding out common prex of these two path expressions. So by \interesting" nodes we understand those nodes, that are represented by common prexes of path expressions from SELECT clause. To describe the minimal structural context of result nodes, we use data structure called result structure tree (RST), which helps us specify the (vertically reduced) structure of result. Result structure tree is a tree RST = (V; E) with mapping PEI (path expression inx) dened for all its nodes. Each node is mapped by PEI to a part of path expression in such a way, that common prex is empty for each pair of siblings in RST. PEI of root node always maps to an empty path expression. 1 To get the structure of result for a certain query, we take all path expressions from SELECT clause of the query and construct appropriate RST from them. The construction is directed by common prexes of path expressions. We start with the root node, which is always present and is always assigned an empty path expression. First path expression will be represented by child node of root, where PEI of that node will be the path expression itself. When adding every other path expression to RST, we check for common prex with all path expressions, that are already represented by RST. If there is non empty common prex, then some splitting must take place in order not to violate denition of RST. Figure 3 in three steps illustrates, how RST is created for set of path expressions finbox.private.message.to, Inbox.Private.Message.From, Inbox.Private.Nameg. v root v root v root Root.Inbox.Private.Message.To Root.Inbox.Private.Message Root.Inbox.Private To From Message Name To From Figure 3. Example of RST construction. Dotted lines indicate node splitting, text attached to node is the value of the node's PEI. Now inner nodes of RST, which represent common prexes of a path expression in SELECT clause of a query, represent path expressions of \interesting" nodes we spoke above { these are the ones which express the minimal structural context of returned data { while the leaf nodes represent the data, which are to be returned. 1 If the data tree forms a rooted acyclic graph, then RST forms a rooted acyclic graph as well, but it has always just one root, which represents an empty path expression. So if there are more roots getting to result from the core data rooted acyclic graph, then they are represented as children of root in RST.

6 Henceforth, we will denote the structure of result using XML-like syntax (accordingly to [Bray et al, 1998 ]). So the nal structure from Fig. 3 will be written down like this: f<inbox.private> f<name>...</>g f<message> f<to>...</>g f<from>...</>g </Message>g </Inbox.Private>g RST to Result Once we have RST representing the result structure, we can easily generate result tree. Each node from RST may correspond to several nodes from the core data tree (e.g. Inbox.Message denotes a set of messages). Thus result for RST in Fig. 3 can look in XML syntax like this: <Inbox.Private> <Name>Private</> <Message> <To>julius.satinsky@theatre.sk</> <From>fan@home.cz</> </Message> <Message> <To>julius.satinsky@theatre.sk</> <To>milan.lasica@theatre.sk</> <From>fan@home.cz</> </Message> </Inbox.Private> Conclusions Leaning on the fact, that most semistructured data (e.g. XML data) is of tree structure in terms of part-subpart relationship, we rened OEM model not to lose notion of this tree behind a complex data graph with cycles. This renement allows us, and generally all languages over semistructured data, to work with data as if it were a tree, and so allows us not to bother with complexity caused by cycles in the data graph. As another use of our OEM renement, we introduced basics of a query language, that provides default structuring of a query result based on keeping minimal structural context, as a dierent approach to default result structuring of languages over semistructured data with some \construct" clause. The query language presented here (originally MailQL) was designed and implemented for an message base as a part of master thesis and is further under development during author's postgraduate study. References Abiteboul, Quass, McHugh, Widom, and Wiener, The lorel query language for semistructured data, International Journal on Digital Libraries, 1(1):68{88, 1996, Serge Abiteboul and Dan Suciu, Data on the Web: From Relations to Semistructured Data and XML, Data Management Systems, Morgan Kaufmann, rst edition, Peter Buneman, Susan Davidson, Gerd Hillebrand, and Dan Suciu, A query language and optimization techniques for unstructured data, In H. V. Jagadish and Inderpal Singh Mumick, SIGMOD, pages 505{516, ACM Press, Tim Bray, Jean Paoli, and C. M. Sperberg-McQueen, Extensible markup language (XML) 1.0, February 1998, R.G.G. Cattell et al, The Object Database Standard: ODMG 2.0, Morgan Kaufmann Publishers, Inc., Alin Deutsch, Mary Fernandez, Daniela Florescu, Alon Levy, and Dan Suciu, XML-QL: A query language for XML, August 1998, Pavel Hlousek, MailQL: A query language for an message base, Master's thesis, Charles University, Prague, 2000, In czech. J. Widom, H. Garcia-Molina, and Y. Papakonstantinou, Object exchange across heterogeneous information sources, In Proceedings of the Eleventh International Conference on Data Engineering, pages 251{260, Taipei, Taiwan, March 1995.

Semistructured Data Store Mapping with XML and Its Reconstruction

Semistructured Data Store Mapping with XML and Its Reconstruction Semistructured Data Store Mapping with XML and Its Reconstruction Enhong CHEN 1 Gongqing WU 1 Gabriela Lindemann 2 Mirjam Minor 2 1 Department of Computer Science University of Science and Technology of

More information

Edinburgh Research Explorer

Edinburgh Research Explorer Edinburgh Research Explorer Beyond XML Query Languages Citation for published version: Buneman, P, Deutsch, A, Fan, W, Liefke, H, Sahuguet, A & Tan, W-C 1998, Beyond XML Query Languages. in Query Language

More information

Schemas for Integration and Translation of. Structured and Semi-Structured Data?

Schemas for Integration and Translation of. Structured and Semi-Structured Data? Schemas for Integration and Translation of Structured and Semi-Structured Data? Catriel Beeri 1 and Tova Milo 2 1 Hebrew University beeri@cs.huji.ac.il 2 Tel Aviv University milo@math.tau.ac.il 1 Introduction

More information

Interactive Query and Search in Semistructured Databases æ

Interactive Query and Search in Semistructured Databases æ Interactive Query and Search in Semistructured Databases Roy Goldman, Jennifer Widom Stanford University froyg,widomg@cs.stanford.edu www-db.stanford.edu Abstract Semistructured graph-based databases have

More information

Path Query Reduction and Diffusion for Distributed Semi-structured Data Retrieval+

Path Query Reduction and Diffusion for Distributed Semi-structured Data Retrieval+ Path Query Reduction and Diffusion for Distributed Semi-structured Data Retrieval+ Jaehyung Lee, Yon Dohn Chung, Myoung Ho Kim Division of Computer Science, Department of EECS Korea Advanced Institute

More information

Wrapper 2 Wrapper 3. Information Source 2

Wrapper 2 Wrapper 3. Information Source 2 Integration of Semistructured Data Using Outer Joins Koichi Munakata Industrial Electronics & Systems Laboratory Mitsubishi Electric Corporation 8-1-1, Tsukaguchi Hon-machi, Amagasaki, Hyogo, 661, Japan

More information

Browsing in the tsimmis System. Stanford University. into requests the source can execute. The data returned by the source is converted back into the

Browsing in the tsimmis System. Stanford University. into requests the source can execute. The data returned by the source is converted back into the Information Translation, Mediation, and Mosaic-Based Browsing in the tsimmis System SIGMOD Demo Proposal (nal version) Joachim Hammer, Hector Garcia-Molina, Kelly Ireland, Yannis Papakonstantinou, Jerey

More information

Storing and Maintaining Semistructured Data Efficiently in an Object-Relational Database

Storing and Maintaining Semistructured Data Efficiently in an Object-Relational Database Storing and Maintaining Semistructured Data Efficiently in an Object-Relational Database Yuanying Mo National University of Singapore moyuanyi@comp.nus.edu.sg Tok Wang Ling National University of Singapore

More information

Indexing XML Data with ToXin

Indexing XML Data with ToXin Indexing XML Data with ToXin Flavio Rizzolo, Alberto Mendelzon University of Toronto Department of Computer Science {flavio,mendel}@cs.toronto.edu Abstract Indexing schemes for semistructured data have

More information

Some aspects of references behaviour when querying XML with XQuery

Some aspects of references behaviour when querying XML with XQuery Some aspects of references behaviour when querying XML with XQuery c B.Khvostichenko boris.khv@pobox.spbu.ru B.Novikov borisnov@acm.org Abstract During the XQuery query evaluation, the query output is

More information

A taxonomy of race. D. P. Helmbold, C. E. McDowell. September 28, University of California, Santa Cruz. Santa Cruz, CA

A taxonomy of race. D. P. Helmbold, C. E. McDowell. September 28, University of California, Santa Cruz. Santa Cruz, CA A taxonomy of race conditions. D. P. Helmbold, C. E. McDowell UCSC-CRL-94-34 September 28, 1994 Board of Studies in Computer and Information Sciences University of California, Santa Cruz Santa Cruz, CA

More information

A Distributed Query Engine for XML-QL

A Distributed Query Engine for XML-QL A Distributed Query Engine for XML-QL Paramjit Oberoi and Vishal Kathuria University of Wisconsin-Madison {param,vishal}@cs.wisc.edu Abstract: This paper describes a distributed Query Engine for executing

More information

Introduction to Semistructured Data and XML

Introduction to Semistructured Data and XML Introduction to Semistructured Data and XML Chapter 27, Part D Based on slides by Dan Suciu University of Washington Database Management Systems, R. Ramakrishnan 1 How the Web is Today HTML documents often

More information

Inferring Structure in Semistructured Data

Inferring Structure in Semistructured Data Inferring Structure in Semistructured Data SVETLOZAR NESTOROV æ SERGE ABITEBOUL y RAJEEV MOTWANI z Department of Computer Science Stanford University Stanford, CA 94305-9040 fevtimov,abiteboug@db.stanford.edu,

More information

such internal data dependencies can be formally specied. A possible approach to specify

such internal data dependencies can be formally specied. A possible approach to specify Chapter 6 Specication and generation of valid data unit instantiations In this chapter, we discuss the problem of generating valid data unit instantiations. As valid data unit instantiations must adhere

More information

Inferring Structure in Semistructured Data

Inferring Structure in Semistructured Data Inferring Structure in Semistructured Data SVETLOZAR NESTOROV SERGE ABITEBOUL RAJEEV MOTWANI æ Department of Computer Science Stanford University Stanford, CA 94305-9040 fevtimov,abiteboug@db.stanford.edu,

More information

Querying Semantically Tagged Documents. on the World-Wide Web? Yehoshua Sagiv 2. U.C. Berkeley, Berkeley CA

Querying Semantically Tagged Documents. on the World-Wide Web? Yehoshua Sagiv 2. U.C. Berkeley, Berkeley CA Querying Semantically Tagged Documents on the World-Wide Web? Ziv Bar-Yossef 1,Yaron Kanza 2,Yakov Kogan 2,Werner Nutt 3,and Yehoshua Sagiv 2 1 Department of Electrical Engineering and Computer Science,

More information

Introduction to Semistructured Data and XML. Overview. How the Web is Today. Based on slides by Dan Suciu University of Washington

Introduction to Semistructured Data and XML. Overview. How the Web is Today. Based on slides by Dan Suciu University of Washington Introduction to Semistructured Data and XML Based on slides by Dan Suciu University of Washington CS330 Lecture April 8, 2003 1 Overview From HTML to XML DTDs Querying XML: XPath Transforming XML: XSLT

More information

Querying XML Data. Mary Fernandez. AT&T Labs Research David Maier. Oregon Graduate Institute

Querying XML Data. Mary Fernandez. AT&T Labs Research David Maier. Oregon Graduate Institute Querying XML Data Alin Deutsch Univ. of Pennsylvania adeutsch@gradient.cis.upenn.edu Alon Levy University of Washington, Seattle alon@cs.washington.edu Mary Fernandez AT&T Labs Research mff@research.att.com

More information

where is a constant, 0 < <. In other words, the ratio between the shortest and longest paths from a node to a leaf is at least. An BB-tree allows ecie

where is a constant, 0 < <. In other words, the ratio between the shortest and longest paths from a node to a leaf is at least. An BB-tree allows ecie Maintaining -balanced Trees by Partial Rebuilding Arne Andersson Department of Computer Science Lund University Box 8 S-22 00 Lund Sweden Abstract The balance criterion dening the class of -balanced trees

More information

Parallel Program Graphs and their. (fvivek dependence graphs, including the Control Flow Graph (CFG) which

Parallel Program Graphs and their. (fvivek dependence graphs, including the Control Flow Graph (CFG) which Parallel Program Graphs and their Classication Vivek Sarkar Barbara Simons IBM Santa Teresa Laboratory, 555 Bailey Avenue, San Jose, CA 95141 (fvivek sarkar,simonsg@vnet.ibm.com) Abstract. We categorize

More information

UPDATING MULTIDIMENSIONAL XML DOCUMENTS 1)

UPDATING MULTIDIMENSIONAL XML DOCUMENTS 1) UPDATING MULTIDIMENSIONAL XML DOCUMENTS ) Nikolaos Fousteris, Manolis Gergatsoulis, Yannis Stavrakas Department of Archive and Library Science, Ionian University, Ioannou Theotoki 72, 4900 Corfu, Greece.

More information

Semistructured Data: The Tsimmis Experience. Stanford University.

Semistructured Data: The Tsimmis Experience. Stanford University. Semistructured Data: The Tsimmis Experience Joachim Hammer, Jason McHugh, and Hector Garcia-Molina Department of Computer Science Stanford University Stanford, CA 94305{9040 U.S.A. fjoachim,mchughj,hectorg@db.stanford.edu

More information

Extending E-R for Modelling XML Keys

Extending E-R for Modelling XML Keys Extending E-R for Modelling XML Keys Martin Necasky Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic martin.necasky@mff.cuni.cz Jaroslav Pokorny Faculty of Mathematics and

More information

Figure 1. Algorithm for mapping DTD to ER.

Figure 1. Algorithm for mapping DTD to ER. Integrating XML Data with Relational Databases Wang-Chien Lee Gail Mitchell Xin Zhang Λ GTE Laboratories Incorporated 40 Sylvan Road, Waltham, MA 02451, USA E-Mail: fwlee, gmitchell, xzhangg@gte.com Abstract

More information

Querying Spatiotemporal XML Using DataFoX

Querying Spatiotemporal XML Using DataFoX Querying Spatiotemporal XML Using DataFoX Yi Chen Peter Revesz Computer Science and Engineering Department University of Nebraska-Lincoln Lincoln, NE 68588, USA {ychen,revesz}@cseunledu Abstract We describe

More information

Abstract. We address the problem of query rewriting for TSL, a language for querying semistructured data. We

Abstract. We address the problem of query rewriting for TSL, a language for querying semistructured data. We Paper Number P044 Query Rewriting for Semistructured Data Yannis Papakonstantinou y University of California, San Diego yannis@cs.ucsd.edu Vasilis Vassalos z Stanford University vassalos@cs.stanford.edu

More information

XML Query Languages. Content. Slide 1 Norbert Gövert. January 11, XML documents as trees. Slide 2. Overview on XML query languages XQL

XML Query Languages. Content. Slide 1 Norbert Gövert. January 11, XML documents as trees. Slide 2. Overview on XML query languages XQL XML Query Languages Slide 1 Norbert Gövert January 11, 2001 Content Slide 2 XML documents as trees Overview on XML query languages XQL XIRQL: IR extension for XQL 1 XML documents as trees Slide 3

More information

Target-driven merging of Taxonomies

Target-driven merging of Taxonomies Target-driven merging of Taxonomies Salvatore Raunich, Erhard Rahm University of Leipzig Germany {raunich, rahm}@informatik.uni-leipzig.de Abstract The proliferation of ontologies and taxonomies in many

More information

is to provide a seamless blend of browsing and querying modes, in the spirit of Garlic's Pesto interface [CHMW96]. Fig. 2 shows a snapshot of query co

is to provide a seamless blend of browsing and querying modes, in the spirit of Garlic's Pesto interface [CHMW96]. Fig. 2 shows a snapshot of query co View Denition and DTD Inference for XML Bertram Ludascher Yannis Papakonstantinou Pavel Velikhov Victor Vianu ludaesch@sdsc.edu fyannis,pvelikho,vianug@cs.ucsd.edu 1 Introduction We present a query language

More information

composer E work T "1685" "BWV82" "Ich habe genug"

composer E work T 1685 BWV82 Ich habe genug Keys for XML Peter Buneman Susan Davidson y Wenfei Fan z Carmem Hara x Wang-Chiew Tan { (Draft of August 9, 2000) Abstract We discuss the denition of keys for XML documents, paying particular attention

More information

fied by a regular expression [4,7,9,11,23,16]. However, this kind of navigational queries is not completely satisfactory since in many cases we would

fied by a regular expression [4,7,9,11,23,16]. However, this kind of navigational queries is not completely satisfactory since in many cases we would Electronic Notes in Theoretical Computer Science 50 No. 3 (2001) Proc. GT-VMT 2001 URL: http://www.elsevier.nl/locate/entcs/volume50.html 10 pages Graph Grammars for Querying Graph-like Data S. Flesca,

More information

A Boolean Expression. Reachability Analysis or Bisimulation. Equation Solver. Boolean. equations.

A Boolean Expression. Reachability Analysis or Bisimulation. Equation Solver. Boolean. equations. A Framework for Embedded Real-time System Design? Jin-Young Choi 1, Hee-Hwan Kwak 2, and Insup Lee 2 1 Department of Computer Science and Engineering, Korea Univerity choi@formal.korea.ac.kr 2 Department

More information

METAXPath. Utah State University. From the SelectedWorks of Curtis Dyreson. Curtis Dyreson, Utah State University Michael H. Böhen Christian S.

METAXPath. Utah State University. From the SelectedWorks of Curtis Dyreson. Curtis Dyreson, Utah State University Michael H. Böhen Christian S. Utah State University From the SelectedWorks of Curtis Dyreson December, 2001 METAXPath Curtis Dyreson, Utah State University Michael H. Böhen Christian S. Jensen Available at: https://works.bepress.com/curtis_dyreson/11/

More information

. The problem: ynamic ata Warehouse esign Ws are dynamic entities that evolve continuously over time. As time passes, new queries need to be answered

. The problem: ynamic ata Warehouse esign Ws are dynamic entities that evolve continuously over time. As time passes, new queries need to be answered ynamic ata Warehouse esign? imitri Theodoratos Timos Sellis epartment of Electrical and Computer Engineering Computer Science ivision National Technical University of Athens Zographou 57 73, Athens, Greece

More information

Efficient Query Evaluation on Distributed Graph with Hadoop Environment

Efficient Query Evaluation on Distributed Graph with Hadoop Environment 30 (2013 ) Efficient Query Evaluation on Distributed Graph with Hadoop Environment Le-Duc Tung, Quyet Nguyen-Van, Zhenjiang Hu Graph has emerged as a powerful data structure to describe the variety of

More information

XML-QE: A Query Engine for XML Data Soures

XML-QE: A Query Engine for XML Data Soures XML-QE: A Query Engine for XML Data Soures Bruce Jackson, Adiel Yoaz {brucej, adiel}@cs.wisc.edu 1 1. Introduction XML, short for extensible Markup Language, may soon be used extensively for exchanging

More information

Ontology Structure of Elements for Web-based Natural Disaster Preparedness Systems

Ontology Structure of Elements for Web-based Natural Disaster Preparedness Systems Association for Information Systems AIS Electronic Library (AISeL) AMCIS 2007 Proceedings Americas Conference on Information Systems (AMCIS) December 2007 Ontology Structure of Elements for Web-based Natural

More information

Distributed Query Evaluation on Semistructured Data

Distributed Query Evaluation on Semistructured Data Distributed Query Evaluation on Semistructured Data DAN SUCIU University of Washington Semistructured data is modeled as a rooted, labeled graph. The simplest kinds of queries on such data are those which

More information

Integrating Path Index with Value Index for XML data

Integrating Path Index with Value Index for XML data Integrating Path Index with Value Index for XML data Jing Wang 1, Xiaofeng Meng 2, Shan Wang 2 1 Institute of Computing Technology, Chinese Academy of Sciences, 100080 Beijing, China cuckoowj@btamail.net.cn

More information

Introduction to XML. Yanlei Diao UMass Amherst April 17, Slides Courtesy of Ramakrishnan & Gehrke, Dan Suciu, Zack Ives and Gerome Miklau.

Introduction to XML. Yanlei Diao UMass Amherst April 17, Slides Courtesy of Ramakrishnan & Gehrke, Dan Suciu, Zack Ives and Gerome Miklau. Introduction to XML Yanlei Diao UMass Amherst April 17, 2008 Slides Courtesy of Ramakrishnan & Gehrke, Dan Suciu, Zack Ives and Gerome Miklau. 1 Structure in Data Representation Relational data is highly

More information

A Framework for Processing Complex Document-centric XML with Overlapping Structures Ionut E. Iacob and Alex Dekhtyar

A Framework for Processing Complex Document-centric XML with Overlapping Structures Ionut E. Iacob and Alex Dekhtyar A Framework for Processing Complex Document-centric XML with Overlapping Structures Ionut E. Iacob and Alex Dekhtyar ABSTRACT Management of multihierarchical XML encodings has attracted attention of a

More information

Ecient XPath Axis Evaluation for DOM Data Structures

Ecient XPath Axis Evaluation for DOM Data Structures Ecient XPath Axis Evaluation for DOM Data Structures Jan Hidders Philippe Michiels University of Antwerp Dept. of Math. and Comp. Science Middelheimlaan 1, BE-2020 Antwerp, Belgium, fjan.hidders,philippe.michielsg@ua.ac.be

More information

Controlled Access and Dissemination of XML Documents

Controlled Access and Dissemination of XML Documents Controlled Access and Dissemination of XML Documents Elisa Bertino Silvana Castano Elena Ferrari Dip. di Scienze dell'informazione Universita degli Studi di Milano Via Comelico, 39/41 20135 Milano, Italy

More information

SAMOS: an Active Object{Oriented Database System. Stella Gatziu, Klaus R. Dittrich. Database Technology Research Group

SAMOS: an Active Object{Oriented Database System. Stella Gatziu, Klaus R. Dittrich. Database Technology Research Group SAMOS: an Active Object{Oriented Database System Stella Gatziu, Klaus R. Dittrich Database Technology Research Group Institut fur Informatik, Universitat Zurich fgatziu, dittrichg@ifi.unizh.ch to appear

More information

Fixpoint Path Queries

Fixpoint Path Queries Fixpoint Path Queries N. Bidoit and M. Ykhlef LaBRI (U.R.A. 1304 du CNRS) Université Bordeaux I 351, Cours de la Libération, F-33405 Talence email : {Nicole.Bidoit, Mourad.Ykhlef}@labri.u-bordeaux.fr Abstract

More information

For our sample application we have realized a wrapper WWWSEARCH which is able to retrieve HTML-pages from a web server and extract pieces of informati

For our sample application we have realized a wrapper WWWSEARCH which is able to retrieve HTML-pages from a web server and extract pieces of informati Meta Web Search with KOMET Jacques Calmet and Peter Kullmann Institut fur Algorithmen und Kognitive Systeme (IAKS) Fakultat fur Informatik, Universitat Karlsruhe Am Fasanengarten 5, D-76131 Karlsruhe,

More information

Extending XML-RL With Update

Extending XML-RL With Update Extending XML-RL With Update Guoren Wang, Mengchi Liu, and Li Lu School of Computer Science, Carleton University Ottawa, Ontario, Canada, K1S 5B6 {wanggr, mengchi}@scs.carleton.ca, llu@math.carleton.ca

More information

UNIVERSITÄT PADERBORN. ComponentTools

UNIVERSITÄT PADERBORN. ComponentTools UNIVERSITÄT PADERBORN ComponentTools Component Library Concept Project Group ComponentTools pg-components@uni-paderborn.de Alexander Gepting, Joel Greenyer, Andreas Maas, Sebastian Munkelt, Csaba Pales,

More information

Fragmentation of XML Documents

Fragmentation of XML Documents Fragmentation of XML Documents Hui Ma 1, Klaus-Dieter Schewe 2 1 Victoria University of Wellington, School of Engineering and Computer Science, Wellington, New Zealand hui.ma@ecs.vuw.ac.nz 2 Software Competence

More information

A Commit Scheduler for XML Databases

A Commit Scheduler for XML Databases A Commit Scheduler for XML Databases Stijn Dekeyser and Jan Hidders University of Antwerp Abstract. The hierarchical and semistructured nature of XML data may cause complicated update-behavior. Updates

More information

XML Query Processing and Optimization

XML Query Processing and Optimization XML Query Processing and Optimization Bartley D. Richardson Department of Electrical & Computer Engineering and Computer Science University of Cincinnati December 16, 2005 Outline Background XML As A Data

More information

Overview of the Integration Wizard Project for Querying and Managing Semistructured Data in Heterogeneous Sources

Overview of the Integration Wizard Project for Querying and Managing Semistructured Data in Heterogeneous Sources In Proceedings of the Fifth National Computer Science and Engineering Conference (NSEC 2001), Chiang Mai University, Chiang Mai, Thailand, November 2001. Overview of the Integration Wizard Project for

More information

Optimum Alphabetic Binary Trees T. C. Hu and J. D. Morgenthaler Department of Computer Science and Engineering, School of Engineering, University of C

Optimum Alphabetic Binary Trees T. C. Hu and J. D. Morgenthaler Department of Computer Science and Engineering, School of Engineering, University of C Optimum Alphabetic Binary Trees T. C. Hu and J. D. Morgenthaler Department of Computer Science and Engineering, School of Engineering, University of California, San Diego CA 92093{0114, USA Abstract. We

More information

XML-based Components for Federating Multiple Heterogeneous Data Sources

XML-based Components for Federating Multiple Heterogeneous Data Sources XML-based Components for Federating Multiple Heterogeneous Data Sources Georges Gardarin, Fei Sha, Tuyet-Tram Dang-Ngoc To cite this version: Georges Gardarin, Fei Sha, Tuyet-Tram Dang-Ngoc. XML-based

More information

Sara Cohen Yaron Kanza Yakov Kogan. using query languages. Thus, one of the greatest challenges, provided by XML,

Sara Cohen Yaron Kanza Yakov Kogan. using query languages. Thus, one of the greatest challenges, provided by XML, EquiX Easy Querying in XML Databases Sara Cohen Yaron Kanza Yakov Kogan Werner Nutt y Yehoshua Sagiv Alexander Serebrenik 1 Introduction The Web is explored by many users, but only a few of them have experience

More information

Deferred Incremental Refresh of XML Materialized Views: Algorithms and Performance Evaluation *

Deferred Incremental Refresh of XML Materialized Views: Algorithms and Performance Evaluation * Deferred Incremental Refresh of XML Materialized Views: Algorithms and Performance Evaluation * Hyunchul Kang Hosang Sung ChanHo Moon Dept. of Computer Science and Engineering Chung-Ang University Seoul,

More information

perspective, logic programs do have a notion of control ow, and the in terms of the central control ow the program embodies.

perspective, logic programs do have a notion of control ow, and the in terms of the central control ow the program embodies. Projections of Logic Programs Using Symbol Mappings Ashish Jain Department of Computer Engineering and Science Case Western Reserve University Cleveland, OH 44106 USA email: jain@ces.cwru.edu Abstract

More information

Subsumption for XML Types. Abstract. XML data is often used (validated, stored, queried, etc) with respect to dierent types.

Subsumption for XML Types. Abstract. XML data is often used (validated, stored, queried, etc) with respect to dierent types. Subsumption for XML Types Gabriel M. Kuper Bell Laboratories kuper@research.bell-labs.com Jer^ome Simeon Bell Laboratories simeon@research.bell-labs.com Abstract XML data is often used (validated, stored,

More information

Copyright (C) 1997, 1998 by the Association for Computing Machinery, Inc. Permission to make digital or hard copies of part or all of this work for

Copyright (C) 1997, 1998 by the Association for Computing Machinery, Inc. Permission to make digital or hard copies of part or all of this work for Copyright (C) 1997, 1998 by the Association for Computing Machinery, Inc. Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided

More information

A Universal Model for XML Information Retrieval

A Universal Model for XML Information Retrieval A Universal Model for XML Information Retrieval Maria Izabel M. Azevedo 1, Lucas Pantuza Amorim 2, and Nívio Ziviani 3 1 Department of Computer Science, State University of Montes Claros, Montes Claros,

More information

PathStack : A Holistic Path Join Algorithm for Path Query with Not-predicates on XML Data

PathStack : A Holistic Path Join Algorithm for Path Query with Not-predicates on XML Data PathStack : A Holistic Path Join Algorithm for Path Query with Not-predicates on XML Data Enhua Jiao, Tok Wang Ling, Chee-Yong Chan School of Computing, National University of Singapore {jiaoenhu,lingtw,chancy}@comp.nus.edu.sg

More information

Local Constraints in Semistructured Data Schemas

Local Constraints in Semistructured Data Schemas Local Constraints in Semistructured Data Schemas Andrea Calì, Diego Calvanese, Maurizio Lenzerini Dipartimento di Informatica e Sistemistica Università di Roma La Sapienza Via Salaria 113, 00198 Roma,

More information

User Interface Modelling Based on the Graph Transformations of Conceptual Data Model

User Interface Modelling Based on the Graph Transformations of Conceptual Data Model User Interface Modelling Based on the Graph Transformations of Conceptual Data Model Martin Molhanec Department of e-technology, Faculty of Electrical Engineering Czech Technical University in Prague Technická

More information

Catching the Boat with Strudel: Experiences with a Web-Site Management System. Jaewoo Kang. Savera Systems.

Catching the Boat with Strudel: Experiences with a Web-Site Management System. Jaewoo Kang. Savera Systems. Catching the Boat with Strudel: Experiences with a Web-Site Management System Mary Fernandez AT&T Labs m@research.att.com Daniela Florescu Inria Roquencourt dana@rodin.inria.fr Jaewoo Kang Savera Systems

More information

Context-Sensitive Search and Exploration of XML Text. University of Maryland, College Park University of Maryland, College Park MD MD 20742

Context-Sensitive Search and Exploration of XML Text. University of Maryland, College Park University of Maryland, College Park MD MD 20742 Context-Sensitive Search and Exploration of XML Text Thomas Baby Sudarshan S. Chawathe thomas@cs.umd.edu chaw@cs.umd.edu Department of Computer Science Institute for Advanced Computer Studies University

More information

2 Data Reduction Techniques The granularity of reducible information is one of the main criteria for classifying the reduction techniques. While the t

2 Data Reduction Techniques The granularity of reducible information is one of the main criteria for classifying the reduction techniques. While the t Data Reduction - an Adaptation Technique for Mobile Environments A. Heuer, A. Lubinski Computer Science Dept., University of Rostock, Germany Keywords. Reduction. Mobile Database Systems, Data Abstract.

More information

Dependent Object Types - A foundation for Scala's type system

Dependent Object Types - A foundation for Scala's type system Dependent Object Types - A foundation for Scala's type system Draft of January 14, 2010 Do Not Distrubute Martin Odersky, Georey Alan Washburn EPFL Abstract. 1 Introduction This paper presents a proposal

More information

teacher research teach "Joe" "Joe"

teacher research teach Joe Joe On XML Integrity Constraints in the Presence of DTDs Wenfei Fan Bell Laboratories 600 Mountain Avenue Murray Hill, NJ 07974, USA wenfei@research.bell-labs.com Leonid Libkin Department of Computer Science

More information

SXML: an XML document as an S-expression

SXML: an XML document as an S-expression SXML: an XML document as an S-expression Kirill Lisovsky, Dmitry Lizorkin Institute for System Programming RAS, Moscow State University lisovsky@acm.org lizorkin@hotbox.ru Abstract This article is the

More information

FROM XML SCHEMA TO OBJECT-RELATIONAL DATABASE AN XML SCHEMA-DRIVEN MAPPING ALGORITHM

FROM XML SCHEMA TO OBJECT-RELATIONAL DATABASE AN XML SCHEMA-DRIVEN MAPPING ALGORITHM FROM XML SCHEMA TO OBJECT-RELATIONAL DATABASE AN XML SCHEMA-DRIVEN MAPPING ALGORITHM Irena Mlynkova, Jaroslav Pokorny Charles University, Faculty of Mathematics and Physics, Department of Software Engineering

More information

Choosing a Data Model and Query Language for Provenance

Choosing a Data Model and Query Language for Provenance Choosing a Data Model and Query Language for Provenance The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters. Citation Accessed Citable

More information

Structured Information Retrieval in XML documents

Structured Information Retrieval in XML documents Structured Information Retrieval in documents Evangelos Kotsakis Joint Research Center (CCR), TP261, I-21020 Ispra (VA), Italy kotsakis@acm.org ABSTRACT Query languages that take advantage of the document

More information

Querying Tree-Structured Data Using Dimension Graphs

Querying Tree-Structured Data Using Dimension Graphs Querying Tree-Structured Data Using Dimension Graphs Dimitri Theodoratos 1 and Theodore Dalamagas 2 1 Dept. of Computer Science New Jersey Institute of Technology Newark, NJ 07102 dth@cs.njit.edu 2 School

More information

Query Languages and Tools for XML Documents and Databases

Query Languages and Tools for XML Documents and Databases Query Languages and Tools for XML Documents and Databases Sonali Sheth John A. Miller Computer Science Department 415 GSRC University of Georgia Athens, GA 30602-7404 jam@cs.uga.edu 1 Abstract XML (extensible

More information

Chapter 13 XML: Extensible Markup Language

Chapter 13 XML: Extensible Markup Language Chapter 13 XML: Extensible Markup Language - Internet applications provide Web interfaces to databases (data sources) - Three-tier architecture Client V Application Programs Webserver V Database Server

More information

r (1,1) r (2,4) r (2,5) r(2,6) r (1,3) r (1,2)

r (1,1) r (2,4) r (2,5) r(2,6) r (1,3) r (1,2) Routing on Trees via Matchings? Alan Roberts 1, Antonis Symvonis 1 and Louxin Zhang 2 1 Department of Computer Science, University of Sydney, N.S.W. 2006, Australia 2 Department of Computer Science, University

More information

a declarative semantics for Web queries is dened by incorporating Web access into an existing DOOD framework. This allows to integrate Web data with a

a declarative semantics for Web queries is dened by incorporating Web access into an existing DOOD framework. This allows to integrate Web data with a 5th Intl. Conf. on Deductive and Object-Oriented Databases (DOOD'97), December 8{12, 1997, Montreux, Switzerland. LNCS 1341, Springer. On a Declarative Semantics for Web Queries Rainer Himmeroder? Georg

More information

A Formal Approach to Program Modification. Lindsay Groves

A Formal Approach to Program Modification. Lindsay Groves A Formal Approach to Program Modication Lindsay Groves School of Mathematical and Computing Sciences Victoria University of Wellington, New Zealand lindsay@mcs.vuw.ac.nz Abstract This paper presents a

More information

Why and Where: A Characterization of Data Provenance

Why and Where: A Characterization of Data Provenance University of Pennsylvania ScholarlyCommons Departmental Papers (CIS) Department of Computer & Information Science January 2001 Why and Where: A Characterization of Data Provenance Peter Buneman University

More information

XViews: XML views of relational schemas

XViews: XML views of relational schemas SDSC TR-1999-3 XViews: XML views of relational schemas Chaitanya Baru San Diego Supercomputer Center, University of California San Diego La Jolla, CA 92093, USA baru@sdsc.edu October 7, 1999 San Diego

More information

The Niagara Internet Query System

The Niagara Internet Query System The Niagara Internet Query System Jeffrey Naughton, David DeWitt, David Maier, Ashraf Aboulnaga, Jianjun Chen, Leonidas Galanis, Jaewoo Kang, Rajasekar Krishnamurthy, Qiong Luo, Naveen Prakash Ravishankar

More information

SSDDM: Distance Metric for Graph-based Semistructured

SSDDM: Distance Metric for Graph-based Semistructured SSDDM: Distance Metric for Graph-based Semistructured Data István Soós, Tihamér Levendovszky, Hassan Charaf Department of Automation and Applied Informatics Budapest University of Technology and Economics

More information

A Model and a Visual Query Language for Structured Text. handle structure. language. These indices have been studied in literature and their

A Model and a Visual Query Language for Structured Text. handle structure. language. These indices have been studied in literature and their A Model and a Visual Query Language for Structured Text Ricardo Baeza-Yates Gonzalo Navarro Depto. de Ciencias de la Computacion, Universidad de Chile frbaeza,gnavarrog@dcc.uchile.cl Jesus Vegas Pablo

More information

Students Courses Students Courses. Taking. Taking: Taking. Enrolled: Enrolled. Enrolled. CName. Name. "Smith" "Chem3" "Jones"

Students Courses Students Courses. Taking. Taking: Taking. Enrolled: Enrolled. Enrolled. CName. Name. Smith Chem3 Jones Path Constraints on Semistructured and Structured Data Abstract Peter Buneman University of Pennsylvania peter@central.cis.upenn.edu Wenfei Fan y University of Pennsylvania wfan@saul.cis.upenn.edu Scott

More information

o12 references o24 references o29 first last

o12 references o24 references o29 first last Semistructured Data and XML Dan Suciu AT&T Labs suciu@research.att.com 1 Introduction Today much of the existing electronic data lies outside of database management system: it lies in structured documents

More information

A MODEL FOR ADVANCED QUERY CAPABILITY DESCRIPTION IN MEDIATOR SYSTEMS

A MODEL FOR ADVANCED QUERY CAPABILITY DESCRIPTION IN MEDIATOR SYSTEMS A MODEL FOR ADVANCED QUERY CAPABILITY DESCRIPTION IN MEDIATOR SYSTEMS Alberto Pan, Paula Montoto and Anastasio Molano Denodo Technologies, Almirante Fco. Moreno 5 B, 28040 Madrid, Spain Email: apan@denodo.com,

More information

Server 1 Server 2 CPU. mem I/O. allocate rec n read elem. n*47.0. n*20.0. select. n*1.0. write elem. n*26.5 send. n*

Server 1 Server 2 CPU. mem I/O. allocate rec n read elem. n*47.0. n*20.0. select. n*1.0. write elem. n*26.5 send. n* Information Needs in Performance Analysis of Telecommunication Software a Case Study Vesa Hirvisalo Esko Nuutila Helsinki University of Technology Laboratory of Information Processing Science Otakaari

More information

Annex A (Informative) Collected syntax The nonterminal symbols pointer-type, program, signed-number, simple-type, special-symbol, and structured-type

Annex A (Informative) Collected syntax The nonterminal symbols pointer-type, program, signed-number, simple-type, special-symbol, and structured-type Pascal ISO 7185:1990 This online copy of the unextended Pascal standard is provided only as an aid to standardization. In the case of dierences between this online version and the printed version, the

More information

STRUCTURE-BASED QUERY EXPANSION FOR XML SEARCH ENGINE

STRUCTURE-BASED QUERY EXPANSION FOR XML SEARCH ENGINE STRUCTURE-BASED QUERY EXPANSION FOR XML SEARCH ENGINE Wei-ning Qian, Hai-lei Qian, Li Wei, Yan Wang and Ao-ying Zhou Computer Science Department Fudan University Shanghai 200433 E-mail: wnqian@fudan.edu.cn

More information

An Improved Prefix Labeling Scheme: A Binary String Approach for Dynamic Ordered XML

An Improved Prefix Labeling Scheme: A Binary String Approach for Dynamic Ordered XML An Improved Prefix Labeling Scheme: A Binary String Approach for Dynamic Ordered XML Changqing Li and Tok Wang Ling Department of Computer Science, National University of Singapore {lichangq, lingtw}@comp.nus.edu.sg

More information

size, runs an existing induction algorithm on the rst subset to obtain a rst set of rules, and then processes each of the remaining data subsets at a

size, runs an existing induction algorithm on the rst subset to obtain a rst set of rules, and then processes each of the remaining data subsets at a Multi-Layer Incremental Induction Xindong Wu and William H.W. Lo School of Computer Science and Software Ebgineering Monash University 900 Dandenong Road Melbourne, VIC 3145, Australia Email: xindong@computer.org

More information

to automatically generate parallel code for many applications that periodically update shared data structures using commuting operations and/or manipu

to automatically generate parallel code for many applications that periodically update shared data structures using commuting operations and/or manipu Semantic Foundations of Commutativity Analysis Martin C. Rinard y and Pedro C. Diniz z Department of Computer Science University of California, Santa Barbara Santa Barbara, CA 93106 fmartin,pedrog@cs.ucsb.edu

More information

Integrating Information by Outerjoins. Dept. of Computer Science. Stanford University ABSTRACT

Integrating Information by Outerjoins. Dept. of Computer Science. Stanford University ABSTRACT Integrating Information by Outerjoins and Full Disjunctions 1 Anand Rajaraman Jerey D. Ullman Dept. of Computer Science Stanford University ABSTRACT Our motivation is the piecing together tidbits of information

More information

Correctness Criteria Beyond Serializability

Correctness Criteria Beyond Serializability Correctness Criteria Beyond Serializability Mourad Ouzzani Cyber Center, Purdue University http://www.cs.purdue.edu/homes/mourad/ Brahim Medjahed Department of Computer & Information Science, The University

More information

PCO ASPs IUT. Tester. ASPs PCO. PDUs. Test System TCP. ASPs PCO. PDUs IUT. Service Provider. Lower Tester Control Function TCP

PCO ASPs IUT. Tester. ASPs PCO. PDUs. Test System TCP. ASPs PCO. PDUs IUT. Service Provider. Lower Tester Control Function TCP Accepted for Computer Networks & ISDN Systems: Special Issue on Protocol Testing TTCN: Towards a Formal Semantics and Validation of Test Suites Finn Kristoersen Thomas Walter y Abstract TTCN (Tree and

More information

Binary Decision Diagrams

Binary Decision Diagrams Logic and roof Hilary 2016 James Worrell Binary Decision Diagrams A propositional formula is determined up to logical equivalence by its truth table. If the formula has n variables then its truth table

More information

Khoral Research, Inc. Khoros is a powerful, integrated system which allows users to perform a variety

Khoral Research, Inc. Khoros is a powerful, integrated system which allows users to perform a variety Data Parallel Programming with the Khoros Data Services Library Steve Kubica, Thomas Robey, Chris Moorman Khoral Research, Inc. 6200 Indian School Rd. NE Suite 200 Albuquerque, NM 87110 USA E-mail: info@khoral.com

More information

XML in Databases. Albrecht Schmidt. al. Albrecht Schmidt, Aalborg University 1

XML in Databases. Albrecht Schmidt.   al. Albrecht Schmidt, Aalborg University 1 XML in Databases Albrecht Schmidt al@cs.auc.dk http://www.cs.auc.dk/ al Albrecht Schmidt, Aalborg University 1 What is XML? (1) Where is the Life we have lost in living? Where is the wisdom we have lost

More information

DATA MODELS FOR SEMISTRUCTURED DATA

DATA MODELS FOR SEMISTRUCTURED DATA Chapter 2 DATA MODELS FOR SEMISTRUCTURED DATA Traditionally, real world semantics are captured in a data model, and mapped to the database schema. The real world semantics are modeled as constraints and

More information