ARQo: The Architecture for an ARQ Static Query Optimizer

Size: px
Start display at page:

Download "ARQo: The Architecture for an ARQ Static Query Optimizer"

Transcription

1 ARQo: The Architecture for an ARQ Static Query Optimizer Markus Stocker, Andy Seaborne Digital Media Systems Laboratory HP Laboratories Bristol HPL June 26, 2007* semantic web, SPARQL, query optimization In this paper we describe the architecture of ARQo, a first approach for SPARQL static query optimization in ARQ. Specifically, we focus on static optimization of BasicGraphPattern (BGP) for in-memory models. Static query optimization is intended as a query rewriting process where the set of triple patterns defined for a BGP are rewritten according to a specific order. We propose a rewriting process according to the estimated execution cost of joined triple patterns in increasing order. Specifically, the estimated execution cost is a function of multiple parameters such as the estimated selectivity of joined triple patterns, the availability of indexes or pre-calculated result sets. * Internal Accession Date Only Approved for External Publication Copyright 2007 Hewlett-Packard Development Company, L.P.

2 ARQo: The Architecture for an ARQ Static Query Optimizer Markus Stocker and Andy Seaborne HP Labs Bristol Abstract. In this paper we describe the architecture of ARQo, a rst approach for SPARQL static query optimization in ARQ. Specically, we focus on static optimization of BasicGraphPattern (BGP) for in-memory models. Static query optimization is intended as a query rewriting process where the set of triple patterns dened for a BGP are rewritten according to a specic order. We propose a rewriting process according to the estimated execution cost of joined triple patterns in increasing order. Specically, the estimated execution cost is a function of multiple parameters such as the estimated selectivity of joined triple patterns, the availability of indexes or precalculated result sets. 1 Introduction Query optimization is an on-going research area for Relational Database Management Systems (RDBMS). Generally, database systems which provide a declarative language to describe what a query should return, require advanced optimization techniques to enable query evaluation within user friendly time frames. Modern database systems implement a number of optimization techniques on different abstraction levels. For instance, the evaluation of Query Execution Plans (QEP) is a fundamental optimization and sophisticated techniques greatly aect the query execution performance. In this paper we present and discuss ARQo, a static query optimizer for ARQ 1. Specically, we focus on static optimization of SPARQL [7] BasicGraph- Patterns (BGP) for in-memory models. While SPARQL query engines such as ARQ and Sesame 2 generally implement index structures for in-memory models to allow fast retrieval of triples, they still lack advanced static ( i.e. query rewriting) optimization techniques to optimize the query at the QEP level. ARQo is a rst proposal to address the problem of SPARQL static query optimization in ARQ. The goal is to nd the QEP which minimizes the execution costs. Thereby, execution costs are a function of multiple parameters which may be arbitrary combined depending whether or not the required information is Contact in case of questions

3 available within the query execution context. For instance, index structures or pre-calculated result sets are examples of such parameters. ARQo implements an extensible architecture for QEP cost estimation. Essentially, the framework enables the implementation of cost estimation techniques and implements a generic strategy to search for the most promising QEP, the QEP which is expected to evaluate the query fastest. Therefore, we formalize the concept of BasicGraphPattern (BGP), QEP plan space and QEP to meet the needs of the architecture. Moreover, we extend the framework with cost estimation techniques which may be used within dierent ARQ deployments depending on the specic settings. Specically, we intend to develop (1) a generic technique based on variable counting which is especially convenient if specialized information about the underlying ontology is unavailable. Further, we intend to implement (2) a more sophisticated technique based on the concept of joined triple pattern selectivity introduced by Bernstein et al. [1]. We intend to implement the approaches of Selectivity Estimation Index (SEI) and Query Pattern Index (QPI) for the estimation of joined triple pattern selectivity which the authors describe in [1]. The remainder of this paper is structured as follows. In Section 2, we briey review some related work. In Section 3, we formalize the concepts required for the ARQo architecture, especially the BGP, the QEP and the plan space. In Section 4, we describe the proposed architecture in more details, focusing on the selected approach to identify the most promising QEP. Finally, in Section 5, we present and discuss the QEP cost estimation extensions we intend to provide. 2 Related Work Static query optimization is an on-going research eld especially for relational database management systems. Years of research have enabled modern database systems to powerful and ne tuned infrastructures which allow interactive querying for most applications. In the literature (IBM System R [8], INGRES [9] POSTGRES 3 ) the problem of nding the best QEP is mainly solved using two dierent approaches, query decomposition or exhaustive search. Query decomposition is a heuristic greedy algorithm that proceeds in a stepwise fashion. The major drawback of this approach is that potentially good plans are ignored by the algorithm. An exhaustive search of the plan space identies every join combination, i.e. all plans for processing two-way joins are considered. The drawback of this approach is the required time for planning, which is bigger than for the query decomposition approach since the algorithm has to scan a potentially huge plan space. The benet of using an exhaustive search approach is that good plans are not overlooked. IBM System R [8] is probably the most prominent and the rst example of a relational database system which considers static query optimization techniques

4 Listing 1.1. Example BasicGraphPattern (LUBM Query 2) 1?X r d f : type ub : GraduateStudent. 2?Y r d f : type ub : U n i v e r s i t y. 3?Z r d f : type ub : Department. 4?X ub : memberof?z. 5?Z ub : suborganizationof?y. 6?X ub : undergraduatedegreefrom?y. The static optimizer of System R implements an exhaustive search of the plan space where the cost of a plan is calculated based on the required disk accesses and, thus, on the selectivity of conditions [6]. The Sesame open source RDF framework 4 uses some general query optimization techniques based on query rewriting according to the specicity of triple patterns. Essentially, the specicity is a function of the number and type of variables dened for the subject, the predicate and the object of a triple pattern. The query planner selects the triple patterns according to their decreasing specicity. Harth et al. propose in [4] an optimized index structure for RDF [2] which is extended with statistical information about the data set in the form of occurrence counts of access patterns. This allows ecient lookup for the result set size of an access pattern and, thus, static query optimization according to the selectivity of access patterns. In our approach, we provide selectivity information about joined patterns, which is fundamental since the selectivity of two patterns may be low when considered independently but the joined pattern may be highly selective. 3 ARQo Formalization In this section we formalize the concepts required for ARQo and the goal of SPARQL static query optimization of BasicGraphPattern (BGP) for in-memory models. To clarify the idea, we provide some gures based on the BGP of Listing 1.1 (i.e. the WHERE clause of the Lehigh University Benchmark (LUBM) [3]). Given a BGP, B, we dene B to be a graph G which is described by the set G of undirected connected graphs. The elements g G are the components of G. For each pair g i, g j G, g i, g j are disconnected. For SPARQL, an undirected connected graph g G is an ordered pair g := (N, E), where N is a set, whose elements are triple patterns (i.e. the nodes of g) and E is a set of unordered pairs of distinct triple patterns which are joined by a common variable (i.e. the edges of g). In Figure 1, we display the undirected connected graph g 1 G for the BGP of Listing 1.1. Since the triple patterns of the BGP are all joined together, G contains only one element, i.e. the connected graph g 1. Note, that the numbering 4 3

5 Fig. 1. Undirected connected graph g 1 G used for the nodes of g 1 corresponds to the numbering of the triple patterns of the BGP in Listing 1.1. For any two nodes (i, j) g 1, there is a path between i and j, if the triple patterns corresponding to i and j are joined together by one or more variables. In SPARQL, the execution order of pairwise disconnected graphs g G is arbitrary, since the overall result set corresponds to the Cartesian product of the result sets for any g G. Thus, the problem of statically optimizing B is a sub-problem of optimizing each g G. Denition 1. The size of g G is the number of nodes of g. Denition 2. A QEP for g G is an ordered set of the nodes of g. Denition 3. For g G we dene the set P as the QEP space of g. The size of P is the total number of QEPs p P, i.e. the number of ordered permutations of the nodes of g. Given the size N of g G, the size of P is N! (for an uniprocessor machine). Thus, even for simple SPARQL queries with a few joined triple patterns, the expanded QEP space P is huge. Therefore, we need an ecient way how to identify the most promising QEP. A QEP for B is described by an unordered set, Q, whose elements p P are QEPs. The size of Q equals the size of G. Thus, the QEP for B is described by the QEP of each component of G. Therefore, we state the optimization problem as the search for the most promising QEP p within the plan space P for each undirected connected graph g G, building, hence, the unordered set Q of QEPs for the components of G, where G is the graph which reects the BGP B. Note, that a QEP p P can be represented as a Directed Acyclic Graph (DAG). We dene D as the set of DAGs d for a g G. For any d D, the unordered set E of joined triple patterns (i.e. edges) of g G is transformed to a set A of directed edges (i.e. arcs). In Figure 2, we show the DAG corresponding to the QEP which executes the BGP of Listing 1.1 top-down. For any two nodes (i, j) d 1, there is a directed 4

6 Fig. 2. DAG d 1 D path between i and j, if the triple patterns corresponding to i and j are joined together by one or more variables and i is executed rst. Hence, there is a clear relationship between the set P of QEPs and the set D of DAGs. More precisely, we can state the following function, f : QEP DAG (1) which is injective, not surjective and not bijective. Thus, a QEP can be uniquely mapped to a DAG, but a DAG is an abstraction for one or more QEPs. Following Equation 1, a QEP may be described as a materialization of an undirected connected graph g G in the form of a directed acyclic graph d D. 4 ARQo Architecture In this section we concisely describe the architecture of ARQo. Essentially, ARQo is a plug-in module for ARQ with three main components: (1) the optimization core, a (2) heuristic broker and an (3) extensible pool of heuristic techniques. 4.1 The Optimization Core The optimization core implements the main technique used to statically optimize SPARQL Basic Graph Patterns (BGP). As described in Section 3, we intend to abstract a BGP as a graph G, where G is a set G of undirected connected graphs, i.e. the components of G. We, hence, optimize each g G, i.e. we search for the most promising materialization of g in the form of a DAG. This task is mainly solved by the following steps. Given a g G, we (1) provide g with estimated execution costs for the nodes and edges. Thereby, costs are estimated by heuristics which are implemented in ARQo and are extensible for user specic purposes. Then, we (2) identify the Minimum Spanning Tree (MST) of g by means of an algorithm which 5

7 returns an ordered set T of joined triple patterns (i.e. edges). For instance, the Kruskal MST algorithm [5] identies the edges of the MST in increasing weight. The elements t T are ordered according to the total cost for each element t, i.e. the edges of g. The total cost of t is calculated as a function of the costs of the edge and the related nodes. Thus, we dene the partial order relation (T, ) with the properties of reexivity, antisymmetry and transitivity. Moreover, we dene an order relation (N, ) for the elements t T, where N is the set of nodes for a specic t T. Thus, we get a set T with edges t which are ordered by increasing estimated total cost, where each t T is a set N of the nodes of t. The set N is a pair of nodes ordered by increasing estimated costs. Finally, we (3) extract the nodes from T to a set Q by selecting the nodes according to the two order relations (T, ) and (N, ). The set Q features a list of distinct nodes, i.e. triple patterns and reects the most promising QEP. Note that the elements q Q, i.e. the nodes of g, represent a specic QEP and, thus, dene a DAG. We may describe Q as a materialization of g in the form of the DAG specied by the elements q Q. In fact, the order relation of Q, which is specied by (T, ) and (N, ), allows to construct the corresponding DAG by selecting the head and tail nodes q Q according to the order relation of Q and the join relationship between node pairings , , , ,874 1, , ,874 2, , Fig. 3. MST for the undirected connected graph g 1 G of Listing 1.1 In Figure 3, we show the undirected connected graph g 1 G for the BGP of Listing 1.1. For each node and edge, we provide the selectivities of the corresponding (joined) triple pattern(s). Further, we highlight the MST. Based on this solution, we may build the ordered set T of edges with the elements, T = {(5, 6), (5, 3), (5, 2), (1, 4), (6, 4)} Finally, the ordered set Q of distinct nodes which reects the most promising QEP contains the elements, Q = {5, 6, 3, 2, 1, 4} 6

8 Fig. 4. DAG of the most promising QEP Listing 1.2. Optimized BasicGraphPattern (LUBM Query 2)?Z ub : suborganizationof?y.?x ub : undergraduatedegreefrom?y.?z r d f : type ub : Department.?Y r d f : type ub : U n i v e r s i t y.?x r d f : type ub : GraduateStudent.?X ub : memberof?z. In Figure 4 we display the DAG which corresponds to the most promising QEP and reects the selected sequence of distinct nodes (i.e. triple patterns) of the set Q, whereas in Listing 1.2 we list the reordered optimized BGP. 4.2 The Heuristic Broker The heuristic broker is a component which selects the best heuristic cost estimation for joined triple pattern according to specic settings. Such settings may be the availability of specialized statistical information about the underlying ontology, cached result sets for joined triple patterns or specialized access paths for triple pattern evaluation. The main goal of the broker is to provide the optimization core with the best available cost estimation heuristic according to the settings, or with the heuristic specically selected by the user. 4.3 The Extensible Pool of Heuristic Techniques The extensible pool of heuristic techniques implements a number of dierent heuristics to estimate the execution cost of (joined) triple patterns. Such heuristics are required by the optimization core to select the most promising QEP. In fact, they provide the estimated execution costs for the nodes and edges of each g G. We implement the third component as a typical framework which allows to extend specic, user-dened, techniques. In Section 5, we describe in more details the techniques we intend to provide out-of-the-box. 7

9 5 ARQo Extensions In this section, we briey introduce the two main heuristics for (joined) triple pattern cost estimation we intend to develop for ARQo. 5.1 Variable Counting The rst heuristic is a simple variable counting technique. The goal is to provide a simple optimization technique also if specialized statistical information about the underlying ontology is not available to ARQo. This heuristic will provide the optimization core with a constant cost of 1.0 for edges. However, the cost of a node will be a function of the number and type of variables dened in the corresponding triple pattern. In fact, a higher execution cost will be assigned to triple patterns with more variables. Moreover, a variable subject leads to a higher execution cost as a variable predicate or object. 5.2 Selectivity Estimation A second heuristic is an implementation of the Selectivity Estimation Index (SEI) and the Query Pattern Index (QPI) rst introduced by Bernstein et al. and described in [1]. The SEI is used to estimate the cost execution of triple patterns, i.e. nodes, whereas the QPI is used to estimate the cost of joins, i.e. edges. Based on specialized statistical indexes about the underlying ontology, the SEI and QPI are expected to result into improved SPARQL static query optimization (as illustrated by the evaluation performed by Bernstein et al. in [1]). Next, we concisely describe the SEI and the QPI and present an approach how both may be combined to estimate more accurately the selectivity of joined triple patterns dened in SPARQL queries. SEI: The Selectivity Estimation Index. The SEI is used to estimate the selectivity [8] of a triple pattern. Given a triple pattern T, the formula sel(t ) = sel(s) sel(p ) sel(o) estimates the selectivity of T. Thus, the selectivity of a triple pattern is calculated as the multiplication of the selectivity for the building blocks of a triple pattern, i.e. the subject S, the predicate P and the object O. We refer to Bernstein et al. [1] for a detailed description of the SEI technique. QPI: The Query Pattern Index. The QPI provides information about the selectivity of joined triple patterns, more precisely, the selectivity of two joined triple patterns. The ontology schema is used to automatically generate the QPI. In fact, the rdfs:domain and rdfs:range properties of rdf:property instances, allow to identify pairs of rdf:property instances which can be joined because of a matching class for the rdfs:domain and rdfs:range properties. 8

10 For a detailed discussion about the QPI technique we refer to Bernstein et al. 5 [1]. Combining the SEI and the QPI. The QPI is essentially a list of two joined triple patterns with variables dened for the subject and the object 6. However, SPARQL queries do not always dene variables for the subject and the object of joined triple patterns. Thus, the QPI enables the retrieval of the selectivity for joined patterns which are an upper bound for the selectivity of the pattern dened in the query. For instance, consider the following SPARQL query (we list only the WHERE clause)?x rdf:type ub:graduatestudent.?x ub:undergraduatedegreefrom?y. The QPI allows a lookup for the selectivity of?x rdf:type?y.?x ub:undergraduatedegreefrom?z. which only approximates the real selectivity of the joined pattern dened in the SPARQL query. In fact, the QPI selectivity of the pattern is an upper bound for the selectivity of any possible query pattern which can be built for the two joined patterns. Thus, we know that the selectivity of a SPARQL query pattern is potentially lower. We intend to use the SEI selectivity estimation to estimate the selectivity for a given subject and object. The QPI selectivity may be reduced using the SEI selectivity estimation by a specic formula. 5.3 Other Heuristics Of course we may think of many other cost estimation heuristic. For instance, we could extend ARQ with a cache for the selectivity of executed query patterns. This incremental cache could be exploited by a corresponding heuristic which considers the cached selectivities while estimating the cost of (joined) triple patterns. As another example, we could extend ARQ with a cache for result sets of query patterns and implement a heuristic which assigns lower execution costs to triple patterns whose result sets are cached. As an example for a user specic setting, if someone provides a simple index about the occurrences of distinct predicates, we may implement a simple heuristic which considers this specic index to estimate the execution cost of triple patterns. 5 The QPI technique is not yet published. However, the authors have submitted both techniques as contribution to the ISWC Note, that the variables are of the set (?x,?y,?z), i.e. the joined triple pattern denes a common variable where the class type matches for the rdfs:domain and rdfs:range properties 9

11 6 Future Work and Conclusions As future work, we intend to implement the proposed ARQo architecture with its three components based on the description provided in this paper. We will investigate dierent cost estimation techniques which allow QEP evaluation also for specialized settings which may include caching of result sets or learning systems which incrementally store selectivity information of query patterns. Finally, we intend to extensively evaluate the performance of ARQ extended with ARQo on dierent data sets and retrieval tasks. We believe, ARQo will provide ARQ a robust static query optimization module for basic graph patterns and in-memory models which is extensible and adaptable to specic settings but will lead to improved performance also for generic ARQ deployments. References 1. Abraham Bernstein, Christoph Kiefer, and Markus Stocker. OptARQ: A SPARQL Optimization Approach based on Triple Pattern Selectivity Estimation. Technical Report i , Department of Informatics, University of Zurich, Dan Brickley and R.V. Guha. Rdf vocabulary description language 1.0: Rdf schema. Technical report, W3C, Y. Guo, Z. Pan, and J. Hein. LUBM: A Benchmark for OWL Knowledge Base Systems. Journal of Web Semantics, 3(2):158182, Andreas Harth and Stefan Decker. Optimized index structures for querying rdf from the web. In 3rd Latin American Web Congress, Jr. Joseph B. Kruskal. On the shortest spanning subtree of a graph and the traveling salesman problem. In Proceedings of the American Mathematical Society, volume 7, pages 4850, Gregory Piatetsky-Shapiro and Charles Connell. Accurate estimation of the number of tuples satisfying a condition. In SIGMOD '84: Proceedings of the 1984 ACM SIGMOD international conference on Management of data, pages , New York, NY, USA, ACM Press. 7. Eric Prud'hommeaux and Andy Seaborne. Sparql query language for rdf. Technical report, W3C, P. Griths Selinger, M. M. Astrahan, D. D. Chamberlin, R. A. Lorie, and T. G. Price. Access path selection in a relational database management system. In SIG- MOD '79: Proceedings of the 1979 ACM SIGMOD international conference on Management of data, pages 2334, New York, NY, USA, ACM Press. 9. Eugene Wong and Karel Yousse. Decomposition a strategy for query processing. ACM Trans. Database Syst., 1(3):223241,

Efficient Optimization of Sparql Basic Graph Pattern

Efficient Optimization of Sparql Basic Graph Pattern Efficient Optimization of Sparql Basic Graph Pattern Ms.M.Manju 1, Mrs. R Gomathi 2 PG Scholar, Department of CSE, Bannari Amman Institute of Technology, Sathyamangalam, Tamilnadu, India 1 Associate Professor/Senior

More information

SPARQL Basic Graph Pattern Optimization Using Selectivity Estimation

SPARQL Basic Graph Pattern Optimization Using Selectivity Estimation SPARQL Basic Graph Pattern Optimization Using Selectivity Estimation Markus Stocker HP Laboratories Bristol United Kingdom markus.stocker@gmail.com Christoph Kiefer Department of Informatics University

More information

SPARQL BGP Optimization For native RDF graph implementations

SPARQL BGP Optimization For native RDF graph implementations SPARQL BGP Optimization For native RDF graph implementations Markus Stocker, HP Laboratories Bristol Manchester, 23. October 2007 About me Markus Stocker Born in Switzerland, 1979, Ascona Languages: De,

More information

A General Approach to Query the Web of Data

A General Approach to Query the Web of Data A General Approach to Query the Web of Data Xin Liu 1 Department of Information Science and Engineering, University of Trento, Trento, Italy liu@disi.unitn.it Abstract. With the development of the Semantic

More information

Index Support for SPARQL

Index Support for SPARQL Index Support for SPARQL Ralf Heese, Ulf Leser, Bastian Quilitz, and Christian Rothe Humboldt-Universität zu Berlin Department of Computer Science (rheese leser quilitz rothe)@informatik.hu-berlin.de Abstract.

More information

Distributed minimum spanning tree problem

Distributed minimum spanning tree problem Distributed minimum spanning tree problem Juho-Kustaa Kangas 24th November 2012 Abstract Given a connected weighted undirected graph, the minimum spanning tree problem asks for a spanning subtree with

More information

SPARQL Query Optimization on Top of DHTs

SPARQL Query Optimization on Top of DHTs SPARQL Query Optimization on Top of DHTs Zoi Kaoudi, Kostis Kyzirakos, and Manolis Koubarakis Dept. of Informatics and Telecommunications National and Kapodistrian University of Athens, Greece Abstract.

More information

An Overview of Cost-based Optimization of Queries with Aggregates

An Overview of Cost-based Optimization of Queries with Aggregates An Overview of Cost-based Optimization of Queries with Aggregates Surajit Chaudhuri Hewlett-Packard Laboratories 1501 Page Mill Road Palo Alto, CA 94304 chaudhuri@hpl.hp.com Kyuseok Shim IBM Almaden Research

More information

FedX: A Federation Layer for Distributed Query Processing on Linked Open Data

FedX: A Federation Layer for Distributed Query Processing on Linked Open Data FedX: A Federation Layer for Distributed Query Processing on Linked Open Data Andreas Schwarte 1, Peter Haase 1,KatjaHose 2, Ralf Schenkel 2, and Michael Schmidt 1 1 fluid Operations AG, Walldorf, Germany

More information

OSDBQ: Ontology Supported RDBMS Querying

OSDBQ: Ontology Supported RDBMS Querying OSDBQ: Ontology Supported RDBMS Querying Cihan Aksoy 1, Erdem Alparslan 1, Selçuk Bozdağ 2, İhsan Çulhacı 3, 1 The Scientific and Technological Research Council of Turkey, Gebze/Kocaeli, Turkey 2 Komtaş

More information

Design and Implementation of an RDF Triple Store

Design and Implementation of an RDF Triple Store Design and Implementation of an RDF Triple Store Ching-Long Yeh and Ruei-Feng Lin Department of Computer Science and Engineering Tatung University 40 Chungshan N. Rd., Sec. 3 Taipei, 04 Taiwan E-mail:

More information

Semantic Processing of Sensor Event Stream by Using External Knowledge Bases

Semantic Processing of Sensor Event Stream by Using External Knowledge Bases Semantic Processing of Sensor Event Stream by Using External Knowledge Bases Short Paper Kia Teymourian and Adrian Paschke Freie Universitaet Berlin, Berlin, Germany {kia, paschke}@inf.fu-berlin.de Abstract.

More information

Dewayne E. Perry. Abstract. An important ingredient in meeting today's market demands

Dewayne E. Perry. Abstract. An important ingredient in meeting today's market demands Maintaining Consistent, Minimal Congurations Dewayne E. Perry Software Production Research, Bell Laboratories 600 Mountain Avenue, Murray Hill, NJ 07974 USA dep@research.bell-labs.com Abstract. An important

More information

A Relaxed Approach to RDF Querying

A Relaxed Approach to RDF Querying A Relaxed Approach to RDF Querying Carlos A. Hurtado churtado@dcc.uchile.cl Department of Computer Science Universidad de Chile Alexandra Poulovassilis, Peter T. Wood {ap,ptw}@dcs.bbk.ac.uk School of Computer

More information

Checking: Trading Space for Time. Divesh Srivastava. AT&T Research. 10]). Given a materialized view dened using database

Checking: Trading Space for Time. Divesh Srivastava. AT&T Research. 10]). Given a materialized view dened using database Materialized View Maintenance and Integrity Constraint Checking: Trading Space for Time Kenneth A. Ross Columbia University kar@cs.columbia.edu Divesh Srivastava AT&T Research divesh@research.att.com S.

More information

Using RDF to Model the Structure and Process of Systems

Using RDF to Model the Structure and Process of Systems Using RDF to Model the Structure and Process of Systems Marko A. Rodriguez Jennifer H. Watkins Johan Bollen Los Alamos National Laboratory {marko,jhw,jbollen}@lanl.gov Carlos Gershenson New England Complex

More information

. The problem: ynamic ata Warehouse esign Ws are dynamic entities that evolve continuously over time. As time passes, new queries need to be answered

. The problem: ynamic ata Warehouse esign Ws are dynamic entities that evolve continuously over time. As time passes, new queries need to be answered ynamic ata Warehouse esign? imitri Theodoratos Timos Sellis epartment of Electrical and Computer Engineering Computer Science ivision National Technical University of Athens Zographou 57 73, Athens, Greece

More information

View-Size Estimation in Self-Organizing Databases (extended abstract)

View-Size Estimation in Self-Organizing Databases (extended abstract) View-Size Estimation in Self-Organizing Databases (extended abstract) Kyoung-Hwa Kim and Rada Chirkova Computer Science Department North Carolina State University Raleigh, NC 27695, USA {kkim,rychirko}@ncsu.edu

More information

Finding Similarity and Comparability from Merged Hetero Data of the Semantic Web by Using Graph Pattern Matching

Finding Similarity and Comparability from Merged Hetero Data of the Semantic Web by Using Graph Pattern Matching Finding Similarity and Comparability from Merged Hetero Data of the Semantic Web by Using Graph Pattern Matching Hiroyuki Sato, Kyoji Iiduka, Takeya Mukaigaito, and Takahiko Murayama Information Sharing

More information

Lecture 2 - Graph Theory Fundamentals - Reachability and Exploration 1

Lecture 2 - Graph Theory Fundamentals - Reachability and Exploration 1 CME 305: Discrete Mathematics and Algorithms Instructor: Professor Aaron Sidford (sidford@stanford.edu) January 11, 2018 Lecture 2 - Graph Theory Fundamentals - Reachability and Exploration 1 In this lecture

More information

A Boolean Expression. Reachability Analysis or Bisimulation. Equation Solver. Boolean. equations.

A Boolean Expression. Reachability Analysis or Bisimulation. Equation Solver. Boolean. equations. A Framework for Embedded Real-time System Design? Jin-Young Choi 1, Hee-Hwan Kwak 2, and Insup Lee 2 1 Department of Computer Science and Engineering, Korea Univerity choi@formal.korea.ac.kr 2 Department

More information

Controlling Access to RDF Graphs

Controlling Access to RDF Graphs Controlling Access to RDF Graphs Giorgos Flouris 1, Irini Fundulaki 1, Maria Michou 1, and Grigoris Antoniou 1,2 1 Institute of Computer Science, FORTH, Greece 2 Computer Science Department, University

More information

On Distributed Querying of Linked Data

On Distributed Querying of Linked Data On Distributed Querying of Linked Data Martin Svoboda, Jakub Stárka, and Irena Mlýnková Martin Svoboda, Jakub Stárka, and Irena Mlýnková XML and Web Engineering Research Group Faculty of Mathematics XML

More information

Enhancing Integrated Layer Processing using Common Case. Anticipation and Data Dependence Analysis. Extended Abstract

Enhancing Integrated Layer Processing using Common Case. Anticipation and Data Dependence Analysis. Extended Abstract Enhancing Integrated Layer Processing using Common Case Anticipation and Data Dependence Analysis Extended Abstract Philippe Oechslin Computer Networking Lab Swiss Federal Institute of Technology DI-LTI

More information

Semantic Cardinality Estimation for Mapping andselecting Queries. in Complex Object Bases y. Marian H. Nodine, Mitch Cherniack

Semantic Cardinality Estimation for Mapping andselecting Queries. in Complex Object Bases y. Marian H. Nodine, Mitch Cherniack Semantic Cardinality Estimation for Mapping andselecting Queries in Complex Object Bases y Marian H. Nodine, Mitch Cherniack Brown University, Providence, RI 02912 email: fmhn,mfcg@cs.brown.edu and Mark

More information

Benchmarking RDF Production Tools

Benchmarking RDF Production Tools Benchmarking RDF Production Tools Martin Svihla and Ivan Jelinek Czech Technical University in Prague, Karlovo namesti 13, Praha 2, Czech republic, {svihlm1, jelinek}@fel.cvut.cz, WWW home page: http://webing.felk.cvut.cz

More information

Graph Databases. Guilherme Fetter Damasio. University of Ontario Institute of Technology and IBM Centre for Advanced Studies IBM Corporation

Graph Databases. Guilherme Fetter Damasio. University of Ontario Institute of Technology and IBM Centre for Advanced Studies IBM Corporation Graph Databases Guilherme Fetter Damasio University of Ontario Institute of Technology and IBM Centre for Advanced Studies Outline Introduction Relational Database Graph Database Our Research 2 Introduction

More information

An RDF Model for Multi-Level Hypertext in Digital Libraries

An RDF Model for Multi-Level Hypertext in Digital Libraries An RDF Model for Multi-Level Hypertext in Digital Libraries Gudrun Fischer, Norbert Fuhr University of Dortmund, Germany 1 Multi-level hypertext in digital libraries A core concept of the Semantic Web

More information

THE EFFECT OF JOIN SELECTIVITIES ON OPTIMAL NESTING ORDER

THE EFFECT OF JOIN SELECTIVITIES ON OPTIMAL NESTING ORDER THE EFFECT OF JOIN SELECTIVITIES ON OPTIMAL NESTING ORDER Akhil Kumar and Michael Stonebraker EECS Department University of California Berkeley, Ca., 94720 Abstract A heuristic query optimizer must choose

More information

Flexible querying for SPARQL

Flexible querying for SPARQL Flexible querying for SPARQL A. Calì, R. Frosini, A. Poulovassilis, P. T. Wood Department of Computer Science and Information Systems, Birkbeck, University of London London Knowledge Lab Overview of the

More information

A SEMANTIC MATCHMAKER SERVICE ON THE GRID

A SEMANTIC MATCHMAKER SERVICE ON THE GRID DERI DIGITAL ENTERPRISE RESEARCH INSTITUTE A SEMANTIC MATCHMAKER SERVICE ON THE GRID Andreas Harth Yu He Hongsuda Tangmunarunkit Stefan Decker Carl Kesselman DERI TECHNICAL REPORT 2004-05-18 MAY 2004 DERI

More information

perspective, logic programs do have a notion of control ow, and the in terms of the central control ow the program embodies.

perspective, logic programs do have a notion of control ow, and the in terms of the central control ow the program embodies. Projections of Logic Programs Using Symbol Mappings Ashish Jain Department of Computer Engineering and Science Case Western Reserve University Cleveland, OH 44106 USA email: jain@ces.cwru.edu Abstract

More information

International Journal of Foundations of Computer Science c World Scientic Publishing Company DFT TECHNIQUES FOR SIZE ESTIMATION OF DATABASE JOIN OPERA

International Journal of Foundations of Computer Science c World Scientic Publishing Company DFT TECHNIQUES FOR SIZE ESTIMATION OF DATABASE JOIN OPERA International Journal of Foundations of Computer Science c World Scientic Publishing Company DFT TECHNIQUES FOR SIZE ESTIMATION OF DATABASE JOIN OPERATIONS KAM_IL SARAC, OMER E GEC_IO GLU, AMR EL ABBADI

More information

Logical reconstruction of RDF and ontology languages

Logical reconstruction of RDF and ontology languages Logical reconstruction of RDF and ontology languages Jos de Bruijn 1, Enrico Franconi 2, and Sergio Tessaris 2 1 Digital Enterprise Research Institute, University of Innsbruck, Austria jos.debruijn@deri.org

More information

Optimizing Queries with Aggregate Views. Abstract. Complex queries, with aggregates, views and nested subqueries

Optimizing Queries with Aggregate Views. Abstract. Complex queries, with aggregates, views and nested subqueries Optimizing Queries with Aggregate Views Surajit Chaudhuri 1 and Kyuseok Shim 2 1 Hewlett-Packard Laboratories, 1501 Page Mill Road, Palo Alto, CA 94304, USA 2 IBM Almaden Research Center, 650 Harry Road,

More information

A Linear Time Algorithm for the Minimum Spanning Tree Problem. on a Planar Graph. Tomomi MATSUI. (January 1994 )

A Linear Time Algorithm for the Minimum Spanning Tree Problem. on a Planar Graph. Tomomi MATSUI. (January 1994 ) A Linear Time Algorithm for the Minimum Spanning Tree Problem on a Planar Graph Tomomi MATSUI (January 994 ) Department of Mathematical Engineering and Information Physics Faculty of Engineering, University

More information

Expressive Querying of Semantic Databases with Incremental Query Rewriting

Expressive Querying of Semantic Databases with Incremental Query Rewriting Expressive Querying of Semantic Databases with Incremental Query Rewriting Alexandre Riazanov, UNB Saint John joint work with Marcelo A. T. Aragão, Manchester Univ. and Central Bank of Brazil AWOSS 10.2,

More information

Grid Resources Search Engine based on Ontology

Grid Resources Search Engine based on Ontology based on Ontology 12 E-mail: emiao_beyond@163.com Yang Li 3 E-mail: miipl606@163.com Weiguang Xu E-mail: miipl606@163.com Jiabao Wang E-mail: miipl606@163.com Lei Song E-mail: songlei@nudt.edu.cn Jiang

More information

contribution of this paper is to demonstrate that rule orderings can also improve eciency by reducing the number of rule applications. In eect, since

contribution of this paper is to demonstrate that rule orderings can also improve eciency by reducing the number of rule applications. In eect, since Rule Ordering in Bottom-Up Fixpoint Evaluation of Logic Programs Raghu Ramakrishnan Divesh Srivastava S. Sudarshan y Computer Sciences Department, University of Wisconsin-Madison, WI 53706, U.S.A. Abstract

More information

Event Object Boundaries in RDF Streams A Position Paper

Event Object Boundaries in RDF Streams A Position Paper Event Object Boundaries in RDF Streams A Position Paper Robin Keskisärkkä and Eva Blomqvist Department of Computer and Information Science Linköping University, Sweden {robin.keskisarkka eva.blomqvist}@liu.se

More information

OWL a glimpse. OWL a glimpse (2) requirements for ontology languages. requirements for ontology languages

OWL a glimpse. OWL a glimpse (2) requirements for ontology languages. requirements for ontology languages OWL a glimpse OWL Web Ontology Language describes classes, properties and relations among conceptual objects lecture 7: owl - introduction of#27# ece#720,#winter# 12# 2# of#27# OWL a glimpse (2) requirements

More information

Select (SumSal > Budget) Join (DName) E4: Aggregate (SUM Salary by DName) Emp. Select (SumSal > Budget) Aggregate (SUM Salary by DName, Budget)

Select (SumSal > Budget) Join (DName) E4: Aggregate (SUM Salary by DName) Emp. Select (SumSal > Budget) Aggregate (SUM Salary by DName, Budget) ACM SIGMOD Conference, June 1996, pp.447{458 Materialized View Maintenance and Integrity Constraint Checking: Trading Space for Time Kenneth A. Ross Columbia University kar@cs.columbia.edu Divesh Srivastava

More information

Practical Aspects of Query Rewriting for OWL 2

Practical Aspects of Query Rewriting for OWL 2 Practical Aspects of Query Rewriting for OWL 2 Héctor Pérez-Urbina, Ian Horrocks, and Boris Motik Oxford University Computing Laboratory, Oxford, England {hector.perez-urbina,ian.horrocks,boris.motik}@comlab.ox.ac.uk

More information

Towards a better insight of RDF triples Ontology-guided Storage system abilities

Towards a better insight of RDF triples Ontology-guided Storage system abilities Towards a better insight of RDF triples Ontology-guided Storage system abilities Olivier Curé 1, David Faye 1,2, Guillaume Blin 1 1 Université Paris-Est, LIGM - UMR CNRS 8049, France {ocure, gblin}@univ-mlv.fr

More information

A faceted lightweight ontology for Earthquake Engineering Research Projects and Experiments

A faceted lightweight ontology for Earthquake Engineering Research Projects and Experiments Eng. Md. Rashedul Hasan email: md.hasan@unitn.it Phone: +39-0461-282571 Fax: +39-0461-282521 SERIES Concluding Workshop - Joint with US-NEES JRC, Ispra, May 28-30, 2013 A faceted lightweight ontology for

More information

The minimum spanning tree problem

The minimum spanning tree problem The minimum spanning tree problem MST is a minimum cost connection problem on graphs The graph can model the connection in a (hydraulic, electric, telecommunication) network: the nodes are the points that

More information

The Minimum Spanning Tree Problem on a Planar Graph. Tomomi MATSUI. (February 1994 ) Department of Mathematical Engineering and Information Physics

The Minimum Spanning Tree Problem on a Planar Graph. Tomomi MATSUI. (February 1994 ) Department of Mathematical Engineering and Information Physics The Minimum Spanning Tree Problem on a Planar Graph Tomomi MATSUI (February 994 ) Department of Mathematical Engineering and Information Physics Faculty of Engineering, University of Tokyo Bunkyo-ku, Tokyo

More information

Egemen Tanin, Tahsin M. Kurc, Cevdet Aykanat, Bulent Ozguc. Abstract. Direct Volume Rendering (DVR) is a powerful technique for

Egemen Tanin, Tahsin M. Kurc, Cevdet Aykanat, Bulent Ozguc. Abstract. Direct Volume Rendering (DVR) is a powerful technique for Comparison of Two Image-Space Subdivision Algorithms for Direct Volume Rendering on Distributed-Memory Multicomputers Egemen Tanin, Tahsin M. Kurc, Cevdet Aykanat, Bulent Ozguc Dept. of Computer Eng. and

More information

Decoupled Software Pipelining in LLVM

Decoupled Software Pipelining in LLVM Decoupled Software Pipelining in LLVM 15-745 Final Project Fuyao Zhao, Mark Hahnenberg fuyaoz@cs.cmu.edu, mhahnenb@andrew.cmu.edu 1 Introduction 1.1 Problem Decoupled software pipelining [5] presents an

More information

Stanford University. cult time understanding the set of supported mediator

Stanford University. cult time understanding the set of supported mediator Computing Capabilities of Mediators Ramana Yerneni, Chen Li, Hector Garcia-Molina, Jerey Ullman Department of Computer Science Stanford University fyerneni,chenli,hector,ullmang@cs.stanford.edu Abstract

More information

Stanford University. Abstract. In data-integration systems, the queries supported by a mediator are aected by the queryprocessing

Stanford University. Abstract. In data-integration systems, the queries supported by a mediator are aected by the queryprocessing Computing Capabilities of Mediators Ramana Yerneni, Chen Li, Hector Garcia-Molina, Jerey Ullman Department of Computer Science Stanford University contact: yerneni@cs.stanford.edu Abstract In data-integration

More information

Cluster-based Instance Consolidation For Subsequent Matching

Cluster-based Instance Consolidation For Subsequent Matching Jennifer Sleeman and Tim Finin, Cluster-based Instance Consolidation For Subsequent Matching, First International Workshop on Knowledge Extraction and Consolidation from Social Media, November 2012, Boston.

More information

FedX: Optimization Techniques for Federated Query Processing on Linked Data. ISWC 2011 October 26 th. Presented by: Ziv Dayan

FedX: Optimization Techniques for Federated Query Processing on Linked Data. ISWC 2011 October 26 th. Presented by: Ziv Dayan FedX: Optimization Techniques for Federated Query Processing on Linked Data ISWC 2011 October 26 th Presented by: Ziv Dayan Andreas Schwarte 1, Peter Haase 1, Katja Hose 2, Ralf Schenkel 2, and Michael

More information

Lecture 9 - Matrix Multiplication Equivalences and Spectral Graph Theory 1

Lecture 9 - Matrix Multiplication Equivalences and Spectral Graph Theory 1 CME 305: Discrete Mathematics and Algorithms Instructor: Professor Aaron Sidford (sidford@stanfordedu) February 6, 2018 Lecture 9 - Matrix Multiplication Equivalences and Spectral Graph Theory 1 In the

More information

Towards Semantic Data Mining

Towards Semantic Data Mining Towards Semantic Data Mining Haishan Liu Department of Computer and Information Science, University of Oregon, Eugene, OR, 97401, USA ahoyleo@cs.uoregon.edu Abstract. Incorporating domain knowledge is

More information

The Logic of the Semantic Web. Enrico Franconi Free University of Bozen-Bolzano, Italy

The Logic of the Semantic Web. Enrico Franconi Free University of Bozen-Bolzano, Italy The Logic of the Semantic Web Enrico Franconi Free University of Bozen-Bolzano, Italy What is this talk about 2 What is this talk about A sort of tutorial of RDF, the core semantic web knowledge representation

More information

2. RDF Semantic Web Basics Semantic Web

2. RDF Semantic Web Basics Semantic Web 2. RDF Semantic Web Basics Semantic Web Prof. Dr. Bernhard Humm Faculty of Computer Science Hochschule Darmstadt University of Applied Sciences Summer semester 2011 1 Agenda Semantic Web Basics Literature

More information

Cost based Query Ordering over OWL Ontologies

Cost based Query Ordering over OWL Ontologies Cost based Query Ordering over OWL Ontologies Ilianna Kollia 1,2 and Birte Glimm 1 1 University of Ulm, Germany, birte.glimm@uni-ulm.de 2 National Technical University of Athens, Greece, ilianna2@mail.ntua.gr

More information

A Framework for Performance Study of Semantic Databases

A Framework for Performance Study of Semantic Databases A Framework for Performance Study of Semantic Databases Xianwei Shen 1 and Vincent Huang 2 1 School of Information and Communication Technology, KTH- Royal Institute of Technology, Kista, Sweden 2 Services

More information

Virtual Multi-homing: On the Feasibility of Combining Overlay Routing with BGP Routing

Virtual Multi-homing: On the Feasibility of Combining Overlay Routing with BGP Routing Virtual Multi-homing: On the Feasibility of Combining Overlay Routing with BGP Routing Zhi Li, Prasant Mohapatra, and Chen-Nee Chuah University of California, Davis, CA 95616, USA {lizhi, prasant}@cs.ucdavis.edu,

More information

Web site Image database. Web site Video database. Web server. Meta-server Meta-search Agent. Meta-DB. Video query. Text query. Web client.

Web site Image database. Web site Video database. Web server. Meta-server Meta-search Agent. Meta-DB. Video query. Text query. Web client. (Published in WebNet 97: World Conference of the WWW, Internet and Intranet, Toronto, Canada, Octobor, 1997) WebView: A Multimedia Database Resource Integration and Search System over Web Deepak Murthy

More information

A Genetic Algorithm for Database Query Optimization. University of Wisconsin West Dayton Street. that satises

A Genetic Algorithm for Database Query Optimization. University of Wisconsin West Dayton Street. that satises A Genetic Algorithm for Database Query Optimization Kristin Bennett Michael C. Ferris Computer Sciences Department University of Wisconsin 1210 West Dayton Street Madison, Wisconsin 53706 Yannis E. Ioannidis

More information

Sindice Widgets: Lightweight embedding of Semantic Web capabilities into existing user applications.

Sindice Widgets: Lightweight embedding of Semantic Web capabilities into existing user applications. Sindice Widgets: Lightweight embedding of Semantic Web capabilities into existing user applications. Adam Westerski, Aftab Iqbal, and Giovanni Tummarello Digital Enterprise Research Institute, NUI Galway,Ireland

More information

A Multidatabase Platform to Support. Prototyping Environments. Omar Boucelma, Jean-Claude Franchitti, and Roger King.

A Multidatabase Platform to Support. Prototyping Environments. Omar Boucelma, Jean-Claude Franchitti, and Roger King. A Multidatabase Platform to Support Prototyping Environments Omar Boucelma, Jean-Claude Franchitti, and Roger King Department of Computer Science University of Colorado at Boulder Campus Box 430 Boulder,

More information

Conclusions and Future Work. We introduce a new method for dealing with the shortage of quality benchmark circuits

Conclusions and Future Work. We introduce a new method for dealing with the shortage of quality benchmark circuits Chapter 7 Conclusions and Future Work 7.1 Thesis Summary. In this thesis we make new inroads into the understanding of digital circuits as graphs. We introduce a new method for dealing with the shortage

More information

Towards Equivalences for Federated SPARQL Queries

Towards Equivalences for Federated SPARQL Queries Towards Equivalences for Federated SPARQL Queries Carlos Buil-Aranda 1? and Axel Polleres 2?? 1 Department of Computer Science, Pontificia Universidad Católica, Chile cbuil@ing.puc.cl 2 Vienna University

More information

Join Query Optimization Techniques for Complex Event Processing Applications

Join Query Optimization Techniques for Complex Event Processing Applications Join Query Optimization Techniques for Complex Event Processing Applications ABSTRACT Ilya Kolchinsky Technion, Israel Institute of Technology Haifa 32000 Israel ikolchin@cs.technion.ac.il Complex event

More information

Introduction to Mathematical Programming IE406. Lecture 16. Dr. Ted Ralphs

Introduction to Mathematical Programming IE406. Lecture 16. Dr. Ted Ralphs Introduction to Mathematical Programming IE406 Lecture 16 Dr. Ted Ralphs IE406 Lecture 16 1 Reading for This Lecture Bertsimas 7.1-7.3 IE406 Lecture 16 2 Network Flow Problems Networks are used to model

More information

A Comparison of MapReduce Join Algorithms for RDF

A Comparison of MapReduce Join Algorithms for RDF A Comparison of MapReduce Join Algorithms for RDF Albert Haque 1 and David Alves 2 Research in Bioinformatics and Semantic Web Lab, University of Texas at Austin 1 Department of Computer Science, 2 Department

More information

Assignment 4. Overview. Prof. Stewart Weiss. CSci 335 Software Design and Analysis III Assignment 4

Assignment 4. Overview. Prof. Stewart Weiss. CSci 335 Software Design and Analysis III Assignment 4 Overview This assignment combines several dierent data abstractions and algorithms that we have covered in class, including priority queues, on-line disjoint set operations, hashing, and sorting. The project

More information

Linked Data Querying through FCA-based Schema Indexing

Linked Data Querying through FCA-based Schema Indexing Linked Data Querying through FCA-based Schema Indexing Dominik Brosius and Steffen Staab Institute for Web Science and Technologies, University of Koblenz-Landau, {dbrosius, staab}@uni-koblenz.de Abstract.

More information

The SPARQL Query Graph Model for Query Optimization

The SPARQL Query Graph Model for Query Optimization The SPARQL Query Graph Model for Query Optimization Olaf Hartig and Ralf Heese Humboldt-Universität zu Berlin Department of Computer Science (hartig rheese)@informatik.hu-berlin.de Abstract. The Semantic

More information

Translation of Sparql to SQL Based on Integrity Constraint

Translation of Sparql to SQL Based on Integrity Constraint Journal of Computational Information Systems 7:2 (2011) 394-402 Available at http://www.jofcis.com Translation of Sparql to SQL Based on Integrity Constraint Xianji CUI 1, 2, Dantong OUYANG 1, 2,, Yuxin

More information

Effective Semantic Search over Huge RDF Data

Effective Semantic Search over Huge RDF Data Effective Semantic Search over Huge RDF Data 1 Dinesh A. Zende, 2 Chavan Ganesh Baban 1 Assistant Professor, 2 Post Graduate Student Vidya Pratisthan s Kamanayan Bajaj Institute of Engineering & Technology,

More information

Database Fundamentals Chapter 1

Database Fundamentals Chapter 1 Database Fundamentals Chapter 1 Class 01: Database Fundamentals 1 What is a Database? The ISO/ANSI SQL Standard does not contain a definition of the term database. In fact, the term is never mentioned

More information

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. 1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. Integrating Complex Financial Workflows in Oracle Database Xavier Lopez Seamus Hayes Oracle PolarLake, LTD 2 Copyright 2011, Oracle

More information

Persistent Storage System for Efficient Management of OWL Web Ontology *

Persistent Storage System for Efficient Management of OWL Web Ontology * Persistent Storage System for Efficient Management of OWL Web Ontology * Dongwon Jeong 1, Myounghoi Choi 1, Yang-Seung Jeon 2, Youn-Hee Han 3, Laurence T. Yang 4, Young-Sik Jeong 2, and Sung-Kook Han 2

More information

Chapter 9 Graph Algorithms

Chapter 9 Graph Algorithms Introduction graph theory useful in practice represent many real-life problems can be if not careful with data structures Chapter 9 Graph s 2 Definitions Definitions an undirected graph is a finite set

More information

Using ontologies function management

Using ontologies function management for Using ontologies function management Caroline Domerg, Juliette Fabre and Pascal Neveu 22th July 2010 O. Corby C.Faron-Zucker E.Gennari A. Granier I. Mirbel V. Negre A. Tireau Semantic Web tools Ontology

More information

Grouping multiple RDF graphs in the collections

Grouping multiple RDF graphs in the collections Grouping multiple RDF graphs in the collections Dominik Tomaszuk 1 and Henryk Rybi«ski 2 1 Faculty of Mathematics and Informatics, University of Bialystok, Poland dtomaszuk@ii.uwb.edu.pl 2 Institute of

More information

Transforming Data from into DataPile RDF Structure into RDF

Transforming Data from into DataPile RDF Structure into RDF Transforming Data from DataPile Structure Transforming Data from into DataPile RDF Structure into RDF Jiří Jiří Dokulil Charles Faculty of University, Mathematics Faculty and Physics, of Mathematics Charles

More information

Oracle Spatial and Graph: Benchmarking a Trillion Edges RDF Graph ORACLE WHITE PAPER NOVEMBER 2016

Oracle Spatial and Graph: Benchmarking a Trillion Edges RDF Graph ORACLE WHITE PAPER NOVEMBER 2016 Oracle Spatial and Graph: Benchmarking a Trillion Edges RDF Graph ORACLE WHITE PAPER NOVEMBER 2016 Introduction One trillion is a really big number. What could you store with one trillion facts?» 1000

More information

Heuristics-based Query Reordering for Federated Queries in SPARQL 1.1 and SPARQL-LD

Heuristics-based Query Reordering for Federated Queries in SPARQL 1.1 and SPARQL-LD Heuristics-based Query Reordering for Federated Queries in SPARQL 1.1 and SPARQL-LD Thanos Yannakis 1, Pavlos Fafalios 2, and Yannis Tzitzikas 1 1 Computer Science Department, University of Crete, and

More information

Chapter 9 Graph Algorithms

Chapter 9 Graph Algorithms Chapter 9 Graph Algorithms 2 Introduction graph theory useful in practice represent many real-life problems can be if not careful with data structures 3 Definitions an undirected graph G = (V, E) is a

More information

FOUNDATIONS OF SEMANTIC WEB TECHNOLOGIES

FOUNDATIONS OF SEMANTIC WEB TECHNOLOGIES FOUNDATIONS OF SEMANTIC WEB TECHNOLOGIES Semantics of RDF(S) Sebastian Rudolph Dresden, 25 April 2014 Content Overview & XML Introduction into RDF RDFS Syntax & Intuition Tutorial 1 RDFS Semantics RDFS

More information

Proposal for Implementing Linked Open Data on Libraries Catalogue

Proposal for Implementing Linked Open Data on Libraries Catalogue Submitted on: 16.07.2018 Proposal for Implementing Linked Open Data on Libraries Catalogue Esraa Elsayed Abdelaziz Computer Science, Arab Academy for Science and Technology, Alexandria, Egypt. E-mail address:

More information

P a r r o t. P r i m e

P a r r o t. P r i m e To appear in World Automation Congress (WAC'98) Prole Aware Retrieval Optimizer for Continuous Media Cyrus Shahabi, Ali Esmail Dashti, and Shahram Ghandeharizadeh Integrated Media Systems Center and Computer

More information

Benchmarking Reasoners for Multi-Ontology Applications

Benchmarking Reasoners for Multi-Ontology Applications Benchmarking Reasoners for Multi-Ontology Applications Ameet N Chitnis, Abir Qasem and Jeff Heflin Lehigh University, 19 Memorial Drive West, Bethlehem, PA 18015 {anc306, abq2, heflin}@cse.lehigh.edu Abstract.

More information

Ontological Modeling: Part 2

Ontological Modeling: Part 2 Ontological Modeling: Part 2 Terry Halpin LogicBlox This is the second in a series of articles on ontology-based approaches to modeling. The main focus is on popular ontology languages proposed for the

More information

Autolink. A Tool for the Automatic and Semi-Automatic Test Generation

Autolink. A Tool for the Automatic and Semi-Automatic Test Generation Autolink A Tool for the Automatic and Semi-Automatic Test Generation Michael Schmitt, Beat Koch, Jens Grabowski and Dieter Hogrefe University of Lubeck, Institute for Telematics, Ratzeburger Allee 160,

More information

Techniques. IDSIA, Istituto Dalle Molle di Studi sull'intelligenza Articiale. Phone: Fax:

Techniques. IDSIA, Istituto Dalle Molle di Studi sull'intelligenza Articiale. Phone: Fax: Incorporating Learning in Motion Planning Techniques Luca Maria Gambardella and Marc Haex IDSIA, Istituto Dalle Molle di Studi sull'intelligenza Articiale Corso Elvezia 36 - CH - 6900 Lugano Phone: +41

More information

R2RML by Assertion: A Semi-Automatic Tool for Generating Customised R2RML Mappings

R2RML by Assertion: A Semi-Automatic Tool for Generating Customised R2RML Mappings R2RML by Assertion: A Semi-Automatic Tool for Generating Customised R2RML Mappings Luís Eufrasio T. Neto 1, Vânia Maria P. Vidal 1, Marco A. Casanova 2, José Maria Monteiro 1 1 Federal University of Ceará,

More information

Introduction to Optimization

Introduction to Optimization Introduction to Optimization Greedy Algorithms October 28, 2016 École Centrale Paris, Châtenay-Malabry, France Dimo Brockhoff Inria Saclay Ile-de-France 2 Course Overview Date Fri, 7.10.2016 Fri, 28.10.2016

More information

7.3 Spanning trees Spanning trees [ ] 61

7.3 Spanning trees Spanning trees [ ] 61 7.3. Spanning trees [161211-1348 ] 61 7.3 Spanning trees We know that trees are connected graphs with the minimal number of edges. Hence trees become very useful in applications where our goal is to connect

More information

An Entity Based RDF Indexing Schema Using Hadoop And HBase

An Entity Based RDF Indexing Schema Using Hadoop And HBase An Entity Based RDF Indexing Schema Using Hadoop And HBase Fateme Abiri Dept. of Computer Engineering Ferdowsi University Mashhad, Iran Abiri.fateme@stu.um.ac.ir Mohsen Kahani Dept. of Computer Engineering

More information

SWSE: Objects before documents!

SWSE: Objects before documents! Provided by the author(s) and NUI Galway in accordance with publisher policies. Please cite the published version when available. Title SWSE: Objects before documents! Author(s) Harth, Andreas; Hogan,

More information

RDFMatView: Indexing RDF Data using Materialized SPARQL queries

RDFMatView: Indexing RDF Data using Materialized SPARQL queries RDFMatView: Indexing RDF Data using Materialized SPARQL queries Roger Castillo, Christian Rothe, and Ulf Leser Humboldt University of Berlin {castillo,rothe,leser}@informatik.hu-berlin.de http://www.hu-berlin.de/

More information

Semantics-Aware Querying of Web-Distributed RDF(S) Repositories

Semantics-Aware Querying of Web-Distributed RDF(S) Repositories Semantics-Aware Querying of Web-Distributed RDF(S) Repositories Georgia D. Solomou, Dimitrios A. Koutsomitropoulos, Theodore S. Papatheodorou High Performance Systems Laboratory, School of Engineering

More information

Performance Measures for Multi-Graded Relevance

Performance Measures for Multi-Graded Relevance Performance Measures for Multi-Graded Relevance Christian Scheel, Andreas Lommatzsch, and Sahin Albayrak Technische Universität Berlin, DAI-Labor, Germany {christian.scheel,andreas.lommatzsch,sahin.albayrak}@dai-labor.de

More information

SWAD-Europe Deliverable 3.18: RDF Query Standardisation

SWAD-Europe Deliverable 3.18: RDF Query Standardisation SWAD-Europe Deliverable 3.18: RDF Query Standardisation Project name: Semantic Web Advanced Development for Europe (SWAD-Europe) Project Number: IST-2001-34732 Workpackage name: 3 Dissemination and Exploitation

More information