Comparing path-based and vertically-partitioned RDF databases
|
|
- Patricia Logan
- 5 years ago
- Views:
Transcription
1 11/4/2007 Comparing path-based and vertically-partitioned RDF databases Abstract Preetha Lakshmi & Chris Mueller CSCI 8715 Given the increasing prevalence of RDF data formats for storing and sharing data on the Semantic Web, efficient storage mechanisms for RDF data are also becoming increasingly important. We survey existing storage solutions for RDF data in an RDMS. Two recent and novel storage concepts open the door for significantly better querying efficiency. The first, proposed by Matono, et al (2005), models RDF data as a graph, then stores materialized path expressions for efficient querying. The second, proposed by Abadi, et al (2007), stores RDF triples in a vertically-partitioned column-oriented database, in which each RDF property is stored in a single table. Our objective is to compare these two storage methods to find relative strengths and weaknesses on a wide range of possible queries. Related Work The simplest way to store RDF data is in a triple store, essentially one large table with three columns for subject, predicate, and object. Variations on the triple-store have shown improvements in efficiency and reduced the number of self-joins needed when issuing complex queries. Data Normalization A normalized triple store attempts to improve the efficiency of the triple store. A Statements table, which stores RDF triples in three columns, as well as a Literals table and a Resources table make up the basic table schema. In RDF, literals refer to literal values, such as strings or integers, and Resources refer to URIs or a. The Statements table contains references to items in the Literals and Resources table, reducing disk space usage. A variation on this approach, the denormalized triple store attempts to limit the number of joins that would occur across the Statement, Resources, and Literals tables. Instead of always storing a reference to the Literals and Resource tables, the Statements table will hold the literal or resource within itself so long as that resource or literal is smaller than a certain limit, e.g. < 255 characters. Jena1 made use of normalized triple stores, and Jena2 makes use of denormalized triple stores. (Wilkinson 2003) Oracle also uses normalized tables (Alexander n.d.). Data normalization or denormalization can be a property of any RDF storage scheme. Dynamic Table Schema Another method for storing RDF data is to recreate the RDF schema in a dynamic table schema. With this approach, classes and properties in RDF are mirrored in tables. In the book/author example given above, separate tables would be created for books, authors, and titles. Relationships among the tables will express the RDF triples. One benefit of using this approach is that queries can be made against the RDF schema itself (Matono 2005). A similar dynamic table schema is used by Sesame. Research suggests this method can perform with reasonable efficiency, especially when properties of the underlying DBMS are exploited effectively, e.g. using the object-relational features of Postgres to support native subclassing (Broekstra 2002).
2 There are several problems with the dynamic table schema. It can be inefficient, especially when many self-joins are required during query execution. RDF data cannot be stored without knowing the RDF schema needed to create the table schema. Also, if the RDF schema changes, the table schema must be recreated. (Beckett 2003, Matono 2003) Property Tables In order to reduce the number of self-joins needed for queries, some applications have implemented property tables. In a property table, subjects along with similar properties are stored in denormalized tables like flat structures (Abadi 2007). All the data are stored in the same table and hence eliminates the need for joins. There can be significant overhead when using property tables if certain subjects do not have certain properties applicable to them, as the number of NULL values in a table will increase. This wastes the storage space. Postgres can prove to be better in case of NULL values because it just has a bit representation for NULL values. Another drawback to using property tables is that if queries require joins from many different property tables, or if the property tables were poorly configured, performance can actually be worse than a traditional triple-store. Jena2 implements property tables (Wilkinson 2003). The property class table is a variation on the property table. Instead of storing all the values a single table, triples are split up based on the availability of data so as to avoid storing of NULL values. This saves space. The selection of column names must be made very carefully. This makes implementation more dependent on the kind of data being stored and hence might require changes based on the input data. But a drawback here is, again, that if a query refers to properties from more than one table a join and a union might be required, performance can be degraded. Property class tables are implemented in Jena2 (Wilkinson 2003). Network Model Oracle differs from other RDF storage schemes and stores the RDF graph directly as a part of Oracle Spatial network data model (Wang 2003). Each triple is stored as a separate object. There are three main components to the table schema. RDF_VALUES are the values of all parts of the triples. RDF_NODES stores subjects and objects. RDF_LINKS stores all the link information and properties as well as reification information. Oracle supports bags, which are unordered groups of multiple properties, e.g. Book A was written by X,Y,Z; sequences, which are similar to bags, but with order imposed; as well as alts, which indicates alternative websites for a source, etc. Storage of this information is in the form of a graph. This facilitates making of inferences by traversing through the graph. Execution of RDF queries in Oracle is at the semantic level instead of the structural and syntactic level. (Alexander n.d.) Path-based Storage Matono, et al (2005) propose storage of RDF graph traversals in a path-based storage (PBS). Path types between nodes and the root nodes are stored in a path table. Instances of given path types are stored in a resource table, which stores a triple's object value, as well as a path_id that indicates the object's location in the graph with respect to a root node. Several schema-level tables are also introduced, which store information about RDF class hierarchy. See figure A for the full table schema.
3 Figure A: Path-based storage scheme Vertical Partitioning More recently, vertical partitioning (VP) has been proposed as an alternative to the schemes mentioned above (Abadi 2007). Here RDF values are stored in tables with two columns. Each table stores exactly one predicate, and rows in the table contain the subjects and objects that use that predicate. Multivalued properties can be captured easily by creating another row, and NULL values are avoided altogether. Since these tables are sorted and indexed based on a dictionary-encoded subject, the can be joined efficiently. Property inferences can be made easily by joining the necessary tables. If inferences are known to be frequent operations, they can be computed and stored in advance. VP can be done in relational databases such as Postgres, but column-oriented databases, such as C-Store, offer significant advantages in terms of storage and query response time for these vertically partitioned tables. Abadi, et. al, (2007) have shown two orders of magnitude of performance increase with VP in C-Store when compared with existing property-table schemes in other databases. Our Contribution Since the authors of PBS and the VP have shown their work to be experimentally better than standard storage models such as Jena's property tables. We observe which factors are most important when comparing PBS and VP across a wide range of query types. We have observed that, in general, PBS performs better when answering queries related to the RDF schema. VP is usually more efficient for retrieving RDF instance data, even when multiple joins are needed. We propose a modification to PBS to store additional information about path expressions in the resource table: a column for the root_node, which indicates the root node associated with the given path and resource. This enhanced path-based storage, or EPBS, would contain a new rdf_resource_en table as follows:
4 rdf_resource_en resourcename pathid root_node 'r1' 1 'r1' 'r2' 4 'r1' 'r3' 4 'r1' 'r4' 1 'r4' 'r5' 6 'r4' 'Picasso' 2 'r1' 'Pablo' 3 'r1' 'August' 2 'r4' 'Rodin' 3 'r4' 'Guernica' 5 'r1' 'Les Dem...' 5 'r1' 'The Thinker' 7 'r4' Problem Statement Given: A set of RDF triples; the vertical partitioning storage model proposed by Abadi, et al (2007); and the path-based storage model proposed by Matono, et al (2005). To find: Query plans for the various categories of queries under these two storage schemes. (Query plans indicate the number of joins and the I/O cost) Objective: To determine query types that perform comparatively better or worse in the path-based approach or in vertical partitioning. Constraint: To find the optimum storage method irrespective of the category of queries Assumptions There are no cycles in the RDF schema (this assumption was also made by the authors of the pathbased storage model). Also, inserts are performed far less frequently than select operations, and will not be considered. The RDF data (figure B) used will follow that in Matono, et al (2005).
5 Figure B: RDF Schema, showing RDF class information as well as instance data Proposed Approach Our approach is to use both experimental and analytical evaluation on the two RDF storage schemes. Executed the benchmark queries in Oracle and C-Store. Oracle was used to implement the path-based storage, and C-Store was used for vertical partitioning. The sample data and schema used are based on the schema used by Matono, et al (2005), in the figure below. SQL statements for queries and joins 1. Find the title of anything painted by anyone. Path-based approach: SELECT r.resourcename FROM path p, rdf_resource r WHERE p.pathid = r.pathid AND p.pathexp = '#title<#paints';
6 SELECT t.object FROM title t, paints p WHERE p.object = t.subject 2. Find the descendants of a given class, e.g. find all types of artists. Path-based approach: SELECT a.classname FROM class a WHERE a.pre > (SELECT b.pre FROM class b WHERE b.classname='artist') AND a.post < (SELECT c.post FROM class c WHERE c.classname='artist'); This cannot be done because the class hierarchy information is not stored. 3. List all the properties stored in this database Path-based approach: SELECT propertyname FROM property This requires a consistent naming pattern for property tables, e.g. prefixing all property tables with property:. SELECT tablenames FROM system_schema WHERE tablename like 'property:%' 4. Select all information about the node R4 This is not possible in path-based schema without resorting many self-joins on the triple table. Using the enhanced path-based schema, the following query can be issued: SELECT r.resourcename, p.pathexp FROM rdf_resource_en r, path p WHERE r.pathid = p.pathid AND r.root_resource = 'r4'; This query is similarly complex in the vertically partitioned scheme because we do not know the properties in advance that could be applied to the node R4. Therefore we must query all possible property tables. If RDF schema information is available, only the requisite tables would need to be queried. 5. Select all possible property values of sculptor Path-based approach:
7 SELECT propertyname FROM property WHERE domain='sculptor' This is not possible because the vertically partitioned model does not store schema-level information. 6. Give me all the titles of work by artists. Path-based approach: SELECT resourcename FROM rdfresource r, path p WHERE r.pathid=p.pathid AND p.pathexp lke '%title%' SELECT object FROM title; 7. Find the names of all sculptors. This is not possible in the path-based approach without resorting to multiple self-joins on the triple table. SELECT f.object, l.object FROM first f, last l, type t WHERE t.subject = f.subject AND f.subject = l.subject AND t.object = 'Sculptor' 8. List all the objects of type artifact. In the path-based approach this query would degenerate to querying the triple table because the pathid does not distinguish between a sculptor and a painter. SELECT subject FROM type WHERE object = 'Artifact'; 9. Diameter query for node R4. Select all nodes in the graph within one edge-length of R4. Using the enhanced path-based schema:
8 SELECT r.resourcename, p.pathexp FROM rdf_resource_en r, path p WHERE r.pathid = p.pathid AND r.root_resource = 'r4' AND p.pathexp NOT LIKE '%<%'; 10. Connectivity query: determine if any two arbitrary nodes are connected. In the artist tables example, it is possible to perform this query using only one join in EPBS. In the more generic case, where root nodes are not well-known, it is only possible to answer a connectivity query using a recursive join. If the graph is well-known in advance and the connections are materialized in the tables, the precise number of joins can be determined before query execution. Here we suggest a query for determining if 'Picasso' is connected to 'Guernica': SELECT 'Connection Exists' FROM rdf_resource_en r, rdf_resource_en r2 WHERE r.root_resource = r2.root_resource AND r.resourcename='picasso' and r2.resourcename='guernica' If all the paths connected to the main node are materialized and stored, all types of connection and diameter queries can be answered with one join. The main node is the node that has '' stored against the pathexp field of the path table. In vertical partitioning, this query can be answered only by joining across all tables. Given N tables in the schema, this would require N^N joins. Validation We executed the path-based model for query 1 using Oracle's query plan generator. The result is as follows: PLAN_TABLE_OUTPUT SELECT STATEMENT (15) 00:00:01 * 1 HASH JOIN (15) 00:00:01 * 2 TABLE ACCESS FULL PATH (0) 00:00:01 3 TABLE ACCESS FULL RDF_RESOURCE (0) 00:00:01 Similar query plans will be generated for all eight query types, in both Oracle (for path-based storage) and C-Store (for vertical partitioning). In certain cases, such as the path-based query for query 7, the result degenerates into many self-joins on the triple table. In such cases, we will not run a query plan for the query, as the result is obvious. Conclusions and Future Work In general, the path-based storage is more effective for retrieval of RDF schema information, including
9 queries about class inheritance or relationships between properties and classes. Queries 2, 3, and 5 are expected to perform better under the path-based storage. The vertically-partitioned approach, especially when implemented in a column oriented database such as C-Store, tends to perform better on queries regarding instance data. Queries 6, 7, and 8 would perform better for vertically partitioned databases. In query 1, both approaches would require a single join; without further analysis of disk I/O costs it is not possible to determine which approach would be better for this type of query. Some queries types are very expensive and inefficient in both storage schemes. Query 4 requires many joins in the path-based approach. In vertical partitioning, this query requires knowledge not inherently stored in the table structure, and we would need a higher-level interface to translate schema-level information into the appropriate SQL joins; otherwise we resort to joins across all tables. If we add level information or an additional identifier to the path table, we can answer more queries about instance level information, such as, find the artist who created this painting. References [1] D. J. Abadi, A. Marcus, S. Madden, K. Hollenbach. Scalable Semantic Web Data Management Using Vertical Partitioning. VLDB 07, September 23-28, [2] N. Alexander, X. Lopez, S. Ravada, S. Stephens, J. Wang. RDF Data Model in Oracle. w3d_rdf_data_model.pdf. Oracle Corporation. Last visited September [3] D. Beckett. SWAD Europe Deliverable 10.1: Scalability and Storage: Survey of Free Software / Open Source RDF Storage Systems. W3C Semantic Web Advanced Development for Europe (SWAD-Europe). July 31, rdf_scalable_storage_report/. Last visited September [4] D. Beckett, J. Grant. SWAD Europe Deliverable 10.2: Mapping Semantic Web Data with RDMSes. W3C Semantic Web Advanced Development for Europe (SWAD-Europe). January 31, Last visited September [5] T. Berners-Lee, J. Hendler, O. Lassila. The Semantic Web. Scientific American. May 17, [6] J. Broekstra, A. Kampman, F. van Harmelen. Sesame: A Generic Architecture for Storing and Querying RDF and RDF Schema. I. Horrocks and J. Hendler (Eds.): ISWC 2002, LNCS 2342, pp , [7] Jena A Semantic Web Framework for Java. Last accessed September [8] A. Matono, T. Amagasa, M. Yoshikawa, S. Uemura. A Path-based Relational RDF Database. 16 th Australasian Database Conference, January 31 February 3, Conferences in Research and Practice in Information Technology, Vol. 39, [9] openrdf.org, home of Sesame. Last accessed September [10] J. C. Wang. Oracle Spatial Network Data Model. Oracle Corporation Technical White Paper.
10 December g_network_model_twp.pdf. Last visited September [11] K. Wilkinson, C. Sayers, H. Kuno, D. Reynolds. Efficient RDF Storage and Retrieval in Jena2. Hewlett-Packard Technical Report, HPL December 16, Last visited September [12] [13]
Efficient, Scalable, and Provenance-Aware Management of Linked Data
Efficient, Scalable, and Provenance-Aware Management of Linked Data Marcin Wylot 1 Motivation and objectives of the research The proliferation of heterogeneous Linked Data on the Web requires data management
More informationBenchmarking RDF Production Tools
Benchmarking RDF Production Tools Martin Svihla and Ivan Jelinek Czech Technical University in Prague, Karlovo namesti 13, Praha 2, Czech republic, {svihlm1, jelinek}@fel.cvut.cz, WWW home page: http://webing.felk.cvut.cz
More informationScalable Semantic Web Data Management Using Vertical Partitioning
Scalable Semantic Web Data Management Using Vertical Partitioning Daniel J. Abadi MIT dna@csail.mit.edu Adam Marcus MIT marcua@csail.mit.edu Samuel R. Madden MIT madden@csail.mit.edu Kate Hollenbach MIT
More informationKKU Engineering Journal
KKU ENGINEERING JOURNAL October December 2014;41(4):537-545 Research Article KKU Engineering Journal http://www.en.kku.ac.th/enjournal/th/ Design and evaluation of a NoSQL database for storing and querying
More informationOSDBQ: Ontology Supported RDBMS Querying
OSDBQ: Ontology Supported RDBMS Querying Cihan Aksoy 1, Erdem Alparslan 1, Selçuk Bozdağ 2, İhsan Çulhacı 3, 1 The Scientific and Technological Research Council of Turkey, Gebze/Kocaeli, Turkey 2 Komtaş
More informationSemantic Web Technologies
1/57 Introduction and RDF Jos de Bruijn debruijn@inf.unibz.it KRDB Research Group Free University of Bolzano, Italy 3 October 2007 2/57 Outline Organization Semantic Web Limitations of the Web Machine-processable
More informationUsing RDF to Model the Structure and Process of Systems
Using RDF to Model the Structure and Process of Systems Marko A. Rodriguez Jennifer H. Watkins Johan Bollen Los Alamos National Laboratory {marko,jhw,jbollen}@lanl.gov Carlos Gershenson New England Complex
More informationEfficient Linked-List RDF Indexing in Parliament
Efficient Linked-List RDF Indexing in Parliament Dave Kolas, Ian Emmons, and Mike Dean BBN Technologies, Arlington, VA 22209, USA {dkolas,iemmons,mdean}@bbn.com Abstract. As the number and scale of Semantic
More informationAn RDF Storage and Query Framework with Flexible Inference Strategy
An RDF Storage and Query Framework with Flexible Inference Strategy Wennan Shen and Yuzhong Qu Department of Computer Science and Engineering, Southeast University, Nanjing 210096, P.R. China {wnshen,
More informationFedX: A Federation Layer for Distributed Query Processing on Linked Open Data
FedX: A Federation Layer for Distributed Query Processing on Linked Open Data Andreas Schwarte 1, Peter Haase 1,KatjaHose 2, Ralf Schenkel 2, and Michael Schmidt 1 1 fluid Operations AG, Walldorf, Germany
More informationA Tool for Storing OWL Using Database Technology
A Tool for Storing OWL Using Database Technology Maria del Mar Roldan-Garcia and Jose F. Aldana-Montes University of Malaga, Computer Languages and Computing Science Department Malaga 29071, Spain, (mmar,jfam)@lcc.uma.es,
More informationBenchmarking Database Representations of RDF/S Stores
Benchmarking Database Representations of RDF/S Stores Yannis Theoharis 1, Vassilis Christophides 1, Grigoris Karvounarakis 2 1 Computer Science Department, University of Crete and Institute of Computer
More informationColumn-Stores vs. Row-Stores: How Different Are They Really?
Column-Stores vs. Row-Stores: How Different Are They Really? Daniel Abadi, Samuel Madden, Nabil Hachem Presented by Guozhang Wang November 18 th, 2008 Several slides are from Daniel Abadi and Michael Stonebraker
More informationSWAD-Europe Deliverable 3.18: RDF Query Standardisation
SWAD-Europe Deliverable 3.18: RDF Query Standardisation Project name: Semantic Web Advanced Development for Europe (SWAD-Europe) Project Number: IST-2001-34732 Workpackage name: 3 Dissemination and Exploitation
More informationAnswering Aggregate Queries Over Large RDF Graphs
1 Answering Aggregate Queries Over Large RDF Graphs Lei Zou, Peking University Ruizhe Huang, Peking University Lei Chen, Hong Kong University of Science and Technology M. Tamer Özsu, University of Waterloo
More informationThe SPARQL Query Graph Model for Query Optimization
The SPARQL Query Graph Model for Query Optimization Olaf Hartig and Ralf Heese Humboldt-Universität zu Berlin Department of Computer Science (hartig rheese)@informatik.hu-berlin.de Abstract. The Semantic
More informationColumn Stores vs. Row Stores How Different Are They Really?
Column Stores vs. Row Stores How Different Are They Really? Daniel J. Abadi (Yale) Samuel R. Madden (MIT) Nabil Hachem (AvantGarde) Presented By : Kanika Nagpal OUTLINE Introduction Motivation Background
More informationSurvey of RDF Storage Managers
Survey of RDF Storage Managers Kiyoshi Nitta Yahoo JAPAN Research Tokyo, Japan knitta@yahoo-corp.jp Iztok Savnik University of Primorska & Institute Jozef Stefan, Slovenia iztok.savnik@upr.si Abstract
More informationIndex Tuning. Index. An index is a data structure that supports efficient access to data. Matching records. Condition on attribute value
Index Tuning AOBD07/08 Index An index is a data structure that supports efficient access to data Condition on attribute value index Set of Records Matching records (search key) 1 Performance Issues Type
More informationAn overview of RDB2RDF techniques and tools
An overview of RDB2RDF techniques and tools DERI Reading Group Presentation Nuno Lopes August 26, 2009 Main purpose of RDB2RDF WG... standardize a language for mapping Relational Database schemas into
More informationBenchmarking Database Representations of RDF/S Stores
Benchmarking Database Representations of RDF/S Stores Yannis Theoharis 1,2, Vassilis Christophides 1,2, and Grigoris Karvounarakis 3 1 Institute of Computer Science, FORTH, Vassilika Vouton, P.O.Box 1385,
More informationWeb Document Compaction by Compressing URI references in RDF and OWL Data
Web Document Compaction by Compressing URI references in RDF and OWL Data Kisung Lee Telematics USN Research Division Electronics and Telecommunications Research Institute 138, Gajeongno, Yuseong-gu Daejeon,
More informationThe case for generating URIs by hashing RDF content
The case for generating URIs by hashing RDF content Craig Sayers, Kave Eshghi Intelligent Enterprise Technology Laboratory HP Laboratories Palo Alto HPL-2002-216 August 22 nd, 2002* RDF, URI, hash, semantic
More informationRStar: An RDF Storage and Query System for Enterprise Resource Management
RStar: An RDF Storage and Query System for Enterprise Resource Management Li Ma, Zhong Su, Yue Pan, Li Zhang, Tao Liu IBM China Research Laboratory, No. 7, 5th Street, ShangDi, Beijing, 100085, P.R. China
More informationOntology Modeling and Storage System for Robot Context Understanding
Ontology Modeling and Storage System for Robot Context Understanding Eric Wang 1, Yong Se Kim 1, Hak Soo Kim 2, Jin Hyun Son 2, Sanghoon Lee 3, and Il Hong Suh 3 1 Creative Design and Intelligent Tutoring
More informationToday: RDF syntax. + conjunctive queries for OWL. KR4SW Winter 2010 Pascal Hitzler 3
Today: RDF syntax + conjunctive queries for OWL KR4SW Winter 2010 Pascal Hitzler 3 Today s Session: RDF Schema 1. Motivation 2. Classes and Class Hierarchies 3. Properties and Property Hierarchies 4. Property
More informationOptimal Query Processing in Semantic Web using Cloud Computing
Optimal Query Processing in Semantic Web using Cloud Computing S. Joan Jebamani 1, K. Padmaveni 2 1 Department of Computer Science and engineering, Hindustan University, Chennai, Tamil Nadu, India joanjebamani1087@gmail.com
More informationEvent Stores (I) [Source: DB-Engines.com, accessed on August 28, 2016]
Event Stores (I) Event stores are database management systems implementing the concept of event sourcing. They keep all state changing events for an object together with a timestamp, thereby creating a
More informationSemantic web. Tapas Kumar Mishra 11CS60R32
Semantic web Tapas Kumar Mishra 11CS60R32 1 Agenda Introduction What is semantic web Issues with traditional web search The Technology Stack Architecture of semantic web Meta Data Main Tasks Knowledge
More informationAccessing Relational Data with RDF Queries and Assertions
Accessing Relational Data with RDF Queries and Assertions Dmitry Borodaenko angdraug@debian.org Abstract. This paper presents a hybrid RDF storage model that combines relational data with arbitrary RDF
More informationThe Semantic Planetary Data System
The Semantic Planetary Data System J. Steven Hughes 1, Daniel J. Crichton 1, Sean Kelly 1, and Chris Mattmann 1 1 Jet Propulsion Laboratory 4800 Oak Grove Drive Pasadena, CA 91109 USA {steve.hughes, dan.crichton,
More informationCHAPTER 1 INTRODUCTION
1 CHAPTER 1 INTRODUCTION Most of today s Web content is intended for the use of humans rather than machines. While searching documents on the Web using computers, human interpretation is required before
More informationAn Indexing Scheme for a Scalable RDF Triple Store
An Indexing Scheme for a Scalable RDF Triple Store David Makepeace 1, David Wood 1, Paul Gearon 1, Tate Jones 1 and Tom Adams 1 1 Tucana Technologies, Inc., 11710 Plaza America Drive, Reston, VA, 20147,
More informationSemantics-Aware Querying of Web-Distributed RDF(S) Repositories
Semantics-Aware Querying of Web-Distributed RDF(S) Repositories Georgia D. Solomou, Dimitrios A. Koutsomitropoulos, Theodore S. Papatheodorou High Performance Systems Laboratory, School of Engineering
More informationCSE 544 Principles of Database Management Systems
CSE 544 Principles of Database Management Systems Alvin Cheung Fall 2015 Lecture 5 - DBMS Architecture and Indexing 1 Announcements HW1 is due next Thursday How is it going? Projects: Proposals are due
More informationPASSWORDS TREES AND HIERARCHIES. CS121: Relational Databases Fall 2017 Lecture 24
PASSWORDS TREES AND HIERARCHIES CS121: Relational Databases Fall 2017 Lecture 24 Account Password Management 2 Mentioned a retailer with an online website Need a database to store user account details
More informationFalcon-AO: Aligning Ontologies with Falcon
Falcon-AO: Aligning Ontologies with Falcon Ningsheng Jian, Wei Hu, Gong Cheng, Yuzhong Qu Department of Computer Science and Engineering Southeast University Nanjing 210096, P. R. China {nsjian, whu, gcheng,
More informationThe ρ Operator: Discovering and Ranking Associations on the Semantic Web
The ρ Operator: Discovering and Ranking Associations on the Semantic Web Kemafor Anyanwu and Amit Sheth Large Scale Distributed Information Systems Lab Department of Computer Science, University of Georgia
More informationFinding Similarity and Comparability from Merged Hetero Data of the Semantic Web by Using Graph Pattern Matching
Finding Similarity and Comparability from Merged Hetero Data of the Semantic Web by Using Graph Pattern Matching Hiroyuki Sato, Kyoji Iiduka, Takeya Mukaigaito, and Takahiko Murayama Information Sharing
More informationOrchestrating Music Queries via the Semantic Web
Orchestrating Music Queries via the Semantic Web Milos Vukicevic, John Galletly American University in Bulgaria Blagoevgrad 2700 Bulgaria +359 73 888 466 milossmi@gmail.com, jgalletly@aubg.bg Abstract
More information1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.
1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. Integrating Complex Financial Workflows in Oracle Database Xavier Lopez Seamus Hayes Oracle PolarLake, LTD 2 Copyright 2011, Oracle
More informationAdministração e Optimização de Bases de Dados 2012/2013 Index Tuning
Administração e Optimização de Bases de Dados 2012/2013 Index Tuning Bruno Martins DEI@Técnico e DMIR@INESC-ID Index An index is a data structure that supports efficient access to data Condition on Index
More informationContents. G52IWS: The Semantic Web. The Semantic Web. Semantic web elements. Semantic Web technologies. Semantic Web Services
Contents G52IWS: The Semantic Web Chris Greenhalgh 2007-11-10 Introduction to the Semantic Web Semantic Web technologies Overview RDF OWL Semantic Web Services Concluding comments 1 See Developing Semantic
More informationNew Approach to Graph Databases
Paper PP05 New Approach to Graph Databases Anna Berg, Capish, Malmö, Sweden Henrik Drews, Capish, Malmö, Sweden Catharina Dahlbo, Capish, Malmö, Sweden ABSTRACT Graph databases have, during the past few
More informationAn Efficient Approach to Triple Search and Join of HDT Processing Using GPU
An Efficient Approach to Triple Search and Join of HDT Processing Using GPU YoonKyung Kim, YoonJoon Lee Computer Science KAIST Daejeon, South Korea e-mail: {ykkim, yjlee}@dbserver.kaist.ac.kr JaeHwan Lee
More informationSPARQL Back-end for Contextual Logic Agents
SPARQL Back-end for Contextual Logic Agents Cláudio Fernandes and Salvador Abreu Universidade de Évora Abstract. XPTO is a contextual logic system that can represent and query OWL ontologies from a contextual
More informationDesign and Implementation of an RDF Triple Store
Design and Implementation of an RDF Triple Store Ching-Long Yeh and Ruei-Feng Lin Department of Computer Science and Engineering Tatung University 40 Chungshan N. Rd., Sec. 3 Taipei, 04 Taiwan E-mail:
More informationCOMPUTER AND INFORMATION SCIENCE JENA DB. Group Abhishek Kumar Harshvardhan Singh Abhisek Mohanty Suhas Tumkur Chandrashekhara
JENA DB Group - 10 Abhishek Kumar Harshvardhan Singh Abhisek Mohanty Suhas Tumkur Chandrashekhara OUTLINE Introduction Data Model Query Language Implementation Features Applications Introduction Open Source
More informationAn RDF NetAPI. Andy Seaborne. Hewlett-Packard Laboratories, Bristol
An RDF NetAPI Andy Seaborne Hewlett-Packard Laboratories, Bristol andy_seaborne@hp.com Abstract. This paper describes some initial work on a NetAPI for accessing and updating RDF data over the web. The
More informationRIBStore: Data Management of RDF triples with RDFS Inferences
RIBStore: Data Management of RDF triples with RDFS Inferences Olivier Curé, David Célestin Faye, Guillaume Blin To cite this version: Olivier Curé, David Célestin Faye, Guillaume Blin. RIBStore: Data Management
More informationOracle Spatial and Graph: Benchmarking a Trillion Edges RDF Graph ORACLE WHITE PAPER NOVEMBER 2016
Oracle Spatial and Graph: Benchmarking a Trillion Edges RDF Graph ORACLE WHITE PAPER NOVEMBER 2016 Introduction One trillion is a really big number. What could you store with one trillion facts?» 1000
More informationTRIE BASED METHODS FOR STRING SIMILARTIY JOINS
TRIE BASED METHODS FOR STRING SIMILARTIY JOINS Venkat Charan Varma Buddharaju #10498995 Department of Computer and Information Science University of MIssissippi ENGR-654 INFORMATION SYSTEM PRINCIPLES RESEARCH
More informationComprehensive Structured Context Profiles (CSCP): Design and Experiences
Comprehensive Structured Context Profiles (CSCP): Design and Experiences Sven Buchholz, Thomas Hamann, and Gerald Hübsch Department of Computer Science, Dresden University of Technology {buchholz, hamann,
More informationDatabasesystemer, forår 2005 IT Universitetet i København. Forelæsning 8: Database effektivitet. 31. marts Forelæser: Rasmus Pagh
Databasesystemer, forår 2005 IT Universitetet i København Forelæsning 8: Database effektivitet. 31. marts 2005 Forelæser: Rasmus Pagh Today s lecture Database efficiency Indexing Schema tuning 1 Database
More informationDATABASE PERFORMANCE AND INDEXES. CS121: Relational Databases Fall 2017 Lecture 11
DATABASE PERFORMANCE AND INDEXES CS121: Relational Databases Fall 2017 Lecture 11 Database Performance 2 Many situations where query performance needs to be improved e.g. as data size grows, query performance
More informationRAPID: Enabling Scalable Ad-Hoc Analytics on the Semantic Web
RAPID: Enabling Scalable Ad-Hoc Analytics on the Semantic Web Radhika Sridhar, Padmashree Ravindra, Kemafor Anyanwu North Carolina State University {rsridha, pravind2, kogan}@ncsu.edu Abstract. As the
More informationArchitecting Object Applications for High Performance with Relational Databases
Architecting Object Applications for High Performance with Relational Databases Shailesh Agarwal 1 Christopher Keene 2 Arthur M. Keller 3 1.0 Abstract This paper presents an approach for architecting OO
More informationTrOWL: Tractable OWL 2 Reasoning Infrastructure
TrOWL: Tractable OWL 2 Reasoning Infrastructure Edward Thomas, Jeff Z. Pan, and Yuan Ren Department of Computing Science, University of Aberdeen, Aberdeen AB24 3UE, UK Abstract. The Semantic Web movement
More informationA General Approach to Query the Web of Data
A General Approach to Query the Web of Data Xin Liu 1 Department of Information Science and Engineering, University of Trento, Trento, Italy liu@disi.unitn.it Abstract. With the development of the Semantic
More informationAnnouncement. Reading Material. Overview of Query Evaluation. Overview of Query Evaluation. Overview of Query Evaluation 9/26/17
Announcement CompSci 516 Database Systems Lecture 10 Query Evaluation and Join Algorithms Project proposal pdf due on sakai by 5 pm, tomorrow, Thursday 09/27 One per group by any member Instructor: Sudeepa
More informationSemantic Exploitation of Engineering Models: An Application to Oilfield Models
Semantic Exploitation of Engineering Models: An Application to Oilfield Models Laura Silveira Mastella 1,YamineAït-Ameur 2,Stéphane Jean 2, Michel Perrin 1, and Jean-François Rainaud 3 1 Ecole des Mines
More informationOntology Development Tools and Languages: A Review
Ontology Development Tools and Languages: A Review Parveen 1, Dheeraj Kumar Sahni 2, Dhiraj Khurana 3, Rainu Nandal 4 1,2 M.Tech. (CSE), UIET, MDU, Rohtak, Haryana 3,4 Asst. Professor, UIET, MDU, Rohtak,
More informationSemantic Web Knowledge Representation in the Web Context. CS 431 March 24, 2008 Carl Lagoze Cornell University
Semantic Web Knowledge Representation in the Web Context CS 431 March 24, 2008 Carl Lagoze Cornell University Acknowledgements for various slides and ideas Ian Horrocks (Manchester U.K.) Eric Miller (W3C)
More informationXML Perspectives on RDF Querying: Towards integrated Access to Data and Metadata on the Web
XML Perspectives on RDF Querying: Towards integrated Access to Data and Metadata on the Web Tim Furche, François Bry, Oliver Bolzer Institute for Informatics, University of Munich http://www.pms.ifi.lmu.de
More informationEfficient Optimization of Sparql Basic Graph Pattern
Efficient Optimization of Sparql Basic Graph Pattern Ms.M.Manju 1, Mrs. R Gomathi 2 PG Scholar, Department of CSE, Bannari Amman Institute of Technology, Sathyamangalam, Tamilnadu, India 1 Associate Professor/Senior
More informationGRIN: A Graph Based RDF Index
GRIN: A Graph Based RDF Index Octavian Udrea University of Maryland College Park, MD 20742 Andrea Pugliese University of Calabria DEIS department Via P. Bucci, Rende, Italy V.S. Subrahmanian University
More informationCWI. Multimedia on the Semantic Web. Jacco van Ossenbruggen, Lynda Hardman, Frank Nack. Multimedia and Human-Computer Interaction CWI, Amsterdam
Multimedia on the Semantic Web Jacco van Ossenbruggen, Lynda Hardman, Frank Nack Multimedia and Human-Computer Interaction, Amsterdam Short history of the Web in three generations (see thesis for long
More informationPeer-to-Peer Systems. Chapter General Characteristics
Chapter 2 Peer-to-Peer Systems Abstract In this chapter, a basic overview is given of P2P systems, architectures, and search strategies in P2P systems. More specific concepts that are outlined include
More informationRekayasa Perangkat Lunak 2 (IN043): Pertemuan 8. Data Management Layer Design
Rekayasa Perangkat Lunak 2 (IN043): Pertemuan 8 Data Management Layer Design Data Management Layer Focus on how to manage data are stored that can be handled by the programs that run the system, including:
More informationSemantics. Matthew J. Graham CACR. Methods of Computational Science Caltech, 2011 May 10. matthew graham
Semantics Matthew J. Graham CACR Methods of Computational Science Caltech, 2011 May 10 semantic web The future of the Internet (Web 3.0) Decentralized platform for distributed knowledge A web of databases
More informationColumn-Stores vs. Row-Stores: How Different Are They Really?
Column-Stores vs. Row-Stores: How Different Are They Really? Daniel J. Abadi, Samuel Madden and Nabil Hachem SIGMOD 2008 Presented by: Souvik Pal Subhro Bhattacharyya Department of Computer Science Indian
More informationCHAPTER 5 SEARCH ENGINE USING SEMANTIC CONCEPTS
82 CHAPTER 5 SEARCH ENGINE USING SEMANTIC CONCEPTS In recent years, everybody is in thirst of getting information from the internet. Search engines are used to fulfill the need of them. Even though the
More informationGraph Analytics in the Big Data Era
Graph Analytics in the Big Data Era Yongming Luo, dr. George H.L. Fletcher Web Engineering Group What is really hot? 19-11-2013 PAGE 1 An old/new data model graph data Model entities and relations between
More information7 Analysis of experiments
Natural Language Addressing 191 7 Analysis of experiments Abstract In this research we have provided series of experiments to identify any trends, relationships and patterns in connection to NL-addressing
More informationRDFMatView: Indexing RDF Data using Materialized SPARQL queries
RDFMatView: Indexing RDF Data using Materialized SPARQL queries Roger Castillo, Christian Rothe, and Ulf Leser Humboldt University of Berlin {castillo,rothe,leser}@informatik.hu-berlin.de http://www.hu-berlin.de/
More informationHYRISE In-Memory Storage Engine
HYRISE In-Memory Storage Engine Martin Grund 1, Jens Krueger 1, Philippe Cudre-Mauroux 3, Samuel Madden 2 Alexander Zeier 1, Hasso Plattner 1 1 Hasso-Plattner-Institute, Germany 2 MIT CSAIL, USA 3 University
More informationChapter 12: Indexing and Hashing. Basic Concepts
Chapter 12: Indexing and Hashing! Basic Concepts! Ordered Indices! B+-Tree Index Files! B-Tree Index Files! Static Hashing! Dynamic Hashing! Comparison of Ordered Indexing and Hashing! Index Definition
More informationBackward inference and pruning for RDF change detection using RDBMS
Article Backward inference and pruning for RDF change detection using RDBMS Journal of Information Science 39(2) 238 255 Ó The Author(s) 2012 Reprints and permission: sagepub. co.uk/journalspermissions.nav
More informationOBJECTIVES. How to derive a set of relations from a conceptual data model. How to validate these relations using the technique of normalization.
7.5 逻辑数据库设计 OBJECTIVES How to derive a set of relations from a conceptual data model. How to validate these relations using the technique of normalization. 2 OBJECTIVES How to validate a logical data model
More informationScalability Report on Triple Store Applications
Scalability Report on Triple Store Applications Ryan Lee July 14, 2004 ryanlee@w3.org http://simile.mit.edu/ Abstract This report examines a set of open source triple store systems suitable for The SIMILE
More informationChapter 12: Indexing and Hashing
Chapter 12: Indexing and Hashing Basic Concepts Ordered Indices B+-Tree Index Files B-Tree Index Files Static Hashing Dynamic Hashing Comparison of Ordered Indexing and Hashing Index Definition in SQL
More informationSemantic Web Fundamentals
Semantic Web Fundamentals Web Technologies (706.704) 3SSt VU WS 2018/19 with acknowledgements to P. Höfler, V. Pammer, W. Kienreich ISDS, TU Graz January 7 th 2019 Overview What is Semantic Web? Technology
More informationSEPA SPARQL Event Processing Architecture
SEPA SPARQL Event Processing Architecture Enabling distributed, context aware and interoperable Dynamic Linked Data and Web of Things applications Luca Roffia (luca.roffia@unibo.it) Web of Things: members
More informationApproach for Mapping Ontologies to Relational Databases
Approach for Mapping Ontologies to Relational Databases A. Rozeva Technical University Sofia E-mail: arozeva@tu-sofia.bg INTRODUCTION Research field mapping ontologies to databases Research goal facilitation
More informationTowards a better insight of RDF triples Ontology-guided Storage system abilities
Towards a better insight of RDF triples Ontology-guided Storage system abilities Olivier Curé 1, David Faye 1,2, Guillaume Blin 1 1 Université Paris-Est, LIGM - UMR CNRS 8049, France {ocure, gblin}@univ-mlv.fr
More informationStriped Grid Files: An Alternative for Highdimensional
Striped Grid Files: An Alternative for Highdimensional Indexing Thanet Praneenararat 1, Vorapong Suppakitpaisarn 2, Sunchai Pitakchonlasap 1, and Jaruloj Chongstitvatana 1 Department of Mathematics 1,
More informationTowards the Semantic Desktop. Dr. Øyvind Hanssen University Library of Tromsø
Towards the Semantic Desktop Dr. Øyvind Hanssen University Library of Tromsø Agenda Background Enabling trends and technologies Desktop computing and The Semantic Web Online Social Networking and P2P Computing
More informationDB2 NoSQL Graph Store
DB2 NoSQL Graph Store Mario Briggs mario.briggs@in.ibm.com December 13, 2012 Agenda Introduction Some Trends: NoSQL Data Normalization Evolution Hybrid Data Comparing Relational, XML and RDF RDF Introduction
More informationScaling the Semantic Wall with AllegroGraph and TopBraid Composer. A Joint Webinar by TopQuadrant and Franz
Scaling the Semantic Wall with AllegroGraph and TopBraid Composer A Joint Webinar by TopQuadrant and Franz Dean Allemang Chief Scientist, TopQuadrant Inc. Jans Aasman CTO, Franz Inc. July 07 1 This Seminar
More informationBinary Encoded Attribute-Pairing Technique for Database Compression
Binary Encoded Attribute-Pairing Technique for Database Compression Akanksha Baid and Swetha Krishnan Computer Sciences Department University of Wisconsin, Madison baid,swetha@cs.wisc.edu Abstract Data
More informationReview on Managing RDF Graph Using MapReduce
Review on Managing RDF Graph Using MapReduce 1 Hetal K. Makavana, 2 Prof. Ashutosh A. Abhangi 1 M.E. Computer Engineering, 2 Assistant Professor Noble Group of Institutions Junagadh, India Abstract solution
More informationWITH continuous advances in the network and storage
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 22, NO. 2, FEBRUARY 2010 219 Exploring Correlated Subspaces for Efficient Query Processing in Sparse Databases Bin Cui, Senior Member, IEEE, Jiakui
More informationSemantic Web Fundamentals
Semantic Web Fundamentals Web Technologies (706.704) 3SSt VU WS 2017/18 Vedran Sabol with acknowledgements to P. Höfler, V. Pammer, W. Kienreich ISDS, TU Graz December 11 th 2017 Overview What is Semantic
More informationIndex Support for SPARQL
Index Support for SPARQL Ralf Heese, Ulf Leser, Bastian Quilitz, and Christian Rothe Humboldt-Universität zu Berlin Department of Computer Science (rheese leser quilitz rothe)@informatik.hu-berlin.de Abstract.
More informationSemantic Web Systems Querying Jacques Fleuriot School of Informatics
Semantic Web Systems Querying Jacques Fleuriot School of Informatics 5 th February 2015 In the previous lecture l Serialising RDF in XML RDF Triples with literal Object edstaff:9888 foaf:name Ewan Klein.
More informationQuadrant-Based MBR-Tree Indexing Technique for Range Query Over HBase
Quadrant-Based MBR-Tree Indexing Technique for Range Query Over HBase Bumjoon Jo and Sungwon Jung (&) Department of Computer Science and Engineering, Sogang University, 35 Baekbeom-ro, Mapo-gu, Seoul 04107,
More informationGrid Resources Search Engine based on Ontology
based on Ontology 12 E-mail: emiao_beyond@163.com Yang Li 3 E-mail: miipl606@163.com Weiguang Xu E-mail: miipl606@163.com Jiabao Wang E-mail: miipl606@163.com Lei Song E-mail: songlei@nudt.edu.cn Jiang
More informationGraph Databases. Guilherme Fetter Damasio. University of Ontario Institute of Technology and IBM Centre for Advanced Studies IBM Corporation
Graph Databases Guilherme Fetter Damasio University of Ontario Institute of Technology and IBM Centre for Advanced Studies Outline Introduction Relational Database Graph Database Our Research 2 Introduction
More informationTime Complexity and Parallel Speedup to Compute the Gamma Summarization Matrix
Time Complexity and Parallel Speedup to Compute the Gamma Summarization Matrix Carlos Ordonez, Yiqun Zhang Department of Computer Science, University of Houston, USA Abstract. We study the serial and parallel
More informationImproved Processing of Path Query on RDF Data Using Suffix Array
Journal of Convergence Information Technology Volume 4, Number 3, September 2009 Improved Processing of Path Query on RDF Data Using Suffix Array Corresponding author Sung Wan Kim * Division of Computer,
More information