Center for Semantic Web Research.

Size: px
Start display at page:

Download "Center for Semantic Web Research."

Transcription

1 Center for Semantic Web Research

2 Three institutions PUC, Chile Marcelo Arenas (Boss) SW, data exchange, semistrucured data Juan Reutter SW, graph DBs, DLs Cristian Riveros data exchange, semistr. data Jorge Baier planning, search Carlos Buil SW Univ. of Talca Renzo Angles (SW) Univ. of Chile Pablo Barceló (Deputy) graph DBs, DB theory Claudio Gutierrez SW, graph DBs Jorge Pérez SW, data exchange Aidan Hogan SW, semistr. data Bárbara Poblete web mining, SNA Benjamín Bustos multimedia

3 Alberto Mendelzon Workshop (AMW)

4 AMW 2016 Panama City, 6-10 June, 2016 PC Chairs: Altigran Soares da Silva (UFAM), Reinhard Pichler (TU Wien) Invited speakers: Diego Calvanese (Data-driven verification) Juliana Freire (Urban data) Lise Getoor (Relational statistical learning) Raghu Ramakrishnan (Big data) Long & short submissions (due on Feb. 29th, 2016) AMW School (4 tutorials), 4-5 June, 2016 Very nice environment

5 Query Languages for Graph DBs: Bridging the Grap Between Theory and Practice Pablo Barceló DCC, Universidad de Chile Center for Semantic Web Research (

6 BACKGROUND AND OBJECTIVES

7 Graph databases Trendy applications: Social network analysis Semantic web Scientific databases Software bug localization Geo-routing

8 Graph databases Trendy applications: Social network analysis Semantic web Scientific databases Software bug localization Geo-routing More in general: Wherever connections are as important as data

9 What is a graph database? A data management system that exposes a graph data model. 1 1 Graph databases. Robinson, Webber, & Eifrem. O Reilly, 2013.

10 What is a graph database? A data management system that exposes a graph data model. 1 Several existing graph DB engines and query languages: DEX/Sparksee - basic algebra IBM System G - Gremlin Neo4J - Cypher Oracle PGX - PGQL RDF stores (Virtuoso, AllegroGraph, Oracle, IBM) - SPARQL 1 Graph databases. Robinson, Webber, & Eifrem. O Reilly, 2013.

11 What graph databases are good for? Flexible modelling of interconnected data Agile evolution of the data model Scalable evaluation of join-intensive queries

12 My personal story Since 2009: Working on theory of query languages for graph DBs Since 2015: Working group of LDBC for the design of such language

13 My personal story Since 2009: Working on theory of query languages for graph DBs Since 2015: Working group of LDBC for the design of such language Conclusion: Theory and practice are more connected than expected

14 Objectives Identify topics of common interest for theoreticians and developers Formalize relevant concepts (syntax, semantics, terminology, etc) Understand tradeoff expressiveness/efficiency

15 THE DATA MODEL: PROPERTY GRAPHS

16 The data model is important as it must be: Flexible enough to accomodate scenarios of practical interest Simple enough to allow for a clean presentation Expressive enough for theoretical issues to appear in full force

17 The data model is important as it must be: Flexible enough to accomodate scenarios of practical interest Simple enough to allow for a clean presentation Expressive enough for theoretical issues to appear in full force This is accomplished by the model of property graphs

18 A property graph Journal name: JACM journal Article name: jacmhopcroft74 year: 1974 corresponding: YES Author name: Robert Tarjan affil: Princeton Univ. Conference name: FOCS series Venue name: FOCS67 part of Article name: FOCSHopcroft67 year: 1967 Author name: John Hopcroft affil: Cornell Univ. series Venue name: PODS89 part of Article name: PODSUllman89 year: 1989 Author name: Jeff Ullman affil: Stanford Univ. Conference name: PODS series series corresponding: YES Venue name: PODS83 part of Article name: PODSFaginUV83 year: 1983 corresponding: YES Author name: Ron Fagin affil: IBM Almaden Venue name: PODS95 part of part of Article name: PODSVardi95 year: 1995 cooresponding: YES Author name: Moshe Vardi affil: Rice University Article cooresponding: YES Author name: PODSLibkin95 name: Leonid Libkin year: 1995 affil: U. of Edinburgh Journal name: IPL journal Article name: IPLLibkinW95 year: 1995 cooresponding: YES Author name: Limsoon Wong affil: NU Singapore

19 What is a property graph then? It is a graph It is directed It is labeled in nodes and edges Nodes and edges can be attributed

20 What is a property graph then? It is a graph It is directed It is labeled in nodes and edges Nodes and edges can be attributed

21 What is a property graph then? It is a graph It is directed It is labeled in nodes and edges Nodes and edges can be attributed

22 What is a property graph then? It is a graph It is directed It is labeled in nodes and edges Nodes and edges can be attributed

23 What is a property graph then? It is a graph It is directed It is labeled in nodes and edges Nodes and edges can be attributed

24 GRAPH PATTERNS

25 Graph patterns are: The basic unit for querying property graphs

26 Graph patterns are: The basic unit for querying property graphs Definition of graph pattern: A directed graph Nodes are given by variables x,y,z,... Edges are given by variables X,Y,Z,... Nodes and edges satisfy label and attribute constraints (selection) E.g., l(x) = Author, l(y) = & Y@corresponding = YES Some of the variables are selected (projection)

27 Graph patterns are: The basic unit for querying property graphs Definition of graph pattern: A directed graph Nodes are given by variables x,y,z,... Edges are given by variables X,Y,Z,... Nodes and edges satisfy label and attribute constraints (selection) E.g., l(x) = Author, l(y) = & Y@corresponding = YES Some of the variables are selected (projection)

28 Graph patterns are: The basic unit for querying property graphs Definition of graph pattern: A directed graph Nodes are given by variables x,y,z,... Edges are given by variables X,Y,Z,... Nodes and edges satisfy label and attribute constraints (selection) E.g., l(x) = Author, l(y) = & Y@corresponding = YES Some of the variables are selected (projection)

29 Graph patterns are: The basic unit for querying property graphs Definition of graph pattern: A directed graph Nodes are given by variables x,y,z,... Edges are given by variables X,Y,Z,... Nodes and edges satisfy label and attribute constraints (selection) E.g., l(x) = Author, l(y) = & Y@corresponding = YES Some of the variables are selected (projection)

30 Graph patterns are: The basic unit for querying property graphs Definition of graph pattern: A directed graph Nodes are given by variables x,y,z,... Edges are given by variables X,Y,Z,... Nodes and edges satisfy label and attribute constraints (selection) E.g., l(x) = Author, l(y) = & Y@corresponding = YES Some of the variables are selected (projection)

31 Graph patterns are: The basic unit for querying property graphs Definition of graph pattern: A directed graph Nodes are given by variables x,y,z,... Edges are given by variables X,Y,Z,... Nodes and edges satisfy label and attribute constraints (selection) E.g., l(x) = Author, l(y) = & Y@corresponding = YES Some of the variables are selected (projection)

32 Example of a graph pattern Find pairs of authors who coauthored a paper in PODS83: Author Venue name: PODS83 part of Article Author

33 More examples Get the common friends of Peter, John and Mary: Person name: Peter friend of Person friend of Person name: John friend of Person name: Mary

34 More examples Get the common friends of Peter, John and Mary: Person name: Peter friend of Person friend of Person name: John friend of Person name: Mary Find friends of John who are (1) mutual friends, and (2) have lovers that are colleagues Person lover of Person friend of Person name: John friend of works with friend of Person lover of Person

35 Evaluation of graph patterns 1. Find all matchings of the pattern over the property graph 2. Project over the variables in the output

36 Evaluation of graph patterns 1. Find all matchings of the pattern over the property graph 2. Project over the variables in the output

37 Evaluation of graph patterns 1. Find all matchings of the pattern over the property graph 2. Project over the variables in the output

38 An example of evaluation The property graph Journal name: JACM journal Article name: jacmhopcroft74 year: 1974 corresponding: YES Author name: Robert Tarjan affil: Princeton Univ. Conference name: FOCS series Venue name: FOCS67 part of Article name: FOCSHopcroft67 year: 1967 Author name: John Hopcroft affil: Cornell Univ. series Venue name: PODS89 part of Article name: PODSUllman89 year: 1989 Author name: Jeff Ullman affil: Stanford Univ. Conference name: PODS series series corresponding: YES Venue name: PODS83 part of Article name: PODSFaginUV83 year: 1983 corresponding: YES Author name: Ron Fagin affil: IBM Almaden Venue name: PODS95 part of part of Article name: PODSVardi95 year: 1995 cooresponding: YES Author name: Moshe Vardi affil: Rice University Article cooresponding: YES Author name: PODSLibkin95 name: Leonid Libkin year: 1995 affil: U. of Edinburgh Journal name: IPL journal Article name: IPLLibkinW95 year: 1995 cooresponding: YES Author name: Limsoon Wong affil: NU Singapore

39 An example of evaluation The graph pattern Author Venue name: PODS83 part of Article Author

40 An example of evaluation A matching Journal name: JACM journal Article name: jacmhopcroft74 year: 1974 corresponding: YES Author name: Robert Tarjan affil: Princeton Univ. Conference name: FOCS series Venue name: FOCS67 part of Article name: FOCSHopcroft67 year: 1967 Author name: John Hopcroft affil: Cornell Univ. series Venue name: PODS89 part of Article name: PODSUllman89 year: 1989 Author name: Jeff Ullman affil: Stanford Univ. Conference name: PODS series series corresponding: YES Venue name: PODS83 part of Article name: PODSFaginUV83 year: 1983 corresponding: YES Author name: Ron Fagin affil: IBM Almaden Venue name: PODS95 part of part of Article name: PODSVardi95 year: 1995 cooresponding: YES Author name: Moshe Vardi affil: Rice University Article cooresponding: YES Author name: PODSLibkin95 name: Leonid Libkin year: 1995 affil: U. of Edinburgh Journal name: IPL journal Article name: IPLLibkinW95 year: 1995 cooresponding: YES Author name: Limsoon Wong affil: NU Singapore

41 An example of evaluation Its projection Journal name: JACM journal Article name: jacmhopcroft74 year: 1974 corresponding: YES Author name: Robert Tarjan affil: Princeton Univ. Conference name: FOCS series Venue name: FOCS67 part of Article name: FOCSHopcroft67 year: 1967 Author name: John Hopcroft affil: Cornell Univ. series Venue name: PODS89 part of Article name: PODSUllman89 year: 1989 Author name: Jeff Ullman affil: Stanford Univ. Conference name: PODS series series corresponding: YES Venue name: PODS83 part of Article name: PODSFaginUV83 year: 1983 corresponding: YES Author name: Ron Fagin affil: IBM Almaden Venue name: PODS95 part of part of Article name: PODSVardi95 year: 1995 cooresponding: YES Author name: Moshe Vardi affil: Rice University Article cooresponding: YES Author name: PODSLibkin95 name: Leonid Libkin year: 1995 affil: U. of Edinburgh Journal name: IPL journal Article name: IPLLibkinW95 year: 1995 cooresponding: YES Author name: Limsoon Wong affil: NU Singapore

42 An example of evaluation Another matching Journal name: JACM journal Article name: jacmhopcroft74 year: 1974 corresponding: YES Author name: Robert Tarjan affil: Princeton Univ. Conference name: FOCS series Venue name: FOCS67 part of Article name: FOCSHopcroft67 year: 1967 Author name: John Hopcroft affil: Cornell Univ. series Venue name: PODS89 part of Article name: PODSUllman89 year: 1989 Author name: Jeff Ullman affil: Stanford Univ. Conference name: PODS series series corresponding: YES Venue name: PODS83 part of Article name: PODSFaginUV83 year: 1983 corresponding: YES Author name: Ron Fagin affil: IBM Almaden Venue name: PODS95 part of part of Article name: PODSVardi95 year: 1995 cooresponding: YES Author name: Moshe Vardi affil: Rice University Article cooresponding: YES Author name: PODSLibkin95 name: Leonid Libkin year: 1995 affil: U. of Edinburgh Journal name: IPL journal Article name: IPLLibkinW95 year: 1995 cooresponding: YES Author name: Limsoon Wong affil: NU Singapore

43 An example of evaluation Its projection Journal name: JACM journal Article name: jacmhopcroft74 year: 1974 corresponding: YES Author name: Robert Tarjan affil: Princeton Univ. Conference name: FOCS series Venue name: FOCS67 part of Article name: FOCSHopcroft67 year: 1967 Author name: John Hopcroft affil: Cornell Univ. series Venue name: PODS89 part of Article name: PODSUllman89 year: 1989 Author name: Jeff Ullman affil: Stanford Univ. Conference name: PODS series series corresponding: YES Venue name: PODS83 part of Article name: PODSFaginUV83 year: 1983 corresponding: YES Author name: Ron Fagin affil: IBM Almaden Venue name: PODS95 part of part of Article name: PODSVardi95 year: 1995 cooresponding: YES Author name: Moshe Vardi affil: Rice University Article cooresponding: YES Author name: PODSLibkin95 name: Leonid Libkin year: 1995 affil: U. of Edinburgh Journal name: IPL journal Article name: IPLLibkinW95 year: 1995 cooresponding: YES Author name: Limsoon Wong affil: NU Singapore

44 But what is a matching?

45 But what is a matching? A mapping from: nodes of the pattern to nodes of the graph, and edges of the pattern to edges of the graph which preserves the structure of the pattern in the graph

46 Different notions of matching Homomorphism: No restriction on the mapping nodes in the pattern can map to the same node in the graph

47 Different notions of matching Homomorphism: No restriction on the mapping nodes in the pattern can map to the same node in the graph Mostly studied in database theory (PODS)

48 Different notions of matching Isomorphism: Mapping is injective elements in the pattern map to elements in the graph

49 Different notions of matching Isomorphism: Mapping is injective elements in the pattern map to elements in the graph Mostly studied in database systems (SIGMOD)

50 Different notions of matching Edge-injective: Self-describing edges in the pattern map to edges in the graph

51 Different notions of matching Edge-injective: Self-describing edges in the pattern map to edges in the graph Implemented in some graph DB engines (Neo4J)

52 Is there a matching?

53 Is there a matching? An NP-complete problem

54 Is there a matching? An NP-complete problem How to address this problem?

55 Solution 1: Restriction on graph patterns In many applications, graph patterns are tame: Homomorphism/isomorphism can be solved efficiently for them

56 Solution 1: Restriction on graph patterns In many applications, graph patterns are tame: Homomorphism/isomorphism can be solved efficiently for them Tame: The underlying graph is almost acyclic Bounded treewidth (database/graph theory)

57 Solution 2: Heuristics for real-world datasets Structural optimization techniques for reducing search space: Join ordering, prunning, indexes (database systems)

58 Solution 2: Heuristics for real-world datasets Structural optimization techniques for reducing search space: Join ordering, prunning, indexes (database systems) Real databases have structure that can be exploited

59 Solution 3: Inexact evaluation Use weaker forms of matching that can be evaluated efficiently: Bisimulations, approximations (database theory/systems)

60 Solution 3: Inexact evaluation Use weaker forms of matching that can be evaluated efficiently: Bisimulations, approximations (database theory/systems) Compromise the quality of the answer in favor of efficiency

61 Solution 4: Use different notions of complexity Graphs and patterns are different beasts: Graphs are BIG, patterns are small

62 Solution 4: Use different notions of complexity Graphs and patterns are different beasts: Graphs are BIG, patterns are small Assume pattern is fixed (data complexity/database theory): Matching can be solved very efficiently

63 Solution modifiers on graph patterns Relational operations: Union Difference Cartesian product

64 Solution modifiers on graph patterns Relational operations: Union Difference Cartesian product The language becomes relational complete

65 OPTIONAL: An important solution modifier Allows to match parts of the data only if available Important in the context of semistructured data Developed by the RDF community Corresponds to left-outer join in relational algebra

66 An example with OPTIONAL The property graph Paper name: AKV85 year: 1985 Paper name: Saf85 year: 1985 published in published in evaluated in Conference name: STOC85 Review text: "This paper..."

67 An example with OPTIONAL The property graph Paper name: AKV85 year: 1985 Paper name: Saf85 year: 1985 published in published in evaluated in Conference name: STOC85 Review text: "This paper..." The pattern with optional (x,published in,y) OPTIONAL (y,evaluated in,z)

68 An example with OPTIONAL A matching Paper name: AKV85 year: 1985 Paper name: Saf85 year: 1985 published in published in evaluated in Conference name: STOC85 Review text: "This paper..." The pattern with optional (x,published in,y) OPTIONAL (x,evaluated in,z)

69 An example with OPTIONAL Another matching Paper name: AKV85 year: 1985 Paper name: Saf85 year: 1985 published in published in evaluated in Conference name: STOC85 Review text: "This paper..." The pattern with optional (x,published in,y) OPTIONAL (x,evaluated in,z)

70 Conclusion Graph patterns are a versatile and simple language for querying PGs Graph pattern evaluation comes in different flavors This problem is challenging (theory/practice) Different operators can be added in order to increase expressiveness

71 NAVIGATION

72 Graphs are there to be navigated Recall our property graph? Journal name: JACM journal Article name: jacmhopcroft74 year: 1974 corresponding: YES Author name: Robert Tarjan affil: Princeton Univ. Conference name: FOCS series Venue name: FOCS67 part of Article name: FOCSHopcroft67 year: 1967 Author name: John Hopcroft affil: Cornell Univ. series Venue name: PODS89 part of Article name: PODSUllman89 year: 1989 Author name: Jeff Ullman affil: Stanford Univ. Conference name: PODS series series corresponding: YES Venue name: PODS83 part of Article name: PODSFaginUV83 year: 1983 corresponding: YES Author name: Ron Fagin affil: IBM Almaden Venue name: PODS95 part of part of Article name: PODSVardi95 year: 1995 cooresponding: YES Author name: Moshe Vardi affil: Rice University Article cooresponding: YES Author name: PODSLibkin95 name: Leonid Libkin year: 1995 affil: U. of Edinburgh Journal name: IPL journal Article name: IPLLibkinW95 year: 1995 cooresponding: YES Author name: Limsoon Wong affil: NU Singapore

73 Graphs are there to be navigated Find pairs of authors linked by a coauthorship sequence Author ( 1 ) Author

74 Do practical query languages navigate? Very little: Check if there is a directed path between two nodes (DFS)

75 Do practical query languages navigate? Very little: Check if there is a directed path between two nodes (DFS) The previous query cannot be expressed (save for a few exceptions)

76 Do practical query languages navigate? Very little: Check if there is a directed path between two nodes (DFS) The previous query cannot be expressed (save for a few exceptions) No support for regular path queries (database theory, RDF, DL) Is there a directed path whose label satisfies a regex?

77 Are RPQs harder to evaluate?

78 Are RPQs harder to evaluate? Not really (in theory): Convert the regex into an automata Take the cross product of the property graph and the automata Check if there is a path from an initial to a final state (DFS)

79 Are RPQs harder to evaluate? Not really (in theory): Convert the regex into an automata Take the cross product of the property graph and the automata Check if there is a path from an initial to a final state (DFS) Cost is linear in the size of the data and the regex

80 But, what is the semantics? Is there a path or a simple path? Database theory concentrates on the former (why?) Graph DB engines implement the latter (why?)

81 But, what is the semantics? Is there a path or a simple path? Database theory concentrates on the former (why?) Graph DB engines implement the latter (why?)

82 But, what is the semantics? Is there a path or a simple path? Database theory concentrates on the former (why?) Graph DB engines implement the latter (why?)

83 But, what is the semantics? Is there a path or a simple path? Database theory concentrates on the former (why?) Graph DB engines implement the latter (why?) Our algorithm evaluates RPQs under arbitrary path semantics

84 But, what is the semantics? Is there a path or a simple path? Database theory concentrates on the former (why?) Graph DB engines implement the latter (why?) Our algorithm evaluates RPQs under arbitrary path semantics Is it possible to use it under a simple path semantics?

85 RPQs under simple paths Is there a simple path whose label satisfies a regex? This problem is NP-complete even if the regex is fixed! High complexity only dependent on the size of the data

86 RPQs under simple paths Is there a simple path whose label satisfies a regex? This problem is NP-complete even if the regex is fixed! High complexity only dependent on the size of the data

87 RPQs under simple paths Is there a simple path whose label satisfies a regex? This problem is NP-complete even if the regex is fixed! High complexity only dependent on the size of the data

88 RPQs under simple paths Is there a simple path whose label satisfies a regex? This problem is NP-complete even if the regex is fixed! High complexity only dependent on the size of the data Example: Consider the RPQ (aa) 1. It asks whether there is a simple path of even length from x to y 2. This problem is NP-complete

89 Adding RPQs to graph patterns Give rise to the class of conjunctive RPQs Has received considerable attention in theory (Essentially) unexplored from a practical point of view Challenging because of matching and RPQ evaluation

90 Adding RPQs to graph patterns Give rise to the class of conjunctive RPQs Has received considerable attention in theory (Essentially) unexplored from a practical point of view Challenging because of matching and RPQ evaluation

91 Adding RPQs to graph patterns Give rise to the class of conjunctive RPQs Has received considerable attention in theory (Essentially) unexplored from a practical point of view Challenging because of matching and RPQ evaluation

92 Adding RPQs to graph patterns Give rise to the class of conjunctive RPQs Has received considerable attention in theory (Essentially) unexplored from a practical point of view Challenging because of matching and RPQ evaluation

93 Are paths the only form of navigation?

94 Are paths the only form of navigation? No! We can allow stronger forms of recursion (DB theory, DL, MC): Navigate with branching (nested regexs, similar evaluation to RPQs) Allow transitive closure on top of conjunctive RPQs (regular queries, harder to evaluate) Allow arbitrary recursion (datalog, very hard to evaluate, non-parallelizable)

95 Are paths the only form of navigation? No! We can allow stronger forms of recursion (DB theory, DL, MC): Navigate with branching (nested regexs, similar evaluation to RPQs) Allow transitive closure on top of conjunctive RPQs (regular queries, harder to evaluate) Allow arbitrary recursion (datalog, very hard to evaluate, non-parallelizable)

96 Are paths the only form of navigation? No! We can allow stronger forms of recursion (DB theory, DL, MC): Navigate with branching (nested regexs, similar evaluation to RPQs) Allow transitive closure on top of conjunctive RPQs (regular queries, harder to evaluate) Allow arbitrary recursion (datalog, very hard to evaluate, non-parallelizable)

97 Are paths the only form of navigation? No! We can allow stronger forms of recursion (DB theory, DL, MC): Navigate with branching (nested regexs, similar evaluation to RPQs) Allow transitive closure on top of conjunctive RPQs (regular queries, harder to evaluate) Allow arbitrary recursion (datalog, very hard to evaluate, non-parallelizable)

98 Conclusions (really, questions) Are RPQs useful in practice? Can they be implemented? Under which semantics? What about conjunctive RPQs? Or even stronger forms of recursion?

99 Conclusions (really, questions) Are RPQs useful in practice? Can they be implemented? Under which semantics? What about conjunctive RPQs? Or even stronger forms of recursion?

100 Conclusions (really, questions) Are RPQs useful in practice? Can they be implemented? Under which semantics? What about conjunctive RPQs? Or even stronger forms of recursion?

101 Conclusions (really, questions) Are RPQs useful in practice? Can they be implemented? Under which semantics? What about conjunctive RPQs? Or even stronger forms of recursion?

102 Conclusions (really, questions) Are RPQs useful in practice? Can they be implemented? Under which semantics? What about conjunctive RPQs? Or even stronger forms of recursion?

103 Conclusions (really, questions) Are RPQs useful in practice? Can they be implemented? Under which semantics? What about conjunctive RPQs? Or even stronger forms of recursion?

104 RETURNING PATHS

105 Paths in the output Query languages such as Cypher & PGQL allow: Return one path Return one shortest path (DFS) Return all paths Return all shortest paths

106 Paths in the output Query languages such as Cypher & PGQL allow: Return one path Return one shortest path (DFS) Return all paths Return all shortest paths But, what does it mean to return all paths?... there can be infinitely many

107 Paths in the output Query languages such as Cypher & PGQL allow: Return one path Return one shortest path (DFS) Return all paths Return all shortest paths But, what does it mean to return all paths?... there can be infinitely many Systems return simple paths... but there can be exponentially many

108 Even more interesting for RPQs Return a shortest path whose label satisfies a regex And all shortest paths Return all paths whose label satisfies a regex And all paths

109 A solution from database theory Instead of returning all paths... return a compact representation of them

110 A solution from database theory Instead of returning all paths... return a compact representation of them Compact representation: A property graph with all paths in the output Can be constructed efficiently (in data) for arbitrary paths Impossible for simple paths

111 A solution from database theory Instead of returning all paths... return a compact representation of them Compact representation: A property graph with all paths in the output Can be constructed efficiently (in data) for arbitrary paths Impossible for simple paths

112 A solution from database theory Instead of returning all paths... return a compact representation of them Compact representation: A property graph with all paths in the output Can be constructed efficiently (in data) for arbitrary paths Impossible for simple paths

113 Conclusions Returning paths is difficult under all interpretations All paths can be compactly represented, but simple paths cannot The right semantics still needs to be settled

114 UNGROUPING

115 Paths can appear in the output

116 Paths can appear in the output We can ungroup them List the nodes that appear in them

117 Paths can appear in the output We can ungroup them List the nodes that appear in them We can check if a path visits all nodes Travelling salesman problem!

118 Paths can appear in the output We can ungroup them List the nodes that appear in them We can check if a path visits all nodes Travelling salesman problem!

119 Paths can appear in the output We can ungroup them List the nodes that appear in them We can check if a path visits all nodes Travelling salesman problem! Interaction with other operators expresses even more complex properties

120 A query language for paths Variables for paths and nodes Can check if a node belongs to a path Can check if the label of a path satisfies a regex Closure under quantification over nodes and paths

121 A query language for paths Variables for paths and nodes Can check if a node belongs to a path Can check if the label of a path satisfies a regex Closure under quantification over nodes and paths

122 A query language for paths Variables for paths and nodes Can check if a node belongs to a path Can check if the label of a path satisfies a regex Closure under quantification over nodes and paths

123 A query language for paths Variables for paths and nodes Can check if a node belongs to a path Can check if the label of a path satisfies a regex Closure under quantification over nodes and paths

124 A query language for paths Variables for paths and nodes Can check if a node belongs to a path Can check if the label of a path satisfies a regex Closure under quantification over nodes and paths

125 A query language for paths Variables for paths and nodes Can check if a node belongs to a path Can check if the label of a path satisfies a regex Closure under quantification over nodes and paths Complexity of evaluation is astronomical (in data) (Severely) restricting the language leads to efficiency

126 A query language for paths Variables for paths and nodes Can check if a node belongs to a path Can check if the label of a path satisfies a regex Closure under quantification over nodes and paths Complexity of evaluation is astronomical (in data) (Severely) restricting the language leads to efficiency

127 Question What ungropuing can do and should do?

128 FINAL THOUGHTS

129 Final thoughts Exciting times for studying graph DBs (theory/practice) Lots of fine tuning needed Some issues still unexplored: Comparing paths Ranking of answers Constraints

130 MANY THANKS

131 Bibliography Surveys: P. Wood. Query languages for graph databases. SIGMOD Record, P. Barceló. Querying graph databases. PODS, Regular path queries: A. Mendelzon and P. Wood. Finding regular simple paths in graph databases. SIAM J. on Comp, D. Calvanese, G. de Giacomo, M. Lenzerini, M. Vardi. Reasoning on regular path queries. SIGMOD Record, Conjunctive regular path queries: D. Calvanese, G. de Giacomo, M. Lenzerini, M. Vardi. Containment of conjunctive regular path queries with inverse. KR, 2000.

132 Bibliography Queries with more expressive recursion: P. Barceló, J. Pérez, J. Reutter. Relative expressiveness of nested regular expressions. AMW, J. Reutter, M. Romero, M. Vardi. Regular queries on graph databases. ICDT, Queries that compare nodes: L. Libkin, D,. Vrgoc. Regular path queries on graphs with data. ICDT, G. Fletcher, M. Gyssens, D. Leinders, et al. Relative expressive power of navigational querying on graphs. ICDT, P. Barceló, G. Fontaine, A. W. Lin. Expressive path queries over graphs with data. LMCS, Queries with paths as first-class citizens and negation: P. Barceló, L. Libkin, A. W. Lin, P. Wood. Expressive path queries over graph-structured data. ACM TODS, 2012.

Graph Databases. Advanced Topics in Foundations of Databases, University of Edinburgh, 2017/18

Graph Databases. Advanced Topics in Foundations of Databases, University of Edinburgh, 2017/18 Graph Databases Advanced Topics in Foundations of Databases, University of Edinburgh, 2017/18 Graph Databases and Applications Graph databases are crucial when topology is as important as the data Several

More information

Classical DB Questions on New Kinds of Data

Classical DB Questions on New Kinds of Data 1 Classical DB Questions on New Kinds of Data Marcelo Arenas & Pablo Barceló, Center for Semantic Web Research PUC Chile Universidad de Chile Evolution of data models 80s early 90s Overcome expressiveness

More information

Publications of Pablo Barceló by type

Publications of Pablo Barceló by type Publications of Pablo Barceló by type Type A 1. Pablo Barceló. Logical Foundations of Relational Data Exchange. SIGMOD Record 38(1):49-58, Mar 2009. ACM Press, New York, NY, USA. ISSN 0163-5808. 2. Leonid

More information

Querying Graph Databases 1/ 83

Querying Graph Databases 1/ 83 Querying Graph Databases 1/ 83 Graph DBs and applications Graph DBs are crucial when topology is as important as data itself. Renewed interest due to new applications: Semantic Web and RDF. Social networks.

More information

G-CORE: A Core for Future Graph Query Languages

G-CORE: A Core for Future Graph Query Languages G-CORE: A Core for Future Graph Query Languages Designed by the LDBC Graph Query Language Task Force Hannes Voigt hannes.voigt@tu-dresden.de http://bit.ly/gcorelanguage @LDBCouncil FOSDEM Graph Feb 3rd,

More information

Processing Regular Path Queries Using Views or What Do We Need for Integrating Semistructured Data?

Processing Regular Path Queries Using Views or What Do We Need for Integrating Semistructured Data? Processing Regular Path Queries Using Views or What Do We Need for Integrating Semistructured Data? Diego Calvanese University of Rome La Sapienza joint work with G. De Giacomo, M. Lenzerini, M.Y. Vardi

More information

Incomplete Data: What Went Wrong, and How to Fix It. PODS 2014 Incomplete Information 1/1

Incomplete Data: What Went Wrong, and How to Fix It. PODS 2014 Incomplete Information 1/1 Incomplete Data: What Went Wrong, and How to Fix It Leonid Libkin (University of Edinburgh) PODS 2014 Incomplete Information 1/1 Incomplete information It is everywhere. The more data we accumulate, the

More information

A TriAL: A navigational algebra for RDF triplestores

A TriAL: A navigational algebra for RDF triplestores A TriAL: A navigational algebra for RDF triplestores Navigational queries over RDF data are viewed as one of the main applications of graph query languages, and yet the standard model of graph databases

More information

ANDREAS PIERIS JOURNAL PAPERS

ANDREAS PIERIS JOURNAL PAPERS ANDREAS PIERIS School of Informatics, University of Edinburgh Informatics Forum, 10 Crichton Street, Edinburgh, EH8 9AB, Scotland, UK apieris@inf.ed.ac.uk PUBLICATIONS (authors in alphabetical order) JOURNAL

More information

The Property Graph Database Model

The Property Graph Database Model The Property Graph Database Model Renzo Angles 1 Dept. of Computer Science, Universidad de Talca, Chile 2 Center for Semantic Web Research Abstract. Most of the current graph database systems have been

More information

Structural Characterization of Graph Navigational Languages (Extended Abstract)

Structural Characterization of Graph Navigational Languages (Extended Abstract) Structural Characterization of Graph Navigational Languages (Extended Abstract) Valeria Fionda 1 and Giuseppe Pirrò 2 1 DeMaCS, University of Calabria, Italy fionda@mat.unical.it 2 Institute for High Performance

More information

Querying from a Graph Database Perspective: the case of RDF

Querying from a Graph Database Perspective: the case of RDF Querying from a Database Perspective: the case of RDF Renzo Angles and Claudio Gutierrez Department of Computer Science Universidad de Chile {rangles, cgutierr}@dcc.uchile.cl Agenda Motivations, Problem

More information

On the Hardness of Counting the Solutions of SPARQL Queries

On the Hardness of Counting the Solutions of SPARQL Queries On the Hardness of Counting the Solutions of SPARQL Queries Reinhard Pichler and Sebastian Skritek Vienna University of Technology, Faculty of Informatics {pichler,skritek}@dbai.tuwien.ac.at 1 Introduction

More information

a standard database system

a standard database system user queries (RA, SQL, etc.) relational database Flight origin destination airline Airport code city VIE LHR BA VIE Vienna LHR EDI BA LHR London LGW GLA U2 LGW London LCA VIE OS LCA Larnaca a standard

More information

Structural characterizations of schema mapping languages

Structural characterizations of schema mapping languages Structural characterizations of schema mapping languages Balder ten Cate INRIA and ENS Cachan (research done while visiting IBM Almaden and UC Santa Cruz) Joint work with Phokion Kolaitis (ICDT 09) Schema

More information

Regular Expressions for Data Words

Regular Expressions for Data Words Regular Expressions for Data Words Leonid Libkin and Domagoj Vrgoč School of Informatics, University of Edinburgh Abstract. In data words, each position carries not only a letter form a finite alphabet,

More information

SPARQL with Property Paths

SPARQL with Property Paths SPARQL with Property Paths Egor V. Kostylev 1, Juan L. Reutter 2, Miguel Romero 3, and Domagoj Vrgoč 2 1 University of Oxford 2 PUC Chile and Center for Semantic Web Research 3 University of Chile and

More information

Revisiting Blank Nodes in RDF to Avoid the Semantic Mismatch with SPARQL

Revisiting Blank Nodes in RDF to Avoid the Semantic Mismatch with SPARQL Revisiting Blank Nodes in RDF to Avoid the Semantic Mismatch with SPARQL Marcelo Arenas 1, Mariano Consens 2, and Alejandro Mallea 1,3 1 Pontificia Universidad Católica de Chile 2 University of Toronto

More information

A Foundations of Modern Query Languages for Graph Databases 1

A Foundations of Modern Query Languages for Graph Databases 1 A Foundations of Modern Query Languages for Graph Databases 1 RENZO ANGLES, Universidad de Talca & Center for Semantic Web Research MARCELO ARENAS, Pontificia Universidad Católica de Chile & Center for

More information

Logical Foundations of Relational Data Exchange

Logical Foundations of Relational Data Exchange Logical Foundations of Relational Data Exchange Pablo Barceló Department of Computer Science, University of Chile pbarcelo@dcc.uchile.cl 1 Introduction Data exchange has been defined as the problem of

More information

FOUNDATIONS OF DATABASES AND QUERY LANGUAGES

FOUNDATIONS OF DATABASES AND QUERY LANGUAGES FOUNDATIONS OF DATABASES AND QUERY LANGUAGES Lecture 14: Database Theory in Practice Markus Krötzsch TU Dresden, 20 July 2015 Overview 1. Introduction Relational data model 2. First-order queries 3. Complexity

More information

Composition and Inversion of Schema Mappings

Composition and Inversion of Schema Mappings Composition and Inversion of Schema Mappings Marcelo Arenas Jorge Pérez Juan Reutter Cristian Riveros PUC Chile PUC Chile U. of Edinburgh Oxford University marenas@ing.puc.cl jperez@ing.puc.cl juan.reutter@ed.ac.uk

More information

G-CORE A Core for Future Graph Query Languages Designed by the LDBC Graph Query Language Task Force

G-CORE A Core for Future Graph Query Languages Designed by the LDBC Graph Query Language Task Force G-CORE A Core for Future Graph Query Languages Designed by the LDBC Graph Query Language Task Force RENZO ANGLES, Universidad de Talca MARCELO ARENAS, PUC Chile PABLO BARCELÓ, DCC, Universidad de Chile

More information

Database Theory VU , SS Introduction: Relational Query Languages. Reinhard Pichler

Database Theory VU , SS Introduction: Relational Query Languages. Reinhard Pichler Database Theory Database Theory VU 181.140, SS 2011 1. Introduction: Relational Query Languages Reinhard Pichler Institut für Informationssysteme Arbeitsbereich DBAI Technische Universität Wien 8 March,

More information

Database Theory VU , SS Introduction: Relational Query Languages. Reinhard Pichler

Database Theory VU , SS Introduction: Relational Query Languages. Reinhard Pichler Database Theory Database Theory VU 181.140, SS 2018 1. Introduction: Relational Query Languages Reinhard Pichler Institut für Informationssysteme Arbeitsbereich DBAI Technische Universität Wien 6 March,

More information

The Logic of the Semantic Web. Enrico Franconi Free University of Bozen-Bolzano, Italy

The Logic of the Semantic Web. Enrico Franconi Free University of Bozen-Bolzano, Italy The Logic of the Semantic Web Enrico Franconi Free University of Bozen-Bolzano, Italy What is this talk about 2 What is this talk about A sort of tutorial of RDF, the core semantic web knowledge representation

More information

Approximation Algorithms for Computing Certain Answers over Incomplete Databases

Approximation Algorithms for Computing Certain Answers over Incomplete Databases Approximation Algorithms for Computing Certain Answers over Incomplete Databases Sergio Greco, Cristian Molinaro, and Irina Trubitsyna {greco,cmolinaro,trubitsyna}@dimes.unical.it DIMES, Università della

More information

XML Data Exchange. Marcelo Arenas P. Universidad Católica de Chile. Joint work with Leonid Libkin (U. of Toronto)

XML Data Exchange. Marcelo Arenas P. Universidad Católica de Chile. Joint work with Leonid Libkin (U. of Toronto) XML Data Exchange Marcelo Arenas P. Universidad Católica de Chile Joint work with Leonid Libkin (U. of Toronto) Data Exchange in Relational Databases Data exchange has been extensively studied in the relational

More information

Foundations of SPARQL Query Optimization

Foundations of SPARQL Query Optimization Foundations of SPARQL Query Optimization Michael Schmidt, Michael Meier, Georg Lausen Albert-Ludwigs-Universität Freiburg Database and Information Systems Group 13 th International Conference on Database

More information

Recursion in SPARQL. Juan L. Reutter, Adrián Soto, and Domagoj Vrgoč. PUC Chile and Center for Semantic Web Research

Recursion in SPARQL. Juan L. Reutter, Adrián Soto, and Domagoj Vrgoč. PUC Chile and Center for Semantic Web Research Recursion in SPARQL Juan L. Reutter, Adrián Soto, and Domagoj Vrgoč PUC Chile and Center for Semantic Web Research Abstract. The need for recursive queries in the Semantic Web setting is becoming more

More information

Joint Entity Resolution

Joint Entity Resolution Joint Entity Resolution Steven Euijong Whang, Hector Garcia-Molina Computer Science Department, Stanford University 353 Serra Mall, Stanford, CA 94305, USA {swhang, hector}@cs.stanford.edu No Institute

More information

Foundations of Schema Mapping Management

Foundations of Schema Mapping Management Foundations of Schema Mapping Management Marcelo Arenas Jorge Pérez Juan Reutter Cristian Riveros PUC Chile PUC Chile University of Edinburgh Oxford University marenas@ing.puc.cl jperez@ing.puc.cl juan.reutter@ed.ac.uk

More information

Introduction to Graph Data Management

Introduction to Graph Data Management Introduction to Graph Data Management Claudio Gutierrez Center for Semantic Web Research (CIWS) Department of Computer Science Universidad de Chile EDBT Summer School Palamos 2015 Joint Work With Renzo

More information

The Inverse of a Schema Mapping

The Inverse of a Schema Mapping The Inverse of a Schema Mapping Jorge Pérez Department of Computer Science, Universidad de Chile Blanco Encalada 2120, Santiago, Chile jperez@dcc.uchile.cl Abstract The inversion of schema mappings has

More information

arxiv: v3 [cs.db] 15 Jun 2017

arxiv: v3 [cs.db] 15 Jun 2017 A Foundations of Modern Query Languages for Graph Databases 1 RENZO ANGLES, Universidad de Talca & Center for Semantic Web Research MARCELO ARENAS, Pontificia Universidad Católica de Chile & Center for

More information

Property Path Query in SPARQL 1.1

Property Path Query in SPARQL 1.1 Property Path Query in SPARQL 1.1 Worarat Krathu Guohui Xiao Institute of Information Systems, Vienna University of Technology July 2012 Property Path Query in SPARQL 1.1 0/27 Overview Introduction Limitation

More information

Provable data privacy

Provable data privacy Provable data privacy Kilian Stoffel 1 and Thomas Studer 2 1 Université de Neuchâtel, Pierre-à-Mazel 7, CH-2000 Neuchâtel, Switzerland kilian.stoffel@unine.ch 2 Institut für Informatik und angewandte Mathematik,

More information

Foundations of Data Exchange and Metadata Management. Marcelo Arenas Ron Fagin Special Event - SIGMOD/PODS 2016

Foundations of Data Exchange and Metadata Management. Marcelo Arenas Ron Fagin Special Event - SIGMOD/PODS 2016 Foundations of Data Exchange and Metadata Management Marcelo Arenas Ron Fagin Special Event - SIGMOD/PODS 2016 The need for a formal definition We had a paper with Ron in PODS 2004 Back then I was a Ph.D.

More information

RELATIONAL DATABASE AND GRAPH DATABASE: A COMPARATIVE ANALYSIS

RELATIONAL DATABASE AND GRAPH DATABASE: A COMPARATIVE ANALYSIS RELATIONAL DATABASE AND GRAPH DATABASE: A COMPARATIVE ANALYSIS Surajit Medhi 1, Hemanta K. Baruah 2 1 Department of Computer Science, Gauhati University, Assam, India 2 Bodoland University, Assam, India

More information

Inverting Schema Mappings: Bridging the Gap between Theory and Practice

Inverting Schema Mappings: Bridging the Gap between Theory and Practice Inverting Schema Mappings: Bridging the Gap between Theory and Practice Marcelo Arenas Jorge Pérez Juan Reutter Cristian Riveros PUC Chile PUC Chile PUC Chile R&M Tech marenas@ing.puc.cl jperez@ing.puc.cl

More information

A Retrospective on Datalog 1.0

A Retrospective on Datalog 1.0 A Retrospective on Datalog 1.0 Phokion G. Kolaitis UC Santa Cruz and IBM Research - Almaden Datalog 2.0 Vienna, September 2012 2 / 79 A Brief History of Datalog In the beginning of time, there was E.F.

More information

View-based query processing: On the relationship between rewriting, answering and losslessness

View-based query processing: On the relationship between rewriting, answering and losslessness Theoretical Computer Science 371 (2007) 169 182 www.elsevier.com/locate/tcs View-based query processing: On the relationship between rewriting, answering and losslessness Diego Calvanese a,, Giuseppe De

More information

Graph Databases. Graph Databases. May 2015 Alberto Abelló & Oscar Romero

Graph Databases. Graph Databases. May 2015 Alberto Abelló & Oscar Romero Graph Databases 1 Knowledge Objectives 1. Describe what a graph database is 2. Explain the basics of the graph data model 3. Enumerate the best use cases for graph databases 4. Name two pros and cons of

More information

PGQL: a Property Graph Query Language

PGQL: a Property Graph Query Language PGQL: a Property Graph Query Language Oskar van Rest Sungpack Hong Jinha Kim Xuming Meng Hassan Chafi Oracle Labs June 24, 2016 Safe Harbor Statement The following is intended to outline our general product

More information

Certain Answers as Objects and Knowledge

Certain Answers as Objects and Knowledge Proceedings of the Fourteenth International Conference on Principles of Knowledge Representation and Reasoning Certain Answers as Objects and Knowledge Leonid Libkin School of Informatics, University of

More information

following syntax: R ::= > n j P j $i=n : C j :R j R 1 u R 2 C ::= > 1 j A j :C j C 1 u C 2 j 9[$i]R j (» k [$i]r) where i and j denote components of r

following syntax: R ::= > n j P j $i=n : C j :R j R 1 u R 2 C ::= > 1 j A j :C j C 1 u C 2 j 9[$i]R j (» k [$i]r) where i and j denote components of r Answering Queries Using Views in Description Logics Diego Calvanese, Giuseppe De Giacomo, Maurizio Lenzerini Dipartimento di Informatica e Sistemistica, Universit a di Roma La Sapienza" Via Salaria 113,

More information

Containment of Data Graph Queries

Containment of Data Graph Queries Containment of Data Graph Queries Egor V. Kostylev University of Oxford and University of Edinburgh egor.kostylev@cs.ox.ac.uk Juan L. Reutter PUC Chile jreutter@ing.puc.cl Domagoj Vrgoč University of Edinburgh

More information

Database Constraints and Homomorphism Dualities

Database Constraints and Homomorphism Dualities Database Constraints and Homomorphism Dualities Balder ten Cate 1, Phokion G. Kolaitis 1,2, and Wang-Chiew Tan 2,1 1 University of California Santa Cruz 2 IBM Research-Almaden Abstract. Global-as-view

More information

Data Exchange in the Relational and RDF Worlds

Data Exchange in the Relational and RDF Worlds Data Exchange in the Relational and RDF Worlds Marcelo Arenas Department of Computer Science Pontificia Universidad Católica de Chile This is joint work with Jorge Pérez, Juan Reutter, Cristian Riveros

More information

RDF* and SPARQL* An Alternative Approach to Statement-Level Metadata in RDF

RDF* and SPARQL* An Alternative Approach to Statement-Level Metadata in RDF RDF* and SPARQL* An Alternative Approach to Statement-Level Metadata in RDF Olaf Hartig @olafhartig Picture source:htp://akae.blogspot.se/2008/08/dios-mo-doc-has-construido-una-mquina.html 2 4 htp://tinkerpop.apache.org/docs/current/reference/#intro

More information

Formal languages and computation models

Formal languages and computation models Formal languages and computation models Guy Perrier Bibliography John E. Hopcroft, Rajeev Motwani, Jeffrey D. Ullman - Introduction to Automata Theory, Languages, and Computation - Addison Wesley, 2006.

More information

On the Codd Semantics of SQL Nulls

On the Codd Semantics of SQL Nulls On the Codd Semantics of SQL Nulls Paolo Guagliardo and Leonid Libkin School of Informatics, University of Edinburgh Abstract. Theoretical models used in database research often have subtle differences

More information

Finite Model Theory and Its Applications

Finite Model Theory and Its Applications Erich Grädel Phokion G. Kolaitis Leonid Libkin Maarten Marx Joel Spencer Moshe Y. Vardi Yde Venema Scott Weinstein Finite Model Theory and Its Applications With 35 Figures and 2 Tables Springer Contents

More information

Lecture 22 Acyclic Joins and Worst Case Join Results Instructor: Sudeepa Roy

Lecture 22 Acyclic Joins and Worst Case Join Results Instructor: Sudeepa Roy CompSci 516 ata Intensive Computing Systems Lecture 22 Acyclic Joins and Worst Case Join Results Instructor: Sudeepa Roy uke CS, Fall 2016 CompSci 516: ata Intensive Computing Systems Announcements Final

More information

millions of SQL Server users worldwide, this feature broadens enormously concepts behind the model; how relationships are handled and what are the

millions of SQL Server users worldwide, this feature broadens enormously concepts behind the model; how relationships are handled and what are the SQL Server 2017 introduced the extension for graph databases. As there are millions of SQL Server users worldwide, this feature broadens enormously the audience of potential users. But, what to expect

More information

Designing Views to Answer Queries under Set, Bag,and BagSet Semantics

Designing Views to Answer Queries under Set, Bag,and BagSet Semantics Designing Views to Answer Queries under Set, Bag,and BagSet Semantics Rada Chirkova Department of Computer Science, North Carolina State University Raleigh, NC 27695-7535 chirkova@csc.ncsu.edu Foto Afrati

More information

Outline. Database Theory. Prerequisites and Admission. Classes VU , SS 2018

Outline. Database Theory. Prerequisites and Admission. Classes VU , SS 2018 Database Theory Database Theory Outline Database Theory VU 181.140, SS 2018 0. General Information Reinhard Pichler Institut für Informationssysteme Arbeitsbereich DBAI Technische Universität Wien 6 March,

More information

XPath for DL-Lite Ontologies

XPath for DL-Lite Ontologies XPath for DL-Lite Ontologies Egor V. Kostylev 1, Juan L. Reutter 2, and Domagoj Vrgoč 3 1 University of Oxfordegor.kostylev@cs.ox.ac.uk 2 PUC Chilejreutter@ing.puc.cl 3 University of Edinburghdomagoj.vrgoc@ed.ac.uk

More information

SWARMGUIDE: Towards Multiple-Query Optimization in Graph Databases

SWARMGUIDE: Towards Multiple-Query Optimization in Graph Databases SWARMGUIDE: Towards Multiple-Query Optimization in Graph Databases Zahid Abul-Basher Nikolay Yakovets Parke Godfrey Mark Chignell University of Toronto, Toronto, ON, Canada {zahid,chignell}@mie.utoronto.ca

More information

Relative Expressive Power of Navigational Querying on Graphs

Relative Expressive Power of Navigational Querying on Graphs Relative Expressive Power of Navigational Querying on Graphs George H. L. Fletcher Eindhoven University of Technology g.h.l.fletcher@tue.nl Jan Van den Bussche Hasselt University & Transnational Univ.

More information

The GQL Manifesto. 1. References [DM ]

The GQL Manifesto. 1. References [DM ] The GQL Manifesto Title Author Status Date The GQL Manifesto Alastair Green, Individual Expert, Neo4j Inc. Discussion Paper Date of original publication, 13 May 2018, at https://gql.today Referenced in

More information

Graph Query Verification using Monadic 2 nd -Order Logic

Graph Query Verification using Monadic 2 nd -Order Logic 1 Graph Query Verification using Monadic 2 nd -Order Logic Graph Kazuhiro Inaba ( 稲葉一浩 ) kinaba@nii.ac.jp Oct 10, 2010 1 st PKU-NII International Joint Workshop on Advanced Software Engineering 2 Goal

More information

Data integration supports seamless access to autonomous, heterogeneous information

Data integration supports seamless access to autonomous, heterogeneous information Using Constraints to Describe Source Contents in Data Integration Systems Chen Li, University of California, Irvine Data integration supports seamless access to autonomous, heterogeneous information sources

More information

ArgQL: A Declarative Language for Querying Argumentative Dialogues

ArgQL: A Declarative Language for Querying Argumentative Dialogues ArgQL: A Declarative Language for Querying Argumentative Dialogues R U L E M L + R R : I N T E R N AT I O N A L J O I N T C O N F E R E N C E O N R U L E S A N D R E A S O N I N G D I M I T R A Z O G R

More information

Towards Equivalences for Federated SPARQL Queries

Towards Equivalences for Federated SPARQL Queries Towards Equivalences for Federated SPARQL Queries Carlos Buil-Aranda 1? and Axel Polleres 2?? 1 Department of Computer Science, Pontificia Universidad Católica, Chile cbuil@ing.puc.cl 2 Vienna University

More information

TriAL for RDF: Adapting Graph Query Languages for RDF Data

TriAL for RDF: Adapting Graph Query Languages for RDF Data TriAL for RDF: Adapting Graph Query Languages for RDF Data Leonid Libkin University of Edinburgh libkin@ed.ac.uk Juan Reutter University of Edinburgh and Catholic University of Chile juan.reutter@ed.ac.uk

More information

A Formal Framework for Comparing Linked Data Fragments

A Formal Framework for Comparing Linked Data Fragments A Formal Framework for Comparing Linked Data Fragments Olaf Hartig 1, Ian Letter 2, and Jorge Pérez 3 1 Dept. of Computer and Information Science (IDA), Linköping University, Sweden olaf.hartig@liu.se

More information

SQL. Dean Williamson, Ph.D. Assistant Vice President Institutional Research, Effectiveness, Analysis & Accreditation Prairie View A&M University

SQL. Dean Williamson, Ph.D. Assistant Vice President Institutional Research, Effectiveness, Analysis & Accreditation Prairie View A&M University SQL Dean Williamson, Ph.D. Assistant Vice President Institutional Research, Effectiveness, Analysis & Accreditation Prairie View A&M University SQL 1965: Maron & Levien propose Relational Data File 1968:

More information

A Study on Reverse Top-K Queries Using Monochromatic and Bichromatic Methods

A Study on Reverse Top-K Queries Using Monochromatic and Bichromatic Methods A Study on Reverse Top-K Queries Using Monochromatic and Bichromatic Methods S.Anusuya 1, M.Balaganesh 2 P.G. Student, Department of Computer Science and Engineering, Sembodai Rukmani Varatharajan Engineering

More information

Preferentially Annotated Regular Path Queries

Preferentially Annotated Regular Path Queries Preferentially Annotated Regular Path Queries Gosta Grahne Concordia U. Canada Alex Thomo U. of Victoria Canada Bill Wadge U. of Victoria Canada Regular Path Queries (RPQ s) Essentially regular expressions

More information

OWL and tractability. Based on slides from Ian Horrocks and Franz Baader. Combining the strengths of UMIST and The Victoria University of Manchester

OWL and tractability. Based on slides from Ian Horrocks and Franz Baader. Combining the strengths of UMIST and The Victoria University of Manchester OWL and tractability Based on slides from Ian Horrocks and Franz Baader Where are we? OWL Reasoning DL Extensions Scalability OWL OWL in practice PL/FOL XML RDF(S)/SPARQL Practical Topics Repetition: DL

More information

Unit 4 Relational Algebra (Using SQL DML Syntax): Data Manipulation Language For Relations Zvi M. Kedem 1

Unit 4 Relational Algebra (Using SQL DML Syntax): Data Manipulation Language For Relations Zvi M. Kedem 1 Unit 4 Relational Algebra (Using SQL DML Syntax): Data Manipulation Language For Relations 2014 Zvi M. Kedem 1 Relational Algebra And SQL SQL is based on relational algebra with many extensions Some necessary

More information

a standard database system

a standard database system user queries (RA, SQL, etc.) relational database Flight origin destination airline Airport code city VIE LHR BA VIE Vienna LHR EDI BA LHR London LGW GLA U2 LGW London LCA VIE OS LCA Larnaca a standard

More information

Semantics Preserving SQL-to-SPARQL Query Translation for Left Outer Join

Semantics Preserving SQL-to-SPARQL Query Translation for Left Outer Join Semantics Preserving SQL-to-SPARQL Query Translation for Left Outer Join BAHAJ Mohamed, Soussi Nassima Faculty of Science and Technologies, Settat Morocco mohamedbahaj@gmail.com sossinass@gmail.com ABSTRACT:

More information

A Model of Distributed Query Computation in Client-Server Scenarios on the Semantic Web

A Model of Distributed Query Computation in Client-Server Scenarios on the Semantic Web A Model of Distributed Query Computation in Client-Server Scenarios on the Semantic Web Olaf Hartig 1, Ian Letter 2, Jorge Pérez 3,4 1 Dept. of Computer and Information Science (IDA), Linköping University,

More information

ATFD: material for presentations, projects, and essays

ATFD: material for presentations, projects, and essays ATFD: material for presentations, projects, and essays Winter 2016 The references are taken from DBLP (http://dblp.uni-trier.de), the main bibliographical source for computer science research. You can

More information

Inference in Hierarchical Multidimensional Space

Inference in Hierarchical Multidimensional Space Proc. International Conference on Data Technologies and Applications (DATA 2012), Rome, Italy, 25-27 July 2012, 70-76 Related papers: http://conceptoriented.org/ Inference in Hierarchical Multidimensional

More information

Core Schema Mappings: Computing Core Solution with Target Dependencies in Data Exchange

Core Schema Mappings: Computing Core Solution with Target Dependencies in Data Exchange Core Schema Mappings: Computing Core Solution with Target Dependencies in Data Exchange S. Ravichandra, and D.V.L.N. Somayajulu Abstract Schema mapping is a declarative specification of the relationship

More information

Graph Database Model for Querying, Searching and Updating

Graph Database Model for Querying, Searching and Updating 2012 International Conference on Software and Computer Applications (ICSCA 2012) IPCSIT vol. 41 (2012) (2012) IACSIT Press, Singapore Graph Database Model for Querying, Searching and Updating Prashish

More information

CURRICULUM VITAE MARCELO ARENAS SHORT BIO RESEARCH INTERESTS EDUCATION PUBLICATIONS

CURRICULUM VITAE MARCELO ARENAS SHORT BIO RESEARCH INTERESTS EDUCATION PUBLICATIONS CURRICULUM VITAE MARCELO ARENAS Associate Professor, Department of Computer Science Pontificia Universidad Católica de Chile Phone: +56 2 23545821 Email: marenas@ing.puc.cl Http: web.ing.puc.cl/~marenas

More information

Data Integration: Logic Query Languages

Data Integration: Logic Query Languages Data Integration: Logic Query Languages Jan Chomicki University at Buffalo Datalog Datalog A logic language Datalog programs consist of logical facts and rules Datalog is a subset of Prolog (no data structures)

More information

Workshops. 1. SIGMM Workshop on Social Media. 2. ACM Workshop on Multimedia and Security

Workshops. 1. SIGMM Workshop on Social Media. 2. ACM Workshop on Multimedia and Security 1. SIGMM Workshop on Social Media SIGMM Workshop on Social Media is a workshop in conjunction with ACM Multimedia 2009. With the growing of user-centric multimedia applications in the recent years, this

More information

Updating data and knowledge bases

Updating data and knowledge bases Updating data and knowledge bases Inconsistency management in data and knowledge bases (2013) Antonella Poggi Sapienza Università di Roma Inconsistency management in data and knowledge bases (2013) Rome,

More information

An Archiving System for Managing Evolution in the Data Web

An Archiving System for Managing Evolution in the Data Web An Archiving System for Managing Evolution in the Web Marios Meimaris *, George Papastefanatos and Christos Pateritsas * Institute for the Management of Information Systems, Research Center Athena, Greece

More information

COMP718: Ontologies and Knowledge Bases

COMP718: Ontologies and Knowledge Bases 1/35 COMP718: Ontologies and Knowledge Bases Lecture 9: Ontology/Conceptual Model based Data Access Maria Keet email: keet@ukzn.ac.za home: http://www.meteck.org School of Mathematics, Statistics, and

More information

Negations in Refinement Type Systems

Negations in Refinement Type Systems Negations in Refinement Type Systems T. Tsukada (U. Tokyo) 14th March 2016 Shonan, JAPAN This Talk About refinement intersection type systems that refute judgements of other type systems. Background Refinement

More information

Function Symbols in Tuple-Generating Dependencies: Expressive Power and Computability

Function Symbols in Tuple-Generating Dependencies: Expressive Power and Computability Function Symbols in Tuple-Generating Dependencies: Expressive Power and Computability Georg Gottlob 1,2, Reinhard Pichler 1, and Emanuel Sallinger 2 1 TU Wien and 2 University of Oxford Tuple-generating

More information

On the Data Complexity of Consistent Query Answering over Graph Databases

On the Data Complexity of Consistent Query Answering over Graph Databases On the Data Complexity of Consistent Query Answering over Graph Databases Pablo Barceló and Gaëlle Fontaine Department of Computer Science University of Chile pbarcelo@dcc.uchile.cl, gaelle@dcc.uchile.cl

More information

}Optimization Formalisms for recursive queries. Module 11: Optimization of Recursive Queries. Module Outline Datalog

}Optimization Formalisms for recursive queries. Module 11: Optimization of Recursive Queries. Module Outline Datalog Module 11: Optimization of Recursive Queries 11.1 Formalisms for recursive queries Examples for problems requiring recursion: Module Outline 11.1 Formalisms for recursive queries 11.2 Computing recursive

More information

Relational Database Theory at Princeton and Beyond

Relational Database Theory at Princeton and Beyond Relational Database Theory at Princeton and Beyond David Maier Dept. of Computer Science and Engineering OGI School of Science & Technology at Oregon Health & Science University In the Beginning Dennis

More information

Logical reconstruction of RDF and ontology languages

Logical reconstruction of RDF and ontology languages Logical reconstruction of RDF and ontology languages Jos de Bruijn 1, Enrico Franconi 2, and Sergio Tessaris 2 1 Digital Enterprise Research Institute, University of Innsbruck, Austria jos.debruijn@deri.org

More information

}Optimization. Module 11: Optimization of Recursive Queries. Module Outline

}Optimization. Module 11: Optimization of Recursive Queries. Module Outline Module 11: Optimization of Recursive Queries Module Outline 11.1 Formalisms for recursive queries 11.2 Computing recursive queries 11.3 Partial transitive closures User Query Transformation & Optimization

More information

Panel: Pattern management challenges

Panel: Pattern management challenges Panel: Pattern management challenges Panos Vassiliadis Univ. of Ioannina, Dept. of Computer Science, 45110, Ioannina, Hellas E-mail: pvassil@cs.uoi.gr 1 Introduction The increasing opportunity of quickly

More information

Isomorphism of Graphs Which are k-separable*

Isomorphism of Graphs Which are k-separable* Reprinted from INFORMATION AND CONTROL All Rights Reserved by Academic Press, New York and Vol. 56. Nos. 1-2, 1983 Printed in Belgium Isomorphism of Graphs Which are k-separable* GARY L. MILLER Department

More information

Unit 4 Relational Algebra (Using SQL DML Syntax): Data Manipulation Language For Relations Zvi M. Kedem 1

Unit 4 Relational Algebra (Using SQL DML Syntax): Data Manipulation Language For Relations Zvi M. Kedem 1 Unit 4 Relational Algebra (Using SQL DML Syntax): Data Manipulation Language For Relations 2016 Zvi M. Kedem 1 Relational Algebra in Context User Level (View Level) Community Level (Base Level) Physical

More information

( A(x) B(x) C(x)) (A(x) A(y)) (C(x) C(y))

( A(x) B(x) C(x)) (A(x) A(y)) (C(x) C(y)) 1 Introduction Finite model theory studies the expressive power of logics on finite models. Classical model theory, on the other hand, concentrates on infinite structures: its origins are in mathematics,

More information

Uncertainty in Databases. Lecture 2: Essential Database Foundations

Uncertainty in Databases. Lecture 2: Essential Database Foundations Uncertainty in Databases Lecture 2: Essential Database Foundations Table of Contents 1 2 3 4 5 6 Table of Contents Codd s Vision Codd Catches On Top Academic Recognition Selected Publication Venues 1 2

More information

Mastro Studio: a system for Ontology-Based Data Management

Mastro Studio: a system for Ontology-Based Data Management Mastro Studio: a system for Ontology-Based Data Management Cristina Civili, Marco Console, Domenico Lembo, Lorenzo Lepore, Riccardo Mancini, Antonella Poggi, Marco Ruzzi, Valerio Santarelli, and Domenico

More information

A Typed Lambda Calculus for Input Sanitation

A Typed Lambda Calculus for Input Sanitation A Typed Lambda Calculus for Input Sanitation Nathan Fulton Carthage College nfulton@carthage.edu April 11, 2013 Abstract Programmers often wish to validate or sanitize user input. One common approach to

More information

GRAPH DB S & APPLICATIONS

GRAPH DB S & APPLICATIONS GRAPH DB S & APPLICATIONS DENIS VRDOLJAK GUNNAR KLEEMANN UC BERKELEY SCHOOL OF INFORMATION BERKELEY DATA SCIENCE GROUP, LLC PRESENTATION ROAD MAP Intro Background Examples Our Work Graph Databases Intro

More information