View-Based Tree-Language Rewritings

Similar documents
Minimum congestion spanning trees in bipartite and random graphs

Q1:Choose the correct answer:

XML Data Streams. XML Stream Processing. XML Stream Processing. Yanlei Diao. University of Massachusetts Amherst

Inverse Kinematics 1 1/29/2018

Select Operation (σ) It selects tuples that satisfy the given predicate from a relation (choose rows). Review : RELATIONAL ALGEBRA

Logic as a Query Language: from Frege to XML. Victor Vianu U.C. San Diego

KINEMATIC ANALYSIS OF VARIOUS ROBOT CONFIGURATIONS

Macrohomogenous Li-Ion-Battery Modeling - Strengths and Limitations

The Crank-Nicholson method for a nonlinear diffusion equation

1. Introduction. Abstract

1 The secretary problem

Calculations for multiple mixers are based on a formalism that uses sideband information and LO frequencies: ( ) sb

Processing Regular Path Queries Using Views or What Do We Need for Integrating Semistructured Data?

Shortest Paths in Directed Graphs

XPath with transitive closure

Exercises Computational Complexity

XML and Databases. Lecture 9 Properties of XPath. Sebastian Maneth NICTA and UNSW

Karen L. Collins. Wesleyan University. Middletown, CT and. Mark Hovey MIT. Cambridge, MA Abstract

XML Research for Formal Language Theorists

Towards Schema-Guided XML Query Induction

HOMEWORK #3 BME 473 ~ Applied Biomechanics Due during Week #10

Touring a Sequence of Polygons

Topics. Lecture 37: Global Optimization. Issues. A Simple Example: Copy Propagation X := 3 B > 0 Y := 0 X := 4 Y := Z + W A := 2 * 3X

Touring a Sequence of Polygons

A SIMPLE IMPERATIVE LANGUAGE THE STORE FUNCTION NON-TERMINATING COMMANDS

More on Polynomial Time and Space

Generic Traverse. CS 362, Lecture 19. DFS and BFS. Today s Outline

Using Bayesian Networks for Cleansing Trauma Data

Combined Radix-10 and Radix-16 Division Unit

Advanced Encryption Standard and Modes of Operation

Representations and Transformations. Objectives

Polygon Side Lengths NAME DATE TIME

Description of Traffic in ATM Networks by the First Erlang Formula

ISSN (Online), Volume 1, Special Issue 2(ICITET 15), March 2015 International Journal of Innovative Trends and Emerging Technologies

Shortest Paths Problem. CS 362, Lecture 20. Today s Outline. Negative Weights

An Intro to LP and the Simplex Algorithm. Primal Simplex

1.3 Functions and Equivalence Relations 1.4 Languages

DAROS: Distributed User-Server Assignment And Replication For Online Social Networking Applications

Edits in Xylia Validity Preserving Editing of XML Documents

Midterm 2 March 10, 2014 Name: NetID: # Total Score

Today s Outline. CS 561, Lecture 23. Negative Weights. Shortest Paths Problem. The presence of a negative cycle might mean that there is

COMP9319 Web Data Compression & Search. Cloud and data optimization XPath containment Distributed path expression processing

Repair of XML documents

AN ALGORITHM FOR RESTRICTED NORMAL FORM TO SOLVE DUAL TYPE NON-CANONICAL LINEAR FRACTIONAL PROGRAMMING PROBLEM

A note on degenerate and spectrally degenerate graphs

Infinite transducers on terms denoting graphs

Course Updates. Reminders: 1) Assignment #13 due Monday. 2) Mirrors & Lenses. 3) Review for Final: Wednesday, May 5th

Datum Transformations of NAV420 Reference Frames

AVL Tree. The height of the BST be as small as possible

Modeling of underwater vehicle s dynamics

Counting multiplicity over infinite alphabets

NP-Completeness. Algorithms

Regular Expressions for Data Words

Graph algorithms based on infinite automata: logical descriptions and usable constructions

Monadic Datalog Containment on Trees

New Rewritings and Optimizations for Regular Path Queries

KS3 Maths Assessment Objectives

Incorporating Speculative Execution into Scheduling of Control-flow Intensive Behavioral Descriptions

Pruning Game Tree by Rollouts

Web Science and additionality

Introduction to Database Systems CSE 414

Lecture 14: Minimum Spanning Tree I

1 The Knuth-Morris-Pratt Algorithm

Announcements. CSE332: Data Abstractions Lecture 19: Parallel Prefix and Sorting. The prefix-sum problem. Outline. Parallel prefix-sum

Operational Semantics Class notes for a lecture given by Mooly Sagiv Tel Aviv University 24/5/2007 By Roy Ganor and Uri Juhasz

Definitions Homework. Quine McCluskey Optimal solutions are possible for some large functions Espresso heuristic. Definitions Homework

An Improved Implementation of Elliptic Curve Digital Signature by Using Sparse Elements

Comparison of XPath Containment Algorithms


View-based query processing: On the relationship between rewriting, answering and losslessness

J. Carme, R. Gilleron, A. Lemay, J. Niehren. INRIA FUTURS, University of Lille 3

Applications of Tree Automata Theory Lecture I: Tree Automata

Where have all the cars gone? A model for determining traffic flow throughout a road network

XPath evaluation in linear time with polynomial combined complexity

Enhanced Regular Path Queries on Semistructured Databases

MAT 155: Describing, Exploring, and Comparing Data Page 1 of NotesCh2-3.doc

Interface Tracking in Eulerian and MMALE Calculations

Schema-Guided Query Induction

Trees, Automata and XML

8 : Learning Fully Observed Undirected Graphical Models

Simple strategy Computes shortest paths from the start note

Deterministic automata for extended regular expressions

Automata Theory for Reasoning about Actions

Automata for XML A survey

CORRECTNESS ISSUES AND LOOP INVARIANTS

Routing Definition 4.1

Interconnection Styles

Delaunay Triangulation: Incremental Construction

Semantic Characterizations of XPath

Universität Augsburg. Institut für Informatik. Approximating Optimal Visual Sensor Placement. E. Hörster, R. Lienhart.

Contents. shortest paths. Notation. Shortest path problem. Applications. Algorithms and Networks 2010/2011. In the entire course:

Drawing Lines in 2 Dimensions

Rewriting Ontology-Mediated Queries. Carsten Lutz University of Bremen

xy-monotone path existence queries in a rectilinear environment

Brief Announcement: Distributed 3/2-Approximation of the Diameter

Data Formats and APIs

CS 598CSC: Approximation Algorithms Lecture date: March 2, 2011 Instructor: Chandra Chekuri

( ) ( ) Completing the Square. Alg 3 1 Rational Roots Solving Polynomial Equations. A Perfect Square Trinomials

Performance Evaluation of an Advanced Local Search Evolutionary Algorithm

Quadrilaterals. Learning Objectives. Pre-Activity

Transcription:

View-Baed Tree-Language Rewriting Lak Lakhmanan, Alex Thomo Univerity of Britih Columbia, Canada Univerity of Vitoria, Canada

Importane of tree XML Semi-trutured textual format are very popular. <movie> <title>houe of ard</title> <year>2013</year> <harater> <name>frani</name> <ator>kevin Spaey</ator> </harater> <harater> <name>claire</name> <ator>robin Wright</ator> </harater> </movie> XML (Multi TB) ue torie: 1. Elevier Paper and book 2. JPMorgan Chae & Co Stok reearh data 3. JetBlue Airway Doument management Soure: MarkLogi XML Impating the Enterprie Tapping into the Power of XML: Five Sue Storie

Importane of tree JSON Semi-trutured textual format are very popular. "movie": { "title": "Houe of ard", "year": "2013", "harater": [ { "name": "Frani", "ator": "Kevin Spaey" }, { "name": "Claire", "ator": "Robin Wright" } ] } JSON (Multi TB) ue torie: 1. CouhDB 2. MongoDB 3. Jaql and Hive JSON SerDe for Hadoop Mantra: Log firt, ak quetion later

Tree viually movie title year harater harater Houe of Card 2013 name ator name ator Frani Kevin Spaey Claire Robin Wright

Another example

Importane of view (example) Big databae of movie in a uper-tree, eah movie being a ub-tree Query ak for all the movie ub-tree with a MAC. mall minority; number about 50. Reult materialized into a view. Tremendou help in anwering new querie, e.g. find ator playing a MAC. Rewrite into: find ator playing a MAC in a movie having a MAC anwer it on the materialized view.

Regular Expreion and Automata Return all movie ator _* m _* aˆ A pattern Automaton 0 0 m 0 1 1 1 1 2 ˆ 2 a 3

Revere Return all movie ator _* m _* aˆ A pattern Automaton a ˆ a a m m m m

Bottom-up Tree Automata Return all movie ator _* m _* aˆ Automaton * * * * a * * a ˆ a * m * m m m A pattern

Run

Bottom-up Tree Automata (II) Return all movie ator of MAC _* m _* MAC aˆ Automaton aˆ a aa * * ˆ * * * * aˆ * aˆ m * m m m a a A pattern

Bottom-up Tree Automata (IV) Return all movie having ome MAC _* mˆ _* MAC a Automaton * a * * * a * * a a * ˆ m * m m m A pattern

Run

Bottom-up Tree Automata (V) Regular tree language (RTA) the et of tree reognized by TA. loed under interetion and omplement Determiniti TA For any tree t, there an be at mot one aepting run of A on t. Power-wie, TA = DTA. Complement obtained from determiniti TA Interetion via a peial ontrution preerve determinim.

Querie Querie are regular et of tree over ˆ Containment Lemma Q Q Q 1 Q2 implie an 1 an 2

Star Operation

Filled Star Operation

Tranformation for avoiding marker overlap

Rewriting, and two et Maximally ontained rewriting: The bad et: The promiing et: Propoition.

Example with hain

Example with hain (II)

Invere of the tar operation Propoition. Compute where J and J are RTQ

Colored Alphabet Marker will be olor Blue for J Red for J

Colored Language et of all tree having one node blue et of all tree having one node red et of all tree having one node blue and another red a deendant of the blue node et of all tree having all node blak, exept root whih i red et of all tree having all node blak, exept for the root whih i blue and another node whih i red.

Colored Language (II)

Colored Language (III) over ame a p, but with blue node turned blak ame a p, but with red node turned blak over

Colored Language (IV) automaton for Similarly:

Rewriting Algorithm Compute: Theorem.

Rewriting Algorithm (II)

Complexity Propoition. polynomial time. an be omputed in Theorem. The MCR of Q uing V an be omputed in exponential time. Theorem. Computing the MCR of Q uing V i EXPTIME-hard.

Final Note Query automata formalim ued i equivalent in power to MSO (golden tandard) For peifying node-eleting querie. Color orrepond to Boolean marking J. Niehren, L. Planque, J.-M. Talbot, and S. Tion. N-ary querie by tree automata. DBPL, 2005 XPath rewriting i NP-hard. XPath i a ubla of our formalim. Our automata-baed algorithm an be ued a well for rewriting XPath querie.

K-ary querie Example: Find the 2-foret of ator tree pair for ator who have played the ame harater together in ome movie. Automaton * aˆ aˆ * * * * * * aˆ m * aˆ m m m A pattern

Run

Why i rewriting K-ary querie hallenging It ha been hown that k-ary querie an be enoded by unary querie T. Shwentik. On diving in tree. In MFCS, 2000. Done by going through MSO formula. Going from a k-ary query to an MSO enoding and then bak to automata inur non elementary omplexity. Therefore we need a another algorithm for rewriting k-ary querie that doen t go via MSO formula

Conluion Charaterized view-baed rewriting a olving a lang. equation Defined appropriate tree operator Defined olored language Gave automata ontrution Computed rewriting a a erie of operation on automata Charaterized the omplexity of omputing rewriting Tight lower bound provided Extended the reult to k-ary querie Common in XQuery

Thank You