Towards Integrating Work1low and Database Provenance

Size: px
Start display at page:

Download "Towards Integrating Work1low and Database Provenance"

Transcription

1 IPAW 12 Interna'onal Provenance and Annota'on Workshop Towards Integrating Work1low and Database Provenance Fernando Chiriga- and Juliana Freire Polytechnic Ins4tute of NYU

2 Database Provenance Fine- grained provenance Focus: deriva4on of a piece of data in a dataset Provenance for tuples in a rela4onal database Propaga4on of provenance through queries Which parts of the database D contributed to the piece of data t according to query Q? [Cheney et al., FTDB 2009] Employee Department Name DeptName Salary And CBE $50,000 DeptName CBE Address 6 MetroTech = Chris CSE $60,000 CSE 2 MetroTech Robert CSE $55,000 Ryan ECE $40,000 Name DeptName Salary Address And CBE $50,000 6 MetroTech Chris CSE $60,000 2 MetroTech Robert CSE $55,000 2 MetroTech

3 Work1low Provenance Coarse- grained provenance A directed graph describing a computa4onal task Ver4ces = modules = processing steps + parameters Edges = connec4ons between output and input ports Execu4on order determined by flow of data from output to input ports Record of the en4re history of the deriva4on of the output [Davidson and Freire, SIGMOD 2008] head.120.vtk resampled-head-vis.png

4 Work1lows + Databases: Challenges s 12 3 input output1 output1 output2 output2 output Run 12 output (Run 1) output (Run 2)

5 Work1lows + Databases: Challenges input s 1 s 2 output Run 21 output (Run 1) output (Run 2)

6 Work1lows + Databases: Challenges Workflows are func4onal: there are no states or side effects Outputs are a func'on of the inputs O3 = vtkdatasetmapper(input=o2) O2 = vtkcontourfilter(value=57,input=o1) O1 = vtkstructuredreader(input=i1) Databases follow a stateful model The models seem to be incompa4ble: accesses to databases break the stateless scien4fic workflow model How to support reproducibility? O1 O2 I1 O3 [Davidson and Freire, SIGMOD 2008] How to properly describe the provenance of workflows that access (or modify) databases?

7 Contributions Model for integra4ng database and workflow provenance Keeps track of each state of a database leverage transac4on temporal databases [Jensen, 2000] Uniquely iden4fies a database state using transac'on 'me Supports reproducibility and provenance querying Reflects func4onality available in commercial rela4onal databases Oracle RDBMS: hdp:// op'ons/total- recall/overview/index.html DB2: hdp:// temporal/index.html?ca=drs- Proof- of- concept implementa4on VisTrails workflow system Oracle RDBMS

8 Background and De1initions Stateless workflows A module is represented by the func4on input m M f m (I m ) = < O m > output1 output2 f GetInfo (input) = < output1> f ReadDB (output1) = < output2 > f ProcessData (output2) = < output > output

9 Background and De1initions Stateful databases Use the model of transac4on temporal databases Adapt the backlog schema [Jensen, JIS 1993] A backlog keeps track of changes in a rela4on It is append- only Maintain a sequence of states S(R) of a rela4on R S(R) = {(S 1 (R),T 1 (R)),...,(S n (R),T n (R))} The difference (delta) between two states is computed as: Δ j,i (R) = S j (R) S i (R), i < j

10 Example of Backlog Relation Emp BR Emp K Name Job T Op U S 3 S 2 S 1 1 Robert Research Researcher Assistant 10 I fchiriga4 2 Claire Assistant Director 10 I fchiriga4 31 Robert Eric Administra4ve Researcher Director 15 D jfreire 1 Robert Research Assistant 15 I jfreire 3 Eric Administra4ve Director 20 I fchiriga4 S(Emp) = {(S 1 (Emp),10),(S 2 (Emp),15),(S 3 (Emp), 20)} Δ 3,1 (Emp) = 1 Robert Researcher 15 D jfreire 1 Robert Research Assistant 15 I jfreire 3 Eric Administra4ve Director 20 I fchiriga4

11 Integrating Work1low and Database Provenance Key idea: capture informa4on about the database states Previously, we had: Now, we have: f m (I m ) = < O m > f m (I m,[r,t b (R)]) = < O m,[r,t a (R)] > state before the execu4on state amer the execu4on

12 Example of Integrated Model Δ 2,1 (R) Δ 3,2 (R) Δ 4,3 (R) S 1 (R) S 2 (R) S 3 (R) S 4 (R) I 11 I 21 m 11 m 21 I 12 I 13 O 21 m 12 m 13 m 22 O 12 O 13 m 14 m 23 W1 O 14 W2 O 23 f m12 (I 12,[R,T 1 (R)]) = < O 12,[R,Tf 1 m21 (R)] (I 21 >,[R,T 4 (R)]) = < O 21,[R,T 4 (R)] > f m13 (I 13,[R,T 1 (R)]) = < O 13,[R,T 2 (R)] >

13 Reproducibility Database states are tracked during workflow execu4on To reproduce, retrieve original state using recorded transac4on 4me, if necessary Δ 2,1 (R) Δ 3,2 (R) Δ 4,3 (R) S 1 (R) S 2 (R) S 3 (R) S 4 (R) Δ 5,4 (R) S 5 (R) I 11 I 21 m 11 m 21 I 12 I 13 O 21 m 12 m 13 O 12 O 13 m 22 f m21 (I 21,[R,T 4 (R)]) = < O 21,[R,T 4 (R)] > m 14 m 23 W1 O 14 W2 O 23

14 Querying Provenance The model supports different queries Lineage of an output Lineage of a database state How- provenance see paper for details Lineage of a tuple see paper for details

15 Lineage of an Output Δ 2,1 (R) Δ 3,2 (R) Δ 4,3 (R) S 1 (R) S 2 (R) S 3 (R) S 4 (R) I 11 I 21 m 11 m 21 I 12 I 13 O 21 m 12 m 13 m 22 O 12 O 13 m 14 m 23 W1 O 14 W2 O 23 lineage(o 14 ) = {W 1, I 11,[R,T 1 (R)]}

16 Lineage of a State Δ 2,1 (R) Δ 3,2 (R) Δ 4,3 (R) S 1 (R) S 2 (R) S 3 (R) S 4 (R) I 11 I 21 m 11 m 21 I 12 I 13 O 21 m 12 m 13 m 22 O 12 O 13 m 14 m 23 W1 O 14 W2 O 23 f m13 (I 13,[R,T 1 (R)]) = < O 13,[R,T 2 (R)] > lineage(s 2 (R)) = {W 1, f m13, I 13,[R,T 1 (R)]}

17 How- Provenance The model has informa4on about states S b and S a The model can retrieve the difference Δ a,b Through this difference, we know the opera4ons performed by the module We know exactly how the module modified the rela4on

18 Lineage of a Tuple ComputeLineage(t) : lineage(t) = {} b _ times = select T, Op from R B where K = k t for each workflow instance W : for each module m M : for each T in b _ times : if (T in f m 's output) and (T not in f m 's input) : lineage(t). add([ f m,op])

19 Implementation VisTrails Workflow- based data explora4on system Provides support for provenance Oracle RDBMS Total Recall feature Tracks the changes by using a history table Similar to a backlog rela4on SELECT column_name FROM table_name AS OF time S SELECT column_name FROM table_name VERSIONS BETWEEN time_1 AND time_2 Δ

20 VisTrails Total Recall Package SELECT name FROM mountaineers WHERE AS OF country= Brazil " SCN( ) WHERE country= Brazil " T b = T a =

21 Related Work [Acar et al., TaPP 2010] Propose a common provenance graph model Use DFL to support database queries and workflow steps [Amsterdamer et al., VLDB 2011] Propose a framework to integrate fine- grained database- style provenance into workflows Modules are Pig La4n programs Transla4on to nested rela4onal calculus expressions

22 Conclusion Proposed model to integrate workflow and database provenance Leverages func4onality of transac4on temporal databases to capture database states and uniquely iden4fy them Supports reproducibility Supports a rich set of provenance queries that straddle workflows and databases Future work Querying - - efficiency and interfaces Support DDL opera4ons

23 IPAW 12 Interna'onal Provenance and Annota'on Workshop Acknowledgements Thanks to: VisTrails Team, Dieter Gawlick and Venkatesh Radhakrishnan (Oracle) and Jan Van den Bussche This work is par4ally supported by the Na-onal Science Founda-on Any ques4ons? THANK YOU!

FACILITATING REPRODUCIBILITY AFTER THE FACT. Fernando Chiriga- ViDA Visualiza;on and Data Analysis Lab NYU Polytechnic School of Engineering

FACILITATING REPRODUCIBILITY AFTER THE FACT. Fernando Chiriga- ViDA Visualiza;on and Data Analysis Lab NYU Polytechnic School of Engineering FACILITATING REPRODUCIBILITY AFTER THE FACT Fernando Chiriga- ViDA Visualiza;on and Data Analysis Lab NYU Polytechnic School of Engineering Reproducibility? What? Why? No need for mo;va;on here. Reproducibility

More information

Ibis: A Provenance Manager for Mul5 Layer Systems. Christopher Olston & Anish Das Sarma Yahoo! Research

Ibis: A Provenance Manager for Mul5 Layer Systems. Christopher Olston & Anish Das Sarma Yahoo! Research Ibis: A Provenance Manager for Mul5 Layer Systems Christopher Olston & Anish Das Sarma Yahoo! Research Mo5va5on: Many Sub Systems workflow manager e.g. Oozie inges5on dataflow programming framework e.g.

More information

Scien&fic Experiments as Workflows and Scripts. Vanessa Braganholo

Scien&fic Experiments as Workflows and Scripts. Vanessa Braganholo Scien&fic Experiments as Workflows and Scripts Vanessa Braganholo The experiment life cycle Composition Concep&on Reuse Analysis Query Discovery Visualiza&on Provenance Data Distribu&on Monitoring Execution

More information

Fix- point engine in Z3. Krystof Hoder Nikolaj Bjorner Leonardo de Moura

Fix- point engine in Z3. Krystof Hoder Nikolaj Bjorner Leonardo de Moura μz Fix- point engine in Z3 Krystof Hoder Nikolaj Bjorner Leonardo de Moura Mo?va?on Horn EPR applica?ons (Datalog) Points- to analysis Security analysis Deduc?ve data- bases and knowledge bases (Yago)

More information

SQL- Updates, Asser0ons and Views

SQL- Updates, Asser0ons and Views SQL- Updates, Asser0ons and Views Data Defini0on, Constraints, and Schema Changes Used to CREATE, DROP, and ALTER the descrip0ons of the tables (rela0ons) of a database CREATE TABLE In SQL2, can use the

More information

CompSci Understanding Data: Theory and Applica>ons

CompSci Understanding Data: Theory and Applica>ons CompSci 590.6 Understanding Data: Theory and Applica>ons Lecture 8 Why- Not Queries (Query- based) Instructor: Sudeepa Roy Email: sudeepa@cs.duke.edu 1 Today s Paper(s) Why Not? Chapman- Jagadish SIGMOD

More information

Bioinforma)cs Resources - NoSQL -

Bioinforma)cs Resources - NoSQL - Bioinforma)cs Resources - NoSQL - Lecture & Exercises Prof. B. Rost, Dr. L. Richter, J. Reeb Ins)tut für Informa)k I12 Short SQL Recap schema typed data tables defined layout space consump)on is computable

More information

Distributed Data Management Summer Semester 2013 TU Kaiserslautern

Distributed Data Management Summer Semester 2013 TU Kaiserslautern Distributed Data Management Summer Semester 2013 TU Kaiserslautern Dr.- Ing. Sebas4an Michel smichel@mmci.uni- saarland.de Distributed Data Management, SoSe 2013, S. Michel 1 Lecture 4 PIG/HIVE Distributed

More information

A Distributed Data- Parallel Execu3on Framework in the Kepler Scien3fic Workflow System

A Distributed Data- Parallel Execu3on Framework in the Kepler Scien3fic Workflow System A Distributed Data- Parallel Execu3on Framework in the Kepler Scien3fic Workflow System Ilkay Al(ntas and Daniel Crawl San Diego Supercomputer Center UC San Diego Jianwu Wang UMBC WorDS.sdsc.edu Computa3onal

More information

Robust Identification of Fuzzy Duplicates

Robust Identification of Fuzzy Duplicates Robust Identification of Fuzzy Duplicates ì Authors: Surajit Chaudhuri (Microso3 Research) Venkatesh Gan; (Microso3 Research) Rajeev Motwani (Stanford University) Publica;on: 21 st Interna;onal Conference

More information

biokepler: A Comprehensive Bioinforma2cs Scien2fic Workflow Module for Distributed Analysis of Large- Scale Biological Data

biokepler: A Comprehensive Bioinforma2cs Scien2fic Workflow Module for Distributed Analysis of Large- Scale Biological Data biokepler: A Comprehensive Bioinforma2cs Scien2fic Workflow Module for Distributed Analysis of Large- Scale Biological Data Ilkay Al/ntas 1, Jianwu Wang 2, Daniel Crawl 1, Shweta Purawat 1 1 San Diego

More information

Access Control for Enterprise Apps. Dominic Duggan Stevens Ins8tute of Technology Based on material by Lars Olson and Ross Anderson

Access Control for Enterprise Apps. Dominic Duggan Stevens Ins8tute of Technology Based on material by Lars Olson and Ross Anderson Access Control for Enterprise Apps Dominic Duggan Stevens Ins8tute of Technology Based on material by Lars Olson and Ross Anderson SQL ACCESS CONTROL 2 App vs Database Security Mul8ple users for Apps (A)

More information

Database Design CENG 351

Database Design CENG 351 Database Design Database Design Process Requirements analysis What data, what applica;ons, what most frequent opera;ons, Conceptual database design High level descrip;on of the data and the constraint

More information

Abstract Provenance Graphs: Anticipating and Exploiting Schema-Level Data Provenance

Abstract Provenance Graphs: Anticipating and Exploiting Schema-Level Data Provenance Abstract Provenance Graphs: Anticipating and Exploiting Schema-Level Data Provenance Daniel Zinn and Bertram Ludäscher fdzinn,ludaeschg@ucdavis.edu Abstract. Provenance graphs capture flow and dependency

More information

Introduc3on to Data Management

Introduc3on to Data Management ICS 101 Fall 2014 Introduc3on to Data Management Assoc. Prof. Lipyeow Lim Informa3on & Computer Science Department University of Hawaii at Manoa Lipyeow Lim - - University of Hawaii at Manoa 1 The Data

More information

Provenance Manager: PROV-man an Implementation of the PROV Standard. Ammar Benabadelkader Provenance Taskforce Budapest, 24 March 2014

Provenance Manager: PROV-man an Implementation of the PROV Standard. Ammar Benabadelkader Provenance Taskforce Budapest, 24 March 2014 Provenance Manager: PROV-man an Implementation of the PROV Standard Ammar Benabadelkader Provenance Taskforce Budapest, 24 March 2014 Outlines Motivation State-of-the-art PROV-man The Approach, the data

More information

Cloud Data Management System (CDMS)

Cloud Data Management System (CDMS) Cloud Management System (CMS) Wiqar Chaudry Solu9ons Engineer Senior Advisor CMS Overview he OpenStack cloud data management system features a canonical data modeling framework designed to broker context

More information

Vulnerability Analysis (III): Sta8c Analysis

Vulnerability Analysis (III): Sta8c Analysis Computer Security Course. Vulnerability Analysis (III): Sta8c Analysis Slide credit: Vijay D Silva 1 Efficiency of Symbolic Execu8on 2 A Sta8c Analysis Analogy 3 Syntac8c Analysis 4 Seman8cs- Based Analysis

More information

h7ps://bit.ly/citustutorial

h7ps://bit.ly/citustutorial Before We Start Setup a Citus Cloud account for the exercises: h7ps://bit.ly/citustutorial Designing a Mul

More information

ECS 165B: Database System Implementa6on Lecture 14

ECS 165B: Database System Implementa6on Lecture 14 ECS 165B: Database System Implementa6on Lecture 14 UC Davis April 28, 2010 Acknowledgements: por6ons based on slides by Raghu Ramakrishnan and Johannes Gehrke, as well as slides by Zack Ives. Class Agenda

More information

Toad for Oracle s AppDesigner

Toad for Oracle s AppDesigner Toad for Oracle s AppDesigner You can use AppDesigner to automate and control processes you perform regularly. Connection information, window settings, and queries can be saved, shared, scheduled and run

More information

Objec0ves. Gain understanding of what IDA Pro is and what it can do. Expose students to the tool GUI

Objec0ves. Gain understanding of what IDA Pro is and what it can do. Expose students to the tool GUI Intro to IDA Pro 31/15 Objec0ves Gain understanding of what IDA Pro is and what it can do Expose students to the tool GUI Discuss some of the important func

More information

RAD, Rules, and Compatibility: What's Coming in Kuali Rice 2.0

RAD, Rules, and Compatibility: What's Coming in Kuali Rice 2.0 software development simplified RAD, Rules, and Compatibility: What's Coming in Kuali Rice 2.0 Eric Westfall - Indiana University JASIG 2011 For those who don t know Kuali Rice consists of mul8ple sub-

More information

Fluxo. Improving the Responsiveness of Internet Services with Automa7c Cache Placement

Fluxo. Improving the Responsiveness of Internet Services with Automa7c Cache Placement Fluxo Improving the Responsiveness of Internet Services with Automac Cache Placement Alexander Rasmussen UCSD (Presenng) Emre Kiciman MSR Redmond Benjamin Livshits MSR Redmond Madanlal Musuvathi MSR Redmond

More information

1/5/11. Announcements. Topics Lab hour/discussion sec2on: Introduc4on (SDM, e Science, scien2fic workflows) Databases. Assignment 1

1/5/11. Announcements. Topics Lab hour/discussion sec2on: Introduc4on (SDM, e Science, scien2fic workflows) Databases. Assignment 1 1/5/11 Announcements Topics Lab hour/discussion sec2on: Introduc4on (SDM, e Science, scien2fic workflows) Databases next Thursday 2 3pm, 93 Hutchison Assignment 1 Basic concepts of rela2onal databases (crash

More information

Tiresias. The Database Oracle for How- To Queries. Alexandra Meliou Dan Suciu University of Massachuse9s Amherst. University of Washington

Tiresias. The Database Oracle for How- To Queries. Alexandra Meliou Dan Suciu University of Massachuse9s Amherst. University of Washington Tiresias The Database Oracle for How- To Queries Alexandra Meliou Dan Suciu University of Massachuse9s Amherst University of Washington University of Washington Database Group Hypothetical (What- if) Queries

More information

Generalizing Map- Reduce

Generalizing Map- Reduce Generalizing Map- Reduce 1 Example: A Map- Reduce Graph map reduce map... reduce reduce map 2 Map- reduce is not a solu;on to every problem, not even every problem that profitably can use many compute

More information

Agenda. About ECRIN Overview of ECRIN Ac4vi4es Increasing value

Agenda. About ECRIN Overview of ECRIN Ac4vi4es Increasing value Agenda About ECRIN Overview of ECRIN Ac4vi4es Increasing value ECRIN Overview A non- profit organisa4on with the legal status of European Research Infrastructure Consor4um (ERIC) Mission: support the conduct

More information

Query and Join Op/miza/on 11/5

Query and Join Op/miza/on 11/5 Query and Join Op/miza/on 11/5 Overview Recap of Merge Join Op/miza/on Logical Op/miza/on Histograms (How Es/mates Work. Big problem!) Physical Op/mizer (if we have /me) Recap on Merge Key (Simple) Idea

More information

Intelligent Systems Knowledge Representa6on

Intelligent Systems Knowledge Representa6on Intelligent Systems Knowledge Representa6on SCJ3553 Ar6ficial Intelligence Faculty of Computer Science and Informa6on Systems Universi6 Teknologi Malaysia Outline Introduc6on Seman6c Network Frame Conceptual

More information

IRS Use Case & Requirements

IRS Use Case & Requirements IRS Use Case & Requirements Shane Amante Level 3 Communica:ons, Inc. (Speaking on behalf of several Use Case and Requirement I- D s co- authors) IRS Use Case & Reqmt s DraHs Use Cases dra$- amante- irs-

More information

Overview of IPTV Forum Japan s Hybridcast Technical SpecificaAon

Overview of IPTV Forum Japan s Hybridcast Technical SpecificaAon The fourth Web and TV Workshop Overview of IPTV Forum Japan s Hybridcast Technical SpecificaAon Kinji Matsumura, NHK 1 What is Technology pladorm for broadcast and broadband hybrid service that uses HTML5

More information

Data Flow Analysis. Suman Jana. Adopted From U Penn CIS 570: Modern Programming Language Implementa=on (Autumn 2006)

Data Flow Analysis. Suman Jana. Adopted From U Penn CIS 570: Modern Programming Language Implementa=on (Autumn 2006) Data Flow Analysis Suman Jana Adopted From U Penn CIS 570: Modern Programming Language Implementa=on (Autumn 2006) Data flow analysis Derives informa=on about the dynamic behavior of a program by only

More information

Garlik are the online personal iden2ty experts Set up to give individuals and their families real power over the use of their informa2on in the

Garlik are the online personal iden2ty experts Set up to give individuals and their families real power over the use of their informa2on in the 1 2 Garlik are the online personal iden2ty experts Set up to give individuals and their families real power over the use of their informa2on in the digital world Garlik have assembled a world class Leadership

More information

Computer Programming-I. Developed by: Strawberry

Computer Programming-I. Developed by: Strawberry Computer Programming-I Objec=ve of CP-I The course will enable the students to understand the basic concepts of structured programming. What is programming? Wri=ng a set of instruc=ons that computer use

More information

Asynchronous and Fault-Tolerant Recursive Datalog Evalua9on in Shared-Nothing Engines

Asynchronous and Fault-Tolerant Recursive Datalog Evalua9on in Shared-Nothing Engines Asynchronous and Fault-Tolerant Recursive Datalog Evalua9on in Shared-Nothing Engines Jingjing Wang, Magdalena Balazinska, Daniel Halperin University of Washington Modern Analy>cs Requires Itera>on Graph

More information

Instructor: Amol Deshpande

Instructor: Amol Deshpande Instructor: Amol Deshpande amol@cs.umd.edu } Mo7va7on: Why study databases? } Background: 424 Summary } Administrivia Workload etc. } No laptop use allowed in the class!! 1 } There is a *HUGE* amount of

More information

Macro Assembler. Defini3on from h6p://www.computeruser.com

Macro Assembler. Defini3on from h6p://www.computeruser.com The Macro Assembler Macro Assembler Defini3on from h6p://www.computeruser.com A program that translates assembly language instruc3ons into machine code and which the programmer can use to define macro

More information

Review. Objec,ves. Example Students Table. Database Overview 3/8/17. PostgreSQL DB Elas,csearch. Databases

Review. Objec,ves. Example Students Table. Database Overview 3/8/17. PostgreSQL DB Elas,csearch. Databases Objec,ves PostgreSQL DB Elas,csearch Review Databases Ø What language do we use to query databases? March 8, 2017 Sprenkle - CSCI397 1 March 8, 2017 Sprenkle - CSCI397 2 Database Overview Store data in

More information

Simplified Computa/on and Generaliza/on of the Refined Process Structure Tree

Simplified Computa/on and Generaliza/on of the Refined Process Structure Tree The 7 th Interna/onal Workshop on Web Services and Formal Methods (WS- FM 2010) Simplified Computa/on and Generaliza/on of the Refined Process Structure Tree Artem Polyvyanyy, Jussi Vanhatalo, and Hagen

More information

CS 4604: Introduc0on to Database Management Systems. B. Aditya Prakash Lecture #4: SQL---Part 2

CS 4604: Introduc0on to Database Management Systems. B. Aditya Prakash Lecture #4: SQL---Part 2 CS 4604: Introduc0on to Database Management Systems B. Aditya Prakash Lecture #4: SQL---Part 2 Overview - detailed - SQL DML other parts: views modifications joins DDL constraints Prakash 2016 VT CS 4604

More information

Decision Support Systems

Decision Support Systems Decision Support Systems 2011/2012 Week 3. Lecture 6 Previous Class Dimensions & Measures Dimensions: Item Time Loca0on Measures: Quan0ty Sales TransID ItemName ItemID Date Store Qty T0001 Computer I23

More information

RISC-V, Rocket, and RoCC Spring 2017 James Mar2n

RISC-V, Rocket, and RoCC Spring 2017 James Mar2n RISC-V, Rocket, and RoCC Spring 2017 James Mar2n What s new in Lab 2: In lab 1, you built a SHA3 unit that operates in isola2on We would like Sha3Accel to act as an accelerator for a processor Lab 2 introduces

More information

Sql 2008 Copy Tables Structure And Database To Another

Sql 2008 Copy Tables Structure And Database To Another Sql 2008 Copy Tables Structure And Database To Another Copy NAV Database Structure to another Database along with Data in SQL @tablevar table(name varchar(300)) declare @columntablevar table(column_name

More information

Course Title: Introduction to Database Management System Course Code: CSIT116 Course Level: UG Course Credits:04 L T P/ S SW/F W

Course Title: Introduction to Database Management System Course Code: CSIT116 Course Level: UG Course Credits:04 L T P/ S SW/F W Course Title: Introduction to Database Management System Course Code: CSIT116 Course Level: UG Course Credits:04 Course Objectives: The objectives of this course is to: To expose the students to the fundamentals

More information

EAS- SEC: Framework for Securing Enterprise Business Applica;ons

EAS- SEC: Framework for Securing Enterprise Business Applica;ons Invest in security to secure investments EAS- SEC: Framework for Securing Enterprise Business Applica;ons Alexander Polyakov CTO ERPScan About ERPScan The only 360- degree SAP Security solu8on - ERPScan

More information

Keyword search in databases: the power of RDBMS

Keyword search in databases: the power of RDBMS Keyword search in databases: the power of RDBMS 1 Introduc

More information

Introduc.on to Databases

Introduc.on to Databases Introduc.on to Databases G6921 and G6931 Web Technologies Dr. Séamus Lawless Housekeeping Course Structure 1) Intro to the Web 2) HTML 3) HTML and CSS Essay Informa.on Session 4) Intro to Databases 5)

More information

School of Computing, Engineering and Information Sciences University of Northumbria. Set Operations

School of Computing, Engineering and Information Sciences University of Northumbria. Set Operations Set Operations Aim: To understand how to do the equivalent of the UNION, DIFFERENCE and INTERSECT set operations in SQL. Outline of Session: Do some example SQL queries to learn to differentiate between

More information

Approach starts with GEN and KILL sets

Approach starts with GEN and KILL sets b -Advanced-DFA Review of Data Flow Analysis State Propaga+on Computer Science 5-6 Fall Prof. L. J. Osterweil Material adapted from slides originally prepared by Prof. L. A. Clarke A technique for determining

More information

Detec%ng the Temporal Context of Queries. Oliver Kennedy, Ying Yang, Jan Chomicki, Ronny Fehling, Zhen Hua Liu, and Dieter Gawlick 09/01/2014

Detec%ng the Temporal Context of Queries. Oliver Kennedy, Ying Yang, Jan Chomicki, Ronny Fehling, Zhen Hua Liu, and Dieter Gawlick 09/01/2014 Detec%ng the Temporal Context of Queries Oliver Kennedy, Ying Yang, Jan Chomicki, Ronny Fehling, Zhen Hua Liu, and Dieter Gawlick 09/01/2014 Outline Mo.va.on Contextual Analysis Prac.cal Temporal Dependency

More information

DD2451 Parallel and Distributed Computing --- FDD3008 Distributed Algorithms

DD2451 Parallel and Distributed Computing --- FDD3008 Distributed Algorithms DD2451 Parallel and Distributed Computing --- FDD3008 Distributed Algorithms Lecture 8 Leader Election Mads Dam Autumn/Winter 2011 Previously... Consensus for message passing concurrency Crash failures,

More information

Proofs about Programs

Proofs about Programs Proofs about Programs Program Verification (Rosen, Sections 5.5) TOPICS Program Correctness Preconditions & Postconditions Program Verification Assignment Statements Conditional Statements Loops Composition

More information

Chapter 2: Intro to Relational Model

Chapter 2: Intro to Relational Model Chapter 2: Intro to Relational Model Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Example of a Relation attributes (or columns) tuples (or rows) 2.2 Attribute Types The

More information

Provenance Management in Databases under Schema Evolution

Provenance Management in Databases under Schema Evolution Provenance Management in Databases under Schema Evolution Shi Gao, Carlo Zaniolo Department of Computer Science University of California, Los Angeles 1 Provenance under Schema Evolution Modern information

More information

Ar#ficial Intelligence

Ar#ficial Intelligence Ar#ficial Intelligence Advanced Searching Prof Alexiei Dingli Gene#c Algorithms Charles Darwin Genetic Algorithms are good at taking large, potentially huge search spaces and navigating them, looking for

More information

DATA AND SCHEMA MODIFICATIONS CHAPTERS 4,5 (6/E) CHAPTER 8 (5/E)

DATA AND SCHEMA MODIFICATIONS CHAPTERS 4,5 (6/E) CHAPTER 8 (5/E) 1 DATA AND SCHEMA MODIFICATIONS CHAPTERS 4,5 (6/E) CHAPTER 8 (5/E) 2 LECTURE OUTLINE Updating Databases Using SQL Specifying Constraints as Assertions and Actions as Triggers Schema Change Statements in

More information

Sausalito: An Applica/on Server for RESTful Services in the Cloud. Ma;hias Brantner & Donald Kossmann 28msec Inc. h;p://sausalito.28msec.

Sausalito: An Applica/on Server for RESTful Services in the Cloud. Ma;hias Brantner & Donald Kossmann 28msec Inc. h;p://sausalito.28msec. Sausalito: An Applica/on Server for RESTful Services in the Cloud Ma;hias Brantner & Donald Kossmann 28msec Inc. h;p://sausalito.28msec.com/ Conclusion Integrate DBMS, Applica3on Server, and Web Server

More information

Performance and Scalability: Apriori Implementa6on

Performance and Scalability: Apriori Implementa6on Performance and Scalability: Apriori Implementa6on Apriori R. Agrawal and R. Srikant. Fast algorithms for mining associa6on rules. VLDB, 487 499, 1994 Reducing Number of Comparisons Candidate coun6ng:

More information

CSc 120. Introduc/on to Computer Programming II. 02: Problem Decomposi1on and Program Development. Adapted from slides by Dr.

CSc 120. Introduc/on to Computer Programming II. 02: Problem Decomposi1on and Program Development. Adapted from slides by Dr. CSc 120 Introduc/on to Computer Programming II Adapted from slides by Dr. Saumya Debray 02: Problem Decomposi1on and Program Development A common student lament "I have this big programming assignment.

More information

Rela+onal Algebra. Rela+onal Query Languages. CISC437/637, Lecture #6 Ben Cartere?e

Rela+onal Algebra. Rela+onal Query Languages. CISC437/637, Lecture #6 Ben Cartere?e Rela+onal Algebra CISC437/637, Lecture #6 Ben Cartere?e Copyright Ben Cartere?e 1 Rela+onal Query Languages A query language allows manipula+on and retrieval of data from a database The rela+onal model

More information

11/12/11. Objec&ves Overview. Databases, Data, and Informa&on. Objec&ves Overview. Databases, Data, and Informa&on. Databases, Data, and Informa&on

11/12/11. Objec&ves Overview. Databases, Data, and Informa&on. Objec&ves Overview. Databases, Data, and Informa&on. Databases, Data, and Informa&on Objec&ves Overview Define the term,, and explain how a interacts with and informa:on Define the term, integrity, and describe the quali:es of valuable informa:on Discuss the terms character, field, record,

More information

Let's play a public String ping() { counter++; return "SF hits ="+counter;

Let's play a public String ping() { counter++; return SF hits =+counter; Stateful beans Let's play a bit package session; import javax.ejb.stateful; @Stateful public class StatefulSessionBean implements StatefulSessionBeanRemote { int counter=0; @Override public String ping()

More information

Handle Server Performance Tuning

Handle Server Performance Tuning Handle Server Performance Tuning Jason Petrone Corporation for National Research Initiatives Full Screen Handle System Workshop 2 p. 1/14 Reducing Trips to Global Handle System

More information

Thinking Induc,vely. COS 326 David Walker Princeton University

Thinking Induc,vely. COS 326 David Walker Princeton University Thinking Induc,vely COS 326 David Walker Princeton University slides copyright 2017 David Walker permission granted to reuse these slides for non-commercial educa,onal purposes Administra,on 2 Assignment

More information

Informa)on Retrieval and Map- Reduce Implementa)ons. Mohammad Amir Sharif PhD Student Center for Advanced Computer Studies

Informa)on Retrieval and Map- Reduce Implementa)ons. Mohammad Amir Sharif PhD Student Center for Advanced Computer Studies Informa)on Retrieval and Map- Reduce Implementa)ons Mohammad Amir Sharif PhD Student Center for Advanced Computer Studies mas4108@louisiana.edu Map-Reduce: Why? Need to process 100TB datasets On 1 node:

More information

Lecture 1 Introduc-on

Lecture 1 Introduc-on Lecture 1 Introduc-on What would you get out of this course? Structure of a Compiler Op9miza9on Example 15-745: Introduc9on 1 What Do Compilers Do? 1. Translate one language into another e.g., convert

More information

Apache Storm. A framework for Parallel Data Stream Processing

Apache Storm. A framework for Parallel Data Stream Processing Apache Storm A framework for Parallel Data Stream Processing Storm Storm is a distributed real- ;me computa;on pla

More information

CS 465 Final Review. Fall 2017 Prof. Daniel Menasce

CS 465 Final Review. Fall 2017 Prof. Daniel Menasce CS 465 Final Review Fall 2017 Prof. Daniel Menasce Ques@ons What are the types of hazards in a datapath and how each of them can be mi@gated? State and explain some of the methods used to deal with branch

More information

The DCGS- A ontology suite Standard opera8ng procedures and ontology quality assurance Annota8on vs. Explica8on How the DCGS- A ontologies are being

The DCGS- A ontology suite Standard opera8ng procedures and ontology quality assurance Annota8on vs. Explica8on How the DCGS- A ontologies are being Ron Rudnicki November 12, 2013 The DCGS- A ontology suite Standard opera8ng procedures and ontology quality assurance Annota8on vs. Explica8on How the DCGS- A ontologies are being used for the explica8on

More information

CMPS 20 Game Design Experience

CMPS 20 Game Design Experience CMPS 20 Game Design Experience Winter 2013 Week 4/5: Design Patterns Computational Cinematics Studio // Center for Games and Playable Media http://games.soe.ucsc.edu/ccs // http://games.soe.ucsc.edu Arnav

More information

Computer Architecture. CSE 1019Y Week 16. Introduc>on to MARIE

Computer Architecture. CSE 1019Y Week 16. Introduc>on to MARIE Computer Architecture CSE 1019Y Week 16 Introduc>on to MARIE MARIE Simple model computer used in this class MARIE Machine Architecture that is Really Intui>ve and Easy Designed for educa>on only While

More information

SQL: A COMMERCIAL DATABASE LANGUAGE. Data Change Statements,

SQL: A COMMERCIAL DATABASE LANGUAGE. Data Change Statements, SQL: A COMMERCIAL DATABASE LANGUAGE Data Change Statements, Outline 1. Introduction 2. Data Definition, Basic Constraints, and Schema Changes 3. Basic Queries 4. More complex Queries 5. Aggregate Functions

More information

To understand the concept of candidate and primary keys and their application in table creation.

To understand the concept of candidate and primary keys and their application in table creation. CM0719: Database Modelling Seminar 5 (b): Candidate and Primary Keys Exercise Aims: To understand the concept of candidate and primary keys and their application in table creation. Outline of Activity:

More information

Spa$al Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University

Spa$al Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University Spa$al Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University Class Outlines Spatial Point Pattern Regional Data (Areal Data) Continuous Spatial Data (Geostatistical

More information

HSC QUEUE MODE OPERATION PLAN

HSC QUEUE MODE OPERATION PLAN HSC QUEUE MODE OPERATION PLAN Tae- Soo Pyo & Queue- mode team (Subaru Telescope) 2015. 1.13 Working Group Tae- Soo Pyo Ikuru Iwata Masatoshi Imanishi [ 9 members] Eric Jeschke Queue Mode Team Sherry Yeh

More information

Heterogeneous compute in the GATK

Heterogeneous compute in the GATK Heterogeneous compute in the GATK Mauricio Carneiro GSA Broad Ins

More information

UPCRC. Illiac. Gigascale System Research Center. Petascale computing. Cloud Computing Testbed (CCT) 2

UPCRC. Illiac. Gigascale System Research Center. Petascale computing. Cloud Computing Testbed (CCT) 2 Illiac UPCRC Petascale computing Gigascale System Research Center Cloud Computing Testbed (CCT) 2 www.parallel.illinois.edu Mul2 Core: All Computers Are Now Parallel We con'nue to have more transistors

More information

Stream and Complex Event Processing Discovering Exis7ng Systems:c- sparql

Stream and Complex Event Processing Discovering Exis7ng Systems:c- sparql Stream and Complex Event Processing Discovering Exis7ng Systems:c- sparql G. Cugola E. Della Valle A. Margara Politecnico di Milano cugola@elet.polimi.it dellavalle@elet.polimi.it Vrije Universiteit Amsterdam

More information

Network Access Transla0on - NAT

Network Access Transla0on - NAT Network Access Transla0on - NAT Foreword Those slides have been done by gathering a lot of informa0on on the net Ø Cisco tutorial Ø Lectures from other ins0tu0ons University of Princeton University of

More information

Self-Demo Guide. Oracle ilearning and HTML DB

Self-Demo Guide. Oracle ilearning and HTML DB 2003-2004 Self-Demo Guide Oracle ilearning and HTML DB The Oracle Academy allows a school to offer advanced Database and Java programming courses through the use of Oracle s infrastructure. The school

More information

CS 4604: Introduc0on to Database Management Systems

CS 4604: Introduc0on to Database Management Systems CS 4604: Introduc0on to Database Management Systems B. Aditya Prakash Lecture #1: Introduc/on Based on material by Profs. T. M. Murali and Christos Faloutsos Course Informa0on Instructor B. Aditya Prakash,

More information

Towards Automatic Optimization of MapReduce Programs (Position Paper) Shivnath Babu Duke University

Towards Automatic Optimization of MapReduce Programs (Position Paper) Shivnath Babu Duke University Towards Automatic Optimization of MapReduce Programs (Position Paper) Shivnath Babu Duke University Roadmap Call to action to improve automatic optimization techniques in MapReduce frameworks Challenges

More information

CS 4604: Introduc0on to Database Management Systems. B. Aditya Prakash Lecture #1: Introduc/on

CS 4604: Introduc0on to Database Management Systems. B. Aditya Prakash Lecture #1: Introduc/on CS 4604: Introduc0on to Database Management Systems B. Aditya Prakash Lecture #1: Introduc/on Course Informa0on Instructor B. Aditya Prakash, Torg 3160 F, badityap@cs.vt.edu Office Hours: 2:30-3:30pm Mondays

More information

ECSE 425 Lecture 25: Mul1- threading

ECSE 425 Lecture 25: Mul1- threading ECSE 425 Lecture 25: Mul1- threading H&P Chapter 3 Last Time Theore1cal and prac1cal limits of ILP Instruc1on window Branch predic1on Register renaming 2 Today Mul1- threading Chapter 3.5 Summary of ILP:

More information

CS 61C: Great Ideas in Computer Architecture. MIPS Instruc,on Representa,on II. Dan Garcia

CS 61C: Great Ideas in Computer Architecture. MIPS Instruc,on Representa,on II. Dan Garcia CS 61C: Great Ideas in Computer Architecture MIPS Instruc,on Representa,on II Dan Garcia 1 Review of Last Lecture Simplifying MIPS: Define instruc?ons to be same size as data word (one word) so that they

More information

Gradient Descent. Michail Michailidis & Patrick Maiden

Gradient Descent. Michail Michailidis & Patrick Maiden Gradient Descent Michail Michailidis & Patrick Maiden Outline Mo4va4on Gradient Descent Algorithm Issues & Alterna4ves Stochas4c Gradient Descent Parallel Gradient Descent HOGWILD! Mo4va4on It is good

More information

MANAGING DATA(BASES) USING SQL (NON-PROCEDURAL SQL, X401.9)

MANAGING DATA(BASES) USING SQL (NON-PROCEDURAL SQL, X401.9) Technology & Information Management Instructor: Michael Kremer, Ph.D. Class 4 Professional Program: Data Administration and Management MANAGING DATA(BASES) USING SQL (NON-PROCEDURAL SQL, X401.9) AGENDA

More information

Display and usage of Interna3onalized Registra3on Data. Dave Piscitello

Display and usage of Interna3onalized Registra3on Data. Dave Piscitello Display and usage of Interna3onalized Registra3on Data Dave Piscitello 1 SAC 037: Display and usage of Interna3onalized Registra3on Data Reference document can be found at h=p://www.icann.org/commi=ees/security/

More information

SQL Structured Query Language Introduction

SQL Structured Query Language Introduction SQL Structured Query Language Introduction Rifat Shahriyar Dept of CSE, BUET Tables In relational database systems data are represented using tables (relations). A query issued against the database also

More information

Application of Named Graphs Towards Custom Provenance Views

Application of Named Graphs Towards Custom Provenance Views Application of Named Graphs Towards Custom Provenance Views Tara Gibson, Karen Schuchardt, Eric Stephan Pacific Northwest National Laboratory Abstract Provenance capture as applied to execution oriented

More information

Provenance-aware Faceted Search in Drupal

Provenance-aware Faceted Search in Drupal Provenance-aware Faceted Search in Drupal Zhenning Shangguan, Jinguang Zheng, and Deborah L. McGuinness Tetherless World Constellation, Computer Science Department, Rensselaer Polytechnic Institute, 110

More information

Founda'ons of So,ware Engineering. Process: Agile Prac.ces Claire Le Goues

Founda'ons of So,ware Engineering. Process: Agile Prac.ces Claire Le Goues Founda'ons of So,ware Engineering Process: Agile Prac.ces Claire Le Goues 1 Learning goals Define agile as both a set of itera.ve process prac.ces and a business approach for aligning customer needs with

More information

CS6200 Informa.on Retrieval. David Smith College of Computer and Informa.on Science Northeastern University

CS6200 Informa.on Retrieval. David Smith College of Computer and Informa.on Science Northeastern University CS6200 Informa.on Retrieval David Smith College of Computer and Informa.on Science Northeastern University Indexing Process Indexes Indexes are data structures designed to make search faster Text search

More information

Analysis and Tes,ng of PLEXIL Plans

Analysis and Tes,ng of PLEXIL Plans Analysis and Tes,ng of PLEXIL Plans Jason Biatek, Michael W. Whalen, Sanjai Rayadurgam, Mats P.E. Heimdahl, Michael Lowry 04-06- 14 FormaliSE - Mike Whalen 1 Autonomy at NASA 04-06- 14 FormaliSE - Mike

More information

CS6312 DATABASE MANAGEMENT SYSTEMS LABORATORY L T P C

CS6312 DATABASE MANAGEMENT SYSTEMS LABORATORY L T P C CS6312 DATABASE MANAGEMENT SYSTEMS LABORATORY L T P C 0 0 3 2 LIST OF EXPERIMENTS: 1. Creation of a database and writing SQL queries to retrieve information from the database. 2. Performing Insertion,

More information

W1005 Intro to CS and Programming in MATLAB. Brief History of Compu?ng. Fall 2014 Instructor: Ilia Vovsha. hip://www.cs.columbia.

W1005 Intro to CS and Programming in MATLAB. Brief History of Compu?ng. Fall 2014 Instructor: Ilia Vovsha. hip://www.cs.columbia. W1005 Intro to CS and Programming in MATLAB Brief History of Compu?ng Fall 2014 Instructor: Ilia Vovsha hip://www.cs.columbia.edu/~vovsha/w1005 Computer Philosophy Computer is a (electronic digital) device

More information

Oracle Database Auditing

Oracle Database Auditing By Craig Moir craig@mydba.co.za http://www.mydba.co.za August 2012 Version 1 WHY AUDIT? Allows organizations to enforce the trust-but-verify security principle. Satisfying compliance regulations. Enables

More information