Towards Integrating Work1low and Database Provenance
|
|
- Derick Reginald Barnett
- 5 years ago
- Views:
Transcription
1 IPAW 12 Interna'onal Provenance and Annota'on Workshop Towards Integrating Work1low and Database Provenance Fernando Chiriga- and Juliana Freire Polytechnic Ins4tute of NYU
2 Database Provenance Fine- grained provenance Focus: deriva4on of a piece of data in a dataset Provenance for tuples in a rela4onal database Propaga4on of provenance through queries Which parts of the database D contributed to the piece of data t according to query Q? [Cheney et al., FTDB 2009] Employee Department Name DeptName Salary And CBE $50,000 DeptName CBE Address 6 MetroTech = Chris CSE $60,000 CSE 2 MetroTech Robert CSE $55,000 Ryan ECE $40,000 Name DeptName Salary Address And CBE $50,000 6 MetroTech Chris CSE $60,000 2 MetroTech Robert CSE $55,000 2 MetroTech
3 Work1low Provenance Coarse- grained provenance A directed graph describing a computa4onal task Ver4ces = modules = processing steps + parameters Edges = connec4ons between output and input ports Execu4on order determined by flow of data from output to input ports Record of the en4re history of the deriva4on of the output [Davidson and Freire, SIGMOD 2008] head.120.vtk resampled-head-vis.png
4 Work1lows + Databases: Challenges s 12 3 input output1 output1 output2 output2 output Run 12 output (Run 1) output (Run 2)
5 Work1lows + Databases: Challenges input s 1 s 2 output Run 21 output (Run 1) output (Run 2)
6 Work1lows + Databases: Challenges Workflows are func4onal: there are no states or side effects Outputs are a func'on of the inputs O3 = vtkdatasetmapper(input=o2) O2 = vtkcontourfilter(value=57,input=o1) O1 = vtkstructuredreader(input=i1) Databases follow a stateful model The models seem to be incompa4ble: accesses to databases break the stateless scien4fic workflow model How to support reproducibility? O1 O2 I1 O3 [Davidson and Freire, SIGMOD 2008] How to properly describe the provenance of workflows that access (or modify) databases?
7 Contributions Model for integra4ng database and workflow provenance Keeps track of each state of a database leverage transac4on temporal databases [Jensen, 2000] Uniquely iden4fies a database state using transac'on 'me Supports reproducibility and provenance querying Reflects func4onality available in commercial rela4onal databases Oracle RDBMS: hdp:// op'ons/total- recall/overview/index.html DB2: hdp:// temporal/index.html?ca=drs- Proof- of- concept implementa4on VisTrails workflow system Oracle RDBMS
8 Background and De1initions Stateless workflows A module is represented by the func4on input m M f m (I m ) = < O m > output1 output2 f GetInfo (input) = < output1> f ReadDB (output1) = < output2 > f ProcessData (output2) = < output > output
9 Background and De1initions Stateful databases Use the model of transac4on temporal databases Adapt the backlog schema [Jensen, JIS 1993] A backlog keeps track of changes in a rela4on It is append- only Maintain a sequence of states S(R) of a rela4on R S(R) = {(S 1 (R),T 1 (R)),...,(S n (R),T n (R))} The difference (delta) between two states is computed as: Δ j,i (R) = S j (R) S i (R), i < j
10 Example of Backlog Relation Emp BR Emp K Name Job T Op U S 3 S 2 S 1 1 Robert Research Researcher Assistant 10 I fchiriga4 2 Claire Assistant Director 10 I fchiriga4 31 Robert Eric Administra4ve Researcher Director 15 D jfreire 1 Robert Research Assistant 15 I jfreire 3 Eric Administra4ve Director 20 I fchiriga4 S(Emp) = {(S 1 (Emp),10),(S 2 (Emp),15),(S 3 (Emp), 20)} Δ 3,1 (Emp) = 1 Robert Researcher 15 D jfreire 1 Robert Research Assistant 15 I jfreire 3 Eric Administra4ve Director 20 I fchiriga4
11 Integrating Work1low and Database Provenance Key idea: capture informa4on about the database states Previously, we had: Now, we have: f m (I m ) = < O m > f m (I m,[r,t b (R)]) = < O m,[r,t a (R)] > state before the execu4on state amer the execu4on
12 Example of Integrated Model Δ 2,1 (R) Δ 3,2 (R) Δ 4,3 (R) S 1 (R) S 2 (R) S 3 (R) S 4 (R) I 11 I 21 m 11 m 21 I 12 I 13 O 21 m 12 m 13 m 22 O 12 O 13 m 14 m 23 W1 O 14 W2 O 23 f m12 (I 12,[R,T 1 (R)]) = < O 12,[R,Tf 1 m21 (R)] (I 21 >,[R,T 4 (R)]) = < O 21,[R,T 4 (R)] > f m13 (I 13,[R,T 1 (R)]) = < O 13,[R,T 2 (R)] >
13 Reproducibility Database states are tracked during workflow execu4on To reproduce, retrieve original state using recorded transac4on 4me, if necessary Δ 2,1 (R) Δ 3,2 (R) Δ 4,3 (R) S 1 (R) S 2 (R) S 3 (R) S 4 (R) Δ 5,4 (R) S 5 (R) I 11 I 21 m 11 m 21 I 12 I 13 O 21 m 12 m 13 O 12 O 13 m 22 f m21 (I 21,[R,T 4 (R)]) = < O 21,[R,T 4 (R)] > m 14 m 23 W1 O 14 W2 O 23
14 Querying Provenance The model supports different queries Lineage of an output Lineage of a database state How- provenance see paper for details Lineage of a tuple see paper for details
15 Lineage of an Output Δ 2,1 (R) Δ 3,2 (R) Δ 4,3 (R) S 1 (R) S 2 (R) S 3 (R) S 4 (R) I 11 I 21 m 11 m 21 I 12 I 13 O 21 m 12 m 13 m 22 O 12 O 13 m 14 m 23 W1 O 14 W2 O 23 lineage(o 14 ) = {W 1, I 11,[R,T 1 (R)]}
16 Lineage of a State Δ 2,1 (R) Δ 3,2 (R) Δ 4,3 (R) S 1 (R) S 2 (R) S 3 (R) S 4 (R) I 11 I 21 m 11 m 21 I 12 I 13 O 21 m 12 m 13 m 22 O 12 O 13 m 14 m 23 W1 O 14 W2 O 23 f m13 (I 13,[R,T 1 (R)]) = < O 13,[R,T 2 (R)] > lineage(s 2 (R)) = {W 1, f m13, I 13,[R,T 1 (R)]}
17 How- Provenance The model has informa4on about states S b and S a The model can retrieve the difference Δ a,b Through this difference, we know the opera4ons performed by the module We know exactly how the module modified the rela4on
18 Lineage of a Tuple ComputeLineage(t) : lineage(t) = {} b _ times = select T, Op from R B where K = k t for each workflow instance W : for each module m M : for each T in b _ times : if (T in f m 's output) and (T not in f m 's input) : lineage(t). add([ f m,op])
19 Implementation VisTrails Workflow- based data explora4on system Provides support for provenance Oracle RDBMS Total Recall feature Tracks the changes by using a history table Similar to a backlog rela4on SELECT column_name FROM table_name AS OF time S SELECT column_name FROM table_name VERSIONS BETWEEN time_1 AND time_2 Δ
20 VisTrails Total Recall Package SELECT name FROM mountaineers WHERE AS OF country= Brazil " SCN( ) WHERE country= Brazil " T b = T a =
21 Related Work [Acar et al., TaPP 2010] Propose a common provenance graph model Use DFL to support database queries and workflow steps [Amsterdamer et al., VLDB 2011] Propose a framework to integrate fine- grained database- style provenance into workflows Modules are Pig La4n programs Transla4on to nested rela4onal calculus expressions
22 Conclusion Proposed model to integrate workflow and database provenance Leverages func4onality of transac4on temporal databases to capture database states and uniquely iden4fy them Supports reproducibility Supports a rich set of provenance queries that straddle workflows and databases Future work Querying - - efficiency and interfaces Support DDL opera4ons
23 IPAW 12 Interna'onal Provenance and Annota'on Workshop Acknowledgements Thanks to: VisTrails Team, Dieter Gawlick and Venkatesh Radhakrishnan (Oracle) and Jan Van den Bussche This work is par4ally supported by the Na-onal Science Founda-on Any ques4ons? THANK YOU!
FACILITATING REPRODUCIBILITY AFTER THE FACT. Fernando Chiriga- ViDA Visualiza;on and Data Analysis Lab NYU Polytechnic School of Engineering
FACILITATING REPRODUCIBILITY AFTER THE FACT Fernando Chiriga- ViDA Visualiza;on and Data Analysis Lab NYU Polytechnic School of Engineering Reproducibility? What? Why? No need for mo;va;on here. Reproducibility
More informationIbis: A Provenance Manager for Mul5 Layer Systems. Christopher Olston & Anish Das Sarma Yahoo! Research
Ibis: A Provenance Manager for Mul5 Layer Systems Christopher Olston & Anish Das Sarma Yahoo! Research Mo5va5on: Many Sub Systems workflow manager e.g. Oozie inges5on dataflow programming framework e.g.
More informationScien&fic Experiments as Workflows and Scripts. Vanessa Braganholo
Scien&fic Experiments as Workflows and Scripts Vanessa Braganholo The experiment life cycle Composition Concep&on Reuse Analysis Query Discovery Visualiza&on Provenance Data Distribu&on Monitoring Execution
More informationFix- point engine in Z3. Krystof Hoder Nikolaj Bjorner Leonardo de Moura
μz Fix- point engine in Z3 Krystof Hoder Nikolaj Bjorner Leonardo de Moura Mo?va?on Horn EPR applica?ons (Datalog) Points- to analysis Security analysis Deduc?ve data- bases and knowledge bases (Yago)
More informationSQL- Updates, Asser0ons and Views
SQL- Updates, Asser0ons and Views Data Defini0on, Constraints, and Schema Changes Used to CREATE, DROP, and ALTER the descrip0ons of the tables (rela0ons) of a database CREATE TABLE In SQL2, can use the
More informationCompSci Understanding Data: Theory and Applica>ons
CompSci 590.6 Understanding Data: Theory and Applica>ons Lecture 8 Why- Not Queries (Query- based) Instructor: Sudeepa Roy Email: sudeepa@cs.duke.edu 1 Today s Paper(s) Why Not? Chapman- Jagadish SIGMOD
More informationBioinforma)cs Resources - NoSQL -
Bioinforma)cs Resources - NoSQL - Lecture & Exercises Prof. B. Rost, Dr. L. Richter, J. Reeb Ins)tut für Informa)k I12 Short SQL Recap schema typed data tables defined layout space consump)on is computable
More informationDistributed Data Management Summer Semester 2013 TU Kaiserslautern
Distributed Data Management Summer Semester 2013 TU Kaiserslautern Dr.- Ing. Sebas4an Michel smichel@mmci.uni- saarland.de Distributed Data Management, SoSe 2013, S. Michel 1 Lecture 4 PIG/HIVE Distributed
More informationA Distributed Data- Parallel Execu3on Framework in the Kepler Scien3fic Workflow System
A Distributed Data- Parallel Execu3on Framework in the Kepler Scien3fic Workflow System Ilkay Al(ntas and Daniel Crawl San Diego Supercomputer Center UC San Diego Jianwu Wang UMBC WorDS.sdsc.edu Computa3onal
More informationRobust Identification of Fuzzy Duplicates
Robust Identification of Fuzzy Duplicates ì Authors: Surajit Chaudhuri (Microso3 Research) Venkatesh Gan; (Microso3 Research) Rajeev Motwani (Stanford University) Publica;on: 21 st Interna;onal Conference
More informationbiokepler: A Comprehensive Bioinforma2cs Scien2fic Workflow Module for Distributed Analysis of Large- Scale Biological Data
biokepler: A Comprehensive Bioinforma2cs Scien2fic Workflow Module for Distributed Analysis of Large- Scale Biological Data Ilkay Al/ntas 1, Jianwu Wang 2, Daniel Crawl 1, Shweta Purawat 1 1 San Diego
More informationApache Hive. CMSC 491 Hadoop-Based Distributed Compu<ng Spring 2016 Adam Shook
Apache Hive CMSC 491 Hadoop-Based Distributed Compu
More informationAccess Control for Enterprise Apps. Dominic Duggan Stevens Ins8tute of Technology Based on material by Lars Olson and Ross Anderson
Access Control for Enterprise Apps Dominic Duggan Stevens Ins8tute of Technology Based on material by Lars Olson and Ross Anderson SQL ACCESS CONTROL 2 App vs Database Security Mul8ple users for Apps (A)
More informationDatabase Design CENG 351
Database Design Database Design Process Requirements analysis What data, what applica;ons, what most frequent opera;ons, Conceptual database design High level descrip;on of the data and the constraint
More informationAbstract Provenance Graphs: Anticipating and Exploiting Schema-Level Data Provenance
Abstract Provenance Graphs: Anticipating and Exploiting Schema-Level Data Provenance Daniel Zinn and Bertram Ludäscher fdzinn,ludaeschg@ucdavis.edu Abstract. Provenance graphs capture flow and dependency
More informationIntroduc3on to Data Management
ICS 101 Fall 2014 Introduc3on to Data Management Assoc. Prof. Lipyeow Lim Informa3on & Computer Science Department University of Hawaii at Manoa Lipyeow Lim - - University of Hawaii at Manoa 1 The Data
More informationProvenance Manager: PROV-man an Implementation of the PROV Standard. Ammar Benabadelkader Provenance Taskforce Budapest, 24 March 2014
Provenance Manager: PROV-man an Implementation of the PROV Standard Ammar Benabadelkader Provenance Taskforce Budapest, 24 March 2014 Outlines Motivation State-of-the-art PROV-man The Approach, the data
More informationCloud Data Management System (CDMS)
Cloud Management System (CMS) Wiqar Chaudry Solu9ons Engineer Senior Advisor CMS Overview he OpenStack cloud data management system features a canonical data modeling framework designed to broker context
More informationVulnerability Analysis (III): Sta8c Analysis
Computer Security Course. Vulnerability Analysis (III): Sta8c Analysis Slide credit: Vijay D Silva 1 Efficiency of Symbolic Execu8on 2 A Sta8c Analysis Analogy 3 Syntac8c Analysis 4 Seman8cs- Based Analysis
More informationh7ps://bit.ly/citustutorial
Before We Start Setup a Citus Cloud account for the exercises: h7ps://bit.ly/citustutorial Designing a Mul
More informationECS 165B: Database System Implementa6on Lecture 14
ECS 165B: Database System Implementa6on Lecture 14 UC Davis April 28, 2010 Acknowledgements: por6ons based on slides by Raghu Ramakrishnan and Johannes Gehrke, as well as slides by Zack Ives. Class Agenda
More informationToad for Oracle s AppDesigner
Toad for Oracle s AppDesigner You can use AppDesigner to automate and control processes you perform regularly. Connection information, window settings, and queries can be saved, shared, scheduled and run
More informationObjec0ves. Gain understanding of what IDA Pro is and what it can do. Expose students to the tool GUI
Intro to IDA Pro 31/15 Objec0ves Gain understanding of what IDA Pro is and what it can do Expose students to the tool GUI Discuss some of the important func
More informationRAD, Rules, and Compatibility: What's Coming in Kuali Rice 2.0
software development simplified RAD, Rules, and Compatibility: What's Coming in Kuali Rice 2.0 Eric Westfall - Indiana University JASIG 2011 For those who don t know Kuali Rice consists of mul8ple sub-
More informationFluxo. Improving the Responsiveness of Internet Services with Automa7c Cache Placement
Fluxo Improving the Responsiveness of Internet Services with Automac Cache Placement Alexander Rasmussen UCSD (Presenng) Emre Kiciman MSR Redmond Benjamin Livshits MSR Redmond Madanlal Musuvathi MSR Redmond
More information1/5/11. Announcements. Topics Lab hour/discussion sec2on: Introduc4on (SDM, e Science, scien2fic workflows) Databases. Assignment 1
1/5/11 Announcements Topics Lab hour/discussion sec2on: Introduc4on (SDM, e Science, scien2fic workflows) Databases next Thursday 2 3pm, 93 Hutchison Assignment 1 Basic concepts of rela2onal databases (crash
More informationTiresias. The Database Oracle for How- To Queries. Alexandra Meliou Dan Suciu University of Massachuse9s Amherst. University of Washington
Tiresias The Database Oracle for How- To Queries Alexandra Meliou Dan Suciu University of Massachuse9s Amherst University of Washington University of Washington Database Group Hypothetical (What- if) Queries
More informationGeneralizing Map- Reduce
Generalizing Map- Reduce 1 Example: A Map- Reduce Graph map reduce map... reduce reduce map 2 Map- reduce is not a solu;on to every problem, not even every problem that profitably can use many compute
More informationAgenda. About ECRIN Overview of ECRIN Ac4vi4es Increasing value
Agenda About ECRIN Overview of ECRIN Ac4vi4es Increasing value ECRIN Overview A non- profit organisa4on with the legal status of European Research Infrastructure Consor4um (ERIC) Mission: support the conduct
More informationQuery and Join Op/miza/on 11/5
Query and Join Op/miza/on 11/5 Overview Recap of Merge Join Op/miza/on Logical Op/miza/on Histograms (How Es/mates Work. Big problem!) Physical Op/mizer (if we have /me) Recap on Merge Key (Simple) Idea
More informationIntelligent Systems Knowledge Representa6on
Intelligent Systems Knowledge Representa6on SCJ3553 Ar6ficial Intelligence Faculty of Computer Science and Informa6on Systems Universi6 Teknologi Malaysia Outline Introduc6on Seman6c Network Frame Conceptual
More informationIRS Use Case & Requirements
IRS Use Case & Requirements Shane Amante Level 3 Communica:ons, Inc. (Speaking on behalf of several Use Case and Requirement I- D s co- authors) IRS Use Case & Reqmt s DraHs Use Cases dra$- amante- irs-
More informationOverview of IPTV Forum Japan s Hybridcast Technical SpecificaAon
The fourth Web and TV Workshop Overview of IPTV Forum Japan s Hybridcast Technical SpecificaAon Kinji Matsumura, NHK 1 What is Technology pladorm for broadcast and broadband hybrid service that uses HTML5
More informationData Flow Analysis. Suman Jana. Adopted From U Penn CIS 570: Modern Programming Language Implementa=on (Autumn 2006)
Data Flow Analysis Suman Jana Adopted From U Penn CIS 570: Modern Programming Language Implementa=on (Autumn 2006) Data flow analysis Derives informa=on about the dynamic behavior of a program by only
More informationGarlik are the online personal iden2ty experts Set up to give individuals and their families real power over the use of their informa2on in the
1 2 Garlik are the online personal iden2ty experts Set up to give individuals and their families real power over the use of their informa2on in the digital world Garlik have assembled a world class Leadership
More informationComputer Programming-I. Developed by: Strawberry
Computer Programming-I Objec=ve of CP-I The course will enable the students to understand the basic concepts of structured programming. What is programming? Wri=ng a set of instruc=ons that computer use
More informationAsynchronous and Fault-Tolerant Recursive Datalog Evalua9on in Shared-Nothing Engines
Asynchronous and Fault-Tolerant Recursive Datalog Evalua9on in Shared-Nothing Engines Jingjing Wang, Magdalena Balazinska, Daniel Halperin University of Washington Modern Analy>cs Requires Itera>on Graph
More informationInstructor: Amol Deshpande
Instructor: Amol Deshpande amol@cs.umd.edu } Mo7va7on: Why study databases? } Background: 424 Summary } Administrivia Workload etc. } No laptop use allowed in the class!! 1 } There is a *HUGE* amount of
More informationMacro Assembler. Defini3on from h6p://www.computeruser.com
The Macro Assembler Macro Assembler Defini3on from h6p://www.computeruser.com A program that translates assembly language instruc3ons into machine code and which the programmer can use to define macro
More informationReview. Objec,ves. Example Students Table. Database Overview 3/8/17. PostgreSQL DB Elas,csearch. Databases
Objec,ves PostgreSQL DB Elas,csearch Review Databases Ø What language do we use to query databases? March 8, 2017 Sprenkle - CSCI397 1 March 8, 2017 Sprenkle - CSCI397 2 Database Overview Store data in
More informationSimplified Computa/on and Generaliza/on of the Refined Process Structure Tree
The 7 th Interna/onal Workshop on Web Services and Formal Methods (WS- FM 2010) Simplified Computa/on and Generaliza/on of the Refined Process Structure Tree Artem Polyvyanyy, Jussi Vanhatalo, and Hagen
More informationCS 4604: Introduc0on to Database Management Systems. B. Aditya Prakash Lecture #4: SQL---Part 2
CS 4604: Introduc0on to Database Management Systems B. Aditya Prakash Lecture #4: SQL---Part 2 Overview - detailed - SQL DML other parts: views modifications joins DDL constraints Prakash 2016 VT CS 4604
More informationDecision Support Systems
Decision Support Systems 2011/2012 Week 3. Lecture 6 Previous Class Dimensions & Measures Dimensions: Item Time Loca0on Measures: Quan0ty Sales TransID ItemName ItemID Date Store Qty T0001 Computer I23
More informationRISC-V, Rocket, and RoCC Spring 2017 James Mar2n
RISC-V, Rocket, and RoCC Spring 2017 James Mar2n What s new in Lab 2: In lab 1, you built a SHA3 unit that operates in isola2on We would like Sha3Accel to act as an accelerator for a processor Lab 2 introduces
More informationSql 2008 Copy Tables Structure And Database To Another
Sql 2008 Copy Tables Structure And Database To Another Copy NAV Database Structure to another Database along with Data in SQL @tablevar table(name varchar(300)) declare @columntablevar table(column_name
More informationCourse Title: Introduction to Database Management System Course Code: CSIT116 Course Level: UG Course Credits:04 L T P/ S SW/F W
Course Title: Introduction to Database Management System Course Code: CSIT116 Course Level: UG Course Credits:04 Course Objectives: The objectives of this course is to: To expose the students to the fundamentals
More informationEAS- SEC: Framework for Securing Enterprise Business Applica;ons
Invest in security to secure investments EAS- SEC: Framework for Securing Enterprise Business Applica;ons Alexander Polyakov CTO ERPScan About ERPScan The only 360- degree SAP Security solu8on - ERPScan
More informationKeyword search in databases: the power of RDBMS
Keyword search in databases: the power of RDBMS 1 Introduc
More informationIntroduc.on to Databases
Introduc.on to Databases G6921 and G6931 Web Technologies Dr. Séamus Lawless Housekeeping Course Structure 1) Intro to the Web 2) HTML 3) HTML and CSS Essay Informa.on Session 4) Intro to Databases 5)
More informationSchool of Computing, Engineering and Information Sciences University of Northumbria. Set Operations
Set Operations Aim: To understand how to do the equivalent of the UNION, DIFFERENCE and INTERSECT set operations in SQL. Outline of Session: Do some example SQL queries to learn to differentiate between
More informationApproach starts with GEN and KILL sets
b -Advanced-DFA Review of Data Flow Analysis State Propaga+on Computer Science 5-6 Fall Prof. L. J. Osterweil Material adapted from slides originally prepared by Prof. L. A. Clarke A technique for determining
More informationDetec%ng the Temporal Context of Queries. Oliver Kennedy, Ying Yang, Jan Chomicki, Ronny Fehling, Zhen Hua Liu, and Dieter Gawlick 09/01/2014
Detec%ng the Temporal Context of Queries Oliver Kennedy, Ying Yang, Jan Chomicki, Ronny Fehling, Zhen Hua Liu, and Dieter Gawlick 09/01/2014 Outline Mo.va.on Contextual Analysis Prac.cal Temporal Dependency
More informationDD2451 Parallel and Distributed Computing --- FDD3008 Distributed Algorithms
DD2451 Parallel and Distributed Computing --- FDD3008 Distributed Algorithms Lecture 8 Leader Election Mads Dam Autumn/Winter 2011 Previously... Consensus for message passing concurrency Crash failures,
More informationProofs about Programs
Proofs about Programs Program Verification (Rosen, Sections 5.5) TOPICS Program Correctness Preconditions & Postconditions Program Verification Assignment Statements Conditional Statements Loops Composition
More informationChapter 2: Intro to Relational Model
Chapter 2: Intro to Relational Model Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Example of a Relation attributes (or columns) tuples (or rows) 2.2 Attribute Types The
More informationProvenance Management in Databases under Schema Evolution
Provenance Management in Databases under Schema Evolution Shi Gao, Carlo Zaniolo Department of Computer Science University of California, Los Angeles 1 Provenance under Schema Evolution Modern information
More informationAr#ficial Intelligence
Ar#ficial Intelligence Advanced Searching Prof Alexiei Dingli Gene#c Algorithms Charles Darwin Genetic Algorithms are good at taking large, potentially huge search spaces and navigating them, looking for
More informationDATA AND SCHEMA MODIFICATIONS CHAPTERS 4,5 (6/E) CHAPTER 8 (5/E)
1 DATA AND SCHEMA MODIFICATIONS CHAPTERS 4,5 (6/E) CHAPTER 8 (5/E) 2 LECTURE OUTLINE Updating Databases Using SQL Specifying Constraints as Assertions and Actions as Triggers Schema Change Statements in
More informationSausalito: An Applica/on Server for RESTful Services in the Cloud. Ma;hias Brantner & Donald Kossmann 28msec Inc. h;p://sausalito.28msec.
Sausalito: An Applica/on Server for RESTful Services in the Cloud Ma;hias Brantner & Donald Kossmann 28msec Inc. h;p://sausalito.28msec.com/ Conclusion Integrate DBMS, Applica3on Server, and Web Server
More informationPerformance and Scalability: Apriori Implementa6on
Performance and Scalability: Apriori Implementa6on Apriori R. Agrawal and R. Srikant. Fast algorithms for mining associa6on rules. VLDB, 487 499, 1994 Reducing Number of Comparisons Candidate coun6ng:
More informationCSc 120. Introduc/on to Computer Programming II. 02: Problem Decomposi1on and Program Development. Adapted from slides by Dr.
CSc 120 Introduc/on to Computer Programming II Adapted from slides by Dr. Saumya Debray 02: Problem Decomposi1on and Program Development A common student lament "I have this big programming assignment.
More informationRela+onal Algebra. Rela+onal Query Languages. CISC437/637, Lecture #6 Ben Cartere?e
Rela+onal Algebra CISC437/637, Lecture #6 Ben Cartere?e Copyright Ben Cartere?e 1 Rela+onal Query Languages A query language allows manipula+on and retrieval of data from a database The rela+onal model
More information11/12/11. Objec&ves Overview. Databases, Data, and Informa&on. Objec&ves Overview. Databases, Data, and Informa&on. Databases, Data, and Informa&on
Objec&ves Overview Define the term,, and explain how a interacts with and informa:on Define the term, integrity, and describe the quali:es of valuable informa:on Discuss the terms character, field, record,
More informationLet's play a public String ping() { counter++; return "SF hits ="+counter;
Stateful beans Let's play a bit package session; import javax.ejb.stateful; @Stateful public class StatefulSessionBean implements StatefulSessionBeanRemote { int counter=0; @Override public String ping()
More informationHandle Server Performance Tuning
Handle Server Performance Tuning Jason Petrone Corporation for National Research Initiatives Full Screen Handle System Workshop 2 p. 1/14 Reducing Trips to Global Handle System
More informationThinking Induc,vely. COS 326 David Walker Princeton University
Thinking Induc,vely COS 326 David Walker Princeton University slides copyright 2017 David Walker permission granted to reuse these slides for non-commercial educa,onal purposes Administra,on 2 Assignment
More informationInforma)on Retrieval and Map- Reduce Implementa)ons. Mohammad Amir Sharif PhD Student Center for Advanced Computer Studies
Informa)on Retrieval and Map- Reduce Implementa)ons Mohammad Amir Sharif PhD Student Center for Advanced Computer Studies mas4108@louisiana.edu Map-Reduce: Why? Need to process 100TB datasets On 1 node:
More informationLecture 1 Introduc-on
Lecture 1 Introduc-on What would you get out of this course? Structure of a Compiler Op9miza9on Example 15-745: Introduc9on 1 What Do Compilers Do? 1. Translate one language into another e.g., convert
More informationApache Storm. A framework for Parallel Data Stream Processing
Apache Storm A framework for Parallel Data Stream Processing Storm Storm is a distributed real- ;me computa;on pla
More informationCS 465 Final Review. Fall 2017 Prof. Daniel Menasce
CS 465 Final Review Fall 2017 Prof. Daniel Menasce Ques@ons What are the types of hazards in a datapath and how each of them can be mi@gated? State and explain some of the methods used to deal with branch
More informationThe DCGS- A ontology suite Standard opera8ng procedures and ontology quality assurance Annota8on vs. Explica8on How the DCGS- A ontologies are being
Ron Rudnicki November 12, 2013 The DCGS- A ontology suite Standard opera8ng procedures and ontology quality assurance Annota8on vs. Explica8on How the DCGS- A ontologies are being used for the explica8on
More informationCMPS 20 Game Design Experience
CMPS 20 Game Design Experience Winter 2013 Week 4/5: Design Patterns Computational Cinematics Studio // Center for Games and Playable Media http://games.soe.ucsc.edu/ccs // http://games.soe.ucsc.edu Arnav
More informationComputer Architecture. CSE 1019Y Week 16. Introduc>on to MARIE
Computer Architecture CSE 1019Y Week 16 Introduc>on to MARIE MARIE Simple model computer used in this class MARIE Machine Architecture that is Really Intui>ve and Easy Designed for educa>on only While
More informationSQL: A COMMERCIAL DATABASE LANGUAGE. Data Change Statements,
SQL: A COMMERCIAL DATABASE LANGUAGE Data Change Statements, Outline 1. Introduction 2. Data Definition, Basic Constraints, and Schema Changes 3. Basic Queries 4. More complex Queries 5. Aggregate Functions
More informationTo understand the concept of candidate and primary keys and their application in table creation.
CM0719: Database Modelling Seminar 5 (b): Candidate and Primary Keys Exercise Aims: To understand the concept of candidate and primary keys and their application in table creation. Outline of Activity:
More informationSpa$al Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University
Spa$al Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University Class Outlines Spatial Point Pattern Regional Data (Areal Data) Continuous Spatial Data (Geostatistical
More informationHSC QUEUE MODE OPERATION PLAN
HSC QUEUE MODE OPERATION PLAN Tae- Soo Pyo & Queue- mode team (Subaru Telescope) 2015. 1.13 Working Group Tae- Soo Pyo Ikuru Iwata Masatoshi Imanishi [ 9 members] Eric Jeschke Queue Mode Team Sherry Yeh
More informationHeterogeneous compute in the GATK
Heterogeneous compute in the GATK Mauricio Carneiro GSA Broad Ins
More informationUPCRC. Illiac. Gigascale System Research Center. Petascale computing. Cloud Computing Testbed (CCT) 2
Illiac UPCRC Petascale computing Gigascale System Research Center Cloud Computing Testbed (CCT) 2 www.parallel.illinois.edu Mul2 Core: All Computers Are Now Parallel We con'nue to have more transistors
More informationStream and Complex Event Processing Discovering Exis7ng Systems:c- sparql
Stream and Complex Event Processing Discovering Exis7ng Systems:c- sparql G. Cugola E. Della Valle A. Margara Politecnico di Milano cugola@elet.polimi.it dellavalle@elet.polimi.it Vrije Universiteit Amsterdam
More informationNetwork Access Transla0on - NAT
Network Access Transla0on - NAT Foreword Those slides have been done by gathering a lot of informa0on on the net Ø Cisco tutorial Ø Lectures from other ins0tu0ons University of Princeton University of
More informationSelf-Demo Guide. Oracle ilearning and HTML DB
2003-2004 Self-Demo Guide Oracle ilearning and HTML DB The Oracle Academy allows a school to offer advanced Database and Java programming courses through the use of Oracle s infrastructure. The school
More informationCS 4604: Introduc0on to Database Management Systems
CS 4604: Introduc0on to Database Management Systems B. Aditya Prakash Lecture #1: Introduc/on Based on material by Profs. T. M. Murali and Christos Faloutsos Course Informa0on Instructor B. Aditya Prakash,
More informationTowards Automatic Optimization of MapReduce Programs (Position Paper) Shivnath Babu Duke University
Towards Automatic Optimization of MapReduce Programs (Position Paper) Shivnath Babu Duke University Roadmap Call to action to improve automatic optimization techniques in MapReduce frameworks Challenges
More informationCS 4604: Introduc0on to Database Management Systems. B. Aditya Prakash Lecture #1: Introduc/on
CS 4604: Introduc0on to Database Management Systems B. Aditya Prakash Lecture #1: Introduc/on Course Informa0on Instructor B. Aditya Prakash, Torg 3160 F, badityap@cs.vt.edu Office Hours: 2:30-3:30pm Mondays
More informationECSE 425 Lecture 25: Mul1- threading
ECSE 425 Lecture 25: Mul1- threading H&P Chapter 3 Last Time Theore1cal and prac1cal limits of ILP Instruc1on window Branch predic1on Register renaming 2 Today Mul1- threading Chapter 3.5 Summary of ILP:
More informationCS 61C: Great Ideas in Computer Architecture. MIPS Instruc,on Representa,on II. Dan Garcia
CS 61C: Great Ideas in Computer Architecture MIPS Instruc,on Representa,on II Dan Garcia 1 Review of Last Lecture Simplifying MIPS: Define instruc?ons to be same size as data word (one word) so that they
More informationGradient Descent. Michail Michailidis & Patrick Maiden
Gradient Descent Michail Michailidis & Patrick Maiden Outline Mo4va4on Gradient Descent Algorithm Issues & Alterna4ves Stochas4c Gradient Descent Parallel Gradient Descent HOGWILD! Mo4va4on It is good
More informationMANAGING DATA(BASES) USING SQL (NON-PROCEDURAL SQL, X401.9)
Technology & Information Management Instructor: Michael Kremer, Ph.D. Class 4 Professional Program: Data Administration and Management MANAGING DATA(BASES) USING SQL (NON-PROCEDURAL SQL, X401.9) AGENDA
More informationDisplay and usage of Interna3onalized Registra3on Data. Dave Piscitello
Display and usage of Interna3onalized Registra3on Data Dave Piscitello 1 SAC 037: Display and usage of Interna3onalized Registra3on Data Reference document can be found at h=p://www.icann.org/commi=ees/security/
More informationSQL Structured Query Language Introduction
SQL Structured Query Language Introduction Rifat Shahriyar Dept of CSE, BUET Tables In relational database systems data are represented using tables (relations). A query issued against the database also
More informationApplication of Named Graphs Towards Custom Provenance Views
Application of Named Graphs Towards Custom Provenance Views Tara Gibson, Karen Schuchardt, Eric Stephan Pacific Northwest National Laboratory Abstract Provenance capture as applied to execution oriented
More informationProvenance-aware Faceted Search in Drupal
Provenance-aware Faceted Search in Drupal Zhenning Shangguan, Jinguang Zheng, and Deborah L. McGuinness Tetherless World Constellation, Computer Science Department, Rensselaer Polytechnic Institute, 110
More informationNetSecOps: Policy-Driven, Knowledge-Centric, Holis<c Network Security Opera<ons
NetSecOps: Policy-Driven, Knowledge-Centric, Holis
More informationFounda'ons of So,ware Engineering. Process: Agile Prac.ces Claire Le Goues
Founda'ons of So,ware Engineering Process: Agile Prac.ces Claire Le Goues 1 Learning goals Define agile as both a set of itera.ve process prac.ces and a business approach for aligning customer needs with
More informationCS6200 Informa.on Retrieval. David Smith College of Computer and Informa.on Science Northeastern University
CS6200 Informa.on Retrieval David Smith College of Computer and Informa.on Science Northeastern University Indexing Process Indexes Indexes are data structures designed to make search faster Text search
More informationAnalysis and Tes,ng of PLEXIL Plans
Analysis and Tes,ng of PLEXIL Plans Jason Biatek, Michael W. Whalen, Sanjai Rayadurgam, Mats P.E. Heimdahl, Michael Lowry 04-06- 14 FormaliSE - Mike Whalen 1 Autonomy at NASA 04-06- 14 FormaliSE - Mike
More informationCS6312 DATABASE MANAGEMENT SYSTEMS LABORATORY L T P C
CS6312 DATABASE MANAGEMENT SYSTEMS LABORATORY L T P C 0 0 3 2 LIST OF EXPERIMENTS: 1. Creation of a database and writing SQL queries to retrieve information from the database. 2. Performing Insertion,
More informationW1005 Intro to CS and Programming in MATLAB. Brief History of Compu?ng. Fall 2014 Instructor: Ilia Vovsha. hip://www.cs.columbia.
W1005 Intro to CS and Programming in MATLAB Brief History of Compu?ng Fall 2014 Instructor: Ilia Vovsha hip://www.cs.columbia.edu/~vovsha/w1005 Computer Philosophy Computer is a (electronic digital) device
More informationOracle Database Auditing
By Craig Moir craig@mydba.co.za http://www.mydba.co.za August 2012 Version 1 WHY AUDIT? Allows organizations to enforce the trust-but-verify security principle. Satisfying compliance regulations. Enables
More information