Data Fusion. References

Size: px
Start display at page:

Download "Data Fusion. References"

Transcription

1 Data Fusion Helena Galhardas DEI/IST References Data Fusion, Jens Bleiholder and Felix Naumann, ACM Computing Surveys, Vol. 41, N.1, Slides VLDB 2009 tutorial on Data Fusion, Luna Dong and Felix Naumann Slides Data Quality and Data Cleansing course, Felix Naumann, Winter 2014/15 2 1

2 Definition of Data Fusion Process of consolidating multiple records representing the same real-world object into a single, consistent, and clean representation Also known as: data merging, consolidation, entity resolution, finding representatives/ survivors, instance-level conflict resolution 3 Example: Data Fusion amazon.com H. Melville $3.98 " ID MAX length MIN CONCAT Herman Melville Moby Dick $5.99!! bn.com 4 2

3 Why data fusion is required? Data conflicts exist and occur due to poor data quality Errors exist when collecting or entering data Sources are not updated, etc Several types of data conflicts: representation conflicts, e.g. dollar vs. euro key equivalence conflicts, i.e. same real world objects with different identifiers attribute value conflicts, i.e. instances corresponding to same real world objects and sharing an equivalent key, differ on other attributes 5 Another Example EmpId Name Surname Salary A-ribute conflicts arpa78 John Smith 2000 smith@abc.it eugi 98 Edward Monroe 1500 monroe@abc. it ghjk09 Anthony Wite 1250 white@abc.it treg23 Marianne Collins 1150 collins@abc.it EmpId Name Surname Salary arpa78 John Smith 2600 smith@abc.it Key conflicts eugi 98 Edward Monroe 1500 monroe@abc.it ghjk09 Anthony White 1250 white@abc.it dref43 Marianne Collins 1150 collins@abc.it 6 3

4 Data Conflict Elimination Error correction Reference tables Cities, countries, products... Similarity measures Standardization Domain-knowledge (meta data) Conventions (country/region-specific spelling) Ontologies Thesauri, dictionaries for homonyms, synonyms,... Outlier detection and elimination And data fusion 7 Outline Ø Data fusion in the data integration process Completeness and conciseness Foundations of data fusion Conflict resolution strategies and functions Conflict resolution operators Existing data fusion systems 8 4

5 Data Integration Source A Source B <Titel> Federated Database Systems </Titel> <Autor> Amit Sheth </Autor> <Autor> James Larson </Autor> <publicahon> <Htle> Federated Database Systems for Managing Distributed, Heterogeneous, and Autonomous Databases </Htle> <author> Scheth & Larson </author> <year> 1990 </year> </publicahon> Matching Mapping Duplicate DetecHon Data Fusion 9 Data Integration Source A Source B <Titel> Federated Database Systems </Titel> <Autor> Amit Sheth </Autor> <Autor> James Larson </Autor> <publicahon> <Htle> Federated Database Systems for Managing Distributed, Heterogeneous, and Autonomous Databases </Htle> <author> Scheth & Larson </author> <year> 1990 </year> </publicahon> <Htle> </Htle> <author> </author> <author> </author> <year> </year> Matching IntegraHon Matching Mapping Duplicate DetecHon Data Fusion 10 5

6 Source A Source B Data Integration <Titel> Federated Database Systems </Titel> <Autor> Amit Sheth </Autor> <Autor> James Larson </Autor> <publicahon> <Htle> Federated Database Systems for Managing Distributed, Heterogeneous, and Autonomous Databases </Htle> <author> Scheth & Larson </author> <year> 1990 </year> </publicahon> TransformaHon queries or views <Htle> Federated Database Systems </Htle> <author> Amit Sheth </author> <author> James Larson </author> <Htle> Federated Database Systems for Managing Distributed, Heterogeneous, and Autonomous Databases </Htle> <author> Scheth & Larson </author> <year> 1990 </year> XQuery XQuery Matching Mapping Duplicate DetecHon Data Fusion 11 Data Integration Source A Source B <Titel> Federated Database Systems </Titel> <Autor> Amit Sheth </Autor> <Autor> James Larson </Autor> <publicahon> <Htle> Federated Database Systems for Managing Distributed, Heterogeneous, and Autonomous Databases </Htle> <author> Scheth & Larson </author> <year> 1990 </year> </publicahon> <Htle> Federated Database Systems </Htle> <author> Amit Sheth </author> <author> James Larson </author> <Htle> Federated Database Systems for Managing Distributed, Heterogeneous, and Autonomous Databases </Htle> <author> Scheth & Larson </author> <year> 1990 </year> Matching Mapping Duplicate DetecHon Data Fusion 12 6

7 Data Integration Source A Source B <Titel> Federated Database Systems </Titel> <Autor> Amit Sheth </Autor> <Autor> James Larson </Autor> <publicahon> <Htle> Federated Database Systems for Managing Distributed, Heterogeneous, and Autonomous Databases </Htle> <author> Scheth & Larson </author> <year> 1990 </year> </publicahon> <Htle> Federated Database Systems </Htle> <author> Amit Sheth </author> <author> James Larson </author> <Htle> Federated Database Systems for Managing Distributed, Heterogeneous, and Autonomous Databases </Htle> <author> Scheth & Larson </author> <year> 1990 </year> Matching Mapping Duplicate DetecHon Data Fusion 13 Data Integration Source A Source B <Htle> Federated Database Systems </Htle> <author> Amit Sheth </author> <author> James Larson </author> <Htle> Federated Database Systems for Managing Distributed, Heterogeneous, and Autonomous Databases </Htle> <author> Scheth & Larson </author> <year> 1990 </year> <Htle> Federated Database Systems for Managing Distributed, Heterogeneous, and Autonomous Databases </Htle> <author> Amit Sheth </author> <author> James Larson </author> <year> 1990 </year> Preserve lineage Matching Mapping Duplicate DetecHon Data Fusion 14 7

8 Goals of data integration To increase the completeness By adding more data sources (more objects, more attributes) so that no object is forgotten in the result the conciseness by removing redundant data, by fusing duplicate entries and merging common attributes into one no object is represented twice and the data presented to the user has no contradictions of data that is available to users and applications 15 Completeness Measures the amount of data in a dataset both in terms of the number of tuples (extensional, data level) and the number of attributes (intensional, schema level) Extensional completeness: number of unique object representations in a dataset in relation to the overall number of unique objects in the real world (in all the sources); measures the % of real-world objects covered by the dataset Intensional completeness: number of unique attributes in a dataset in relation to the overall number of unique attributes available; an increase is achieved by integrating sources with additional attributes 16 8

9 Example Extensional completeness = 4/4 = 1 in the result of combining the two sources (4 distinct IDs in the sources) Extensional completeness of ¾ in each of the sources Extensional conciseness = 4/4 =1 17 Conciseness Measures the uniqueness of object representations in a data set Extensional conciseness: number of unique objects in a dataset in relation to the overall number of object representations in the dataset Intensional conciseness: measures the number of unique attributes in a dataset in relation to the overall number of attributes 18 9

10 Data conflict A data conflict exists if, for the same realworld object, semantically equivalent attributes, from one or more sources, do not agree on their value Two kinds of data conflicts: uncertainties and contradictions 19 Uncertainty and Contradiction Uncertainty: conflict between a non-null value and one or more null values that are all used to describe the same property of an object. Easy case Contradiction Non-NULL value vs. (different) non-null value Uncertainty Contradic5on Uncertainty 20 10

11 Semantics of NULL unknown There is a value, but I do not know it. E.g.: Unknown date-of-birth not applicable There is no meaningful value. E.g.: Spouse for singles withheld There is a value, but we are not authorized to see it. E.g.: Private phone line 21 Example 22 11

12 Challenges of Data Fusion Problem Given a duplicate, create a single object representation while resolving conflicting data values. 23 The Field of Data Fusion Data Fusion Conflict types ResoluHon strategies Operators ResoluHon funchons Uncertainty ContradicHon Join-based Possible worlds SubsumpHon AggregaHon Union-based Consistent answers ComplementaHon Advanced funchons Ignorance Avoidance ResoluHon Instance-based Metadata-based Instance-based Metadata-based 24 12

13 Overview Data fusion in the integration process Completeness and conciseness Foundations of data fusion Ø Conflict resolution strategies and functions Conflict resolution operators Existing data fusion systems 25 Classification of Resolution Strategies Conflict-ignoring strategies: do not make a decision as to what to do with conflicting data Ex: presenting both x and k values for attribute B of OID 3 Conflict-avoiding strategies: acknowledge the existence of possible conflicts, but do not detect and resolve single existing conflicts. Handle conflicting data by applying a unique decision equally to all data, such as preferring data from a special source. Ex: only the null value or x when preferring objects from S over T Conflict-resolution strategies: regard all data and metadata before deciding. Deciding: choose a value from all the already present values Mediating: choose a value that does not necessarily exist among the conflicting values 26 13

14 Possible Conflict Resolution Strategies 27 Classification of Conflict Resolution Functions conflict resoluhon strategies conflict ignorance Escalate instance based Coalesce ChooseDepending Concat conflict avoidance metadata based Choose instance based conflict resoluhon metadata based deciding mediahng deciding mediahng MIN, MAX Random Vote AVG, SUM MostRecent CommonAncestor MostAbstract MostSpecific 28 14

15 Conflict Resolution Functions Func5on Descrip5on Examples Min, Max, Sum, Count, Avg Standard aggregahon NumChildren, Salary, Height Random Random choice Shoe size Longest, Shortest Longest/shortest value First_name Choose(source) Value from a parhcular source DoB (DMV), CEO (SEC) ChooseDepending(val, col) Value depends on value chosen in other column city & zip, & employer Vote Majority decision RaHng Coalesce First non-null value First_name Group, Concat Group or concatenate all values Book_reviews MostRecent Most recent (up-to-date) value Address MostAbstract, MostSpecific, CommonAncestor Use a taxonomy / ontology LocaHon Escalate Export conflichng values gender 29 Overview Data fusion in the integration process Foundations of data fusion Conflict resolution strategies and functions Ø Conflict resolution operators Existing data fusion systems 30 15

16 The Field of Data Fusion Data Fusion Conflict types ResoluHon strategies Operators ResoluHon funchons Uncertainty ContradicHon Join-based Possible worlds SubsumpHon AggregaHon Union-based Consistent answers ComplementaHon Advanced funchons Ignorance Avoidance ResoluHon Instance-based Metadata-based Instance-based Metadata-based 31 Properties of operators Value preservation An operator that does not loose any value from the objects in the sources nor creates nor duplicates values Bag-union: value preserving Set union: non value-preserving Related to completeness Uniqueness When attributes that contain unique values in the sources also contain unique values in the result Equality join: uniqueness-preserving Union: non uniqueness-preserving Related to conciseness The ideal fusion operator should preserve as many values and objects as possible while enforcing or preserving uniqueness and resolving conflicts 32 16

17 Fusing tuples Source 1(A,B,C) Source 2(A,B,D) a, b, - a, b, - a, b, -, - a, b, -, - a, b, -, - IdenHcal tuples a, b, c a, b, - a, b, c, - a, b, -, - a, b, c, - Subsumed tuples a, b, c a, b, d a, b, c, - a, b, -, d a, b, c, d ComplemenHng tuples a, b, c a, e, d a, b, c, - a, e, -, d a, f(b,e), c, d ConflicHng tuples 33 Relational Operators and extensions Identical tuples UNION, OUTER UNION Subsumed tuples (uncertainty) MINIMUM UNION Complementing tuples (uncertainty) COMPLEMENT UNION, MERGE Conflicting tuples (contradiction) Relational approaches: Match, Group, Fuse, Other approaches Possible worlds, probabilistic answers, consistent answers 34 17

18 Union: Elimination of exact duplicates Minimum Union: elimination of subsumed tuples Outer union Subsumption Minimum Union A B C a b c e f g m n o A B D a b + = e f h m p A tuple t1 subsumes a tuple t2, if it has same schema, has less NULL-values, and coincides in all non-nullvalues. A B C D a b c a b e f g e f h m n o m p R A B C D a b c e f g e f h m n o m p 35 Complement Union Proposal Elimination of complementing tuples Outer union Complementation No known SQL rewriting A B C a b c e f g m n o A B D a b + = e f h m p A tuple t1 complements a tuple t2, if it has same schema and coincides in all non-null-values. Includes duplicate removal and subsumphon A B C D a b c a b e f g e f h m n o m p R A B C D a b c e f g h m n o m p 36 18

19 Grouping and Aggregation Outer union then group by real-world ID Aggregate all other columns using conflict resolving aggregate function Efficient implementations Catches inter- and intra-source duplicates Restricted to built-in aggregate-functions MAX, MIN, AVG, VAR, STDDEV, SUM, COUNT WITH OU AS ( ( SELECT A, B, C, NULL AS D FROM U1 ) UNION (ALL) ( SELECT A, B, NULL AS C, D FROM U2 ) ), SELECT A, MAX(B), MIN(C), SUM(D) FROM OU GROUP BY A 37 FUSE BY SQL extensions to resolve uncertainties and contradictions [BN05,BBB+05] FUSE FROM implies OUTER UNION Removes subsumed and duplicate tuples by default FUSE BY declares real-world ID RESOLVE specifies conflict resolution function from catalog Default: COALESCE Implemented on top of relational DBMS XXL SELECT ID, RESOLVE(Title, Choose(IMDB)), RESOLVE(Year, Max), RESOLVE(Director), RESOLVE(Rating), RESOLVE(Genre, Concat) FUSE FROM IMDB, Filmdienst FUSE BY (ID) ON ORDER Year DESC 38 19

20 Union, Outer Union Summary of Operators Duplicates Subsumed tuples Complemen5ng tuples Contradic5ons ü û û û Minimum Union ü ü û û Full DisjuncHon ü ü ü (inter-source) Complement Union ü ü ü û Merge ü ü (intersource) MatchJoin + CTQM ü (inter-source) ü ü ü û ü Group By ü ü ü ü Fuse By ü ü ü ü û 39 FuSem Tool to query and fuse data from diverse data sources [BDN07] Based on HumMer project [BBB+05]. Explore data and find interesting subsets Execute, explore and compare five different data fusion semantics, specified in their respective syntax: SQL (and extensions, such as Subsumption) Merge MatchJoin FuseBy ConQuer 40 20

21 Overview Data fusion in the integration process Foundations of data fusion Conflict resolution strategies and functions Conflict resolution operators Ø Existing data fusion systems 41 Commercial Data Integration Tools Source: Gartner Typical ETL tools support rule-based fusion that allow to remove uncertainties but not data conflicts IIS (IBM Information Server) SSIS (Microsoft s SQL Server Integration Services) This functionality is typically named survivorship or consolidaton 42 21

22 Research Data Integration/Cleaning Systems System Conflict types Strategy class Func5on Specifica5on MulHbase Hc, data ResoluHon Choose, Avg, Min, Max, Sum, Manually, in query Hermes Hc, data ResoluHon MostRecent, Choose Manually, in mediator Fusionplex Hc, object, data ResoluHon MostRecent, Min, Max, Avg, Manually, in query HumMer Hc, object, data ResoluHon MostAbstract, Vote, Min, ChooseDepen Ajax Hc, object, data ResoluHon Various Manually, in query Manually, in workflow definihon TSIMMIS Hc, data Avoidance Choose Manually, rules in mediator SIMS/Ariadne Hc, data Avoidance Choose AutomaHcally Infomix Hc, data Avoidance onlyconsistentvalue AutomaHcally Hippo Hc, object, data Avoidance onlyconsistentvalue AutomaHcally ConQuer Hc, object, data Avoidance onlyconsistentvalue AutomaHcally Rainbow Hc, object, data Avoidance onlyconsistentvalue AutomaHcally Pegasus Hc, data Ignorance Escalate Manually Nimble Unknown Ignorance Escalate Manually Carnot Hc Ignorance Escalate AutomaHcally InfoSleuth Hc Ignorance Escalate Unknown Poler s Wheel 43 Hc Ignorance Escalate Manually, transformahon CLEENEX Next Lecture 22

23 Figure 4 a) High completeness but not concise Without schema mapping information and knowledge of object identifiers b) Intensionally concise Knowledge about common attributes given by schema mapping c) Extensionally concise Knowledge about common objects d) Intensionally and extensionally concise Knowledge about common attributes and common objects 45 23

C. Batini & M. Scannapieco Data and Information Quality Book Figures. Chapter 10: Data Quality Issues in Data Integration Systems

C. Batini & M. Scannapieco Data and Information Quality Book Figures. Chapter 10: Data Quality Issues in Data Integration Systems C. Batini & M. Scannapieco Data and Information Quality Book Figures Chapter 10: Data Quality Issues in Data Integration Systems 1 Data Fusion Strategies conflict handling strategies conflict ignorance

More information

Data Quality in Databases. Felix Naumann Hasso-Plattner-Institut

Data Quality in Databases. Felix Naumann Hasso-Plattner-Institut Data Quality in Databases OpEN.SC Symposium 8.5.2009 Felix Naumann Hasso-Plattner-Institut Fachgebiet Informationssysteme The HPI Hasso Plattner Institut 2 Founded in 1998 as a Public Private Partnership

More information

Data Consolidation in Three Steps. Milano, May 9, 2008 Felix Naumann

Data Consolidation in Three Steps. Milano, May 9, 2008 Felix Naumann Data Consolidation in Three Steps Milano, May 9, 2008 Felix Naumann The HPI Hasso Plattner Institut 2 Founded in 1998 as a Public Private Partnership Hasso Plattner, co-founder of SAP, endowed over 200

More information

A Relational Operator Approach to Data Fusion

A Relational Operator Approach to Data Fusion A Relational Operator Approach to Data Fusion Jens Bleiholder Humboldt-Universität zu Berlin Unter den Linden 6, 10099 Berlin, Germany bleiho@informatik.hu-berlin.de Abstract Integrated information systems

More information

Information Quality in Integrated Information Systems

Information Quality in Integrated Information Systems Information Quality in Integrated Information Systems Institute for Infocomm Research, Singapore 15.3.2005 Felix Naumann Humboldt-Universität zu Berlin 15.3.2005 Felix Naumann - Humboldt-Universität zu

More information

Outline. Data Integration. Entity Matching/Identification. Duplicate Detection. More Resources. Duplicates Detection in Database Integration

Outline. Data Integration. Entity Matching/Identification. Duplicate Detection. More Resources. Duplicates Detection in Database Integration Outline Duplicates Detection in Database Integration Background HumMer Automatic Data Fusion System Duplicate Detection methods An efficient method using priority queue Approach based on Extended key Approach

More information

11/04/16. Data Profiling. Helena Galhardas DEI/IST. References

11/04/16. Data Profiling. Helena Galhardas DEI/IST. References Data Profiling Helena Galhardas DEI/IST References Slides Data Profiling course, Felix Naumann, Trento, July 2015 Z. Abedjan, L. Golab, F. Naumann, Profiling Relational Data A Survey, VLDBJ 2015 T. Papenbrock

More information

MCSA SQL SERVER 2012

MCSA SQL SERVER 2012 MCSA SQL SERVER 2012 1. Course 10774A: Querying Microsoft SQL Server 2012 Course Outline Module 1: Introduction to Microsoft SQL Server 2012 Introducing Microsoft SQL Server 2012 Getting Started with SQL

More information

CMP-3440 Database Systems

CMP-3440 Database Systems CMP-3440 Database Systems Relational DB Languages Relational Algebra, Calculus, SQL Lecture 05 zain 1 Introduction Relational algebra & relational calculus are formal languages associated with the relational

More information

Data Warehousing ETL. Esteban Zimányi Slides by Toon Calders

Data Warehousing ETL. Esteban Zimányi Slides by Toon Calders Data Warehousing ETL Esteban Zimányi ezimanyi@ulb.ac.be Slides by Toon Calders 1 Overview Picture other sources Metadata Monitor & Integrator OLAP Server Analysis Operational DBs Extract Transform Load

More information

Database Management Systems,

Database Management Systems, Database Management Systems SQL Query Language (3) 1 Topics Aggregate Functions in Queries count sum max min avg Group by queries Set Operations in SQL Queries Views 2 Aggregate Functions Tables are collections

More information

Data about data is database Select correct option: True False Partially True None of the Above

Data about data is database Select correct option: True False Partially True None of the Above Within a table, each primary key value. is a minimal super key is always the first field in each table must be numeric must be unique Foreign Key is A field in a table that matches a key field in another

More information

Tuple Merging in Probabilistic Databases

Tuple Merging in Probabilistic Databases Tuple Merging in Probabilistic Databases Fabian Panse and Norbert Ritter Universität Hamburg, Vogt-Kölln Straße 33, 22527 Hamburg, Germany {panse,ritter}@informatik.uni-hamburg.de http://vsis-www.informatik.uni-hamburg.de/

More information

Chapter 3: The Relational Database Model

Chapter 3: The Relational Database Model Chapter 3: The Relational Database Model Student: 1. The practical significance of taking the logical view of a database is that it serves as a reminder of the simple file concept of data storage. 2. You

More information

Multiplex: Integrating. Autonomous, Heterogeneous and Inconsistent Information Sources

Multiplex: Integrating. Autonomous, Heterogeneous and Inconsistent Information Sources Multiplex: Integrating Autonomous, Heterogeneous and Inconsistent Information Sources Amihai Motro Proceedings of NGITS 99 Fourth International Workshop on Next Generation Information Technologies and

More information

Relational Model. Rab Nawaz Jadoon DCS. Assistant Professor. Department of Computer Science. COMSATS IIT, Abbottabad Pakistan

Relational Model. Rab Nawaz Jadoon DCS. Assistant Professor. Department of Computer Science. COMSATS IIT, Abbottabad Pakistan Relational Model DCS COMSATS Institute of Information Technology Rab Nawaz Jadoon Assistant Professor COMSATS IIT, Abbottabad Pakistan Management Information Systems (MIS) Relational Model Relational Data

More information

Computing for Medicine (C4M) Seminar 3: Databases. Michelle Craig Associate Professor, Teaching Stream

Computing for Medicine (C4M) Seminar 3: Databases. Michelle Craig Associate Professor, Teaching Stream Computing for Medicine (C4M) Seminar 3: Databases Michelle Craig Associate Professor, Teaching Stream mcraig@cs.toronto.edu Relational Model The relational model is based on the concept of a relation or

More information

NULLs & Outer Joins. Objectives of the Lecture :

NULLs & Outer Joins. Objectives of the Lecture : Slide 1 NULLs & Outer Joins Objectives of the Lecture : To consider the use of NULLs in SQL. To consider Outer Join Operations, and their implementation in SQL. Slide 2 Missing Values : Possible Strategies

More information

12. MS Access Tables, Relationships, and Queries

12. MS Access Tables, Relationships, and Queries 12. MS Access Tables, Relationships, and Queries 12.1 Creating Tables and Relationships Suppose we want to build a database to hold the information for computers (also refer to parts in the text) and suppliers

More information

Lecture Query evaluation. Combining operators. Logical query optimization. By Marina Barsky Winter 2016, University of Toronto

Lecture Query evaluation. Combining operators. Logical query optimization. By Marina Barsky Winter 2016, University of Toronto Lecture 02.03. Query evaluation Combining operators. Logical query optimization By Marina Barsky Winter 2016, University of Toronto Quick recap: Relational Algebra Operators Core operators: Selection σ

More information

Lecture 8. Database Management and Queries

Lecture 8. Database Management and Queries Lecture 8 Database Management and Queries Lecture 8: Outline I. Database Components II. Database Structures A. Conceptual, Logical, and Physical Components III. Non-Relational Databases A. Flat File B.

More information

CSE 344 MAY 7 TH EXAM REVIEW

CSE 344 MAY 7 TH EXAM REVIEW CSE 344 MAY 7 TH EXAM REVIEW EXAMINATION STATIONS Exam Wednesday 9:30-10:20 One sheet of notes, front and back Practice solutions out after class Good luck! EXAM LENGTH Production v. Verification Practice

More information

Database Heterogeneity

Database Heterogeneity Database Heterogeneity Lecture 13 1 Outline Database Integration Wrappers Mediators Integration Conflicts 2 1 1. Database Integration Goal: providing a uniform access to multiple heterogeneous information

More information

[AVNICF-MCSASQL2012]: NICF - Microsoft Certified Solutions Associate (MCSA): SQL Server 2012

[AVNICF-MCSASQL2012]: NICF - Microsoft Certified Solutions Associate (MCSA): SQL Server 2012 [AVNICF-MCSASQL2012]: NICF - Microsoft Certified Solutions Associate (MCSA): SQL Server 2012 Length Delivery Method : 5 Days : Instructor-led (Classroom) Course Overview Participants will learn technical

More information

Lecture 3 SQL. Shuigeng Zhou. September 23, 2008 School of Computer Science Fudan University

Lecture 3 SQL. Shuigeng Zhou. September 23, 2008 School of Computer Science Fudan University Lecture 3 SQL Shuigeng Zhou September 23, 2008 School of Computer Science Fudan University Outline Basic Structure Set Operations Aggregate Functions Null Values Nested Subqueries Derived Relations Views

More information

Data Cleansing. LIU Jingyuan, Vislab WANG Yilei, Theoretical group

Data Cleansing. LIU Jingyuan, Vislab WANG Yilei, Theoretical group Data Cleansing LIU Jingyuan, Vislab WANG Yilei, Theoretical group What is Data Cleansing Data cleansing (data cleaning) is the process of detecting and correcting (or removing) errors or inconsistencies

More information

Data Manipulation Language (DML)

Data Manipulation Language (DML) In the name of Allah Islamic University of Gaza Faculty of Engineering Computer Engineering Department ECOM 4113 DataBase Lab Lab # 3 Data Manipulation Language (DML) El-masry 2013 Objective To be familiar

More information

Relational Database Management Systems for Epidemiologists: SQL Part I

Relational Database Management Systems for Epidemiologists: SQL Part I Relational Database Management Systems for Epidemiologists: SQL Part I Outline SQL Basics Retrieving Data from a Table Operators and Functions What is SQL? SQL is the standard programming language to create,

More information

Database Technology Introduction. Heiko Paulheim

Database Technology Introduction. Heiko Paulheim Database Technology Introduction Outline The Need for Databases Data Models Relational Databases Database Design Storage Manager Query Processing Transaction Manager Introduction to the Relational Model

More information

Querying Microsoft SQL Server

Querying Microsoft SQL Server Course Code: M20461 Vendor: Microsoft Course Overview Duration: 5 RRP: POA Querying Microsoft SQL Server Overview This 5-day instructor led course provides delegates with the technical skills required

More information

3/3/2008. Announcements. A Table with a View (continued) Fields (Attributes) and Primary Keys. Video. Keys Primary & Foreign Primary/Foreign Key

3/3/2008. Announcements. A Table with a View (continued) Fields (Attributes) and Primary Keys. Video. Keys Primary & Foreign Primary/Foreign Key Announcements Quiz will cover chapter 16 in Fluency Nothing in QuickStart Read Chapter 17 for Wednesday Project 3 3A due Friday before 11pm 3B due Monday, March 17 before 11pm A Table with a View (continued)

More information

Chapter 3B Objectives. Relational Set Operators. Relational Set Operators. Relational Algebra Operations

Chapter 3B Objectives. Relational Set Operators. Relational Set Operators. Relational Algebra Operations Chapter 3B Objectives Relational Set Operators Learn About relational database operators SELECT & DIFFERENCE PROJECT & JOIN UNION PRODUCT INTERSECT DIVIDE The Database Meta Objects the data dictionary

More information

Advance Database Management System

Advance Database Management System Advance Database Management System Conceptual Design Lecture- A simplified database design process Database Requirements UoD Requirements Collection and Analysis Functional Requirements A simplified database

More information

Data Fusion and Peer Data Management

Data Fusion and Peer Data Management Data Fusion and Peer Data Management Almaden, 13. January 2006 Felix Naumann naumann@informatik.hu-berlin.de Humboldt-Universität zu Berlin Humboldt-Universität zu Berlin VLDB 2003 19. Januar 2005 Armin

More information

Querying Microsoft SQL Server 2014

Querying Microsoft SQL Server 2014 Querying Microsoft SQL Server 2014 Course: 20461 Course Details Audience(s): IT Professional(s) Technology: Microsoft SQL Server 2014 Duration: 40 Hours ABOUT THIS COURSE This forty hours of instructor-led

More information

Applied Databases. Sebastian Maneth. Lecture 5 ER Model, normal forms. University of Edinburgh - January 25 th, 2016

Applied Databases. Sebastian Maneth. Lecture 5 ER Model, normal forms. University of Edinburgh - January 25 th, 2016 Applied Databases Lecture 5 ER Model, normal forms Sebastian Maneth University of Edinburgh - January 25 th, 2016 Outline 2 1. Entity Relationship Model 2. Normal Forms Keys and Superkeys 3 Superkey =

More information

AVANTUS TRAINING PTE LTD

AVANTUS TRAINING PTE LTD [MS20461]: Querying Microsoft SQL Server 2014 Length : 5 Days Audience(s) : IT Professionals Level : 300 Technology : SQL Server Delivery Method : Instructor-led (Classroom) Course Overview This 5-day

More information

Chapter 6 - Part II The Relational Algebra and Calculus

Chapter 6 - Part II The Relational Algebra and Calculus Chapter 6 - Part II The Relational Algebra and Calculus Copyright 2004 Ramez Elmasri and Shamkant Navathe Division operation DIVISION Operation The division operation is applied to two relations R(Z) S(X),

More information

20461: Querying Microsoft SQL Server 2014 Databases

20461: Querying Microsoft SQL Server 2014 Databases Course Outline 20461: Querying Microsoft SQL Server 2014 Databases Module 1: Introduction to Microsoft SQL Server 2014 This module introduces the SQL Server platform and major tools. It discusses editions,

More information

ETL and OLAP Systems

ETL and OLAP Systems ETL and OLAP Systems Krzysztof Dembczyński Intelligent Decision Support Systems Laboratory (IDSS) Poznań University of Technology, Poland Software Development Technologies Master studies, first semester

More information

HumMer and the three steps of information integration

HumMer and the three steps of information integration HumMer and the three steps of information integration Modena, 26th November 2005 Felix Naumann Jens Bleiholder naumann@informatik.hu-berlin.de Humboldt-Universität zu Berlin Humboldt-Universität zu Berlin

More information

MANAGING DATA(BASES) USING SQL (NON-PROCEDURAL SQL, X401.9)

MANAGING DATA(BASES) USING SQL (NON-PROCEDURAL SQL, X401.9) Technology & Information Management Instructor: Michael Kremer, Ph.D. Class 6 Professional Program: Data Administration and Management MANAGING DATA(BASES) USING SQL (NON-PROCEDURAL SQL, X401.9) AGENDA

More information

Querying Microsoft SQL Server 2008/2012

Querying Microsoft SQL Server 2008/2012 Querying Microsoft SQL Server 2008/2012 Course 10774A 5 Days Instructor-led, Hands-on Introduction This 5-day instructor led course provides students with the technical skills required to write basic Transact-SQL

More information

Preprocessing Short Lecture Notes cse352. Professor Anita Wasilewska

Preprocessing Short Lecture Notes cse352. Professor Anita Wasilewska Preprocessing Short Lecture Notes cse352 Professor Anita Wasilewska Data Preprocessing Why preprocess the data? Data cleaning Data integration and transformation Data reduction Discretization and concept

More information

20461: Querying Microsoft SQL Server

20461: Querying Microsoft SQL Server 20461: Querying Microsoft SQL Server Length: 5 days Audience: IT Professionals Level: 300 OVERVIEW This 5 day instructor led course provides students with the technical skills required to write basic Transact

More information

Ian Kenny. November 28, 2017

Ian Kenny. November 28, 2017 Ian Kenny November 28, 2017 Introductory Databases Relational Algebra Introduction In this lecture we will cover Relational Algebra. Relational Algebra is the foundation upon which SQL is built and is

More information

Chapter 3: Introduction to SQL. Chapter 3: Introduction to SQL

Chapter 3: Introduction to SQL. Chapter 3: Introduction to SQL Chapter 3: Introduction to SQL Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Chapter 3: Introduction to SQL Overview of The SQL Query Language Data Definition Basic Query

More information

Silberschatz, Korth and Sudarshan See for conditions on re-use

Silberschatz, Korth and Sudarshan See   for conditions on re-use Chapter 3: SQL Database System Concepts, 5th Ed. See www.db-book.com for conditions on re-use Chapter 3: SQL Data Definition Basic Query Structure Set Operations Aggregate Functions Null Values Nested

More information

Summary of Last Chapter. Course Content. Chapter 3 Objectives. Chapter 3: Data Preprocessing. Dr. Osmar R. Zaïane. University of Alberta 4

Summary of Last Chapter. Course Content. Chapter 3 Objectives. Chapter 3: Data Preprocessing. Dr. Osmar R. Zaïane. University of Alberta 4 Principles of Knowledge Discovery in Data Fall 2004 Chapter 3: Data Preprocessing Dr. Osmar R. Zaïane University of Alberta Summary of Last Chapter What is a data warehouse and what is it for? What is

More information

FedDW Global Schema Architect

FedDW Global Schema Architect UML based Design Tool for the Integration of Data Mart Schemas Dr. Stefan Berger Department of Business Informatics Data & Knowledge Engineering Johannes Kepler University Linz ACM 15 th DOLAP 12 November

More information

INDEX. 1 Basic SQL Statements. 2 Restricting and Sorting Data. 3 Single Row Functions. 4 Displaying data from multiple tables

INDEX. 1 Basic SQL Statements. 2 Restricting and Sorting Data. 3 Single Row Functions. 4 Displaying data from multiple tables INDEX Exercise No Title 1 Basic SQL Statements 2 Restricting and Sorting Data 3 Single Row Functions 4 Displaying data from multiple tables 5 Creating and Managing Tables 6 Including Constraints 7 Manipulating

More information

COURSE OUTLINE MOC 20461: QUERYING MICROSOFT SQL SERVER 2014

COURSE OUTLINE MOC 20461: QUERYING MICROSOFT SQL SERVER 2014 COURSE OUTLINE MOC 20461: QUERYING MICROSOFT SQL SERVER 2014 MODULE 1: INTRODUCTION TO MICROSOFT SQL SERVER 2014 This module introduces the SQL Server platform and major tools. It discusses editions, versions,

More information

Set theory is a branch of mathematics that studies sets. Sets are a collection of objects.

Set theory is a branch of mathematics that studies sets. Sets are a collection of objects. Set Theory Set theory is a branch of mathematics that studies sets. Sets are a collection of objects. Often, all members of a set have similar properties, such as odd numbers less than 10 or students in

More information

SQL Data Query Language

SQL Data Query Language SQL Data Query Language André Restivo 1 / 68 Index Introduction Selecting Data Choosing Columns Filtering Rows Set Operators Joining Tables Aggregating Data Sorting Rows Limiting Data Text Operators Nested

More information

Data Mining. Data preprocessing. Hamid Beigy. Sharif University of Technology. Fall 1395

Data Mining. Data preprocessing. Hamid Beigy. Sharif University of Technology. Fall 1395 Data Mining Data preprocessing Hamid Beigy Sharif University of Technology Fall 1395 Hamid Beigy (Sharif University of Technology) Data Mining Fall 1395 1 / 15 Table of contents 1 Introduction 2 Data preprocessing

More information

SQL: Data Manipulation Language. csc343, Introduction to Databases Diane Horton Winter 2017

SQL: Data Manipulation Language. csc343, Introduction to Databases Diane Horton Winter 2017 SQL: Data Manipulation Language csc343, Introduction to Databases Diane Horton Winter 2017 Introduction So far, we have defined database schemas and queries mathematically. SQL is a formal language for

More information

SQL: csc343, Introduction to Databases Renée J. Miller and Fatemeh Nargesian and Sina Sina Meraji. Winter 2018

SQL: csc343, Introduction to Databases Renée J. Miller and Fatemeh Nargesian and Sina Sina Meraji. Winter 2018 SQL: csc343, Introduction to Databases Renée J. Miller and Fatemeh Nargesian and Sina Sina Meraji Winter 2018 Introduction So far, we have defined database schemas and queries mathematically. SQL is a

More information

Introduction to Computer Science and Business

Introduction to Computer Science and Business Introduction to Computer Science and Business This is the second portion of the Database Design and Programming with SQL course. In this portion, students implement their database design by creating a

More information

Schema Integration Methodologies for Multidatabases and the Relational Integration Model - Candidacy document

Schema Integration Methodologies for Multidatabases and the Relational Integration Model - Candidacy document Schema Integration Methodologies for Multidatabases and the Relational Integration Model - Candidacy document Ramon Lawrence Department of Computer Science University of Manitoba umlawren@cs.umanitoba.ca

More information

Relational model continued. Understanding how to use the relational model. Summary of board example: with Copies as weak entity

Relational model continued. Understanding how to use the relational model. Summary of board example: with Copies as weak entity COS 597A: Principles of Database and Information Systems Relational model continued Understanding how to use the relational model 1 with as weak entity folded into folded into branches: (br_, librarian,

More information

Data Warehousing. Jens Teubner, TU Dortmund Summer Jens Teubner Data Warehousing Summer

Data Warehousing. Jens Teubner, TU Dortmund Summer Jens Teubner Data Warehousing Summer Jens Teubner Data Warehousing Summer 2018 1 Data Warehousing Jens Teubner, TU Dortmund jens.teubner@cs.tu-dortmund.de Summer 2018 Jens Teubner Data Warehousing Summer 2018 160 Part VI ETL Process ETL Overview

More information

Introduction to Data Management CSE 344. Lectures 8: Relational Algebra

Introduction to Data Management CSE 344. Lectures 8: Relational Algebra Introduction to Data Management CSE 344 Lectures 8: Relational Algebra CSE 344 - Winter 2017 1 Announcements Homework 3 is posted Microsoft Azure Cloud services! Use the promotion code you received Due

More information

Overview Relational data model

Overview Relational data model Thanks to José and Vaida for most of the slides. Relational databases and MySQL Juha Takkinen juhta@ida.liu.se Outline 1. Introduction: Relational data model and SQL 2. Creating tables in Mysql 3. Simple

More information

Introduction to Data Management CSE 344. Lectures 8: Relational Algebra

Introduction to Data Management CSE 344. Lectures 8: Relational Algebra Introduction to Data Management CSE 344 Lectures 8: Relational Algebra CSE 344 - Winter 2016 1 Announcements Homework 3 is posted Microsoft Azure Cloud services! Use the promotion code you received Due

More information

UFCEKG 20 2 : Data, Schemas and Applications

UFCEKG 20 2 : Data, Schemas and Applications Lecture 11 UFCEKG 20 2 : Data, Schemas and Applications Lecture 11 Database Theory & Practice (5) : Introduction to the Structured Query Language (SQL) Origins & history Early 1970 s IBM develops Sequel

More information

Querying Microsoft SQL Server

Querying Microsoft SQL Server Querying Microsoft SQL Server Course 20461D 5 Days Instructor-led, Hands-on Course Description This 5-day instructor led course is designed for customers who are interested in learning SQL Server 2012,

More information

DATABASE TECHNOLOGY - 1MB025

DATABASE TECHNOLOGY - 1MB025 1 DATABASE TECHNOLOGY - 1MB025 Fall 2004 An introductory course on database systems http://user.it.uu.se/~udbl/dbt-ht2004/ alt. http://www.it.uu.se/edu/course/homepage/dbastekn/ht04/ Kjell Orsborn Uppsala

More information

Unit Assessment Guide

Unit Assessment Guide Unit Assessment Guide Unit Details Unit code Unit name Unit purpose/application ICTWEB425 Apply structured query language to extract and manipulate data This unit describes the skills and knowledge required

More information

INTERMEDIATE SQL GOING BEYOND THE SELECT. Created by Brian Duffey

INTERMEDIATE SQL GOING BEYOND THE SELECT. Created by Brian Duffey INTERMEDIATE SQL GOING BEYOND THE SELECT Created by Brian Duffey WHO I AM Brian Duffey 3 years consultant at michaels, ross, and cole 9+ years SQL user What have I used SQL for? ROADMAP Introduction 1.

More information

Index. Bitmap Heap Scan, 156 Bitmap Index Scan, 156. Rahul Batra 2018 R. Batra, SQL Primer,

Index. Bitmap Heap Scan, 156 Bitmap Index Scan, 156. Rahul Batra 2018 R. Batra, SQL Primer, A Access control, 165 granting privileges to users general syntax, GRANT, 170 multiple privileges, 171 PostgreSQL, 166 169 relational databases, 165 REVOKE command, 172 173 SQLite, 166 Aggregate functions

More information

Data Mining: Concepts and Techniques. (3 rd ed.) Chapter 3. Chapter 3: Data Preprocessing. Major Tasks in Data Preprocessing

Data Mining: Concepts and Techniques. (3 rd ed.) Chapter 3. Chapter 3: Data Preprocessing. Major Tasks in Data Preprocessing Data Mining: Concepts and Techniques (3 rd ed.) Chapter 3 1 Chapter 3: Data Preprocessing Data Preprocessing: An Overview Data Quality Major Tasks in Data Preprocessing Data Cleaning Data Integration Data

More information

CMPT 354: Database System I. Lecture 5. Relational Algebra

CMPT 354: Database System I. Lecture 5. Relational Algebra CMPT 354: Database System I Lecture 5. Relational Algebra 1 What have we learned Lec 1. DatabaseHistory Lec 2. Relational Model Lec 3-4. SQL 2 Why Relational Algebra matter? An essential topic to understand

More information

Data Mining: Concepts and Techniques. (3 rd ed.) Chapter 3

Data Mining: Concepts and Techniques. (3 rd ed.) Chapter 3 Data Mining: Concepts and Techniques (3 rd ed.) Chapter 3 Jiawei Han, Micheline Kamber, and Jian Pei University of Illinois at Urbana-Champaign & Simon Fraser University 2011 Han, Kamber & Pei. All rights

More information

SQL. CS 564- Fall ACKs: Dan Suciu, Jignesh Patel, AnHai Doan

SQL. CS 564- Fall ACKs: Dan Suciu, Jignesh Patel, AnHai Doan SQL CS 564- Fall 2015 ACKs: Dan Suciu, Jignesh Patel, AnHai Doan MOTIVATION The most widely used database language Used to query and manipulate data SQL stands for Structured Query Language many SQL standards:

More information

DATABASTEKNIK - 1DL116

DATABASTEKNIK - 1DL116 1 DATABASTEKNIK - 1DL116 Spring 2004 An introductury course on database systems http://user.it.uu.se/~udbl/dbt-vt2004/ Kjell Orsborn Uppsala Database Laboratory Department of Information Technology, Uppsala

More information

DSE 203 DAY 1: REVIEW OF DBMS CONCEPTS

DSE 203 DAY 1: REVIEW OF DBMS CONCEPTS DSE 203 DAY 1: REVIEW OF DBMS CONCEPTS Data Models A specification that precisely defines The structure of the data The fundamental operations on the data The logical language to specify queries on the

More information

Data Mining. Data preprocessing. Hamid Beigy. Sharif University of Technology. Fall 1394

Data Mining. Data preprocessing. Hamid Beigy. Sharif University of Technology. Fall 1394 Data Mining Data preprocessing Hamid Beigy Sharif University of Technology Fall 1394 Hamid Beigy (Sharif University of Technology) Data Mining Fall 1394 1 / 15 Table of contents 1 Introduction 2 Data preprocessing

More information

Oracle Database: SQL and PL/SQL Fundamentals NEW

Oracle Database: SQL and PL/SQL Fundamentals NEW Oracle Database: SQL and PL/SQL Fundamentals NEW Duration: 5 Days What you will learn This Oracle Database: SQL and PL/SQL Fundamentals training delivers the fundamentals of SQL and PL/SQL along with the

More information

Relational Model, Relational Algebra, and SQL

Relational Model, Relational Algebra, and SQL Relational Model, Relational Algebra, and SQL August 29, 2007 1 Relational Model Data model. constraints. Set of conceptual tools for describing of data, data semantics, data relationships, and data integrity

More information

DIRA : A FRAMEWORK OF DATA INTEGRATION USING DATA QUALITY

DIRA : A FRAMEWORK OF DATA INTEGRATION USING DATA QUALITY DIRA : A FRAMEWORK OF DATA INTEGRATION USING DATA QUALITY Reham I. Abdel Monem 1, Ali H. El-Bastawissy 2 and Mohamed M. Elwakil 3 1 Information Systems Department, Faculty of computers and information,

More information

"Charting the Course to Your Success!" MOC D Querying Microsoft SQL Server Course Summary

Charting the Course to Your Success! MOC D Querying Microsoft SQL Server Course Summary Course Summary Description This 5-day instructor led course provides students with the technical skills required to write basic Transact-SQL queries for Microsoft SQL Server 2014. This course is the foundation

More information

Chapter 3: SQL. Database System Concepts, 5th Ed. Silberschatz, Korth and Sudarshan See for conditions on re-use

Chapter 3: SQL. Database System Concepts, 5th Ed. Silberschatz, Korth and Sudarshan See  for conditions on re-use Chapter 3: SQL Database System Concepts, 5th Ed. See www.db-book.com for conditions on re-use Chapter 3: SQL Data Definition Basic Query Structure Set Operations Aggregate Functions Null Values Nested

More information

Oracle Database: SQL and PL/SQL Fundamentals Ed 2

Oracle Database: SQL and PL/SQL Fundamentals Ed 2 Oracle University Contact Us: Local: 1800 103 4775 Intl: +91 80 67863102 Oracle Database: SQL and PL/SQL Fundamentals Ed 2 Duration: 5 Days What you will learn This Oracle Database: SQL and PL/SQL Fundamentals

More information

The Extended Algebra. Duplicate Elimination. Sorting. Example: Duplicate Elimination

The Extended Algebra. Duplicate Elimination. Sorting. Example: Duplicate Elimination The Extended Algebra Duplicate Elimination 2 δ = eliminate duplicates from bags. τ = sort tuples. γ = grouping and aggregation. Outerjoin : avoids dangling tuples = tuples that do not join with anything.

More information

Relational Databases

Relational Databases Relational Databases Jan Chomicki University at Buffalo Jan Chomicki () Relational databases 1 / 49 Plan of the course 1 Relational databases 2 Relational database design 3 Conceptual database design 4

More information

Scaling Access to Heterogeneous Data Sources with DISCO

Scaling Access to Heterogeneous Data Sources with DISCO 808 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 10, NO. 5, SEPTEMBER/OCTOBER 1998 Scaling Access to Heterogeneous Data Sources with DISCO Anthony Tomasic, Louiqa Raschid, Member, IEEE, and

More information

20761 Querying Data with Transact SQL

20761 Querying Data with Transact SQL Course Overview The main purpose of this course is to give students a good understanding of the Transact-SQL language which is used by all SQL Server-related disciplines; namely, Database Administration,

More information

Chapter 3: SQL. Chapter 3: SQL

Chapter 3: SQL. Chapter 3: SQL Chapter 3: SQL Database System Concepts, 5th Ed. See www.db-book.com for conditions on re-use Chapter 3: SQL Data Definition Basic Query Structure Set Operations Aggregate Functions Null Values Nested

More information

Relational Model History. COSC 304 Introduction to Database Systems. Relational Model and Algebra. Relational Model Definitions.

Relational Model History. COSC 304 Introduction to Database Systems. Relational Model and Algebra. Relational Model Definitions. COSC 304 Introduction to Database Systems Relational Model and Algebra Dr. Ramon Lawrence University of British Columbia Okanagan ramon.lawrence@ubc.ca Relational Model History The relational model was

More information

DATABASE TECHNOLOGY - 1MB025

DATABASE TECHNOLOGY - 1MB025 1 DATABASE TECHNOLOGY - 1MB025 Fall 2005 An introductury course on database systems http://user.it.uu.se/~udbl/dbt-ht2005/ alt. http://www.it.uu.se/edu/course/homepage/dbastekn/ht05/ Kjell Orsborn Uppsala

More information

Chapter 3: Introduction to SQL

Chapter 3: Introduction to SQL Chapter 3: Introduction to SQL Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Chapter 3: Introduction to SQL Overview of the SQL Query Language Data Definition Basic Query

More information

Simple SQL Queries (2)

Simple SQL Queries (2) Simple SQL Queries (2) Review SQL the structured query language for relational databases DDL: data definition language DML: data manipulation language Create and maintain tables CMPT 354: Database I --

More information

Querying Microsoft SQL Server 2012/2014

Querying Microsoft SQL Server 2012/2014 Page 1 of 14 Overview This 5-day instructor led course provides students with the technical skills required to write basic Transact-SQL queries for Microsoft SQL Server 2014. This course is the foundation

More information

Querying Microsoft SQL Server

Querying Microsoft SQL Server 20461 - Querying Microsoft SQL Server Duration: 5 Days Course Price: $2,975 Software Assurance Eligible Course Description About this course This 5-day instructor led course provides students with the

More information

Relational Algebra and SQL

Relational Algebra and SQL Relational Algebra and SQL Computer Science E-66 Harvard University David G. Sullivan, Ph.D. Example Domain: a University We ll use relations from a university database. four relations that store info.

More information

Data Preprocessing. Slides by: Shree Jaswal

Data Preprocessing. Slides by: Shree Jaswal Data Preprocessing Slides by: Shree Jaswal Topics to be covered Why Preprocessing? Data Cleaning; Data Integration; Data Reduction: Attribute subset selection, Histograms, Clustering and Sampling; Data

More information

itrails: Pay-as-you-go Information Integration in Dataspaces

itrails: Pay-as-you-go Information Integration in Dataspaces itrails: Pay-as-you-go Information Integration in Dataspaces Marcos Vaz Salles Jens Dittrich Shant Karakashian Olivier Girard Lukas Blunschi ETH Zurich VLDB 2007 Outline Motivation itrails Experiments

More information

Querying Data with Transact SQL

Querying Data with Transact SQL Course 20761A: Querying Data with Transact SQL Course details Course Outline Module 1: Introduction to Microsoft SQL Server 2016 This module introduces SQL Server, the versions of SQL Server, including

More information

Course Modules for MCSA: SQL Server 2016 Database Development Training & Certification Course:

Course Modules for MCSA: SQL Server 2016 Database Development Training & Certification Course: Course Modules for MCSA: SQL Server 2016 Database Development Training & Certification Course: 20762C Developing SQL 2016 Databases Module 1: An Introduction to Database Development Introduction to the

More information

Oracle Database 11g: SQL and PL/SQL Fundamentals

Oracle Database 11g: SQL and PL/SQL Fundamentals Oracle University Contact Us: +33 (0) 1 57 60 20 81 Oracle Database 11g: SQL and PL/SQL Fundamentals Duration: 5 Days What you will learn In this course, students learn the fundamentals of SQL and PL/SQL

More information