Introduction Data Integration Summary. Data Integration. COCS 6421 Advanced Database Systems. Przemyslaw Pawluk. CSE, York University.
|
|
- Janice Allison
- 5 years ago
- Views:
Transcription
1 COCS 6421 Advanced Database Systems CSE, York University March 20, 2008
2 Agenda 1 Problem description Problems 2 3 Open questions and future work Conclusion Bibliography
3 Problem description Problems
4 Why to do it? Problem description Problems Many data sources and applications in one organization
5 Why to do it? Problem description Problems Many data sources and applications in one organization The need of fast access and understanding of information from different sources.
6 Why to do it? Problem description Problems Many data sources and applications in one organization The need of fast access and understanding of information from different sources. Elimination of bottlenecks in the data-flow (many points where data is transformed).
7 Goals of data integration Problem description Problems To answer queries in most efficient way. (When we know exactly the queries we want to answer and what data are available) To discover the knowledge. (When don t know the queries in advance or we don t know all data sources in advance) In both we need to analyze data and present results according to requirements of users.
8 Goals of data integration Problem description Problems To answer queries in most efficient way. (When we know exactly the queries we want to answer and what data are available) To discover the knowledge. (When don t know the queries in advance or we don t know all data sources in advance) In both we need to analyze data and present results according to requirements of users.
9 Goals of data integration Problem description Problems To answer queries in most efficient way. (When we know exactly the queries we want to answer and what data are available) To discover the knowledge. (When don t know the queries in advance or we don t know all data sources in advance) In both we need to analyze data and present results according to requirements of users.
10 Problems in data integration Problem description Problems Identification of the best data source How to get data from the source? Interface to integrate schemes Formulation of queries to different data sources Problems with data incomplete and incorrect data inconsistent data incomprehensible data duplicates different formats Security issues Quality of Data Quality of Service reliability efficiency...
11 Problems with data Example Problem description Problems Many data formats i.e. John Brown, Brown J., Brown John
12 Problems with data Example Problem description Problems Many data formats i.e. John Brown, Brown J., Brown John Two different phone no. or addresses where integrity constraint allows only one
13 Problems with data Example Problem description Problems Many data formats i.e. John Brown, Brown J., Brown John Two different phone no. or addresses where integrity constraint allows only one Zip codes, phone numbers, SIN etc. with or without
14 Problems with data Example Problem description Problems Many data formats i.e. John Brown, Brown J., Brown John Two different phone no. or addresses where integrity constraint allows only one Zip codes, phone numbers, SIN etc. with or without Bad news! AI cannot solve all problems. Knowledge of meaning of the data is required!
15
16 Integration Process
17 Elements of the integration process Understanding meta-data analysis discover inconsistencies and dependencies
18 Elements of the integration process Understanding meta-data analysis discover inconsistencies and dependencies Standardization target schema data repair (to achieve consistent data) data identification constraints (How to identify object?)
19 Elements of the integration process Understanding meta-data analysis discover inconsistencies and dependencies Standardization target schema data repair (to achieve consistent data) data identification constraints (How to identify object?) Specification build mapping(s)
20 Elements of the integration process Understanding meta-data analysis discover inconsistencies and dependencies Standardization target schema data repair (to achieve consistent data) data identification constraints (How to identify object?) Specification build mapping(s) Execution materialization (ETL) federation (data exchange or query reformulation) indexing
21 Techniques
22 Materialization DS1 DS2 ODS DS3 The information pipeline[1]
23 Materialization DS1 DS2 ODS ETL DS3 The information pipeline[1]
24 Materialization DS1 DS2 ODS ETL CDB DS3 The information pipeline[1]
25 Materialization DS1 DM1 DS2 ODS ETL CDB DM2 DS3 DM3 The information pipeline[1]
26 Materialization DS1 DM1 DS2 ODS ETL CDB DM2 Present. DS3 DM3 The information pipeline[1]
27 Federation DS1 DS2 DS3
28 Federation Query processor DS1 DS2 DS3
29 Federation Presentation Query processor DS1 DS2 DS3
30 Federation Presentation Query processor Do not materialize data Usually use query reformulation or unfolding DS1 DS2 DS3
31 Indexing XML1 XML2 XML3
32 Indexing Index XML1 XML2 XML3
33 Indexing Presentation Query processor Index XML1 XML2 XML3
34 Indexing Presentation Query processor Index Do not materialize data Use index to find proper document XML1 XML2 XML3
35
36 Views
37 Views in data integration Architecture based on global schema (GS) and set of sources Data in the data sources (DS) Global schema as virtual view of DS Each DS and GS can be expressed in a different language The goal is to define some mappings between DSs and GS Two approaches: Global as view Local as view
38 Global as view Global schema expressed in terms of data sources. Each element of the global schema is defined as a view over the sources.
39 Global as view Global schema expressed in terms of data sources. Each element of the global schema is defined as a view over the sources. Requirement! Stable set of data sources!
40 GAV benefits The benefits of GAV: Easier to understand Mapping explicitly tells how to retrieve data from DSs
41 Local as view Global schema defined independently from data-sources, and the relationships between the global schema and the sources are expressed by defining every source as a view over the global schema.
42 Local as view Global schema defined independently from data-sources, and the relationships between the global schema and the sources are expressed by defining every source as a view over the global schema. Requirement! Stable global schema!
43 LAV benefits The immediate benefits of LAV: Easier describing sources(doesn t require any knowledge about other DS) New DS can be easily added Describing precise constraints on the contents of DS and describing sources that have different relational structure than the GS is easier
44 GAV vs. LAV The LAV system can be transformed into a GAV one and vice-versa.
45 GAV vs. LAV The LAV system can be transformed into a GAV one and vice-versa. LAV to GAV can be used to derive the procedural specification from declarative one.
46 GAV vs. LAV The LAV system can be transformed into a GAV one and vice-versa. LAV to GAV can be used to derive the procedural specification from declarative one. GAV to LAV can be useful to derive a declarative characterization of the content of the sources starting from procedural specification.
47 Answering queries using views Depends on chosen technique GAV gives simple solution how to get the data. Just unfold! LAV uses view-based query rewriting or view-based query answering. More complicated than GAV, uses reasoning in the presence of incomplete information
48 Schema mapping
49 Schema mapping Data exchange How to move data from one schema to the other one? Data integration How to integrate data from many different schemes?
50 Schema mapping in data exchange problem
51 LAV and GAV Example In general elements of the set Σ can have a form φ(x) yψ(x, y) For example (Student(s) Enrolls(s, c)) t g(teaches(t, c) Grade(s, c, g)) In LAV there is only one element on the left side P(x) yψ(x, y) In GAV there is only one element on the right side φ(x) R(x)
52 Schema composing
53 Schema inverting in schema evolution
54 Schema mapping Example of changes in schemas Why evolution? new atribute add/remove constraint... Example new information about student or course required changes in the grading system...
55
56 Example banking system Our point of view: CRM system Questions: The purpose of the system The source/sources of data for the system?...
57 Example Banking systems Customers DB CRM System
58 Example Banking systems Customers DB CRM System Cards DB WebSys. Data Account DB Contracts
59 Example Banking systems Different formats and accuracy of data Different Database Management Systems Different owners (security levels)...
60 What is Master Data? Facts describing core business entities Provides business context Does not imply a particular style of usage Not everything is Master Data...
61 What Is Master Data Management? An approach that decouples master information from individual applications and unifies it. Central application- and process-neutral resource Ensures consistent, up-to-date master information across business processes, and systems Simplifies ongoing integration tasks and new application development
62 Problems...
63 Proposed solution Master Data Management brakes walls between the systems and provides fast access to the consistent information.
64 Open questions and future work Conclusion Bibliography
65 Open questions Open questions and future work Conclusion Bibliography Automatic schema mapping Reference reconciling Model management Peer-to-Peer Data Management
66 Conclusion Open questions and future work Conclusion Bibliography Data integration theoretical as well as business problem Some commercial solutions are available but not sufficient Many issues without solution = reach research field
67 Open questions and future work Conclusion Bibliography Bibliography
68 Open questions and future work Conclusion Bibliography Brazhnik, O., and Jones, J. F. Anatomy of data integration. J. of Biomedical Informatics 40, 3 (2007), Fagin, R., Kolaitis, P. G., Tan, W.-C., and Popa, L. Composing schema mappings: second-order dependencies to the rescue. In PODS 04: Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems (New York, NY, USA, 2004), ACM, pp Haas, L. M. Beauty and the beast: The theory and practice of information integration. In ICDT (2007), T. Schwentick and D. Suciu, Eds., vol of Lecture Notes in Computer Science, Springer, pp
69 Open questions and future work Conclusion Bibliography Halevy, A., Rajaraman, A., and Ordille, J. Data integration: the teenage years. In VLDB 06: Proceedings of the 32nd international conference on Very large data bases (2006), VLDB Endowment, pp Halevy, A. Y. Answering queries using views: A survey. The VLDB Journal 10, 4 (2001), Kolaitis, P. G. Schema mappings, data exchange, and metadata management. In PODS 05: Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems (New York, NY, USA, 2005), ACM, pp Lenzerini, M.
70 Open questions and future work Conclusion Bibliography Data integration: a theoretical perspective. In PODS 02: Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems (New York, NY, USA, 2002), ACM, pp Pankowski, T. Integracja danych w teorii i praktyce przeglad problemow i rozwiazan. In Bazy Danych: Nowe Technologie (2007), WK, pp Ra, Y.-G., and Rundensteiner, E. A. A transparent schema-evolution system based on object-oriented view technology. IEEE Transactions on Knowledge and Data Engineering 9, 4 (1997), Shahri, H. H., and Shahri, S. H.
71 Open questions and future work Conclusion Bibliography Eliminating duplicates in information integration: An adaptive, extensible framework. IEEE Intelligent Systems 21, 5 (2006),
ISENS: A System for Information Integration, Exploration, and Querying of Multi-Ontology Data Sources
ISENS: A System for Information Integration, Exploration, and Querying of Multi-Ontology Data Sources Dimitre A. Dimitrov, Roopa Pundaleeka Tech-X Corp. Boulder, CO 80303, USA Email: {dad, roopa}@txcorp.com
More informationData Integration: Schema Mapping
Data Integration: Schema Mapping Jan Chomicki University at Buffalo and Warsaw University March 8, 2007 Jan Chomicki (UB/UW) Data Integration: Schema Mapping March 8, 2007 1 / 13 Data integration Data
More informationData Integration: Schema Mapping
Data Integration: Schema Mapping Jan Chomicki University at Buffalo and Warsaw University March 8, 2007 Jan Chomicki (UB/UW) Data Integration: Schema Mapping March 8, 2007 1 / 13 Data integration Jan Chomicki
More informationOntology-Based Schema Integration
Ontology-Based Schema Integration Zdeňka Linková Institute of Computer Science, Academy of Sciences of the Czech Republic Pod Vodárenskou věží 2, 182 07 Prague 8, Czech Republic linkova@cs.cas.cz Department
More informationINCONSISTENT DATABASES
INCONSISTENT DATABASES Leopoldo Bertossi Carleton University, http://www.scs.carleton.ca/ bertossi SYNONYMS None DEFINITION An inconsistent database is a database instance that does not satisfy those integrity
More informationKanata: Adaptation and Evolution in Data Sharing Systems
Kanata: Adaptation and Evolution in Data Sharing Systems Periklis Andritsos Ariel Fuxman Anastasios Kementsietsidis Renée J. Miller Yannis Velegrakis Department of Computer Science University of Toronto
More informationA Mapping Approach for Fully Virtual Data Integration System Processes
A Mapping Approach for Fully Virtual Data Integration System Processes Ali Z. El Qutaany 1 PhD Student, Faculty of Computers and Information, Cairo University Cairo, Egypt Osman M. Hegazi 2 Professor,
More informationData integration lecture 2
PhD course on View-based query processing Data integration lecture 2 Riccardo Rosati Dipartimento di Informatica e Sistemistica Università di Roma La Sapienza {rosati}@dis.uniroma1.it Corso di Dottorato
More informationDatabase Technology Introduction. Heiko Paulheim
Database Technology Introduction Outline The Need for Databases Data Models Relational Databases Database Design Storage Manager Query Processing Transaction Manager Introduction to the Relational Model
More informationComposing Schema Mapping
Composing Schema Mapping An Overview Phokion G. Kolaitis UC Santa Cruz & IBM Research Almaden Joint work with R. Fagin, L. Popa, and W.C. Tan 1 Data Interoperability Data may reside at several different
More informationData integration supports seamless access to autonomous, heterogeneous information
Using Constraints to Describe Source Contents in Data Integration Systems Chen Li, University of California, Irvine Data integration supports seamless access to autonomous, heterogeneous information sources
More informationProvable data privacy
Provable data privacy Kilian Stoffel 1 and Thomas Studer 2 1 Université de Neuchâtel, Pierre-à-Mazel 7, CH-2000 Neuchâtel, Switzerland kilian.stoffel@unine.ch 2 Institut für Informatik und angewandte Mathematik,
More informationMetadata Services for Distributed Event Stream Processing Agents
Metadata Services for Distributed Event Stream Processing Agents Mahesh B. Chaudhari School of Computing, Informatics, and Decision Systems Engineering Arizona State University Tempe, AZ 85287-8809, USA
More information(Big Data Integration) : :
(Big Data Integration) : : 3 # $%&'! ()* +$,- 2/30 ()* + # $%&' = 3 : $ 2 : 17 ;' $ # < 2 6 ' $%&',# +'= > 0 - '? @0 A 1 3/30 3?. - B 6 @* @(C : E6 - > ()* (C :(C E6 1' +'= - ''3-6 F :* 2G '> H-! +'-?
More informationDataspaces: A New Abstraction for Data Management. Mike Franklin, Alon Halevy, David Maier, Jennifer Widom
Dataspaces: A New Abstraction for Data Management Mike Franklin, Alon Halevy, David Maier, Jennifer Widom Today s Agenda Why databases are great. What problems people really have Why databases are not
More informationInformation Integration
Information Integration Part 1: Basics of Relational Database Theory Werner Nutt Faculty of Computer Science Master of Science in Computer Science A.Y. 2012/2013 Integration in Data Management: Evolution
More informationData Integration and Data Warehousing Database Integration Overview
Data Integration and Data Warehousing Database Integration Overview Sergey Stupnikov Institute of Informatics Problems, RAS ssa@ipi.ac.ru Outline Information Integration Problem Heterogeneous Information
More informationArchiving and Maintaining Curated Databases
Archiving and Maintaining Curated Databases Heiko Müller University of Edinburgh, UK hmueller@inf.ed.ac.uk Abstract Curated databases represent a substantial amount of effort by a dedicated group of people
More informationOn the Role of Integrity Constraints in Data Integration
On the Role of Integrity Constraints in Data Integration Andrea Calì, Diego Calvanese, Giuseppe De Giacomo, Maurizio Lenzerini Dipartimento di Informatica e Sistemistica Università di Roma La Sapienza
More informationCSE-6490B Final Exam
February 2009 CSE-6490B Final Exam Fall 2008 p 1 CSE-6490B Final Exam In your submitted work for this final exam, please include and sign the following statement: I understand that this final take-home
More informationData Integration: A Theoretical Perspective
Data Integration: A Theoretical Perspective Maurizio Lenzerini Dipartimento di Informatica e Sistemistica Università di Roma La Sapienza Via Salaria 113, I 00198 Roma, Italy lenzerini@dis.uniroma1.it ABSTRACT
More informationThe Inverse of a Schema Mapping
The Inverse of a Schema Mapping Jorge Pérez Department of Computer Science, Universidad de Chile Blanco Encalada 2120, Santiago, Chile jperez@dcc.uchile.cl Abstract The inversion of schema mappings has
More informationData integration lecture 3
PhD course on View-based query processing Data integration lecture 3 Riccardo Rosati Dipartimento di Informatica e Sistemistica Università di Roma La Sapienza {rosati}@dis.uniroma1.it Corso di Dottorato
More informationCSCI1270 Introduction to Database Systems
CSCI1270 Introduction to Database Systems with thanks to Prof. George Kollios, Boston University Prof. Mitch Cherniack, Brandeis University Prof. Avi Silberschatz, Yale University 1.1 What is a Database
More informationFrom ER Diagrams to the Relational Model. Rose-Hulman Institute of Technology Curt Clifton
From ER Diagrams to the Relational Model Rose-Hulman Institute of Technology Curt Clifton Review Entity Sets and Attributes Entity set: collection of things in the DB Attribute: property of an entity calories
More informationDescribing and Utilizing Constraints to Answer Queries in Data-Integration Systems
Describing and Utilizing Constraints to Answer Queries in Data-Integration Systems Chen Li Information and Computer Science University of California, Irvine, CA 92697 chenli@ics.uci.edu Abstract In data-integration
More informationDesigning Views to Answer Queries under Set, Bag,and BagSet Semantics
Designing Views to Answer Queries under Set, Bag,and BagSet Semantics Rada Chirkova Department of Computer Science, North Carolina State University Raleigh, NC 27695-7535 chirkova@csc.ncsu.edu Foto Afrati
More informationRelational model continued. Understanding how to use the relational model. Summary of board example: with Copies as weak entity
COS 597A: Principles of Database and Information Systems Relational model continued Understanding how to use the relational model 1 with as weak entity folded into folded into branches: (br_, librarian,
More informationBio/Ecosystem Informatics
Bio/Ecosystem Informatics Renée J. Miller University of Toronto DB research problem: managing data semantics R. J. Miller University of Toronto 1 Managing Data Semantics Semantics modeled by Schemas (structure
More informationDatabases Lectures 1 and 2
Databases Lectures 1 and 2 Timothy G. Griffin Computer Laboratory University of Cambridge, UK Databases, Lent 2009 T. Griffin (cl.cam.ac.uk) Databases Lectures 1 and 2 DB 2009 1 / 36 Re-ordered Syllabus
More informationOutline. Database Theory. Prerequisites and Admission. Classes VU , SS 2018
Database Theory Database Theory Outline Database Theory VU 181.140, SS 2018 0. General Information Reinhard Pichler Institut für Informationssysteme Arbeitsbereich DBAI Technische Universität Wien 6 March,
More informationA Data warehouse within a Federated database architecture
Association for Information Systems AIS Electronic Library (AISeL) AMCIS 1997 Proceedings Americas Conference on Information Systems (AMCIS) 8-15-1997 A Data warehouse within a Federated database architecture
More informationEdinburgh Research Explorer
Edinburgh Research Explorer Hippo: A System for Computing Consistent Answers to a Class of SQL Queries Citation for published version: Chomicki, J, Marcinkowski, J & Staworko, S 2004, Hippo: A System for
More informationDatabase Systems. Sven Helmer. Database Systems p. 1/567
Database Systems Sven Helmer Database Systems p. 1/567 Chapter 1 Introduction and Motivation Database Systems p. 2/567 Introduction What is a database system (DBS)? Obviously a system for storing and managing
More informationAspects of an XML-Based Phraseology Database Application
Aspects of an XML-Based Phraseology Database Application Denis Helic 1 and Peter Ďurčo2 1 University of Technology Graz Insitute for Information Systems and Computer Media dhelic@iicm.edu 2 University
More informationEvolution of XML Applications
Evolution of XML Applications University of Technology Sydney, Australia Irena Mlynkova 9.11. 2011 XML and Web Engineering Research Group Department of Software Engineering Faculty of Mathematics and Physics
More informationSTRATEGIC INFORMATION SYSTEMS IV STV401T / B BTIP05 / BTIX05 - BTECH DEPARTMENT OF INFORMATICS. By: Dr. Tendani J. Lavhengwa
STRATEGIC INFORMATION SYSTEMS IV STV401T / B BTIP05 / BTIX05 - BTECH DEPARTMENT OF INFORMATICS LECTURE: 05 (A) DATA WAREHOUSING (DW) By: Dr. Tendani J. Lavhengwa lavhengwatj@tut.ac.za 1 My personal quote:
More informationProcessing Regular Path Queries Using Views or What Do We Need for Integrating Semistructured Data?
Processing Regular Path Queries Using Views or What Do We Need for Integrating Semistructured Data? Diego Calvanese University of Rome La Sapienza joint work with G. De Giacomo, M. Lenzerini, M.Y. Vardi
More informationOn the Integration of Autonomous Data Marts
On the Integration of Autonomous Data Marts Luca Cabibbo and Riccardo Torlone Dipartimento di Informatica e Automazione Università di Roma Tre {cabibbo,torlone}@dia.uniroma3.it Abstract We address the
More informationData Integration A Logic-Based Perspective
AI Magazine Volume 26 Number 1 (2005) ( AAAI) Articles Data Integration A Logic-Based Perspective Diego Calvanese and Giuseppe De Giacomo Data integration is the problem of combining data residing at different
More informationSchema Management. Abstract
Schema Management Periklis Andritsos Λ Ronald Fagin y Ariel Fuxman Λ Laura M. Haas y Mauricio A. Hernández y Ching-Tien Ho y Anastasios Kementsietsidis Λ Renée J. Miller Λ Felix Naumann y Lucian Popa y
More informationG64DBS Database Systems. Lecture 7 SQL SELECT. The Data Dictionary. Data Dictionaries. Different Sections of SQL (DDL) Different Sections of SQL (DCL)
G64DBS Database Systems Lecture 7 SQL SELECT Tim Brailsford Different Sections of SQL (DDL) The Data Definition Language (DDL): CREATE TABLE - creates a new database table ALTER TABLE - alters (changes)
More informationThe interaction of theory and practice in database research
The interaction of theory and practice in database research Ron Fagin IBM Research Almaden 1 Purpose of This Talk Encourage collaboration between theoreticians and system builders via two case studies
More informationA Deeper Look at Data Modeling. Shan-Hung Wu & DataLab CS, NTHU
A Deeper Look at Data Modeling Shan-Hung Wu & DataLab CS, NTHU Outline More about ER & Relational Models Weak Entities Inheritance Avoiding redundancy & inconsistency Functional Dependencies Normal Forms
More informationData Integration: A Logic-Based Perspective
Data Integration: A Logic-Based Perspective Diego Calvanese Faculty of Computer Science Free University of Bolzano/Bozen Piazza Domenicani 3, 39100 Bolzano, Italy calvanese@inf.unibz.it Giuseppe De Giacomo
More informationLearning from Semantically Heterogeneous Data
Learning from Semantically Heterogeneous Data Doina Caragea* Department of Computing and Information Sciences Kansas State University 234 Nichols Hall Manhattan, KS 66506 USA voice: +1 785-532-7908 fax:
More informationAdvances in Data Management - Web Data Integration A.Poulovassilis
Advances in Data Management - Web Data Integration A.Poulovassilis 1 1 Integrating Deep Web Data Traditionally, the web has made available vast amounts of information in unstructured form (i.e. text).
More informationData Schema Integration
Mustafa Jarrar Lecture Notes, Web Data Management (MCOM7348) University of Birzeit, Palestine 1 st Semester, 2013 Data Schema Integration Dr. Mustafa Jarrar University of Birzeit mjarrar@birzeit.edu www.jarrar.info
More informationAn Efficient Design and Implementation of a Heterogeneous Deductive Object-Oriented Database System
An Efficient Design and Implementation of a Heterogeneous Deductive Object-Oriented Database System Cyril S. Ku Department of Computer Science William Paterson University Wayne, NJ 07470, USA Suk-Chung
More informationEncyclopedia of Database Systems, Editors-in-chief: Özsu, M. Tamer; Liu, Ling, Springer, MAINTENANCE OF RECURSIVE VIEWS. Suzanne W.
Encyclopedia of Database Systems, Editors-in-chief: Özsu, M. Tamer; Liu, Ling, Springer, 2009. MAINTENANCE OF RECURSIVE VIEWS Suzanne W. Dietrich Arizona State University http://www.public.asu.edu/~dietrich
More informationQuery Rewriting Using Views in the Presence of Inclusion Dependencies
Query Rewriting Using Views in the Presence of Inclusion Dependencies Qingyuan Bai Jun Hong Michael F. McTear School of Computing and Mathematics, University of Ulster at Jordanstown, Newtownabbey, Co.
More informationMIS2502: Data Analytics Relational Data Modeling - 1. JaeHwuen Jung
MIS2502: Data Analytics Relational Data Modeling - 1 JaeHwuen Jung jaejung@temple.edu http://community.mis.temple.edu/jaejung Where we are Now we re here Data entry Transactional Database Data extraction
More informationA Novel Method for the Comparison of Graphical Data Models
3RD INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS DEVELOPMENT (ISD01 CROATIA) A Novel Method for the Comparison of Graphical Data Models Katarina Tomičić-Pupek University of Zagreb, Faculty of Organization
More informationCore Schema Mappings: Computing Core Solution with Target Dependencies in Data Exchange
Core Schema Mappings: Computing Core Solution with Target Dependencies in Data Exchange S. Ravichandra, and D.V.L.N. Somayajulu Abstract Schema mapping is a declarative specification of the relationship
More informationModelling Data Warehouses with Multiversion and Temporal Functionality
Modelling Data Warehouses with Multiversion and Temporal Functionality Waqas Ahmed waqas.ahmed@ulb.ac.be Université Libre de Bruxelles Poznan University of Technology July 9, 2015 ITBI DC Outline 1 Introduction
More informationEnabling Seamless Sharing of Data among Organizations Using the DaaS Model in a Cloud
Enabling Seamless Sharing of Data among Organizations Using the DaaS Model in a Cloud Addis Mulugeta Ethiopian Sugar Corporation, Addis Ababa, Ethiopia addismul@gmail.com Abrehet Mohammed Omer Department
More informationKeyword query interpretation over structured data
Keyword query interpretation over structured data Advanced Methods of Information Retrieval Elena Demidova SS 2018 Elena Demidova: Advanced Methods of Information Retrieval SS 2018 1 Recap Elena Demidova:
More informationOntology-based Integration and Refinement of Evaluation-Committee Data from Heterogeneous Data Sources
Indian Journal of Science and Technology, Vol 8(23), DOI: 10.17485/ijst/2015/v8i23/79342 September 2015 ISSN (Print) : 0974-6846 ISSN (Online) : 0974-5645 Ontology-based Integration and Refinement of Evaluation-Committee
More informationDatabase Constraints and Homomorphism Dualities
Database Constraints and Homomorphism Dualities Balder ten Cate 1, Phokion G. Kolaitis 1,2, and Wang-Chiew Tan 2,1 1 University of California Santa Cruz 2 IBM Research-Almaden Abstract. Global-as-view
More informationCreating a Mediated Schema Based on Initial Correspondences
Creating a Mediated Schema Based on Initial Correspondences Rachel A. Pottinger University of Washington Seattle, WA, 98195 rap@cs.washington.edu Philip A. Bernstein Microsoft Research Redmond, WA 98052-6399
More informationKeyword query interpretation over structured data
Keyword query interpretation over structured data Advanced Methods of IR Elena Demidova Materials used in the slides: Jeffrey Xu Yu, Lu Qin, Lijun Chang. Keyword Search in Databases. Synthesis Lectures
More informationPresented by Kit Na Goh
Developing A Geo-Spatial Search Tool Using A Relational Database Implementation of the FGDC CSDGM Model Presented by Kit Na Goh Introduction Executive Order 12906 was issued on April 13, 1994 with the
More informationA Framework for Ontology Integration
A Framework for Ontology Integration Diego Calvanese, Giuseppe De Giacomo, Maurizio Lenzerini Dipartimento di Informatica e Sistemistica Università di Roma La Sapienza Via Salaria 113, 00198 Roma, Italy
More informationConsistent Query Answering: Opportunities and Limitations
Consistent Query Answering: Opportunities and Limitations Jan Chomicki Dept. Computer Science and Engineering University at Buffalo, SUNY Buffalo, NY 14260-2000, USA chomicki@buffalo.edu Abstract This
More informationSchema Exchange: a Template-based Approach to Data and Metadata Translation
Schema Exchange: a Template-based Approach to Data and Metadata Translation Paolo Papotti and Riccardo Torlone Università Roma Tre {papotti,torlone}@dia.uniroma3.it Abstract. In this paper we study the
More informationRewrite and Conquer: Dealing with Integrity Constraints in Data Integration
Rewrite and Conquer: Dealing with Integrity Constraints in Data Integration Andrea Calì, Diego Calvanese, Giuseppe De Giacomo, and Maurizio Lenzerini Abstract The work Data Integration under Integrity
More informationMIS2502: Data Analytics Relational Data Modeling. Jing Gong
MIS2502: Data Analytics Relational Data Modeling Jing Gong gong@temple.edu http://community.mis.temple.edu/gong Where we are Now we re here Data entry Transactional Database Data extraction Analytical
More informationDIRA : A FRAMEWORK OF DATA INTEGRATION USING DATA QUALITY
DIRA : A FRAMEWORK OF DATA INTEGRATION USING DATA QUALITY Reham I. Abdel Monem 1, Ali H. El-Bastawissy 2 and Mohamed M. Elwakil 3 1 Information Systems Department, Faculty of computers and information,
More information38050 Povo Trento (Italy), Via Sommarive 14 THE CODB ROBUST PEER-TO-PEER DATABASE SYSTEM
UNIVERSITY OF TRENTO DEPARTMENT OF INFORMATION AND COMMUNICATION TECHNOLOGY 38050 Povo Trento (Italy), Via Sommarive 14 http://www.dit.unitn.it THE CODB ROBUST PEER-TO-PEER DATABASE SYSTEM Enrico Franconi,
More informationANDREAS PIERIS JOURNAL PAPERS
ANDREAS PIERIS School of Informatics, University of Edinburgh Informatics Forum, 10 Crichton Street, Edinburgh, EH8 9AB, Scotland, UK apieris@inf.ed.ac.uk PUBLICATIONS (authors in alphabetical order) JOURNAL
More informationCMPT 354: Database System I. Lecture 5. Relational Algebra
CMPT 354: Database System I Lecture 5. Relational Algebra 1 What have we learned Lec 1. DatabaseHistory Lec 2. Relational Model Lec 3-4. SQL 2 Why Relational Algebra matter? An essential topic to understand
More informationOptimized encodings for Consistent Query Answering via ASP from different perspectives
Optimized encodings for Consistent Query Answering via ASP from different perspectives Marco Manna, Francesco Ricca, and Giorgio Terracina Department of Mathematics, University of Calabria, Italy {manna,ricca,terracina}@mat.unical.it
More informationDatabase Fundamentals Chapter 1
Database Fundamentals Chapter 1 Class 01: Database Fundamentals 1 What is a Database? The ISO/ANSI SQL Standard does not contain a definition of the term database. In fact, the term is never mentioned
More informationThe Data Integration Research Group at UFPE
The Data Integration Research Group at UFPE Ana Carolina Salgado 1, Bernadette F. Lóscio 1, Maria da Conceição M. Batista 2 Rosalie B. Belian 3, Carlos Eduardo S. Pires 4, Damires Souza 5 1 Federal University
More informationCAS CS 460/660 Introduction to Database Systems. Fall
CAS CS 460/660 Introduction to Database Systems Fall 2017 1.1 About the course Administrivia Instructor: George Kollios, gkollios@cs.bu.edu MCS 283, Mon 2:30-4:00 PM and Tue 1:00-2:30 PM Teaching Fellows:
More informationI. Khalil Ibrahim, V. Dignum, W. Winiwarter, E. Weippl, Logic Based Approach to Semantic Query Transformation for Knowledge Management Applications,
I. Khalil Ibrahim, V. Dignum, W. Winiwarter, E. Weippl, Logic Based Approach to Semantic Query Transformation for Knowledge Management Applications, Proc. of the International Conference on Knowledge Management
More informationB.H.GARDI COLLEGE OF MASTER OF COMPUTER APPLICATION. Ch. 1 :- Introduction Database Management System - 1
Basic Concepts :- 1. What is Data? Data is a collection of facts from which conclusion may be drawn. In computer science, data is anything in a form suitable for use with a computer. Data is often distinguished
More informationData Integration 1. Giuseppe De Giacomo. Dipartimento di Informatica e Sistemistica Antonio Ruberti Università di Roma La Sapienza
Data Integration 1 Giuseppe De Giacomo Dipartimento di Informatica e Sistemistica Antonio Ruberti Università di Roma La Sapienza View-based query processing Diego Calvanese, Giuseppe De Giacomo, Georg
More informationSCHEMA MAPPING DESIGN SYSTEMS 1. Schema Mapping Design Systems: Example-Driven and Semantic Approaches. Kathryn Dahlgren. CSU Stanislaus CS4960
SCHEMA MAPPING DESIGN SYSTEMS 1 Schema Mapping Design Systems: Example-Driven and Semantic Approaches Kathryn Dahlgren CSU Stanislaus CS4960 Dr. Melanie Martin SCHEMA MAPPING DESIGN SYSTEMS 2 Introduction
More informationMDDQL-Stat: data querying and analysis through integration of intentional and extensional semantics.
University of Westminster Eprints http://eprints.wmin.ac.uk MDDQL-Stat: data querying and analysis through integration of intentional and extensional semantics. Epaminondas Kapetanios 1 David Baer Björn
More informationChris Moffatt Director of Technology, Ed-Fi Alliance
Chris Moffatt Director of Technology, Ed-Fi Alliance Review Background and Context Temporal ODS Project Project Overview Design and Architecture Demo Temporal Snapshot & Query Proof of Concept Discussion
More informationDatabase Systems Overview. Truong Tuan Anh CSE-HCMUT
Database Systems Overview Truong Tuan Anh CSE-HCMUT Outline File-based Approach and Database Approach Three-Schema Architecture and Data Independence Database Languages Data Models, Database Schema, Database
More informationConsistent Query Answering
Consistent Query Answering Opportunities and Limitations Jan Chomicki Dept. CSE University at Buffalo State University of New York http://www.cse.buffalo.edu/ chomicki 1 Integrity constraints Integrity
More informationInformation Management course
Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 05(b) : 23/10/2012 Data Mining: Concepts and Techniques (3 rd ed.) Chapter
More informationA Prospect of Websites Evaluation Tools Based on Event Logs
A Prospect of Websites Evaluation Tools Based on Event Logs Vagner Figuerêdo de Santana 1, and M. Cecilia C. Baranauskas 2 1 Institute of Computing, UNICAMP, Brazil, v069306@dac.unicamp.br 2 Institute
More informationDatabase Design. 6-1 Artificial, Composite, and Secondary UIDs. Copyright 2015, Oracle and/or its affiliates. All rights reserved.
Database Design 6-1 Objectives This lesson covers the following objectives: Define the different types of unique identifiers (UIDs) Define a candidate UID and explain why an entity can sometimes have more
More informationA SECURITY BASED DATA MINING APPROACH IN DATA GRID
45 A SECURITY BASED DATA MINING APPROACH IN DATA GRID S.Vidhya, S.Karthikeyan Abstract - Grid computing is the next logical step to distributed computing. Main objective of grid computing is an innovative
More informationA Distributed Event Stream Processing Framework for Materialized Views over Heterogeneous Data Sources
A Distributed Event Stream Processing Framework for Materialized Views over Heterogeneous Data Sources Mahesh B. Chaudhari #*1 # School of Computing, Informatics, and Decision Systems Engineering Arizona
More informationWho won the Universal Relation wars? Alberto Mendelzon University of Toronto
Who won the Universal Relation wars? Alberto Mendelzon University of Toronto Who won the Universal Relation wars? 1 1-a Who won the Universal Relation wars? The Good Guys. The Good Guys 2 3 Outline The
More informationA. Papadopoulos, G. Pallis, M. D. Dikaiakos. Identifying Clusters with Attribute Homogeneity and Similar Connectivity in Information Networks
A. Papadopoulos, G. Pallis, M. D. Dikaiakos Identifying Clusters with Attribute Homogeneity and Similar Connectivity in Information Networks IEEE/WIC/ACM International Conference on Web Intelligence Nov.
More informationIJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 06, 2016 ISSN (online):
IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 06, 2016 ISSN (online): 2321-0613 Tanzeela Khanam 1 Pravin S.Metkewar 2 1 Student 2 Associate Professor 1,2 SICSR, affiliated
More informationAlgebraic Model Management: A Survey
Algebraic Model Management: A Survey Patrick Schultz 1, David I. Spivak 1, and Ryan Wisnesky 2 1 Massachusetts Institute of Technology 2 Categorical Informatics, Inc. Abstract. We survey the field of model
More informationA Comprehensive Semantic Framework for Data Integration Systems
A Comprehensive Semantic Framework for Data Integration Systems Andrea Calì 1, Domenico Lembo 2, and Riccardo Rosati 2 1 Faculty of Computer Science Free University of Bolzano/Bozen, Italy cali@inf.unibz.it
More informationUpdates through Views
1 of 6 15 giu 2010 00:16 Encyclopedia of Database Systems Springer Science+Business Media, LLC 2009 10.1007/978-0-387-39940-9_847 LING LIU and M. TAMER ÖZSU Updates through Views Yannis Velegrakis 1 (1)
More informationApproximation Algorithms for Computing Certain Answers over Incomplete Databases
Approximation Algorithms for Computing Certain Answers over Incomplete Databases Sergio Greco, Cristian Molinaro, and Irina Trubitsyna {greco,cmolinaro,trubitsyna}@dimes.unical.it DIMES, Università della
More informationVALUE RECONCILIATION IN MEDIATORS OF HETEROGENEOUS INFORMATION COLLECTIONS APPLYING WELL-STRUCTURED CONTEXT SPECIFICATIONS
VALUE RECONCILIATION IN MEDIATORS OF HETEROGENEOUS INFORMATION COLLECTIONS APPLYING WELL-STRUCTURED CONTEXT SPECIFICATIONS D. O. Briukhov, L. A. Kalinichenko, N. A. Skvortsov, S. A. Stupnikov Institute
More informationStructural characterizations of schema mapping languages
Structural characterizations of schema mapping languages Balder ten Cate INRIA and ENS Cachan (research done while visiting IBM Almaden and UC Santa Cruz) Joint work with Phokion Kolaitis (ICDT 09) Schema
More informationXQuery Optimization Based on Rewriting
XQuery Optimization Based on Rewriting Maxim Grinev Moscow State University Vorob evy Gory, Moscow 119992, Russia maxim@grinev.net Abstract This paper briefly describes major results of the author s dissertation
More informationINTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET)
INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET) International Journal of Electronics and Communication Engineering & Technology (IJECET), ISSN 0976 ISSN 0976 6464(Print)
More informationDatabase Systems ( 資料庫系統 ) Practicum in Database Systems ( 資料庫系統實驗 ) 9/20 & 9/21, 2006 Lecture #1
Database Systems ( 資料庫系統 ) Practicum in Database Systems ( 資料庫系統實驗 ) 9/20 & 9/21, 2006 Lecture #1 1 Course Goals First course in database systems. Main Course (3 units) - Learn Use a relational database
More information