Bio/Ecosystem Informatics

Size: px
Start display at page:

Download "Bio/Ecosystem Informatics"

Transcription

1 Bio/Ecosystem Informatics Renée J. Miller University of Toronto DB research problem: managing data semantics R. J. Miller University of Toronto 1

2 Managing Data Semantics Semantics modeled by Schemas (structure and constraints) Creating semantics Data design Managing semantics Understanding and reconciling different choices made in modeling data semantics R. J. Miller University of Toronto 2

3 Managing Data Semantics Problem Active area of DB research Not specific to bio/ecosystem informatics Solutions Tailored to data characteristics Our solution is focused on curated data R. J. Miller University of Toronto 3

4 Clio: Schema & View Management University of Toronto Renée J. Miller Periklis Andritsos, Ariel Fuxman, Tasos Kemesetsidis, Yannis Velegrakis IBM Almaden Laura Haas Ron Fagin, Mauricio Hernández Howard Ho, Phokion Kolaitis Lucian Popa, Ling-Ling Yan R. J. Miller University of Toronto 4

5 Clio: Schema & View Management Manage datasets that may differ in: Data models Structures (table or nesting structure) Semantics (constraints) Content Example problems Schema Mapping Schema Translation (Wrapper Generation) Schema Evolution Schema Integration R. J. Miller University of Toronto 5

6 Schema Mapping Source schema S schema mapping Target schema T conforms to conforms to data translation program (queries) Clio facilitates: Creation, management, debugging of schema mappings Uses Data exchange: create a (materialized) target instance Data integration: translate queries on (virtual) target into queries on source(s) R. J. Miller University of Toronto 6

7 Data Publishing & Exchange WWW XML schema Business partner WebService WSDL Life Sciences Company Relational Schema Microarray data in DB2 SwissProt PubMed GenBank R. J. Miller University of Toronto 7

8 Illustration: Clio Schema Mapping R. J. Miller University of Toronto 8

9 R. J. Miller University of Toronto 9

10 Illustration: Clio Schema Mapping Support Nested Structures Element correspondences Human friendly Automatic discovery Preserve data meaning Discover data associations Use constraints & schema Create New Target Values Produce Correct Grouping And produce XQuery R. J. Miller University of Toronto 10

11 Why Should DB Researchers Care about Bio/Ecosystems? Great Data Rich, complex semantics Exercises many facets of our models Publicly available!!! Domain curators are expert DBAs Provide important feedback R. J. Miller University of Toronto 11

12 How Should DB Researchers Work with Bio/Ecosystem Scientists? Our Model Took existing, general DM problem Explored systems and foundational issues Prototyped tool Team of developers work closely with domain scientists Provide feedback Arms-length collaboration!!! R. J. Miller University of Toronto 12

13 Overview Goal: interoperability between independent data sources Schema Mapping Data Translation (Data Exchange Problem) Challenges Schemas can be arbitrarily different Still, data must not lose its meaning during translation Maximum advantage of semantics embedded in schemas & data Used in compilation Facilitate user specification of any additional semantics As by-product user learns if semantics incorrect/incomplete Performed manually: complex user queries, programs, etc. Output: correct data translation program R. J. Miller University of Toronto 13

Schema Management. Abstract

Schema Management. Abstract Schema Management Periklis Andritsos Λ Ronald Fagin y Ariel Fuxman Λ Laura M. Haas y Mauricio A. Hernández y Ching-Tien Ho y Anastasios Kementsietsidis Λ Renée J. Miller Λ Felix Naumann y Lucian Popa y

More information

Mapper An Efficient Data Transformation Operator

Mapper An Efficient Data Transformation Operator Mapper An Efficient Data Transformation Operator Paulo Carreira University of Lisbon 2 Context Source: IMPCRED CREDTID MERCH FT546083 RT546084 8 PCS LARGE CONTAINER MODULE X067 100 PCS COTTON CORDUROY

More information

The interaction of theory and practice in database research

The interaction of theory and practice in database research The interaction of theory and practice in database research Ron Fagin IBM Research Almaden 1 Purpose of This Talk Encourage collaboration between theoreticians and system builders via two case studies

More information

Simplifying Information Integration: Object-Based Flow-of-Mappings Framework for Integration

Simplifying Information Integration: Object-Based Flow-of-Mappings Framework for Integration Simplifying Information Integration: Object-Based Flow-of-Mappings Framework for Integration Howard Ho IBM Almaden Research Center Take Home Messages Clio (Schema Mapping for EII) has evolved to Clio 2010

More information

Model Management and Schema Mappings: Theory and Practice (Part II)

Model Management and Schema Mappings: Theory and Practice (Part II) IBM Research Model Management and Schema Mappings: Theory and Practice (Part II) Howard Ho IBM Almaden Research Center For VLDB07 Tutorial with Phil Bernstein, Microsoft Research 2006 IBM Corporation The

More information

DBAI-TR UMAP: A Universal Layer for Schema Mapping Languages

DBAI-TR UMAP: A Universal Layer for Schema Mapping Languages DBAI-TR-2012-76 UMAP: A Universal Layer for Schema Mapping Languages Florin Chertes and Ingo Feinerer Technische Universität Wien, Vienna, Austria Institut für Informationssysteme FlorinChertes@acm.org

More information

Kanata: Adaptation and Evolution in Data Sharing Systems

Kanata: Adaptation and Evolution in Data Sharing Systems Kanata: Adaptation and Evolution in Data Sharing Systems Periklis Andritsos Ariel Fuxman Anastasios Kementsietsidis Renée J. Miller Yannis Velegrakis Department of Computer Science University of Toronto

More information

BioFederator: A Data Federation System for Bioinformatics on the Web

BioFederator: A Data Federation System for Bioinformatics on the Web BioFederator: A Data Federation System for Bioinformatics on the Web Ahmed Radwan, Akmal Younis, Mauricio A. Hernandez, Howard Ho, Department of Electrical and Computer Engineering Lucian Popa, Shivkumar

More information

A Classification of Schema Mappings and Analysis of Mapping Tools

A Classification of Schema Mappings and Analysis of Mapping Tools A Classification of Schema Mappings and Analysis of Mapping Tools Frank Legler IBM Deutschland Entwicklung GmbH flegler@de.ibm.com Felix Naumann Hasso-Plattner-Institut, Potsdam naumann@hpi.uni-potsdam.de

More information

A Non intrusive Data driven Approach to Debugging Schema Mappings for Data Exchange

A Non intrusive Data driven Approach to Debugging Schema Mappings for Data Exchange 1. Problem and Motivation A Non intrusive Data driven Approach to Debugging Schema Mappings for Data Exchange Laura Chiticariu and Wang Chiew Tan UC Santa Cruz {laura,wctan}@cs.ucsc.edu Data exchange is

More information

Foundations of Data Exchange and Metadata Management. Marcelo Arenas Ron Fagin Special Event - SIGMOD/PODS 2016

Foundations of Data Exchange and Metadata Management. Marcelo Arenas Ron Fagin Special Event - SIGMOD/PODS 2016 Foundations of Data Exchange and Metadata Management Marcelo Arenas Ron Fagin Special Event - SIGMOD/PODS 2016 The need for a formal definition We had a paper with Ron in PODS 2004 Back then I was a Ph.D.

More information

Partly based on slides by AnHai Doan

Partly based on slides by AnHai Doan Partly based on slides by AnHai Doan New faculty member Find houses with 2 bedrooms priced under 200K realestate.com homeseekers.com homes.com 2 Find houses with 2 bedrooms priced under 200K mediated schema

More information

Clio Grows Up: From Research Prototype to Industrial Tool

Clio Grows Up: From Research Prototype to Industrial Tool Clio Grows Up: From Research Prototype to Industrial Tool Laura M. Haas IBM Silicon Valley Labs laura@almaden.ibm.com Mauricio A. Hernández IBM Almaden Research Center mauricio@almaden.ibm.com Lucian Popa

More information

Data Integration: Schema Mapping

Data Integration: Schema Mapping Data Integration: Schema Mapping Jan Chomicki University at Buffalo and Warsaw University March 8, 2007 Jan Chomicki (UB/UW) Data Integration: Schema Mapping March 8, 2007 1 / 13 Data integration Data

More information

Structural characterizations of schema mapping languages

Structural characterizations of schema mapping languages Structural characterizations of schema mapping languages Balder ten Cate INRIA and ENS Cachan (research done while visiting IBM Almaden and UC Santa Cruz) Joint work with Phokion Kolaitis (ICDT 09) Schema

More information

Data Integration: Schema Mapping

Data Integration: Schema Mapping Data Integration: Schema Mapping Jan Chomicki University at Buffalo and Warsaw University March 8, 2007 Jan Chomicki (UB/UW) Data Integration: Schema Mapping March 8, 2007 1 / 13 Data integration Jan Chomicki

More information

Setting up a CIDOC CRM Adoption and Use Strategy CIDOC CRM: Success Stories, Challenges and New Perspective

Setting up a CIDOC CRM Adoption and Use Strategy CIDOC CRM: Success Stories, Challenges and New Perspective Setting up a CIDOC CRM Adoption and Use Strategy CIDOC CRM: Success Stories, Challenges and New Perspective George Bruseker CIDOC 2017 Tblisi, Georgia 27/09/2017 Researcher, Interpreter Goal: A Semantic

More information

Function Symbols in Tuple-Generating Dependencies: Expressive Power and Computability

Function Symbols in Tuple-Generating Dependencies: Expressive Power and Computability Function Symbols in Tuple-Generating Dependencies: Expressive Power and Computability Georg Gottlob 1,2, Reinhard Pichler 1, and Emanuel Sallinger 2 1 TU Wien and 2 University of Oxford Tuple-generating

More information

Data publication and discovery with Globus

Data publication and discovery with Globus Data publication and discovery with Globus Questions and comments to outreach@globus.org The Globus data publication and discovery services make it easy for institutions and projects to establish collections,

More information

Foundations and Applications of Schema Mappings

Foundations and Applications of Schema Mappings Foundations and Applications of Schema Mappings Phokion G. Kolaitis University of California Santa Cruz & IBM Almaden Research Center The Data Interoperability Challenge Data may reside at several different

More information

DATA IS DEAD WITHOUT WHAT-IF MODELS

DATA IS DEAD WITHOUT WHAT-IF MODELS DATA IS DEAD WITHOUT WHAT-IF MODELS Peter J. Haas, Paul P. Maglio, Patricia G. Selinger, and Wang-Chiew Tan IBM Almaden Research Center Congratulations, Database Community! Transactions & Reports, IMS

More information

Resolving Schema and Value Heterogeneities for XML Web Querying

Resolving Schema and Value Heterogeneities for XML Web Querying Resolving Schema and Value Heterogeneities for Web ing Nancy Wiegand and Naijun Zhou University of Wisconsin 550 Babcock Drive Madison, WI 53706 wiegand@cs.wisc.edu, nzhou@wisc.edu Isabel F. Cruz and William

More information

ON SCHEMA DISCOVERY ICDM Renée J. Miller

ON SCHEMA DISCOVERY ICDM Renée J. Miller ON SCHEMA DISCOVERY ICDM 2011 Renée J. Miller What are Schemas? 2 Schema From the Greek "σχήμα meaning shape, or more generally, plan Structure and constraints the data (should) satisfy Attribute structure

More information

Introduction Data Integration Summary. Data Integration. COCS 6421 Advanced Database Systems. Przemyslaw Pawluk. CSE, York University.

Introduction Data Integration Summary. Data Integration. COCS 6421 Advanced Database Systems. Przemyslaw Pawluk. CSE, York University. COCS 6421 Advanced Database Systems CSE, York University March 20, 2008 Agenda 1 Problem description Problems 2 3 Open questions and future work Conclusion Bibliography Problem description Problems Why

More information

RONALD FAGIN. Computer Science Dept., IBM Almaden Research Center (formerly IBM Research Laboratory), San Jose, California, 1975 present

RONALD FAGIN. Computer Science Dept., IBM Almaden Research Center (formerly IBM Research Laboratory), San Jose, California, 1975 present Education RONALD FAGIN IBM Fellow IBM Almaden Research Center 650 Harry Road San Jose, California 95120-6099 Phone: 408-927-1726 Fax: 845-491-2916 Email: fagin@us.ibm.com Home page: http://www.almaden.ibm.com/cs/people/fagin/

More information

Creating a Mediated Schema Based on Initial Correspondences

Creating a Mediated Schema Based on Initial Correspondences Creating a Mediated Schema Based on Initial Correspondences Rachel A. Pottinger University of Washington Seattle, WA, 98195 rap@cs.washington.edu Philip A. Bernstein Microsoft Research Redmond, WA 98052-6399

More information

Composing Schema Mapping

Composing Schema Mapping Composing Schema Mapping An Overview Phokion G. Kolaitis UC Santa Cruz & IBM Research Almaden Joint work with R. Fagin, L. Popa, and W.C. Tan 1 Data Interoperability Data may reside at several different

More information

Data Exchange: Semantics and Query Answering

Data Exchange: Semantics and Query Answering Data Exchange: Semantics and Query Answering Ronald Fagin Phokion G. Kolaitis Renée J. Miller Lucian Popa IBM Almaden Research Center fagin,lucian @almaden.ibm.com University of California at Santa Cruz

More information

RONALD FAGIN. IBM Research Almaden (formerly IBM Research Laboratory), San Jose, California, 1975 present

RONALD FAGIN. IBM Research Almaden (formerly IBM Research Laboratory), San Jose, California, 1975 present RONALD FAGIN IBM Fellow IBM Research Almaden 650 Harry Road San Jose, California 95120-6099 Phone: 408-927-1726 Fax: 845-491-2916 Email: fagin@us.ibm.com Home page: http://researcher.ibm.com/person/us-fagin

More information

Development of an Ontology-Based Portal for Digital Archive Services

Development of an Ontology-Based Portal for Digital Archive Services Development of an Ontology-Based Portal for Digital Archive Services Ching-Long Yeh Department of Computer Science and Engineering Tatung University 40 Chungshan N. Rd. 3rd Sec. Taipei, 104, Taiwan chingyeh@cse.ttu.edu.tw

More information

The Modeling and Simulation Catalog for Discovery, Knowledge, and Reuse

The Modeling and Simulation Catalog for Discovery, Knowledge, and Reuse The Modeling and Simulation Catalog for Discovery, Knowledge, and Reuse Stephen Hunt OSD CAPE Joint Data Support (SAIC) Stephen.Hunt.ctr@osd.mil The DoD Office of Security Review has cleared this report

More information

The Materials Data Facility

The Materials Data Facility The Materials Data Facility Ben Blaiszik (blaiszik@uchicago.edu), Kyle Chard (chard@uchicago.edu) Ian Foster (foster@uchicago.edu) materialsdatafacility.org What is MDF? We aim to make it simple for materials

More information

Orchid: Integrating Schema Mapping and ETL

Orchid: Integrating Schema Mapping and ETL Orchid: Integrating Schema Mapping and ETL (Exted Version) Stefan Dessloch, Mauricio A. Hernández, Ryan Wisnesky, Ahmed Radwan, Jindan Zhou Department of Computer Science, University of Kaiserslautern

More information

Labelling & Classification using emerging protocols

Labelling & Classification using emerging protocols Labelling & Classification using emerging protocols "wheels you don't have to reinvent & bandwagons you can jump on" Stephen McGibbon Lotus Development Assumptions The business rationale and benefits of

More information

Introduction to Federation Server

Introduction to Federation Server Introduction to Federation Server Alex Lee IBM Information Integration Solutions Manager of Technical Presales Asia Pacific 2006 IBM Corporation WebSphere Federation Server Federation overview Tooling

More information

Business Process Design based on Web Services: The C.O.S.M.O.S. Environment

Business Process Design based on Web Services: The C.O.S.M.O.S. Environment Business Process Design based on Web Services: The C.O.S.M.O.S. Environment LOUKAS GEORGIOU School of Informatics University of Wales-Bangor Dean Street Bangor Gwynedd, LL571UT UNITED KINGDOM ODYSSEAS

More information

MAPR DATA GOVERNANCE WITHOUT COMPROMISE

MAPR DATA GOVERNANCE WITHOUT COMPROMISE MAPR TECHNOLOGIES, INC. WHITE PAPER JANUARY 2018 MAPR DATA GOVERNANCE TABLE OF CONTENTS EXECUTIVE SUMMARY 3 BACKGROUND 4 MAPR DATA GOVERNANCE 5 CONCLUSION 7 EXECUTIVE SUMMARY The MapR DataOps Governance

More information

Learning mappings and queries

Learning mappings and queries Learning mappings and queries Marie Jacob University Of Pennsylvania DEIS 2010 1 Schema mappings Denote relationships between schemas Relates source schema S and target schema T Defined in a query language

More information

The CASPAR Finding Aids

The CASPAR Finding Aids ABSTRACT The CASPAR Finding Aids Henri Avancini, Carlo Meghini, Loredana Versienti CNR-ISTI Area dell Ricerca di Pisa, Via G. Moruzzi 1, 56124 Pisa, Italy EMail: Full.Name@isti.cnr.it CASPAR is a EU co-funded

More information

Empowering DBA's with IBM Data Studio. Deb Jenson, Data Studio Product Manager,

Empowering DBA's with IBM Data Studio. Deb Jenson, Data Studio Product Manager, Empowering DBA's with IBM Data Studio Deb Jenson, Data Studio Product Manager, dejenson@us.ibm.com Disclaimer Copyright IBM Corporation [current year]. All rights reserved. U.S. Government Users Restricted

More information

Nested Mappings: Schema Mapping Reloaded

Nested Mappings: Schema Mapping Reloaded Nested Mappings: Schema Mapping Reloaded Ariel Fuxman University of Toronto afuxman@cs.toronto.edu Renee J. Miller University of Toronto miller@cs.toronto.edu Mauricio A. Hernandez IBM Almaden Research

More information

Developing Hypermedia Over an Information Repository

Developing Hypermedia Over an Information Repository Developing Hypermedia Over an Information Repository Panos Constantopoulos, Manos Theodorakis and Yannis Tzitzikas Department of Computer Science,University of Crete and Institute of Computer Science,

More information

Chapter 8 Web Services Objectives

Chapter 8 Web Services Objectives Chapter 8 Web Services Objectives Describe the Web services approach to the Service- Oriented Architecture concept Describe the WSDL specification and how it is used to define Web services Describe the

More information

IBM Rational Application Developer for WebSphere Software, Version 7.0

IBM Rational Application Developer for WebSphere Software, Version 7.0 Visual application development for J2EE, Web, Web services and portal applications IBM Rational Application Developer for WebSphere Software, Version 7.0 Enables installation of only the features you need

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Generating SPARQL Executable Mappings to Integrate Ontologies

Generating SPARQL Executable Mappings to Integrate Ontologies Generating SPARQL Executable Mappings to Integrate Ontologies Carlos R. Rivero, Inma Hernández, David Ruiz, and Rafael Corchuelo University of Sevilla, Spain {carlosrivero, inmahernandez, druiz, corchu}@us.es

More information

Predicates for Boolean web service policy languages Anne H. Anderson Sun Microsystems Laboratories Burlington, MA

Predicates for Boolean web service policy languages Anne H. Anderson Sun Microsystems Laboratories Burlington, MA Predicates for Boolean web service policy languages Anne H. Anderson Sun Microsystems Laboratories Burlington, MA Anne.Anderson@sun.com ABSTRACT Four of the web service policy languages that have been

More information

IBM DB2 11 DBA for z/os Certification Review Guide Exam 312

IBM DB2 11 DBA for z/os Certification Review Guide Exam 312 Introduction IBM DB2 11 DBA for z/os Certification Review Guide Exam 312 The purpose of this book is to assist you with preparing for the IBM DB2 11 DBA for z/os exam (Exam 312), one of the two required

More information

Database Management Systems

Database Management Systems DATABASE CONCEPTS & APPLICATIONS Database Management Systems A Database Management System (DBMS) is a software package designed to store and manage databases through database applications. User Database

More information

What s Out There and Where Do I find it: Enterprise Metacard Builder Resource Portal

What s Out There and Where Do I find it: Enterprise Metacard Builder Resource Portal What s Out There and Where Do I find it: Enterprise Metacard Builder Resource Portal Gary W. Allen, PhD Project Manager Joint Training Integration and Evaluation Center Orlando, FL William C. Riggs Senior

More information

Practical Database Design Methodology and Use of UML Diagrams Design & Analysis of Database Systems

Practical Database Design Methodology and Use of UML Diagrams Design & Analysis of Database Systems Practical Database Design Methodology and Use of UML Diagrams 406.426 Design & Analysis of Database Systems Jonghun Park jonghun@snu.ac.kr Dept. of Industrial Engineering Seoul National University chapter

More information

Chapter 1: Introduction

Chapter 1: Introduction Chapter 1: Introduction Chapter 1: Introduction Purpose of Database Systems Database Languages Relational Databases Database Design Data Models Database Internals Database Users and Administrators Overall

More information

Chapter 1: Introduction

Chapter 1: Introduction Chapter 1: Introduction Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Outline The Need for Databases Data Models Relational Databases Database Design Storage Manager Query

More information

DB2 for z/os: Programmer Essentials for Designing, Building and Tuning

DB2 for z/os: Programmer Essentials for Designing, Building and Tuning Brett Elam bjelam@us.ibm.com - DB2 for z/os: Programmer Essentials for Designing, Building and Tuning April 4, 2013 DB2 for z/os: Programmer Essentials for Designing, Building and Tuning Information Management

More information

The Rise of the (Modelling) Bots: Towards Assisted Modelling via Social Networks

The Rise of the (Modelling) Bots: Towards Assisted Modelling via Social Networks The Rise of the (Modelling) Bots: Towards Assisted Modelling via Social Networks Sara Perez-Soler, Esther Guerra, Juan de Lara, Francisco Jurado 2017 Presented by Laura Walsh 1 Overview 1. Background &

More information

When Communities of Interest Collide: Harmonizing Vocabularies Across Operational Areas C. L. Connors, The MITRE Corporation

When Communities of Interest Collide: Harmonizing Vocabularies Across Operational Areas C. L. Connors, The MITRE Corporation When Communities of Interest Collide: Harmonizing Vocabularies Across Operational Areas C. L. Connors, The MITRE Corporation Three recent trends have had a profound impact on data standardization within

More information

What Are We Building?

What Are We Building? Presentation Agenda Introduction Overview of the Transport Layer Respective Responsibilities Overview of Extensible Markup Language (XML) SDS Data Exchange Specification Schedule 1 What Are We Building?

More information

Chapter 1: Introduction. Chapter 1: Introduction

Chapter 1: Introduction. Chapter 1: Introduction Chapter 1: Introduction Database System Concepts, 5th Ed. See www.db-book.com for conditions on re-use Chapter 1: Introduction Purpose of Database Systems View of Data Database Languages Relational Databases

More information

Use of Semantic Technologies at Eli Lilly and Company. J Phil Brooks Information Consultant, SE Data Team Discover IT Eli Lilly and Company

Use of Semantic Technologies at Eli Lilly and Company. J Phil Brooks Information Consultant, SE Data Team Discover IT Eli Lilly and Company Use of Semantic Technologies at Eli Lilly and Company J Phil Brooks Information Consultant, SE Data Team Discover IT Eli Lilly and Company Notable Semantic Projects at Lilly Discovery Metadata Integration

More information

A Collective, Probabilistic Approach to Schema Mapping

A Collective, Probabilistic Approach to Schema Mapping A Collective, Probabilistic Approach to Schema Mapping Angelika Kimmig, Alex Memory, Renée Miller, Lise Getoor ILP 2017 (published at ICDE 2017) 1 Context: Data Exchange & source emp id company 1 Alice

More information

Out of the UML box: Intuitive and Data-driven Modelling Tools for INSPIRE

Out of the UML box: Intuitive and Data-driven Modelling Tools for INSPIRE Out of the UML box: Intuitive and Data-driven Modelling Tools for INSPIRE Thorsten Reitz, wetransform GmbH 15.09.2017 INSPIRE Conference 2017, Strasbourg, France Is UML bad? Observations: UML is a very

More information

Introduction to Information Systems

Introduction to Information Systems Table of Contents 1... 2 1.1 Introduction... 2 1.2 Architecture of Information systems... 2 1.3 Classification of Data Models... 4 1.4 Relational Data Model (Overview)... 8 1.5 Conclusion... 12 1 1.1 Introduction

More information

Reducing Consumer Uncertainty

Reducing Consumer Uncertainty Spatial Analytics Reducing Consumer Uncertainty Towards an Ontology for Geospatial User-centric Metadata Introduction Cooperative Research Centre for Spatial Information (CRCSI) in Australia Communicate

More information

A Mapping Model for Transforming Nested XML Documents

A Mapping Model for Transforming Nested XML Documents IJCSNS International Journal of Computer Science and Network Security, VOL.6 No.2A, February 2006 83 A Mapping Model for Transforming Nested XML Documents Gang Qian, and Yisheng Dong, Department of Computer

More information

SDTM Validation Rules in XQuery

SDTM Validation Rules in XQuery SDTM Validation Rules in XQuery FH-Prof. Dr. Jozef Aerts Univ. Appl. Sciences FH Joanneum Graz, Austria Can you understand the following validation rule (part 1)? SDTM Validation Rules in XQuery Jozef

More information

Querying Multiple Bioinformatics Information Sources: Can Semantic Web Research Help?

Querying Multiple Bioinformatics Information Sources: Can Semantic Web Research Help? Querying Multiple Bioinformatics Information Sources: Can Semantic Web Research Help? David Buttler, Matthew Coleman 1, Terence Critchlow 1, Renato Fileto, Wei Han, Ling Liu, Calton Pu, Daniel Rocco, Li

More information

CULTURAL DOCUMENTATION: THE CLIO SYSTEM. Panos Constantopoulos. University of Crete and Foundation of Research and Technology - Hellas

CULTURAL DOCUMENTATION: THE CLIO SYSTEM. Panos Constantopoulos. University of Crete and Foundation of Research and Technology - Hellas CULTURAL DOCUMENTATION: THE CLIO SYSTEM Panos Constantopoulos University of Crete and Foundation of Research and Technology - Hellas Institute of Computer Science Foundation of Research and Technology

More information

Exploring and Exploiting the Biological Maze. Presented By Vidyadhari Edupuganti Advisor Dr. Zoe Lacroix

Exploring and Exploiting the Biological Maze. Presented By Vidyadhari Edupuganti Advisor Dr. Zoe Lacroix Exploring and Exploiting the Biological Maze Presented By Vidyadhari Edupuganti Advisor Dr. Zoe Lacroix Motivation An abundance of biological data sources contain data about scientific entities, such as

More information

Enabling the Future of Connectivity. HITEC 2016 Tech Talk

Enabling the Future of Connectivity. HITEC 2016 Tech Talk Enabling the Future of Connectivity HITEC 2016 Tech Talk Who is OpenTravel? Founded in 1999 by companies in ALL verticals of travel industry who demanded a common language At the dawn of today s online

More information

HDF Product Designer: A tool for building HDF5 containers with granule metadata

HDF Product Designer: A tool for building HDF5 containers with granule metadata The HDF Group HDF Product Designer: A tool for building HDF5 containers with granule metadata Lindsay Powers Aleksandar Jelenak, Joe Lee, Ted Habermann The HDF Group Data Producer s Conundrum 2 HDF Features

More information

Real World Data Governance- Part 1

Real World Data Governance- Part 1 Real World Data Governance- Part 1 Day in the Life of a Business Steward Jesse Lambert and Jack Spivak, TopQuadrant Inc. November 30, 2017 Today s Program TopBraid EDG: A Day in the Life of a Business

More information

An Automation Framework for ns-3

An Automation Framework for ns-3 Dr. L. Felipe Perrone, Bryan C. Ward, and Andrew H. Hallagan Department of Computer Science Bucknell University March 14, 2010 Motivation Network simulation is no easy business. One must: Build a model

More information

RDF for Life Sciences

RDF for Life Sciences RDF for Life Sciences Presentation to Oracle Life Sciences User Group June 23, 2004 John Wilbanks World Wide Web Consortium (W3C) What is the W3C? Founded in 1994 by Tim Berners-Lee Develops common protocols

More information

Metamodel Matching: Experiments and Comparison

Metamodel Matching: Experiments and Comparison Metamodel Matching: Experiments and Comparison Denivaldo Lopes Federal University of Maranhão (UFMA) São Luís, Brazil Email: dlopes@dee.ufma.br Slimane Hammoudi ESEO Angers, France Email: shammoudi@eseo.fr

More information

EUROPEANA METADATA INGESTION , Helsinki, Finland

EUROPEANA METADATA INGESTION , Helsinki, Finland EUROPEANA METADATA INGESTION 20.11.2012, Helsinki, Finland As of now, Europeana has: 22.322.604 Metadata (related to a digital record) in CC0 3.698.807 are in the Public Domain 697.031 Digital Objects

More information

Provenance Management in Databases under Schema Evolution

Provenance Management in Databases under Schema Evolution Provenance Management in Databases under Schema Evolution Shi Gao, Carlo Zaniolo Department of Computer Science University of California, Los Angeles 1 Provenance under Schema Evolution Modern information

More information

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT EUDAT A European Collaborative Data Infrastructure Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT OpenAire Interoperability Workshop Braga, Feb. 8, 2013 EUDAT Key facts

More information

The Semantic Planetary Data System

The Semantic Planetary Data System The Semantic Planetary Data System J. Steven Hughes 1, Daniel J. Crichton 1, Sean Kelly 1, and Chris Mattmann 1 1 Jet Propulsion Laboratory 4800 Oak Grove Drive Pasadena, CA 91109 USA {steve.hughes, dan.crichton,

More information

Transactive Energy Case Study. Ron Melton, Battelle Pacific Northwest Division

Transactive Energy Case Study. Ron Melton, Battelle Pacific Northwest Division Transactive Energy Case Study Ron Melton, Battelle Pacific Northwest Division 1 Pacific Northwest Demonstration Project What: $178M, ARRA-funded, 5-year demonstration 60,000 metered customers in 5 states

More information

Algebraic Model Management: A Survey

Algebraic Model Management: A Survey Algebraic Model Management: A Survey Patrick Schultz 1, David I. Spivak 1, and Ryan Wisnesky 2 1 Massachusetts Institute of Technology 2 Categorical Informatics, Inc. Abstract. We survey the field of model

More information

XML Applications. Introduction Jaana Holvikivi 1

XML Applications. Introduction Jaana Holvikivi 1 XML Applications Introduction 1.4.2009 Jaana Holvikivi 1 Outline XML standards Application areas 1.4.2009 Jaana Holvikivi 2 Basic XML standards XML a meta language for the creation of languages to define

More information

Course Contents: 1 Business Objects Online Training

Course Contents: 1 Business Objects Online Training IQ Online training facility offers Business Objects online training by trainers who have expert knowledge in the Business Objects and proven record of training hundreds of students Our Business Objects

More information

SEEK: Scalable Extraction of Enterprise Knowledge

SEEK: Scalable Extraction of Enterprise Knowledge SEEK: Scalable Extraction of Enterprise Knowledge Joachim Hammer Dept. of CISE University of Florida 26-Feb-2002 1 Project Overview Faculty Joachim Hammer Mark Schmalz Computer Science Students Sangeetha

More information

On the Integration of Autonomous Data Marts

On the Integration of Autonomous Data Marts On the Integration of Autonomous Data Marts Luca Cabibbo and Riccardo Torlone Dipartimento di Informatica e Automazione Università di Roma Tre {cabibbo,torlone}@dia.uniroma3.it Abstract We address the

More information

SharePoint Development Web Development Generate from Usage. Cloud Development Windows Development Office Development

SharePoint Development Web Development Generate from Usage. Cloud Development Windows Development Office Development Silverlight Tools SharePoint Development Web Development Generate from Usage New WPF Editor Multi-core Development Cloud Development Windows Development Office Development Customizable IDE UI Test Automation

More information

CS425 Fall 2016 Boris Glavic Chapter 1: Introduction

CS425 Fall 2016 Boris Glavic Chapter 1: Introduction CS425 Fall 2016 Boris Glavic Chapter 1: Introduction Modified from: Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Textbook: Chapter 1 1.2 Database Management System (DBMS)

More information

On using database techniques for generating ontology mappings

On using database techniques for generating ontology mappings On using database techniques for generating ontology mappings Carlos R. Rivero carlosrivero@us.es Inma Hernández inmahernandez@us.es David Ruiz druiz@us.es Rafael Corchuelo corchu@us.es Abstract In the

More information

Expose Existing z Systems Assets as APIs to extend your Customer Reach

Expose Existing z Systems Assets as APIs to extend your Customer Reach Expose Existing z Systems Assets as APIs to extend your Customer Reach Unlocking mainframe assets for mobile and cloud applications Asit Dan z Services API Management, Chief Architect asit@us.ibm.com Insert

More information

Logic and Databases. Lecture 4 - Part 2. Phokion G. Kolaitis. UC Santa Cruz & IBM Research - Almaden

Logic and Databases. Lecture 4 - Part 2. Phokion G. Kolaitis. UC Santa Cruz & IBM Research - Almaden Logic and Databases Phokion G. Kolaitis UC Santa Cruz & IBM Research - Almaden Lecture 4 - Part 2 2 / 17 Alternative Semantics of Queries Bag Semantics We focused on the containment problem for conjunctive

More information

Mutalyzer webservices. Jeroen F. J. Laros Leiden Genome Technology Center Department of Human Genetics Center for Human and Clinical Genetics

Mutalyzer webservices. Jeroen F. J. Laros Leiden Genome Technology Center Department of Human Genetics Center for Human and Clinical Genetics Mutalyzer webservices Jeroen F. J. Laros Leiden Genome Technology Center Department of Human Genetics Center for Human and Clinical Genetics Introduction Mutalyzer: a curational tool for Locus Specific

More information

Database Management Systems (CPTR 312)

Database Management Systems (CPTR 312) Database Management Systems (CPTR 312) Preliminaries Me: Raheel Ahmad Ph.D., Southern Illinois University M.S., University of Southern Mississippi B.S., Zakir Hussain College, India Contact: Science 116,

More information

Improving Productivity

Improving Productivity Improving Productivity On Demand Insurance Business Problems 1. We lose customers because we process new policy applications too slowly. 2. Our claims processing is time-consuming and inefficient. 3. We

More information

SEXTANT 1. Purpose of the Application

SEXTANT 1. Purpose of the Application SEXTANT 1. Purpose of the Application Sextant has been used in the domains of Earth Observation and Environment by presenting its browsing and visualization capabilities using a number of link geospatial

More information

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation INSPIRE 2010, KRAKOW Dr. Arif Shaon, Dr. Andrew Woolf (e-science, Science and Technology Facilities Council, UK) 3

More information

Fedora Commons: Taking on the Challenge of the Next Generation of Scholarly Communication

Fedora Commons: Taking on the Challenge of the Next Generation of Scholarly Communication Fedora Commons: Taking on the Challenge of the Next Generation of Scholarly Communication Sandy Payette Executive Director Fedora Commons November 7, 2007 DLF, Philadelphia, PA Scholarship and Research

More information

Introduction. What s jorca?

Introduction. What s jorca? Introduction What s jorca? jorca is a Java desktop Client able to efficiently access different type of web services repositories mapping resources metadata over a general virtual definition to support

More information

An Ontology-based Framework for XML Semantic Integration

An Ontology-based Framework for XML Semantic Integration An Ontology-based Framework for XML Semantic Integration Isabel F. Cruz ifc@cs.uic.edu University of Illinois at Chicago Huiyong Xiao hxiao@cs.uic.edu University of Illinois at Chicago Feihong Hsu fhsu@cs.uic.edu

More information

Web Ontology Language for Service (OWL-S) The idea of Integration of web services and semantic web

Web Ontology Language for Service (OWL-S) The idea of Integration of web services and semantic web Web Ontology Language for Service (OWL-S) The idea of Integration of web services and semantic web Introduction OWL-S is an ontology, within the OWL-based framework of the Semantic Web, for describing

More information

Data Integration and Data Warehousing Database Integration Overview

Data Integration and Data Warehousing Database Integration Overview Data Integration and Data Warehousing Database Integration Overview Sergey Stupnikov Institute of Informatics Problems, RAS ssa@ipi.ac.ru Outline Information Integration Problem Heterogeneous Information

More information

Interactive Machine Learning (IML) Markup of OCR Generated Text by Exploiting Domain Knowledge: A Biodiversity Case Study

Interactive Machine Learning (IML) Markup of OCR Generated Text by Exploiting Domain Knowledge: A Biodiversity Case Study Interactive Machine Learning (IML) Markup of OCR Generated by Exploiting Domain Knowledge: A Biodiversity Case Study Several digitization projects such as Google books are involved in scanning millions

More information