Approaches & Languages for Schema Transformation

Similar documents
Conceptual schema matching with the Ontology Mapping Language: requirements and evaluation

Out of the UML box: Intuitive and Data-driven Modelling Tools for INSPIRE

Workshop Data Modelling [en]

Challenges in Geospatial Data Harmonisation:

Framework specification, logical architecture, physical architecture, requirements, use cases.

A5.2-D3 [3.6] HUMBOLDT Processing Components General Model and Implementations

Ontology-based Architecture Documentation Approach

A5.2-D3 [3.5] Workflow Design and Construction Service Component Specification. Eva Klien (FHG), Christine Giger (ETHZ), Dániel Kristóf (FOMI)

Observations and Measurements

7/13/2011. Make Data Available in a standard format through standard services. INSPIRE Initiative. FME 2011: Makes you the Superhero!

MERGING BUSINESS VOCABULARIES AND RULES

Cataloguing GI Functions provided by Non Web Services Software Resources Within IGN

MDA & Semantic Web Services Integrating SWSF & OWL with ODM

DESIGN AND EVALUATION OF A GENERIC METHOD FOR CREATING XML SCHEMA. 1. Introduction

Semantic Web Domain Knowledge Representation Using Software Engineering Modeling Technique

Chapter 8: Enhanced ER Model

Semantic Interoperability. Being serious about the Semantic Web

What you have learned so far. Interoperability. Ontology heterogeneity. Being serious about the semantic web

How to Create a European INSPIRE Compliant Data Specification. Anja Hopfstock, BKG (Germany) Morten Borrebæk, SK (Norway)

Development of Technical Guidance for the INSPIRE Transformation Network Service. "State Of The Art Analysis"

A Loose Coupling Approach for Combining OWL Ontologies and Business Rules

Data Interoperability An Introduction

Closing the INSPIRE Implementation Gap by Contributing to SDI Technology Development

PROJECT PERIODIC REPORT

Extending SOA Infrastructure for Semantic Interoperability

The coastal data in the regional and national territorial data repertory. Genova 24 Aprile 2012 Anna Cerrato Regione Liguria

Model Driven Ontology: A New Methodology for Ontology Development

Design Patterns for Description-Driven Systems

Knowledge Centric Systems Engineering

Automating Geodatabase Creation with Geoprocessing

Rui Reis, Maria José Vale, Marcelo Ribeiro, Bruno Meneses Geospatial World Forum 2016, May 2016, Rotterdam

GeoSmartCity Workshop Katowice October 26 th 2016

International Organization for Standardization Technical Committee 211 (ISO/TC211)

Automated REA (AREA): a software toolset for a machinereadable resource-event-agent (REA) ontology specification

Knowledge Integration Environment

An Architecture for Semantic Enterprise Application Integration Standards

DEVELOPING A NEW GEOGRAPHICAL OBJECT DATABASE. EXPERIENCES FROM IDEA TO DELIVERING DATASETS TOP10NL

Index. Business processes 409. a philosophy of maximum access 486 abstract service management metamodel

DATA VALIDATION AGAINST SCHEMA AND SOURCE DATA

Utility Network Management in ArcGIS: Migrating Your Data to the Utility Network. John Alsup & John Long

Application of the Catalogue and Validator tools in the context of Inspire Alberto Belussi, Jody Marca, Mauro Negri, Giuseppe Pelagatti

INSPIRE-compliant and easy-to-use GeoModel Editor. Jan Schulze Althoff Dr. Christine Giger Prof. Dr. Lorenz Hurni

ELF download services

Ontology-based Model Transformation

Implementing the Army Net Centric Data Strategy in a Service Oriented Environment

Semantic Information Modeling for Federation (SIMF)

ArchiMate symbols for relating system elements

Web Ontology Language for Service (OWL-S) The idea of Integration of web services and semantic web

Use case 1 INSPIRE harmonization of existing Energy Performance Certificate datasets Phase 1

Detailed analysis + Integration plan

Object Oriented Programming

Technical implementation of INSPIRE: feed-back from experiences

Transformative characteristics and research agenda for the SDI-SKI step change:

To Vector, or to Raster? Coverage Processing and Publishing for INSPIRE Annex II/III

HealthCyberMap: Mapping the Health Cyberspace Using Hypermedia GIS and Clinical Codes

Publishing WWII aerial photographs in geographical and library information systems

Future Directions for SysML v2 INCOSE IW MBSE Workshop January 28, 2017

* Corresponding Author

PODS Lite. Technical Overview and Guide

Semantic Technologies and CDISC Standards. Frederik Malfait, Information Architect, IMOS Consulting Scott Bahlavooni, Independent

Nathalie Delattre, National Geographic Institute of Belgium

ECLIPSE MODELING PROJECT

Ontology Servers and Metadata Vocabulary Repositories

Semantic Web Infusion Roadmap V1.0 Gap Analysis 1.6

Topology in the Geodatabase: An Introduction

Advanced Parcel Editing. Amy Andis Tim Hodson

Building a missing item in INSPIRE: The Re3gistry

Semantic Infrastructure and Platforms for Geospatial Services: A report from European Projects 4 th International Workshop on Semantic and

Extension and integration of i* models with ontologies

Heterogeneous Missions Accessibility: Interoperability for Earth Observation

Extracting Ontologies from Standards: Experiences and Issues

INSPIRE & Linked Data: Bridging the Gap Part II: Tools for linked INSPIRE data

Validating services and data in an SDI

PUG List. Raster Analysis 3D Analysis Geoprocessing. Steve Kopp

PODS Lite version 1.1. Technical Overview and Guide

Semantic System Integration Incorporating Rulebased Semantic Bridges into BPEL Processes

Land Cover spatial datasets harmonization in Portugal using HALE

The Unified Modelling Language. Example Diagrams. Notation vs. Methodology. UML and Meta Modelling

SeMFIS: A Tool for Managing Semantic Conceptual Models

Envisioning Semantic Web Technology Solutions for the Arts

IT Infrastructure for BIM and GIS 3D Data, Semantics, and Workflows

IHO S-100 Framework. The Essence. WP / Task: Date: Author: hansc/dga Version: 0.6. Document name: IHO S-100 Framework-The Essence

AQD IPR pilot study programme on e-reporting

An Ontology-Based Methodology for Integrating i* Variants

InfraGML. Linking pin between Geo and BIM. 4 november 2014

KINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY. (An NBA Accredited Programme) ACADEMIC YEAR / EVEN SEMESTER

INSPIRE roadmap and architecture: lessons learned INSPIRE 2017

A Scenario for Business Benefit from Public Data

Lecture Notes UML UNIT-II. Subject: OOAD Semester: 8TH Course No: CSE-802

Semantic Web: Core Concepts and Mechanisms. MMI ORR Ontology Registry and Repository

D WSMO Data Grounding Component

SDI Workshop ESDIN Best Practices INSPIRE conference, Edinburgh. Arnulf Christl, Metaspatial

Reducing Consumer Uncertainty

Unit 6 - Software Design and Development LESSON 4 DATA TYPES

Web apps for INSPIRE: the ELISE Energy pilot example

The Eclipse Modeling Framework and MDA Status and Opportunities

HUMBOLDT Application Scenario: Protected Areas

BSC Smart Cities Initiative

The Plan4business Approach to Transfer Open Data into Real Estate Businesses

Mining the Biomedical Research Literature. Ken Baclawski

Transcription:

Approaches & Languages for Schema Transformation Findings of HUMBOLDT & follow-up Activities INSPIRE KEN Workshop on Schema Transformation Paris, France, 08.10.2013 Thorsten Reitz Esri R&D Center Zurich AG, data harmonisation panel Photo Credit: Petr Kratochvil

The data harmonisation panel 2 Thorsten Reitz INSPIRE KEN Workshop, Paris, 08.10.2013

Content What is the problem with Schema Transformation? Requirements A Classification of Approaches Declarative, Procedural Schema-driven, Instance-driven Visual, Textual Conceptual, Logical, Physical Comparison & Conclusion 3 Thorsten Reitz INSPIRE KEN Workshop, Paris, 08.10.2013

Schema Transformation Languages are Programming Languages A Schema Transformation project typically includes the following: Design (matching tables, UML) Development (Coding Xquery, building a Pipes-and-Filters Graph) Debugging (analysing ST behaviour) Testing (Validation & Acceptance Testing) Documentation (hopefully!) Iterate through these activities for updated/new datasets, new schemas 4 Thorsten Reitz INSPIRE KEN Workshop, Paris, 08.10.2013

in a high-complexity environment! 5 Thorsten Reitz INSPIRE KEN Workshop, Paris, 08.10.2013

REQUIREMENTS ON SCHEMA TRANSFORMATION (ST) IN INSPIRE 6

Requirements for ST approaches (I) Expressivity Six Levels defined in INSPIRE Schema Transformation Pilot [Beare 2010] 1 Renaming classes and attributes 2 Simple attribute derivation 3 Aggregating input records 4 Complex derivation and dynamic type selection 5 Deriving values based on multiple features 6 Conflation and model generalisation 7 Thorsten Reitz INSPIRE KEN Workshop, Paris, 08.10.2013

Requirements for ST approaches (II) Verbosity How many statements are needed to define a mapping? Impacts maintenance, effectiveness Transparency and Traceability What happens in the transformation engine? Can I debug it? Tools/Implementations Performance, Reliability, Usability, 8 Thorsten Reitz INSPIRE KEN Workshop, Paris, 08.10.2013

Specific Requirements In INSPIRE Spatial harmonisation Edge Matching, Reference Systems, Multiple Representations Structure References, Aggregates & Composites, Inheritance Quality Metadata, geometric quality, classification accuracy,... Semantics Code Lists, Feature Types 9 Thorsten Reitz INSPIRE KEN Workshop, Paris, 08.10.2013

CLASSIFICATION AND COMPARISON OF APPROACHES 10

Classification by Paradigm Declarative Approaches Describe the logic of a computation without describing its control flow. Leave optimization and actual execution order to the runtime engine. Examples: XSLT, EDOAL/gOML Procedural Approaches Describe a computation by giving its control flow through a series of functions. Examples: Python GeoProcessing Tool, FME Other Paradigms (Rule-based, Functional, Object-oriented, Aspect-Oriented, Agent-based ) 11 Thorsten Reitz INSPIRE KEN Workshop, Paris, 08.10.2013

Classification by Abstraction Level Horizontal Transformations Metamodel A Metamodel B OWL UML <<inst ant iat es>> <<inst ant iat es>> Vertical Transformations Conceptual Schema A OWL Ontology <<derived>> Logical Schema A GML App. Schema Conceptual Schema Alignment <<ext ends>> Transformation Definition Conceptual Schema B UML Model <<derived>> Logical Schema B GML App. Schema <<inst ant iat es>> <<derived>> <<inst ant iat es>> Instance Data A Instance Transformation Instance Data B 12 GML GML

Classification by Drivers Instance-driven Approaches Use information available in features (instances) to infer correct transformation functions to use Examples: FCA-Merge, XSLT, FME Schema-driven Approaches Use conceptual schema and define relations based on the elements of the schema Examples: PROMT, COMA++ (Protégé), EDOAL/gOML Combined Approaches Use information available on instance level to enhance the quality of an automatically derived mapping on the extension level 13 Thorsten Reitz INSPIRE KEN Workshop, Paris, 08.10.2013

Classification by Representation Graphical Textual 14 Thorsten Reitz INSPIRE KEN Workshop, Paris, 08.10.2013

Classification Name Abstraction Schema Language Paradigm(s) Primary Driver XSLT Logical XSD Declarative+ Instance Textual XQuery Logical XSD Procedural Instance Textual Primary Representation EDOAL/OML Conceptual OWL+* Declarative Schema Graphical RIF Conceptual * Declarative Instance Textual QVT/ATL Conceptual UML Both Schema Combined UML-T Conceptual UML Declarative Schema Graphical GeoKettle Logical Internal Procedural Instance Graphical Talend Logical Internal Procedural Instance Graphical FME Logical Internal Procedural Instance Graphical * Independent of a specific logical schema 15 Thorsten Reitz INSPIRE KEN Workshop, Paris, 08.10.2013

Comparison 1 Name Tools Domain Expressivity 2 Verbosity 3 Maturity 4 XSLT 10+ Universal + Spatial 80% 1114 5 4 XQuery 10+ Universal + Spatial 96% 392 4 EDOAL/gOML 2 Universal + Spatial 92% 91 3 RIF 3 Universal 80% 414 5 2 QVT/ATL 4+ Universal 76% 117 3 UML-T 1 Universal + Spatial 60% -.- 1 GeoKettle 1 Spatial 96% -.- 3 Talend 1 Universal + Spatial 92% -.- 3 FME 1 Spatial 100% 205 3 1 Research conducted in 2010-2011. 2 Coverage of functions listed in [Beare 2010], Appendix 2 3 Number of statements needed for a set of example projects 4 no. of implementations, user base, iterations, functionality 5 generated XSLT/RIF 16 Thorsten Reitz INSPIRE KEN Workshop, Paris, 08.10.2013

CONCLUSION & DISCUSSION 18

Conclusion (I) What makes ST hard? Language Expressiveness & Complexity Most approaches are Turing complete you can build a First Person Shooter using them if you want to Schema complexity & data volume, variance Tooling Debugging/Inspection Limited collaboration Unconnected Documentation No maintainability or collaboration features 19 Thorsten Reitz INSPIRE KEN Workshop, Paris, 08.10.2013

Conclusion (II) - Selecting approaches How well does an approach cover all phases in my ST project? If only some steps are covered, which issues are introduced through combination of multiple approaches? How well can I maintain my ST project? As with any software, a project outcome will loose value quickly if it cannot be maintained How can I document my ST project? How do I explain to others what tricks I used to get the approach to do what I wanted? 20 Thorsten Reitz INSPIRE KEN Workshop, Paris, 08.10.2013

Discussion & Questions Are we happy with the standards and de-facto standards? Limited evidence of re-use and exchange of Schema Transformation projects Disagree with any of the classification/comparison items? Contact me! 21 Thorsten Reitz INSPIRE KEN Workshop, Paris, 08.10.2013

Happy end Read the data harmonisation panel to learn more: http://blog.dhpanel.eu Contact the presenter: tr@xsdi.eu / treitz@esri.com Thorsten Reitz INSPIRE KEN Workshop, Paris, 08.10.2013