Hybrid-Type Extensions for Actor-Oriented Modeling (a.k.a. Semantic Data-types for Kepler) Shawn Bowers & Bertram Ludäscher

Similar documents
Workflow Exchange and Archival: The KSW File and the Kepler Object Manager. Shawn Bowers (For Chad Berkley & Matt Jones)

Where we are so far. Intro to Data Integration (Datalog, mediators, ) more to come (your projects!): schema matching, simple query rewriting

2/12/11. Addendum (different syntax, similar ideas): XML, JSON, Motivation: Why Scientific Workflows? Scientific Workflows

Chapter 3 Research Method

Workflow Fault Tolerance for Kepler. Sven Köhler, Thimothy McPhillips, Sean Riddle, Daniel Zinn, Bertram Ludäscher

Kepler: An Extensible System for Design and Execution of Scientific Workflows

Kepler Scientific Workflow and Climate Modeling

The Gigascale Silicon Research Center

Reducing Consumer Uncertainty

Automatic Transformation from Geospatial Conceptual Workflow to Executable Workflow Using GRASS GIS Command Line Modules in Kepler *

Scientific Workflows

System-Level Design Languages: Orthogonalizing the Issues

An Improving for Ranking Ontologies Based on the Structure and Semantics

Contents Contents 1 Introduction Entity Types... 37

ISSN: Supporting Collaborative Tool of A New Scientific Workflow Composition

Scientific Workflow Tools. Daniel Crawl and Ilkay Altintas San Diego Supercomputer Center UC San Diego

Institute of Automatics AGH University of Science and Technology, POLAND. Hybrid Knowledge Engineering.

Features and Requirements for an XML View Definition Language: Lessons from XML Information Mediation

Adding formal semantics to the Web

Semantic Web. Ontology Pattern. Gerd Gröner, Matthias Thimm. Institute for Web Science and Technologies (WeST) University of Koblenz-Landau

A Knowledge-Based Approach to Scientific Workflow Composition

Mapping provenance in ontologies

Accelerating the Scientific Exploration Process with Kepler Scientific Workflow System

The Open Group SOA Ontology Technical Standard. Clive Hatton

Opus: University of Bath Online Publication Store

KEPLER: Overview and Project Status

Review Sources of Architecture. Why Domain-Specific?

a paradigm for the Introduction to Semantic Web Semantic Web Angelica Lo Duca IIT-CNR Linked Open Data:

Dictionary Driven Exchange Content Assembly Blueprints

Framework specification, logical architecture, physical architecture, requirements, use cases.

A(nother) Vision of ppod Data Integration!?

7 th International Digital Curation Conference December 2011

SUMMARY: MODEL DRIVEN SECURITY

Wade Sheldon. Georgia Coastal Ecosystems LTER University of Georgia CUAHSI Virtual Workshop Field Data Management Solutions

Modeling Requirements

Web Services Annotation and Reasoning

Sliding Window Calculations on Streaming Data using the Kepler Scientific Workflow System

XML Systems & Benchmarks

Chapter 8: Enhanced ER Model

Table of Contents. iii

Smart Open Services for European Patients. Work Package 3.5 Semantic Services Definition Appendix E - Ontology Specifications

MDA & Semantic Web Services Integrating SWSF & OWL with ODM

Using Web Services and Scientific Workflow for Species Distribution Prediction Modeling 1

D WSMO Data Grounding Component

Architecture domain. Leonardo Candela. DL.org Autumn School Athens, 3-8 October th October 2010

Reducing Consumer Uncertainty Towards a Vocabulary for User-centric Geospatial Metadata

UC Berkeley Mobies Technology Project

Bridging the Gap between Semantic Web and Networked Sensors: A Position Paper

Extension and integration of i* models with ontologies

Semantic Web. Tahani Aljehani

Web Ontology Language for Service (OWL-S) The idea of Integration of web services and semantic web

Using ontologies function management

The Formal Syntax and Semantics of Web-PDDL

The OWL API: An Introduction

Compositional Model Based Software Development

Design Process Overview. At Each Level of Abstraction. Design Phases. Design Phases James M. Bieman

Automatic Annotation of Web Services based on Workflow Definitions

Model driven Engineering & Model driven Architecture

Army Data Services Layer (ADSL) Data Mediation Providing Data Interoperability and Understanding in a

Alexander Haffner. RDA and the Semantic Web

Data Integration and Data Warehousing Database Integration Overview

The Ptolemy II Framework for Visual Languages

Model Driven Dynamic Composition of Web Services Flow for Business Process Integration

Raising the Level of Development: Models, Architectures, Programs

Semantic Interoperability of Dublin Core Metadata in Digital Repositories

Historization and Versioning of DDI-Lifecycle Metadata Objects

Automatic Generation of Workflow Provenance

0.1 Upper ontologies and ontology matching

358 cardinality 18, 55-57, 62, 65, 71, 75-76, 85, 87, 99, 122, 129, , 140, 257, , , 295, categorization 18, 26, 56, 71, 7

Update on 3R Project (RDA Toolkit Restructure and Redesign Project)

Linked Data and RDF. COMP60421 Sean Bechhofer

A Collaborative User-centered Approach to Fine-tune Geospatial

Scientific Workflow, Provenance, and Data Modeling Challenges and Approaches

INTRODUCTION Background of the Problem Statement of the Problem Objectives of the Study Significance of the Study...

edify Requirements for Evidence-based Templates in Electronic Case Report Forms Marco Schweitzer, Stefan Oberbichler

IBM Research Report. Overview of Component Services for Knowledge Integration in UIMA (a.k.a. SUKI)

An Archiving System for Managing Evolution in the Data Web

Provenance Information in the Web of Data

Automated Visualization Support for Linked Research Data

Workshop: Practice of CRM-Based Data Integration

Ontology Links in the Distributed Ontology Language (DOL)

Bridging Versioning and Adaptive Hypermedia in the Dynamic Web

Evolution of XML Applications

ER-to-Relational Mapping

XML Metadata Standards and Topic Maps

Kepler/pPOD: Scientific Workflow and Provenance Support for Assembling the Tree of Life

COMP718: Ontologies and Knowledge Bases

A GML SCHEMA MAPPING APPROACH TO OVERCOME SEMANTIC HETEROGENEITY IN GIS

1. CONCEPTUAL MODEL 1.1 DOMAIN MODEL 1.2 UML DIAGRAM

Semantic Web. Ontology Engineering and Evaluation. Morteza Amini. Sharif University of Technology Fall 95-96

Reproducible & Transparent Computational Science with Galaxy. Jeremy Goecks The Galaxy Team

UML Fundamental. OutLine. NetFusion Tech. Co., Ltd. Jack Lee. Use-case diagram Class diagram Sequence diagram

Experience in Using a Process Language to Define Scientific Workflow and Generate Dataset Provenance

Workpackage 15: DBE Business Modeling Language. Deliverable D15.5: BML Editor Final Release

Dynamic, Rule-based Quality Control Framework for Real-time Sensor Data

Component-Based Design of Embedded Control Systems

FIBO Metadata in Ontology Mapping

Using RDF to Model the Structure and Process of Systems

The HMatch 2.0 Suite for Ontology Matchmaking

EDEN An Epigraphic Web Database of Ancient Inscriptions

Transcription:

Hybrid-Type Extensions for Actor-Oriented Modeling (a.k.a. Semantic Data-types for Kepler) Shawn Bowers & Bertram Ludäscher University of alifornia, Davis Genome enter & S Dept. May, 2005 Outline 1. Hybrid Types 2. Hybrid Types and Scientific Workflow Design 3. Super Rapid Prototyping: The Sparrow Family of Languages 4. Next Steps: Adding a Hybrid-Type System to Kepler

Hybrid Types Hybrid Types: Superimposing Semantics Separation of oncerns: onventional Data Modeling (Structural Data Types) E.g., XML Schema / DTD, etc. onceptual Data Modeling (Semantic Types) Drawn from ontologies (expressed in Description Logic) apturing domain knowledge (e.g., biodiversity, ecology) Explicit (external) linkages (Semantic Annotations) Simple links (one concept per item) Links expressed as constraints (logical mappings)

Hybrid Types: Superimposing Semantics T1 := table meas Datatypes double a relational table of measurements T2 := list spp a list of s Hybrid Types: Superimposing Semantics SpeciesBiomass Measurement item.species prop.biomass loc.location Species biomass is a measure of the amount of biomass of a particular species within a location. Semtypes SpeciesommBiomass SpeciesBiomass loc.ommunity Species community biomass is a species biomass within a community. T1 := table meas SpeciesommBiomass table/x:meas => X:SpeciesommBiomass double each table/meas instance is a measurement

Hybrid Types: Superimposing Semantics SpeciesBiomass Measurement item.species prop.biomass loc.location Species biomass is a measure of the amount of biomass of a particular species within a location. Semtypes SpeciesommBiomass SpeciesBiomass loc.ommunity Species community biomass is a species biomass within a community. T1 := ommunity table meas SpeciesommBiomass table/x:meas, X/Y:site, X/Z:plot => X:SpeciesommBiomass, =f(y,z), (X,):loc, :ommunity, double Hybrid Types: Superimposing Semantics SpeciesBiomass Measurement item.species prop.biomass loc.location Species biomass is a measure of the amount of biomass of a particular species within a location. Semtypes SpeciesommBiomass SpeciesBiomass loc.ommunity Species community biomass is a species biomass within a community. SpeciesommBiomass T1 := table ommunity meas Species double table/x:meas, X/Y:site, X/Z:plot, X/U:spp => X:SpeciesommBiomass, =f(y,z), (X,):loc, :ommunity, (X,U):item, U:Species,

Hybrid Types: Superimposing Semantics SpeciesBiomass Measurement item.species prop.biomass loc.location Species biomass is a measure of the amount of biomass of a particular species within a location. Semtypes SpeciesommBiomass SpeciesBiomass loc.ommunity Species community biomass is a species biomass within a community. SpeciesommBiomass T1 := table ommunity meas Species Biomass double table/x:meas, X/Y:site, X/Z:plot, X/U:spp, X/B:bm => X:SpeciesommBiomass, =f(y,z), (X,):loc, :ommunity, (X,U):item, U:Species, (X,B):prop, B:Biomass. Hybrid Types: Superimposing Semantics Searching oncept-based, e.g., find all datasets containing biomass measurements Merging/Integrating ombining heterogeneous sources based on annotations oncatenate, Union (merge), Join, etc. Transforming onstruct mappings from schema S1 to S2 based on annotations Semantic Propagation Pushing semantic annotations through transformations/queries

Semantic Annotation Propagation apture I/O constraints Similar to unit type constraints an enable automated metadata creation (annotation propagation ) an help refine ontologies and existing annotations Semantic Annotation Propagation T1 := table meas double Port 1 Annotation table/x:meas, X/Y:site, X/Z:plot, X/U:spp, X/B:bm => X:SpeciesommBiomass, =f(y,z), (X,):loc, :ommunity, (X,U):item, U:Species, (X,B):prop, B:Biomass. p1::t1 p2::t2 seasonal species p3::t3 Actor I/O constraint (approx.) p3.obs(site, plot, spp, bm) :- p1.meas(site, plot, spp, bm), p2.spp(spp). T3 := table Port 2 hased Annotation table/x:obs, X/Y:site, X/Z:plot, obs X/U:spp, X/B:bm => X:SpeciesommBiomass, =f(y,z), (X,):loc, :ommunity, (X,U):item, U:Species, (X,B):prop, B:Biomass. double

Hybrid Types and Scientific Workflow Design Workflow Design Primitives End-to-End Workflow Design and Implementation Viewed as a series of primitive transformations Each takes a WF and produces a new WF an be combined to form design strategies Workflow Design W 0 W 1 W 2 W m W n t t t Workflow Implementation Top-Down Task Driven Data Driven Bottom-Up Structure Driven Output Driven Semantic Driven Input Driven

Workflow Design Primitives Semantic types and Actor Oriented Modeling: Actors and Workflows: can have semantic types conceptually describing their function Ports: can have semantic types conceptually describing what they consume and produce I/O onstraints: a general form of constraint between input and output (e.g., like unit constraints) approximating the function of an actor Basic Design Primitives Inherited from Ptolemy Basic Transformations Starting Workflow Resulting Workflow Resulting Workflow t 1 : Entity Introduction (actor or data connection) t 2 : Port Introduction t 3 : Datatype Refinement (s s, t t) s t s t s t t 4 : Hierarchical Abstraction t 5 : Hierarchical Refinement t 6 : Data onnection t 7 : Director Introduction

Additional (Planned) Design Primitives for Semantic Types Extended Transformations Starting Workflow Resulting Workflow t 9 : Actor Semantic Type Refinement (T T) T T t 10 : Port Semantic Type Refinement (, D D) t 11 : Annotation onstraint Refinement (α α) t 12 : I/O onstraint Strengthening (ψ ϕ ) t 13 : Data onnection Refinement α 1 s ϕ ψ Resulting Workflow D D D D α2 α 1 t s D α2 t α 1 s D α 2 t t 14 : Adapter Insertion t 15 : Actor Replacement f f t 16 : Workflow ombination (Map) f 1 f 2 f 1 f 2 Adapters for Semantic and Structural Incompatibility D D D Adapters may: be abstract (no impl.) 1 2 D 1 1 D 1 2 2 D 2 D be concrete bridge a semantic gap fix a structural mismatch f 1 f [S] 2 f 1 S T [S][S ] map f 2 [T] f 1 f [[S]] 2 f 1 S T [[S]][[S ]] map map f 2 [[T]] be generated automatically (e.g., Taverna s list mismatch ) be reused components (based on signatures)

Applying the Replacement Primitive f D f D D D f general replacement unsafe replacement context-sensitive replacement ( wiggle room ) f D f D f D D D D, overlap (e.g., ) D,D overlap (e.g., D D ) (e.g., ) D D (e.g., D D ) General replacement doesn t consider surrounding connections ontext-sensitive replacement gives more wiggle room by tuning the actors semtypes based on connections Workflow Elaboration Adapter insertion, replacement, and search provide a powerful mechanism for workflow elaboration : 1. Given an initial, user specified set of connected abstract actors 2. Repeatedly search for replacement concrete actors (atomic and composite) 3. At each step, insert adapters when necessary 4. Allow user to select returned workflows to be combined

Super Rapid Prototyping: The Sparrow Family of Languages The Sparrow Family of Language Basic Idea: Have both Machine and Human readable syntax Sparrow-DL Description logic Sparrow-DTD Datatypes, variant of XML DTDs Sparrow-Annotate onfiguring concepts; linking datatypes and ontologies Sparrow-SWF KSW-Based MoML Metadata Sparrow-Rule Fancy stuff, like type constraints (a la unit types), function approximation, and misc. other constraints

The Sparrow Family of Languages Sparrow-DTD Sparrow-SWF Sparrow-DL Sparrow-Toolkit Operations w1 Author Author ISBN ASIN AMSQuote a1 a3 a4 p1:: [] p1:: p2:: int p1:: P2:: {price=, cond=, seller={ }} Sparrow-Toolkit (example) Operations Is w1 semantically and/or structurally well typed? What can be semantically connected to a3? Insert abstract adapter between a3 and a4 What can replace (e.g., implement) the adapter?

Future Steps: Adding a Hybrid-Type System to Kepler urrent and Future Implementation oncept-based Actor Search Implemented as proof-ofconcept About to undergo major revision Additional operations slated for next Kepler Release (data search, actor-based port search, etc.) Biggest hallenges Building/searching a repository Making changes to MoML (see KSW) GUI changes Ontology management Workflow omponents (MoML) urn ids Semantic Annotations instance expressions Ontologies (OWL) Default + Other