Industrial Convergence and Semantic Interoperability

Similar documents
Semantic Metadata Tapping the Potential of Semantic Ontologies

ASSURING DATA INTEROPERABILITY THROUGH THE USE OF FORMAL MODELS OF VISA PAYMENT MESSAGES (Category: Practice-Oriented Paper)

David S. Frankel Lead Standards Architect Model Driven Systems SAP Labs

MDA Journal. BPMI and OMG: The BPM Merger A BPT COLUMN. David S. Frankel Lead Standards Architect - Model Driven Systems SAP Labs.

elements) and on the structure and representation of the information (i.e. the message format).

Model Driven Message Interoperability (MDMI): an Object Management Group (OMG) Standard

The Benefits of Strong Authentication for the Centers for Medicare and Medicaid Services

A Better Approach to Leveraging an OpenStack Private Cloud. David Linthicum

E-Guide WHAT WINDOWS 10 ADOPTION MEANS FOR IT

Chapter 1. Storage Concepts. CommVault Concepts & Design Strategies:

The Business Value of Metadata for Data Governance: The Challenge of Integrating Packaged Applications

E-Commerce Integration Meta-Framework Introduction (ECIMF-Intro) CEN/ISSS/WS-EC/ECIMF. Draft, version 0.3 November 28, 2001

Promoting Component Architectures in a Dysfunctional Organization

Losing Control: Controls, Risks, Governance, and Stewardship of Enterprise Data

Workshop: Modeling Scenarios Using Data. Fiona Charles BCS SIGIST 22-Sept-2009

developer.* The Independent Magazine for Software Professionals

MDA Journal. XML and MDA A BPT COLUMN. David S. Frankel. February 2005

APPLICATION OF A METASYSTEM IN UNIVERSITY INFORMATION SYSTEM DEVELOPMENT

A Beginners Guide to UML Part II

IBM Research Report. Model-Driven Business Transformation and Semantic Web

MDA Journal. Toward a Business Process Platform A BPT COLUMN. David S. Frankel Lead Standards Architect - Model Driven Systems SAP Labs.

idrm: Fixing the broken interface between design and manufacturing

Creating Ontology Chart Using Economy Domain Ontologies

doi> Digital Object Identifier

NIS Directive : Call for Proposals

Integrated Access Management Solutions. Access Televentures

1 Executive Overview The Benefits and Objectives of BPDM

Service-Oriented Architecture

Building Better Interfaces: HL7 Conformance Profiles

Where's the Beef from Enterprise Structured Content

4D CONSTRUCTION SEQUENCE PLANNING NEW PROCESS AND DATA MODEL

Alan J. Perlis - Epigrams on Programming

Models versus Ontologies - What's the Difference and where does it Matter?

CATCH ERRORS BEFORE THEY HAPPEN. Lessons for a mature data governance practice

The Bizarre Truth! Automating the Automation. Complicated & Confusing taxonomy of Model Based Testing approach A CONFORMIQ WHITEPAPER

Real Wireframes Get Real Results

Architecture Viewpoint Template for ISO/IEC/IEEE 42010

Global Reference Architecture: Overview of National Standards. Michael Jacobson, SEARCH Diane Graski, NCSC Oct. 3, 2013 Arizona ewarrants

Definition of Information Systems

Automate Transform Analyze

Report. Conceptual Framework for the DIAMONDS Project. SINTEF ICT Networked Systems and Services SINTEF A Unrestricted

WYSIWON T The XML Authoring Myths

Computer Aided Draughting and Design: Graded Unit 1

Software Review: Ruby Tabulation Software

Paying. on the go: Mobile payments slowly catch on in the United States

Category Theory in Ontology Research: Concrete Gain from an Abstract Approach

ETNO Reflection Document on the EC Proposal for a Directive on Network and Information Security (NIS Directive)

CAPACITY PLANNING FOR THE DATA WAREHOUSE BY W. H. Inmon

The Analysis and Proposed Modifications to ISO/IEC Software Engineering Software Quality Requirements and Evaluation Quality Requirements

FOCUS on Intrusion Detection: Re-synchronizing a NIDS Page 1 of 6

A Study of Future Internet Applications based on Semantic Web Technology Configuration Model

SOME TYPES AND USES OF DATA MODELS

Software Design Models, Tools & Processes. Lecture 6: Transition Phase Cecilia Mascolo

NEXT-GENERATION DATACENTER MANAGEMENT

A2A EAI. Overview and recommendations Data Transport. Jerome CAPIROSSI. people are keen to argue since they find themselves to be affected.

Foundations. The Golden Record is Not Enough: The Case For Data Orchestration. CPDAs Highly Recommend

Understanding Managed Services

High Performance Computing Prof. Matthew Jacob Department of Computer Science and Automation Indian Institute of Science, Bangalore

Effective Threat Modeling using TAM

Dictionary Driven Exchange Content Assembly Blueprints

Building Better Parametric Cost Models

SEPARATING FACT FROM FICTION MICROSOFT & POWER MANAGEMENT

IBM AND THE FUTURE OF SMART IT. February 26, 2008

NTAF IN PRACTICE. White Paper

Oracle Tuxedo. CORBA Technical Articles 11g Release 1 ( ) March 2010

Rational Software White paper

Advances In Data Integration: The No ETL Approach. Marcos A. Campos, Principle Consultant, The Cognatic Group. capsenta.com. Sponsored by Capsenta

Domain-Driven Development with Ontologies and Aspects

Storage Key Issues for 2017

Bachelor of Arts Program in Information Science

E-Guide CLOUDS ARE MORE SECURE THAN TRADITIONAL IT SYSTEMS -- AND HERE S WHY

The task or context will be familiar and involve few variable aspects. The techniques used will be familiar or commonly undertaken.

Protect Your Data the Way Banks Protect Your Money

Cisco Collaboration Optimization Services: Tune-Up for Peak Performance

PROTOCOL ENGINEERING BY HARTMUT KöNIG

INCOGNITO TOOLKIT: TOOLS, APPS, AND CREATIVE METHODS FOR REMAINING ANONYMOUS, PRIVATE, AND SECURE WHILE COMMUNICATING, PUBLISHING, BUYING,

Panda Security 2010 Page 1

Semantic Information Modeling for Federation (SIMF)

Response to the. ESMA Consultation Paper:

When Milliseconds Matter. The Definitive Buying Guide to Network Services for Healthcare Organizations

Even amidst the enforced frugality of a recession,

News English.com Ready-to-use ESL / EFL Lessons

KDD 10 Tutorial: Recommender Problems for Web Applications. Deepak Agarwal and Bee-Chung Chen Yahoo! Research

Presentation for the MARC Format Interest Group at the ALA Midwinter Meeting January 21, 2012 Kelley McGrath

Today s competitive marketplace is placing extraordinary demands upon customer service organizations, sales teams and call centers.

SAPtips. Journal. Creating a Well-Developed Master Data Management Solution in BW. August/September 2005 Volume III Issue 4. SAPtips.

Authentication Work stream FIGI Security Infrastructure and Trust Working Group. Abbie Barbir, Chair

A Simple Case Study of a Grid Performance System

Standard Business Rules Language: why and how? ICAI 06

INTERNATIONAL STANDARD

Enabling Performance & Stress Test throughout the Application Lifecycle

DEVELOPMENT OF A VISUAL TOOL FOR DYNAMIC SIMULATION PROGRAM HVACSIM+ *2 Yamatake Building Systems Co., Ltd. Tokyo , Japan

UML enabling the Content Framework

Digital Libraries at Virginia Tech

EBOOK THE BEGINNER S GUIDE TO DESIGN VERIFICATION AND DESIGN VALIDATION FOR MEDICAL DEVICES

Practical Model-Driven Development with the IBM Software Development Platform

WHITE PAPER. ENSURING SECURITY WITH OPEN APIs. Scott Biesterveld, Lead Solution Architect Senthil Senthil, Development Manager IBS Open APIs

THE IMPACT OF SECURITY ON APPLICATION DEVELOPMENT. August prevoty.com. August 2015

MODEL COMPLAINTS SYSTEM AND POLICY THE OMBUDSMAN'S GUIDE TO DEVELOPING A COMPLAINT HANDLING SYSTEM

Benefits and Costs of Structured Data. August 11, Secretary Securities and Exchange Commission 100 F Street, NE Washington, DC

Transcription:

Industrial Convergence and Semantic Interoperability Syntactic Interoperability... 1 From Simple to Complex... 2 Where is the Semantic Knowledge?... 2 Semantic Interoperability... 3 Technology That Can Help... 4 Industrial Convergence...4 Afterword... 5 Up to now, the software industry has managed to avoid facing up to the low level of semantic interoperability in our data processing systems. Current macro economic trends requiring more sharing of data across vertical markets threaten to push our systems to the breaking point if we do not address this shortcoming soon. This article takes a close look at this problem, and explains why there is reason to be cautiously optimistic that we can get a handle on it. Syntactic Interoperability In order to explain what semantic interoperability is all about, I find it useful to first explain the notion of syntactic interoperability, because syntactic interoperability is a necessary precondition for semantic interoperability. Syntactic interoperability is the ability of a set of components to understand the syntax of the communication that flows among them. For example, consider what happens when a client sends a message to a general ledger service, requesting that a debit of $200 be posted to account number 4601. Syntactic interoperability guarantees that, when the client specifies that 200 is the dollar amount and that 4601 is the account number, the general ledger system does not misinterpret the message and think that 4601 is the dollar amount and 200 is the account number. Syntactic interoperability also makes it possible for the client and the service to agree that 256.77 is not a valid account number, because the account number must be an integer. The computer industry has made good progress achieving syntactic interoperability. Various distributed computing protocols, such as Web Services, CORBA, and so on, make it possible for components written in different programming languages and running over different operating systems on different machines to communicate with a shared understanding of the syntax of the data that they exchange during the course of their communications. 1

From Simple to Complex However, we don t always have the luxury of dealing with messages as simple as our example, containing just an account number and amount. Industry groups and standards bodies define messages containing considerably more numerous and complex data elements. For example, in the financial services industry, an industrial strength debit transfer message contains multiple blocks of data, including a block of data usually called something like Debtor Party. The Debtor Party block includes multiple data elements, including one that is named something along the lines of Identification_Issuer_Name_Proprietary. 1 Conceptually, this data element represents the name of the issuer of a proprietary identification for a debtor party. The element incorporates a number of semantic concepts reflected by the terms identification, issuer, name, proprietary, debtor, and party. In addition to those fine-grained concepts, it incorporates the following coarser-grained concepts that combine the more granular concepts: Debtor party Proprietary identification Issuer name Identification Issuer Thus, the data element in question is based on a combination of these semantic concepts. Where is the Semantic Knowledge? In current practice, software developers hard-wire their understanding of these semantic concepts into programs that construct and deconstruct a debit transfer message. This semantic knowledge is buried deep in the code and is not manifest in any machine-readable metadata. Of course, there are multiple formats for debit transfers in use within the financial services industry. Issuing new message formats does not mean that system managers rip out and replace applications that use older formats, which is exceedingly costly. Furthermore, various domains within the finance industry deal with debits, and each of their associated standards bodies defines different message formats. Thus, there are multiple message definitions that come into play in a single financial transaction. As messages cross subsystem boundaries while moving through the financial supply chain, a debit transfer message encoded in one format often has to be translated to a message in another format. So the old problem arises: How do we map one format to another? The task of mapping one message format to another falls to a human analyst. The formats that the analyst must map are often lengthy and complicated. Some messages have scores of complex fields. The human analyst does the best she can. But the amount of help that our computer systems provide to help her figure out what the mapping should be is quite limited. State of the art data mapping tools display both formats on the screen and allow the analyst to graphically draw connections and write expressions to indicate what should map to what according to what rules. These tools also are good at generating translation code, once the analyst has figured out what the mapping should be and has entered the rules into the tool. But as far as these mapping tools are concerned, our example complex data element simply has an opaque name (Identification_Issuer_Name_Proprietary), a data type (Text), and a cardinality (zero to one). There is no metadata about the underlying semantic concepts that the mapping tool can grab onto in order to provide guidance as to what should map to what. If you think that it should be easy for a person to work out how to map a field in one complex format to another just by visually inspecting the formats, you probably have never tried to do it. Not only is it enormously time consuming, but it also is error prone. Subtle mistakes that occur in 1 This example is loosely based on one of the ISO 20022 payment messages. Copyright 2006 David S. Frankel. All Rights Reserved. www.bptrends.com 2

data transformations at subsystem boundaries within the financial supply chain are common and cost the involved parties serious money. Even if our erstwhile analyst is fortunate enough to have access to good documentation of the message formats written in English or in some human language in which she is fluent, the sheer size and complexity of the formats means the process is fraught with opportunities for honest mistakes. Semantic Interoperability These mistakes are a consequence of a low degree of semantic interoperability. Semantic interoperability is the ability to coordinate the functioning of a set of components to coordinate their functioning based on a shared understanding of the meaning of the communications that flow among them. In our debit transfer example, the communications involve the passing of messages, as well as the transformation of those messages from one format to another. The level of semantic interoperability is based on the degree to which components at various points in the financial supply chain have a common understanding of the meaning of the data that flows through the chain. The reason that syntactic interoperability is a necessary precondition for semantic interoperability is that it is not possible to agree on the meaning of a piece of data if the parties to a financial transaction cannot even agree on whether the data is a dollar amount, an integer. or a piece of text. As I have pointed out, message definitions currently in use in industry today have little if any machine readable metadata that manifests the underlying semantic concepts. Therefore, whatever degree of semantic interoperability that our data processing systems possess relies on hard-wired code and on the best efforts of human data mapping analysts who must wade through the complexity largely unassisted. The lower the degree of semantic interoperability, the more friction in the system. Studies indicate that a one percent reduction in such friction in electronic payment systems could add a few percent to global GDP. 2 Even if such projections are too optimistic by several fold, it should still be evident that quite modest gains in semantic interoperability could produce substantial returns. Thus, in attacking this problem, we don t have to strive for fully automated semantic interoperability. Complete automation of all mapping decisions is probably not attainable, certainly not in the foreseeable future. Humans will have to be involved in these decisions. Moreover, even a goal of, say, 50 percent automation may be too high to shoot for at this time, given the technical challenges and the resulting high cost of achieving such a target. But we can certainly do better than we are doing today, and the evidence indicates that moderate improvements will be well worth the effort. If message definitions for electronic commerce include the right kind of metadata, then each data element essentially carries a semantic map of itself that reveals, in a structured way, the underlying concepts on which the element is based. Tools can use this metadata to provide a reasonable degree of assistance to analysts trying to work out the right data mappings. Such tools can analyze the elements internal semantic maps and suggest (not dictate) to the analyst what the proper mappings to other elements might be. This approach semi-automates the mapping. It also is helpful to the analyst to be able to read an element s internal semantic map as she carries out the non-automated aspects of the mapping process. This approach is all the more effective to the extent that data elements underlying semantic concepts are reused from libraries of semantic concepts that have been defined and registered by a duly constituted registration authority. The more that the same concepts start showing up in different elements internal semantic maps, the more that tools can find probable mapping 2 Joseph N. Bugajski, Response to Payments RFI, Visa International Payments Association, August 22, 2004, OMG document finance/04-08-07. Copyright 2006 David S. Frankel. All Rights Reserved. www.bptrends.com 3

relationships. Of course, there are issues with getting different communities aligned around common concepts; that is one of the key reasons why, realistically, we expect only modest results. Incidentally, service-oriented systems benefit as much as message-oriented systems from the semi-automation because they pass messages around too. Moreover, the electronic payment domain on which my examples focus is only one of many e-commerce domains suffering from friction that could be partially alleviated. Technology That Can Help The technologies exist to make limited yet useful advances in semantic interoperability. From the standpoint of technical feasibility, therefore, it is now a matter of engineering to work out how to use these technologies together in a way that scales and provides practical improvements. The UN/CEFACT Core Components (CCTS) initiative and Semantic Web technologies, along with generic modeling tools, give us the basic toolkit we need. 3 The tougher challenge has been getting industry groups who define electronic commerce interoperability standards to recognize the semantic interoperability problem. However, progress is in the air, due to the pressure of inexorable economic forces. Industrial Convergence Low levels of semantic interoperability have generated substantial economic friction for some time now, but the increasing overlap among vertical markets is severely exacerbating the problem to the point where a number of industry standards groups are starting to genuinely understand that we can no longer afford to ignore the issue. It is bad enough when there is only a low degree of semantic interoperability among a single standards body s own published messages, where all of those messages target the same vertical market. It is of an order of magnitude more difficult to deal with a paucity of semantic interoperability when messages devised for different vertical markets need to work together, because it is more difficult to find analysts who understand multiple markets. The economy is rife with examples of the breakdown of traditional borderlines among vertical markets. Here are just a few sample data points: Today, 33 percent of the components in an automobile are high tech components. So if you think that RosettaNet messages for high tech have no bearing on the automotive industry, think again. IBM recently spent $745 million to acquire Telelogic. Telelogic s model-driven development tools target the embedded systems space. The walls are coming down between the embedded systems and enterprise systems worlds, and IBM is voting that way with its wallet. HIIMMS, the Healthcare Information and Management Systems Society, is spearheading an initiative to put together a health information exchange in Southeast Michigan. Major stakeholders include the Big 3 automobile companies, which are feeling a particularly acute amount of pain due to health care costs. Financial institutions that are directly involved in healthcare payments are also stakeholders. This initiative could have ramifications beyond the Detroit area. In an economy that is gradually blurring the barriers between vertical industries, the demands for interoperability increase dramatically. Keeping up with the message formats for multiple markets 3 For those who are familiar with these technologies: Some ontology experts have expressed reservations about certain aspects of CCTS. However, there are some new developments already in motion that I believe can overcome these problems. I will say more about that in future editions of MDA Journal. Copyright 2006 David S. Frankel. All Rights Reserved. www.bptrends.com 4

and understanding how they interrelate is daunting. This specter is putting semantic interoperability on industry radar screens. It is too early to declare victory, but the pain level has reached a threshold such that we who have been pushing for better semantic interoperability are getting a hearing. Quite a number of vertical market groups have embarked upon or are seriously considering semantic interoperability initiatives. Now it is time for us to execute. Afterword In future issues of MDA Journal, I will examine the technologies that hold out hope of modest but significant improvements in semantic interoperability, looking at their strengths and weaknesses and how best to use them. I will also describe some of the semantic interoperability initiatives in progress in industry. Additionally, I will consider the impact of semantic interoperability on Business Process Management. David Frankel is Lead Standards Architect for Model-Driven Systems at SAP Labs. He has over 25 years of experience as a programmer, architect, and technical strategist. He is the author of the book, Model-Driven Architecture : Applying MDA to Enterprise Computing. He also is lead editor of the book The MDA Journal. He served several terms as a member of the Architecture Board of the Object Management Group (OMG), the body that manages the MDA standards, and he has co-authored a number of industry standards. Recently he has been publishing and speaking about the role of model-driven systems in enterprise SOA and has been promoting the Business Process Expert community at www.sap.bpx.com. Copyright 2006 David S. Frankel. All Rights Reserved. www.bptrends.com 5