Web services and workflow creation

Size: px
Start display at page:

Download "Web services and workflow creation"

Transcription

1 Web services and workflow creation Internal Version: 2 Editors: Marc Kemps-Snijders

2 WG2.7 Web services and workflow creation 2

3 Web services and workflow creation CLARIN-2010-D2R-7b EC FP7 project no Deliverable: D2R-7b - Deadline: T36 Responsible: Peter Wittenburg Contributing Partners: MPI Contributing Members: all rights reserved by MPI for Psycholinguistics on behalf of CLARIN WG2.7 Web services and workflow creation 3

4 Scope of the Document This document describes the goals and requirements of web services and workflow systems that could be used by all CLARIN members and beyond, i.e. a functioning system could be used by other communities as well. Stepwise all CLARIN centers would need to introduce these requirements in their operational environment to come to a proper landscape of resources, services and tools where various instances can and will be created/operated at various places. This document will be discussed in the appropriate working groups and in the Executive Board. It will be subject of regular adaptations dependent on the progress in CLARIN. CLARIN References Centers Types CLARIN February 2009 Persistent and Unique Identifiers CLARIN February 2009 Centers CLARIN February 2009 Language Resource and Technology Federation CLARIN February 2009 Metadata Infrastructure for Language Resource and Technology CLARIN February 2009 Report on web services CLARIN March 2009 Requirement specification web services and workflow Systems CLARIN June 2009 Integration of LR into web service infrastructure CLARIN-2009-D5R-3a December 2009 WG2.7 Web services and workflow creation 4

5 Contents Contents Executive Summary Introduction Terminology Definitions Acronyms Related Documents Goals and activities Architecture overview Introduction Wrapper architecture CSB architecture Models Class model Sequence diagram Interface Algorithms Unit level test cases Bibliography WG2.7 Web services and workflow creation 5

6 Executive Summary In this document we want to specify a detailed design on the part of the CLARIN infrastructure that deals with web service invocations and metadata and provenance information. It identifies all collaborating components and sub systems and provides the necessary base line information that is needed for the implementation process. This documents builds on the earlier Web Service and Workflow Requirements specification document [CLARIN ] and takes into account the following requirements that have been laid down there: 1. The CLARIN infrastructure is an open infrastructure and must provide ease of use of integrating existing web services into the CLARIN landscape. 2. The CLARIN infrastructure must be able support multiple communication protocols. SOAP and REST are considered the main communication protocols for web services, although the infrastructure should be open to extensions to other protocols as well if the need arises. 3. The CLARIN infrastructure must be able to support multiple transport protocols. Although the current web services primarily communicate via HTTP, the need may arise to support other protocols as well. An example of this is FTP for file transfer. 4. Each web service must be registered at a CLARIN compliant web service registry. All service metadata is to be registered according to the CLARIN MetaData Infrastructure (CMDI) guidelines, that is metadata is to be provided in stand-off XML documents using the CLARIN MetaData model. This is obligatory in CLARIN for all activities. 5. Functional metadata of resources and services is to be described through the data categories specified in the metadata profile of ISOcat. Amongst others this will cater for profile matching. 6. The result of each web service invocation must be associated with CLARIN metadata. This metadata is to be provided in an automated manner through the metadata component. CLARIN must make a standard basic metadata component available which is capable of delivering at least the basic level of required CLARIN metadata to reduce the amount of work needed to integrate existing web services into the CLARIN infrastructure. 7. CLARIN must ensure that users have access to temporal workspaces where they can carry out their operations and store created temporary data in a temporary fashion. 8. Each web service invocation needs to be associated with provenance data. Provenance data must be stored through a provenance component. CLARIN must make a basic standard component available which is capable of storing provenance data in a uniform manner to reduce the amount of work needed to integrate existing web services into the CLARIN infrastructure. 12. At first instance a wrapper functionality will be provided within CLARIN for test purposes and prototyping. But CLARIN will also need to study the CSB solution which will consist of action code, configuration files and library dependencies. This may be packaged as a single deployable unit. The packaging and deployment strategy needs to be worked out. WG2.7 Web services and workflow creation 6

7 1. Introduction This document describes detailed design of all components associated with the invocation of web services and metadata and provenance data generation. It takes into account the requirements as described in the Web Services and Workflow requirements document[clarin ]. Every resource in CLARIN is described using metadata following the CMDI recommendation. Similarly, each web service in CLARIN is also described using CMDI 1, using a specialized metadata profile for web services. For each resource that is produced as a result of a web service invocation, as done in NLP pipelines, the resource must be associated with appropriate metadata and is initially stored in a user s private workspace. Association with metadata is key to provide ease of use of the CLARIN centers and allows users to quickly publish their resources to a designated center without the need to create metadata from scratch. Provenance data is needed for reproducibility and traceability of the results. Two possible architectures are discussed, wrapper and CSB, which have previously been introduced in the afore mentioned CLARIN requirements document. These have in common that they both play the role of ServiceDelegate as is introduced in this document. 1 WG2.7 Web services and workflow creation 7

8 2. Terminology 2.1. Definitions AAI [Stanica 2006] Authentication and Authorization infrastructure An infrastructure that provides Authentication and Authorization Services. The minimum service components include Identity and Privilege Management with respect to users and resources. Archive [CiTER] repository dedicated to the long-term preservation of the associated data IP [Stanica 2006] Identity provider An entity in an AAI that performs Identity Management. Metadata [Guenter 2004] structured information that describes, explains, locates, and otherwise makes it easier to retrieve and use an information resource. Metadata registry [Guenter 2004] registry a formal system for the documentation of the element sets, descriptions, semantics, and syntax of one or more metadata schemes Provenance data Information that provides a traceable record of the origin and source of a resource Resource [Berners-Lee 2005] The term "resource" is used in a general sense for whatever might be identified by a URI. Familiar examples include an electronic document, an image, a source of information with a consistent purpose (e.g., "today's weather report for Los Angeles"), a service (e.g., an HTTP-to-SMS gateway), and a collection of other resources. A resource is not necessarily accessible via the Internet; e.g., human beings, corporations, and bound books in a library can also be resources. Likewise, abstract concepts can be resources, such as the operators and operands of a mathematical equation, the types of a relationship (e.g., "parent" or "employee"), or numeric values (e.g., zero, one, and infinity). Repository [CiTER] facility that provides reliable access to managed digital resources SOA [Mackenzie 2006] Service Oriented architecture A paradigm for organizing and utilizing distributed capabilities that may be under the control of different ownership domains. It provides a uniform means to offer, discover, interact with and use capabilities to produce desired effects consistent with measurable preconditions and expectations. SP [Stanica 2006] Service provider An entity that provides access to a service based on federated authentication. Web service [Brown 2004] A web service is a software system designed to support interoperable machine-to-machine interaction over a network. It has an interface described in a machine-processable format. WG2.7 Web services and workflow creation 8

9 Workflow[Wulong 2001] Workflow is a term used to describe the tasks, procedural steps, organizations or people involved, required input and output information, and tools needed for each step in a business process 2.2 Acronyms Reference Abbreviation of Link [APA] Alliance for Permanent Access [ARK] Archival Resource Key [BPEL] Business Process Execution Language [CGN] Corpus Gesproken Nederlands [CLARIN] Common Language and Technology Infrastructure [DAM-LR] Distributed Access Management for Language Resources [DC] Dublin Core [DCAM] [DCR] Data Category Registry [DC-DS-XML] [DC-TEXT] [DEISA] Distributed European Infrastructure for Supercomputing Applications [DFKI] Deutsche Forschungszentrum für Künstliche Intelligenz [DFKITR] Deutsche Forschungszentrum für Künstliche Intelligenz Tool Registry [DFN] Deutsches Forschungsnetz [DOBES] [DOBES] Dokumentation Bedrohter Sprachen [DOI] Digital Object Identifier [D-SPIN] Deutsche Sprachressourcen-Infrastruktur [EAD] Encoded Archival Description, chival_description&oldid= [ebxml] e-business XML [EGEE] Enabling Grids for E-sciencE [EGI] European Grid Initiative [EAGLES] Expert advisory Group on Language Engineering Standards [e-irg] e-infrastructure Reflection Group [ELDA UC] Universal Catalogue [ENABLER] [ESF] European Science Foundation Second Learner Study =PA1&lpg=PA1&dq=esf+Second+learner+perdue&s ource=bl&ots=wki3guqqp6&sig=n7qswy3stxvd 06nMfAzY7GBbm9w&hl=de&sa=X&oi=book_result& resnum=3&ct=result#ppp1,m1 [FIDAS] Fieldwork Data Sustainability Project [HS] Handle System [ICONCLASS] [INTERA] Integrated European language data Repository Area [ISOcat] WG2.7 Web services and workflow creation 9

10 [LAF] Linguistic Annotation Framework [LIRICS] Linguistic Infrastructure for Interoperable Resources and Systems [LMF] Lexical Markup Framework [MAF] [METATAG] [METS] [MILE] [MPEG7] [NLSR] [OAIS] [OASIS] Morhosyntactic Annotation Framework Metadata Encoding and Transmission Standard Open Archival Information System Organization for the Advancement of Structured Information Standards nt&oldid= n_system [ODD] One Document Does all [OLAC] Open Language Archives Community [PMH] Protocol for Metadata Harvesting [REST] Representational State Transfer _arch_style.htm [SCHEMAS] [SHIBBOLETH] Shibboleth [SIMPLESAML]SimpleSAMLphp [SOAP] Simple Object Access Protocol [SRU] Search/Retrieve via URL [SRW] Search/Retrieve Web Service vice 7&oldid= [SYNAF] Syntactic Annotation Framework [TEI] Text Encoding Initiative [UDDI] Universal Description Discovery and Integration [WADL] Web Application Description Language [WSDL] Web Services Description Language [Z39.50] WG2.7 Web services and workflow creation 10

11 2.2 Related Documents [CLARIN_NEWS_4] CLARIN news letter Dec [CLARIN_WS_NOTE] CLARIN note on web services [D-SPIN_PRES] D-SPIN workshop report and Presentations [CLARIN_MD_SHRT] CLARIN Component Metadata Shortguide [CLARIN ] [CLARIN ] [CLARIN ] [CLARIN ] [CLARIN ] [CLARIN ] Centers Types CLARIN February documents/web-service-presentations-at-the-wp2- workshop-in-oxford ShortGuide.pdf Persistent and Unique Identifiers CLARIN February Centers CLARIN February Language Resource and Technology Federation CLARIN February 2009 Metadata Infrastructure for Language Resource and Technology CLARIN February 2009 Report on web services CLARIN March WG2.7 Web services and workflow creation 11

12 3. Goals and activities CLARIN is devoted to build a persistent integrated and interoperable infrastructure that will facilitate the access and the combination of language resources and tools/web services for those researchers that are working with language material in some form, in particular of course the humanities and social sciences. To achieve this goal CLARIN started a number of different activities. At the integration level all technologies that allow users to make resources visible and to bring them together are in focus: federation technologies, persistent identifiers, metadata, registries, workspaces and portals. At the interoperability level we are faced with the problems of exploiting the virtually integrated resources and tools/services, i.e. typically issues such as structural and semantic interoperability need to be solved to allow users for example to build and execute short and long chains of operations which we will call workflow chains. All integration aspects which are largely independent of the internal structure and semantics of resources and tools have been dealt with already by other requirements specification documents and are currently being worked out in prototypical form. They are normative for all CLARIN activities. This design document deals with the interoperability aspects as they occur in service oriented architectures. Here we want to give two examples for such scenarios: a structured search operating across a virtual collection of heterogeneously tagged and encoded annotations for example requires to have a mapping between the related tag and encoding sets; the creation of a workflow chain of operations requires that the input resource(s) need to provide the format and encoding that the following operation can work on. In this document we want to specify the design of a number of key components in the service oriented architecture which CLARIN is envisaging. This includes aspects that have to do with resources, service invocation, metadata and provenance. While work package 5 will deal with these issues in detail from the linguistic point of view, work package 2 is focusing on the computer science aspects. The design aspects introduced in this document are generic in nature and as such not tied to specific linguistic encoding formats. The aspects covered in this document will however continuously need be revised against any further requirements that come out of work package 2 or 5 and may result in updated versions of this document. WG2.7 Web services and workflow creation 12

13 4. Architecture overview 4.1 Introduction The CLARIN requirements specification for web services and workflow[clarin ] provides a description of the interactions involved in web service invocation. The following characteristics are considered of importance: Invocation of web services is considered to be data driven, i.e. users will generally supply a resource they want to invoke a service on and are interested in (part) of the information that results from the service invocation. In CLARIN all resources are associated with separate metadata which may link to provenance data if resources are the result of previous processing steps. After service invocation a new resource is created which is again to be associated with appropriate metadata. After service invocation the metadata description of a resource must contain appropriate provenance information containing sufficient information to allow reinvocation of the service in the future. Gathering of metadata and provenance data are crucial elements of service invocations to provide well described, traceable resources. The ability to generate metadata and provenance data for service requests relies on the following components to be part of the CLARIN infrastructure: Metadata generation component, generates metadata for the generated resources Provenance data generation component, generates provenance data for the service invocation Transformers, transforms data between different resource formats, tag sets and annotation level types. Transformers From the CLARIN deliverable Integration of LR into web service infrastructure it becomes clear that there is a wide spread in output formats for the services made available within CLARIN. Proprietary formats remain to be developed, even within the CLARIN framework and it is unforeseeable for the near future that a quick convergence to standard pivot formats can be achieved. And although the development of the proprietary TCF format in the D-Spin project has produced some interesting results in terms of integrating services from a number of partners there are currently no signs which indicate widespread adoptance of this strategy. As a consequence point to point conversions present the only viable transformation scenario available for the upcoming constriction phase. Although this may not be seen as problematic given the amount of services available this significantly adds to the overhead and development effort to prepare resources generated from one service to be usable as input for another one and will continue to rise as the number of service is increasing. To make one service interoperable with all others transformers will need to be produced for each service which is capable of producing resources that can be consumed by the service and also for each service which may consume the produced result. As presented in the CLARIN Requirements for web services and workflow deliverable[clarin ], use of a pivot model greatly reduces the amount of transformers needed to support all possible transformation scenarios in the service infrastructure. For the purpose of this document transformers are either considered to be individual web services, meaning that they behave as any web service and are subject to the metadata and provenance strategies. Alternatively, they are considered to be integrated as part of a web service, when invoking a web service the transformation is considered to be carried out as part of a web service. Given this, transformers do not need to be considered specifically in this document. The component diagram shown below shows the structural relations between the various components that play a role in this part of the infrastructure. WG2.7 Web services and workflow creation 13

14 Figure 1: Component diagram for web service invocations. Each component may both provide and require interfaces to other components. An interface is an abstraction of one or more methods and zero or more attributes that should define a cohesive set of behaviors. Provided interfaces are modeled using the lollipop notation and required interfaces are modeled using the socket notation. ServiceDelegate The ServiceDelegate acts as a service abstraction. It provides and abstraction and thus hides the implementation of the WebService and other associated components to external clients. It also reduces the coupling between the client and all components involved. The ServiceDelegate receives messages from external clients, invokes the intended web service on their behalf, takes care that for the resulting resource(s) appropriate metadata is generated and that relevant provenance data is generated. The communication diagram below displays the interactions between the components in more detail. PIDResolver The PIDResolver is capable of returning the location of a (metadata) resource of a given PID, which is assumed to be a URI in the figure above. This is a standard web service that must be provided by any of the PID providers such as the centers associated with EPIC (European Persistent Identifier Consortium). EPIC will use handles which may be resolved using the resolver located at Resource repository This component describes a Resource Repository which within CLARIN will typically be of Type A, B or C. For centers of type C the resources must be accessible through the metadata descriptions, but access to the resources may only be descriptive in nature [CLARIN ]. For the Resource Repository described in this document it is assumed however that resources are machine accessible through the locations described in the metadata description which in some cases limits the interaction possibilities for the C type centers. The Resource Repository will thus provide an machine interface for accessing and retrieving resources. Access to the resource may be restricted due to authorization restrictions set on the resource. These are maintained by the resource provider and may be determined by the resource provider on the basis of the authentication details of the user provided upon login to the CLARIN infrastructure. Workspace repository The role of the Workspace repository component is to store resources that are associated with a user s work in progress and as such is not part of any of the archives yet. These resources may be transient in nature or may in time be persisted as part of any of the center s archives in the infrastructure. Metadata mut be WG2.7 Web services and workflow creation 14

15 associated to each of the resources in the Workspace repository. The Workspace Repository may be associated with other types of information as well that are related a researcher s private workspace, such as tool preferences. Detailed design of the Workspace component is however not within the scope of this document. As an initial approach it is assumed that one workspace will be assigned to each user so that the workspace may be associated with the identity of the user which is established upon login to the infrastructure. Metadata repository The Metadata repository represents the central repository where all metadata records for all centers are harvested. As described in [CLARIN ] these records will be harvested as CMDI metadata records trough the OAI-PMH protocol. Metadata documents are freely accessible by any requesting party. Metadata component The purpose of the metadata component is to generate metadata for the resources which are produced through service invocations. Given a single resource serving as input for a service invocation the metadata component must decide whether to generate a brand new metadata description for the resulting resource or reuse and extend the metadata description of the original resource. The distinction to be made is basedon the metadata of the WebService through the /AnnotationStandoff/ data category (see figure 3). This data category indicated whether the annotation created by the service is produced in an inline or standoff manner with the understanding that standoff implies that the resulting annotation is stored as a separate resource. In case of an inline annotation the metadata of the original resource may be preserved and extended with information relevant for the service invocation, such as the /AnnotationLevelType/ 2 produced by the service. Provenance component The purpose of the Provenance component is to provide information on the source of the resource, i.e. all relevant information that is used to trace the origin of the resource and its methods of production. Provenance data accounts for proper acknowledgement of the sources of the resource and must record sufficient information to allow for reproducibility of the resource. For web services the minimum amount of information to be recorded is which service and method is being invoked and the parameters used to invoke the service method with. Web Service This component describes a generic Web Service which, in real life, may be a corpus cleaning tool, a POS tagger or any other type of web service that is available within the CLARIN infrastructure. The metadata descriptions of these web services are stored in the Metadata Repository and are described in CMDI. The figure below displays the structure of the technical metadata of the proposed CMDI profile. This CMDI profile may be further extended with descriptive elements intended for human end users. This includes information such as the organization responsible for the web service, contact persons or descriptions of the functionality of the web service. The CMDI profile in figure 3 has been validated against the D-Spin metadata descriptions which makes it possible to provide a profile matching strategy as currently implemented in the D-Spin project. The Metadata component described above relies heavily on this profile since it will use it to extract information on the resources generated by the web service. 2 The /AnnotationLevelType/ specifies the types of annotation levels provided by the resource. WG2.7 Web services and workflow creation 15

16 Figure 2: Collaboration diagram for web service invocations. The collaboration pattern between these components is as follows: 1. The Web Service client wants to invoke a WebService and sends a message to the ServiceDelegate to invoke the service on his behalf. The message contains basic information on the web service to invoke, the operation, parameter information and possibly a reply address in case the request is an asynchronous one. The message format can be based on the WS-Addressing[WSAddressing] specification. 2. The WebService to invoke is encoded as a PID pointing to the metadata description of the web service. The PIDResolver will resolve this PID to a location of the metadata description. The PIDResolver will in fact be invoked multiple time during the process, that is, whenever a PID needs to be resolved. For clarity reasons the interaction with the PIDResolver is only shown onece in this diagram. 3. The metadata for the WebService can be retrieved using the location as returned by the PIDResolver. Metadata descriptions of web services will always be stored in the central metadata repository. Web services will be described using the CMDI metadata specification for web services as shown in figure 3. From this metadata description the ServiceDelegate can extract the list of parameters that are expected to be found for the operation to perform. Parameters specifications may contain references to resources, these will contain TechnicalMetadata information describing the types of resource that are expecte. In the message that is sent by the client references to resources are not made directly, but rather through the metadata PID of the resource. This allows the ServiceDelegate to retrieve the associated metadata of the resource and perform a last minute check on the TechnicalMetadata characteristics of the resource against those which are expected for the given parameters. WG2.7 Web services and workflow creation 16

17 Figure 3: CMDI metadata profile for web services. 4. From the message the PIDs of the metadata descriptions of resources will be extracted. The PIDresolver is used to provide the exact locations of the metadata descriptions for each of the resources. This metadata may either be located at the central metadata repository and resource metadata and associated resources may either be obtained from the metadata repository and resource repository or from the Workspace repository, depending whether the resource is one from a recognized resource center or from a user s private workspace. A1. The metadata resides in a CLARIN center. The metadata repository is requested to provide the metadata of the WebService. A2. The resource PID is read from the metadata, the PIDResolver is contacted to obtain the location of the resource and the resource is requested from the Resource repository. alternatively: B1 The metadata resides in a user s workspace and is requested from that workspace B2 The resource PID is read from the metadata, the PIDResolver is contacted to obtain the location of the resource and the resource is requested from the Workspace Repository. Metadata and resources are thus retrieved in a uniform manner using the location information supplied by the PIDResolver. 5. The endpoint of the WebService is extracted from the service s metadata. For each of the parameters, the metadata PIDs of the resources are substituted by the contents of the corresponding resource.next, the service is invoked and the result is returned to the ServiceDelegate. 6. The ServiceDelegate requests the metadata component to compose a metadata description for a resulting resource. For the construction of this, the Metadata component will use the WebService metadata description which provides TechnicalMetadata for the output parameters. Metadata construction is based on the principle that if the TechnicalMetadata for a resulting resource (of a paremeter) indicates that the resource is a standoff annotation in a separated resource(!) as indicated through the /annotationstandoff/ data category then a brand new metadata description for the resource is to be generated. The TechnicalMetadata component is copied in its entirety into the new metadata description. If /annotationstandoff/ indicates that the that the annotations are added to the same file, then the metadata description of the original resource is taken and only those information fragments from the TechnicalMetadata are added that do not already appear in the resource s original TechnicalMetadata section. This in particular pertains to the /AnnotationLevel/ WG2.7 Web services and workflow creation 17

18 information( see figure 4) which encodes which linguistic concepts are presented in the annotation file. For a tokenized document the TechnicalMetadata might look like: Figure 4: Example TechnicalMetada specification 7. The Provenance component is instructed to compose the provenance data for the resulting resource. This may consist of a copy of the original message that was sent by the client. This message contains all necessary information to invoke the same procedure again at a later stage. 8. Once the metadata and provenance data for the resulting resource has been obtained the metadata is stored in the user s private workspace. 9. The resulting resource is also stored in the user s private workspace Finally, the result as obtained from the WebService component is returned to the Web service client. The primary components of focus for this design are the ServiceDelegate, Metadata Component and the Provenance Component. Although the interfaces through which they interact with other components are subject of this design and places requirements on the design of these interfaces the detailed design of these other components falls outside the scope of this document. 4.2 Wrapper architecture One way to implement the ServiceDelegate component described above is to create a wrapper, encapsulating the web service. This means that for each web service an individual wrapper must be created and must itself be exposed as a web service. There is not direct interaction with the encapsulated web services by clients, but only through the exposed wrapper interface. It is therefore the wrapper that is described in the web service registry, rather than the native service. 4.3 CSB architecture The CSB architecture is a CLARIN specific implementation of an Enterprise Service Bus. The term Enterprise Service Bus refers here to the middleware design pattern, but also to ESB product such as JBossESB or Mule. The CSB will play the role of the ServiceDelegate as described in this document, Provenance and Metadata components are inserted into the CSB. IBM defines the ESB pattern as follows[hutch 2005] WG2.7 Web services and workflow creation 18

19 In the ESB pattern, rather than interacting directly, participants in a service interaction communicate through a bus that provides virtualization and management features that implement and extend the core definition of SOA. The IBM ESB pattern provides virtualization of: Location and identity: Participants need not know the location or identity of other participants. For example, requesters don't need to be aware that a request could be serviced by any of several providers. Service providers can be added or removed without disruption. Interaction protocol: Participants need not share the same communication protocol or interaction style. A request expressed as SOAP/HTTP may be serviced by a provider that only understands Java Remote Method Invocation (RMI). Interface: Requesters and providers don't need to agree on a common interface. The ESB reconciles differences by transforming request messages into a form expected by the provider. Qualities of (Interaction) Service (QoS): Participants declare their QoS requirements, including performance and reliability, authorization of requests, encryption/decryption of message contents, automatic auditing of service interactions, and how their requests should be routed (such as to available implementations based on workload distribution criteria). Policies that describe the QoS requirements and capabilities of requesters and providers may be fulfilled services themselves or fulfilled by the ESB compensating for mismatches. Figure 5: IBM's basic ESB pattern. In the initial CLARIN implementation not all of the points in IBM s definition will be covered at once. Depending upon the CLARIN needs functionality needs to be added to the ESB. Off the shelf ESB products such as JBossESB provide a standard web service invocation mechanisms that are able to invoke both SOAP services and Http end points. These are however configured to contact specific end points which are configured in the ESB setup. To be usable for CLARIN these mechanism need to be customized to allow invocation of any service in the CLARIN registry without the need to reconfigure the ESB for each service individually. It is expected that the invocation mechanisms can be easily modified to work with the CLARIN registry Since the ESB exposes its own end point and as such is not tied specifically to any particular web service, contrary to the wrapper described above, all information of the web service to invoke must be carried in a standard message format. WS-Addressing provides transport-neutral mechanisms to address web services and messages. An example fragment is shown below. Figure 6: Example fragment WS-addressing Using the information carried in the message, the CSB is able to locate the requested service ( as indicated in the To field) from the service registry and invoke the requested action(see Action field). Optionally a replyto may be used for asynchronous messaging. WG2.7 Web services and workflow creation 19

20 5. Models This section describes all classes and their relations. 5.1 Class model Figure 7: Class diagram for ServiceDelegate The class diagram shows the different relations of the ServiceDelegate to its surrounding components. The classes ServiceDelegate, Metada component, Provenance component and interface IMetadata component and IProvenance component all are part of the same sub system. The other classes are external systems which are used by the ServiceDelegate. The interfaces shown here describe the types of interaction with these systems in more detail and provide the only dependencies to these systems. The ServiceDelegate accesses the WebService to invoke the requested service operations on behalf of the client. Interaction with the metadata repository and resource repository are based on access to metadata and resources using the resolved PID URIs of these. Interaction with the PIDResolver only requires access to a method that is capable of resolving a PID to a URI, i.e. the location of a metada document or resource. Interaction with the Workspace repository only requires that it is possible to store metadata documents WG2.7 Web services and workflow creation 20

21 and resources there. In addition, access to these may be requested be requested from the Workspace repository in the same manner as indicated for Metadata repository and Resource repository to access and use resources for further processing purposes. Interfaces are described in more detail in chapter Sequence diagram The diagram below shows the sequence diagram for the interactions between the various components. Figure 8: Sequence diagram for ServiceDelegate interaction The individual steps for this sequence diagram are as follows: 1. The Web service client invokes the ServiceDelegate. The ServiceDelegate assesses which service to invoke. In case a wrapper achitecture is used, the ServiceDelegate will be targeted at one specific web service, in the CSB approach the web service to invoke must be obtained from the contents of the message sent to the CSB. Inc ase WS-Addressing is used to encode this in the message format the To element will contain the PID pointint to the web service. In either case however, the next few steps will be dedicated to retrieving the metadata description for the service. 2. The location of the metadata document for the web service to invoked is retrieved from the PIDResolver. 3. With the location of the metadata document, the metadata document is requested from the Metadata repository. 4. In this step the parameter information is gathered. For a wrapper architecture parameters can be directly exposed parameters of the external wrapper service interface. For the CSB these must be extracted from the message sent by the Web service client. WG2.7 Web services and workflow creation 21

22 Each of the parameters must also be described in the Parameter components of the CMDI Web Service Profile obtained in step 2. In the next steps these parameters are evaluated further. 5. If a parameter is associated with a resource then the CMDI Web Service profile will contain a TechnicalMetadata component describing the resource characteristics that are expected for the given parameter. This may be matched against the corresponding parameter sent by the Web service client. Resources are however not to b sent directly by the client. Rather, the PIDs of the resource s metadata are sent. In this step, the location for the metadata documents for each of these resources is requested though the PIDResolver. 6. The metadata documents for the resources are retrieved here. These metadata documents each contain TechnicalMetadata component as well. This provides the possibility for the ServiceDelegate to check whether the resource requirements for operating the web service are met by the resources. If, for example, a web service specifies that it expects a document of /characterencoding/ = UTF-8 and a /mimetype/ = text/xml then these datacategories should at least be represented in the TechnicalMetadata of the resource s metadata description. 7. Next, the WebService itself is invoked and the result is returned. The result is captured by the ServiceDelegate. 8. From the WebService metadata s output parameter descriptions those parameter descriptions are evaluated which indicate the presence of a resource there. These may be matched with the result obtained from the previous step and identify the resources in the return message. It is assumed at this point that the full resource is part of the return message, allthough this may be extended to include URI s, for example, as well. If a web service employs this strategy then it must be mentioned in the WebService s metadata. If the TechnicalMetadata of the output parameter indicates that it uses a /annotationstandoff/ strategy, by which is meant that resources are annotated in a standoff manner in a seperate file, then for each of the resources identified in the result a new Metadata document may be created. The TechnicalMetadata characterizing each of these resources can be duplicated in full from corresponding the TechnicalMetadata descriptions of the WebService s output parameters. If the output parameter indicates that it is not standoff then this means that the metadata from the resource at the input can be extended with the TechnicalMetadata information specified for the output parameter. There may be cases however where multiple resources serve as input for a web service and an output parameter declares to be /annotationstandoff/. Here, a decission needs to be made which of the input resources will serve as the basis, i.e. to which resource are the annotations added. In these cases the relation between input and output parameter must be expressed explicitly to make it possible to determine which of the metadata documents of the input parameter resources is to be used as the basis of the metadata document of the a resulting resource. 9. The resulting resources are stored in the Workspace repository. As a consequence each will be assigned a PID which is added to the metadata description of the resources. 10. The metadata documents of the resulting resources are stored in the Workspace repository. Each of these will be assigned a PID which will be gathered. WG2.7 Web services and workflow creation 22

23 11. One of the last steps in the process involves the creation of the necessary provenance data. One of the most important pieces of information to be maintained is service identifier and the parameters used to invoke the WebService with. As indicated before, the service identifier corresponds to the WebService s metadata PID. For a wrapper this refers to the metadata descrption of the web service it encapsulates, in the CSB architecture this may be obtained from the message information sent to the ESB. To ensure consist encoding of the provenenance information service identifier and parameter information are to be encoded the same for both architectures. It makes sense to use the same encoding used in the CSB architecture to transfer the message from the Web service client to the CSB, which serves as a ServiceDelegate in this case. WS-addressing appears to be usable for this, which means that the provenance data will consist of WS-Addressing fragment describing the input. It also makes sense to store the result information as part of the provenance data. When added to the metadata of a generated resource it provides information on the results of all the output parameters that were produced using the web service, not only the parameter which produced this resource. This makes it possible not only to determine whether a resource may be reproduced using a specific service, but it also allows to determine whether other output parameters produced by the service are still the same. The format of this information can again be derived from the input format used to store provenance input information, e.g. WS-addressing, but is more limited in scope. Here, only parameter information needs to be stored. The replyto field of the original message already identifies the receiver of the result. Another aspect is that the information on the resources is to be encoded using the metadata PID in the provenance data. If complete resources are passed back to the client as part of the result then the amount of information that is then stored as part of the provenance data wold become enormous. At the same time, the resulting resources have already been complemented with the necessary metadata and stored in the user s private workspace. So, the provenance data for the output parameters becomes limited in size by using these PIDs. If a user at some point decides to remove a resource from their workspace then the situation occurs that a the results of a future reinvocation can no longer be compared to the newly generated resource. If reprodicibilty of results is an issue then these resources must be maintained persistently, for example by moving them into an established archive that is part of the CLARIN infrastructure. Finally, the provenance data may be extended with additional information, for example QoS(Quality of Service) information, such as execution time, as measured by the Service Delegate or exception information that was raised during service invocation. 12. After the provenance data has been added to the metadata of each of the resources the metadata documents are restored in the Workspace repository. At this point no version control on the metadata doccuments is expected. It is also assumed here that provenance data is embedded in the metadata document. Alternatively an external provenance store may be used and a PID for locating the provenance data may be added to the metadata document 13. The result is returned to the Web service client. Note that the result that is being returned to the client is the same as if the client had invokded the WebService directly. WG2.7 Web services and workflow creation 23

24 In the sequence diagram shown above it is assumed that the interaction is synchronous in nature. In many cases however, the interaction is expected to be asynchonous. This may be dealt with using appropriate callback handlers exposed by the ServicedDelegate which processes incoming (asynchonous) response, performs steps 9 through 12 and returns the result to a callback handler exposed by the WebService client. A more detailled discussion on this may be found in chapter Interface This section describes the implementation of the interfaces. IPIDResolver The IPIDResolver interface describes an interface that makes it possible to resolve a specified PID to a (URI) location. Although it is realized at this stage that the signature of the interface may not correspond directly to the IResourceStorage The IResourceStorage is an interface providing the necessary functionality to store a resource. The only component for which this applies is the Workspace repository which will store a resource in a user s private workspace. Once AAI is in place the user s identity can be attained, but this currently remains an open issue. IMetadataStorage The IResourceStorage interface provides the necessary functionality to store metadata documents. The only component to which this applies is the Workspace repository. As with resources, metadata documents are stored in the user s private workspace which depends upon the possibility to retrieve the user s identity. IMetadata component The IMetadata component interface provides the interface that will construct metadata documents for created resources. The information that needs to be passed are the WebService s metadata description and the metadata descriptions of the original resources involved as parameters in the process, i.e. if the /annotationstandoff/ mode of the WebService is false. IProvenance component The IProvenance component interface provides the necessary functionality to construct provenance data based on the message content supplied by the client and the result of the service invocation. 7. Algorithms This section describes the algorithms used in more detail which have not already been describedd elsehere in the document. Asynchonous messaging The ability to handle asynchronous messaging is one of the key requirements for the ServiceDelegate. Assuming WS-addressing is used as the message format, an example excerpt from a message is shown below. Here, the MessageID provides a message identifier that may be used to uniquely identify the message. This is supplied by the client. The From field provides the source reference point and indicates whereh the message came from. The To field provides the destination URI and indicates where the message should go to. Here, as well as in the From field, the URI (hdl:1839/..) points to the metadata document of the service. Action conveys WG2.7 Web services and workflow creation 24

25 the action related to the message, This can be used to identify the operation to be invoked upon receiving a request message and here points to the Operation section in the service s metadata document. The ReplyTo field provides the reply end point reference and indicates where the reply message is to be sent to. If not present, the message is usually sent back to the endpoint the request orginated from. When the ServiceDelegate receives this message it will call the service identified in the To and Action fields. It will insert a callback service of its own to allow for post processing and identfy itself as the caller in the From field. The resulting message may look like this: The MessageID in this message represents an internal messageid maintained by the ServiceDelegate which allows it to retrieve the original message. Additionally, a RelatesTo field may be added to this message to convey the ID of the original message, along with the relationtype, but this is optional. The WebService being called upon will return the result back to the ServiceDelegate on the end point reference that it exposes in the ReplyToField. When the ServiceDelegate receives the result it will contain the messageid sent earlier allowing for retrieval of the original message, which may have been persisted in a local datastore. In general, asynchronous messages should be stored persistently since the response time is not known in advance. After the necessary post processing for metadata and provenance data generation, the recipient of the result as specified in the original message s ReplyTo field can be contacted with the result. 8. Unit level test cases This section provides a description of unit level test cases. The system can be tested against against a skeleton web service which takes an arbitrary document as input and simply returns the document as is which will act as the WebService component described in this document. The PIDResolver will can also be implemented as a simple skeleton resolver which only returns the PID. The PID for testing is taken to be a file location identifiying the metadata document or resource. The metadata documents for the resources and service are prepared by hand. These should, as a minimum, only include the TechnicalMetadata information. The system can be tested by sending small messages into the ServiceDelegate which subsequently invokes the interaction patterns that have been discussed in this document. Special attention is given here to the /annotationstandoff/ datacategory of the WebService s TechncialMetadata controlling the metadata creation process. WG2.7 Web services and workflow creation 25

A first set of integrated web services

A first set of integrated web services A first set of integrated web services Milestone D2.R7.1 2010-02-12 Version: 2 Editors: Marc Kemps-Snijders www.clarin.eu WG2.7 A first set of integrated web services 2 A first set of integrated web services

More information

Implementing a Ground Service- Oriented Architecture (SOA) March 28, 2006

Implementing a Ground Service- Oriented Architecture (SOA) March 28, 2006 Implementing a Ground Service- Oriented Architecture (SOA) March 28, 2006 John Hohwald Slide 1 Definitions and Terminology What is SOA? SOA is an architectural style whose goal is to achieve loose coupling

More information

Introduction to Web Services & SOA

Introduction to Web Services & SOA References: Web Services, A Technical Introduction, Deitel & Deitel Building Scalable and High Performance Java Web Applications, Barish Service-Oriented Programming (SOP) SOP A programming paradigm that

More information

(9A05803) WEB SERVICES (ELECTIVE - III)

(9A05803) WEB SERVICES (ELECTIVE - III) 1 UNIT III (9A05803) WEB SERVICES (ELECTIVE - III) Web services Architecture: web services architecture and its characteristics, core building blocks of web services, standards and technologies available

More information

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT EUDAT A European Collaborative Data Infrastructure Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT OpenAire Interoperability Workshop Braga, Feb. 8, 2013 EUDAT Key facts

More information

Introduction to Web Services & SOA

Introduction to Web Services & SOA References: Web Services, A Technical Introduction, Deitel & Deitel Building Scalable and High Performance Java Web Applications, Barish Web Service Definition The term "Web Services" can be confusing.

More information

ActiveVOS Technologies

ActiveVOS Technologies ActiveVOS Technologies ActiveVOS Technologies ActiveVOS provides a revolutionary way to build, run, manage, and maintain your business applications ActiveVOS is a modern SOA stack designed from the top

More information

Agent-Enabling Transformation of E-Commerce Portals with Web Services

Agent-Enabling Transformation of E-Commerce Portals with Web Services Agent-Enabling Transformation of E-Commerce Portals with Web Services Dr. David B. Ulmer CTO Sotheby s New York, NY 10021, USA Dr. Lixin Tao Professor Pace University Pleasantville, NY 10570, USA Abstract:

More information

Common Language Resources and Technology Infrastructure REVISED WEBSITE

Common Language Resources and Technology Infrastructure REVISED WEBSITE REVISED WEBSITE Responsible: Dan Cristea Contributing Partners: UAIC, FFGZ, DFKI, UIB/Unifob The ultimate objective of CLARIN is to create a European federation of existing digital repositories that include

More information

Next-Generation SOA Infrastructure. An Oracle White Paper May 2007

Next-Generation SOA Infrastructure. An Oracle White Paper May 2007 Next-Generation SOA Infrastructure An Oracle White Paper May 2007 Next-Generation SOA Infrastructure INTRODUCTION Today, developers are faced with a bewildering array of technologies for developing Web

More information

Services Oriented Architecture and the Enterprise Services Bus

Services Oriented Architecture and the Enterprise Services Bus IBM Software Group Services Oriented Architecture and the Enterprise Services Bus The next step to an on demand business Geoff Hambrick Distinguished Engineer, ISSW Enablement Team ghambric@us.ibm.com

More information

Oracle SOA Suite 10g: Services Orchestration

Oracle SOA Suite 10g: Services Orchestration Oracle University Contact Us: 01 800 214 0697 Oracle SOA Suite 10g: Services Orchestration Duration: 5 Days What you will learn This course deals with the basic concepts of Service Orchestration (SOA)

More information

Managing very large Multimedia Archives and their Integration into Federations

Managing very large Multimedia Archives and their Integration into Federations Managing very large Multimedia Archives and their Integration into Federations Daan Broeder, Eric Auer, Marc Kemps-Snijders, Han Sloetjes, Peter Wittenburg, Claus Zinn 1 1 Max-Planck-Institute for Psycholinguistics,

More information

Managing Learning Objects in Large Scale Courseware Authoring Studio 1

Managing Learning Objects in Large Scale Courseware Authoring Studio 1 Managing Learning Objects in Large Scale Courseware Authoring Studio 1 Ivo Marinchev, Ivo Hristov Institute of Information Technologies Bulgarian Academy of Sciences, Acad. G. Bonchev Str. Block 29A, Sofia

More information

Integration Framework. Architecture

Integration Framework. Architecture Integration Framework 2 Architecture Anyone involved in the implementation or day-to-day administration of the integration framework applications must be familiarized with the integration framework architecture.

More information

Realisation of SOA using Web Services. Adomas Svirskas Vilnius University December 2005

Realisation of SOA using Web Services. Adomas Svirskas Vilnius University December 2005 Realisation of SOA using Web Services Adomas Svirskas Vilnius University December 2005 Agenda SOA Realisation Web Services Web Services Core Technologies SOA and Web Services [1] SOA is a way of organising

More information

Using JBI for Service-Oriented Integration (SOI)

Using JBI for Service-Oriented Integration (SOI) Using JBI for -Oriented Integration (SOI) Ron Ten-Hove, Sun Microsystems January 27, 2006 2006, Sun Microsystems Inc. Introduction How do you use a service-oriented architecture (SOA)? This is an important

More information

Incorporating applications to a Service Oriented Architecture

Incorporating applications to a Service Oriented Architecture Proceedings of the 5th WSEAS Int. Conf. on System Science and Simulation in Engineering, Tenerife, Canary Islands, Spain, December 16-18, 2006 401 Incorporating applications to a Service Oriented Architecture

More information

Nuno Freire National Library of Portugal Lisbon, Portugal

Nuno Freire National Library of Portugal Lisbon, Portugal Date submitted: 05/07/2010 UNIMARC in The European Library and related projects Nuno Freire National Library of Portugal Lisbon, Portugal E-mail: nuno.freire@bnportugal.pt Meeting: 148. UNIMARC WORLD LIBRARY

More information

Enterprise SOA Experience Workshop. Module 8: Operating an enterprise SOA Landscape

Enterprise SOA Experience Workshop. Module 8: Operating an enterprise SOA Landscape Enterprise SOA Experience Workshop Module 8: Operating an enterprise SOA Landscape Agenda 1. Authentication and Authorization 2. Web Services and Security 3. Web Services and Change Management 4. Summary

More information

METADATA INTERCHANGE IN SERVICE BASED ARCHITECTURE

METADATA INTERCHANGE IN SERVICE BASED ARCHITECTURE UDC:681.324 Review paper METADATA INTERCHANGE IN SERVICE BASED ARCHITECTURE Alma Butkovi Tomac Nagravision Kudelski group, Cheseaux / Lausanne alma.butkovictomac@nagra.com Dražen Tomac Cambridge Technology

More information

Persistent and unique Identifiers

Persistent and unique Identifiers Persistent and unique Identifiers 2009-2-4 Version: 4 Editors: Daan Broeder, Malte Dreyer, Marc Kemps-Snijders, Andreas Witt, Marc Kupietz, Peter Wittenburg www.clarin.eu The ultimate objective of CLARIN

More information

FLAT: A CLARIN-compatible repository solution based on Fedora Commons

FLAT: A CLARIN-compatible repository solution based on Fedora Commons FLAT: A CLARIN-compatible repository solution based on Fedora Commons Paul Trilsbeek The Language Archive Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands Paul.Trilsbeek@mpi.nl Menzo

More information

Topics on Web Services COMP6017

Topics on Web Services COMP6017 Topics on Web Services COMP6017 Dr Nicholas Gibbins nmg@ecs.soton.ac.uk 2013-2014 Module Aims Introduce you to service oriented architectures Introduce you to both traditional and RESTful Web Services

More information

(Some) Standards in the Humanities. Sebastian Drude CLARIN ERIC RDA 4 th Plenary, Amsterdam September 2014

(Some) Standards in the Humanities. Sebastian Drude CLARIN ERIC RDA 4 th Plenary, Amsterdam September 2014 (Some) Standards in the Humanities Sebastian Drude CLARIN ERIC RDA 4 th Plenary, Amsterdam September 2014 1. Introduction Overview 2. Written text: the Text Encoding Initiative (TEI) 3. Multimodal: ELAN

More information

D-SPIN. D-SPIN Report 2.1: Formation of Centres

D-SPIN. D-SPIN Report 2.1: Formation of Centres D-SPIN D-SPIN Report 2.1: Formation of Centres June 2009 D-SPIN, BMBF-FKZ: 01UG0801B Deliverable: R2.1: Formation of Centres Responsible: Peter Wittenburg Editors: Peter Wittenburg 2 Contents 1. Introduction...4

More information

Metadata and Encoding Standards for Digital Initiatives: An Introduction

Metadata and Encoding Standards for Digital Initiatives: An Introduction Metadata and Encoding Standards for Digital Initiatives: An Introduction Maureen P. Walsh, The Ohio State University Libraries KSU-SLIS Organization of Information 60002-004 October 29, 2007 Part One Non-MARC

More information

C exam. IBM C IBM WebSphere Application Server Developer Tools V8.5 with Liberty Profile. Version: 1.

C exam.   IBM C IBM WebSphere Application Server Developer Tools V8.5 with Liberty Profile. Version: 1. C9510-319.exam Number: C9510-319 Passing Score: 800 Time Limit: 120 min File Version: 1.0 IBM C9510-319 IBM WebSphere Application Server Developer Tools V8.5 with Liberty Profile Version: 1.0 Exam A QUESTION

More information

Metadata Infrastructure for Language Resources and Technology

Metadata Infrastructure for Language Resources and Technology Metadata Infrastructure for Language Resources and Technology 2009-02-04 - Version 5 Editors: Daan Broeder, Bertrand Gaiffe, Maria Gavrilidou, Erhard Hinrichs, Lothar Lemnitzer, Dieter Van Uytvanck, Andreas

More information

Workshop on Web of Services for Enterprise Computing

Workshop on Web of Services for Enterprise Computing Workshop on Web of Services for Enterprise Computing Fujitsu Submission v0.2 Authors: Jacques Durand Tom Rutt Hamid BenMalek Acknowledgements: Masahiko Narita Paul A. Knapp 1. The Great Divide The fundamental

More information

Goal: Offer practical information to help the architecture evaluation of an SOA system. Evaluating a Service-Oriented Architecture

Goal: Offer practical information to help the architecture evaluation of an SOA system. Evaluating a Service-Oriented Architecture Evaluating a Service-Oriented Architecture Paulo Merson, SEI with Phil Bianco, SEI Rick Kotermanski, Summa Technologies May 2007 Goal: Offer practical information to help the architecture evaluation of

More information

International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 4, Jul-Aug 2015

International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 4, Jul-Aug 2015 RESEARCH ARTICLE OPEN ACCESS Multi-Lingual Ontology Server (MOS) For Discovering Web Services Abdelrahman Abbas Ibrahim [1], Dr. Nael Salman [2] Department of Software Engineering [1] Sudan University

More information

Bringing Europeana and CLARIN together: Dissemination and exploitation of cultural heritage data in a research infrastructure

Bringing Europeana and CLARIN together: Dissemination and exploitation of cultural heritage data in a research infrastructure Bringing Europeana and CLARIN together: Dissemination and exploitation of cultural heritage data in a research infrastructure Twan Goosen 1 (CLARIN ERIC), Nuno Freire 2, Clemens Neudecker 3, Maria Eskevich

More information

National Identity Exchange Federation. Terminology Reference. Version 1.0

National Identity Exchange Federation. Terminology Reference. Version 1.0 National Identity Exchange Federation Terminology Reference Version 1.0 August 18, 2014 Table of Contents 1. INTRODUCTION AND PURPOSE... 2 2. REFERENCES... 2 3. BASIC NIEF TERMS AND DEFINITIONS... 5 4.

More information

CLARIN s central infrastructure. Dieter Van Uytvanck CLARIN-PLUS Tools & Services Workshop 2 June 2016 Vienna

CLARIN s central infrastructure. Dieter Van Uytvanck CLARIN-PLUS Tools & Services Workshop 2 June 2016 Vienna CLARIN s central infrastructure Dieter Van Uytvanck CLARIN-PLUS Tools & Services Workshop 2 June 2016 Vienna CLARIN? Common Language Resources and Technology Infrastructure Research Infrastructure for

More information

Semantic SOA - Realization of the Adaptive Services Grid

Semantic SOA - Realization of the Adaptive Services Grid Semantic SOA - Realization of the Adaptive Services Grid results of the final year bachelor project Outline review of midterm results engineering methodology service development build-up of ASG software

More information

D-SPIN Report R2.2b: The German Resource Landscape and a Portal

D-SPIN Report R2.2b: The German Resource Landscape and a Portal D-SPIN Report R2.2b: The German Resource Landscape and a Portal February 2010 D-SPIN, BMBF-FKZ: 01UG0801A Deliverable: R2.2: The German Language Resource Landscape and a Portal Responsible: Peter Wittenburg

More information

1. General requirements

1. General requirements Title CLARIN B Centre Checklist Version 6 Author(s) Peter Wittenburg, Dieter Van Uytvanck, Thomas Zastrow, Pavel Straňák, Daan Broeder, Florian Schiel, Volker Boehlke, Uwe Reichel, Lene Offersgaard Date

More information

BEAAquaLogic. Service Bus. JPD Transport User Guide

BEAAquaLogic. Service Bus. JPD Transport User Guide BEAAquaLogic Service Bus JPD Transport User Guide Version: 3.0 Revised: March 2008 Contents Using the JPD Transport WLI Business Process......................................................2 Key Features.............................................................2

More information

Information Quality & Service Oriented Architecture

Information Quality & Service Oriented Architecture Information Quality & Oriented Architecture Presentation for the MIT IQ Industry Symposium July 17, 2007 Dave Becker The MITRE Corporation Approved for Public Release; Distribution Unlimited. (070837)

More information

RESTful Web service composition with BPEL for REST

RESTful Web service composition with BPEL for REST RESTful Web service composition with BPEL for REST Cesare Pautasso Data & Knowledge Engineering (2009) 2010-05-04 Seul-Ki Lee Contents Introduction Background Design principles of RESTful Web service BPEL

More information

Towards a Telecommunication Service Oriented Architecture

Towards a Telecommunication Service Oriented Architecture Towards a Telecommunication Service Oriented Architecture Paolo Falcarin Jian Yu Politecnico di Torino, Italy paolo.falcarin@polito.it, jian.yu@polito.it Abstract Web Services are often used for providing

More information

SDMX self-learning package No. 3 Student book. SDMX-ML Messages

SDMX self-learning package No. 3 Student book. SDMX-ML Messages No. 3 Student book SDMX-ML Messages Produced by Eurostat, Directorate B: Statistical Methodologies and Tools Unit B-5: Statistical Information Technologies Last update of content February 2010 Version

More information

Introduction to XML. Asst. Prof. Dr. Kanda Runapongsa Saikaew Dept. of Computer Engineering Khon Kaen University

Introduction to XML. Asst. Prof. Dr. Kanda Runapongsa Saikaew Dept. of Computer Engineering Khon Kaen University Introduction to XML Asst. Prof. Dr. Kanda Runapongsa Saikaew Dept. of Computer Engineering Khon Kaen University http://gear.kku.ac.th/~krunapon/xmlws 1 Topics p What is XML? p Why XML? p Where does XML

More information

WSIA and WSRP are new Web

WSIA and WSRP are new Web Written by Eilon Reshef WSIA and WSRP are new Web services standards that enable businesses to create user-facing, visual, and interactive Web services that organizations can easily plug-and-play into

More information

Centres Network Formation

Centres Network Formation Centres Network Formation 2009-02-04 - Version: 9 Editors: Dirk Roorda, Dieter van Uytvanck Peter Wittenburg, Martin Wynne The ultimate objective of CLARIN is to create a European federation of existing

More information

Persistent Identifiers for Audiovisual Archives and Cultural Heritage

Persistent Identifiers for Audiovisual Archives and Cultural Heritage Persistent Identifiers for Audiovisual Archives and Cultural Heritage Hennie Brugman Technical coordinator CATCHPlus Max-Planck-Institute for Psycholinguistics Netherlands Institute for Sound and Vision

More information

Identity Provider for SAP Single Sign-On and SAP Identity Management

Identity Provider for SAP Single Sign-On and SAP Identity Management Implementation Guide Document Version: 1.0 2017-05-15 PUBLIC Identity Provider for SAP Single Sign-On and SAP Identity Management Content 1....4 1.1 What is SAML 2.0.... 5 SSO with SAML 2.0.... 6 SLO with

More information

Quality - The Key to Successful SOA. Charitha Kankanamge WSO2 February 2011

Quality - The Key to Successful SOA. Charitha Kankanamge WSO2 February 2011 Quality - The Key to Successful SOA Charitha Kankanamge WSO2 February 2011 WSO2 Founded in 2005 by acknowledged leaders in XML, Web Services Technologies & Standards and Open Source Producing entire middleware

More information

Introduction to XML 3/14/12. Introduction to XML

Introduction to XML 3/14/12. Introduction to XML Introduction to XML Asst. Prof. Dr. Kanda Runapongsa Saikaew Dept. of Computer Engineering Khon Kaen University http://gear.kku.ac.th/~krunapon/xmlws 1 Topics p What is XML? p Why XML? p Where does XML

More information

Naming & Design Requirements (NDR)

Naming & Design Requirements (NDR) The Standards Based Integration Company Systems Integration Specialists Company, Inc. Naming & Design Requirements (NDR) CIM University San Francisco October 11, 2010 Margaret Goodrich, Manager, Systems

More information

Oracle Developer Day

Oracle Developer Day Oracle Developer Day Sponsored by: Track # 1: Session #2 Web Services Speaker 1 Agenda Developing Web services Architecture, development and interoperability Quality of service Security, reliability, management

More information

Service Interface Design RSVZ / INASTI 12 July 2006

Service Interface Design RSVZ / INASTI 12 July 2006 Architectural Guidelines Service Interface Design RSVZ / INASTI 12 July 2006 Agenda > Mandatory standards > Web Service Styles and Usages > Service interface design > Service versioning > Securing Web

More information

Implementing a Variety of Linguistic Annotations

Implementing a Variety of Linguistic Annotations Implementing a Variety of Linguistic Annotations through a Common Web-Service Interface Adam Funk, Ian Roberts, Wim Peters University of Sheffield 18 May 2010 Adam Funk, Ian Roberts, Wim Peters Implementing

More information

IT6801-SERVICE ORIENTED ARCHITECTURE

IT6801-SERVICE ORIENTED ARCHITECTURE ST.JOSEPH COLLEGE OF ENGINEERING DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING IT 6801-SERVICE ORIENTED ARCHITECTURE UNIT I 2 MARKS 1. Define XML. Extensible Markup Language(XML) is a markup language

More information

J2EE APIs and Emerging Web Services Standards

J2EE APIs and Emerging Web Services Standards J2EE APIs and Emerging Web Services Standards Session #4 Speaker Title Corporation 1 Agenda J2EE APIs for Web Services J2EE JAX-RPC APIs for Web Services JAX-RPC Emerging Web Services Standards Introduction

More information

Semantic-Based Web Mining Under the Framework of Agent

Semantic-Based Web Mining Under the Framework of Agent Semantic-Based Web Mining Under the Framework of Agent Usha Venna K Syama Sundara Rao Abstract To make automatic service discovery possible, we need to add semantics to the Web service. A semantic-based

More information

International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 6, Nov-Dec 2015

International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 6, Nov-Dec 2015 RESEARCH ARTICLE OPEN ACCESS Middleware Interoperability using SOA for Enterprise Business Application T Sathis Kumar Assistant Professor Department of Computer Science and Engineering Saranathan College

More information

BPEL Research. Tuomas Piispanen Comarch

BPEL Research. Tuomas Piispanen Comarch BPEL Research Tuomas Piispanen 8.8.2006 Comarch Presentation Outline SOA and Web Services Web Services Composition BPEL as WS Composition Language Best BPEL products and demo What is a service? A unit

More information

Lesson 14 SOA with REST (Part I)

Lesson 14 SOA with REST (Part I) Lesson 14 SOA with REST (Part I) Service Oriented Architectures Security Module 3 - Resource-oriented services Unit 1 REST Ernesto Damiani Università di Milano Web Sites (1992) WS-* Web Services (2000)

More information

2.2 What are Web Services?

2.2 What are Web Services? Chapter 1 [Author s Note: This article is an excerpt from our upcoming book Web Services: A Technical Introduction in the Deitel Developer Series. This is pre-publication information and contents may change

More information

Integrating Legacy Assets Using J2EE Web Services

Integrating Legacy Assets Using J2EE Web Services Integrating Legacy Assets Using J2EE Web Services Jonathan Maron Oracle Corporation Page Agenda SOA-based Enterprise Integration J2EE Integration Scenarios J2CA and Web Services Service Enabling Legacy

More information

1Z

1Z 1Z0-451 Passing Score: 800 Time Limit: 4 min Exam A QUESTION 1 What is true when implementing human reactions that are part of composite applications using the human task component in SOA 11g? A. The human

More information

Semantic Web. Semantic Web Services. Morteza Amini. Sharif University of Technology Fall 94-95

Semantic Web. Semantic Web Services. Morteza Amini. Sharif University of Technology Fall 94-95 ه عا ی Semantic Web Semantic Web Services Morteza Amini Sharif University of Technology Fall 94-95 Outline Semantic Web Services Basics Challenges in Web Services Semantics in Web Services Web Service

More information

Executing Evaluations over Semantic Technologies using the SEALS Platform

Executing Evaluations over Semantic Technologies using the SEALS Platform Executing Evaluations over Semantic Technologies using the SEALS Platform Miguel Esteban-Gutiérrez, Raúl García-Castro, Asunción Gómez-Pérez Ontology Engineering Group, Departamento de Inteligencia Artificial.

More information

Open Archives Initiatives Protocol for Metadata Harvesting Practices for the cultural heritage sector

Open Archives Initiatives Protocol for Metadata Harvesting Practices for the cultural heritage sector Open Archives Initiatives Protocol for Metadata Harvesting Practices for the cultural heritage sector Relais Culture Europe mfoulonneau@relais-culture-europe.org Community report A community report on

More information

Dynamic Service Discovery

Dynamic Service Discovery Dynamic Service Discovery A position paper for the W3C Workshop on Web Services for Enterprise Computing, by Kinga Dziembowski of Gestalt-LLC. My position Service Discovery in the dynamic and transient

More information

Sentinet for BizTalk Server SENTINET

Sentinet for BizTalk Server SENTINET Sentinet for BizTalk Server SENTINET Sentinet for BizTalk Server 1 Contents Introduction... 2 Sentinet Benefits... 3 SOA and API Repository... 4 Security... 4 Mediation and Virtualization... 5 Authentication

More information

Interoperability for Digital Libraries

Interoperability for Digital Libraries DRTC Workshop on Semantic Web 8 th 10 th December, 2003 DRTC, Bangalore Paper: C Interoperability for Digital Libraries Michael Shepherd Faculty of Computer Science Dalhousie University Halifax, NS, Canada

More information

Enterprise Architecture Deployment Options. Mark Causley Sandy Milliken Sue Martin

Enterprise Architecture Deployment Options. Mark Causley Sandy Milliken Sue Martin Enterprise Architecture Deployment Options Mark Causley Sandy Milliken Sue Martin GIS is Being Implemented in Many Settings Organization Business to Business Department Workgroup GIS is Moving to the Enterprise

More information

Annotation Science From Theory to Practice and Use Introduction A bit of history

Annotation Science From Theory to Practice and Use Introduction A bit of history Annotation Science From Theory to Practice and Use Nancy Ide Department of Computer Science Vassar College Poughkeepsie, New York 12604 USA ide@cs.vassar.edu Introduction Linguistically-annotated corpora

More information

Content Management for the Defense Intelligence Enterprise

Content Management for the Defense Intelligence Enterprise Gilbane Beacon Guidance on Content Strategies, Practices and Technologies Content Management for the Defense Intelligence Enterprise How XML and the Digital Production Process Transform Information Sharing

More information

JAVA COURSES. Empowering Innovation. DN InfoTech Pvt. Ltd. H-151, Sector 63, Noida, UP

JAVA COURSES. Empowering Innovation. DN InfoTech Pvt. Ltd. H-151, Sector 63, Noida, UP 2013 Empowering Innovation DN InfoTech Pvt. Ltd. H-151, Sector 63, Noida, UP contact@dninfotech.com www.dninfotech.com 1 JAVA 500: Core JAVA Java Programming Overview Applications Compiler Class Libraries

More information

METEOR-S Process Design and Development Tool (PDDT)

METEOR-S Process Design and Development Tool (PDDT) METEOR-S Process Design and Development Tool (PDDT) Ranjit Mulye LSDIS Lab, University of Georgia (Under the Direction of Dr. John A. Miller) Acknowledgements Advisory Committee Dr. John A. Miller (Major

More information

Overview SENTINET 3.1

Overview SENTINET 3.1 Overview SENTINET 3.1 Overview 1 Contents Introduction... 2 Customer Benefits... 3 Development and Test... 3 Production and Operations... 4 Architecture... 5 Technology Stack... 7 Features Summary... 7

More information

Semantic Web. Semantic Web Services. Morteza Amini. Sharif University of Technology Spring 90-91

Semantic Web. Semantic Web Services. Morteza Amini. Sharif University of Technology Spring 90-91 بسمه تعالی Semantic Web Semantic Web Services Morteza Amini Sharif University of Technology Spring 90-91 Outline Semantic Web Services Basics Challenges in Web Services Semantics in Web Services Web Service

More information

Lesson 5 Web Service Interface Definition (Part II)

Lesson 5 Web Service Interface Definition (Part II) Lesson 5 Web Service Interface Definition (Part II) Service Oriented Architectures Security Module 1 - Basic technologies Unit 3 WSDL Ernesto Damiani Università di Milano Controlling the style (1) The

More information

NEXOF-RA NESSI Open Framework Reference Architecture IST- FP

NEXOF-RA NESSI Open Framework Reference Architecture IST- FP NEXOF-RA NESSI Open Framework Reference Architecture IST- FP7-216446 Deliverable D7.4 RA Specification Sample Siemens AG HP Engineering Thales Due date of deliverable: 01/03/2009 Actual submission date:

More information

describe the functions of Windows Communication Foundation describe the features of the Windows Workflow Foundation solution

describe the functions of Windows Communication Foundation describe the features of the Windows Workflow Foundation solution 1 of 9 10/9/2013 1:38 AM WCF and WF Learning Objectives After completing this topic, you should be able to describe the functions of Windows Communication Foundation describe the features of the Windows

More information

This document is a preview generated by EVS

This document is a preview generated by EVS INTERNATIONAL STANDARD ISO 25577 Second edition 2013-12-15 Information and documentation MarcXchange Information et documentation MarcXchange Reference number ISO 25577:2013(E) ISO 2013 ISO 25577:2013(E)

More information

Getting Started with Web Services

Getting Started with Web Services Getting Started with Web Services Getting Started with Web Services A web service is a set of functions packaged into a single entity that is available to other systems on a network. The network can be

More information

Metadata and DCR. <CMD_Component /> Dieter Van Uytvanck. Max Planck Institute for Psycholinguistics

Metadata and DCR. <CMD_Component /> Dieter Van Uytvanck. Max Planck Institute for Psycholinguistics Metadata and DCR Dieter Van Uytvanck Max Planck Institute for Psycholinguistics Dieter.VanUytvanck@mpi.nl Overview Traditional metadata Component metadata Data categories The big picture

More information

Best practices in the design, creation and dissemination of speech corpora at The Language Archive

Best practices in the design, creation and dissemination of speech corpora at The Language Archive LREC Workshop 18 2012-05-21 Istanbul Best practices in the design, creation and dissemination of speech corpora at The Language Archive Sebastian Drude, Daan Broeder, Peter Wittenburg, Han Sloetjes The

More information

The Design of a DLS for the Management of Very Large Collections of Archival Objects

The Design of a DLS for the Management of Very Large Collections of Archival Objects Session: VLDL Architectures The Design of a DLS for the Management of Very Large Collections of Archival Objects Maristella Agosti, Nicola Ferro and Gianmaria Silvello Information Management Research Group

More information

A prototype infrastructure for DSpin

A prototype infrastructure for DSpin A prototype infrastructure for DSpin - DSpin/Clarin, LLS, prototype, toolwrappers, roadmap - Volker Boehlke University of Leipzig Institut für Informatik Some facts about me - iba Consulting Gesellschaft

More information

Identität und Autorisierung als Grundlage für sichere Web-Services. Dr. Hannes P. Lubich IT Security Strategist

Identität und Autorisierung als Grundlage für sichere Web-Services. Dr. Hannes P. Lubich IT Security Strategist Identität und Autorisierung als Grundlage für sichere Web-Services Dr. Hannes P. Lubich IT Security Strategist The Web Services Temptation For every $1 spent on software $3 to $5 is spent on integration

More information

Extending SOA Infrastructure for Semantic Interoperability

Extending SOA Infrastructure for Semantic Interoperability Extending SOA Infrastructure for Semantic Interoperability Wen Zhu wzhu@alionscience.com ITEA System of Systems Conference 26 Jan 2006 www.alionscience.com/semantic Agenda Background Semantic Mediation

More information

Web Services in Cincom VisualWorks. WHITE PAPER Cincom In-depth Analysis and Review

Web Services in Cincom VisualWorks. WHITE PAPER Cincom In-depth Analysis and Review Web Services in Cincom VisualWorks WHITE PAPER Cincom In-depth Analysis and Review Web Services in Cincom VisualWorks Table of Contents Web Services in VisualWorks....................... 1 Web Services

More information

Implementing the Army Net Centric Data Strategy in a Service Oriented Environment

Implementing the Army Net Centric Data Strategy in a Service Oriented Environment Implementing the Army Net Centric Strategy in a Service Oriented Environment Michelle Dirner Army Net Centric Strategy (ANCDS) Center of Excellence (CoE) Service Team Lead RDECOM CERDEC SED in support

More information

Chapter 17 Web Services Additional Topics

Chapter 17 Web Services Additional Topics Prof. Dr.-Ing. Stefan Deßloch AG Heterogene Informationssysteme Geb. 36, Raum 329 Tel. 0631/205 3275 dessloch@informatik.uni-kl.de Chapter 17 Web Services Additional Topics Prof. Dr.-Ing. Stefan Deßloch

More information

<Insert Picture Here> Click to edit Master title style

<Insert Picture Here> Click to edit Master title style Click to edit Master title style Introducing the Oracle Service What Is Oracle Service? Provides visibility into services, service providers and related resources across the enterprise

More information

Working towards a Metadata Federation of CLARIN and DARIAH-DE

Working towards a Metadata Federation of CLARIN and DARIAH-DE Working towards a Metadata Federation of CLARIN and DARIAH-DE Thomas Eckart Natural Language Processing Group University of Leipzig, Germany teckart@informatik.uni-leipzig.de Tobias Gradl Media Informatics

More information

EUDAT. Towards a pan-european Collaborative Data Infrastructure

EUDAT. Towards a pan-european Collaborative Data Infrastructure EUDAT Towards a pan-european Collaborative Data Infrastructure Damien Lecarpentier CSC-IT Center for Science, Finland CESSDA workshop Tampere, 5 October 2012 EUDAT Towards a pan-european Collaborative

More information

Technical Overview. Version March 2018 Author: Vittorio Bertola

Technical Overview. Version March 2018 Author: Vittorio Bertola Technical Overview Version 1.2.3 26 March 2018 Author: Vittorio Bertola vittorio.bertola@open-xchange.com This document is copyrighted by its authors and is released under a CC-BY-ND-3.0 license, which

More information

Metadata Catalogue Issues. Daan Broeder Max-Planck Institute for Psycholinguistics

Metadata Catalogue Issues. Daan Broeder Max-Planck Institute for Psycholinguistics Metadata Catalogue Issues Daan Broeder Max-Planck Institute for Psycholinguistics Introduction Methods of registering resources Metadata Making metadata interoperable Exposing metadata Facilitating resource

More information

AIM Enterprise Platform Software IBM z/transaction Processing Facility Enterprise Edition 1.1.0

AIM Enterprise Platform Software IBM z/transaction Processing Facility Enterprise Edition 1.1.0 z/tpf EE V1.1 z/tpfdf V1.1 TPF Toolkit for WebSphere Studio V3 TPF Operations Server V1.2 IBM Software Group TPF Users Group Spring 2007 TPF Users Group Spring 2007 z/tpf Web Services Update Name: Barry

More information

Position Paper on the Definition of SOA-RM

Position Paper on the Definition of SOA-RM 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 Position Paper on the Definition of SOA-RM Authors: C. Matthew MacKenzie (mattm@adobe.com), Duane A.

More information

Distribution and web services

Distribution and web services Chair of Software Engineering Carlo A. Furia, Bertrand Meyer Distribution and web services From concurrent to distributed systems Node configuration Multiprocessor Multicomputer Distributed system CPU

More information

Cisco Unified Presence 8.0

Cisco Unified Presence 8.0 Cisco Unified Presence 8.0 Cisco Unified Communications Solutions unify voice, video, data, and mobile applications on fixed and mobile networks, enabling easy collaboration every time from any workspace.

More information

Service Oriented Architectures Visions Concepts Reality

Service Oriented Architectures Visions Concepts Reality Service Oriented Architectures Visions Concepts Reality CSC March 2006 Alexander Schatten Vienna University of Technology Vervest und Heck, 2005 A Service Oriented Architecture enhanced by semantics, would

More information