Analysis Exchange Framework Terms of Reference December 2016

Similar documents
Cybersecurity-Related Information Sharing Guidelines Draft Document Request For Comment

Metadata Framework for Resource Discovery

Solving the Enterprise Data Dilemma

OG0-091 Q&As TOGAF 9 Part 1

Microsoft SharePoint Server 2013 Plan, Configure & Manage

DRS Policy Guide. Management of DRS operations is the responsibility of staff in Library Technology Services (LTS).

Strategy & Planning: Data Governance & Data Quality

BPS Suite and the OCEG Capability Model. Mapping the OCEG Capability Model to the BPS Suite s product capability.

Vendor: The Open Group. Exam Code: OG Exam Name: TOGAF 9 Part 1. Version: Demo

Unified Governance for Amazon S3 Data Lakes

Information Quality & Service Oriented Architecture

Accelerate Your Enterprise Private Cloud Initiative

LESSONS LEARNED FROM THE INDIANA UNIVERSITY ELECTRONIC RECORDS PROJECT. How to Implement an Electronic Records Strategy

DATA SHEET RSA NETWITNESS PLATFORM PROFESSIONAL SERVICES ACCELERATE TIME-TO-VALUE & MAXIMIZE ROI

An introduction to Headless Content Management Systems

IBM Resilient Incident Response Platform On Cloud

IBM Resilient Incident Response Platform On Cloud

ITC Vision for Campus IT: February 28, 2018

Cyber Defense Maturity Scorecard DEFINING CYBERSECURITY MATURITY ACROSS KEY DOMAINS

21ST century enterprise. HCL Technologies Presents. Roadmap for Data Center Transformation

IDC MarketScape: Worldwide Datacenter Transformation Consulting and Implementation Services 2016 Vendor Assessment

Commonwealth of Pennsylvania - Justice Network

IBM Resilient Incident Response Platform On Cloud

Reviewed by ADM(RS) in accordance with the Access to Information Act. Information UNCLASSIFIED.

Stewarding NOAA s Data: How NCEI Allocates Stewardship Resources

The Open Group SOA Ontology Technical Standard. Clive Hatton

Advanced Solutions of Microsoft SharePoint Server 2013

Cyber Partnership Blueprint: An Outline

Hybrid WAN Operations: Extend Network Monitoring Across SD-WAN and Legacy WAN Infrastructure

Agenda. Bibliography

Vision for PACE Data Environment, PACE Quantum and the Common Data Set. October 21, 2015

Do Not Track Compromise Proposal

OpenChain Specification Version 1.3 (DRAFT)

IBM Spectrum Protect Plus

GEOSS Data Management Principles: Importance and Implementation

Oracle Application Server 10g Integration Interconnect. An Oracle Technical White Paper January 2005

VIDEO PROCEDURE. The Vice-President, Advancement is responsible for significant decisions relating to UNSW s external-facing video presence.

PREPARE FOR TAKE OFF. Accelerate your organisation s journey to the Cloud.

Assessment of the progress made in the implementation of and follow-up to the outcomes of the World Summit on the Information Society

Promoting accountability and transparency of multistakeholder partnerships for the implementation of the 2030 Agenda

Advanced Solutions of Microsoft SharePoint Server 2013 Course Contact Hours

The Proposed Road Centerline Standard for Minnesota Overview and Frequently Asked Questions

REQUEST FOR PROPOSALS: ARTIST TRUST WEBSITE REDESIGN

Advanced Solutions of Microsoft SharePoint 2013

RFP/RFI Questions for Managed Security Services. Sample MSSP RFP Template

STEP Data Governance: At a Glance

Europeana Core Service Platform

2009/7 Assessment of the progress made in the implementation of and follow-up to the outcomes of the World Summit on the Information Society

Verto What s New? Training User Guide

Response to Industry Canada Consultation Developing a Digital Research Infrastructure Strategy

Trust and Certification: the case for Trustworthy Digital Repositories. RDA Europe webinar, 14 February 2017 Ingrid Dillo, DANS, The Netherlands

Module 3. Overview of TOGAF 9.1 Architecture Development Method (ADM)

Information Security and Service Management. Security and Risk Management ISSM and ITIL/ITSM Interrelationship

strategy IT Str a 2020 tegy

Calne Without Parish Council. IT Strategy

EXIN BCS SIAM Foundation. Sample Exam. Edition

National Antimicrobial Resistance Surveillance

Proposed Capability-Based Reference Architecture for Real-Time Network Defense

DoD Software Assurance (SwA) Update

Cheshire 3 Framework White Paper: Implementing Support for Digital Repositories in a Data Grid Environment

Update on the Government of Canada s Information Technology Transformation Plan

Enterprise Architecture Method

Integrating your CX, ERP and HCM Clouds with your On-premises Applications CON7012

Information empowerment for your evolving data ecosystem

WHO Secretariat Dr Oleg Chestnov Assistant Director-General Noncommunicable Diseases and Mental Health

What s New in Ideagen Pentana 4.2

How a Metadata Repository enables dynamism and automation in SDTM-like dataset generation

Academic Editor Tutorial

Six Weeks to Security Operations The AMP Story. Mike Byrne Cyber Security AMP

ISAO SO Product Outline

Arkansas MAV Conservation Delivery Network

How to Evaluate a Next Generation Mobile Platform

I D C T E C H N O L O G Y S P O T L I G H T

April 17, Ronald Layne Manager, Data Quality and Data Governance

Understanding the workplace of the future. Artificial Intelligence series

Time2Track User Manual. SJP Interns Site Supervisors

Data Protection for Virtualized Environments

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

Providing a Rapid Response to Meltdown and Spectre for Hybrid IT. Industry: Computer Security and Operations Date: February 2018

How to choose the right Data Governance resources. by First San Francisco Partners

ForeScout ControlFabric TM Architecture

OpenChain Conformance 2016-H1 Specification

The University of Queensland

Agency Guide for FedRAMP Authorizations

OPTIMIZATION MAXIMIZING TELECOM AND NETWORK. The current state of enterprise optimization, best practices and considerations for improvement

PERFORM FOR HPE CONTENT MANAGER

Subject indexing in SBN. Maria Lucia Di Geso ICCU

The Open Group Professional Certification Program. Accreditation Requirements

A VO-friendly, Community-based Authorization Framework

Business Analysis in Practice

THE JOURNEY OVERVIEW THREE PHASES TO A SUCCESSFUL MIGRATION ADOPTION ACCENTURE IS 80% IN THE CLOUD

Cisco CloudCenter Solution Use Case: Application Migration and Management

The Value of Migrating from Cisco Tidal Horizon to Cisco Process Orchestrator

Faculty Release Notes. Release 8 January 2008

Datameer Big Data Governance. Bringing open-architected and forward-compatible governance controls to Hadoop analytics

NAI Mobile Application Code

Meeting PCI DSS 3.2 Compliance with RiskSense Solutions

PCORI Online: Awardee User Guide Research Awards

PODS Lite. Executive Summary

Well Lifecycle: Workflow Automation

Transcription:

Analysis Exchange Framework Terms of Reference December 2016 Approved for Public Release; Distribution Unlimited. Case Number 16-4653 The views, opinions and/or findings contained in this report are those of The MITRE Corporation and should not be construed as an official government position, policy, or decision, unless designated by other documentation. 2016 The MITRE Corporation. All rights reserved.

This page intentionally left blank. 1

Background The Analytic Technology Industry Roundtable has produced several studies focusing on analytic architectures, in order to gain a better understanding and ground truth for collaboration opportunity. The Roundtable produced studies on industry analytic capabilities, and use cases in 2016 [1, 2, 3, 4]. These studies can benefit the government while encouraging collaboration across industry partners. In this paper, we articulate the core architectural components of an Analysis Exchange Service 1 intended to support the desired collaboration. The exchange design is based upon several months of engagement, study, and work with the Industry Roundtable Working Group. This paper describes the common exchange that the Roundtable will build. The paper describes how MITRE and industry will engage to realize this model. In order to achieve a successful prototype for the Analysis Exchange Service and a supporting Analysis Exchange Model, industry and MITRE will be working together in partnership with government input. Analysis Exchange Service a network-enabled repository for results from external analytic services, fostering industry and government collaboration by facilitating knowledge fusion and downstream analysis. Analysis Exchange Model the ontology capable of representing adapted analytic results from various sources and intended to support the use case(s) generated for the prototype Analysis Exchange Service. This document expands on these components and describes the path toward implementing them. This will be a key Roundtable effort into FY17. This document also articulates the details about what Roundtable s tasks will be (Figure 1) and why. Government also will have a role in this effort, specifically in providing strategic input and long-term requirements for how it can evolve to solve their hard problems. Tasks Use case maturation Analytic tool selection Data selection Exchange service APIs Publish-subscribe service Data model / ontology Semi-persistent result store Adapters Analytic output guidance Testing the workflow Figure 1 1 In our original material, this was referred to as the Common Exchange Service. 2

Charter This study will: Provide an architecture to support collaboration, whether through a publish-subscribe service or a more involved orchestration of externally based analytics. Provide a data model with the supporting ontology that can be publicly released and serve as a lingua franca for analytic results (this data model, the Analysis Exchange Model, is described in more detail in the next section). Provide the templates, or at least the pattern of activity, to allow for the adaptation of different industry analytic results into a common data model. Study Products Each of the goals specified above introduces different products that require development, either from scratch or by applying and extending existing capabilities. These include the following: The application programming interfaces (APIs) to allow for interaction with the Analysis Exchange Service. A publish-subscribe service that acts as a basis for the exchange and can be the foundation for orchestration. An ontology serving as the underlying data model and rules for reasoning for the Analysis Exchange Model. A semi-persistent store for the analytic results mapped into the Analysis Exchange Model. Template adapters for migrating analytic results into the Analysis Exchange Model. There are also several dependencies of note, including: Maturing and finalizing the use case. Selecting analytic tools, systems, and capabilities for the use case, which represents the industry collaboration at the tool level. Selecting and collecting the input data sets for the use case (for example, social media data, images, or plaintext). Assisting and guiding the creation of the adapters between analytic results and the Analysis Exchange Model. Testing, evaluating, and studying the analytic workflow in how well it fulfils the use case. The remainder of this document explores the charter goals and planned study products in greater depth. 3

Analysis Exchange Service The Analysis Exchange Service (as depicted in Figure 2) is a network-enabled repository and acts as a hub between the automated analytic stacks from different data and analytic providers and the analyst customers who make use of the artifacts, knowledge, and results of the analytic processes. The goal of the Analysis Exchange Service is to achieve fusion of the artifacts and knowledge it holds so that different industry automated analytics can contribute to a final product without compromising their internal analytic technology. Further analysis of what the Analysis Exchange Service holds is possible downstream of this service. In all, the Analysis Exchange Service will encourage new collaborative capabilities between industry partners in the service of their government customers. Figure 2: Exchange for automated analytic collaboration across industry and government. 4

As specified above, there are necessary components to develop as part of the Analysis Exchange Service. The application programming interfaces, which correspond to the transfer service depicted in Figure 2, are the entry point into the architecture that can be controlled following an ad hoc publish-subscribe pattern; these can also be extended to allow workflow orchestration to call on external analytics and potentially internal analytics eventually. The internal store retains artifacts for as long as the use case requires. Further, the results in this store are defined by a consistent data model that conforms to an ontology built for the supported use case, which seeks to mitigate ambiguities between different representations that the external analytics might exhibit. Adapters are required to make the format and possible semantic conversion into this model, which ensures that downstream analysts receive a consistent picture of the results regardless of the source. Activities that are supported by the publish-subscribe model include 1) pipelines of analytic activity that execute when informed through the subscription that the results they expect are available in the repository, 2) aggregation from multiple sources into a final product, and 3) alteration, or updates, of results previously submitted. These activities align to the data flows depicted in Figure 3, as first expressed in the MITRE BlueRidge project. Later sections will expand on how these pieces fit together and how each will be implemented over the course of FY17. Analysis Exchange Model The Analysis Exchange Model is the ontology that defines the space of analytic results which are adapted to conform to it. This aligns to elements that recur throughout enterprises where analytic architectures are being explored, namely the need for a common representation and consistent definition of knowledge objects that make up the results. While the Analysis Exchange Service can store results in their raw output form, an Analysis Exchange Model and an evolving library of adaptation software that keeps pace with changing native models means useful interactions can spin up more quickly. Additionally, it does not assume the burden of adaptation necessarily has to fall to one partner or the other. Adaptation software can be provided by companies should they wish to retain a close hold on proprietary processes and models or they can be engineered by outside parties if the issue is one of aligning a published standard with the Analysis Exchange 5

Figure 3: Flows of data (or results) as originally presented in MITRE BlueRidge work Model. In this case, we intend to design the Analysis Exchange Model and representative adapters to establish the template for future adapter development. While it would be ideal to use a single comprehensive ontology that could draw on and reason over results across varied domains for any conceivable use case, to our knowledge no such ontology exists. Rather than making the attempt to design such an ontology or import an existing ontology to serve in this capacity, it is preferable to begin with a narrower use case and develop (or import) a data model and ontology that fits the parameters of the kind of results that are likely 6

to be available from the different analytics and desirable for the use case. Depending on the scope, existing ontologies with broad coverage might serve as an upper ontology, but we plan to create a tailored, more domain-specific ontology. Components APIs for the Analysis Exchange Service The principal goal of the Analysis Exchange Service is to bridge capabilities between different industry partners in service of government customers, in particular analysts. The application programming interfaces for the Analysis Exchange Service are the principal ways industry partners and government customers will be able to access the Analysis Exchange Service, covering activities such as uploading results, collecting results to ingest in downstream automated analytics, and retrieving the final results that represent the outcome of a complete activity, whether as an instance of a repeatable workflow or a single ad hoc experiment bridging multiple analytic capabilities. It is part of the tasking to create all the necessary APIs to perform the actions described above. These APIs can be the foundation of providing a publish-subscribe service or more robust centralized orchestration that calls on external analytics to execute. These APIs will be written in Java, but they can be extended to other languages based on a languageindependent RESTful implementation. Eventually, there can also be a capability in the APIs to allow for partners and customers to execute internal analytic processes in the Analysis Exchange Service, but the initial vision expects the analytics to be those provided by industry partners and to remain external. Publish-Subscribe Service and Orchestration Following a publish-subscribe pattern for communication is a straightforward method that imposes a minimal burden. This model expects that when analytic artifacts are made available, the Analysis Exchange Service will publish that these artifacts. Therefore, subscribers awaiting certain expected analytic artifacts will be notified when the artifacts that interest them, either for a single experiment or part of a regular workflow, are available. 7

It is part of the tasking to implement this publish-subscribe service. While a publish-subscribe pattern typically assumes an underlying structure of clients and a central server, we can use a framework 2 to change this structure into one that is peer-to-peer. This allows it to act as a messaging service between the Analysis Exchange Service and the external systems when updates occur. We expect that partners will subscribe to the artifacts published by the Analysis Exchange Service as well as provide the artifacts that the Analysis Exchange Service makes available. Who acts in what capacity should be agreed upon as part of defining an initial use case. Also part of this task is developing a listener that ensures events are noticed, where events are any kind of relevant notification about a change in the system or data or any requests that are part of a workflow. Eventually, this publish-subscribe service can also provide the basis for more complex internal orchestration, where a centralized control calls on both external and internal analytics to execute on different stages of artifacts to produce the final results. The initial vision intends to keep this simple, but with the potential to be extended to greater responsibilities held by the Analysis Exchange Service, perhaps using an open source scientific workflow application as the basis for the orchestration. The listener we described in the publish-subscribe pattern could also be used in this more central orchestration. Ontology and Data Model An underlying need in a successful Analysis Exchange Service is achieving a seamless integration across varied analytic results in support of a use case. These results are often in semantic models or can be made to conform to them. We envision applying an ontology (namely a set of defined classes, relationships, and rules) that describe the elements relevant to the use case, which allows for a consistent treatment of the knowledge and the capacity for reasoning across the rules that describe how different statements of knowledge affect one another. Developing this ontology, or the Analysis Exchange Model as described above, is one of the main tasks of Roundtable effort. This ontology will be integrated with the store that will be populated with instances of knowledge by the adapters. 2 For instance, Apache Kafka 8

The store and adapters are both described below. The principal dependency here is a clear use case. While we can begin design and make changes to fit an evolving use case, having a direction of the space the resulting knowledge occupies is important to know what to include in the ontology s classes, relationships, and rules and what to exclude as irrelevant. Store While analytic artifacts published by the Analysis Exchange Service conform to the Analysis Exchange Model, they also need a place to be stored internally. This store can be permanent, volatile, or somewhere in between depending on the use case. In some cases, all that is of interest is timely data whereas in others, aggregation over successive uploads provides the needed picture to downstream analytics and analysts. There can also be degrees of volatility, where all knowledge is at least retained temporarily and some is always retained long term. Regardless, this suggests a knowledge base that can potentially scale if considerable artifacts are produced. Implementing this knowledge store is a task to be performed in concert with the ontology development. We expect this requires both a reasoning engine and persistent store. The length of retention will be determined by what fits the use case. Template Adapters One of the more serious challenges to inter-industry and government collaboration is the disconnect in formats and models of analytic results. In order for analytics to operate in concert, whether they ingest one another s results as input or synthesize a final product collectively, they must have a common language. This is what the Analysis Exchange Model does, but results must be mapped into (and in some cases out of) the Analysis Exchange Model, as we know that we cannot mandate that every analytic will produce (or consume) our model. This is where adapters become essential. Adapters convert the results from their raw analytic results into a format conforming to the Analysis Exchange Model and vice versa if the use case dictates. This also means that there must be adapters for each analytic involved, or at least each analytic result type involved (formats can potentially span different analytics). Developing and implementing template adapters is another task. While adapters can be bi-directional (going from raw analytic results to a format conforming to the Analysis Exchange Model and from the Analysis Exchange Model to a format ingestible by external analytics), we intend to only start with adapters that map from analytic results into the Analysis 9

Exchange Model, unless the use case requires otherwise. The intent is to make the adapters necessary to execute and demonstrate the use case; these adapters can eventually become the basis for the continual development of more adapters for other analytic results. This requires a use case and ongoing collaboration to ensure we understand the analytic output and are preserving the semantics in the adaptation. Conclusion Achieving both the Analysis Exchange Service and the Analysis Exchange Model requires simultaneous development and implementation by the Roundtable of several different elements, including the APIs, the publish-subscribe service, the data model and its underlying ontology, a semi-persistent store for results, and adapters. Each of these ensures that the Analysis Exchange Service can be a central hub for collaboration, but it also levies certain dependencies that have been discussed throughout the proposal. These dependencies include maturing and finalizing the use case, selecting analytic tools, selecting the input data sets relevant to the use case, testing the workflow to ensure it fulfils the use case, and ongoing guidance of the development of the Analysis Exchange Service, particularly the adapters which must properly preserve the content of the analytic results. Between the tasks and the dependencies, the goal of achieving an analytic architecture fostering collaboration can be achieved. This can also be expanded and extended in the future by following the process when applying it to new use cases or further enriching the control of the Analysis Exchange Service by allowing it to handle orchestration centrally. 10

References [1] A. Etches, C. Brown and J. Stultz, "Analytics and Use Cases," 8 November 2016. [Online]. Available: http://www2.mitre.org/public/analytictechnology/pdfs/analytics_and_use_cases_study_ibm_sas_11_25_16.pdf. [2] B. Choung and M. Chandler, "Review and Study on Industry Collaboration," 8 November 2016. [Online]. Available: http://www2.mitre.org/public/analytictechnology/pdfs/industry_architecture_survey_and_review_study_palantir_11_25_16.pdf. [3] R. Winder, J. Jubinski and A. O'Hanlon, "Analytic Architecture Survey and Review," 8 November 2016. [Online]. Available: http://www2.mitre.org/public/analytic-technology/pdfs/architecture-survey- Review.pdf. [4] W. Niehaus, B. Adams, S. Panzer and L. Hanson, "Alignment of Capabilities," 8 November 2016. [Online]. Available: http://www2.mitre.org/public/analytic-technology/papers.html. 11