Dependable Computing Clouds for Cyber-Physical Systems

Similar documents
Adaptive System Infrastructure for Ultra-Large. Large-Scale Systems. SMART Conference, Thursday, March 6 th, 2008

Meeting the Challenges of Ultra-Large

Vortex Whitepaper. Intelligent Data Sharing for the Business-Critical Internet of Things. Version 1.1 June 2014 Angelo Corsaro Ph.D.

I D C T E C H N O L O G Y S P O T L I G H T. V i r t u a l and Cloud D a t a Center Management

Next Generation Privilege Identity Management

Wednesday, March 20, 2013, Middleware Dr. Douglas C. Schmidt

That Set the Foundation for the Private Cloud

DEFINING SECURITY FOR TODAY S CLOUD ENVIRONMENTS. Security Without Compromise

Institute for Software Integrated Systems Vanderbilt University Nashville, Tennessee

Enable Infrastructure Beyond Cloud

Wireless Environments

Overview SENTINET 3.1

Adapting Enterprise Distributed Real-time and Embedded (DRE) Pub/Sub Middleware for Cloud Computing Environments

A Data-Centric Approach for Modular Assurance Abstract. Keywords: 1 Introduction

Using Distributed Intelligence to Aid MicroGrid and Distribution Technology Deployment

Part III: Evaluating the Business Value of the Hybrid Cloud

Large Scale Computing Infrastructures

Transform to Your Cloud

VMware Cloud Application Platform

BUILDING the VIRtUAL enterprise

Why Converged Infrastructure?

ALCATEL-LUCENT ENTERPRISE DATA CENTER SWITCHING SOLUTION Automation for the next-generation data center

Next-Generation HCI: Fine- Tuned for New Ways of Working

Hyper-Converged Infrastructure: Providing New Opportunities for Improved Availability

Leveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands

VMware vsphere 4. The Best Platform for Building Cloud Infrastructures

CONNECTING THE CLOUD WITH ON DEMAND INFRASTRUCTURE

Demystifying the Cloud With a Look at Hybrid Hosting and OpenStack

Internet of Things (IoT) CSE237A

DRAM and Storage-Class Memory (SCM) Overview

The Economic Benefits of a Cooperative Control Wireless LAN Architecture

02 - Distributed Systems

Composable Infrastructure for Public Cloud Service Providers

Why Microsoft Azure is the right choice for your Public Cloud, a Consultants view by Simon Conyard

Rethinking VDI: The Role of Client-Hosted Virtual Desktops. White Paper Virtual Computer, Inc. All Rights Reserved.

Lesson 14: Cloud Computing

Real-time & Embedded Systems Workshop July 2007 Building Successful Real-time Distributed Systems in Java

Cisco Unified Computing System Delivering on Cisco's Unified Computing Vision

Outline. Definition of a Distributed System Goals of a Distributed System Types of Distributed Systems

Weapon Systems Open Architecture Overview

Data-Centric Architecture for Space Systems

Designing and debugging real-time distributed systems

INFS 214: Introduction to Computing

Upgrade Your MuleESB with Solace s Messaging Infrastructure

Air Force Institute of Technology

Hitachi Enterprise Cloud Container Platform

Toward a Memory-centric Architecture

Public, Private, or Hybrid Cloud

Build application-centric data centers to meet modern business user needs

Cloud Security Gaps. Cloud-Native Security.

BUILD A BUSINESS CASE

VMworld 2015 Track Names and Descriptions

FLUID COMPUTING. ARC FORUM, India Ricky Watts Director of Industrial Solutions, Wind River IN A SOFTWARE-DEFINED WORLD

Transforming the Internal IT Landscape with APIs. Scott Cranton Director, Application Platform SAs April 2018

Lossless 10 Gigabit Ethernet: The Unifying Infrastructure for SAN and LAN Consolidation

From MDD back to basic: Building DRE systems

Network Implications of Cloud Computing Presentation to Internet2 Meeting November 4, 2010

IT your way - Hybrid IT FAQs

Priya Narasimhan. Assistant Professor of ECE and CS Carnegie Mellon University Pittsburgh, PA

Cloud Computing: Making the Right Choice for Your Organization

UNCLASSIFIED. R-1 Program Element (Number/Name) PE D8Z / Software Engineering Institute (SEI) Applied Research. Prior Years FY 2013 FY 2014

IBM POWER SYSTEMS: YOUR UNFAIR ADVANTAGE

Deterministic Ethernet & Unified Networking

Multicore Computing and Scientific Discovery

W H I T E P A P E R U n l o c k i n g t h e P o w e r o f F l a s h w i t h t h e M C x - E n a b l e d N e x t - G e n e r a t i o n V N X

3. Quality of Service

Accelerate Your Enterprise Private Cloud Initiative

A Predictable RTOS. Mantis Cheng Department of Computer Science University of Victoria

Connecting your Microservices and Cloud Services with Oracle Integration CON7348

THE VMTURBO CLOUD CONTROL PLANE

AT&T Flow Designer. Current Environment

How can the Future Internet

MASERGY S MANAGED SD-WAN

02 - Distributed Systems

Data Consistency with SPLICE Middleware. Leslie Madden Chad Offenbacker Naval Surface Warfare Center Dahlgren Division

Enterprise Cloud Computing. Eddie Toh Platform Marketing Manager, APAC Data Centre Group Cisco Summit 2010, Kuala Lumpur

Intelligent Event Processing in Quality of Service (QoS) Enabled Publish/Subscribe (Pub/Sub) Middleware

Vortex OpenSplice. Python DDS Binding

Functional Requirements for Grid Oriented Optical Networks

Huawei Agile Controller. Agile Controller 1

Data Model Considerations for Radar Systems

The Future of Virtualization. Jeff Jennings Global Vice President Products & Solutions VMware

Reconstruct to re-energize

Government IT Modernization and the Adoption of Hybrid Cloud

Achieving Digital Transformation: FOUR MUST-HAVES FOR A MODERN VIRTUALIZATION PLATFORM WHITE PAPER

Enabling Cloud Adoption. Addressing the challenges of multi-cloud

Cato Cloud. Software-defined and cloud-based secure enterprise network. Solution Brief

Assistant Professor, School of Computer Applications,Career Point University,Kota, Rajasthan, India Id

Software-Defined Networking (SDN) Overview

AN APPLICATION-CENTRIC APPROACH TO DATA CENTER MIGRATION

The Future of Virtualization Desktop to the Datacentre. Raghu Raghuram Vice President Product and Solutions VMware

XBS Application Development Platform

Angelo Corsaro, Ph.D. Chief Technology Officer! OMG DDS Sig Co-Chair PrismTech

Christopher Covert. Principal Product Manager Enterprise Solutions Group. Copyright 2016 Symantec Endpoint Protection Cloud

How the WAN is Driving Digital Transformation. An IDC InfoBrief, Sponsored by

Cisco Unified Data Center Strategy

Infrastructure Middleware (Part 1): Hardware Abstraction Layer (HAL)

Vortex Whitepaper. Simplifying Real-time Information Integration in Industrial Internet of Things (IIoT) Control Systems

EBOOK. FROM DISASTER RECOVERY TO ACTIVE-ACTIVE: NuoDB AND MULTI-DATA CENTER DEPLOYMENTS

Data center interconnect for the enterprise hybrid cloud

Transcription:

Dependable Computing Clouds for Cyber-Physical Systems Dependability Issues in Cloud Computing (DISCCO) Workshop October 11 th, 2012 Douglas C. Schmidt d.schmidt@vanderbilt.edu Institute for Software Integrated Systems Vanderbilt University Nashville, TN http://www.dre.vanderbilt.edu/~schmidt/dcc4cps.pdf

Outline of Presentation Context & terminology Prior R&D progress Current R&D challenges A promising solution Upcoming event Concluding remarks 2

Outline of Presentation Context & terminology Prior R&D progress Current R&D trends & challenges A promising solution Upcoming event Concluding remarks 3

Overview of Cyber-Physical Systems A cyber-physical system (CPS) features a tight combination of & coordination between the system s computational & physical elements A CPS often involves networked processing elements that control physical, chemical, or biological processes or devices In a CPS the right answer delivered too late becomes the wrong answer i.e., dependability has a temporal dimension (& increasingly a security dimension) CPSs are increasingly distributed systems 4 rather than standalone devices

Cloud computing involves the delivery of computing as a service rather than as a product Essentially a form of outsourcing for certain hardware & software components & activities Cloud offerings enable economies of scale via multi-tentancy & elasticity Overview of Cloud Computing Typically run atop shared (& virtualized) hardware, storage, data access, middleware, software, etc. Cloud services don t require end-user knowledge of the physical location & configuration of the computing/comm infrastructure delivering the services Similar to utilities, such as power grids, datacom/telecom, water, sewer, etc. Key characteristics of clouds for 5 CPS are multi-tenancy & elasticity

Outline of Presentation Context & terminology Prior R&D progress Current R&D trends & challenges A promising solution Upcoming event Concluding remarks 6

Prior R&D Progress for Cyber-Physical Systems From this design paradigm Nav Air Frame WTS AP SPLnner Cyclic Exec FLIR IFF The designs of legacy CPSs tend to be: Stovepiped Proprietary Brittle & non-adaptive Expensive to develop & evolve Vulnerable Problem: Small changes can break 7 nearly anything & everything

Prior R&D Progress for Cyber-Physical Systems and this operational paradigm Real-time QoS requirements for legacy CPSs: Ensure predictable end-to-end QoS, e.g., Minimize latency, jitter, & footprint Bound priority inversions Allocate & manage resources statically & avoid sharing Utility Utility Curve Broken Works Resources Hard Requirements Problem: Lack of any resource 8 can break nearly everything

Prior R&D Progress for Cyber-Physical Systems and this operational paradigm Real-time QoS requirements for legacy CPSs: Ensure predictable end-to-end QoS, e.g., Minimize latency, jitter, & footprint Bound priority inversions Allocate & manage resources statically & avoid sharing This is not at all what we think 9 of as a computing cloud!

Prior R&D Progress for Cyber-Physical Systems to this design paradigm Air Frame Event Channel AP Nav WTS The designs of today s leading-edge CPSs tend to be more: Replication Service PLanner IFF FLIR Object Request Broker Layered & componentized Standard & COTS Robust to failures & adaptive to operating conditions Cost effective to evolve & retarget Result: changing requirements & environments 10 can be handled more flexibly

Prior R&D Progress for Cyber-Physical Systems and this operational paradigm Resources Result: better support for operations 11 with scarce/contended resources Utility Desired Utility Curve Working Range Ensure acceptable end-to-end QoS, e.g., Softer Requirements Minimize latency, jitter, & footprint Minimize priority inversions Resources are allocated/managed dynamically & can be shared

Prior R&D Progress for Cyber-Physical Systems and this operational paradigm Ensure acceptable end-to-end QoS, e.g., Minimize latency, jitter, & footprint Minimize priority inversions Resources are allocated/managed dynamically & can be shared Some of these CPS computing environments 12 have much in common with clouds

New Challenge: Ultra-Large-Scale Cyber-Physical Systems Key problem space challenges Limited communication bandwidth Connectivity is intermittent & connections are noisy Processing & storage capacity are limited & power/weight limits affect usage patterns Must adapt to unanticipated workflows Frequent dynamic network topology & membership changes Key solution space challenges Enormous accidental & inherent complexities Continuous evolution & change Highly heterogeneous platform, language, & tool environments 13 Mapping problem space requirements to solution space artifacts is very hard!

New Challenge: Ultra-Large-Scale Cyber-Physical Systems Key problem space challenges Limited communication bandwidth, Connectivity is intermittent & connections are noisy Processing & storage capacity are limited & power/weight limits affect usage patterns Must adapt to unanticipated workflows Frequent dynamic network topology & membership changes Key solution space challenges Enormous accidental & inherent complexities Continuous evolution & change Highly heterogeneous platform, language, & tool environments 14 Cyber-Physical Systems-of-Systems well beyond the scope of clouds today

Outline of Presentation Context & terminology Prior R&D progress Current R&D trends & challenges A promising solution Upcoming event Concluding remarks 15

Convenient Trend: Elastic Hardware Platforms Elastic hardware based on multi-core & distributed-core architectures now available at reasonable prices Elastic hardware has potential to substantially accelerate performance by parallelizing application workloads & autoscaling data processing at runtime Goal is to add/utilize more hardware without changing application business logic or configurations Current focus of elastic hardware is largely on web hosting applications in public cloud environments Elastic hardware is necessary but not 16 sufficient for elastic CPS applications

Impediments to Elastic Hardware for CPSs Inadequate programming models Complicated & obtrusive APIs Can t use hardware effectively & scalably Inadequate knowledge of real-time, concurrency, & networking e.g., high probability of race conditions, deadlocks, priority inversion, & missed deadlines Inadequate mechanisms to transition seamlessly from multi- to distributedcore environments Inadequate at scale quality-of-service (QoS) support e.g., lack of control over key QoS impacting resource usage & end-toend data deliver semantics Some impediments affect many types 17 of systems, some mostly affect CPSs

Key Research Challenges for Elastic CPSs Precise auto-scaling of resources with a systemwide focus Flexible optimization algorithms to balance real-time constraints with cost & other goals Improved fault-tolerance fail-over that supports real-time requirements Data provisioning & load balancing algorithms that use physical properties of computations These challenges are hard enough for 18 non-elastic cyber-physical systems!

Key Research Challenges for Elastic CPSs Precise auto-scaling of resources with a systemwide focus Lack of predictable workload patterns makes it hard to inform cloud providers of resource requirements Current state-of-the-art in auto-scaling algorithms manage services in isolation CPSs require auto-scaling algorithms to operate on end-to-end task chains Physical stability & safety properties may require exceedingly complex analyses e.g., reachability of hybrid cyber-physical states CPU utilization 19

Key Research Challenges for Elastic CPSs Flexible optimization algorithms to balance realtime constraints with cost & other goals CPS deployments must be schedulable on all resources acquired from cloud providers to ensure real-time response times while optimizing desired objective functions e.g., minimizing costs Principled means are needed to co-schedule or perform admission control & eviction of mixedcriticality task sets deployed on cloud resources Multi-dimensional Resource Management Cost 20

Key Research Challenges for Elastic CPSs Improved fault-tolerance fail-over that supports real-time requirements Although some cloud platforms tolerate faults for provisioned resources, this is insufficient for CPSs where real-time faulttolerance of end-to-end task chains must be met simultaneously while minimizing costs The complex & stochastic nature of some CPSs means that reasoning about consequences of faults is an important open research area 21

Key Research Challenges for Elastic CPSs Data provisioning & load balancing algorithms that use physical properties of computations CPSs generate load on a cloud computing environment due to physical stimuli To build most scalable & high-performance systems, algorithms & techniques are needed to exploit physical characteristics of data & computation to improve the distribution of work in a cloud environment cache affinity geographic associations social network linkages power consumption 22

Outline of Presentation Context & terminology Prior R&D progress Current R&D trends & challenges A promising solution Upcoming event Concluding remarks We need a holistic solution that provides 23 an elastic CPS software infrastructure

Requirements for Elastic CPS Software Infrastructure Flexibility loosely coupled components that can be analyzed, replaced, reused, distributed, & parallelized dependably Adaptability provide APIs that adapt to existing code, rather than always having to adapt code to an API Uniformity seamless (ideally standards-based) support for multi-core & distributed-core Scalability static & dynamic load balancing ensures best & dependable utilization of available elastic hardware resources Dependability Low Latency Dynamic Discovery Middle ware Load Balancing Data Distribution Middleware is at the heart of elastic 24 CPS software infrastructure

Key Layers of CPS Software Infrastructure Provide mechanisms to manage endsystem resources, e.g., CPU scheduling, file systems, memory management, & IPC Domain-Specific Services Common Middleware Services Distribution Middleware Host Infrastructure Middleware Operating Systems & Protocols 25

Key Layers of CPS Software Infrastructure Encapsulates & enhances native OS mechanisms to create reusable network programming components Domain-Specific Services Common Middleware Services Distribution Middleware Host Infrastructure Middleware Operating Systems & Protocols 26

Key Layers of CPS Software Infrastructure Defines higher-level distributed programming models whose reusable APIs & components automate & extend native OS capabilities Domain-Specific Services Common Middleware Services Distribution Middleware Host Infrastructure Middleware Operating Systems & Protocols 27

Key Layers of CPS Software Infrastructure Augment distribution middleware by defining higher-level domainindependent services that focus on programming business logic Domain-Specific Services Common Middleware Services Distribution Middleware Host Infrastructure Middleware Operating Systems & Protocols 28

Key Layers of CPS Software Infrastructure Tailored to the requirements of particular domains, such as SCADA, C4ISR, avionics, vehtronics, air traffic mnagement, aerospace, etc. Domain-Specific Services Common Middleware Services Distribution Middleware Host Infrastructure Middleware Operating Systems & Protocols 29

Promising Elastic CPS Middleware: DDS The OMG Data Distribution Service (DDS) promotes a pattern language that leads to loosely coupled, polyglot, evolvable, scalable, high performance, & dependable, CPSs DDS supports flat, relational, & OO information modeling DDS global data space allows apps to read/write data anonymously/ asynchronously, decoupled in space & time DDS QoS-enabled pub/sub model allows apps to produce/consume information into/from the global data space Global Data Space DDS mainly provides distribution middleware 30 & common middleware services

Promising Elastic CPS Middleware: DDS DDS provides a rich set of QoS policies that control of resource usage, end-to-end data delivery, & data availability, e.g.: Batching Priority Deadline Data Durability Redundancy Data History en.wikipedia.org/wiki/data_distribution_service 31 has a good DDS overview

Promising Elastic CPS Middleware: DDS DDS provides a rich set of QoS policies that control of resource usage, end-to-end data delivery, & data availability, e.g.: Batching Priority Deadline Data Durability Redundancy Data History Bridges are available across technologies to expose relevant data to over 80 communication protocols, without imposing changes into existing systems www.slideshare.net/angelo.corsaro/cloud-messaging-with-opensplice-dds 32

DDS is an OMG standard that itself is based on many associated open standards All mainstream DDS implementations are now available in opensource form Promising Elastic CPS Middleware: DDS DDS is used in many CPS research projects & production systems http://portals.omg.org/ dds provides more info on DDS open-source activities & projects 33

Outline of Presentation Context & terminology Prior R&D progress Current R&D trends & challenges A promising solution Upcoming event Concluding remarks 34

Upcoming NSF Workshop on CC4CPS www.isis.vanderbilt.edu/ workshops/cc4cps contains information on an upcoming NSF workshop on Computing Clouds for Cyber- Physical Systems (CC4CPS) Held March 14 & 15 th, 2012 in Arlington, VA Short position papers are due by November 30 th, 2012 NSF will pay for ~$1k of travel expenses for each participant 35

Outline of Presentation Context & terminology Prior R&D progress Current R&D trends & challenges A promising solution Upcoming event Concluding remarks 36

Concluding Remarks Despite promising advances in elastic hardware, deploying CPSs in cloud environments is hard without adequate support from elastic software infrastructure It s unlikely that public clouds will work for mission-critical (ultra-large-scale) cyber-physical systems The standards-based DDS middleware ecosystem provides key open-source building-blocks for creating a dependable elastic software infrastructure for CPSs Big breakthroughs often happen when what is suddenly possible meets what is desperately necessary Thomas Friedman 37

Concluding Remarks Despite promising advances in elastic hardware, deploying CPSs in cloud environments is hard without adequate support from elastic software infrastructure It s unlikely that public clouds will work for mission-critical (ultra-large-scale) cyber-physical systems The standards-based DDS middleware ecosystem provides key open-source building-blocks for creating a dependable elastic software infrastructure for CPSs Big breakthroughs often happen when what is suddenly possible meets what is desperately necessary Thomas Friedman www.coursera.org/course/posa 38

Further Reading Ultra-large-scale (ULS) systems are sociotechnical ecosystems comprised of softwarereliant systems, people, policies, cultures, & economics that have unprecedented scale: # of software & hardware elements # of connections & interdependencies # of computational elements # of purposes & perception of purposes # of routine processes & emergent behaviors # of (overlapping) policy domains & enforceable mechanisms # of people involved in some way Amount of data stored, accessed, & manipulated etc www.sei.cmu.edu/uls See blog.sei.cmu.edu for more discussions 39 of software R&D activities

Further Reading NRC Report Critical Code: Software Producibility for Defense (2010) The report focuses on ensuring the DoD has the technical capacity & workforce to design, produce, assure, & evolve innovative software-reliant systems in a predictable manner, while effectively managing risk, cost, schedule, & complexity Sponsored by Office of the Secretary of Defense (OSD) with assistance from the National Science Foundation (NSF), & Office of Naval Research (ONR), www.nap.edu/openbook.php?record_id=12979&page=r1 See blog.sei.cmu.edu for more discussions 40 of software R&D activities