CIS : Computational Reproducibility

Similar documents
CIS : Computational Reproducibility

CIS : Computational Reproducibility

Virtualization. Starting Point: A Physical Machine. What is a Virtual Machine? Virtualization Properties. Types of Virtualization

Virtualization. ! Physical Hardware Processors, memory, chipset, I/O devices, etc. Resources often grossly underutilized

Introduction to Virtual Machines. Carl Waldspurger (SB SM 89 PhD 95) VMware R&D

VMware Overview VMware Infrastructure 3: Install and Configure Rev C Copyright 2007 VMware, Inc. All rights reserved.

VMware - VMware vsphere: Install, Configure, Manage [V6.7]

Virtualization. Dr. Yingwu Zhu

OPS-9: Fun With Virtualization. John Harlow. John Harlow. About John Harlow

IBM Bluemix compute capabilities IBM Corporation

7 Things ISVs Must Know About Virtualization

Understanding Virtual System Data Protection

VMware vsphere: Fast Track [V6.7] (VWVSFT)

How it can help your organisation

VMware vsphere 6.5: Install, Configure, Manage (5 Days)

Distributed Systems COMP 212. Lecture 18 Othon Michail

VEMBU VS VEEAM Why Vembu is Better. VEMBU TECHNOLOGIES

Virtualization. ...or how adding another layer of abstraction is changing the world. CIS 399: Unix Skills University of Pennsylvania.

Chapter 5 C. Virtual machines

VMware vsphere: Fast Track. System administrators System engineers. Cursusduur: 5 Dagen Cursuscode: VSFT Version: 6.7. Beschrijving: Doelgroep:

Hyper-Convergence De-mystified. Francis O Haire Group Technology Director

[VMICMV6.5]: VMware vsphere: Install, Configure, Manage [V6.5]

Virtualization with Arcserve Unified Data Protection

CS 470 Spring Virtualization and Cloud Computing. Mike Lam, Professor. Content taken from the following:

Data Centers and Cloud Computing

VMWARE VSPHERE: FAST TRACK V6.7 (EDU-VSFT67)

Securing your Virtualized Datacenter. Charu Chaubal Senior Architect, Technical Marketing 6 November, 2008

LINUX CONTAINERS. Where Enterprise Meets Embedded Operating Environments WHEN IT MATTERS, IT RUNS ON WIND RIVER

Data Centers and Cloud Computing. Slides courtesy of Tim Wood

WHITE PAPER SEPTEMBER VMWARE vsphere AND vsphere WITH OPERATIONS MANAGEMENT. Licensing, Pricing and Packaging

Data Centers and Cloud Computing. Data Centers

Verron Martina vspecialist. Copyright 2012 EMC Corporation. All rights reserved.

VMware vsphere: Install, Configure, Manage (vsphere ICM 6.7)

Better Security with Virtual Machines

Virtualization. Application Application Application. MCSN - N. Tonellotto - Distributed Enabling Platforms OPERATING SYSTEM OPERATING SYSTEM

Data Protection for Virtualized Environments

Spring 2017 :: CSE 506. Introduction to. Virtual Machines. Nima Honarmand

VMware Enterprise Desktop Solutions: What s NEW with VMware View 3. John Hinkle Professional Services Practice principal February 18 th, 2009

VMware vsphere: Install, Configure, Manage plus Optimize and Scale- V 6.5. VMware vsphere 6.5 VMware vcenter 6.5 VMware ESXi 6.

Enterprise X-Architecture 5th Generation And VMware Virtualization Solutions

Devops, Docker and Security. John

NexentaStor VVOL

VMware vsphere 5.5 Advanced Administration

Unisys SafeGuard Solutions

HPC learning using Cloud infrastructure

Pavel Anni Oracle Solaris 11 Feature Map. Slide 2

Arcserve Solutions for Amazon Web Services (AWS)

LINUX Virtualization. Running other code under LINUX

HWDSB s Journey to the Cloud

Dell EMC Unity: Built-In Hybrid Cloud & Software Defined Capabilities. Wei Chen Product Technologist Midrange & Entry Systems

A Better Approach to Leveraging an OpenStack Private Cloud. David Linthicum

Availability for the modern datacentre Veeam Availability Suite v9.5

VMware vsphere with ESX 6 and vcenter 6

ARCSERVE UNIFIED DATA PROTECTION

Deploying Software Defined Storage for the Enterprise with Ceph. PRESENTATION TITLE GOES HERE Paul von Stamwitz Fujitsu

The vsphere 6.0 Advantages Over Hyper- V

About John Harlow. About BravePoint. OPS- 4: Fun With Virtualization

Table of Contents 1.1. Overview. Containers, Docker, Registries vsphere Integrated Containers Engine

Potpuna virtualizacija od servera do desktopa. Saša Hederić Senior Systems Engineer VMware Inc.

STREAMLINING THE DELIVERY, PROTECTION AND MANAGEMENT OF VIRTUAL DESKTOPS. VMware Workstation and Fusion. A White Paper for IT Professionals

Symantec Reference Architecture for Business Critical Virtualization

The Realities of Virtualization

Application Centric Microservices Ken Owens, CTO Cisco Intercloud Services. Redhat Summit 2015

Specialist Technology Architect, Data Protection Version 1.0 Certification Description

What s new. James De Clercq (RealDolmen) Timothy Dewin (Veeam Software)

Hybrid Cloud Data Protection & Storage

CAS8490 Delivering Recovery as a Service (RaaS) November 2014

Master Services Agreement:

Introduction. Application Versions. Virtual Machine Defined. Other Definitions. Tech Note 656 Building Wonderware Solution Architectures on VMware

An overview of virtual machine architecture

Quick Prototyping+CI with LXC and Puppet

Paragon Protect & Restore

Module 1: Virtualization. Types of Interfaces

CIS : Scalable Data Analysis

Foundation for Cloud Computing with VMware vsphere 4

Virtual Machines Disco and Xen (Lecture 10, cs262a) Ion Stoica & Ali Ghodsi UC Berkeley February 26, 2018

Virtual Volumes FAQs First Published On: Last Updated On:

Introduction to Virtualization. From NDG In partnership with VMware IT Academy

VSPHERE: FAST TRACK V6.5 (EDU-VSFT65)

Symantec Backup Exec 2012 OEM FAQ

Docker and HPE Accelerate Digital Transformation to Enable Hybrid IT. Steven Follis Solutions Engineer Docker Inc.

"Software-defined storage Crossing the right bridge"

Tools: Versioning. Dr. David Koop

IOmark- VM. IBM IBM FlashSystem V9000 Test Report: VM a Test Report Date: 5, December

Introduction to Cloud Computing

Optimize Your SQL Server 2008/R2 Migration for Cost and Availability

HP Virtual Desktop Infrastructure with VMware View Overview

VMware Workstation 5 Lab. New Features and Capabilities: Multiple Snapshots, Teams, Clones, Video Capture and More

VMware vsphere with ESX 4 and vcenter

Securing the Data Center against

Distributed File System Support for Virtual Machines in Grid Computing

VxRail: Level Up with New Capabilities and Powers GLOBAL SPONSORS

Veeam Availability Solution for Cisco UCS: Designed for Virtualized Environments. Solution Overview Cisco Public

VMWARE PROTECTION WITH DELL EMC NETWORKER 9

How Symantec Backup solution helps you to recover from disasters?

LINUX, WINDOWS(MCSE),

Detail the learning environment, remote access labs and course timings

VMware ESX Server 3i. December 2007

Consolidated Disaster Recovery. Paul Kangro Applied Technology Strategiest

Back To The Future - VMware Product Directions. Andre Kemp Sr. Product Marketing Manager Asia - Pacific

Transcription:

CIS 602-01: Computational Reproducibility Containers Dr. David Koop

Virtual Machines Software Abstraction - Behaves like hardware - Encapsulates all OS and application state Virtualization Layer - Extra level of indirection - Decouples hardware, OS - Enforces isolation - Multiplexes physical hardware across VMs [via E. de Lara] 2

Virtualization Properties Isolation - Fault isolation - Performance isolation Encapsulation - Cleanly capture all VM state - Enables VM snapshots, clones Portability - Independent of physical hardware - Enables migration of live, running VMs Interposition - Transformations on instructions, memory, I/O - Enables transparent resource overcommitment, encryption, compression, replication 3

Types of Virtualization Native/Bare metal (Type 1) - Higher performance - ESX, Xen, HyperV Hosted (Type 2) - Easier to install - Leverage host s device drivers - VMware Workstation, Parallels [http://itechthoughts.wordpress.com/tag/full-virtualization/ via E. de Lara] 4

Virtual Machine Uses Software Testing: Test multiple configurations on one computer Migration: if a server fails, move the virtual machine elsewhere Cross-environment work: Windows on Linux Enterprise support: upgrade via image Education: concentrate on math/programming rather than install Custom prototypes: try-before-you-buy [B. Howe, 2014] 5

Approaches to disseminating software high effort required by experimenter low controlled environments extensive documentation raw code and data extensive documentation controlled environments raw code and data virtual machines virtual machines low high effort required by those who only reproduce the experiments low high effort required by those who reuse and extend the results [B. Howe, 2014] 6

Improving Reproducibility Capturing more variables Fewer constraints on research methods On-Demand Backups Virtual Machines as Citable Publications Code, Data, Environment + Resources Automatic Upgrades Competitive, Elastic Pricing Reproducibility for Complex Architectures Unfettered Collaborative Experiments Data-intensive Computing Cost Sharing A Foundation for Single-Payer Funding Compatibility with Other Approaches [B. Howe, 2014] 7

Remaining Challenges Cost Culture Provenance Reuse [B. Howe, 2014] 8

Non-challenges Security Licensing Vendor Lock-In and Long-Term Preservation [B. Howe, 2014] 9

Ocean Appliance Example Ship the entire machine instead of trying to configure an existing machine with all of the new software Easier, cheaper, and safer to build the box in the lab and hand it out for free than to work with the ships admin to get our software running. Modern analog: Easier to build and distribute a virtual appliance than it is to support installation of your software. [B. Howe, 2014] 10

Virtualization and the Cloud Virtualization = Code + Data + Environment Cloud = Virtualization + Resources + Services Cloud allows on-demand resources, centralized maintenance, supply+demand Computation done near the data (cannot FTP data around for many datasets due to size and transfer costs) [B. Howe, 2014] 11

Project Find some papers that you may be interested in reproducing Do a survey of the material that is available for each paper: - Code? Is the code under version control? - Data? Is it clear how to process or understand the data? Is there metadata? - Virtual machine or container? Does the hardware/software that deals with these still work? - Provenance? Do we have a record of the steps taken in producing a result? How complete is it? 12

Project If you are interested in a topic that aligns with reproducibility, please email me/talk to me about your ideas For example, if you are working on a research project that could incorporate reproducibility Formal Specification Online http://www.cis.umassd.edu/~dkoop/cis602/project.html Due Monday, November 7 13

Introduction to Docker Docker, Inc.

The Problem Matrix [Docker, Inc., 2016] 15

The Solution: Containers [Docker, Inc., 2016] 16

Containers vs. Virtual Machines [Docker, Inc., 2016] 17

Containers vs. Virtual Machines [D. Merkel, 2014] 18

Related: Package Management & Deployment Examples: - Anaconda for Python - Gems for Ruby - apt-get, yum, etc. for Linux distributions 19

Containers and Reproducibility What are the benefits of containers over virtual machines with respect to reproducibility? Do containers address all of the problems we are concerned with? What issues remain? 20