Modern systems: multicore issues

Similar documents
INFLUENTIAL OS RESEARCH

VIRTUALIZATION: IBM VM/370 AND XEN

Message Passing. Advanced Operating Systems Tutorial 5

The Multikernel A new OS architecture for scalable multicore systems

Virtual Machines Disco and Xen (Lecture 10, cs262a) Ion Stoica & Ali Ghodsi UC Berkeley February 26, 2018

The Barrelfish operating system for heterogeneous multicore systems

SCALABILITY AND HETEROGENEITY MICHAEL ROITZSCH

Disco: Running Commodity Operating Systems on Scalable Multiprocessors

Virtual Machines. 2 Disco: Running Commodity Operating Systems on Scalable Multiprocessors([1])

CS533 Concepts of Operating Systems. Jonathan Walpole

the Barrelfish multikernel: with Timothy Roscoe

Implications of Multicore Systems. Advanced Operating Systems Lecture 10

G Disco. Robert Grimm New York University

Decelerating Suspend and Resume in Operating Systems. XSEL, Purdue ECE Shuang Zhai, Liwei Guo, Xiangyu Li, and Felix Xiaozhu Lin 02/21/2017

Disco. CS380L: Mike Dahlin. September 13, This week: Disco and Exokernel. One lesson: If at first you don t succeed, try try again.

Multiprocessor OS. COMP9242 Advanced Operating Systems S2/2011 Week 7: Multiprocessors Part 2. Scalability of Multiprocessor OS.

Design Principles for End-to-End Multicore Schedulers

Today s Papers. Virtual Machines Background. Why Virtualize? EECS 262a Advanced Topics in Computer Systems Lecture 19

The Multikernel: A new OS architecture for scalable multicore systems Baumann et al. Presentation: Mark Smith

Virtualization and Virtual Machines. CS522 Principles of Computer Systems Dr. Edouard Bugnion

CSE 120 Principles of Operating Systems

Virtual machine architecture and KVM analysis D 陳彥霖 B 郭宗倫

CSC 5930/9010 Cloud S & P: Virtualization

Disco: Running Commodity Operating Systems on Scalable Multiprocessors

DISCO and Virtualization

VMMS: DISCO AND XEN CS6410. Ken Birman

New directions in OS design: the Barrelfish research operating system. Timothy Roscoe (Mothy) ETH Zurich Switzerland

Background. IBM sold expensive mainframes to large organizations. Monitor sits between one or more OSes and HW

24-vm.txt Mon Nov 21 22:13: Notes on Virtual Machines , Fall 2011 Carnegie Mellon University Randal E. Bryant.

MODERN SYSTEMS: EXTENSIBLE KERNELS AND CONTAINERS

Common Kernel Development for Heterogeneous Linux Platforms

Virtualization. Part 1 Concepts & XEN

Supporting Isolation for Fault and Power Management with Fully Virtualized Memory Systems

Failures in Large Systems. Hive: Fault Containment for Shared- Memory Multiprocessors. Why failures? Component Reliability

Xen and the Art of Virtualization

MULTIPROCESSORS AND THREAD-LEVEL. B649 Parallel Architectures and Programming

CSE 120 Principles of Operating Systems

MULTIPROCESSORS AND THREAD-LEVEL PARALLELISM. B649 Parallel Architectures and Programming

COMPUTER ARCHITECTURE. Virtualization and Memory Hierarchy

Extensible Kernels: Exokernel and SPIN

Distributed File Systems Issues. NFS (Network File System) AFS: Namespace. The Andrew File System (AFS) Operating Systems 11/19/2012 CSC 256/456 1

MULTIPROCESSOR OS. Overview. COMP9242 Advanced Operating Systems S2/2013 Week 11: Multiprocessor OS. Multiprocessor OS

Lecture 9: MIMD Architectures

Xen and the Art of Virtualization

CSC501 Operating Systems Principles. OS Structure

Master s Thesis Nr. 10

Virtual Virtual Memory

Operating Systems 4/27/2015

AN OVERVIEW OF DISTRIBUTED FILE SYSTEM Aditi Khazanchi, Akshay Kanwar, Lovenish Saluja

OS Virtualization. Why Virtualize? Introduction. Virtualization Basics 12/10/2012. Motivation. Types of Virtualization.

Nested Virtualization and Server Consolidation

Virtual Machine Systems

Disco: Running Commodity Operating Systems on Scalable Multiprocessors

Enabling Virtualization on Scalable Multicore Systems

CHAPTER 16 - VIRTUAL MACHINES

We ve seen lots of examples of virtualization. This is another: a virtual machine monitor. A VMM virtualizes an entire hardware machine.

NON SCHOLAE, SED VITAE

Multiprocessors and Thread Level Parallelism Chapter 4, Appendix H CS448. The Greed for Speed

Non-Uniform Memory Access (NUMA) Architecture and Multicomputers

CS 350 Winter 2011 Current Topics: Virtual Machines + Solid State Drives

CS5460: Operating Systems. Lecture: Virtualization. Anton Burtsev March, 2013

B.H.GARDI COLLEGE OF ENGINEERING & TECHNOLOGY (MCA Dept.) Parallel Database Database Management System - 2

Non-Uniform Memory Access (NUMA) Architecture and Multicomputers

Concepts of Distributed Systems 2006/2007

Unit 5: Distributed, Real-Time, and Multimedia Systems

CSE 451: Operating Systems. Section 10 Project 3 wrap-up, final exam review

Virtualization, Xen and Denali

Non-Uniform Memory Access (NUMA) Architecture and Multicomputers

Chapter 5 (Part II) Large and Fast: Exploiting Memory Hierarchy. Baback Izadi Division of Engineering Programs

Distributed OS and Algorithms

Chapter 5 B. Large and Fast: Exploiting Memory Hierarchy

Pmake NFS. Scientific App SMP OS. Thin OS DISCO PE PE PE PE PE PE PE PE. Interconnect. CC NUMA Multiprocessor

OS Extensibility: SPIN and Exokernels. Robert Grimm New York University

Operating Systems : Overview

Virtual Machines. To do. q VM over time q Implementation methods q Hardware features supporting VM q Next time: Midterm?

CS-736 Midterm: Beyond Compare (Spring 2008)

Linux and Xen. Andrea Sarro. andrea.sarro(at)quadrics.it. Linux Kernel Hacking Free Course IV Edition

Dynamic Translator-Based Virtualization

Operating System. Chapter 4. Threads. Lynn Choi School of Electrical Engineering

Separating Access Control Policy, Enforcement, and Functionality in Extensible Systems. Robert Grimm University of Washington

Virtualization. Virtualization

Chapter 5 C. Virtual machines

Virtualization. Pradipta De

Lecture 9: MIMD Architecture

Virtualization and memory hierarchy

Virtualization. Dr. Yingwu Zhu

Dynamo: Amazon s Highly Available Key-value Store

Integrating Reliable Memory in Databases

Why Multiprocessors?

Bachelor s Thesis Nr. 138b

OS concepts and structure. q OS components & interconnects q Structuring OSs q Next time: Processes

Mach External pager. IPC Costs. Why the difference? Example of IPC Performance. First generation microkernels were slow Mach, Chorus, Amoeba

CS370: Operating Systems [Spring 2017] Dept. Of Computer Science, Colorado State University

Multiprocessors - Flynn s Taxonomy (1966)

Computer-System Organization (cont.)

CSCE 410/611: Virtualization!

System Virtual Machines

Background. IBM sold expensive mainframes to large organiza<ons. Monitor sits between one or more OSes and HW

CSCE 410/611: Virtualization

System Virtual Machines

Transcription:

Modern systems: multicore issues By Paul Grubbs Portions of this talk were taken from Deniz Altinbuken s talk on Disco in 2009: http://www.cs.cornell.edu/courses/cs6410/2009fa/lectures/09-multiprocessors.ppt

What papers will we be discussing? The Multikernel: A new OS architecture for scalable multicore systems. Andrew Baumann, Paul Barham, Pierre-Evariste Dagand, Tim Harrisy, Rebecca Isaacs, Simon Peter, Tim Roscoe, Adrian Schüpbach, and Akhilesh Singhania. Proceedings of the Twenty-Second ACM Symposium on Operating Systems Principles (Austin, Texas, United States), ACM, 2009. Disco: Running Commodity Operating Systems on Scalable Multiprocessors, Edouard Bugnion, Scott Devine, and Mendel Rosenblum. 16th ACM symposium on Operating systems principles (SOSP), October 1997, pages 143--156.

High-level context General-purpose operating systems must run efficiently on many different architectures. Multiprocessing Non-uniform memory access (NUMA) (Cache coherence?) Commodity, general-purpose OSs are not designed to do this Rewriting them should be avoided Exokernels (1995), SPIN (1996)

Disco: Running Commodity Operating Systems on Scalable Multiprocessors Edouard Bugnion, Scott Devine, and Mendel Rosenblum What is the problem being considered? Multiprocessing requires extensive OS rewrites NUMA is hard, more rewrites

What is the authors solution to this problem? A new twist on an old idea: virtual machine monitors (VMM). Updated VMMs for the multiprocessing era

Disco vs. exokernels? Exokernel leaves resource management to applications Only multiplexes physical resources Disco virtualizes them Disco can run commodity OSs with little or no modification More difficult to run commodity OSs on Exokernels

Disco vs. System/370? Both are VM monitors VM/370 maps virtual disks to physical disk partitions Disco uses shared copy-on-write disks to decrease storage overhead Disco supports ccnuma multiprocessors Heavily optimizes for NUMA and shared mem access

(picture taken from Disco paper)

Abstractions of hardware Virtual CPU Virtualized physical memory Virtualized I/O devices

Virtual CPUs No emulation of most instructions: code runs raw on hardware CPU Exception: privileged calls (TLB, device access) must be emulated by Disco Disco keeps process table for each vcpu for fast emulation vcpu scheduler to allow time-sharing on physical CPUs Compare to Xen paravirtualization?

Virtualized physical memory Offers uniform memory abstraction to commodity OSs, uses ccnuma memory of multiprocessor Dynamic page migration/replication a small change to OS: Disco allocates shared memory regions that multiple VMs can access DB w/ shared buffer cache Drawback: redundant OS/application code Solution: Transparent sharing of redundant read-only pages like kernel code

Virtualized I/O devices No device virtualization really Add special VMM-specific device drivers to kernel of OS Pages handled using copy-on-write Works well for read-only Persistent disks only mounted on one VM VMs read other disks using NFS

How do they assess the quality of their solution? FLASH didn t exist yet so used an OS simulator They weren t able to simulate the machine particularly well No benchmarks for long-running or complicated processes Disco s resource sharing policies were only superficially tested They focused on four uses cases Parallel compilation of GNU chess application Verilog simulation of hardware Raytracing

Thoughts/Questions? Do you prefer Disco s virtualization approach or hardware multiplexing, e.g. Exokernels? Which do you think is better? Disco makes support for commodity OSs a first-class goal. Is this desirable? Does it lead to suboptimal design decisions? In OS research is it necessary to preserve backwards-compatibility? Does not having a real machine to test on hurt the paper? What did you really like about this paper? What did you really not like about this paper?

The Multikernel: A new OS architecture for scalable multicore systems Andrew Baumann, Paul Barham, Pierre-Evariste Dagand, Tim Harrisy, Rebecca Isaacs, Simon Peter, Tim Roscoe, Adrian Schüpbach, and Akhilesh Singhania What is the problem being considered? Diversity in systems, diversity in cores, diversity in multiprocessor architectures What is the authors solution to this problem? New OS structure: multikernel How do they assess the quality of their solution? Various benchmarks for cache coherence, RPC overhead

Three key ideas: 1. Make all inter-core communication explicit. 1. Make OS structure hardware-neutral. 1. View state as replicated instead of shared.

1. Make all inter-core communication explicit. Inter-core communiation uses explicit messages Avoids shared memory Multiprocessors look more and more like networks Using messages allows easy pipelining/batching Makes interconnect use more efficient Automated analysis/formal verification Calculi for reasoning about concurrency

2. Make OS structure hardware-neutral. Separate OS structure from physical instantiation: abstraction! Only message transport and hardware interfaces are machine-specific Minimizes code change to OS Separate IPC protocols from hardware implementation Performance/extensibility benefits

3. View state as replicated instead of shared. Shared state is accessed as a local replica Shared state consistency through messages Consistency reqs tunable using diff protocols Reduces interconnect traffic and synchronization overhead Fault-tolerant to failures in CPUs

(Taken from the Multikernel paper)

Thoughts/questions? Relying on distributed protocols for consistency of shared state Good idea/bad idea? Why? Multikernels do not target support for commodity OS Good idea/bad idea? Why? Is their system-as-network model accurate? Should the interconnect be treated like other communication channels? What did you really like about this paper? What did you really not like about this paper?

What connects these two papers? Multiprocessing! NUMA!