Low overhead virtual machines tracing in a cloud infrastructure

Similar documents
Host-Assisted Virtual Machine Tracing and Analysis

PERFORMANCE ANALYSIS OF CLOUD COMPUTING PLATFORMS

1. What is Cloud Computing (CC)? What are the Pros and Cons of CC? Technologies of CC 27

KVM 在 OpenStack 中的应用. Dexin(Mark) Wu

KVM Weather Report. Amit Shah SCALE 14x

Container Adoption for NFV Challenges & Opportunities. Sriram Natarajan, T-Labs Silicon Valley Innovation Center

Red Hat Enterprise Virtualization Hypervisor Roadmap. Bhavna Sarathy Senior Technology Product Manager, Red Hat

Profiling: Understand Your Application

What is KVM? KVM patch. Modern hypervisors must do many things that are already done by OSs Scheduler, Memory management, I/O stacks

Build Cloud like Rackspace with OpenStack Ansible

Linux Foundation Collaboration Summit 2010

Spring 2017 :: CSE 506. Introduction to. Virtual Machines. Nima Honarmand

Course Review. Hui Lu

INTEGRATING HPFS IN A CLOUD COMPUTING ENVIRONMENT

System Wide Tracing User Need

Monitoring and Analyzing Virtual Machines Resource Overcommitment Detection and Virtual Machine Classification

Status Update About COLO FT

A Userspace Packet Switch for Virtual Machines

Running Analytics and Real-Time Monitoring of OpenStack Swift Cluster. Sreedhar Varma Vedams Inc.

Virtualization. ...or how adding another layer of abstraction is changing the world. CIS 399: Unix Skills University of Pennsylvania.

OPENSTACK: THE OPEN CLOUD

HKG : OpenAMP Introduction. Wendy Liang

EDGE COMPUTING & IOT MAKING IT SECURE AND MANAGEABLE FRANCK ROUX MARKETING MANAGER, NXP JUNE PUBLIC

Large Scale Debugging

GRNET Cloud Services

How Container Runtimes matter in Kubernetes?

Cloud & container monitoring , Lars Michelsen Check_MK Conference #4

KVM PERFORMANCE OPTIMIZATIONS INTERNALS. Rik van Riel Sr Software Engineer, Red Hat Inc. Thu May

Munara Tolubaeva Technical Consulting Engineer. 3D XPoint is a trademark of Intel Corporation in the U.S. and/or other countries.

The speed of containers, the security of VMs

Virtual Machines. Part 2: starting 19 years ago. Operating Systems In Depth IX 1 Copyright 2018 Thomas W. Doeppner. All rights reserved.

Live block device operations in QEMU

Ceph vs Swift Performance Evaluation on a Small Cluster. edupert monthly call Jul 24, 2014

Recovering Disk Storage Metrics from low level Trace events

Understanding The Performance of DPDK as a Computer Architect

Changpeng Liu. Cloud Storage Software Engineer. Intel Data Center Group

CSC 5930/9010 Cloud S & P: Virtualization

Lecture 7. Xen and the Art of Virtualization. Paul Braham, Boris Dragovic, Keir Fraser et al. 16 November, Advanced Operating Systems

Virtualization and Performance

Zhang Chen Zhang Chen Copyright 2017 FUJITSU LIMITED

Virtualisation: The KVM Way. Amit Shah

Real-Time KVM for the Masses Unrestricted Siemens AG All rights reserved

Reducing CPU usage of a Toro Appliance

Using (Suricata over) PF_RING for NIC-Independent Acceleration

Next-Generation Cloud Platform

LinuxCon 2010 Tracing Mini-Summit

Efficient and Large Scale Program Flow Tracing in Linux. Alexander Shishkin, Intel

LinuxCon North America 2016 Investigating System Performance for DevOps Using Kernel Tracing

Accelerating NVMe I/Os in Virtual Machine via SPDK vhost* Solution Ziye Yang, Changpeng Liu Senior software Engineer Intel

Knut Omang Ifi/Oracle 20 Oct, Introduction to virtualization (Virtual machines) Aspects of network virtualization:

RAMP-White / FAST-MP

Nested Virtualization and Server Consolidation

The Challenges of X86 Hardware Virtualization. GCC- Virtualization: Rajeev Wankar 36

Low-Overhead Ring-Buffer of Kernel Tracing in a Virtualization System

PCP: Ingest and Export

Hypervisor security. Evgeny Yakovlev, DEFCON NN, 2017

Outline. INF3190:Distributed Systems - Examples. Last week: Definitions Transparencies Challenges&pitfalls Architecturalstyles

I/O Systems. Amir H. Payberah. Amirkabir University of Technology (Tehran Polytechnic)

Changpeng Liu. Senior Storage Software Engineer. Intel Data Center Group

The Architecture of Virtual Machines Lecture for the Embedded Systems Course CSD, University of Crete (April 29, 2014)

Measuring zseries System Performance. Dr. Chu J. Jong School of Information Technology Illinois State University 06/11/2012

Libvirt: a virtualization API and beyond

SR-IOV Support for Virtualization on InfiniBand Clusters: Early Experience

Performance of Kernels in Virtual Machines: An Introduction to KVM Hypervisor

Cloud environment with CentOS, OpenNebula and KVM

User Workspace Management

Cloud Computing Virtualization

Chapter 13: I/O Systems

QuartzV: Bringing Quality of Time to Virtual Machines

LinuxCon North America 2012

viommu/arm: full emulation and virtio-iommu approaches Eric Auger KVM Forum 2017

CIS 21 Final Study Guide. Final covers ch. 1-20, except for 17. Need to know:

EE 660: Computer Architecture Cloud Architecture: Virtualization

Diagnosis via monitoring & tracing

Survey on Cloud Infrastructure Service: OpenStack Compute

Live Migration of Virtualized Edge Networks: Analytical Modeling and Performance Evaluation

Virtualizaton: One Size Does Not Fit All. Nedeljko Miljevic Product Manager, Automotive Solutions MontaVista Software

RDMA-like VirtIO Network Device for Palacios Virtual Machines

Agilio CX 2x40GbE with OVS-TC

OpenStack and OpenDaylight, the Evolving Relationship in Cloud Networking Charles Eckel, Open Source Developer Evangelist

Improve VNF safety with Vhost-User/DPDK IOMMU support

Operating Systems 4/27/2015

24-vm.txt Mon Nov 21 22:13: Notes on Virtual Machines , Fall 2011 Carnegie Mellon University Randal E. Bryant.

MidoNet Scalability Report

LAS16-TR06 Remoteproc & rpmsg development. Bjorn Andersson

Increase KVM Performance/Density

DPDK Summit 2016 OpenContrail vrouter / DPDK Architecture. Raja Sivaramakrishnan, Distinguished Engineer Aniket Daptari, Sr.

Abstract. Testing Parameters. Introduction. Hardware Platform. Native System

PageForge: A Near-Memory Content- Aware Page-Merging Architecture

for Multi-Services Gateways

Applying Polling Techniques to QEMU

Xen and the Art of Virtualiza2on

Evolution of the netmap architecture

High-performance aspects in virtualized infrastructures

Mental models for modern program tuning

1 Virtualization Recap

International Journal of Advance Engineering and Research Development. DPDK-Based Implementation Of Application : File Downloader

Hypervisors on ARM Overview and Design choices

Multimedia Streaming. Mike Zink

Developing cloud infrastructure from scratch: the tale of an ISP

Transcription:

Low overhead virtual machines tracing in a cloud infrastructure Mohamad Gebai Michel Dagenais Dec 7, 2012 École Polytechnique de Montreal

Content Area of research Current tracing: LTTng vs ftrace / virtio trace Preliminary work Main project proposal References 2

Area of research Tracing virtualized machines in a cloud context 3

Current VM tracing LTTng tools 2.1: support for network streaming Sending trace data between the guest and the host over the network Might not be the best suited approach for the guest and the host to communicate trace data Usage of network bandwidth IP based communications may be inadvertently blocked by the virtual machine administrator 4

Virtio http://www.ibm.com/developerworks/linux/library/l virtio/figure2.gif Based on paravirtualization : the operating system knows it is running in a VM Virtio: a set of device drivers for virtual machines virtio net (network adapter) virtio balloon (dynamic allocation and disallocation of the guest's memory) virtio serial (char device communication channel) 5

Virtio trace Device driver for tracing since Linux kernel version 3.7 Sending traces from guest to host without copying Low overhead Guest OS tracing tool Trace data No memory copying Ring buffer read( ) virtio trace splice( ) virtio serial FIFO qemu Host OS 6

Preliminary work Running benchmarks on native VM, with ftrace (using virtio trace), with LTTng (using streaming) Compare the guest to host communication overhead between ftrace and LTTng 7

Main project proposal Improve the performance and extent of virtual machine tracing Adapt the best features of existing kernel ring buffers for LTTng and ftrace Insure uniform clock sources between kernel and user space applications in physical and virtual machines Trace and analyze VM specific behavior such as page sharing with KSM (kernel same page merging) 8

References http://www.virtualbox.org/manual/ch06.html http://www.ibm.com/developerworks/linux/library/l virtio/index.html http://log.amitshah.net/tag/virtio serial/ https://events.linuxfoundation.org/images/stories/pdf/lcjp2012_yunomae.pdf https://lkml.org/lkml/2012/7/23/511 https://lkml.org/lkml/2012/6/5/143 9

Large Scale Data Center Monitoring and Debugging Infrastructure Julien Desfossez Michel Dagenais Dec 7, 2012 École Polytechnique de Montreal

Content Problem definition State of the art Proposed approach Early work References 11

Problem Definition Large scale data centers : hosting, cloud computing, scientific computing, etc Virtualized data centers Monitoring inefficient and costly (/proc reading, locks and system calls) Mostly high level metrics (CPU, memory, load, etc) or application level monitoring without correlation to the OS or physical layer No virtualisation/hypervisor metrics 12

Monitoring Multiple aspects: Availability testing Load measurement Trending Reporting 13

State of the Art Commercial products add Cloud to their softwares titles Most of commercial systems focus on the interface and a predefined set of tests Reuse the same technology as monitoring a single machine and the associated services Focus mainly on capacity planning with high level information 14

State of the Art Swift Recon : middle man metrics collector Common metrics : Load average, sockets stats, /proc/meminfo, etc Object storage dedicated metrics : MD5 of each ring file most recent object replication time count of each type of quarantined files: account, container, or object. Count of async_pendings (deferred container updates) on disk 15

State of the Art Rackspace Cloud Monitoring API to access monitoring data High level remote tests HTTP requests to pages TCP connect Banner check 16

State of the Art RockSteady Metric analysis and correlation engine User defined metrics Sending/receiving done with message queueing (RabbitMQ) Optional graphing engine (Graphite) Complex Event Processing to extract root cause from event by integrating multiple data sources 17

18

Proposed Approach Tracing can be used as an efficient monitoring backend (Master's results) LTTng + Perf PMU counters Extract metrics from trace data Define metrics relevant for large scale data centers and virtualized environments Generate statistics and detailled usage reports 19

Proposed Approach Dynamically adjustable level of information (high level metrics down to detailled in application behavior) Streaming and live analysis Aggregation on intermediate nodes (depending on topology) 20

Target environment OpenStack KVM LXC cgroups 21

Early Work Basic metrics computation and reporting (LTTngTop during Master) Streaming traces (lttng tools 2.1) Index generation Live analysis (in progress) 22

References OpenStack Manual chapter 6, OpenStack Object Storage Monitoring Proven Practice: Metrics for Virtualization Management http://communities.vmware.com/docs/doc 11375?decorator Intel Virtualization Performance Metrics http://software.intel.com/en us/articles/virtualization performa Google Rocksteady https://code.google.com/p/rocksteady/ 23