CSE 544 Principles of Database Management Systems. Magdalena Balazinska Winter 2015 Lecture 17 Database Systems as a Cloud Service

Similar documents
ECE Enterprise Storage Architecture. Fall ~* CLOUD *~. Tyler Bletsch Duke University

Cloud & container monitoring , Lars Michelsen Check_MK Conference #4

Lecture 7: Data Center Networks

Module Day Topic. 1 Definition of Cloud Computing and its Basics

Cloud Computing Lecture 4

Introduction to data centers

CSE544 Database Architecture

Workshop Report: ElaStraS - An Elastic Transactional Datastore in the Cloud

DISTRIBUTED SYSTEMS [COMP9243] Lecture 8a: Cloud Computing WHAT IS CLOUD COMPUTING? 2. Slide 3. Slide 1. Why is it called Cloud?

Flash in a Hybrid Cloud World. How Cloud Shift will affect flash in the Data Center Steve Knipple: Cloud Shift Advisors

The Windows Azure Platform: A Perspective

CPSC 426/526. Cloud Computing. Ennan Zhai. Computer Science Department Yale University

Course 20533B: Implementing Microsoft Azure Infrastructure Solutions

Introduction To Cloud Computing

Cloud Programming. Programming Environment Oct 29, 2015 Osamu Tatebe

The Windows Azure Platform: A Perspective

Copyright Heraflux Technologies. Do not redistribute or copy as your own. 1

CHEM-E Process Automation and Information Systems: Applications

Next Generation Storage for The Software-Defned World

Lecture 09: VMs and VCS head in the clouds

Computing as a Service

Distributed Systems. 31. The Cloud: Infrastructure as a Service Paul Krzyzanowski. Rutgers University. Fall 2013

The Intersection of Cloud & Solid State Storage

Aaron J. Elmore. Research Areas. Education. Grants, Honors, and Awards. Publications

Manohar Punna. Azure Database Migration Choosing the Right Tier

Large-Scale Web Applications

CLOUD COMPUTING ABSTRACT

Introduction to Cloud Computing

Configuring and Operating a Hybrid Cloud with Microsoft Azure Stack

Javier Villegas. Azure SQL Server Managed Instance

Introduction to Cloud Computing and Virtual Resource Management. Jian Tang Syracuse University

Where We Are. Review: Parallel DBMS. Parallel DBMS. Introduction to Data Management CSE 344

BI ENVIRONMENT PLANNING GUIDE

Cloud Computing and Service-Oriented Architectures

Cloud Computing. Vania Marangozova-Martin. ibd.forge.imag.fr

UVA HPC & BIG DATA COURSE. Cloud Computing. Adam Belloum

DEEP DIVE INTO CLOUD COMPUTING

What is Cloud Computing? Cloud computing is the dynamic delivery of IT resources and capabilities as a Service over the Internet.

SOFTWARE DEFINED STORAGE

Course Overview. ECE 1779 Introduction to Cloud Computing. Marking. Class Mechanics. Eyal de Lara

Cloud Computing TDM Session

ENERGY EFFICIENT VIRTUAL MACHINE INTEGRATION IN CLOUD COMPUTING

CSE 544 Principles of Database Management Systems

Episerver Digital Experience Cloud Norge Thechforum 2017

CSE 344 JANUARY 3 RD - INTRODUCTION

Examining Public Cloud Platforms

Introduction to Data Management CSE 344. Lecture 1: Introduction

Cloud Computing 4/17/2016. Outline. Cloud Computing. Centralized versus Distributed Computing Some people argue that Cloud Computing. Cloud Computing.

Introduction to Database Systems CSE 444. Lecture 1 Introduction

Virtual Machines. Jinkyu Jeong Computer Systems Laboratory Sungkyunkwan University

Performance Isolation in Multi- Tenant Relational Database-asa-Service. Sudipto Das (Microsoft Research)

SEEM3450 Engineering Innovation and Entrepreneurship

A Holistic View of Telco Clouds

Cloud Computing. Ennan Zhai. Computer Science at Yale University

OPENSTACK: THE OPEN CLOUD

Developing Enterprise Cloud Solutions with Azure

Why Choose MS Azure?

Advanced Technologies of SharePoint 2016

Cloud Computing: The Next Wave. Matt Jonson Connected Architectures Lead Cisco Systems US and Canada Partner Organization

How to Keep UP Through Digital Transformation with Next-Generation App Development

Finished Application e.g. SalesForce.com. Software as a Service. Application Code Publish your app to the cloud. Platform as a Service

Cloud Computing. What is cloud computing. CS 537 Fall 2017

Cloud Computing Concepts, Models, and Terminology

Programowanie w chmurze na platformie Java EE Wykład 1 - dr inż. Piotr Zając

Cloud Computing. Technologies and Types

How CloudEndure Works

COMP6511A: Large-Scale Distributed Systems. Windows Azure. Lin Gu. Hong Kong University of Science and Technology Spring, 2014

CLOUD AND AWS TECHNICAL ESSENTIALS PLUS

Cloud Computing Technologies and Types

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Winter 2015 Lecture 14 NoSQL

STATE OF MODERN APPLICATIONS IN THE CLOUD

What is Cloud Computing? What are the Private and Public Clouds? What are IaaS, PaaS, and SaaS? What is the Amazon Web Services (AWS)?

Microsoft Advanced Technologies of SharePoint 2016

2014 VMware Inc. All rights reserved.

Technical Brief. Adding Zadara Storage to VMware Cloud on AWS

NetApp Clustered Data ONTAP 8.2 Storage QoS Date: June 2013 Author: Tony Palmer, Senior Lab Analyst

Quality-Assured Cloud Bandwidth Auto-Scaling for Video-on-Demand Applications

The End of Storage. Craig Nunes. HP Storage Marketing Worldwide Hewlett-Packard

Cloud & AWS Essentials Agenda. Introduction What is the cloud? DevOps approach Basic AWS overview. VPC EC2 and EBS S3 RDS.

CPET 581 Cloud Computing: Technologies and Enterprise IT Strategies

Modelos de Negócio na Era das Clouds. André Rodrigues, Cloud Systems Engineer

Data Centers and Cloud Computing

How CloudEndure Works

Cloud Computing. DB Special Topics Lecture (10/5/2012) Kyle Hale Maciej Swiech

Introduction to Cloud Computing. [thoughtsoncloud.com] 1

Private cloud to support Infrastructure þÿ a s a S e r v i c e I a a S

How CloudEndure Disaster Recovery Works

Demystifying the Cloud With a Look at Hybrid Hosting and OpenStack

Practical MySQL Performance Optimization. Peter Zaitsev, CEO, Percona July 02, 2015 Percona Technical Webinars

An Integration and Load Balancing in Data Centers Using Virtualization

Capacity Management for Hybrid IT

Deploying enterprise applications on Dell Hybrid Cloud System for Microsoft Cloud Platform System Standard

CSE 344 JULY 9 TH NOSQL

Cluster-Based Computing

The 7 deadly sins of cloud computing [2] Cloud-scale resource management [1]

Azure Stack: The hybrid cloud revolution

The Applications We See The Most

Course Overview This five-day course will provide participants with the key knowledge required to deploy and configure Microsoft Azure Stack.

Introduction to Cloud Computing

Migrating Enterprise Applications to the Cloud Session 672. Leighton L. Nelson

Transcription:

CSE 544 Principles of Database Management Systems Magdalena Balazinska Winter 2015 Lecture 17 Database Systems as a Cloud Service

Final Project Presentations Presentation Logistics Where: CSE 403 When 1pm-5pm on Monday, March 16 th Use the Doodle to pick a slot Check course website for detailed schedule Each team gets 10 min 8 min to talk and 2 min for questions (strictly enforced) Please try to attend all the talks Grading One grade for the whole team But I recommend that each team member speaks at least briefly CSE 544 - Magda Balazinska, Winter 2015 2

Recommended Presentation Format Introduction and problem statement (2min) What problem are you trying to solve? Mention your related work here Introduce a running example and use it in the rest of the talk Approach (3min) How are you trying to solve the problem? Why did you pick this solution and not another? Preliminary evaluation (2min) What did you find out? What are your results? Conclusion and future work (1min) CSE 544 - Magda Balazinska, Winter 2015 3

Final Project Report Write your final report like a mini conference paper Abstract: 1 short paragraph Introduction: 1 page [The intro is a summary of the paper] Section 2: Background, problem, or motivation: 0.5 page Describe the application domain, the data, the problem Section 3: Approach: 2 pages Section 4: Evaluation: 2 pages Section 5: Related work: 0.25 page Section 6: Conclusion: 1 paragraph Length: Aim for 6 pages but ok to spill over CSE 544 - Magda Balazinska, Winter 2015 4

Course Evaluation https://uw.iasystem.org/survey/141298 Did you find the homeworks useful? HW1: I already got some comments: change the dataset, offer an alternative to installing PostgreSQL, spend less time on data munging. HW2 and HW3? Any suggestions? Did you like the selection of lecture topics? Good mix of background/fundamentals and more recent topics? Did the project help you to learn about data management and the research process? Overall workload? Was it ok? Did you like having a guest speaker? Any other comments are welcome! Thanks! CSE 544 - Magda Balazinska, Winter 2015 5

References Aaron J. Elmore, Sudipto Das, Alexander Pucher, Divyakant Agrawal, Amr El Abbadi, and Xifeng Yan. 2013. Characterizing tenant behavior for placement and crisis mitigation in multitenant DBMSs. In Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data (SIGMOD '13). CSE 544 - Magda Balazinska, Winter 2015 6

Cloud Computing A definition Style of computing in which dynamically scalable and often virtualized resources are provided as a service over the Internet Basic idea Developer focuses on application logic Infrastructure and data hosted by someone else in their cloud Hence all operations tasks handled by cloud service provider Some history "computation may someday be organized as a public utility (John McCarthy 1960) 1996 Hotmail Software as a Service 1999 Salesforce.com offers enterprise-class Software as a Service 2006 Amazon Web Services with EC2 And now it s commonly used CSE 544 - Magda Balazinska, Winter 2015 7

Levels of Service Infrastructure as a Service (IaaS) Virtual machines, storage, and networking Example Amazon EC2 Platform as a Service (PaaS) Execution runtime, database, web server, development tools, Example Microsoft Azure, Google App Engine Software as a Service (SaaS) Entire applications Example Google Docs CSE 544 - Magda Balazinska, Winter 2015 8

How About Data Management as a Service? Running a DBMS is challenging Need to hire a skilled database administrator (DBA) Need to provision machines (hardware, software, configuration) Problems: If business picks up, may need to scale quickly Workload varies over time Solution: Use a DBMS service All machines are hosted in service provider s data centers Data resides in those data centers Pay-per-use policy Elastic scalability No administration! CSE 544 - Magda Balazinska, Winter 2015 9

Multitenancy Problem Given a DBMS as a cloud service, how to support multiple tenants? 1. Each tenant runs in its own virtual machine(s) For example Amazon RDS 2. Tenants share the same DBMS instances For example SQL Azure 3. Tenants data is stored in a single table For example Force.com CSE 544 - Magda Balazinska, Winter 2015 10

Tenant Placement Many tenants need less than the capacity of one machine How to consolidate many tenants on a few servers? Also called tenant packing Question 1: Which tenants can be placed together? Want to avoid interference One challenge is that tenant workloads vary over time Question 2: How many tenants can we place together? Trade-off between over-provisioning and over-booking CSE 544 - Magda Balazinska, Winter 2015 11

Tenant Migration When conditions change and SLOs are violated Need to move tenants Which tenant to move? How to perform the migration with minimum disruption? CSE 544 - Magda Balazinska, Winter 2015 12

Paper Discussion Delphi: Self-managing controller for a multitenant DBMS Pythia: Learn behavior through observation Tenant behavior Node behavior Uses database-level attributes Assigns a class to each tenant and determines which tenant classes can be colocated Assigns classes to packings: good, good with underutilized resources, or bad CSE 544 - Magda Balazinska, Winter 2015 13

Tenant Model DBMS-agnostic database-level performance measures Write percent (insert, delete, updates) Avg operation complexity: avg nb of pages accessed by tx Percent cache hits Buffer pool size: nb pages allocated to tenant Database size Throughput (transactions per second) Tenant labels D: Disk IOPS, T: Throughout, and O: Operation complexity Each resource type range is split into buckets Tenant labels: DS-TS-OS CSE 544 - Magda Balazinska, Winter 2015 14

Node Model One feature per node: packing vector One cell per tenant class Value in cell is the number of tenants of that class Model learns mapping From feature vector To quality of packing: under, good, over CSE 544 - Magda Balazinska, Winter 2015 15

Crisis Detection and Mitigation Periodically collect a snapshot of system state For each snapshot, classify tenants Tenant class is aggregate class over time-window W Example: {0.8c j, 0.2c k } If packing is bad, use hill-climbing to find a good packing Consider all potential migrations of one tenant Perform the move that yields the largest improvement Naïve cost function minimizes the number of nodes labeled as over Not good because algorithm tends to overload one node completely Better cost function assigns a confidence to each node of being over Consider only nodes with high confidence of being over Minimize the weighted sum of tenants being on an overloaded node Continue until cannot improve any more CSE 544 - Magda Balazinska, Winter 2015 16