Power-Aware Throughput Control for Database Management Systems

Similar documents
Power-Aware Throughput Control for Database Management Systems

Making Databases Green: An Energy-Aware Software Approach

Performance Isolation in Multi- Tenant Relational Database-asa-Service. Sudipto Das (Microsoft Research)

Quantifying Load Imbalance on Virtualized Enterprise Servers

Database Replication in Tashkent. CSEP 545 Transaction Processing Sameh Elnikety

Modeling VM Performance Interference with Fuzzy MIMO Model

Adaptive QoS Control Beyond Embedded Systems

Cost and Performance benefits of Dell Compellent Automated Tiered Storage for Oracle OLAP Workloads

SRM-Buffer: An OS Buffer Management SRM-Buffer: An OS Buffer Management Technique toprevent Last Level Cache from Thrashing in Multicores

Managing Performance Variance of Applications Using Storage I/O Control

TEMPERATURE MANAGEMENT IN DATA CENTERS: WHY SOME (MIGHT) LIKE IT HOT

Dynamic Power Optimization for Higher Server Density Racks A Baidu Case Study with Intel Dynamic Power Technology

Last Class: Processes

Nowadays data-intensive applications play a

IBM B2B INTEGRATOR BENCHMARKING IN THE SOFTLAYER ENVIRONMENT

A Critical Review on Concept of Green Databases

Autonomic Workload Execution Control Using Throttling

Sandor Heman, Niels Nes, Peter Boncz. Dynamic Bandwidth Sharing. Cooperative Scans: Marcin Zukowski. CWI, Amsterdam VLDB 2007.

Autonomic Resource Management for Virtualized Database Hosting Systems

Data Processing on Modern Hardware

SRM-Buffer: An OS Buffer Management Technique to Prevent Last Level Cache from Thrashing in Multicores

Implementing SQL Server 2016 with Microsoft Storage Spaces Direct on Dell EMC PowerEdge R730xd

Three Paths to Better Business Decisions

Toward Energy-efficient and Fault-tolerant Consistent Hashing based Data Store. Wei Xie TTU CS Department Seminar, 3/7/2017

MC-Safe: Multi-Channel Real-time V2V Communication for Enhancing Driving Safety

Accelerate Applications Using EqualLogic Arrays with directcache

A Comparative Study of Microsoft Exchange 2010 on Dell PowerEdge R720xd with Exchange 2007 on Dell PowerEdge R510

Balancing Fairness and Efficiency in Tiered Storage Systems with Bottleneck-Aware Allocation

MCC-DB: Minimizing Cache Conflicts in Multi-core Processors for Databases

Using Global Behavior Modeling to improve QoS in Cloud Data Storage Services

Cross-layer Optimization for Virtual Machine Resource Management

Deployment Planning and Optimization for Big Data & Cloud Storage Systems

Lenovo Database Configuration for Microsoft SQL Server TB

TOWARD PREDICTABLE PERFORMANCE IN SOFTWARE PACKET-PROCESSING PLATFORMS. Mihai Dobrescu, EPFL Katerina Argyraki, EPFL Sylvia Ratnasamy, UC Berkeley

A Study on Optimally Co-scheduling Jobs of Different Lengths on CMP

Towards Energy Proportionality for Large-Scale Latency-Critical Workloads

Revolutionizing the Datacenter Join the Conversation #OpenPOWERSummit

CIS 601 Graduate Seminar. Dr. Sunnie S. Chung Dhruv Patel ( ) Kalpesh Sharma ( )

Adapting Mixed Workloads to Meet SLOs in Autonomic DBMSs

Towards Energy-Efficient Reactive Thermal Management in Instrumented Datacenters

Lecture Topics. Announcements. Today: Uniprocessor Scheduling (Stallings, chapter ) Next: Advanced Scheduling (Stallings, chapter

In-Memory Data Management for Enterprise Applications. BigSys 2014, Stuttgart, September 2014 Johannes Wust Hasso Plattner Institute (now with SAP)

Automated Control for Elastic Storage Harold Lim, Shivnath Babu, Jeff Chase Duke University

Enlightening the I/O Path: A Holistic Approach for Application Performance

Experimental Study of Virtual Machine Migration in Support of Reservation of Cluster Resources

Microsoft SQL Server in a VMware Environment on Dell PowerEdge R810 Servers and Dell EqualLogic Storage

Erik Riedel Hewlett-Packard Labs

Lenovo Database Configuration

Catalogic DPX TM 4.3. ECX 2.0 Best Practices for Deployment and Cataloging

Reference Architectures for designing and deploying Microsoft SQL Server Databases in Active System800 Platform

Shen, Tang, Yang, and Chu

Measuring VDI Fitness and User Experience Technical White Paper

PYTHIA: Improving Datacenter Utilization via Precise Contention Prediction for Multiple Co-located Workloads

Coordinating Liquid and Free Air Cooling with Workload Allocation for Data Center Power Minimization

ayaz ali Micro & Macro Scheduling Techniques Ayaz Ali Department of Computer Science University of Houston Houston, TX

IN recent years, power control (also called power capping)

Microsoft IT Leverages its Compute Service to Virtualize SharePoint 2010

MODELING OF SMART GRID TRAFFICS USING NON- PREEMPTIVE PRIORITY QUEUES

TPC-E testing of Microsoft SQL Server 2016 on Dell EMC PowerEdge R830 Server and Dell EMC SC9000 Storage

SoftNAS Cloud Performance Evaluation on Microsoft Azure

Achieving Fair or Differentiated Cache Sharing in Power-Constrained Chip Multiprocessors

Preserving I/O Prioritization in Virtualized OSes

Quality Contracts for Real-Time Enterprises

Biology, Physics, Mathematics, Sociology, Engineering, Computer Science, Etc

Rack-scale Data Processing System

SoftNAS Cloud Performance Evaluation on AWS

Dell Reference Configuration for Large Oracle Database Deployments on Dell EqualLogic Storage

A Network-aware Scheduler in Data-parallel Clusters for High Performance

Lenovo Database Configuration

Upgrade to Microsoft SQL Server 2016 with Dell EMC Infrastructure

DISCRETE TIME ADAPTIVE LINEAR CONTROL

System identification in automatic database memory tuning

Performance Analysis and Prediction for distributed homogeneous Clusters

Application-aware Cross-layer Virtual Machine Resource Management

Visual Analysis of Lagrangian Particle Data from Combustion Simulations

Performance Implications of Storage I/O Control Enabled NFS Datastores in VMware vsphere 5.0

The Case for Heterogeneous HTAP

Dell PowerEdge R910 SQL OLTP Virtualization Study Measuring Performance and Power Improvements of New Intel Xeon E7 Processors and Low-Voltage Memory

Best Practices for Deploying a Mixed 1Gb/10Gb Ethernet SAN using Dell EqualLogic Storage Arrays

EMC Business Continuity for Microsoft Applications

vsphere Resource Management Update 2 VMware vsphere 5.5 VMware ESXi 5.5 vcenter Server 5.5

Index. ADEPT (tool for modelling proposed systerns),

IBM Emulex 16Gb Fibre Channel HBA Evaluation

The Impact of SSD Selection on SQL Server Performance. Solution Brief. Understanding the differences in NVMe and SATA SSD throughput

Gaining Insights into Multicore Cache Partitioning: Bridging the Gap between Simulation and Real Systems

CS Project Report

Adaptive Resync in vsan 6.7 First Published On: Last Updated On:

Data Transformation and Migration in Polystores

Intel Solid State Drive Data Center Family for PCIe* in Baidu s Data Center Environment

LEoNIDS: a Low-latency and Energyefficient Intrusion Detection System

Correlation based File Prefetching Approach for Hadoop

VISUAL QUALITY ASSESSMENT CHALLENGES FOR ARCHITECTURE DESIGN EXPLORATIONS. Wen-Fu Kao and Durgaprasad Bilagi. Intel Corporation Folsom, CA 95630

Performance Characterization of the Dell Flexible Computing On-Demand Desktop Streaming Solution

Microsoft SQL Server 2012 Fast Track Reference Configuration Using PowerEdge R720 and EqualLogic PS6110XV Arrays

Database Solutions Engineering. Best Practices for running Microsoft SQL Server and Microsoft Hyper-V on Dell PowerEdge Servers and Storage

ENERGY-EFFICIENT VISUALIZATION PIPELINES A CASE STUDY IN CLIMATE SIMULATION

Accelerating Pointer Chasing in 3D-Stacked Memory: Challenges, Mechanisms, Evaluation Kevin Hsieh

Scheduling the Intel Core i7

Model-Driven Geo-Elasticity In Database Clouds

Accelerating Microsoft SQL Server Performance With NVDIMM-N on Dell EMC PowerEdge R740

Transcription:

Power-Aware Throughput Control for Database Management Systems Zichen Xu, Xiaorui Wang, Yi-Cheng Tu * The Ohio State University * The University of South Florida Power-Aware Computer Systems (PACS) Lab

Problem Overview and Motivation Data centers are energy starving Database management system (DBMS) is one of the major services in data centers Control the DBMS throughput to save energy Load Balanced performance: performance control 2 Max Load Setpoint t Power saving to the setpoint: low power hardware states

Power-Aware Throughput Control (PAT) Goal: minimize power consumption of a database system while maintaining its desired performance Challenges Energy and DBMS: power consumption minimization Control and DBMS: performance guarantee Load t Contributions: Energy profiling in DBMS Setpoint Active Power Feedback control design in DBMS 3 Energy t

Power-Aware Throughput Control (PAT) Goal: minimize power consumption of a database system while maintaining its desired performance Challenges Energy and DBMS: power consumption minimization Control and DBMS: performance guarantee Load t Contributions: Energy profiling in DBMS Setpoint Active Power Feedback control design in DBMS 4 Energy t

Characterization Study Memory Low power state in memory may not be a good choice for power saving in DBMS workloads 5

Less than 20% power savings Characterization Study Memory At least 90% performance degradation Low power state in memory may not be a good choice for power saving in DBMS workloads 6

Characterization Study CPU CPU provides great power-saving and the relationship can be approximated as a linear model 7

Characterization Study DBMS Workloads The ratio of I/O intensive queries (λ) is the key in the workload statistics for power control 8

Characterization Study DBMS Workloads The ratio of I/O intensive queries (λ) is the key in the workload statistics for power control 9

Characterization Study DBMS Workloads The ratio of I/O intensive queries (λ) is the key in the workload statistics for power control 10

Characterization Study Insights Low power state in CPU provides great opportunities for power-saving DBMS throughput and CPU frequency can be approximated as a linear model rr = AA ff + BB The ratio of I/O intensive queries (λ) is the key in the workload statistics for power control rr = λaa ff + BB 11

Framework Query Workload DBMS PAT Control Plant Query Queue System Util. Monitor Query Resource Estimator R S + r( i 1) - Throughput Monitor Workload Statistics r(i) New CPU State Workload Classifier (Input Module) λ PI Controller f CPU Power State Modulator Actuator (i) Controller 12

Fuzzy Workload Classifier One fuzzy rule base Compute the membership of query to each learnt cluster The procedure of workload classification Evaluation of antecedents Implication calculation of consequents Aggregation result of consequents Query Cluster Statistic Evaluation Implication Vector Weight yes Update the ratio of Aggregation I/O intensive queries ( λ) no 13 Implication Confidence

The Controller Design System model rr ii = λaa ff ii + BB Control model (PI control) Zero steady-state error Short settling time Stability a System Model Control Model Transfer Pole Function Placement Control Period t, Max Overshoot, etc. 14 K p K I

Methodology Testbed Hardware: a DELL PowerEdge R710 with 12-core Intel Xeon E5645, 16GB memory and 1TB storage Software: Redhat 5 (V3.0.0) + PostgreSQL (V8.3.18) Workload: TPC and SDSS Power measurement: WattsUp power meter System Contention Ideal: running DBMS alone Competing: running DBMS with other processes with equal priority. Preemptive: running DBMS with other processes with higher priority. 15

Baselines State-of-the-Practice Baselines NORMAL: no power management, performance first SPEEDSTEP: Intel TM build-in power management State-of-the-Art Baselines TRADITION: Open-loop control based on known workload statistics HEURISTIC: Ad-hoc control based on performancepower model SCTRL: System-level power control for DBMS performance 16

A Snapshot of PAT in Competing Setup Power Throughput 17

How PAT Performs in All Scenarios PAT saves energy cost (15% more than the SPEEDSTEP, 51.3% more than NORMAL) 18

A Snapshot of Comparison Power Throughput 19

Comparison with Other Control Methods Power Throughput PAT has the smallest overshoot and the best energy saving 20

Conclusion Minimizing power consumption of DBMS under a user-specified performance bound Controller design on the system characteristics is the solution Empirically study the relationship between energy and DBMS processing PAT: a feedback control framework with system characteristics Thanks 21

Acknowledge This research was supported by the US National Science Foundation (NSF) grants IIS-1117699, IIS-1156435, and CSR-1143607 22

Related Work Control in DBMS Feedback Control in DBMS [Tu et al. VLDB 2007] Other Applications Power Shaving Power Over-commitment Proportional Energy Consumption 23

A Snapshot of PAT in Ideal Setup Power Throughput 24

A Snapshot of PAT in Competing Setup Power Throughput 25

Backup: Errors From Modeling FWC could produce almost 10% error in the system, The designed maximum overshoot is 40% in PAT. 26

Backup Characterization Study Energy and DBMS DBMS throughput and CPU frequency can be approximated as a linear model 27

Backup: Rule-based Workload Classifier 28

Outline Problem Overview & Motivation Characterization Study: DBMS and Energy Overall Framework Fuzzy Workload Classifier The Controller Design Evaluation Conclusion 29

Fuzzy Workload Classifier One fuzzy rule base If [d i CPU, d i I/O] ϵ cluster X j, then [u i CPU, u i I/O] = M j [d i CPU, d i I/O] + N j The procedure of workload classification Evaluation: [d i CPU, d i I/O] -> [u i CPU, u i I/O] per X j Implication calculation: Σ(p j) p j -> t Σ(p j ) i j of X j Aggregation result: [ t i j u i CPU, t i j u i I/O] T 30